An Efficient Approach for Computing Silhouette Coefficients
Abstract
One popular approach for finding the best number of clusters (K) in a data set is through computing the silhouette coefficients. The silhouette coefficients for different values of K, are first found and then the maximum value of these coefficients is chosen. However, computing the silhouette coefficient for different Ks is a very time consuming process. This is due to the amount of CPU time spent on distance calculations. A proposed approach to compute the silhouette coefficient quickly had been presented. The approach was based on decreasing the number of addition operations when computing distances. The results were efficient and more than 50% of the CPU time was achieved when applied to different data sets.
DOI: https://doi.org/10.3844/jcssp.2008.252.255
                                            
                                Copyright: © 2008 Moh'd B. Al- Zoubi and Mohammad al Rawi. This is an open access article distributed under the terms of the
                                                                            Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                                                                    
- 6,181 Views
 - 7,302 Downloads
 - 73 Citations
 
Download
Keywords
- Silhouette coefficients
 - clustering
 - data mining
 - pattern recognition