Back to the table of contents Previous Next waffles_clusterA command-line tool for clustering data. Here's the usage information: Full Usage Information [Square brackets] are used to indicate required arguments. <Angled brackets> are used to indicate optional arguments. waffles_cluster [command] Cluster data. agglomerative [dataset] [clusters] Performs single-link agglomerative clustering. Outputs the cluster id for each row. fuzzykmeans [dataset] [clusters] <options> Performs fuzzy k-means clustering. Outputs the cluster id for each row. This algorithm is specified in Li, D. and Deogun, J. and Spaulding, W. and Shuart, B., Towards missing data imputation: A study of fuzzy K-means clustering method, In Rough Sets and Current Trends in Computing, Springer, pages 573--579, 2004. [dataset] The filename of a dataset to cluster. <options> -seed [value] Specify a seed for the random number generator. -reps [n] Cluster the data [n] times, and return the clustering that minimizes the sum-weighted-distance between rows and the centroids. -fuzzifier [value] Specify the fuzzifier parameter, which should be greater than 1. kmeans [dataset] [clusters] <options> Performs k-means clustering. Outputs the cluster id for each row. [dataset] The filename of a dataset to cluster. <options> -seed [value] Specify a seed for the random number generator. -reps [n] Cluster the data [n] times, and return the clustering that minimizes the sum-squared-distance between each row and its corresponding centroid. kmedoids [dataset] [clusters] Performs k-medoids clustering. Outputs the cluster id for each row. usage Print usage information. Previous Next Back to the table of contents |