Back to the table of contents

Previous Next
waffles_cluster

A command-line tool for clustering data. Here's the usage information:
Full Usage Information
[Square brackets] are used to indicate required arguments.
<Angled brackets> are used to indicate optional arguments.

waffles_cluster [command]
   Cluster data.
   agglomerative [dataset] [clusters]
      Performs single-link agglomerative clustering. Outputs the cluster id for
      each row.
   fuzzykmeans [dataset] [clusters] <options>
      Performs fuzzy k-means clustering. Outputs the cluster id for each row.
      This algorithm is specified in Li, D. and Deogun, J. and Spaulding, W.
      and Shuart, B., Towards missing data imputation: A study of fuzzy K-means
      clustering method, In Rough Sets and Current Trends in Computing,
      Springer, pages 573--579, 2004.
      [dataset]
         The filename of a dataset to cluster.
      <options>
         -seed [value]
            Specify a seed for the random number generator.
         -reps [n]
            Cluster the data [n] times, and return the clustering that
            minimizes the sum-weighted-distance between rows and the centroids.
         -fuzzifier [value]
            Specify the fuzzifier parameter, which should be greater than 1.
   kmeans [dataset] [clusters] <options>
      Performs k-means clustering. Outputs the cluster id for each row.
      [dataset]
         The filename of a dataset to cluster.
      <options>
         -seed [value]
            Specify a seed for the random number generator.
         -reps [n]
            Cluster the data [n] times, and return the clustering that
            minimizes the sum-squared-distance between each row and its
            corresponding centroid.
   kmedoids [dataset] [clusters]
      Performs k-medoids clustering. Outputs the cluster id for each row.
   usage
      Print usage information.
Previous Next

Back to the table of contents