This page contains reference data and code used in a paper by Jan Poland and myself, published back in 2005.
Variable | Meaning |
---|---|
x | the matrix of the data themselves, as column vectors |
c0 | a row vector of correct cluster assignments, an int for each point |
nn | a row vector of cluster sizes, an int for each cluster |
sgm | a matrix of best kernel widths (sigma) for every algorithm. Each row
contains five empirically found best values for the corresponding
algorithm. The algorithms are ordered as follows:
|
sgmHist | a vector of most probable sigmas according to the histogram method |
The data sets are: