Benchmarks

For benchmark our project we used the dataset provided by Marek Gagolewski. Specificallyh we are using wut dataset. Below you can see the table with our metrics of rand index and accuracy of our algorithms: kmeans and kmeans++ which was benchmarked on wut dataset. Also, we are providing visualization in order to give insights what data we used.

Comparison table

Test CaseKmeans Rand IndexKmeans AccuracyKmeans++ Rand IndexKmeans++ AccuracyBkmeans Rand IndexBkmeans Accuracy
circles1.001.001.001.001.01.0
cross0.59050.59670.59050.59670.73620.7167
graph0.88350.23730.88520.61330.86870.3147
isolation0.55640.34740.55570.34440.55430.3489
labirynth0.75260.45110.75260.45110.78390.2350
mk11.001.001.001.001.01.0
mk20.50100.54670.50100.54670.50100.5467
mk30.97400.97780.97400.97780.97400.9778
mk40.56820.23110.57660.24000.62240.5111
olympic0.71220.29070.73060.27070.71100.2840
smile0.82630.42670.83790.52670.83350.5733
stripes0.49980.51600.49980.51600.49980.5160
trajectories1.001.001.001.001.01.0
trapped_lovers0.59730.58400.59550.58130.58330.4533
twosplashes0.61860.75000.61860.75000.61860.7500
windows0.57450.33330.56770.35350.58200.4855
x11.001.001.001.001.01.0
x20.83010.88890.60780.66670.60780.6667
x30.92590.92860.93390.64290.93390.6429
z10.62560.48280.62560.48280.66500.5862
z20.78030.33330.78030.33330.77720.2963
z31.001.001.001.001.01.0

Visualization K-means

Clustering Circles

Figure 1: Clustering Circles with K-means

Mk1

Figure 2: Mk1 with K-means

Mk1

Figure 3: Mk3 with K-means

Mk1

Figure 4: Z3 with K-means

Mk1

Figure 5: Trajectories with K-means