<分区>
检查聚类算法(在本例中为 K-Means)准确度的最佳方法是什么?我已经知道由更高级的算法实现的真正的组分离。
但是,我面临着一堵墙。我如何确定我的第 1 组与他们的第 2 组相同?如果分离相同,这可以手动完成。但是,由于结果各不相同,“手工”似乎不值得,最好采用 CS 方法/统计数据。
您知道我可以使用哪种算法来解决这个问题吗?
请看一下这两个对象。第一个是时间序列的预期分离,第二个是我对它们进行分组的方式。
检查两者的前 6 个元素,发现预期数据中的组 3 可能与我的组 3 相同。但是,他们的 元素 4和 5,它们在第 4 组和第 2 组中,被插入到第 6 组 my 中。因此,我如何确定我的第 6 组是他们的第 2 组还是第 4 组,这样我就可以计算命中率.
c(3L, 3L, 3L, 4L, 2L, 2L, 5L, 6L, 4L, 4L, 6L, 6L, 6L, 5L, 1L,
1L, 2L, 2L, 3L, 6L, 5L, 6L, 4L, 5L, 5L, 5L, 2L, 5L, 2L, 2L, 5L,
5L, 1L, 2L, 1L, 6L, 5L, 4L, 4L, 5L, 2L, 2L, 3L, 2L, 1L, 5L, 6L,
6L, 2L, 6L, 6L, 4L, 3L, 6L, 4L, 3L, 2L, 3L, 5L, 3L, 5L, 1L, 5L,
6L, 5L, 2L, 1L, 6L, 3L, 6L, 3L, 3L, 3L, 5L, 5L, 2L, 2L, 4L, 5L,
4L, 6L, 3L, 2L, 3L, 1L, 4L, 2L, 3L, 4L, 3L, 1L, 3L, 1L, 5L, 3L,
4L, 5L, 1L, 5L, 5L, 5L, 4L, 3L, 6L, 2L, 6L, 6L, 1L, 5L, 4L, 2L,
1L, 2L, 5L, 2L, 5L, 6L, 2L, 3L, 6L, 6L, 3L, 4L, 6L, 4L, 3L, 1L,
6L, 2L, 5L, 6L, 3L, 3L, 1L, 2L, 1L, 1L, 1L, 6L, 2L, 3L, 2L, 1L,
5L, 3L, 4L, 4L, 2L, 5L, 3L, 4L, 1L, 1L, 5L, 2L, 3L, 6L, 2L, 1L,
4L, 2L, 3L, 1L, 6L, 5L, 6L, 1L, 6L, 1L, 1L, 4L, 1L, 1L, 2L, 1L,
4L, 3L, 3L, 3L, 4L, 2L, 4L, 4L, 6L, 3L, 5L, 6L, 5L, 6L, 2L, 4L,
1L, 1L, 2L, 1L, 2L, 4L, 4L, 1L, 4L, 5L, 6L, 3L, 5L, 5L, 5L, 1L,
3L, 6L, 3L, 4L, 1L, 6L, 3L, 3L, 3L, 5L, 6L, 6L, 2L, 4L, 3L, 5L,
6L, 1L, 1L, 1L, 1L, 2L, 5L, 5L, 4L, 2L, 3L, 5L, 1L, 4L, 4L, 3L,
2L, 4L, 6L, 1L, 3L, 3L, 5L, 3L, 3L, 2L, 1L, 4L, 4L, 1L, 6L, 1L,
6L, 6L, 6L, 3L, 2L, 5L, 5L, 2L, 2L, 5L, 2L, 4L, 2L, 4L, 4L, 2L,
5L, 5L, 6L, 4L, 4L, 3L, 3L, 4L, 3L, 2L, 4L, 6L, 2L, 1L, 2L, 3L,
5L, 6L, 5L, 3L, 1L, 6L, 5L, 1L, 4L, 4L, 4L, 4L, 2L, 3L, 1L, 4L,
3L, 5L, 4L, 6L, 1L, 5L, 6L, 1L, 1L, 1L, 1L, 6L, 6L, 2L, 1L, 1L,
5L, 4L, 1L, 3L, 3L, 6L, 2L, 2L, 1L, 6L, 2L, 2L, 3L, 4L, 6L, 1L,
3L, 4L, 6L, 6L, 3L, 6L, 4L, 1L, 4L, 1L, 3L, 4L, 5L, 2L, 5L, 3L,
4L, 6L, 6L, 1L, 1L, 1L, 3L, 4L, 5L, 5L, 4L, 2L, 1L, 1L, 2L, 2L,
2L, 4L, 5L, 5L, 2L, 5L, 2L, 3L, 6L, 4L, 4L, 5L, 6L, 4L, 6L, 4L,
3L, 5L, 5L, 3L, 2L, 2L, 3L, 6L, 2L, 4L, 5L, 1L, 6L, 2L, 4L, 1L,
6L)
--
c(3, 3, 3, 6, 6, 6, 2, 6, 6, 6, 6, 6, 6, 2, 1, 1, 6, 6, 3, 6,
2, 6, 6, 2, 2, 2, 6, 2, 6, 6, 2, 2, 1, 6, 1, 6, 2, 6, 6, 2, 6,
6, 3, 6, 1, 2, 6, 6, 6, 6, 6, 6, 3, 6, 6, 3, 6, 3, 2, 3, 2, 1,
2, 6, 2, 6, 1, 6, 3, 6, 3, 3, 3, 2, 2, 6, 6, 6, 2, 6, 6, 3, 6,
3, 1, 6, 6, 3, 6, 3, 1, 3, 1, 2, 3, 6, 2, 1, 2, 2, 2, 6, 3, 6,
6, 6, 6, 1, 2, 6, 6, 1, 6, 2, 6, 2, 6, 6, 3, 6, 6, 3, 6, 6, 6,
3, 1, 6, 6, 2, 6, 3, 3, 1, 6, 1, 1, 1, 6, 6, 3, 6, 1, 2, 3, 6,
6, 6, 2, 3, 6, 1, 1, 2, 6, 3, 6, 6, 1, 6, 6, 3, 1, 6, 2, 6, 1,
6, 1, 1, 6, 1, 1, 6, 1, 6, 3, 4, 3, 6, 6, 6, 6, 6, 3, 2, 6, 2,
6, 6, 6, 1, 1, 6, 1, 6, 6, 6, 1, 6, 2, 6, 3, 2, 2, 2, 1, 3, 6,
3, 6, 1, 6, 3, 3, 3, 2, 6, 6, 6, 6, 3, 2, 6, 1, 1, 1, 1, 6, 2,
2, 6, 6, 3, 2, 1, 6, 6, 3, 6, 6, 6, 1, 3, 3, 2, 3, 3, 6, 1, 6,
6, 1, 6, 1, 6, 6, 6, 3, 6, 5, 2, 6, 6, 2, 6, 6, 6, 6, 6, 6, 2,
2, 6, 6, 6, 3, 3, 6, 3, 6, 6, 6, 6, 1, 6, 3, 2, 1, 4, 3, 1, 6,
2, 1, 6, 6, 6, 6, 6, 3, 1, 6, 3, 2, 6, 6, 1, 2, 6, 1, 1, 1, 1,
6, 6, 6, 1, 1, 2, 6, 1, 3, 3, 6, 6, 6, 1, 6, 6, 6, 3, 6, 6, 1,
3, 6, 6, 6, 3, 6, 6, 1, 6, 1, 3, 6, 2, 6, 2, 3, 6, 6, 6, 1, 1,
1, 3, 6, 2, 2, 6, 6, 1, 1, 6, 6, 6, 6, 2, 2, 6, 2, 6, 3, 6, 6,
6, 2, 6, 6, 6, 6, 3, 2, 2, 3, 6, 6, 3, 6, 6, 6, 2, 1, 6, 6, 6,
1, 6)
注意:对象由R中的dput函数生成