- Python Machine Learning Cookbook(Second Edition)
- Giuseppe Ciaburro Prateek Joshi
- 148字
- 2021-06-24 15:41:08
There's more…
The vector quantization algorithm can be used to divide a dataset into a number of clusters. The algorithm is based on the calculation of the Euclidean distance for the allocation of the samples to the cluster, to which it belongs. The algorithm consists of the following steps:
- At the beginning, all the vectors are assigned to the same cluster, whose centroid is calculated as the mean value of all the vectors.
- For each centroid, a perturbation is introduced that generates two new cluster centers. The old representative is discarded.
- Each carrier is reassigned to one of the new clusters according to the minimum distance criterion.
- The new representatives are calculated as the average value of the vectors assigned to each cluster. These will be the new centers of the cluster.
- If the end criterion is met, the algorithm terminates. If not, return to step 2.
本周热推: