Distributed K-Means with R-Hadoop
In this article, an R-hadoop (with rmr2) implementation of Distributed KMeans Clustering will be described with a sample 2-d dataset. First the dataset shown below is horizontally partitioned into 4 data subsets… Read More »Distributed K-Means with R-Hadoop