Sampling For Large-Scale Clustering