-
Recent Posts
Recent Comments
Yuanqing on Configuration Spaces and the M… Mel on Recurrent Neural Networks Teradata Expands Cap… on K-modes Teradata Expands Cap… on K-modes Qu'est-ce qu… on Graphs and networks Archives
- December 2016
- November 2016
- October 2016
- June 2016
- April 2016
- January 2016
- November 2015
- October 2015
- July 2015
- June 2015
- May 2015
- January 2015
- September 2014
- June 2014
- May 2014
- March 2014
- February 2014
- January 2014
- December 2013
- October 2013
- September 2013
- August 2013
- July 2013
- June 2013
- May 2013
- April 2013
- March 2013
Categories
Meta
Category Archives: Clustering
K-modes
I recently read an interesting Wired story about Chris McKinlay (a fellow alum of Middlebury College), who used a clustering algorithm to understand the pool of users on the dating site OkCupid (and successfully used this information to improve his … Continue reading
Posted in Clustering, Feature extraction
39 Comments
Modularity – Measuring cluster separation
We’ve now seen a number of different clustering algorithms, each of which will divide a data set into a number of subsets. This week, I want to ask the question: How do we know if answer that a clustering algorithm … Continue reading
Posted in Clustering, Unsupervised learning
Leave a comment
Spectral clustering
In the last few posts, we’ve been studying clustering, i.e. algorithms that try to cut a given data set into a number of smaller, more tightly packed subsets, each of which might represent a different phenomenon or a different type … Continue reading
Posted in Clustering, Unsupervised learning
9 Comments
Mapper and the choice of scale
In last week’s post, I described the DBSCAN clustering algorithm, which uses the notion of density to determine which data points in a data set form tightly packed groups called clusters. This algorithm relies on two parameters – a distance … Continue reading
Posted in Clustering, Unsupervised learning
4 Comments
Clusters and DBScan
A few weeks ago, I mentioned the idea of a clustering algorithm, but here’s a recap of the idea: Often, a single data set will be made up of different groups of data points, each of which corresponds to a … Continue reading
Posted in Clustering, Unsupervised learning
7 Comments