Skip to content

Instantly share code, notes, and snippets.

@Abhishek-Deshmukh
Created September 3, 2021 08:11
Show Gist options
  • Save Abhishek-Deshmukh/ac82bc44bfe55f1540fa969a6721ab6a to your computer and use it in GitHub Desktop.
Save Abhishek-Deshmukh/ac82bc44bfe55f1540fa969a6721ab6a to your computer and use it in GitHub Desktop.
CS640 assignment (Clustering)

If the data is less (less than like million points). Then just do a scatter plot and visually count the number of clusters. This is also a good place to decide wether to use KMeans or DBSCAN. If the data is in circle-ish clusters then k-means should work fine, it not DBSCAN would be better.

I am not sure what to do when data is very large. I guess something specific about where the data came from and what we need can be used.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment