Abhishek-Deshmukh/clustering_submission.md

Created September 3, 2021 08:11

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/Abhishek-Deshmukh/ac82bc44bfe55f1540fa969a6721ab6a.js"></script>
Save Abhishek-Deshmukh/ac82bc44bfe55f1540fa969a6721ab6a to your computer and use it in GitHub Desktop.

CS640 assignment (Clustering)

Raw

If the data is less (less than like million points). Then just do a scatter plot and visually count the number of clusters. This is also a good place to decide wether to use KMeans or DBSCAN. If the data is in circle-ish clusters then k-means should work fine, it not DBSCAN would be better.

I am not sure what to do when data is very large. I guess something specific about where the data came from and what we need can be used.