Skip to content

Instantly share code, notes, and snippets.

@karamanbk
Last active March 31, 2020 01:22
Show Gist options
  • Save karamanbk/ed06a1a0064dd43b42730f71299299f1 to your computer and use it in GitHub Desktop.
Save karamanbk/ed06a1a0064dd43b42730f71299299f1 to your computer and use it in GitHub Desktop.
#remove outliers
tx_merge = tx_merge[tx_merge['m6_Revenue']<tx_merge['m6_Revenue'].quantile(0.99)]
#creating 3 clusters
kmeans = KMeans(n_clusters=3)
kmeans.fit(tx_merge[['m6_Revenue']])
tx_merge['LTVCluster'] = kmeans.predict(tx_merge[['m6_Revenue']])
#order cluster number based on LTV
tx_merge = order_cluster('LTVCluster', 'm6_Revenue',tx_merge,True)
#creatinga new cluster dataframe
tx_cluster = tx_merge.copy()
#see details of the clusters
tx_cluster.groupby('LTVCluster')['m6_Revenue'].describe()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment