Skip to content

Instantly share code, notes, and snippets.

@BadreeshShetty
Created November 6, 2021 11:08
Show Gist options
  • Save BadreeshShetty/af2fc15ae9db9498df17f814c0ac806b to your computer and use it in GitHub Desktop.
Save BadreeshShetty/af2fc15ae9db9498df17f814c0ac806b to your computer and use it in GitHub Desktop.
NLP ML (Built-In)
Display the source blob
Display the rendered blob
Raw
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Intermediate Graphs"
]
},
{
"cell_type": "code",
"execution_count": 29,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAX0AAAD8CAYAAACb4nSYAAAABHNCSVQICAgIfAhkiAAAAAlwSFlz\nAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMS4wLCBo\ndHRwOi8vbWF0cGxvdGxpYi5vcmcvpW3flQAAFSdJREFUeJzt3X+M3PWd3/Hn2z+wkxZMz7gRsYE1\nBU62szIExyYqnGQlOHYS4lyAxrTobAUFXYrTwokEfFEQJXe9QNq6VwXlQs4oBNHgK/nlCF84UpM0\nrYDYBnz2hgMW8JU9U+IY5COAwTbv/jHftcbD7s6sdz2zu5/nQ7L2O5/5fHfe853xaz/zmc98JzIT\nSVIZJnW6AElS+xj6klQQQ1+SCmLoS1JBDH1JKoihL0kFMfQlqSCGviQVxNCXpIJM6XQBjU455ZTs\n6urqdBmSNK5s3779N5k5q1m/MRf6XV1dbNu2rdNlSNK4EhF/30o/p3ckqSCGviQVxNCXpIKMuTn9\ngRw8eJC+vj4OHDjQ6VLabvr06cyZM4epU6d2uhRJE8C4CP2+vj5OPPFEurq6iIhOl9M2mcm+ffvo\n6+tj7ty5nS5H0gQwLqZ3Dhw4wMyZM4sKfICIYObMmUW+wpF0fIyL0AeKC/x+pd5vScfHuAl9SdLI\njYs5/UbrH3x6VH/fdRefM6q/T5LGqnEZ+pKaG2pw5ECnXE7vtOi1117jYx/7GAsXLuR973sfGzdu\npKurixtuuIHFixezePFient7Afjxj3/MkiVLOO+88/jwhz/MSy+9BMDNN9/M6tWrWbZsGV1dXXz/\n+9/ni1/8It3d3SxfvpyDBw928i5KKoCh36Kf/OQnvPe972XHjh3s2rWL5cuXA3DSSSfxy1/+krVr\n13LttdcCcOGFF/LII4/w+OOPs2rVKm677bYjv+fZZ5/l/vvv50c/+hFXXnklS5cuZefOnbzrXe/i\n/vvv78h9k1QOQ79F3d3d/PSnP+WGG27gF7/4BTNmzADgiiuuOPLz4YcfBmqfK/jIRz5Cd3c3X/va\n1+jp6Tnye1asWMHUqVPp7u7m8OHDR/54dHd3s3v37vbeKUnFMfRbdM4557B9+3a6u7tZt24dt9xy\nC3D0ksr+7c9//vOsXbuWnTt38s1vfvOodfbTpk0DYNKkSUydOvXIPpMmTeLQoUPtujuSCmXot2jP\nnj28+93v5sorr+T666/nscceA2Djxo1Hfn7wgx8EYP/+/cyePRuAu+66qzMFS9IAxuXqnU6sPNi5\ncydf+MIXjozQv/GNb3DZZZfx5ptvsmTJEt5++22++93vArU3bC+//HJmz57NBRdcwPPPP9/2eiVp\nIJGZna7hKIsWLcrGL1F58sknmTdvXocqGlz/F76ccsopx/V2xur919jmks2yRMT2zFzUrJ/TO5JU\nkHE5vTNWuNpG0njjSF+SCtJS6EfE8oh4KiJ6I+LGAa6fFhEbq+sfjYiuhutPj4jfRsT1o1O2JOlY\nNA39iJgM3A6sAOYDV0TE/IZuVwGvZOZZwHrg1obr1wN/PfJyJUkj0cpIfzHQm5nPZeZbwL3AyoY+\nK4H+Ben3AR+K6lNHEfFJ4DmgB0lSR7XyRu5s4IW6y33AksH6ZOahiNgPzIyIN4AbgIuB0ZvaeejP\nRu1XAbB0XdMuu3fv5uMf/zi7du0a3duWpDZqZaQ/0Fc3NS7uH6zPfwDWZ+Zvh7yBiKsjYltEbNu7\nd28LJUmSjkUrod8HnFZ3eQ6wZ7A+ETEFmAG8TO0VwW0RsRu4FvjjiFjbeAOZeUdmLsrMRbNmzRr2\nnWiXw4cP89nPfpYFCxawbNky3njjDb71rW/xgQ98gIULF3LppZfy+uuvA7BmzRo+97nPsXTpUs48\n80x+/vOf85nPfIZ58+axZs2azt4RScVqJfS3AmdHxNyIOAFYBWxq6LMJWF1tXwZsyZqLMrMrM7uA\n/wr8x8z8+ijV3nbPPPMM11xzDT09PZx88sl873vf41Of+hRbt25lx44dzJs3jw0bNhzp/8orr7Bl\nyxbWr1/PJZdcwnXXXUdPTw87d+7kiSee6OA9kVSqpqGfmYeAtcADwJPAX2VmT0TcEhGfqLptoDaH\n3wv8EfCOZZ0Twdy5czn33HMBOP/889m9eze7du3ioosuoru7m3vuueeo0yhfcsklRATd3d285z3v\nobu7m0mTJrFgwQI/2CWpI1r6RG5mbgY2N7TdVLd9ALi8ye+4+RjqG1P6T4sMMHnyZN544w3WrFnD\nD3/4QxYuXMi3v/1tfvazn72j/6RJk47a19MoS+oUP5E7Qq+++iqnnnoqBw8e5J577ul0OZI0pPF5\n7p0Wlli2y1e+8hWWLFnCGWecQXd3N6+++mqnS5KkQXlq5XGg9PuvY+OplcviqZUlSe9g6EtSQcZN\n6I+1aah2KfV+Szo+xkXoT58+nX379hUXgJnJvn37mD59eqdLkTRBjIvVO3PmzKGvr48Sz8szffp0\n5syZ0+kyJE0Q4yL0p06dyty5cztdhiSNe+NiekeSNDoMfUkqiKEvSQUx9CWpIIa+JBXE0Jekghj6\nklQQQ1+SCmLoS1JBDH1JKoihL0kFMfQlqSCGviQVxNCXpIIY+pJUEENfkgpi6EtSQQx9SSqIoS9J\nBTH0Jakghr4kFcTQl6SCGPqSVBBDX5IKYuhLUkEMfUkqiKEvSQUx9CWpIIa+JBXE0JekgrQU+hGx\nPCKeiojeiLhxgOunRcTG6vpHI6Kral8cEU9U/3ZExO+PbvmSpOFoGvoRMRm4HVgBzAeuiIj5Dd2u\nAl7JzLOA9cCtVfsuYFFmngssB74ZEVNGq3hJ0vC0EsCLgd7MfA4gIu4FVgK/quuzEri52r4P+HpE\nRGa+XtdnOpAjrlgSAOsffLrTJWgcamV6ZzbwQt3lvqptwD6ZeQjYD8wEiIglEdED7AT+sLpektQB\nrYR+DNDWOGIftE9mPpqZC4APAOsiYvo7biDi6ojYFhHb9u7d20JJkqRj0Uro9wGn1V2eA+wZrE81\nZz8DeLm+Q2Y+CbwGvK/xBjLzjsxclJmLZs2a1Xr1kqRhaSX0twJnR8TciDgBWAVsauizCVhdbV8G\nbMnMrPaZAhARZwC/C+welcolScPW9I3czDwUEWuBB4DJwJ2Z2RMRtwDbMnMTsAG4OyJ6qY3wV1W7\nXwjcGBEHgbeBf5uZvzked0SS1FxLyyczczOwuaHtprrtA8DlA+x3N3D3CGuUJI0SP5ErSQUx9CWp\nIIa+JBXE0Jekghj6klQQQ1+SCmLoS1JBDH1JKoihL0kFMfQlqSCGviQVxNCXpIIY+pJUEENfkgpi\n6EtSQQx9SSqIoS9JBTH0Jakghr4kFcTQl6SCGPqSVBBDX5IKYuhLUkEMfUkqiKEvSQUx9CWpIIa+\nJBXE0Jekghj6klQQQ1+SCmLoS1JBpnS6AEkDW//g050uQROQI31JKoihL0kFMfQlqSCGviQVxNCX\npIIY+pJUEENfkgrS0jr9iFgO/DkwGfjLzPxqw/XTgO8A5wP7gE9n5u6IuBj4KnAC8BbwhczcMor1\njy0P/dnQ1y9d1546JGkQTUf6ETEZuB1YAcwHroiI+Q3drgJeycyzgPXArVX7b4BLMrMbWA3cPVqF\nS5KGr5XpncVAb2Y+l5lvAfcCKxv6rATuqrbvAz4UEZGZj2fmnqq9B5hevSqQJHVAK6E/G3ih7nJf\n1TZgn8w8BOwHZjb0uRR4PDPfPLZSJUkj1cqcfgzQlsPpExELqE35LBvwBiKuBq4GOP3001soSZJ0\nLFoZ6fcBp9VdngPsGaxPREwBZgAvV5fnAD8A/iAznx3oBjLzjsxclJmLZs2aNbx7IElqWSsj/a3A\n2RExF/gHYBXwrxv6bKL2Ru3DwGXAlszMiDgZuB9Yl5n/Z/TKHqeGWt3jyh5JbdA09DPzUESsBR6g\ntmTzzszsiYhbgG2ZuQnYANwdEb3URvirqt3XAmcBX46IL1dtyzLz16N9RyS1rtlpm6+7+Jw2VaJ2\na2mdfmZuBjY3tN1Ut30AuHyA/f4E+JMR1ihJGiV+IleSCmLoS1JBDH1JKoihL0kFMfQlqSCGviQV\nxNCXpIK0tE5fbeC5+CW1gaE/XvhHQdIocHpHkgpi6EtSQQx9SSqIoS9JBTH0Jakghr4kFcTQl6SC\nuE5/OJqtlZekMc6RviQVxJG+1CHNvqdWOh4c6UtSQQx9SSqIoS9JBTH0Jakghr4kFcTQl6SCGPqS\nVBBDX5IKYuhLUkEMfUkqiKdhkI4TT7OgsciRviQVxNCXpIIY+pJUEENfkgpi6EtSQQx9SSqIoS9J\nBXGd/kTR7Evbl65rTx2SxjRH+pJUkJZCPyKWR8RTEdEbETcOcP20iNhYXf9oRHRV7TMj4qGI+G1E\nfH10S5ckDVfT0I+IycDtwApgPnBFRMxv6HYV8EpmngWsB26t2g8AXwauH7WKJUnHrJWR/mKgNzOf\ny8y3gHuBlQ19VgJ3Vdv3AR+KiMjM1zLzf1MLf0lSh7US+rOBF+ou91VtA/bJzEPAfmDmaBQoSRo9\nrYR+DNCWx9Bn8BuIuDoitkXEtr1797a6myRpmFoJ/T7gtLrLc4A9g/WJiCnADODlVovIzDsyc1Fm\nLpo1a1aru0mShqmV0N8KnB0RcyPiBGAVsKmhzyZgdbV9GbAlM1se6UuS2qPph7My81BErAUeACYD\nd2ZmT0TcAmzLzE3ABuDuiOilNsJf1b9/ROwGTgJOiIhPAssy81ejf1ckSc209InczNwMbG5ou6lu\n+wBw+SD7do2gPknSKPI0DKXwNA2S8DQMklQUQ1+SCmLoS1JBnNNXTbM5/6H4foA0bjjSl6SCGPqS\nVBBDX5IKYuhLUkEMfUkqiKEvSQVxyaZGzlM8SOOGI31JKogjfWkI6x98esjrr7v4nDZVIo0OQ18a\ngWZ/FKSxxukdSSqII30dfx18o9fpGelojvQlqSCGviQVxOkddZ7r/KW2MfRVNFffqDSGvjRGXfB/\n7xjy+kdOv7pNlWgiMfQ19g01/ePUjzQshr40Th3PVwIudZ24XL0jSQVxpK+mHn5u35DXf/DMmW2q\n5J2ajUidF5eOZuhrXGsW6iPdv9kfhaH2H8m+0vFi6EtDGEkwG+oai5zTl6SCGPqSVBCnd+o1Ox2A\nJI1zjvQlqSCO9KUJaiQrizRxGfrjxFheK3+8NbvvklpXVug7Zz8mGepS+5QV+jouDO3yeG6e8cvQ\nlwrk6SnKNfFCv9ApnJHO+Ttal8ow8UJ/DBsqWCfyG7GaeJqfYuI/taUODV9LoR8Ry4E/ByYDf5mZ\nX224fhrwHeB8YB/w6czcXV23DrgKOAz8u8x8YNSqn0AcaWsicc5/7Goa+hExGbgduBjoA7ZGxKbM\n/FVdt6uAVzLzrIhYBdwKfDoi5gOrgAXAe4GfRsQ5mXl4tO9IO5S8bFJl8WRxE1crI/3FQG9mPgcQ\nEfcCK4H60F8J3Fxt3wd8PSKiar83M98Eno+I3ur3PTw65Usai5r+0XhoBAOksfwVmc3eUxwDtbcS\n+rOBF+ou9wFLBuuTmYciYj8ws2p/pGHf2cdc7XE2kadYJvJ90/gzove3mgRrJ1+RN71tOv9HoZXQ\njwHassU+rexLRFwN9K8R+21EPNVCXYM5BfjNCPY/XqxreKxreKxreMZoXX88krrOaKVTK6HfB5xW\nd3kOsGeQPn0RMQWYAbzc4r5k5h3AqEwiRsS2zFw0Gr9rNFnX8FjX8FjX8JRcVytn2dwKnB0RcyPi\nBGpvzG5q6LMJWF1tXwZsycys2ldFxLSImAucDfxydEqXJA1X05F+NUe/FniA2pLNOzOzJyJuAbZl\n5iZgA3B39Ubty9T+MFD1+ytqb/oeAq4Zryt3JGkiaGmdfmZuBjY3tN1Ut30AuHyQff8U+NMR1Dhc\nY3WtmXUNj3UNj3UNT7F1RW0WRpJUAr85S5IKMmFCPyKWR8RTEdEbETd2sI7TIuKhiHgyInoi4t9X\n7TdHxD9ExBPVv492oLbdEbGzuv1tVdvvRMSDEfFM9fOftbmm3607Jk9ExD9GxLWdOF4RcWdE/Doi\ndtW1DXh8oua/Vc+3v42I97e5rq9FxN9Vt/2DiDi5au+KiDfqjttftLmuQR+3iFhXHa+nIuIjba5r\nY11NuyPiiaq9ncdrsGxo73MsM8f9P2pvMD8LnAmcAOwA5neollOB91fbJwJPA/OpfWL5+g4fp93A\nKQ1ttwE3Vts3Ard2+HH8f9TWG7f9eAG/B7wf2NXs+AAfBf6a2mdRLgAebXNdy4Ap1fatdXV11ffr\nwPEa8HGr/g/sAKYBc6v/r5PbVVfD9f8ZuKkDx2uwbGjrc2yijPSPnCoiM98C+k8V0XaZ+WJmPlZt\nvwo8yRj+FDK143RXtX0X8MkO1vIh4NnM/PtO3Hhm/i9qq8/qDXZ8VgLfyZpHgJMj4tR21ZWZf5OZ\nh6qLj1D7DExbDXK8BnPklCyZ+TzQf0qWttYVEQH8K+C7x+O2hzJENrT1OTZRQn+gU0V0PGgjogs4\nD3i0alpbvUy7s93TKJUE/iYitkftU9AA78nMF6H2pAT+eQfq6reKo/8zdvp4weDHZyw95z5DbUTY\nb25EPB4RP4+IizpQz0CP21g5XhcBL2XmM3VtbT9eDdnQ1ufYRAn9lk730E4R8U+B7wHXZuY/At8A\n/gVwLvAitZeY7fYvM/P9wArgmoj4vQ7UMKCoffDvE8D/qJrGwvEayph4zkXEl6h9BuaequlF4PTM\nPA/4I+C/R8RJbSxpsMdtTBwv4AqOHli0/XgNkA2Ddh2gbcTHbKKEfkune2iXiJhK7UG9JzO/D5CZ\nL2Xm4cx8G/gWx+ml7VAyc0/189fAD6oaXup/yVj9/HW766qsAB7LzJeqGjt+vCqDHZ+OP+ciYjXw\nceDfZDUJXE2f7Ku2t1ObO2/byeuHeNzGwvGaAnwK2Njf1u7jNVA20Obn2EQJ/VZOFdEW1ZzhBuDJ\nzPwvde31c3G/D+xq3Pc41/VPIuLE/m1qbwTu4uhTaKwGftTOuuocNQLr9PGqM9jx2QT8QbXC4gJg\nf/9L9HaI2hcb3QB8IjNfr2ufFbXvwCAizqR26pPn2ljXYI/bWDgly4eBv8vMvv6Gdh6vwbKBdj/H\n2vGudTv+UXun+2lqf6m/1ME6LqT2EuxvgSeqfx8F7gZ2Vu2bgFPbXNeZ1FZP7AB6+o8RtVNg/0/g\nmern73TgmL2b2jeuzahra/vxovZH50XgILVR1lWDHR9qL71vr55vO4FFba6rl9p8b/9z7C+qvpdW\nj+8O4DHgkjbXNejjBnypOl5PASvaWVfV/m3gDxv6tvN4DZYNbX2O+YlcSSrIRJnekSS1wNCXpIIY\n+pJUEENfkgpi6EtSQQx9SSqIoS9JBTH0Jakg/x8m8I8uRmS2CQAAAABJRU5ErkJggg==\n",
"text/plain": [
"<matplotlib.figure.Figure at 0x22926dac780>"
]
},
"metadata": {},
"output_type": "display_data"
}
],
"source": [
"bins = np.linspace(0, 200, 40)\n",
"\n",
"plt.hist(data[data['label']=='spam']['body_len'], bins, alpha=0.5, normed=True, label='spam')\n",
"plt.hist(data[data['label']=='ham']['body_len'], bins, alpha=0.5, normed=True, label='ham')\n",
"plt.legend(loc='upper left')\n",
"plt.show()"
]
},
{
"cell_type": "code",
"execution_count": 30,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAX0AAAD8CAYAAACb4nSYAAAABHNCSVQICAgIfAhkiAAAAAlwSFlz\nAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMS4wLCBo\ndHRwOi8vbWF0cGxvdGxpYi5vcmcvpW3flQAAGI9JREFUeJzt3X+Q1PWd5/Hnix+CF6NGnFjKQGYs\nsQrIRLOOg9aqF0xChovKVoQLZK2FixXuspLbuBsVUndocFOJyd6yW6WVkkRPYjTgGbMh51yIiuel\ntlAH/DWMrHEkHHRIKUHiagzCwPv+6C9c0xno78z0TDP9eT2qKPr7+X6+335/yvbVXz797U8rIjAz\nszSMqnUBZmY2fBz6ZmYJceibmSXEoW9mlhCHvplZQhz6ZmYJceibmSXEoW9mlhCHvplZQsbUuoBy\nZ555ZjQ1NdW6DDOzEWXz5s2/jYiGSv1OuNBvampi06ZNtS7DzGxEkfR/8/Tz9I6ZWUIc+mZmCXHo\nm5kl5ISb0zczy+PAgQMUCgX27dtX61KG1fjx42lsbGTs2LEDOt6hb2YjUqFQ4P3vfz9NTU1IqnU5\nwyIi2LNnD4VCgebm5gGdw9M7ZjYi7du3jwkTJiQT+ACSmDBhwqD+dZMr9CW1S3pFUo+kpX3sv0LS\nc5J6Jc0t2zdZ0s8lbZX0sqSmAVdrZlYipcA/bLBjrhj6kkYDdwGzgWnAAknTyrrtABYBD/Zxiu8D\n346IqUAb8MZgCjYzs4HLM6ffBvRExDYASWuAOcDLhztExPZs36HSA7M3hzER8VjW753qlG1mdrSV\nj/2yque78ZPnV/V8J4o8oT8R2FmyXQBm5Dz/+cDvJD0CNAOPA0sj4mC/qhwmlV409foiMLN05JnT\n72sCKXKefwxwOfAV4GLgXIrTQEc/gbRY0iZJm3bv3p3z1GZmtfX73/+eT3/601xwwQV8+MMfZu3a\ntTQ1NXHLLbfQ1tZGW1sbPT09APz0pz9lxowZfPSjH+UTn/gEr7/+OgC33XYbCxcuZNasWTQ1NfHI\nI49w880309LSQnt7OwcOHKhqzXlCvwBMKtluBHblPH8BeD4itkVEL/BPwJ+Ud4qIVRHRGhGtDQ0V\n1wsyMzsh/OxnP+Occ87hxRdfZMuWLbS3twNw6qmn8uyzz7JkyRK+/OUvA3DZZZfx9NNP8/zzzzN/\n/ny+9a1vHTnPa6+9xqOPPspPfvITrrvuOmbOnElXVxcnn3wyjz76aFVrzhP6ncAUSc2STgLmA+ty\nnr8T+ICkw0l+JSWfBZiZjWQtLS08/vjj3HLLLfziF7/gtNNOA2DBggVH/t64cSNQ/F7Bpz71KVpa\nWvj2t79Nd3f3kfPMnj2bsWPH0tLSwsGDB4+8ebS0tLB9+/aq1lwx9LMr9CXAemAr8FBEdEtaIeka\nAEkXSyoA84C7JXVnxx6kOLXzhKQuilNF363qCMzMauT8889n8+bNtLS0sGzZMlasWAEcfVvl4cdf\n+tKXWLJkCV1dXdx9991H3Ws/btw4AEaNGsXYsWOPHDNq1Ch6e3urWnOub+RGRAfQUda2vORxJ8Vp\nn76OfQz4yCBqNDM7Ie3atYszzjiD6667jlNOOYX77rsPgLVr17J06VLWrl3LpZdeCsBbb73FxIkT\nAVi9enWtSvYyDGZWH2pxd11XVxc33XTTkSv073znO8ydO5f33nuPGTNmcOjQIX74wx8CxQ9s582b\nx8SJE7nkkkv41a9+Nez1Aigi7404w6O1tTVq9SMqvmXTbOTYunUrU6dOrXUZf+TwD0GdeeaZQ/Yc\nfY1d0uaIaK10rNfeMTNLiKd3zMyqqNp321Sbr/TNzBLi0DczS4hD38wsIQ59M7OE+INcM6sPT36j\nuuebuaxil+3bt3PVVVexZcuW6j73EPKVvplZQhz6ZmaDcPDgQb7whS8wffp0Zs2axR/+8Ae++93v\ncvHFF3PBBRdw7bXX8u677wKwaNEivvjFLzJz5kzOPfdcnnrqKT7/+c8zdepUFi1aNCz1OvTNzAbh\n1Vdf5YYbbqC7u5vTTz+dH/3oR3zmM5+hs7OTF198kalTp3LPPfcc6b937142bNjAypUrufrqq7nx\nxhvp7u6mq6uLF154YcjrdeibmQ1Cc3MzF154IQAXXXQR27dvZ8uWLVx++eW0tLTwwAMPHLWM8tVX\nX40kWlpaOOuss2hpaWHUqFFMnz59WL7Y5dA3MxuEw8siA4wePZre3l4WLVrEnXfeSVdXF7feeusx\nl1EuPXYollHui0PfzKzK3n77bc4++2wOHDjAAw88UOtyjuJbNs2sPuS4xXK43H777cyYMYMPfehD\ntLS08Pbbb9e6pCO8tHIJL61sNnKcqEsrD4chX1pZUrukVyT1SFrax/4rJD0nqVfS3D72nyrp15Lu\nzPN8ZmY2NCqGvqTRwF3AbGAasEDStLJuO4BFwIPHOM3twFMDL9PMzKohz5V+G9ATEdsiYj+wBphT\n2iEitkfES8Ch8oMlXQScBfy8CvWamR1xok1PD4fBjjlP6E8EdpZsF7K2iiSNAv4bcFP/SzMzO7bx\n48ezZ8+epII/ItizZw/jx48f8Dny3L2jvp475/n/EuiIiJ1SX6fJnkBaDCwGmDx5cs5Tm1nKGhsb\nKRQK7N69u9alDKvx48fT2Ng44OPzhH4BmFSy3Qjsynn+S4HLJf0lcApwkqR3IuKoD4MjYhWwCop3\n7+Q8t5klbOzYsTQ3N9e6jBEnT+h3AlMkNQO/BuYDn8tz8oj488OPJS0CWssD38zMhk/FOf2I6AWW\nAOuBrcBDEdEtaYWkawAkXSypAMwD7pbUfewzmplZreT6Rm5EdAAdZW3LSx53Upz2Od457gPu63eF\nZmZWNV57x8wsIQ59M7OEOPTNzBLiVTaryAu2mdmJzlf6ZmYJceibmSXEoW9mlhCHvplZQhz6ZmYJ\nceibmSXEoW9mlhCHvplZQvzlrH6o9OUrM7MTna/0zcwS4tA3M0tIUtM7np4xs9T5St/MLCG5Ql9S\nu6RXJPVI+qPfuJV0haTnJPVKmlvSfqGkjZK6Jb0k6bPVLN7MzPqnYuhLGg3cBcwGpgELJE0r67YD\nWAQ8WNb+LvAXETEdaAf+QdLpgy3azMwGJs+cfhvQExHbACStAeYALx/uEBHbs32HSg+MiF+WPN4l\n6Q2gAfjdoCs3M7N+yzO9MxHYWbJdyNr6RVIbcBLwWh/7FkvaJGnT7t27+3tqMzPLKU/oq4+26M+T\nSDobuB/4DxFxqHx/RKyKiNaIaG1oaOjPqc3MrB/yhH4BmFSy3QjsyvsEkk4FHgX+S0Q83b/yzMys\nmvKEficwRVKzpJOA+cC6PCfP+v8Y+H5E/I+Bl2lmZtVQMfQjohdYAqwHtgIPRUS3pBWSrgGQdLGk\nAjAPuFtSd3b4vweuABZJeiH7c+GQjMTMzCrK9Y3ciOgAOsralpc87qQ47VN+3A+AHwyyRjMzqxJ/\nI9fMLCEOfTOzhDj0zcwS4tA3M0uIQ9/MLCFJrac/WJfsWHXc/U9PXjxMlZiZDYyv9M3MEuLQNzNL\niEPfzCwhDn0zs4Q49M3MEuLQNzNLiEPfzCwhvk+/RKX78M3MRjpf6ZuZJcShb2aWEIe+mVlCcoW+\npHZJr0jqkbS0j/1XSHpOUq+kuWX7Fkp6NfuzsFqFm5lZ/1UMfUmjgbuA2cA0YIGkaWXddgCLgAfL\njj0DuBWYAbQBt0r6wODLNjOzgchzpd8G9ETEtojYD6wB5pR2iIjtEfEScKjs2E8Bj0XEmxGxF3gM\naK9C3WZmNgB5Qn8isLNku5C15ZHrWEmLJW2StGn37t05T21mZv2VJ/TVR1vkPH+uYyNiVUS0RkRr\nQ0NDzlObmVl/5Qn9AjCpZLsR2JXz/IM51szMqixP6HcCUyQ1SzoJmA+sy3n+9cAsSR/IPsCdlbWZ\nmVkNVAz9iOgFllAM663AQxHRLWmFpGsAJF0sqQDMA+6W1J0d+yZwO8U3jk5gRdZmZmY1kGvtnYjo\nADrK2paXPO6kOHXT17H3AvcOokYzM6sSfyPXzCwhDn0zs4Q49M3MEuLQNzNLiEPfzCwhDn0zs4Q4\n9M3MEuLQNzNLiEPfzCwhDn0zs4TkWobBqmPlY7885r4bP3n+MFZiZqnylb6ZWUIc+mZmCfH0ThVd\nsmPVcfc/PXnxMFViZtY3X+mbmSXEoW9mlpBcoS+pXdIrknokLe1j/zhJa7P9z0hqytrHSlotqUvS\nVknLqlu+mZn1R8XQlzQauAuYDUwDFkiaVtbtemBvRJwHrATuyNrnAeMiogW4CPiPh98QzMxs+OW5\n0m8DeiJiW0TsB9YAc8r6zAFWZ48fBj4uSUAA75M0BjgZ2A/8a1UqNzOzfssT+hOBnSXbhaytzz7Z\nD6m/BUyg+Abwe+A3wA7g7/zD6GZmtZMn9NVHW+Ts0wYcBM4BmoG/kXTuHz2BtFjSJkmbdu/enaMk\nMzMbiDz36ReASSXbjcCuY/QpZFM5pwFvAp8DfhYRB4A3JP0z0ApsKz04IlYBqwBaW1vL31D658lv\nHGfntYM6tZnZSJfnSr8TmCKpWdJJwHxgXVmfdcDC7PFcYENEBMUpnStV9D7gEuBfqlO6mZn1V8XQ\nz+bolwDrga3AQxHRLWmFpGuybvcAEyT1AH8NHL6t8y7gFGALxTeP/x4RL1V5DGZmllOuZRgiogPo\nKGtbXvJ4H8XbM8uPe6evdjMzqw1/I9fMLCEOfTOzhDj0zcwS4tA3M0uIQ9/MLCEOfTOzhDj0zcwS\n4tA3M0uIQ9/MLCEOfTOzhORahsGq45Idq46z9++GrQ4zS5ev9M3MEuLQNzNLiEPfzCwhDn0zs4Q4\n9M3MEuLQNzNLSK7Ql9Qu6RVJPZKW9rF/nKS12f5nJDWV7PuIpI2SuiV1SRpfvfLNzKw/Koa+pNEU\nf+t2NjANWCBpWlm364G9EXEesBK4Izt2DPAD4D9FxHTgY8CBqlVvZmb9kudKvw3oiYhtEbEfWAPM\nKeszB1idPX4Y+LgkAbOAlyLiRYCI2BMRB6tTupmZ9Vee0J8I7CzZLmRtffaJiF7gLWACcD4QktZL\nek7SzYMv2czMBirPMgzqoy1y9hkDXAZcDLwLPCFpc0Q8cdTB0mJgMcDkyZNzlGRmZgOR50q/AEwq\n2W4Edh2rTzaPfxrwZtb+VET8NiLeBTqAPyl/gohYFRGtEdHa0NDQ/1GYmVkueUK/E5giqVnSScB8\nYF1Zn3XAwuzxXGBDRASwHviIpH+TvRn8W+Dl6pRuZmb9VXF6JyJ6JS2hGOCjgXsjolvSCmBTRKwD\n7gHul9RD8Qp/fnbsXkl/T/GNI4COiHh0iMYysj35jePvn7lseOows7qWa2nliOigODVT2ra85PE+\nYN4xjv0Bxds2zcysxvyNXDOzhDj0zcwS4tA3M0uIQ9/MLCFJ/Ubu8X+j1sys/vlK38wsIQ59M7OE\nOPTNzBLi0DczS4hD38wsIQ59M7OEOPTNzBLi0DczS4hD38wsIUl9I3dE83r7ZlYFDv0TxMZte467\n/9JzJxz/BH5TMLMcPL1jZpaQXKEvqV3SK5J6JC3tY/84SWuz/c9IairbP1nSO5K+Up2yzcxsICqG\nvqTRwF3AbGAasEDStLJu1wN7I+I8YCVwR9n+lcD/Gny5ZmY2GHmu9NuAnojYFhH7gTXAnLI+c4DV\n2eOHgY9LEoCkPwO2Ad3VKdnMzAYqT+hPBHaWbBeytj77REQv8BYwQdL7gFuArx3vCSQtlrRJ0qbd\nu3fnrd3MzPopT+irj7bI2edrwMqIeOd4TxARqyKiNSJaGxoacpRkZmYDkeeWzQIwqWS7Edh1jD4F\nSWOA04A3gRnAXEnfAk4HDknaFxF3DrryY6h066OZWcryhH4nMEVSM/BrYD7wubI+64CFwEZgLrAh\nIgK4/HAHSbcB7wxl4JuZ2fFVDP2I6JW0BFgPjAbujYhuSSuATRGxDrgHuF9SD8Ur/PlDWbSZmQ1M\nrm/kRkQH0FHWtrzk8T5gXoVz3DaA+iwz6G/smpnhb+SamSXFoW9mlhCHvplZQhz6ZmYJceibmSXE\noW9mlhCHvplZQhz6ZmYJceibmSXEoW9mlhCHvplZQhz6ZmYJceibmSXEoW9mlhCHvplZQhz6ZmYJ\nyfUjKpLagX+k+MtZ34uIb5btHwd8H7gI2AN8NiK2S/ok8E3gJGA/cFNEbKhi/ZbXk984/v6Zy4an\nDjOrqYpX+pJGA3cBs4FpwAJJ08q6XQ/sjYjzgJXAHVn7b4GrI6KF4m/o3l+tws3MrP/yTO+0AT0R\nsS0i9gNrgDllfeYAq7PHDwMfl6SIeD4idmXt3cD47F8FZmZWA3lCfyKws2S7kLX12ScieoG3gPIf\nbb0WeD4i3htYqWZmNlh55vTVR1v0p4+k6RSnfGb1+QTSYmAxwOTJk3OUZGZmA5HnSr8ATCrZbgR2\nHauPpDHAacCb2XYj8GPgLyLitb6eICJWRURrRLQ2NDT0bwRmZpZbniv9TmCKpGbg18B84HNlfdZR\n/KB2IzAX2BARIel04FFgWUT8c/XKtqrz3T1mSah4pZ/N0S8B1gNbgYciolvSCknXZN3uASZI6gH+\nGliatS8BzgP+q6QXsj8frPoozMwsl1z36UdEB9BR1ra85PE+YF4fx/0t8LeDrNHMzKokV+ibDYqn\njsxOGA59y+d4we3QNhsxHPo2eJWu5M3shOEF18zMEuLQNzNLiEPfzCwhntOvExu37Tnu/kvPLV8K\nycxS5Ct9M7OEOPTNzBLi0DczS4hD38wsIf4g12rPyzSYDRtf6ZuZJcShb2aWEE/vJKLSffyV1PQ+\n/8FM/3jqyOwoDn0b+bzgm1lunt4xM0tIrit9Se3APwKjge9FxDfL9o8Dvg9cBOwBPhsR27N9y4Dr\ngYPAf46I9VWr3mywBvuvBE8P2QhTMfQljQbuAj4JFIBOSesi4uWSbtcDeyPiPEnzgTuAz0qaRvGH\n1KcD5wCPSzo/Ig5WeyBmNTGYN41Kbxj+PMKGQJ4r/TagJyK2AUhaA8wBSkN/DnBb9vhh4E5JytrX\nRMR7wK+yH05vAzZWp3wbLoP9IPh4Kn1IPJjnruuF5k7kf6X4DeuElSf0JwI7S7YLwIxj9YmIXklv\nAROy9qfLjp044GrN6smJ/AF0rUN7MD/PWevaj+cEqC1P6KuPtsjZJ8+xSFoMLM4235H0So66juVM\n4LeDOH4kSm3MqY0XhmTMX63RsbmPP8aYh+W5a+Srg/nv/KE8nfKEfgGYVLLdCOw6Rp+CpDHAacCb\nOY8lIlYBq/IUXImkTRHRWo1zjRSpjTm18YLHnIrhGHOeWzY7gSmSmiWdRPGD2XVlfdYBC7PHc4EN\nERFZ+3xJ4yQ1A1OAZ6tTupmZ9VfFK/1sjn4JsJ7iLZv3RkS3pBXApohYB9wD3J99UPsmxTcGsn4P\nUfzQtxe4wXfumJnVTq779COiA+goa1te8ngfMO8Yx34d+PogauyvqkwTjTCpjTm18YLHnIohH7OK\nszBmZpYCL8NgZpaQugl9Se2SXpHUI2lpresZCpLulfSGpC0lbWdIekzSq9nfH6hljdUmaZKkJyVt\nldQt6a+y9rodt6Txkp6V9GI25q9l7c2SnsnGvDa7saJuSBot6XlJ/zPbruvxAkjaLqlL0guSNmVt\nQ/rarovQL1kqYjYwDViQLQFRb+4D2svalgJPRMQU4Ilsu570An8TEVOBS4Absv+29Tzu94ArI+IC\n4EKgXdIlFJc3WZmNeS/F5U/qyV8BW0u26328h82MiAtLbtUc0td2XYQ+JUtFRMR+4PBSEXUlIv4P\nxbujSs0BVmePVwN/NqxFDbGI+E1EPJc9fptiKEykjscdRe9km2OzPwFcSXGZE6izMUtqBD4NfC/b\nFnU83gqG9LVdL6Hf11IRqSz3cFZE/AaKAQl8sMb1DBlJTcBHgWeo83FnUx0vAG8AjwGvAb+LiN6s\nS729xv8BuBk4lG1PoL7He1gAP5e0OVuZAIb4tV0vP6KSa7kHG7kknQL8CPhyRPxr8UKwfmXfZ7lQ\n0unAj4GpfXUb3qqGhqSrgDciYrOkjx1u7qNrXYy3zJ9GxC5JHwQek/QvQ/2E9XKln2u5hzr1uqSz\nAbK/36hxPVUnaSzFwH8gIh7Jmut+3AAR8Tvgf1P8POP0bJkTqK/X+J8C10jaTnFq9kqKV/71Ot4j\nImJX9vcbFN/c2xji13a9hH6epSLqVekSGAuBn9SwlqrL5nbvAbZGxN+X7KrbcUtqyK7wkXQy8AmK\nn2U8SXGZE6ijMUfEsohojIgmiv/vboiIP6dOx3uYpPdJev/hx8AsYAtD/Nqumy9nSfp3FK8ODi8V\nMZzfAh4Wkn4IfIzi6oOvA7cC/wQ8BEwGdgDzIqL8w94RS9JlwC+ALv7/fO9XKc7r1+W4JX2E4gd4\noylemD0UESsknUvxSvgM4Hnguuy3KupGNr3zlYi4qt7Hm43vx9nmGODBiPi6pAkM4Wu7bkLfzMwq\nq5fpHTMzy8Ghb2aWEIe+mVlCHPpmZglx6JuZJcShb2aWEIe+mVlCHPpmZgn5fyjZgnDU1A4AAAAA\nAElFTkSuQmCC\n",
"text/plain": [
"<matplotlib.figure.Figure at 0x229015597f0>"
]
},
"metadata": {},
"output_type": "display_data"
}
],
"source": [
"bins = np.linspace(0, 50, 40)\n",
"\n",
"plt.hist(data[data['label']=='spam']['punct%'], bins, alpha=0.5, normed=True, label='spam')\n",
"plt.hist(data[data['label']=='ham']['punct%'], bins, alpha=0.5, normed=True, label='ham')\n",
"plt.legend(loc='upper right')\n",
"plt.show()"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.2"
},
"toc": {
"base_numbering": 1,
"nav_menu": {},
"number_sections": false,
"sideBar": true,
"skip_h1_title": false,
"title_cell": "Table of Contents",
"title_sidebar": "Contents",
"toc_cell": false,
"toc_position": {},
"toc_section_display": true,
"toc_window_display": false
},
"varInspector": {
"cols": {
"lenName": 16,
"lenType": 16,
"lenVar": 40
},
"kernels_config": {
"python": {
"delete_cmd_postfix": "",
"delete_cmd_prefix": "del ",
"library": "var_list.py",
"varRefreshCmd": "print(var_dic_list())"
},
"r": {
"delete_cmd_postfix": ") ",
"delete_cmd_prefix": "rm(",
"library": "var_list.r",
"varRefreshCmd": "cat(var_dic_list()) "
}
},
"types_to_exclude": [
"module",
"function",
"builtin_function_or_method",
"instance",
"_Feature"
],
"window_display": false
}
},
"nbformat": 4,
"nbformat_minor": 2
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment