Skip to content

Instantly share code, notes, and snippets.

View NataliiaRastoropova's full-sized avatar

Nataliia Rastoropova NataliiaRastoropova

View GitHub Profile
fixed acidity volatile acidity citric acid residual sugar chlorides free sulfur dioxide total sulfur dioxide density pH sulphates alcohol quality
1599.000 1599.000 1599.000 1599.000 1599.000 1599.000 1599.000 1599.000 1599.000 1599.000 1599.000 1599.000
8.320 0.528 0.271 2.539 0.087 15.875 46.468 0.997 3.311 0.658 10.423 5.636
1.741 0.179 0.195 1.410 0.047 10.460 32.895 0.002 0.154 0.170 1.066 0.808
4.600 0.120 0.000 0.900 0.012 1.000 6.000 0.990 2.740 0.330 8.400 3.000
7.100 0.390 0.090 1.900 0.070 7.000 22.000 0.996 3.210 0.550 9.500 5.000
7.900 0.520 0.260 2.200 0.079 14.000 38.000 0.997 3.310 0.620 10.200 6.000
9.200 0.640 0.420 2.600 0.090 21.000 62.000 0.998 3.400 0.730 11.100 6.000
15.900 1.580 1.000 15.500 0.611 72.000 289.000 1.004 4.010 2.000 14.900 8.000
fixed acidity volatile acidity citric acid residual sugar chlorides free sulfur dioxide total sulfur dioxide density pH sulphates alcohol quality
1599.0 1599.0 1599.0 1599.0 1599.0 1599.0 1599.0 1599.0 1599.0 1599.0 1599.0 1599.0
8.31963727329581 0.5278205128205128 0.2709756097560976 2.53880550343965 0.08746654158849279 15.874921826141339 46.46779237023139 0.9967466791744841 3.3111131957473416 0.6581488430268917 10.422983114446529 5.6360225140712945
1.7410963181276953 0.17905970415353537 0.19480113740531857 1.4099280595072798 0.0470653020100901 10.460156969809725 32.895324478299074 0.0018873339538425554 0.15438646490354277 0.16950697959010996 1.0656675818473946 0.807569439734705
4.6 0.12 0.0 0.9 0.012 1.0 6.0 0.99007 2.74 0.33 8.4 3.0
7.1 0.39 0.09 1.9 0.07 7.0 22.0 0.9956 3.21 0.55 9.5 5.0
7.9 0.52 0.26 2.2 0.079 14.0 38.0 0.99675 3.31 0.62 10.2 6.0
9.2 0.64 0.42 2.6 0.09 21.0 62.0 0.997835 3.4 0.73 11.1 6.0
15.9 1.58 1.0 15.5 0.611 72.0 289.0 1.00369 4.01 2.0 14.9 8.0
We can make this file beautiful and searchable if this error is corrected: It looks like row 7 should actually have 12 columns, instead of 5 in line 6.
,alcohol,chlorides,citric acid,density,fixed acidity,free sulfur dioxide,pH,residual sugar,sulphates,total sulfur dioxide,volatile acidity
quality,,,,,,,,,,,
3,9.955000000000002,0.12250000000000001,0.17099999999999999,0.9974640000000001,8.36,11.0,3.3979999999999997,2.6350000000000002,0.5700000000000001,24.9,0.8845000000000001
4,10.265094339622639,0.09067924528301885,0.1741509433962264,0.9965424528301886,7.779245283018868,12.264150943396226,3.381509433962264,2.69433962264151,0.5964150943396227,36.24528301886792,0.6939622641509429
5,9.899706314243753,0.09273568281938328,0.24368575624082198,0.9971036270190888,8.167254038179149,16.983847283406753,3.3049486049926546,2.528854625550658,0.6209691629955947,56.51395007342144,0.5770411160058732
6,10.629519331243463,0.08495611285266458,0.2738244514106587,0.9966150626959255,8.347178683385575,15.711598746081505,3.3180721003134837,2.477194357366772,0.6753291536050158,40.86990595611285,0.49748432601880965
7,11.465912897822443,0.07658793969849244,0.37517587939698493,0.9961042
We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 12 columns, instead of 10 in line 1.
,alcohol,chlorides,citric acid,density,fixed acidity,free sulfur dioxide,pH,residual sugar,sulphates,total sulfur dioxide,volatile acidity
quality,,,,,,,,,
3,9.955000000000002,0.12250000000000001,0.17099999999999999,0.9974640000000001,8.36,11.0,3.3979999999999997,2.6350000000000002,0.5700000000000001,24.9,0.8845000000000001
4,10.265094339622639,0.09067924528301885,0.1741509433962264,0.9965424528301886,7.779245283018868,12.264150943396226,3.381509433962264,2.69433962264151,0.5964150943396227,36.24528301886792,0.6939622641509429
5,9.899706314243753,0.09273568281938328,0.24368575624082198,0.9971036270190888,8.167254038179149,16.983847283406753,3.3049486049926546,2.528854625550658,0.6209691629955947,56.51395007342144,0.5770411160058732
6,10.629519331243463,0.08495611285266458,0.2738244514106587,0.9966150626959255,8.347178683385575,15.711598746081505,3.3180721003134837,2.477194357366772,0.6753291536050158,40.86990595611285,0.49748432601880965
7,11.465912897822443,0.07658793969849244,0.37517587939698493,0.996104271
We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 12 columns, instead of 13 in line 1.
,alcohol,chlorides,citric acid,density,fixed acidity,free sulfur dioxide,pH,residual sugar,sulphates,total sulfur dioxide,volatile acidity
quality,,,,,,,,,,,,
3,9.955000000000002,0.12250000000000001,0.17099999999999999,0.9974640000000001,8.36,11.0,3.3979999999999997,2.6350000000000002,0.5700000000000001,24.9,0.8845000000000001
4,10.265094339622639,0.09067924528301885,0.1741509433962264,0.9965424528301886,7.779245283018868,12.264150943396226,3.381509433962264,2.69433962264151,0.5964150943396227,36.24528301886792,0.6939622641509429
5,9.899706314243753,0.09273568281938328,0.24368575624082198,0.9971036270190888,8.167254038179149,16.983847283406753,3.3049486049926546,2.528854625550658,0.6209691629955947,56.51395007342144,0.5770411160058732
6,10.629519331243463,0.08495611285266458,0.2738244514106587,0.9966150626959255,8.347178683385575,15.711598746081505,3.3180721003134837,2.477194357366772,0.6753291536050158,40.86990595611285,0.49748432601880965
7,11.465912897822443,0.07658793969849244,0.37517587939698493,0.996104
We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 12 columns, instead of 2 in line 1.
,alcohol,chlorides,citric acid,density,fixed acidity,free sulfur dioxide,pH,residual sugar,sulphates,total sulfur dioxide,volatile acidity
quality,
3,9.955000000000002,0.12250000000000001,0.17099999999999999,0.9974640000000001,8.36,11.0,3.3979999999999997,2.6350000000000002,0.5700000000000001,24.9,0.8845000000000001
4,10.265094339622639,0.09067924528301885,0.1741509433962264,0.9965424528301886,7.779245283018868,12.264150943396226,3.381509433962264,2.69433962264151,0.5964150943396227,36.24528301886792,0.6939622641509429
5,9.899706314243753,0.09273568281938328,0.24368575624082198,0.9971036270190888,8.167254038179149,16.983847283406753,3.3049486049926546,2.528854625550658,0.6209691629955947,56.51395007342144,0.5770411160058732
6,10.629519331243463,0.08495611285266458,0.2738244514106587,0.9966150626959255,8.347178683385575,15.711598746081505,3.3180721003134837,2.477194357366772,0.6753291536050158,40.86990595611285,0.49748432601880965
7,11.465912897822443,0.07658793969849244,0.37517587939698493,0.9961042713567828,
We can make this file beautiful and searchable if this error is corrected: It looks like row 6 should actually have 11 columns, instead of 6 in line 5.
alcohol,chlorides,citric acid,density,fixed acidity,free sulfur dioxide,pH,residual sugar,sulphates,total sulfur dioxide,volatile acidity
9.955000000000002,0.12250000000000001,0.17099999999999999,0.9974640000000001,8.36,11.0,3.3979999999999997,2.6350000000000002,0.5700000000000001,24.9,0.8845000000000001
10.265094339622639,0.09067924528301885,0.1741509433962264,0.9965424528301886,7.779245283018868,12.264150943396226,3.381509433962264,2.69433962264151,0.5964150943396227,36.24528301886792,0.6939622641509429
9.899706314243753,0.09273568281938328,0.24368575624082198,0.9971036270190888,8.167254038179149,16.983847283406753,3.3049486049926546,2.528854625550658,0.6209691629955947,56.51395007342144,0.5770411160058732
10.629519331243463,0.08495611285266458,0.2738244514106587,0.9966150626959255,8.347178683385575,15.711598746081505,3.3180721003134837,2.477194357366772,0.6753291536050158,40.86990595611285,0.49748432601880965
11.465912897822443,0.07658793969849244,0.37517587939698493,0.9961042713567828,8.872361809045225,14
Confusion matrix
[[285 2]
[ 18 15]]
Classification report
precision recall f1-score support
0 0.94 0.99 0.97 287
1 0.88 0.45 0.60 33
micro avg 0.94 0.94 0.94 320
SupportVectorClassifier: 0.873364 (0.024056)
StochasticGradientDecentC: 0.838976 (0.034672)
RandomForestClassifier: 0.884289 (0.020586)
DecisionTreeClassifier: 0.848321 (0.037492)
GaussianNB: 0.826446 (0.025753)
KNeighborsClassifier: 0.860845 (0.026717)
AdaBoostClassifier: 0.866351 (0.039970)
LogisticRegression: 0.871014 (0.028362)
alcohol 0.476166
sulphates 0.251397
citric acid 0.226373
fixed acidity 0.124052
residual sugar 0.013732
free sulfur dioxide -0.050656
pH -0.057731
chlorides -0.128907
density -0.174919
total sulfur dioxide -0.185100