In Python 3, write a simple program below that calculates the probability of 5% percent of a population having diabetes. You can use this report to collect probabilites.
Using Pandas prepare his dataset to test the probabilities for the reports above: https://archive.ics.uci.edu/ml/datasets/Diabetes+130-US+hospitals+for+years+1999-2008
Using the above dataset train a simple SciKit model to predict the length of stay.
Please share how you would verify and validate your results. Please upload an Markdown file with your results.
Please fork this gist and commit your .py files.