Skip to content

Instantly share code, notes, and snippets.

@odubno
Created October 26, 2014 21:31
Show Gist options
  • Save odubno/e0e0c939b8a8a9159487 to your computer and use it in GitHub Desktop.
Save odubno/e0e0c939b8a8a9159487 to your computer and use it in GitHub Desktop.
Project 2 Data Problem and Hypothesis - Loans
I got a new data set from https://www.lendingclub.com/info/download-data.action
My new dataset deals with loans. There's lots of info and the column headings that I will focus on and work with are 'loan status', 'total current payment', 'annual income', 'employment length' and 'funded amount'.
Using logistic regression I want to see which factor is most responible for people paying back their loans. I'm also currious to see what causes people to get higher loans. High annual income might be the more obvious answer, maybe there are other factors.
I believe that 'employment length' and 'annual income' could predict the status of a loan.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment