github repo for rest of specialization: Data Science Coursera
Which of the following are courses in the Data Science Specialization? Select all that apply:
- Business Analytics
- Python Programming
github repo for rest of specialization: Data Science Coursera
Which of the following are courses in the Data Science Specialization? Select all that apply:
github repo for rest of specialization: Data Science Coursera
We take a random sample of individuals in a population and identify whether they smoke and if they have cancer. We observe that there is a strong relationship between whether a person in the sample smoked or not and whether they have lung cancer. We claim that the smoking is related to lung cancer in the larger population. We explain we think that the reason for this relationship is because cigarette smoke contains known carcinogens such as arsenic and benzene, which make cells in the lungs become cancerous.
Histogram display a sample estimate of the density or mass function by plotting a bar graph of the frequency or proportion of times that a variable takes specific values, or a range of values for continuous data, within a sample
Github repo for the Course: Stanford Machine Learning (Coursera)
Quiz Needs to be viewed here at the repo (because the image solutions cant be viewed as part of a gist)
Answer | Explanation
Get a list of distinct values for a column in a table
SELECT DISTINCT column FROM table;
Get the count of rows in a table
SELECT COUNT(*) FROM table;
""" | |
Minimal character-level Vanilla RNN model. Written by Andrej Karpathy (@karpathy) | |
BSD License | |
""" | |
import numpy as np | |
# data I/O | |
data = open('input.txt', 'r').read() # should be simple plain text file | |
chars = list(set(data)) | |
data_size, vocab_size = len(data), len(chars) |
If you are a newcomer to the Deep Learning area, the first question you may have is "Which paper should I start reading from?"
Here is a reading roadmap of Deep Learning papers!
The roadmap is constructed in accordance with the following four guidelines:
pandas
is the world's most popular Python library, used for everything from data manipulation to data analysis. Learn how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas!
Lead by Maggie Matsui, Data Scientist at DataCamp
Inspect DataFrames and perform fundamental manipulations, including sorting rows, subsetting, and adding new columns
https://learn.datacamp.com/career-tracks/data-scientist-with-python