title | author | date |
---|---|---|
Reproducible Research Project 1 |
Michael Galarnyk |
3/26/2017 |
github repo for rest of specialization: Data Science Coursera
python --version | |
# you can even add names of packages after python=3.6 like python=3.6 pandas | |
conda create --name subscribe python=3.6 | |
# Remove your environment | |
conda env remove --name subscribe | |
# List your environments | |
conda env list |
title | author | date |
---|---|---|
Reproducible Research Project 1 |
Michael Galarnyk |
3/26/2017 |
github repo for rest of specialization: Data Science Coursera
Suppose I conduct a study and publish my findings. Which of the following is an example of a replication of my study?
An investigator at another institution conducts a study addressing a different scientific question and publishes her findings.
I give my data to an independent investigator at another institution, she analyzes the data and gets the same results as I originally obtained.
A image video link walkthrough of the project + answers is available at the end of this quiz.
github repo for rest of specialization: Data Science Coursera
The Course Project for this course is pretty straightforward. Now that you’ve learned about the basic tools found in the data scientist’s toolbox, we want to make sure that you’ve assembled those tools and are ready to begin using them. This easy project is your chance to demonstrate that you’ve done the basic software setup (R, RStudio, and Github) that will get you through the rest of the Data Science Specialization.
github repo for rest of specialization: Data Science Coursera
We take a random sample of individuals in a population and identify whether they smoke and if they have cancer. We observe that there is a strong relationship between whether a person in the sample smoked or not and whether they have lung cancer. We claim that the smoking is related to lung cancer in the larger population. We explain we think that the reason for this relationship is because cigarette smoke contains known carcinogens such as arsenic and benzene, which make cells in the lungs become cancerous.
github repo for rest of specialization: Data Science Coursera
Which of the following commands will create a directory called data in your current working directory?
github repo for rest of specialization: Data Science Coursera
Which of the following are courses in the Data Science Specialization? Select all that apply:
The American Community Survey distributes downloadable data about United States communities. Download the 2006 microdata survey about housing for the state of Idaho using download.file() from here:
https://d396qusza40orc.cloudfront.net/getdata%2Fdata%2Fss06hid.csv
and load the data into R. The code book, describing the variable names is here: