Skip to content

Instantly share code, notes, and snippets.

View mGalarnyk's full-sized avatar

Michael Galarnyk mGalarnyk

View GitHub Profile
python --version
# you can even add names of packages after python=3.6 like python=3.6 pandas
conda create --name subscribe python=3.6
# Remove your environment
conda env remove --name subscribe
# List your environments
conda env list
@mGalarnyk
mGalarnyk / PA1_Template.md
Last active April 20, 2021 09:50
Reproducible Research Project 1 John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera
title author date
Reproducible Research Project 1
Michael Galarnyk
3/26/2017

github repo for rest of specialization: Data Science Coursera

Introduction

@mGalarnyk
mGalarnyk / quiz2.md
Created March 21, 2017 05:06
Reproducible Research Quiz 2 John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera

Reproducible Research Quiz 2 (JHU) Coursera

Question 1

Who created Markdown?

  • Robert Gentleman

  • Yihui Xie

@mGalarnyk
mGalarnyk / quiz1.md
Last active July 30, 2023 20:53
Reproducible Research Quiz 1 John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera

Reproducible Research Quiz 1 (JHU) Coursera

Question 1

Suppose I conduct a study and publish my findings. Which of the following is an example of a replication of my study?

  • An investigator at another institution conducts a study addressing a different scientific question and publishes her findings.

  • I give my data to an independent investigator at another institution, she analyzes the data and gets the same results as I originally obtained.

@mGalarnyk
mGalarnyk / quiz1.md
Created March 15, 2017 03:16
Regression Models Quiz 1 (Week 1) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/7_Regression_Models

Regression Models Quiz 1 (JHU) Coursera

Question 1

Consider the data set given below

x <- c(0.18, -1.54, 0.42, 0.95)
@mGalarnyk
mGalarnyk / project1.md
Last active April 12, 2023 11:18
The Data Scientist's Toolbox Project John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera

The Data Scientist's Toolbox Project (JHU) Coursera

A image video link walkthrough of the project + answers is available at the end of this quiz.

github repo for rest of specialization: Data Science Coursera

Part 1

The Course Project for this course is pretty straightforward. Now that you’ve learned about the basic tools found in the data scientist’s toolbox, we want to make sure that you’ve assembled those tools and are ready to begin using them. This easy project is your chance to demonstrate that you’ve done the basic software setup (R, RStudio, and Github) that will get you through the rest of the Data Science Specialization.

  1. Install R
@mGalarnyk
mGalarnyk / quiz3.md
Last active April 19, 2023 03:50
The Data Scientist's Toolbox Quiz 3 (Week 3) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/1_Data_Scientist_Toolbox

The Data Scientist's Toolbox Quiz 3 (JHU) Coursera

github repo for rest of specialization: Data Science Coursera

Question 1

We take a random sample of individuals in a population and identify whether they smoke and if they have cancer. We observe that there is a strong relationship between whether a person in the sample smoked or not and whether they have lung cancer. We claim that the smoking is related to lung cancer in the larger population. We explain we think that the reason for this relationship is because cigarette smoke contains known carcinogens such as arsenic and benzene, which make cells in the lungs become cancerous.

  • This is an example of a causal data analysis.
@mGalarnyk
mGalarnyk / quiz2.md
Last active September 16, 2022 07:36
The Data Scientist's Toolbox Quiz 2 (Week 2) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/1_Data_Scientist_Toolbox

The Data Scientist's Toolbox Quiz 2 (JHU) Coursera

github repo for rest of specialization: Data Science Coursera

Question 1

Which of the following commands will create a directory called data in your current working directory?

  • mkdir /Users/data
@mGalarnyk
mGalarnyk / quiz1.md
Last active June 5, 2020 19:26
The Data Scientist's Toolbox Quiz 1 (Week 1) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/1_Data_Scientist_Toolbox

The Data Scientist's Toolbox Quiz 1 (JHU) Coursera

github repo for rest of specialization: Data Science Coursera

Question 1

Which of the following are courses in the Data Science Specialization? Select all that apply:

  • Business Analytics
  • Python Programming
@mGalarnyk
mGalarnyk / quiz4.md
Created March 7, 2017 18:15
Getting and Cleaning Data Quiz 4 (Week 4) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/3_Getting_and_Cleaning_Data

Getting and Cleaning Data Quiz 4 (JHU) Coursera

Question 1

The American Community Survey distributes downloadable data about United States communities. Download the 2006 microdata survey about housing for the state of Idaho using download.file() from here:

https://d396qusza40orc.cloudfront.net/getdata%2Fdata%2Fss06hid.csv

and load the data into R. The code book, describing the variable names is here: