Skip to content

Instantly share code, notes, and snippets.

View HabibMrad's full-sized avatar
🏠
Working from home (Always ready to help)

Habib Mrad HabibMrad

🏠
Working from home (Always ready to help)
  • Beirut - Lebanon
  • 10:55 (UTC +02:00)
View GitHub Profile
@HabibMrad
HabibMrad / quiz1.md
Created November 28, 2017 03:57 — forked from mGalarnyk/quiz1.md
The Data Scientist's Toolbox Quiz 1 (Week 1) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/1_Data_Scientist_Toolbox

The Data Scientist's Toolbox Quiz 1 (JHU) Coursera

github repo for rest of specialization: Data Science Coursera

Question 1

Which of the following are courses in the Data Science Specialization? Select all that apply:

  • Business Analytics
  • Python Programming
@HabibMrad
HabibMrad / quiz3.md
Created November 28, 2017 03:59 — forked from mGalarnyk/quiz3.md
The Data Scientist's Toolbox Quiz 3 (Week 3) John Hopkins Data Science Specialization Coursera for the github repo https://github.com/mGalarnyk/datasciencecoursera/tree/master/1_Data_Scientist_Toolbox

The Data Scientist's Toolbox Quiz 3 (JHU) Coursera

github repo for rest of specialization: Data Science Coursera

Question 1

We take a random sample of individuals in a population and identify whether they smoke and if they have cancer. We observe that there is a strong relationship between whether a person in the sample smoked or not and whether they have lung cancer. We claim that the smoking is related to lung cancer in the larger population. We explain we think that the reason for this relationship is because cigarette smoke contains known carcinogens such as arsenic and benzene, which make cells in the lungs become cancerous.

  • This is an example of a causal data analysis.
@HabibMrad
HabibMrad / lecture11.md
Created June 5, 2018 16:15 — forked from Y4suyuki/lecture11.md
Lecture note of Mathematical Biostatistics Boot Camp 1 by Brian Caffo, PhD at Coursera https://www.coursera.org/course/biostats

Plotting Data

Histgram

Histogram display a sample estimate of the density or mass function by plotting a bar graph of the frequency or proportion of times that a variable takes specific values, or a range of values for continuous data, within a sample

Pros and Cons

  • Histograms are useful and easy, apply to continuous, discrete and even unordered data
  • They use a lot of ink and space to display very little information
  • It's difficult to display several at the same time for comparisons Also, for this data it's probably preferable to consider log base 10, since the raw histogram simplay says that most islands are small

Stem and leaf plot

@HabibMrad
HabibMrad / machineLearningWeek3Quiz1.md
Created August 8, 2018 23:57 — forked from mGalarnyk/machineLearningWeek3Quiz1.md
Machine Learning (Stanford) Coursera Logistic Regression Quiz (Week 3, Quiz 1) for the github repo: https://github.com/mGalarnyk/datasciencecoursera/tree/master/Stanford_Machine_Learning

Machine Learning Week 3 Quiz 1 (Logistic Regression) Stanford Coursera

Github repo for the Course: Stanford Machine Learning (Coursera)
Quiz Needs to be viewed here at the repo (because the image solutions cant be viewed as part of a gist)

Question 1

Answer | Explanation

@HabibMrad
HabibMrad / IntroToSQLForDataScience_Notes.md
Created January 1, 2019 23:56 — forked from maptastik/IntroToSQLForDataScience_Notes.md
Notes from DataCamp's "Intro to SQL for Data Science" course

Get a list of distinct values for a column in a table

SELECT DISTINCT column FROM table;

Get the count of rows in a table

SELECT COUNT(*) FROM table;
@HabibMrad
HabibMrad / min-char-rnn.py
Created May 30, 2019 10:19 — forked from karpathy/min-char-rnn.py
Minimal character-level language model with a Vanilla Recurrent Neural Network, in Python/numpy
"""
Minimal character-level Vanilla RNN model. Written by Andrej Karpathy (@karpathy)
BSD License
"""
import numpy as np
# data I/O
data = open('input.txt', 'r').read() # should be simple plain text file
chars = list(set(data))
data_size, vocab_size = len(data), len(chars)

Deep Learning Papers Reading Roadmap

If you are a newcomer to the Deep Learning area, the first question you may have is "Which paper should I start reading from?"

Here is a reading roadmap of Deep Learning papers!

The roadmap is constructed in accordance with the following four guidelines:

  • From outline to detail
  • From old to state-of-the-art
@HabibMrad
HabibMrad / image-processing-with-keras-in-python.ipynb
Created July 18, 2020 17:29
Image Processing with Keras in Python.ipynb
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@HabibMrad
HabibMrad / Data Manipulation with pandas.md
Created July 20, 2020 11:51 — forked from misho-kr/Data Manipulation with pandas.md
Summary of "Data Manipulation with pandas" course on Datacamp

pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. Learn how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas!

Lead by Maggie Matsui, Data Scientist at DataCamp

Transforming Data

Inspect DataFrames and perform fundamental manipulations, including sorting rows, subsetting, and adding new columns

@HabibMrad
HabibMrad / readme.md
Created July 20, 2020 14:18 — forked from aaizemberg/readme.md
Data Scientist with Python (DataCamp)