Skip to content

Instantly share code, notes, and snippets.

View misho-kr's full-sized avatar

Misho Krastev misho-kr

  • San Jose, California
View GitHub Profile
@misho-kr
misho-kr / Cleaning Data in Python.md
Last active November 2, 2021 03:34
Summary of "Cleaning Data in Python" course on Datacamp

It is commonly said that data scientists spend 80% of their time cleaning and manipulating data, and only 20% of their time actually analyzing it. This course will equip you with all the skills you need to clean your data in Python, from learning how to diagnose problems in your data, to dealing with missing values and outliers.

Lead by Daniel Chen, Data Science Consultant at Lander Analytics

Exploring your data

You've gotten your hands on a brand new dataset and are itching to start exploring it. How can you be sure your dataset is clean? You'll learn how to explore your data with an eye for diagnosing issues such as outliers, missing values, and duplicate rows.

@misho-kr
misho-kr / Intermediate Importing Data in Python.md
Last active February 2, 2020 05:59
Summary of "Intermediate Importing Data in Python" course on Datacamp

In this course, you'll extend this knowledge base by learning to import data from the web and by pulling data from Application Programming Interfaces— APIs—such as the Twitter streaming API, which allows us to stream real-time tweets.

Lead by Hugo Bowne-Anderson, Data Scientist at DataCamp

Importing data from the Internet

The web is a rich source of data from which you can extract various types of insights and findings. In this chapter, you will learn how to get data from the web, whether it is stored in files or in HTML. You'll also learn the basics of scraping and parsing web data.

@misho-kr
misho-kr / Introduction to Importing Data in Python.md
Last active February 2, 2020 05:25
Summary of "Introduction to Importing Data in Python" course on Datacamp

As a data scientist, you will need to clean data, wrangle and munge it, visualize it, build predictive models, and interpret these models. Before you can do so, however, you will need to know how to get data into Python. In this course, you'll learn the many ways to import data into Python: from flat files such as .txt and .csv; from files native to other software such as Excel spreadsheets, Stata, SAS, and MATLAB files; and from relational databases such as SQLite and PostgreSQL.

Lead by Hugo Bowne-Anderson, Data Scientist at DataCamp

Introduction and flat files

In this chapter, you'll learn how to import data into Python from all types of flat files, which are a simple and prevalent form of data storage. You've previously learned how to use NumPy and pandas—you will learn how to use these packages to impor

@misho-kr
misho-kr / Python Data Science Toolbox (Part 2).md
Last active November 2, 2019 05:36
Summary of "Python Data Science Toolbox (Part 2)" course on Datacamp

In this second Python Data Science Toolbox course, you'll continue to build your Python data science skills. First, you'll learn about iterators, objects you have already encountered in the context of for loops. You'll then learn about list comprehensions, which are extremely handy tools for all data scientists working in Python. You'll end the course by working through a case study in which you'll apply all the techniques you learned in both parts of this course.

Using iterators in PythonLand

You'll learn all about iterators and iterables, which you have already worked with when writing for loops.

  • Iterators and iterables, iter() and next()
@misho-kr
misho-kr / Python Data Science Toolbox (Part 1).md
Last active October 27, 2019 09:26
Summary of "Python Data Science Toolbox (Part 1)" course on Datacamp

It's time to push forward and develop your Python chops even further. There are tons of fantastic functions in Python and its library ecosystem. However, as a data scientist, you'll constantly need to write your own functions to solve problems that are dictated by your data. You will learn the art of function writing in this first Python Data Science Toolbox course. You'll come out of this course being able to write your very own custom functions, complete with multiple parameters and multiple return values, along with default arguments and variable-length arguments. You'll gain insight into scoping in Python and be able to write lambda functions and handle errors in your function writing practice. And you'll wrap up each chapter by using your new skills to write functions that analyze Twitter DataFrames.

Lead by [Hugo Bowne-Anderson](https://www.datacamp.com/ins

@misho-kr
misho-kr / Intermediate Python for Data Science.md
Last active October 27, 2019 08:22
Summary of "Intermediate Python for Data Science" course on DataCamp

Intermediate Python for Data Science is crucial for any aspiring data science practitioner learning Python. Learn to visualize real data with Matplotlib's functions and get acquainted with data structures such as the dictionary and the pandas DataFrame. After covering key concepts such as boolean logic, control flow, and loops in Python, you'll be ready to blend together everything you've learned to solve a case study using hacker statistics.

Lead by Filip Schouwenaars, Data Science Instructor at DataCamp

Matplotlib

Data visualization is a key skill for aspiring data scientists. Matplotlib makes it easy to create meaningful and insightful plots. In this chapter, you’ll learn how to build various types of plots, and customize them to be more visually appealing and interpretable.

@misho-kr
misho-kr / Introduction to Python.md
Last active October 22, 2019 04:21
Summary of "Introduction to Python" course on DataCamp

Python is a general-purpose programming language that is becoming ever more popular for data science. Companies worldwide are using Python to harvest insights from their data and gain a competitive edge. Unlike other Python tutorials, this course focuses on Python specifically for data science. In our Introduction to Python course, you’ll learn about powerful ways to store and manipulate data, and helpful data science tools to begin conducting your own analyses. Start DataCamp’s online Python curriculum now.

Lead by Hugo Bowne-Anderson, Data Scientist at DataCamp

Python Basics

An introduction to the basic concepts of Python. Learn how to use Python interactively and by using a script. Create your first variables and acquaint yourself with Python's basic data types.

@misho-kr
misho-kr / DataCamp.md
Last active January 9, 2025 06:55
DataCamp Courses and Career Tracks

DataCamp Courses and Career Tracks

A Data Scientist combines statistical and machine learning techniques with Python programming to analyze and interpret complex data

Python | 100 Hours | 26 Courses

@misho-kr
misho-kr / Getting Started with Google Kubernetes Engine.md
Last active October 31, 2019 19:46
Summary of "Getting Started with Google Kubernetes Engine"

This one-week, accelerated online class equips students to containerize workloads in Docker containers, deploy them to Kubernetes clusters provided by Google Kubernetes Engine, and scale those workloads to handle increased traffic.

Introduction to Containers and Docker

Acquaint yourself with containers, Docker, and the Google Container Registry.

  • In this lab, you learn how to:
@misho-kr
misho-kr / Introduction to Linear Modeling in Python.md
Last active March 25, 2020 19:54
Introduction to Linear Modeling in Python

Lecturer

Jason Vestuto, Data Scientist, University of Texas at Austin

Exploring Linear Trends

We start the course with an initial exploration of linear relationships, including some motivating examples of how linear models are used, and demonstrations of data visualization methods from matplotlib. We then use descriptive statistics to quantify the shape of our data and use correlation to quantify the strength of linear relationships between two variables.