Skip to content

Instantly share code, notes, and snippets.

View misho-kr's full-sized avatar

Misho Krastev misho-kr

  • San Jose, California
View GitHub Profile
@misho-kr
misho-kr / Introduction to Importing Data in Python.md
Last active February 2, 2020 05:25
Summary of "Introduction to Importing Data in Python" course on Datacamp

As a data scientist, you will need to clean data, wrangle and munge it, visualize it, build predictive models, and interpret these models. Before you can do so, however, you will need to know how to get data into Python. In this course, you'll learn the many ways to import data into Python: from flat files such as .txt and .csv; from files native to other software such as Excel spreadsheets, Stata, SAS, and MATLAB files; and from relational databases such as SQLite and PostgreSQL.

Lead by Hugo Bowne-Anderson, Data Scientist at DataCamp

Introduction and flat files

In this chapter, you'll learn how to import data into Python from all types of flat files, which are a simple and prevalent form of data storage. You've previously learned how to use NumPy and pandas—you will learn how to use these packages to impor

@misho-kr
misho-kr / Python Data Science Toolbox (Part 2).md
Last active November 2, 2019 05:36
Summary of "Python Data Science Toolbox (Part 2)" course on Datacamp

In this second Python Data Science Toolbox course, you'll continue to build your Python data science skills. First, you'll learn about iterators, objects you have already encountered in the context of for loops. You'll then learn about list comprehensions, which are extremely handy tools for all data scientists working in Python. You'll end the course by working through a case study in which you'll apply all the techniques you learned in both parts of this course.

Using iterators in PythonLand

You'll learn all about iterators and iterables, which you have already worked with when writing for loops.

  • Iterators and iterables, iter() and next()
@misho-kr
misho-kr / Python Data Science Toolbox (Part 1).md
Last active October 27, 2019 09:26
Summary of "Python Data Science Toolbox (Part 1)" course on Datacamp

It's time to push forward and develop your Python chops even further. There are tons of fantastic functions in Python and its library ecosystem. However, as a data scientist, you'll constantly need to write your own functions to solve problems that are dictated by your data. You will learn the art of function writing in this first Python Data Science Toolbox course. You'll come out of this course being able to write your very own custom functions, complete with multiple parameters and multiple return values, along with default arguments and variable-length arguments. You'll gain insight into scoping in Python and be able to write lambda functions and handle errors in your function writing practice. And you'll wrap up each chapter by using your new skills to write functions that analyze Twitter DataFrames.

Lead by [Hugo Bowne-Anderson](https://www.datacamp.com/ins

@misho-kr
misho-kr / Intermediate Python for Data Science.md
Last active October 27, 2019 08:22
Summary of "Intermediate Python for Data Science" course on DataCamp

Intermediate Python for Data Science is crucial for any aspiring data science practitioner learning Python. Learn to visualize real data with Matplotlib's functions and get acquainted with data structures such as the dictionary and the pandas DataFrame. After covering key concepts such as boolean logic, control flow, and loops in Python, you'll be ready to blend together everything you've learned to solve a case study using hacker statistics.

Lead by Filip Schouwenaars, Data Science Instructor at DataCamp

Matplotlib

Data visualization is a key skill for aspiring data scientists. Matplotlib makes it easy to create meaningful and insightful plots. In this chapter, you’ll learn how to build various types of plots, and customize them to be more visually appealing and interpretable.

@misho-kr
misho-kr / Introduction to Python.md
Last active October 22, 2019 04:21
Summary of "Introduction to Python" course on DataCamp

Python is a general-purpose programming language that is becoming ever more popular for data science. Companies worldwide are using Python to harvest insights from their data and gain a competitive edge. Unlike other Python tutorials, this course focuses on Python specifically for data science. In our Introduction to Python course, you’ll learn about powerful ways to store and manipulate data, and helpful data science tools to begin conducting your own analyses. Start DataCamp’s online Python curriculum now.

Lead by Hugo Bowne-Anderson, Data Scientist at DataCamp

Python Basics

An introduction to the basic concepts of Python. Learn how to use Python interactively and by using a script. Create your first variables and acquaint yourself with Python's basic data types.

@misho-kr
misho-kr / DataCamp.md
Last active January 9, 2025 06:55
DataCamp Courses and Career Tracks

DataCamp Courses and Career Tracks

A Data Scientist combines statistical and machine learning techniques with Python programming to analyze and interpret complex data

Python | 100 Hours | 26 Courses

@misho-kr
misho-kr / Getting Started with Google Kubernetes Engine.md
Last active October 31, 2019 19:46
Summary of "Getting Started with Google Kubernetes Engine"

This one-week, accelerated online class equips students to containerize workloads in Docker containers, deploy them to Kubernetes clusters provided by Google Kubernetes Engine, and scale those workloads to handle increased traffic.

Introduction to Containers and Docker

Acquaint yourself with containers, Docker, and the Google Container Registry.

  • In this lab, you learn how to:
@misho-kr
misho-kr / Introduction to Linear Modeling in Python.md
Last active March 25, 2020 19:54
Introduction to Linear Modeling in Python

Lecturer

Jason Vestuto, Data Scientist, University of Texas at Austin

Exploring Linear Trends

We start the course with an initial exploration of linear relationships, including some motivating examples of how linear models are used, and demonstrations of data visualization methods from matplotlib. We then use descriptive statistics to quantify the shape of our data and use correlation to quantify the strength of linear relationships between two variables.

@misho-kr
misho-kr / Conda2.md
Last active November 19, 2018 06:26
Summary of "Conda for Building & Distributing Packages" course at DataCamp.Com

Summary

In the Conda Essentials course you learned how use the Conda package manager to create and share reproducible environments for data science development.

Anaconda Projects

In this chapter you'll create an Anaconda Project, which is a data science asset that specifies package installs, file downloads, and executable commands. Anaconda projects can be used to run Jupyter notebooks, Bokeh server apps, REST APIs, and command line tools on Windows, Mac OSX, and Linux platforms making deployment easy.

@misho-kr
misho-kr / Conda.md
Last active July 22, 2019 09:39
Summary of "Conda Essentials" course at DataCamp.Com

Summary

Conda packages are files containing a bundle of resources: usually libraries and executables, but not always. In principle, Conda packages can include data, images, notebooks, or other assets.

One of the powerful aspects of conda, both the tool and the package format, is that dependencies are taken care of. That is, when you install any Conda package, any other packages needed get installed automatically.

A Conda package, then, is a file containing all files needed to make a given program execute correctly on a given system.