Skip to content

Instantly share code, notes, and snippets.

View misho-kr's full-sized avatar

Misho Krastev misho-kr

  • San Jose, California
View GitHub Profile
@misho-kr
misho-kr / GitHub Concepts.md
Last active January 22, 2023 07:14
Summary of "GitHub Concepts" from Datacamp.Org

Learn How to Use GitHub and explore Key GitHub Concepts.

Building on the topics covered in Introduction to Version Control with Git, this conceptual course enables you to navigate the user interface of GitHub effectively. Create public and private repositories, create and modify files, branches, and issues, assign tasks, tag users, review pull requests, and merge branches. Clone and fork repositories and generate private access tokens (PAT).

By Jasmin Ludolf, Data Science Content Developer, DataCamp

1. Introduction to GitHub

@misho-kr
misho-kr / Introduction to version control with Git.md
Last active January 3, 2023 09:21
Summary of "Introduction to version control with Git" from Datacamp.Org

Discover the importance of version control when working on data science projects and explore how to use Git to track files, compare differences, modify and save files, undo changes, and allow collaborative development through the use of branches. Introduction to the structure of a repository, create new repositories and clone existing ones, and show how Git stores data. Skills to handle conflicting files.

By George Boorman, Analytics and Data Science Curriculum Manager, DataCamp

Ressources: Git Cheatsheet

1. Introduction to Git

@misho-kr
misho-kr / Introduction to NumPy.md
Last active February 20, 2023 06:48
Summary of "Introduction to NumPy" from Datacamp.Org

NumPy is an essential Python library. TensorFlow and scikit-learn use NumPy arrays as inputs, and pandas and Matplotlib are built on top of NumPy. In this Introduction to NumPy course, you'll become a master wrangler of NumPy's core object: arrays! You'll discover why NumPy is so efficient and use broadcasting and vectorization to make your NumPy code even faster. By the end of the course, you'll be using 3D arrays to alter a Claude Monet painting.

By Izzy Weber, Curriculum Developer @ DataCamp

1. Understanding NumPy Arrays

Create and change array shapes to suit your needs. Discover NumPy's many data types and how they contribute to speedy array operations.

@misho-kr
misho-kr / Interactive Data Visualization with Bokeh.md
Last active July 22, 2021 06:21
Summary of "Interactive Data Visualization with Bokeh" course on Datacamp

Lead by Team Anaconda, Data Science Training

Bokeh is an interactive data visualization library for Python—and other languages—that targets modern web browsers for presentation. It can create versatile, data-driven graphics and connect the full power of the entire Python data science stack to create rich, interactive visualizations.

  • Glyphs
    • Visual shapes
  • circles, squares, triangles
@misho-kr
misho-kr / Object-Oriented Programming in Python.md
Last active September 18, 2023 04:17
Summary of "Object-Oriented Programming in Python" from Datacamp.Org

Object-oriented programming (OOP) is a widely used programming paradigm that reduces development times—making it easier to read, reuse, and maintain your code. OOP shifts the focus from thinking about code as a sequence of actions to looking at your program as a collection of objects that interact with each other. In this course, you’ll learn how to create classes, which act as the blueprints for every object in Python. You’ll then leverage principles called inheritance and polymorphism to reuse and optimize code.

By Alex Yarosh, Curriculum Developer @ Cockroach Labs

OOP Fundamentals

Learn what object-oriented programming (OOP) is, how it differs from procedural-programming, and how it can be applied. Define your own classes, and create methods, attributes, and constructors.

@misho-kr
misho-kr / Introduction to Data Visualization in Python.md
Last active April 18, 2021 09:23
Summary of "Introduction to Data Visualization in Python" course on Datacamp

Lead by Team Anaconda, Data Science Training

This course provides a stronger foundation in data visualization in Python, broader coverage of the Matplotlib library and an overview of seaborn, a package for statistical graphics. Topics covered include customizing graphics, plotting two-dimensional arrays (like pseudocolor plots, contour plots, and images), statistical graphics (like visualizing distributions and regressions), and working with time series and image data.

Customizing plots

Review of basic plotting with Matplotlib, customizing plots using Matplotlib. Overlaying plots, making subplots, controlling axes, adding legends and annotations, and using different plot styles.

@misho-kr
misho-kr / Introduction to Relational Databases in SQL.md
Created March 1, 2021 04:16
Summary of "Summary of "Merging DataFrames with pandas" course on Datacamp

Lead by Timo Grossenbacher, Project Lead Automated Journalism at Tamedia

You can model different phenomena in your data, as well as the relationships between them. This gives your data structure and consistency, which results in better data quality. In this course, you'll experience this firsthand by working with a real-life dataset that was used to investigate questionable university affiliations. You'll learn how to create tables and specify their relationships, as well as how to enforce data integrity. You'll also discover other unique features of database systems, such as constraints.

Your first database

Create your very first database with a set of simple SQL commands. Next, migrate data from existing flat tables into that database. Learn how meta-information about a datab

@misho-kr
misho-kr / Introduction to SQL.md
Created March 1, 2021 04:11
Summary of "Introduction to SQL" course on Datacamp

This course teaches syntax in SQL shared by many types of databases, such as PostgreSQL, MySQL, SQL Server, and Oracle. This course teaches you everything you need to know to begin working with databases today!

Lead by Nick Carchedi, Product Manager at DataCamp

Selecting columns

Brief introduction to working with relational databases. Learn about their structure, how to talk about them using database lingo, and how to begin an analysis using simple SQL commands to select and summarize columns from database tables.

@misho-kr
misho-kr / Analyzing Police Activity with pandas.md
Last active December 24, 2020 10:01
Summary of "Analyzing Police Activity with pandas" course on Datacamp (https://gist.github.com/misho-kr/873ddcc2fc89f1c96414de9e0a58e0fe)

You will explore the Stanford Open Policing Project dataset and analyze the impact of gender on police behavior. Practice cleaning messy data, creating visualizations, combining and reshaping datasets, and manipulating time series data. Analyzing Police Activity with pandas will give you valuable experience analyzing a dataset.

Lead by Kevin Markham Founder of Data School

Preparing the data for analysis

Examine and clean the dataset, to make working with it a more efficient process. Fix data types, handling missing values, and dropping columns and rows while learning about the Stanford Open Policing Project dataset.

@misho-kr
misho-kr / Statistical Thinking in Python (Part 1).md
Last active December 10, 2020 05:53
Summary of "Statistical Thinking in Python (Part 1)" from Datacamp.Org

After acquiring data and getting them into a form you can work with, you want to make clear, succinct conclusions from them. This crucial last step of a data analysis pipeline hinges on the principles of statistical inference. You will start building the foundation to think statistically, speak the language of your data, and understand what your data is telling you. Get up-to-speed and begin thinking statistically.

By Justin Bois Lecturer at the California Institute of Technology

Graphical exploratory data analysis

Before diving into sophisticated statistical inference techniques, you should first explore your data by plotting them and computing simple summary statistics. This process, called exploratory data analysis, is a crucial first step in statistical analysis of data.