Good for visualizing missing data and exploring patterns in the missing values of missing data.
A 1-line tool for data exploration, statistical analysis and EDA. I was working on a similar library, but this is clearely of excellent quality.
import pandas as pd
from pandas_profiling import ProfileReport
data = pd.read_csv('https://raw.githubusercontent.com/mkleinbort/resource-datasets/master/titanic/titanic.csv')
ProfileReport(data)
Big data with Python, scales across clusters and uses parallelism and memory management to allow for amazing things. Extends numpy arrays and pandas dataframes (and other things). Used it to clean ~300,000 dataframes (190GB) in ~1.5h running on my laptop.
...