Skip to content

Instantly share code, notes, and snippets.

View pmbaumgartner's full-sized avatar

Peter Baumgartner pmbaumgartner

View GitHub Profile
@pmbaumgartner
pmbaumgartner / performance_test.py
Last active November 10, 2022 17:24
Django/Postgres Data Load - Performance Comparison
from contextlib import closing
from io import StringIO
from time import time
import pandas as pd
from django.core.management.base import BaseCommand
from django.db import transaction
from faker import Faker
from core.models import Thing
@pmbaumgartner
pmbaumgartner / dockerpredict.sh
Last active September 26, 2019 17:18
Getting Bert Working!
#!/bin/sh
# use this to get predictions on a test.csv located in BERT_DATA_DIR
export OUTCOME={classification_task_name}
export NOTEBOOK=/notebooks # don't change me
docker run --runtime=nvidia -it --rm \
-v $(pwd):$NOTEBOOK/ \
-e "BERT_BASE_DIR=$NOTEBOOK/uncased_L-12_H-768_A-12" \
@pmbaumgartner
pmbaumgartner / applied nlp.ipynb
Last active February 7, 2019 00:29
⚡️Applied Natural Language Processing in Python ⚡️
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@pmbaumgartner
pmbaumgartner / word2vec-umap-plotly.py
Created April 28, 2018 16:34
Load Google News Word2Vec, Reduce Dimension with UMAP, and plot with plot.ly
import gensim.downloader as gensim_api
import umap
import requests
import pandas as pd
from numpy import log10
import plotly
import plotly.graph_objs as go
w2v_model = gensim_api.load('word2vec-google-news-300')
@pmbaumgartner
pmbaumgartner / Procfile
Last active April 5, 2018 21:53
Gensim Celery Issue Replication
redis: redis-server
celery_worker: celery -A tasks worker -l info
@pmbaumgartner
pmbaumgartner / Github Commits - Forecast & Components
Created December 28, 2017 15:34
Forecast & Composition of Github Commits using Prophet
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Github Commits - Forecast Components\n",
"\n",
"Data from [BigQuery Github Dataset](https://cloud.google.com/bigquery/public-data/github) using the following query:\n",
"\n",
@pmbaumgartner
pmbaumgartner / annoytutorial-text8.ipynb
Last active September 11, 2017 10:00
Annoy Tutorial - Text8 Changes
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@pmbaumgartner
pmbaumgartner / hangman.py
Created September 18, 2016 15:46
Hangman (based of TIY assignment)
import random
from string import ascii_uppercase
def get_difficulty():
difficulty_length = {'easy': (4, 6),
'medium': (6, 8),
'hard': (8, 20)
}