Skip to content

Instantly share code, notes, and snippets.

View frutik's full-sized avatar

Andrew Kornilov frutik

View GitHub Profile
brew install openblas
OPENBLAS="$(brew --prefix openblas)" pip install numpy scipy
pip3 install --no-binary :all: --no-use-pep517 scipy
my preferences
small
tech in heart
flat structure
no HIPPO

Kubernetes for Dummies (aka 101)

Macos X

Install/Setup

brew install kubernetes-cli
pip3 install awscli --upgrade
aws configure (you have to specify credentials)
https://towardsdatascience.com/beyond-a-b-testing-multi-armed-bandit-experiments-1493f709f804
Beyond A/B Testing: Multi-armed Bandit Experiments
A study of Google Analytics’ stochastic k-armed bandit test with Thompson sampling and Monte Carlo simulation
brew install apache-drill
drill-embedded
ALTER SESSION SET `store.format`='json';
ALTER SESSION SET `store.json.all_text_mode` = true;
alter session set `planner.enable_decimal_data_type`=true;
select * from dfs.`/Users/akornilov/work/prices` limit 1;
select AVG(CAST(price as DECIMAL(28, 2)) ) as avg_price from dfs.`/Users/akornilov/work/prices`;
https://stackoverflow.com/questions/787496/what-is-the-best-way-to-compute-trending-topics-or-tags/826509#826509
http://sayrohan.blogspot.com/2013/06/finding-trending-topics-and-trending.html
JUN
6
Finding Trending Topics and Trending Sentiments From Live Twitter Stream Using Storm
@frutik
frutik / System Design.md
Created May 16, 2020 15:23 — forked from vasanthk/System Design.md
System Design Cheatsheet

System Design Cheatsheet

Picking the right architecture = Picking the right battles + Managing trade-offs

Basic Steps

  1. Clarify and agree on the scope of the system
  • User cases (description of sequences of events that, taken together, lead to a system doing something useful)
    • Who is going to use it?
    • How are they going to use it?
https://www.revsys.com/tidbits/django-performance-simple-things/
https://devcenter.heroku.com/articles/python-concurrency-and-database-connections
https://serverfault.com/questions/635100/django-conn-max-age-persists-connections-but-doesnt-reuse-them-with-postgresq
https://github.com/snowplow/snowplow/wiki/setting-up-snowplow-google-cloud-storage-loader
https://github.com/snowplow/snowplow/wiki/setting-up-snowplow-google-cloud-storage-loader
https://github.com/snowplow/snowplow/wiki/Snowplow-Google-Cloud-Storage-Loader
https://github.com/snowplow/snowplow/wiki/setting-up-snowplow-google-cloud-storage-loader
https://raw.githubusercontent.com/snowplow/snowplow/master/2-collectors/scala-stream-collector/examples/config.hocon.sample
https://medium.com/tech-at-popxo/recommendation-engine-at-popxo-using-snowplow-and-prediction-io-with-universal-recommender-fa079abe3f8a
http://mahout.apache.org/users/algorithms/intro-cooccurrence-spark.html
https://developer.ibm.com/dwblog/2017/mahout-spark-correlated-cross-occurences/
https://stackoverflow.com/questions/29760935/how-to-get-vector-for-a-sentence-from-the-word2vec-of-tokens-in-sentence
https://stackoverflow.com/questions/29760935/how-to-get-vector-for-a-sentence-from-the-word2vec-of-tokens-in-sentence/36299641
https://radimrehurek.com/gensim/models/word2vec.html
https://github.com/RaRe-Technologies/gensim/blob/develop/docs/notebooks/doc2vec-lee.ipynb