Skip to content

Instantly share code, notes, and snippets.

@costrouc
costrouc / gist:5fe7338c96d534b0b8ac
Created July 9, 2015 13:54
Search large file with buffer
def fnd(fname, s, start=0):
with open(fname, 'rb') as f:
fsize = os.path.getsize(fname)
bsize = 4096
buffer = None
if start > 0:
f.seek(start)
overlap = len(s) - 1
while True:
if (f.tell() >= overlap and f.tell() < fsize):
# -*- coding: utf-8 -*-
# elements.py
# Copyright (c) 2005-2015, Christoph Gohlke
# All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions are met:
#
# * Redistributions of source code must retain the above copyright
var request = function(obj, success, failure) {
success = success || function(){};
failure = failure || function(){};
obj.method = obj.method || 'GET';
obj.headers = obj.headers || {};
obj.params = obj.params || {};
var req = new XMLHttpRequest();
@costrouc
costrouc / customjson.py
Created October 13, 2016 18:55
JSONDecode and JSONEncode
import json
class NPEncoder(json.JSONEncoder):
def default(self, obj):
if isinstance(obj, np.ndarray):
return obj.tolist()
return json.JSONEncoder.default(self, obj)
class NPDecoder(json.JSONDecoder):
def __init__(self, **kwargs):
def read_qe_output(filename):
steps = {
'positions': [],
'lattice': []
}
with open(filename, 'r') as f:
for line in f:
if 'lattice parameter (alat) =' in line:
alat = float(line.split()[-2])
@costrouc
costrouc / client.py
Last active April 27, 2019 10:00
Asyncio Majordomo Protocol (18/MDP)
import asyncio
import logging
import zmq
import zmq.asyncio
from scheduler import SchedulerCode
class Client:
@costrouc
costrouc / index.html
Created January 22, 2018 14:59
Questions Simple Python Server
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Question Example</title>
<meta name="description" content="Simple Questions Asked">
<meta name="author" content="Chris Ostrouchov">
</head>
<body>
<form id="questions">
@costrouc
costrouc / README.md
Last active May 17, 2018 15:00
Workshop: Contributing to Open Source Projects

This workshop is designed to help beginners learn AND practice the basic skills necessary to contribute to open source projects. If you are already familiar with the steps for a pull request we encourage you to attend so that we have more people to help others.

A computer of any operating system is required.

This workshop will follow the guide used at pycon 2018 [1].

Steps that we will cover:

@costrouc
costrouc / README.md
Last active May 17, 2018 15:41
Creating your First Python Package with Testing, Documentation, and Continuous Integration

Packaging in python is a complex topic. In this talk I will be doing an opinionated live demo of developing a pure python package. We will most likely not be able cover everything so I will defer to the audience for what they would like covered. I will also discuss the future of packaging that I heard from the PyPi developers at pycon.

A git repository [1] is provided with all the steps along with documentation [2].

Python packages used:

@costrouc
costrouc / data-engineer-exam-notes.org
Created November 15, 2018 16:45
Google Data Engineer Notes

Practice Exam Attempt (3) Carefully taken with notes

Got an 16/20 on the latest practice exam. Missed questions 5, 7, 12, 13.

Summary of why missed each question:

5
always favor simpler method (cbt (cloud big query tool) vs hbase cli) … I feel uneasy with this answer
7
prefer simpler method. pub/sub more than capable. kafka is overkill. this question showed my lack of knowledge of pub/sub capabilities