kastnerkyle’s gists

kastnerkyle / rover_simple_python.py

Last active July 14, 2024 19:58

ROVER system combination algorithm

	# -- coding: utf-8 --
	from __future__ import print_function
	from __future__ import unicode_literals

	# author: Kyle Kastner

	# References:
	# needleman wunsch (could use other alignment algorithms instead)
	# https://colab.research.google.com/github/zaneveld/full_spectrum_bioinformatics/blob/master/content/08_phylogenetic_trees/needleman_wunsch_alignment.ipynb

kastnerkyle / flow_matching.py

Created June 19, 2024 18:55 — forked from francois-rozet/flow_matching.py

Flow Matching in 100 LOC

	#!/usr/bin/env python

	import math
	import matplotlib.pyplot as plt
	import torch
	import torch.nn as nn

	from sklearn.datasets import make_moons
	from torch import Tensor
	from tqdm import tqdm

kastnerkyle / semantic_search_with_gzip.py

Created July 15, 2023 14:25 — forked from kyo-takano/lexical_search_with_gzip.py

Semantic Search with gzip （gzipによるセマンティック検索）

	import gzip

	def gzip_search(query: str, candidate_chunks: list[str], top_k: int=1):
	"""
	文字列ベースで類似したテキストチャンクを推定するアルゴリズム.
	`query`, `chunk`, および`query + " " + chunk`をそれぞれgzipで圧縮し、編集距離のようなものをベースに評価する.

	Parameters:
	query (str): 検索クエリとして使用する文字列.
	top_k (int, optional): 返される類似チャンクの上位k個を指定する (default: 1).

kastnerkyle / string_match.py

Last active December 8, 2022 04:41

String matching using fft / fht (hartley transform). For learning, not optimal speed

	# Author: Kyle Kastner
	# BSD 3-Clause

	# Thanks to jakevdp for the nice blog post on FFT
	# https://jakevdp.github.io/blog/2013/08/28/understanding-the-fft/
	# Summary
	# http://www.arazim-project.com/sites/default/files/public/lesson_sums/1fft.pdf
	# Details on hartley and many xforms
	# https://caxapa.ru/thumbs/455725/algorithms.pdf
	# pg 332 http://sep.stanford.edu/data/media/public/oldreports/sep38/38_29.pdf

kastnerkyle / find_noise.py

Created September 12, 2022 00:00 — forked from trygvebw/find_noise.py

A "reverse" version of the k_euler sampler for Stable Diffusion, which finds the noise that will reconstruct the supplied image

	import torch
	import k_diffusion as K

	from PIL import Image
	from torch import autocast
	from einops import rearrange, repeat

	def pil_img_to_latent(model, img, batch_size=1, device='cuda', half=True):
	init_image = pil_img_to_torch(img, half=half).to(device)
	init_image = repeat(init_image, '1 ... -> b ...', b=batch_size)

kastnerkyle / adamw_finetune.py

Created August 1, 2022 16:41 — forked from crowsonkb/adamw_finetune.py

	import math
	import torch
	from torch import optim


	class AdamWFinetune(optim.Optimizer):
	r"""Implements AdamW algorithm with optional weight decay toward the starting value, to
	prevent overfitting to the new dataset during fine-tuning.

	The original Adam algorithm was proposed in `Adam: A Method for Stochastic Optimization`_.

kastnerkyle / typical_top_k_top_p.py

Last active June 30, 2022 15:11

My own take on a plug and play setup for typical sampling from Meister et. al. "Typical Decoding for Natural Language Generation". Added top k by typicality for now

	def typical_top_k_filtering(logits, top_k=0, top_p=0.0, temperature=1.0, min_tokens_to_keep=1, filter_value=-1E12):
	""" Filter a distribution of logits using typicality, with optional top-k and/or nucleus (top-p) filtering
	Meister et. al. https://arxiv.org/abs/2202.00666
	Args:
	logits: logits distribution shape (..., vocabulary size)
	top_k >0: keep top k tokens with highest prob (top-k filtering).
	top_p >0.0: keep the top p tokens which compose cumulative probability mass top_p (nucleus filtering).
	min_tokens_to_keep >=1: always keep at least this many tokens through the top_p / nucleus sampling
	"""
	# https://arxiv.org/abs/2202.00666

kastnerkyle / bwv101.7.C-minor-transposed.json

Last active February 16, 2021 03:45

Quick and dirty example of piano roll plotting from "music JSON"

	{
	"seconds_per_quarter": 0.5,
	"parts_names": [
	"Soprano",
	"Alto",
	"Tenor",
	"Bass"
	],
	"parts_cumulative_times": [
	[

kastnerkyle / batch_ar_example.py

Last active October 22, 2020 11:05

	import numpy as np

	# make a minibatch of time, batch, features
	# time length 7
	# batch size 2
	# feature dimension 4:
	# 1:4, 10:14, 20:24, 30:34, etc for first minibatch element
	# 5:8, 15:18, etc second minibatch el
	n_features = 4
	n_timesteps = 7

kastnerkyle / Kiritan singing voice synthesis demo.ipynb

Created May 3, 2020 06:09 — forked from r9y9/Kiritan singing voice synthesis demo.ipynb

Neural_network_based_singing_voice_synthesis_demo_using_kiritan_singing_database_(Japanese)

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

Kyle Kastner kastnerkyle