Conner Vercellino c0nn3r

Interested in NLP, metalearning and other random machine learning topics.

thomwolf / top-k-top-p.py

Last active March 11, 2025 03:44

Sample the next token from a probability distribution using top-k and/or nucleus (top-p) sampling

	def top_k_top_p_filtering(logits, top_k=0, top_p=0.0, filter_value=-float('Inf')):
	""" Filter a distribution of logits using top-k and/or nucleus (top-p) filtering
	Args:
	logits: logits distribution shape (vocabulary size)
	top_k >0: keep only top k tokens with highest probability (top-k filtering).
	top_p >0.0: keep the top tokens with cumulative probability >= top_p (nucleus filtering).
	Nucleus filtering is described in Holtzman et al. (http://arxiv.org/abs/1904.09751)
	"""
	assert logits.dim() == 1 # batch size 1 for now - could be updated for more but the code would be less clear
	top_k = min(top_k, logits.size(-1)) # Safety check

apaszke / functional_model.py

Last active February 28, 2023 14:28

	import sys
	from collections import OrderedDict

	PY2 = sys.version_info[0] == 2
	_internal_attrs = {'_backend', '_parameters', '_buffers', '_backward_hooks', '_forward_hooks', '_forward_pre_hooks', '_modules'}


	class Scope(object):
	def __init__(self):
	self._modules = OrderedDict()

jfsantos / pytorch_train_with_masking.py

Created February 18, 2017 15:18

	def train_fn(model, optimizer, criterion, batch):
	x, y, lengths = batch

	x = Variable(x.cuda())
	y = Variable(y.cuda(), requires_grad=False)

	mask = Variable(torch.ByteTensor(x.size()).fill_(1).cuda(),
	requires_grad=False)
	for k, l in enumerate(lengths):
	mask[:l, k, :] = 0

jacobsvante / README.md

Last active June 30, 2020 01:54 — forked from agnoster/README.md

My version of the Agnoster theme, with Virtualenv support

agnoster.zsh-theme

A ZSH theme optimized for people who use:

For Mac users, I highly recommend iTerm 2 + Solarized Dark