Shahbaz Syed shahbazsyed

shahbazsyed / apple-silicon.preset.json

Created February 21, 2024 20:32 — forked from ingridstevens/apple-silicon.preset.json

Apple Metal (GPU Acceleration on) + use_mlock = off

shahbazsyed / LLM.md

Created March 29, 2023 10:34 — forked from rain-1/LLM.md

LLM Introduction: Learn Language Models

Purpose

Bootstrap knowledge of LLMs ASAP. With a bias/focus to GPT.

Avoid being a link dump. Try to provide only valuable well tuned information.

Neural network links before starting with transformers.

shahbazsyed / bart-preprocess-example.sh

Created February 17, 2021 14:26 — forked from tomsherborne/bart-preprocess-example.sh

	source ~/miniconda3/bin/activate allen

	LANG=en
	TASK=qa_en_small
	for SPLIT in train valid
	do
	python -m examples.roberta.multiprocessing_bpe_encoder \
	--encoder-json encoder.json \
	--vocab-bpe vocab.bpe \
	--inputs "$TASK/$SPLIT.$LANG" \

shahbazsyed / bart-train-example.sh

Created February 17, 2021 14:26 — forked from tomsherborne/bart-train-example.sh


	fairseq-train qa_en_small-bin \
	--log-interval=10 \
	--log-format=json \
	--tensorboard-logdir=/users/tom/ed/sp/pretrain/tests/fairseq/bart_en_small/logs \
	--seed=1 \
	--cpu \
	--min-loss-scale=0.0001 \
	--model-parallel-size=1 \
	--criterion=cross_entropy \

shahbazsyed / parallel.py

Created June 3, 2019 13:18 — forked from thomwolf/parallel.py

Data Parallelism in PyTorch for modules and losses

	##+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
	## Created by: Hang Zhang, Rutgers University, Email: [email protected]
	## Modified by Thomas Wolf, HuggingFace Inc., Email: [email protected]
	## Copyright (c) 2017-2018
	##
	## This source code is licensed under the MIT-style license found in the
	## LICENSE file in the root directory of this source tree
	##+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

	"""Encoding Data Parallel"""

shahbazsyed / word2vec_tf_idf_from_wikipeida.py

Created October 30, 2017 14:20 — forked from maxbellec/word2vec_tf_idf_from_wikipeida.py

Create Word2Vec from wikipedia with gensim

	import multiprocessing
	from gensim.corpora.wikicorpus import WikiCorpus
	from gensim.models.word2vec import Word2Vec
	from gensim.models import TfidfModel

	# logging is important to get the state of the functions
	import logging
	logging.basicConfig(format='%(asctime)s: %(levelname)s: %(message)s')
	logging.root.setLevel(level=logging.INFO)