Mahesh Deshwal deshwalmahesh

🎯

Focusing

Machine Learning Engineer

lewtun / sft_trainer.py

Last active April 21, 2025 16:04

Fine-tuning Mistral 7B with TRL & DeepSpeed ZeRO-3

	# This is a modified version of TRL's `SFTTrainer` example (https://github.com/huggingface/trl/blob/main/examples/scripts/sft_trainer.py),
	# adapted to run with DeepSpeed ZeRO-3 and Mistral-7B-V1.0. The settings below were run on 1 node of 8 x A100 (80GB) GPUs.
	#
	# Usage:
	# - Install the latest transformers & accelerate versions: `pip install -U transformers accelerate`
	# - Install deepspeed: `pip install deepspeed==0.9.5`
	# - Install TRL from main: pip install git+https://github.com/huggingface/trl.git
	# - Clone the repo: git clone github.com/huggingface/trl.git
	# - Copy this Gist into trl/examples/scripts
	# - Run from root of trl repo with: accelerate launch --config_file=examples/accelerate_configs/deepspeed_zero3.yaml --gradient_accumulation_steps 8 examples/scripts/sft_trainer.py

younesbelkada / benchmark-mistral-7b.py

Last active February 14, 2024 13:11

Benchmark Mistral 7b model

	import argparse
	from mistral.cache import RotatingBufferCache
	import torch
	import inspect
	from typing import List
	from pathlib import Path

	from mistral.model import Transformer
	from mistral.tokenizer import Tokenizer

ahoho / prompt_alpaca_lora.py

Last active May 7, 2024 04:28

Create a huggingface pipeline with a lora-trained alpaca

	from typing import Optional, Any

	import torch

	from transformers.utils import is_accelerate_available, is_bitsandbytes_available
	from transformers import (
	AutoTokenizer,
	AutoModelForCausalLM,
	GenerationConfig,
	pipeline,

YashasSamaga / yolov4.py

Last active June 25, 2025 01:48

YOLOv4 on OpenCV DNN

	import cv2
	import time

	CONFIDENCE_THRESHOLD = 0.2
	NMS_THRESHOLD = 0.4
	COLORS = [(0, 255, 255), (255, 255, 0), (0, 255, 0), (255, 0, 0)]

	class_names = []
	with open("classes.txt", "r") as f:
	class_names = [cname.strip() for cname in f.readlines()]

thomwolf / gradient_accumulation.py

Last active August 2, 2025 21:09

PyTorch gradient accumulation training loop

	model.zero_grad() # Reset gradients tensors
	for i, (inputs, labels) in enumerate(training_set):
	predictions = model(inputs) # Forward pass
	loss = loss_function(predictions, labels) # Compute loss function
	loss = loss / accumulation_steps # Normalize our loss (if averaged)
	loss.backward() # Backward pass
	if (i+1) % accumulation_steps == 0: # Wait for several backward steps
	optimizer.step() # Now we can do an optimizer step
	model.zero_grad() # Reset gradients tensors
	if (i+1) % evaluation_steps == 0: # Evaluate the model when we...

tobiasraabe / python-downloader.py

Last active March 8, 2025 10:47

Script to download files, resume downloads and validate downloads. Everything wrapped with a beautiful progressbar.

	import click
	import hashlib
	import requests

	from pathlib import Path
	from tqdm import tqdm


	CONTEXT_SETTINGS = dict(help_option_names=['-h', '--help'])