Skip to content

Instantly share code, notes, and snippets.

View sahil280114's full-sized avatar
🎯
Focusing

Sahil Chaudhary sahil280114

🎯
Focusing
View GitHub Profile
@sahil280114
sahil280114 / m.yaml
Created September 28, 2023 20:08
axolotl
base_model: openlm-research/open_llama_3b_v2
base_model_config: openlm-research/open_llama_3b_v2
model_type: LlamaForCausalLM
tokenizer_type: LlamaTokenizer
load_in_8bit: false
load_in_4bit: false
strict: false
push_dataset_to_hub:
datasets:
- path: data1.json
from human_eval.data import write_jsonl, read_problems
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
problems = read_problems()
tokenizer = AutoTokenizer.from_pretrained("mistralai/Mixtral-8x7B-v0.1", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
"merged_thing",
torch_dtype=torch.bfloat16,