This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
from transformers import AutoTokenizer | |
import numpy as np | |
from tqdm import tqdm | |
def analyze_token_sizes(dataset, tokenizer): | |
""" | |
Analyze token sizes for all text fields in the dataset | |
""" | |
# Initialize lists to store token counts | |
token_counts = [] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Copyright 2022 EleutherAI and The HuggingFace Inc. team. All rights reserved. | |
# | |
# Licensed under the Apache License, Version 2.0 (the "License"); | |
# you may not use this file except in compliance with the License. | |
# You may obtain a copy of the License at | |
# | |
# http://www.apache.org/licenses/LICENSE-2.0 | |
# | |
# Unless required by applicable law or agreed to in writing, software | |
# distributed under the License is distributed on an "AS IS" BASIS, |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
personal |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
base_model: meta-llama/Meta-Llama-3-8B | |
model_type: AutoModelForCausalLM | |
tokenizer_type: AutoTokenizer | |
load_in_8bit: false | |
load_in_4bit: true | |
strict: false | |
datasets: | |
- path: aaditya/alpaca_subset_1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
#Inspired from mlabonne autogguf work with modifications | |
cat << "EOF" | |
$$$$$$\ $$\ $$\ $$$$$$$$\ $$$$$$\ $$$$$$\ $$$$$$\ $$\ $$\ $$$$$$$$\ | |
$$ __$$\ $$ | $$ |\__$$ __|$$ __$$\ $$ __$$\ $$ __$$\ $$ | $$ |$$ _____| | |
$$ / $$ |$$ | $$ | $$ | $$ / $$ | $$ / \__|$$ / \__|$$ | $$ |$$ | | |
$$$$$$$$ |$$ | $$ | $$ | $$ | $$ |$$$$$$\ $$ |$$$$\ $$ |$$$$\ $$ | $$ |$$$$$\ | |
$$ __$$ |$$ | $$ | $$ | $$ | $$ |\______|$$ |\_$$ |$$ |\_$$ |$$ | $$ |$$ __| | |
$$ | $$ |$$ | $$ | $$ | $$ | $$ | $$ | $$ |$$ | $$ |$$ | $$ |$$ | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
base_model: google/gemma-7b | |
model_type: AutoModelForCausalLM | |
tokenizer_type: AutoTokenizer | |
load_in_8bit: false | |
load_in_4bit: true | |
strict: false | |
# huggingface repo | |
datasets: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"Dataset": [ | |
"multimedqa", | |
"medmcqa", | |
"medqa_4options", | |
"mmlu_anatomy", | |
"mmlu_clinical_knowledge", | |
"mmlu_college_biology", | |
"mmlu_college_medicine", | |
"mmlu_medical_genetics", |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
from abc import ABCMeta, abstractmethod | |
from typing import List, Optional, Union, Dict | |
import tenacity | |
class Model(metaclass=ABCMeta): | |
""" | |
Abstract base class for a large language model(llm). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sciatica Diagnosis and Treatment. | |
here is the conversation between doctor and patient comes to doctors opd,summerzie it clinically | |
Good morning, how can I help you today? | |
Hi, I've been experiencing some intense low back pain, and I was hoping you could give me some guidance on what might be causing it. | |
I'm sorry to hear that you're in pain. Let's start by discussing your symptoms. Can you describe the nature of the pain? Is it a dull ache, a sharp or shooting pain, or something else? |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"suggested_symptoms": "Decreased sensation in right foot, difficulty in walking due to swelling and pain, frequent urination, increased thirst, unexpected weight loss, fatigue, signs of infection around the right foot such as redness, warmth, pus discharge, malaise, severe cough with phlegm or blood, shortness of breath, weight loss, night sweats, history of insect bites or travel to malaria-endemic regions, periods of high fever alternating with periods of normal or low temperature (fever pattern associated with malaria), jaundice, pallor, enlarged spleen or liver, muscle aches, fatigue, nausea, and vomiting", | |
"suggested_diagnosis": "Uncontrolled Type 2 Diabetes Mellitus with complications of foot cellulitis and Malaria. The patient may also be at risk for a respiratory infection like Pneumonia due to severe coughing, which needs further investigation. In addition, given the existing diabetes, neuropathy and peripheral vascular disease should also be considered as part of the diabetes-related compl |
NewerOlder