Skip to content

Instantly share code, notes, and snippets.

View hamelsmu's full-sized avatar
💻
Always learning.

Hamel Husain hamelsmu

💻
Always learning.
View GitHub Profile
@hamelsmu
hamelsmu / torchserve.md
Last active August 24, 2024 04:07
Why `torchserve` is great in the age of LLMs

Why torchserve is great in the age of LLMs

I previously wrote about TorchServe as nice way to serve models.

There are a plethora of ways to optimize LLM models for inference, such as quantization, Paged Attention, kernel fusion, and other compilation techniques. The list of these keeps expanding as the demand for serving OSS LLMs increases. There is a groundswell of demand for serving OSS LLMs, because many organizations have tried OpenAI/Anthropic/Palm, but want a a solution they can control

image

This is why TorchServe is nice, you can use any framework you want in the custom handler, such as GPTQ, transformers, vLLM, etc (as long as there is a python client). With TorchServe, there is enough f

@hamelsmu
hamelsmu / app.ipynb
Last active November 14, 2022 18:16
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / app.ipynb
Created November 7, 2022 20:20
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / app.ipynb
Created November 7, 2022 18:21
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / app.ipynb
Created November 7, 2022 18:10
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / app.ipynb
Created November 7, 2022 18:07
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / app.ipynb
Created November 7, 2022 17:54
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / app.ipynb
Created November 7, 2022 17:53
A demo of creating a gradio app with nbdev
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / notes.ipynb
Created August 30, 2022 19:25
How docs workflow works
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@hamelsmu
hamelsmu / get_validation_preds.py
Created July 3, 2022 03:57
Get Validation Predictions For Lesson 3 wandb
import wandb
import pandas as pd
from fastcore.all import Path
def val_pred_table(run_id, entity='wandb_course', proj='lemon-project'):
"Get prediction table on the validation set for the lemon project."
api = wandb.Api()
path = api.artifact(f'{entity}/{proj}/run-{run_id}-predictions_table:v0').download()
preds = (Path(path)/'predictions_table.table.json').read_json()
return pd.DataFrame([{'pred':p, 'label':t} for _,p,_,t in preds['data']])