Deploy Dolly v2.0 to SageMaker

hooman650 commented Apr 29, 2023

Thanks for the example. We have deployed many conversational models on Sagemaker. The challenge is that this way the endpoint does not stream the response and a lot of times for longer responses it times out.

ulisseshen commented May 3, 2023

Thanks for the example. We have deployed many conversational models on Sagemaker. The challenge is that this way the endpoint does not stream the response and a lot of times for longer responses it times out.

You can try another conversational pattern for your server/client like a websocket

IChr1 commented May 30, 2023 •

edited

Loading

Has anyone used an inference config for the code as seen above so that the model can handle embeddings ?

ybm11 commented Jul 24, 2023

Thanks for sharing, this is helping me a lot in trying to figure this topic out.
One question - why is there a mismatch between the transformers version in the requirements.txt file and in the Sagmaker model creation command? What is the difference, and how does it make sense that they will be different?

wd021 commented Jul 8, 2025

hey prompters, sharing a new resource for 🧠 prompts, God Tier Prompts!

timesler/deploy_dolly_v2.ipynb

hooman650 commented Apr 29, 2023

Uh oh!

ulisseshen commented May 3, 2023

Uh oh!

IChr1 commented May 30, 2023 •

edited

Loading

Uh oh!

ybm11 commented Jul 24, 2023

Uh oh!

wd021 commented Jul 8, 2025

Uh oh!

timesler/deploy_dolly_v2.ipynb

hooman650 commented Apr 29, 2023

Uh oh!

ulisseshen commented May 3, 2023

Uh oh!

IChr1 commented May 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ybm11 commented Jul 24, 2023

Uh oh!

wd021 commented Jul 8, 2025

Uh oh!

IChr1 commented May 30, 2023 •

edited

Loading