veekaybee/normcore-llm.md

Last active November 15, 2024 12:06

Star () You must be signed in to star a gist
Fork () You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e.js"></script>
Save veekaybee/be375ab33085102f9027853128dc5f0e to your computer and use it in GitHub Desktop.

Download ZIP

Normcore LLM Reads

Raw

normcore-llm.md

Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.

Foundational Concepts

Pre-Transformer Models

Building Blocks

Foundational Deep Learning Papers (in semi-chronological order)

The Transformer Architecture

Attention

GPT

Significant OSS Models

LLMs in 2023

Training Data

Pre-Training

RLHF and DPO

Fine-Tuning and Compression

The Complete Guide to LLM Fine-tuning
LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language - Really great overview of SOTA fine-tuning techniques
On the Structural Pruning of Large Language Models
Quantiztion
PEFT

Small and Local LLMs

Deployment and Production

LLM Inference and K-V Cache

Prompt Engineering and RAG

GPUs

Evaluation

Eval Frameworks

UX

What's Next?

Thanks to everyone who added suggestions on Twitter, Mastodon, and Bluesky.

lcrmorin commented Aug 22, 2023

It seems that the gzip approach, altough really cool, was 'optimistic' and thus overhyped, see: https://kenschutte.com/gzip-knn-paper/ (basiccaly they confused k in k-nn and top-k accuracy, reporting top-2 accuracy). More recent studies found that it is, as expected, on 'bag of words' performance level Gzip versus bag-of-words for text classification.

I don't know if you intend to (or are even interested) but I am on the look out for "usecases for normies".

wrhall commented Aug 22, 2023

Do you think it's worth annotating with dates of the articles / papers / videos?

Author

veekaybee commented Aug 23, 2023

Why Chatbots Are Not the Future

Great one!

Author

veekaybee commented Aug 23, 2023

It seems that the gzip approach, altough really cool, was 'optimistic' and thus overhyped, see: https://kenschutte.com/gzip-knn-paper/ (basiccaly they confused k in k-nn and top-k accuracy, reporting top-2 accuracy). More recent studies found that it is, as expected, on 'bag of words' performance level Gzip versus bag-of-words for text classification.

I don't know if you intend to (or are even interested) but I am on the look out for "usecases for normies".

Yeah that was my read on it as well but I'm also very interested in it as a general theoretical approach and baseline, even if this particular implementation doesn't work.

Author

veekaybee commented Aug 23, 2023

Do you think it's worth annotating with dates of the articles / papers / videos?

Maybe would be helpful but I explicitly picked stuff that I thought wouldn't age and/or where the recency didn't matter because the fundamentals are timeless.

janhesse53 commented Aug 23, 2023

Patterns for Building LLM-based Systems & Products

In my opinion, this is a super in depth article that covers many of the categories and deserves a place in the reading list.

rmitsch commented Aug 25, 2023

Against LLM maximalism (disclaimer: I work at Explosion)

Lykos2 commented Aug 27, 2023

Patterns for Building LLM-based Systems & Products In my opinion, this is a super in depth article that covers many of the categories and deserves a place in the reading list.

detailed blog

spmurrayzzz commented Aug 27, 2023

The Illustrated Transformer - maybe redundant from some of the other transformers content here, but is very well-written and strikes a good balance between prose and visual aids.

satisfice commented Aug 27, 2023

ChatGPT Sucks at Being a Testing Expert
https://www.satisfice.com/download/appendix-chatgpt-sucks-at-being-a-testing-expert?wpdmdl=487569

This is a careful analysis of an attempt to demonstrate ChatGPT’s usefulness to help testers.

davidzshi commented Aug 27, 2023

I liked this article a lot: https://lilianweng.github.io/posts/2023-06-23-agent/

Tulip4attoo commented Aug 28, 2023

My writing as an extension to "Why you should host your LLM?" article, with some adding on operation perspective: https://tulip4attoo.substack.com/p/why-you-should-host-your-llm-from

ghosthamlet commented Aug 28, 2023

Maybe Foundational Papers should include the first instruction-tuned model FLAN (no RLHF)：
Finetuned Language Models Are Zero-Shot Learners: https://arxiv.org/abs/2109.01652

timbornholdt commented Aug 28, 2023

I gave a talk about prompt engineering for normal people and turned it into a pretty decent article, might be useful for the list too? https://timbornholdt.com/blog/prompt-engineering-how-to-think-like-an-ai

emilymbender commented Aug 28, 2023

The Stochastic Parrots paper presents many things that anyone should be cognisant of when deciding whether or not to use an LLM:

Bender, Emily M., Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜. In Proceedings of FAccT 2021, pp.610-623.

livc commented Aug 29, 2023

We are exploring the landing and commercialization scenarios of AI Agent at https://askgen.ie, and currently we think customer support is a good scenario

umair-nasir14 commented Aug 29, 2023

@livc Are you guys hiring?

umair-nasir14 commented Aug 29, 2023

LLMs for designing 2D game levels: Level Generation Through Large Language Models
LLMs for Neural Architecture Search : LLMatic: Neural Architecture Search via Large Language Models and Quality-Diversity Optimization

Sharrp commented Aug 29, 2023

I found "Five years of GPT progress" to be a useful overview of the influential papers on GPT.
https://finbarr.ca/five-years-of-gpt-progress/
May work as a high-level summary for "Foundational Papers" section.

p.s. Thank you for compiling the list!

hkniberg commented Aug 30, 2023 •

edited

Loading

Hi! This is great! Would be even more useful to write the year/month of publication next to each item, to get a sense of which links are more up-to-date and which are more historical.

will-thompson-k commented Aug 30, 2023 •

edited

Loading

I really like this list, sad I just discovered this 😎 .

I am not sure if this would complement your Background section, but I wrote this as a primer on LLMs last month: https://willthompson.name/what-we-know-about-llms-primer.

But I don't know, might not be very orthogonal to your other sources here 🤷 .

AnnthomyGILLES commented Aug 31, 2023

An overview of vector database. The author highlight the differences between the various vector databases out there as visually as possible.

https://thedataquarry.com/posts/vector-db-1/

davidzshi commented Sep 6, 2023

An overview of vector database. The author highlight the differences between the various vector databases out there as visually as possible.

https://thedataquarry.com/posts/vector-db-1/

This is really helpful, thank you!

tekumara commented Sep 11, 2023

GPT in 60 Lines of NumPy https://jaykmody.com/blog/gpt-from-scratch/

lcrmorin commented Dec 29, 2023

I keep coming back to this list. However I feel like it miss a good discussion about current stuff not working. I keep failling to implement working stuff, despite lenghty theoretical works, and when I scratch the veneer I keep getting the same answer: "technology is not ready yet".

lcrmorin commented Dec 29, 2023 •

edited

Loading

Also: Bash One-Liners for LLMs

zaunere commented Sep 22, 2024

Awesome list (and comments), but "graph" is missing

Angelld23 commented Sep 23, 2024

RAGs Do Not Reduce Hallucinations in LLMs — A Math Deep Dive

veekaybee/normcore-llm.md

Anti-hype LLM reading list

Foundational Concepts

Pre-Transformer Models

Building Blocks

Foundational Deep Learning Papers (in semi-chronological order)

The Transformer Architecture

Attention

GPT

Significant OSS Models

LLMs in 2023

Training Data

Pre-Training

RLHF and DPO

Fine-Tuning and Compression

Small and Local LLMs

Deployment and Production

LLM Inference and K-V Cache

Prompt Engineering and RAG

GPUs

Evaluation

Eval Frameworks

UX

What's Next?

lcrmorin commented Aug 22, 2023

wrhall commented Aug 22, 2023

veekaybee commented Aug 23, 2023

veekaybee commented Aug 23, 2023

veekaybee commented Aug 23, 2023

janhesse53 commented Aug 23, 2023

rmitsch commented Aug 25, 2023

Lykos2 commented Aug 27, 2023

spmurrayzzz commented Aug 27, 2023

satisfice commented Aug 27, 2023

davidzshi commented Aug 27, 2023

Tulip4attoo commented Aug 28, 2023

ghosthamlet commented Aug 28, 2023

timbornholdt commented Aug 28, 2023

emilymbender commented Aug 28, 2023

livc commented Aug 29, 2023

umair-nasir14 commented Aug 29, 2023

umair-nasir14 commented Aug 29, 2023

Sharrp commented Aug 29, 2023

hkniberg commented Aug 30, 2023 • edited Loading

will-thompson-k commented Aug 30, 2023 • edited Loading

AnnthomyGILLES commented Aug 31, 2023

davidzshi commented Sep 6, 2023

tekumara commented Sep 11, 2023

lcrmorin commented Dec 29, 2023

lcrmorin commented Dec 29, 2023 • edited Loading

zaunere commented Sep 22, 2024

Angelld23 commented Sep 23, 2024

hkniberg commented Aug 30, 2023 •

edited

Loading

will-thompson-k commented Aug 30, 2023 •

edited

Loading

lcrmorin commented Dec 29, 2023 •

edited

Loading