← Awesome Lists

AWESOME · GENERATIVE AI · CURATED

Awesome Generative AI.

generative-ai llm awesome resources
A curated path through generative AI — courses and reading to learn it, frameworks and serving to build with it, RAG / fine-tuning / eval / agents to make it useful, and the landmark papers that define the field. Opinionated and kept tight: the links worth your time. Links open in a new tab.

Courses & learning

ResourceWhatLink
Karpathy — Neural Nets: Zero to HeroBuild GPT from scratch. The single best intro to how LLMs work.site
Hugging Face — LLM CourseHands-on transformers, fine-tuning, and the HF ecosystem.course
DeepLearning.AI short coursesBite-size, practical RAG/agents/prompting courses.site
CS336 — Language Modeling from Scratch (Stanford)Build a real LLM end-to-end. Deep and current.site
Prompt Engineering GuideThe reference for prompting patterns + techniques.site

Reading & deep dives

ResourceWhatLink
The Illustrated Transformer (Jay Alammar)The visual explainer everyone learns attention from.post
Lil'Log (Lilian Weng)Deep, rigorous posts on agents, hallucination, diffusion, RLHF.blog
What Is ChatGPT Doing… (Wolfram)Intuitive long-form on how LLMs generate text.essay
Hugging Face BlogPractical deep dives on training, quantization, serving.blog

Frameworks & orchestration

ResourceWhatLink
LangChain / LangGraphUbiquitous LLM glue; LangGraph for stateful agent graphs.repo
LlamaIndexThe RAG/data-framework default — ingestion, indexing, retrieval.repo
DSPyProgram-and-compile prompts instead of hand-tuning strings.repo
Vercel AI SDKTypeScript-first streaming UI + model-agnostic provider API.site
LiteLLMOne OpenAI-style API across 100+ providers — gateway + fallbacks.repo

Models & access

ResourceWhatLink
Hugging Face HubThe home of open models, datasets, and spaces.site
Llama / Mistral / Qwen / DeepSeek / GemmaThe leading open-weight model families.browse
OpenAI / Anthropic / GoogleFrontier closed APIs — GPT, Claude, Gemini.docs
LMArena / Open LLM LeaderboardCompare models by human votes + benchmarks.site

Serving & inference

ResourceWhatLink
vLLMHigh-throughput serving — PagedAttention, continuous batching. The default.repo
OllamaRun open models locally with one command. Best DX for local.site
llama.cppCPU/GPU inference + GGUF quantization. Runs anywhere.repo
TGI / TensorRT-LLMProduction serving stacks (HF / NVIDIA).repo

RAG & vector stores

ResourceWhatLink
pgvectorVectors in Postgres — start here before a dedicated DB.repo
Qdrant / Weaviate / MilvusDedicated vector databases for scale + filtering.repo
FAISSThe in-process similarity-search library (Meta).repo
MTEB leaderboardPick your embedding model by benchmark.site

Fine-tuning & training

ResourceWhatLink
PEFT / LoRA (HF)Parameter-efficient fine-tuning — adapters, not full weights.repo
AxolotlConfig-driven fine-tuning across many models + methods.repo
Unsloth2x faster, low-memory fine-tuning.repo
TRL (HF)SFT + RLHF (PPO/DPO/GRPO) for aligning models.repo

Eval & observability

ResourceWhatLink
promptfooTest/compare prompts + models in CI. Catch regressions.repo
RagasMetrics for RAG quality — faithfulness, relevance, recall.repo
LangfuseOpen-source LLM tracing, eval, and cost observability.repo
OpenAI Evals / lm-eval-harnessStandard frameworks for model benchmarking.repo

Agents & tool use

ResourceWhatLink
Model Context Protocol (MCP)The emerging standard for connecting models to tools/data.site
LangGraphStateful, controllable agent graphs. The production default.repo
OpenAI Agents SDK / SwarmLightweight multi-agent orchestration patterns.repo
CrewAI / AutoGenMulti-agent collaboration frameworks.repo

Image / video / audio generation

ResourceWhatLink
Diffusers (HF)The library for diffusion image/video/audio pipelines.repo
ComfyUINode-based pipelines for Stable Diffusion + video models.repo
WhisperRobust open speech-to-text (OpenAI).repo
Stable Diffusion / FLUXThe leading open text-to-image model families.models

Landmark papers

PaperWhy it mattersLink
Attention Is All You Need (2017)The Transformer — everything generative is built on it.arXiv
GPT-3 — Language Models are Few-Shot Learners (2020)Scale + in-context learning changed the game.arXiv
InstructGPT / RLHF (2022)Aligning models to instructions — the ChatGPT recipe.arXiv
Chain-of-Thought (2022)Reasoning emerges from "think step by step".arXiv
RAG (2020)Retrieval-augmented generation — ground models in data.arXiv
LoRA (2021)Low-rank adaptation — cheap fine-tuning everyone uses.arXiv
Latent Diffusion (2021)Made high-quality image generation cheap + open.arXiv
where to start New to GenAI? Do Karpathy's Zero-to-Hero, build a RAG app with LlamaIndex + pgvector, serve with Ollama/vLLM, evaluate with promptfoo, and trace with Langfuse. Read Attention → GPT-3 → RAG → InstructGPT → CoT. Always wire eval before scaling — vibes don't catch regressions.
← prev: Reinforcement Learning next: Robotics →
© cvam — written in plaintext, served warm