===== Large Language Models (LLMs): Resources ===== This is an ad-hoc list of resources about LLMs. ==== Introduction ==== * [[https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/|A jargon-free explanation of how AI large language models work]] * [[https://mark-riedl.medium.com/a-very-gentle-introduction-to-large-language-models-without-the-hype-5f67941fa59e|A Very Gentle Introduction to Large Language Models without the Hype]] * [[https://www.youtube.com/watch?v=zjkBMFhNj_g|Intro to Large Language Models]] by Andrej Karpathy * [[https://medium.com/data-science-at-microsoft/how-large-language-models-work-91c362f5b78f|How Large Language Models work: From zero to ChatGPT]] * [[https://www.youtube.com/watch?v=LPZh9BOjkQs|Large Language Models explained briefly]] by 3Blue1Brown * [[https://www.youtube.com/watch?v=wjZofJX0v4M|Transformers (how LLMs work) explained visually]] by 3Blue1Brown * [[https://www.youtube.com/watch?v=eMlx5fFNoYc|Attention in transformers, step-by-step]] by 3Blue1Brown * [[https://www.youtube.com/watch?v=9-Jl0dxWQs8|How might LLMs store facts]] by 3Blue1Brown * [[https://arxiv.org/abs/2501.09223|Foundations of Large Language Models]] by Tong Xiao, Jingbo Zhu * [[https://en.wikipedia.org/wiki/Large_language_model|Large language model]] by Wikipedia * [[https://arxiv.org/abs/2402.06853|History, Development, and Principles of Large Language Models-An Introductory Survey]] by Zichong Wang, Zhibo Chu, Thang Viet Doan, Shiwen Ni, Min Yang, Wenbin Zhang * [[https://www.youtube.com/watch?v=KJtZARuO3JY|Visualizing transformers and attention]] by Grant Sanderson ==== Courses ==== * [[https://karpathy.ai/zero-to-hero.html|Neural Networks: Zero to Hero]] by Andrej Karpathy * A course by Andrej Karpathy on building neural networks, from scratch, in code. * [[https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi| Neural networks]] by 3Blue1Brown * Basics of neural networks, backpropagation, transformers