User Tools

Site Tools


resources_llms

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
resources_llms [2025/02/02 20:32] – removed - external edit (Unknown date) 127.0.0.1resources_llms [2025/02/02 20:48] (current) – Add courses for LLM resources. manish
Line 1: Line 1:
 +===== Large Language Models (LLMs): Resources =====
 +This is an ad-hoc list of resources about LLMs.
  
 +==== Introduction ====
 +  * [[https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/|A jargon-free explanation of how AI large language models work]]
 +  * [[https://mark-riedl.medium.com/a-very-gentle-introduction-to-large-language-models-without-the-hype-5f67941fa59e|A Very Gentle Introduction to Large Language Models without the Hype]]
 +  * [[https://www.youtube.com/watch?v=zjkBMFhNj_g|Intro to Large Language Models]] by Andrej Karpathy
 +  * [[https://medium.com/data-science-at-microsoft/how-large-language-models-work-91c362f5b78f|How Large Language Models work: From zero to ChatGPT]]
 +  * [[https://www.youtube.com/watch?v=LPZh9BOjkQs|Large Language Models explained briefly]] by 3Blue1Brown
 +  * [[https://www.youtube.com/watch?v=wjZofJX0v4M|Transformers (how LLMs work) explained visually]] by 3Blue1Brown
 +  * [[https://www.youtube.com/watch?v=eMlx5fFNoYc|Attention in transformers, step-by-step]] by 3Blue1Brown
 +  * [[https://www.youtube.com/watch?v=9-Jl0dxWQs8|How might LLMs store facts]] by 3Blue1Brown
 +  * [[https://arxiv.org/abs/2501.09223|Foundations of Large Language Models]] by Tong Xiao, Jingbo Zhu
 +  * [[https://en.wikipedia.org/wiki/Large_language_model|Large language model]] by Wikipedia
 +  * [[https://arxiv.org/abs/2402.06853|History, Development, and Principles of Large Language Models-An Introductory Survey]] by Zichong Wang, Zhibo Chu, Thang Viet Doan, Shiwen Ni, Min Yang, Wenbin Zhang
 +  * [[https://www.youtube.com/watch?v=KJtZARuO3JY|Visualizing transformers and attention]] by Grant Sanderson
 +
 +
 +==== Courses ====
 +  * [[https://karpathy.ai/zero-to-hero.html|Neural Networks: Zero to Hero]] by Andrej Karpathy
 +    * A course by Andrej Karpathy on building neural networks, from scratch, in code.
 +  * [[https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi| Neural networks]] by 3Blue1Brown
 +    * Basics of neural networks, backpropagation, transformers

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki