Introduction to Language Models Posted on 2025-02-01 Edited on 2026-04-14 In AI Word count in article: 465 Reading time ≈ 2 mins. Chapter 1 - 了解如何加载模型 Read more »
Softmax Posted on 2025-02-01 Edited on 2026-04-14 In AI Word count in article: 2 Reading time ≈ 1 mins. Softmax 笔记 Read more »
Looking Inside Transformer LLMs Posted on 2025-01-31 Edited on 2026-04-14 In AI Word count in article: 529 Reading time ≈ 2 mins. Chapter 3 - 1. 了解 transformers llm 不同输入和输出的区别?以及看看不同的 output 可以有什么用?2. 了解 RMSNorm 和 layernorm 的区别3.了解 KV cache 的原理,以及在推理的时候怎么使用? Read more »