Softmax Posted on 2025-02-01 Edited on 2025-02-15 In AI Word count in article: 2 Reading time ≈ 1 mins. #deepLearning/softmax Read more »
Looking Inside Transformer LLMs Posted on 2025-01-31 Edited on 2025-02-15 In AI Word count in article: 529 Reading time ≈ 2 mins. Chapter 3 - 1. 了解 transformers llm 不同输入和输出的区别?以及看看不同的 output 可以有什么用?2. 了解 RMSNorm 和 layernorm 的区别3.了解 KV cache 的原理,以及在推理的时候怎么使用? Read more »
return_tensors Posted on 2025-01-30 Edited on 2025-02-15 In AI Word count in article: 850 Reading time ≈ 3 mins. Why use "return_tensors="? Read more »