SERIES · 斯坦福CS336: Language Modeling from Scratch

Stanford CS336: lecture 2 Pytorch, Resource Accounting

2025-09-25 · 25 min read · by GUMP

Stanford CS336: lecture 2 Pytorch, Resource Accounting

主要介绍训练模型所需的基本要素,从张量到模型、再到优化器与训练循环,强调资源效率,尤其是内存(GB)与计算量(FLOPs)的核算。课程不涉及 Transformer,而是通过更简单的模型来讲解。