15 March 2025 3 6K Report

When training large models(LLMs), which training speed metric is primarily considered: learning rate, batch size, or batch time?

More Tong Guo's questions See All
Similar questions and discussions