15 March 2025 0 6K Report

Does directly adding relative-position-embedding and absolute-position-embedding provide the same length extrapolation advantages for LLMs by RoPE (Rotary Position Embedding)?

More Tong Guo's questions See All
Similar questions and discussions