Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation Paper • 2604.27263 • Published May 14 • 11
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation Paper • 2604.27263 • Published May 14 • 11
Hermes 4 Evaluations Collection Evals from the Hermes-4 Technical Report • 20 items • Updated Dec 3, 2025 • 2