Woosung (Reiss) Koh

Main
Experience
News

Experience
News
Projects
Blog
Publications
Recent & Upcoming Talks
- Example Talk
Teaching
- Learn JavaScript
- Learn Python

Large Language Models Can Control Their Own Attention Span

May 22, 2026·

[P2] Namgyu Ho*, Huzama Ahmad*, Woosung Koh*, Se-Young Yun, Tal Schuster, Cicero Nogueira Dos Santos

· 0 min read

Type

Conference paper

Publication

Pre-print

Last updated on May 29, 2026

Long-Context Efficient Inference Sparse Attention

mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Mar 25, 2026 →

© 2026 Reiss Koh

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.