Woosung (Reiss) Koh
  • Main
  • Experience
  • News
  • Experience
  • News
  • Projects
    • Pandas
    • PyTorch
    • scikit-learn
  • Blog
    • ๐ŸŽ‰ Easily create your own simple yet highly customizable blog
    • ๐Ÿง  Sharpen your thinking with a second brain
    • ๐Ÿ“ˆ Communicate your results effectively with the best data visualizations
    • ๐Ÿ‘ฉ๐Ÿผโ€๐Ÿซ Teach academic courses
    • โœ… Manage your projects
  • Publications
    • mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT
    • Generative Visual Code Mobile World Models
    • Predicting LLM Reasoning Performance with Small Proxy Model
    • AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners
    • C$^2$: Scalable Auto-Feedback for LLM-based Chart Generation
    • FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
    • Encoding Temporal Statistical-space Priors via Augmented Representation
    • Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series
    • Network-based exploratory data analysis and explainable three-stage deep clustering for financial customer profiling
  • Recent & Upcoming Talks
    • Example Talk
  • Teaching
    • Learn JavaScript
    • Learn Python

mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

Mar 25, 2026ยท
[P2] Woosung Koh*
,
Jeyoung Jeon*
,
Youngjin Song
,
Yujin Cheon
,
Soowon Oh
,
Jaehyeong Choi
,
Se-Young Yun
ยท 0 min read
Type
Conference paper
Publication
Pre-print
Last updated on Mar 12, 2026
Post-Training Multi-Task Generalization

Generative Visual Code Mobile World Models Jan 25, 2026 →

ยฉ 2026 Reiss Koh

Published with Hugo Blox Builder โ€” the free, open source website builder that empowers creators.