Woosung (Reiss) Koh
  • Main
  • Experience
  • Projects
  • Projects
    • Pandas
    • PyTorch
    • scikit-learn
  • Experience
  • Blog
    • ๐ŸŽ‰ Easily create your own simple yet highly customizable blog
    • ๐Ÿง  Sharpen your thinking with a second brain
    • ๐Ÿ“ˆ Communicate your results effectively with the best data visualizations
    • ๐Ÿ‘ฉ๐Ÿผโ€๐Ÿซ Teach academic courses
    • โœ… Manage your projects
  • Publications
    • Predicting LLM Reasoning Performance with Small Proxy Model
    • AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners
    • C2^22: Scalable Auto-Feedback for LLM-based Chart Generation
    • FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
    • Encoding Temporal Statistical-space Priors via Augmented Representation
    • Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series
    • Network-based exploratory data analysis and explainable three-stage deep clustering for financial customer profiling
  • Recent & Upcoming Talks
    • Example Talk
  • Teaching
    • Learn JavaScript
    • Learn Python

AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners

Apr 19, 2025ยท
[C3] Woosung Koh
,
Wonbeen Oh
,
Jaein Jang
,
MinHyung Lee
,
Hyeongjin Kim
,
Ah Yeon Kim
,
Joonkee Kim
,
Junghyun Lee
,
Taehyeon Kim
,
Se-Young Yun
ยท 0 min read
PDF Code
Type
Conference paper
Publication
NeurIPS 2025
Last updated on Oct 6, 2025

โ† Predicting LLM Reasoning Performance with Small Proxy Model Sep 25, 2025
C2^22: Scalable Auto-Feedback for LLM-based Chart Generation May 1, 2024 โ†’

ยฉ 2025 Reiss Koh

Published with Hugo Blox Builder โ€” the free, open source website builder that empowers creators.