AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners

Apr 19, 2025·

[C3] Woosung Koh

,

Wonbeen Oh

,

Jaein Jang

,

MinHyung Lee

,

Hyeongjin Kim

,

Ah Yeon Kim

,

Joonkee Kim

,

Junghyun Lee

,

Taehyeon Kim

,

Se-Young Yun

· 0 min read

Type

Conference paper

Publication

NeurIPS 2025

Last updated on Feb 7, 2026

Post-Training Reasoning Data Sampling Training Efficiency

← Predicting LLM Reasoning Performance with Small Proxy Model Sep 25, 2025

C$^2$: Scalable Auto-Feedback for LLM-based Chart Generation May 1, 2024 →