Publications

[C4] Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems

Yujun Kim*, Jaeyoung Cha*, Chulhee Yun
ICML 2025

[C3] Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count

Hanseul Cho*, Jaeyoung Cha*, Srinadh Bhojanapalli, Chulhee Yun
ICLR 2025

[C2] Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure

Hanseul Cho*, Jaeyoung Cha*, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun
NeurIPS 2024
Short version at ICML 2024 Workshop on Long-Context Foundation Models (LCFM)

[C1] Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond

Jaeyoung Cha, Jaewook Lee, Chulhee Yun
ICML 2023 (Oral)