Publications
[C4] Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems
Yujun Kim*, Jaeyoung Cha*, Chulhee Yun
ICML 2025
[C3] Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count
Hanseul Cho*, Jaeyoung Cha*, Srinadh Bhojanapalli, Chulhee Yun
ICLR 2025
[C2] Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
Hanseul Cho*, Jaeyoung Cha*, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun
NeurIPS 2024
Short version at ICML 2024 Workshop on Long-Context Foundation Models (LCFM)
[C1] Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond
Jaeyoung Cha, Jaewook Lee, Chulhee Yun
ICML 2023 (Oral)
