Publications

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count

Hanseul Cho*, Jaeyoung Cha*, Srinadh Bhojanapalli, Chulhee Yun
arxiv preprint

Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure

Hanseul Cho*, Jaeyoung Cha*, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun
NeurIPS 2024
Short version at ICML 2024 Workshop on Long-Context Foundation Models (LCFM)

Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond

Jaeyoung Cha, Jaewook Lee, Chulhee Yun
ICML 2023 (Oral)