Publications
Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count
Hanseul Cho*, Jaeyoung Cha*, Srinadh Bhojanapalli, Chulhee Yun
arxiv preprint
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure
Hanseul Cho*, Jaeyoung Cha*, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun
NeurIPS 2024
Short version at ICML 2024 Workshop on Long-Context Foundation Models (LCFM)
Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond
Jaeyoung Cha, Jaewook Lee, Chulhee Yun
ICML 2023 (Oral)