Zero-th order algorithm for softmax attention optimization

Publication
arXiv preprint arXiv:2307.08352
Yichuan Deng
Yichuan Deng
Ph.D. Student in Computer Science

My research interests lay in Theoretical Computer Science.