Zero-th order algorithm for softmax attention optimization

Publication
2024 IEEE International Conference on Big Data (BigData) (IEEE), 24–33
Yichuan Deng
Yichuan Deng
Ph.D. Student