Zihao Ye (叶子豪)
Ph.D. student @ UW CSE
Bill & Melinda Gates Center, Room 330
Email : zhye [at] cs [dot] washington [dot] edu
Strava : My Account
Github : yzh119
About Me
This is Zihao Ye, a fourth-year Ph.D. student at the University of Washington, advised by Luis Ceze. I also work closely with Tianqi Chen on Machine Learning Compilers. I'm a research intern at NVIDIA, working with Vinod Grover. I'm fortunate to be a recipient of NVIDIA Graduate Fellowship 2024-2025.
I'll be visiting CMU Catalyst starting from 2025.
Prior to joining UW, I spent two years at AWS where Minjie Wang and Zheng Zhang introduced me to the Machine Learning Systems. I obtained my bachelor's degree from ACM Honors Class at Shanghai Jiao Tong University.
Research
My current research focuses on Machine Learning Compilers and Sparse Computation. I'm passionate about building practical systems that have real-world impact and I enjoy dealing with engineering challenges.
I'm focusing on developing FlashInfer and some related research projects, feel free to drop me an email if are interested, and I'm open to collaborations.
Active Projects
- FlashInfer: Kernel Generator and Library for LLM Inference Serving
I'm excited to be part of MLC community and collaborate with a strong team on the following projects in TVM Unity:
- MLC-LLM: Universal Deployment of Large Language Models
- Relax: Composable Abstractions for End-to-End Dynamic Machine Learning
- TensorIR: Tensor-Level Abstractions for Deep Learning Operators
Earlier Projects
Misc
Heaven Sent is my favorite Doctor Who episode and it helped me get through every dark moments during my PhD.