Also on DBLP, Google Scholar and ORCID.
Conference Papers
-
Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Pratyush Patel, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Aashaka Shah, Saeed Maleki, Ricardo Bianchini
ISCA 2024
Best Paper Nominee
[
paper |
slides |
poster |
code |
traces |
press ]
-
Characterizing Power Management Opportunities for LLMs in the Cloud
Pratyush Patel, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Brijesh Warrier, Nithish Mahalingam, Ricardo Bianchini
ASPLOS 2024
[
paper |
slides |
lightning slides |
lightning talk |
poster ]
-
Hybrid Computing for Interactive Datacenter Applications
Pratyush Patel, Katie Lim, Kushal Jhunjhunwalla, Ashlie Martinez, Max Demoulin, Jacob Nelson, Irene Zhang, Thomas Anderson
Preprint
[
paper ]
-
File Systems are not Enough: Rethinking the Storage API for Microsecond-Scale Cloud Applications
Ashlie Martinez, Katie Lim, Pratyush Patel, Irene Zhang, Dan Ports, Jacob Nelson, Thomas Anderson
Preprint
[
paper ]
-
Srifty: Swift and Thrifty Distributed Neural Network Training on the Cloud
Liang Luo, Peter West, Pratyush Patel, Arvind Krishnamurthy, Luis Ceze
MLSys 2022
[
paper ]
-
The Demikernel Datapath OS Architecture for Microsecond-scale Datacenter Systems
Irene Zhang, Amanda Raybuck, Pratyush Patel, Kirk Olynyk, Jacob Nelson, Omar Navarro Leija, Ashlie Martinez, Jing Liu, Anna Kornfeld Simpson, Sujay Jayakar, Pedro Henrique Penna, Max Demoulin, Piali Choudhury, Anirudh Badam
SOSP 2021
[
paper |
talk |
code ]
-
SoundWatch: Exploring Smartwatch-based Deep Learning Approaches to Support Sound Awareness for Deaf and Hard of Hearing Users
Dhruv Jain, Hung Ngo, Pratyush Patel, Steven Goodman, Leah Findlater, Jon Froehlich
ASSETS 2020
Best Artifact Award, Selected for CACM Research Highlights
[
paper |
slides |
talk |
poster |
code |
app |
press ]
-
The Virtual Block Interface: A Flexible Alternative to Conventional Virtual Memory Frameworks
Nastaran Hajinazar, Pratyush Patel, Minesh Patel, Konstantinos Kanellopoulos, Saugata Ghose, Rachata Ausavarungnirun, Geraldo Francisco de Oliveira Jr., Jonathan Appavoo, Vivek Seshadri, Onur Mutlu
ISCA 2020
[
paper |
slides |
talk |
lightning slides |
lightning talk |
poster ]
-
Gandiva: Introspective Cluster Scheduling for Deep Learning
Wencong Xiao, Romil Bhardwaj, Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, Fan Yang, Lidong Zhou
OSDI 2018
[
paper |
slides |
talk |
poster ]
-
Analytical Enhancements and Practical Insights for MPCP with Self-Suspensions
Pratyush Patel, Iljoo Baek, Hyoseung Kim, Raj Rajkumar
RTAS 2018
Best Presentation Award
[
paper |
slides ]
-
A Server-Based Approach for Predictable GPU Access Control
Hyoseung Kim, Pratyush Patel, Shige Wang, Raj Rajkumar
RTCSA 2017
Best Paper Award
[
paper |
slides ]
-
TimerShield: Protecting High-Priority Tasks from Low-Priority Timer Interference
Pratyush Patel, Manohar Vanga, Björn Brandenburg
RTAS 2017
Best Paper Award
[
paper |
slides |
web |
code ]
Journal Papers
-
Towards Improved Power Management in Cloud GPUs
Pratyush Patel, Zibo Gong, Syeda Rizvi, Esha Choukse, Pulkit Misra, Thomas Anderson, Akshitha Sriraman
IEEE CAL 2023
[
paper ]
-
SoundWatch: Deep Learning for Sound Accessibility on Smartwatches
Dhruv Jain, Hung Ngo, Pratyush Patel, Steven Goodman, Khoa Nguyen, Rachel Grossman-Kahn, Leah Findlater, Jon Froehlich
CACM Research Highlights 2022
[
paper ]
-
A Server-Based Approach for Predictable GPU Access with Improved Analysis
Hyoseung Kim, Pratyush Patel, Shige Wang, Raj Rajkumar
Journal of Systems Architecture 2018
[
paper ]
Workshop Papers and Posters
-
An Agile Pathway Towards Carbon-aware Clouds
Pratyush Patel, Theo Gregersen, Thomas Anderson
HotCarbon 2023
[
paper |
slides |
talk ]
-
Designing Equitable Data Center Scheduling Systems
Sahana Rangarajan, Xuesi Chen, Pratyush Patel, Sara Mahdizadeh Shahri, Jaylen Wang, Akshitha Sriraman
CWIDCA at MICRO 2022
[
abstract |
slides |
poster ]
-
Extreme Memoization: Everything in a LUT!
Pratyush Patel, Luis Ceze
WACI at ASPLOS 2020
[
abstract |
slides |
talk ]
-
μTVM: Deep Learning on Bare-Metal Devices
Logan Weber, Pratyush Patel, Tianqi Chen
ARM Research Summit 2019
[
poster ]
-
μTVM: Deep Learning on Bare-Metal Devices
Pratyush Patel, Tianqi Chen, Luis Ceze
TVM Conference 2018
[
slides |
talk |
code ]
Theses