Luis Ceze
Paul G. Allen School of Computer Science and Engineering
University of Washington
Box 352350
Seattle, WA 98195

Paul G. Allen Center, Room 576


(206) 543-1896 [phone], (206) 616-3804 [fax]

I help run three groups: Sampa on hardware/software systems, saml on machine learning systems and architecture, and MISL on using DNA for information technology applications.

My official UW CSE webpage, and my perpetually semi-up-to-date CV.

An overview of our work on approximate computing can be found here. And try out our language, compiler and benchmarking infrastructure for approximate computing.

Check out the videos from my Hardware/Software Interface class
Take a look at TVM, our recently-released end-to-end stack for deep learning -



I work on the intersection of computer architecture, programming languages, machine learning and biology. My goals are explore new and better ways to build computing systems.

Some selected publications (full list):

"Learning to Optimize Tensor Programs", NIPS'18 (to appear).
"TVM:An Automated End-to-End Optimizing Compiler for Deep Learning", OSDI'18 (to appear).
"Parameter Hub: a Rack-Scale Parameter Server for Distributed Deep Neural Network Training", SoCC'18 (to appear).
"Architecture Considerations for Stochastic Computing Accelerators", CODES'18 (to appear).
"LightDB: A DBMS for Virtual Reality Video", PVLDB'18.
"Troubleshooting Transiently-Recurring Errors in Production Systems with Blame-Proportional Logging", USENIX ATC'18.
"Random access in large-scale DNA data storage", Nature Biotechnology, Cover Feature in Mar'18.
"Application Codesign of Near-Data Processing for Similarity Search", IPDPS'18
"MATIC: Learning Around Erros for Efficient Low-Voltage Neural Network Accelerators", DATE'18, Best Paper Award.
"Correlation Manipulating Circuits for Stochastic", DATE'18
Clustering Billions of Reads for DNA Data Storage", NIPS'17
"Computer Security, Privacy, and DNA Sequencing: Compromising Computers with Synthesized DNA, Privacy Leaks, and More.", Usenix Security'17
"Exploring Computation-Communication Tradeoffs in Camera Systems", IISWC'17
"Customizing Progressive JPEG for Efficient Image Storage", USENIX HotStorage'17
"A Hardware-Friendly Bilateral Solver for Real-Time Virtual Reality Video", HPG'17
"Similarity Search on Automata Processors", IPDPS'17
"VisualCloud Demonstration: A DBMS for Virtual Reality, SIGMOD'17
"WeLight:Augmenting Interpersonal Communication through Connected Lighting, CHI-LBW'17 (try it out!)
"Approximate Storage for Encrypted and Compressed Videos, ASPLOS'17
"Enabling In-network Computation with a Programmable Network Middlebox, ASPLOS'17
"Energy-Efficient Hybrid Stochastic-Binary Neural Networks for Near-Sensor Computing, DATE'17.
"Disciplined Inconsistency with Consistency Types, SOCC'16.
"A DNA-Based Archival Storage System, ASPLOS'16.
"High-Density Image Storage Using Approximate Memory Cells, ASPLOS'16.
"Optimizing Synthesis with Metasketches”, POPL'16.
"Probability Type Inference for Flexible Approximate Programming”, OOPSLA'15.
"Hardware–Software Co-Design: Not Just a Cliche”, SNAPL'15.
"Latency-Tolerant Software Distributed Shared Memory", USENIX ATC'15.
"Debugging and Monitoring Quality in Approximate Programs", ASPLOS 2015.
"SNNAP: Neural Acceleration on Programmable Logic", HPCA 2015.
"Data Provenance Analysis in Multithreaded Programs", CGO 2015.
"Alembic: Automatic Locality Extraction via Migration", OOPSLA 2014.
"Symbolic Execution of Multithreaded Programs from Arbitrary Program Contexts", OOPSLA 2014.
"General-Purpose Code Acceleration with Limited-Precision Analog Computation", ISCA 2014.
"Expressing and Verifying Probabilistic Assertions", PLDI 2014.
"Low-Level Detection of High-Level Data Races with LARD", ASPLOS 2014.
"Integrated 3D-Stacked Server Designs for Increasing Physical Density of Key-Value Stores", ASPLOS 2014.
"Approximate Storage in Solid-State Memories", MICRO 2013.
"EnerJ, the Language of Good-Enough Computing", IEEE Spectrum Feature Article.
"Input-Covering Schedules for Multithreaded Programs", OOPSLA 2013.
"DNA-based Molecular Architecture with Spatially Localized Components", ISCA 2013.
"Cooperative Empirical Failure Avoidance for Multithreaded Programs", ASPLOS 2013.
"DDOS: Taming Nondeterminism in Distributed Systems", ASPLOS 2013.
"Neural Acceleration for General-Purpose Approximate Programs", MICRO 2012 (Selected as IEEE Micro Top Picks and CACM Research Highlights).
"IFRit: Interference-Free Regions for Dynamic Data-Race Detection", OOPSLA 2012.
"RADISH: Always-On Sound and Complete Race Detection in Software and Hardware", ISCA 2012.
"Architectural Support for Disciplined Approximate Programming", ASPLOS 2012.
"EnerJ: Approximate Data Types for Safe and General Low-Power Computation", PLDI 2011.
"Isolating and Understanding Concurrency Errors Using Reconstructed Execution Fragments", PLDI 2011.
"Operating System Implications of Fast, Cheap, Non-Volatile Memory", Usenix HotOS 2011.
"RCDC: A Relaxed Consistency Deterministic Computer", ASPLOS 2011.
"Checked Load: Architectural Support for JavaScript Type-Checking on Mobile Processors Authors", HPCA 2011.
"A Limit Study of JavaScript Parallelism", IISWC 2010.
"Composable Specifications for Structured Shared-Memory Communication", OOPSLA 2010.
"Deterministic Process Groups in dOS", OSDI 2010.
"Conflict Exceptions: Providing Simple Concurrent Language Semantics with Precise Hardware Exceptions", ISCA 2010.
"ColorSafe: Architectural Support for Debugging and Dynamically Avoiding Multi-variable Atomicity Violations", ISCA 2010.
"CoreDet: A Compiler and Runtime System for Deterministic Multithreaded Execution", ASPLOS 2010.
"Finding Concurrency Bugs with Context-Aware Communication Graphs", MICRO 2009.
"The Case for System Support for Concurrency Exceptions", USENIX HotPar 2009.
"DMP: Deterministic Shared Memory Multiprocessing", ASPLOS 2009. (Selected for the IEEE Micro Top Picks 2009).
"Atom-Aid: Detecting and Surviving Atomicity Violations", ISCA 2008. (Selected for the IEEE Micro Top Picks 2008).

If you need a good architecture simulator, take a look at SESC, a very fast multiprocessor simulator. And here is a good way of choosing your next architecture or compiler conference.

We have released our Grappa runtime system for large-scale irregular applications (e.g., graph analytics), take a look!
We have also recently started publicizing, and ACCEPT, a set of tools and benchmarks for approximate computing research.

Take a look at the collectively written white-paper on 21st Century computer architecture research.

I am lucky to have my research supported by Microsoft, Intel, Google, Qualcomm, NetApp, NSF, and the Pacific Northwest National Laboratory. I am also a member of the Center for Future Architectures Research.

I received an NSF CAREER Award and a Microsoft Research Faculty Fellowship to develop ideas on deterministic multiprocessing and multicore programmability in general. I am also the 2013 recipient of the IEEE TCCA Young Computer Architect Award.


CSE590G Architecture Seminar (ongoing)
CSE351 The Hardware/Software Interface, Winter 2015
CSE548 Computer Systems Architecture, Fall 2014
Coursera The Hardware/Software Interface, Spring 2013. Summer 2014.
CSE351 The Hardware/Software Interface, Spring 2013
CSEP548 Computer Systems Architecture (PMP), Fall 2012
CSE351 The Hardware/Software Interface, Fall 2011
CSE351 The Hardware/Software Interface, Spring 2011
CSEP548 Computer Systems Architecture, Winter 2011
CSE378 Machine Organization & Assembly Language, Spring 2010
CSE548 Computer Systems Architecture, Winter 2010
CSEP548 Computer Systems Architecture (PMP), Spring 2009
CSE378 Machine Organization & Assembly Language, Winter 2009
CSE599Q Topics in Multiprocessor Programmability, Spring 2008
CSE590P Programming Systems Seminar, Winter 2008 (with Dan Grossman)
CSE548 Computer Systems Architecture, Winter 2008
CSE378 Machine Organization & Assembly Language, Fall 2007


I have the pleasure of working with the following incredible graduate students:

James Bornholt (primarily advised by Emina Torlak)
Tianqi Chen (w/ Carlos Guestrin)
Vincent Lee (w/ Mark Oskin)
Ming Liu (primarily advised by Arvind Krishnamurthy)
Liang Luo
Amrita Mazumdar
Thierry Moreau
Lee Organick (w/ Karin Strauss)
Kendall Stewart(w/ Karin Strauss and Georg Seelig)
Eddie Yan
Max Willsey
Meghan Cowan
Luis Vega
Melissa Queen

Armin Alaghi (postdoc, now at Oculus Research)
Djordje Jevdjic (postoc, now faculty atu SNU)
Artem Miniyaylov
Brandon Holt (PhD'16, now at Apple)
Adrian Sampson (PhD'15, now faculty at Cornell)
Ben Wood (PhD'15, now faculty at Wellesley)
Jacob Nelson (PhD'14, now at MSR)
Ben Ransford (now co-founder of Virta Labs)
Peter Hornyack (MS'14, now at Google)
Andre Baixo (MS'15, now at ARM)
Katelin Bailey (MS'15, now at Microsoft)
Tom Bergan (PhD'14, now at Google)
Mike Ringenburg (PhD'14, now at Cray)
Nick Hunt (MS'12)
Hadi Emaseilzadeh (PhD'13, now faculty at Georgia Tech)
Brandon Lucia (PhD'13, now faculty at CMU)
Joe Devietti (PhD'12, now faculty at UPenn)
Emily Fortuna (MS'11, first employment: Google)
Owen Anderson (MS'10, first employment: Apple)
Julian Knutsen (BS'10, first employment: Corensic)
Angda (Andy) Chen (BS'08 Honors, first employment: Amazon)
Danushen Gnanapragasam
Andrew Morgan

About me

I was born in São Paulo, Brazil.

I received my PhD in Computer Science from University of Illinois at Urbana-Champaign. I got my BEng and MEng in Electrical Engineering from University of São Paulo, Brazil.

Self portraits.

I love to cook and eat.

I am very fortunate to have such a happy family.

My (much smarter than me) brother was freezing in Michigan but having fun with aerospace engineering, now he is in sunny California at NASA AMES..

I am always happy because she exists.