Download: PDF, slides (PDF), slides (ODP), Regex Checker implementation.
“A type system for regular expressions” by Eric Spishak, Werner Dietl, and Michael D. Ernst. In FTfJP: 14th Workshop on Formal Techniques for Java-like Programs, (Beijing, China), June 2012, pp. 20-26.
Regular expressions are used to match and extract text. It is easy for developers to make syntactic mistakes when writing regular expressions, because regular expressions are often complex and different across programming languages. Such errors result in exceptions at run time, and there is currently no static support for preventing them.
This paper describes practical experience designing and using a type
system for regular expressions. This type system validates regular
expression syntax and capturing group usage at compile time instead of at
run time — ensuring the absence of
PatternSyntaxException
s from invalid syntax and
IndexOutOfBoundsException
s from accessing invalid capturing
groups.
Our implementation is publicly available and supports the full Java language. In an evaluation on five open-source Java applications (480kLOC), the type system was easy to use, required less than one annotation per two thousand lines, and found 56 previously-unknown bugs.
Download: PDF, slides (PDF), slides (ODP), Regex Checker implementation.
BibTeX entry:
@inproceedings{SpishakDE2012, author = {Eric Spishak and Werner Dietl and Michael D. Ernst}, title = {A type system for regular expressions}, booktitle = {FTfJP: 14th Workshop on Formal Techniques for Java-like Programs}, pages = {20--26}, address = {Beijing, China}, month = jun, year = {2012} }
(This webpage was created with bibtex2web.)
Back to Michael Ernst's publications.