Causality in Databases
Description
When analyzing data sets, users are often interested in the causes of
their observations: "What caused my personalized newsfeed to contain
more than 10 items related to volcanos?", "Why can't I find any
flights with my search criteria?". Database research that addresses
these or similar questions is mainly work on lineage of query results,
such as why or where provenance, and very recently, explanations for
non-answers. While these approaches differ over what the response to
questions should be, all of them seem to be linked through a common
underlying theme: understanding causal relationships in databases.
Causal relationships cannot be explicitly modeled in current database
systems, which offer no specific support for such queries. Mining
techniques can infer statistically significant data patterns but they
are not sufficient to draw conclusions, as correlation does not
necessarily imply causation. The goal of this project is to extend the
capabilities of current database systems by incorporating to them
causal reasoning. This will allow databases to model causal
dependencies, and users to issue queries that can interpret them to
provide explanations for their observations. Starting from the very
basic functionality of justifying the presence or absence of results
for a given query, causality-enabled databases can find many practical
applications. For an intuitive introduction and several motivating
examples see the Data Bulletin article.
A former member of the project, Alexandra Meliou, has a nice Website
on this project
http://people.cs.umass.edu/~ameli/projects/causality/
Supported by:
NSF IIS-0911036
Members
Paul Beame,
JerryLi,
Sudeepa Roy,
Dan Suciu,
Alexandra Meliou,
Wolfgang Gatterbauer,
Kate Moore,
Joe Halpern,
Christoph Koch,
Web Page:
http://homes.cs.washington.edu/~suciu/project-causality.html
Publications
Paul Beame, JerryLi, Sudeepa Roy, Dan Suciu,
Model Counting of Query Expressions: Limitations of Propositional Methods
In ICDT, 2014
Alexandra Meliou, Sudeepa Roy, Dan Suciu,
Causality and Explanations in Databases
Published in PVLDB, vol. 7 , no. 13 , pp. 1715--1716 , 2014
Sudeepa Roy, Dan Suciu,
A formal approach to finding explanations for database queries
In International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, USA, June 22-27, 2014, pp. 1579--1590, 2014
Alexandra Meliou, Dan Suciu,
Tiresias: the database oracle for how-to queries
In SIGMOD Conference, pp. 337-348, 2012
Alexandra Meliou, Wolfgang Gatterbauer, Dan Suciu,
Tracing data errors with view-conditioned causality
In SIGMOD Conference, pp. 505-516, 2011
Alexandra Meliou, Wolfgang Gatterbauer, Dan Suciu,
Reverse Data Management
Published in PVLDB, vol. 4 , no. 12 , pp. 1490-1493 , 2011
Alexandra Meliou, Wolfgang Gatterbauer, Kate Moore, Dan Suciu,
Why so? or Why no? Functional Causality for Explaining Query Answers
In MUD, 2010
Alexandra Meliou, Wolfgang Gatterbauer, Joe Halpern, Christoph Koch, Kate Moore, Dan Suciu,
Causality in Databases
Published in Data Engineering Bulletin, vol. 33 , no. 3 , 2010
Alexandra Meliou, Wolfgang Gatterbauer, Kate Moore, Dan Suciu,
The Complexity of Causality and Responsibility for \\Query Answers and non-Answers
Published in PVLDB, 2010