III: Small: BeliefDB - Adding Belief Annotations to Databases


The goal of this project is the design and implementation of a new data annotation model. In this model, annotations allow users of a shared data repository to retain different possible "belief worlds" of what should be in the database (with partly overlapping and conflicting information) and to have a structured argumentation on content and annotations between the users. The key idea is to give annotations a clearly defined semantics that lets a database understand and can manage them efficiently. The project studies data models, language design, algorithm for conflict resolution, and query evaluation on uncertain or inconsistent data.

Our motivation comes from many large-scale scientific data applications today, where a community of users is working together to assemble, revise, and curate a shared repository of data. The traces of individual users in such databases are commonly known as annotations. Examples of such collaborations include curated protein databases, where users store known protein functions and comment on the stored data, and registries of animal sightings, where scientists strive to keep track of animal populations by having volunteers register animal sightings. As the community accumulates knowledge and the database content evolves over time, it may contain conflicting information and members can disagree on the information it should store. In these instances, the database serves not only as repository for data, but also as means of communication within the community.

Supported by:

NSF IIS-0915054


Paraschos Koutris,
Dan Suciu,
Paul Beame,
Sudeepa Roy,
Wolfgang Gatterbauer,
Daniel Li,
Gerome Miklau,
Prasang Upadhyaya,
Magdalena Balazinska,
Bill Howe,
Nodira Khoussainova,
Alexandra Meliou,
Abhay Jha,

Web Page:



