We are developing an end-to-end system for validating scientific claims against open data repositories using NLP, machine learning, and data integration techniques.
We are studying the technical foundations for responsible data science, including fair machine learning, semi-synthetic private data, data governance, automatic metadata attachment and curation, and...
Building on our data science incubator program and the University of Chicago's Data Science for Social Good program, we ran an interdisciplinary summer program for...
Part of the Myria project, RACO (the Relational Algebra COmpiler) is a polystore middleware system that provides query translation, optimization, and orchestration across complex multi-system...
Working at the intersection of network science, databases, and high-performance computing, we developed a series of novel parallel algorithms based on Infomap serial graph clustering...
We have developed algorithms, methods, systems, and applications in support of the Seaflow project in the Armbrust Lab in the UW department of Oceanography.
VizDeck recommends visualizations based on the statistical properties of the data tempered by perception heuristics. Dashboards are assembled through a card-game UI.
The Horizon project was one of the early efforts to study the limits of Hadoop for complex analytics. We developed Hadoop algorithms for visualization, machine...