Database Research Group

WSI – Database Systems Research Group

Language-Level Provenance Analysis of SQL


This thesis is about Data Provenance Analysis on SQL language level.

Data Provenance for SQL is one of our group's major research topics at the moment. We want you we want you to implement and analyze a recently developed approach to embed a variant of Data Provenance Analysis directly into SQL queries.

A two-step analysis technique this approach is based on has been described in recent publications. In this previous research, we first compiled SQL into imperative code and afterwards ran the provenance analysis. Novel to this topic is to omit the compilation step and stay on SQL language level.

This means that your implementation -- preferably in Haskell -- will transform an input SQL query into two new SQL queries. While the first query produces execution logs, the second derives the actual data provenance (see Figure 4 in the paper mentioned above). The execution environment for these generated queries is the DBMS itself.

Please meet us in person for a detailed explanation.