Seminar on Recent Developments in Databases and Information Systems

To attend this seminar, students need to register according to the registration procedure explained below. After the end of the registration process, participants are selected and informed. Each selected student will get assigned one research paper that needs to be presented in a talk at the end of the semester. Finally, also a short report about the presented paper needs to be submitted, the latest two weeks after the presentation.

Prerequisites

  • Participants should have successfully attended the core lecture Datenbanksysteme (database systems) or equivalent.
  • Having attended the beginner’s course Informationssysteme (information systems), or equivalent, is assumed anyway.

Log:

  • 2025-03-21 Page created
  • 2025-03-28 List of papers and tentative schedule added, instructions updated

Dates and Deadlines:

  • 16.04.2025 Registration Deadline
  • 28.-30.04.2025 Kickoff (exact date tbd.)
  • 01.06.2025 Initial Report Deadline
  • 06.07. No-Left-ToDo Slides
  • 28.07.-01.08 Presentations (exact date tbd.)
  • 2 weeks after presentations: Final Report Deadline

Information on Registration Process:

  • This seminar is offered in the summer semester of 2025.
  • The number of participants is limited.
  • Registration is not done on a first-come, first-served basis.
  • In order to register, download this JSON template registration file rename it to yourmatriculationnumber.json, edit it to reflect your information, and send it as an attachment via email to Patrick Hansert (patrick.hansert@cs.rptu.de). Make sure the file is valid JSON and ASCII or UTF-8 encoded, the latter without byte order mark. Use the paper ids from the following table to specify your preference for at least four papers. Please use your official university email account @cs.uni-kl.de or @student.uni-kl.de or @rhrk.uni-kl.de or @rptu…. to register and to send the email.
  • Registration due 16.04.2025
  • Soon after the end of the registration, we will let you know whether or not you got a slot in the seminar.

Papers:

Look further than only titel and abstract before deciding for a paper, since some papers might demand math or deeper ML skills you might find difficult. You have to be in the University’s VPN (full tunnel) for accessing some of the papers.

Bachelor

# Paper Title Weblink
1 SQL Has Problems. We Can Fix Them: Pipe Syntax In SQL https://doi.org/10.14778/3685800.3685826
2 Adaptive sorting for large keys, strings, and database rows https://doi.org/10.18420/BTW2025-10
3 CHORUS: Foundation Models for Unified Data Discovery and Exploration https://www.vldb.org/pvldb/vol17/p2104-kayali.pdf
4 Gen-T: Table Reclamation in Data Lakes https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10597787
5 FUDJ: Flexible User-Defined Distributed Joins https://ieeexplore.ieee.org/document/10598050
6 Access path selection in a relational database management system https://doi.org/10.1145/582095.582099
7 Graph-based QSS: A Graph-based Approach to Quantifying Semantic Similarity for Automated Linear SQL Grading https://dl.gi.de/items/95d1dba7-1610-4232-9e2b-a8bc5af83513

Master

# Paper Title Weblink
1 Dynamic Data Layout Optimization with Worst-Case Guarantees https://doi.org/10.1109/icde60146.2024.00327
2 BtrBlocks: Efficient Columnar Compression for Data Lakes https://doi.org/10.1145/3589263
3 QueryArtisan: Generating Data Manipulation Codes for Ad-hoc Analysis in Data Lakes https://www.vldb.org/pvldb/vol18/p108-yao.pdf
4 MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10598123
5 Relational Algorithms for Top-k Query Evaluation https://dl.acm.org/doi/10.1145/3654971
6 Optimizing Disjunctive Queries with Tagged Execution. https://dl.acm.org/doi/10.1145/3654961
7 Cornus: atomic commit for a cloud DBMS with storage disaggregation https://dl.acm.org/doi/10.14778/3565816.3565837
8 The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining https://doi.org/10.14778/3696435.3696436
9 SlabCity: Whole-Query Optimization using Program Synthesis https://dl.acm.org/doi/10.14778/3611479.3611515