Seminar Organization

The seminar is jointly organized with Prof. Deßloch’s group.

Below are a few general information:

Registration

All CS department seminar registrations are now handled via OLAT courses. Please register there and choose our seminar(s).

OLAT course for Bachelor’s seminar registration: https://olat.vcrp.de/url/RepositoryEntry/5151064318

OLAT course for Master’s seminar registration: https://olat.vcrp.de/url/RepositoryEntry/5151064324

Prerequisites

  • Participants should have successfully attended the core lecture Datenbanksysteme (database systems) or equivalent.
  • Having attended the beginner’s course Informationssysteme (information systems), or equivalent, is assumed anyway.

Timeline

Remember the following deadlines if you got a slot in the seminar:

  • 16.04.26 Paper Selection Deadline
  • 17.04.26 10:15 Seminar Kickoff in 36-336

Papers

Look further than only titel and abstract before deciding for a paper, since some papers might demand math or deeper ML skills you might find difficult. You have to be in the University’s VPN (full tunnel) for accessing some of the papers.

Bachelor

# Paper Title Weblink
1 HyPer: A Hybrid OLTP&OLAP Main Memory Database System Based on Virtual Memory Snapshots https://cs.brown.edu/courses/cs227/archives/2012/papers/olap/hyper.pdf
2 Building Stateless Serverless Vector DBs via Block-based Data Partitioning https://dl.acm.org/doi/10.1145/3769769
3 The Ubiquitous Skiplist: A Survey of What Cannot Be Skipped About the Skiplist and Its Applications in Data Systems https://dl.acm.org/doi/10.1145/3736754
4 Dataset Discovery in Data Lakes https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9101607
5 SQLStorm: Taking Database Benchmarking into the LLM Era https://www.vldb.org/pvldb/vol18/p4144-schmidt.pdf
6 Improving Unnesting of Complex Queries https://dl.gi.de/bitstreams/c1918e8c-6a87-4da2-930a-bfed289f2388/download
7 Nested Parquet Is Flat, Why Not Use It? How To Scan Nested Data With On-the-Fly Key Generation and Joins https://doi.org/10.1145/3725329
8 Towards Principled, Practical Document Database Design https://doi.org/10.14778/3750601.3750606

Master

# Paper Title Weblink
1 Extending SQL to Return a Subdatabase https://dl.acm.org/doi/10.1145/3725291
2 How to Optimize SQL Queries? A Comparison Between Split, Holistic, and Hybrid Approaches https://www.vldb.org/pvldb/vol18/p3910-gretscher.pdf
3 Are Database System Researchers Making Correct Assumptions about Transaction Workloads? https://dl.acm.org/doi/pdf/10.1145/3725268
4 Hadoop GIS: A High Performance Spatial Data Warehousing System over MapReduce https://www.vldb.org/pvldb/vol6/p1009-aji.pdf
5 DiskJoin: Large-scale Vector Similarity Join with SSD https://dl.acm.org/doi/10.1145/3769780
6 Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations https://dl.acm.org/doi/10.1145/3769823
7 QuIT Your B+-Tree for the Quick Insertion Tree https://openproceedings.org/2025/conf/edbt/paper-118.pdf
8 Cleaning Time Series under Seasonal and Trend Constraints https://dl.acm.org/doi/10.1145/3769773
9 LogLite: Lightweight Plug-and-Play Streaming Log Compression https://doi.org/10.14778/3749646.3749652
10 GpJSON: High-Performance JSON Data Processing on GPUs https://doi.org/10.14778/3746405.3746439
11 DBCopilot: Natural Language Querying over Massive Databases via Schema Routing https://openproceedings.org/2025/conf/edbt/paper-209.pdf
12 CDA: Cost-Sensitive Data Acquisition for Incomplete Datasets https://ieeexplore.ieee.org/abstract/document/11112862