2024
- Partiton, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines. Patrick Hansert and Sebastian Michel. VLDB. 2024.
- Learning over Sets for Databases. Angjela Davitkova, Damjan Gjurovski, and Sebastian Michel. EDBT. 2024.
2023
- To UDFs and Beyond: Demonstration of a Fully Decomposed Data Processor for General Data Wrangling Tasks. Nico Schäfer, Damjan Gjurovski, Angjela Davitkova, and Sebastian Michel. Proc. VLDB Endow.. 2023.
- Schema-based Column Reordering for Dremel-encoded Data. Patrick Hansert and Sebastian Michel. Big Data in Emergent Distributed Environments (BiDEDE 2023), co-located with ACM SIGMOD. 2023.
- Tuple Bubbles: Learned Tuple Representation for Tunable Approximate Query Processing. Damjan Gjurovski and Sebastian Michel. Sixth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management (aiDM), co-located with ACM SIGMOD. 2023.
2022
- BETZE: Benchmarking Data Exploration Tools with (Almost) Zero Effort. Nico Schäfer and Sebastian Michel. ICDE. 2022.
- LMKG: Learned Models for Cardinality Estimation in Knowledge Graphs. Angjela Davitkova, Damjan Gjurovski, and Sebastian Michel. EDBT. 2022.
- Ameliorating data compression and query performance through cracked Parquet. Patrick Hansert, and Sebastian Michel. BiDEDE@SIGMOD. 2022.
- Summarizing Edge-Device Data via Core Items. Damjan Gjurovski, Jan Heidemann, and Sebastian Michel. ADBIS. 2022.
2021
- Compressing (Multidimensional) Learned Bloom Filters. Angjela Davitkova, Damjan Gjurovski, and Sebastian Michel. DBAI@NeurIPS. 2021.
- Optimizing Multiple Multi-Way Stream Joins. Manuel Dossinger, and Sebastian Michel. CoRR. 2021.
- Utilizing Delta Trees for Efficient, Iterative Exploration and Transformation of Semi-Structured Contents. Nico Schäfer, and Sebastian Michel. ICDE. 2021.
- Optimizing Multiple Multi-Way Stream Joins. Manuel Dossinger, and Sebastian Michel. ICDE. 2021.
- Natural Joins for Schema-free Data. Damjan Gjurovski, and Sebastian Michel. IEEE BigData. 2021.
2020
- JODA: A Vertically Scalable, Lightweight JSON Processor for Big Data Transformations. Nico Schäfer, and Sebastian Michel. ICDE. 2020.
- Scaling Out Schema-free Stream Joins. Damjan Gjurovski, and Sebastian Michel. ICDE. 2020.
- Partially Materializable Delta Trees for Efficient Data Wrangling of Semi-Structured Contents. Nico Schäfer, and Sebastian Michel. EDBT. 2020.
- Distributed Similarity Joins over Top-K Rankings. Evica Milchevski, and Sebastian Michel. EDBT. 2020.
- The ML-Index: A Multidimensional, Learned Index for Point, Range, and Nearest-Neighbor Queries. Angjela Davitkova, Evica Milchevski, and Sebastian Michel. EDBT. 2020.
2019
- Concept and Computation of Ranking-based Dominance. Kiril Panev, and Sebastian Michel. Inf. Syst.. 2019.
- CLASH: A High-Level Abstraction for Optimized, Multi-Way Stream Joins over Apache Storm. Manuel Dossinger, Sebastian Michel, and Constantin Roudsarabi. SIGMOD Conference. 2019.
- Similarity Search and Data Exploration over Entity Rankings (Ähnlichkeitssuche und Datenexploration über Entitäts-Ranglisten). Sebastian Michel. Grundlagen von Datenbanken. 2019.
- Scaling Out Multi-Way Stream Joins using Optimized, Iterative Probing. Manuel Dossinger, and Sebastian Michel. IEEE BigData. 2019.
2018
- Editorial. Sebastian Michel, Rainer Gemulla, Ralf Schenkel, and Theo Härder. Datenbank-Spektrum. 2018.
- Processing Class-Constraint K-NN Queries with MISP. Evica Milchevski, Fabian Neffgen, and Sebastian Michel. WebDB. 2018.
- Learning interesting attributes for automated data categorization. Koninika Pal, and Sebastian Michel. SSDBM. 2018.
- Exploring Pros and Cons of Ranked Entities with COMPETE. Kiril Panev, and Sebastian Michel. ExploreDB@SIGMOD/PODS. 2018.
- Class-constraint similarity queries. Jéssica Andressa de Souza, Agma J. M. Traina, and Sebastian Michel. SAC. 2018.
2017
- Learning Interesting Categorical Attributes for Refined Data Exploration. Koninika Pal, and Sebastian Michel. CoRR. 2017.
- LSH-Based Probabilistic Pruning of Inverted Indices for Sets and Ranked Lists. Koninika Pal, and Sebastian Michel. WebDB. 2017.
- Scaling Out Continuous Multi-Way Theta-Joins. Manuel Hoffmann, and Sebastian Michel. BeyondMR@SIGMOD. 2017.
- Playing Information LEGO at Large Scale. Sebastian Michel. CIDR. 2017.
- Reverse Engineering Top-k Join Queries. Kiril Panev, Nico Weisenauer, and Sebastian Michel. BTW. 2017.
- Exploring Databases via Reverse Engineering Ranking Queries with PALEO. Kiril Panev, Sebastian Michel, Evica Milchevski, and Koninika Pal. BTW. 2017.
2016
- Exploring Databases via Reverse Engineering Ranking Queries with PALEO. Kiril Panev, Sebastian Michel, Evica Milchevski, and Koninika Pal. Proc. VLDB Endow.. 2016.
- Mining Entity Rankings. Fabian Reinartz, Koninika Pal, and Sebastian Michel. Datenbank-Spektrum. 2016.
- Die Arbeitsgruppen für Datenbanken und Informationssysteme an der TU Kaiserslautern. Stefan Deßloch, Theo Härder, and Sebastian Michel. Datenbank-Spektrum. 2016.
- Efficient Similarity Search across Top-k Lists under the Kendall's Tau Distance. Koninika Pal, and Sebastian Michel. SSDBM. 2016.
- Computing similar entity rankings via reverse engineering of top-k database queries. Kiril Panev, Evica Milchevski, and Sebastian Michel. ICDE Workshops. 2016.
- Playing LEGO with JSON: Probabilistic joins over attribute-value fragments. Manuel Hoffmann, Evica Milchevski, and Sebastian Michel. ICDE Workshops. 2016.
- Reverse Engineering Top-k Database Queries with PALEO. Kiril Panev, and Sebastian Michel. EDBT. 2016.
- A Data Mining Approach to Choosing Categorical Attributes for Ranked Lists. Koninika Pal, and Sebastian Michel. EDBT. 2016.
- Quantifying Likelihood of Change through Update Propagation across Top-k Rankings. Evica Milchevski, and Sebastian Michel. EDBT. 2016.
2015
- ligDB - Online Query Processing Without (almost) any Storage. Evica Milchevski, and Sebastian Michel. EDBT. 2015.
- The Sweet Spot between Inverted Indices and Metric-Space Indexing for Top-K-List Similarity Search. Evica Milchevski, Avishek Anand, and Sebastian Michel. EDBT. 2015.
2014
- An LSH Index for Computing Kendall's Tau over Top-k Lists. Koninika Pal, and Sebastian Michel. CoRR. 2014.
- Tracking set correlations at large scale. Foteini Alvanaki, and Sebastian Michel. SIGMOD Conference. 2014.
2013
- Benchmarking Soundtrack Recommendation Systems with SRBench. Aleksandar Stupar, and Sebastian Michel. CoRR. 2013.
- A thin monitoring layer for top-k aggregation queries over a database. Foteini Alvanaki, and Sebastian Michel. DBRank@VLDB. 2013.
- Automated educated guessing. Aleksandar Stupar, and Sebastian Michel. ICDE Workshops. 2013.
- Scalable, continuous tracking of tag co-occurrences between short sets using (almost) disjoint tag partitions. Foteini Alvanaki, and Sebastian Michel. DBSocial. 2013.
- Interesting event detection through hall of fame rankings. Foteini Alvanaki, Evica Ilieva, Sebastian Michel, and Aleksandar Stupar. DBSocial. 2013.
- SRbench-a benchmark for soundtrack recommendation systems. Aleksandar Stupar, and Sebastian Michel. CIKM. 2013.
- The essence of knowledge (bases) through entity rankings. Evica Ilieva, Sebastian Michel, and Aleksandar Stupar. CIKM. 2013.
2012
- Efficient monitoring of personalized hot news over Web 2.0 streams. Parisa Haghani, Sebastian Michel, and Karl Aberer. Comput. Sci. Res. Dev.. 2012.
- Sequoia - An Approach to Declarative Information Retrieval. Christoph Pinkel, Foteini Alvanaki, and Sebastian Michel. Datenbank-Spektrum. 2012.
- Building and Maintaining Halls of Fame over a Database. Foteini Alvanaki, Sebastian Michel, and Aleksandar Stupar. CoRR. 2012.
- Enhancing Locality Sensitive Hashing with Peek Probing and Nearest Neighbor Links. Aleksandar Stupar, and Sebastian Michel. WebDB. 2012.
- Top-k interesting phrase mining in ad-hoc collections using sequence pattern indexing. Chuancong Gao, and Sebastian Michel. EDBT. 2012.
- See what's enBlogue: real-time emergent topic identification in social media. Foteini Alvanaki, Sebastian Michel, Krithi Ramamritham, and Gerhard Weikum. EDBT. 2012.
- Being picky: processing top-k queries with set-defined selections. Aleksandar Stupar, and Sebastian Michel. CIKM. 2012.
2011
- EnBlogue: emergent topic detection in web 2.0 streams. Foteini Alvanaki, Sebastian Michel, Krithi Ramamritham, and Gerhard Weikum. SIGMOD Conference. 2011.
- Picasso - to sing, you must close your eyes and draw. Aleksandar Stupar, and Sebastian Michel. SIGIR. 2011.
- PICASSO: automated soundtrack suggestion for multi-modal data. Aleksandar Stupar, and Sebastian Michel. CIKM. 2011.
- Tracking Hot-k Items over Web 2.0 Streams. Parisa Haghani, Sebastian Michel, and Karl Aberer. BTW. 2011.
- Efficient Interest Group Discovery in Social Networks using an Integrated Structure/Quality Index. Adriana Budura, Sebastian Michel, and Karl Aberer. BTW. 2011.
2010
- RankReduce - Processing K-Nearest Neighbor Queries on Top of MapReduce. Aleksandar Stupar, Sebastian Michel, and Ralf Schenkel. LSDS-IR@SIGIR. 2010.
- Continuous query evaluation over distributed sensor networks. Oana Jurca, Sebastian Michel, Alexandre Herrmann, and Karl Aberer. ICDE. 2010.
- The gist of everything new: personalized top-k processing over web 2.0 streams. Parisa Haghani, Sebastian Michel, and Karl Aberer. CIKM. 2010.
2009
- Workshop on large-scale distributed systems for information retrieval. Sebastian Michel, Gleb Skobeltsyn, and Wai Gen Yee. SIGIR Forum. 2009.
- Distributed top-k aggregation queries at large. Thomas Neumann, Matthias Bender, Sebastian Michel, Ralf Schenkel, Peter Triantafillou, and Gerhard Weikum. Distributed Parallel Databases. 2009.
- Rethinking email message and people search. Sebastian Michel, and Ingmar Weber. WWW. 2009.
- Knowing When to Slide - Efficient Scheduling for Sliding Window Processing. Ali Salehi, Mehdi Riahi, Sebastian Michel, and Karl Aberer. Mobile Data Management. 2009.
- Environmental Monitoring 2.0. Sebastian Michel, Ali Salehi, Liqian Luo, Nicholas Dawes, Karl Aberer, Guillermo Barrenetxea, Mathias Bavay, Aman Kansal, K. Ashwin Kumar, Suman Nath, Marc Parlange, Stewart Tansley, Catharine van Ingen, Feng Zhao, and Yongluan Zhou. ICDE. 2009.
- Neighborhood-Based Tag Prediction. Adriana Budura, Sebastian Michel, Philippe Cudré-Mauroux, and Karl Aberer. ESWC. 2009.
- Distributed similarity search in high dimensions using locality sensitive hashing. Parisa Haghani, Sebastian Michel, and Karl Aberer. EDBT. 2009.
- Processing publish/subscribe queries over distributed data streams. Oana Jurca, Sebastian Michel, Alexandre Herrmann, and Karl Aberer. DEBS. 2009.
- Evaluating top-k queries over incomplete data streams. Parisa Haghani, Sebastian Michel, and Karl Aberer. CIKM. 2009.
- Top-k Aggegation Queries in Large-Scale Distributed Systems. Sebastian Michel. BTW. 2009.
2008
- The Juxtaposed approximate PageRank method for robust PageRank approximation in a peer-to-peer web search network. Josiane Xavier Parreira, Carlos Castillo, Debora Donato, Sebastian Michel, and Gerhard Weikum. VLDB J.. 2008.
- Making SENSE: socially enhanced search and exploration. Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, and Gerhard Weikum. Proc. VLDB Endow.. 2008.
- Efficiently Handling Dynamics in Distributed Link Based Authority Analysis. Josiane Xavier Parreira, Sebastian Michel, and Gerhard Weikum. WISE. 2008.
- Optimizing Distributed Top-k Queries. Thomas Neumann, Matthias Bender, Sebastian Michel, Ralf Schenkel, Peter Triantafillou, and Gerhard Weikum. WISE. 2008.
- From Web 1.0 to Web 2.0 and back -: how did your grandma use to tag?. Sheila Kinsella, Adriana Budura, Gleb Skobeltsyn, Sebastian Michel, John G. Breslin, and Karl Aberer. WIDM. 2008.
- LSH At Large - Distributed KNN Search in High Dimensions. Parisa Haghani, Sebastian Michel, Philippe Cudré-Mauroux, and Karl Aberer. WebDB. 2008.
- Efficient top-k querying over social-tagging networks. Ralf Schenkel, Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, and Gerhard Weikum. SIGIR. 2008.
- Social recommendations at work. Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, and Gerhard Weikum. SIGIR. 2008.
- To tag or not to tag -: harvesting adjacent metadata in large-scale tagging systems. Adriana Budura, Sebastian Michel, Philippe Cudré-Mauroux, and Karl Aberer. SIGIR. 2008.
- Exploiting social relations for query expansion and result ranking. Matthias Bender, Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, and Gerhard Weikum. ICDE Workshops. 2008.
- Sensor Metadata Management and Its Application in Collaborative Environmental Research. Nicholas Dawes, K. Ashwin Kumar, Sebastian Michel, Karl Aberer, and Michael Lehning. eScience. 2008.
- Smooth Interpolating Histograms with Error Guarantees. Thomas Neumann, and Sebastian Michel. BNCOD. 2008.
2007
- p2pDating: Real life inspired semantic overlay networks for Web search. Josiane Xavier Parreira, Sebastian Michel, and Gerhard Weikum. Inf. Process. Manag.. 2007.
- Algebraic query optimization for distributed top-k queries. Thomas Neumann, and Sebastian Michel. Inform. Forsch. Entwickl.. 2007.
- Peer-to-Peer Information Search: Semantic, Social, or Spiritual?. Matthias Bender, Tom Crecelius, Mouna Kacimi, Sebastian Michel, Josiane Xavier Parreira, and Gerhard Weikum. IEEE Data Eng. Bull.. 2007.
- Search for the Best but Expect the Worst - Distributed Top-k Queries over Decreasing Aggregated Scores. Sebastian Michel, and Thomas Neumann. WebDB. 2007.
- P2P Authority Analysis for Social Communities. Josiane Xavier Parreira, Sebastian Michel, Matthias Bender, Tom Crecelius, and Gerhard Weikum. VLDB. 2007.
- Peer-to-Peer Information Search. Sebastian Michel, and Josiane Xavier Parreira. SBBD. 2007.
- P2P Web Search: Make It Light, Make It Fly (Demo). Matthias Bender, Sebastian Michel, Josiane Xavier Parreira, and Tom Crecelius. CIDR. 2007.
- Algebraic Query Optimization for Distributed Top-k Queries. Thomas Neumann, and Sebastian Michel. BTW. 2007.
2006
- Global Document Frequency Estimation in Peer-to-Peer Web Search. Matthias Bender, Sebastian Michel, Peter Triantafillou, and Gerhard Weikum. WebDB. 2006.
- Efficient and Decentralized PageRank Approximation in a Peer-to-Peer Web Search Network. Josiane Xavier Parreira, Debora Donato, Sebastian Michel, and Gerhard Weikum. VLDB. 2006.
- P2P Content Search: Give the Web Back to the People. Matthias Bender, Sebastian Michel, Peter Triantafillou, Gerhard Weikum, and Christian Zimmer. IPTPS. 2006.
- P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs. Matthias Bender, Sebastian Michel, and Gerhard Weikum. ICDE Workshops. 2006.
- A Reproducible Benchmark for P2P Retrieval. Thomas Neumann, Matthias Bender, Sebastian Michel, and Gerhard Weikum. ExpDB. 2006.
- IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking. Sebastian Michel, Matthias Bender, Peter Triantafillou, and Gerhard Weikum. EDBT. 2006.
- A Comparative Study of Pub/Sub Methods in Structured P2P Networks. Matthias Bender, Sebastian Michel, Sebastian Parkitny, and Gerhard Weikum. DBISP2P. 2006.
- Size doesn't always matter: exploiting pageRank for query routing in distributed IR. Josiane Xavier Parreira, Sebastian Michel, and Matthias Bender. P2PIR@CIKM. 2006.
- Discovering and exploiting keyword and attribute-value co-occurrences to improve P2P routing indices. Sebastian Michel, Matthias Bender, Nikos Ntarmos, Peter Triantafillou, Gerhard Weikum, and Christian Zimmer. CIKM. 2006.
2005
- Das MINERVA-Projekt: Datenbankselektion für Peer-to-Peer-Websuche. Matthias Bender, Sebastian Michel, Gerhard Weikum, and Christian Zimmer. Inform. Forsch. Entwickl.. 2005.
- KLEE: A Framework for Distributed Top-k Query Algorithms. Sebastian Michel, Peter Triantafillou, and Gerhard Weikum. VLDB. 2005.
- MINERVA: Collaborative P2P Search. Matthias Bender, Sebastian Michel, Peter Triantafillou, Gerhard Weikum, and Christian Zimmer. VLDB. 2005.
- Improving collection selection with overlap awareness in P2P search engines. Matthias Bender, Sebastian Michel, Peter Triantafillou, Gerhard Weikum, and Christian Zimmer. SIGIR. 2005.
- On the Usage of Global Document Occurrences in Peer-to-Peer Information Systems. Odysseas Papapetrou, Sebastian Michel, Matthias Bender, and Gerhard Weikum. OTM Conferences (1). 2005.
- MINERVAinfinity: A Scalable Efficient Peer-to-Peer Search Engine. Sebastian Michel, Peter Triantafillou, and Gerhard Weikum. Middleware. 2005.
- Database Selection and Result Merging in P2P Web Search. Sergey Chernov, Pavel Serdyukov, Matthias Bender, Sebastian Michel, Gerhard Weikum, and Christian Zimmer. DBISP2P. 2005.
- The MINERVA Project: Database Selection in the Context of P2P Search. Matthias Bender, Sebastian Michel, Gerhard Weikum, and Christian Zimmer. BTW. 2005.
2004
- Bookmark-driven Query Routing in Peer-to-Peer Web Search. Matthias Bender, Sebastian Michel, Gerhard Weikum, and Christian Zimmer. Workshop on Peer-to-Peer Information Retrieval. 2004.
- The MINERVAMinerva is the Roman goddess of science, wisdom, and learning. Project: Towards Collaborative Search in Digital Libraries Using Peer-to-Peer Technology. Matthias Bender, Sebastian Michel, Christian Zimmer, and Gerhard Weikum. DELOS. 2004.
- Towards Collaborative Search in Digital Libraries Using Peer-to-Peer Technology. Matthias Bender, Sebastian Michel, Christian Zimmer, and Gerhard Weikum. DELOS. 2004.