The lecture will be held as flipped classroom (like database systems). The Q&A session for the lecture will be on Tuesday 8:15 - 9:45 (alternating with IRDM lecture) at the university, the biweekly exercise session will be on Thursday 15:30-17:00. More details follow soon.

Here is the link to the Mattermost chat for this course: “https://teamwork.rarp-kl.de/signup_user_complete/?id=wdk8rgxqi3bobmojrf7fawx98w

Here is the link to the OLAT lecture (The OLAT password is provided in Mattermost): https://olat.vcrp.de/auth/RepositoryEntry/4105175534/CourseNode/93096824812783

Contents

This course addresses fundamental concepts of distributed data management. Emphasis is put on novel approaches/paradigms to managing Big Data. The course aims at a mixture of system issues and hands on experience (like Hadoop/HDFS) and on fundamental algorithms and techniques (such as consistent hashing or Bloom filters).

  • Big Data, Cloud Computing
  • MapReduce (Hadoop, HDFS, …)
  • Various algorithms on top of MR
  • NoSQL Stores (MongoDB, Amazon Dynamo, Riak, …)
  • (State Machine) Replication, Paxos
  • (Eventual) Consistency Models
  • Synopses: Bloomfilter, count-min sketch, KMV, …
  • Distributed Data Stream Processing: STREAM, Storm, …
  • Gossip protocols, consistent hashing

Regulations

Please read carefully. They might slightly change before start of the teaching, e.g., regarding submission, points etc.

Students need to successfully participate in the exercise sessions, according to the regulations below, in order to get admitted to the final exam.

  • There will be 6 exercise sheets.
  • The teaching assistant presents the solutions and answers questions.
  • There is no mandatory attendance of the exercise sessions; still, we would be happy to see a lively participation.
  • Each sheet consists of 3 assignments on average, which makes 18 assignments in total. Each assignment is equivalent to one point.
  • A student needs to reach a total of at least 13 points throughout the semester to qualify for the final exam.
  • Solutions to exercise sheets have to be submitted in OLAT.
  • Students can work alone or in groups of max. two. In order to submit solutions you have to enrol to a group in the OLAT course.
  • Students need to mark in OLAT which individual assignments they have managed to solve correctly (individually, not per group).
  • Students can only mark an assignment as solved, if they have managed to fully complete the assignment in a reasonable manner. In other words, for every part of the assignment the submitted solution has to present an in-depth approach, which does not necessarily have to include every detail of a correct solution.
  • In addition, only if the solution of an assignment is done correctly to an extent of ½ or more, the point for that assignment will be given.
  • If it is obvious that the mark has been placed in a dishonest attempt to obtain a point without proper engagement with the assignment, the entire sheet is assessed with zero points. For instance, if the marked exercise or parts of it are not done at all.
  • Copying solutions from other groups or taking solutions from previously published solution sheets, if clearly identifiable, will cause all involved groups to get immediately disqualified from the course, independent of the number of points accomplished regularly.

People

Prof. Dr.-Ing. Sebastian Michel

profile picture of Prof. Dr.-Ing. Sebastian Michel

Head of Research Group

Damjan Gjurovski

profile picture of Damjan Gjurovski

PhD Student