Modern Distributed Systems

Brought by: edX

Overview

Modern IT infrastructure is built as distributed systems, an exciting concept that started with the first computers and evolved rapidly into its present form. From online video meetings to internet services, from social media platforms to online games, we all use and interact with distributed systems on a daily basis and increasingly depend on them. Designing and operating such large-scale distributed systems, however, is complex and typically involves making reasonable compromises. There are fundamental technical barriers as well as economic arguments why we cannot make these systems behave as if they were running on a single, perfectly reliable machine..

In this course, learners will be introduced to the essential functional and non-functional concerns of distributed systems and the common problems encountered while designing them, such as consistency, availability, elasticity, and scalability. A variety of practical solutions that have been established in the leading tech industry in recent years will be reviewed. These provide re-usable building blocks to create new large-scale applications. These recent developments, especially around cloud computing, large-scale data processing, distributed machine learning, and other fields are often not reflected in textbooks and are absent from many traditional curricula but are at the heart of this course.

The course will therefore provide learners with the fundamental understanding (theoretical and practical foundations) of how cloud, edge, and big data processing systems work and how they address common challenges for distributed systems such as performance, resilience, and scalability.

The learning progress is assessed through a variety of different activities including quizzes, design exercises, experiments, and open questions, with peer review of other students’ solutions. In the final project, learners will design a distributed system based on the learners’ own experience and interests and describe the functional and non-functional properties of the system.

Syllabus

Module 1: Introduction to Distributed Systems

● Parallel vs. Distributed Systems

● Challenges in Distributed Systems

● The CAP Theorem

● Example 1: Online Gaming

● Example 2: Scientific Computing

Module 2: Functional Requirements

● Functional vs. Non–Functional Properties

● Naming

● Replication

● Consistency

● Consensus

Module 3: Non-Functional Requirements

● Importance of Performance

● Measuring NFRs: Metrics

● Scalability and Elasticity

● Amdahl’s Law

● Gustafson’s Law

● Benchmarking

Module 4: Resource Management and Scheduling

● Scheduling in the Small: Processor Scheduling

● Scheduling in the Large: Scheduling for Distributed Systems

● Workloads in DS

● Centralized Schedulers: Kubernetes, etc.

● Decentralized Schedulers: (HTCondor) etc.

● Portfolio Scheduling

Module 5: System Architectures and Programming Models

● Trade-offs between SAs and PMs

● Programming Models

● System Architectures: communication, big data, machine learning

● Layering

Module 6: Distributed Ecosystems

● Introduction to massive processing

● Theory of ecosystems

● Super-distribution principle

● Distributed ecosystems in science and engineering: cloud, edge, big data

● Distributed ecosystems in online gaming

● The future of distributed ecosystems

Taught by

Alexandru Iosup and Jan Rellermeyer

Modern Distributed Systems
Go to course

Modern Distributed Systems

Brought by: edX

  • edX
  • Free
  • English
  • Certificate Available
  • Certain days
  • intermediate
  • English