Redundant Array of Independent Disks

Introduzione ai Redundant Array of Indipendent Disks # I RAID ne abbiamo citato per la prima volta in Memoria . Come facciamo a stare su alla velocità del processore se questa va a crescere in modo esponenziale? Parallelizzazione della ricerca! . Ecco perché ci serve raid (oltre…

June 4, 2025 · Reading Time: 4 minutes · By Xuanqiang Angelo Huang

Systems for Artificial Intelligence

At the time of writing, the compute requirements for machine learning models and artificial intelligence are growing at a staggering rate of 200% every 3.5 months. Interest in the area is being quantified as 10k papers per month on the topic, while dollar investments on compute…

June 4, 2025 · Reading Time: 14 minutes · By Xuanqiang Angelo Huang

Cloud Computing Services

Cloud Computing: An Overview # Cloud shifted the paradigm from owning hardware to renting computing resources on-demand. Hardware became a service. Key Players in the Cloud Industry # The cloud computing market is dominated by several major providers, often referred to as the…

June 3, 2025 · Reading Time: 15 minutes · By Xuanqiang Angelo Huang

Cloud Reliability

Reliability is the ability of a system to remain operational over time, i.e., to offer the service it was designed for. Cloud Hardware and software fails. In this note, we will try to find methods to analyze and predict when components fail, and how we can prevent this problem.…

June 3, 2025 · Reading Time: 9 minutes · By Xuanqiang Angelo Huang

Cluster Resource Management

We need to find an efficient and effective manner to allocate the resources around. This is what the resource management layer does. Introduction to the problem # What is Cluster Resource Management? # Most of the time, the user specifies an amount of resources, and then the…

June 3, 2025 · Reading Time: 8 minutes · By Xuanqiang Angelo Huang

Confidential Computing

pWith confidential computing we want to guarantee confidentiality and integrity of a user’s computation running on a remote (cloud) system, including: The program Its inputs and outputs Intermediate state, control flow, etc. Even if do not trust the cloud provider ! Usually it…

June 3, 2025 · Reading Time: 6 minutes · By Xuanqiang Angelo Huang

Container Virtualization

Containers # In this note, we introduce the famous docker containers. We also explore how #Linux Containers are implemented, and some parts of how #Docker works. What is a Container # We have explored Virtual Machines in some past section. Containers do not virtualize…

June 3, 2025 · Reading Time: 6 minutes · By Xuanqiang Angelo Huang

Content Delivery Networks

CDNs are intermediary servers that replicate read intensive data to provide better performance when user requests them. A close relative of CDNs is edge computing (e.g. gaming stations) where lots of computation is done directly close to the user. Types of CDNs # Mainly three…

June 3, 2025 · Reading Time: 3 minutes · By Xuanqiang Angelo Huang

Notions of Security

CIAA principles of security # We have already outlined these principles in Sicurezza delle reti and talked about the concepts of authentication and integrity. Here we try to deepen these concepts and delve a little bit more on the attack vectors. This note mainly focuses on the…

Optimizations for DNN

Mixture of Experts # There is a gate that opens a subset of the experts, and the output is the weighted sum of the outputs of the experts. The weights are computed by a gating network. One problem is load balancing, non uniform assignment. And there is a lot of communication…

June 3, 2025 · Reading Time: 11 minutes · By Xuanqiang Angelo Huang