Cloud Storage

Object Stores # Characteristics of Cloud Systems # Object storage design principles # We don't want the hierarchy that is common in Filesystem s, so we need to simplify that and have these four principles: Black-box objects Flat and global key-value model (trivial model, easy to…

June 7, 2025 · Reading Time: 20 minutes · By Xuanqiang Angelo Huang

Green Computing

The cloud is inefficient, and it looks like we can improve a lot on this side. Computer Science with their systems have reached industrial scales and can be compared to build airports, highways and metro systems in terms of public infrastructure , yet, due to their immaterial…

June 6, 2025 · Reading Time: 5 minutes · By Xuanqiang Angelo Huang

Virtual Machines

The fundamental idea behind a virtual machine is to abstract the hardware of a single computer (the CPU, memory, disk drives, network interface cards, and so forth) into several different execution environments, thereby creating the illusion that each separate environment is…

June 6, 2025 · Reading Time: 14 minutes · By Xuanqiang Angelo Huang

Communication in the Cloud

How can we coordinate services to actually understand what they are doing, or what the user wants them to do? How to manage networks errors? This note will mainly focus on high level communication protocols to coordinate this kind of communication. Remote Procedure Calls #…

June 4, 2025 · Reading Time: 8 minutes · By Xuanqiang Angelo Huang

Datacenter Hardware

We want to optimize the parts of the datacenter hardware such that the cost of operating the datacenter as a whole would be lower, we need to think about it as a whole. Datacenter CPUs # Desktop CPU vs Cloud CPU # Isolation : Desktop CPUs have low isolation, they are used by a…

June 4, 2025 · Reading Time: 21 minutes · By Xuanqiang Angelo Huang

Redundant Array of Independent Disks

Introduzione ai Redundant Array of Indipendent Disks # I RAID ne abbiamo citato per la prima volta in Memoria . Come facciamo a stare su alla velocità del processore se questa va a crescere in modo esponenziale? Parallelizzazione della ricerca! . Ecco perché ci serve raid (oltre…

June 4, 2025 · Reading Time: 4 minutes · By Xuanqiang Angelo Huang

Systems for Artificial Intelligence

At the time of writing, the compute requirements for machine learning models and artificial intelligence are growing at a staggering rate of 200% every 3.5 months. Interest in the area is being quantified as 10k papers per month on the topic, while dollar investments on compute…

June 4, 2025 · Reading Time: 14 minutes · By Xuanqiang Angelo Huang

Cloud Computing Services

Cloud Computing: An Overview # Cloud shifted the paradigm from owning hardware to renting computing resources on-demand. Hardware became a service. Key Players in the Cloud Industry # The cloud computing market is dominated by several major providers, often referred to as the…

June 3, 2025 · Reading Time: 15 minutes · By Xuanqiang Angelo Huang

Cloud Reliability

Reliability is the ability of a system to remain operational over time, i.e., to offer the service it was designed for. Cloud Hardware and software fails. In this note, we will try to find methods to analyze and predict when components fail, and how we can prevent this problem.…

June 3, 2025 · Reading Time: 9 minutes · By Xuanqiang Angelo Huang

Cluster Resource Management

We need to find an efficient and effective manner to allocate the resources around. This is what the resource management layer does. Introduction to the problem # What is Cluster Resource Management? # Most of the time, the user specifies an amount of resources, and then the…

June 3, 2025 · Reading Time: 8 minutes · By Xuanqiang Angelo Huang