深入理解分布式系统

Google https://zhuanlan.zhihu.com/p/338161224

FlumeJava: Easy, Efficient Data-Parallel Pipelines

Tenzing A SQL Implementation On The MapReduce Framework

MillWheel: Fault-Tolerant Stream Processing at Internet Scale

Pregel: a system for large-scale graph processing

MapReduce: Simplified Data Processing on Large Clusters

Large-scale Incremental Processing Using Distributed Transactions and Notifications

Processing a Trillion Cells per Mouse Click(PowerDrill)

Megastore: Providing Scalable, Highly Available Storage for Interactive Services

Spanner: Google's Globally-Distributed Database

Bigtable: A Distributed Storage System for Structured Data

Dapper, a Large-Scale Distributed Systems Tracing Infrastructure

The Google File System

Colossus: Successor to the Google File System (GFS)

CPI2: CPU performance isolation for shared compute clusters

The Chubby lock service for loosely-coupled distributed systems

Large-scale cluster management at Google with Borg

Omega: flexible, scalable schedulers for large compute clusters