“Predictive Load-Balancing: Unfair But Faster & More Robust” by Steve Gury

Client-Side Load Balancing Methods

Random Selection

Round Robin

Least Loaded

Load Balancing Troubles

Servers Are Not All The Same

Thudering Herd

Outliers

Multiple Clients with Independent States

All These Methods Have Issues

predictive_load_balancing_matrix.png

Latency-Based Load-Balancing

Load = Predicted_Latency * (#requests + 1)

Predicted Latency

Issues and Solutions

Reacting to Latency Quickly

Not Perfect