Cloud Responsiveness

Critical mechanisms to ensure responsiveness, improve performance, and effectively handle fluctuating workloads:

Load balancing
- Distributes network traffic across multiple servers or services to improve performance and provide high availability
- are intermediaries (proxies) between users and back-end resources
- distribute incoming requests to different resources using sophisticated algorithms and handle server capacity, response time, and workload
Edge computing
- Optimizes the geographic location of resources and services to enable faster processing and reduced latency
- Instead of routing all data to a centralized cloud datacenter,
  - edge computing utilizes distributed computing resources to minimize the distance data needs to travel
    - reducing network latency and improving responsiveness
- beneficial for applications that require real-time or low-latency processing
  - e.g., IoT devices, CDNs, etc.
Auto-scaling
- Is an automated process that adjusts the computing resources allocated to an application based on demand

adam's notes