Exponential Backoff & Service Limits Increase

  • API Rate Limits
    • the number of calls per second for an API. For example
      • DescribeInstances API for EC2 has a limit of 100 calls per seconds
      • GetObject on S3 has a limit of 5500 GET per second per prefix
      • For Intermittent Errors (sometimes happen): implement Exponential Backoff (any AWS service)
      • If getting it consistent: request API throttling limit increase
  • Service Quotas (Service Limits)
    • how many resources we can run. For example
    • On-demand standard instances: only 1152 vCPU
    • To increase Service limit, we open a ticket in Service Quotas API