This post is a part of a series of posts to keep knowledge around Software quality attributes.

Questions you might want to ask:

  • How much of the downtime system can tolerate?
  • Is this during business hours or around the clock?
  • Which countries/regions are your users from?
  • How fast should the system recover from the failure?
    • If the answer is "immediately", ask if they want to pay $$$$$ for that?
  • How much of the recent data is the system allowed to lose in case of the failure?
    • If the answer is "immediately", ask if they want to pay $$$$$ for that?
  • Can we tolerate downtime in case the Cloud provider experiences an outage?
    • Region outage
    • Core service worldwide outage, AWS S3 outage as an example