[Design-Data-Intensive-Application]- Reliability

发布时间 2023-09-19 10:26:48作者: ylxn

1  How to work under failure 

1) error & solution

error

  • HW-error disk / network / machine 
  • SW-error bug (sandbox-> Experiment)
  • human error (Access control list) .  operation error

 

solution

  • Access control list
  • sandbox
  • CI / CD + Automade Testing (version)
  • Monitory (RPC Error . Metrix (Cpu memory)
  • rollout Feature