Produktbild: Reliability Engineering in the Cloud

Reliability Engineering in the Cloud Strategies and Practices for AI-Powered Cloud-Based Systems

52,99 €

inkl. gesetzl. MwSt., Versandkostenfrei


Beschreibung

Produktdetails

Einband

Taschenbuch

Erscheinungsdatum

16.07.2025

Verlag

Pearson Academic

Seitenzahl

272

Maße (L/B/H)

23,1/18,9/1,7 cm

Gewicht

474 g

Sprache

Englisch

ISBN

978-0-13-539579-0

Beschreibung

Produktdetails

Einband

Taschenbuch

Erscheinungsdatum

16.07.2025

Verlag

Pearson Academic

Seitenzahl

272

Maße (L/B/H)

23,1/18,9/1,7 cm

Gewicht

474 g

Sprache

Englisch

ISBN

978-0-13-539579-0

Herstelleradresse

Libri GmbH
Europaallee 1
36244 Bad Hersfeld
DE

Email: gpsr@libri.de

Noch keine Bewertungen vorhanden

Verfassen Sie die erste Bewertung zu diesem Artikel

Helfen Sie anderen Kundinnen und Kunden durch Ihre Meinung.

Kundinnen und Kunden meinen

Bewertungen (0)

Die Leseprobe wird geladen.
  • Produktbild: Reliability Engineering in the Cloud
  • Preface xiii
    Acknowledgments xxiii
    About the Authors xxv

    Chapter 1: Reliability Engineering in the Cloud: How to Design, Build, Operate, and Stress-Test Highly Reliable Systems 1
    Cloud 1
    Resilience 2
    Reliability 2
    Engineering 3
    Engineering Excellence 4
    How to Design and Build Resilient and Reliable Applications 7
    Leveraging Lean Principles 12
    Leveraging Artificial Intelligence 13
    Leveraging Value Stream Mapping 14
    Culture and Values 15
    Operational Excellence 15
    Summary 15
    Q&A 16

    Chapter 2: Resilient, Available, and Scalable Systems: Ensuring That Applications Can Handle Failure in a Controlled Manner 21
    Key Concepts 21
    Design Principles 33
    Chaos Engineering 34
    Validating Resilience 35
    Summary 39
    Q&A 40

    Chapter 3: Incident Response for Fast Recovery: How to Handle Incidents and How to Automate This Process to Improve Time to Detect and Time to Recovery 49
    Incident Response 49
    Fast Recovery 55
    Incident Handling 65
    Summary 67
    Q&A 67

    Chapter 4: Operational Excellence and Change Management: How to Establish Efficient Processes and Maintain Best-in-Class CRE Practices 75
    Key Performance Indicators 76
    Root Cause Analysis 76
    Incident Reviews 78
    Change Management 83
    Case Study 86
    Architecture and Reliability Assessments 89
    Summary 91
    Q&A 91

    Chapter 5: Leveraging Observability, Monitoring, Reliability Metrics, and GenAI: How to Gain Insights, Set Effective Monitoring, Set Service Level Objectives, and Establish Thresholds 95
    Reliability Engineering Capabilities 95
    Ten-Step Process for Creating Effective Monitoring 99
    Maturity Levels 101
    Monitoring and Alerting Tools 102
    Case Study: AIs Impact on CRE 106
    Summary 108
    Q&A 109

    Chapter 6: CRE via Objectives and Key Results (OKRs): How to Build a Culture of Continuous Reliability Improvements Using the OKR Framework 111
    Continuous Improvement in Lean 111
    Application of Lean to CRE 115
    Application of OKRs to CRE 117
    Summary 122
    Q&A 122

    Chapter 7: CRE Tooling: Tools That Support Automatic Failovers, Automatic Rollbacks, Automatic Deployments, Chaos Engineering, Incident Response, Configuration Management, Immutable Infrastructure, and Disaster Recovery 127
    Distributing Load and Volume with Auto-Scaling and Load Balancing 128
    Enabling Automatic Failovers for High Availability 133
    Facilitating Controlled Deployments with Rollback Strategies 136
    Providing Chaos Engineering Capabilities for Resilience Testing 139
    Assisting in Incident Response with Automation 140
    Ensuring Proper Configuration Management 142
    Leveraging Immutable Infrastructure as a Service 144
    Practicing Disaster Recovery Frequently 147
    Case Study 149
    Summary 151
    Q&A 151

    Chapter 8: Cutting-Edge Technologies: How to Use the Power of AI, ML, LLMs, and GenAI Models to Revolutionize Your CRE Practices 155
    Understanding AI, ML, LLMs, and GenAI 155
    Benefits of Integrating These Technologies into CRE Practices 156
    Implementation Considerations 165
    Summary 169
    Q&A 169

    Chapter 9: CRE Value Stream: How to Build Your CRE Strategy Based on Holistic End-to-End Analysis of Your Systems and Customers 175
    What Is a Value Stream? 175
    CRE as a Value Stream 176
    Case Studies 180
    Summary 182
    Q&A 182

    Chapter 10: Culture: How to Build a Psychologically Safe Environment and Culture of Innovation with the CRE Framework 191
    Psychological Safety 191
    Employee Empowerment 192
    Leadership and Ownership 192
    Collaboration and Cross-Functional Teams 194
    Customer Obsession 194
    CRE Culture 195
    Summary 196
    Q&A 197

    Chapter 11: The Business Case for CRE: How to Measure ROI, Ensure Customer Satisfaction, and Promote Business Success 201
    Benefits of CRE 201
    Aligning CRE with Strategic Objectives 204
    Evolution of CRE Practices 206
    Case Studies 208
    Summary 209
    Q&A 210

    Chapter 12: Conclusion 217

    Appendix A: Incident Response Checklist Template 219

    Appendix B: Correction of Error (COE) Document Structure 223

    Appendix C: CRE Change Management Checklist 225

    Glossary 227
    References 231
    Index 235