Site Reliability Engineering

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems to create scalable and reliable software systems. Coursera's SRE catalogue equips you with the principles of SRE, including service level objectives, error budgets, and automation. You'll learn about the design, deployment, and maintenance of large-scale, efficient, and reliable software systems. By understanding incident management, disaster recovery, and creating monitoring systems, you can enhance system reliability and efficiency, making you valuable to any company that relies on robust software infrastructure.
8credentials
23courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Results for "site reliability engineering"

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Site Reliability Engineering, Google Cloud Platform, Cost Management, Cloud Computing, Cloud Infrastructure, Budget Management, Capacity Management, Operational Excellence, Corporate Sustainability, Resource Management, Resource Allocation, Sustainability Reporting, Identity and Access Management, Client Support, Disaster Recovery

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Site Reliability Engineering, Cloud Infrastructure, Google Cloud Platform, Public Cloud, Cloud Computing, DevOps, Budget Management, Scalability, Cost Management, Operational Excellence, Operational Efficiency, Corporate Sustainability, Sustainable Business, Customer Support, Disaster Recovery

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Google Cloud Platform, Kubernetes, Real Time Data, Big Data, Data Infrastructure, CI/CD, Performance Tuning, Data Pipelines, Databases, Containerization, Data Processing, DevOps, Scalability, Cloud Storage, System Monitoring

  • Status: Free Trial

    Skills you'll gain: Cloud Security, Cloud Management, Site Reliability Engineering, Cost Management, Cloud Computing, Google Cloud Platform, DevOps, IT Security Architecture, Data Security, Multi-Tenant Cloud Environments, Financial Controls, System Monitoring, Cybersecurity, Identity and Access Management

  • Status: Free

    Skills you'll gain: Load Balancing, Kubernetes, Site Reliability Engineering, Scalability, Application Deployment, Disaster Recovery, Containerization, YAML, Servers, System Monitoring

  • Skills you'll gain: Site Reliability Engineering, Safety Culture, Culture Transformation, Continuous Delivery, DevOps, Service Level, Continuous Integration, Performance Measurement, Performance Metric, Change Management, Design Thinking, Automation, Data-Driven Decision-Making, Prototyping

  • Skills you'll gain: Cloud Computing Architecture, Amazon Web Services, Cloud Security, Operational Excellence, Reliability, Solution Architecture, Corporate Sustainability, Performance Tuning, Operational Efficiency, Cost Reduction, Security Strategy, Operational Analysis, System Requirements, Site Reliability Engineering, Scalability, Cost Management, Disaster Recovery, Interviewing Skills

  • Status: New
    Status: Free Trial

    Skills you'll gain: Cloud Computing Architecture, Cloud Computing, Scalability, Cloud Infrastructure, Cloud Platforms, Cloud Services, Solution Architecture, Infrastructure As A Service (IaaS), Public Cloud, Software Architecture, Enterprise Architecture, Platform As A Service (PaaS), Disaster Recovery, Site Reliability Engineering, Requirements Analysis