← Back to opportunities
About the Role
Site Reliability Engineers (SREs) are essential to PandaDoc's success, ensuring customers receive a reliable service with minimal downtime.
The SRE team achieves this by:
- Owning the incident management processes and tools.
- Managing the observability stack and alerting systems to enable timely investigation and mitigation.
- Actively contributing to service codebases to proactively prevent incidents and resolve performance bottlenecks.
In essence, SREs are the cornerstone of production service resiliency, driving efforts in observability, incident management, capacity planning, and maintaining reliable operations.
In this role, you will:
- Own and influence the incident management process end-to-end
- Maintain and evolve the on-prem observability stack
- Keep production applications running smoothly by participating in the on-call rotation
- Develop automations and tools to suppor...
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position