← Back to opportunities

Site Reliability Engineer - Data Availability

📍 Location
Singapore
⏰ Job Type
Full-time
📅 Posted
May 26, 2026

About the Role

What You Will Do

Operational Support & Incident Management

  • Provide 2nd level support for production systems and critical business applications.

  • Investigate, troubleshoot, and resolve incidents and performance issues.

  • Perform root cause analysis (RCA) and document findings in a structured manner.
  • Monitoring, Observability & Automation

  • Design, implement, and maintain monitoring dashboards.
    Improve alert quality and reduce noise through effective threshold and metric design.
    Analyze logs, metrics, and system behavior to proactively detect anomalies, automate operational processes using Ansible and scripting.
  • What You Bring

  • Operational Mindset & Collaboration

  • Proven experience in Site Reliability Engineering, DevOps, or 2nd level production support.

  • Effective communication skills and ability to work with cross-functional teams.

  • Technical Skills

    Ready to Join Through a Referral?

    Apply now and get connected directly with the hiring team

    Apply for this Position