Site Reliability Engineer (Egypt, USA, Phillipines, Mexico)
Job Location
Pretoria, South Africa
Job Description
Job title: Site Reliability Engineer (SRE) Location : Egypt (Cairo), USA, Phillipines, India Employment Type: Initial 1-year Fixed term contract with option to move into a permanent position Job Description Summary Overview Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments. The successful candidate will be responsible for ensuring site reliability & performance, monitoring & alerting, and supporting emergency response situations. This would require working closely with software engineers, DevOps and product teams to maintain robust infrastructure and automation that supports mission-critical applications. The ideal candidate creates a bridge between development and operations by applying a software engineering mindset to service management. We are seeking an individual who is highly motivated, intellectually curious, and seeks out opportunities for improvement. The Role: This role involves working with a team of talented SREs/DevOps Engineers to support highly scalable services. Responsibilities include: Responsible for pipeline build and maintenance in accordance with the clients tooling and conventions. Participate in the software development lifecycle, working closely with the development team to ensure that designed solutions meet non-functional requirements such as availability, performance, security and maintainability standards. Maintain services through monitoring of metrics, system health, and analysis of reports. Provide support for production and in-house systems. Participate in on- call Production support rota. Incident management, on call support and root cause analysis conducting post incident reviews and 5-Whys Remediate system vulnerability , security and resiliency measures. Improve process and systems within the Program. Lead incident management efforts by proactively monitoring and analyzing ISO 8583 financial transaction messages across the 4-party payment model (Cardholder, Merchant, Acquirer, Issuer). Skills & requirements: Card payment domain knowledge (mandatory) Experience with CI/CD and Build pipelines using Jenkins. Experience in public and private Cloud offerings (PCF, Azure, AWS etc.). Knowledge of NoSQL & SQL databases such as Mongo / Oracle/ Experience and knowledge of managing distributed systems and working with microservices. Familiarity with Unix tooling, with strong scripting skills Exposure to working with Monitoring and Alerting tools such as Splunk, Dynatrace Proficiency in one of the following: Python, Java, GO or equivalent. Familiarity defining SLO’s and SLA’s Prior experience of working in an SRE/DevOps team and excellent understanding of SRE/DevOps principles. High degree of initiative and self-motivation, with a willingness to take on challenging opportunities. Excellent communication and relationship building/collaboration skills.
Location: Pretoria, ZA
Posted Date: 7/6/2025
Location: Pretoria, ZA
Posted Date: 7/6/2025
Contact Information
Contact | Human Resources |
---|