This job listing has expired and may no longer be relevant!
27 Mar 2025

Permanent Site Reliability Engineer – Pepkor Vacancies

Pepkor – Posted by Joblink24 Western Cape, South Africa

Job Description

Pepkor Vacancies – Site Reliability Engineer

Pepkor is seeking a skilled Site Reliability Engineer to join our team. This role is ideal for a professional with a strong background in automation, cloud infrastructure, and system reliability.

Responsibilities:

  • Develop expertise in multiple scripting and programming languages to create efficient, scalable solutions.

  • Design and implement automation tools and processes for managing large-scale systems.

  • Lead critical incident responses with a proactive approach to resolution and post-incident analysis.

  • Shape system architecture and influence high-impact decisions to enhance reliability and performance.

  • Establish and maintain reliability standards to ensure scalable and sustainable system operations.

  • Demonstrate strategic thinking and planning to drive organizational success.

  • Provide technical leadership, influencing key decisions and collaborating with cross-functional teams.

  • Mentor and coach junior and intermediate engineers to foster a culture of learning and growth.

Minimum Requirements:

  • 8-10 years of experience in Site Reliability Engineering, DevOps, or Systems Engineering.

  • Proficiency in scripting languages for automation and system management.

  • Relevant certifications such as Oracle, Cloud, or DevOps.

Technical Skills:

  • Continuous delivery and deployment strategies.

  • Cloud computing expertise and best practices.

  • System and application performance monitoring (observability).

  • Infrastructure as code and configuration management.

  • Containerization and orchestration.

  • Automation of system operations.

  • Effective collaboration and communication within technical teams.

  • Coding and scripting for infrastructure and application management.

  • Experience with Azure DevOps and other CI/CD tools.

  • System uptime optimization and reliability engineering.

  • Service-level objectives (SLOs) and latency management.

  • Incident response, outage resolution, and change management.

  • Capacity planning and scaling strategies.

APPLY NOW

217 total views, 1 today

Apply for this Job