Stellenbosch, South Africa

Production Reliability Engineer

Entersekt is looking for an engineer or developer who is passionate about solving operational and engineering challenges in a SaaS environment. Our operational capability includes a role that we call Production Reliability Engineering. The Production Reliability Engineer’s (PRE) main responsibilities are:

  • Product hardening
  • Operational enablement and automation
  • Security management and networking support
  • Client support at tier 4 level (escalated by tier 3 service desk)

The ideal candidate

  • Hungry and eager to make a difference at a technical and operational level
  • A healthy self-esteem – you will be challenged by some bright minds on a regular basis
  • Good communication skills – you will need to explain your ideas to your teammates
  • The ability to get things done – we don’t like micro-management, but we expect you to do the right thing
  • A tendency to think outside the box – innovation is valued highly at Entersekt
  • A preference to work in a team rather than by yourself
  • An attitude of being proud of your work and taking ownership

The role

  • Product hardening: operational and client use cases will be driven up the product value chain to yield product enhancements that improve reliability, ease of use, and measurability
  • Operational enablement: includes improved monitoring systems and the building of tools, dashboards and automated processes to empower the product and organizational value chain globally, all the way from our own teams to partners to clients
  • Security management and networking support: from internal certificate management and crypto techniques, to assisting clients with troubleshooting firewall and proxy server issues, the PRE is required to understand how our solution integrates with our clients’ environments from a security and networking perspective and use that knowledge to overcome or anticipate challenges
  • Support escalation: our tier 3/4 service desk will refer to the PRE as an escalation point for troubleshooting complex issues and assisting clients at application, network, and system level to resolve production challenges

Responsibilities

  • Script and develop routines and code to automate and improve backend processes as well as microservice and data management
  • Drive product design for operation/maintenance and resilience early in the SDLC
  • Provide solid feedback and recommendations to Product Development teams regarding product reliability, security, operability, and maintainability on existing and future products
  • Identify and contribute to security enhancements, best practices and recommendations from a client and production operations point of view (e.g. platform distribution and client-side deployment methodologies and identified issues)
  • Participate in incident escalations and resolution, which may occur after hours (no shift work – our Service Desk team performs 24x7 emergency standby)

Skills & Experience

Successful candidates for the Production Reliability Engineer role will generally possess the following qualifications:

  • A software or engineering qualification (e.g. engineering or software degree plus 1-2 years’ experience or a diploma with 2-3 years’ experience)
  • Experience in software development lifecycles, from coding to testing, as this role engages strongly with our Software Development and QA teams
  • Understanding of Linux from an administrative perspective: core fundamentals, command-line operations, scripting, service monitoring, iptables, and networking tools
  • Proficiency and experience working with AWS environments
  • Experience with building and maintaining modern monitoring, analysis, and diagnostics systems (e.g. Prometheus, Zabbix, Splunk, ELK, HP Openview, EUEM, SCOM, or similar)
  • Scripting experience (e.g. Perl, PHP, Bash or similar)
  • Application server experience (e.g. Tomcat, Glassfish, or similar)
  • Experience with micro services/containers (e.g. Docker)
  • Some understanding of APIs (SOAP, REST, or similar)
  • Network protocol and security knowledge (e.g. TCP, UDP, DHCP, DNS, SSL)
  • Strong network troubleshooting skills (e.g. Wireshark)

Apply for this position! Email your CV and cover letter to:

This email address is being protected from spambots. You need JavaScript enabled to view it.

logo entersekt

Entersekt is an international software development company based just outside of Cape Town, South Africa.

We are leaders in authentication, app security, and payments enablement technology, offering a highly scalable solution set with a track record of success across multiple continents.