

Technology-Software
The System Reliability Engineer, Consultant at AIA is responsible for ensuring the reliability, scalability, and performance of enterprise systems by applying software engineering principles to operations. This role involves collaboration with development and operations teams to build automation, monitor system health, respond to incidents, and improve service availability and efficiency.
The System / Site Reliability Engineer role involves ensuring the reliability, scalability, and performance of enterprise systems by applying software engineering principles to operations. The engineer collaborates with development and operations teams to build automation, monitor system health, respond to incidents, and improve service availability and efficiency.
Bachelors degree in Computer Science, Software Engineering, Information Technology, or a related field.
3 6 5 years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles.
Prior experience supporting front-end applications in production environments, preferably in financial services or regulated industries.
Frontend Performance Monitoring; Ability to instrument front-end code for custom metrics and traces.
Experience with Real User Monitoring (RUM), Synthetic Monitoring, and Application Performance Monitoring (APM) tools (e.g., New Relic, Dynatrace, Datadog).
Proficiency in setting up dashboards and alerts using tools like Dynatrace, Grafana, Prometheus, Elastic Stack, or Splunk.
Familiarity with OpenTelemetry standards for distributed tracing.
Scripting skills in Python, Bash, or JavaScript for automation and tooling.
Experience with CI/CD pipelines (e.g., GitHub Flow).
Hands-on experience with cloud platforms (AWS, Azure).
Familiarity with containerization (Docker) and orchestration (Kubernetes).
Understanding of secure coding practices for front-end applications.
Awareness of financial compliance standards (e.g., PCI-DSS).
Company
AIA Malaysia (Insurance)
Location
Kuala Lumpur
Salary
Undisclosed
Skills Required
10 skills
Click to submit your application
Site Reliability Engineering
Devops
Frontend Performance Monitoring
Real User Monitoring (RUM)
Application Performance Monitoring (APM)
Dashboard And Alerting Tools
Scripting And Automation
Continuous Integration And Continuous Deployment (CI/CD)
Cloud Platforms
Containerization And Orchestration