Staff Site Reliability Engineer

Electrolux Group
Full_timePetaling Jaya, Malaysia

📍 Job Overview

  • Job Title: Staff Site Reliability Engineer
  • Company: Electrolux Group
  • Location: Petaling Jaya, Selangor, Malaysia
  • Job Type: Full-Time
  • Category: DevOps, Infrastructure
  • Date Posted: June 25, 2025
  • Experience Level: 10+ years
  • Remote Status: On-site (Hybrid)

🚀 Role Summary

  • Lead and guide a team of engineers in building and operating the Connectivity Platform for Electrolux, ensuring operational excellence, reliability, scalability, and efficiency of systems.
  • Drive initiatives to improve the availability, performance, and cost-effectiveness of critical services across the entire service lifecycle.
  • Collaborate with engineering and leadership teams to define and execute on infrastructure roadmaps, architectural best practices, and long-term operational goals.
  • Mentor and guide engineers, sharing best practices, reviewing designs, and fostering a culture of continuous improvement.
  • Enhance incident response processes, lead blameless post-mortems, and refine tooling and automation to reduce manual intervention.
  • Contribute to and lead the development of tools and platforms that improve deployment pipelines, observability, and infrastructure as code.

📝 Enhancement Note: This role combines hands-on engineering with strategic oversight, requiring a strong technical leader with experience in driving complex projects and mentoring engineers.

💻 Primary Responsibilities

  • Operational Excellence: Champion reliability and scalability by driving initiatives to improve the availability, performance, and cost-effectiveness of critical services.
  • Technical Strategy: Lead and collaborate with teams to define and execute on infrastructure roadmaps, architectural best practices, and long-term operational goals.
  • Mentoring & Guidance: Mentor engineers, sharing best practices, reviewing designs, and fostering a culture of continuous improvement.
  • Incident Response & Optimization: Enhance incident response processes, lead blameless post-mortems, and refine tooling and automation to reduce manual intervention.
  • Platform Development: Contribute to and lead the development of tools and platforms that improve deployment pipelines, observability, and infrastructure as code.
  • Global Collaboration: Partner with geographically distributed teams to solve complex problems and deliver reliable, scalable systems.

📝 Enhancement Note: This role requires a strong focus on operational excellence, strategic planning, and global collaboration to ensure the Connectivity Platform remains robust and future-ready.

🎓 Skills & Qualifications

Education: Bachelor's degree in Computer Science, Engineering, or a related field. Relevant experience may be considered in lieu of a degree.

Experience: 8+ years of experience in infrastructure/site reliability engineering, with at least 2+ years in a senior or technical leadership role.

Required Skills:

  • Expertise in designing, building, and operating large-scale, distributed systems.
  • Deep knowledge of cloud infrastructure (AWS preferred), Kubernetes, CI/CD pipelines, and observability practices.
  • Proficiency in one or more programming languages such as Java, Python, or Go, with a strong focus on automation.
  • Strong communication and collaboration skills, with the ability to work effectively in a global team.

Preferred Skills:

  • Experience with infrastructure as code (IaC) tools like Terraform or CloudFormation.
  • Familiarity with monitoring and logging tools like Prometheus, Grafana, ELK Stack, or Datadog.
  • Knowledge of containerization and orchestration tools like Docker and Kubernetes.
  • Experience with Agile methodologies and DevOps practices.

📝 Enhancement Note: Candidates should have a strong background in infrastructure and site reliability engineering, with a proven track record of driving complex projects and mentoring engineers.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate experience in building and operating large-scale, distributed systems.
  • Showcase projects that highlight your expertise in cloud infrastructure, Kubernetes, CI/CD pipelines, and observability practices.
  • Include examples of your programming skills and automation projects.
  • Highlight your ability to collaborate effectively in a global team.

Technical Documentation:

  • Provide documentation for your projects, including code comments, version control, and deployment processes.
  • Include performance metrics, testing methodologies, and optimization techniques used in your projects.
  • Showcase your ability to lead blameless post-mortems and refine tooling and automation to reduce manual intervention.

📝 Enhancement Note: Candidates should emphasize their technical skills, leadership experience, and global collaboration abilities in their portfolio and project requirements.

💵 Compensation & Benefits

Salary Range: The estimated salary range for this role in Petaling Jaya, Malaysia is RM 250,000 - RM 350,000 per year (approximately USD 60,000 - USD 85,000), based on regional market standards and the candidate's experience level.

Benefits:

  • Hybrid working arrangements.
  • Discounts on Electrolux products and services.
  • Medical & Hospitalization coverage.
  • Dental & Optical.

Working Hours: Full-time position with a standard workweek of 40 hours, with flexibility for deployment windows, maintenance, and project deadlines.

📝 Enhancement Note: The salary range is estimated based on regional market standards and the candidate's experience level. Electrolux offers a competitive benefits package, including hybrid working arrangements and discounts on their products and services.

🎯 Team & Company Context

🏢 Company Culture

Industry: Electrolux operates in the home appliances industry, with a focus on innovative and sustainable products.

Company Size: Electrolux is a large, global company with over 40,000 employees worldwide. This role will be part of a dynamic international team where English is the natural language.

Founded: Electrolux was founded in 1901 and has since grown into a leading global appliance company.

Team Structure:

  • The team consists of engineers specializing in infrastructure, site reliability, and cloud services.
  • The team follows a matrix reporting structure, with cross-functional collaboration between engineering, product, and business teams.
  • The team is part of the global Electrolux organization, working with geographically distributed teams to deliver reliable, scalable systems.

Development Methodology:

  • The team follows Agile methodologies, with a focus on continuous improvement and iterative development.
  • The team uses CI/CD pipelines, automated deployment, and infrastructure as code (IaC) practices to ensure efficient and reliable system delivery.
  • The team emphasizes blameless post-mortems and knowledge sharing to foster a culture of learning and growth.

Company Website: Electrolux Group

📝 Enhancement Note: Electrolux is a large, global company with a strong focus on innovation and sustainability. The team structure and development methodology emphasize collaboration, continuous improvement, and global collaboration.

📈 Career & Growth Analysis

Web Technology Career Level: This role is a senior-level position, requiring a strong technical leader with experience in driving complex projects and mentoring engineers.

Reporting Structure: This role reports directly to the Head of Connectivity Platform, with a matrix reporting structure to other functional leaders within the organization.

Technical Impact: The Staff Site Reliability Engineer will have a significant impact on the reliability, scalability, and efficiency of the Connectivity Platform, ensuring that it meets the demanding needs of the business and consumers.

Growth Opportunities:

  • Technical Leadership: This role offers the opportunity to grow into a technical leadership position, driving the evolution of infrastructure, tools, and processes for the Connectivity Platform.
  • Global Collaboration: This role provides the opportunity to work with geographically distributed teams, fostering global collaboration and knowledge sharing.
  • Continuous Learning: This role offers the opportunity to learn and work with cutting-edge technologies, driving continuous learning and skill development.

📝 Enhancement Note: This role offers significant growth opportunities, including technical leadership, global collaboration, and continuous learning. Candidates should be prepared to take on a high level of responsibility and drive complex projects.

🌐 Work Environment

Office Type: The office is a modern, collaborative workspace designed to facilitate cross-functional collaboration and knowledge sharing.

Office Location(s): The role is based in the Petaling Jaya office, with the opportunity to work remotely on a hybrid basis.

Workspace Context:

  • The workspace is equipped with modern development tools, multiple monitors, and testing devices to ensure optimal productivity.
  • The workspace encourages collaboration, with open-plan offices and dedicated team spaces for focused work.
  • The workspace is designed to be flexible, with adjustable workstations and ergonomic equipment to support employee well-being.

Work Schedule: The role follows a hybrid work arrangement, with a standard workweek of 40 hours and flexibility for deployment windows, maintenance, and project deadlines.

📝 Enhancement Note: The workspace is designed to be collaborative, flexible, and supportive of employee well-being. The hybrid work arrangement offers a balance between on-site collaboration and remote work flexibility.

📄 Application & Technical Interview Process

Interview Process:

  1. Technical Assessment: A hands-on technical assessment to evaluate the candidate's programming skills, automation experience, and understanding of cloud infrastructure and Kubernetes.
  2. Architectural Design: A system design discussion to assess the candidate's ability to design and scale large-scale, distributed systems.
  3. Behavioral & Cultural Fit: An interview to evaluate the candidate's communication skills, collaboration abilities, and cultural fit within the Electrolux organization.
  4. Final Evaluation: A final evaluation to assess the candidate's overall fit for the role and the organization.

Portfolio Review Tips:

  • Highlight projects that demonstrate your expertise in cloud infrastructure, Kubernetes, CI/CD pipelines, and observability practices.
  • Include examples of your programming skills and automation projects, with a focus on reducing manual intervention and improving system efficiency.
  • Showcase your ability to lead blameless post-mortems and refine tooling and automation to reduce manual intervention.
  • Emphasize your global collaboration and teamwork skills, with examples of successful cross-functional collaboration and knowledge sharing.

Technical Challenge Preparation:

  • Familiarize yourself with the latest trends and best practices in cloud infrastructure, Kubernetes, and CI/CD pipelines.
  • Brush up on your programming skills, with a focus on automation and reducing manual intervention.
  • Prepare for system design questions, focusing on scalability, availability, and cost-efficiency.
  • Practice communicating technical concepts clearly and effectively, with a focus on global collaboration and knowledge sharing.

ATS Keywords: (Organized by category)

  • Programming Languages: Java, Python, Go, Bash, Shell, PowerShell
  • Cloud Infrastructure: AWS, Kubernetes, Docker, Terraform, CloudFormation
  • CI/CD Pipelines: Jenkins, GitLab CI/CD, CircleCI, GitHub Actions
  • Monitoring & Logging: Prometheus, Grafana, ELK Stack, Datadog, New Relic
  • Infrastructure as Code (IaC): Terraform, CloudFormation, Ansible, Puppet, Chef
  • Containerization: Docker, Kubernetes, Helm, ECS, EKS
  • Version Control: Git, SVN, Mercurial
  • Soft Skills: Communication, Collaboration, Leadership, Mentoring, Problem-Solving, Decision-Making
  • Industry Terms: Site Reliability Engineering, Infrastructure, Cloud Services, DevOps, Agile, Scrum, Kanban

📝 Enhancement Note: The interview process focuses on technical assessments, architectural design, behavioral and cultural fit, and a final evaluation. Candidates should prepare for hands-on technical challenges, system design discussions, and behavioral interviews that emphasize global collaboration and knowledge sharing.

🛠 Technology Stack & Web Infrastructure

Cloud Infrastructure:

  • AWS (Amazon Web Services) - The primary cloud provider for Electrolux's Connectivity Platform.
  • Azure & Google Cloud Platform - Familiarity with these cloud providers is a plus, as Electrolux may expand its cloud footprint in the future.

Containerization & Orchestration:

  • Docker - Used for containerizing applications and services.
  • Kubernetes - Used for orchestrating and managing containerized applications at scale.

CI/CD Pipelines:

  • Jenkins - The primary CI/CD tool used by Electrolux for automated testing, building, and deployment.
  • GitLab CI/CD - Familiarity with GitLab CI/CD pipelines is a plus, as Electrolux uses GitLab for version control and project management.

Monitoring & Logging:

  • Prometheus & Grafana - Used for monitoring and visualizing the performance and health of the Connectivity Platform.
  • ELK Stack (Elasticsearch, Logstash, Kibana) - Used for log aggregation, search, and analysis.
  • Datadog & New Relic - Familiarity with these monitoring tools is a plus, as Electrolux may expand its monitoring stack in the future.

Infrastructure as Code (IaC):

  • Terraform - Used for provisioning and managing infrastructure resources in a declarative, modular, and automated way.
  • CloudFormation - Familiarity with AWS CloudFormation is a plus, as Electrolux uses AWS as its primary cloud provider.

📝 Enhancement Note: The technology stack focuses on cloud infrastructure, containerization, CI/CD pipelines, monitoring, and infrastructure as code. Candidates should have experience with these technologies and be prepared to work with cutting-edge tools and platforms.

👥 Team Culture & Values

Web Development Values:

  • Reliability: Prioritize system availability, performance, and scalability to ensure the Connectivity Platform meets the demanding needs of the business and consumers.
  • Collaboration: Foster a culture of knowledge sharing, global collaboration, and continuous learning to drive operational excellence and technical innovation.
  • Continuous Improvement: Embrace a mindset of continuous improvement, with a focus on iterative development, blameless post-mortems, and refining tooling and automation.
  • Customer Focus: Understand and address the needs of internal and external customers, ensuring that the Connectivity Platform delivers value and meets performance, cost-efficiency, and resiliency objectives.

Collaboration Style:

  • Cross-Functional Integration: Work closely with product, design, and business teams to ensure that the Connectivity Platform meets the needs of the business and consumers.
  • Code Review Culture: Foster a culture of code review and peer programming to ensure high-quality, maintainable, and secure systems.
  • Mentoring & Knowledge Sharing: Encourage mentoring and knowledge sharing to drive technical growth and foster a culture of learning and innovation.

📝 Enhancement Note: The team culture emphasizes reliability, collaboration, continuous improvement, and customer focus. Candidates should be prepared to work in a dynamic, global team that prioritizes knowledge sharing, technical innovation, and operational excellence.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Scalability: Design and implement scalable systems that can handle increased traffic and user demand while maintaining performance and cost-efficiency.
  • Resiliency: Ensure that the Connectivity Platform can withstand failures and disasters, with automated failover and recovery mechanisms in place.
  • Cost Optimization: Continuously monitor and optimize the cost-efficiency of the Connectivity Platform, identifying and addressing areas of waste and inefficiency.
  • Emerging Technologies: Stay up-to-date with the latest trends and best practices in cloud infrastructure, Kubernetes, and CI/CD pipelines, driving continuous learning and skill development.

Learning & Development Opportunities:

  • Technical Skill Development: Pursue certifications, attend conferences, and engage with online communities to stay current with the latest trends and best practices in cloud infrastructure, Kubernetes, and CI/CD pipelines.
  • Leadership Development: Participate in leadership training programs and mentoring relationships to develop your leadership skills and prepare for technical leadership roles.
  • Architecture Decision-Making: Contribute to and lead architecture decision-making processes, driving the evolution of the Connectivity Platform and ensuring that it remains robust and future-ready.

📝 Enhancement Note: The technical challenges and growth opportunities for this role focus on scalability, resiliency, cost optimization, and continuous learning. Candidates should be prepared to take on complex technical challenges and drive continuous learning and skill development.

💡 Interview Preparation

Technical Questions:

  • Cloud Infrastructure: Describe your experience with cloud infrastructure, highlighting your expertise in AWS, Kubernetes, CI/CD pipelines, and observability practices.
  • System Design: Walk through a system design exercise, focusing on scalability, availability, and cost-efficiency. Be prepared to discuss trade-offs and make informed decisions based on business requirements and technical constraints.
  • Problem-Solving: Present a challenging technical problem you've faced in the past and explain how you approached it, highlighting your problem-solving skills and ability to drive complex projects to completion.

Company & Culture Questions:

  • Global Collaboration: Describe your experience working with geographically distributed teams and explain how you've fostered collaboration, knowledge sharing, and technical innovation in a global context.
  • Agile Methodologies: Explain your experience with Agile methodologies, focusing on continuous improvement, iterative development, and blameless post-mortems.
  • User Experience Impact: Discuss how you've ensured that the systems you've built have met the needs of internal and external customers, highlighting your focus on user experience and customer satisfaction.

Portfolio Presentation Strategy:

  • Live Demonstration: Prepare a live demonstration of your portfolio projects, highlighting your expertise in cloud infrastructure, Kubernetes, CI/CD pipelines, and observability practices.
  • Code Walkthrough: Prepare a code walkthrough of your portfolio projects, focusing on your programming skills, automation, and system design.
  • Architecture Decision Reasoning: Prepare to explain the architecture decisions you've made in your portfolio projects, highlighting your ability to make informed decisions based on business requirements and technical constraints.

📝 Enhancement Note: The interview preparation focuses on technical questions, company and culture questions, and a portfolio presentation strategy. Candidates should be prepared to discuss their technical expertise, global collaboration, and architecture decision-making skills, with a focus on driving complex projects to completion.

📌 Application Steps

To apply for this Staff Site Reliability Engineer position:

  1. Customize Your Portfolio: Tailor your portfolio to highlight your expertise in cloud infrastructure, Kubernetes, CI/CD pipelines, and observability practices. Include examples of your programming skills, automation projects, and global collaboration experiences.
  2. Optimize Your Resume: Highlight your relevant experience, skills, and accomplishments in infrastructure and site reliability engineering. Include keywords related to cloud infrastructure, Kubernetes, CI/CD pipelines, and observability practices to optimize your resume for web technology roles.
  3. Prepare for Technical Challenges: Familiarize yourself with the latest trends and best practices in cloud infrastructure, Kubernetes, and CI/CD pipelines. Brush up on your programming skills, with a focus on automation and reducing manual intervention. Practice communicating technical concepts clearly and effectively, with a focus on global collaboration and knowledge sharing.
  4. Research the Company: Learn about Electrolux's history, mission, and values. Understand the company's focus on innovation, sustainability, and global collaboration. Prepare questions to ask during the interview process to demonstrate your interest and engagement with the company.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web technology industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Application Requirements

Candidates should have 8+ years of experience in infrastructure/site reliability engineering, with at least 2+ years in a senior or technical leadership role. Expertise in cloud infrastructure, distributed systems, and proficiency in programming languages is essential.