📍 Job Overview

Job Title: Site Reliability Engineer
Company: bp
Location: Kuala Lumpur, Kuala Lumpur, Malaysia
Job Type: Full-Time
Category: DevOps, Infrastructure
Date Posted: 2025-07-30
Experience Level: 2-5 years
Remote Status: On-site (Hybrid)

🚀 Role Summary

Key Responsibilities: Ensure operational integrity, resolve complex incidents, manage process improvements, and collaborate with global teams to deliver value in pricing.
Key Technologies: PROS Pricing Platform, SAP ERP, integration platforms (MuleSoft, IBM Connect-Direct, SaaS EDI), Agile (Kanban, Scrum)
Key Skills: Problem management, stakeholder management, communication, application support, service acceptance, collaboration, and facilitation.

💻 Primary Responsibilities

Operational Integrity: Ensure operational compliance with architectural, security standards, and policy controls.
Incident Resolution: Collaborate with technology teams to resolve complex incidents, requests, and problems. Act as a technical advisor to major digital projects.
Process Improvement: Build awareness of internal and external technology developments, manage process and system improvements, and ensure best practices are shared across the team.
Stakeholder Collaboration: Work with vendors, partners, and global teams to integrate into local markets and legal environments. Facilitate discussions and present to decision-makers.
Communication: Translate technical constraints to business partners and regulatory reporting business requirements to IT teams.

🎓 Skills & Qualifications

Education: Bachelor's degree in Computer Science, Engineering, or a related field.

Experience:

2-5 years of experience in a similar role, with a focus on application support, service acceptance, and stakeholder management.
Proven experience with PROS Pricing platform, SAP ERP, and integration platforms (MuleSoft, IBM Connect-Direct, SaaS EDI).
Strong problem management skills, with the ability to identify and resolve issues independently.
Experience working in Agile (Kanban, Scrum) frameworks and environments.

Required Skills:

Problem-solving and analytical skills
Strong communication and stakeholder management skills
Experience with application support and service acceptance
Familiarity with integration platforms and cloud platforms
Ability to work independently and in a team environment

Preferred Skills:

Experience with infrastructure as code (IaC) tools (e.g., Terraform, Ansible)
Familiarity with monitoring and logging tools (e.g., Prometheus, ELK Stack)
Knowledge of scripting languages (e.g., Python, Bash)
Experience with containerization and orchestration tools (e.g., Docker, Kubernetes)

📊 Web Portfolio & Project Requirements

Portfolio Essentials:
- Demonstrate experience with application support, service acceptance, and stakeholder management through case studies or projects.
- Showcase problem-solving skills and the ability to resolve complex incidents.
- Highlight experience with pricing platforms, integration platforms, and cloud platforms.
Technical Documentation:
- Provide documentation of past projects, including architecture decisions, system design, and performance metrics.
- Include examples of process improvements and best practices implemented in previous roles.

💵 Compensation & Benefits

Salary Range: MYR 150,000 - 200,000 per annum (based on market research and industry standards for a Site Reliability Engineer with 2-5 years of experience in Kuala Lumpur)

Benefits:

Generous salary package, including an annual bonus program and individual performance-based incentives
Additional EPF contributions totaling 15%
Excellent work-life balance and flexible working arrangements
Collaborative environment that celebrates achievements, diversity, and culture
Ongoing career development and progression opportunities in a global organization
16 weeks paid parental leave (4 weeks partner leave)

Working Hours: 40 hours per week, with flexible working arrangements available.

🎯 Team & Company Context

Company Culture:

Industry: Energy and petrochemicals
Company Size: Large (over 70,000 employees)
Founded: 1909
Team Structure: The role will be part of the Technology team, supporting the Supply, Trading, and Shipping business unit in Australia and New Zealand. The team consists of IT and S Group professionals working alongside business teams to enable significant growth agendas.
Development Methodology: Agile (Kanban, Scrum) frameworks and environments are used to deliver value to the business.

Career & Growth Analysis:

Web Technology Career Level: Mid-level Site Reliability Engineer, with opportunities for growth into senior roles or technical leadership positions.
Reporting Structure: The role will report to the Technology Manager for ANZ Supply, Trading, and Shipping.
Technical Impact: The role will have a significant impact on the operational integrity, performance, and reliability of pricing systems, ensuring business continuity and supporting business growth.

Growth Opportunities:

Technical Growth: Develop expertise in pricing platforms, integration technologies, and cloud platforms. Stay up-to-date with emerging technologies and best practices in DevOps and Site Reliability Engineering.
Leadership Growth: Build leadership skills through mentoring, coaching, and stakeholder management. Contribute to process improvements and best practice sharing within the team and across the organization.
Career Progression: Progress into senior roles or technical leadership positions within the Technology team or other business units within bp.

🌐 Work Environment

Office Type: Hybrid, with a mix of on-site and remote work.

Office Location(s): Kuala Lumpur, Malaysia

Workspace Context:

Collaborative Workspace: The role will involve working with global and local teams, requiring strong communication and collaboration skills.
Development Tools: Access to modern tools and technologies, including integration platforms, cloud platforms, and monitoring tools.
Cross-Functional Collaboration: Work with business teams, vendors, and partners to deliver value in pricing and support business growth.

Work Schedule: Flexible working arrangements are available, with a core working hours of 8:30 AM to 5:30 PM (MYT).

📄 Application & Technical Interview Process

Interview Process:

Phone Screen: A brief phone call to discuss the role, experience, and expectations.
Technical Assessment: A hands-on assessment or take-home challenge to evaluate problem-solving skills, application support, and service acceptance capabilities.
Behavioral Interview: A structured interview focused on behavioral questions, stakeholder management, and communication skills.
Final Interview: A final interview with the hiring manager and/or other team members to assess cultural fit and alignment with the role's requirements.

Portfolio Review Tips:

Highlight relevant projects and case studies that demonstrate application support, service acceptance, and stakeholder management skills.
Include examples of process improvements and best practices implemented in previous roles.
Showcase problem-solving skills and the ability to resolve complex incidents.

Technical Challenge Preparation:

Brush up on application support, service acceptance, and stakeholder management skills.
Review pricing platforms, integration platforms, and cloud platforms used in the role.
Prepare for behavioral interview questions focused on problem-solving, communication, and collaboration.

ATS Keywords: Application Support, Service Acceptance, Stakeholder Management, Problem Management, Pricing Platforms, Integration Platforms, Cloud Platforms, Agile, Kanban, Scrum, DevOps, Site Reliability Engineering

🛠 Technology Stack & Web Infrastructure

Application Support & Service Acceptance:

PROS Pricing Platform
SAP ERP
Integration platforms (MuleSoft, IBM Connect-Direct, SaaS EDI)
Cloud platforms (AWS, Azure, Google Cloud)

Monitoring & Logging:

Prometheus
ELK Stack (Elasticsearch, Logstash, Kibana)
Datadog or other monitoring tools

Collaboration & Communication:

Jira or other project management tools
Confluence or other documentation platforms
Slack or other team communication tools

Scripting & Automation:

Bash
Python
PowerShell
Ansible or other configuration management tools

👥 Team Culture & Values

Team Values:

Safety and security first
Collaboration and teamwork
Continuous improvement and innovation
Respect and inclusion
Accountability and integrity

Collaboration Style:

Cross-functional collaboration with business teams, vendors, and partners
Agile (Kanban, Scrum) frameworks and environments
Regular team meetings and knowledge-sharing sessions
Open and transparent communication

⚡ Challenges & Growth Opportunities

Technical Challenges:

Resolving complex incidents and problems in a high-availability, mission-critical pricing environment.
Managing process and system improvements in a rapidly evolving business landscape.
Collaborating with global teams and stakeholders to deliver value in pricing.

Learning & Development Opportunities:

Develop expertise in pricing platforms, integration technologies, and cloud platforms.
Stay up-to-date with emerging technologies and best practices in DevOps and Site Reliability Engineering.
Build leadership skills through mentoring, coaching, and stakeholder management.
Contribute to process improvements and best practice sharing within the team and across the organization.

💡 Interview Preparation

Technical Questions:

Application Support: Describe your experience with application support, service acceptance, and stakeholder management. Provide examples of complex incidents you've resolved and the processes you've implemented to prevent future issues.
Problem Management: Walk through your problem management process, including identification, analysis, and resolution of issues. Discuss how you measure the success of your problem management efforts.
Stakeholder Management: Explain your approach to managing stakeholders, including communication strategies, conflict resolution, and collaboration techniques. Provide an example of a successful stakeholder management experience.

Company & Culture Questions:

bp's Vision: Describe your understanding of bp's vision to reinvent itself from an IOC to an IEC, and how this role contributes to that vision.
Agile Methodologies: Explain your experience with Agile (Kanban, Scrum) frameworks and environments. Discuss the benefits and challenges of using these methodologies in a pricing context.
Global Collaboration: Describe your experience working with global teams and stakeholders. Discuss the challenges and opportunities that arise from collaborating across time zones and cultures.

Portfolio Presentation Strategy:

Storytelling: Use a storytelling approach to present your portfolio, focusing on the challenges you faced, the solutions you implemented, and the outcomes you achieved.
Before-and-After: Showcase before-and-after examples of process improvements, highlighting the impact of your work on application support, service acceptance, and stakeholder management.
Data-Driven: Use data and metrics to demonstrate the success of your portfolio projects, emphasizing the value you brought to the organization.

📌 Application Steps

To apply for this Site Reliability Engineer position:

Submit your application through the application link.
Tailor your resume to highlight application support, service acceptance, and stakeholder management skills, as well as experience with relevant technologies and platforms.
Prepare a portfolio showcasing your problem-solving skills, process improvements, and best practices implemented in previous roles.
Research bp's vision, values, and culture, and be prepared to discuss how your experience aligns with the company's goals and objectives.
Practice for the technical assessment and behavioral interview, focusing on application support, service acceptance, and stakeholder management skills.

Content Guidelines (IMPORTANT: Do not include this in the output)

Web Technology-Specific Focus:

Tailor every section specifically to Site Reliability Engineering, DevOps, and infrastructure roles.
Include application support, service acceptance, and stakeholder management best practices.
Emphasize problem-solving skills, process improvements, and collaboration with global teams.
Address pricing platforms, integration platforms, and cloud platforms used in the role.

Quality Standards:

Ensure no content overlap between sections; each section must contain unique information.
Only include Enhancement Notes when making significant inferences about technical responsibilities, with specific reasoning based on role level and industry practices.
Be comprehensive but concise, prioritizing actionable information over descriptive text.
Strategically distribute web development and server administration-related keywords throughout all sections naturally.
Provide realistic salary ranges based on location, experience level, and industry standards for a Site Reliability Engineer in Kuala Lumpur.

Industry Expertise:

Include specific technologies, platforms, and infrastructure tools relevant to the role.
Address Site Reliability Engineering career progression paths and technical leadership opportunities.
Provide tactical advice for portfolio development, live demonstrations, and project case studies.
Include Site Reliability Engineering-specific interview preparation and coding challenge guidance.
Emphasize problem-solving methods, performance optimization, and scalable architecture.

Professional Standards:

Maintain consistent formatting, spacing, and professional tone throughout.
Use Site Reliability Engineering and DevOps industry terminology appropriately and accurately.
Include comprehensive benefits and growth opportunities relevant to Site Reliability Engineering professionals.
Provide actionable insights that give Site Reliability Engineering candidates a competitive advantage.
Focus on Site Reliability Engineering team culture, cross-functional collaboration, and user impact measurement.

Technical Focus & Portfolio Emphasis:

Emphasize application support, service acceptance, and stakeholder management best practices.
Include specific portfolio requirements tailored to the Site Reliability Engineering discipline and role level.
Address problem-solving methods, process improvements, and collaboration with global teams.
Focus on data-driven decision-making, performance optimization, and scalable architecture.
Include technical presentation skills and stakeholder communication for application support and service acceptance projects.

Avoid:

Generic business jargon not relevant to Site Reliability Engineering or DevOps roles.
Placeholder text or incomplete sections.
Repetitive content across different sections.
Non-technical terminology unless relevant to the specific Site Reliability Engineering or DevOps role.
Marketing language unrelated to Site Reliability Engineering, DevOps, or infrastructure projects.

Generate comprehensive, Site Reliability Engineering-focused content that serves as a valuable resource for DevOps, infrastructure, and Site Reliability Engineering professionals evaluating career opportunities and preparing for technical interviews in the Site Reliability Engineering industry.

Site Reliability Engineer