Senior Site Reliability Engineer at bp

📍 Job Overview

Job Title: Senior Site Reliability Engineer
Company: bp
Location: Budapest, Budapest, Hungary
Job Type: Full-Time (Hybrid)
Category: DevOps Engineer
Date Posted: 2025-06-18
Experience Level: 5-10 years
Remote Status: Hybrid (2 days remote per week)

🚀 Role Summary

Key Responsibilities: Ensure the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure. Automate routine tasks and provide technical support to other teams.
Key Skills: Site Reliability Engineering, DevOps, Infrastructure Automation, Cloud Platforms, Linux/Unix Systems, Infrastructure as Code, CI/CD Pipelines, Monitoring Systems, Incident Management, Root Cause Analysis, Collaboration Skills, System Scalability, Performance Tuning, Capacity Planning, Curiosity, Adaptability.

📝 Enhancement Note: This role requires a strong background in site reliability engineering or DevOps, with a focus on infrastructure automation and cloud platforms. The ideal candidate will have a proven track record of ensuring the reliability and performance of large-scale applications and infrastructure.

💻 Primary Responsibilities

Build, Maintain, and Troubleshoot Software Solutions and Infrastructure: Ensure the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure.
Automate Routine Tasks: Improve operational aspects of the site by creating automated solutions.
Provide Technical Support: Collaborate with software developers, engineers, and operations teams to improve system performance and resolve issues.
Detect and Manage Issues: Keep systems up and running by detecting issues and automatically managing failures.
Analyze Incidents: Conduct post-mortem reviews to prevent future disruptions.

📝 Enhancement Note: This role requires a deep understanding of cloud platforms, infrastructure as code tools, and monitoring systems. The ideal candidate will have experience implementing and managing CI/CD pipelines and be comfortable with system scalability, performance tuning, and capacity planning.

🎓 Skills & Qualifications

Education: A bachelor's degree in Computer Science, Engineering, or a related field is typically required.

Experience: Proven experience in site reliability engineering, DevOps, or infrastructure-focused software development roles is essential. Candidates should have at least 5-10 years of experience in a similar role.

Required Skills:

Strong scripting skills in Python or Bash
Hands-on experience with cloud platforms such as AWS or Azure
Practical knowledge of Linux/Unix systems, including system configuration, networking, and troubleshooting
Familiarity with infrastructure as code tools like Terraform and/or Ansible
Experience implementing and managing CI/CD pipelines using tools such as Jenkins, GitLab CI, or similar
Solid understanding of monitoring and logging systems such as Prometheus, Grafana, and the ELK stack
Competence in incident management, root cause analysis, and post-mortem reviews
Strong collaboration skills and the ability to work effectively with both development and operations teams
Comfortable with system scalability, performance tuning, and capacity planning
Curiosity and adaptability to learn new technologies and improve existing systems
Fluency in English; German is a plus, but not required

Preferred Skills:

Experience with containerization and orchestration tools like Kubernetes
Familiarity with infrastructure provisioning and configuration management tools like Terraform and Ansible
Knowledge of infrastructure as code (IaC) best practices and principles
Experience with cloud-native applications and microservices architectures
Familiarity with Agile development methodologies and DevOps practices

📝 Enhancement Note: This role requires a strong background in site reliability engineering or DevOps, with a focus on infrastructure automation and cloud platforms. The ideal candidate will have a proven track record of ensuring the reliability and performance of large-scale applications and infrastructure.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

A portfolio showcasing your experience in site reliability engineering, DevOps, or infrastructure-focused software development.
Examples of automated solutions you've created to improve operational aspects of sites.
Case studies demonstrating your ability to ensure the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure.
Documentation of your experience with incident management, root cause analysis, and post-mortem reviews.

Technical Documentation:

Code quality, commenting, and documentation standards.
Version control, deployment processes, and server configuration.
Testing methodologies, performance metrics, and optimization techniques.

📝 Enhancement Note: For this role, your portfolio should focus on demonstrating your technical skills and experience in site reliability engineering or DevOps. Include examples of your work that showcase your ability to ensure the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure.

💵 Compensation & Benefits

Salary Range: The estimated salary range for this role in Budapest, Hungary is €60,000 - €80,000 per year, based on industry standards and regional cost of living.

Benefits:

Different bonus opportunities based on performance, wide range of cafeteria elements
Life & health insurance, medical care package
Flexible working schedule: home office up to 2 days per week, based on team agreement
Opportunity to build up long-term career path and develop your skills with wide range of learning options
Family-friendly workplace e.g., extended parental leave, mother-baby room
Employees' wellbeing programs e.g., Employee Assistance Program, Company Recognition Program
Possibility to join social communities and networks
Chill-out and collaboration spaces in beautiful Budapest offices e.g., Play Zones, Office massage, Sport and music equipment
Assets like phone for private usage and company laptop are provided from the first day of employment with other equipment if requested

Working Hours: Full-time (40 hours per week) with flexible working hours and the option to work from home up to 2 days per week.

📝 Enhancement Note: The salary range provided is an estimate based on industry standards and regional cost of living. The actual salary may vary depending on the candidate's experience and qualifications.

🎯 Team & Company Context

🏢 Company Culture

Industry: Energy, Oil & Gas

Company Size: Large (Over 10,000 employees)

Founded: 1901

Team Structure:

The technology team at bp is responsible for building, maintaining, and improving the company's digital platforms and infrastructure.
The team is structured into several sub-teams, including site reliability engineering, software development, data engineering, and DevOps.
The site reliability engineering team works closely with software development and operations teams to ensure the reliability, performance, and scalability of bp's digital platforms.

Development Methodology:

Agile development methodologies are used to manage projects and deliver features to production.
Code reviews, testing, and quality assurance practices are implemented to ensure code quality and maintainability.
Deployment strategies, CI/CD pipelines, and server management are used to automate the deployment process and ensure high availability.

Company Website: https://www.bp.com/

📝 Enhancement Note: bp is a large, multinational energy company with a strong focus on digital transformation. The technology team at bp is responsible for driving the company's digital strategy and ensuring the reliability, performance, and scalability of its digital platforms.

📈 Career & Growth Analysis

Web Technology Career Level: Senior Site Reliability Engineer

Reporting Structure: This role reports directly to the Site Reliability Engineering Manager.

Technical Impact: The Senior Site Reliability Engineer is responsible for ensuring the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure. This role has a significant impact on the overall performance and availability of bp's digital platforms.

Growth Opportunities:

Technical Growth: Opportunities to specialize in specific technologies or domains, such as cloud platforms, infrastructure as code, or monitoring systems.
Leadership Growth: Opportunities to take on leadership roles within the site reliability engineering team or across the broader technology organization.
Career Transition: Opportunities to transition into other technical roles within the technology organization, such as software development, data engineering, or DevOps.

📝 Enhancement Note: This role offers significant opportunities for technical growth and leadership development within the site reliability engineering team and across the broader technology organization at bp.

🌐 Work Environment

Office Type: Hybrid (2 days remote per week)

Office Location(s): Budapest, Hungary

Workspace Context:

The workspace at bp is designed to foster collaboration and innovation, with open-plan offices, meeting rooms, and breakout spaces.
The technology team at bp uses a variety of tools and technologies to support its work, including cloud platforms, infrastructure as code tools, and monitoring systems.
The team works closely with other departments within bp, including business, marketing, and operations, to ensure that the company's digital platforms meet the needs of its customers and stakeholders.

Work Schedule: Full-time (40 hours per week) with flexible working hours and the option to work from home up to 2 days per week.

📝 Enhancement Note: The hybrid work environment at bp offers the best of both worlds, with the opportunity to work from home up to 2 days per week and collaborate with colleagues in the office.

📄 Application & Technical Interview Process

Interview Process:

Technical Phone Screen: A brief phone call to assess your technical skills and cultural fit for the role.
Technical Deep Dive: A more in-depth technical interview focused on your experience with site reliability engineering, DevOps, and cloud platforms. You may be asked to complete a technical challenge or case study.
Behavioral Interview: An interview focused on your problem-solving skills, communication, and collaboration abilities.
Final Interview: A meeting with the hiring manager to discuss your career aspirations, growth opportunities, and next steps.

Portfolio Review Tips:

Highlight your experience with site reliability engineering, DevOps, and cloud platforms.
Include examples of your work that demonstrate your ability to ensure the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure.
Showcase your problem-solving skills and your ability to work effectively with development and operations teams.

Technical Challenge Preparation:

Brush up on your knowledge of cloud platforms, infrastructure as code tools, and monitoring systems.
Practice incident management, root cause analysis, and post-mortem review techniques.
Prepare for questions about system scalability, performance tuning, and capacity planning.

ATS Keywords:

Site Reliability Engineering
DevOps
Infrastructure Automation
Cloud Platforms
Linux/Unix Systems
Infrastructure as Code
CI/CD Pipelines
Monitoring Systems
Incident Management
Root Cause Analysis
Collaboration Skills
System Scalability
Performance Tuning
Capacity Planning
Curiosity
Adaptability
Python
Bash
AWS
Azure
Terraform
Ansible
Jenkins
GitLab CI
Prometheus
Grafana
ELK Stack
Agile Development Methodologies
DevOps Practices

📝 Enhancement Note: The interview process for this role is designed to assess your technical skills and cultural fit for the site reliability engineering team at bp. The technical challenge and interview questions will focus on your experience with site reliability engineering, DevOps, and cloud platforms.

🛠 Technology Stack & Web Infrastructure

Frontend Technologies: N/A (This role focuses on backend and infrastructure technologies)

Backend & Server Technologies:

Cloud Platforms: AWS, Azure
Infrastructure as Code: Terraform, Ansible
CI/CD Pipelines: Jenkins, GitLab CI
Monitoring Systems: Prometheus, Grafana, ELK Stack
Linux/Unix Systems: Ubuntu, CentOS, Debian
Containerization: Docker, Kubernetes
Server Management: Nginx, Apache

Development & DevOps Tools:

Version Control: Git
Collaboration: Jira, Confluence
Project Management: Jira, Trello
Communication: Slack, Microsoft Teams

📝 Enhancement Note: The technology stack for this role includes a variety of cloud platforms, infrastructure as code tools, and monitoring systems. The ideal candidate will have experience with these technologies and be comfortable working in a dynamic, fast-paced environment.

👥 Team Culture & Values

Web Development Values:

Reliability: Ensuring the availability, performance, and scalability of large-scale, cloud-based applications and infrastructure.
Automation: Improving operational aspects of the site by creating automated solutions.
Collaboration: Working effectively with development and operations teams to ensure the reliability and performance of bp's digital platforms.
Continuous Learning: Staying up-to-date with the latest technologies and best practices in site reliability engineering and DevOps.

Collaboration Style:

Cross-Functional Collaboration: Working closely with software development, data engineering, and operations teams to ensure the reliability and performance of bp's digital platforms.
Code Review Culture: Collaborating with other site reliability engineers and developers to ensure code quality and maintainability.
Peer Programming: Pairing with other team members to share knowledge, improve skills, and ensure code quality.

📝 Enhancement Note: The team culture at bp values collaboration, continuous learning, and a strong focus on ensuring the reliability and performance of the company's digital platforms.

⚡ Challenges & Growth Opportunities

Technical Challenges:

Large-Scale Infrastructure Management: Ensuring the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure.
Incident Management: Detecting and managing issues to keep systems up and running.
Automation: Improving operational aspects of the site by creating automated solutions.
Performance Optimization: Identifying and addressing performance bottlenecks and optimization opportunities.
Emerging Technologies: Staying up-to-date with the latest technologies and best practices in site reliability engineering and DevOps.

Learning & Development Opportunities:

Technical Training: Opportunities to attend training sessions, workshops, and conferences to improve your skills and knowledge in site reliability engineering and DevOps.
Mentorship Program: A mentorship program to help you develop your technical skills and career progression.
Leadership Development: Opportunities to take on leadership roles within the site reliability engineering team or across the broader technology organization.

📝 Enhancement Note: This role offers significant opportunities for technical growth and leadership development within the site reliability engineering team and across the broader technology organization at bp.

💡 Interview Preparation

Technical Questions:

Cloud Platforms: Questions about your experience with AWS, Azure, or other cloud platforms.
Infrastructure as Code: Questions about your experience with Terraform, Ansible, or other infrastructure as code tools.
Monitoring Systems: Questions about your experience with Prometheus, Grafana, ELK Stack, or other monitoring systems.
Incident Management: Questions about your experience with incident management, root cause analysis, and post-mortem reviews.
System Scalability: Questions about your experience with system scalability, performance tuning, and capacity planning.

Company & Culture Questions:

Company Culture: Questions about your understanding of bp's company culture, values, and mission.
Team Dynamics: Questions about your ability to work effectively in a team and collaborate with other departments within bp.
Career Growth: Questions about your career aspirations, growth opportunities, and long-term goals.

Portfolio Presentation Strategy:

Technical Deep Dive: Prepare a detailed walkthrough of your portfolio, highlighting your experience with site reliability engineering, DevOps, and cloud platforms.
Incident Management: Include examples of your experience with incident management, root cause analysis, and post-mortem reviews.
System Scalability: Showcase your understanding of system scalability, performance tuning, and capacity planning.

📝 Enhancement Note: The interview process for this role is designed to assess your technical skills and cultural fit for the site reliability engineering team at bp. The technical challenge and interview questions will focus on your experience with site reliability engineering, DevOps, and cloud platforms.

📌 Application Steps

To apply for this Senior Site Reliability Engineer position at bp:

Update Your Portfolio: Highlight your experience with site reliability engineering, DevOps, and cloud platforms. Include examples of your work that demonstrate your ability to ensure the reliability, performance, and scalability of large-scale, cloud-based applications and infrastructure.
Tailor Your Resume: Emphasize your technical skills and experience with site reliability engineering, DevOps, and cloud platforms. Include relevant keywords and phrases to optimize your resume for the ATS system.
Prepare for Technical Challenges: Brush up on your knowledge of cloud platforms, infrastructure as code tools, and monitoring systems. Practice incident management, root cause analysis, and post-mortem review techniques. Prepare for questions about system scalability, performance tuning, and capacity planning.
Research the Company: Familiarize yourself with bp's company culture, values, and mission. Understand the company's business and the role of the technology team in driving its digital strategy.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Senior Site Reliability Engineer