Senior Cloud Operations Engineer - AWS
📍 Job Overview
- Job Title: Senior Cloud Operations Engineer - AWS
- Company: Endava
- Location: Cluj-Napoca, Cluj, Romania
- Job Type: Full-time
- Category: DevOps Engineer, System Administrator, Cloud Engineer
- Date Posted: June 19, 2025
- Experience Level: Mid-Senior level (5-10 years)
- Remote Status: On-site/Hybrid
🚀 Role Summary
- Cloud Infrastructure Management: Oversee day-to-day operations, ensuring system availability, performance, and reliability in a 24/7 support model.
- Incident & Problem Management: Implement ITIL-based service management, including incident, request, and problem management to minimize business impact and prevent recurring issues.
- Automation & IaC: Build, manage, and automate AWS cloud environments using Infrastructure as Code (Terraform) to reduce manual intervention and improve system consistency.
- Collaboration: Work closely with development and application teams to streamline deployments and operational processes, ensuring infrastructure availability for business-critical services.
📝 Enhancement Note: This role requires a strong focus on operational excellence, with a commitment to 24/7 support coverage and a passion for automation to drive system reliability and efficiency.
💻 Primary Responsibilities
- Cloud Infrastructure Support: Monitor, maintain, and improve system availability, performance, and reliability, ensuring infrastructure availability for business-critical services.
- Incident & Request Management: Detect, investigate, and resolve infrastructure incidents rapidly, and fulfill service requests within agreed service levels, following ITIL standards.
- Problem Management: Identify root causes of recurring issues and implement permanent fixes to prevent reoccurrence, ensuring compliance with security, governance, and operational standards.
- Automation & IaC: Build and manage AWS cloud environments using Terraform, implementing automation for operational tasks to reduce manual intervention and improve system consistency.
- Collaboration & Documentation: Collaborate with development and application teams to streamline deployments and operational processes, maintaining detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides.
🎓 Skills & Qualifications
Education: Bachelor's degree in Computer Science, Information Technology, or a related field. Relevant certifications (e.g., AWS Solutions Architect, SysOps Administrator) are a plus.
Experience: Proven experience (5-10 years) managing AWS cloud infrastructure, with strong hands-on experience in Infrastructure as Code using Terraform. Familiarity with ITIL-based service management and scripting skills in languages like Bash, Python, or PowerShell are essential.
Required Skills:
- AWS cloud infrastructure management
- Terraform (Infrastructure as Code)
- ITIL-based service management (incident, request, problem management)
- Scripting (Bash, Python, PowerShell)
- Networking, security groups, load balancers, IAM policies, and monitoring in cloud environments
- Strong communication and collaboration skills
Preferred Skills:
- Experience with observability tools (Splunk, CloudWatch, Datadog)
- Familiarity with CI/CD pipelines, configuration management tools, and automation frameworks
- AWS certifications (Solutions Architect, SysOps Administrator)
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- Demonstrate experience managing AWS cloud infrastructure, highlighting system availability, performance, and reliability improvements.
- Showcase automation and IaC projects using Terraform, emphasizing reduced manual intervention and improved system consistency.
- Display incident and problem management case studies, illustrating rapid issue resolution and permanent fixes implementation.
Technical Documentation:
- Provide detailed documentation of infrastructure configurations, operating procedures, and troubleshooting guides, demonstrating clear and concise communication of technical information.
- Include examples of capacity planning, cost optimization, and performance tuning activities, showcasing proactive approach to infrastructure management.
📝 Enhancement Note: While a portfolio is not explicitly required, demonstrating relevant projects and case studies will strengthen your application and provide valuable insights into your problem-solving approach and technical skills.
💵 Compensation & Benefits
Salary Range: The estimated salary range for this role in Cluj-Napoca, Romania is between 35,000 RON and 50,000 RON per year (approximately 7,500 EUR to 11,000 EUR), based on market research and industry standards for mid-senior level DevOps and cloud engineering roles.
Benefits:
- Competitive salary package
- Share plan
- Company performance bonuses
- Value-based recognition awards
- Referral bonus
- Career coaching
- Global career opportunities
- Non-linear career paths
- Internal development programs for management and technical leadership
- Complex projects, rotations, internal tech communities, training, certifications, coaching, online learning platforms subscriptions, pass-it-on sessions, workshops, conferences
- Hybrid work and flexible working hours
- Employee assistance program
- Global internal wellbeing program, access to wellbeing apps
- Inclusion and diversity programs
Working Hours: 40 hours per week, with a commitment to 24/7 support coverage and participation in on-call rotations.
📝 Enhancement Note: The salary range provided is an estimate based on market research and may vary depending on individual qualifications, experience, and company-specific compensation structures.
🎯 Team & Company Context
🏢 Company Culture
Industry: Endava is a technology company focused on delivering innovative solutions for clients across various industries, including finance, insurance, media, and retail.
Company Size: Endava is a mid-sized company with a global presence, employing over 9,000 professionals across multiple locations. This size offers the benefits of a large organization, such as diverse project opportunities and extensive resources, while maintaining a collaborative and agile work environment.
Founded: Endava was founded in 2000 and has since grown into a global technology partner, delivering innovative solutions that drive business value for clients.
Team Structure:
- The cloud operations team consists of dedicated professionals responsible for managing and maintaining cloud infrastructure, ensuring system availability, performance, and reliability.
- The team works closely with development and application teams to streamline deployments and operational processes, collaborating on projects and sharing technical expertise.
- The team follows an Agile development methodology, with regular sprint planning and code reviews to ensure high-quality deliverables and efficient collaboration.
Development Methodology:
- Endava follows an Agile development methodology, with a focus on iterative development, continuous improvement, and customer satisfaction.
- The company emphasizes collaboration, communication, and knowledge sharing, fostering a culture of innovation and continuous learning.
- Endava's development process includes code reviews, testing, and quality assurance practices to ensure high-quality deliverables and efficient collaboration.
Company Website: Endava
📝 Enhancement Note: Endava's culture emphasizes collaboration, innovation, and continuous learning, providing a supportive environment for professionals seeking to grow and develop their skills in cloud infrastructure management and DevOps.
📈 Career & Growth Analysis
Cloud Operations Engineer Career Level: This role is at the mid-senior level, with a focus on managing and maintaining cloud infrastructure, ensuring system availability, performance, and reliability. The engineer will be responsible for implementing ITIL-based service management, automating operational tasks, and collaborating with development and application teams to streamline deployables and operational processes.
Reporting Structure: The Senior Cloud Operations Engineer will report directly to the Cloud Operations Manager and work closely with development and application teams, as well as other internal stakeholders, to ensure infrastructure availability and support business-critical services.
Technical Impact: The engineer will have a significant impact on the performance, reliability, and security of cloud infrastructure, ensuring that it meets the needs of the business and supports critical applications and services. They will also play a crucial role in driving operational excellence and continuous improvement through automation and process optimization.
Growth Opportunities:
- Technical Leadership: As a senior-level role, this position offers opportunities for technical leadership and mentoring, with the potential to guide junior team members and contribute to the development of best practices and standards.
- Architecture & Design: The engineer may have the opportunity to contribute to the design and architecture of cloud infrastructure, ensuring that it is scalable, secure, and optimized for performance.
- Project Management: With experience in incident, request, and problem management, the engineer may have the opportunity to take on project management responsibilities, leading cross-functional teams to deliver complex projects and initiatives.
📝 Enhancement Note: Endava's global presence and diverse client base provide ample opportunities for career growth and development, with potential for technical leadership, architecture and design, and project management roles.
🌐 Work Environment
Office Type: Endava's Cluj-Napoca office is a modern, collaborative workspace designed to foster innovation and creativity. The office features open-plan workspaces, meeting rooms, and breakout areas, with a focus on employee comfort and productivity.
Office Location(s): Endava's Cluj-Napoca office is located in the city center, with easy access to public transportation and nearby amenities. The office is also easily accessible by car, with dedicated parking available for employees.
Workspace Context:
- Collaborative Workspace: The open-plan office layout encourages collaboration and communication, with dedicated spaces for team meetings, workshops, and brainstorming sessions.
- Technology & Equipment: Endava provides state-of-the-art technology and equipment, including high-performance workstations, multiple monitors, and testing devices, to ensure that employees have the tools they need to perform their jobs effectively.
- Work-Life Balance: Endava offers a hybrid work arrangement, allowing employees to balance their work and personal lives more effectively. The company also provides an employee assistance program to support employees' physical and mental well-being.
Work Schedule: Endava operates on a standard business hours schedule, with a commitment to 24/7 support coverage and participation in on-call rotations for critical infrastructure and business-critical services.
📝 Enhancement Note: Endava's modern, collaborative workspace and flexible work arrangements provide an ideal environment for cloud infrastructure engineers seeking to grow and develop their skills in a supportive and innovative work environment.
📄 Application & Technical Interview Process
Interview Process:
- Phone/Video Screen: A brief conversation to assess your technical skills, experience, and cultural fit with Endava.
- Technical Assessment: A hands-on technical assessment, focusing on your AWS cloud infrastructure management skills, automation experience, and problem-solving abilities.
- On-site/Video Interview: A face-to-face meeting with the hiring manager and other team members to discuss your technical approach, career goals, and cultural fit with Endava.
- Final Decision: A decision will be made based on your technical skills, experience, and cultural fit with the team and company.
Portfolio Review Tips:
- Highlight your experience managing AWS cloud infrastructure, emphasizing system availability, performance, and reliability improvements.
- Showcase your automation and IaC projects using Terraform, demonstrating reduced manual intervention and improved system consistency.
- Include incident and problem management case studies, illustrating your ability to resolve complex issues and implement permanent fixes.
Technical Challenge Preparation:
- Brush up on your AWS cloud infrastructure management skills, focusing on incident, request, and problem management.
- Familiarize yourself with Terraform and other automation tools, ensuring you can demonstrate your ability to reduce manual intervention and improve system consistency.
- Prepare for behavioral questions, focusing on your problem-solving approach, communication skills, and collaboration with development and application teams.
ATS Keywords: [See the comprehensive list of web development and server administration-relevant keywords for resume optimization, organized by category: programming languages, web frameworks, server technologies, databases, tools, methodologies, soft skills, industry terms]
📝 Enhancement Note: Endava's interview process focuses on assessing your technical skills, experience, and cultural fit with the team and company. By preparing for each stage of the interview process and tailoring your portfolio and resume to highlight your relevant experience and skills, you will increase your chances of success.
🛠 Technology Stack & Web Infrastructure
Cloud Infrastructure:
- AWS (Amazon Web Services) - The primary cloud infrastructure provider, with a focus on managing and maintaining AWS cloud environments.
- Terraform - Infrastructure as Code (IaC) tool used to build, manage, and automate AWS cloud environments, reducing manual intervention and improving system consistency.
Monitoring & Logging:
- CloudWatch - AWS's native monitoring and logging service, used to collect and track metrics, collect and monitor log files, set alarms, and automatically react to changes in your AWS resources.
- Datadog - A third-party monitoring and analytics platform that provides real-time visibility into your infrastructure and applications, enabling you to proactively identify and resolve performance issues.
Configuration & Automation:
- Ansible - An open-source automation and configuration management tool used to provision and manage AWS resources, ensuring consistency and reducing manual intervention.
- Jenkins - A popular open-source automation server used to implement CI/CD pipelines, enabling continuous integration and delivery of cloud infrastructure and applications.
Collaboration & Communication:
- Jira - A project management and issue tracking tool used to collaborate with development and application teams, streamline deployments, and manage operational processes.
- Confluence - A collaboration software used to share information and knowledge, fostering a culture of innovation and continuous learning.
📝 Enhancement Note: Endava's technology stack focuses on AWS cloud infrastructure management, with a strong emphasis on automation, monitoring, and collaboration. Familiarity with these tools and technologies will be essential for success in this role.
👥 Team Culture & Values
Cloud Operations Values:
- Reliability: Endava values reliability in its cloud infrastructure management, ensuring that systems are available, performant, and reliable, with a commitment to 20/7 support coverage and minimal downtime.
- Automation: Endava emphasizes automation as a critical enabler of operational excellence, reducing manual intervention and improving system consistency and reliability.
- Collaboration: Endava fosters a culture of collaboration, with a focus on working closely with development and application teams to streamline deployments and operational processes.
- Continuous Improvement: Endava encourages continuous improvement, with a focus on driving operational excellence through process optimization, automation, and innovation.
Collaboration Style:
- Cross-Functional Integration: Endava's cloud operations team works closely with development and application teams, ensuring that infrastructure availability supports critical business functions and services.
- Code Review Culture: Endava emphasizes code reviews and peer programming, fostering a culture of knowledge sharing, collaboration, and continuous learning.
- Knowledge Sharing: Endava encourages knowledge sharing, with a focus on technical mentoring, coaching, and community-building activities that promote a culture of innovation and continuous learning.
📝 Enhancement Note: Endava's cloud operations team values reliability, automation, collaboration, and continuous improvement, fostering a culture of innovation and continuous learning that drives operational excellence and technical expertise.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- Incident & Problem Management: Manage and resolve complex infrastructure incidents and problems, ensuring minimal business impact and implementing permanent fixes to prevent reoccurrence.
- Automation & IaC: Automate operational tasks using Terraform, reducing manual intervention and improving system consistency and reliability.
- Capacity Planning & Optimization: Plan and optimize cloud infrastructure to meet business demands, ensuring cost-efficiency and scalability.
- Security & Compliance: Ensure that cloud infrastructure meets security and compliance standards, protecting sensitive data and maintaining regulatory compliance.
Learning & Development Opportunities:
- Technical Training: Endava offers technical training and certification opportunities, enabling engineers to develop their skills and advance their careers in cloud infrastructure management and DevOps.
- Project Rotations: Endava provides opportunities for engineers to rotate through different projects and teams, gaining diverse experience and exposure to new technologies and methodologies.
- Technical Leadership: As a senior-level role, this position offers opportunities for technical leadership and mentoring, with the potential to guide junior team members and contribute to the development of best practices and standards.
📝 Enhancement Note: Endava's technical challenges and growth opportunities provide ample opportunities for cloud infrastructure engineers to develop their skills, advance their careers, and make a significant impact on the company's operational excellence and technical innovation.
💡 Interview Preparation
Technical Questions:
- AWS Cloud Infrastructure Management: Describe your experience managing AWS cloud infrastructure, highlighting system availability, performance, and reliability improvements.
- Incident & Problem Management: Walkthrough a complex infrastructure incident or problem you've managed, explaining your approach to resolution and permanent fix implementation.
- Automation & IaC: Explain your experience with automation and Infrastructure as Code (IaC) tools, such as Terraform, and how you've used them to reduce manual intervention and improve system consistency.
- Capacity Planning & Optimization: Describe your approach to capacity planning and optimization, ensuring that cloud infrastructure meets business demands while maintaining cost-efficiency and scalability.
Company & Culture Questions:
- Endava Culture: Explain what you understand about Endava's culture and how you think you would fit in with the team and company.
- Collaboration & Communication: Describe your experience working with development and application teams, and how you've collaborated to streamline deployables and operational processes.
- Continuous Improvement: Explain your approach to continuous improvement, and how you've driven operational excellence through process optimization, automation, and innovation.
Portfolio Presentation Strategy:
- Cloud Infrastructure Management: Highlight your experience managing AWS cloud infrastructure, emphasizing system availability, performance, and reliability improvements.
- Incident & Problem Management: Include case studies illustrating your ability to resolve complex infrastructure incidents and problems, with a focus on minimizing business impact and implementing permanent fixes.
- Automation & IaC: Showcase your automation and IaC projects using Terraform, demonstrating reduced manual intervention and improved system consistency.
- Capacity Planning & Optimization: Include examples of capacity planning and optimization projects, demonstrating your ability to meet business demands while maintaining cost-efficiency and scalability.
📝 Enhancement Note: Endava's interview process focuses on assessing your technical skills, experience, and cultural fit with the team and company. By preparing for technical and company-specific questions, and tailoring your portfolio to highlight your relevant experience and skills, you will increase your chances of success.
📌 Application Steps
To apply for this Senior Cloud Operations Engineer - AWS position at Endava:
- Submit Your Application: Click the "Apply" button on the job listing to submit your resume and cover letter.
- Tailor Your Portfolio: Highlight your experience managing AWS cloud infrastructure, automation, and incident management projects, ensuring that your portfolio demonstrates your technical skills and problem-solving approach.
- Optimize Your Resume: Include relevant keywords and phrases from the job listing, ensuring that your resume is optimized for Endava's Applicant Tracking System (ATS) and highlights your relevant experience and skills.
- Prepare for Technical Challenges: Brush up on your AWS cloud infrastructure management skills, automation experience, and incident management techniques, ensuring that you're ready to tackle Endava's technical assessments and interviews.
- Research Endava: Familiarize yourself with Endava's company culture, values, and mission, ensuring that you understand the company's focus on innovation, collaboration, and continuous learning.
⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Candidates must have proven experience managing AWS cloud infrastructure and strong hands-on experience with Infrastructure as Code using Terraform. Familiarity with ITIL-based service management and scripting skills in languages like Bash or Python are also essential.