Senior Manager of SRE
📍 Job Overview
- Job Title: Senior Manager of SRE
- Company: Nexthink
- Location: Madrid, Madrid, Spain
- Job Type: Full-time
- Category: DevOps & Infrastructure
- Date Posted: June 11, 2025
- Experience Level: 10+ years
- Remote Status: Hybrid
🚀 Role Summary
- Lead and drive the adoption of SRE industry best practices in a Security and compliance-centric delivery model.
- Oversee all operations and SRE functions, including incident response and forward-thinking monitoring.
- Collaborate with development, architecture, and security teams to ensure high-quality products and enterprise-grade practices.
- Recruit, manage, and inspire a proficient cloud engineering and SRE team.
📝 Enhancement Note: This role requires a strong background in cloud operations engineering and SRE team management, with a focus on security and compliance. Familiarity with modern CI/CD tools and infrastructure-as-code practices is essential.
💻 Primary Responsibilities
- Incident Response & Monitoring: In charge of incident response and forward-thinking monitoring to ensure service availability and performance.
- Capacity Forecasting & Change Management: Drive capacity forecasting and change management processes to proactively address potential issues.
- Automation & Infrastructure-as-Code: Implement automation for delivery and operations of platform services using infrastructure-as-code and monitoring-as-code.
- Service Availability & Scalability: Tasked with building and managing service availability, performance, and scalability in production environments to enable business-defined SLAs.
- Collaboration & Stakeholder Management: Collaborate with application and business stakeholders to ensure a high-quality product is developed and deployed in production.
- Compliance & Evidence-Gathering: Own and drive compliance and evidence-gathering activities for audits and regulated deployments.
- Team Management & Development: Recruit, manage, and inspire a proficient cloud engineering and SRE team.
📝 Enhancement Note: This role requires a balance of technical depth and leadership skills, with a strong focus on incident management, capacity planning, and process improvement.
🎓 Skills & Qualifications
Education: Degree in Computer Science or Engineering or equivalent professional experience.
Experience: 10+ years in cloud operations engineering leadership roles in SaaS companies, with 5+ years in a senior management/leadership role, leading large SRE and Cloud Operations teams.
Required Skills:
- Deep understanding and experience working with one of the three major Cloud Service Providers (AWS, Azure, or GCP) running native cloud technologies based on Docker, Kubernetes, Istio, Kafka at scale.
- Experience working with modern CI/CD and automation tools such as Jenkins, Ansible, Terraform, etc.
- Experience building, scaling, and monitoring infrastructure needed for SaaS-based applications and services. Experience with APM and infrastructure monitoring tools such as Datadog, New Relic, Sumo Logic, Splunk, Dynatrace, etc.
- Managed on-call 24x7 rotation teams to serve global customers.
- Experience creating a strong and passionate customer-focused SRE-driven operations culture.
Preferred Skills:
- Experience operating workloads in a secured, highly regulated environment such as FedRAMP.
- Knowledge of lean and agile software engineering best practices.
- Excellent interpersonal and communication skills in English.
📝 Enhancement Note: This role requires a strong technical background in cloud operations and SRE, with a focus on incident management, capacity planning, and process improvement. Familiarity with modern CI/CD tools and infrastructure-as-code practices is essential.
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- Specific examples of incident response and monitoring strategies implemented in previous roles.
- Documentation of capacity forecasting and change management processes.
- Case studies demonstrating successful service availability, performance, and scalability improvements.
- Evidence of compliance and evidence-gathering activities for audits and regulated deployments.
Technical Documentation:
- Code quality, commenting, and documentation standards for infrastructure-as-code and monitoring-as-code implementations.
- Version control, deployment processes, and server configuration management strategies.
- Testing methodologies, performance metrics, and optimization techniques for SRE functions.
📝 Enhancement Note: As this role focuses on SRE and cloud operations management, the portfolio should emphasize incident management, capacity planning, and process improvement. Include specific examples of how you have driven SRE best practices and improved service availability and performance in previous roles.
💵 Compensation & Benefits
Salary Range: €80,000 - €120,000 per year (based on regional market data for senior cloud operations and SRE roles in Madrid)
Benefits:
- Permanent Contract and a competitive compensation package (Stock Options also included).
- Amazing centrally located offices near the Bernabeu Stadium.
- Private Health Insurance (Sanitas) and daily meal vouchers of €11 entirely covered by the company.
- Hybrid work model balancing office and remote work, with a structured approach for new hires to foster connections and onboarding.
- Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 23 days of holidays offered) plus 3 company-paid volunteer days.
- Up to €25 per month for a gym subscription.
- Flexible compensation plan for childcare & public transportation.
- Reimbursement of up to 50% of the cost of English & Spanish classes.
- Fresh fruit, cookies, soft drinks, and protein shakes at the office.
- Regular company and team events like Pizza talks, Team Building activities, Christmas parties, hosting Meetups at the office, and more!
- Bonuses for referring successful hires after three months of continuous employment.
- Relocation package for people coming from another country.
📝 Enhancement Note: The salary range for this role is estimated based on regional market data for senior cloud operations and SRE roles in Madrid. Benefits are comprehensive and tailored to the needs of web technology professionals.
🎯 Team & Company Context
🏢 Company Culture
Industry: Nexthink is the leader in digital employee experience management software, operating in the IT market category (DEX) and shaping the future of how the world works.
Company Size: Nexthink has over 1,300 customers and 1,000 employees across 5 continents, operating as one team with a commitment to diversity, inclusion, and equity.
Founded: Nexthink was founded in 2004 and is dual-headquartered in Lausanne, Switzerland, and Boston, Massachusetts.
Team Structure:
- The SRE team will report directly to the Senior Manager of SRE and collaborate with development, architecture, and security teams.
- The team will consist of cloud engineers and SRE professionals responsible for incident response, monitoring, automation, and capacity management.
Development Methodology:
- Nexthink follows lean and agile software engineering best practices, focusing on continuous improvement and customer satisfaction.
- The company uses modern CI/CD tools and infrastructure-as-code practices to ensure high-quality products and enterprise-grade services.
Company Website: Nexthink
📝 Enhancement Note: Nexthink's company culture emphasizes innovation, customer focus, and continuous learning, with a strong commitment to diversity, inclusion, and equity. The SRE team will play a crucial role in driving these values and ensuring the company's digital employee experience management software meets the highest standards.
📈 Career & Growth Analysis
Web Technology Career Level: Senior Manager of SRE is a senior leadership role, responsible for driving SRE best practices and managing a team of cloud engineers and SRE professionals.
Reporting Structure: The Senior Manager of SRE will report directly to the Head of Cloud Engineering and collaborate with development, architecture, and security teams.
Technical Impact: The Senior Manager of SRE will have a significant impact on the company's digital employee experience management software, ensuring high-quality products and enterprise-grade services. They will drive SRE best practices, improve service availability and performance, and enhance the company's security and compliance posture.
Growth Opportunities:
- Technical Growth: Deepen expertise in cloud operations, SRE best practices, and emerging technologies such as Kubernetes, Istio, and Kafka.
- Leadership Development: Expand leadership skills by managing a team of cloud engineers and SRE professionals, driving team growth, and fostering a customer-focused culture.
- Architecture & Design: Contribute to the design and architecture of the company's digital employee experience management software, ensuring it meets the highest standards for scalability, performance, and security.
📝 Enhancement Note: This role offers significant growth opportunities for technical and leadership development, with the potential to make a substantial impact on the company's digital employee experience management software and security posture.
🌐 Work Environment
Office Type: Nexthink's Madrid office is centrally located near the Bernabeu Stadium, offering a modern and collaborative workspace for web technology professionals.
Office Location(s): Madrid, Spain
Workspace Context:
- Collaboration: The office fosters a collaborative environment, with dedicated spaces for team meetings, brainstorming sessions, and social events.
- Tools & Equipment: Nexthink provides state-of-the-art development tools, multiple monitors, and testing devices to ensure high productivity and code quality.
- Team Interaction: The office encourages cross-functional collaboration between web technology teams, designers, and stakeholders, fostering a culture of innovation and continuous learning.
Work Schedule: Nexthink offers a hybrid work model, balancing office and remote work with a structured approach for new hires to foster connections and onboarding. The work schedule is flexible, with unlimited vacation and 3 company-paid volunteer days.
📝 Enhancement Note: Nexthink's work environment emphasizes collaboration, innovation, and continuous learning, with a strong focus on fostering a customer-focused culture and driving technical excellence.
📄 Application & Technical Interview Process
Interview Process:
- Technical Assessment: A hands-on technical assessment focused on cloud operations, SRE best practices, and incident management strategies.
- Behavioral & Cultural Fit: An in-depth discussion of the candidate's leadership style, team management approach, and cultural fit with Nexthink's values.
- Final Evaluation: A final evaluation with the Head of Cloud Engineering, focusing on the candidate's strategic vision for the SRE team and their ability to drive SRE best practices.
Portfolio Review Tips:
- Incident Management: Highlight specific examples of incident response and monitoring strategies implemented in previous roles, demonstrating the candidate's ability to drive SRE best practices and improve service availability and performance.
- Capacity Planning: Showcase the candidate's capacity planning and change management processes, emphasizing their ability to proactively address potential issues and ensure business-defined SLAs.
- Compliance & Security: Demonstrate the candidate's experience with compliance and evidence-gathering activities for audits and regulated deployments, highlighting their commitment to security and enterprise-grade practices.
Technical Challenge Preparation:
- Cloud Operations & SRE: Brush up on cloud operations and SRE best practices, with a focus on incident management, capacity planning, and process improvement.
- Leadership & Team Management: Prepare for behavioral and cultural fit interviews, emphasizing the candidate's leadership style, team management approach, and commitment to driving SRE best practices.
- Company-Specific Knowledge: Research Nexthink's digital employee experience management software, understanding its architecture, features, and competitive advantages.
📝 Enhancement Note: The interview process for this role will focus on the candidate's technical depth in cloud operations and SRE, as well as their leadership skills, team management approach, and cultural fit with Nexthink's values. The portfolio review and technical challenge preparation should emphasize incident management, capacity planning, and process improvement.
🛠 Technology Stack & Web Infrastructure
Cloud Service Providers: Familiarity with one of the three major Cloud Service Providers (AWS, Azure, or GCP) running native cloud technologies based on Docker, Kubernetes, Istio, and Kafka at scale.
Infrastructure-as-Code & Monitoring: Experience with infrastructure-as-code tools such as Terraform, Ansible, and monitoring-as-code tools such as Prometheus, Grafana, and ELK Stack.
CI/CD & Automation: Familiarity with modern CI/CD tools such as Jenkins, GitLab CI/CD, and automation tools such as Ansible, Puppet, and Chef.
APM & Infrastructure Monitoring: Experience with APM and infrastructure monitoring tools such as Datadog, New Relic, Sumo Logic, Splunk, and Dynatrace.
📝 Enhancement Note: This role requires a strong background in cloud operations and SRE, with a focus on incident management, capacity planning, and process improvement. Familiarity with modern CI/CD tools, infrastructure-as-code practices, and APM and infrastructure monitoring tools is essential.
👥 Team Culture & Values
Web Development Values:
- Customer Focus: Nexthink's digital employee experience management software is designed to meet the highest standards for user experience, performance, and security.
- Innovation: Nexthink fosters a culture of innovation, encouraging web technology professionals to explore emerging technologies and drive continuous improvement.
- Collaboration: Nexthink emphasizes cross-functional collaboration between web technology teams, designers, and stakeholders, fostering a culture of innovation and continuous learning.
- Technical Excellence: Nexthink is committed to driving technical excellence in web technology, ensuring high-quality products and enterprise-grade services.
Collaboration Style:
- Cross-Functional Integration: Nexthink encourages cross-functional integration between web technology teams, designers, and stakeholders, fostering a culture of innovation and continuous learning.
- Code Review & Peer Programming: Nexthink follows lean and agile software engineering best practices, emphasizing code review and peer programming to ensure high-quality products and enterprise-grade services.
- Knowledge Sharing & Mentoring: Nexthink fosters a culture of knowledge sharing and mentoring, with a strong commitment to driving technical excellence and continuous learning.
📝 Enhancement Note: Nexthink's web development values emphasize customer focus, innovation, collaboration, and technical excellence. The collaboration style encourages cross-functional integration, code review, and peer programming, fostering a culture of knowledge sharing and mentoring.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- Incident Management: Develop and implement incident response and monitoring strategies to ensure high service availability and performance.
- Capacity Planning: Drive capacity forecasting and change management processes to proactively address potential issues and ensure business-defined SLAs.
- Compliance & Security: Own and drive compliance and evidence-gathering activities for audits and regulated deployments, ensuring enterprise-grade practices and security posture.
- Emerging Technologies: Stay up-to-date with emerging technologies such as Kubernetes, Istio, and Kafka, and integrate them into Nexthink's digital employee experience management software.
Learning & Development Opportunities:
- Technical Skill Development: Deepen expertise in cloud operations, SRE best practices, and emerging technologies such as Kubernetes, Istio, and Kafka.
- Leadership Development: Expand leadership skills by managing a team of cloud engineers and SRE professionals, driving team growth, and fostering a customer-focused culture.
- Architecture & Design: Contribute to the design and architecture of Nexthink's digital employee experience management software, ensuring it meets the highest standards for scalability, performance, and security.
📝 Enhancement Note: This role offers significant technical and leadership development opportunities, with a focus on incident management, capacity planning, and process improvement. Familiarity with modern CI/CD tools, infrastructure-as-code practices, and APM and infrastructure monitoring tools is essential.
💡 Interview Preparation
Technical Questions:
- Cloud Operations & SRE: Prepare for technical questions focused on cloud operations, SRE best practices, incident management strategies, capacity planning, and process improvement.
- Leadership & Team Management: Prepare for behavioral and cultural fit interviews, emphasizing the candidate's leadership style, team management approach, and commitment to driving SRE best practices.
- Company-Specific Knowledge: Research Nexthink's digital employee experience management software, understanding its architecture, features, and competitive advantages.
Company & Culture Questions:
- Prepare for questions about Nexthink's company culture, values, and commitment to driving technical excellence in web technology.
- Research Nexthink's digital employee experience management software, understanding its market position, customer base, and competitive advantages.
- Prepare for questions about the candidate's strategic vision for the SRE team and their ability to drive SRE best practices and improve service availability and performance.
Portfolio Presentation Strategy:
- Incident Management: Highlight specific examples of incident response and monitoring strategies implemented in previous roles, demonstrating the candidate's ability to drive SRE best practices and improve service availability and performance.
- Capacity Planning: Showcase the candidate's capacity planning and change management processes, emphasizing their ability to proactively address potential issues and ensure business-defined SLAs.
- Compliance & Security: Demonstrate the candidate's experience with compliance and evidence-gathering activities for audits and regulated deployments, highlighting their commitment to security and enterprise-grade practices.
- Company-Specific Knowledge: Tailor the portfolio presentation to Nexthink's digital employee experience management software, emphasizing the candidate's understanding of its architecture, features, and competitive advantages.
📝 Enhancement Note: The interview process for this role will focus on the candidate's technical depth in cloud operations and SRE, as well as their leadership skills, team management approach, and cultural fit with Nexthink's values. The portfolio presentation strategy should emphasize incident management, capacity planning, and process improvement.
📌 Application Steps
To apply for this Senior Manager of SRE position:
- Portfolio Customization: Tailor your portfolio to highlight specific examples of incident response and monitoring strategies, capacity planning processes, and compliance and evidence-gathering activities for audits and regulated deployments.
- Resume Optimization: Optimize your resume for web technology roles, emphasizing your experience with cloud operations, SRE best practices, incident management, capacity planning, and process improvement.
- Technical Interview Preparation: Brush up on cloud operations and SRE best practices, with a focus on incident management, capacity planning, and process improvement. Prepare for behavioral and cultural fit interviews, emphasizing your leadership style, team management approach, and commitment to driving SRE best practices.
- Company Research: Research Nexthink's digital employee experience management software, understanding its architecture, features, and competitive advantages. Familiarize yourself with Nexthink's company culture, values, and commitment to driving technical excellence in web technology.
⚠️ Important Notice: This enhanced job description includes AI-generated insights and web technology industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Candidates should have a degree in Computer Science or Engineering and over 10 years of experience in cloud operations engineering leadership roles. A strong background in managing SRE teams and familiarity with major cloud service providers is essential.