Site Reliability Engineer - Platform Engineering - STACKIT (m/f/d)

Schwarz
Full_timeBucharest, Romania

📍 Job Overview

  • Job Title: Site Reliability Engineer - Platform Engineering - STACKIT (m/f/d)
  • Company: Schwarz IT
  • Location: Bucharest, Romania
  • Job Type: On-site
  • Category: DevOps Engineer
  • Date Posted: June 20, 2025
  • Experience Level: Mid-Senior level (2-5 years)
  • Remote Status: On-site

🚀 Role Summary

  • Design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  • Participate in the entire software development lifecycle, from requirements analysis to design, deployment, and maintenance.
  • Conduct code reviews to ensure code quality and promote knowledge sharing within the team.
  • Actively participate in design and architecture discussions to advance the technical vision and roadmap of products.
  • Eliminate toil by continuously working on improving and automating processes.

📝 Enhancement Note: This role requires a strong background in software development and a deep understanding of Site Reliability Engineering (SRE) principles to ensure the reliability and scalability of software products.

💻 Primary Responsibilities

  • Design and Development: Develop high-quality, scalable, and reliable code for software products using Kubernetes.
  • Software Development Lifecycle: Participate in the entire software development lifecycle, from requirements analysis to deployment and maintenance.
  • Code Reviews: Ensure code quality and promote knowledge sharing within the team through code reviews.
  • Architecture and Design: Actively participate in design and architecture discussions to advance the technical vision and roadmap of products.
  • Process Improvement: Continuously work on improving and automating processes to eliminate toil.

📝 Enhancement Note: This role requires a strong focus on software development best practices, including code quality, performance optimization, and responsive design principles.

🎓 Skills & Qualifications

Education: A bachelor's degree in Computer Science, Software Engineering, or a related field. Relevant experience may be considered in lieu of a degree.

Experience: Proven experience (2-5 years) in building and operating large-scale distributed systems, with a strong focus on Site Reliability Engineering (SRE) principles.

Required Skills:

  • Excellent knowledge of one or more of the following programming languages: C, C++, Go (Golang), Rust.
  • Proficient in Kubernetes and containerization technologies.
  • Strong understanding of Site Reliability Engineering (SRE) principles, including monitoring, alerting, error budgets, and error analysis.
  • Experience with distributed systems and large-scale infrastructure.
  • Proactive, reliable, and able to work in an agile environment.
  • Willingness to work with international teams.

Preferred Skills:

  • Experience with cloud platforms (e.g., AWS, GCP, Azure).
  • Familiarity with infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation).
  • Knowledge of CI/CD pipelines and DevOps best practices.
  • Familiarity with European data protection regulations and standards.

📝 Enhancement Note: This role requires a strong technical background in software development and SRE, with a focus on large-scale distributed systems and infrastructure management.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate your ability to design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  • Showcase your experience with distributed systems and large-scale infrastructure through relevant projects or case studies.
  • Highlight your understanding of Site Reliability Engineering (SRE) principles, including monitoring, alerting, error budgets, and error analysis.

Technical Documentation:

  • Provide clear and concise documentation for your projects, including code comments, version control, deployment processes, and server configuration.
  • Include testing methodologies, performance metrics, and optimization techniques to demonstrate your commitment to code quality and reliability.

📝 Enhancement Note: This role requires a strong focus on technical documentation and code quality to ensure the reliability and scalability of software products.

💵 Compensation & Benefits

Salary Range: The estimated salary range for this role in Bucharest, Romania is between 15,000 RON and 25,000 RON per month, based on market research and industry standards for mid-senior level DevOps engineers with a strong focus on software development and SRE.

Benefits:

  • Competitive salary and benefits package.
  • Opportunity to work with international teams and shape the future of Europe's digital infrastructure.
  • Collaborative and agile work environment focused on continuous learning and improvement.

Working Hours: Full-time position with standard working hours, including flexibility for deployment windows, maintenance, and project deadlines.

📝 Enhancement Note: The salary range provided is an estimate based on market research and industry standards for the given role and location. Actual salary may vary based on experience, skills, and company-specific compensation policies.

🎯 Team & Company Context

🏢 Company Culture

Industry: Schwarz IT is responsible for the entire digital infrastructure and all software solutions of the companies within Schwarz Group. It operates as a cloud and colocation provider, offering secure infrastructure and individual solutions to both internal and external customers.

Company Size: Schwarz IT is a large organization with a significant presence in the European market, providing ample opportunities for growth and collaboration within a diverse team.

Founded: Schwarz IT is a division of Schwarz Group, which was founded in 1913. The IT division has been operating since 1970, providing IT solutions and infrastructure to the group's various businesses.

Team Structure:

  • The STACKIT Products division is responsible for developing, testing, and improving products and services.
  • The team consists of dedicated STACKITEERS who work collaboratively to design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  • The team follows Agile methodologies and promotes a culture of continuous learning and improvement.

Development Methodology:

  • Agile/Scrum methodologies are used for project management and sprint planning.
  • Code reviews are conducted to ensure code quality and promote knowledge sharing within the team.
  • CI/CD pipelines and automated deployment strategies are employed to streamline the development process.
  • Deployment strategies are designed to minimize downtime and ensure high availability.

Company Website: www.careers.it.schwarz

📝 Enhancement Note: Schwarz IT operates as a large, established organization within the Schwarz Group, providing ample opportunities for growth and collaboration within a diverse team focused on digital infrastructure and software solutions.

📈 Career & Growth Analysis

Web Technology Career Level: This role is at the mid-senior level, focusing on software development and Site Reliability Engineering (SRE) within a large organization. Responsibilities include designing, developing, and maintaining high-quality, scalable, and reliable code for software products based on Kubernetes, as well as participating in the entire software development lifecycle.

Reporting Structure: This role reports directly to the STACKIT Products division, working collaboratively with other STACKITEERS to design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.

Technical Impact: This role has a significant impact on the reliability and scalability of software products, ensuring that they meet the needs of both internal and external customers. It also contributes to the continuous improvement of processes and the advancement of the technical vision and roadmap of products.

Growth Opportunities:

  • Technical Growth: Develop your skills in software development, SRE, and large-scale distributed systems to take on more complex projects and leadership roles.
  • Career Progression: Advance your career within the STACKIT Products division or explore opportunities within other divisions of Schwarz IT, such as STACKIT Infrastructure or STACKIT Security.
  • International Opportunities: Work with international teams and explore opportunities to collaborate with external customers, expanding your professional network and gaining diverse experiences.

📝 Enhancement Note: This role offers significant opportunities for growth and career advancement within a large, established organization focused on digital infrastructure and software solutions.

🌐 Work Environment

Office Type: Schwarz IT operates in a modern, collaborative work environment focused on continuous learning and improvement. The office is designed to facilitate teamwork and communication, with state-of-the-art technology and tools to support productivity.

Office Location(s): The primary office location for this role is in Bucharest, Romania. Schwarz IT also has offices in other European locations, providing opportunities for collaboration and growth within a diverse team.

Workspace Context:

  • Collaborative Environment: Schwarz IT fosters a collaborative work environment, with dedicated spaces for team meetings, brainstorming sessions, and social events.
  • Technology and Tools: The office is equipped with modern technology and tools to support productivity, including high-speed internet, multiple monitors, and testing devices.
  • Cross-Functional Collaboration: Schwarz IT encourages collaboration across different teams and disciplines, providing opportunities to work with designers, marketers, and other stakeholders to develop effective IT solutions.

Work Schedule: Full-time position with standard working hours, including flexibility for deployment windows, maintenance, and project deadlines. Schwarz IT offers a flexible work arrangement that allows employees to balance their work and personal lives.

📝 Enhancement Note: Schwarz IT provides a modern, collaborative work environment focused on continuous learning and improvement, with ample opportunities for growth and collaboration within a diverse team.

📄 Application & Technical Interview Process

Interview Process:

  1. Technical Assessment: A technical assessment or coding challenge focused on software development and SRE principles, such as designing, developing, and maintaining high-quality, scalable, and reliable code for software products based on Kubernetes.
  2. Architecture and Design Discussion: A discussion focused on architecture and design principles, including the technical vision and roadmap of products, as well as the ability to work collaboratively with other STACKITEERS.
  3. Team Fit Assessment: An assessment of cultural fit within the STACKIT Products division, focusing on the ability to work collaboratively and effectively within a diverse team.
  4. Final Evaluation: A final evaluation based on the candidate's technical skills, cultural fit, and alignment with the company's values and mission.

Portfolio Review Tips:

  • Project Selection: Choose projects that demonstrate your ability to design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  • Code Quality: Ensure that your code is well-documented, follows best practices, and is optimized for performance and reliability.
  • Architecture and Design: Highlight your understanding of architecture and design principles, including the ability to work collaboratively with other STACKITEERS to advance the technical vision and roadmap of products.
  • User Experience: Demonstrate your ability to consider the user experience when designing, developing, and maintaining software products, including accessibility and responsiveness.

Technical Challenge Preparation:

  • Technical Fundamentals: Brush up on your technical fundamentals, including software development, SRE principles, and large-scale distributed systems.
  • Problem-Solving: Practice problem-solving techniques and algorithms to ensure that you can design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  • Communication Skills: Prepare to communicate your technical solutions effectively, including the ability to explain complex concepts to non-technical stakeholders.

ATS Keywords: (See the comprehensive list of web development and server administration-relevant keywords for resume optimization, organized by category, at the end of this document)

📝 Enhancement Note: The interview process for this role is designed to assess the candidate's technical skills, cultural fit, and alignment with the company's values and mission, with a strong focus on software development and SRE principles.

🛠 Technology Stack & Web Infrastructure

Frontend Technologies: (Not applicable for this role, as it focuses on software development and SRE)

Backend & Server Technologies:

  • Programming Languages: C, C++, Go (Golang), Rust
  • Containerization: Kubernetes
  • Monitoring and Alerting: Prometheus, Grafana, or other relevant tools
  • Error Budget and Error Analysis: Custom tools or industry-standard solutions

Development & DevOps Tools:

  • Version Control: Git
  • CI/CD Pipelines: Jenkins, GitLab CI/CD, or other relevant tools
  • Infrastructure as Code (IaC): Terraform, CloudFormation, or other relevant tools
  • Configuration Management: Ansible, Puppet, or other relevant tools

📝 Enhancement Note: This role requires a strong background in software development and SRE, with a focus on large-scale distributed systems and infrastructure management. The technology stack includes a variety of programming languages, containerization technologies, monitoring and alerting tools, and development and DevOps tools.

👥 Team Culture & Values

Web Development Values:

  • Quality: Schwarz IT is committed to delivering high-quality, scalable, and reliable software products that meet the needs of both internal and external customers.
  • Collaboration: Schwarz IT fosters a collaborative work environment, with dedicated spaces for team meetings, brainstorming sessions, and social events.
  • Continuous Learning: Schwarz IT encourages continuous learning and improvement, with regular training and development opportunities to help employees grow both personally and professionally.
  • Innovation: Schwarz IT promotes a culture of innovation, with a focus on staying at the forefront of technology and exploring new solutions to meet the evolving needs of the business.

Collaboration Style:

  • Cross-Functional Integration: Schwarz IT encourages collaboration across different teams and disciplines, providing opportunities to work with designers, marketers, and other stakeholders to develop effective IT solutions.
  • Code Review Culture: Schwarz IT promotes a code review culture, with a focus on ensuring code quality and promoting knowledge sharing within the team.
  • Peer Programming: Schwarz IT encourages peer programming, with a focus on learning from one another and improving the overall quality of the codebase.

📝 Enhancement Note: Schwarz IT fosters a collaborative work environment focused on continuous learning, improvement, and innovation, with a strong commitment to delivering high-quality, scalable, and reliable software products that meet the needs of both internal and external customers.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Scalability: Design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes, ensuring that they can handle the growing demands of both internal and external customers.
  • Reliability: Ensure the reliability and availability of software products, with a focus on minimizing downtime and maximizing performance.
  • Performance Optimization: Continuously optimize the performance of software products, with a focus on improving response times, reducing latency, and enhancing user experience.
  • Emerging Technologies: Stay up-to-date with emerging technologies and trends in software development and SRE, and explore opportunities to incorporate new solutions into the software products and infrastructure.

Learning & Development Opportunities:

  • Technical Skill Development: Develop your skills in software development, SRE, and large-scale distributed systems through training, workshops, and on-the-job experiences.
  • Certification and Community Involvement: Pursue relevant certifications and engage with the local tech community to expand your professional network and gain diverse experiences.
  • Mentorship and Leadership Development: Seek mentorship opportunities within Schwarz IT to develop your leadership skills and advance your career within the organization.

📝 Enhancement Note: This role presents significant technical challenges and growth opportunities, with a focus on designing, developing, and maintaining high-quality, scalable, and reliable code for software products based on Kubernetes, as well as staying up-to-date with emerging technologies and trends in software development and SRE.

💡 Interview Preparation

Technical Questions:

  • Software Development: Prepare for technical questions focused on software development, including designing, developing, and maintaining high-quality, scalable, and reliable code for software products based on Kubernetes.
  • SRE Principles: Brush up on your understanding of SRE principles, including monitoring, alerting, error budgets, and error analysis, and be prepared to discuss how you have applied these principles in previous projects.
  • Large-Scale Distributed Systems: Prepare for technical questions focused on large-scale distributed systems, including the design, development, and maintenance of high-quality, scalable, and reliable code for software products based on Kubernetes.

Company & Culture Questions:

  • Company Values: Research Schwarz IT's values and be prepared to discuss how you align with the company's commitment to quality, collaboration, continuous learning, and innovation.
  • Team Dynamics: Prepare for questions focused on team dynamics, including your ability to work collaboratively with other STACKITEERS and contribute to the advancement of the technical vision and roadmap of products.
  • User Experience: Prepare to discuss your understanding of user experience and how you have considered the user experience when designing, developing, and maintaining software products.

Portfolio Presentation Strategy:

  • Live Demonstration: Prepare a live demonstration of your software products, highlighting your ability to design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  • Code Walkthrough: Prepare a detailed walkthrough of your code, including your approach to architecture and design, as well as your commitment to code quality and performance optimization.
  • User Experience Showcase: Prepare a showcase of your user experience design and interface development, including your ability to consider the user experience when designing, developing, and maintaining software products.

📝 Enhancement Note: The interview process for this role is designed to assess the candidate's technical skills, cultural fit, and alignment with the company's values and mission, with a strong focus on software development and SRE principles.

📌 Application Steps

To apply for this Site Reliability Engineer - Platform Engineering - STACKIT (m/f/d) position:

  1. Customize Your Portfolio: Tailor your portfolio to highlight your ability to design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes, with a focus on software development and SRE principles.
  2. Optimize Your Resume: Highlight your technical skills and experience in software development, SRE, and large-scale distributed systems, as well as your ability to work collaboratively with other STACKITEERS and contribute to the advancement of the technical vision and roadmap of products.
  3. Prepare for Technical Challenges: Brush up on your technical fundamentals, including software development, SRE principles, and large-scale distributed systems, and practice problem-solving techniques and algorithms to ensure that you can design, develop, and maintain high-quality, scalable, and reliable code for software products based on Kubernetes.
  4. Research the Company: Familiarize yourself with Schwarz IT's values, culture, and mission, and be prepared to discuss how you align with the company's commitment to quality, collaboration, continuous learning, and innovation.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

🛠 ATS Keywords

Programming Languages:

  • C
  • C++
  • Go (Golang)
  • Rust

Web Frameworks:

  • Kubernetes

Server Technologies:

  • Prometheus
  • Grafana
  • Terraform
  • CloudFormation
  • Ansible
  • Puppet

Databases:

  • (Not applicable for this role, as it focuses on software development and SRE)

Tools:

  • Git
  • Jenkins
  • GitLab CI/CD
  • Infrastructure as Code (IaC)
  • Configuration Management

Methodologies:

  • Agile/Scrum
  • Site Reliability Engineering (SRE)
  • Infrastructure as Code (IaC)
  • Continuous Integration/Continuous Deployment (CI/CD)

Soft Skills:

  • Collaboration
  • Communication
  • Problem-Solving
  • Adaptability
  • Leadership

Industry Terms:

  • Software Development
  • Site Reliability Engineering (SRE)
  • Large-Scale Distributed Systems
  • Containerization
  • Monitoring and Alerting
  • Error Budget and Error Analysis
  • Infrastructure as Code (IaC)
  • Configuration Management
  • Technical Vision and Roadmap
  • User Experience
  • Performance Optimization
  • Scalability
  • Reliability
  • Emerging Technologies
  • Technical Skill Development
  • Certification and Community Involvement
  • Mentorship and Leadership Development

Application Requirements

You have excellent knowledge of one of the following programming languages: C, C++, Go (Golang), Rust. You have experience in building and operating large-scale distributed systems and a comprehensive understanding of the principles of Site Reliability Engineering (SRE).