STACKIT CLOUD TECH LEAD SITE RELIABILITY ENGINEER STORAGE (m/f/d)

Schwarz
Full_timeSofia, Bulgaria

📍 Job Overview

  • Job Title: STACKIT Cloud Tech Lead - Site Reliability Engineer - Storage (m/f/d)
  • Company: Schwarz
  • Location: Sofia, Sofia-Grad, Bulgaria
  • Job Type: On-site
  • Category: DevOps Engineer, System Administrator
  • Date Posted: 2025-06-26
  • Experience Level: 5-10 years

🚀 Role Summary

  • 📝 Enhancement Note: As a Cloud Tech Lead - Site Reliability Engineer - Storage, you'll play a pivotal role in maintaining and optimizing the stability and availability of Schwarz's storage infrastructure. You'll work with cutting-edge cloud technologies, automate processes, and collaborate with cross-functional teams to ensure high system reliability.

💻 Primary Responsibilities

  • 📝 Enhancement Note: Your primary responsibilities will revolve around ensuring the stability and reliability of Schwarz's storage infrastructure. You'll proactively monitor systems, troubleshoot incidents, and optimize performance to meet future scaling demands.

🔑 Key Responsibilities

  • Stability & Reliability: Maintain and optimize the stability and availability of the highly available, resilient storage infrastructure (Block, Object, Backup, and File Storage).
  • Automation: Automate provisioning and operating processes in the storage environment to continuously optimize products.
  • Architecture: Collaborate with your team to design a robust and efficient storage architecture for long-term stability and reliability.
  • End-to-End Responsibility: Identify with the products provided to customers and practice end-to-end responsibility, supported by internal STACKIT service teams.
  • Performance & Capacity Planning: Analyze and optimize system performance, and plan for future capacity needs.
  • Incident & Post-Mortem Analysis: Process (major) incidents with storage participation, derive mitigating measures, and implement them successfully.

🎓 Skills & Qualifications

Education: A Bachelor's degree in Computer Science, IT, or a related field is preferred. Relevant experience may substitute for formal education.

Experience: Proven experience (5-10 years) in operating and managing storage infrastructure, with a strong focus on cloud environments and their architectures.

Required Skills:

  • Storage Products: Experience with various storage products (e.g., NetApp, Cohesity, Pure, Ceph) in the area of Block, Object, Backup, or File Storage.
  • Cloud Environments: Proficiency in cloud environments and their architectures.
  • Storage Infrastructure Operation: Expertise in operating storage infrastructure, including solution scenarios, provisioning, scaling, migration, and incident response.
  • Automation: Experience with automation tools (e.g., Golang, Python, Bash, Ansible) to streamline storage infrastructure management.
  • Containerized Systems: Familiarity with containerized system landscapes (e.g., k8s) in the storage environment.
  • Monitoring & Logging: Experience with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, Elasticsearch) to ensure complete system monitoring.
  • API Development: Experience working with APIs and developing them further (e.g., REST API with Golang and Python).
  • Troubleshooting & Performance Analysis: Strong troubleshooting skills and experience in performance analysis, high availability, lifecycle management, and incident response.
  • Communication Skills: Excellent communication skills in English (and optionally German) for successful cooperation in international, agile teams.

Preferred Skills:

  • Experience with incident management processes and problem management methodologies.
  • Knowledge of infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation).
  • Familiarity with infrastructure automation and configuration management tools (e.g., Ansible, Puppet, Chef).
  • Experience with version control systems (e.g., Git) and CI/CD pipelines.

📊 Web Portfolio & Project Requirements

  • 📝 Enhancement Note: As a DevOps Engineer or System Administrator, your portfolio should demonstrate your expertise in storage infrastructure management, automation, and optimization. Highlight projects that showcase your ability to maintain and improve system reliability, performance, and scalability.

Portfolio Essentials:

  • Storage Infrastructure Projects: Showcase projects that demonstrate your experience with various storage products and cloud environments.
  • Automation & Scripting: Include examples of automated provisioning and operating processes using tools like Golang, Python, Bash, or Ansible.
  • Incident Response & Troubleshooting: Highlight projects where you successfully troubleshot and resolved storage-related incidents, improving system reliability.
  • Performance Optimization: Demonstrate your ability to analyze and optimize storage system performance, with a focus on future scaling and capacity planning.

Technical Documentation:

  • Code Quality & Documentation: Showcase well-commented and documented code, adhering to best practices and industry standards.
  • Version Control & Deployment: Demonstrate experience with version control systems (e.g., Git) and CI/CD pipelines for automated deployment and rollback.
  • Testing & Performance Metrics: Include examples of testing methodologies, performance metrics, and optimization techniques to ensure system reliability and scalability.

💵 Compensation & Benefits

Salary Range: The estimated salary range for this role in Sofia, Bulgaria is between 35,000 BGN (gross) and 50,000 BGN (gross) per year, based on experience and market research. This range is inclusive of bonuses and other compensation components.

Benefits:

  • 📝 Enhancement Note: Benefits information was not provided in the job listing. However, as a large international organization, Schwarz is likely to offer competitive benefits packages, including health insurance, retirement plans, and employee discounts.

🎯 Team & Company Context

🏢 Company Culture

Industry: Schwarz operates in the retail and technology sectors, with a strong focus on innovation and digital transformation. As a tech lead in the storage team, you'll work at the intersection of these industries, driving technological advancements that support the company's growth and success.

Company Size: Schwarz is a large, multinational organization with over 500,000 employees across more than 30 countries in Europe and the US. This provides ample opportunities for collaboration, knowledge sharing, and career growth within the tech team and the broader organization.

Founded: Schwarz was founded in 1913 and has since grown into one of the largest retail and technology companies in the world. The company's commitment to innovation and continuous improvement is reflected in its tech teams, which are at the forefront of driving digital transformation and technological advancements.

Team Structure: The storage team is part of STACKIT, Schwarz's sovereign cloud platform. The team consists of experienced storage engineers, working collaboratively to maintain and optimize the company's storage infrastructure. As a tech lead, you'll work closely with other team members, as well as with internal service teams, to refine and improve storage services.

Development Methodology: The storage team follows Agile methodologies, with a focus on continuous improvement and iterative development. You'll work in sprints, collaborating with your team to prioritize tasks, plan work, and deliver results. The team also emphasizes code review, testing, and quality assurance practices to ensure the reliability and performance of storage systems.

Company Website: Schwarz Group

📈 Career & Growth Analysis

Web Technology Career Level: As a Cloud Tech Lead - Site Reliability Engineer - Storage, you'll be responsible for maintaining and optimizing the stability and availability of Schwarz's storage infrastructure. This role requires a deep understanding of storage systems, cloud environments, and automation tools, as well as strong leadership and communication skills.

Reporting Structure: You'll report directly to the STACKIT Cloud Tech Lead and work closely with other storage engineers, as well as with internal service teams, to refine and improve storage services.

Technical Impact: In this role, you'll have a significant impact on the performance, reliability, and scalability of Schwarz's storage infrastructure. Your work will directly contribute to the company's ability to deliver high-quality products and services to its customers.

Growth Opportunities:

  • Technical Growth: As a tech lead, you'll have the opportunity to deepen your expertise in storage systems, cloud environments, and automation tools. You'll also have the chance to explore emerging technologies and stay up-to-date with the latest industry trends.
  • Leadership Growth: In this role, you'll develop your leadership and communication skills, working closely with other team members and internal service teams to drive results and improve storage services.
  • Career Progression: As a large, multinational organization, Schwarz offers numerous opportunities for career growth and progression within the tech team and the broader organization. You may have the opportunity to take on more senior roles, such as a Principal Engineer or a Technical Manager, as you develop your skills and gain experience.

🌐 Work Environment

Office Type: Schwarz's Sofia office is a modern, collaborative workspace designed to foster innovation and creativity. The office features open-plan workspaces, meeting rooms, and breakout areas, as well as on-site amenities such as a cafeteria and fitness center.

Office Location(s): Schwarz's Sofia office is located in the heart of the city, with easy access to public transportation and nearby amenities.

Workspace Context:

  • Collaboration: The open-plan workspace encourages collaboration and knowledge sharing among team members, as well as with other departments and teams within the organization.
  • Technology & Tools: The office is equipped with state-of-the-art technology and tools, including high-speed internet, multiple monitors, and testing devices, to support the development and testing of storage systems.
  • Flexible Working: Schwarz offers flexible working arrangements, including remote work options, to support work-life balance and accommodate individual needs.

Work Schedule: Schwarz operates on a standard business hours schedule, with core hours from 9:00 AM to 5:00 PM. However, as a tech lead, you may be required to work outside of these hours to manage incidents and ensure system reliability.

📄 Application & Technical Interview Process

Interview Process:

  1. Online Assessment: You'll complete an online assessment to evaluate your technical skills and problem-solving abilities.
  2. Technical Interview: You'll participate in a technical interview with a member of the storage team, focusing on your expertise in storage systems, cloud environments, and automation tools. You may be asked to discuss your approach to incident management, performance optimization, and capacity planning.
  3. Team Fit Interview: You'll meet with members of the storage team to discuss your communication skills, leadership style, and cultural fit within the organization.
  4. Final Interview: You'll have a final interview with the STACKIT Cloud Tech Lead to discuss your career aspirations, growth opportunities, and next steps.

Portfolio Review Tips:

  • 📝 Enhancement Note: When preparing your portfolio, focus on projects that demonstrate your expertise in storage infrastructure management, automation, and optimization. Highlight your ability to maintain and improve system reliability, performance, and scalability.
  • Storage Infrastructure Projects: Include examples of your experience with various storage products and cloud environments, as well as your ability to automate provisioning and operating processes.
  • Incident Response & Troubleshooting: Showcase your ability to troubleshoot and resolve storage-related incidents, with a focus on improving system reliability and performance.
  • Performance Optimization: Demonstrate your ability to analyze and optimize storage system performance, with a focus on future scaling and capacity planning.

Technical Challenge Preparation:

  • 📝 Enhancement Note: When preparing for technical challenges, focus on your ability to troubleshoot and resolve storage-related incidents, as well as your understanding of storage systems, cloud environments, and automation tools.
  • Incident Response: Practice incident response scenarios to improve your ability to identify, diagnose, and resolve storage-related issues quickly and effectively.
  • Performance Optimization: Familiarize yourself with performance optimization techniques and tools, such as benchmarking, profiling, and capacity planning, to ensure that you can optimize storage system performance for future scaling.
  • Communication & Collaboration: Brush up on your communication and collaboration skills, as you'll need to work closely with other team members and internal service teams to drive results and improve storage services.

ATS Keywords:

  • Programming Languages: Bash, Golang, Python, Ansible
  • Web Frameworks: N/A (Storage-focused role)
  • Server Technologies: NetApp, Cohesity, Pure, Ceph, Kubernetes
  • Databases: N/A (Storage-focused role)
  • Tools: Prometheus, Grafana, Elasticsearch, Terraform, CloudFormation
  • Methodologies: Agile, ITIL, DevOps
  • Soft Skills: Communication, Leadership, Problem-Solving, Teamwork
  • Industry Terms: Storage Infrastructure, Cloud Environments, Automation, Incident Management, Performance Optimization, Capacity Planning

🛠 Technology Stack & Web Infrastructure

Frontend Technologies: N/A (Storage-focused role)

Backend & Server Technologies:

  • Storage Products: NetApp, Cohesity, Pure, Ceph
  • Cloud Environments: AWS, Google Cloud, Microsoft Azure
  • Infrastructure Tools: Kubernetes, Terraform, CloudFormation

Development & DevOps Tools:

  • Automation Tools: Golang, Python, Bash, Ansible
  • Monitoring Tools: Prometheus, Grafana, Elasticsearch
  • Version Control Systems: Git
  • CI/CD Pipelines: Jenkins, GitLab CI/CD

👥 Team Culture & Values

Web Development Values:

  • Stability & Reliability: Schwarz places a strong emphasis on the stability and reliability of its storage infrastructure, ensuring that it can deliver high-quality products and services to its customers.
  • Automation & Optimization: The company values automation and optimization, striving to continuously improve its products and services through the use of cutting-edge technologies and best practices.
  • Collaboration & Knowledge Sharing: Schwarz fosters a culture of collaboration and knowledge sharing, encouraging team members to work together to drive results and improve storage services.
  • Innovation & Continuous Learning: The company values innovation and continuous learning, encouraging team members to explore emerging technologies and stay up-to-date with the latest industry trends.

Collaboration Style:

  • Cross-Functional Integration: The storage team works closely with other departments and teams within the organization, including design, marketing, and business teams, to deliver high-quality products and services to customers.
  • Code Review & Peer Programming: The team emphasizes code review and peer programming practices to ensure the reliability and performance of storage systems.
  • Knowledge Sharing: Schwarz encourages knowledge sharing and technical mentoring, with a focus on continuous learning and skill development.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Storage Infrastructure Management: As a Cloud Tech Lead - Site Reliability Engineer - Storage, you'll face technical challenges related to storage infrastructure management, including incident response, performance optimization, and capacity planning.
  • Cloud Environments & Automation: You'll also need to stay up-to-date with the latest developments in cloud environments and automation tools, ensuring that you can optimize storage infrastructure management processes and improve system reliability and performance.
  • Emerging Technologies: As a tech lead, you'll have the opportunity to explore emerging technologies and stay up-to-date with the latest industry trends, ensuring that you can drive innovation and continuous improvement within the storage team.

Learning & Development Opportunities:

  • Technical Skill Development: As a Cloud Tech Lead - Site Reliability Engineer - Storage, you'll have the opportunity to develop your technical skills in storage systems, cloud environments, and automation tools, as well as to explore emerging technologies and stay up-to-date with the latest industry trends.
  • Conference Attendance & Certification: Schwarz supports its team members' professional development by providing opportunities to attend industry conferences and obtain relevant certifications.
  • Technical Mentorship & Leadership Development: As a tech lead, you'll have the opportunity to develop your leadership and communication skills, working closely with other team members and internal service teams to drive results and improve storage services.

💡 Interview Preparation

Technical Questions:

  • Storage Infrastructure Management: Prepare for questions related to storage infrastructure management, including incident response, performance optimization, and capacity planning. Be ready to discuss your approach to maintaining and improving system reliability, performance, and scalability.
  • Cloud Environments & Automation: Brush up on your knowledge of cloud environments and automation tools, and be prepared to discuss your experience with these technologies and how you've used them to optimize storage infrastructure management processes.
  • Problem-Solving: Demonstrate your ability to identify, diagnose, and resolve storage-related issues quickly and effectively, with a focus on improving system reliability and performance.

Company & Culture Questions:

  • Company Culture: Research Schwarz's company culture and values, and be prepared to discuss how you align with the organization's commitment to innovation, continuous improvement, and customer satisfaction.
  • Team Dynamics: Familiarize yourself with the storage team's structure and dynamics, and be prepared to discuss how you'll collaborate with other team members and internal service teams to drive results and improve storage services.
  • Customer Impact: Understand the impact of your role on Schwarz's customers, and be prepared to discuss how you'll work to ensure the reliability and performance of storage systems to support the company's growth and success.

Portfolio Presentation Strategy:

  • 📝 Enhancement Note: When presenting your portfolio, focus on projects that demonstrate your expertise in storage infrastructure management, automation, and optimization. Highlight your ability to maintain and improve system reliability, performance, and scalability.
  • Storage Infrastructure Projects: Include examples of your experience with various storage products and cloud environments, as well as your ability to automate provisioning and operating processes.
  • Incident Response & Troubleshooting: Showcase your ability to troubleshoot and resolve storage-related incidents, with a focus on improving system reliability and performance.
  • Performance Optimization: Demonstrate your ability to analyze and optimize storage system performance, with a focus on future scaling and capacity planning.

📌 Application Steps

To apply for this Cloud Tech Lead - Site Reliability Engineer - Storage role at Schwarz:

  1. Submit Your Application: Click on the "Apply Now" button on the job listing to submit your application through the Schwarz careers portal.
  2. Customize Your Portfolio: Tailor your portfolio to highlight your expertise in storage infrastructure management, automation, and optimization. Include examples of your experience with various storage products and cloud environments, as well as your ability to automate provisioning and operating processes.
  3. Optimize Your Resume: Highlight your relevant experience and skills in storage infrastructure management, cloud environments, and automation tools. Include any relevant certifications or industry-recognized qualifications.
  4. Prepare for Technical Interviews: Brush up on your knowledge of storage systems, cloud environments, and automation tools, and practice incident response and problem-solving scenarios to improve your ability to identify, diagnose, and resolve storage-related issues quickly and effectively.
  5. Research the Company: Familiarize yourself with Schwarz's company culture, values, and mission, and be prepared to discuss how you align with the organization's commitment to innovation, continuous improvement, and customer satisfaction.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.


Application Requirements

Candidates should have experience with various storage products and cloud architectures, along with expertise in operating and automating storage infrastructure. Strong communication skills and a passion for new technologies are essential.