Senior Site Reliability Engineer
📍 Job Overview
- Job Title: Senior Site Reliability Engineer
- Company: Obsidian Security
- Location: Sydney, New South Wales, Australia (Remote)
- Job Type: Full-Time
- Category: DevOps / SRE
- Date Posted: 2025-06-25
- Experience Level: 5-10 years
- Remote Status: Remote OK
🚀 Role Summary
- Maintain and enhance the reliability, scalability, and performance of Obsidian's SaaS security platform
- Collaborate with engineering teams to optimize CI/CD pipelines, monitoring, and deployment processes
- Define service verification strategies and implement them in the CI/CD process to meet service level agreements (SLAs)
- Improve developer experience by optimizing CI/CD workflows and performance
- Participate in an on-call rotation to provide 24/7 support and maintain production infrastructure and services on AWS/GCP
📝 Enhancement Note: This role requires a strong background in DevOps and SRE practices, with a focus on maintaining and enhancing the reliability and performance of SaaS applications. Candidates should be comfortable working in a dynamic, fast-paced environment and have experience with relevant technologies.
💻 Primary Responsibilities
- Maintain and Enhance Platform Reliability: Ensure the stability, scalability, and high performance of Obsidian's customer-facing SaaS security platform by addressing complex challenges around reliability, observability, and cost efficiency.
- Collaborate with Engineering Teams: Work closely with engineering teams to maintain and enhance Helm charts, application deployment, monitoring, and CI/CD pipelines.
- Define and Implement Service Verification Strategies: Develop and implement service verification strategies as part of the CI/CD process to meet SLAs and ensure the quality of releases.
- Optimize CI/CD Workflows and Performance: Improve developer experience by optimizing CI/CD workflows and performance, enabling engineering teams to work more efficiently.
- Provide 24/7 On-Call Support: Participate in an on-call rotation to provide 24/7 support for production infrastructure and services on AWS/GCP, ensuring minimal downtime and quick issue resolution.
- Monitor, Debug, and Optimize Production Infrastructure: Monitor, debug, and optimize production infrastructure and services on AWS/GCP to maintain high availability and performance.
📝 Enhancement Note: This role requires a deep understanding of cloud services, microservices architecture, and CI/CD pipelines. Candidates should be comfortable working with complex systems and have experience with relevant technologies.
🎓 Skills & Qualifications
Education: A Bachelor's degree in Computer Science or a related field is required. Relevant coursework or equivalent experience may be considered.
Experience: Candidates should have 4+ years of experience in a DevOps or SRE role supporting SaaS services on GCP and/or AWS. Experience with Kubernetes, microservices architecture, Helm, GitLab CI/CD, ArgoCD, Prometheus, and Grafana is essential.
Required Skills:
- Strong proficiency in Kubernetes, microservices architecture, Helm, GitLab CI/CD, ArgoCD, Prometheus, and Grafana
- Programming experience in at least one language; Golang or Python preferred
- Deep understanding of autoscaling, version upgrades, and cloud service optimization
- Experience with AWS and GCP services
- Familiarity with technologies such as Kafka, Elasticsearch, PostgreSQL, ScyllaDB, Databricks, Dagster, Sentry, and Kong is a plus
Preferred Skills:
- Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation
- Familiarity with chaos engineering principles and practices
- Knowledge of container security best practices
- Experience with incident management and post-mortem processes
📝 Enhancement Note: This role requires a strong technical skill set, with a focus on cloud services, microservices architecture, and CI/CD pipelines. Candidates should have experience with relevant technologies and be comfortable working in a dynamic, fast-paced environment.
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- A portfolio showcasing your experience with cloud services, microservices architecture, and CI/CD pipelines
- Examples of your work optimizing platform reliability, scalability, and performance
- Case studies demonstrating your ability to define and implement service verification strategies
- Documentation of your experience with on-call rotations and incident management
Technical Documentation:
- Code samples and documentation demonstrating your proficiency with relevant technologies
- Documentation of your experience with incident management and post-mortem processes
- Examples of your work optimizing CI/CD workflows and performance
💵 Compensation & Benefits
Salary Range: The base salary range for this role is AUD $216,000 to $256,000 per year. The actual base pay will vary based on factors such as work location, knowledge, skills, and experience.
Benefits:
- Competitive compensation with equity and 401k
- Comprehensive healthcare with dental and vision coverage
- Flexible paid time off and paid holiday time off
- 12 weeks of new parent or family leave
- Personal and professional development resources
Working Hours: This role requires a standard full-time workweek of 40 hours, with the possibility of occasional overtime to ensure minimal downtime and quick issue resolution. Flexible working hours may be available to accommodate different time zones.
📝 Enhancement Note: The salary range provided is a guideline and may vary based on the candidate's location, skills, and experience. Obsidian Security offers a competitive benefits package designed to support employees' well-being, both at work and at home.
🎯 Team & Company Context
🏢 Company Culture
Industry: Obsidian Security operates in the cybersecurity industry, focusing on securing SaaS applications where modern business happens.
Company Size: Obsidian Security is a growing startup with a strong global presence, protecting more than 200 organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand, including many of the world's largest Fortune 1000 and Global 2000 companies.
Founded: Obsidian Security was founded in 2017, with a mission to close a critical gap in securing SaaS applications.
Team Structure:
- The DevOps/SRE team works closely with Engineering, Quality Engineering, and Customer Support to deliver end-to-end services that bring code to life and maintain the world-class SaaS security platform.
- The team is responsible for supporting and maintaining the service quality of the customer-facing SaaS security platform, addressing complex challenges around scalability, reliability, observability, and cost efficiency.
- The DevOps/SRE team collaborates with engineering teams to maintain and enhance Helm charts, application deployment, monitoring, and CI/CD pipelines.
Development Methodology:
- Obsidian Security follows Agile development methodologies, with a focus on continuous integration, continuous deployment, and continuous improvement.
- The company uses GitLab for version control, CI/CD, and project management.
- Obsidian Security employs a microservices architecture, with a focus on scalability, reliability, and maintainability.
Company Website: obsidiansecurity.com
📝 Enhancement Note: Obsidian Security's company culture is driven by its mission to close a critical gap in securing SaaS applications. The company values innovation, collaboration, and a strong focus on customer success. The DevOps/SRE team plays a crucial role in maintaining the reliability and performance of the company's SaaS security platform.
📈 Career & Growth Analysis
Web Technology Career Level: This role is a senior-level position within the DevOps/SRE team at Obsidian Security. The ideal candidate will have extensive experience in DevOps and SRE practices, with a strong background in cloud services, microservices architecture, and CI/CD pipelines.
Reporting Structure: The Senior Site Reliability Engineer will report directly to the Director of Engineering or a similar role within the engineering leadership team.
Technical Impact: This role has a significant impact on the reliability, scalability, and performance of Obsidian's SaaS security platform. The Senior Site Reliability Engineer will work closely with engineering teams to maintain and enhance the platform, ensuring minimal downtime and quick issue resolution.
Growth Opportunities:
- Technical Growth: The Senior Site Reliability Engineer will have the opportunity to grow their technical skills by working with cutting-edge technologies and collaborating with a talented team of engineers.
- Leadership Development: As the company grows, there may be opportunities for the Senior Site Reliability Engineer to take on more leadership responsibilities, mentoring junior team members and helping to shape the future of the DevOps/SRE team.
- Architecture Decisions: The Senior Site Reliability Engineer will have the opportunity to influence architecture decisions, helping to ensure that the platform remains scalable, reliable, and performant as the company grows.
📝 Enhancement Note: This role offers significant opportunities for technical growth and leadership development. The Senior Site Reliability Engineer will have the chance to work with cutting-edge technologies and collaborate with a talented team of engineers, helping to shape the future of Obsidian's SaaS security platform.
🌐 Work Environment
Office Type: Obsidian Security has a remote-first work environment, with team members located across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand.
Office Location(s): As a remote-first company, Obsidian Security does not have a physical office. Team members are encouraged to work from a location that suits their needs and preferences.
Workspace Context:
- Remote Work: Obsidian Security provides the necessary tools and resources for remote work, including laptops, monitors, and software licenses.
- Collaboration: The DevOps/SRE team uses collaboration tools such as Slack, Google Workspace, and GitLab to communicate and work together effectively.
- Work-Life Balance: Obsidian Security values work-life balance and encourages team members to prioritize their well-being and personal lives.
Work Schedule: This role requires a standard full-time workweek of 40 hours, with the possibility of occasional overtime to ensure minimal downtime and quick issue resolution. Flexible working hours may be available to accommodate different time zones.
📝 Enhancement Note: Obsidian Security's remote-first work environment provides team members with the flexibility to work from a location that suits their needs and preferences. The company values work-life balance and encourages team members to prioritize their well-being and personal lives.
📄 Application & Technical Interview Process
Interview Process:
- Technical Phone Screen: A brief phone or video call to assess your technical skills and cultural fit with the team.
- Technical Deep Dive: A more in-depth technical interview, focusing on your experience with cloud services, microservices architecture, and CI/CD pipelines. You may be asked to complete a take-home assignment or participate in a live coding exercise.
- Behavioral Interview: A discussion-focused interview to assess your problem-solving skills, communication abilities, and cultural fit with the team.
- Final Review: A final review of your application and interview performance by the hiring manager and other stakeholders.
Portfolio Review Tips:
- Highlight your experience with cloud services, microservices architecture, and CI/CD pipelines
- Include case studies demonstrating your ability to optimize platform reliability, scalability, and performance
- Showcase your experience with incident management and on-call rotations
- Provide examples of your work defining and implementing service verification strategies
Technical Challenge Preparation:
- Brush up on your knowledge of cloud services, microservices architecture, and CI/CD pipelines
- Review your experience with relevant technologies, such as Kubernetes, Helm, GitLab CI/CD, ArgoCD, Prometheus, and Grafana
- Prepare for live coding exercises and technical deep dives, focusing on your ability to optimize platform reliability, scalability, and performance
ATS Keywords: [Provide a comprehensive list of web development and server administration-relevant keywords for resume optimization, organized by category: programming languages, web frameworks, server technologies, databases, tools, methodologies, soft skills, industry terms]
📝 Enhancement Note: This role requires a strong technical skill set, with a focus on cloud services, microservices architecture, and CI/CD pipelines. Candidates should be comfortable working with complex systems and have experience with relevant technologies.
🛠 Technology Stack & Web Infrastructure
Frontend Technologies: N/A (This role focuses on backend and infrastructure technologies)
Backend & Server Technologies:
- Cloud Services: AWS and GCP
- Containerization: Kubernetes
- Orchestration: Helm
- CI/CD Pipelines: GitLab CI/CD and ArgoCD
- Monitoring: Prometheus and Grafana
- Databases: PostgreSQL, ScyllaDB, and Elasticsearch
- Streaming: Kafka
- Data Processing: Databricks and Dagster
- API Gateway: Kong
- Error Tracking: Sentry
Development & DevOps Tools:
- Version Control: GitLab
- Project Management: Jira and GitLab
- Collaboration: Slack and Google Workspace
- Incident Management: PagerDuty and OpsGenie
📝 Enhancement Note: This role requires a strong background in cloud services, microservices architecture, and CI/CD pipelines. Candidates should have experience with relevant technologies and be comfortable working with complex systems.
👥 Team Culture & Values
Web Development Values:
- Reliability: Obsidian Security values reliability above all else. The DevOps/SRE team is responsible for ensuring the stability, scalability, and high performance of the company's SaaS security platform.
- Innovation: Obsidian Security encourages innovation and continuous improvement. The DevOps/SRE team is always looking for ways to optimize platform reliability, scalability, and performance.
- Collaboration: Obsidian Security values collaboration and cross-functional teamwork. The DevOps/SRE team works closely with engineering, quality engineering, and customer support teams to deliver end-to-end services that bring code to life and maintain the world-class SaaS security platform.
- Customer Focus: Obsidian Security is committed to customer success. The DevOps/SRE team works tirelessly to ensure minimal downtime and quick issue resolution, providing the best possible experience for Obsidian's customers.
Collaboration Style:
- Cross-Functional Integration: The DevOps/SRE team works closely with engineering, quality engineering, and customer support teams to deliver end-to-end services that bring code to life and maintain the world-class SaaS security platform.
- Code Review Culture: The DevOps/SRE team follows a code review culture, ensuring that all changes to the platform are thoroughly reviewed and tested before deployment.
- Knowledge Sharing: The DevOps/SRE team encourages knowledge sharing and continuous learning. Team members are encouraged to share their expertise and learn from one another.
📝 Enhancement Note: Obsidian Security's DevOps/SRE team values reliability, innovation, collaboration, and customer focus. The team works closely with other departments to deliver end-to-end services that bring code to life and maintain the world-class SaaS security platform.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- Scalability: As Obsidian Security continues to grow, the Senior Site Reliability Engineer will face technical challenges around scaling the platform to meet increasing demand.
- Reliability: The Senior Site Reliability Engineer will be responsible for maintaining the reliability of the platform, ensuring minimal downtime and quick issue resolution.
- Observability: The Senior Site Reliability Engineer will need to ensure that the platform is highly observable, with clear insights into its performance, reliability, and scalability.
- Cost Efficiency: The Senior Site Reliability Engineer will be tasked with optimizing the platform's cost efficiency, ensuring that it remains scalable, reliable, and performant while minimizing wasteful spending.
Learning & Development Opportunities:
- Technical Skill Development: The Senior Site Reliability Engineer will have the opportunity to grow their technical skills by working with cutting-edge technologies and collaborating with a talented team of engineers.
- Leadership Development: As the company grows, there may be opportunities for the Senior Site Reliability Engineer to take on more leadership responsibilities, mentoring junior team members and helping to shape the future of the DevOps/SRE team.
- Architecture Decisions: The Senior Site Reliability Engineer will have the opportunity to influence architecture decisions, helping to ensure that the platform remains scalable, reliable, and performant as the company grows.
📝 Enhancement Note: This role offers significant opportunities for technical growth and leadership development. The Senior Site Reliability Engineer will have the chance to work with cutting-edge technologies and collaborate with a talented team of engineers, helping to shape the future of Obsidian's SaaS security platform.
💡 Interview Preparation
Technical Questions:
- Cloud Services: Questions related to your experience with AWS and GCP, focusing on your ability to optimize platform reliability, scalability, and performance.
- Microservices Architecture: Questions related to your understanding of microservices architecture and your ability to work with complex systems.
- CI/CD Pipelines: Questions related to your experience with CI/CD pipelines, focusing on your ability to optimize workflows and performance.
- Incident Management: Questions related to your experience with incident management and on-call rotations, focusing on your ability to ensure minimal downtime and quick issue resolution.
Company & Culture Questions:
- Company Values: Questions related to your understanding of Obsidian Security's values and your ability to contribute to the company's mission.
- Team Dynamics: Questions related to your ability to work collaboratively with other teams and contribute to a positive work environment.
- Adaptability: Questions related to your ability to adapt to a dynamic, fast-paced environment and learn new technologies as needed.
Portfolio Presentation Strategy:
- Cloud Services: Highlight your experience with AWS and GCP, focusing on your ability to optimize platform reliability, scalability, and performance.
- Microservices Architecture: Showcase your understanding of microservices architecture and your ability to work with complex systems.
- CI/CD Pipelines: Demonstrate your experience with CI/CD pipelines, focusing on your ability to optimize workflows and performance.
- Incident Management: Provide examples of your experience with incident management and on-call rotations, focusing on your ability to ensure minimal downtime and quick issue resolution.
📝 Enhancement Note: This role requires a strong technical skill set, with a focus on cloud services, microservices architecture, and CI/CD pipelines. Candidates should be comfortable working with complex systems and have experience with relevant technologies.
📌 Application Steps
To apply for this Senior Site Reliability Engineer position at Obsidian Security:
- Submit Your Application: Click the "Apply Now" button on the job listing to submit your application through Greenhouse.
- Prepare Your Portfolio: Tailor your portfolio to showcase your experience with cloud services, microservices architecture, and CI/CD pipelines. Include case studies demonstrating your ability to optimize platform reliability, scalability, and performance, as well as your experience with incident management and on-call rotations.
- Optimize Your Resume: Highlight your relevant skills and experience, focusing on your ability to work with complex systems and your experience with relevant technologies.
- Prepare for Technical Interviews: Brush up on your knowledge of cloud services, microservices architecture, and CI/CD pipelines. Review your experience with relevant technologies and prepare for live coding exercises and technical deep dives, focusing on your ability to optimize platform reliability, scalability, and performance.
- Research the Company: Familiarize yourself with Obsidian Security's mission, values, and culture. Understand the company's focus on securing SaaS applications and its commitment to customer success.
⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Candidates should have 4+ years of experience in a DevOps or SRE role supporting SaaS services on GCP and/or AWS, along with a Bachelor's degree in Computer Science or a related field. Strong proficiency in relevant technologies and programming languages is essential.