Site Reliability Engineer

Marsello
Full_timeNew Zealand

📍 Job Overview

  • Job Title: Site Reliability Engineer
  • Company: Marsello
  • Location: Wellington Central, Wellington (Hybrid)
  • Job Type: Hybrid
  • Category: DevOps & Infrastructure
  • Date Posted: 2025-06-24
  • Experience Level: Mid-Level (2-5 years)
  • Remote Status: On-site/Hybrid

🚀 Role Summary

  • 📝 Enhancement Note: Marsello is seeking a Site Reliability Engineer to join their Wellington team, focusing on ensuring uptime reliability, scalability, and supporting the platform through incident resolution and performance optimization. This role is ideal for a proactive problem solver with a strong background in DevOps, SRE, or platform engineering, looking to contribute to strategic decisions around infrastructure, security, monitoring, and data operations.

💻 Primary Responsibilities

  • Ensure Platform Uptime & Reliability: Proactively monitor and maintain high-availability systems to minimize downtime and maximize user experience.
  • Incident Resolution: Lead incident response efforts, troubleshoot issues, and implement solutions to prevent future occurrences.
  • Performance Optimization: Continuously monitor and optimize platform performance, ensuring it scales to meet growing user demands.
  • Collaborate with Cross-Functional Teams: Work closely with product and customer teams to understand user needs, gather requirements, and drive platform improvements.
  • Strategic Infrastructure Decisions: Contribute to strategic decisions around infrastructure, security, monitoring, and data operations, influencing architecture to support company growth.

🎓 Skills & Qualifications

Education: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.

Experience: Proven experience (3+ years) in a DevOps, SRE, or platform engineering role.

Required Skills:

  • AWS Services Proficiency: Extensive experience with AWS services, including Kubernetes, EC2, ECS, S3, RDS, and IAM.
  • CI/CD Pipeline Management: Experience managing CI/CD pipelines with Azure DevOps or GitHub Actions.
  • Git-Based Version Control: Strong skills in Git-based version control workflows.
  • High-Availability Systems Experience: Proven track record in monitoring and supporting high-availability systems.
  • Infrastructure as Code: Familiarity with Infrastructure as Code tools such as Terraform, CloudFormation, or Bicep.
  • Database Management: Strong database experience, including MongoDB, Snowflake, and ClickHouse.
  • Containerized Workloads: Experience with containerized workloads using Docker, ECS, or Kubernetes.

Preferred Skills:

  • Experience with cloud-native applications and microservices architectures.
  • Familiarity with chaos engineering principles and practices.
  • Knowledge of observability tools and techniques.
  • Experience with Agile methodologies and DevOps practices.

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate your ability to manage and optimize high-availability systems with real-world examples.
  • Showcase your problem-solving skills by presenting complex issues you've faced and how you resolved them.
  • Highlight your experience with AWS services, CI/CD pipelines, and Infrastructure as Code tools.
  • Include any relevant certifications or training in AWS, DevOps, or related technologies.

Technical Documentation:

  • Provide clear and concise documentation of your code, including comments, version control, and deployment processes.
  • Include performance metrics, testing methodologies, and optimization techniques used in your projects.
  • Demonstrate your understanding of security best practices and how you've implemented them in your projects.

💵 Compensation & Benefits

Salary Range: NZD 100,000 - 130,000 per annum (Based on market research and role complexity)

Benefits:

  • Discounted Health Insurance
  • Phone and Internet Allowance
  • Professional Development Allowance
  • Gym Subsidy

Working Hours: Full-time (40 hours/week) with flexible work-from-home options

🎯 Team & Company Context

🏢 Company Culture

Industry: Marsello operates in the marketing technology industry, focusing on personalized omnichannel loyalty solutions for brands.

Company Size: Marsello is a mid-sized company with teams based in New Zealand, Canada, and Australia, offering a supportive, fun, and hardworking culture that values employee well-being and growth.

Founded: 2016

Team Structure:

  • Marsello's platform team consists of developers, QA engineers, and DevOps/SRE engineers working collaboratively to maintain and improve the platform.
  • The team follows an Agile/Scrum methodology, with regular sprint planning and code reviews.
  • Cross-functional collaboration is encouraged, with close interaction between platform, product, and customer teams.

Development Methodology:

  • Marsello follows Agile/Scrum methodologies for development, with regular sprint planning, daily stand-ups, and bi-weekly retrospectives.
  • Code reviews and pair programming are encouraged to ensure code quality and knowledge sharing.
  • CI/CD pipelines are used to automate deployment and ensure consistent, reliable releases.

Company Website: Marsello

📝 Enhancement Note: Marsello's company culture emphasizes empathy, professionalism, and a positive attitude, with a focus on maintaining a calm presence during high-pressure incidents.

📈 Career & Growth Analysis

Web Technology Career Level: Mid-Level Site Reliability Engineer, responsible for ensuring platform uptime, reliability, and performance, contributing to strategic infrastructure decisions, and collaborating with cross-functional teams to drive platform improvements.

Reporting Structure: This role reports directly to the Head of Engineering and works closely with the platform, product, and customer teams.

Technical Impact: The Site Reliability Engineer plays a crucial role in ensuring Marsello's platform remains highly available, scalable, and performant, directly impacting user experience and business growth.

Growth Opportunities:

  • Technical Leadership: As Marsello grows, there will be opportunities for the Site Reliability Engineer to take on more technical leadership responsibilities, mentoring junior team members and influencing architecture decisions.
  • Specialization: With experience, the Site Reliability Engineer may choose to specialize in specific areas, such as security, monitoring, or data operations.
  • Career Progression: Proven success in this role may lead to opportunities for advancement into senior or management positions within the engineering organization.

📝 Enhancement Note: Marsello's growth-focused culture provides ample opportunities for professional development and career progression for motivated and talented Site Reliability Engineers.

🌐 Work Environment

Office Type: Marsello operates an 'office-first' policy, with flexibility to work from home when needed, offering the best of both worlds for employees.

Office Location(s): Wellington Central, Wellington

Workspace Context:

  • Marsello's Wellington office provides a collaborative workspace with multiple monitors and testing devices available to support employee productivity.
  • The office encourages cross-functional interaction, with regular team-building activities and social events.
  • Marsello offers flexible work arrangements, allowing employees to balance their work and personal lives effectively.

Work Schedule: Full-time (40 hours/week) with flexible work-from-home options, allowing employees to maintain a healthy work-life balance.

📝 Enhancement Note: Marsello's flexible work arrangements and collaborative work environment foster a positive and productive atmosphere for employees.

📄 Application & Technical Interview Process

Interview Process:

  1. Technical Screening: A phone or video call to discuss your experience, technical skills, and problem-solving approach, with a focus on AWS services, CI/CD pipelines, and high-availability systems.
  2. Technical Challenge: A hands-on assessment, involving a real-world scenario that tests your ability to manage and optimize high-availability systems, with a focus on AWS services and Infrastructure as Code tools.
  3. Behavioral Interview: A conversation to understand your problem-solving approach, communication skills, and cultural fit with Marsello's team.
  4. Final Interview: A meeting with the Head of Engineering to discuss your career aspirations, technical vision, and alignment with Marsello's mission and values.

Portfolio Review Tips:

  • Highlight your experience with AWS services, CI/CD pipelines, and Infrastructure as Code tools.
  • Include real-world examples of complex issues you've faced and how you resolved them.
  • Showcase your problem-solving skills and ability to maintain a calm presence during high-pressure incidents.

Technical Challenge Preparation:

  • Brush up on your AWS services knowledge, focusing on Kubernetes, EC2, ECS, S3, RDS, and IAM.
  • Review your experience with CI/CD pipelines, Infrastructure as Code tools, and high-availability systems.
  • Practice explaining technical concepts clearly and concisely, with a focus on user impact and business value.

ATS Keywords: AWS, Kubernetes, EC2, ECS, S3, RDS, IAM, CI/CD, Infrastructure as Code, High-Availability, MongoDB, Snowflake, ClickHouse, Docker, ECS, Git, Agile, Scrum, DevOps, Site Reliability Engineering, Platform Engineering, Technical Leadership, Problem-Solving, Incident Resolution, Performance Optimization, Cloud-Native Applications, Microservices Architectures, Chaos Engineering, Observability, Security, Database Management, Containerized Workloads

📝 Enhancement Note: Marsello's interview process focuses on assessing your technical skills, problem-solving approach, and cultural fit, with a strong emphasis on real-world examples and hands-on challenges.

🛠 Technology Stack & Web Infrastructure

AWS Services:

  • Compute: EC2, ECS, Kubernetes
  • Storage: S3, EFS, RDS (MongoDB, Snowflake, ClickHouse)
  • Networking: VPC, Route 53, ELB, CloudFront
  • Security: IAM, WAF, Shield, Cognito
  • Monitoring & Logging: CloudWatch, X-Ray, CloudTrail

CI/CD Tools:

  • Azure DevOps
  • GitHub Actions
  • Terraform
  • CloudFormation
  • Bicep

Infrastructure as Code (IaC) Tools:

  • Terraform
  • CloudFormation
  • Bicep

Containerization & Orchestration:

  • Docker
  • ECS
  • Kubernetes

📝 Enhancement Note: Marsello's technology stack is built on AWS services, with a strong focus on Infrastructure as Code, containerization, and orchestration to ensure platform reliability, scalability, and performance.

👥 Team Culture & Values

Web Development Values:

  • Empathy & User-Centric: Marsello prioritizes empathy and a user-centric approach, ensuring that all decisions consider the end-user experience.
  • Proactive Problem-Solving: The team values proactive problem-solving, with a focus on preventing issues before they occur and minimizing downtime when they do.
  • Collaboration & Knowledge Sharing: Marsello encourages collaboration and knowledge sharing, with a strong emphasis on learning from one another and continuously improving.
  • Continuous Learning & Improvement: The team values continuous learning and improvement, with a commitment to staying up-to-date with the latest technologies and best practices.

Collaboration Style:

  • Cross-Functional Collaboration: Marsello's teams work closely together, with regular communication and collaboration between platform, product, and customer teams.
  • Code Reviews & Pair Programming: The team encourages code reviews and pair programming to ensure code quality and knowledge sharing.
  • Agile Methodologies: Marsello follows Agile/Scrum methodologies for development, with regular sprint planning, daily stand-ups, and bi-weekly retrospectives.

📝 Enhancement Note: Marsello's team culture emphasizes empathy, user-centricity, proactive problem-solving, collaboration, and continuous learning, fostering a positive and productive work environment.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • High-Availability Systems: Design, implement, and maintain high-availability systems that can scale to meet growing user demands and minimize downtime.
  • Incident Response: Develop and refine incident response processes to minimize downtime and ensure quick recovery when issues do occur.
  • Performance Optimization: Continuously monitor and optimize platform performance, ensuring it can scale to meet growing user demands and maintain a high level of user experience.
  • Strategic Infrastructure Decisions: Contribute to strategic decisions around infrastructure, security, monitoring, and data operations, influencing architecture to support company growth.

Learning & Development Opportunities:

  • AWS Training & Certification: Marsello encourages employees to pursue AWS training and certification to deepen their knowledge and skills in AWS services.
  • Conferences & Events: Marsello supports employee attendance at relevant conferences and events, providing opportunities to learn from industry experts and network with peers.
  • Mentorship & Coaching: Marsello offers mentorship and coaching opportunities to help employees develop their technical and leadership skills.

📝 Enhancement Note: Marsello's technical challenges and learning opportunities provide Site Reliability Engineers with the chance to grow their skills, contribute to strategic decisions, and drive platform improvements that directly impact user experience and business growth.

💡 Interview Preparation

Technical Questions:

  • AWS Services: Be prepared to discuss your experience with AWS services, with a focus on Kubernetes, EC2, ECS, S3, RDS, and IAM.
  • CI/CD Pipelines: Expect questions about your experience managing CI/CD pipelines with Azure DevOps or GitHub Actions.
  • High-Availability Systems: Be ready to discuss your experience monitoring and supporting high-availability systems, with a focus on incident resolution and performance optimization.
  • Infrastructure as Code: Prepare to discuss your familiarity with Infrastructure as Code tools, such as Terraform, CloudFormation, or Bicep.

Company & Culture Questions:

  • Marsello's Mission & Values: Be prepared to discuss how your experience and skills align with Marsello's mission and values, and how you can contribute to the company's growth.
  • Agile Methodologies: Expect questions about your experience with Agile methodologies, such as Scrum, and how you've applied them in previous roles.
  • Problem-Solving Approach: Be ready to discuss your problem-solving approach, with a focus on real-world examples and how you've handled high-pressure incidents in the past.

Portfolio Presentation Strategy:

  • AWS Services: Highlight your experience with AWS services, with a focus on the specific services relevant to the role and how you've used them to manage and optimize high-availability systems.
  • CI/CD Pipelines: Showcase your experience managing CI/CD pipelines, with a focus on the tools and processes you've used to ensure consistent, reliable releases.
  • Incident Resolution: Include real-world examples of complex issues you've faced and how you resolved them, demonstrating your ability to maintain a calm presence and lead through high-pressure incidents.
  • Performance Optimization: Highlight your experience with performance optimization, with a focus on the techniques and tools you've used to ensure platforms can scale to meet growing user demands.

📝 Enhancement Note: Marsello's interview process focuses on assessing your technical skills, problem-solving approach, and cultural fit, with a strong emphasis on real-world examples and hands-on challenges.

📌 Application Steps

To apply for this Site Reliability Engineer position at Marsello:

  1. Tailor Your Resume: Highlight your relevant experience with AWS services, CI/CD pipelines, and Infrastructure as Code tools, with a focus on high-availability systems, incident resolution, and performance optimization.
  2. Prepare Your Portfolio: Include real-world examples of complex issues you've faced and how you resolved them, demonstrating your problem-solving skills and ability to maintain a calm presence during high-pressure incidents.
  3. Research Marsello: Familiarize yourself with Marsello's mission, values, and technology stack, ensuring your application and interview responses align with the company's goals and culture.
  4. Practice Technical Challenges: Brush up on your AWS services knowledge, focusing on Kubernetes, EC2, ECS, S3, RDS, and IAM. Review your experience with CI/CD pipelines, Infrastructure as Code tools, and high-availability systems.
  5. Prepare for Behavioral Interviews: Reflect on your problem-solving approach, communication skills, and cultural fit with Marsello's team, with a focus on real-world examples and how you've handled high-pressure incidents in the past.

📝 Enhancement Note: Marsello's application and interview process prioritizes real-world examples, technical challenges, and cultural fit, providing candidates with the opportunity to demonstrate their skills and align with the company's mission and values.

Application Requirements

At least 3+ years in a DevOps, SRE, or platform engineering role with proven experience in AWS services and managing CI/CD pipelines. Strong skills in Git-based workflows and experience with high-availability systems are essential.