Site Reliability Engineer (SRE)

Workday (NZ) Unlimited
Full_timeNew Zealand

📍 Job Overview

  • Job Title: Site Reliability Engineer (SRE)
  • Company: Workday (NZ) Unlimited
  • Location: Auckland, New Zealand
  • Job Type: Hybrid (2 office days per week)
  • Category: DevOps Engineer
  • Date Posted: July 1, 2025
  • Experience Level: 2-5 years
  • Remote Status: On-site with hybrid flexibility

🚀 Role Summary

  • Drive reliability and availability of customer environments through automation and continuous improvement
  • Collaborate with a learning-focused team to deliver daily tasks and reduce manual effort
  • Leverage Linux systems, Bash, Python, and Kubernetes to maintain and enhance production environments
  • Balance multiple tasks, prioritize effectively, and make business decisions under pressure

📝 Enhancement Note: This role combines SRE and software development responsibilities, focusing on automation, reliability, and customer satisfaction.

💻 Primary Responsibilities

  • Environment Provisioning & Management: Automate and manage public and private cloud environments using scripts (bash, ruby, python)
  • Performance Monitoring: Monitor and optimize environment performance to ensure high availability and reliability
  • Troubleshooting & Problem-Solving: Diagnose and resolve issues in production environments, minimizing downtime and impact on customers
  • Automation & Tooling: Develop and maintain tools to reduce manual effort and improve efficiency
  • Collaboration & Communication: Work closely with cross-functional teams to deliver results and ensure customer satisfaction

📝 Enhancement Note: This role requires a strong focus on collaboration, transparency, and effective communication to succeed in a dynamic, customer-centric environment.

🎓 Skills & Qualifications

Education: Bachelor's or Master's degree in Computer Science, Engineering, or related technical field, or equivalent experience

Experience:

  • 2+ years with Linux Systems
  • 2+ years using Bash or Python
  • 2+ years with Kubernetes
  • 2+ years running and maintaining a 24x7 large-scale production environment

Required Skills:

  • Linux systems administration
  • Scripting (bash, python)
  • Kubernetes
  • Production environment management
  • Problem-solving and troubleshooting skills
  • Collaboration and communication skills

Preferred Skills:

  • Experience with Apache Tomcat, HTTPd, MySQL, and Java Web Applications
  • Familiarity with Chef, Puppet, OSSEC, Splunk, Elasticsearch, Ansible, JIRA, Confluence, Grafana, and Prometheus
  • Enterprise-level thinking, including documentation, runbooks, root cause analysis, capacity-trending, bug fixes, and scripting
  • Secret passion for monitoring and addressing false positives

📊 Web Portfolio & Project Requirements

Portfolio Essentials:

  • Demonstrate experience with Linux systems, scripting, and Kubernetes through relevant projects and case studies
  • Showcase problem-solving skills and ability to optimize production environments
  • Highlight collaboration and communication skills through team projects or client-facing work

Technical Documentation:

  • Provide examples of runbooks, root cause analysis, and capacity-trending documentation
  • Include code comments, version control, and deployment processes documentation
  • Demonstrate understanding of monitoring tools and metrics through project documentation

📝 Enhancement Note: While a portfolio is not explicitly required, providing relevant projects and case studies will strengthen your application and demonstrate your skills and experience.

💵 Compensation & Benefits

Salary Range: NZD 100,000 - 140,000 per year (based on market research and role requirements)

Benefits:

  • Competitive compensation packages with base salary, bonus, and stock
  • Time and support for skill development and career growth
  • Hybrid work model with flexibility to work from home and in-person collaboration
  • Amazing events and snacks

Working Hours: Full-time, 40 hours per week, with some nights and weekends required for on-call support and production update rotation

📝 Enhancement Note: The salary range is estimated based on market research and role requirements. Actual compensation may vary depending on experience, skills, and other factors.

🎯 Team & Company Context

🏢 Company Culture

Industry: Enterprise Software

Company Size: Large (10,000+ employees)

Founded: 2005

Team Structure:

  • Small, collaborative SRE team focused on reliability and availability
  • Cross-functional teams, including software developers, designers, and marketers
  • Flat hierarchy with emphasis on learning, continuous improvement, and engineering focus

Development Methodology:

  • Agile and iterative development processes
  • Regular code reviews, testing, and quality assurance practices
  • Continuous integration and deployment pipelines

Company Website: Workday Careers

📝 Enhancement Note: Workday's culture emphasizes employee-centric, collaborative, and innovative work environments. The SRE team focuses on learning, continuous improvement, and engineering-driven solutions.

📈 Career & Growth Analysis

Web Technology Career Level: Mid-level SRE with a focus on automation, reliability, and customer satisfaction

Reporting Structure: Reports directly to the SRE Manager, collaborating with cross-functional teams and other SREs

Technical Impact: Ensures high availability and reliability of customer environments through automation, monitoring, and troubleshooting

Growth Opportunities:

  • Develop expertise in specific technologies and tools
  • Contribute to open-source projects and community involvement
  • Advance to senior SRE roles, focusing on architecture, leadership, and mentoring

📝 Enhancement Note: This role offers opportunities for growth and development in automation, reliability, and customer-centric technologies. Career progression may include advancing to senior SRE roles or exploring other technical leadership paths.

🌐 Work Environment

Office Type: Hybrid, with a focus on in-person collaboration and remote flexibility

Office Location(s): Auckland, New Zealand

Workspace Context:

  • Modern, collaborative workspaces designed for team interaction and innovation
  • Access to multiple monitors, testing devices, and development tools
  • Cross-functional collaboration opportunities with designers, marketers, and other teams

Work Schedule: Full-time, 40 hours per week, with some nights and weekends required for on-call support and production update rotation

📝 Enhancement Note: Workday's hybrid work model offers flexibility and the benefits of in-person collaboration, enabling teams to deepen connections, maintain a strong community, and do their best work.

📄 Application & Technical Interview Process

Interview Process:

  1. Online assessment or coding challenge focused on Linux systems, scripting, and Kubernetes
  2. Technical deep dive into production environment management, automation, and troubleshooting
  3. Behavioral and cultural fit assessment, focusing on collaboration, communication, and problem-solving skills
  4. Final evaluation and decision-making

Portfolio Review Tips:

  • Highlight relevant projects and case studies demonstrating Linux systems, scripting, and Kubernetes experience
  • Showcase problem-solving skills and ability to optimize production environments
  • Emphasize collaboration and communication skills through team projects or client-facing work

Technical Challenge Preparation:

  • Brush up on Linux systems administration, scripting, and Kubernetes skills
  • Familiarize yourself with relevant tools and technologies, such as Apache Tomcat, HTTPd, MySQL, and Java Web Applications
  • Prepare for behavioral and cultural fit questions, focusing on collaboration, communication, and problem-solving skills

ATS Keywords: Linux, Bash, Python, Kubernetes, Production Environment, Apache Tomcat, MySQL, Java Web Applications, Chef, Puppet, OSSEC, Splunk, Elasticsearch, Ansible, JIRA, Confluence, Grafana, Prometheus, SRE, DevOps, Reliability, Availability, Automation, Troubleshooting, Collaboration, Communication

📝 Enhancement Note: The interview process focuses on technical skills, problem-solving, and cultural fit. Prepare for a comprehensive assessment of your Linux systems, scripting, and Kubernetes expertise, as well as your ability to collaborate and communicate effectively.

🛠 Technology Stack & Web Infrastructure

Frontend Technologies: N/A (focus on backend and infrastructure)

Backend & Server Technologies:

  • Linux Systems (CentOS, SunOS, Solaris)
  • Bash, Python, Ruby
  • Kubernetes
  • Apache Tomcat, HTTPd
  • MySQL
  • Java Web Applications

Development & DevOps Tools:

  • Chef, Puppet, OSSEC, Splunk, Elasticsearch, Ansible
  • JIRA, Confluence, Grafana, Prometheus
  • Git, GitHub, or other version control systems
  • CI/CD pipelines and automation tools

📝 Enhancement Note: This role focuses on backend and infrastructure technologies, with a strong emphasis on Linux systems, scripting, and Kubernetes. Familiarize yourself with relevant tools and technologies to excel in the interview process and on the job.

👥 Team Culture & Values

Web Development Values:

  • Customer-centric approach, focusing on reliability and availability
  • Collaboration and learning, with a strong emphasis on continuous improvement
  • Innovation and automation, driving efficiency and reducing manual effort
  • Transparency and communication, ensuring effective teamwork and customer satisfaction

Collaboration Style:

  • Cross-functional teams, working closely with software developers, designers, and marketers
  • Code reviews and peer programming practices
  • Knowledge sharing, technical mentoring, and continuous learning

📝 Enhancement Note: Workday's SRE team values collaboration, learning, and continuous improvement. Emphasize these values in your application and interview process to demonstrate your fit for the role and the team.

⚡ Challenges & Growth Opportunities

Technical Challenges:

  • Automating and optimizing production environments for high availability and reliability
  • Troubleshooting and resolving complex issues in large-scale production environments
  • Developing and maintaining tools to reduce manual effort and improve efficiency
  • Balancing multiple tasks, prioritizing effectively, and making business decisions under pressure

Learning & Development Opportunities:

  • Expanding expertise in Linux systems, scripting, and Kubernetes
  • Contributing to open-source projects and community involvement
  • Advancing to senior SRE roles, focusing on architecture, leadership, and mentoring

📝 Enhancement Note: This role presents technical challenges and growth opportunities in automation, reliability, and customer-centric technologies. Leverage these opportunities to develop your skills and advance your career in SRE and DevOps roles.

💡 Interview Preparation

Technical Questions:

  • Linux systems administration, scripting, and Kubernetes
  • Production environment management, automation, and troubleshooting
  • Relevant tools and technologies, such as Apache Tomcat, HTTPd, MySQL, and Java Web Applications
  • Problem-solving and troubleshooting skills

Company & Culture Questions:

  • Workday's customer-centric approach and focus on reliability and availability
  • Collaboration and learning within the SRE team and across functional areas
  • Innovation and automation to drive efficiency and reduce manual effort
  • Transparency and communication to ensure effective teamwork and customer satisfaction

Portfolio Presentation Strategy:

  • Highlight relevant projects and case studies demonstrating Linux systems, scripting, and Kubernetes experience
  • Showcase problem-solving skills and ability to optimize production environments
  • Emphasize collaboration and communication skills through team projects or client-facing work

📝 Enhancement Note: Prepare for a comprehensive technical assessment of your Linux systems, scripting, and Kubernetes expertise, as well as your ability to collaborate and communicate effectively. Familiarize yourself with Workday's customer-centric approach and focus on reliability and availability to excel in the interview process.

📌 Application Steps

To apply for this Site Reliability Engineer (SRE) position at Workday:

  1. Review and update your resume, highlighting relevant Linux systems, scripting, and Kubernetes experience
  2. Prepare a portfolio showcasing relevant projects and case studies, demonstrating your skills and expertise
  3. Research Workday's customer-centric approach, focus on reliability and availability, and prepare for behavioral and cultural fit questions
  4. Complete the online assessment or coding challenge, focusing on Linux systems, scripting, and Kubernetes
  5. Prepare for the technical deep dive, emphasizing your production environment management, automation, and troubleshooting skills
  6. Attend the behavioral and cultural fit assessment, showcasing your collaboration, communication, and problem-solving skills
  7. Review and address any feedback provided during the interview process, and prepare for the final evaluation and decision-making

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.


Application Requirements

Candidates must have at least 2 years of experience with Linux systems, Bash or Python, and Kubernetes, along with experience in maintaining a large-scale production environment. A strong understanding of automation and a proactive approach to problem-solving is essential.