Site Reliability Engineer (SRE)
📍 Job Overview
- Job Title: Site Reliability Engineer (SRE)
- Company: Workday (NZ) Unlimited
- Location: Auckland, New Zealand
- Job Type: Hybrid (2 office days per week)
- Category: DevOps Engineer
- Date Posted: July 1, 2025
- Experience Level: 2-5 years
- Remote Status: On-site with hybrid flexibility
🚀 Role Summary
- Drive reliability and availability of customer environments through automation and continuous improvement
- Collaborate with a learning-focused team to deliver daily tasks and reduce manual effort
- Leverage Linux systems, Bash, Python, and Kubernetes to maintain and enhance production environments
- Balance multiple tasks, prioritize effectively, and make business decisions under pressure
📝 Enhancement Note: This role combines SRE and software development responsibilities, focusing on automation, reliability, and customer satisfaction.
💻 Primary Responsibilities
- Environment Provisioning & Management: Automate and manage public and private cloud environments using scripts (bash, ruby, python)
- Performance Monitoring: Monitor and optimize environment performance to ensure high availability and reliability
- Troubleshooting & Problem-Solving: Diagnose and resolve issues in production environments, minimizing downtime and impact on customers
- Automation & Tooling: Develop and maintain tools to reduce manual effort and improve efficiency
- Collaboration & Communication: Work closely with cross-functional teams to deliver results and ensure customer satisfaction
📝 Enhancement Note: This role requires a strong focus on collaboration, transparency, and effective communication to succeed in a dynamic, customer-centric environment.
🎓 Skills & Qualifications
Education: Bachelor's or Master's degree in Computer Science, Engineering, or related technical field, or equivalent experience
Experience:
- 2+ years with Linux Systems
- 2+ years using Bash or Python
- 2+ years with Kubernetes
- 2+ years running and maintaining a 24x7 large-scale production environment
Required Skills:
- Linux systems administration
- Scripting (bash, python)
- Kubernetes
- Production environment management
- Problem-solving and troubleshooting skills
- Collaboration and communication skills
Preferred Skills:
- Experience with Apache Tomcat, HTTPd, MySQL, and Java Web Applications
- Familiarity with Chef, Puppet, OSSEC, Splunk, Elasticsearch, Ansible, JIRA, Confluence, Grafana, and Prometheus
- Enterprise-level thinking, including documentation, runbooks, root cause analysis, capacity-trending, bug fixes, and scripting
- Secret passion for monitoring and addressing false positives
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- Demonstrate experience with Linux systems, scripting, and Kubernetes through relevant projects and case studies
- Showcase problem-solving skills and ability to optimize production environments
- Highlight collaboration and communication skills through team projects or client-facing work
Technical Documentation:
- Provide examples of runbooks, root cause analysis, and capacity-trending documentation
- Include code comments, version control, and deployment processes documentation
- Demonstrate understanding of monitoring tools and metrics through project documentation
📝 Enhancement Note: While a portfolio is not explicitly required, providing relevant projects and case studies will strengthen your application and demonstrate your skills and experience.
💵 Compensation & Benefits
Salary Range: NZD 100,000 - 140,000 per year (based on market research and role requirements)
Benefits:
- Competitive compensation packages with base salary, bonus, and stock
- Time and support for skill development and career growth
- Hybrid work model with flexibility to work from home and in-person collaboration
- Amazing events and snacks
Working Hours: Full-time, 40 hours per week, with some nights and weekends required for on-call support and production update rotation
📝 Enhancement Note: The salary range is estimated based on market research and role requirements. Actual compensation may vary depending on experience, skills, and other factors.
🎯 Team & Company Context
🏢 Company Culture
Industry: Enterprise Software
Company Size: Large (10,000+ employees)
Founded: 2005
Team Structure:
- Small, collaborative SRE team focused on reliability and availability
- Cross-functional teams, including software developers, designers, and marketers
- Flat hierarchy with emphasis on learning, continuous improvement, and engineering focus
Development Methodology:
- Agile and iterative development processes
- Regular code reviews, testing, and quality assurance practices
- Continuous integration and deployment pipelines
Company Website: Workday Careers
📝 Enhancement Note: Workday's culture emphasizes employee-centric, collaborative, and innovative work environments. The SRE team focuses on learning, continuous improvement, and engineering-driven solutions.
📈 Career & Growth Analysis
Web Technology Career Level: Mid-level SRE with a focus on automation, reliability, and customer satisfaction
Reporting Structure: Reports directly to the SRE Manager, collaborating with cross-functional teams and other SREs
Technical Impact: Ensures high availability and reliability of customer environments through automation, monitoring, and troubleshooting
Growth Opportunities:
- Develop expertise in specific technologies and tools
- Contribute to open-source projects and community involvement
- Advance to senior SRE roles, focusing on architecture, leadership, and mentoring
📝 Enhancement Note: This role offers opportunities for growth and development in automation, reliability, and customer-centric technologies. Career progression may include advancing to senior SRE roles or exploring other technical leadership paths.
🌐 Work Environment
Office Type: Hybrid, with a focus on in-person collaboration and remote flexibility
Office Location(s): Auckland, New Zealand
Workspace Context:
- Modern, collaborative workspaces designed for team interaction and innovation
- Access to multiple monitors, testing devices, and development tools
- Cross-functional collaboration opportunities with designers, marketers, and other teams
Work Schedule: Full-time, 40 hours per week, with some nights and weekends required for on-call support and production update rotation
📝 Enhancement Note: Workday's hybrid work model offers flexibility and the benefits of in-person collaboration, enabling teams to deepen connections, maintain a strong community, and do their best work.
📄 Application & Technical Interview Process
Interview Process:
- Online assessment or coding challenge focused on Linux systems, scripting, and Kubernetes
- Technical deep dive into production environment management, automation, and troubleshooting
- Behavioral and cultural fit assessment, focusing on collaboration, communication, and problem-solving skills
- Final evaluation and decision-making
Portfolio Review Tips:
- Highlight relevant projects and case studies demonstrating Linux systems, scripting, and Kubernetes experience
- Showcase problem-solving skills and ability to optimize production environments
- Emphasize collaboration and communication skills through team projects or client-facing work
Technical Challenge Preparation:
- Brush up on Linux systems administration, scripting, and Kubernetes skills
- Familiarize yourself with relevant tools and technologies, such as Apache Tomcat, HTTPd, MySQL, and Java Web Applications
- Prepare for behavioral and cultural fit questions, focusing on collaboration, communication, and problem-solving skills
ATS Keywords: Linux, Bash, Python, Kubernetes, Production Environment, Apache Tomcat, MySQL, Java Web Applications, Chef, Puppet, OSSEC, Splunk, Elasticsearch, Ansible, JIRA, Confluence, Grafana, Prometheus, SRE, DevOps, Reliability, Availability, Automation, Troubleshooting, Collaboration, Communication
📝 Enhancement Note: The interview process focuses on technical skills, problem-solving, and cultural fit. Prepare for a comprehensive assessment of your Linux systems, scripting, and Kubernetes expertise, as well as your ability to collaborate and communicate effectively.
🛠 Technology Stack & Web Infrastructure
Frontend Technologies: N/A (focus on backend and infrastructure)
Backend & Server Technologies:
- Linux Systems (CentOS, SunOS, Solaris)
- Bash, Python, Ruby
- Kubernetes
- Apache Tomcat, HTTPd
- MySQL
- Java Web Applications
Development & DevOps Tools:
- Chef, Puppet, OSSEC, Splunk, Elasticsearch, Ansible
- JIRA, Confluence, Grafana, Prometheus
- Git, GitHub, or other version control systems
- CI/CD pipelines and automation tools
📝 Enhancement Note: This role focuses on backend and infrastructure technologies, with a strong emphasis on Linux systems, scripting, and Kubernetes. Familiarize yourself with relevant tools and technologies to excel in the interview process and on the job.
👥 Team Culture & Values
Web Development Values:
- Customer-centric approach, focusing on reliability and availability
- Collaboration and learning, with a strong emphasis on continuous improvement
- Innovation and automation, driving efficiency and reducing manual effort
- Transparency and communication, ensuring effective teamwork and customer satisfaction
Collaboration Style:
- Cross-functional teams, working closely with software developers, designers, and marketers
- Code reviews and peer programming practices
- Knowledge sharing, technical mentoring, and continuous learning
📝 Enhancement Note: Workday's SRE team values collaboration, learning, and continuous improvement. Emphasize these values in your application and interview process to demonstrate your fit for the role and the team.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- Automating and optimizing production environments for high availability and reliability
- Troubleshooting and resolving complex issues in large-scale production environments
- Developing and maintaining tools to reduce manual effort and improve efficiency
- Balancing multiple tasks, prioritizing effectively, and making business decisions under pressure
Learning & Development Opportunities:
- Expanding expertise in Linux systems, scripting, and Kubernetes
- Contributing to open-source projects and community involvement
- Advancing to senior SRE roles, focusing on architecture, leadership, and mentoring
📝 Enhancement Note: This role presents technical challenges and growth opportunities in automation, reliability, and customer-centric technologies. Leverage these opportunities to develop your skills and advance your career in SRE and DevOps roles.
💡 Interview Preparation
Technical Questions:
- Linux systems administration, scripting, and Kubernetes
- Production environment management, automation, and troubleshooting
- Relevant tools and technologies, such as Apache Tomcat, HTTPd, MySQL, and Java Web Applications
- Problem-solving and troubleshooting skills
Company & Culture Questions:
- Workday's customer-centric approach and focus on reliability and availability
- Collaboration and learning within the SRE team and across functional areas
- Innovation and automation to drive efficiency and reduce manual effort
- Transparency and communication to ensure effective teamwork and customer satisfaction
Portfolio Presentation Strategy:
- Highlight relevant projects and case studies demonstrating Linux systems, scripting, and Kubernetes experience
- Showcase problem-solving skills and ability to optimize production environments
- Emphasize collaboration and communication skills through team projects or client-facing work
📝 Enhancement Note: Prepare for a comprehensive technical assessment of your Linux systems, scripting, and Kubernetes expertise, as well as your ability to collaborate and communicate effectively. Familiarize yourself with Workday's customer-centric approach and focus on reliability and availability to excel in the interview process.
📌 Application Steps
To apply for this Site Reliability Engineer (SRE) position at Workday:
- Review and update your resume, highlighting relevant Linux systems, scripting, and Kubernetes experience
- Prepare a portfolio showcasing relevant projects and case studies, demonstrating your skills and expertise
- Research Workday's customer-centric approach, focus on reliability and availability, and prepare for behavioral and cultural fit questions
- Complete the online assessment or coding challenge, focusing on Linux systems, scripting, and Kubernetes
- Prepare for the technical deep dive, emphasizing your production environment management, automation, and troubleshooting skills
- Attend the behavioral and cultural fit assessment, showcasing your collaboration, communication, and problem-solving skills
- Review and address any feedback provided during the interview process, and prepare for the final evaluation and decision-making
⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.
Application Requirements
Candidates must have at least 2 years of experience with Linux systems, Bash or Python, and Kubernetes, along with experience in maintaining a large-scale production environment. A strong understanding of automation and a proactive approach to problem-solving is essential.