Site Reliability Engineer Pleno – SRE (Remoto)
📍 Job Overview
- Job Title: Site Reliability Engineer Pleno – SRE (Remoto)
- Company: ZENVIA
- Location: São Paulo, São Paulo, Brazil
- Job Type: Full-Time
- Category: DevOps, Infrastructure
- Date Posted: 2025-07-24
- Experience Level: Mid-Senior level (2-5 years)
- Remote Status: Remote (Brazil)
🚀 Role Summary
- Key Responsibilities: Support project development, implement improvements for high availability, scalability, reliability, and security. Automate infrastructure and troubleshoot production issues.
- Key Skills: Infrastructure, Linux, Cloud Computing, IaC, Kubernetes, Observability, Scripting, Creativity, Communication.
- 📝 Enhancement Note: This role focuses on maintaining and improving the reliability and performance of Zenvia's on-premise infrastructure, working collaboratively with various tech teams.
💻 Primary Responsibilities
- 1. Project Support: Collaborate with technology teams to develop projects and implement improvements.
- 2. Infrastructure Automation: Assist in automating infrastructure using tools like Ansible and GitLab.
- 3. Troubleshooting: Help troubleshoot and resolve issues in the production environment.
- 4. Monitoring and Observability: Work with observability tools like Prometheus, Grafana, Elastic, and Zabbix to monitor and improve system performance.
- 5. Documentation: Document processes and actions to enable knowledge sharing and future automation.
- 📝 Enhancement Note: This role requires a strong focus on problem-solving, communication, and collaboration to ensure the smooth operation of Zenvia's infrastructure.
🎓 Skills & Qualifications
Education: Bachelor's degree in Computer Science, Computer Engineering, or a related field. Relevant experience may be considered in lieu of a degree.
Experience: Proven experience (2-5 years) in infrastructure management, with a focus on on-premise environments.
Required Skills:
- Experience with on-premise infrastructure and Linux operating systems (Redhat, Debian, Ubuntu)
- Knowledge of cloud services (AWS) and IaC tools (Ansible, GitLab)
- Familiarity with Kubernetes and container orchestration
- Proficiency in scripting languages (Bash, Shell)
- Strong communication and collaboration skills
Preferred Skills:
- Experience with message brokers (Kafka, Redis)
- Knowledge of SQL and NoSQL databases (PostgreSQL, MySQL)
- Performance tuning and optimization
- Familiarity with SMPP and SIP protocols
- Support for internal teams related to infrastructure
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- Demonstrate your experience with infrastructure management, focusing on on-premise environments and Linux systems.
- Showcase your scripting skills and ability to automate infrastructure tasks.
- Highlight your problem-solving skills and experience with troubleshooting production issues.
Technical Documentation:
- Provide examples of well-documented infrastructure projects, including code comments and version control.
- Showcase your understanding of system design and architecture, with a focus on high availability and scalability.
💵 Compensation & Benefits
Salary Range: The salary range for this role is not specified. However, based on market research, the average salary for a Site Reliability Engineer in Brazil is approximately R$8,000 - R$15,000 per month, depending on experience and skills.
Benefits:
- Zenvia Care: A benefits program for you and your family, including healthcare, wellness, parental, and remote care.
- Healthcare: Comprehensive health and dental plans, life insurance, and telemedicine services.
- Wellness Care: Personal loans, day off on your birthday, TotalPass, and access to coaches for sports and nutrition.
- Parental Care: Extended maternity and paternity leave, pregnancy support, and childcare assistance.
- Remote Care: Policy for temporary relocation, equipment allowance, home office allowance, and access to co-working spaces.
- Career Development: Internal mobility program, language courses, and access to a learning hub for personal growth.
- Participation in Results: Share in the success of the company through performance-based compensation.
- Company Culture: Flexible work hours, remote work, and a collaborative, innovative environment.
🎯 Team & Company Context
🏢 Company Culture
Industry: Technology, with a focus on cloud-based communication solutions.
Company Size: Medium to large (1,000+ employees)
Founded: 2011
Team Structure:
- The Site Reliability Engineer will work closely with various technology teams, including software development, quality assurance, and other infrastructure professionals.
- The role reports directly to the SRE team lead and collaborates with other SREs to maintain and improve the overall infrastructure.
Development Methodology:
- Agile/Scrum methodologies for project management and development.
- Infrastructure as Code (IaC) for automated deployment and configuration management.
- Continuous Integration/Continuous Deployment (CI/CD) pipelines for automated testing and deployment.
Company Website: ZENVIA
📝 Enhancement Note: Zenvia values innovation, collaboration, and autonomy, providing an environment where employees can contribute to the company's growth and success.
📈 Career & Growth Analysis
Web Technology Career Level: Mid-Senior level Site Reliability Engineer, responsible for maintaining and improving the reliability, performance, and scalability of Zenvia's infrastructure.
Reporting Structure: Reports directly to the SRE team lead and collaborates with other SREs and technology teams.
Technical Impact: Directly influences the availability, performance, and security of Zenvia's communication platforms, impacting millions of users daily.
Growth Opportunities:
- 1. Technical Specialization: Deepen your expertise in specific areas of infrastructure management, such as databases, messaging systems, or cloud services.
- 2. Team Leadership: Develop your leadership skills and take on more responsibilities within the SRE team or mentor junior team members.
- 3. Architecture and Design: Contribute to the design and architecture of Zenvia's infrastructure, driving innovation and scalability.
📝 Enhancement Note: Zenvia offers opportunities for growth and development, both technically and in leadership roles, for motivated and talented professionals.
🌐 Work Environment
Office Type: Remote, with occasional in-person meetings and events.
Office Location(s): São Paulo, Brazil (with remote work options)
Workspace Context:
- 1. Remote Work: Zenvia provides a remote work allowance of R$250 per month (with an option to allocate R$50 to previdência privada) and access to co-working spaces through a partnership with Woba.
- 2. Equipment: Zenvia offers an equipment allowance of R$1,500 to help set up a home office.
- 3. Collaboration: Zenvia fosters a collaborative work environment, with regular team meetings and open communication channels.
Work Schedule: Flexible work hours, with a focus on results and productivity.
📝 Enhancement Note: Zenvia's remote work policy and flexible hours allow employees to balance their personal and professional lives effectively.
📄 Application & Technical Interview Process
Interview Process:
- 1. Phone Screen: A brief call to discuss your experience and qualifications for the role.
- 2. Technical Assessment: A hands-on assessment of your infrastructure management and scripting skills.
- 3. Behavioral Interview: An in-depth discussion of your problem-solving skills, communication, and collaboration abilities.
- 4. Final Interview: A meeting with the hiring manager or a member of the leadership team to discuss your fit for the role and Zenvia's company culture.
Portfolio Review Tips:
- 1. Infrastructure Projects: Highlight your experience with on-premise infrastructure management and automation.
- 2. Scripting Examples: Showcase your proficiency in scripting languages and your ability to automate infrastructure tasks.
- 3. Problem-Solving Scenarios: Demonstrate your ability to troubleshoot and resolve complex infrastructure issues.
Technical Challenge Preparation:
- 1. Infrastructure Management: Brush up on your knowledge of Linux operating systems, cloud services, and IaC tools.
- 2. Scripting: Practice your scripting skills and prepare for coding challenges related to infrastructure automation.
- 3. Problem-Solving: Review common infrastructure issues and develop strategies to troubleshoot and resolve them effectively.
ATS Keywords: Infrastructure, Linux, Cloud Computing, IaC, Kubernetes, Observability, Scripting, Problem-Solving, Collaboration, Communication, On-Premise, Remote Work, Agile, Scrum, CI/CD, SRE, DevOps, Infrastructure Management.
📝 Enhancement Note: Zenvia's interview process focuses on assessing your technical skills, problem-solving abilities, and cultural fit within the organization.
🛠 Technology Stack & Web Infrastructure
Frontend Technologies: (Not applicable for this role)
Backend & Server Technologies:
- Linux (Redhat, Debian, Ubuntu)
- Cloud Services (AWS)
- IaC Tools (Ansible, GitLab)
- Kubernetes and container orchestration
- Observability tools (Prometheus, Grafana, Elastic, Zabbix)
- Scripting languages (Bash, Shell)
Development & DevOps Tools:
- Version control systems (Git)
- CI/CD pipelines (Jenkins, GitLab CI/CD)
- Infrastructure as Code (IaC) tools (Terraform, CloudFormation)
- Containerization tools (Docker, Kubernetes)
📝 Enhancement Note: Zenvia's technology stack focuses on open-source tools and platforms, providing opportunities for professionals to gain experience with cutting-edge technologies.
👥 Team Culture & Values
Web Development Values:
- 1. Innovation: Zenvia encourages continuous learning and innovation, driving progress in the communication technology industry.
- 2. Collaboration: Zenvia fosters a collaborative work environment, with open communication channels and regular team meetings.
- 3. Autonomy: Zenvia values employee autonomy, providing the freedom to create, innovate, and make decisions within the role.
- 4. Customer Focus: Zenvia prioritizes customer satisfaction, ensuring that its communication platforms meet the needs of users worldwide.
Collaboration Style:
- 1. Cross-Functional Teams: Zenvia encourages collaboration between different teams, including software development, quality assurance, and other infrastructure professionals.
- 2. Knowledge Sharing: Zenvia fosters a culture of knowledge sharing, with regular team meetings and training opportunities.
- 3. Continuous Learning: Zenvia supports the continuous learning and development of its employees, providing access to a learning hub and language courses.
📝 Enhancement Note: Zenvia's team culture values innovation, collaboration, and autonomy, providing an environment where employees can thrive and grow both personally and professionally.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- 1. Infrastructure Complexity: Manage and maintain complex on-premise infrastructure, ensuring high availability, scalability, and performance.
- 2. Automation: Develop and implement automated infrastructure management solutions to improve efficiency and reduce manual effort.
- 3. Troubleshooting: Diagnose and resolve complex infrastructure issues, often under tight deadlines and high pressure.
- 4. Performance Optimization: Continuously monitor and optimize infrastructure performance, ensuring that Zenvia's communication platforms meet the needs of millions of users worldwide.
- 5. Emerging Technologies: Stay up-to-date with emerging infrastructure technologies and adapt Zenvia's infrastructure to take advantage of new opportunities.
Learning & Development Opportunities:
- 1. Technical Specialization: Deepen your expertise in specific areas of infrastructure management, such as databases, messaging systems, or cloud services.
- 2. Leadership Development: Develop your leadership skills and take on more responsibilities within the SRE team or mentor junior team members.
- 3. Architecture and Design: Contribute to the design and architecture of Zenvia's infrastructure, driving innovation and scalability.
📝 Enhancement Note: Zenvia offers opportunities for growth and development, both technically and in leadership roles, for motivated and talented professionals.
💡 Interview Preparation
Technical Questions:
- 1. Infrastructure Management: Be prepared to discuss your experience with on-premise infrastructure management and automation.
- 2. Scripting: Demonstrate your proficiency in scripting languages and your ability to automate infrastructure tasks.
- 3. Problem-Solving: Showcase your ability to troubleshoot and resolve complex infrastructure issues, with a focus on high availability, scalability, and performance.
Company & Culture Questions:
- 1. Company Culture: Demonstrate your understanding of Zenvia's company culture, values, and work environment.
- 2. Collaboration: Explain your approach to collaboration and working effectively within a remote, distributed team.
- 3. Adaptability: Discuss your ability to adapt to new technologies, tools, and working environments.
Portfolio Presentation Strategy:
- 1. Infrastructure Projects: Highlight your experience with on-premise infrastructure management and automation, focusing on your ability to deliver high availability, scalability, and performance.
- 2. Scripting Examples: Showcase your proficiency in scripting languages and your ability to automate infrastructure tasks, with a focus on efficiency and reliability.
- 3. Problem-Solving Scenarios: Demonstrate your ability to troubleshoot and resolve complex infrastructure issues, with a focus on customer impact and business value.
📝 Enhancement Note: Zenvia's interview process focuses on assessing your technical skills, problem-solving abilities, and cultural fit within the organization.
📌 Application Steps
To apply for this Site Reliability Engineer Pleno – SRE (Remoto) position at ZENVIA:
- Submit your application through the application link.
- Customize your portfolio to highlight your experience with on-premise infrastructure management and automation, focusing on high availability, scalability, and performance.
- Optimize your resume for web technology roles, emphasizing your project highlights and technical skills.
- Prepare for technical interviews by practicing coding challenges and reviewing common infrastructure issues and problem-solving strategies.
- Research Zenvia's company culture, values, and work environment to ensure a strong fit for your personal and professional goals.
📝 Enhancement Note: This enhanced job description provides a comprehensive overview of the Site Reliability Engineer Pleno – SRE (Remoto) role at ZENVIA, including key responsibilities, required skills, and growth opportunities. Use this information to tailor your application and prepare for a successful interview process.
Application Requirements
Candidates should have experience with on-premise infrastructure and Linux operating systems, as well as knowledge of cloud services and IaC tools. Familiarity with Kubernetes, observability tools, and scripting languages is also required.