Staff Site Reliability Engineer

Lucid Motors
Full_time$138k-203k/year (USD)Newark, United States

📍 Job Overview

  • Job Title: Staff Site Reliability Engineer
  • Company: Lucid Motors
  • Location: Newark, California, United States
  • Job Type: On-site
  • Category: DevOps Engineer, Site Reliability Engineer
  • Date Posted: 2025-06-04
  • Experience Level: 10+
  • Remote Status: On-site

🚀 Role Summary

Lucid Motors is seeking a Staff Site Reliability Engineer to own and enhance the reliability of services deployed across various cloud regions. This role involves leading the containerization and deployment of microservices and data pipelines on Kubernetes, using Helm charts, ensuring best practices for scalability and fault tolerance. The ideal candidate will have a strong background in Cloud Infrastructure, Site Reliability Engineering (SRE), or DevOps Engineering, with a minimum of 8 years of experience.

💻 Primary Responsibilities

🔄 Reliability Engineering

  • Own and enhance the reliability of services deployed across various cloud regions.
  • Proactively monitor, automate, and scale services to ensure seamless uptime and performance.
  • Implement autoscaling strategies and monitor application performance using tools like Prometheus and Grafana.
  • Perform SRE tasks such as availability monitoring, incident response, post-mortem analysis, and preparing reliability reports for leadership and stakeholders.

📦 Containerization & Microservices Deployment

  • Lead the containerization and deployment of microservices and data pipelines on Kubernetes, using Helm charts.
  • Ensure best practices for scalability and fault tolerance in containerized applications.
  • Collaborate with development teams to manage and deploy applications seamlessly with minimal intervention.

🤝 DevOps Advocacy

  • Foster and advocate for a DevOps culture that emphasizes automation, self-service, and engineering excellence.
  • Enable development teams to manage and deploy applications seamlessly with minimal intervention.
  • Collaborate with cross-functional teams in Agile Scrum and Kanban workflows to deliver iterative improvements and meet evolving business needs.

🛠️ Tool Deployment & Maintenance

  • Deploy, configure, and maintain essential cloud services and tools including Kafka, Spark, Presto, Airflow, MQTT, and other microservices platforms in a cloud-native environment.
  • Set up and manage cloud infrastructure using tools like Terraform, Cluster API, and other IaC frameworks, ensuring seamless provisioning, management, and scaling of resources.

🚨 Automated Alerts & Recovery

  • Continuously enhance and automate alerting, incident detection, and recovery mechanisms for critical applications and services to minimize downtime and improve system reliability.
  • Participate in an on-call rotation to meet business SLAs, quickly troubleshoot and resolve issues, and document runbooks for consistent incident management processes.

🎓 Skills & Qualifications

🎓 Education

  • B.S. or M.S. degree in Computer Science, Engineering, or a related technical field, or equivalent experience may be considered in lieu of degree.

🕒 Experience

  • 8+ years in Cloud Infrastructure, Site Reliability Engineering (SRE), DevOps Engineering, or related fields.
  • At least 4+ years of hands-on experience deploying, managing, and optimizing containerized applications using Kubernetes in both public and private cloud environments (AWS, GCP, Azure, etc.).
  • 4+ years in Infrastructure-as-Code (IaC) using Terraform, Cluster API, or similar automation frameworks to manage cloud infrastructure.
  • Experience in scripting or programming with Python, Go, Bash/Shell, or similar languages.
  • Strong understanding of using Prometheus, Grafana, and other monitoring and observability tools.
  • Ability to effectively diagnose and resolve performance bottlenecks within AWS at the infrastructure and application layers.
  • Configuration Management: Experience with configuration management and automation tools such as Ansible, Chef, or Puppet (preferred but not required).

📊 Web Portfolio & Project Requirements

  • A comprehensive portfolio showcasing your experience in Site Reliability Engineering, DevOps, and cloud infrastructure management.
  • Examples of containerized applications deployed on Kubernetes and other cloud platforms.
  • Case studies demonstrating your ability to enhance system reliability, optimize performance, and automate alerting and recovery mechanisms.
  • Documentation of your experience with incident management, post-mortem analysis, and preparing reliability reports.

💵 Compensation & Benefits

💰 Salary Range

The compensation range for this position is specific to the locations listed below and is the range Lucid reasonably and in good faith expects to pay for the position taking into account the wide variety of factors that are considered in making compensation decisions, including job-related knowledge; skillset; experience, education and training; certifications; and other relevant business and organizational factors.

  • Base Pay Range (Annual): $138,200 - $202,730 USD

🎁 Additional Compensation and Benefits

  • Medical, dental, vision, life insurance, disability insurance, vacation, and 401k.
  • Equity program and discretionary annual incentive program, subject to the rules governing such programs.

🎯 Team & Company Context

🏢 Company Culture

Industry: Lucid Motors is a luxury electric vehicle manufacturer, focusing on designing and producing high-performance, sustainable vehicles.

Company Size: Lucid Motors is a growing company with a team of over 2,500 employees, providing ample opportunities for career growth and innovation.

Founded: Lucic Motors was founded in 2007, with a mission to create the most captivating, luxury electric vehicles that elevate the human experience and transcend the perceived limitations of space, performance, and intelligence.

Team Structure:

  • The Site Reliability Engineering team works closely with development teams, product owners, and engineering managers to deliver reliable, scalable, and performant services.
  • The team is structured to emphasize collaboration, knowledge sharing, and continuous learning, fostering a culture of engineering excellence.

Development Methodology:

  • Lucid Motors follows Agile Scrum and Kanban workflows, emphasizing iterative development, continuous integration, and delivery.
  • The company uses tools like Jira, Confluence, and Git for project management, documentation, and version control.
  • Lucid Motors encourages a culture of automation, self-service, and infrastructure as code, utilizing tools like Terraform and Kubernetes to manage and deploy applications seamlessly.

Company Website: Lucid Motors

📝 Enhancement Note: Lucid Motors is a dynamic and growing company, providing an excellent opportunity for professionals seeking to make an immediate and significant impact in the luxury electric vehicle industry.

📈 Career & Growth Analysis

Web Technology Career Level: Staff Site Reliability Engineer at Lucid Motors is a senior-level role, responsible for leading and driving reliability engineering efforts across multiple teams and projects.

Reporting Structure: This role reports directly to the Director of Site Reliability Engineering, working closely with development teams, product owners, and other senior stakeholders.

Technical Impact: The Staff Site Reliability Engineer at Lucid Motors has a significant impact on the reliability, performance, and scalability of critical services and applications, ensuring seamless uptime and user experience.

Growth Opportunities:

  • Technical Leadership: This role offers opportunities to mentor junior engineers, drive technical decision-making, and contribute to the architecture and design of large-scale, distributed systems.
  • Emerging Technologies: Lucid Motors encourages its engineers to stay up-to-date with the latest trends and tools in cloud infrastructure, Site Reliability Engineering, and DevOps, providing ample opportunities to gain experience with emerging technologies.
  • Career Progression: As the company continues to grow, there will be opportunities for advancement into roles such as Principal Site Reliability Engineer, Senior Manager, or even Director of Site Reliability Engineering.

📝 Enhancement Note: Lucid Motors' commitment to innovation, growth, and continuous improvement provides an ideal environment for professionals seeking to advance their careers in Site Reliability Engineering and DevOps.

🌐 Work Environment

Office Type: Lucid Motors' Newark, California headquarters features a modern, collaborative workspace designed to foster creativity, innovation, and teamwork.

Office Location(s): The Newark, California office is conveniently located near the intersection of I-880 and I-680, providing easy access to major transportation routes and nearby amenities.

Workspace Context:

  • Collaborative Workspace: The office features open-plan workspaces, conference rooms, and breakout areas designed to facilitate collaboration and communication among team members.
  • State-of-the-Art Technology: Lucid Motors provides its employees with access to the latest hardware, software, and development tools to ensure they have everything they need to succeed in their roles.
  • On-Site Amenities: The Newark, California office offers on-site amenities such as a fully-equipped gym, cafeteria, and outdoor patio areas, providing employees with a comfortable and convenient work environment.

Work Schedule: Lucid Motors operates on a standard 40-hour workweek, with flexible working hours to accommodate individual needs and preferences. The company encourages a healthy work-life balance and provides generous vacation time to ensure employees have ample opportunities to recharge and rejuvenate.

📝 Enhancement Note: Lucid Motors' commitment to providing a collaborative, innovative, and comfortable work environment ensures that employees have the resources and support they need to thrive in their roles and achieve their career goals.

📄 Application & Technical Interview Process

Interview Process:

  1. Phone Screen: A brief phone or video call to assess communication skills, cultural fit, and basic technical competencies.
  2. Technical Deep Dive: A comprehensive technical interview focused on cloud infrastructure, Site Reliability Engineering, and DevOps concepts, as well as hands-on coding challenges and system design exercises.
  3. On-Site Interview: A final on-site interview with stakeholders, including the Director of Site Reliability Engineering, to assess cultural fit, leadership potential, and technical depth.

Portfolio Review Tips:

  • Highlight your experience with containerization, microservices deployment, and cloud infrastructure management.
  • Include case studies demonstrating your ability to enhance system reliability, optimize performance, and automate alerting and recovery mechanisms.
  • Showcase your experience with incident management, post-mortem analysis, and preparing reliability reports.
  • Tailor your portfolio to emphasize your understanding of Lucid Motors' products, services, and business objectives.

Technical Challenge Preparation:

  • Brush up on your knowledge of cloud infrastructure, Site Reliability Engineering, and DevOps concepts.
  • Practice hands-on coding challenges and system design exercises to ensure you are prepared for the technical deep dive interview.
  • Familiarize yourself with Lucid Motors' products, services, and business objectives to demonstrate your understanding of the company's goals and mission.

ATS Keywords: [Comprehensive list of web development and server administration-relevant keywords for resume optimization, organized by category: programming languages, web frameworks, server technologies, databases, tools, methodologies, soft skills, industry terms]

📝 Enhancement Note: Lucid Motors' interview process is designed to assess both technical competencies and cultural fit, ensuring that candidates possess the skills and qualities necessary to succeed in the role and contribute to the company's mission.

📌 Application Steps

To apply for this Staff Site Reliability Engineer position at Lucid Motors:

  1. Submit your application through the application link provided.
  2. Tailor your resume to highlight your experience with cloud infrastructure, Site Reliability Engineering, and DevOps, emphasizing relevant keywords and skills.
  3. Prepare a comprehensive portfolio showcasing your experience with containerization, microservices deployment, and cloud infrastructure management.
  4. Research Lucid Motors' products, services, and business objectives to demonstrate your understanding of the company's mission and goals.
  5. Practice hands-on coding challenges and system design exercises to ensure you are prepared for the technical interview.

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development/server administration industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Application Requirements

Candidates should have a B.S. or M.S. degree in a related technical field and 8+ years of experience in Cloud Infrastructure or Site Reliability Engineering. Hands-on experience with Kubernetes and Infrastructure-as-Code tools is essential.