Cloud Engineer - Observability
Based on the provided job listing, here's the enhanced job description:
π Job Overview
- Job Title: Cloud Engineer - Observability
- Company: NetDocuments
- Location: Lehi, Utah, United States
- Job Type: Hybrid
- Category: DevOps, Cloud Infrastructure, Web Technology
- Date Posted: 2025-07-28
- Experience Level: 2-5 years
π Role Summary
-
Key Responsibilities:
- Lead and evolve the company-wide observability strategy across infrastructure, applications, and end-user experience.
- Manage and optimize observability tools, including Datadog, OpenTelemetry, AWS CloudTrail, and CloudWatch.
- Collaborate with engineering, platform, and operations teams to define standards for logging, metrics, tracing, and alerting.
- Develop and maintain standardized instrumentation libraries, dashboards, and alerting frameworks.
- Build and manage synthetic monitors, golden signals, and runbooks to support rapid incident detection and response.
-
Key Qualifications:
- 3+ years of experience in Observability, SRE, DevOps, or Cloud Infrastructure roles.
- Strong hands-on experience with Datadog, OpenTelemetry, AWS CloudTrail, and related observability tools.
- Deep understanding of metrics, logs, and traces, with the ability to implement and interpret golden signals.
- Experience supporting observability platforms at scale in modern, distributed environments.
- Proficiency with AWS services, including VPC Flow Logs, CloudWatch, EC2, S3, Route53, Load Balancers, and ECS.
π» Primary Responsibilities
π Observability Strategy & Tool Management
-
Lead and Evolve Observability Strategy:
- Define and execute the company-wide observability strategy, ensuring meaningful dashboards and actionable alerts for all development teams.
- Collaborate with stakeholders to understand business needs and align observability goals with company objectives.
-
Manage and Optimize Observability Tools:
- Administer and optimize Datadog, OpenTelemetry, AWS CloudTrail, and CloudWatch for effective monitoring and alerting.
- Ensure meaningful dashboards and actionable alerts are available to all development teams, driving proactive issue detection and resolution.
-
Collaborate with Engineering Teams:
- Work closely with engineering, platform, and operations teams to define standards for logging, metrics, tracing, and alerting.
- Contribute to a culture of proactive monitoring, continuous improvement, and incident prevention.
-
Develop and Maintain Instrumentation Libraries:
- Create and maintain standardized instrumentation libraries, dashboards, and alerting frameworks to support rapid incident detection and response.
- Ensure consistent and meaningful metrics, logs, and traces across the organization.
-
Build and Manage Synthetic Monitors:
- Develop and maintain synthetic monitors, golden signals, and runbooks to support rapid incident detection and response.
- Monitor and optimize the cost and performance of observability tooling, ensuring efficient resource utilization.
π Skills & Qualifications
π Required Skills and Experience
-
Experience:
- 3+ years in an Observability, SRE, DevOps, or Cloud Infrastructure role.
- Strong hands-on experience with Datadog, OpenTelemetry, AWS CloudTrail, and related observability tools.
- Deep understanding of metrics, logs, and traces, with the ability to implement and interpret golden signals.
- Experience supporting observability platforms at scale in modern, distributed environments.
- Proficiency with AWS services, including VPC Flow Logs, CloudWatch, EC2, S3, Route53, Load Balancers, and ECS.
-
Education:
- Bachelorβs degree in Computer Science, Engineering, or a related field, or equivalent experience.
π‘ Preferred Skills
-
Experience:
- Familiarity with CI/CD pipelines and Infrastructure-as-Code (IaC) tools, especially Terraform.
- Strong collaboration and communication skills, with the ability to influence engineering teams and drive change.
- Proficiency in microservices, SaaS architecture, and DR/resiliency practices in AWS.
- Experience with Terraform, Kubernetes, and containerization technologies.
-
Certifications:
- AWS certifications and a commitment to staying current on the latest platform capabilities and best practices.
π° Compensation & Benefits
π° Salary Range
- Salary: $100,000 - $140,000 per year (Estimated, based on regional market rates and experience level)
π Benefits
- Healthcare: 90% healthcare premiums company covered, with HSA and wellness contributions.
- Time Off: 4 weeks of PTO, 10 paid holidays, and monthly contributions for life activities and wellness.
- Retirement: 401K match at 4%.
- Professional Development: Access to LinkedIn Learning with dedicated time for exploration and growth.
π Team & Company Context
π’ Company Culture
- Industry: Legal technology, focusing on cloud-based content management and productivity platforms for legal professionals.
- Company Size: Medium (1,000-5,000 employees)
- Founded: 1999, with a strong focus on innovation, collaboration, and continuous improvement.
- Team Structure: Cross-functional teams, with a focus on collaboration and knowledge sharing between engineering, design, marketing, and business teams.
- Development Methodology: Agile/Scrum methodologies, with a focus on code review, testing, and quality assurance practices.
π Career & Growth Analysis
- Web Technology Career Level: Mid-level, with opportunities for technical leadership, architecture decisions, and team management.
- Reporting Structure: Reports directly to the Director of Cloud Engineering, with opportunities for cross-functional collaboration and influence.
- Technical Impact: Drives meaningful dashboards and actionable alerts, enabling hundreds of engineers to proactively detect, diagnose, and resolve issues before they reach customers.
- Growth Opportunities: Opportunities for technical leadership, mentoring, and driving innovation in observability and monitoring practices.
π Application & Technical Interview Process
π Application Process
-
Interview Process:
- Initial phone or video screen to assess cultural fit and technical qualifications.
- Technical assessment, focusing on observability tools, AWS services, and coding challenges.
- Final evaluation, focusing on problem-solving, communication, and leadership skills.
-
Portfolio Presentation Strategy:
- Prepare a live demo of your portfolio, highlighting relevant projects and user experience design considerations.
- Tailor your presentation to showcase your understanding of observability tools, AWS services, and monitoring best practices.
-
Technical Challenge Preparation:
- Brush up on your knowledge of observability tools, AWS services, and monitoring techniques.
- Practice coding challenges and problem-solving exercises to demonstrate your technical proficiency.
π‘ Interview Preparation
π Technical Questions
- Observability Tools: Be prepared to discuss your experience with Datadog, OpenTelemetry, AWS CloudTrail, and related observability tools.
- AWS Services: Brush up on your knowledge of AWS services, including VPC Flow Logs, CloudWatch, EC2, S3, Route53, Load Balancers, and ECS.
- Monitoring and Alerting: Prepare for questions on monitoring strategies, alerting thresholds, and incident response processes.
π Company & Culture Questions
- Company Culture: Research NetDocuments' company culture, focusing on collaboration, innovation, and continuous improvement.
- Team Dynamics: Prepare for questions on working in cross-functional teams, driving change, and influencing stakeholders.
- User Experience: Brush up on your understanding of user experience design principles and their application to monitoring and alerting.
π Application Steps
To apply for this Cloud Engineer - Observability position at NetDocuments:
- Submit your application through the application link provided.
- Prepare a live demo of your portfolio, highlighting relevant projects and user experience design considerations.
- Brush up on your knowledge of observability tools, AWS services, and monitoring techniques.
- Practice coding challenges and problem-solving exercises to demonstrate your technical proficiency.
- Research NetDocuments' company culture, focusing on collaboration, innovation, and continuous improvement.
- Prepare for questions on working in cross-functional teams, driving change, and influencing stakeholders.
- Brush up on your understanding of user experience design principles and their application to monitoring and alerting.
β οΈ Important Notice: This enhanced job description includes AI-generated insights and web technology industry standards. All details should be verified directly with the hiring organization before making application decisions.
Content Guidelines (IMPORTANT: Do not include this in the output)
Web Technology-Specific Focus:
- Tailor every section specifically to web technology roles, including web development, server administration, DevOps, and cloud infrastructure.
- Include web technology methodologies, responsive design principles, and server management practices.
- Emphasize web portfolio requirements, live project demonstrations, and user experience considerations.
- Address web technology career progression paths and technical leadership opportunities.
Quality Standards:
- Ensure no content overlap between sections; each section should contain unique information only.
- Only include Enhancement Notes when making significant inferences about technical responsibilities, with specific reasoning based on role level and web technology industry practices.
- Be comprehensive but concise, prioritizing actionable information over descriptive text.
- Strategically distribute web technology and server administration-related keywords throughout all sections naturally.
- Provide realistic salary ranges based on location, experience level, and web technology specialization, with detailed methodology and assumptions for regional adjustments.
Industry Expertise:
- Include specific web technologies, frameworks, server platforms, and infrastructure requirements prominently.
- Address web development team dynamics, cross-functional collaboration with designers and marketers, and user experience impact measurement.
- Provide tactical advice for web portfolio development, live demonstrations, and project case studies.
- Include web technology-specific interview preparation and coding challenge guidance.
- Emphasize responsive design, performance optimization, accessibility standards, and user experience principles.
Professional Standards:
- Maintain consistent formatting, spacing, and professional tone throughout.
- Use web technology and server administration industry terminology appropriately and accurately.
- Include comprehensive benefits and growth opportunities relevant to web technology professionals.
- Provide actionable insights that give web development and server administration candidates a competitive advantage.
- Focus on web development team culture, cross-functional collaboration, and user impact measurement.
Technical Focus & Portfolio Emphasis:
- Emphasize web portfolio requirements, live demonstrations, and project case studies tailored to the web technology discipline and role level.
- Address technical presentation skills and stakeholder communication for web projects.
- Include technical documentation requirements, version control, deployment processes, and server configuration.
- Focus on problem-solving methods, performance optimization, and scalable web architecture.
- Include technical presentation strategies and architecture decision reasoning.
Acknowledgments:
- This enhanced job description is based on the provided job listing and AI-generated insights, tailored to web technology roles and industry standards.
- All content should be verified directly with the hiring organization before making application decisions.
- The output format follows the specified structure and guidelines, with proper Markdown tags for structure and clarity.
Application Requirements
Candidates should have 3+ years of experience in observability or related roles, with strong hands-on experience in relevant tools. A deep understanding of metrics, logs, and AWS services is essential, along with strong collaboration skills.