Site Reliability Engineer

Cambix
Full_time€50k-70k/year (EUR)

📍 Job Overview

  • Job Title: Site Reliability Engineer
  • Company: Cambix
  • Location: Vilnius, Lithuania
  • Job Type: Hybrid (On-site & Remote)
  • Category: DevOps & Infrastructure
  • Date Posted: 2025-07-21
  • Experience Level: 10+ years
  • Remote Status: Hybrid

🚀 Role Summary

  • 📝 Enhancement Note: This role combines the responsibilities of managing and maintaining a scalable, resilient cloud infrastructure with leading the response to operational and security incidents. It requires a strong background in both DevOps and Site Reliability Engineering.

  • 🔒 Secure and high-performance cloud infrastructure management using AWS (EKS, RDS)

  • 🔄 CI/CD pipeline management and automation using GitHub Actions

  • 🏗 Infrastructure as Code (IaC) development and maintenance using Terraform

  • 🛡️ Incident response and management, ensuring business continuity and compliance with regulations like DORA and MiCA

💻 Primary Responsibilities

  • 🛠️ Manage, automate, and maintain production infrastructure on AWS, including EKS, RDS, and other services
  • 🔄 Develop, manage, and improve CI/CD pipelines to ensure smooth and reliable deployments
  • 🏗 Own and advance IaC practices using Terraform to ensure infrastructure is reproducible, scalable, and secure
  • 🛡️ Lead incident response efforts, from initial detection to post-mortem analysis and implementation of corrective actions
  • 🔍 Monitor and manage system capacity and performance, ensuring high availability and low latency for users
  • 🔐 Implement and enforce security best practices across the infrastructure, including network segmentation, secret management, and access controls

🎓 Skills & Qualifications

Education

  • 📚 Relevant degree or equivalent experience in Computer Science, Engineering, or a related field

Experience

  • 🕒 10+ years of experience in Site Reliability Engineering, DevOps, or a similar role

Required Skills

  • 🌟 Proven expertise in AWS, particularly EKS, RDS, VPC, IAM, and security services like GuardDuty and Security Hub
  • 🐳 Strong proficiency with containerization (Docker) and Kubernetes orchestration in a production environment
  • 🏗 Expert-level knowledge of Infrastructure as Code, with extensive experience using Terraform
  • 🔄 Solid experience in building and managing CI/CD pipelines, preferably with GitHub Actions
  • 🛡️ Demonstrated experience leading incident response efforts, including incident command, diagnostics, and post-incident review
  • 🔗 A strong understanding of networking principles, including VPCs, subnets, load balancing (NLB), and edge security (WAF, DDoS protection) with platforms like Cloudflare
  • 🔍 Familiarity with modern monitoring, logging, and observability principles and tools

Preferred Skills

  • 🏢 Experience working in a highly regulated environment, such as FinTech, banking, or crypto services
  • 🛠️ Familiarity with the wider tech stack, including Vercel, Fireblocks, and NGINX
  • 🔐 Experience with security scanning tools for containers and dependencies (e.g., Trivy)
  • 🔑 Knowledge of authentication mechanisms like JWE and best practices for secrets management (e.g., credential stores, AWS KMS)
  • 🛠️ Scripting skills in languages such as Python or Bash for automation tasks

📊 Web Portfolio & Project Requirements

  • 🏗 Demonstrate experience with Infrastructure as Code (IaC) using Terraform and other relevant tools
  • 🔄 Showcase your CI/CD pipeline management skills with real-world examples and live demos
  • 🛡️ Highlight your incident response experience with case studies and walkthroughs of your approach to incident management
  • 🔍 Provide examples of system monitoring and performance optimization techniques you've implemented in previous roles

💵 Compensation & Benefits

Salary Range

  • 💰 50,000 - 70,000 EUR per year gross (research-based on regional web development industry standards and cost of living in Vilnius)

Benefits

  • 💰 Competitive salary and benefits package
  • 🌱 Flexible working arrangements

🎯 Team & Company Context

🏢 Company Culture

  • 🌐 Industry: FinTech, focusing on cryptocurrency exchange services
  • 🏢 Company Size: Medium to large (100-500 employees)
  • 📅 Founded: Not specified (but growing and established in the industry)
  • 🌟 Team Structure: Collaborative and dynamic, with a strong focus on security and resilience
  • 🔄 Development Methodology: Agile, with a focus on continuous integration, delivery, and improvement

📈 Career & Growth Analysis

  • 🌱 Web Technology Career Level: Senior to Staff-level Site Reliability Engineer or DevOps Engineer
  • 💼 Reporting Structure: Reports directly to the Head of Engineering or a similar role, with cross-functional collaboration with development, security, and compliance teams
  • 💡 Technical Impact: Responsible for the reliability, performance, and security of the UNX platform, ensuring high availability and low latency for users

🌐 Work Environment

  • 🏢 Office Type: Hybrid (On-site & Remote), with flexible working arrangements
  • 📍 Office Location(s): Vilnius, Lithuania
  • 💻 Workspace Context: Modern, collaborative workspace with a focus on security and resilience
  • 🕒 Work Schedule: Full-time (40 hours per week), with flexible hours and remote work options

📄 Application & Technical Interview Process

  • 📝 Interview Process:

    • 🔄 Technical assessment: Demonstrate your expertise in AWS, Terraform, and incident response with hands-on exercises and case studies
    • 💼 Cultural fit assessment: Showcase your communication skills and cultural fit with the team through behavioral and situational interviews
    • 💡 Final evaluation: Present your findings from the technical assessment and discuss your approach to incident management and infrastructure security
  • 📝 Portfolio Review Tips:

    • 🏗 Highlight your Infrastructure as Code (IaC) experience with Terraform and other relevant tools
    • 🔄 Showcase your CI/CD pipeline management skills with real-world examples and live demos
    • 🛡️ Demonstrate your incident response experience with case studies and walkthroughs of your approach to incident management
    • 🔍 Provide examples of system monitoring and performance optimization techniques you've implemented in previous roles
  • 📝 Technical Challenge Preparation:

    • 🌟 Familiarize yourself with the AWS services relevant to the role, including EKS, RDS, and security services like GuardDuty and Security Hub
    • 🛠️ Brush up on your Terraform skills and prepare for hands-on exercises demonstrating your Infrastructure as Code experience
    • 🛡️ Review incident response best practices and prepare for case studies and role-playing exercises
  • 🔑 ATS Keywords: Site Reliability Engineering, DevOps, AWS, EKS, RDS, Terraform, CI/CD, Incident Response, Security, Docker, Kubernetes, Networking, Monitoring, Logging, Observability, Python, Bash, FinTech, Crypto, Cloudflare, DORA, MiCA

📌 Application Steps

To apply for this Site Reliability Engineer position:

  1. 📝 Tailor your resume and cover letter to highlight your relevant experience and skills for this role
  2. 📝 Prepare for the technical assessment by brushing up on your AWS, Terraform, and incident response skills
  3. 📝 Research Cambix and the UNX platform to demonstrate your understanding of the company and its services
  4. 📝 Submit your application through the provided link, including your resume, cover letter, and any relevant portfolio materials

⚠️ Important Notice: This enhanced job description includes AI-generated insights and web development industry-standard assumptions. All details should be verified directly with the hiring organization before making application decisions.

Application Requirements

Candidates should have over 10 years of experience in Site Reliability Engineering or a similar role, with deep expertise in AWS and Infrastructure as Code using Terraform. Strong incident response skills and familiarity with cloud-native technologies are essential.