DevOps Team Leader

Cellebrite • Remote, Virginia, United States • 2d ago

Cellebrite

Title: DevOps Team Leader
Location: Remote, VA, US

About Cellebrite:

Cellebrites (Nasdaq: CLBT) mission is to enable its global customers to protect and save lives by enhancing digital investigations and intelligence gathering to accelerate justice in communities around the world. Cellebrites AI-powered Digital Investigation Platform enables customers to lawfully access, collect, analyze and share digital evidence in legally sanctioned investigations while preserving data privacy. Thousands of public safety organizations, intelligence agencies and businesses rely on Cellebrites digital forensic and investigative solutionsavailable via cloud, on-premises and hybrid deploymentsto close cases faster and safeguard communities.

To learn more, visit us at www.cellebrite.com,https://investors.cellebrite.com/investors and find us on social media @Cellebrite.

We are looking for a highly talented and hands-on DevOps Team Leader to join our team and drive infrastructure automation, CI/CD excellence, cloud scalability, and production reliability at scale. This role requires a strong technical leader with deep production ownership experience, SRE mindset, and the ability to lead a DevOps/SRE team responsible for mission critical systems.

What We Expect From You:

Lead and manage the DevOps/SRE team responsible for production environments, providing technical leadership and operational excellence.

Mentor engineers and help elevate the DevOps/SRE teams capabilities.

Own production AWS accounts, including availability, stability, security, compliance, and cost efficiency.

Define, implement, and continuously improve SRE practices, including:

o SLAs and SLOs aligned with business and customer expectations

o Error budgets to balance reliability and delivery velocity

o Production readiness reviews and reliability standards

Take a hands-on approach to designing, developing, and modifying CI/CD pipelines, automation processes, and cloud environments.

Own and optimize CI/CD processes for production-grade cloud environments, focusing on AWS Commercial & AWS GovCloud.

Design and maintain highly available, resilient, and scalable production infrastructure, with a strong emphasis on networking and AWS Landing Zone architectures.

Act as a production owner, leading:

o Incident response and escalation

o Post-incident reviews (RCA / postmortems)

o Preventative reliability improvements

Collaborate closely with development, security, compliance, and cloud engineering teams to ensure reliability is integrated throughout the SDLC.

Enforce SDLC and DevOps best practices, including clean code, version control, testing, and high-quality automation.

Improve and shorten development and deployment cycles while protecting production stability through controlled rollouts, canary deployments, and rollback strategies.

Drive observability standards across production systems (monitoring, alerting, logging, metrics).

Troubleshoot and resolve complex, realtime production incidents, ensuring system stability, performance, and compliance.

What Youll Love About This Role:

The opportunity to define and embed SRE standards (SLOs, error budgets, reliability metrics) across a large-scale platform.

Ownership of mission-critical, highcritical, highsecuritysecurity production environments, including regulated cloud workloads.

Working in an environment where production reliability, security, and automation are top priorities.

High-impact leadership role influencing architecture, availability, and delivery velocity across the organization.

Office Location:
Vienna

Requirements:

• 10+ years of DevOps experience, with 5+ years leading a DevOps or SRE team responsible for production systems.

• Proven hands-on experience owning and operating production AWS accounts, including:

o On-call rotations

o Incident management

o SLA/SLO ownership

• Strong hands-on experience with:

o CI/CD tools (GitHub Actions, Artifactory (JFROG), ECR)

o Containerization (Docker, Docker Compose, Kubernetes)

o Cloud platforms (AWS & AWS Gov - must)

o Linux administration, IaC, and automation

o Scripting and/or languages such as Bash, Python, NodeJS, or similar languages.

o Networking concepts and AWS Landing Zone architectures

• Strong understanding of Site Reliability Engineering concepts, including monitoring strategies, alert fatigue reduction, and reliability KPIs.

• Experience building production-safe CI/CD pipelines with quality gates and automated validations.

• Software development experience with focus on automation, maintainability, and quality.

• Experience working in Agile/Scrum teams.

• A proactive leader with a production-first and reliability-driven mindset.

Nice to Have / Advantage:

• Experience with FedRAMP, IRAP, or other regulated cloud compliance frameworks.

• Exposure to highly regulated, air-gapped or government cloud environments.

• Experience defining enterprise-level reliability metrics and reporting to leadership.

• Familiarity with cost optimization practices (FinOps) in production environments.

Equal employment opportunity, including veterans and individuals with disabilities.

PI282129964