Staff Devops Engineer
Upload My Resume
Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT
Requirements
• 6+ years of experience in DevOps, SRE, or infrastructure engineering, with a track record of owning production systems end to end • Deep expertise with Kubernetes, container orchestration, and cloud-native architecture on AWS • Strong CI/CD experience: you've built pipelines that teams actually trust and use daily • Fluent in infrastructure-as-code (Terraform, Pulumi, or equivalent) and GitOps workflows • Solid understanding of networking, DNS, load balancing, and security fundamentals • Experience with monitoring and observability tooling (Datadog, Grafana, Prometheus, or similar) • Comfortable scripting in Python, Bash, or Go for automation and tooling • Experience operating databases in production (PostgreSQL, Redis, or similar) • Strong opinions on reliability, incident response, and on-call practices, but pragmatic about when to move fast • Bonus: experience with email infrastructure at scale (deliverability, domain/IP management, SMTP) • Bonus: experience supporting ML/AI workloads in production • Bonus: startup experience where you built the infrastructure function from scratch
Responsibilities
• Own and manage Kubernetes cluster management for orchestration of AI agents at scale across AWS infrastructure. • Develop CI/CD pipelines to enable frequent releases with automated testing, staging environments, rollback strategies, feature flags. • Construct the observability layer comprising monitoring, alerting, logging, and tracing systems for real-time insights into AI agent performance and pipeline processes. • Implement infrastructure supporting email deliverability across thousands of domains and mailboxes with sender reputation management, domain warming strategies, DNS configuration, IP address handling. • Establish security posture by managing secrets, access controls, network policies, vulnerability scanning to ensure SOC 2 compliance for customer data protection. • Optimize cloud costs associated with AI workloads and pipelines through efficient infrastructure design and spend visibility tools. • Enhance developer experience via provisioning of local development environments, ensuring fast builds, reliable deployments, and comprehensive documentation accessibility.
Benefits
• You're building the infrastructure layer for AI employees. This isn't a standard SaaS platform. Ava makes autonomous decisions, sends real emails to real people, and operates 24/7 on behalf of customers. The reliability bar is high because the AI doesn't stop working at 6pm. • You're building the infrastructure layer for AI employees. • The surface area is huge. Email infrastructure, real-time AI orchestration, a database of hundreds of millions of leads, data pipelines, a multi-product platform. You won't get bored. • The surface area is huge. • You'll define the practice, not just execute it. This is a staff-level hire where you'll set the standards for how Artisan builds, deploys, monitors, and secures its infrastructure. Your decisions will shape how the engineering team operates for years. • You'll define the practice, not just execute it. • Competitive salary + meaningful equity. We compensate for impact. • Details • Details • Location: San Francisco, New York, or Remote (US) • Team: Engineering • Reports to: Sam Stallings, CPTO