wagey.ggwagey.gg
Open Tech JobsCompaniesPricing
Log InGet Started Free
Jobs/DevOps Engineer Role/Devops Engineer

Devops Engineer

cosineLondon, England, United Kingdom$170k – $114k2w ago
In OfficeSeniorEMEACloud ComputingDevOps EngineerAWSGCPAzureKubernetesChange Management

Upload My Resume

Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• 5+ years building and operating production infrastructure on a major cloud (AWS, GCP, or Azure). • Significant hands-on experience running Kubernetes in production (EKS/GKE/AKS or self-managed): • Cluster upgrades, autoscaling, node group design, and multi-env setups. • Helm or similar for packaging services. • Think in infrastructure-as-code • Deep experience with IaC tools (Pulumi, Terraform, CDK, or similar). • Comfortable managing infra changes via code review, CI, and automated rollouts. • AWS primitives like EKS, ECS/Fargate, ECR, SQS, ElastiCache/Redis. • Argo CD or other GitOps tools for Kubernetes. • On-prem, air-gapped, or regulated industry deployments (e.g. finance, healthcare). • AI/ML infrastructure (GPU workloads, model hosting, feature stores). • Prior experience as an early infra / platform hire at a startup. • ___________________________________________________________________________

Responsibilities

• Own core infrastructure • Design, operate, and evolve our Kubernetes-based platform (EKS or similar), including cluster topology, node groups, autoscaling, and multi-environment isolation. • Manage supporting cloud resources: container registries, load balancers, queues, caches, and data infra needed to run our APIs and agents. • Build the deployment & tooling layer • Design and maintain CI/CD pipelines for image builds and infra rollouts (e.g. Pulumi/Terraform + Helm/Docker). • Implement safe rollout strategies (blue/green, canary, staged rollouts) and fast rollback paths. • Build internal tools and abstractions that make it easy for product teams to self-serve infra safely. • Own reliability & operations (SRE-ish) • Define and track SLOs/SLIs for key services (latency, error rates, availability). • Improve our observability stack (metrics, logs, traces, alerts) so issues are obvious, actionable, and debuggable. • Participate in the on-call rotation, lead incident response when needed, and drive blameless post-mortems and fixes. • Shape networking & security • Design and maintain networking: VPCs, subnets, ingress/egress, service meshes / L7 routing, DNS, and TLS. • Implement least-privilege access via IAM, secure secret management, and hardened configurations for multi-tenant and isolated customer environments. • Help design patterns for secure enterprise and on-prem / regulated deployments. • Partner with product & research • Work closely with application, ML, and research teams to understand their needs and translate them into reusable infra building blocks. • Provide guidance on “how to run this in production” — capacity planning, failure modes, and operational readiness reviews. • ___________________________________________________________________________

Benefits

• ___________________________________________________________________________ • COSINE AT A GLANCE • At Cosine, we’re building autonomous AI engineers that plan, write, and ship code inside real development workflows. • Cosine is designed for on-premise and virtual private cloud (VPC) deployments, including fully air-gapped environments. We build our agent tooling entirely in-house and post-train open-source models to deliver reliable, enterprise-grade coding performance in security-critical settings. • In 2024, Cosine achieved a 72% score on OpenAI’s SWE-Lancer benchmark, placing us among the strongest real-world software-engineering AI systems evaluated. • YC-backed and well-funded, Cosine was founded by experienced operators focused on building dependable, production-grade AI. • This role is based in our Hoxton office, five days a week, because close collaboration, fast feedback, and shared context matter for the problems we’re solving.

Similar Jobs

Software Engineer II (Full Stack, Backend-leaning)1h ago
Jerry.aiJerry.ai·Remote - Toronto, Ontario, Canada·Equity
RemoteNAMidInsuranceCloud ComputingSoftware EngineerFull StackSlackAsanaCustomer RetentionReact NativeReactNext.jsTypeScriptRedisExpoClickHouseAWSNestJS
Looking for Fintech Jobs?1h ago
SoFiSoFi·Engineering WA - Seattle·$173k – $297k/year
In OfficeNASeniorFintechPaymentsBlockchain Security AnalystJavaSpringAWSPostgreSQLKotlinPythonKafkaDockerKubernetesTerraformReportingFinancial Reporting
Intermediate Full Stack Software Developer, Platform1h ago
KOHOKOHO·Remote - Canada·$73k – $95k/year
RemoteNAMidCloud ComputingSenior Full Stack DeveloperGoSQLPythonFull StackClaudeExcelJavaScriptAngularReactKubernetesIonicAWSDatadog
Senior Solutions Engineer1h ago
SprintoSprinto·Remote - India
RemoteAPACSeniorCybersecurityCloud ComputingSolutions EngineerSenior Software EngineerStakeholder ManagementProspectingProduct MarketingCSMGCPAWSAzureObjection HandlingMid-Market
Associate, Credit Card Growth5h ago
Wealthsimple TechnologiesWealthsimple Technologies·Remote - Canada·$111k – $138k/year + Equity
RemoteNAJuniorFintechCloud ComputingTravel AgentGrowth LeadReportingSQLAirflowAWSPythondbtRedshiftFinancial Modeling

Stop filling. Start chilling.Start chilling.

Get Started Free

No credit card. Takes 10 seconds.

© 2026 Dominic Morris. All rights reserved.·Privacy·Terms·