Senior Engineer: Kubernetes Platforms
CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.
Job Description
About the role:
The Kubernetes Platforms Team supports and advances the heart of industry at CoreWeave. Nearly every CoreWeave product and technology utilizes Kubernetes in some capacity and with tens of thousands of Kublets and their associated control planes and supporting services, the Kubernetes Platforms Team sets the tone for reliability, efficiency, and simplicity. This team is responsible for the software development and operations lifecycle of all Kubernetes clusters across CoreWeave’s multiple levels of Kube-ception and has a major role in the integration of custom components into the CoreWeave Kubernetes product experience.
We are seeking a Senior Engineer to join the Kubernetes Platforms Team and help us grow our orchestration platforms in scale, reliability, and featureset. This individual will join a team of 6-10 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Kubernetes Platforms Team, you would have the opportunity to:
- Design and implement solutions to fascinating problems of scale for provisioning and managing bare-metal and virtual Kubernetes clusters with Cluster API and other tools.
- Create custom Kubernetes interfaces, gateways, and orchestrators all managed using Gitops tools such as Argo CD and Helm.
- Improve the performance, security, and reliability of our Kubernetes products and participate in the Kubernetes Platforms on-call rotation.
- Develop dashboards, alerts, and insights into the customer experience using Grafana-ecosystem tools such as Mimir and Loki.
- Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.
Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk.
- You have four or more years of experience in a software or infrastructure engineering industry
- You have experience operating services in production and at scale.
- You’re comfortable with the idea of using Go as your primary programming language.
- You have some experience using Kubernetes with a conceptual understanding of its major components and/or have operated Kubernetes clusters with some form of automation.
- You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
- You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
- You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.
- You’re excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $165,000/year in our lowest geographic market up to $220,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.
About the role:
The Kubernetes Platforms Team supports and advances the heart of industry at CoreWeave. Nearly every CoreWeave product and technology utilizes Kubernetes in some capacity and with tens of thousands of Kublets and their associated control planes and supporting services, the Kubernetes Platforms Team sets the tone for reliability, efficiency, and simplicity. This team is responsible for the software development and operations lifecycle of all Kubernetes clusters across CoreWeave’s multiple levels of Kube-ception and has a major role in the integration of custom components into the CoreWeave Kubernetes product experience.
We are seeking a Senior Engineer to join the Kubernetes Platforms Team and help us grow our orchestration platforms in scale, reliability, and featureset. This individual will join a team of 6-10 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Kubernetes Platforms Team, you would have the opportunity to:
Design and implement solutions to fascinating problems of scale for provisioning and managing bare-metal and virtual Kubernetes clusters with Cluster API and other tools.
Create custom Kubernetes interfaces, gateways, and orchestrators all managed using Gitops tools such as Argo CD and Helm.
Improve the performance, security, and reliability of our Kubernetes products and participate in the Kubernetes Platforms on-call rotation.
Develop dashboards, alerts, and insights into the customer experience using Grafana-ecosystem tools such as Mimir and Loki.
Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.
Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk.
You have four or more years of experience in a software or infrastructure engineering industry
You have experience operating services in production and at scale.
You’re comfortable with the idea of using Go as your primary programming language.
You have some experience using Kubernetes with a conceptual understanding of its major components and/or have operated Kubernetes clusters with some form of automation.
You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.
You’re excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $165,000/year in our lowest geographic market up to $220,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.