Posted on 
Jun 6, 2024

Senior Engineer, Kubernetes Services

Roseland
Mid-Senior ICs
Engineering
CoreWeave
CoreWeave
CoreWeave
Private
101-250
Software, Security & Developer Tools

CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.

Job Description

About the role:

The Kubernetes Services Team ensures the reliability, scalability, and usability of the core application management platform within CoreWeave. This team is responsible for the services, processes and tooling that enable internal consumers to install, upgrade, and manage the complete lifecycle of applications within CoreWeave’s Kubernetes-native infrastructure platform. Members of this team will be challenged to combine the tools and patterns of the modern Kubernetes ecosystem with learnings gained from experiences with CI/CD, distributed systems orchestration, traffic shaping, progressive rollouts, application packaging, channel management, and SRE fundamentals to develop CoreWeave’s next-generation solution for internal service delivery.

We are seeking a Senior Engineer to join the Kubernetes Services Team and help us develop and advance the tools and processes used to deliver internal services reliably, at-scale, and with minimal engineer friction. This individual will join a team of 6-8 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Kubernetes Services Team, you would have the opportunity to:

  • Design and implement solutions to fascinating problems of scale for orchestrating the services that run one of the most exciting clouds in the world, today.
  • Integrate the right mix of innovative and battle-hardened tools and practices of the Kubernetes service delivery ecosystem into how CoreWeave deploys and manages services at scale.
  • Coordinate closely with other teams in the Kubernetes Engineering division and throughout CoreWeave to drive adoption of best practices.
  • Create custom Kubernetes interfaces, gateways, and orchestrators such as ArgoCD to enable declarative, reliable delivery of applications at scale.
  • Improve the performance, security, and reliability of our Kubernetes products and participate in the Kubernetes Services on-call rotation.
  • Develop dashboards, alerts, and insights into the customer experience using Grafana-ecosystem tools such as Mimir and Loki.
  • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.

Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk.

  • You have four or more years of experience in a software or infrastructure engineering industry
  • You have experience operating services in production and at scale.
  • You’re comfortable with the idea of using Go as your primary programming language.
  • You have some experience using Kubernetes with a conceptual understanding of its major components and/or have operated Kubernetes clusters with some form of automation.
  • You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
  • You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
  • You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.
  • You’re excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $165,000 to $200,000 annually. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. 

Hybrid Workplace

Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.

CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry’s fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at www.coreweave.com.

About the role:

The Kubernetes Services Team ensures the reliability, scalability, and usability of the core application management platform within CoreWeave. This team is responsible for the services, processes and tooling that enable internal consumers to install, upgrade, and manage the complete lifecycle of applications within CoreWeave’s Kubernetes-native infrastructure platform. Members of this team will be challenged to combine the tools and patterns of the modern Kubernetes ecosystem with learnings gained from experiences with CI/CD, distributed systems orchestration, traffic shaping, progressive rollouts, application packaging, channel management, and SRE fundamentals to develop CoreWeave’s next-generation solution for internal service delivery.

We are seeking a Senior Engineer to join the Kubernetes Services Team and help us develop and advance the tools and processes used to deliver internal services reliably, at-scale, and with minimal engineer friction. This individual will join a team of 6-8 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the Kubernetes Services Team, you would have the opportunity to:

  • Design and implement solutions to fascinating problems of scale for orchestrating the services that run one of the most exciting clouds in the world, today.
  • Integrate the right mix of innovative and battle-hardened tools and practices of the Kubernetes service delivery ecosystem into how CoreWeave deploys and manages services at scale.
  • Coordinate closely with other teams in the Kubernetes Engineering division and throughout CoreWeave to drive adoption of best practices.
  • Create custom Kubernetes interfaces, gateways, and orchestrators such as ArgoCD to enable declarative, reliable delivery of applications at scale.
  • Improve the performance, security, and reliability of our Kubernetes products and participate in the Kubernetes Services on-call rotation.
  • Develop dashboards, alerts, and insights into the customer experience using Grafana-ecosystem tools such as Mimir and Loki.
  • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.

Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk. 

  • You have four or more years of experience in a software or infrastructure engineering industry
  • You have experience operating services in production and at scale.
  • You’re comfortable with the idea of using Go as your primary programming language.
  • You have some experience using Kubernetes with a conceptual understanding of its major components and/or have operated Kubernetes clusters with some form of automation.
  • You know your way around a Linux distro, shell scripting, and/or the Linux storage and networking stacks.
  • You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.
  • You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.
  • You’re excited about being part of a team of diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $165,000 to $200,000 annually. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. 

Hybrid Workplace

Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.

Receive Tech Ladies'
newest jobs in your inbox,
every week.

Join Tech Ladies for full-access to the job board, member-only events, and more!

If you're already a member, we haven't forgotten you. We promise. It's a new system. If you fill out the form once, it'll remember you going forward. Apologies for the inconvenience.

Roseland
Roseland
No items found.
Engineering
Engineering
Hybrid
Hybrid