Posted on 
Jun 18, 2024

Infrastructure Engineer

Roseland
Mid-Senior ICs
Engineering, IT
CoreWeave
CoreWeave
CoreWeave
Private
101-250
Software, Security & Developer Tools

CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.

Job Description

About this Role:

CoreWeave is seeking a highly skilled and motivated Infrastructure Engineer to join our Hardware Engineering Development team (METALDEV), reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the development of the services that automate and test our server infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.

Responsibilities:

  • Develop and maintain Go and Python server management services
  • Collaborate with upstream communities, including Go and Redfish based projects
  • Document hardware automation workflows and processes
  • Create CI/CD pipelines for server hardware compliance tests
  • Develop and maintain hardware/firmware management services
  • Automate all aspects of the server hardware lifecycle
  • Serve as the senior point of contact for hardware escalation and troubleshooting
  • Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
  • Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
  • Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency
  • Establish processes for internal hardware testing, deployment, and performance optimization

Requirements:

  • Must have at least 2 years of profession experience:
  • Proficiency with Go and Python
  • Previous experience deploying containerized applications using Kubernetes
  • Excellent documentation skills and attention to detail
  • Strong analytical and problem-solving abilities

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $160,000-$185,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

Hybrid Workplace

Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.

About this Role:

CoreWeave is seeking a highly skilled and motivated Infrastructure Engineer to join our Hardware Engineering Development team (METALDEV), reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the development of the services that automate and test our server infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.

Responsibilities:

  • Develop and maintain Go and Python server management services
  • Collaborate with upstream communities, including Go and Redfish based projects
  • Document hardware automation workflows and processes
  • Create CI/CD pipelines for server hardware compliance tests
  • Develop and maintain hardware/firmware management services
  • Automate all aspects of the server hardware lifecycle
  • Serve as the senior point of contact for hardware escalation and troubleshooting
  • Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
  • Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
  • Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency
  • Establish processes for internal hardware testing, deployment, and performance optimization

Requirements:

  • Must have at least 2 years of profession experience:
  • Proficiency with Go and Python
  • Previous experience deploying containerized applications using Kubernetes
  • Excellent documentation skills and attention to detail
  • Strong analytical and problem-solving abilities

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $160,000-$185,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

Hybrid Workplace

Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.

Receive Tech Ladies'
newest jobs in your inbox,
every week.

Join Tech Ladies for full-access to the job board, member-only events, and more!

If you're already a member, we haven't forgotten you. We promise. It's a new system. If you fill out the form once, it'll remember you going forward. Apologies for the inconvenience.

Roseland
Roseland
No items found.
Engineering
Engineering
IT
IT
Hybrid
Hybrid