Announcing our partnership with . Click here to learn more!
Posted on
Sep 26, 2024
Senior Infrastructure Engineer, Metal Dev
Roseland
Mid-Senior ICs
Engineering, IT
CoreWeave
CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.
Job Description
About this Role:
CoreWeave is seeking a highly skilled and motivated Infrastructure Engineer to join our Hardware Engineering Development team (METALDEV), reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the development of the services that automate and test our server infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.
Responsibilities:
- Develop and maintain Go and Python server management services
- Collaborate with upstream communities, including Go and Redfish based projects
- Document hardware automation workflows and processes
- Create CI/CD pipelines for server hardware compliance tests
- Develop and maintain hardware/firmware management services
- Automate all aspects of the server hardware lifecycle
- Serve as the senior point of contact for hardware escalation and troubleshooting
- Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
- Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
- Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency
- Establish processes for internal hardware testing, deployment, and performance optimization
Requirements:
- Must have at least 5 years of profession experience:
- Proficiency with Go and Python
- Previous experience deploying containerized applications using Kubernetes
- Excellent documentation skills and attention to detail
- Strong analytical and problem-solving abilities
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $175,000 - $210,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.
About this Role:
CoreWeave is seeking a highly skilled and motivated Infrastructure Engineer to join our Hardware Engineering Development team (METALDEV), reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the development of the services that automate and test our server infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.
Responsibilities:
- Develop and maintain Go and Python server management services
- Collaborate with upstream communities, including Go and Redfish based projects
- Document hardware automation workflows and processes
- Create CI/CD pipelines for server hardware compliance tests
- Develop and maintain hardware/firmware management services
- Automate all aspects of the server hardware lifecycle
- Serve as the senior point of contact for hardware escalation and troubleshooting
- Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
- Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
- Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency
- Establish processes for internal hardware testing, deployment, and performance optimization
Requirements:
- Must have at least 5 years of profession experience:
- Proficiency with Go and Python
- Previous experience deploying containerized applications using Kubernetes
- Excellent documentation skills and attention to detail
- Strong analytical and problem-solving abilities
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $175,000 - $210,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.