Software Engineer, Inference AI/ML

Remote Full-time
CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost. Responsibilities Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve) Write tests, code comments, and short design docs; participate in code reviews Add basic metrics and dashboards; assist with alarms and runbooks Follow on-call runbooks and learn incident response in a guided rotation Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance Skills BS/MS in CS, EE, or related field, or equivalent practical experience Foundations in data structures, algorithms, and networked services Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics Exposure to containers and Kubernetes (coursework or projects welcome) Curiosity about GPU inference concepts (micro-batching, KV cache, streaming) Internship or project that deployed a microservice or ML inference demo Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling Benefits Medical, dental, and vision insurance - 100% paid for by CoreWeave Company-paid Life Insurance Voluntary supplemental life insurance Short and long-term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Ability to Participate in Employee Stock Purchase Program (ESPP) Mental Wellness Benefits through Spring Health Family-Forming support provided by Carrot Paid Parental Leave Flexible, full-service childcare support with Kinside 401(k) with a generous employer match Flexible PTO Catered lunch each day in our office and data center locations A casual work environment A work culture focused on innovative disruption Company Overview CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is
Apply Now

Similar Opportunities

Accountant l

Remote Full-time

Associate Product Manager

Remote Full-time

[Remote] Laravel Full Stack Developer

Remote Full-time

OPS Clinician SBS

Remote Full-time

Account Manager / Outside Sales Representative - Virginia Beach, VA area

Remote Full-time

Project Assistant

Remote Full-time

Phoenix, AZ Account Executive - Bilingual Spanish

Remote Full-time

Associate Equipment Specialist - Solar (Traveler) | Mortenson

Remote Full-time

Project Coordinator

Remote Full-time

Social Video Editor

Remote Full-time

Partnership Sales Manager | Remote | AI SaaS Sales Role

Remote Full-time

Experienced Customer Service Representative – Hybrid Work Solution – arenaflex

Remote Full-time

**Experienced Part-Time Customer Service Chat Representative – Remote Work Opportunity with blithequark**

Remote Full-time

**Experienced Remote Customer Service Representative – Call Center Support for blithequark**

Remote Full-time

Experienced Customer Success Manager – Delivering Exceptional Customer Experiences through Strategic Relationship Building and Project Management

Remote Full-time

Sr. Software Engineer/Architect: WFM (Remote)

Remote Full-time

New Business Account Executive

Remote Full-time

Experienced Customer Service Representative – Full-Time Remote Opportunity with blithequark for $25/Hour

Remote Full-time

Senior Managing Counsel, Privacy & Cybersecurity (Americas)

Remote Full-time

Experienced Spanish Bilingual Customer Service Representative – Remote Contractor Role for Prestigious Clients with Flexible Scheduling and Competitive Pay

Remote Full-time
← Back to Home