Senior Deep Learning Engineer – Autonomous Vehicles

Remote Full-time
Job Description: • Crafting, scaling, and hardening deep learning infrastructure libraries and frameworks for training on multi-thousand GPU clusters. • Improving efficiency throughout the training stack: data loaders, distributed training, scheduling, and performance monitoring. • Building robust training pipelines and libraries to handle massive video datasets and enable rapid experimentation. • Collaborating with researchers, model engineers, and internal platform teams to enhance efficiency, minimize stalls, and improve training availability. • Owning core infrastructure components such as orchestration libraries, distributed training frameworks, and fault-resilient training systems. • Partnering with leadership to ensure infrastructure scales with growing GPU capacity and dataset size while maintaining developer efficiency and stability. Requirements: • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, or a related field, or equivalent experience. • 12+ years of professional experience building and scaling high-performance distributed systems, ideally in ML, HPC, or large-scale data infrastructure. • Extensive knowledge in deep learning frameworks (PyTorch is preferred), large scale training (DDP/FSDP, NCCL, tensor/pipeline parallelism), and performance profiling. • Strong systems background: datacenter networking (RoCE, IB), parallel filesystems (Lustre), storage systems, schedulers (Slurm, Kubernetes, etc.). • Proficiency in Python and C++, with experience writing production-grade libraries, orchestration layers, and automation tools. • Ability to work closely with multi-functional teams (ML researchers, infra engineers, product leads) and translate requirements into robust systems. Benefits: • equity • benefits Apply tot his job
Apply Now

Similar Opportunities

[Hiring] Machine Learning Scientist - Digital Pathology and Medicine @Memorial Sloan Kettering Cancer Center

Remote Full-time

Research Engineer/Research Scientist, RL/Reasoning

Remote Full-time

Staff Machine Learning Research Scientist, LLM Evals

Remote Full-time

AI Engineer / Scientist — Computer Vision & Deep Learning (Health Technology) - Contract to Hire

Remote Full-time

Senior Manager, Data Scientist (Machine Learning)

Remote Full-time

Co-op, Aviation Management - FAA/DOT Disabilities Compliance Programs (Summer 2026)

Remote Full-time

Ramp Agent (Customer Service Agent) - FLL

Remote Full-time

Rochester,NY:Delta Airlines Flight Attendant Needed(Full-time)

Remote Full-time

Specialist, Reservations Forecasting and Insights

Remote Full-time

Delta Vacations Program Manager, Email Marketing and Advertising

Remote Full-time

**Experienced Full Stack Customer Success Advocate – Live Chat Support & Digital Experience Enhancement**

Remote Full-time

Entry Level Customer Service Representative – Remote Part-Time Opportunity for Exceptional Support Professionals to Join blithequark's Dynamic Team

Remote Full-time

**Experienced Customer Service Representative – Healthcare – Join the blithequark Team**

Remote Full-time

[Remote] Associate SOC Analyst

Remote Full-time

Customer Success Manager

Remote Full-time

Mentor A Promise is hiring: Blog Writer (Volunteer) in New York

Remote Full-time

**Experienced Data Entry and Administrative Data Clerk – Supporting Excellence in Home Care Services**

Remote Full-time

**Experienced Full Stack Customer Service Representative – US Government and Enterprise Client Support**

Remote Full-time

**Experienced Full Stack Distributed Systems Engineer – Web & Cloud Application Development**

Remote Full-time

Manager, Client Development Affiliate Marketing, Capital One Ad Solutions (Remote)

Remote Full-time
← Back to Home