[Pfizer] Senior Manager, AI Infrastructure and Operations

Remote Full-time
ROLE SUMMARY The Sr. Manager/Staff Engineer, AI Infrastructure & MLOps Engineering is a senior technical leader responsible for architecting, building, and scaling Pfizer’s AI infrastructure and developer platforms. This role leverages extensive experience in cloud engineering, DevOps, and MLOps to deliver robust, high-performance solutions supporting advanced AI/ML workloads in biotechnology, healthcare, and enterprise technology. The successful candidate will drive innovation in automation, reliability, and scalability, enabling scientists and engineers to rapidly develop, deploy, and monitor machine learning models in production environments. ROLE RESPONSIBILITIES Platform Architecture & Engineering • Design, implement, and own large-scale cloud-based HPC and MLOps platforms supporting AI model training, genomic sequencing, and precision medicine. • Architect multi-environment clusters (AWS, GCP, Azure), enabling GPU/FPGA workloads and advanced observability. • Lead the development of developer and cloud platforms, including internal engineering accelerators and reusable toolsets. Platform Catalog & Developer Experience • Design, implement, and manage unified platform catalogs using Backstage, enhancing developer experience and application metadata management. • Develop custom plugins and APIs for Backstage to support internal engineering workflows and documentation. Automation & DevOps Excellence • Build and maintain Python-based automation frameworks, CI/CD pipelines, and Infrastructure-as-Code (Terraform, Helm, Pulumi, AWS CDK). • Operationalize containerized solutions using Docker and Kubernetes, integrating MLflow, Kubeflow, and other orchestration platforms. • Implement robust automation for provisioning, configuring, and managing cloud resources across multiple environments. MLOps & Reliability Engineering • Lead the implementation of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and advanced observability (Prometheus, Grafana, PagerDuty). • Develop and maintain APIs and services for model management, feature stores, and inference pipelines. • Operationalize ML model serving at scale using frameworks such as TensorFlow Serving, TorchServe, KServe, and Seldon Core. • Ensure compliance with industry standards (e.g., HIPAA, FDA) for data protection and reliability. Collaboration & Leadership • Mentor engineers and lead cross-functional teams to deliver integrated solutions. • Champion engineering excellence through design documentation, code reviews, and testing automation. • Present at industry summits, author technical proposals, and contribute to open-source projects (Kubernetes, Helm, Go, Envoy). Continuous Improvement • Drive agile delivery, sprint planning, and performance optimization. • Lead incident response and disaster recovery initiatives for mission-critical platforms. • Foster a culture of shared ownership, transparency, and innovation BASIC QUALIFICATIONS • 8+ years of hands-on software engineering experience in cloud infrastructure, DevOps, and MLOps. • Deep expertise in Python, Kubernetes, Terraform, Helm, and CI/CD pipeline development. • Proven experience architecting and operating containerized solutions on AWS, GCP, and Azure. • Strong knowledge of Infrastructure-as-Code, distributed systems, and production system reliability. • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field. PREFERRED QUALIFICATIONS • Expertise in AWS cloud services (EC2, S3, Lambda, EKS, SageMaker, API Gateway, CloudFormation, IAM, etc.). • Experience deploying and customizing Backstage as a unified catalog for teams, services, and technical documentation. • Experience building and deploying microservices and REST/gRPC APIs for AI model delivery. • Familiarity with MLflow, Kubeflow, and other MLOps orchestration platforms. • Proficiency with model serving frameworks (TensorFlow Serving, TorchServe, KServe, Seldon Core, BentoML, etc.). Work Location Assignment: Remote Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates. Information & Business Tech Apply tot his job
Apply Now

Similar Opportunities

CRM Practice Growth Strategist

Remote Full-time

Mental Health Coach - Part-time (W2)

Remote Full-time

Client Growth Strategist - Base + Commission - Remote

Remote Full-time

Growth Marketing Specialist

Remote Full-time

Remote Growth Strategist

Remote Full-time

Experienced Customer Success Manager - Digital Transformation and Growth Strategist

Remote Full-time

[Remote] Senior Revenue Integrity Charge Analyst- Cardiac

Remote Full-time

Senior Clinical Implementation Analyst job at HCA - Hospital Corporation of America in Nashville, TN

Remote Full-time

Sr EHR Analyst Education WFH

Remote Full-time

Manager, Data Science - Core TV & Interactive Discovery

Remote Full-time

Experienced Part-Time Data Entry Associate - Flexible Work from Home Opportunity for Beginners in Digital Administration with blithequark

Remote Full-time

Experienced Retail Seasonal Sales Associate – Palm Court at Bath & Body Works, Delivering Exceptional Customer Experiences and Driving Sales Growth

Remote Full-time

Part-Time Remote Data Entry Clerk at blithequark – Flexible Schedule, Competitive Hourly Rate

Remote Full-time

**Experienced Business Assessment Lead – Data-Driven Client Support Transformation**

Remote Full-time

Senior Learning Skills Coach – 25/hr-40/hr – Remote in Boulder, CO

Remote Full-time

Chat Support Agent Jobs Hiring Immediately – Remote, No Phone Calls | $25–$35/hr

Remote Full-time

Experienced Online Live Chat Assistant – Customer Service and Communication Entry-Level Role with Immediate Start at Blithequark

Remote Full-time

Regional Director, Senior Enterprise - West Region

Remote Full-time

Aetna Work From Home Customer Service Representative (Entry Level) -US

Remote Full-time

**Experienced Data Entry Specialist – Remote Work Opportunities at arenaflex**

Remote Full-time
← Back to Home