AI Dojo Databricks SRE/Support Engineer

Remote Full-time
AI Dojo Databricks SRE/Support Engineer As Databricks SRE and Support Engineer, you will work on operations related to AI Dojo (AI/ML upskilling program on Databricks. This individual contributor (IC) role requires experience on working on large-scale AI/ML platforms guaranteeing stability, reliability, scalability, and performance. Experience with modern Infrastructure and DevOps tools and paradigms, as well as proven hands-on knowledge with Databricks is a must. Primary Responsibilities: Continuous support: Provide continuous SRE support to thousands of geographically distributed users on the AI Dojo Databricks platform: respond to tickets, triage support, liaise with customers. Automation & DevOps: Improve existing Infrastructure as Code (IaC) according to best DevOps practices. Systems Monitoring: Develop and maintain monitoring frameworks to timely respond to outages and other service interruptions. Security & Compliance: Collaborate with internal cybersecurity teams to ensure all systems and operations comply with industry standards and are secure against evolving threats. Capacity Planning & Cost Optimization: Forecast and manage capacity requirements for the AI/ML training environment, while identifying opportunities to reduce costs without compromising performance. Required Qualifications: Bachelor’s degree in computer science, information technology, or a related field. 6+ years of infrastructure experience: Proven experience working on large-scale, cloud-based, enterprise-level software platforms and deep understanding of Databricks environment. In particular: Experience building Github Actions pipelines including composite actions, OIDC federation for cloud provider identity acquisition, and use of environments and deployment controls Experience building Databricks Asset Bundle and Terraform pipelines to manage and deploy Databricks Platform and Workspace resources Fluency in Python, experience with the Databricks Python SDK to perform Workspace operations, and familiarity with PySpark and Delta Lake. Deep familiarity with Databricks APIs, and use of the Databricks CLI for use provisioning Workspace identities, filesystem resources, and the querying of account and workspace level Users, Groups, and Service Principals Strong understanding of security best practices and experience ensuring compliance with relevant regulatory frameworks. 3+ years of practical experience in Infrastructure-as-Code and CI/CD tools like Terraform, Git Actions and alike. 3+ years of experience working in support teams that are geographically distributed Apply tot his job
Apply Now

Similar Opportunities

PGH Engineer All Levels

Remote Full-time

Junior Marketing Operations Specialist

Remote Full-time

Senior Software Engineer, Data Engineering

Remote Full-time

Lead Cloud Data Engineer - Remote Opportunity

Remote Full-time

Data Platform - Sr Principal Software Engineer

Remote Full-time

DevOps & Security Engineer

Remote Full-time

Product Manager (ex-founder or ex-product engineer)

Remote Full-time

Senior Product Owner – Enterprise Data Platforms

Remote Full-time

Legal Counsel - Product

Remote Full-time

(USA) Principal, Systems and Infrastructure Engineer, Data Security

Remote Full-time

**Experienced Part-Time Typing Data Entry Specialist - Work From Home Customer Service Representative - Opportunities for Growth and Development**

Remote Full-time

**Experienced Customer Service Representative – Delivering Exceptional Service from the Comfort of Your Home**

Remote Full-time

**Experienced Customer Care Analyst – Reservations and Operations Support**

Remote Full-time

Experienced Remote Chat Agent and Customer Support Specialist – Flexible Work Arrangements, Competitive Hourly Rates, and Opportunities for Growth and Development at arenaflex

Remote Full-time

Remote Customer Success jobs – Full‑Time Senior Success Manager (Remote) – $85K‑$115K base + bonus – North Las Vegas, Nevada – SaaS & B2B Account Success, Gainsight & Salesforce Expert

Remote Full-time

Experienced Remote Part-Time Customer Experience Representative – Delivering Exceptional Service and Driving Customer Loyalty at arenaflex

Remote Full-time

Data Analyst – Research, Tracking Expert

Remote Full-time

Experienced Data Entry Specialist for Remote Work from Home - Join Our Dynamic Team!

Remote Full-time

TikTok Content Moderation Specialist - Remote Opportunity with Competitive Hourly Rate

Remote Full-time

Website Administration & Site Reliability Engineer

Remote Full-time
← Back to Home