Machine Learning Engineering Manager – LLM Serving, Infrastructure

Remote, USA Full-time
• Lead a high-performing engineering team to develop, build, and deploy a high-scale, low-latency LLM Serving Infrastructure. • Drive the implementation of a unified serving layer to support multiple LLM models and inference types (batch, offline eval flows and real-time/streaming). • Lead all aspects of the development of the Model Registry for deploying, versioning, and running LLMs across production environments. • Ensure successful integration with the core Personalization and Recommendation systems to deliver LLM-powered features. • Define and champion standardized technical interfaces and protocols for efficient model deployment and scaling. • Establish and monitor the serving infrastructure's performance, cost, and reliability, including load balancing, autoscaling, and failure recovery. • Collaborate closely with data science, machine learning research, and feature teams (Autoplay, Home, Search, etc.) to drive the active adoption of the serving infrastructure. • Scale up the serving architecture to handle hundreds of millions of users and high-volume inference requests for internal domain-specific LLMs. • Drive Latency and Cost Optimization: partner with SRE and ML teams to implement techniques like quantization, pruning, and efficient batching to minimize serving latency and cloud compute costs. • Develop Observability and Monitoring: build dashboards and alerting for service health, tracing, A/B test traffic, and latency trends to ensure consistency to defined SLAs. • Contribute to Core LPM Serving: focus on the technical strategy for deploying and maintaining the core Large Personalization Model (LPM). Apply tot his job Apply tot his job
Apply Now

Similar Jobs

Center of Excellence Coordinator

Remote, USA Full-time

RN - Healthcare Sales

Remote, USA Full-time

Patient Advocate (Part Time, Remote 1099)

Remote, USA Full-time

Assoc. Medical Director - Remote (Dallas, TX, US)

Remote, USA Full-time

Health Promotion Specialist

Remote, USA Full-time

Hormone Health Coach & Wellness Entrepreneur (Fully Remote – U.S., Canada & Global)

Remote, USA Full-time

[Hiring] Professional Health Coach- Digital Medicine @Ochsner Health

Remote, USA Full-time

Health Economist Statistician

Remote, USA Full-time

Data Analyst, Epic EHR > > Location: - REMOTE

Remote, USA Full-time

Senior Healthcare Compliance Officer, Revenue Cycle Management

Remote, USA Full-time

Student Support Advisor

Remote, USA Full-time

Associate Director, Medical Writing job at TrialSpark in New York City, NY, Boston, MA

Remote, USA Full-time

Assistant Director, Full Service RMCC Managed

Remote, USA Full-time

Experienced or Aspiring Remote Sales Professional - Unlock Unlimited Earning Potential in Financial Services

Remote, USA Full-time

Experienced Part-Time Data Entry Clerk for Remote Work Opportunities – Join arenaflex for Flexible and Engaging Market Research Projects

Remote, USA Full-time

Part-Time Custodian/Janitor (Min. 1 Yr. Exp. Required) – Amazon Store

Remote, USA Full-time

**Experienced Full Stack Optician – Work From Home Opportunity with Competitive Salary and Benefits**

Remote, USA Full-time

Business Development Manager

Remote, USA Full-time

[Remote] Clinical Pharmacist (MN License Required)

Remote, USA Full-time

Experienced Data Entry Specialist for Development and Construction Industry – Career Growth and Professional Development Opportunities

Remote, USA Full-time
Back to Home