[Remote] MLOps / LLM Engineer (Google Cloud Platform & Vertex AI)

Remote, USA Full-time
Note: The job is a remote job and is open to candidates in USA. Dice is the leading career destination for tech experts at every stage of their careers. Our client, Hexacorp, is seeking an MLOps / LLM Engineer with expertise in Google Cloud Platform and Vertex AI to design, deploy, and manage large-scale AI/ML solutions. The role involves building MLOps pipelines, optimizing large language models, and ensuring the production-grade deployment of advanced AI solutions. Responsibilities • Build and manage MLOps pipelines for training, evaluation, deployment, and monitoring of ML/LLM models using Vertex AI Pipelines. • Deploy, fine-tune, and optimize LLMs (PaLM, Gemini, BERT, Llama, GPT-based models) on Vertex AI / GKE. • Automate infrastructure provisioning using Terraform / Deployment Manager. • Implement CI/CD pipelines with Google Cloud Platform tools (Cloud Build, Artifact Registry, GitOps/ArgoCD). • Develop and manage feature stores, model registries, and monitoring solutions. • Optimize cost and performance for AI/ML workloads on Google Cloud Platform. • Implement observability (logging, monitoring, and alerting) for ML/LLM production systems. • Collaborate with Data Scientists, ML Engineers, and Cloud Architects to integrate AI solutions into enterprise systems. • Ensure security, governance, and compliance for LLM/AI workloads. Skills • 7+ years of experience in DevOps/MLOps/Cloud Engineering. • Hands-on expertise with Google Cloud Platform (IAM, VPC, GKE, BigQuery, Dataflow, Pub/Sub). • Strong experience with Vertex AI (training, endpoints, pipelines, feature store). • Proven experience with LLMs: fine-tuning, prompt engineering, serving APIs, and optimizing performance. • Proficiency in Python and ML frameworks (TensorFlow, PyTorch, Hugging Face, LangChain). • Strong knowledge of CI/CD pipelines and automation tools. • Experience with Kubernetes (GKE), Docker, Helm. • Knowledge of monitoring & observability tools (Prometheus, Grafana, Stackdriver). • Google Professional ML Engineer or Cloud Architect certification. • Prior experience with LangChain, RAG (Retrieval Augmented Generation), vector databases (Pinecone, FAISS, Vertex Matching Engine). • Experience in deploying GenAI applications on Google Cloud Platform. • Understanding of MLOps frameworks (Kubeflow, MLflow, TFX). Company Overview • Welcome to Jobs via Dice, the go-to destination for discovering the tech jobs you want. It was founded in undefined, and is headquartered in , with a workforce of 0-1 employees. Its website is Apply tot his job
Apply Now

Similar Jobs

Google Cloud Platform Architect

Remote, USA Full-time

Google Cloud Platform Infrastructure Engineer- W2 only

Remote, USA Full-time

Senior Google Cloud Computing Engineer

Remote, USA Full-time

Senior Google Cloud Specialist

Remote, USA Full-time

Cloud Platform Architect (AWS/Azure/Google Cloud Platform)

Remote, USA Full-time

Customer Engineer II, Google Cloud AI/ML

Remote, USA Full-time

Senior Google Cloud Practice Director | Remote Work | San Francisco, California, United States

Remote, USA Full-time

[Remote] SAP BW/4HANA/Google Cloud Architect

Remote, USA Full-time

ETL designer with Google Cloud Platform_Remote

Remote, USA Full-time

Remote DevSecOps (Google Cloud Platform)

Remote, USA Full-time

Student Writer & Content Creator

Remote, USA Full-time

**Experienced Bilingual Korean Customer Support Specialist – Delivering Exceptional Chat and Email Experiences**

Remote, USA Full-time

Content Creator Job at Publicis Groupe Holdings B.V in Los Angeles

Remote, USA Full-time

**Experienced Online Chat Assistant – Customer Service Representative – Remote Opportunity at arenaflex**

Remote, USA Full-time

Delivery Manager

Remote, USA Full-time

Associate Medical Director

Remote, USA Full-time

Experienced Remote Data Entry Specialist – Join the Magical World of Disney as a Data Entry Professional with Opportunities for Growth and Development

Remote, USA Full-time

Cyber Threat & Response Engineer (L2)

Remote, USA Full-time

Hybrid Dosimetrist I, Radiation Oncology, Helen F. Graham Center, Newark, DE (hybrid)

Remote, USA Full-time

AVP, Asset Management – Multifamily

Remote, USA Full-time
Back to Home