Job Description


Job Overview

  • Job ID:

    J53176

  • Job Title:

    AI/ML Tech lead with Gen AI

  • Location:

    Rosemead, CA

  • Duration:

    22 Months + Extension

  • Hourly Rate:

    Depending on Experience (DOE)

  • Work Authorization:

    US Citizen, Green Card, OPT-EAD, CPT, H-1B,
    H4-EAD, L2-EAD, GC-EAD

  • Client:

    To Be Discussed Later

  • Employment Type:

    W-2, 1099, C2C

Job Description:

Designing and deploying scalable, secure, and reliable LLM-based solution, implement Retrieval-Augmented Generation (RAG), model fine-tuning, and agentic workflows to solve complex business problems.
Key Responsibilities

  • Design, build, and maintain production-grade Generative AI applications and APIs on GCP, focusing on Gemini models, RAG architectures, and vector databases.
  • Develop automated MLOps pipelines (training, evaluation, monitoring, deployment) using Vertex AI, Kubeflow, Cloud Build, and Terraform.
  • Implement techniques to enhance AI model performance, including fine-tuning, quantization (e.g., GPTQ, AWQ), and prompt engineering to improve accuracy and reduce latency.
  • Optimize GCP resources for high-performance computing, ensuring scalability, cost-efficiency, and security (IAM, VPC).
  • Partner with Data Science, Data Engineering, and Product teams to translate business requirements into technical AI/ML roadmaps.
  • Ensure compliance with data privacy, security regulations (HIPAA, GDPR, if applicable), and ethical AI standards.



Required Qualifications

  • 5-8+ years of industry experience in Machine Learning, with at least 3+ years of hands-on experience in building and deploying Generative AI models and LLMs in a production environment.
  • Proven experience with Google Cloud Platform (GCP) and its AI suite (Vertex AI, BigQuery, Dataflow, Cloud Run).
  • Strong expertise in Python and standard data science libraries (scikit-learn, TensorFlow, PyTorch).
  • Hands-on experience with framework tooling such as LangChain, LlamaIndex, or Hugging Face.
  • Strong understanding of SQL and unstructured data management.
  • Familiarity with Docker, Kubernetes (GKE), and CI/CD tools.



Preferred Qualifications

  • Experience with multi-agent systems and orchestration (e.g., LangGraph, AutoGen).
  • Deep knowledge of Vector Databases (e.g., Vertex AI Vector Search, Pinecone, Chroma).
  • Google Cloud Professional Machine Learning Engineer certification.
  • Demonstrated experience leading team projects and mentoring junior engineers.

Apply Now
Equal Opportunity Employer

MACHINE LEARNING TECHNOLOGIES LLC is an equal opportunity employer inclusive of female, minority, disability and veterans, (M/F/D/V). Hiring, promotion, transfer, compensation, benefits, discipline, termination and all other employment decisions are made without regard to race, color, religion, sex, sexual orientation, gender identity, age, disability, national origin, citizenship/immigration status, veteran status or any other protected status. MACHINE LEARNING TECHNOLOGIES LLC will not make any posting or employment decision that does not comply with applicable laws relating to labor and employment, equal opportunity, employment eligibility requirements or related matters. Nor will MACHINE LEARNING TECHNOLOGIES LLC require in a posting or otherwise U.S. citizenship or lawful permanent residency in the U.S. as a condition of employment except as necessary to comply with law, regulation, executive order, or federal, state, or local government contract