NEWPosted 3 hours ago

Job ID: JOB_ID_3777

Job Summary:

We are seeking a highly skilled and experienced Generative AI Engineer with over 15 years of professional experience to design, develop, and deploy cutting-edge AI solutions. The ideal candidate will possess a strong Python background and hands-on experience with Large Language Models (LLMs), prompt engineering, Gen AI frameworks, and building scalable AI applications. Experience in developing Agentic AI solutions is crucial.

Key Responsibilities:

  • Design, develop, and implement advanced Generative AI models for text, image, or multimodal applications.
  • Develop and refine prompt engineering strategies and embedding-based retrieval systems (RAG).
  • Integrate Gen AI capabilities into web applications and enterprise workflows, ensuring scalability and performance.
  • Build and deploy agentic AI applications using context engineering and multi-agent orchestration tools.
  • Develop and maintain MLOps/LLMOps pipelines for CI/CD automation and model deployment.
  • Collaborate with cross-functional teams to define AI project requirements and deliverables.
  • Troubleshoot and optimize AI models and applications for performance and efficiency.
  • Stay abreast of the latest advancements in AI, ML, Gen AI, and related technologies.
  • Contribute to the architectural design of AI solutions, ensuring robustness and security.
  • Develop microservices and API endpoints for AI model serving using FastAPI, Docker, and Kubernetes.

Must-Have Skills:

  • Overall 15+ years of professional experience.
  • 10+ years of hands-on experience in AI, Data Science, ML, and GEN AI.
  • Strong hands-on experience designing and deploying Retrieval-Augmented Generation (RAG) pipelines.
  • Extensive experience with LangChain, LangGraph, CrewAI, and multi-agent orchestration.
  • Strong MLOps/LLMOps experience with CI/CD automation.
  • Experience with cloud AI services on AWS (SageMaker, Lambda, EKS, S3) and GCP (Vertex AI).
  • API & microservices development using FastAPI, REST, Docker, Kubernetes.
  • Strong Python proficiency with PyTorch / TensorFlow.
  • Strong hands-on experience with vector databases (Pinecone, FAISS, ChromaDB) and embedding lifecycle management.
  • Experience in developing microservices and API development using FastAPI, REST APIs, Pydantic/JSON schemas, Docker, and Kubernetes for low-latency serving.
  • Strong proficiency in Python and AI/ML frameworks (PyTorch, TensorFlow).
  • Hands-on experience using session and memory for building multi-agent systems along with using MCP tools.
  • Hands-on experience with LLMs, transformers, and Hugging Face ecosystem.
  • Knowledge and experience with vector databases and RAG technique for semantic search.
  • Familiarity with cloud AI services (AWS SageMaker, Azure OpenAI, GCP Vertex AI).
  • Understanding of MLOps practices for scalable AI deployment.
  • Strong experience in working with LLM fine-tuning with LoRA, QLoRA, PEFT.
  • Strong experience in Architected advanced RAG systems using Pinecone, FAISS, Weaviate, Chroma, hybrid retrieval, and custom embeddings.
  • Strong experience in Designing end-to-end LLMOps/MLOps pipelines using MLflow, DVC, SageMaker Pipelines, Vertex AI Pipelines, and GitHub Actions.
  • Experience in using cloud-native AI systems on AWS (SageMaker, Lambda, EKS, EC2, Step Functions, S3, Glue) and GCP Vertex AI, supporting high-volume inference and secure enterprise operations.
  • Experience in developing multi-agent orchestration workflows using LangGraph and CrewAI for tool-calling, validation agents, automated reasoning, and workflow supervision.

Nice-to-Have Skills:

  • GCP
  • Prompt Engineering

Work Location & Reporting Address:

Irving, TX 75039 (Irving, Dallas, TX or Charlotte, NC. Onsite-Hybrid. Will consider candidates willing to relocate to clients location)

Contract Duration:

12 Months

Candidate Requirements:

  • Must be USC or GC.
  • Do not send FAKE GC candidates.
  • Candidates must be willing to relocate to the client’s location if not already local.
  • Provide basic details with resume: Name, location, Zip code, willingness to relocate, USC or GC status, Rate on C2C, Year came to USA & on what visa, Education with year of pass out.

Special Requirements

USC or GC only. Do not send FAKE GC candidates. Candidates willing to relocate to client's location. Onsite-Hybrid role.


Compensation & Location

Salary: $70 – $90 per year

Location: Irving, TX


Recruiter / Company – Contact Information

Email: ck@efulgent.net


Interested in this position?
Apply via Email

Recruiter Notice:
To remove this job posting, please send an email from
ck@efulgent.net with the subject:

DELETE_JOB_ID_3777

to delete@join-this.com.