Job ID: JOB_ID_3777
Job Summary:
We are seeking a highly skilled and experienced Generative AI Engineer with over 15 years of professional experience to design, develop, and deploy cutting-edge AI solutions. The ideal candidate will possess a strong Python background and hands-on experience with Large Language Models (LLMs), prompt engineering, Gen AI frameworks, and building scalable AI applications. Experience in developing Agentic AI solutions is crucial.
Key Responsibilities:
- Design, develop, and implement advanced Generative AI models for text, image, or multimodal applications.
- Develop and refine prompt engineering strategies and embedding-based retrieval systems (RAG).
- Integrate Gen AI capabilities into web applications and enterprise workflows, ensuring scalability and performance.
- Build and deploy agentic AI applications using context engineering and multi-agent orchestration tools.
- Develop and maintain MLOps/LLMOps pipelines for CI/CD automation and model deployment.
- Collaborate with cross-functional teams to define AI project requirements and deliverables.
- Troubleshoot and optimize AI models and applications for performance and efficiency.
- Stay abreast of the latest advancements in AI, ML, Gen AI, and related technologies.
- Contribute to the architectural design of AI solutions, ensuring robustness and security.
- Develop microservices and API endpoints for AI model serving using FastAPI, Docker, and Kubernetes.
Must-Have Skills:
- Overall 15+ years of professional experience.
- 10+ years of hands-on experience in AI, Data Science, ML, and GEN AI.
- Strong hands-on experience designing and deploying Retrieval-Augmented Generation (RAG) pipelines.
- Extensive experience with LangChain, LangGraph, CrewAI, and multi-agent orchestration.
- Strong MLOps/LLMOps experience with CI/CD automation.
- Experience with cloud AI services on AWS (SageMaker, Lambda, EKS, S3) and GCP (Vertex AI).
- API & microservices development using FastAPI, REST, Docker, Kubernetes.
- Strong Python proficiency with PyTorch / TensorFlow.
- Strong hands-on experience with vector databases (Pinecone, FAISS, ChromaDB) and embedding lifecycle management.
- Experience in developing microservices and API development using FastAPI, REST APIs, Pydantic/JSON schemas, Docker, and Kubernetes for low-latency serving.
- Strong proficiency in Python and AI/ML frameworks (PyTorch, TensorFlow).
- Hands-on experience using session and memory for building multi-agent systems along with using MCP tools.
- Hands-on experience with LLMs, transformers, and Hugging Face ecosystem.
- Knowledge and experience with vector databases and RAG technique for semantic search.
- Familiarity with cloud AI services (AWS SageMaker, Azure OpenAI, GCP Vertex AI).
- Understanding of MLOps practices for scalable AI deployment.
- Strong experience in working with LLM fine-tuning with LoRA, QLoRA, PEFT.
- Strong experience in Architected advanced RAG systems using Pinecone, FAISS, Weaviate, Chroma, hybrid retrieval, and custom embeddings.
- Strong experience in Designing end-to-end LLMOps/MLOps pipelines using MLflow, DVC, SageMaker Pipelines, Vertex AI Pipelines, and GitHub Actions.
- Experience in using cloud-native AI systems on AWS (SageMaker, Lambda, EKS, EC2, Step Functions, S3, Glue) and GCP Vertex AI, supporting high-volume inference and secure enterprise operations.
- Experience in developing multi-agent orchestration workflows using LangGraph and CrewAI for tool-calling, validation agents, automated reasoning, and workflow supervision.
Nice-to-Have Skills:
- GCP
- Prompt Engineering
Work Location & Reporting Address:
Irving, TX 75039 (Irving, Dallas, TX or Charlotte, NC. Onsite-Hybrid. Will consider candidates willing to relocate to clients location)
Contract Duration:
12 Months
Candidate Requirements:
- Must be USC or GC.
- Do not send FAKE GC candidates.
- Candidates must be willing to relocate to the client’s location if not already local.
- Provide basic details with resume: Name, location, Zip code, willingness to relocate, USC or GC status, Rate on C2C, Year came to USA & on what visa, Education with year of pass out.
Special Requirements
USC or GC only. Do not send FAKE GC candidates. Candidates willing to relocate to client's location. Onsite-Hybrid role.
Compensation & Location
Salary: $70 – $90 per year
Location: Irving, TX
Recruiter / Company – Contact Information
Email: ck@efulgent.net
Recruiter Notice:
To remove this job posting, please send an email from
ck@efulgent.net with the subject:
DELETE_JOB_ID_3777