NEWPosted 3 hours ago

Job ID: JOB_ID_3777

Job Summary:

We are seeking a highly skilled and experienced Generative AI Engineer with over 15 years of professional experience to design, develop, and deploy cutting-edge AI solutions. The ideal candidate will possess a strong Python background and hands-on experience with Large Language Models (LLMs), prompt engineering, Gen AI frameworks, and building scalable AI applications. Experience in developing Agentic AI solutions is crucial.

Key Responsibilities:

Design, develop, and implement advanced Generative AI models for text, image, or multimodal applications.
Develop and refine prompt engineering strategies and embedding-based retrieval systems (RAG).
Integrate Gen AI capabilities into web applications and enterprise workflows, ensuring scalability and performance.
Build and deploy agentic AI applications using context engineering and multi-agent orchestration tools.
Develop and maintain MLOps/LLMOps pipelines for CI/CD automation and model deployment.
Collaborate with cross-functional teams to define AI project requirements and deliverables.
Troubleshoot and optimize AI models and applications for performance and efficiency.
Stay abreast of the latest advancements in AI, ML, Gen AI, and related technologies.
Contribute to the architectural design of AI solutions, ensuring robustness and security.
Develop microservices and API endpoints for AI model serving using FastAPI, Docker, and Kubernetes.

Must-Have Skills:

Overall 15+ years of professional experience.
10+ years of hands-on experience in AI, Data Science, ML, and GEN AI.
Strong hands-on experience designing and deploying Retrieval-Augmented Generation (RAG) pipelines.
Extensive experience with LangChain, LangGraph, CrewAI, and multi-agent orchestration.
Strong MLOps/LLMOps experience with CI/CD automation.
Experience with cloud AI services on AWS (SageMaker, Lambda, EKS, S3) and GCP (Vertex AI).
API & microservices development using FastAPI, REST, Docker, Kubernetes.
Strong Python proficiency with PyTorch / TensorFlow.
Strong hands-on experience with vector databases (Pinecone, FAISS, ChromaDB) and embedding lifecycle management.
Experience in developing microservices and API development using FastAPI, REST APIs, Pydantic/JSON schemas, Docker, and Kubernetes for low-latency serving.
Strong proficiency in Python and AI/ML frameworks (PyTorch, TensorFlow).
Hands-on experience using session and memory for building multi-agent systems along with using MCP tools.
Hands-on experience with LLMs, transformers, and Hugging Face ecosystem.
Knowledge and experience with vector databases and RAG technique for semantic search.
Familiarity with cloud AI services (AWS SageMaker, Azure OpenAI, GCP Vertex AI).
Understanding of MLOps practices for scalable AI deployment.
Strong experience in working with LLM fine-tuning with LoRA, QLoRA, PEFT.
Strong experience in Architected advanced RAG systems using Pinecone, FAISS, Weaviate, Chroma, hybrid retrieval, and custom embeddings.
Strong experience in Designing end-to-end LLMOps/MLOps pipelines using MLflow, DVC, SageMaker Pipelines, Vertex AI Pipelines, and GitHub Actions.
Experience in using cloud-native AI systems on AWS (SageMaker, Lambda, EKS, EC2, Step Functions, S3, Glue) and GCP Vertex AI, supporting high-volume inference and secure enterprise operations.
Experience in developing multi-agent orchestration workflows using LangGraph and CrewAI for tool-calling, validation agents, automated reasoning, and workflow supervision.

Nice-to-Have Skills:

GCP
Prompt Engineering

Work Location & Reporting Address:

Irving, TX 75039 (Irving, Dallas, TX or Charlotte, NC. Onsite-Hybrid. Will consider candidates willing to relocate to clients location)

Contract Duration:

12 Months

Candidate Requirements:

Must be USC or GC.
Do not send FAKE GC candidates.
Candidates must be willing to relocate to the client’s location if not already local.
Provide basic details with resume: Name, location, Zip code, willingness to relocate, USC or GC status, Rate on C2C, Year came to USA & on what visa, Education with year of pass out.

Special Requirements

USC or GC only. Do not send FAKE GC candidates. Candidates willing to relocate to client's location. Onsite-Hybrid role.

Compensation & Location

Salary: $70 – $90 per year

Location: Irving, TX

Recruiter / Company – Contact Information

Email: ck@efulgent.net

Interested in this position?
Apply via Email

Recruiter Notice:
To remove this job posting, please send an email from
ck@efulgent.net with the subject:

DELETE_JOB_ID_3777

to delete@join-this.com.