Job ID: JOB_ID_9479
Job Summary:
We are seeking an experienced Senior Generative AI Developer to design and implement cutting-edge AI solutions leveraging Retrieval-Augmented Generation (RAG) techniques. The ideal candidate will have strong expertise in Python programming, FastAPI, and cloud platforms (AWS, Azure, or GCP). This role requires a deep understanding of system architecture design, scalable APIs, and end-to-end AI solution development. You will be responsible for architecting and developing Generative AI applications for enterprise-scale solutions, ensuring scalability, security, and performance.
Key Responsibilities:
- Architect and develop Generative AI applications using RAG frameworks for enterprise-scale solutions.
- Design and implement robust system architectures for AI-driven platforms ensuring scalability, security, and performance.
- Build and optimize APIs using FastAPI for seamless integration with AI models and data pipelines.
- Collaborate with cross-functional teams to integrate AI solutions into existing systems and workflows.
- Implement data ingestion, preprocessing, and retrieval mechanisms for large-scale knowledge bases.
- Ensure compliance with best practices for cloud deployment on AWS, Azure, or GCP.
- Conduct performance tuning and optimization of AI models and APIs.
- Stay updated with the latest advancements in Generative AI, LLMs, and RAG methodologies.
- Work with containerization technologies like Docker and orchestration tools like Kubernetes.
- Implement CI/CD pipelines and DevOps practices.
Required Skills & Qualifications:
- 8+ years of professional experience in software development and system design.
- Strong proficiency in Python and experience with FastAPI for API development.
- Hands-on experience with Generative AI frameworks and RAG architectures.
- Solid understanding of system and architecture design principles for distributed applications.
- Experience deploying solutions on any major cloud platform (AWS, Azure, GCP).
- Familiarity with vector databases, embedding models, and retrieval pipelines.
- Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Qualifications:
- Experience with LLM fine-tuning, prompt engineering, and model evaluation.
- Knowledge of containerization (Docker) and orchestration (Kubernetes).
- Exposure to CI/CD pipelines and DevOps practices.
Employment Type:
- Contract (Convertible to Hire after 6 months)
Locations:
- Dallas, TX
- Tampa, FL
- Jersey City, NJ
Work Model:
- 3 days Hybrid
Special Requirements
All visa types are accepted for C2H (Contract to Hire). Candidate should take BB test within 2 days of submission.
Compensation & Location
Salary: $140,000 – $160,000 per year (Estimated)
Location: Dallas, TX
Recruiter / Company – Contact Information
Recruiter / Employer: Metrix IT Solutions INC
Email: ur.g@metrixit.com
Recruiter Notice:
To remove this job posting, please send an email from
ur.g@metrixit.com with the subject:
DELETE_JOB_ID_9479