NEWPosted 3 hours ago

Job ID: JOB_ID_9479

Job Summary:

We are seeking an experienced Senior Generative AI Developer to design and implement cutting-edge AI solutions leveraging Retrieval-Augmented Generation (RAG) techniques. The ideal candidate will have strong expertise in Python programming, FastAPI, and cloud platforms (AWS, Azure, or GCP). This role requires a deep understanding of system architecture design, scalable APIs, and end-to-end AI solution development. You will be responsible for architecting and developing Generative AI applications for enterprise-scale solutions, ensuring scalability, security, and performance.

Key Responsibilities:

  • Architect and develop Generative AI applications using RAG frameworks for enterprise-scale solutions.
  • Design and implement robust system architectures for AI-driven platforms ensuring scalability, security, and performance.
  • Build and optimize APIs using FastAPI for seamless integration with AI models and data pipelines.
  • Collaborate with cross-functional teams to integrate AI solutions into existing systems and workflows.
  • Implement data ingestion, preprocessing, and retrieval mechanisms for large-scale knowledge bases.
  • Ensure compliance with best practices for cloud deployment on AWS, Azure, or GCP.
  • Conduct performance tuning and optimization of AI models and APIs.
  • Stay updated with the latest advancements in Generative AI, LLMs, and RAG methodologies.
  • Work with containerization technologies like Docker and orchestration tools like Kubernetes.
  • Implement CI/CD pipelines and DevOps practices.

Required Skills & Qualifications:

  • 8+ years of professional experience in software development and system design.
  • Strong proficiency in Python and experience with FastAPI for API development.
  • Hands-on experience with Generative AI frameworks and RAG architectures.
  • Solid understanding of system and architecture design principles for distributed applications.
  • Experience deploying solutions on any major cloud platform (AWS, Azure, GCP).
  • Familiarity with vector databases, embedding models, and retrieval pipelines.
  • Strong problem-solving skills and ability to work in a fast-paced environment.

Preferred Qualifications:

  • Experience with LLM fine-tuning, prompt engineering, and model evaluation.
  • Knowledge of containerization (Docker) and orchestration (Kubernetes).
  • Exposure to CI/CD pipelines and DevOps practices.

Employment Type:

  • Contract (Convertible to Hire after 6 months)

Locations:

  • Dallas, TX
  • Tampa, FL
  • Jersey City, NJ

Work Model:

  • 3 days Hybrid

Special Requirements

All visa types are accepted for C2H (Contract to Hire). Candidate should take BB test within 2 days of submission.


Compensation & Location

Salary: $140,000 – $160,000 per year (Estimated)

Location: Dallas, TX


Recruiter / Company – Contact Information

Recruiter / Employer: Metrix IT Solutions INC

Email: ur.g@metrixit.com


Interested in this position?
Apply via Email

Recruiter Notice:
To remove this job posting, please send an email from
ur.g@metrixit.com with the subject:

DELETE_JOB_ID_9479

to delete@join-this.com.