NewVision logo

Gen AI Engineer

NewVision
3 days ago
Full-time
Remote
United States
AI Services

Role: Gen AI Jr role

Location: USA

Client: Deloitte

 

About the Role

 

We are looking for a passionate Gen AI Junior Engineer to join our growing AI team. This role is ideal for someone eager to build real-world applications powered by Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and agentic AI systems.

 

You will work on designing, developing, and optimizing AI-driven applications using modern GenAI frameworks, APIs, and cloud-based AI services such as Azure AI Foundry and OpenAI technologies.

 

Key Responsibilities

Develop and deploy LLM-powered applications using Python.

Design and implement Agentic AI workflows (multi-step reasoning, tool usage, memory handling).

Build and optimize Prompt Engineering strategies for accuracy, cost-efficiency, and reliability.

Manage token usage and cost optimization for production-grade AI systems.

Implement RAG (Retrieval-Augmented Generation) pipelines using vector databases.

Work with embeddings, chunking strategies, and semantic search.

Integrate AI models using REST APIs and SDKs.

Build and test AI services using Azure OpenAI Service.

Develop proof-of-concepts using frameworks like:

LangChain

Semantic Kernel

Model Context Protocol

Work with vector databases (e.g., Azure AI Search, Pinecone, etc.).

Collaborate with frontend teams to integrate AI into UI frameworks (React/Streamlit/Gradio).

Continuously evaluate and experiment with emerging GenAI tools and frameworks.

 

Required Skills & Qualifications

Bachelor’s degree in computer science, AI, Data Science, or related field.

Strong proficiency in Python.

Understanding of:

Large Language Models (LLMs)

Transformers basics

Embeddings and semantic similarity

Tokenization concepts

Experience working with APIs (REST, JSON).

 

Familiarity with:

LangChain or Semantic Kernel

Azure OpenAI Service or OpenAI APIs

Vector databases (Azure AI Search, Pinecone, FAISS, etc.)

Basic knowledge of RAG architecture and prompt optimization.

Understanding of token cost management and inference optimization.

Strong problem-solving and debugging skills.

Eagerness to learn and experiment with GenAI ecosystems.

 

Nice-to-Have

Exposure to multi-agent systems and agent orchestration.

Knowledge of fine-tuning techniques.

Experience with UI frameworks (React, Streamlit, Gradio).

Familiarity with cloud environments (Azure preferred).

Basic DevOps knowledge (Docker, CI/CD).

 

What You’ll Gain

Hands-on experience building production-grade AI systems.

Exposure to cutting-edge Agentic AI architectures.

Opportunity to work with enterprise AI platforms like Azure AI Foundry.

Mentorship and learning opportunities in advanced LLM systems.

A fast-paced innovation-driven environment.