DescriptionGramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high-performing teams by matching them with professionals who truly fit their needs.
Role Overview
We are looking for a Lead AI Engineer to lead the design, delivery, and scaling of production-grade AI systems powering real-world products and workflows.
This role sits at the intersection of engineering leadership, applied machine learning, and large-scale AI platform delivery, owning both team growth and technical strategy. You will work cross-functionally with product, data, and business stakeholders to turn cutting-edge AI capabilities into reliable, enterprise-ready solutions.
This is a high-impact leadership role for someone who combines hands-on understanding of modern LLM ecosystems with a strong track record of building and scaling engineering organizations.
Commitments Required: 8 hours per day with an overlap of 4 hours with PST.
Employment type: Contractor assignment (no medical/paid leave); 100% REMOTE
Duration of contract: 12+ months (long-term)
Locations: LATAM
Responsibilities
- Lead and scale engineering teams delivering production AI systems and platforms
- Drive architecture and implementation of LLM-powered applications, including RAG, embeddings, fine-tuning, and evaluation pipelines
- Establish best practices for LLM testing, benchmarking, grounding, hallucination detection, and performance validation
- Partner with product and business leaders to translate AI opportunities into scalable technical solutions
- Oversee deployment and lifecycle management of AI models in cloud production environments
- Guide development of robust data pipelines, integration patterns, and MLOps workflows
- Mentor engineering leaders and senior engineers, fostering strong technical ownership and execution culture
- Ensure systems meet enterprise standards for reliability, security, privacy, and scalability
Requirements- 8+ years of software engineering experience including leadership of engineering teams
- Strong recent experience applying AI/LLM systems to real production products and workflows
- Proven success building and scaling engineering teams in production AI environments
- Deep experience with modern LLM ecosystem including:
- embeddings
- RAG architectures
- RLHF
- fine-tuning approaches
- Experience designing or implementing LLM evaluation frameworks, including prompt testing, grounding strategies, hallucination mitigation, and benchmarking
- Strong programming experience in Python and/or Typescript/Javascript
- Excellent communication skills with experience leading cross-functional initiatives
Preferred Qualifications
- Experience working with large-scale datasets in AdTech or advertising platforms
- Experience with Snowflake, Databricks, or similar large-scale data platforms
- Hands-on experience with AI frameworks such as PyTorch, TensorFlow, or Hugging Face
- Experience deploying models into production cloud environments
- Familiarity with data pipelines, integration patterns, and MLOps practices
- Direct experience implementing RAG systems at scale and/or fine-tuning large language models
- Exposure to diffusion, image, or video generation models
- Experience with creative pipelines or AI-driven content workflows
- Background in statistical modeling or mathematics
- Awareness of AI privacy, copyright, or ethical considerations
- Experience in media, entertainment, or content-tech industries