Founded over 25 years ago, Forte Group has transformed from a Quality Assurance-focused company into a dynamic player in the tech industry, delivering innovative solutions globally. Based in Boca Raton, USA, we proudly partner with over 400 clients, including Fortune 500 companies, and our software impacts more than 9 million users—comparable to the entire population of New York or Switzerland!

Forte Group is looking for an AI Product Engineer. We build products where AI is the core capability. Our AI Product Engineers work on client engagements across the US and Europe, delivering LLM-powered features, retrieval systems, agentic workflows, and fully AI-native applications into production.

Recent projects include:

An AI agent extracting structured settlement data from unstructured financial documents with 95%+ accuracy, including full audit trails and multi-tenant enterprise deployment
A clinical AI platform that reduced prescription validation time from 20 minutes to 5 minutes across 47 healthcare centers, built and maintained by a team of five engineers
A revenue optimization platform processing booking data across thousands of facilities using multi-model LLM orchestration, large-scale vector search, and document intelligence pipelines

These are not prototypes. They operate on real data, serve real users, meet compliance requirements, and carry real consequences when they fail.

If that’s the kind of work you want to do, keep reading.

WHAT YOU WILL DO

You will design and deliver AI-powered features end-to-end — from architecture to production and ongoing optimization.

On a typical engagement, you will:

Build RAG pipelines: document ingestion, chunking strategies, embeddings, vector stores, retrieval evaluation, and re-ranking
Integrate LLM APIs into production systems using structured outputs, function calling, and multi-model routing
Balance real-world tradeoffs between accuracy, latency, and cost
Develop agentic workflows with tool usage, human-in-the-loop checkpoints, and multi-agent coordination
Build evaluation systems: automated test sets, field-level accuracy metrics, regression detection in CI/CD, and production monitoring
Design human-in-the-loop workflows for low-confidence outputs and feedback loops
Own production operations: latency optimization, cost control, drift monitoring, and incident response

SKILLS & EXPERIENCE

Experience

5+ years in software engineering
2+ years building AI-powered features or products in production

Languages & Frameworks

Python (primary), TypeScript/Node.js
Experience with at least one: FastAPI, Flask, Django, Express, or Next.js
Solid understanding of async patterns, streaming (SSE/WebSockets), and batch processing

LLM & AI Systems

Experience with LLM APIs: OpenAI, Anthropic, or open-source models (Ollama, vLLM, etc.)
Strong prompt engineering at a systems level (structured outputs, tool use, few-shot, etc.)
Multi-model architectures: routing, tiering by cost/complexity, fallback strategies
Understanding of when to use fine-tuning vs. RAG vs. prompt engineering
RAG & Retrieval
Vector databases: Pinecone, Weaviate, pgvector, Chroma, OpenSearch, or similar
Deep understanding of embeddings and chunking strategies (fixed, semantic, sentence-based)
Hybrid search (vector + keyword), re-ranking, and retrieval evaluation
Document processing pipelines, including OCR and multi-format ingestion

Agent Framework & Orchestration

Experience with LangChain, LangGraph, CrewAI, Autogen, or similar tools
Understanding of MCP (Model Context Protocol) or equivalent multi-agent coordination patterns
Ability to choose between frameworks and custom orchestration
Secure tool usage patterns (sandboxing, permissions, approval workflows)

Evaluation & Observability

Evaluation frameworks (LangSmith, Braintrust, Ragas, or custom setups)
Production monitoring: output quality, latency, cost, model drift
Tracing across multi-step pipelines
Human feedback loops and confidence-based routing

Infrastructure & Production

Containerization and cloud deployment (AWS, Azure, or GCP)
CI/CD pipelines for AI systems, including automated evaluation
Data pipelines (Snowflake, dbt, or similar tools are a plus)
Latency optimization (streaming, caching, async processing)
Cost management at scale (model tiering, caching, per-query tracking)
Data governance: PII handling, auditability, compliance

WHAT SETS YOU APART

You can walk us through a production AI feature you built — including what failed and what you learned
You speak concretely about RAG (chunking tradeoffs, retrieval accuracy), not just tools
You’ve handled real-world constraints: latency, cost spikes, unpredictable inputs
You can explain AI system behavior to stakeholders who care about outcomes, not architecture
You build evaluation into systems from day one — because you’ve seen what happens when you don’t

We Offer

Work your way — anywhere, anytime. Our remote-first approach lets you choose where and how you work best!
Experience working with diverse teams and gaining international expertise
A friendly, supportive team and an enjoyable work environment where your ideas matter
A chance to work on exciting, challenging projects using cutting-edge technologies that make a real impact
Comprehensive health insurance, corporate psychologist access, and partial sports activity coverage
Free training programs, reimbursement for certifications, and access to online learning platforms to fuel your growth
Paid vacation, public holidays, and sick leave are fully covered by Forte Group
Referral bonuses, regular performance reviews, and full support for business trips
Corporate events and holiday presents

Join a team that invests in your well-being, growth, and success!

AI Product Engineer

About this role

Similar jobs

Similar jobs

Similar jobs