Founded over 25 years ago, Forte Group has transformed from a Quality Assurance-focused company into a dynamic player in the tech industry, delivering innovative solutions globally. Based in Boca Raton, USA, we proudly partner with over 400 clients, including Fortune 500 companies, and our software impacts more than 9 million users—comparable to the entire population of New York or Switzerland!
Forte Group is looking for an AI Product Engineer. We build products where AI is the core capability. Our AI Product Engineers work on client engagements across the US and Europe, delivering LLM-powered features, retrieval systems, agentic workflows, and fully AI-native applications into production.
Recent projects include:
- An AI agent extracting structured settlement data from unstructured financial documents with 95%+ accuracy, including full audit trails and multi-tenant enterprise deployment
- A clinical AI platform that reduced prescription validation time from 20 minutes to 5 minutes across 47 healthcare centers, built and maintained by a team of five engineers
- A revenue optimization platform processing booking data across thousands of facilities using multi-model LLM orchestration, large-scale vector search, and document intelligence pipelines
These are not prototypes. They operate on real data, serve real users, meet compliance requirements, and carry real consequences when they fail.
If that’s the kind of work you want to do, keep reading.
WHAT YOU WILL DO
You will design and deliver AI-powered features end-to-end — from architecture to production and ongoing optimization.
On a typical engagement, you will:
- Build RAG pipelines: document ingestion, chunking strategies, embeddings, vector stores, retrieval evaluation, and re-ranking
- Integrate LLM APIs into production systems using structured outputs, function calling, and multi-model routing
- Balance real-world tradeoffs between accuracy, latency, and cost
- Develop agentic workflows with tool usage, human-in-the-loop checkpoints, and multi-agent coordination
- Build evaluation systems: automated test sets, field-level accuracy metrics, regression detection in CI/CD, and production monitoring
- Design human-in-the-loop workflows for low-confidence outputs and feedback loops
- Own production operations: latency optimization, cost control, drift monitoring, and incident response
SKILLS & EXPERIENCE
Experience
- 5+ years in software engineering
- 2+ years building AI-powered features or products in production
Languages & Frameworks
- Python (primary), TypeScript/Node.js
- Experience with at least one: FastAPI, Flask, Django, Express, or Next.js
- Solid understanding of async patterns, streaming (SSE/WebSockets), and batch processing
LLM & AI Systems
- Experience with LLM APIs: OpenAI, Anthropic, or open-source models (Ollama, vLLM, etc.)
- Strong prompt engineering at a systems level (structured outputs, tool use, few-shot, etc.)
- Multi-model architectures: routing, tiering by cost/complexity, fallback strategies
- Understanding of when to use fine-tuning vs. RAG vs. prompt engineering
- RAG & Retrieval
- Vector databases: Pinecone, Weaviate, pgvector, Chroma, OpenSearch, or similar
- Deep understanding of embeddings and chunking strategies (fixed, semantic, sentence-based)
- Hybrid search (vector + keyword), re-ranking, and retrieval evaluation
- Document processing pipelines, including OCR and multi-format ingestion
Agent Framework & Orchestration
- Experience with LangChain, LangGraph, CrewAI, Autogen, or similar tools
- Understanding of MCP (Model Context Protocol) or equivalent multi-agent coordination patterns
- Ability to choose between frameworks and custom orchestration
- Secure tool usage patterns (sandboxing, permissions, approval workflows)
Evaluation & Observability
- Evaluation frameworks (LangSmith, Braintrust, Ragas, or custom setups)
- Production monitoring: output quality, latency, cost, model drift
- Tracing across multi-step pipelines
- Human feedback loops and confidence-based routing
Infrastructure & Production
- Containerization and cloud deployment (AWS, Azure, or GCP)
- CI/CD pipelines for AI systems, including automated evaluation
- Data pipelines (Snowflake, dbt, or similar tools are a plus)
- Latency optimization (streaming, caching, async processing)
- Cost management at scale (model tiering, caching, per-query tracking)
- Data governance: PII handling, auditability, compliance
WHAT SETS YOU APART
- You can walk us through a production AI feature you built — including what failed and what you learned
- You speak concretely about RAG (chunking tradeoffs, retrieval accuracy), not just tools
- You’ve handled real-world constraints: latency, cost spikes, unpredictable inputs
- You can explain AI system behavior to stakeholders who care about outcomes, not architecture
- You build evaluation into systems from day one — because you’ve seen what happens when you don’t
We Offer
- Work your way — anywhere, anytime. Our remote-first approach lets you choose where and how you work best!
- Experience working with diverse teams and gaining international expertise
- A friendly, supportive team and an enjoyable work environment where your ideas matter
- A chance to work on exciting, challenging projects using cutting-edge technologies that make a real impact
- Comprehensive health insurance, corporate psychologist access, and partial sports activity coverage
- Free training programs, reimbursement for certifications, and access to online learning platforms to fuel your growth
- Paid vacation, public holidays, and sick leave are fully covered by Forte Group
- Referral bonuses, regular performance reviews, and full support for business trips
- Corporate events and holiday presents
Join a team that invests in your well-being, growth, and success!