Most fitness AIs are a model and a prompt. Fitly is a ten-layer orchestrator: nine systems run before the model is allowed to speak, and a tenth runs after. Here is what happens beneath the chat bubble.
The naive AI app takes your message, glues on some profile data, sends it to a model, and renders the reply. It works for demos and collapses the moment a real person shows up with a real life, a shoulder injury, a hatred of quinoa, and a "make it harder" with no other context.
A model is a smart, expensive, occasionally-wrong text engine. Everything that makes Fitly feel like a coach instead of a chatbot is engineered around it.
You type one sentence. Nine systems run before the model replies, and a tenth runs after.
Every request is verified, schema-checked, and rate-limited before a single token is spent. The layer that says no is what lets the rest feel generous.
Photos are sorted into eight types (meal, label, barcode, scale, equipment, body, screenshot, other) so each one triggers the right pipeline instead of one fuzzy guess.
Nine queries fan out at once: profile, macro targets, today's logs, recent workouts, active plans, history, workflow state, long-term memory, and streaks.
Detects whether "yes, the bigger one" is a fresh request or a reply mid-task, across nine multi-turn workflows, so the conversation just flows.
The system prompt is assembled per turn from 20+ conditional blocks: personality, diet, targets, tools, refusal rules. Every irrelevant token is a tax on quality.
50+ tools exist; the model only sees the few that fit this turn. Smaller tool lists mean faster, more accurate tool choices.
Cheap tasks run on a fast model; hard ones route to a stronger one, and a turn can escalate itself mid-flight. Thoughtful coaching at consumer prices.
The model can call tools, read results, and call more, up to ten iterations, before it replies. One bubble can hide six tool calls. It checks its work.
Three guards run before the reply ships: it really saved what it claimed, it can't write into future days, and it can't quietly drop a logged food.
After the reply, the turn is stored, memory candidates are extracted, streaks advance. The layer that makes the next conversation sharper than the last.
We wrote a long-form breakdown of all ten layers, what runs, why it runs, and what it costs to get right.
Read: The Orchestrator Inside Fitly →Fitly isn't just an agent you talk to. It runs a Model Context Protocol (MCP) server, so your Claude, ChatGPT, Gemini, or any custom agent can connect and use the same tools your coach uses. Your fitness data becomes a capability any agent can call, no app required. Available on the Pro plan.
The model is one component, not the product. It reasons, picks from 50+ structured tools, calls them in a loop, and only answers once it has checked the data. That is the difference between a chatbot and an agent.
Before the model speaks, nine queries assemble your present-tense state: goals, injuries, allergies, today's macros, recent workouts, and long-term memory. It never asks you to repeat yourself.
Fitly runs on frontier models (Claude by default) behind an abstraction, and routes each turn to the right one for the job, escalating mid-turn when a task turns out to be hard.
A fast classifier sorts every image into eight categories first, so a nutrition label gets OCR'd, a barcode gets looked up, and a plate gets portion-estimated, each by the right pipeline.
A write-confirmation guard, a future-write guard, and a partial-logging guard run on every response. We don't let the model lie about what it saved, log into next Tuesday, or drop a food.
Health data is encrypted, served through pre-signed URLs, and every data change is written to an audit log. Your data is yours; we never sell it.
After every turn, Fitly extracts what it learned about you into long-term memory, so the next conversation starts further ahead than the last.
A stateless API, cursor-based pagination, Redis-cached context, and queue-backed processing on AWS, architected to serve millions from day one, not bolted on later.
Every fitness AI in the App Store has access to roughly the same models we do. The orchestration around the model is the moat, and it is where we have spent our build cycles. That is why a sentence as ordinary as "I had a chicken bowl from Chipotle" can do as much real work as it does inside Fitly.
Drop your email and we'll let you know when spots open up.