Technology | Fitly AI

The model is the last thing we trust.

The naive AI app takes your message, glues on some profile data, sends it to a model, and renders the reply. It works for demos and collapses the moment a real person shows up with a real life, a shoulder injury, a hatred of quinoa, and a "make it harder" with no other context.

A model is a smart, expensive, occasionally-wrong text engine. Everything that makes Fitly feel like an agent instead of a chatbot is engineered around it.

Ten layers run on every message.

You type one sentence. Nine systems run before the model replies, and a tenth runs after.

Auth, validation & limits

Every request is verified, schema-checked, and rate-limited before a single token is spent. The layer that says no is what lets the rest feel generous.

Image classification

Photos are sorted into eight types (meal, label, barcode, scale, equipment, body, screenshot, other) so each one triggers the right pipeline instead of one fuzzy guess.

Parallel context assembly

Nine queries fan out at once: profile, macro targets, today's logs, recent workouts, active plans, history, workflow state, long-term memory, and streaks.

Workflow continuation

Detects whether "yes, the bigger one" is a fresh request or a reply mid-task, across nine multi-turn workflows, so the conversation just flows.

Dynamic prompt builder

The system prompt is assembled per turn from 20+ conditional blocks: personality, diet, targets, tools, refusal rules. Every irrelevant token is a tax on quality.

Dynamic tool scoping

50+ tools exist; the model only sees the few that fit this turn. Smaller tool lists mean faster, more accurate tool choices.

Multi-model routing

Cheap tasks run on a fast model; hard ones route to a stronger one, and a turn can escalate itself mid-flight. Thoughtful guidance at consumer prices.

Agentic tool loop

The model can call tools, read results, and call more, up to ten iterations, before it replies. One bubble can hide six tool calls. It checks its work.

Response guards

Three guards run before the reply ships: it really saved what it claimed, it can't write into future days, and it can't quietly drop a logged food.

Persistence & memory

After the reply, the turn is stored, memory candidates are extracted, streaks advance. The layer that makes the next conversation sharper than the last.

Want the full deep dive?

We wrote a long-form breakdown of all ten layers, what runs, why it runs, and what it costs to get right.

Read: The Orchestrator Inside Fitly →

An agent other agents can talk to.

Fitly isn't just an agent you talk to. It runs a Model Context Protocol (MCP) server, so your Claude, ChatGPT, Gemini, or any custom agent can connect and use the same tools your agent uses. Your fitness data becomes a capability any agent can call, no app required. Available on the Fitly AI Pro plan.

Your AI agents

ClaudeChatGPTGeminiCustom

→MCP

your agent + 50+ tools

→

Your fitness data

read & write

Read your profile, goals, and history

Log meals, workouts, weight, and water

Check macros and today's progress

Pull up and activate your plans

Scoped, token-based access per agent

Streamable HTTP transport

What the orchestrator buys you.

An agent, not a prompt

The model is one component, not the product. It reasons, picks from 50+ structured tools, calls them in a loop, and only answers once it has checked the data. That is the difference between a chatbot and an agent.

Full context, every turn

Before the model speaks, nine queries assemble your present-tense state: goals, injuries, allergies, today's macros, recent workouts, and long-term memory. It never asks you to repeat yourself.

Provider-agnostic, multi-model

Fitly runs on frontier models (Claude by default) behind an abstraction, and routes each turn to the right one for the job, escalating mid-turn when a task turns out to be hard.

Photo intelligence

A fast classifier sorts every image into eight categories first, so a nutrition label gets OCR'd, a barcode gets looked up, and a plate gets portion-estimated, each by the right pipeline.

Guards you can trust

A write-confirmation guard, a future-write guard, and a partial-logging guard run on every response. We don't let the model lie about what it saved, log into next Tuesday, or drop a food.

Private by design

Health data is encrypted, served through pre-signed URLs, and every data change is written to an audit log. Your data is yours; we never sell it.

Sharper every week

After every turn, Fitly extracts what it learned about you into long-term memory, so the next conversation starts further ahead than the last.

Built to scale

A stateless API, cursor-based pagination, Redis-cached context, and queue-backed processing on AWS, architected to serve millions from day one, not bolted on later.

The engine behind your agent.

The model is the last thing we trust.

Ten layers run on every message.

Auth, validation & limits

Image classification

Parallel context assembly

Workflow continuation

Dynamic prompt builder

Dynamic tool scoping

Multi-model routing

Agentic tool loop

Response guards

Persistence & memory

Want the full deep dive?

An agent other agents can talk to.

What the orchestrator buys you.

An agent, not a prompt

Full context, every turn

Provider-agnostic, multi-model

Photo intelligence

Guards you can trust

Private by design

Sharper every week

Built to scale

Everyone has the same models.
The difference is what runs around them.

Ready to start?

The engine behind your agent.

The model is the last thing we trust.

Ten layers run on every message.

Auth, validation & limits

Image classification

Parallel context assembly

Workflow continuation

Dynamic prompt builder

Dynamic tool scoping

Multi-model routing

Agentic tool loop

Response guards

Persistence & memory

Want the full deep dive?

An agent other agents can talk to.

What the orchestrator buys you.

An agent, not a prompt

Full context, every turn

Provider-agnostic, multi-model

Photo intelligence

Guards you can trust

Private by design

Sharper every week

Built to scale

Everyone has the same models.The difference is what runs around them.

Ready to start?

Everyone has the same models.
The difference is what runs around them.