Now in private beta · 240 early users

A coach for your AI stack.

You use Claude, ChatGPT, Cursor, Perplexity, and more. ai/ops watches how you actually use them, finds the patterns no one sees, and tells you exactly what to change — with receipts.

Open demo Create account
Start with demo. See the product instantly with no login. Then create account. Save your setup and move into onboarding.
Local-first. Your prompts never leave your device unless you opt in. Open methodology. Every recommendation shows its receipts.
ai/ops · app.aiops.so/intelligence
View 01 · Intelligence

Your AI Stack Health

7 active recommendations ⌘K
Verdict · Updated 14 min ago
Stack drift detected.
Coding work is fragmented across three tools and short queries aren't routed. Three high-confidence fixes will move you back into the Efficient zone.
72
/ 100
+4 week-over-week
◧ Template92%

Templatize your weekly research pull

You've run a variant 11 times this week. Save ~3.2h.

+3.2h / wkView →
⇄ Swap88%

Move multi-file edits to Cursor

Repo-aware IDE saves the "you forgot the import" loop.

+38% throughputView →
◫ Config81%

Route short asks to a smaller model

62% of spend, 18% of sessions. Router will fix it.

−$120 / moView →
◆ Nudge69%

Pick a primary for long-form thinking

Bouncing breaks reasoning chains. Pick one for >20-turn sessions.

+ coherenceView →

Four promises we're betting the company on.

If we break any of these, the product fails. They're not features — they're the floor.
01 · Trust

Local-first, by default.

Your prompts are captured on your device. PII and secrets are scrubbed before anything is summarized. Cloud sync is opt-in, granular, and one-tap reversible.

02 · Freshness

Every recommendation cites an "as of" date.

Model capabilities shift weekly. So do we. Our eval harness runs every Monday and we publish our reversal rate — how often last quarter's recommendations got revised. Lower is better. We show ours.

03 · Cold start

Day 1 is never empty.

Pick an archetype on signup and we'll seed your feed with cohort-validated recommendations — clearly labeled "based on people like you." By day seven, most are personal.

04 · Receipts

Every claim shows its work.

Open any recommendation and you'll see the actual prompts, sessions, and metrics it learned from. We don't ask you to trust a black box — we hand you the file.

How it works

Anatomy of a recommendation.

Every card on your feed follows the same structure. Observation, the change we suggest, the impact, the evidence, the confidence, and the date. No mystery. No "trust us."

Kind
One of five — Swap, Config, Automate, Nudge, Template.
Observation
The specific pattern we saw in your behavior.
Receipts
The actual sessions / prompts / metrics behind it.
Confidence
A real number, with a breakdown of the signals.
As of
When the underlying tool comparison was last verified.
Apply
One click. Always reversible.
⇄ Swap

Move multi-file code edits to Cursor.

You ran 23 multi-file refactors in Claude this week. Repo-aware IDEs see the whole project — fewer "you forgot the import" loops.

Sessions
23
Retry rate
32% → 7%
Cost / task
$0.42 → $0.18
refactor: rename hook across 4 files
add types to api/ + handlers/
extract useAuth from App.tsx

Five kinds of advice. Nothing more.

Swap

Move a workflow from one tool to a better-suited one.

Verb · move

Config

Change a setting or router. Same tool, better setup.

Verb · tune

Automate

Schedule, batch, or run a repetitive workflow in the background.

Verb · cycle

Nudge

A behavioral fix. Pick a primary tool. Stop context-switching.

Verb · choose

Template

Capture a repeated pattern as a reusable starting point.

Verb · capture
"I don't think about my AI stack anymore. It just works — and I have receipts."
— early user · staff engineer · 11 weeks in
3.4h
avg saved / week
$184
avg spend cut / mo
7.2
recommendations applied / mo
94%
kept after 30 days
Self-reported, n=240 early users, weeks 6–12.

Get a coach for your AI stack.

We're opening 100 more seats this month. Tell us what you use and we'll prepare your first verdict before you even install.

No spam. We only use this to manage early access and product updates.