sunmoon.dev
All writing

The Fractional CTO Playbook for Early-Stage AI Startups

Seunghun Lee
fractional CTOstartupsAI strategy

You raised a small round or you're still pre-revenue. You have a working demo, two or three engineers, and a roadmap that grows faster than your runway. You can't justify a $250k full-time CTO yet, but you can feel the cost of not having one: every architecture call gets made by whoever is loudest, your model bill is creeping up for reasons nobody can explain, and you keep almost-hiring the wrong person.

That gap is exactly where a fractional CTO earns their fee. I've spent years as a senior engineer at Spotify and Klarna, I was early at a Y Combinator–backed startup, and I now build and run my own AI products. So this isn't theory. Here's what the job actually is when it's done well.

What a fractional CTO is not

Let me clear the air first, because the title gets abused.

A fractional CTO is not a part-time senior dev you rent to clear tickets. If someone is mostly writing your features, you've hired a contractor with a fancy title. The whole point of the role is leverage: a few days a month spent on the decisions that are expensive to reverse, so your full-time engineers can move fast on the ones that aren't.

The cheapest line of code is the one you didn't have to delete six months later because someone made the right call early.

The fractional CTO's job is to be that someone. Concretely, that breaks into four areas.

1. Architecture you won't have to rip out

Early AI products fail architecturally in predictable ways. Founders wire their app directly to a single model provider, bury prompts inside React components, and treat the LLM call like any other synchronous API. Then latency spikes, a provider has an outage, or costs triple, and the whole thing has to be unpicked under pressure.

The fractional CTO's first week is usually about boundaries:

  • An abstraction layer between your product and the model. You should be able to swap GPT for Claude for an open-weights model in a config change, not a refactor. I learned this the hard way building transcribe.so — the moment a cheaper, faster model shipped, the teams who'd hard-coded a provider were stuck and we weren't.
  • Async by default. Anything that touches a model goes through a queue with retries, timeouts, and idempotency. A user clicking "summarize" should never block on a 40-second generation.
  • Eval and observability from day one. You cannot improve what you can't measure. Capture inputs, outputs, latency, and cost per request before you have a quality problem, not after.

None of this is exotic. It's the same discipline I applied to payment flows at Klarna, where a retry done wrong moves real money. AI just adds non-determinism on top.

2. Hiring — and not hiring

The most expensive mistake an early startup makes isn't a bad architecture decision. It's a bad hire. A wrong senior engineer costs you six months and poisons the team's standards.

A fractional CTO does three things here that founders usually can't do alone:

  • Writes the technical bar. What does "good" look like for your stage? At three engineers you want generalists who ship; the platform specialists come later.
  • Runs real technical interviews. Most founders can't tell a strong systems engineer from a confident one. I can, because I've sat on both sides of the table at companies that hire hard.
  • Sequences the hires. The order matters more than the count. Hiring an ML researcher before you have someone who can ship to production is a classic, costly inversion.

Here's the part founders underrate: a good fractional CTO will tell you not to hire. Most early AI startups need fewer people and better tooling, not a bigger team. When I built goodlisten.co, the leverage came from picking the right managed services, not from headcount.

3. Model and cost decisions

This is the area founders most often get wrong, because the numbers look small until they aren't. A demo that costs $0.02 per run feels free. At 100,000 runs a month it's a $2,000 line item, and if you priced your product before you measured that, your margins are already underwater.

Here's the framing I bring to every model decision:

Decision The naive instinct What I usually recommend
Which model The biggest, smartest one The smallest model that passes your evals
When to call it Every request, live Cache aggressively; precompute what you can
Fine-tune vs. prompt Fine-tune early to feel serious Prompt + retrieval first; fine-tune only when evals prove it pays
Build vs. buy Build the ML in-house Buy the model, own the product and the data
Cost tracking Check the invoice monthly Track cost per request in real time, per feature

The single highest-leverage habit is measuring cost per unit of value — per transcript, per summary, per user action — not per token. Tokens are an implementation detail. Margin is the business. Getting that visibility in place is often the first thing I do, because every other model decision depends on it.

4. Derisking the things that kill startups quietly

The dramatic failures get the headlines. The quiet ones kill more companies. A fractional CTO's real value is spotting these before they compound:

  • Security and secrets hygiene. One committed API key in a "private" repo can burn your sender domain or drain your model budget overnight. This gets ignored until it's a crisis. I treat it as week-one hygiene, not a someday project.
  • Vendor lock-in. If your entire product is one provider's API with no abstraction, you've handed them your roadmap and your pricing power.
  • The bus factor. When one engineer holds the whole system in their head and no one else can deploy, you're one resignation away from a frozen roadmap.
  • Compliance creep. If you're touching user audio, transcripts, or anything regulated, the data decisions you make now are very hard to reverse later.

This is where lived operating experience matters more than a framework. I've watched these failure modes play out across large companies and my own products, which means I recognize the early shape of them before they're expensive.

How to actually use a fractional CTO

A few hard-won rules for getting value, not just hours:

  1. Give them decision rights, not just an advisory seat. If they can only suggest, you've bought a consultant. Let them own the architecture and hiring calls.
  2. Keep them close to the code. A fractional CTO who never reads your repo is guessing. The good ones stay technical.
  3. Set a clear handoff. The role is transitional by design. The goal is a system and a team strong enough that you eventually promote internally or hire full-time — and the fractional CTO works themselves out of a job.

Frequently Asked Questions

When should an AI startup hire a fractional CTO instead of a full-time one?

When you have real engineering decisions to make but not enough scale to justify a full-time executive salary — typically pre-seed to early seed, with a small team and a product that's live or close to it. A fractional CTO gives you senior judgment on the irreversible calls without the equity and cash burn of a full-time hire. You graduate to full-time when the technical org is large enough that day-to-day leadership becomes a full job.

How many days a month does a fractional CTO actually need?

Less than founders expect — often two to six days a month. The value is concentrated in a few high-leverage decisions, not in volume of hours. If the role starts to look like full-time contracting, either the scope has drifted into hands-on development or the company has outgrown the fractional model and needs a permanent hire.

Can a fractional CTO help us choose between AI model providers?

Yes, and it's one of the highest-value things they do. The right answer is rarely the biggest model — it's the smallest one that passes your evaluations at a cost that protects your margins, behind an abstraction so you can switch later. A good fractional CTO sets up cost-per-request tracking and an eval harness so the decision is driven by data, not vendor marketing.

What's the biggest risk a fractional CTO derisks for an early AI startup?

Quiet, compounding mistakes: secrets leaks, single-provider lock-in, a one-person bus factor, and pricing set before anyone measured unit cost. These rarely look urgent until they trigger a crisis, and by then they're expensive to unwind. Catching them early — as routine hygiene rather than firefighting — is where an experienced operator pays for themselves many times over.

Where this leaves you

If your AI startup is making expensive architecture and hiring calls by gut feel, you don't need a full-time CTO yet — you need senior judgment applied to the decisions you can't easily reverse. That's the whole playbook: derisk the irreversible, measure what matters, and build a team and system that outlast the engagement.

If that's the gap you're feeling, book a call and let's talk through where your product actually stands.

Have something that needs shipping?

I'm Seunghun Lee — I design, build, and ship production AI agents and full-stack SaaS. Tell me what you're building.