Case studies

Selected work. Honest scope. Real numbers.

The work that earned the opinions.

NS Studio is a young studio. We publish only the work we can write about with full context, what we did, what we measured, what we killed, and what we'd do differently. New case studies added each quarter.

Featured · EdTech · End-to-end

The Dreamers Education.

A pre-launch education company came to NS Studio with a mission and a domain name. They left with a brand system, a shipped product, and an AI-driven learning workflow validated against a real pilot cohort of 82 families.

82 Beta families onboarded
68% Weekly session completion
3 Features killed with data
<role>
Act as the founding team of a
pre-launch education company.
You have a mission, a domain,
and a budget. You need a brand,
a product, and a learning loop
that survives a beta cohort.
</role>

<task>
Ship the smallest version that
proves families come back for
week 4, not the demo.
</task>

<output>
82 families · 68% completion ·
3 features killed with data
</output>

The prompt that ran the engagement.

82
Beta families
onboarded
2.3×
More practice
vs. worksheets
74%
Parents reported
confidence lift
3
Features killed
with usage data

The brief

The Dreamers Education was a pre-launch venture aimed at students underserved by mainstream EdTech, kids who don't fit the standard curriculum but have outsized capacity. The founder had a name, a hunch, and a small budget. What they didn't have: a brand, a product, a positioning, or any clear sense of how AI fit into what they were building.

They needed a partner who could do all four, and tell them honestly which corners to cut.

What we actually built

Five components. We deliberately resisted the urge to over-engineer. The discipline of saying "no, that's not in scope" was worth more than any model choice.

01 · Generation
GPT-4.1
Adaptive practice question generation, calibrated to a 70% target success rate per student. No fine-tuning at this layer, prompts handle the work.
02 · Diagnostic
Rule-based skill detection
Deterministic classifier that flags where each student is actually stuck. Cheap, auditable, and explainable to parents and teachers.
03 · Personalization
Light fine-tune · 500 responses
Small fine-tune on anonymized pilot responses to bias the model toward Dreamers' tone and pedagogy. Not the model's intelligence, its manners.
04 · Grounding
K–2 question bank
Curriculum-aligned math and reading question bank, hand-vetted, used as a retrieval layer to keep generations on-curriculum.
05 · Feedback
Weekly parent surveys
Three-question survey emailed every Friday. The single highest-leverage signal we had, and the cheapest to operate.
Pilot results · Q1 2026

The numbers we measured.
And the ones we didn't dress up.

We ran a 10-week pilot with 82 beta families recruited through a parent waitlist. These are the four charts we presented to the founder at the end of the pilot, verbatim.

01 · Engagement

From waitlist to weekly habit.

Conversion through the pilot funnel. The two numbers we cared about: did families show up, and did they come back the second week?

100% 60% 20% 100% Waitlist n=143 57% Onboarded n=82 47% Wk 1 active n=68 38% Wk 4 active n=55 28% Wk 10 n=40

28% week-10 retention is strong for early-stage EdTech (industry median ≈ 12–18%). Source: Dreamers internal product analytics, pilot cohort.

02 · Behavior change

What we actually replaced.

The invisible competitor wasn't another EdTech app. It was worksheets and ChatGPT. Self-reported parent behavior, week 0 vs. week 8.

100% 40% 0% 80% 35% Worksheets 40% 15% ChatGPT (HW) , 61% Dreamers
Week 0 (baseline) Week 8 (post-pilot)

Parent-reported "primary tool used for at-home practice." Sample: 55 active families at week 8.

03 · Learning outcomes

Confidence, before and after.

Parent-reported student confidence on a 1–10 scale, taken at intake and at week 8. Distribution shifted right; the long tail compressed.

median 3.2 median 6.8 1 3 5 7 10 Confidence score (parent-reported)
Intake (week 0) Week 8 (post-pilot)

74% of parents reported a 2+ point lift on the 10-point scale. Self-reported data, directional, not clinical.

04 · Trade-off

Gamified got more clicks. And finished less work.

Two-week A/B with 40 students, split evenly. Gamified version had streaks, points, and avatar rewards. Non-gamified had none.

100% 66% 33% +44% base Clicks/session −22% +31% Lessons completed −38 +12 Parent trust (NPS)
Gamified version Non-gamified (shipped)

Result: we shipped the non-gamified version. Engagement-as-clicks was the wrong North Star. n=40, two-week test.

The four-disciplines-in-one engagement

1. Brand identity

We built the visual system in two weeks: name lockup, logo system, color (a warm, dreamy palette anchored in earned-not-given rust amber and serif typography that signals seriousness without feeling academic), tone-of-voice guidelines, and a launch-ready brand book. The constraint was clarity, students, parents, and teachers all needed to see themselves in the brand.

2. Product & UX

Core flows designed end-to-end: onboarding, learning module navigation, practice loops, and progress feedback. We made one opinionated call early, no gamification gimmicks, no streaks, no points, and then we proved it with the A/B test above. Trust was the differentiator.

3. Go-to-market positioning

We positioned The Dreamers around a single contrast: most EdTech optimizes for engagement, we optimize for outcomes parents actually pay for. That sentence drove the homepage, the pricing page, the email sequence, and every paid acquisition test. It also told us which channels to ignore (social spectacle) and which to invest in (parent communities, school partnerships).

4. AI-driven learning workflow

The heart of the engagement. Five components (above), one non-negotiable: every model decision had to be explainable in plain English to a parent who'd never used AI before. The rule-based diagnostic layer exists because of that constraint, not in spite of it.

The hardest part of an engagement like this isn't the work. It's the ruthless prioritization. Every week, three things ask to be built. We ship one and tell the founder why we killed the other two.

What we killed

Three features made it into the prototype and didn't make it to launch. All three had champions inside the team. None of them survived contact with usage data.

Voice-based AI tutor
Demoed beautifully. Parents loved it in pitches. But repeat usage collapsed after the first session, the latency and conversational fragility didn't survive an impatient seven-year-old.
→ 8% week-2 usage · killed wk 4
Streaks & points system
Increased clicks (+44%) but reduced lesson completion (−22%) and tanked parent trust scores. Optimized for the wrong outcome. The A/B test settled the argument in two weeks.
→ −22% completion · killed wk 5
AI storytelling mode
High novelty, high first-session engagement, near-zero second-session use. Kids treated it as a toy, not a tool. Parents stopped recommending it within ten days.
→ 11% repeat use · killed wk 6

The honest timeline

An earlier version of this case study said "two weeks to launch-ready." That was true for the brand system, not the product. Here's the real arc.

Weeks 1–2 · Strategy & brand
Positioning, naming, brand system, prototype scaffolding.
Two weeks of intense scoping. We resisted the temptation to start building the product. The brand decisions made the product decisions easier.
Weeks 3–6 · MVP build
Core practice loop, diagnostic layer, question bank, first model integration.
Three components had champions. We shipped two and parked the third. The voice tutor went into a "later" pile that turned into a "never" pile.
Weeks 7–10 · Closed beta
82 families onboarded. Weekly parent surveys. The A/B test that killed gamification.
This is where the case study earns its numbers. Three features died in this window. The product got smaller and better every week.
Weeks 11+ · Iteration
Public launch, onboarding rewrite, school-partnership outreach.
The product that launched looked nothing like the prototype. That's the point.

What we'd do differently

We over-invested in the brand book in week one. The founder needed less documentation and more momentum. Future end-to-end engagements start with the AI workflow, not the brand system, even when the client thinks they want the opposite.

We also waited too long to run the gamification A/B. We had a hunch in week three. We ran the test in week eight. Five weeks of avoidable arguing.

That lesson became part of the standard NS Studio engagement shape. It's why every engagement now starts with the assumption test, not the deliverable inventory.

More case studies

In progress.

We publish two to three case studies per year, with full context and client permission. Subscribe to be notified when new work goes live.

Mid-market · OpsQ3 2026

Workflow audit at a 200-person SaaS company

How we returned 4 hours per CSM per week without buying a single new tool. Coming soon.

Research · ActiveFall 2026 cohort

The Calibration Program

Two pre-registered empirical studies on what AI is doing to executive prioritization at F500 scale. Open call for partner companies and research collaborators.

Read the program →
Public sectorQ4 2026

AI governance for a regulated workforce

Drafting an AI use policy that legal, ops, and the union could all sign. Coming soon.

Could your engagement be the next case study?

We work with a small number of clients at any one time, by referral or after a fit call. If the conversation is mutually right, we move fast.

Book a 30-min call
Or first Submit discovery form

METHODOLOGY · Pilot ran Q1 2026 with 82 beta families recruited from a 143-person waitlist. Engagement metrics from internal product analytics. Confidence scores and behavior change are parent-reported via weekly surveys (n=55 active families at week 8). Gamification A/B was a two-week test with 40 students randomly assigned. Numbers rounded for legibility. Figures are illustrative of pilot performance, not a forecast. Published with founder permission.