// case study

The rebuild that got us off a platform we didn’t own

A reference build we commissioned ourselves, because NDAs block most of the before-and-afters we’d otherwise publish.

scope Full rebuild of a base44-generated booking platform

stack Next.js · TypeScript · Supabase · Vercel

time ~2 weeks of senior engineering

// the short version

Same product. Different posture.

We built the same booking app twice. First in base44, a vibe-coding platform that promises a working product in a single prompt. Then again, from scratch, in Next.js and Supabase, using the same product spec and the same screens.

The base44 version demoed fine. It was also unshippable: no way to export the code, no auth we could trust, validation that accepted anything, and the kind of soft failures you only find out about when a real customer hits them.

The rebuild is the version in the screenshots on this page. This is the gap we fix for a living.

The rebuild that got us off a platform we didn’t own screenshot

// why we did this

Most rescues start with a screenshot and a bad feeling.

Most of our rescue engagements start with a founder sending us a Lovable, Bolt, Cursor, or base44 repo and asking some version of, "can this be saved?" The answer is almost always yes. But the why is hard to explain without showing the work, and client NDAs mean we can’t always publish the before-and-after.

So we built one ourselves, on purpose, from a tool we hadn’t yet put through its paces. BookBase is a booking platform in the Calendly / Acuity / Square Appointments lane. We picked it because the category is common, the happy path looks easy, and the edge cases (double-booking, timezone drift, cancellation logic, payment validation, multi-tenant data isolation) are the exact places vibe-coded software falls apart.

The point isn’t to dunk on base44. Base44 is good at what it’s good at: going from zero to a working demo in about an hour. The point is what happens on day 90, when a real user submits a booking with a $-50 price in the notes field and the whole thing silently accepts it.

// what we found

Seven issues, none of them visible in a demo.

We gave base44 the same product brief we would give any engineer starting from scratch. It produced a working app in roughly an hour. Here is what we found when we stopped watching and started looking.

[01]

The code is not yours.

Base44 does not expose the generated source. You cannot clone the repo, audit the schema, inspect the auth flow, read the RLS policies, set up staging, or run it locally. You are renting an app you do not own. This is the single issue that makes every other issue unfixable.

[02]

Validation is cosmetic.

Empty names were accepted through the API. Malformed emails never surfaced an error. The phone field accepted arbitrary text. Notes had no length cap. Past-date bookings via crafted requests went through. Double-booking was a race condition.

[03]

Multi-tenant isolation relies on trust.

No query-level scoping we could verify. The isolation guarantee was "the UI does not show you the other data." That is not a guarantee. Anyone with the API shape and a login token could have queried across tenants.

[04]

No error handling worth the name.

When a booking failed, the user got either a spinner that never resolved or a generic "something went wrong." The server-side error was swallowed. No monitoring, no alerts. If a user hit an error, we would find out when they emailed us.

[05]

No tests.

Not a unit test. Not an integration test. Not an end-to-end test. Every deploy is a leap of faith.

[06]

Performance is accidentally fine.

With a dozen bookings, everything was fast. We loaded it with 5,000. Dashboard queries took seconds. The appointments list had no pagination. The calendar view did the same.

[07]

Observability is zero.

No logs, no metrics, no uptime monitoring. The only way to diagnose a production issue was to refresh the app and hope you could reproduce it.

None of this is visible in a demo. All of it matters the moment you have a real customer.

// the rebuild

Two weeks of senior engineering. Boring stack, on purpose.

Because base44 would not export the code, a refactor was not on the table. We rebuilt from scratch, using the base44 product as a working spec. Every choice below is defensible. None of it is exotic. This is the stack you would pick if you were planning to run the app for five years instead of five days.

// app

Application layer

Next.js 15 (App Router)
TypeScript strict mode
React Hook Form (client validation)
Zod (runtime validation on every API boundary)

// data

Data & security

Supabase (Postgres + Auth)
RLS scoped by tenant on every table
Transaction-guarded slot booking
Cursor-based pagination

// ops

Observability & delivery

Sentry error monitoring
Vercel hosting + preview deploys
Playwright E2E tests
Vitest unit tests on business logic

// before / after

The hour feels cheaper until the first real customer.

A selection of the differences, in concrete numbers. The rebuild took two weeks. The base44 version took an hour.

Metric

base44 version

Rebuild

Source code access

None (platform-locked)

Full (owned repo)

TypeScript strict mode

N/A (no access)

On, 0 errors

Query-level tenant isolation

UI-filtered only

RLS-enforced at DB

Runtime validation on booking API

None

Zod schema, every field

Form validation checks

1 (empty name)

11 (name, email, phone, notes length, service, slot, date, timezone, price, status, tenant)

Double-booking prevention

Race condition

Transaction + unique constraint

End-to-end tests

Unit tests (business logic)

Appointments list pagination

None (full-table fetch)

Cursor-based, 25 / page

Error monitoring

None

Sentry + Slack alert routing

Exportable code

Yes (Git repo handed off)

Staging environment

Not possible

Preview deploy per PR

Deploy rollback

Not possible

One click (Vercel)

Time to diagnose a production issue

Unbounded (no logs)

Minutes (Sentry + structured logs)

The rebuild delivered every feature the base44 version had, plus ownership, auditability, tests, and observability. Full table in the write-up above.

// the public booking flow

Where prototype-versus-production gets concrete.

A customer-facing booking form has exactly one job: collect a valid booking and put it in the database. Everything it gets wrong is visible to a paying customer.

In the base44 version, the name field accepted any string including empty whitespace, the email field had no format validation, the phone field accepted text, notes had no length limit, and the time slot did not re-check availability on submit. If two people picked the same slot within a few seconds, one of them got a confirmation page for a booking that did not actually exist. The confirm button had no disabled state, so rapid clicks created duplicates.

In the rebuild, name is validated on both sides. Email format is checked client and server. Phone is optional but format-checked when present. Notes are capped at 1,000 characters with a visible counter. Slot availability is re-checked inside a database transaction at submission time: the booking commits atomically or fails with a clear error asking the user to pick a new slot. The submit button enters a loading state and is disabled until the request resolves. All of it is covered by end-to-end tests that simulate real-world race conditions.

This is the last 30% of the work. It does not look like anything in a demo. It is the entire difference between "works" and "ships."

// next step

Got a base44, Lovable, Bolt, or Cursor app in the same spot?

Book a production audit. A senior engineer reads your code, runs the diagnostics, and tells you honestly whether a rescue, a rebuild, or staying on the platform for another month is the right call for your situation.

Book a production audit

Follow :

The rebuild that got us off a platform we didn’t own

Same product. Different posture.

Most rescues start with a screenshot and a bad feeling.

Seven issues, none of them visible in a demo.

The code is not yours.

Validation is cosmetic.

Multi-tenant isolation relies on trust.

No error handling worth the name.

No tests.

Performance is accidentally fine.

Observability is zero.

Two weeks of senior engineering. Boring stack, on purpose.

Application layer

Data & security

Observability & delivery

The hour feels cheaper until the first real customer.

Where prototype-versus-production gets concrete.

Got a base44, Lovable, Bolt, or Cursor app in the same spot?

Company

Services

Get in Touch

Email us:

Locations:

Follow :

The rebuild that got us off a platform we didn’t own

Same product. Different posture.

Most rescues start with a screenshot and a bad feeling.

Seven issues, none of them visible in a demo.

The code is not yours.

Validation is cosmetic.

Multi-tenant isolation relies on trust.

No error handling worth the name.

No tests.

Performance is accidentally fine.

Observability is zero.

Two weeks of senior engineering. Boring stack, on purpose.

Application layer

Data & security

Observability & delivery

The hour feels cheaper until the first real customer.

Where prototype-versus-production gets concrete.

Got a base44, Lovable, Bolt, or Cursor app in the same spot?

Other case studies in the same lane.

From one-file prototype to App Store product

Same UI, production-grade underneath

Email us:

Locations: