Changelog

What's new in AI Company. Every feature, improvement, and fix.

v3.35

CurrentMarch 13, 2026

Mobile Sticky Header Fix, Improvement Cycle Hang Fix & QC Hardening

Generated output headers no longer stick to the screen on mobile — both Streamdown prose and HtmlPreview iframe outputs are now fully scrollable. Improvement cycle LLM calls that hung indefinitely are now capped with a 90s timeout. Comprehensive QC audit performed by ChatGPT agent covering content accuracy, navigation, trust messaging, and mobile UX.

FixedMobile sticky header fix — generated output headers (titles, tags, search bars, buttons) no longer stick to screen while scrolling on mobile

FixedHtmlPreview iframe: injected JS neutralizes all position:sticky/fixed elements after page load — catches CSS classes, <style> blocks, and inline styles

FixedStreamdown prose: added contain:paint CSS containment + strips <style> tags rendered by rehype-raw to prevent sticky escaping scroll container

FixedImprovement cycle hang fix — LLM calls now have a 90s timeout cap, preventing indefinite hangs with only heartbeat messages

ImprovedQC audit by ChatGPT agent: verified three-agent role consistency (CEO=ChatGPT, Operator=Built-in LLM, Reviewer=Gemini) across all pages

v3.34

March 12, 2026

Bold New Logo, One-Click Install, Performance & Accessibility

Complete logo overhaul with bold white 'A' on deep blue background — clearly visible at all icon sizes including Android home screen. Smart PWA install banner auto-detects installability and offers one-tap install. Lazy loading splits 17 routes into separate chunks, boosting Lighthouse Performance from 36 to 54. Accessibility score raised to perfect 100. App background fixed to clean white.

NewBold new app icon — programmatically generated white 'A' on deep blue gradient, visible at all sizes (favicon, 192px, 512px, maskable variants)

NewSmart install banner — auto-detects if app isn't installed, shows dismissable bottom bar with one-tap Install button (remembers dismissal for 7 days)

New'Install App' link added to SiteFooter on every page — appears only when app is installable

NewShared useInstallPrompt hook — reused by InstallBanner, SiteFooter, and About page install section

ImprovedLazy loading — 17 route components converted to React.lazy() with Suspense, only Home loads eagerly

ImprovedRoute prefetching — Invite, AppMain, and About chunks preloaded 2 seconds after Home mounts for instant navigation

ImprovedLighthouse Performance score improved from 36 to 54 (+50%) via code splitting and prefetching

ImprovedLighthouse Accessibility score raised from 80 to 100 — removed zoom restriction, added landmarks and aria-labels

FixedApp background restored to clean white — explicit bg-white on all desktop columns (Plan, Outputs) and mobile layout

FixedLoading screen converted from dark (#0F172A) to white with light-themed text, icons, and tip cards

FixedMaskable icons now use full-bleed blue background — no more double-rounding on Android adaptive icons

ImprovedColor contrast improvements — agent name text upgraded to -700 shades, version badge uses gray-600 on gray-100

NewPlay Store prep — manifest icons split into 'any' and 'maskable' entries, .well-known/assetlinks.json template created

v3.32

March 6, 2026

QA Bug Fixes + PWA Cache Busting

Fixed reviewer FAIL label showing misleading 'In Progress' text, updated service worker cache to force new logo propagation on PWA reinstall, and verified all v3.22 QA report items.

FixedReviewer FAIL verdict now shows 'Needs Review' instead of misleading 'In Progress' label

NewWarning notice when build saved despite reviewer FAIL — suggests increasing Max Cycles

FixedService worker cache bumped to v3.32 to force new logo propagation on PWA reinstall

Improvedmanifest.json now included in service worker cache for faster PWA icon updates

FixedVerified all v3.22 QA report items: abort cooldown, Run badge, /builds footer, Share toast

v3.31

March 6, 2026

New 'A' Circuit Logo + Educational Loading Screen

Brand new 'A' Circuit logo across the entire app — a stylized letter A morphing into neural network circuits with glowing blue nodes. The loading screen now teaches you about Run, Build, and Pro modes with rotating example cards.

NewNew 'A' Circuit logo — stylized letter A with neural network circuits and glowing blue nodes

NewLogo updated across all pages: header, footer, favicon, PWA icons, loading screen

NewEducational loading screen with 6 rotating tip cards for Run, Build, and Pro modes

ImprovedEach tip shows mode name, example prompt, and description with smooth fade animations

ImprovedMode pills highlight the current tip's mode (Run/Build/Pro) with color-coded indicators

ImprovedPre-React HTML loading screen shows logo and mode pills before JS bundle loads

v3.30

March 6, 2026

New Logo, No Splash Screen & PWA Improvements

New circuit-brain app icon replaces the generic white circle. PWA splash screen eliminated — dark background loads instantly without white flash on app open.

NewNew circuit-brain logo and favicon — distinctive dark navy design with glowing blue neural network

FixedPWA splash screen eliminated — dark background (#0F172A) loads instantly, no white flash on app open

ImprovedUpdated PWA manifest with new 192x192 and 512x512 icons

ImprovedService worker cache busted (v2) to ensure new logo propagates to all users

v3.29

March 5, 2026

Auto Deploy Verification & Owner Notification

After every publish, the server automatically runs the full 12-check Deploy Verification suite (database, APIs, version, changelog, LLM connectivity, etc.) and sends the results as a notification to the project owner. No manual steps needed — just publish and you'll be notified if anything broke.

NewAuto-run Deploy Verification on server startup in production — 12 checks run automatically 10 seconds after deploy

NewOwner notification with full pass/fail report including individual check results and durations

ImprovedFailed checks highlighted separately in notification for quick triage

ImprovedError handling: if verification itself crashes, owner is notified about the error

v3.28

March 5, 2026

CEO Revision Creates New Build Versions (V4, V5, etc.)

CEO revisions in Build mode now create new Build versions instead of overwriting the latest. Each revision saves as V4, V5, V6, etc. so users can compare all iterations side-by-side in the Build Versions viewer. The new version appears instantly in the left panel after the revision completes.

NewCEO revision in Build mode creates a new Build version (V4, V5, etc.) instead of overwriting the latest

NewNew Build version appears instantly in the left-panel Build Versions viewer after revision completes

ImprovedUsers can now compare all CEO revision iterations side-by-side via version buttons

ImprovedBuild version label shows 'CEO Revision' source to distinguish from original build cycles

v3.27

March 5, 2026

Shared Footer Component, Playwright E2E Smoke Tests & QA Automation

Extracted the 8-link footer into a shared SiteFooter component used by all 6 pages (Home, About, Examples, Instructions, Changelog, Builds, ComponentShowcase) — eliminates duplication and ensures consistent navigation. Created a comprehensive Playwright e2e smoke test suite (pre-deploy-smoke.spec.ts) with 22 automated tests covering footer links, version badges, mobile overflow, changelog integrity, about page model names, invite flow, and cooldown persistence. All tests run before every publish to catch regressions automatically.

ImprovedExtracted footer into shared SiteFooter component — all 7 pages now use a single source of truth for navigation links

NewPlaywright e2e pre-deploy smoke tests (22 tests) covering footer links, version badges, mobile overflow, changelog, and more

ImprovedSiteFooter supports 'dark' variant for pages with dark backgrounds (e.g., Examples page)

FixedComponentShowcase page now shows full navigation footer instead of minimal 'Shadcn/ui Component Showcase' text

v3.26

March 5, 2026

Perplexity QA Fixes: Cooldown Persistence, Run Badge, Footer & Share Toast

Addresses all findings from the Perplexity QA v3.22 test report. The 15-second abort cooldown now persists across page refreshes via localStorage, preventing bypass by reloading. The blue 'Run' badge now appears in logged-in user history (userHistory endpoint was missing runMode). Rate-limit errors from the server now auto-trigger a client-side cooldown countdown. The /builds page now includes the standard 8-link footer, and all copy-to-clipboard actions show a consistent toast notification.

FixedAbort cooldown now persists across page refreshes via localStorage — can't bypass by reloading

FixedBlue 'Run' badge now shows in logged-in user history (userHistory endpoint was missing runMode field)

FixedServer rate-limit errors now auto-trigger client-side cooldown countdown with extracted wait time

FixedDesktop Versions tab trigger now checks buildVersions in addition to cycleVersions

Improved/builds page now includes standard 8-link footer navigation

ImprovedAll copy-to-clipboard actions now show consistent toast notification ('Copied to clipboard!')

NewPre-deployment QA checklist (QA-CHECKLIST.md) with 40+ regression tests organized by category

v3.25

March 4, 2026

Mobile Versions Tab, Revision Timeout Fix & Time Remaining Estimate

Fixed the missing Versions tab on mobile Build mode — V1/V2/V3 website version buttons now appear on mobile just like desktop. Increased CEO revision timeout from 5 to 8 minutes and added per-operation timeouts (30s per search, 20s per image scrape) to prevent indefinite hangs. The modification button now shows an estimated time remaining during each phase.

FixedMobile Build mode now shows Versions tab with V1/V2/V3 website version buttons (was missing entirely)

FixedCEO revision timeout increased from 5 to 8 minutes to accommodate complex research requests

FixedPer-operation timeouts: 30s per search query, 20s per image scrape — prevents indefinite hangs

NewModification button shows estimated time remaining during each phase (Researching / Thinking / Streaming)

ImprovedTimed-out search queries and image scrapes are gracefully skipped instead of failing the entire revision

v3.24

March 4, 2026

CEO Revision Progress Tracking, Retry Buttons & Stale Connection UX

Major improvement to the CEO modification flow: the button now shows real-time progress during research, image scraping, and LLM generation phases with an elapsed time counter. If a revision fails, a Retry button appears inline. All three stale connection warning banners now include a Retry button for immediate manual recovery. Progress events are streamed from the server during the research phase so users see exactly what the CEO is doing.

ImprovedCEO revision button shows real-time status: 'Researching...', 'Scraping images...', 'CEO is thinking...', 'Streaming revision...' with elapsed time counter

ImprovedServer sends progress events during web research and image scraping phases — no more silent waiting

NewRetry button appears inline after a failed CEO revision — no need to re-type the modification

NewRetry button added to all stale connection warning banners (mobile + desktop) for immediate manual recovery

ImprovedProgress messages logged to activity log so users can see research/scraping steps in the Log tab

FixedSafety timeout now properly clears all progress state and shows retry option

v3.23

March 4, 2026

Abort Cooldown Enforcement, Rate Limit Error Handling & Run Badge Fix

Fixes three bugs found during QA testing: (1) After aborting a run, buttons now show a visible 15-second cooldown countdown preventing rapid re-runs. (2) 'Rate exceeded' plain-text errors from upstream proxies are now caught and shown as a friendly message instead of a JSON parse crash. (3) The blue 'Run' badge now correctly appears in the history sidebar for all Run-mode tasks.

FixedAbort cooldown enforced — buttons disabled for 15s with visible countdown after abort

Fixed'Rate exceeded' JSON parse error fixed — upstream plain-text errors now shown as friendly messages

FixedBlue 'Run' badge now shows in history sidebar (was checking wrong value 'run' instead of 'operator')

ImprovedCooldown countdown visible on both mobile and desktop Run/Build/Pro buttons

ImprovedToast notification after abort informs user of 15s wait

v3.22

March 4, 2026

Mobile Stall Watchdog, SSE Reconnection UX & Share Fixes

Critical fix for mobile execution stalls: added a client-side watchdog that detects when no server events arrive for 45+ seconds, shows a visible warning, and auto-aborts at 90 seconds with a retry option. SSE reconnection now shows toast notifications ('Reconnecting...' / 'Reconnected!'). Stale connection warning banner appears on both Task and Log tabs. Share button uses on-demand S3 upload so blob URLs are never shared.

NewMid-run stall watchdog — warns at 45s of no events, auto-aborts at 90s with retry option

NewStale connection warning banner on Task tab — amber bar shows 'No server updates for Xs' with WifiOff icon

NewSSE reconnection toasts — 'Connection interrupted. Reconnecting...' and 'Reconnected! Resuming updates.'

FixedShare button always uploads HTML to S3 on-demand — never copies blob: URLs

ImprovedStale warning visible on both mobile Task tab and desktop main content area

ImprovedAuto-abort message guides user to retry with same task

v3.21

March 3, 2026

On-Demand S3 Upload for Share & Run Mode Badge

Share and Open buttons now upload HTML to S3 on-demand via a new tRPC endpoint, ensuring shared links are always real CloudFront URLs. Added 'Run' mode badge (blue) to history sidebar so all three modes have visible badges.

Newproject.shareHtml tRPC endpoint — uploads HTML to S3 on-demand when user clicks Share or Open

New'Run' mode badge (blue) in history sidebar — matches Build (green) and Pro (violet) badges

FixedShare button always visible in HtmlPreview toolbar with loading spinner during upload

ImprovedOpen in new tab uses CloudFront URL instead of blob URL

v3.20

March 3, 2026

Share Link Blob URL Fix & Cooldown Reduction

Fixed all Open/Share buttons to use real CloudFront URLs when available instead of blob: URLs. Reduced abort cooldown from 60s to 15s for faster re-testing.

FixedAll Open/Share buttons now use CloudFront URLs when savedProject exists

FixedAbort cooldown reduced from 60s to 15s — allows quick re-testing after abort

v3.19

March 3, 2026

Cooldown Reduction & Planning Latency Investigation

Reduced run cooldown from 60 seconds to 15 seconds for faster iteration. Investigated 'stuck in planning' report — confirmed it was normal LLM latency with heartbeat working correctly.

FixedRun cooldown reduced from 60s to 15s across all references

ImprovedVerified heartbeat mechanism works during long planning phases

v3.18

March 3, 2026

Share Button Blob URL Fix

Fixed Share button to only appear when a real CloudFront URL exists. Open in new tab now uses the hosted URL instead of blob URL.

FixedShare button only shows when savedProject has a real CloudFront URL

FixedOpen in new tab uses CloudFront URL when available

FixedBuildVersionsViewer Share button confirmed using correct S3 htmlUrl

v3.17

March 3, 2026

Dedicated Share Button & Button Clarity

Added a dedicated Share button to Build Versions and HtmlPreview toolbar that copies the CloudFront website URL. Renamed 'Play' to 'Preview' and 'View & Play' to 'View Latest' for clarity.

NewDedicated Share button in Build Versions — copies CloudFront URL to clipboard

NewShare button in HtmlPreview toolbar when saved project exists

ImprovedRenamed 'Play' to 'Preview' for clarity

ImprovedRenamed 'View & Play' to 'View Latest'

v3.16

March 3, 2026

Image Scraping for CEO Revisions & Scroll Bug Fix

CEO revision flow can now scrape real images from websites mentioned in modification instructions. When users say 'add pictures from example.com', the system fetches actual image URLs from the HTML instead of guessing. Defensive CSS fixes applied to all marketing pages to prevent white-page scroll issues across browsers.

NewfetchWebpageImages() scrapes real image URLs from websites mentioned in CEO revision instructions

NewImage scraping integrated into both modifyDraft and modifyDraftStream flows

FixedDefensive CSS fixes on all marketing pages — explicit background-color and color fallbacks

FixedAbout page now shows version badge and model names (ChatGPT, Built-in LLM, Gemini)

FixedAll page footers now include complete 8-link navigation

FixedChangelog footer updated with full navigation links

ImprovedImage URL verification via HEAD requests filters broken/tiny/icon images

ImprovedData URI filtering prevents base64-encoded images from being extracted

v3.15

March 2, 2026

Mode Suggestion Toast, Heartbeat & Public Changelog

Mode suggestion toast appears when a task would work better in a different mode. Heartbeat entries show progress during long operations. Changelog page made public. Footer navigation updated on most pages. Pricing chips visible below status bar.

NewMode suggestion toast — appears ~20-30s into a run if a different mode would be better

NewHeartbeat entries in Log tab — 'Still working' messages every ~30 seconds

NewChangelog page now publicly accessible (was owner-only)

NewPricing chips visible below status bar: Run $0.10 / Build $0.60 / Pro $6.10

FixedFooter navigation updated to 7 links on most marketing pages

ImprovedMode suggestion threshold lowered for better detection

v3.14

March 2, 2026

CEO Revision Web Research & Build Timeout Increase

CEO revision flow now does live web research when users mention URLs or ask to improve content. Build timeout increased to 20 minutes with automatic retry. Better error messages for skipped cycles.

NewCEO revision triggers web research when URLs detected in modification instructions

NewextractResearchNeeds() analyzes modification text for URLs and generates search queries

FixedBuild timeout increased from 10 to 20 minutes with automatic retry on timeout

FixedBetter error messages when cycles are skipped due to budget constraints

ImprovedModification instructions parsed for domain-specific search queries

v3.13

March 2, 2026

Streaming CEO Revisions & Activity Log Improvements

CEO revision process now streams changes in real-time instead of waiting for the full response. Activity log entries improved with better formatting and timestamps.

NewCEO revisions stream in real-time — watch changes appear as they're generated

NewRevision counter shows number of CEO modifications applied

ImprovedActivity log entries have better formatting with actor badges

ImprovedModification request entries show in cyan for visual distinction

v3.12

March 2, 2026

CEO Modification Flow & Send to CEO Button

New 'Send to CEO for Revision' button lets users request modifications to completed outputs. CEO analyzes the request and applies targeted changes to the existing draft.

New'Send to CEO for Revision' button on completed outputs

NewCEO analyzes modification requests and applies targeted changes

NewModification log entries track all revision requests

ImprovedCEO modifies existing draft instead of rewriting from scratch

v3.11

March 2, 2026

Quick Test, Automated Verification & Revision Upgrades

Major admin tooling update: Quick Test button runs lightweight smoke tests ($0.15-0.30) to verify the full Run + Build pipeline without external agents. All 12 manual deploy checklist items are now automated server-side checks. Revision process upgraded to modify existing drafts instead of rewriting from scratch. Credit dashboard crash fixed.

NewQuick Test admin button — runs programmatic Run + Build smoke test from /admin/health

NewAutomated Deploy Verification — all 12 manual checklist items now run as server-side tests

NewUser modification log entries — 'User requested modification' appears in activity log with cyan styling

FixedCredit Dashboard crash fixed — TypeError on toFixed when budget fields were undefined

FixedRevision process now upgrades existing draft instead of rewriting from scratch

FixedCEO prompt reduced over-asking — proceeds with reasonable assumptions instead of excessive clarification

ImprovedDeploy checklist page redesigned with automated verification results and timing

ImprovedAll .toFixed() calls wrapped with null safety guards across Credit Dashboard

v2.47

February 18, 2026

Stale Run Recovery & Deployment Resilience

Fixed the 'Unable to start run' error that occurred after server deployments. When a run was interrupted by a deployment (stuck in PLAN/EXECUTE state but no longer active in memory), the system now automatically recovers it by restarting with the stored task brief. Users see a seamless restart instead of an error message.

FixedFixed 'Unable to start run' error after server deployments — stale runs now auto-recover

NewStale run detection: identifies runs in non-terminal DB state but not active in memory

NewAutomatic recovery: restarts stale runs with stored task brief from database

ImprovedClear error messages when recovery isn't possible ('interrupted by server update')

ImprovedDiagnostic logging for stale run recovery to aid debugging

v2.46

February 18, 2026

Connection Health Indicator & Retry for Stuck Steps

Added a real-time 'Last update: Xs ago' indicator with color-coded connection health (green/yellow/red). When no server event is received for 90+ seconds, a 'Retry Step' button appears to abort the stuck step and let users re-submit. Visible on both desktop and mobile, on all tabs.

New'Last update: Xs ago' indicator shows time since last server event with Wifi/WifiOff icon

NewColor-coded: green (<30s), yellow (30-59s), red (60s+) for instant connection health

New'Retry Step' button appears after 90s of no response — aborts stuck step with toast guidance

ImprovedIndicator visible on both desktop sidebar and mobile sticky progress bar

ImprovedAll 14 SSE event handlers now track last event time for accurate monitoring

v2.45

February 18, 2026

Heartbeat Progress Updates During Long LLM Calls

Added heartbeat progress updates every 15 seconds during all long-running LLM calls. Users now see live status messages like 'CEO is analyzing the task (30s)...' instead of a frozen screen. Applied to all 7 LLM call points: CEO planning, Operator drafting, Reviewer checking, Replanning, Improvement cycles, and Finalization.

NewHeartbeat wrapper sends status updates every 15 seconds during long LLM calls

NewAnimated dots ('.', '..', '...') cycle to show the system is alive

NewProgress bar increments by 2% per heartbeat to show forward movement

ImprovedApplied to all 7 LLM call points: CEO, Operator, Reviewer, Replan, Improvement, Finalize

ImprovedCombined with v2.44 client-side timer for fully responsive progress tracking

v2.44

February 18, 2026

Mobile UX: Live Timer, Progress on All Tabs, Bigger Fonts

Fixed the frozen elapsed timer so it counts up every second (not just on server events). Added a sticky progress bar visible on Plan and Outputs tabs. Increased font sizes for credit balance and settings on mobile. Hidden confusing dual ETAs during active runs.

FixedElapsed timer now ticks every second via client-side interval, independent of SSE updates

FixedSticky progress bar (phase stepper + bar + timer) visible on ALL mobile tabs, not just Task

FixedCredit balance and settings text increased from 9-10px to 11-12px for mobile readability

FixedConfusing 'ETA: ~2 min / ~5 min' hidden during active runs — relabeled as 'Avg Run / Avg Build' when idle

ImprovedProgress bar height increased from 1px to 1.5px for better visibility

v2.43

February 18, 2026

LLM Retry Logic & Invite Code Fix

Added retry logic with exponential backoff to all three LLM clients (Built-in, ChatGPT, Gemini) to handle transient upstream errors. Fixed the 'Change Invite Code' button so it properly clears the saved session and navigates to the invite page.

FixedBuilt-in LLM (Operator) now retries up to 3 times with 2s/5s/10s backoff on 500/502/503/504 errors

FixedChatGPT (CEO) now retries up to 3 times for both regular and streaming calls

Fixed'Change Invite Code' button now clears localStorage saved code, preventing auto-redirect back to app

FixedSidebar sheet closes before logout mutation fires for smoother UX

ImprovedE2E tested: Full Run+Build pipeline completed successfully (to-do list app, 5 versions, 15 sources)

v2.42

February 17, 2026

Run Duration Tracking & Completion Notifications

Accurate run duration tracking with completedAt timestamps. Notification sound and vibration on mobile when runs complete so you don't have to watch the screen.

NewcompletedAt timestamp stored in database for accurate run duration tracking

NewTwo-tone completion chime (A5 + E6) plays when a run finishes

NewVibration pattern (100ms-50ms-100ms) on mobile when run completes

ImprovedETA calculation now uses completedAt for accuracy, with runLog fallback for older runs

v2.41

February 17, 2026

Pre-Deploy Checklist & ETA

New admin deploy checklist page with automated health checks and manual verification items. Estimated completion time shown near Run buttons based on historical run durations.

NewPre-deployment checklist at /admin/deploy with auto-run health checks (DB, APIs, search, version)

New12 manual verification items for deployment readiness (mobile, desktop, credit tracking, etc.)

NewEstimated completion time (ETA) shown near Run buttons based on historical run durations

ImprovedRun stats endpoint calculates average duration from completed runs

Fixed'Awaiting Approval' status no longer gets stuck after auto-approve — transitions directly to Executing

v2.40

February 17, 2026

Run History Mobile UX

Shows dollar amount alongside remaining runs in the Run History sidebar. Trash/delete icon is now always visible on mobile instead of requiring hover.

ImprovedRemaining section now shows dollar amount with color-coded warning (red <$3, yellow <$10)

FixedTrash icon always visible on mobile (was hover-only, invisible on touch devices)

v2.39

February 17, 2026

Build Quality & Credit UX Overhaul

Major fix: CEO and Operator now produce working HTML code instead of design documents for build tasks. Added low balance warnings, per-run cost tracking in history, and auto-refreshing credit balance.

FixedCEO no longer recommends Unity/Godot — enforces HTML/Canvas/JS single-file output for all build tasks

FixedImprovement cycles now improve the actual CODE, not polish a design document

FixedReviewer now auto-rejects design documents when user asked for a working product

NewLow balance warning toast: blocks Run+Build if <$3.50, warns if <$7, blocks if 0 runs left

NewPer-run cost breakdown in run history with Build badge and total cost

ImprovedCredit balance auto-refreshes after run completes, errors, or aborts

v2.38

February 17, 2026

Invite Code Cleanup & Credit Balance Display

Cleaned up old invite codes (BETA/GAMMA/DELTA/EPSILON/ZETA) to shorter names. Enhanced credit balance display next to Run buttons with visual progress bar, color-coded warnings, and estimated cost per run type.

ImprovedRenamed 8 invite codes to shorter names (BETA01, GAMMA1, DELTA1, EPSLN1, ZETA01, PRO001, TEAM01, VIP001) — all sessions and runs migrated

ImprovedDeactivated 9 old invite codes and QC test code

NewEnhanced credit balance display with visual progress bar showing budget usage

NewColor-coded budget warnings: green (healthy), yellow (<$10), red (<$3)

NewEstimated cost shown next to Run buttons: Run ~$0.10, Run + Build ~$6.10

v2.37

February 17, 2026

Mobile Reliability Fix — Task Brief Persistence

Fixed 'Invalid run state or missing task brief' error on mobile devices (Samsung S24 Plus and similar). Task briefs are now persisted to the database immediately when a run starts, eliminating race conditions between the run mutation and SSE connection on slow mobile networks.

FixedFixed mobile error: 'Invalid run state or missing task brief' — task brief now stored in DB as ultimate fallback

FixedIncreased SSE retry timeout from 3×500ms to 5×1000ms for slow mobile connections

ImprovedThree-tier task brief resolution: in-memory map → URL parameter → database row

ImprovedBetter error diagnostics: SSE handler now logs detailed debug info when falling through to error state

ImprovedUser-friendly error message: 'Unable to start run. Please try again' instead of cryptic 'Invalid run state'

v2.36

February 17, 2026

Gemini Fallback, Operator Code Output & Builder Auto-Trigger

Gemini API now retries with fallback models when overloaded. Operator prompt rewritten to always produce working code (games, apps, tools) instead of design documents. Builder auto-triggers immediately when 'Run + Build' completes — no manual click needed.

FixedGemini API retry logic with fallback model (gemini-2.0-flash) — no more 'high demand' errors killing the Reviewer

FixedOperator now produces working HTML/JS code for build tasks instead of design documents — games are playable, apps are functional

NewBuilder auto-trigger — when using 'Run + Build', the Builder agent starts automatically after the Operator finishes

ImprovedOperator prompt strengthened with explicit anti-patterns: no 'Game Design Documents', no 'Technical Specifications' — only working code

ImprovedFor games: Operator now uses Canvas API / pure JS instead of external frameworks (Phaser, Unity) to keep everything in one HTML file

ImprovedCEO search aggressiveness increased — now searches by default for almost all tasks

v2.35

February 17, 2026

Owner-Only Changelog, Instructions Rewrite & Code Update

Changelog page is now restricted to the project owner only (verified by identity, not invite code). Instructions page completely rewritten to reflect two run modes, credit system, and current features. Invite code simplified for easier entry.

NewChangelog page now owner-only — verified by OWNER_OPEN_ID, not visible to regular users

NewNew auth.isOwner tRPC endpoint for frontend owner identity checks

ImprovedInstructions page completely rewritten — now covers two run modes (Operator vs Builder), credit system, key features, and updated quick reference table

ImprovedRemoved all invite code references from changelog text — no codes exposed in UI

ImprovedChangelog navigation links removed from public pages — only accessible via direct URL by owner

ImprovedOwner invite code simplified for easier entry

v2.34

February 17, 2026

Credit System, Two Run Modes & Branding Update

Major update introducing a credit system with per-invite budgets, two run modes (Operator for fast AI runs, Builder for full website builds), admin credit dashboard, and complete removal of third-party branding from all user-facing text.

NewTwo run buttons: 'Run' (Operator mode) for fast AI execution, 'Run + Build' (Builder mode) for full website builds

NewCredit system with per-invite dollar budgets and run limits — budget enforcement blocks runs when exhausted

NewAdmin credit dashboard at /admin/credits — shows per-invite budget usage, total spend, and builder cost estimates

NewBuilder cost estimation at ~$6/build with calibration tracking for accuracy over time

NewCredit balance display in invite status — users see remaining budget and runs

ImprovedRemoved all third-party branding from user-facing text — login, activity logs, changelog, and dialogs

ImprovedRenamed 'manus' router to 'builder' — cleaner API naming throughout

ImprovedOperator model label changed from 'Manus LLM' to 'Built-in LLM' in activity logs

ImprovedOwner invite code set to $999 budget / 999 runs; other codes set to $25 / 99 runs

ImprovedCEO prompt updated to search by default — much better results for creative and product tasks

ImprovedSearch API (Tavily) health check added to admin health dashboard

New40+ new tests covering credit system, branding removal, run modes, and builder router

v2.32

February 17, 2026

Admin Health Dashboard, E2E Browser Tests & Post-Publish Verification

Added an admin-only health dashboard at /admin/health with live status indicators for database, API keys, environment, and system health. Set up Playwright E2E browser tests (25 tests) covering all critical user flows. Created a post-publish auto-verification suite (23 tests) that can run against production after every deploy.

NewAdmin health dashboard (/admin/health) — live status indicators with green/yellow/red for all system components

NewPlaywright E2E browser tests (25 tests) — homepage, navigation, all public pages, invite flow, auth, changelog, responsive design

NewPost-publish verification suite (23 tests) — site availability, API health, page rendering, auth flow, changelog integrity, performance

ImprovedAdmin dashboard protected by admin secret with localStorage persistence and auto-refresh

ImprovedNew npm scripts: test:e2e, verify:local, verify:prod for running different test suites

v2.31

February 17, 2026

Deep Testing, Health Check & Regression Guards

Added comprehensive test suites covering the full run lifecycle, post-deployment verification, and regression guards for every version since v2.26. New /api/trpc/version.health endpoint provides real-time system health monitoring with database, API key, and environment checks.

NewHealth check endpoint (version.health) — monitors database, API keys, environment, system status

NewRun lifecycle test suite (55 tests) — covers config validation, state machine, timeout, model detection, plan steps

NewPost-deployment verification suite (41 tests) — API contracts, regression guards, security checks, UI data contracts

ImprovedRegression guards for every version since v2.26 — GPT-5.2 fix, plan step cap, timeout handling, progress phases

ImprovedTotal test count increased from 246 to 340+ covering all critical paths

v2.30

February 16, 2026

Time Management: Countdown, Configurable Timeout & Estimated Completion

Added real-time elapsed/remaining time countdown to the progress bar, configurable per-run timeout (via Max Time setting), and estimated completion time displayed in the Cost Estimate card based on token complexity.

NewLive elapsed time countdown (MM:SS) shown in progress bar during runs

NewTime limit displayed as elapsed/timeout (e.g., 2:30/10:00) so users see remaining time

NewEstimated completion time (~1-2, 2-4, or 4-8 min) shown in Cost Estimate card based on token complexity

ImprovedTimeout is now configurable per-run via Max Time setting and applied to the hard timeout

ImprovedProgress stats row always visible during runs (searches, replans, estimated time left)

v2.29

February 16, 2026

Increased Hard Timeout to 10 Minutes

Doubled the hard timeout from 5 minutes to 10 minutes, giving complex tasks with multiple improvement cycles enough time to complete without hitting the time limit.

ImprovedHard timeout increased from 5 minutes (300s) to 10 minutes (600s)

ImprovedComplex tasks with 4 improvement cycles now have sufficient time to complete

v2.28

February 16, 2026

Graceful Timeout Handling for Improvement Cycles

Fixed a critical issue where runs that completed successfully (reviewer-approved) would show as ERROR when improvement cycles couldn't start due to time limits. Now, if time runs out during improvements, remaining cycles are gracefully skipped and the reviewer-approved output is saved. Finalization also gracefully handles timeouts by using the draft directly.

FixedRuns no longer show ERROR when improvement cycles are skipped due to time limit

FixedTimeout during finalization now saves the reviewer-approved draft instead of erroring

ImprovedTimeout with existing output now completes as END state, not ERROR

ImprovedClear log messages explain which improvement cycles were skipped and why

ImprovedImprovement completion counter accurately tracks partial progress

v2.27

February 16, 2026

QC Smoke Tests, Plan Step Badge & Progress Improvements

Added a comprehensive pre-deployment QC smoke test suite with 41 automated checks covering model selection, run creation, progress tracking, plan steps, version consistency, SSE events, and more. Plan step count is now shown as a badge next to the Execution Plan header. Progress display improved with replanning and improving phases.

NewPre-deployment QC smoke test suite — 41 automated checks across 10 categories

NewPlan step count badge shown next to Execution Plan header in both mobile and desktop

ImprovedProgress bar now shows 'Replanning' and 'Improving' phases with live updates

ImprovedCEO plans soft-capped at 3-7 steps to prevent Operator context bloat

ImprovedPlans over 7 steps are automatically truncated before passing to Operator

v2.26

February 16, 2026

GPT-5.2 API Compatibility Fix

Fixed a critical API error where GPT-5.2 was rejecting the max_tokens parameter. All GPT-5 family models (5, 5-mini, 5-nano, 5.2, 5.2-pro) now correctly use max_completion_tokens as required by OpenAI's API. Legacy models (GPT-4o) still use max_tokens.

FixedGPT-5.2 no longer throws 'Unsupported parameter: max_tokens' error

FixedAll GPT-5 family models now use max_completion_tokens instead of max_tokens

ImprovedNew requiresMaxCompletionTokens() helper for clean model detection

ImprovedBoth standard and streaming ChatGPT calls updated for compatibility

v2.25

February 16, 2026

PWA Popup Fix + QC Testing

Fixed the browser's automatic PWA install popup that was appearing on every page. The install prompt is now fully suppressed — users can only install the app by tapping the link at the bottom of the About page. No more intrusive banners.

FixedSuppressed browser's native PWA install popup globally — no more banner on homepage

ImprovedInstall link remains only at the bottom of the About page, user-initiated

ImprovedComprehensive QC testing checklist created for all 16 feature areas

v2.24

February 16, 2026

PWA Install — Add to Home Screen

AI Company can now be installed as a Progressive Web App (PWA) on your phone or desktop. Visit the About page and tap 'Install AI Company as an App' at the bottom. Works offline with service worker caching. No app store needed.

NewPWA manifest with app name, icons, and theme color for home screen installation

NewService worker for offline caching of static pages

NewNon-intrusive install link at bottom of About page — no popup, user-initiated only

ImprovedApple touch icon and meta tags for iOS home screen support

ImprovedShows 'App installed' confirmation after successful installation

v2.23

February 16, 2026

Auto-Migrate Existing User Model Defaults

Existing users who had old/retired models cached in localStorage (like GPT-4o or Gemini 2.0 Flash) are now automatically upgraded to the new defaults: GPT-5.2 for CEO and Gemini 3 Pro Preview for Reviewer. No manual action needed — the migration happens silently on page load.

FixedAuto-migrate cached CEO model from GPT-4o/GPT-4o-mini/o1/o3-mini to GPT-5.2

FixedAuto-migrate cached Reviewer model from Gemini 2.0 Flash/2.5 Flash/2.0 Pro to Gemini 3 Pro Preview

ImprovedMigration is silent and automatic — existing users see updated defaults immediately

ImprovedFuture model retirements can be handled by adding to the migration list

v2.22

February 16, 2026

Builds Gallery + Access Code Protection

New public Builds Gallery page at /builds showing all Builder builds with status, progress, and credit usage. Each build is protected by a unique 8-character access code — you need the code to view details, URLs, and results. The Examples page remains fully public. Access codes are shown to the build creator after triggering a build, and can be shared with others for controlled access.

NewNew /builds page — public gallery showing all Builder builds with status and progress

NewAccess code protection — each build gets a unique 8-character code required to view details

NewAccess code displayed to build creator in the Build component

NewPublic gallery shows summary only (title, status, date) — no URLs or details without code

NewStats dashboard showing total builds, completed, in-progress, and credits used

ImprovedExamples page (/examples) remains fully public — no code needed

ImprovedBuilds link added to homepage navigation (desktop + mobile)

ImprovedBuild title auto-extracted from CEO output prompt

v2.21

February 16, 2026

Builder Agent — CEO Output → Real Websites

The CEO's output can now be sent directly to the Builder agent for real-world execution. After a run completes, use 'Run + Build' mode to have the Builder actually build what the CEO planned — websites, code, documents, anything. Real-time progress tracking with polling, result links, and shareable URLs. This is the first step toward the full AI Council vision.

New'Run + Build' mode triggers Builder agent after every completed run

NewBuilder API integration — creates full agent tasks from CEO output

NewReal-time build status tracking with 5-second polling

NewView Build link and shareable URL for completed builds

NewResult viewer shows Builder output, attachments, and credit usage

NewRetry button for failed builds

ImprovedBuild status persisted in database — survives page refresh

ImprovedNormalized Builder API response handling (id vs task_id)

v2.20

February 15, 2026

Default Model Upgrade — GPT-5.2 & Gemini 3 Pro

Upgraded default AI models to the latest and most capable versions. CEO agent now defaults to GPT-5.2 (OpenAI's best model for complex tasks) instead of GPT-5 Mini. Reviewer agent now defaults to Gemini 3 Pro instead of Gemini 2.5 Flash. Fixed model detection logic so GPT-5 base is correctly treated as a reasoning model, while GPT-5.2 uses standard API parameters. Also fixed 3 hardcoded fallback model references that were still pointing to retired GPT-4o.

ImprovedCEO default model upgraded from GPT-5 Mini to GPT-5.2 — OpenAI's most capable standard model

ImprovedReviewer default model upgraded from Gemini 2.5 Flash to Gemini 3 Pro — Google's most intelligent model

FixedFixed 3 hardcoded gpt-4o fallbacks in agents.ts — now correctly fall back to gpt-5.2

FixedFixed Reviewer fallback from gemini-2.0-pro to gemini-3-pro-preview

FixedGPT-5 base now correctly detected as a reasoning model (uses developer role, no temperature)

FixedGPT-5.2 correctly uses standard API parameters (system role, temperature, max_tokens)

NewAdded GPT-5 (Reasoning) to the CEO model selector dropdown

ImprovedGPT-4o marked as Retired (Feb 13, 2026) in model selector

v2.19

February 15, 2026

Mobile UX Improvements

Fixed mobile readability issues on Samsung Galaxy S24+ and similar devices. Log entry text is no longer truncated — full descriptions are visible without needing to expand. Transcript sections now use larger font sizes with better line spacing for comfortable reading on mobile screens.

FixedLog entry text no longer truncated on mobile — full descriptions visible

FixedTranscript prompt/response sections use larger, more readable font (text-xs instead of text-[9px])

ImprovedBetter line spacing (leading-relaxed) in transcript sections for mobile readability

ImprovedIncreased max-height for transcript sections so more content is visible before scrolling

v2.18

February 10, 2026

GPT-5 Temperature Fix

Fixed API error where GPT-5 models rejected custom temperature values. GPT-5 Mini and other GPT-5 models only support the default temperature (1), so the temperature parameter is now omitted for all GPT-5+ models. Legacy models like GPT-4o still support custom temperature.

FixedGPT-5 models no longer send temperature parameter (only default value 1 is supported)

FixedStreaming calls also omit temperature for GPT-5+ models

v2.17

February 10, 2026

GPT-5 API Compatibility Fix

Fixed a critical API compatibility issue where GPT-5 models rejected the legacy 'max_tokens' parameter. GPT-5 and newer models now correctly use 'max_completion_tokens' while legacy models (GPT-4o) continue using 'max_tokens'. The fix applies to both standard and streaming API calls.

FixedGPT-5 models now use max_completion_tokens instead of max_tokens (API requirement)

FixedStreaming calls also updated to use correct token parameter per model

ImprovedAutomatic detection of model generation for correct API parameter selection

v2.16

February 10, 2026

Smarter AI Models — GPT-5 & Gemini 3

Upgraded to the latest AI models for dramatically better output quality. CEO agent now defaults to GPT-5 Mini with GPT-5.2 and GPT-5.2 Pro available for complex tasks. Reviewer agent defaults to Gemini 2.5 Flash with Gemini 3 Pro Preview available for the deepest analysis. All models are selectable per-run in the Run Settings dropdown. Legacy models (GPT-4o, Gemini 2.0 Flash) still available but marked as deprecated.

NewCEO model selector — choose from GPT-5 Nano, GPT-5 Mini, GPT-5.2, GPT-5.2 Pro, or o3-mini per run

NewReviewer model selector — choose from Gemini 2.5 Flash-Lite, 2.5 Flash, 2.5 Pro, 3 Flash, or 3 Pro per run

ImprovedDefault CEO model upgraded from GPT-4o to GPT-5 Mini for better reasoning

ImprovedDefault Reviewer model upgraded from Gemini 2.0 Flash to Gemini 2.5 Flash

ImprovedModels configurable via CHATGPT_MODEL and GEMINI_MODEL env vars

FixedRemoved non-existent model IDs (gemini-2.0-pro) that caused API errors

v2.15

February 9, 2026

Agent Transcripts, Progress Stepper & Bug Fixes

Full transparency into what each AI agent said and did. Every log entry now has an expandable 'Agent Transcript' showing the actual prompt sent and response received, with model name and token estimates. The progress bar is replaced with a visual phase stepper showing Planning → Executing → Reviewing → Complete with checkmarks. Five Tier 1 bugs fixed including long brief validation, CEO prompt disambiguation, clarification handling, SSE race condition, and data retention cleanup.

NewAgent Transcript view — click to expand and see exact prompts/responses for CEO, Operator, and Reviewer

NewVisual phase stepper with checkmarks: Planning → Executing → Reviewing → Complete

NewData retention cleanup endpoint — admin can purge old runs via maintenance.cleanup

NewCEO clarification mechanism — ambiguous tasks are flagged with a question in the log

ImprovedProgress bar now shows phase-specific icons and animated pulse indicator for current phase

ImprovedCycle counter and stats (searches, replans) displayed with icons

ImprovedCEO prompt strengthened to better distinguish 'build' vs 'display' tasks

FixedTask brief validation increased to 50K chars to match SSE sanitizer

FixedSSE pendingRunBriefs race condition fixed with retry logic

v2.14

February 8, 2026

Reviewer Details, Improvement Cycles & History Fixes

The Reviewer's activity log now shows detailed findings — every issue, veto reason, and suggestion is visible instead of just a summary count. A new 'Post-Complete Improvements' setting (default 3) makes the CEO do additional research and improvement cycles even after the Reviewer approves, ensuring higher quality output. History sidebar no longer shows duplicate or non-clickable entries.

NewReviewer log entries now show full details — issues, veto reasons, and suggestions are all visible in the activity log

NewPost-Complete Improvements setting (0-10, default 3) — CEO does extra improvement cycles after Reviewer approves

NewImprovement agent researches how to enhance the output and applies changes automatically

FixedHistory sidebar no longer shows duplicate entries for the same task

FixedHistory items that were not clickable (IDLE runs) are now filtered out

ImprovedToast notifications when loading runs from history (success/error feedback)

v2.13

February 7, 2026

Execution Progress & What's Next

All output tabs now show live progress during execution instead of blank placeholders. The Final tab displays which agent is currently working (CEO planning, Operator building, Reviewer checking) with recent activity feed. Draft and Review tabs show contextual loading states. After run completion, a 'What's Next' section suggests modifications, new tasks, or export options.

NewLive execution progress on Final tab — shows current agent, cycle count, status, and recent activity feed

NewDraft tab shows 'CEO is drafting the plan...' spinner during planning phase

NewReview tab shows 'Reviewer is analyzing...' spinner during review phase

NewWhat's Next section after run completion — suggests modifications, new tasks, and export options

ImprovedEmpty states now show 'Click Run to start' hint when idle

ImprovedDesktop progress shows last 5 activity entries, mobile shows last 3

v2.12

February 7, 2026

Streaming Revisions & Extreme QC

CEO revisions now stream in real-time instead of waiting for the full response — you see the output being written word by word with a typing cursor. This also eliminates mobile timeout issues on slow connections. Full extreme quality control pass across all pages, APIs, and features.

NewStreaming revisions — CEO revision output now streams in real-time with a typing cursor animation

NewNew SSE endpoint /api/run/modify-stream for real-time revision delivery

ImprovedNo more 'Failed to fetch' on mobile — streaming eliminates timeout issues on slow connections

ImprovedRevision button shows 'Streaming...' indicator while CEO is writing

FixedFull extreme QC pass: all 7 pages, 6 API endpoints, 5 output tabs, mobile layout verified

v2.11

February 7, 2026

Smarter CEO — Build vs Display Intelligence

Major upgrade to CEO agent intelligence. The CEO now correctly distinguishes between 'build this' and 'display this' instructions — pasting game dev instructions will build the actual game, not a documentation page. Added clarification mechanism for ambiguous tasks. Improved log visibility with AI model labels. Better error handling for revision requests.

NewCEO now distinguishes 'build this' vs 'display this' — pasting game instructions builds the actual playable game, not a docs page

NewCEO asks clarifying questions when tasks are genuinely ambiguous instead of guessing wrong

NewActivity log now shows AI model labels — CEO (ChatGPT), Reviewer (Gemini), Operator (Built-in LLM)

ImprovedInput sanitizer limit increased from 5K to 20K characters to match the task brief validation limit

ImprovedWebsite detection expanded to recognize games, interactive apps, calculators, quizzes, and more

Fixed'Failed to fetch' on revision requests now shows a friendly network timeout message with retry guidance

v2.10

February 7, 2026

Unlimited Runs — Daily Limit Removed

Removed the 10 runs/day per-invite limit and the 30 runs/day per-IP limit. Users can now run unlimited tasks with only a 60-second cooldown between runs. Cleaned up unused rate limit constants.

NewUnlimited runs — removed the 10 runs/day daily limit per invite code

NewRemoved IP-based daily limit (was 30 runs/day per IP address)

Improved60-second cooldown between runs preserved to prevent accidental double-runs

ImprovedCleaned up unused RATE_LIMIT_RUNS_PER_DAY and RATE_LIMIT_IP_RUNS_PER_DAY constants

FixedUpdated Home page and Terms of Service to remove daily limit references

v2.9

February 7, 2026

Quality Check & Mobile Error Visibility

Full quality check across all pages, APIs, and mobile layouts. Error messages now display on all mobile tabs (Task, Plan, Outputs) instead of only the Task tab. Auto-switches to Task tab when an error occurs. Fixed all changelog dates from 2025 to 2026.

FixedMobile error visibility — errors now show on all three mobile tabs (Task, Plan, Outputs) instead of only the Task tab

FixedAuto-switch to Task tab on error — mobile users immediately see the error message instead of a blank Plan or Outputs tab

FixedChangelog dates corrected from 2025 to 2026 across all version entries

ImprovedFull QC pass — all 7 pages verified (Homepage, About, Examples, Instructions, Changelog, App, Projects)

ImprovedAll API endpoints verified (version.current, auth.me, project.listPublic, SSE stream)

Improved172 automated tests passing across 16 test files with 0 failures

v2.8

February 7, 2026

Friendly Error Messages & 20K Character Limit

Error messages are now human-readable instead of showing raw JSON. Task brief character limit increased from 5,000 to 20,000 characters. Validation errors display clear, actionable messages like 'Task brief is too long (max 20,000 characters)' instead of cryptic Zod error objects.

FixedError messages now display user-friendly text instead of raw JSON — e.g., 'Task brief is too long (max 20,000 characters)' instead of [{"origin":"string","code":"too_big"...}]

FixedtRPC error parser handles all error shapes: Zod validation arrays, nested tRPC errors, string messages, and unknown formats

ImprovedTask brief character limit increased from 5,000 to 20,000 characters for detailed prompts

ImprovedError helper function parseTrpcError() provides consistent error formatting across the entire app

ImprovedMobile error display properly wraps long error messages without overflow

v2.7

February 7, 2026

Mobile-First Headers & Version Visibility

All pages now have fully responsive headers that work on Samsung S24+ and other mobile devices. Version badge is always visible on every screen. Home page has a hamburger menu on mobile. Navigation buttons are compact on small screens with abbreviated labels.

ImprovedHome page: hamburger menu on mobile replaces horizontal nav that was overflowing off-screen

ImprovedVersion badge (v2.7) now always visible on all pages including mobile — uses smaller font (10px) and flex-shrink-0 to prevent squeezing

ImprovedInstructions, Changelog, Examples, Projects pages: responsive headers with abbreviated button labels on mobile ('Back' instead of 'Back to Home', 'Start' instead of 'Start Using AI Company')

ImprovedAll header layouts use gap-2 and min-w-0 constraints to prevent overflow on narrow screens

v2.6

February 6, 2026

Try This, History Search, Cost Tracking & Mobile Fixes

Three new features plus critical mobile fixes. 'Try This' buttons on instruction examples auto-fill the app with pre-configured tasks. Run history now has search and status filtering. Activity Log shows per-agent cost estimates with a running total breakdown. Mobile session restore no longer gets stuck, and viewport overflow is fixed.

NewTry This button on each instruction example — one click auto-fills the task brief, budget, cycles, and settings in the app

NewRun history search & filter — search by task brief text, filter by status (Complete, Error, Running, Aborted), or combine both

NewCost tracking per agent — each Activity Log entry shows its estimated cost, with a running total and per-agent breakdown (CEO, Operator, Reviewer, System)

FixedMobile: 'Restoring your session...' no longer gets stuck forever — 8-second timeout with manual retry option

FixedMobile: Content no longer cut off on the left side — fixed viewport overflow with proper CSS constraints

ImprovedDesktop Activity Log shows color-coded per-agent cost breakdown (CEO=blue, Operator=green, Reviewer=purple)

v2.5

February 6, 2026

Enhanced Instructions & Ready-to-Use Commands

Completely revamped instructions page with 4 detailed, copy-paste-ready example prompts covering deep research, brainstorming, website creation, and competitive analysis. Each example includes full task description, recommended settings with dollar budgets, and explanation of what each AI agent does. Added new Website Template, Pro Tips section, and Budget column to the Quick Reference table.

New4 complete ready-to-use command examples — Deep Research Report, Brainstorm & Best Answer, Website Creation, Competitive Analysis — each with full multi-paragraph prompts ready to copy-paste

NewPro Tips section — guidance on using Request Modifications, Activity Log, budget controls, and file attachments

NewWebsite Template added to Task Templates section

ImprovedQuick Reference table now includes Budget column with dollar ranges for each task type

ImprovedEach example includes 'What happens' explanation showing how CEO, Operator, and Reviewer collaborate

ImprovedRecommended settings now include dollar budget and time estimates alongside cycles/searches/replans

v2.4

February 6, 2026

Activity Log, HTML Preview & Post-Run Summary

New Activity Log tab shows every agent action in real-time — see what the CEO planned, what the Operator searched and drafted, and what the Reviewer decided. Draft tab now renders HTML websites as live previews instead of raw code. Post-run summary card shows completion stats with quick navigation links.

NewActivity Log tab — real-time feed of all agent actions with role-colored entries, timestamps, action badges, and search queries

NewPost-run summary card — shows action count, cycle count, sources gathered, reviews done, and quick links to Log/Output/Sources

FixedDraft tab now renders HTML content as live website preview (iframe) instead of showing raw HTML code

FixedOutput tabs (Draft, Review, Final, Evidence, Log) now always visible — no longer disappear after run completion

ImprovedActivity log loads from history — view the full log of any past run, not just live runs

ImprovedError summary card — errors now shown in a dedicated card with clear formatting

v2.3

February 6, 2026

Revision File Attachments

You can now attach files (images, PDFs, documents) when requesting modifications to a completed run. The CEO agent uses attached files as context for more accurate revisions. Both mobile and desktop layouts support the new file upload UI.

NewFile attachments in Request Modifications — attach up to 10 files (images, PDFs, text files) when requesting revisions to completed runs

NewImage reference support — upload screenshots or design mockups and the CEO agent uses them as visual context for website revisions

ImprovedMultimodal LLM support — ChatGPT client updated to support vision/image content in messages

ImprovedFile chips UI — attached files shown as removable chips with file name, size, and type

ImprovedBoth mobile and desktop modification sections now have consistent file upload UI

v2.2

February 6, 2026

Run Reliability & Refresh

Fixed the 'Invalid run state' error when running tasks with long briefs. Task briefs are now stored server-side before SSE connection, eliminating URL length limits. Added a Refresh button to reload run data without a full page refresh.

FixedFixed 'Invalid run state or missing task brief' error — task briefs are now stored server-side via in-memory bridge, eliminating URL length limits for long briefs

NewRefresh button — reload run data (plan, draft, final output, evidence) without refreshing the entire page

ImprovedPending run briefs auto-expire after 60 seconds to prevent memory leaks

v2.1

February 6, 2026

Live Progress, Thumbnails & Cross-Device Access

Real-time progress indicators show cycle count, current phase, and estimated time. Projects gallery now shows live thumbnail previews. Long task briefs no longer cause errors. Runs persist in history and are accessible from any device via invite code.

NewLive progress indicators — shows 'Cycle 2 of 5' with color-coded phase badges (Planning/Executing/Reviewing) and estimated time remaining

NewPhase-colored progress bar — blue for planning, amber for executing, purple for reviewing, with search/replan counters

NewProject thumbnail previews — Projects gallery shows live iframe previews of each generated website with hover-to-view overlay

FixedLong task brief error — increased limit from 10K to 50K characters, fixing database errors on detailed briefs

FixedSSE ownership check — fixed to use invite code instead of session ID, ensuring cross-session access

ImprovedCross-device access — runs are tied to invite code, so Yotam can view results on his iPhone or Windows PC by entering the same invite code

ImprovedProgress state reset — all progress indicators properly reset when starting a new task

v2.0

February 6, 2026

History Fix, Start New Task & Tailwind CDN

Run history is now fully clickable across sessions, a 'Start New Task' button lets you reset and begin fresh, and all generated websites automatically include Tailwind CSS + Inter font for polished styling.

FixedRun history click-to-view — fixed ownership check so runs from previous sessions are accessible (uses invite code instead of session ID)

FixedAll run endpoints (get, approve, abort, modify) now validate by invite code for cross-session access

NewStart New Task button — appears next to Run button after a task completes, resets all state for a fresh start

NewTailwind CDN auto-injection — generated HTML websites automatically include Tailwind CSS, Google Fonts (Inter), and base typography

ImprovedTailwind injection is smart — skips if Tailwind is already present, injects before </head> or creates <head> if missing

ImprovedModified websites also get Tailwind CDN injection when re-uploaded to S3

v1.9

February 6, 2026

Website Preview & Full HTML Pipeline

The AI CEO now generates complete, self-contained HTML websites when you ask for a website, landing page, or portfolio. Preview them live in an iframe with code view, fullscreen, and download options.

NewLive HTML Preview — website outputs render in an interactive iframe with Preview/Code toggle, fullscreen, download, and open-in-new-tab

NewSmart website detection — the Operator now automatically detects website tasks and outputs complete HTML instead of text

NewHTML-aware pipeline — Reviewer evaluates HTML quality (not citations), Finalizer preserves HTML as-is, Modifier re-uploads to S3

ImprovedHigher output limits — website tasks use 16K max tokens for richer, more complete HTML output

ImprovedClean HTML output — markdown code block wrapping and stray text are automatically stripped from LLM responses

ImprovedDesktop export buttons hidden for HTML — the HtmlPreview toolbar replaces generic .txt/.md/PDF buttons

FixedDraft modifications for websites now re-upload the updated HTML to S3 and update the project record

v1.8

February 6, 2026

Auto-Save Website Projects

When the CEO generates a website (HTML output), it is automatically saved as a project with a permanent shareable URL at /projects/:slug.

NewAuto-detect website output — HTML content is automatically identified when the CEO generates a full website

NewAuto-save to S3 — website HTML is uploaded to cloud storage with a permanent URL

NewProject creation — a project entry is created in the database with slug, title, and S3 link

NewGreen banner notification — after a website run completes, a banner shows with 'View Live' and 'Project Page' links

ImprovedSmart slug generation — URL-safe slugs are auto-generated from the task brief

ImprovedTitle extraction — project title is extracted from HTML <title> or <h1> tags

v1.7

February 6, 2026

Versions & Website Preview

Added Versions tab to mobile layout, View Website button for HTML outputs, and export buttons (Copy, .txt, .md, PDF) on every version.

NewView Website button — opens HTML output in a new tab as a live preview when version contains website code

NewVersions tab on mobile — now accessible on all screen sizes, not just desktop

NewExport buttons on Versions — Copy, .txt, .md, and PDF export for each version's draft output

ImprovedMobile tabs redesigned with flexible layout to accommodate all tabs including Versions

v1.6

February 6, 2026

Simplified Settings & Reliability

Redesigned run settings as a clean flat list, added delete run capability, and improved stability across the board.

NewRedesigned run settings — flat list of inputs instead of 4-category cards, type a number or 'E' for endless

NewDelete Run button — clean up failed or test runs from your history

NewSettings persistence — last-used values saved to localStorage, restored automatically

NewUntil Complete toggle — Yes/No switch with safety cap of 50 cycles

ImprovedAll limits configurable independently — Max Dollar, Max Time, Max Cycles, Max Replans, Max Web Searches

FixedtRPC error on invite page — improved error handling for HTML-instead-of-JSON responses

FixedGlobal query error handling — prevents app crashes from transient API errors

v1.5

February 6, 2026

Persistent Run History

Run history now persists across version updates and re-authentication — your past runs will never disappear again.

FixedRun history now tied to invite code instead of session ID — survives re-publishing and re-authentication

FixedAll previously 'lost' runs restored — data was always in the database, just queried by wrong field

ImprovedRate limiting cooldown now uses invite code for consistency across sessions

v1.4

February 6, 2026

Run Modes, Projects & Reliability

Four flexible run modes, live project previews, persistent invite codes, and critical stability fixes.

New4 Run Modes — choose Cycles, Time Limit, Cost Limit, or Until Complete (autonomous mode)

NewProject Generation — website outputs get live preview URLs at /projects/:slug

NewProjects listing page — browse all public projects built by the AI CEO

NewPersistent invite code — saved in localStorage so you don't re-enter it each version

NewToast notifications on draft modification completion

ImprovedExamples page updated to show all 18 game iterations

ImprovedRun mode selector with visual cards for easy selection

FixedReact error #310 — eliminated all early returns to prevent hook ordering crashes on login

FixedTyping lag — converted inline layout components to stable JSX variables

v1.3

February 6, 2026

Draft Modifications & CEO Attribution

Iterative draft refinement — ask the CEO to revise outputs, with clear attribution showing which agent wrote each draft.

NewDraft modification — request changes to completed drafts and the CEO (ChatGPT) will revise them

NewIterative refinement — submit multiple modification requests to progressively improve outputs

NewCEO attribution — 'CEO (ChatGPT) says:' header clearly shows which agent authored the draft

ImprovedModification counter tracks how many revisions have been made

ImprovedUpdated output automatically appears in the Final tab after each revision

v1.2

February 6, 2026

File Intelligence & Polish

Major upgrade to file handling with content extraction, drag-and-drop uploads, and version tracking across the app.

NewPDF and text file content extraction — the AI CEO can now read and analyze uploaded documents

NewDrag-and-drop file upload — drop files directly onto the task input area

NewVersion display across all pages (homepage, dashboard, instructions, examples)

NewChangelog page to track what's new in each release

NewSession recovery UI — resume incomplete runs from earlier sessions

NewVersion update notification — get alerted when a new version is available

ImprovedFile size limits displayed before upload (10MB per file, 20 files max)

ImprovedExtracted document content included in CEO prompt for deeper analysis

FixedSession persistence ensured across deployments — users never lose work

v1.1

January 28, 2026

File Uploads & Run History

Added file upload support, run history sidebar, and configurable run settings.

NewFile upload support — attach images, PDFs, and documents to tasks

NewRun history sidebar — browse and reload previous runs

NewConfigurable run settings — adjust cycles, searches, and replans per run

NewUser history page for logged-in users across all sessions

ImprovedReal-time progress bar during run execution

ImprovedCycle version tracking — compare outputs across refinement cycles

FixedFixed rate limiting to properly track per-invite daily limits

v1.0

January 15, 2026

Initial Release

First public release of AI Company — the AI CEO which thinks before acting.

NewThree-agent architecture: CEO (planner), Operator (executor), Reviewer (quality)

NewReal-time SSE streaming of all agent states and outputs

NewOption evaluation with risk assessment before execution

NewPlan approval gate — review and approve the CEO's plan before execution

NewEvidence tracking with source URLs for every claim

NewInvite-only access control with rate limiting

NewCopy and download final outputs

ImprovedMobile-responsive design with dedicated mobile and desktop layouts

ImprovedTrust banner — transparent about verification limitations

Have a feature request or found a bug?