Changelog

What's new in AI Company. Every feature, improvement, and fix.

v3.35
CurrentMarch 13, 2026

Mobile Sticky Header Fix, Improvement Cycle Hang Fix & QC Hardening

Generated output headers no longer stick to the screen on mobile — both Streamdown prose and HtmlPreview iframe outputs are now fully scrollable. Improvement cycle LLM calls that hung indefinitely are now capped with a 90s timeout. Comprehensive QC audit performed by ChatGPT agent covering content accuracy, navigation, trust messaging, and mobile UX.

FixedMobile sticky header fix — generated output headers (titles, tags, search bars, buttons) no longer stick to screen while scrolling on mobile
FixedHtmlPreview iframe: injected JS neutralizes all position:sticky/fixed elements after page load — catches CSS classes, <style> blocks, and inline styles
FixedStreamdown prose: added contain:paint CSS containment + strips <style> tags rendered by rehype-raw to prevent sticky escaping scroll container
FixedImprovement cycle hang fix — LLM calls now have a 90s timeout cap, preventing indefinite hangs with only heartbeat messages
ImprovedQC audit by ChatGPT agent: verified three-agent role consistency (CEO=ChatGPT, Operator=Built-in LLM, Reviewer=Gemini) across all pages
v3.34
March 12, 2026

Bold New Logo, One-Click Install, Performance & Accessibility

Complete logo overhaul with bold white 'A' on deep blue background — clearly visible at all icon sizes including Android home screen. Smart PWA install banner auto-detects installability and offers one-tap install. Lazy loading splits 17 routes into separate chunks, boosting Lighthouse Performance from 36 to 54. Accessibility score raised to perfect 100. App background fixed to clean white.

NewBold new app icon — programmatically generated white 'A' on deep blue gradient, visible at all sizes (favicon, 192px, 512px, maskable variants)
NewSmart install banner — auto-detects if app isn't installed, shows dismissable bottom bar with one-tap Install button (remembers dismissal for 7 days)
New'Install App' link added to SiteFooter on every page — appears only when app is installable
NewShared useInstallPrompt hook — reused by InstallBanner, SiteFooter, and About page install section
ImprovedLazy loading — 17 route components converted to React.lazy() with Suspense, only Home loads eagerly
ImprovedRoute prefetching — Invite, AppMain, and About chunks preloaded 2 seconds after Home mounts for instant navigation
ImprovedLighthouse Performance score improved from 36 to 54 (+50%) via code splitting and prefetching
ImprovedLighthouse Accessibility score raised from 80 to 100 — removed zoom restriction, added landmarks and aria-labels
FixedApp background restored to clean white — explicit bg-white on all desktop columns (Plan, Outputs) and mobile layout
FixedLoading screen converted from dark (#0F172A) to white with light-themed text, icons, and tip cards
FixedMaskable icons now use full-bleed blue background — no more double-rounding on Android adaptive icons
ImprovedColor contrast improvements — agent name text upgraded to -700 shades, version badge uses gray-600 on gray-100
NewPlay Store prep — manifest icons split into 'any' and 'maskable' entries, .well-known/assetlinks.json template created
v3.32
March 6, 2026

QA Bug Fixes + PWA Cache Busting

Fixed reviewer FAIL label showing misleading 'In Progress' text, updated service worker cache to force new logo propagation on PWA reinstall, and verified all v3.22 QA report items.

FixedReviewer FAIL verdict now shows 'Needs Review' instead of misleading 'In Progress' label
NewWarning notice when build saved despite reviewer FAIL — suggests increasing Max Cycles
FixedService worker cache bumped to v3.32 to force new logo propagation on PWA reinstall
Improvedmanifest.json now included in service worker cache for faster PWA icon updates
FixedVerified all v3.22 QA report items: abort cooldown, Run badge, /builds footer, Share toast
v3.31
March 6, 2026

New 'A' Circuit Logo + Educational Loading Screen

Brand new 'A' Circuit logo across the entire app — a stylized letter A morphing into neural network circuits with glowing blue nodes. The loading screen now teaches you about Run, Build, and Pro modes with rotating example cards.

NewNew 'A' Circuit logo — stylized letter A with neural network circuits and glowing blue nodes
NewLogo updated across all pages: header, footer, favicon, PWA icons, loading screen
NewEducational loading screen with 6 rotating tip cards for Run, Build, and Pro modes
ImprovedEach tip shows mode name, example prompt, and description with smooth fade animations
ImprovedMode pills highlight the current tip's mode (Run/Build/Pro) with color-coded indicators
ImprovedPre-React HTML loading screen shows logo and mode pills before JS bundle loads
v3.30
March 6, 2026

New Logo, No Splash Screen & PWA Improvements

New circuit-brain app icon replaces the generic white circle. PWA splash screen eliminated — dark background loads instantly without white flash on app open.

NewNew circuit-brain logo and favicon — distinctive dark navy design with glowing blue neural network
FixedPWA splash screen eliminated — dark background (#0F172A) loads instantly, no white flash on app open
ImprovedUpdated PWA manifest with new 192x192 and 512x512 icons
ImprovedService worker cache busted (v2) to ensure new logo propagates to all users
v3.29
March 5, 2026

Auto Deploy Verification & Owner Notification

After every publish, the server automatically runs the full 12-check Deploy Verification suite (database, APIs, version, changelog, LLM connectivity, etc.) and sends the results as a notification to the project owner. No manual steps needed — just publish and you'll be notified if anything broke.

NewAuto-run Deploy Verification on server startup in production — 12 checks run automatically 10 seconds after deploy
NewOwner notification with full pass/fail report including individual check results and durations
ImprovedFailed checks highlighted separately in notification for quick triage
ImprovedError handling: if verification itself crashes, owner is notified about the error
v3.28
March 5, 2026

CEO Revision Creates New Build Versions (V4, V5, etc.)

CEO revisions in Build mode now create new Build versions instead of overwriting the latest. Each revision saves as V4, V5, V6, etc. so users can compare all iterations side-by-side in the Build Versions viewer. The new version appears instantly in the left panel after the revision completes.

NewCEO revision in Build mode creates a new Build version (V4, V5, etc.) instead of overwriting the latest
NewNew Build version appears instantly in the left-panel Build Versions viewer after revision completes
ImprovedUsers can now compare all CEO revision iterations side-by-side via version buttons
ImprovedBuild version label shows 'CEO Revision' source to distinguish from original build cycles
v3.27
March 5, 2026

Shared Footer Component, Playwright E2E Smoke Tests & QA Automation

Extracted the 8-link footer into a shared SiteFooter component used by all 6 pages (Home, About, Examples, Instructions, Changelog, Builds, ComponentShowcase) — eliminates duplication and ensures consistent navigation. Created a comprehensive Playwright e2e smoke test suite (pre-deploy-smoke.spec.ts) with 22 automated tests covering footer links, version badges, mobile overflow, changelog integrity, about page model names, invite flow, and cooldown persistence. All tests run before every publish to catch regressions automatically.

ImprovedExtracted footer into shared SiteFooter component — all 7 pages now use a single source of truth for navigation links
NewPlaywright e2e pre-deploy smoke tests (22 tests) covering footer links, version badges, mobile overflow, changelog, and more
ImprovedSiteFooter supports 'dark' variant for pages with dark backgrounds (e.g., Examples page)
FixedComponentShowcase page now shows full navigation footer instead of minimal 'Shadcn/ui Component Showcase' text
v3.26
March 5, 2026

Perplexity QA Fixes: Cooldown Persistence, Run Badge, Footer & Share Toast

Addresses all findings from the Perplexity QA v3.22 test report. The 15-second abort cooldown now persists across page refreshes via localStorage, preventing bypass by reloading. The blue 'Run' badge now appears in logged-in user history (userHistory endpoint was missing runMode). Rate-limit errors from the server now auto-trigger a client-side cooldown countdown. The /builds page now includes the standard 8-link footer, and all copy-to-clipboard actions show a consistent toast notification.

FixedAbort cooldown now persists across page refreshes via localStorage — can't bypass by reloading
FixedBlue 'Run' badge now shows in logged-in user history (userHistory endpoint was missing runMode field)
FixedServer rate-limit errors now auto-trigger client-side cooldown countdown with extracted wait time
FixedDesktop Versions tab trigger now checks buildVersions in addition to cycleVersions
Improved/builds page now includes standard 8-link footer navigation
ImprovedAll copy-to-clipboard actions now show consistent toast notification ('Copied to clipboard!')
NewPre-deployment QA checklist (QA-CHECKLIST.md) with 40+ regression tests organized by category
v3.25
March 4, 2026

Mobile Versions Tab, Revision Timeout Fix & Time Remaining Estimate

Fixed the missing Versions tab on mobile Build mode — V1/V2/V3 website version buttons now appear on mobile just like desktop. Increased CEO revision timeout from 5 to 8 minutes and added per-operation timeouts (30s per search, 20s per image scrape) to prevent indefinite hangs. The modification button now shows an estimated time remaining during each phase.

FixedMobile Build mode now shows Versions tab with V1/V2/V3 website version buttons (was missing entirely)
FixedCEO revision timeout increased from 5 to 8 minutes to accommodate complex research requests
FixedPer-operation timeouts: 30s per search query, 20s per image scrape — prevents indefinite hangs
NewModification button shows estimated time remaining during each phase (Researching / Thinking / Streaming)
ImprovedTimed-out search queries and image scrapes are gracefully skipped instead of failing the entire revision
v3.24
March 4, 2026

CEO Revision Progress Tracking, Retry Buttons & Stale Connection UX

Major improvement to the CEO modification flow: the button now shows real-time progress during research, image scraping, and LLM generation phases with an elapsed time counter. If a revision fails, a Retry button appears inline. All three stale connection warning banners now include a Retry button for immediate manual recovery. Progress events are streamed from the server during the research phase so users see exactly what the CEO is doing.

ImprovedCEO revision button shows real-time status: 'Researching...', 'Scraping images...', 'CEO is thinking...', 'Streaming revision...' with elapsed time counter
ImprovedServer sends progress events during web research and image scraping phases — no more silent waiting
NewRetry button appears inline after a failed CEO revision — no need to re-type the modification
NewRetry button added to all stale connection warning banners (mobile + desktop) for immediate manual recovery
ImprovedProgress messages logged to activity log so users can see research/scraping steps in the Log tab
FixedSafety timeout now properly clears all progress state and shows retry option
v3.23
March 4, 2026

Abort Cooldown Enforcement, Rate Limit Error Handling & Run Badge Fix

Fixes three bugs found during QA testing: (1) After aborting a run, buttons now show a visible 15-second cooldown countdown preventing rapid re-runs. (2) 'Rate exceeded' plain-text errors from upstream proxies are now caught and shown as a friendly message instead of a JSON parse crash. (3) The blue 'Run' badge now correctly appears in the history sidebar for all Run-mode tasks.

FixedAbort cooldown enforced — buttons disabled for 15s with visible countdown after abort
Fixed'Rate exceeded' JSON parse error fixed — upstream plain-text errors now shown as friendly messages
FixedBlue 'Run' badge now shows in history sidebar (was checking wrong value 'run' instead of 'operator')
ImprovedCooldown countdown visible on both mobile and desktop Run/Build/Pro buttons
ImprovedToast notification after abort informs user of 15s wait
v3.22
March 4, 2026

Mobile Stall Watchdog, SSE Reconnection UX & Share Fixes

Critical fix for mobile execution stalls: added a client-side watchdog that detects when no server events arrive for 45+ seconds, shows a visible warning, and auto-aborts at 90 seconds with a retry option. SSE reconnection now shows toast notifications ('Reconnecting...' / 'Reconnected!'). Stale connection warning banner appears on both Task and Log tabs. Share button uses on-demand S3 upload so blob URLs are never shared.

NewMid-run stall watchdog — warns at 45s of no events, auto-aborts at 90s with retry option
NewStale connection warning banner on Task tab — amber bar shows 'No server updates for Xs' with WifiOff icon
NewSSE reconnection toasts — 'Connection interrupted. Reconnecting...' and 'Reconnected! Resuming updates.'
FixedShare button always uploads HTML to S3 on-demand — never copies blob: URLs
ImprovedStale warning visible on both mobile Task tab and desktop main content area
ImprovedAuto-abort message guides user to retry with same task
v3.21
March 3, 2026

On-Demand S3 Upload for Share & Run Mode Badge

Share and Open buttons now upload HTML to S3 on-demand via a new tRPC endpoint, ensuring shared links are always real CloudFront URLs. Added 'Run' mode badge (blue) to history sidebar so all three modes have visible badges.

Newproject.shareHtml tRPC endpoint — uploads HTML to S3 on-demand when user clicks Share or Open
New'Run' mode badge (blue) in history sidebar — matches Build (green) and Pro (violet) badges
FixedShare button always visible in HtmlPreview toolbar with loading spinner during upload
ImprovedOpen in new tab uses CloudFront URL instead of blob URL
v3.20
March 3, 2026

Share Link Blob URL Fix & Cooldown Reduction

Fixed all Open/Share buttons to use real CloudFront URLs when available instead of blob: URLs. Reduced abort cooldown from 60s to 15s for faster re-testing.

FixedAll Open/Share buttons now use CloudFront URLs when savedProject exists
FixedAbort cooldown reduced from 60s to 15s — allows quick re-testing after abort
v3.19
March 3, 2026

Cooldown Reduction & Planning Latency Investigation

Reduced run cooldown from 60 seconds to 15 seconds for faster iteration. Investigated 'stuck in planning' report — confirmed it was normal LLM latency with heartbeat working correctly.

FixedRun cooldown reduced from 60s to 15s across all references
ImprovedVerified heartbeat mechanism works during long planning phases
v3.18
March 3, 2026

Share Button Blob URL Fix

Fixed Share button to only appear when a real CloudFront URL exists. Open in new tab now uses the hosted URL instead of blob URL.

FixedShare button only shows when savedProject has a real CloudFront URL
FixedOpen in new tab uses CloudFront URL when available
FixedBuildVersionsViewer Share button confirmed using correct S3 htmlUrl
v3.17
March 3, 2026

Dedicated Share Button & Button Clarity

Added a dedicated Share button to Build Versions and HtmlPreview toolbar that copies the CloudFront website URL. Renamed 'Play' to 'Preview' and 'View & Play' to 'View Latest' for clarity.

NewDedicated Share button in Build Versions — copies CloudFront URL to clipboard
NewShare button in HtmlPreview toolbar when saved project exists
ImprovedRenamed 'Play' to 'Preview' for clarity
ImprovedRenamed 'View & Play' to 'View Latest'
v3.16
March 3, 2026

Image Scraping for CEO Revisions & Scroll Bug Fix

CEO revision flow can now scrape real images from websites mentioned in modification instructions. When users say 'add pictures from example.com', the system fetches actual image URLs from the HTML instead of guessing. Defensive CSS fixes applied to all marketing pages to prevent white-page scroll issues across browsers.

NewfetchWebpageImages() scrapes real image URLs from websites mentioned in CEO revision instructions
NewImage scraping integrated into both modifyDraft and modifyDraftStream flows
FixedDefensive CSS fixes on all marketing pages — explicit background-color and color fallbacks
FixedAbout page now shows version badge and model names (ChatGPT, Built-in LLM, Gemini)
FixedAll page footers now include complete 8-link navigation
FixedChangelog footer updated with full navigation links
ImprovedImage URL verification via HEAD requests filters broken/tiny/icon images
ImprovedData URI filtering prevents base64-encoded images from being extracted
v3.15
March 2, 2026

Mode Suggestion Toast, Heartbeat & Public Changelog

Mode suggestion toast appears when a task would work better in a different mode. Heartbeat entries show progress during long operations. Changelog page made public. Footer navigation updated on most pages. Pricing chips visible below status bar.

NewMode suggestion toast — appears ~20-30s into a run if a different mode would be better
NewHeartbeat entries in Log tab — 'Still working' messages every ~30 seconds
NewChangelog page now publicly accessible (was owner-only)
NewPricing chips visible below status bar: Run $0.10 / Build $0.60 / Pro $6.10
FixedFooter navigation updated to 7 links on most marketing pages
ImprovedMode suggestion threshold lowered for better detection
v3.14
March 2, 2026

CEO Revision Web Research & Build Timeout Increase

CEO revision flow now does live web research when users mention URLs or ask to improve content. Build timeout increased to 20 minutes with automatic retry. Better error messages for skipped cycles.

NewCEO revision triggers web research when URLs detected in modification instructions
NewextractResearchNeeds() analyzes modification text for URLs and generates search queries
FixedBuild timeout increased from 10 to 20 minutes with automatic retry on timeout
FixedBetter error messages when cycles are skipped due to budget constraints
ImprovedModification instructions parsed for domain-specific search queries
v3.13
March 2, 2026

Streaming CEO Revisions & Activity Log Improvements

CEO revision process now streams changes in real-time instead of waiting for the full response. Activity log entries improved with better formatting and timestamps.

NewCEO revisions stream in real-time — watch changes appear as they're generated
NewRevision counter shows number of CEO modifications applied
ImprovedActivity log entries have better formatting with actor badges
ImprovedModification request entries show in cyan for visual distinction
v3.12
March 2, 2026

CEO Modification Flow & Send to CEO Button

New 'Send to CEO for Revision' button lets users request modifications to completed outputs. CEO analyzes the request and applies targeted changes to the existing draft.

New'Send to CEO for Revision' button on completed outputs
NewCEO analyzes modification requests and applies targeted changes
NewModification log entries track all revision requests
ImprovedCEO modifies existing draft instead of rewriting from scratch
v3.11
March 2, 2026

Quick Test, Automated Verification & Revision Upgrades

Major admin tooling update: Quick Test button runs lightweight smoke tests ($0.15-0.30) to verify the full Run + Build pipeline without external agents. All 12 manual deploy checklist items are now automated server-side checks. Revision process upgraded to modify existing drafts instead of rewriting from scratch. Credit dashboard crash fixed.

NewQuick Test admin button — runs programmatic Run + Build smoke test from /admin/health
NewAutomated Deploy Verification — all 12 manual checklist items now run as server-side tests
NewUser modification log entries — 'User requested modification' appears in activity log with cyan styling
FixedCredit Dashboard crash fixed — TypeError on toFixed when budget fields were undefined
FixedRevision process now upgrades existing draft instead of rewriting from scratch
FixedCEO prompt reduced over-asking — proceeds with reasonable assumptions instead of excessive clarification
ImprovedDeploy checklist page redesigned with automated verification results and timing
ImprovedAll .toFixed() calls wrapped with null safety guards across Credit Dashboard
v2.47
February 18, 2026

Stale Run Recovery & Deployment Resilience

Fixed the 'Unable to start run' error that occurred after server deployments. When a run was interrupted by a deployment (stuck in PLAN/EXECUTE state but no longer active in memory), the system now automatically recovers it by restarting with the stored task brief. Users see a seamless restart instead of an error message.

FixedFixed 'Unable to start run' error after server deployments — stale runs now auto-recover
NewStale run detection: identifies runs in non-terminal DB state but not active in memory
NewAutomatic recovery: restarts stale runs with stored task brief from database
ImprovedClear error messages when recovery isn't possible ('interrupted by server update')
ImprovedDiagnostic logging for stale run recovery to aid debugging
v2.46
February 18, 2026

Connection Health Indicator & Retry for Stuck Steps

Added a real-time 'Last update: Xs ago' indicator with color-coded connection health (green/yellow/red). When no server event is received for 90+ seconds, a 'Retry Step' button appears to abort the stuck step and let users re-submit. Visible on both desktop and mobile, on all tabs.

New'Last update: Xs ago' indicator shows time since last server event with Wifi/WifiOff icon
NewColor-coded: green (<30s), yellow (30-59s), red (60s+) for instant connection health
New'Retry Step' button appears after 90s of no response — aborts stuck step with toast guidance
ImprovedIndicator visible on both desktop sidebar and mobile sticky progress bar
ImprovedAll 14 SSE event handlers now track last event time for accurate monitoring
v2.45
February 18, 2026

Heartbeat Progress Updates During Long LLM Calls

Added heartbeat progress updates every 15 seconds during all long-running LLM calls. Users now see live status messages like 'CEO is analyzing the task (30s)...' instead of a frozen screen. Applied to all 7 LLM call points: CEO planning, Operator drafting, Reviewer checking, Replanning, Improvement cycles, and Finalization.

NewHeartbeat wrapper sends status updates every 15 seconds during long LLM calls
NewAnimated dots ('.', '..', '...') cycle to show the system is alive
NewProgress bar increments by 2% per heartbeat to show forward movement
ImprovedApplied to all 7 LLM call points: CEO, Operator, Reviewer, Replan, Improvement, Finalize
ImprovedCombined with v2.44 client-side timer for fully responsive progress tracking
v2.44
February 18, 2026

Mobile UX: Live Timer, Progress on All Tabs, Bigger Fonts

Fixed the frozen elapsed timer so it counts up every second (not just on server events). Added a sticky progress bar visible on Plan and Outputs tabs. Increased font sizes for credit balance and settings on mobile. Hidden confusing dual ETAs during active runs.

FixedElapsed timer now ticks every second via client-side interval, independent of SSE updates
FixedSticky progress bar (phase stepper + bar + timer) visible on ALL mobile tabs, not just Task
FixedCredit balance and settings text increased from 9-10px to 11-12px for mobile readability
FixedConfusing 'ETA: ~2 min / ~5 min' hidden during active runs — relabeled as 'Avg Run / Avg Build' when idle
ImprovedProgress bar height increased from 1px to 1.5px for better visibility
v2.43
February 18, 2026

LLM Retry Logic & Invite Code Fix

Added retry logic with exponential backoff to all three LLM clients (Built-in, ChatGPT, Gemini) to handle transient upstream errors. Fixed the 'Change Invite Code' button so it properly clears the saved session and navigates to the invite page.

FixedBuilt-in LLM (Operator) now retries up to 3 times with 2s/5s/10s backoff on 500/502/503/504 errors
FixedChatGPT (CEO) now retries up to 3 times for both regular and streaming calls
Fixed'Change Invite Code' button now clears localStorage saved code, preventing auto-redirect back to app
FixedSidebar sheet closes before logout mutation fires for smoother UX
ImprovedE2E tested: Full Run+Build pipeline completed successfully (to-do list app, 5 versions, 15 sources)
v2.42
February 17, 2026

Run Duration Tracking & Completion Notifications

Accurate run duration tracking with completedAt timestamps. Notification sound and vibration on mobile when runs complete so you don't have to watch the screen.

NewcompletedAt timestamp stored in database for accurate run duration tracking
NewTwo-tone completion chime (A5 + E6) plays when a run finishes
NewVibration pattern (100ms-50ms-100ms) on mobile when run completes
ImprovedETA calculation now uses completedAt for accuracy, with runLog fallback for older runs
v2.41
February 17, 2026

Pre-Deploy Checklist & ETA

New admin deploy checklist page with automated health checks and manual verification items. Estimated completion time shown near Run buttons based on historical run durations.

NewPre-deployment checklist at /admin/deploy with auto-run health checks (DB, APIs, search, version)
New12 manual verification items for deployment readiness (mobile, desktop, credit tracking, etc.)
NewEstimated completion time (ETA) shown near Run buttons based on historical run durations
ImprovedRun stats endpoint calculates average duration from completed runs
Fixed'Awaiting Approval' status no longer gets stuck after auto-approve — transitions directly to Executing
v2.40
February 17, 2026

Run History Mobile UX

Shows dollar amount alongside remaining runs in the Run History sidebar. Trash/delete icon is now always visible on mobile instead of requiring hover.

ImprovedRemaining section now shows dollar amount with color-coded warning (red <$3, yellow <$10)
FixedTrash icon always visible on mobile (was hover-only, invisible on touch devices)
v2.39
February 17, 2026

Build Quality & Credit UX Overhaul

Major fix: CEO and Operator now produce working HTML code instead of design documents for build tasks. Added low balance warnings, per-run cost tracking in history, and auto-refreshing credit balance.

FixedCEO no longer recommends Unity/Godot — enforces HTML/Canvas/JS single-file output for all build tasks
FixedImprovement cycles now improve the actual CODE, not polish a design document
FixedReviewer now auto-rejects design documents when user asked for a working product
NewLow balance warning toast: blocks Run+Build if <$3.50, warns if <$7, blocks if 0 runs left
NewPer-run cost breakdown in run history with Build badge and total cost
ImprovedCredit balance auto-refreshes after run completes, errors, or aborts
v2.38
February 17, 2026

Invite Code Cleanup & Credit Balance Display

Cleaned up old invite codes (BETA/GAMMA/DELTA/EPSILON/ZETA) to shorter names. Enhanced credit balance display next to Run buttons with visual progress bar, color-coded warnings, and estimated cost per run type.

ImprovedRenamed 8 invite codes to shorter names (BETA01, GAMMA1, DELTA1, EPSLN1, ZETA01, PRO001, TEAM01, VIP001) — all sessions and runs migrated
ImprovedDeactivated 9 old invite codes and QC test code
NewEnhanced credit balance display with visual progress bar showing budget usage
NewColor-coded budget warnings: green (healthy), yellow (<$10), red (<$3)
NewEstimated cost shown next to Run buttons: Run ~$0.10, Run + Build ~$6.10
v2.37
February 17, 2026

Mobile Reliability Fix — Task Brief Persistence

Fixed 'Invalid run state or missing task brief' error on mobile devices (Samsung S24 Plus and similar). Task briefs are now persisted to the database immediately when a run starts, eliminating race conditions between the run mutation and SSE connection on slow mobile networks.

FixedFixed mobile error: 'Invalid run state or missing task brief' — task brief now stored in DB as ultimate fallback
FixedIncreased SSE retry timeout from 3×500ms to 5×1000ms for slow mobile connections
ImprovedThree-tier task brief resolution: in-memory map → URL parameter → database row
ImprovedBetter error diagnostics: SSE handler now logs detailed debug info when falling through to error state
ImprovedUser-friendly error message: 'Unable to start run. Please try again' instead of cryptic 'Invalid run state'
v2.36
February 17, 2026

Gemini Fallback, Operator Code Output & Builder Auto-Trigger

Gemini API now retries with fallback models when overloaded. Operator prompt rewritten to always produce working code (games, apps, tools) instead of design documents. Builder auto-triggers immediately when 'Run + Build' completes — no manual click needed.

FixedGemini API retry logic with fallback model (gemini-2.0-flash) — no more 'high demand' errors killing the Reviewer
FixedOperator now produces working HTML/JS code for build tasks instead of design documents — games are playable, apps are functional
NewBuilder auto-trigger — when using 'Run + Build', the Builder agent starts automatically after the Operator finishes
ImprovedOperator prompt strengthened with explicit anti-patterns: no 'Game Design Documents', no 'Technical Specifications' — only working code
ImprovedFor games: Operator now uses Canvas API / pure JS instead of external frameworks (Phaser, Unity) to keep everything in one HTML file
ImprovedCEO search aggressiveness increased — now searches by default for almost all tasks
v2.35
February 17, 2026

Owner-Only Changelog, Instructions Rewrite & Code Update

Changelog page is now restricted to the project owner only (verified by identity, not invite code). Instructions page completely rewritten to reflect two run modes, credit system, and current features. Invite code simplified for easier entry.

NewChangelog page now owner-only — verified by OWNER_OPEN_ID, not visible to regular users
NewNew auth.isOwner tRPC endpoint for frontend owner identity checks
ImprovedInstructions page completely rewritten — now covers two run modes (Operator vs Builder), credit system, key features, and updated quick reference table
ImprovedRemoved all invite code references from changelog text — no codes exposed in UI
ImprovedChangelog navigation links removed from public pages — only accessible via direct URL by owner
ImprovedOwner invite code simplified for easier entry
v2.34
February 17, 2026

Credit System, Two Run Modes & Branding Update

Major update introducing a credit system with per-invite budgets, two run modes (Operator for fast AI runs, Builder for full website builds), admin credit dashboard, and complete removal of third-party branding from all user-facing text.

NewTwo run buttons: 'Run' (Operator mode) for fast AI execution, 'Run + Build' (Builder mode) for full website builds
NewCredit system with per-invite dollar budgets and run limits — budget enforcement blocks runs when exhausted
NewAdmin credit dashboard at /admin/credits — shows per-invite budget usage, total spend, and builder cost estimates
NewBuilder cost estimation at ~$6/build with calibration tracking for accuracy over time
NewCredit balance display in invite status — users see remaining budget and runs
ImprovedRemoved all third-party branding from user-facing text — login, activity logs, changelog, and dialogs
ImprovedRenamed 'manus' router to 'builder' — cleaner API naming throughout
ImprovedOperator model label changed from 'Manus LLM' to 'Built-in LLM' in activity logs
ImprovedOwner invite code set to $999 budget / 999 runs; other codes set to $25 / 99 runs
ImprovedCEO prompt updated to search by default — much better results for creative and product tasks
ImprovedSearch API (Tavily) health check added to admin health dashboard
New40+ new tests covering credit system, branding removal, run modes, and builder router
v2.32
February 17, 2026

Admin Health Dashboard, E2E Browser Tests & Post-Publish Verification

Added an admin-only health dashboard at /admin/health with live status indicators for database, API keys, environment, and system health. Set up Playwright E2E browser tests (25 tests) covering all critical user flows. Created a post-publish auto-verification suite (23 tests) that can run against production after every deploy.

NewAdmin health dashboard (/admin/health) — live status indicators with green/yellow/red for all system components
NewPlaywright E2E browser tests (25 tests) — homepage, navigation, all public pages, invite flow, auth, changelog, responsive design
NewPost-publish verification suite (23 tests) — site availability, API health, page rendering, auth flow, changelog integrity, performance
ImprovedAdmin dashboard protected by admin secret with localStorage persistence and auto-refresh
ImprovedNew npm scripts: test:e2e, verify:local, verify:prod for running different test suites
v2.31
February 17, 2026

Deep Testing, Health Check & Regression Guards

Added comprehensive test suites covering the full run lifecycle, post-deployment verification, and regression guards for every version since v2.26. New /api/trpc/version.health endpoint provides real-time system health monitoring with database, API key, and environment checks.

NewHealth check endpoint (version.health) — monitors database, API keys, environment, system status
NewRun lifecycle test suite (55 tests) — covers config validation, state machine, timeout, model detection, plan steps
NewPost-deployment verification suite (41 tests) — API contracts, regression guards, security checks, UI data contracts
ImprovedRegression guards for every version since v2.26 — GPT-5.2 fix, plan step cap, timeout handling, progress phases
ImprovedTotal test count increased from 246 to 340+ covering all critical paths
v2.30
February 16, 2026

Time Management: Countdown, Configurable Timeout & Estimated Completion

Added real-time elapsed/remaining time countdown to the progress bar, configurable per-run timeout (via Max Time setting), and estimated completion time displayed in the Cost Estimate card based on token complexity.

NewLive elapsed time countdown (MM:SS) shown in progress bar during runs
NewTime limit displayed as elapsed/timeout (e.g., 2:30/10:00) so users see remaining time
NewEstimated completion time (~1-2, 2-4, or 4-8 min) shown in Cost Estimate card based on token complexity
ImprovedTimeout is now configurable per-run via Max Time setting and applied to the hard timeout
ImprovedProgress stats row always visible during runs (searches, replans, estimated time left)
v2.29
February 16, 2026

Increased Hard Timeout to 10 Minutes

Doubled the hard timeout from 5 minutes to 10 minutes, giving complex tasks with multiple improvement cycles enough time to complete without hitting the time limit.

ImprovedHard timeout increased from 5 minutes (300s) to 10 minutes (600s)
ImprovedComplex tasks with 4 improvement cycles now have sufficient time to complete
v2.28
February 16, 2026

Graceful Timeout Handling for Improvement Cycles

Fixed a critical issue where runs that completed successfully (reviewer-approved) would show as ERROR when improvement cycles couldn't start due to time limits. Now, if time runs out during improvements, remaining cycles are gracefully skipped and the reviewer-approved output is saved. Finalization also gracefully handles timeouts by using the draft directly.

FixedRuns no longer show ERROR when improvement cycles are skipped due to time limit
FixedTimeout during finalization now saves the reviewer-approved draft instead of erroring
ImprovedTimeout with existing output now completes as END state, not ERROR
ImprovedClear log messages explain which improvement cycles were skipped and why
ImprovedImprovement completion counter accurately tracks partial progress
v2.27
February 16, 2026

QC Smoke Tests, Plan Step Badge & Progress Improvements

Added a comprehensive pre-deployment QC smoke test suite with 41 automated checks covering model selection, run creation, progress tracking, plan steps, version consistency, SSE events, and more. Plan step count is now shown as a badge next to the Execution Plan header. Progress display improved with replanning and improving phases.

NewPre-deployment QC smoke test suite — 41 automated checks across 10 categories
NewPlan step count badge shown next to Execution Plan header in both mobile and desktop
ImprovedProgress bar now shows 'Replanning' and 'Improving' phases with live updates
ImprovedCEO plans soft-capped at 3-7 steps to prevent Operator context bloat
ImprovedPlans over 7 steps are automatically truncated before passing to Operator
v2.26
February 16, 2026

GPT-5.2 API Compatibility Fix

Fixed a critical API error where GPT-5.2 was rejecting the max_tokens parameter. All GPT-5 family models (5, 5-mini, 5-nano, 5.2, 5.2-pro) now correctly use max_completion_tokens as required by OpenAI's API. Legacy models (GPT-4o) still use max_tokens.

FixedGPT-5.2 no longer throws 'Unsupported parameter: max_tokens' error
FixedAll GPT-5 family models now use max_completion_tokens instead of max_tokens
ImprovedNew requiresMaxCompletionTokens() helper for clean model detection
ImprovedBoth standard and streaming ChatGPT calls updated for compatibility
v2.25
February 16, 2026

PWA Popup Fix + QC Testing

Fixed the browser's automatic PWA install popup that was appearing on every page. The install prompt is now fully suppressed — users can only install the app by tapping the link at the bottom of the About page. No more intrusive banners.

FixedSuppressed browser's native PWA install popup globally — no more banner on homepage
ImprovedInstall link remains only at the bottom of the About page, user-initiated
ImprovedComprehensive QC testing checklist created for all 16 feature areas
v2.24
February 16, 2026

PWA Install — Add to Home Screen

AI Company can now be installed as a Progressive Web App (PWA) on your phone or desktop. Visit the About page and tap 'Install AI Company as an App' at the bottom. Works offline with service worker caching. No app store needed.

NewPWA manifest with app name, icons, and theme color for home screen installation
NewService worker for offline caching of static pages
NewNon-intrusive install link at bottom of About page — no popup, user-initiated only
ImprovedApple touch icon and meta tags for iOS home screen support
ImprovedShows 'App installed' confirmation after successful installation
v2.23
February 16, 2026

Auto-Migrate Existing User Model Defaults

Existing users who had old/retired models cached in localStorage (like GPT-4o or Gemini 2.0 Flash) are now automatically upgraded to the new defaults: GPT-5.2 for CEO and Gemini 3 Pro Preview for Reviewer. No manual action needed — the migration happens silently on page load.

FixedAuto-migrate cached CEO model from GPT-4o/GPT-4o-mini/o1/o3-mini to GPT-5.2
FixedAuto-migrate cached Reviewer model from Gemini 2.0 Flash/2.5 Flash/2.0 Pro to Gemini 3 Pro Preview
ImprovedMigration is silent and automatic — existing users see updated defaults immediately
ImprovedFuture model retirements can be handled by adding to the migration list
v2.22
February 16, 2026

Builds Gallery + Access Code Protection

New public Builds Gallery page at /builds showing all Builder builds with status, progress, and credit usage. Each build is protected by a unique 8-character access code — you need the code to view details, URLs, and results. The Examples page remains fully public. Access codes are shown to the build creator after triggering a build, and can be shared with others for controlled access.

NewNew /builds page — public gallery showing all Builder builds with status and progress
NewAccess code protection — each build gets a unique 8-character code required to view details
NewAccess code displayed to build creator in the Build component
NewPublic gallery shows summary only (title, status, date) — no URLs or details without code
NewStats dashboard showing total builds, completed, in-progress, and credits used
ImprovedExamples page (/examples) remains fully public — no code needed
ImprovedBuilds link added to homepage navigation (desktop + mobile)
ImprovedBuild title auto-extracted from CEO output prompt
v2.21
February 16, 2026

Builder Agent — CEO Output → Real Websites

The CEO's output can now be sent directly to the Builder agent for real-world execution. After a run completes, use 'Run + Build' mode to have the Builder actually build what the CEO planned — websites, code, documents, anything. Real-time progress tracking with polling, result links, and shareable URLs. This is the first step toward the full AI Council vision.

New'Run + Build' mode triggers Builder agent after every completed run
NewBuilder API integration — creates full agent tasks from CEO output
NewReal-time build status tracking with 5-second polling
NewView Build link and shareable URL for completed builds
NewResult viewer shows Builder output, attachments, and credit usage
NewRetry button for failed builds
ImprovedBuild status persisted in database — survives page refresh
ImprovedNormalized Builder API response handling (id vs task_id)
v2.20
February 15, 2026

Default Model Upgrade — GPT-5.2 & Gemini 3 Pro

Upgraded default AI models to the latest and most capable versions. CEO agent now defaults to GPT-5.2 (OpenAI's best model for complex tasks) instead of GPT-5 Mini. Reviewer agent now defaults to Gemini 3 Pro instead of Gemini 2.5 Flash. Fixed model detection logic so GPT-5 base is correctly treated as a reasoning model, while GPT-5.2 uses standard API parameters. Also fixed 3 hardcoded fallback model references that were still pointing to retired GPT-4o.

ImprovedCEO default model upgraded from GPT-5 Mini to GPT-5.2 — OpenAI's most capable standard model
ImprovedReviewer default model upgraded from Gemini 2.5 Flash to Gemini 3 Pro — Google's most intelligent model
FixedFixed 3 hardcoded gpt-4o fallbacks in agents.ts — now correctly fall back to gpt-5.2
FixedFixed Reviewer fallback from gemini-2.0-pro to gemini-3-pro-preview
FixedGPT-5 base now correctly detected as a reasoning model (uses developer role, no temperature)
FixedGPT-5.2 correctly uses standard API parameters (system role, temperature, max_tokens)
NewAdded GPT-5 (Reasoning) to the CEO model selector dropdown
ImprovedGPT-4o marked as Retired (Feb 13, 2026) in model selector
v2.19
February 15, 2026

Mobile UX Improvements

Fixed mobile readability issues on Samsung Galaxy S24+ and similar devices. Log entry text is no longer truncated — full descriptions are visible without needing to expand. Transcript sections now use larger font sizes with better line spacing for comfortable reading on mobile screens.

FixedLog entry text no longer truncated on mobile — full descriptions visible
FixedTranscript prompt/response sections use larger, more readable font (text-xs instead of text-[9px])
ImprovedBetter line spacing (leading-relaxed) in transcript sections for mobile readability
ImprovedIncreased max-height for transcript sections so more content is visible before scrolling
v2.18
February 10, 2026

GPT-5 Temperature Fix

Fixed API error where GPT-5 models rejected custom temperature values. GPT-5 Mini and other GPT-5 models only support the default temperature (1), so the temperature parameter is now omitted for all GPT-5+ models. Legacy models like GPT-4o still support custom temperature.

FixedGPT-5 models no longer send temperature parameter (only default value 1 is supported)
FixedStreaming calls also omit temperature for GPT-5+ models
v2.17
February 10, 2026

GPT-5 API Compatibility Fix

Fixed a critical API compatibility issue where GPT-5 models rejected the legacy 'max_tokens' parameter. GPT-5 and newer models now correctly use 'max_completion_tokens' while legacy models (GPT-4o) continue using 'max_tokens'. The fix applies to both standard and streaming API calls.

FixedGPT-5 models now use max_completion_tokens instead of max_tokens (API requirement)
FixedStreaming calls also updated to use correct token parameter per model
ImprovedAutomatic detection of model generation for correct API parameter selection
v2.16
February 10, 2026

Smarter AI Models — GPT-5 & Gemini 3

Upgraded to the latest AI models for dramatically better output quality. CEO agent now defaults to GPT-5 Mini with GPT-5.2 and GPT-5.2 Pro available for complex tasks. Reviewer agent defaults to Gemini 2.5 Flash with Gemini 3 Pro Preview available for the deepest analysis. All models are selectable per-run in the Run Settings dropdown. Legacy models (GPT-4o, Gemini 2.0 Flash) still available but marked as deprecated.

NewCEO model selector — choose from GPT-5 Nano, GPT-5 Mini, GPT-5.2, GPT-5.2 Pro, or o3-mini per run
NewReviewer model selector — choose from Gemini 2.5 Flash-Lite, 2.5 Flash, 2.5 Pro, 3 Flash, or 3 Pro per run
ImprovedDefault CEO model upgraded from GPT-4o to GPT-5 Mini for better reasoning
ImprovedDefault Reviewer model upgraded from Gemini 2.0 Flash to Gemini 2.5 Flash
ImprovedModels configurable via CHATGPT_MODEL and GEMINI_MODEL env vars
FixedRemoved non-existent model IDs (gemini-2.0-pro) that caused API errors
v2.15
February 9, 2026

Agent Transcripts, Progress Stepper & Bug Fixes

Full transparency into what each AI agent said and did. Every log entry now has an expandable 'Agent Transcript' showing the actual prompt sent and response received, with model name and token estimates. The progress bar is replaced with a visual phase stepper showing Planning → Executing → Reviewing → Complete with checkmarks. Five Tier 1 bugs fixed including long brief validation, CEO prompt disambiguation, clarification handling, SSE race condition, and data retention cleanup.

NewAgent Transcript view — click to expand and see exact prompts/responses for CEO, Operator, and Reviewer
NewVisual phase stepper with checkmarks: Planning → Executing → Reviewing → Complete
NewData retention cleanup endpoint — admin can purge old runs via maintenance.cleanup
NewCEO clarification mechanism — ambiguous tasks are flagged with a question in the log
ImprovedProgress bar now shows phase-specific icons and animated pulse indicator for current phase
ImprovedCycle counter and stats (searches, replans) displayed with icons
ImprovedCEO prompt strengthened to better distinguish 'build' vs 'display' tasks
FixedTask brief validation increased to 50K chars to match SSE sanitizer
FixedSSE pendingRunBriefs race condition fixed with retry logic
v2.14
February 8, 2026

Reviewer Details, Improvement Cycles & History Fixes

The Reviewer's activity log now shows detailed findings — every issue, veto reason, and suggestion is visible instead of just a summary count. A new 'Post-Complete Improvements' setting (default 3) makes the CEO do additional research and improvement cycles even after the Reviewer approves, ensuring higher quality output. History sidebar no longer shows duplicate or non-clickable entries.

NewReviewer log entries now show full details — issues, veto reasons, and suggestions are all visible in the activity log
NewPost-Complete Improvements setting (0-10, default 3) — CEO does extra improvement cycles after Reviewer approves
NewImprovement agent researches how to enhance the output and applies changes automatically
FixedHistory sidebar no longer shows duplicate entries for the same task
FixedHistory items that were not clickable (IDLE runs) are now filtered out
ImprovedToast notifications when loading runs from history (success/error feedback)
v2.13
February 7, 2026

Execution Progress & What's Next

All output tabs now show live progress during execution instead of blank placeholders. The Final tab displays which agent is currently working (CEO planning, Operator building, Reviewer checking) with recent activity feed. Draft and Review tabs show contextual loading states. After run completion, a 'What's Next' section suggests modifications, new tasks, or export options.

NewLive execution progress on Final tab — shows current agent, cycle count, status, and recent activity feed
NewDraft tab shows 'CEO is drafting the plan...' spinner during planning phase
NewReview tab shows 'Reviewer is analyzing...' spinner during review phase
NewWhat's Next section after run completion — suggests modifications, new tasks, and export options
ImprovedEmpty states now show 'Click Run to start' hint when idle
ImprovedDesktop progress shows last 5 activity entries, mobile shows last 3
v2.12
February 7, 2026

Streaming Revisions & Extreme QC

CEO revisions now stream in real-time instead of waiting for the full response — you see the output being written word by word with a typing cursor. This also eliminates mobile timeout issues on slow connections. Full extreme quality control pass across all pages, APIs, and features.

NewStreaming revisions — CEO revision output now streams in real-time with a typing cursor animation
NewNew SSE endpoint /api/run/modify-stream for real-time revision delivery
ImprovedNo more 'Failed to fetch' on mobile — streaming eliminates timeout issues on slow connections
ImprovedRevision button shows 'Streaming...' indicator while CEO is writing
FixedFull extreme QC pass: all 7 pages, 6 API endpoints, 5 output tabs, mobile layout verified
v2.11
February 7, 2026

Smarter CEO — Build vs Display Intelligence

Major upgrade to CEO agent intelligence. The CEO now correctly distinguishes between 'build this' and 'display this' instructions — pasting game dev instructions will build the actual game, not a documentation page. Added clarification mechanism for ambiguous tasks. Improved log visibility with AI model labels. Better error handling for revision requests.

NewCEO now distinguishes 'build this' vs 'display this' — pasting game instructions builds the actual playable game, not a docs page
NewCEO asks clarifying questions when tasks are genuinely ambiguous instead of guessing wrong
NewActivity log now shows AI model labels — CEO (ChatGPT), Reviewer (Gemini), Operator (Built-in LLM)
ImprovedInput sanitizer limit increased from 5K to 20K characters to match the task brief validation limit
ImprovedWebsite detection expanded to recognize games, interactive apps, calculators, quizzes, and more
Fixed'Failed to fetch' on revision requests now shows a friendly network timeout message with retry guidance
v2.10
February 7, 2026

Unlimited Runs — Daily Limit Removed

Removed the 10 runs/day per-invite limit and the 30 runs/day per-IP limit. Users can now run unlimited tasks with only a 60-second cooldown between runs. Cleaned up unused rate limit constants.

NewUnlimited runs — removed the 10 runs/day daily limit per invite code
NewRemoved IP-based daily limit (was 30 runs/day per IP address)
Improved60-second cooldown between runs preserved to prevent accidental double-runs
ImprovedCleaned up unused RATE_LIMIT_RUNS_PER_DAY and RATE_LIMIT_IP_RUNS_PER_DAY constants
FixedUpdated Home page and Terms of Service to remove daily limit references
v2.9
February 7, 2026

Quality Check & Mobile Error Visibility

Full quality check across all pages, APIs, and mobile layouts. Error messages now display on all mobile tabs (Task, Plan, Outputs) instead of only the Task tab. Auto-switches to Task tab when an error occurs. Fixed all changelog dates from 2025 to 2026.

FixedMobile error visibility — errors now show on all three mobile tabs (Task, Plan, Outputs) instead of only the Task tab
FixedAuto-switch to Task tab on error — mobile users immediately see the error message instead of a blank Plan or Outputs tab
FixedChangelog dates corrected from 2025 to 2026 across all version entries
ImprovedFull QC pass — all 7 pages verified (Homepage, About, Examples, Instructions, Changelog, App, Projects)
ImprovedAll API endpoints verified (version.current, auth.me, project.listPublic, SSE stream)
Improved172 automated tests passing across 16 test files with 0 failures
v2.8
February 7, 2026

Friendly Error Messages & 20K Character Limit

Error messages are now human-readable instead of showing raw JSON. Task brief character limit increased from 5,000 to 20,000 characters. Validation errors display clear, actionable messages like 'Task brief is too long (max 20,000 characters)' instead of cryptic Zod error objects.

FixedError messages now display user-friendly text instead of raw JSON — e.g., 'Task brief is too long (max 20,000 characters)' instead of [{"origin":"string","code":"too_big"...}]
FixedtRPC error parser handles all error shapes: Zod validation arrays, nested tRPC errors, string messages, and unknown formats
ImprovedTask brief character limit increased from 5,000 to 20,000 characters for detailed prompts
ImprovedError helper function parseTrpcError() provides consistent error formatting across the entire app
ImprovedMobile error display properly wraps long error messages without overflow
v2.7
February 7, 2026

Mobile-First Headers & Version Visibility

All pages now have fully responsive headers that work on Samsung S24+ and other mobile devices. Version badge is always visible on every screen. Home page has a hamburger menu on mobile. Navigation buttons are compact on small screens with abbreviated labels.

ImprovedHome page: hamburger menu on mobile replaces horizontal nav that was overflowing off-screen
ImprovedVersion badge (v2.7) now always visible on all pages including mobile — uses smaller font (10px) and flex-shrink-0 to prevent squeezing
ImprovedInstructions, Changelog, Examples, Projects pages: responsive headers with abbreviated button labels on mobile ('Back' instead of 'Back to Home', 'Start' instead of 'Start Using AI Company')
ImprovedAll header layouts use gap-2 and min-w-0 constraints to prevent overflow on narrow screens
v2.6
February 6, 2026

Try This, History Search, Cost Tracking & Mobile Fixes

Three new features plus critical mobile fixes. 'Try This' buttons on instruction examples auto-fill the app with pre-configured tasks. Run history now has search and status filtering. Activity Log shows per-agent cost estimates with a running total breakdown. Mobile session restore no longer gets stuck, and viewport overflow is fixed.

NewTry This button on each instruction example — one click auto-fills the task brief, budget, cycles, and settings in the app
NewRun history search & filter — search by task brief text, filter by status (Complete, Error, Running, Aborted), or combine both
NewCost tracking per agent — each Activity Log entry shows its estimated cost, with a running total and per-agent breakdown (CEO, Operator, Reviewer, System)
FixedMobile: 'Restoring your session...' no longer gets stuck forever — 8-second timeout with manual retry option
FixedMobile: Content no longer cut off on the left side — fixed viewport overflow with proper CSS constraints
ImprovedDesktop Activity Log shows color-coded per-agent cost breakdown (CEO=blue, Operator=green, Reviewer=purple)
v2.5
February 6, 2026

Enhanced Instructions & Ready-to-Use Commands

Completely revamped instructions page with 4 detailed, copy-paste-ready example prompts covering deep research, brainstorming, website creation, and competitive analysis. Each example includes full task description, recommended settings with dollar budgets, and explanation of what each AI agent does. Added new Website Template, Pro Tips section, and Budget column to the Quick Reference table.

New4 complete ready-to-use command examples — Deep Research Report, Brainstorm & Best Answer, Website Creation, Competitive Analysis — each with full multi-paragraph prompts ready to copy-paste
NewPro Tips section — guidance on using Request Modifications, Activity Log, budget controls, and file attachments
NewWebsite Template added to Task Templates section
ImprovedQuick Reference table now includes Budget column with dollar ranges for each task type
ImprovedEach example includes 'What happens' explanation showing how CEO, Operator, and Reviewer collaborate
ImprovedRecommended settings now include dollar budget and time estimates alongside cycles/searches/replans
v2.4
February 6, 2026

Activity Log, HTML Preview & Post-Run Summary

New Activity Log tab shows every agent action in real-time — see what the CEO planned, what the Operator searched and drafted, and what the Reviewer decided. Draft tab now renders HTML websites as live previews instead of raw code. Post-run summary card shows completion stats with quick navigation links.

NewActivity Log tab — real-time feed of all agent actions with role-colored entries, timestamps, action badges, and search queries
NewPost-run summary card — shows action count, cycle count, sources gathered, reviews done, and quick links to Log/Output/Sources
FixedDraft tab now renders HTML content as live website preview (iframe) instead of showing raw HTML code
FixedOutput tabs (Draft, Review, Final, Evidence, Log) now always visible — no longer disappear after run completion
ImprovedActivity log loads from history — view the full log of any past run, not just live runs
ImprovedError summary card — errors now shown in a dedicated card with clear formatting
v2.3
February 6, 2026

Revision File Attachments

You can now attach files (images, PDFs, documents) when requesting modifications to a completed run. The CEO agent uses attached files as context for more accurate revisions. Both mobile and desktop layouts support the new file upload UI.

NewFile attachments in Request Modifications — attach up to 10 files (images, PDFs, text files) when requesting revisions to completed runs
NewImage reference support — upload screenshots or design mockups and the CEO agent uses them as visual context for website revisions
ImprovedMultimodal LLM support — ChatGPT client updated to support vision/image content in messages
ImprovedFile chips UI — attached files shown as removable chips with file name, size, and type
ImprovedBoth mobile and desktop modification sections now have consistent file upload UI
v2.2
February 6, 2026

Run Reliability & Refresh

Fixed the 'Invalid run state' error when running tasks with long briefs. Task briefs are now stored server-side before SSE connection, eliminating URL length limits. Added a Refresh button to reload run data without a full page refresh.

FixedFixed 'Invalid run state or missing task brief' error — task briefs are now stored server-side via in-memory bridge, eliminating URL length limits for long briefs
NewRefresh button — reload run data (plan, draft, final output, evidence) without refreshing the entire page
ImprovedPending run briefs auto-expire after 60 seconds to prevent memory leaks
v2.1
February 6, 2026

Live Progress, Thumbnails & Cross-Device Access

Real-time progress indicators show cycle count, current phase, and estimated time. Projects gallery now shows live thumbnail previews. Long task briefs no longer cause errors. Runs persist in history and are accessible from any device via invite code.

NewLive progress indicators — shows 'Cycle 2 of 5' with color-coded phase badges (Planning/Executing/Reviewing) and estimated time remaining
NewPhase-colored progress bar — blue for planning, amber for executing, purple for reviewing, with search/replan counters
NewProject thumbnail previews — Projects gallery shows live iframe previews of each generated website with hover-to-view overlay
FixedLong task brief error — increased limit from 10K to 50K characters, fixing database errors on detailed briefs
FixedSSE ownership check — fixed to use invite code instead of session ID, ensuring cross-session access
ImprovedCross-device access — runs are tied to invite code, so Yotam can view results on his iPhone or Windows PC by entering the same invite code
ImprovedProgress state reset — all progress indicators properly reset when starting a new task
v2.0
February 6, 2026

History Fix, Start New Task & Tailwind CDN

Run history is now fully clickable across sessions, a 'Start New Task' button lets you reset and begin fresh, and all generated websites automatically include Tailwind CSS + Inter font for polished styling.

FixedRun history click-to-view — fixed ownership check so runs from previous sessions are accessible (uses invite code instead of session ID)
FixedAll run endpoints (get, approve, abort, modify) now validate by invite code for cross-session access
NewStart New Task button — appears next to Run button after a task completes, resets all state for a fresh start
NewTailwind CDN auto-injection — generated HTML websites automatically include Tailwind CSS, Google Fonts (Inter), and base typography
ImprovedTailwind injection is smart — skips if Tailwind is already present, injects before </head> or creates <head> if missing
ImprovedModified websites also get Tailwind CDN injection when re-uploaded to S3
v1.9
February 6, 2026

Website Preview & Full HTML Pipeline

The AI CEO now generates complete, self-contained HTML websites when you ask for a website, landing page, or portfolio. Preview them live in an iframe with code view, fullscreen, and download options.

NewLive HTML Preview — website outputs render in an interactive iframe with Preview/Code toggle, fullscreen, download, and open-in-new-tab
NewSmart website detection — the Operator now automatically detects website tasks and outputs complete HTML instead of text
NewHTML-aware pipeline — Reviewer evaluates HTML quality (not citations), Finalizer preserves HTML as-is, Modifier re-uploads to S3
ImprovedHigher output limits — website tasks use 16K max tokens for richer, more complete HTML output
ImprovedClean HTML output — markdown code block wrapping and stray text are automatically stripped from LLM responses
ImprovedDesktop export buttons hidden for HTML — the HtmlPreview toolbar replaces generic .txt/.md/PDF buttons
FixedDraft modifications for websites now re-upload the updated HTML to S3 and update the project record
v1.8
February 6, 2026

Auto-Save Website Projects

When the CEO generates a website (HTML output), it is automatically saved as a project with a permanent shareable URL at /projects/:slug.

NewAuto-detect website output — HTML content is automatically identified when the CEO generates a full website
NewAuto-save to S3 — website HTML is uploaded to cloud storage with a permanent URL
NewProject creation — a project entry is created in the database with slug, title, and S3 link
NewGreen banner notification — after a website run completes, a banner shows with 'View Live' and 'Project Page' links
ImprovedSmart slug generation — URL-safe slugs are auto-generated from the task brief
ImprovedTitle extraction — project title is extracted from HTML <title> or <h1> tags
v1.7
February 6, 2026

Versions & Website Preview

Added Versions tab to mobile layout, View Website button for HTML outputs, and export buttons (Copy, .txt, .md, PDF) on every version.

NewView Website button — opens HTML output in a new tab as a live preview when version contains website code
NewVersions tab on mobile — now accessible on all screen sizes, not just desktop
NewExport buttons on Versions — Copy, .txt, .md, and PDF export for each version's draft output
ImprovedMobile tabs redesigned with flexible layout to accommodate all tabs including Versions
v1.6
February 6, 2026

Simplified Settings & Reliability

Redesigned run settings as a clean flat list, added delete run capability, and improved stability across the board.

NewRedesigned run settings — flat list of inputs instead of 4-category cards, type a number or 'E' for endless
NewDelete Run button — clean up failed or test runs from your history
NewSettings persistence — last-used values saved to localStorage, restored automatically
NewUntil Complete toggle — Yes/No switch with safety cap of 50 cycles
ImprovedAll limits configurable independently — Max Dollar, Max Time, Max Cycles, Max Replans, Max Web Searches
FixedtRPC error on invite page — improved error handling for HTML-instead-of-JSON responses
FixedGlobal query error handling — prevents app crashes from transient API errors
v1.5
February 6, 2026

Persistent Run History

Run history now persists across version updates and re-authentication — your past runs will never disappear again.

FixedRun history now tied to invite code instead of session ID — survives re-publishing and re-authentication
FixedAll previously 'lost' runs restored — data was always in the database, just queried by wrong field
ImprovedRate limiting cooldown now uses invite code for consistency across sessions
v1.4
February 6, 2026

Run Modes, Projects & Reliability

Four flexible run modes, live project previews, persistent invite codes, and critical stability fixes.

New4 Run Modes — choose Cycles, Time Limit, Cost Limit, or Until Complete (autonomous mode)
NewProject Generation — website outputs get live preview URLs at /projects/:slug
NewProjects listing page — browse all public projects built by the AI CEO
NewPersistent invite code — saved in localStorage so you don't re-enter it each version
NewToast notifications on draft modification completion
ImprovedExamples page updated to show all 18 game iterations
ImprovedRun mode selector with visual cards for easy selection
FixedReact error #310 — eliminated all early returns to prevent hook ordering crashes on login
FixedTyping lag — converted inline layout components to stable JSX variables
v1.3
February 6, 2026

Draft Modifications & CEO Attribution

Iterative draft refinement — ask the CEO to revise outputs, with clear attribution showing which agent wrote each draft.

NewDraft modification — request changes to completed drafts and the CEO (ChatGPT) will revise them
NewIterative refinement — submit multiple modification requests to progressively improve outputs
NewCEO attribution — 'CEO (ChatGPT) says:' header clearly shows which agent authored the draft
ImprovedModification counter tracks how many revisions have been made
ImprovedUpdated output automatically appears in the Final tab after each revision
v1.2
February 6, 2026

File Intelligence & Polish

Major upgrade to file handling with content extraction, drag-and-drop uploads, and version tracking across the app.

NewPDF and text file content extraction — the AI CEO can now read and analyze uploaded documents
NewDrag-and-drop file upload — drop files directly onto the task input area
NewVersion display across all pages (homepage, dashboard, instructions, examples)
NewChangelog page to track what's new in each release
NewSession recovery UI — resume incomplete runs from earlier sessions
NewVersion update notification — get alerted when a new version is available
ImprovedFile size limits displayed before upload (10MB per file, 20 files max)
ImprovedExtracted document content included in CEO prompt for deeper analysis
FixedSession persistence ensured across deployments — users never lose work
v1.1
January 28, 2026

File Uploads & Run History

Added file upload support, run history sidebar, and configurable run settings.

NewFile upload support — attach images, PDFs, and documents to tasks
NewRun history sidebar — browse and reload previous runs
NewConfigurable run settings — adjust cycles, searches, and replans per run
NewUser history page for logged-in users across all sessions
ImprovedReal-time progress bar during run execution
ImprovedCycle version tracking — compare outputs across refinement cycles
FixedFixed rate limiting to properly track per-invite daily limits
v1.0
January 15, 2026

Initial Release

First public release of AI Company — the AI CEO which thinks before acting.

NewThree-agent architecture: CEO (planner), Operator (executor), Reviewer (quality)
NewReal-time SSE streaming of all agent states and outputs
NewOption evaluation with risk assessment before execution
NewPlan approval gate — review and approve the CEO's plan before execution
NewEvidence tracking with source URLs for every claim
NewInvite-only access control with rate limiting
NewCopy and download final outputs
ImprovedMobile-responsive design with dedicated mobile and desktop layouts
ImprovedTrust banner — transparent about verification limitations

Have a feature request or found a bug?