Native macOS alpha today. Managed AI, STT, vision, wallet, and account sync are wired through Bluey Cloud.
Bluey
Auto -> Deep/System Design
Balance $30.00
Transcript / You
Walk me through a production AI architecture for three workflows with safety, latency, and testing.
Bluey response
Start with a central orchestrator. Keep channels thin, route each workflow to a specialist lane, and make policy checks explicit before any agent calls tools.
Why this answer
Classified as deep system design from transcript length, architecture terms, and multi-workflow constraints. Canvas opened for structure.
CanvasArchitecture
+
Ask anything or let transcript fill in...
Docs
Screen
↑
Answer flow
Bluey turns live context into a response you can use.
Typed question, transcript, screen analysis, and attached files all become one clean request. The UI keeps the question on the right, the answer on the left, and opens canvas only when structure helps.
1. Capture intent
Start listening, type a question, attach docs, or analyse the screen. Bluey keeps source labels and waits for user action.
2. Route the task
Quick prompts start fast. Coding, system design, documents, and vision move to stronger lanes automatically.
3. Stream the answer
Answers appear line by line. When the output needs code, design, or structured edits, canvas opens beside chat.
What it helps with
One overlay for calls, code, documents, and follow-ups.
STT
Live calls
System and microphone transcript stay source-labeled so answers know who said what.
DEV
Coding help
Code-style answers open canvas for implementation, complexity, tests, and follow-up changes.
DOC
Document context
Attach readable files and turn them into session context and searchable account knowledge.
VIS
Screen analysis
Analyse the current screen with consent, then route through vision and normal answer lanes.
Managed routing
No provider menu for the user. Bluey picks the lane.
Users see Auto, not a wall of models. Internally Bluey routes across quick, balanced, deep, and vision paths based on task shape and attached context.
1
Instant
Short asks and quick clarifications start in the lowest-latency lane.
2
Balanced
General work and normal technical answers use the default managed lane.
3
Deep
Architecture, coding, reasoning, and long-form tasks get stronger planning.
4
Vision
Screen, image, and diagram asks include visual context and can still escalate.
Credits
Pay for the managed lanes you actually use.
Bluey uses a prepaid wallet. Reload credits, run until zero, and avoid surprise usage. The account page shows balance, recent usage, and tier classification.
$30 reload
Minimum reload for alpha. Credits expire after one year and stop at zero.
Cost labels
Each managed response can show the lane and cost once billing events are recorded.
Capacity aware
Provider keys, capacity buckets, and Redis-backed health tracking are part of the managed server path.
Local fallback
Local models are availability fallback, not the customer-facing paid model menu.
Control
Context should be obvious, permissioned, and recoverable.
Visible controls
Listen, docs, screen, style, and answer actions live in the overlay composer.
Session history
Past sessions can be reopened, renamed, and continued instead of disappearing.
Knowledge base
Useful documents and transcripts can sync to cloud RAG for follow-up work.
Supportable logs
Trace IDs, redaction, and support bundles make real debugging possible.
Start with one command.
Install Bluey, run the overlay, and connect your account when cloud answers are needed.