|
| 1 | +# Cua Playground: Agents + Sandboxes in Your Browser |
| 2 | + |
| 3 | +Building computer-use agents means constant iteration—writing code, deploying to a sandbox, testing behavior, debugging issues, then repeating the cycle. Every test requires switching between your code editor, terminal, and VNC viewer. Want to try a different prompt? Edit your code, redeploy, and wait for the agent to restart. It works, but it's slow. |
| 4 | + |
| 5 | +Today we're launching the **Cua Playground**: a browser-based environment for testing computer-use agents without writing code. Send messages to your sandboxes, watch them execute in real-time, and iterate on prompts instantly—all from your dashboard at cua.ai. |
| 6 | + |
| 7 | + |
| 8 | + |
| 9 | +**What's new with this release:** |
| 10 | + |
| 11 | +- Instant testing—send messages to any running sandbox directly from your browser |
| 12 | +- Real-time execution—watch your agent work with live tool call updates and screenshots |
| 13 | +- Multi-model support—test with Claude Sonnet 4.5, Haiku 4.5, and more |
| 14 | +- Persistent chat history—conversations save automatically to local storage |
| 15 | + |
| 16 | +The Playground connects to your existing Cua sandboxes—the same ones you use with the Agent SDK. Select a running sandbox and a model, then start chatting. The agent uses computer-use tools (mouse, keyboard, bash, editor) to complete your tasks, and you see every action it takes. |
| 17 | + |
| 18 | +## Getting Started Today |
| 19 | + |
| 20 | +<div align="center"> |
| 21 | + <video src="https://github.com/user-attachments/assets/9fef0f30-1024-4833-8b7a-6a2c02d8eb99" width="600" controls></video> |
| 22 | +</div> |
| 23 | + |
| 24 | + |
| 25 | +Sign up at [cua.ai/signin](https://cua.ai/signin) and grab your API key from the dashboard. Then navigate to the Playground: |
| 26 | + |
| 27 | +1. Navigate to Dashboard > Playground |
| 28 | +2. Select a sandbox from the dropdown (must be "running" status) |
| 29 | +3. Choose a model (we recommend Claude Sonnet 4.5 to start) |
| 30 | +4. Send a message: "Take a screenshot and describe what you see" |
| 31 | +5. Watch the agent execute computer actions in real-time |
| 32 | + |
| 33 | +Example use cases: |
| 34 | + |
| 35 | +**Prompt Testing** |
| 36 | +``` |
| 37 | +❌ "Check the website" |
| 38 | +✅ "Navigate to example.com in Firefox and take a screenshot of the homepage" |
| 39 | +``` |
| 40 | + |
| 41 | +**Model Comparison** |
| 42 | +Run the same task with different models to compare quality, speed, and cost. |
| 43 | + |
| 44 | +**Debugging Agent Behavior** |
| 45 | +1. Send: "Find the login button and click it" |
| 46 | +2. View tool calls to see each mouse movement |
| 47 | +3. Check screenshots to verify the agent found the right element |
| 48 | +4. Adjust your prompt based on what you observe |
| 49 | + |
| 50 | +## FAQs |
| 51 | + |
| 52 | +<details> |
| 53 | +<summary><strong>Do I need to know how to code?</strong></summary> |
| 54 | + |
| 55 | +No. The Playground is designed for testing agent behavior without writing code. However, for production deployments, you'll need to use the Agent SDK (Python/TypeScript). |
| 56 | + |
| 57 | +</details> |
| 58 | + |
| 59 | +<details> |
| 60 | +<summary><strong>Does this replace the Agent SDK?</strong></summary> |
| 61 | + |
| 62 | +No. The Playground is for rapid testing and experimentation. For production deployments, scheduled tasks, or complex workflows, use the Agent SDK. |
| 63 | + |
| 64 | +</details> |
| 65 | + |
| 66 | +<details> |
| 67 | +<summary><strong>How much does it cost?</strong></summary> |
| 68 | + |
| 69 | +Playground requests use the same credit system as Agent SDK requests. You're charged for model inference (varies by model) and sandbox runtime (billed per hour while running). |
| 70 | + |
| 71 | +</details> |
| 72 | + |
| 73 | +<details> |
| 74 | +<summary><strong>Why is my sandbox not showing up?</strong></summary> |
| 75 | + |
| 76 | +The sandbox must have `status = "running"` to appear in the dropdown. Check Dashboard > Sandboxes to verify status. If stopped, click "Start" and wait ~30 seconds for it to become available. |
| 77 | + |
| 78 | +</details> |
| 79 | + |
| 80 | +## Need help? |
| 81 | + |
| 82 | +If you hit issues getting the Playground working, reach out in [Discord](https://discord.gg/cua-ai). We respond fast and fix based on what people actually use. |
| 83 | + |
| 84 | +--- |
| 85 | + |
| 86 | +Get started at [cua.ai](https://cua.ai) or try the Playground at [cua.ai/dashboard/playground](https://cua.ai/dashboard/playground). |
0 commit comments