trycua
diff --git a/‎.github/workflows/npm-publish-cli.yml‎
Lines changed: 43 additions & 0 deletions b/‎.github/workflows/npm-publish-cli.yml‎
Lines changed: 43 additions & 0 deletions
diff --git a/‎.prettierignore‎
Lines changed: 4 additions & 1 deletion b/‎.prettierignore‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎README.md‎
Lines changed: 10 additions & 8 deletions b/‎README.md‎
Lines changed: 10 additions & 8 deletions
diff --git a/‎blog/build-your-own-operator-on-macos-2.md‎
Lines changed: 1 addition & 1 deletion b/‎blog/build-your-own-operator-on-macos-2.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/content/docs/agent-sdk/agent-loops.mdx‎
Lines changed: 1 addition & 1 deletion b/‎docs/content/docs/agent-sdk/agent-loops.mdx‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/content/docs/agent-sdk/custom-computer-handlers.mdx‎
Lines changed: 2 additions & 2 deletions b/‎docs/content/docs/agent-sdk/custom-computer-handlers.mdx‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/content/docs/agent-sdk/custom-tools.mdx‎
Lines changed: 2 additions & 2 deletions b/‎docs/content/docs/agent-sdk/custom-tools.mdx‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/content/docs/agent-sdk/integrations/meta.json‎
Lines changed: 2 additions & 5 deletions b/‎docs/content/docs/agent-sdk/integrations/meta.json‎
Lines changed: 2 additions & 5 deletions
diff --git a/‎docs/content/docs/agent-sdk/meta.json‎
Lines changed: 1 addition & 0 deletions b/‎docs/content/docs/agent-sdk/meta.json‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/content/docs/telemetry.mdx‎ renamed to ‎docs/content/docs/agent-sdk/telemetry.mdx‎
Lines changed: 41 additions & 58 deletions b/‎docs/content/docs/telemetry.mdx‎ renamed to ‎docs/content/docs/agent-sdk/telemetry.mdx‎
Lines changed: 41 additions & 58 deletions
@@ -0,0 +1,43 @@
+name: Publish @trycua/cli to npm
+
+on:
+  push:
+    branches: main
+
+jobs:
+  publish:
+    permissions:
+      id-token: write
+      contents: read
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - name: Setup Bun
+        uses: oven-sh/setup-bun@v2
+        with:
+          bun-version: latest
+
+      - name: Check if version changed
+        id: check-version
+        uses: EndBug/version-check@v2
+        with:
+          file-name: libs/typescript/cua-cli/package.json
+          diff-search: true
+
+      - name: Install dependencies
+        if: steps.check-version.outputs.changed == 'true'
+        working-directory: ./libs/typescript/cua-cli
+        run: bun install --frozen-lockfile
+
+      - name: Build package
+        if: steps.check-version.outputs.changed == 'true'
+        working-directory: ./libs/typescript/cua-cli
+        run: bun run build --if-present
+
+      - name: Publish to npm
+        if: steps.check-version.outputs.changed == 'true'
+        working-directory: ./libs/typescript/cua-cli
+        run: bun publish --access public
+        env:
+          NPM_TOKEN: ${{ secrets.NPM_TOKEN }}
@@ -29,4 +29,7 @@ venv/
 *.db
 *.sqlite
 pnpm-lock.yaml
-uv.lock
+uv.lock
+
+# Docs with complex JSX formatting
+docs/content/docs/get-started/quickstart.mdx
@@ -6,11 +6,7 @@
   </picture>
 
 [![Python](https://img.shields.io/badge/Python-333333?logo=python&logoColor=white&labelColor=333333)](#)
-[![Swift](https://img.shields.io/badge/Swift-F05138?logo=swift&logoColor=white)](#)
-[![macOS](https://img.shields.io/badge/macOS-000000?logo=apple&logoColor=F0F0F0)](#)
 [![Discord](https://img.shields.io/badge/Discord-%235865F2.svg?&logo=discord&logoColor=white)](https://discord.com/invite/mVnXXpdE85)
-[![OSWorld](https://img.shields.io/badge/OSWorld-Benchmark-blue)](https://os-world.github.io/)
-[![HUD](https://img.shields.io/badge/HUD-Integration-green)](https://hud.so)
 <br>
 <a href="https://trendshift.io/repositories/13685" target="_blank"><img src="https://trendshift.io/api/badge/repositories/13685" alt="trycua%2Fcua | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>
 
@@ -121,7 +117,7 @@ from agent import ComputerAgent
 # ComputerAgent works with any computer initialized with the Computer SDK
 
 agent = ComputerAgent(
-    model="anthropic/claude-3-5-sonnet-20241022",
+    model="anthropic/claude-sonnet-4-5-20250929",
     tools=[computer],
     max_trajectory_budget=5.0
 )
@@ -200,9 +196,9 @@ Cua uses the OpenAI Agent response format.
 
 These are the valid model configurations for `ComputerAgent(model="...")`:
 
-| Configuration                            | Description                                                                                                                                     |
-| ---------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------- |
-| `{computer-use-model}`                   | A single model to perform all computer-use tasks                                                                                                |
+| Configuration                            | Description                                                                                                                                |
+| ---------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------ |
+| `{computer-use-model}`                   | A single model to perform all computer-use tasks                                                                                           |
 | `{grounding-model}+{any-vlm-with-tools}` | [Composed](https://cua.ai/docs/agent-sdk/supported-agents/composed-agents) with VLM for captioning and grounding LLM for element detection |
 | `moondream3+{any-llm-with-tools}`        | [Composed](https://cua.ai/docs/agent-sdk/supported-agents/composed-agents) with Moondream3 for captioning and UI element detection         |
 | `human/human`                            | A [human-in-the-loop](https://cua.ai/docs/agent-sdk/supported-agents/human-in-the-loop) in place of a model                                |
@@ -372,32 +368,38 @@ Learn more in the [SOM documentation](./libs/python/som/README.md).
 ## 2025
 
 ### September 2025
+
 - **Hack the North Competition**: First benchmark-driven hackathon track with guaranteed YC interview prize. Winner achieved 68.3% on OSWorld-Tiny ([Blog Post](https://www.cua.ai/blog/hack-the-north))
 - **Global Hackathon Launch**: Ollama × Cua global online competition for creative local/hybrid agents
 
 ### August 2025
+
 - **v0.4 Release - Composite Agents**: Mix grounding + planning models with `+` operator (e.g., `"GTA-7B+GPT-4o"`) ([Blog Post](https://www.cua.ai/blog/composite-agents))
 - **HUD Integration**: One-line benchmarking on OSWorld-Verified with live trace visualization ([Blog Post](https://www.cua.ai/blog/hud-agent-evals))
 - **Human-in-the-Loop**: Interactive agent mode with `human/human` model string
 - **Web-Based Computer Use**: Browser-based agent execution ([Blog Post](https://www.cua.ai/blog/bringing-computer-use-to-the-web))
 
 ### June 2025
+
 - **Windows Sandbox Support**: Native Windows agent execution ([Blog Post](https://www.cua.ai/blog/windows-sandbox))
 - **Containerization Evolution**: From Lume to full Docker support ([Blog Post](https://www.cua.ai/blog/lume-to-containerization))
 - **Sandboxed Python Execution**: Secure code execution in agent workflows
 
 ### May 2025
+
 - **Cua Cloud Containers**: Production-ready cloud deployment with elastic scaling ([Blog Post](https://www.cua.ai/blog/introducing-cua-cloud-containers))
 - **Trajectory Viewer**: Visual debugging tool for agent actions ([Blog Post](https://www.cua.ai/blog/trajectory-viewer))
 - **Training Data Collection**: Tools for creating computer-use training datasets ([Blog Post](https://www.cua.ai/blog/training-computer-use-models-trajectories-1))
 - **App-Use Framework**: Mobile and desktop app automation capabilities
 
 ### April 2025
+
 - **Agent Framework v0.4**: Unified API for 100+ model configurations
 - **UI-TARS Integration**: Local inference support for ByteDance's desktop-optimized model
 - **Blog Series**: "Build Your Own Operator" tutorials ([Part 1](https://www.cua.ai/blog/build-your-own-operator-on-macos-1) | [Part 2](https://www.cua.ai/blog/build-your-own-operator-on-macos-2))
 
 ### March 2025
+
 - **Initial Public Release**: Core Agent SDK and Computer SDK
 - **Lume VM Manager**: macOS VM management tool for local development
 
 
@@ -268,7 +268,7 @@ from agent import ComputerAgent
 async def run_multi_task_workflow():
     async with Computer() as macos_computer:
         agent = ComputerAgent(
-            model="anthropic/claude-3-5-sonnet-20241022",
+            model="anthropic/claude-sonnet-4-5-20250929",
             tools=[macos_computer]
         )
 
 
@@ -89,7 +89,7 @@ Use the following environment variables to configure the agent and its access to
 
 ```bash
 # Computer instance (cloud)
-export CUA_CONTAINER_NAME="your-container-name"
+export CUA_SANDBOX_NAME="your-sandbox-name"
 export CUA_API_KEY="your-cua-api-key"
 
 # LLM API keys
 
@@ -34,7 +34,7 @@ You can then use this as a tool for your agent:
 from agent import ComputerAgent
 
 agent = ComputerAgent(
-    model="anthropic/claude-3-5-sonnet-20241022",
+    model="anthropic/claude-sonnet-4-5-20250929",
     tools=[custom_computer],
 )
 
@@ -122,7 +122,7 @@ class MyCustomComputer(AsyncComputerHandler):
 custom_computer = MyCustomComputer()
 
 agent = ComputerAgent(
-    model="anthropic/claude-3-5-sonnet-20241022",
+    model="anthropic/claude-sonnet-4-5-20250929",
     tools=[custom_computer],
 )
 
 
@@ -16,7 +16,7 @@ def calculate(a: int, b: int) -> int:
 
 # Use with agent
 agent = ComputerAgent(
-    model="anthropic/claude-3-5-sonnet-20241022",
+    model="anthropic/claude-sonnet-4-5-20250929",
     tools=[computer, calculate]
 )
 ```
@@ -43,7 +43,7 @@ from computer import Computer
 
 computer = Computer(...)
 agent = ComputerAgent(
-    model="anthropic/claude-3-5-sonnet-20241022",
+    model="anthropic/claude-sonnet-4-5-20250929",
     tools=[computer, read_file],
 )
 ```
@@ -1,7 +1,4 @@
 {
   "title": "Integrations",
-  "pages": [
-    "hud",
-    "observability"
-  ]
-}
+  "pages": ["hud", "observability"]
+}
@@ -13,6 +13,7 @@
     "custom-computer-handlers",
     "prompt-caching",
     "usage-tracking",
+    "telemetry",
     "benchmarks",
     "migration-guide",
     "integrations"
 
@@ -1,135 +1,118 @@
 ---
 title: Telemetry
-description: This document explains how telemetry works in CUA libraries and how you can control it.
-icon: RadioTower
+description: How telemetry works in Cua and how to control it
 ---
 
-# Telemetry in CUA
+# Telemetry
 
-CUA tracks anonymized usage and error report statistics; we ascribe to Posthog's approach as detailed [here](https://posthog.com/blog/open-source-telemetry-ethical). If you would like to opt out of sending anonymized info, you can set `telemetry_enabled` to false.
+Cua collects anonymized usage and error statistics. We follow [Posthog's ethical telemetry approach](https://posthog.com/blog/open-source-telemetry-ethical). To opt out, set `telemetry_enabled` to false.
 
-## What telemetry data we collect
+## What we collect
 
-CUA libraries collect usage data to help improve our software. We have two categories of telemetry:
+### Enabled by default (opt-out)
 
-### Opt-Out Telemetry (Enabled by Default)
+- System info: OS, OS version, Python version
+- Module initialization: When modules are imported and their versions
+- Performance: Agent run durations, step counts, token usage, API costs
+- Session tracking: Anonymous session IDs and run IDs
 
-Basic performance metrics and system information that help us understand usage patterns:
-
-- **System Information**: Operating system, OS version, Python version
-- **Module Initialization**: When modules are imported and their versions
-- **Performance Metrics**: Agent run durations, step counts, token usage, and API costs
-- **Session Tracking**: Anonymous session IDs and run IDs for performance analysis
-
-### Opt-In Telemetry (Disabled by Default)
-
-**Conversation Trajectory Logging**: Full conversation history including:
+### Disabled by default (opt-in)
 
+**Trajectory logging** captures full conversation history:
 - User messages and agent responses
-- Computer actions and their outputs
-- Reasoning traces from the agent
+- Computer actions and outputs
+- Agent reasoning traces
 
-**Important**: Trajectory logging is **opt-in only** and must be explicitly enabled.
+Must be explicitly enabled.
 
-### We do NOT collect:
+### We don't collect
 
 - Personal information or user identifiers
 - API keys or credentials
 - File contents or application data
-- Information about files being accessed
-- Actual screenshots or screen contents (unless trajectory logging is enabled)
-- Specific text being typed, including user inputs, model outputs, computer outputs, or tool call outputs (unless trajectory logging is enabled)
-
-## Controlling Telemetry
+- Files being accessed
+- Screenshots or screen contents (unless trajectory logging is enabled)
+- Text being typed, user inputs, model outputs, computer outputs, or tool call outputs (unless trajectory logging is enabled)
 
-We are committed to transparency and user control over telemetry. There are two ways to control telemetry:
+## How to disable
 
-### 1. Environment Variable (Global Control)
+### Environment variable (global)
 
-Telemetry is enabled by default. To disable telemetry, set the `CUA_TELEMETRY_ENABLED` environment variable to a falsy value (`0`, `false`, `no`, or `off`):
+Set `CUA_TELEMETRY_ENABLED` to a falsy value (`0`, `false`, `no`, or `off`):
 
 ```bash
-# Disable telemetry before running your script
 export CUA_TELEMETRY_ENABLED=false
-
-# Or as part of the command
-CUA_TELEMETRY_ENABLED=1 python your_script.py
-
 ```
 
-Or from Python:
+Or in Python:
 
 ```python
 import os
 os.environ["CUA_TELEMETRY_ENABLED"] = "false"
 ```
 
-### 2. Instance-Level Control
+### Per instance
 
-#### Computer SDK
+**Computer SDK:**
 
 ```python
 from computer import Computer
 
-# Enable telemetry (default)
-computer = Computer(telemetry_enabled=True)
-
-# Disable telemetry
 computer = Computer(telemetry_enabled=False)
 ```
 
-#### Agent SDK
+**Agent SDK:**
 
 ```python
 from agent import ComputerAgent
 import os
 
 # Basic telemetry - performance metrics only (opt-out, enabled by default)
 agent = ComputerAgent(
-    model="claude-3-5-sonnet-20241022",
+    model="claude-sonnet-4-5-20250929",
     telemetry_enabled=True  # Default is True
 )
 
 # Enable telemetry with full conversation trajectory logging (opt-in)
 agent = ComputerAgent(
-    model="claude-3-5-sonnet-20241022",
+    model="claude-sonnet-4-5-20250929",
     telemetry_enabled={
         "log_trajectory": True  # Logs full conversation items
     }
 )
 
-# Disable telemetry completely
+# Disable completely
 agent = ComputerAgent(
-    model="claude-3-5-sonnet-20241022",
+    model="claude-sonnet-4-5-20250929",
     telemetry_enabled=False
 )
 
-# Disable telemetry completely using environment variables
-os.environ["CUA_TELEMETRY_ENABLED"] = "false"
+# Enable trajectory logging (opt-in)
 agent = ComputerAgent(
-    model="claude-3-5-sonnet-20241022"
+    model="claude-sonnet-4-5-20250929",
+    telemetry_enabled={"log_trajectory": True}
 )
 ```
 
-You can check if telemetry is enabled for an instance:
+Check status:
 
 ```python
-print(computer.telemetry_enabled)  # Will print True or False
-print(agent.telemetry_enabled)     # Will print True, False, or dict
+print(computer.telemetry_enabled)  # True or False
+print(agent.telemetry_enabled)     # True, False, or dict
 ```
 
-Note that telemetry settings must be configured during initialization and cannot be changed after the object is created.
+Telemetry settings are configured at initialization and can't be changed afterward.
 
-## Detailed Telemetry Events
+## Events collected
 
-### Computer SDK Events
+### Computer SDK
 
 | Event Name               | Data Collected                                                                                                                        | Trigger Notes                                                           |
 | ------------------------ | ------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------- |
 | **computer_initialized** | • `os`: Operating system (e.g., 'windows', 'darwin', 'linux')<br />• `os_version`: OS version<br />• `python_version`: Python version | Triggered when a Computer instance is created                           |
 | **module_init**          | • `module`: "computer"<br />• `version`: Package version<br />• `python_version`: Full Python version string                          | Triggered once when the computer package is imported for the first time |
 
-### Agent SDK Events
+### Agent SDK
 
 | Event Name              | Data Collected                                                                                                                                                                                                                                                                                                        | Trigger Notes                                                         |
 | ----------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------- |
@@ -140,6 +123,6 @@ Note that telemetry settings must be configured during initialization and cannot
 | **agent_step**          | • `session_id`: Agent session UUID<br />• `run_id`: Run UUID<br />• `step`: Step number (incremental)<br />• `timestamp`: Unix timestamp<br />• `duration_seconds`: Duration of previous step                                                                                                                         | Triggered on each agent response/step during a run                    |
 | **agent_usage**         | • `session_id`: Agent session UUID<br />• `run_id`: Run UUID<br />• `step`: Current step number<br />• `prompt_tokens`: Tokens in prompt<br />• `completion_tokens`: Tokens in response<br />• `total_tokens`: Total tokens used<br />• `response_cost`: Cost of this API call                                        | Triggered whenever usage information is received from LLM API         |
 
-## Transparency
+## Questions
 
-We believe in being transparent about the data we collect. If you have any questions about our telemetry practices, please open an issue on our GitHub repository.
+Questions about telemetry? Open an issue on our [GitHub repository](https://github.com/trycua/cua).
Original file line number	Diff line number	Diff line change
`@@ -268,7 +268,7 @@ from agent import ComputerAgent`
`268`	`268`	`async def run_multi_task_workflow():`
`269`	`269`	`async with Computer() as macos_computer:`
`270`	`270`	`agent = ComputerAgent(`
`271`		`- model="anthropic/claude-3-5-sonnet-20241022",`
	`271`	`+ model="anthropic/claude-sonnet-4-5-20250929",`
`272`	`272`	`tools=[macos_computer]`
`273`	`273`	`)`
`274`	`274`
Original file line number	Diff line number	Diff line change
`@@ -34,7 +34,7 @@ You can then use this as a tool for your agent:`
`34`	`34`	`from agent import ComputerAgent`
`35`	`35`
`36`	`36`	`agent = ComputerAgent(`
`37`		`- model="anthropic/claude-3-5-sonnet-20241022",`
	`37`	`+ model="anthropic/claude-sonnet-4-5-20250929",`
`38`	`38`	`tools=[custom_computer],`
`39`	`39`	`)`
`40`	`40`
`@@ -122,7 +122,7 @@ class MyCustomComputer(AsyncComputerHandler):`
`122`	`122`	`custom_computer = MyCustomComputer()`
`123`	`123`
`124`	`124`	`agent = ComputerAgent(`
`125`		`- model="anthropic/claude-3-5-sonnet-20241022",`
	`125`	`+ model="anthropic/claude-sonnet-4-5-20250929",`
`126`	`126`	`tools=[custom_computer],`
`127`	`127`	`)`
`128`	`128`