Skip to main content

What it does

Launches a headless Chromium browser that the assistant can control — navigate to URLs, click elements, fill forms, read page content, and wait for changes. Uses accessibility snapshots to give the LLM numbered element references instead of fragile CSS selectors.

Requirements

Install Playwright and Chromium (one-time):
bunx playwright install chromium
The browser tool is automatically registered at gateway startup when Playwright is available.

How the assistant uses it

The browser tool exposes these actions:
ActionWhat it does
navigateGo to a URL
clickClick an element by its ref number
typeType text into an input field
snapshotGet the current page structure as numbered elements
waitWait for a selector, URL pattern, page state, or fixed delay
A typical flow:
  1. Assistant calls navigate to open a page
  2. Assistant calls snapshot to see the page structure
  3. Assistant identifies the right element by its ref number
  4. Assistant calls click or type to interact

Configuration

SettingDefaultDescription
headlesstrueRun without a visible browser window
maxResultChars50,000Max characters returned from snapshots
defaultTimeout30,000 msTimeout for page actions
The browser tool is heavier than web fetch. The assistant prefers web_fetch for simple page reads and only launches the browser when it needs to interact with a page (click, fill forms, handle JavaScript-rendered content).

When to use browser vs web fetch

Use browser whenUse web fetch when
Page requires JavaScript to renderPage is server-rendered HTML
You need to click or fill formsYou just need the text content
Content is behind interactionsContent is at a direct URL
You need to navigate through pagesSingle page read is enough