Tool Categories
Core Workflow Pattern
Most browser automation follows this pattern:- Navigate to the target page
- Snapshot to understand page structure
- Interact with specific elements using refs from snapshot
- Screenshot or capture results
Essential Tools for Getting Started
browser_navigate
Navigate to any website:browser_snapshot
Get page structure and element references:ref
attributes for targeting elements.
browser_click
Click page elements using refs from snapshot:browser_screenshot
Capture visual results:Element Reference System
Browseagent uses a reference system to reliably target page elements:- Take snapshot to get current page structure
- Find target element in snapshot output
- Use the ref attribute for precise targeting
- Provide human description for context
Common Usage Patterns
Form Automation
- Navigate to page with form
- Snapshot to see form structure
- Type into input fields using refs
- Click submit button
- Screenshot results
Data Extraction
- Navigate to data source page
- Screenshot for visual confirmation
- Snapshot to get structured data
- Extract specific information from snapshot
Multi-step Workflows
- Navigate to starting page
- For each step:
- Snapshot current state
- Interact with elements
- Wait if needed for page changes
- Screenshot final results
Error Handling
Element Not Found
If an element ref doesn’t work:- Take a new snapshot (page may have changed)
- Find the updated ref for your target element
- Retry the interaction
Page Loading Issues
If pages don’t load completely:- Use
browser_wait
to allow loading time - Take screenshot to visually verify page state
- Retry snapshot once page is fully loaded
Connection Issues
If tools return connection errors:- Verify Chrome extension is connected
- Check extension popup status
- Restart browser or reconnect extension