AI Agent Index
Browser Use logo

Browser Use

4.2/ 5

by Browser Use

MCPEditorial review
Visit site →

Open-source Python library for AI browser automation using LLMs and computer vision. Free tier with 10 tasks/month, cloud from $29/month. MCP server included. 90,000+ GitHub stars.

Browser Use is an open-source Python library that lets AI agents control a real browser using LLMs instead of brittle CSS selectors or XPath scripts. Rather than pre-programming every interaction with fragile selector code, agents describe what they want in natural language and Browser Use figures out how to execute it against whatever website it encounters, including sites it has never seen before. With over 90,000 GitHub stars, it is one of the most widely adopted open-source browser automation libraries in the AI agent ecosystem. The library provides a self-healing automation harness: when a website's layout changes, Browser Use adapts because it reasons from the page's current visual and DOM state rather than hardcoded selectors. This makes it well-suited for automating legacy enterprise web applications, government portals, insurance platforms, healthcare systems, and any site without a public API. The cloud platform adds stealth browsers that mimic human browser fingerprints to avoid bot detection, handling canvas, WebGL, and audio fingerprinting automatically. The product line has multiple tiers. The open-source library (MIT license, pip install browser-use) is free and self-hosted, giving direct browser automation access with your own LLM API keys. The cloud platform adds managed infrastructure: Free tier (10 tasks/month, 3 concurrent sessions, advanced stealth), Dev ($29/month, $29 in credits, 25 concurrent sessions), Business and Scaleup tiers for higher volume. Browser Use Box is a 24/7 cloud agent product that runs continuously on your behalf. Browser Use ships both a local MCP server (stdio) and a hosted Cloud MCP server. This means Claude Desktop, Cursor, and other MCP-compatible agents can directly invoke browser automation tools including navigate, fill forms, extract data, manage tabs, and run autonomous agent tasks, without custom integration code. Benchmarks report 97% task accuracy on the Browser Use cloud benchmark. The library supports any LLM via BYOK, including OpenAI, Anthropic, Google Gemini, Groq, and local models via Ollama. Works with n8n, LangChain, and other orchestration platforms. Key limitations: the open-source library requires your own compute and API key management, complex multi-step tasks on JavaScript-heavy or bot-detection sites can still fail, and the cloud platform's usage-based pricing offers less cost predictability than flat-rate enterprise RPA tools.

Pricing

freemium · Free

Segment

b2b

Setup

easy

Verified

May 14, 2026

Capabilities

web-searchdata-analysisautonomousno-codeworkflow-builder

Pros & Limitations

Editorial assessment

Pros

  • Self-healing browser automation using LLMs and computer vision instead of brittle CSS selectors or XPath, adapting to website layout changes without code maintenance and making it reliable for automating legacy web apps, government portals, and sites without public APIs
  • Native MCP server integration (local stdio and hosted cloud) lets Claude Desktop, Cursor, and other MCP-compatible agents invoke browser automation directly without custom integration code, exposing navigate, fill forms, extract data, and manage tabs as native agent tools
  • MIT-licensed open-source library with 90,000+ GitHub stars and BYOK model support covering OpenAI, Anthropic, Gemini, Groq, and local models via Ollama, giving teams full model flexibility and zero vendor lock-in

Limitations

  • Complex multi-step workflows on JavaScript-heavy or bot-detection sites can fail, as the LLM reasoning approach adds latency and cost per task compared to deterministic selector-based automation, and success rates on adversarial sites vary significantly
  • Open-source self-hosting requires managing your own compute, LLM API keys, and browser infrastructure, meaning teams without DevOps resources face meaningful operational overhead compared to fully managed browser automation alternatives
  • Cloud pricing is usage-based and the platform is early-stage, with the free tier limited to 10 tasks/month, and high-volume production workloads face less cost predictability than flat-rate enterprise RPA tools

Technical Details

Deployment
cloudself-hostedapi
Avg setup timeUnder 5 minutes for cloud (API key only); under 10 minutes for open-source (pip install browser-use, LLM API key)
Autonomous rateAutonomously navigates websites, fills forms, clicks elements, extracts data, and completes multi-step browser workflows without per-step human approval once given a natural language task description.
MCP compatibleYes
Integrations
OpenAIAnthropicGoogle GeminiGroqOllamaLangChainn8nClaude DesktopCursor

Similar agents

Rating

4.2/ 5

Editorial score

Industries

SaaSEnterpriseB2BDevToolsOpen Source

Leave a review

Never displayed publicly.

Agent Stacks

See workflow stacks that feature Browser Use.

Compare

Related Content

Is this your tool?

Claim this listing to update your details and get a Verified badge.

Claim listing →