8000 GitHub - mogoweb/nanobrowser: Open-source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Open-source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

License

Notifications You must be signed in to change notification settings

mogoweb/nanobrowser

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

57 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

banner

GitHub Twitter Discord

🌐 Nanobrowser

Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and multi-agent system.

⬇️ Get Nanobrowser for free

πŸ‘ Join the community in Discord | X

❀️ Loving Nanobrowser? Give us a star 🌟 and help spread the word!

Nanobrowser Demo GIF

Nanobrowser's multi-agent system analyzing HuggingFace in real-time, with the Planner intelligently self-correcting when encountering obstacles and dynamically instructing the Navigator to adjust its approachβ€”all running locally in your browser.

πŸ”₯Why Nanobrowser?

Looking for a powerful AI web agent without the $200/month price tag of OpenAI Operator? Nanobrowser , as a chrome extension, delivers premium web automation capabilities while keeping you in complete control:

  • 100% Free - No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
  • Privacy-Focused - Everything runs in your local browser. Your credentials stay with you, never shared with any cloud service.
  • Flexible LLM Options - Connect to your preferred LLM providers with the freedom to choose different models for different agents.
  • Fully Open Source - Complete transparency in how your browser is automated. No black boxes or hidden processes.

Note: We currently support OpenAI, Anthropic and Gemini, with more LLM integrations coming soon!

πŸ“Š Key Features

  • Multi-agent System: Specialized AI agents collaborate to accomplish complex web workflows
  • Interactive Side Panel: Intuitive chat interface with real-time status updates
  • Task Automation: Seamlessly automate repetitive web automation tasks across websites
  • Follow-up Questions: Ask contextual follow-up questions about completed tasks
  • Conversation History: Easily access and manage your AI agent interaction history
  • Multiple LLM Support: Connect your preferred LLM providers and assign different models to different agents

πŸš€ Quick Start

  1. Download

    • Download the latest nanobrowser.zip file from the official Github release page.
  2. Install:

    • Unzip nanobrowser.zip.
    • Open chrome://extensions/ in Chrome
    • Enable Developer mode (top right)
    • Click Load unpacked (top left)
    • Select the unzipped nanobrowser folder.
  3. Configure Agent Models

    • Click the Nanobrowser icon in your toolbar to open the sidebar
    • Click the Settings icon (top right).
    • Add your LLM API keys.
    • Choose which model to use for different agents (Navigator, Planner, Validator)

πŸ› οΈ Build from Source

If you prefer to build Nanobrowser yourself, follow these steps:

  1. Prerequisites:

    • Node.js (v22.12.0 or higher)
    • pnpm (v9.15.1 or higher)
  2. Clone the Repository:

    git clone https://github.com/nanobrowser/nanobrowser.git
    cd nanobrowser
  3. Install Dependencies:

    pnpm install
  4. Build the Extension:

    pnpm build
  5. Load the Extension:

    • The built extension will be in the dist directory
    • Follow the installation steps from the Quick Start section to load the extension into your browser
  6. Development Mode (optional):

    pnpm dev

πŸ€– Choosing Your Models

Nanobrowser allows you to configure different LLM models for each agent to balance performance and cost. Here are recommended configurations:

Better Performance

  • Planner & Validator: Claude 3.7 Sonnet
    • Better reasoning and planning capabilities
    • More reliable task validation
  • Navigator: Claude 3.5 Haiku
    • Efficient for web navigation tasks
    • Good balance of performance and cost

Cost-Effective Configuration

  • Planner & Validator: Claude Haiku or GPT-4o
    • Reasonable performance at lower cost
    • May require more iterations for complex tasks
  • Navigator: Gemini 2.0 Flash or GPT-4o-mini
    • Lightweight and cost-efficient
    • Suitable for basic navigation tasks

Note: The cost-effective configuration may produce less stable outputs and require more iterations for complex tasks.

Tip: Feel free to experiment with your own model configurations! Found a great combination? Share it with the community in our Discord to help others optimize their setup.

πŸ’‘ See It In Action

Here are some powerful tasks you can accomplish with just a sentence:

  1. News Summary:

    "Go to TechCrunch and extract top 10 headlines from the last 24 hours"

  2. GitHub Research:

    "Look for the trending Python repositories on GitHub with most stars"

  3. Shopping Research:

    "Find a portable Bluetooth speaker on Amazon with a water-resistant design, under $50. It should have a minimum battery life of 10 hours"

πŸ› οΈ Roadmap

We're actively developing Nanobrowser with exciting features on the horizon, welcome to join us!

  • Expand LLM Support: Add integration with more large language model providers for AI agents
  • Security Firewall: Implement protective measures to ensure safe browsing operations
  • Memory Optimization: Reduce token usage while maintaining context awareness
  • Session Replay: Efficiently replay historical tasks with minimal token consumption
  • Specialized Agents: Develop purpose-built agents for complex domain-specific tasks

🀝 Contributing

We need your help to make Nanobrowser even better! Contributions of all kinds are welcome:

  • Share Prompts & Use Cases
    • Join our Discord server.
    • share how you're using Nanobrowser. Help us build a library of useful prompts and real-world use cases.
  • Provide Feedback
    • Try Nanobrowser and give us feedback on its performance or suggest improvements in our Discord server.
  • Contribute Code
    • Create an issue
    • Fork the repository
    • Open a pull request

We believe in the power of open source and community collaboration. Join us in building the future of web automation!

πŸ’¬ Community

Join our growing community of developers and users:

πŸ‘ Acknowledgments

Nanobrowser builds on top of other awesome open-source projects:

Huge thanks to their creators and contributors!

πŸ“„ License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Made with ❀️ by the Nanobrowser Team.

Like Nanobrowser? Give us a star 🌟 and join us in Discord | X

About

Open-source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 90.2%
  • JavaScript 8.4%
  • CSS 1.2%
  • Other 0.2%
0