Project Mariner

Paid

A research prototype developed by Google DeepMind, exploring the future of human-AI agent interaction, starting with the browser. Built with Gemini 2.0, it combines powerful multimodal understanding and reasoning capabilities to automate tasks in the browser.

Product Screenshot

Features

Multimodal Understanding

Capable of understanding and reasoning about everything on a browser screen, including pixels and text, code, images, and web elements like forms

Browser Automation

Understands and navigates complex websites, executing tasks on behalf of users

Advanced Benchmark Performance

Achieves state-of-the-art results of 83.5% in single-agent settings on the WebVoyager benchmark

Pricing

Limited Testing

Only open to trusted testers

Not publicly available currently
Requires joining a waitlist

Use Cases

Web Navigation & Interaction

Automatically browses and interacts with complex websites

Automating Repetitive Tasks

Handles repetitive tasks on web pages, saving user time