Browser
Quickstart: Controlling a Browser in a Runloop Devbox
Introduction
This guide will walk you through using the Runloop SDK to control a browser inside a Runloop Devbox. The Runloop API provides a browser-ready Devbox, enabling AI agents to interact with web pages programmatically.
Set Up Your Environment
Set up your authentication key:
Install and Initialize the Runloop SDK
First, install the Runloop SDK if you haven’t already:
Then, import and initialize the SDK:
This client
object allows interaction with the Runloop API.
Create a Devbox and Start the Browser
Set up your browser-ready Devbox and obtain the connection details:
Connect to the Browser using Playwright
To interact with the browser, you can use automation tools like Selenium, Puppeteer, or Playwright. Here’s an example using Playwright’s Chrome DevTools Protocol (CDP):
Defining Tools for AI Agents
You can create custom tools for AI agents to interact with the browser programmatically. Here’s an example of a navigation tool using Playwright:
Passing Tools to an AI Agent
Now, you can pass this tool to an AI agent, enabling it to use the browser autonomously:
Different LLM providers have their own specific formats and requirements for defining and passing tools. Make sure to reference your LLM provider’s documentation for the correct implementation details of tool schemas and function calling.
Properly Freeing Resources
To ensure efficient resource management, always shut down the Devbox when you’re done:
Additional Resources
- Runloop GitHub Repository - Explore more examples.
- Runloop API Documentation - Official API reference.
Was this page helpful?