Computer
Introduction
This guide will walk you through using the Runloop SDK to control a remote computer inside a Runloop Devbox. The Runloop API provides a computer-ready Devbox, enabling AI agents to interact with the system programmatically.
Set Up Your Environment
Set up your authentication key:
Install and Initialize the Runloop SDK
First, install the Runloop SDK if you haven’t already:
Then, import and initialize the SDK:
This client
object allows interaction with the Runloop API.
Create a Devbox and Start the Computer Tool
Create your Devbox, wait for it to be ready, and retrieve the connection details:
Interacting with the Computer
The computer-ready Devbox offers a suite of Computer Tools for agent interactions. The available actions include:
- Keyboard interaction:
key
,type
- Mouse interaction:
mouse_move
,left_click
,left_click_drag
,right_click
,middle_click
,double_click
- Screen interaction:
screenshot
,cursor_position
You can access these tools using the Runloop client as shown below:
Using API Tools with the Computer Tool
Once you create tools for your agent, you can integrate them with your preferred LLM. Here’s an example of integrating it with Anthropic’s Claude:
Different LLM providers have their own specific formats and requirements for defining and passing tools. Make sure to reference your LLM provider’s documentation for the correct implementation details of tool schemas and function calling.
Properly Freeing Resources
To ensure efficient resource management, always shut down the Devbox when you’re done:
Additional Resources
- Runloop GitHub Repository - Explore more examples.
- Runloop API Documentation - Official API reference.
Was this page helpful?