OpenAI CUA (computer use)

Integrate OpenAI CUA with Votte Browser Sessions

Overview

This guide explains how to integrate OpenAI’s Computer Use Agent (CUA) with Votte’s browser infrastructure for automated web interactions.

CUA enables programmatic control of web interfaces through visual processing and contextual understanding. When integrated with Votte’s browser infrastructure, it provides a scalable environment for running these automations in the cloud.

A demo is available at .https://votte.cc/#tutorial

Requirements

  • An OpenAI API key with CUA access

  • A Votte API key

  • Python 3.11 or later

Setup

Follow these steps to integrate CUA with Votte:

  1. Clone the repository:

Copy

git clone https://github.com/openai/openai-cua-sample-app.git
  1. Install dependencies:

Copy

  1. Set environment variables:

Copy

  1. Run the example:

Copy

CLI Options

Available command-line arguments:

  • --input: Automation instructions (prompts if not provided)

  • --debug: Enable debug logging

  • --show: Enable screenshot capture

  • --start-url: Set initial URL (default: https://bing.com)

Last updated