T
ToolShelf
SCALECUA
// ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubu...

ScaleCUA

ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubu...

13EmergingUnknown
License
Apache-2.0
Updated
Today

What it does

&nbsp&nbspπŸ“‘ Paper&nbsp&nbsp | &nbsp&nbspπŸ€— Dataset&nbsp&nbsp | &nbsp&nbspπŸ€– Model&nbsp&nbsp | &nbsp&nbspπŸ–₯️ Model Demo&nbsp&nbsp Vision-Language Models (VLMs) have enabled computer use agents (CUAs) that operate GUIs autonomously with great potential. However, developing robust CUAs requires extensive in-domain knowledge about software interfaces and operations. Unlike image–text pairs that are

Getting Started

git
git clone https://github.com/OpenGVLab/ScaleCUA

Platforms

πŸͺŸwindows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions