SCALECUA
// ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubu...
ScaleCUA
ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubu...
13EmergingUnknown
What it does
  π Paper   |   π€ Dataset   |   π€ Model   |   π₯οΈ Model Demo   Vision-Language Models (VLMs) have enabled computer use agents (CUAs) that operate GUIs autonomously with great potential. However, developing robust CUAs requires extensive in-domain knowledge about software interfaces and operations. Unlike imageβtext pairs that are
Getting Started
git
git clone https://github.com/OpenGVLab/ScaleCUA