REX OMNI

// Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Rex Omni

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

13EmergingUnknown

See Alternatives Compare...

License

NOASSERTION

Updated

1mo ago

What it does

Detect Anything via Next Point Prediction > Rex-Omni is a 3B-parameter Multimodal Large Language Model (MLLM) that redefines object detection and a wide range of other visual perception tasks as a simple next-token prediction problem. - [2026-01-10] Pointing Task Finetuning is now supported! Train Rex-Omni on custom pointing datasets with SFT and GRPO. See Fine-tuning Guide for details. -

Getting Started

git

git clone https://github.com/IDEA-Research/Rex-Omni

Links

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

jupyter notebook

Community Reactions

Similar Tools

See all alternatives →

36Emerging

Openvpn_webpanel_manager

A powerful, self-hosted web panel for managing OpenVPN servers, users, resellers (sub-admins), and multi-node deploym...

Active190open-source

36Emerging

Hink

Link Shortener for Hackers

Active158open-source

36Emerging

HackerBook

Hacker Book - COMMUNITY, ALL THE HN ARE BELONG TO YOU. An unkillable, static offline archive of all of Hacker News.

Active187open-source

36Emerging

Obsidian Life Tracker Base View

Capture and visualize the data that matters in your life

Active179open-source

36Emerging

Vyuh_node_flow

A flexible, high-performance node-based flow editor for Flutter. Build visual programming interfaces, workflow editor...

Active171open-source

36Emerging

Db Studio

The modern pgAdmin alternative that works with every database.

Active171open-source