DOLPHIN
// The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
13EmergingUnknown
What it does
Dolphin-v2 is an enhanced universal document parsing model that substantially improves upon the original Dolphin. It seamlessly handles any document type—whether digital-born or photographed—through a document-type-aware two-stage architecture with scalable anchor prompting. Document image parsing is challenging due to diverse document types and complexly intertwined elements such as text
Getting Started
git
git clone https://github.com/bytedance/Dolphin