T
ToolShelf
PUFFIN
// Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Puffin

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

13EmergingUnknown
License
NOASSERTION
Updated
Today

What it does

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation > Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation > > Kang Liao, Size Wu, Zhonghua Wu, Linyi Jin, Chao Wang, Yikai Wang, Fei Wang, Wei Li, Chen Change Loy > > > > > > We introduce Puffin, a camera-centric unified multimodal model designed to advance

Getting Started

git
git clone https://github.com/KangLiao929/Puffin

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions