PUFFIN
// Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
Puffin
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
13EmergingUnknown
What it does
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation > Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation > > Kang Liao, Size Wu, Zhonghua Wu, Linyi Jin, Chao Wang, Yikai Wang, Fei Wang, Wei Li, Chen Change Loy > > > > > > We introduce Puffin, a camera-centric unified multimodal model designed to advance
Getting Started
git
git clone https://github.com/KangLiao929/Puffin