Panwang Pan | ๆฝๆๆ
I am currently employed as a Researcher and Developer at PICO within ByteDance Ltd. Previously, I held the position of Senior Algorithm Engineer at Alibaba Cloud, where I specialized in 3D Reconstruction and 6DoF Pose Estimation.
In 2019, I earned my Master's degree from Xiamen University, where I was enrolled in the School of Informatics.
I focused on generative models and multi-modal representation learning, particularly in the 3D realm. Research contributions have been integrated into XR devices, Aliyun Cloud AI-Box, and various commercial products.
Email
 / 
Google Scholar
 / 
Github
 / 
Twitter
 / 
Wechat
|
|
๐ข Latest News
[2025-06] We released PartCrafter, a 3D-native DiT model designed to generate 3D objects in modular parts.
[2025-02] One paper about VLM + RRHF (JarvisIR) was accepted to CVPR 2025 ๐ .
[2025-01] 4K4DGEN was Selected as ICLR25 Spotlight, top 3.2% among 11672 ๐.
[2025-01] Three papers about 3D/4D Generative Models (InstantSplamp & DiffSplat & 4K4DGEN) were accepted to ICLR 2025.
[2024-09] One paper about generalizable single-view human reconstruction (HumanSplat) was accepted to NeurIPS 2024 ๐ .
[2024-09] One paper about VLM Distillation (MRD) was accepted to ECCV 2024 ๐ .
|
Preprint 2025
|
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
Yunlong Lin, Zixu Lin, Kunjie Lin, Jinbin Bai, Panwang Pan, Chenxin Li, Haoyu Chen, Zhongdao Wang, Xinghao Dingโ , Wenbo Li, Shuicheng Yanโ
[Paper]
[Project]
[Code]
JarvisArt outperforms GPT-4o with a 60% improvement in average pixel-level metrics on MMArt-Bench for content fidelity, while maintaining comparable instruction-following capabilities.
|
ICLR 2025 ๐ spotlight ๐
|
4K4DGEN: Panoramic 4D Generation at 4K Resolution
Renjie Li*, Panwang Pan*โก, Bangbang Yang, Dejia Xu, Shijie Zhou, Xuanyang Zhang,
Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhengzhong Tu, Zhiwen Fan
[Openreview]
[Paper]
[Project]
[Code]
4K4DGEN achieves high-quality Panorama-to-4D generation at a resolution of 4K for the first time using efficient splatting techniques for real-time exploration.
|
|
DynamicVerse: Physically-Aware Multimodal Modeling for Dynamic 4D Worlds
Kairun Wen, Yuzhi Huang, Runyu Chen, Hui Zheng, Yunlong Lin, Panwang Pan, Chenxin Li, Wenyan Cong, Jian Zhang, Junbin Lu, Chenguo Lin, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Yue Huang, Xinghao Ding, Rakesh Ranjan, Zhiwen Fan
[Paper]
[Project]
[Code]
DynamicVerse is a physicalโscale, multimodal 4D modeling framework for real-world video.
|
ByteDance Ltd, Beijing, China, Senior Computer Vision Algorithm Engineer, advised by Cheng Chen and Zeming Li.
|
08/2022 - Present |
Alibaba Cloud, Hangzhou, China, Senior Computer Vision Algorithm Engineer
|
07/2019 - 07/2022 |
DevTech Compute, NVIDIA, Beijing, China,
AI Developer Technology Engineer Intern
advised by Xipeng Li .
|
07/2018 - 10/2018 |
๐ Selected Awards
2023,2024: ByteStyle Award, Bytedance
2019: Outstanding Graduates of Xiamen University
2018: National Scholarship for Postgraduates, Ministry of Education
2018: First Prize of GEDC, Second Prize of MCM & CPIPC
2017: ZhongXian Huang Scholarship, Xiamen University (about 10 awards per year)
2015: National Scholarship for Undergraduates (the highest honor scholarship in China)
|
|