Guocheng Qian

钱国成

Research Scientist at Snap Research

Snap Research

Gordon Guocheng Qian

Welcome! I am a research scientist at Snap Research working on Personalized Generative AI. I earned my Ph.D. in Computer Science from KAUST, where I was fortunate to be advised by Prof. Bernard Ghanem. Prior to that, I received my B.Eng degree from Xi'an Jiaotong University (XJTU), China with the university’s highest undergraduate honor. My primary research interests lie in computer vision and generative models. My representative work includes PointNeXt (NeurIPS), Magic123 (ICLR) and Omni-ID (CVPR'25).
If you are interested in working in generative models with us, please drop me a message through guocheng.qian [at] outlook.com

Education

Ph.D. in CS
KAUST , 2019 - 2023
B.Eng in ME
XJTU , 2014 - 2018
Exchange Student
HKUST , 2017 Spring

Interests

Generative AI

Omni-ID, Magic123

Computer Vision

PointNeXt, DeepGCNs

Featured Publications

Check my full publication at Google Scholar

Wonderland: Navigating 3D Scenes from a Single Image

Hanwen Liang, Junli Cao, Vidit Goel, Guocheng Qian, Sergei Korolev, Demetri Terzopoulos, Konstantinos N. Plataniotis, Sergey Tulyakov, Jian Ren

December 2024 CVPR'25

WonderLand is a video-latent based approach for single-image 3D reconstruction in large-scale scenes.

Preprint Project

Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Guocheng Qian, Kuan-Chieh Wang, Or Patashnik, Negin Heravi, Daniil Ostashev, Sergey Tulyakov, Daniel Cohen-Or, Kfir Aberman

December 2024 CVPR'25

Omni-ID is a novel facial representation tailored for generative tasks, encoding identity features from unstructured images into a fixed-size representation that captures diverse expressions and poses.

Preprint Project

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Sherwin Bahmani, Ivan Skorokhodov, Guocheng Qian, Aliaksandr Siarohin, Willi Menapace, Andrea Tagliasacchi, David B. Lindell, Sergey Tulyakov

November 2024 CVPR'25

AC3D studies when and how you should condition camera signals into a video diffusion model for a better camera control and a higher video quality.

Preprint Project

AToM: Amortized Text-to-Mesh using 2D Diffusion

Guocheng Qian, Junli Cao, Aliaksandr Siarohin, Yash Kant, Chaoyang Wang, Michael Vasilkovsky, Hsin-Ying Lee, Yuwei Fang, Ivan Skorokhodov, Peiye Zhuang, Igor Gilitschenski, Jian Ren, Bernard Ghanem, Kfir Aberman, Sergey Tulyakov

February 2024 arXiv'24

AToM trains a single text-to-mesh model on many prompts using 2D diffusion without 3D supervision, yileds high-quality textured meshes under a second, and generalizes to unseen prompts.

Preprint Code Project

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Guocheng Qian, Jinjie Mai, Abdullah Hamdi, Jian Ren, Aliaksandr Siarohin, Bing Li, Hsin-Ying Lee, Ivan Skorokhodov, Peter Wonka, Sergey Tulyakov, Bernard Ghanem

January 2024 ICLR'24

Magic123 is a coarse-to-fine image-to-3D pipeline that produces high-quality high-resolution 3D content from a single unposed image by the guidance of both 2D and 3D priors.

Preprint Code Project

Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding

Guocheng Qian, Abdullah Hamdi, Xingdi Zhang, Bernard Ghanem

December 2023 3DV'24

Pix4Point shows that image pretraining siginificantly improves point cloud understanding.

Preprint Code

Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only

Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Mohamed Elhoseiny, Sean Chang Culatana

June 2023 ICCV'23

ZeroSeg trains open-vocabulary zero-shot semantic segmentation models using only CLIP Vision Encoder

Preprint Code

PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard Ghanem

August 2022 NeurIPS'22

PointNeXt boosts the performance of PointNet++ to the state-of-the-art level with improved training and scaling strategies.

Preprint Code

Rethinking Learning-based Demosaicing, Denoising, and Super-Resolution Pipeline

Guocheng Qian, Yuanhao Wang, Jinjin Gu, Chao Dong, Wolfgang Heidrich, Bernard Ghanem, Jimmy S. Ren

August 2022 ICCP'2022

Preprint Code Dataset

ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation Learning

Guocheng Qian, Hasan Abed Al Kader Hammoud, Guohao Li, Ali Thabet, Bernard Ghanem

October 2021 NeurIPS'21, Spotlight

ASSANet makes PointNet++ faster and more accurate.

Preprint Code

DeepGCNs: Making GCNs Go as Deep as CNNs

Guocheng Qian*, Guohao Li*, Matthias Müller*, Itzel C. Delgadillo, Abdulellah Abualshour, Ali Thabet, Bernard Ghanem

March 2021 TPAMI'2021

This work transfers concepts such as residual/dense connections and dilated convolutions from CNNs to GCNs in order to successfully train very deep GCNs.

Preprint Code Project Slides Video