About

I am a Computer Vision researcher pursuing my Ph.D. at the University of Macau. My research encompasses generative models, diffusion models, arbitrary style transfer, semantic segmentation, and manipulation detection.

I have proposed and executed novel solutions for multiple computer vision problems, with my work recognized in conferences such as CVPR, ICCV, NeurIPS, ICLR, and AAAI. Beyond academia, I have industry experience from internships at Amazon AWS and Amazon Alexa, shipping state-of-the-art visual feature models.

Generative Models Diffusion Models Style Transfer Computer Vision PyTorch

Timeline

2024 — Present

Ph.D. in Computer Sciences

University of Macau

Studying conditional layout-to-image generation and granular edge detection problems.

2022 — 2024

Applied Scientist Intern

Amazon AWS (Shanghai)

Engineered a video-to-video transfer model utilizing diffusion for user-guided video editing (published in ICLR 2024) and developed efficient diffusion models for layout-to-image synthesis.

Summer 2019, 2020, 2021

Applied Scientist Intern

Amazon Alexa

Pioneered one-shot style transfer platforms, halving deployment times. Conducted vital research bridging computer vision algorithms with Amazon's Alexa platform (published in AAAI 2023, CVPR 2021 Oral).

2017 — 2021

Master/Ph.D. in Computer Sciences

University of Southern California

Innovated generalized zero-shot semantic segmentation and deep fake detection mechanisms, solving diverse CV tasks mapping to key conferences (ICCV, CVPR).

2013 — 2017

B.S. in Information Security

Shanghai Jiao Tong University

Recipient of Third-Class Scholarship for three consecutive years.

Selected Publications

MEMO paper teaser
CVPR 2026 CCF-A

MEMO: Human-like Crisp Edge Detection Using Masked Edge Prediction

Jiaxin Cheng, Yue Wu, Yicong Zhou

Rich-context L2I paper teaser
NeurIPS 2024 CCF-A

Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation

Jiaxin Cheng, Zixu Zhao, Tong He, Tianjun Xiao, Zheng Zhang, Yicong Zhou

Video transfer paper teaser
ICLR 2024

Consistent Video-to-Video Transfer Using Synthetic Dataset

Jiaxin Cheng, Tianjun Xiao, Tong He

Style transfer paper teaser
AAAI 2023 CCF-A

User-controllable arbitrary style transfer via entropy regularization

Jiaxin Cheng, Yue Wu, Ayush Jaiswal, Xu Zhang, Pradeep Natarajan, Prem Natarajan

LayoutDiffuse paper teaser
CVPRW 2023

LayoutDiffuse: Adapting Foundational Diffusion Model for Layout-to-image Generation

Jiaxin Cheng, Xiao Liang, Xingjian Shi, Tong He, Tianjun Xiao, Mu Li

HEAD paper teaser
NeurIPSW 2022

Attack-Agnostic Adversarial Detection

Jiaxin Cheng, Mohamed Hussein, Jay Billa, Wael AbdAlmageed

SIGN paper teaser
ICCV 2021 CCF-A

SIGN: Spatial-information incorporated generative network for generalized zero-shot semantic segmentation

Jiaxin Cheng, Soumyaroop Nandi, Prem Natarajan, Wael Abd-Almageed

Style loss paper teaser
CVPR 2021 ORAL CCF-A

Style-aware normalized loss for improving arbitrary style transfer

Jiaxin Cheng, Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Prem Natarajan

QATM paper teaser
CVPR 2019 CCF-A

QATM: Quality-aware template matching for deep learning

Jiaxin Cheng, Yue Wu, Wael AbdAlmageed, Premkumar Natarajan

Text detection paper teaser
ICDAR 2019 CCF-C

A Study of Script Language Effects in Deep Neural-Network-Based Scene Text Detection

Jiaxin Cheng, Achin Gupta, Yue Wu, Premkumar Natarajan

Image-GPS paper teaser
ACCV 2018 CCF-C

Image-to-GPS verification through a bottom-up pattern matching network

Jiaxin Cheng, Yue Wu, Wael AbdAlmageed, Premkumar Natarajan

Homography estimation paper teaser
CVPR 2026 CCF-A

Towards Generalized Multimodal Homography Estimation

Jinkun You, Jiaxin Cheng, Jie Zhang, Yicong Zhou

Face manipulation paper teaser
CVPRW 2019

Recurrent Convolutional Strategies for Face Manipulation Detection in Videos

Ekraam Sabir, Jiaxin Cheng, Ayush Jaiswal, Wael AbdAlmageed, Iacopo Masi, Prem Natarajan