Selected Publications


Yijie Hu, Zihao Zhou, Kaizhu Huang, Xiaowei Huang, Qiufeng Wang
Can MLLMs Absorb Math Reasoning Abilities from LLMs as Free Lunch?
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeruIPS), 2025.
   

Chaolong Yang*, Kai Yao*, Yuyao Yan, Chenru Jiang, Weiguang Zhao, Jie Sun, Guangliang Cheng, Yifei Zhang, Bin Dong, Kaizhu Huang
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
International Journal of Computer Vision (IJCV), 2025. [280+ Star]
GitHub Repo PDF    

Xiaoqiang Kang, Shengen Wu, Zimu Wang, Yilin Liu, Xiaobo Jin, Kaizhu Huang, Wei Wang, Yutao Yue, Xiaowei Huang, Qiufeng Wang
Can GRPO Boost Complex Multimodal Table Reasoning?
The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025.
PDF    

Tianyi Liu, Zhaorui Tan, Muyin Chen, Xi Yang, Haochuan Jiang, Kaizhu Huang.
MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment
IEEE Journal of Biomedical and Health Informatics (JBHI), 2025.
PDF    

Weiguang Zhang, Huangcheng Lu, Maizhen Ning, Xiaowei Huang, Wei Wang, Kaizhu Huang, Qiufeng Wang
DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinates-based Diffusion Model
The 17th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2025.
PDF    

Xinzhe Xia, Weiguang Zhao, Yuyao Yan, Guanyu Yang, Rui Zhang, Kaizhu Huang, Xi Yang
Towards Training-Free Open-World Classification with 3D Generative Models
The 33rd ACM International Conference on Multimedia (ACM MM), 2025.
PDF    

Chaolong Yang, Yinuo Guo, Kai Yao, Yuyao Yan, Jie Sun, Kaizhu Huang
KDTalker++: Controllable Talking Portrait Generation with Audio, Text, and Expression Editing
The 33rd ACM International Conference on Multimedia - Demo Track (ACM MM), 2025.
GitHub Repo    

Zhaorui Tan, Xi Yang, Tan Pan, Tianyi Liu, Chen Jiang, Xin Guo, Qiufeng Wang, Anh Nguyen, Yuan Qi, Kaizhu Huang, Yuan Cheng
Towards a Universal 3D Medical Multi-modality Generalization via Learning Personalized Invariant Representation
The International Conference on Computer Vision (ICCV), 2025.
PDF    

Kai Yao, Zhaorui Tan, Zixian Su, Xi Yang, Jie Sun, Kaizhu Huang.
SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2025.
PDF    

Tianyi Liu, Haochuan Jiang, Kaizhu Huang
KMD: Koopman Multi-modality Decomposition for Generalized Brain Tumor Segmentation under Incomplete Modalities
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025.
   

Jianan Ye, Weiguang Zhao, Xi Yang, Guangliang Cheng, Kaizhu Huang
PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025.
PDF    

Weiguang Zhao, Rui Zhang, Qiufeng Wang, Guangliang Cheng, Kaizhu Huang
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025.
PDF    

Zihao Zhou, Shudong Liu, Maizhen Ning, Wei Liu, Jindong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
The Thirteenth International Conference on Learning Representations (ICLR), 2025.
PDF    

Zihan Ye, Shreyank N Gowda, Shiming Chen, Xiaowei Huang, Haotian Xu, Fahad Shahbaz Khan, Yaochu Jin, Kaizhu Huang, Xiaobo Jin
ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning
The Thirteenth International Conference on Learning Representations (ICLR), 2025.
PDF    

Jianan Ye, Zhaorui Tan, Yijie Hu, Xi Yang, Guangliang Cheng, Kaizhu Huang.
Disentangling Tabular Data towards Better One-Class Anomaly Detection
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025.
PDF    

Maizhen Ning, Zihao Zhou, Qiufeng Wang, Xiaowei Huang, Kaizhu Huang.
GNS: Solving Plane Geometry Problems by Neural-Symbolic Reasoning with Multi-Modal LLMs
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025.
PDF    

Huiru Shao, Kaizhu Huang, Wei Wang, Xiaowei Huang, Qiufeng Wang.
Towards Better Robustness Against Natural Corruptions in Document Tampering Localization
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025.
PDF    

Xiaoqiang Kang, Zimu Wang, Xiaobo Jin, Wei Wang, Kaizhu Huang, Qiufeng Wang.
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025.
PDF    

Yijie Hu, Qiufeng Wang, Guanyu Yang, Zhaorui Tan, Kaizhu Huang, Xiaowei Huang.
Covariance-based Space Regularization for Few-shot Class Incremental Learning
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025.
PDF    

Zhaorui Tan, Xi Yang, Qiufeng Wang, Anh Nguyen, Kaizhu Huang.
Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification
Neural Information Processing Systems (NeurIPS), 2024. [Spotlight]
PDF    

Jingwei Guo, Kaizhu Huang, Rui Zhang, Xinping Yi.
ES-GNN: Generalizing Graph Neural Networks Beyond Homophily with Edge Splitting
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024.
DOI     PDF    

Shufei Zhang, Zhuang Qian, Kaizhu Huang, Qiufeng Wang, Bin Gu, Huan Xiong, Xinping Yi.
Inter-feature Relationship Certifies Robust Generalization of Adversarial Training
International Journal of Computer Vision (IJCV), 2024.
DOI     PDF    

Latest Breaking Work


Chaolong Yang*, Kai Yao*, Yuyao Yan, Chenru Jiang, Weiguang Zhao, Jie Sun, Guangliang Cheng, Yifei Zhang, Bin Dong, Kaizhu Huang
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
International Journal of Computer Vision (IJCV), 2025. [280+ Star]
GitHub Repo PDF    

Zhaorui Tan, Xi Yang, Qiufeng Wang, Anh Nguyen, Kaizhu Huang.
Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification
Neural Information Processing Systems (NeurIPS), 2024. [Spotlight]
DOI     PDF