毛震东  (特任教授)

博士生导师 硕士生导师

电子邮箱:

学历:博士研究生毕业

学位:博士

毕业院校:中国科学院计算技术研究所

   

代表性论文

当前位置: 中文主页 >> 代表性论文

近三年代表性会议论文:

● Zheren Fu, Lei Zhang, Hou Xia, Zhendong Mao*. “Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment”, CVPR, 2024. CCF-A

● Mengqi Huang, Zhendong Mao*, Mingcong Liu, Qian He, Yongdong Zhang. “RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization”, CVPR, 2024. CCF-A

● Huatian Zhang, Lei Zhang, Kun Zhang, Zhendong Mao*. “Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching.” AAAI 2024. CCF-A

● Yihan Chen, Benfeng Xu, Quan Wang, Yi Liu, Zhendong Mao*. “Benchmarking Large Language Models on Controllable Generation under Diversified Instructions.” AAAI 2024. CCF-A

● Hao Li, Mengqi Huang, Lei Zhang, Bo Hu, Yi Liu, Zhendong Mao*. “Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing.” AAAI 2024. CCF-A

● Zhuowei Chen, Shancheng Fang, Wei Liu, Qian He, Mengqi Huang, Zhendong Mao*. “DreamIdentity: Enhanced Editability for Efficient Face-identity Preserved Image Generationc.” AAAI 2024. CCF-A

● Benfeng Xu, Quan Wang, Yajuan Lyu, Dai Dai, Yongdong Zhang and Zhendong Mao*. “S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction.” ACL 2023. CCF-A

● Jingxuan Han, Quan Wang, Licheng Zhang, Weidong Chen, Yan Song and Zhendong Mao*. “Text Style Transfer with Contrastive Transfer Pattern Mining.” ACL 2023. CCF-A

● Mengqi Huang, Zhendong Mao*, Zhuowei Chen, Yongdong Zhang. “Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization.” CVPR 2023. (highlight paper)  CCF-A

● Mengqi Huang, Zhendong Mao*, Quan Wang, Yongdong Zhang. “Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation.” CVPR 2023.  CCF-A

● Zheren Fu, Zhendong Mao*, Yan Song, Yongdong Zhang. “Learning Semantic Relationship among Instances for Image-Text Matching.” CVPR 2023.  CCF-A

● Yuchen Ren, Zhendong Mao*, Shancheng Fang, Yan Lu, Tong He, Hao Du, Yongdong Zhang, Wanli Ouyang. “Crossing the Gap: Domain Generalization for Image Captioning.” CVPR 2023.  CCF-A

● Benfeng Xu, Quan Wang, Zhendong Mao*, Yajuan Lyu, Qiaoqiao She, Yongdong Zhang. “kNN Prompting: Learning Beyond the Context with Nearest Neighbor Inference.” ICLR 2023.

● Kun Zhang, Lei Zhang, Bo Hu, Mengxiao Zhu, Zhendong Mao*. “Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching.” ACM MM 2023. CCF-A

● Mengqi Huang, Zhendong Mao*, Penghui Wang, Quan Wang, Yongdong Zhang. “DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation.” In Proceedings of the 30th ACM International Conference on Multimedia (ACM MM), Pages 4345 - 4354, 2022. (best student paper,1/3009)  CCF-A

● Zhuowei Chen, Zhendong Mao*, Shancheng Fang, Bo Hu. “Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation.” ACM MM 2022. (oral paper)  CCF-A

● Jingyu Li, Zhendong Mao*, Shancheng Fang, Hao Li. “ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.” IJCAI 2022. (oral paper)  CCF-A

● Kun Zhang, Zhendong Mao*, Quan Wang, Yongdong, Zhang. “Negative-Aware Attention Framework for Image-Text Matching.” CVPR 2022.  CCF-A

● Huatian Zhang, Zhendong Mao*, Kun Zhang, Yongdong Zhang. “Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.” CVPR 2022.  CCF-A


近三年代表性期刊论文:

● Zhe Li, Lei Zhang, Kun Zhang, Yongdong Zhang, Zhendong Mao*. “Improving Image-Text Matching with Bidirectional Consistency of Cross-Modal Alignment.” IEEE Transactions on Circuits and Systems for Video Technology, 2024. 中科院一区

● Zhe Li, Lei Zhang, Kun Zhang, Yongdong Zhang, Zhendong Mao*. “Fast, Accurate, and Lightweight Memory-Enhanced Embedding Learning Framework for Image-Text Retrieval.” IEEE Transactions on Circuits and Systems for Video Technology, 2024. 中科院一区

● Jingyu Li, Lei Zhang, Kun Zhang, Bo Hu, Hongtao Xie, Zhendong Mao*. “Cascade Semantic Prompt Alignment Network for Image Captioning.” IEEE Transactions on Circuits and Systems for Video Technology, 2023. 中科院一区

● Kun Zhang, Bo Hu, Huatian Zhang, Zhe Li, Zhendong Mao*. “Enhanced Semantic Similarity Learning Framework for Image-Text Matching.” IEEE Transactions on Circuits and Systems for Video Technology, 2023. 中科院一区

● Kun Zhang, Zhendong Mao*, An-An Liu, Yongdong Zhang. “Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching.” IEEE Transactions on Multimedia, Volume 25, Pages 1320 – 1332, 2023.  中科院一区

● Shancheng Fang, Zhendong Mao*, Hongtao Xie*, Yuxin Wang, Chenggang Yan, Yongdong Zhang. “ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting.” IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022.  中科院一区

● Zheren Fu, Zhendong Mao*, Bo Hu, An-An Liu, Yongdong Zhang. “Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning.” IEEE Transactions on Multimedia. 2022.  中科院一区

● Chunxiao Liu, Zhendong Mao*, Tianzhu Zhang, An-An Liu, Bin Wang, Yongdong Zhang. “Focus Your Attention: A Focal Attention for Multimodal Learning.” IEEE Transactions on Multimedia, Volume 24, Pages 103 – 115, 2022.  中科院一区

● Bowen Zhao, Zhendong Mao*, Shancheng Fang, Wenyu Zang, Yongdong Zhang.  “Semantically Similarity-wise Dual-branch Network for Scene Graph Generation.” IEEE Transactions on Circuits and Systems for Video Technology, Volume 32, Issue 7, Pages 4573 - 4583, 2022.  中科院一区

● Zheren Fu, Zhendong Mao*, Chenggang Yan, An-An Liu, Hongtao Xie, Yongdong Zhang. “Self-supervised Synthesis Ranking for Deep Metric Learning.” IEEE Transactions on Circuits and Systems for Video Technology, Volume 32, Issue 7, Pages 4736 – 4750, 2022.  中科院一区