研究方向:人工智能生成内容( Artificial Intelligence Generated Content ,AIGC)

AIGC 生成式人工智能 计算可视媒体 领域的重要研究方向, 指的是利用人工智能技术,通过已有数据寻找规律, 从人类创造行为的角度来构建算法 ,并通过预训练大模型、扩散模型等方法,自动生成各种类型的内容,是继专业生产内容(PGC)、用户生产内容(UGC)之后的新型内容创作方式,可以在 图像、视频 、对话、故事、设计和音乐制作等方面,打造新的数字内容生成与交互形式,使得计算机可自行生成内容的同时增强人类的创造力。

我们的理念: 健康生活,快乐科研, 万物可爱,与美同行

我们提供: 宽松的学习科研环境、丰富多彩的小组活动、与海内外顶尖学者长期学术合作的机会、六个月以上互联网大厂实习机会、六个月以上国际交换生留学机会。

招生宣讲材料 (2024年自动化所夏令营) 文件下载 提取码:uvsu

招生宣讲材料 (2023年自动化所夏令营) 文件下载 提取码:81qz

招生宣讲视频 (2020自动化所夏令营“云游AI”片段) 视频播放 提取码: 2x6a

招生宣讲视频(2022年自动化所夏令营): 视频播放 提取码: cue6

微信公众号“计算创意与艺术”:

目前主要研究课题:

2022-11至今,         中国科学院自动化研究所 多模态人工智能系统全国重点实验室 ,研究员

2016-11~2022-10,中国科学院自动化研究所 模式识别国家重点实验室 ,研究员

2010-11~2016-10,中国科学院自动化研究所 模式识别国家重点实验室 ,副研究员

2009-11~2010-10,中国科学院自动化研究所 模式识别国家重点实验室 ,助理研究员

2007-10~2009-10,中国科学院自动化研究所 中欧信息、自动化与应用数学联合实验室 ,博士后

2004-04~2007-06,法国国立信息与自动化研究院( INRIA )/法国 亨利▪庞加莱南锡第一大学 ,博士

2001-09~2004-01 ,清华大学 计算机科学与技术系,工学硕士

1997-09~2001-07, 清华大学 计算机科学与技术系,工学学士

学术论文

  1. Yuxin Zhang, Minyan Luo, Weiming Dong, Xiao Yang, Haibin Huang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu: IP-Prompter: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting. ACM SIGGRAPH (Conference Paper Track) 2025: 122:1-122:12 [ Project Page ][ Paper ]
  2. Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, Chongyang Ma, Pengfei Wan, Tong-Yee Lee, Changsheng Xu: MotionCrafter: Plug-and-play Motion Guidance for Diffusion Models. IEEE Transactions on Visualization and Computer Graphics 31(10): 8372-8384 (2025) [ Project Page ][ Paper ]
  3. Nisha Huang, Weiming Dong, Yuxin Zhang, Fan Tang, Ronghui Li, Chongyang Ma, Xiu Li, Tong-Yee Lee, Changsheng Xu: CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion. IEEE Transactions on Visualization and Computer Graphics 31(10) : 8425-8438 (2025) [ Project Page ][ Paper ]
  4. Zijun Zhou, Yingying Deng, Xiangyu He, Weiming Dong, Fan Tang: Multi-turn Consistent Image Editing. IEEE/CVF International Conference on Computer Vision (ICCV) 2025
  5. Yandan Wang, Chenqi Guo, Yinglong Ma, Jiangyan Chen, Yuan Gao, Weiming Dong: Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition. IEEE/CVF International Conference on Computer Vision (ICCV) 2025
  6. Yuyang Wanyan, Xiaoshan Yang, Weiming Dong, Changsheng Xu: A Comprehensive Review of Few-Shot Action Recognition. International Journal of Computer Vision (2025)
  7. Yingying Deng, Xiangyu He, Fan Tang, Weiming Dong: Z-Magic: Zero-shot Multiple Attributes Guided Image Creator. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025: 18390-18400
  8. Yu Xu, Fan Tang, Juan Cao, Yuxin Zhang, Oliver Deussen, Weiming Dong, Jintao Li, Tong-Yee Lee: B4M: Breaking Low-Rank Adapter for Making Content-Style Customization. ACM Transactions on Graphics 44(2): 21:1--21:17 (2025)
  9. Nisha Huang, Yuxin Zhang, Fan Tang, Chongyang Ma, Haibin Huang, Weiming Dong, Changsheng Xu: DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization. IEEE Transactions on Neural Networks and Learning Systems 36(2): 3370-3383 (2025) [ Code ]
  10. Zhenyu Yang, Yuhang Hu, Zemin Du, Dizhan Xue, Shengsheng Qian, Jiahong Wu, Fan Yang, Weiming Dong, Changsheng Xu. SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding. International Conference on Learning Representations (ICLR) 2025 (Spotlight)
  11. Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu: Dance-to-Music Generation with Encoder-based Textual Inversion. ACM SIGGRAPH Asia (Conference Paper Track) 2024: 135:1-135:11 [ Code&Demo ]
  12. Minyan Luo, Yuxin Zhang, Peng Xu, Tianle Wang, Yihang Bo, Xin Jin, Weiming Dong: Dance Montage through Style Transfer and Music Generation. ACM SIGGRAPH Asia (Art Paper) 2024: 10:1-10:5
  13. Zijun Zhou, Fan Tang, Yuxin Zhang, Oliver Deussen, Juan Cao, Weiming Dong, Xiangtao Li, Tong-Yee Lee: A Comprehensive Evaluation of Arbitrary Image Style Transfer Methods. IEEE Transactions on Visualization and Computer Graphics (2024)
  14. Yingying Deng, Xiangyu He, Fan Tang, Weiming Dong: Z *: Z ero-shot S tyle T ransfer via A ttention R eweighting. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024: 6934-6944 [ Paper ][ Code ]
  15. 董未名:通用人工智能时代的绘画教育与数字美育教学成果评价. 艺术教育 (总第408期): 42-43 (2024)
  16. Hairui Ren, Fan Tang, Xingjia Pan, Juan Cao, Weiming Dong, Zhiwen Lin, Ke Yan, Changsheng Xu: A 2 Pt: Anti-Associative Prompt Tuning for Open Set Visual Recognition. IEEE Transactions on Multimedia 26: 8419-8431 (2024) [ Code ]
  17. Xiaoyu Kong, Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Yongyong Chen, Zhenyu He, Changsheng Xu: Exploring the Temporal Consistency of Arbitrary Style Transfer: A Channel-wise Perspective. IEEE Transactions on Neural Networks and Learning Systems 35(6): 8482-8496 (2024) [ Code ]
  18. Yunbing Jia, Xiaoyu Kong, Fan Tang, Yixing Gao, Weiming Dong, Yi Yang: Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition. The 33rd International Joint Conference on Artificial Intelligence (IJCAI) 2024: 911-919
  19. Sifei Li , Yuxin Zhang, Fan Tang, Chongyang Ma, Weiming Dong, Changsheng Xu: Music Style Transfer with Time-Varying Inversion of Diffusion Models. The 38th AAAI Conference on Artificial Intelligence (AAAI) 2024: 547-555 [ Code ] [ Paper ]
  20. Zhenyu Yang, Shengsheng Qian, Dizhan Xue, Jiahong Wu, Fan Yang, Weiming Dong, Changsheng Xu: Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval. ACM Multimedia 2024: 1245-1254
  21. Nisha Huang , Yuxin Zhang , Weiming Dong: Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer. Signal Processing Letters 31: 1494-1498(2024)[ Paper ]
  22. Chengcheng Ma, Ismail Elezi , Jiankang Deng , Weiming Dong, Changsheng Xu: Three Heads Are Better than One: Complementary Experts for Long-Tailed Semi-supervised Learning. The 38th AAAI Conference on Artificial Intelligence (AAAI) 2024: 14229-14237 [ Code ] [ Paper ]
  23. Zhenyu Yang, Dizhan Xue, Shengsheng Qian, Weiming Dong, Changsheng Xu: LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval. SIGIR 2024: 80-90 (Best paper honorary mention)
  24. Wu-Qin Liu, Minxuan Lin, Haibin Huang, Chongyang Ma, Weiming Dong: FreeStyler: A Free-Form Stylization Method via Multimodal Vector Quantization. CVM (2) 2024: 259-278
  25. Kexin Wu, Fan Tang, Ning Liu, Oliver Deussen, Thi Ngoc Hanh Le, Weiming Dong, Tong-Yee Lee: Lighting Image/Video Style Transfer Methods by Iterative Channel Pruning. ICASSP 2024: 3800-3804
  26. Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Oliver Deussen, Changsheng Xu: ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models. ACM Transactions on Graphics 42(6): 244:1-244:14 (2023) [ Code ][ Paper ]
  27. Yuxin Zhang, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Changsheng Xu: A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning. ACM Transactions on Graphics 42(5): 169:1-169:16 (2023) [ Code ][ Paper ]
  28. Yuxin Zhang, Nisha Huang, Fan Tang, Haibin Huang, Chongyang Ma, Weiming Dong, Changsheng Xu: Inversion-Based Style Transfer with Diffusion Models. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023: 10146-10156 [Code] [ Paper ]
  29. Yuxin Zhang, Fan Tang, Weiming Dong, Thi-Ngoc-Hanh Le, Changsheng Xu, Tong-Yee Lee: Portrait Map Art Generation by Asymmetric Image-to-Image Translation. Leonardo 56(1): 28-36 (2023) (Cover Paper)
  30. Dong Chen, Xingjia Pan, Fan Tang, Weiming Dong, Changsheng Xu: SPA 2 Net: Structure-Preserved Attention Activated Network for Weakly Supervised Object Localization. IEEE Transactions on Image Processing 32: 5779-5793 (2023)
  31. Wuqin Liu, Minxuan Lin, Haibin Huang, Chongyang Ma, Yu Song, Weiming Dong, Changsheng Xu: Emotion-Aware Music Driven Movie Montage . Journal of Computer Science and Technology 38(3): 540-553 (2023)
  32. Sifei Li, Fuzhang Wu, Yuqing Fan, Xue Song, Weiming Dong: PLDGAN: Portrait Line Drawing Generation with Prior Knowledge and Conditioning Target. The Visual Computer 39: 3507–3518 (2023)
  33. 董未名, 邓盈盈, 张宇欣, 黄妮莎: 面向影视制作的风格迁移技术及展望. 影视文化 2022(01): 12-19 (2022)
  34. Shuwei Dong, Xiaoyu Kong, Xingjia Pan, Fan Tang, Wei Li, Yi Chang, Weiming Dong: Semantic-Context Graph Network for Point-based 3D Object Detection. IEEE Transactions on Circuits and Systems for Video Technology 33(11): 6474-6486 (2023)
  35. Chengcheng Ma, Yang Liu, Jiankang Deng, Lingxi Xie, Weiming Dong, Changsheng Xu: Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models. IEEE Transactions on Circuits and Systems for Video Technology 33(9): 4616-4629 (2023) [ Code ]
  36. Yu Song, Fan Tang, Weiming Dong, Feiyue Huang, Tong-Yee Lee, Changsheng Xu: Balance-Aware Grid Collage for Small Image Collections. IEEE Transactions on Visualization and Computer Graphics 29(2): 1330-1344 (2023)
  37. Pei Lv, Jianqi Fan, Xixi Nie, Weiming Dong, Xiaoheng Jiang, Bing Zhou, Mingliang Xu, Changsheng Xu: User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning. IEEE Transactions on Multimedia 25: 736-749 (2023)
  38. Shideng Lin, Fan Tang, Weiming Dong, Xingjia Pan, Changsheng Xu: SMNet: Synchronous Multi-scale Low Light Enhancement Network with Local and Global Concern. IEEE Transactions on Multimedia 25: 9506-9517 (2023)
  39. Cong Wang, Fan Tang, Yong Zhang, Tieru Wu, Weiming Dong: Towards Harmonized Regional Style Transfer and Manipulation for Facial Images. Computational Visual Media 9(2): 351-366 (2023)
  40. Zhiyong Huang, Kekai Sheng, Ke Li, Jian Liang, Taiping Yao, Weiming Dong, Dengwen Zhou, Xing Sun: Reciprocal Normalization for Domain Adaptation. Pattern Recognition 140: 109533 (2023)
  41. Chengcheng Ma, Xingjia Pan, Qixiang Ye, Fan Tang, Weiming Dong, Changsheng Xu: CrossRectify: Leveraging Disagreement for Semi-Supervised Object Detection. Pattern Recognition 137: 109280 (2023)
  42. Xue Song, Jiawei Pan, Fuzhang Wu, Weiming Dong: Optimal Composition Recommendation for Portrait Photography. SIGGRAPH Asia Posters 2022: 20:1-20:2
  43. Rui Wang, Nisha Huang, Fan Tang, Weiming Dong, Tong-Yee Lee: Language-driven Diversified Image Retargeting. SIGGRAPH Asia Posters 2022: 19:1-19:2
  44. Nisha Huang, Fan Tang, Weiming Dong, Changsheng Xu: Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion. ACM Multimedia 2022: 1085-1094 [Code]
  45. Yuxin Zhang, Fan Tang, Weiming Dong, Changsheng Xu: Quantification of Artist Representativity within an Art Movement. ICME Workshops on AIART 2022: 1-6
  46. Yuxin Zhang, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Changsheng Xu: Domain Enhanced Arbitrary Image Style Transfer via Contrastive Learning. ACM SIGGRAPH (Conference Paper Track) 2022: 12:1-12:8  [ Code ]
  47. Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu: StyTr 2 : Image Style Transfer with Transformers. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022: 11326-11336 [ Code ]
  48. Huapeng Wei, Yingying Deng, Fan Tang, Xingjia Pan, Weiming Dong: A Comparative Study of CNN- and Transformer-Based Visual Style Transfer. Journal of Computer Science and Technology 37(3): 601-614 (2022)
  49. Yifan Xu, Zhijie Zhang, Mengdan Zhang, Kekai Sheng, Ke Li, Weiming Dong, Liqing Zhang, Changsheng Xu, Xing Sun: Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer. The 36th AAAI Conference on Artificial Intelligence (AAAI) 2022: 2964-2972 [ Code&Paper]
  50. Yifan Xu, Kekai Sheng, Weiming Dong, Baoyuan Wu, Changsheng Xu, Bao-Gang Hu: Towards Corruption-Agnostic Robust Domain Adaptation. ACM Transactions on Multimedia Computing, Communications, and Applications 18(4): 99:1-99:16 (2022)
  51. Yifan Xu, Huapeng Wei, Minxuan Lin, Yingying Deng, Kekai Sheng, Mengdan Zhang, Fan Tang, Weiming Dong, Feiyue Huang, Changsheng Xu: Transformers in computational visual media: A survey. Computational Visual Media 8(1): 32-62 (2022) [ Paper ]
  52. Yu Song, Fan Tang, Weiming Dong, Changsheng Xu: Non-dominated sorting based multi-page photo collage. Computational Visual Media 8(2): 199-212 (2022)
  53. Huaiyu Li, Weiming Dong, Bao-Gang Hu: Incremental Concept Learning via Online Generative Memory Recall. IEEE Transactions on Neural Networks and Learning Systems 32(7): 3206-3216 (2021)
  54. Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Feiyue Huang, Oliver Deussen, Changsheng Xu: Exploring the Representativity of Art Paintings. IEEE Transactions on Multimedia 23: 2794-2805 (2021) [ Code ]
  55. Minxuan Lin, Fan Tang, Weiming Dong, Xiao Li, Changsheng Xu, Chongyang Ma: Distribution Aligned Multimodal and Multi-Domain Image Stylization. ACM Transactions on Multimedia Computing, Communications, and Applications 17(3): 96:1-96:17 (2021)
  56. Xingjia Pan, Yingguo Gao, Zhiwen Lin, Fan Tang, Weiming Dong, Haolei Yuan, Feiyue Huang, Changsheng Xu: Unveiling the Potential of Structure-preserving for Weakly Supervised Object Localization. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021: 11642-11651 [ Code ]
  57. Yingying Deng, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma, Changsheng Xu: Arbitrary Video Style Transfer via Multi-Channel Correlation. The 35th AAAI Conference on Artificial Intelligence (AAAI) 2021: 1210-1217 [ Code ]
  58. Dong Chen, Fan Tang, Weiming Dong, Hanxing Yao, Changsheng Xu: SiamCPN: Visual Tracking with the Siamese Center-Prediction Network. Computational Visual Media 7(2): 253-265 (2021)
  59. Xingjia Pan, Fan Tang, Weiming Dong, Chongyang Ma, Yiping Meng, Feiyue Huang, Tong-Yee Lee, Changsheng Xu: Content-Based Visual Summarization for Image Collections. IEEE Transactions on Visualization and Computer Graphics 27(4): 2298-2312 (2021) [ Project Page ]
  60. Kekai Sheng, Weiming Dong, Haibin Huang, Guohui Wang, Yong Zhang, Chongyang Ma, Bao-Gang Hu: Learning to Assess Visual Aesthetics of Food Images. Computational Visual Media 7(1): 139-152 (2021) [ Data & Code ]
  61. Yuting Ma, Fan Tang, Weiming Dong, Changsheng Xu: Destylization of Text with Decorative Elements. ACM Multimedia Asia 2020: 14:1-14:7
  62. Yingying Deng, Fan Tang, Weiming Dong, Wen Sun, Feiyue Huang, Changsheng Xu: Arbitrary Style Transfer via Multi-Adaptation Network. ACM Multimedia 2020: 2719-2727 [ Paper ][ Code ]
  63. Xingjia Pan, Fan Tang, Weiming Dong, Yang Gu, Zhichao Song, Yiping Meng, Pengfei Xu, Oilver Deussen, Changsheng Xu: Self-Supervised Feature Augmentation for Large Image Object Detection. IEEE Transactions on Image Processing 29: 6745-6758 (2020)
  64. Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu: Dynamic Refinement Network for Oriented and Densely Packed Object Detection. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020: 11204-11213 (Oral) [ Paper ][ Data & Code ]
  65. Minxuan Lin, Yingying Deng, Fan Tang, Weiming Dong, Changsheng Xu:  Multi-Attribute Guided Painting Generation. The 2nd IEEE Workshop on Artificial Intelligence for Art Creation (AIART) 2020: 400-403 [ Paper ]
  66. Kekai Sheng, Weiming Dong, Menglei Chai, Guohui Wang, Peng Zhou, Feiyue Huang, Bao-Gang Hu, Rongrong Ji, Chongyang Ma: Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning. The 34th AAAI Conference on Artificial Intelligence (AAAI) 2020: 5709-5716 (Spotlight) [ Paper ]
  67. Fan Tang, Weiming Dong, Yiping Meng, Chongyang Ma, Fuzhang Wu, Xinrui Li, Tong-Yee Lee: Image Retargetability. IEEE Transactions on Multimedia 22(3): 641-654 (2020)
  68. Huaiyu Li, Weiming Dong, Xing Mei, Chongyang Ma, Feiyue Huang, Bao-Gang Hu: LGM-Net: Learning to Generate Matching Networks for Few Shot Learning. International Conference on Machine Learning (ICML) 2019: 3825-3834 [ Code ]
  69. Yong Zhang, Baoyuan Wu, Weiming Dong, Zhifeng Li, Wei Liu, Bao-Gang Hu, Qiang Ji: Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019: 3457-3466
  70. Fuzhang Wu, Yan Kong, Weiming Dong, Yanjun Wu: Gradient-aware blind face inpainting for deep face verification. Neurocomputing 331: 301-311 (2019)
  71. Yucheng Zhao, Fan Tang, Weiming Dong, Feiyue Huang, Xiaopeng Zhang: Joint face alignment and segmentation via deep multi-task learning. Multimedia Tools and Applications 78(10): 13131–13148 (2019)
  72. Fan Tang, Weiming Dong, Yiping Meng, Xing Mei, Feiyue Huang, Xiaopeng Zhang, Oliver Deussen: Animated Construction of Chinese Brush Paintings. IEEE Transactions on Visualization and Computer Graphics 24(12): 3019-3031 (2018) [ Project Page ]
  73. Kekai Sheng, Weiming Dong, Haibin Huang, Chongyang Ma, Bao-Gang Hu: Gourmet photography dataset for aesthetic assessment of food images. SIGGRAPH Asia Technical Briefs 2018: 20:1-20:4 [ Data & Code ]
  74. Yu Song, Fan Tang, Weiming Dong, Xiaopeng Zhang, Oliver Deussen, Tong-Yee Lee. Photo Squarization by Deep Multi-Operator Retargeting. ACM Multimedia 2018: 1047-1055
  75. Kekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu. Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment. ACM Multimedia 2018: 879-886 [ Data & Code ]
  76. Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji: Weakly-supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018: 2314-2323
  77. Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji: Classifier Learning with Prior Probabilities for Facial Action Unit Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018: 5108-5116
  78. Yong Zhang, Rui Zhao, Weiming Dong, Bao-Gang Hu, Qiang Ji: Bilateral Ordinal Relevance Multi-Instance Regression for Facial Action Unit Intensity Estimation. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018: 7034-7043
  79. Huaiyu Li, Weiming Dong, Bao-Gang Hu: Facial Image Attributes Transformation via Conditional Recycle Generative Adversarial Networks. Journal of Computer Science and Technology 33(3): 511-521 (2018)
  80. Kekai Sheng, Weiming Dong, Wei Li, Joseph Razik, Feiyue Huang, Bao-Gang Hu: Centroid-aware local discriminative metric learning in speaker verification. Pattern Recognition 72: 176-185 (2017)
  81. Yingying Deng, Fan Tang, Weiming Dong, Hanxing Yao, Bao-Gang Hu: Style-oriented representative paintings selection. SIGGRAPH ASIA Posters 2017: 12:1-12:2
  82. Xingjia Pan, Juntao Ye, Fan Tang, Weiming Dong, Feiyue Huang, Xiaopeng Zhang: Content-based measure of image set diversity. SIGGRAPH ASIA Posters 2017: 43:1-43:2
  83. Jia Liu, Weiming Dong, Xiaopeng Zhang, Zhiguo Jiang: Orientation judgment for abstract paintings. Multimedia Tools and Applications 76(1): 1017-1036 (2017)
  84. Yong Zhang, Weiming Dong, Chongyang Ma, Xing Mei, Ke Li, Feiyue Huang, Bao-Gang Hu, Oliver Deussen: Data-Driven Synthesis of Cartoon Faces Using Different Styles. IEEE Transactions on Image Processing 26(1): 464-478 (2017)
  85. Weiming Dong, Fuzhang Wu, Yan Kong, Xing Mei, Tong-Yee Lee, Xiaopeng Zhang: Image Retargeting by Texture-Aware Synthesis. IEEE Transactions on Visualization and Computer Graphics 22(2): 1088-1101 (2016)
  86. Haiyong Jiang, Liangliang Nan, Dong-Ming Yan, Weiming Dong, Xiaopeng Zhang, Peter Wonka: Automatic Constraint Detection for 2D Layout Regularization. IEEE Transactions on Visualization and Computer Graphics 22(8): 1933-1944 (2016)
  87. Yan Kong, Weiming Dong, Xing Mei, Chongyang Ma, Tong-Yee Lee, Siwei Lyu, Feiyue Huang, Xiaopeng Zhang: Measuring and Predicting Visual Importance of Similar Objects. IEEE Transactions on Visualization and Computer Graphics 22(12): 2564-2578 (2016)
  88. Fuzhang Wu, Weiming Dong, Yan Kong, Xing Mei, Dong-Ming Yan, Xiaopeng Zhang, Jean-Claude Paul: Feature-aware natural texture synthesis. The Visual Computer 32(1): 43-55 (2016)
  89. Yiping Meng, Fan Tang, Weiming Dong, Xiaopeng Zhang: Optimal character composing for Chinese calligraphic artwork. SIGGRAPH Asia Posters 2016: 25
  90. Kekai Sheng, Weiming Dong, Yan Kong, Xing Mei, Jilin Li, Chengjie Wang, Feiyue Huang, Bao-Gang Hu: Evaluating the Quality of Face Alignment without Ground Truth. Computer Graphics Forum 34(7): 213-223 (2015)
  91. Xing Mei, Weiming Dong, Bao-Gang Hu, Siwei Lyu: UniHIST: A unified framework for image restoration with marginal histogram constraints. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015: 3753-3761
  92. Fuzhang Wu, Dong-Ming Yan, Weiming Dong, Xiaopeng Zhang, Peter Wonka: Inverse Procedural Modeling of Facade Layouts. A CM Transactions on Graphics (Proceedings of SIGGRAPH) 33(4): 121:1-121:10 (2014)
  93. Weiming Dong, Ning Zhou, Tong-Yee Lee, Fuzhang Wu, Yan Kong, Xiaopeng Zhang: Summarization-Based Image Resizing by Intelligent Object Carving. IEEE Transactions on Visualization and Computer Graphics 20(1): 111-124 (2014)
  94. Dengwen Zhou, Weiming Dong, Wengang Chen. Joint demosaicking and zooming using moderate spectral correlation and consistent edge map . Journal of Electronic Imaging 23(4): 034310 (2014)
  95. Yong Zhang, Weiming Dong, Oliver Deussen, Feiyue Huang, Ke Li, Bao-Gang Hu: Data-driven face cartoon stylization. SIGGRAPH ASIA Technical Briefs 2014: 14:1-14:4
  96. Fuzhang Wu, Weiming Dong, Yan Kong, Xing Mei, Jean-Claude Paul, Xiaopeng Zhang: Content-Based Colour Transfer. Computer Graphics Forum 32(1): 190-203 (2013)
  97. Yan Kong, Weiming Dong, Xing Mei, Xiaopeng Zhang, Jean-Claude Paul: SimLocator: robust locator of similar objects in images. The Visual Computer 29(9): 861-870 (2013)
  98. Xing Mei, Xun Sun, Weiming Dong, Haitao Wang, Xiaopeng Zhang: Segment-Tree Based Cost Aggregation for Stereo Matching. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2013: 313-320
  99. Dengwen Zhou, Xiaoliu Shen, Weiming Dong: Image zooming using directional cubic convolution interpolation. IET Image Processing (6): 627–634 (2012)
  100. Weiming Dong, Guan-Bo Bao, Xiaopeng Zhang, Jean-Claude Paul: Fast Multi-Operator Image Resizing and Evaluation. Journal of Computer Science and Technology 27(1): 121-134 (2012)
  101. Dengwen Zhou, Xiaoliu Shen, Weiming Dong: Colour demosaicking with directional filtering and weighting. IET Image Processing 6(8): 1084–1092 (2012)
  102. Weiming Dong, Guanbo Bao, Xiaopeng Zhang, Jean-Claude Paul: Fast Local Color Transfer via Dominant Colors Mapping. SIGGRAPH ASIA Technical Sketches 2010: 46:1-46:2
  103. Weiming Dong, Ning Zhou, Jean-Claude Paul, Xiaopeng Zhang: Optimized image resizing using seam carving and scaling. ACM Transactions on Graphics 28(5): 125:1-125:10 (2009)
  104. Weiming Dong, Ning Zhou, Jean-Claude Paul: Robust tile-based texture synthesis using artificial immune system. Neural Computing and Applications 18(3): 223-235 (2009)
  105. Weiming Dong, Ning Zhou, Jean-Claude Paul. Perspective-aware texture analysis and synthesis. The Visual Computer 24(7-9): 515-523 (2008)
  106. Ning Zhou, Jiaxin Wang, Weiming Dong, Jean-Claude Paul: Modeling and Visualization of Flower Color Patterns. CAD/Graphics 2007: 150-155
  107. Weiming Dong, Ning Zhou, Jean-Claude Paul: Optimized tile-based texture synthesis. Graphics Interface 2007: 249-256
  108. Weiming Dong: Rendering Optical Effects Based on Spectra Representation in Complex Scenes. Computer Graphics International 2006: 719-726

发明专利

  1. 盛柯恺; 董未名; 马重阳; 梅星; 胡包钢 ; 基于注意力机制的通用图像美学评估方法、系统及设备, 2021-4-27, 中国, ZL201910086789.X
  2. 宋玉; 唐帆; 董未名; 徐常胜 ; 图片方形化缩放方法、系统及装置, 2020-11-10, 中国, ZL201811545250.8
  3. 邓盈盈; 唐帆; 董未名; 徐常胜 ; 自动挑选画家代表作的方法及装置, 2020-11-30, 中国, ZL201810759512.4
  4. 潘兴甲; 董未名; 袁豪磊; 盛柯恺; 林志文; 高英国; 任玉强; 郭晓威; 黄小明; 黄飞跃 ; 目标检测方法、装置、设备及存储介质, 2020-10-12, 中国, ZL202011085853.1
  5. 唐帆; 余宗桥; 黄飞跃; 李季檩; 李科; 吴永坚; 董未名; 孟一平 ; 图像处理方法和装置(水墨动态绘制过程重构), 2019-8-27, 中国, ZL201410505493.4
  6. Wang, Chengjie; Li, Jilin; Huang, Feiyue; Sheng, Kekai; Dong, Weiming ; Evaluation method and evaluation device for facial key point positioning result, 2017-8-7, 美国, US 10,706,263 B2
  7. 樊艳波; 董未名; 胡包钢 ; 基于自适应阈值调整拒识子空间学习的人脸检测方法, 2016-04-13, 中 国, ZL201510811406.2

  • 张宇欣,可视媒体视觉属性表示与可控生成,博士,2025 [ 论文下载 ,百度网盘提取码: f4s4 ]

  • 黄妮莎,多模态信息引导的艺术图像与视频生成研究,硕士,2024 [ 论文下载 ,百度网盘提取码:hnk9]

  • 刘伍琴,面向影视再创作的多模态引导可视媒体编辑,硕士,2024 [ 论文下载 ,百度网盘提取码:8ahj]

  • 李岚祺,基于图神经网络的分子相互作用关系预测方法研究,硕士,2024 [ 论文下载 百度网盘提取码:j6bn]

  • 宋雪,知识与数据共同驱动的人像构图推荐算法研究与应用,硕士,2023 [ 论文下载 ,百度网盘提取码:4rvz]

  • 许逸凡,标签稀缺条件下的视觉模型可迁移性研究,硕士,2022 [ 论文下载 ,百度网盘提取码:3xt7]

  • 邓盈盈,风格导向的绘画作品生成与分析,博士,2022 [ 论文下载 ,百度网盘提取码:ff68]

  • 林诗登,基于深度学习的暗光图像与视频增强,硕士,2022 [ 论文下载 ,百度网盘提取码:vmhq]

  • 范宇擎,基于属性学习的图像质量评估算法研究,硕士,2022 [ 论文下载 ,百度网盘提取码:niq1]

  • 宋玉,内容相关的图像呈现方法研究及应用,博士,2022 [ 论文下载 ,百度网盘提取码:tw7i]

  • 林敏轩,基于对抗学习的多域艺术图像生成,硕士,2021 [ 论文下载 ,百度网盘提取码:t3jx]

  • 潘兴甲,复杂环境下的图像目标检测与可视化,博士,2020 [ 论文下载 ,百度网盘提取码: djd7]

  • 李怀宇,面向非平稳环境的知识迁移方法研究,博士,2020 [ 论文下载 ,百度网盘提取码:pwhz]

  • 唐帆,中国水墨作品数字化创作重构研究,博士,2019 [ 论文下载 ,百度网盘提取码:p59i]

  • 盛柯恺,图像美学质量评估的方法与应用,博士,2019 [ 论文下载 ,百度网盘提取码:ghnk]

  • 张勇,知识与数据共同驱动的面部行为分析与人脸卡通画合成,博士,2018 [ 论文下载 ,百度网盘提取码:vi4k]

  • 赵昱程,人脸图像分析与妆容图像合成,硕士,2018 [ 论文下载 ,百度网盘提取码:djmm]

  • 孟一平,图像可缩放度的研究与应用,硕士,2017 [ 论文下载 ,百度网盘提取码:3338]

  • 孔彦,图像内容的相似模式分析,博士,2016 [ 论文下载 ,百度网盘提取码:qhyf]

  • 吴富章,内容相关的图像合成,博士,2015 [ 论文下载 ,百度网盘提取码:vdgk]

  • 百度奖学金 (全球10人/年):张宇欣(2023)

  • CCF-凌迪图形学奖学金 (全国10人/年):张宇欣(2023)

  • 中国科学院院长优秀奖 :张宇欣(2025)

  • 国家奖学金(研究生) :李思霏(2024)、杨振宇(2024)、周梓骏(2024)、张宇欣(2022)、邓盈盈(2021)、吴富章(2015)

  • 国家奖学金(本科生) :王诗文(2024)、谭米宁(2023 )、李思霏(2021)

  • 北京市优秀毕业生 :骆敏言(2025,本科生)、黄妮莎(2024,研究生)

  • 中国科学院大学校级优秀毕业论文(本科生) :沈菲尔(2025)

  • 本科生启研项目 :骆敏言(2024)

  • 本科生大创项目 :尹子娇(2025)、骆敏言(2024)

  • 腾讯技术大咖: 潘兴甲(2020)、盛柯恺(2019)、张勇(2018)、孔彦(2016)、吴富章(2015)

  • 腾讯犀牛鸟精英人才计划 :李思霏(2024)、许逸凡(2022)

  • 中国电子学会—腾讯博士生科研激励计划(混元大模型专项) :杨振宇(2025)

科研项目

  1. 蚂蚁集团,可控视频生成技术,2024/12-2025/12
  2. 快手,可控高质量视频生成和编辑,2024/08-2025/07
  3. 快手,扩散模型的可解释性与可控性研究,2023/06-2024/07
  4. 新一代人工智能国家科技重大专项,认知计算基础理论与方法研究,2020/11-2023/10
  5. 北京市自然科学基金-丰台轨道交通前沿研究联合项目,恶劣天气下列车前向障碍物检测关键技术研究,2023/01-2025/12
  6. 国家自然科学基金重点项目,基于视觉认知的可视媒体合成与评价,2019/01-2023/12
  7. NSFC企业创新发展 联合 基金重点项目,知识和数据共同驱动的小样本目标识别理论和方法,2021/01-2024/12
  8. 中文在线,真人照片转指定风格人像技术开发,2022/06-2022/10
  9. 腾讯优图实验室,“优图研究”联合项目第九期,2021/08-2022/07
  10. 腾讯优图实验室,“优图研究”联合项目第八期,2020/08-2021/07
  11. 腾讯优图实验室,“优图研究”联合项目第七期,2019/08-2020/07
  12. 远鉴科技,图像内容合成与质量评价,2019/08-2020/07
  13. 腾讯优图实验室,“优图研究”联合项目第六期,2018/08-2019/07
  14. 国家重点研发计划,社会安全事件智能监测与预警关键技术与装备,2018/07-2021/06
  15. 中科院自动化所-亮亮视野“第一视角计算”联合实验室,2018/07-2021/06
  16. 咪咕视频,视频精细化标签AI能力定制,2018/08-2019/07
  17. 腾讯优图实验室,“优图研究”联合项目第五期,2017/08-2018/07
  18. 中国科学院,卢嘉锡国际合作团队项目,2018/01-2020/12
  19. 国家自然科学基金面上项目,数据驱动的图像合成,2017/01-2020/12
  20. 北京市自然基金面上项目,单图像超分辨率技术与应用,2016/01-2018/12
  21. 腾讯,“优图研究”联合项目第四期,2016/08-2017/07
  22. 爱奇艺,视频智能编辑创作系统,2016/02-2016/12
  23. “优图研究”联合项目第三期,2015/08-2016/07
  24. 核高基重大专项课题分任务,开源操作系统内核分析和安全性评估:基于人脸识别的关键应用程序保护,2015/04-2015/12
  25. 腾讯,“优图研究”联合项目第二期,2014/08-2015/07
  26. 国家自然科学基金面上项目,基于梯度场的计算成像和恢复技术,2014/01-2017/12
  27. 腾讯,“优图研究”联合项目第一期,2013/08-2014/07
  28. 北京市自然基金面上项目,内容相关的图像合成研究与应用,2011/01-2013/12
  29. 法国国家科研署国际合作项目,Shape Modeling: New theories and new algorithms,2010/01-2012/12
  30. 企业委托(上海市科技信息中心),个性化影视动漫制作关键技术研发,2009/12-2010/11
  31. 中国博士后科学基金特别资助,基于图像的植物建模与绘制,2008/11-2009/10
  32. 教育部留学归国人员启动基金,自然景物建模与渲染中的若干问题研究,2008/08-2010/07
  33. 中国博士后科学基金面上项目,真实植物场景数字化与可视化,2008/08-2009/07
  34. 科技部国际合作项目,自然植被景观的动态演变模拟与应用,2007/09-2010/10


媒体报道/采访


学生姓名  培养单位 (包括联合培养) 学位类别   入学/毕业时间    研究课题                                  毕业去向

吴富章     中国科学院自动化研究所     硕博          2010/2015         内容相关的图像合成                  腾讯优图实验室(技术大咖)

李超        北京大学                               硕士          2010/2012         图像物体材质分析与传递           德克萨斯大学达拉斯分校(读博)

沈思成     西北师范大学                       硕士          2013/2015         基于深度学习的人脸识别           远鉴科技(产品部总监)

孔彦        中国科学院自动化研究所     硕博           2011/2016          图像内容的相似模式分析          远鉴科技(图像部总监)、腾讯技术大咖

李志磊     西北师范大学                      硕士           2014/2016        人脸识别与活体验证                  中国邮政集团公司

温祥        北京交通大学                      硕士           2014/2016         基于深度学习的图像内容识别     网易

徐国智     华北电力大学                      硕士           2014/2017        基于深度学习的人脸配准           网易

刘园园     华北电力大学                      硕士          2014/2017         基于深度学习的花卉图像分类    国家电网

孟一平     中国科学院自动化研究所    硕士          2014/2017         图像可缩放度研究与应用           滴滴出行->快手

唐帆        中国科学院自动化研究所     硕博           2013/2019        中国水墨作品数字化创作重构    远鉴科技->吉林大学

张勇        中国科学院自动化研究所     硕博           2012/2018        人脸面部行为分析与卡通合成    腾讯AI Lab

赵昱程     中国科学院自动化研究所     硕士           2015/2018       人脸妆容图像分析与合成          阿里巴巴->字节跳动

盛柯恺     中国科学院自动化研究所     硕博           2014/2019       图像美学评估的方法与应用       腾讯优图实验室(技术大咖)

李怀宇     中国科学院自动化研究所     硕博           2014/2020        面向非平稳环境的知识迁移方法研究  快手

潘兴甲     中国科学院自动化研究所     硕博           2015/2020         通用目标检测                          腾讯优图实验室(技术大咖)

胡广宇     华北电力大学                       硕士           2015/2018        视频人脸虚拟美颜                    远鉴科技

李欣芮     华北电力大学                       硕士           2016/2019        时序数据分析与挖掘                国家电网研究院

周鹏        华北电力大学                        硕士           2016/2019        目标检测与分割                       远鉴科技

邓盈盈     中国科学院自动化研究所     硕博           2017/2022         艺术图像分析与合成                  华为

宋玉        中国科学院自动化研究所      普博           2017/2021        可视媒体呈现                           北京科技大学

孙秀秀     华北电力大学                       硕士           2017/2020        商品图像识别                          上海电网

林敏轩     中国科学院自动化研究所     硕士           2018/2021         图像风格化                              快手

马雨廷     中国科学院自动化研究所     硕士           2018/2021         文字风格化与去风格化             中信银行

张旭龙     国科大人工智能学院            硕士           2018/2021        面向智能眼镜的目标检测          建设银行

陈东         国科大人工智能学院            硕士           2018/2021         目标跟踪                                吉林大学(读博)

黄志勇     华北电力大学                       硕士           2018/2021        领域自适应                               字节跳动

许逸凡     中国科学院自动化研究所     硕士           2019/2022        视觉模型可迁移性                     中科院自动化所(读博)

范宇擎     国科大人工智能学院            硕士           2019/2022        图像质量评价                            中科院软件所

林诗登     国科大人工智能学院            硕士           2019/2022        图像与视频暗光增强                   京东

宋雪        郑州大学                             硕士           2020/2023        人像拍照姿态推荐                       中科院软件所

张宇欣     中国科学院自动化研究所     硕博           2020/2025       可视媒体可控生成                      字节跳动

刘伍琴     国科大人工智能学院            硕士           2021/2024       多模态可视媒体生成                    快手

黄妮莎     国科大人工智能学院            硕士           2021/2024       多模态可视媒体生成                    清华大学(读博)

李岚琪     郑州大学                             硕士           2021/2024       AI+化学                                      深圳证券交易所

李思霏     中国科学院自动化研究所     硕博           2022/                音乐生成

杨晗        国科大人工智能学院            硕士           2022/2025     多模态可视媒体呈现                        美团

马赛赛     郑州大学                             硕士           2022/2025         多模态可视媒体生成

任宥衡     郑州大学                             硕士           2022/                多模态可视媒体生成                   字节跳动

杜俊萱     国科大人工智能学院            直博           2023/                多模态可视媒体生成

谭米宁    中国科学院自动化研究所      硕士           2024/                多模态可视媒体生成

骆敏言   中国科学院自动化研究所       直博           2025/                多模态可视媒体生成

沈菲尔 国科大人工智能学院 直博           2025/                舞蹈生成