研究方向:人工智能生成内容(
Artificial Intelligence Generated Content
,AIGC)
AIGC
是
生成式人工智能
和
计算可视媒体
领域的重要研究方向,
指的是利用人工智能技术,通过已有数据寻找规律,
从人类创造行为的角度来构建算法
,并通过预训练大模型、扩散模型等方法,自动生成各种类型的内容,是继专业生产内容(PGC)、用户生产内容(UGC)之后的新型内容创作方式,可以在
图像、视频
、对话、故事、设计和音乐制作等方面,打造新的数字内容生成与交互形式,使得计算机可自行生成内容的同时增强人类的创造力。
我们的理念:
健康生活,快乐科研,
万物可爱,与美同行
我们提供:
宽松的学习科研环境、丰富多彩的小组活动、与海内外顶尖学者长期学术合作的机会、六个月以上互联网大厂实习机会、六个月以上国际交换生留学机会。
招生宣讲材料
(2024年自动化所夏令营)
:
文件下载
提取码:uvsu
招生宣讲材料
(2023年自动化所夏令营)
:
文件下载
提取码:81qz
招生宣讲视频
(2020自动化所夏令营“云游AI”片段)
:
视频播放
提取码: 2x6a
招生宣讲视频(2022年自动化所夏令营):
视频播放
提取码: cue6
微信公众号“计算创意与艺术”:
目前主要研究课题:
2022-11至今, 中国科学院自动化研究所
多模态人工智能系统全国重点实验室
,研究员
2016-11~2022-10,中国科学院自动化研究所
模式识别国家重点实验室
,研究员
2010-11~2016-10,中国科学院自动化研究所
模式识别国家重点实验室
,副研究员
2009-11~2010-10,中国科学院自动化研究所
模式识别国家重点实验室
,助理研究员
2007-10~2009-10,中国科学院自动化研究所
中欧信息、自动化与应用数学联合实验室
,博士后
2004-04~2007-06,法国国立信息与自动化研究院(
INRIA
)/法国
亨利▪庞加莱南锡第一大学
,博士
2001-09~2004-01
,清华大学
计算机科学与技术系,工学硕士
1997-09~2001-07,
清华大学
计算机科学与技术系,工学学士
学术论文
-
Yuxin Zhang, Minyan Luo, Weiming Dong, Xiao Yang, Haibin Huang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu: IP-Prompter: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting.
ACM SIGGRAPH
(Conference Paper Track) 2025: 122:1-122:12 [
Project Page
][
Paper
]
-
Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, Chongyang Ma, Pengfei Wan, Tong-Yee Lee, Changsheng Xu: MotionCrafter: Plug-and-play Motion Guidance for Diffusion Models.
IEEE Transactions on Visualization and Computer Graphics
31(10): 8372-8384
(2025) [
Project Page
][
Paper
]
-
Nisha Huang, Weiming Dong, Yuxin Zhang, Fan Tang, Ronghui Li, Chongyang Ma, Xiu Li, Tong-Yee Lee, Changsheng Xu: CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion.
IEEE Transactions on Visualization and Computer Graphics
31(10)
:
8425-8438
(2025) [
Project Page
][
Paper
]
-
Zijun Zhou, Yingying Deng, Xiangyu He, Weiming Dong, Fan Tang: Multi-turn Consistent Image Editing.
IEEE/CVF International Conference on Computer Vision (ICCV)
2025
-
Yandan Wang, Chenqi Guo, Yinglong Ma, Jiangyan Chen, Yuan Gao, Weiming Dong: Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition.
IEEE/CVF International Conference on Computer Vision (ICCV)
2025
-
Yuyang Wanyan, Xiaoshan Yang, Weiming Dong, Changsheng Xu: A Comprehensive Review of Few-Shot Action Recognition.
International Journal of Computer Vision
(2025)
-
Yingying Deng, Xiangyu He, Fan Tang, Weiming Dong: Z-Magic: Zero-shot Multiple Attributes Guided Image Creator.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2025:
18390-18400
-
Yu Xu, Fan Tang, Juan Cao, Yuxin Zhang, Oliver Deussen, Weiming Dong, Jintao Li, Tong-Yee Lee: B4M: Breaking Low-Rank Adapter for Making Content-Style Customization.
ACM Transactions on Graphics
44(2): 21:1--21:17
(2025)
-
Nisha Huang, Yuxin Zhang, Fan Tang, Chongyang Ma, Haibin Huang, Weiming Dong, Changsheng Xu: DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization.
IEEE Transactions on Neural Networks and Learning Systems
36(2): 3370-3383 (2025) [
Code
]
-
Zhenyu Yang, Yuhang Hu, Zemin Du, Dizhan Xue, Shengsheng Qian, Jiahong Wu, Fan Yang, Weiming Dong, Changsheng Xu. SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding.
International Conference on Learning Representations (ICLR)
2025 (Spotlight)
-
Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu: Dance-to-Music Generation with Encoder-based Textual Inversion.
ACM SIGGRAPH Asia
(Conference Paper Track) 2024: 135:1-135:11 [
Code&Demo
]
-
Minyan Luo, Yuxin Zhang, Peng Xu, Tianle Wang, Yihang Bo, Xin Jin, Weiming Dong: Dance Montage through Style Transfer and Music Generation.
ACM SIGGRAPH Asia
(Art Paper) 2024: 10:1-10:5
-
Zijun Zhou, Fan Tang, Yuxin Zhang, Oliver Deussen, Juan Cao, Weiming Dong, Xiangtao Li, Tong-Yee Lee: A Comprehensive Evaluation of Arbitrary Image Style Transfer Methods.
IEEE Transactions on Visualization and Computer Graphics
(2024)
-
Yingying Deng, Xiangyu He, Fan Tang, Weiming Dong:
Z
*:
Z
ero-shot
S
tyle
T
ransfer via
A
ttention
R
eweighting.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2024: 6934-6944 [
Paper
][
Code
]
-
董未名:通用人工智能时代的绘画教育与数字美育教学成果评价.
艺术教育
(总第408期): 42-43 (2024)
-
Hairui Ren, Fan Tang, Xingjia Pan, Juan Cao, Weiming Dong, Zhiwen Lin, Ke Yan, Changsheng Xu: A
2
Pt: Anti-Associative Prompt Tuning for Open Set Visual Recognition.
IEEE Transactions on Multimedia
26: 8419-8431 (2024) [
Code
]
-
Xiaoyu Kong,
Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Yongyong Chen,
Zhenyu He, Changsheng Xu: Exploring the Temporal Consistency of
Arbitrary Style Transfer: A Channel-wise Perspective.
IEEE Transactions on Neural Networks and Learning Systems
35(6): 8482-8496 (2024) [
Code
]
-
Yunbing Jia, Xiaoyu Kong, Fan Tang, Yixing Gao, Weiming Dong, Yi Yang: Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition.
The 33rd International Joint Conference on Artificial Intelligence (IJCAI)
2024: 911-919
-
Sifei Li
,
Yuxin Zhang, Fan Tang, Chongyang Ma, Weiming Dong, Changsheng Xu: Music
Style Transfer with Time-Varying Inversion of Diffusion Models.
The 38th AAAI Conference on Artificial Intelligence (AAAI)
2024: 547-555
[
Code
]
[
Paper
]
-
Zhenyu Yang, Shengsheng Qian, Dizhan Xue, Jiahong Wu, Fan Yang, Weiming Dong, Changsheng Xu: Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval.
ACM Multimedia
2024: 1245-1254
-
Nisha Huang , Yuxin Zhang , Weiming Dong: Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer.
Signal Processing Letters
31: 1494-1498(2024)[
Paper
]
-
Chengcheng Ma,
Ismail Elezi
,
Jiankang Deng
, Weiming Dong, Changsheng Xu: Three Heads Are Better than One: Complementary Experts for Long-Tailed Semi-supervised Learning.
The 38th AAAI Conference on Artificial Intelligence (AAAI)
2024: 14229-14237
[
Code
]
[
Paper
]
-
Zhenyu Yang, Dizhan Xue, Shengsheng Qian, Weiming Dong, Changsheng Xu: LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval.
SIGIR
2024: 80-90 (Best paper honorary mention)
-
Wu-Qin Liu, Minxuan Lin, Haibin Huang, Chongyang Ma, Weiming Dong: FreeStyler: A Free-Form Stylization Method via Multimodal Vector Quantization.
CVM
(2) 2024: 259-278
-
Kexin Wu, Fan Tang, Ning Liu, Oliver Deussen, Thi Ngoc Hanh Le, Weiming Dong, Tong-Yee Lee: Lighting Image/Video Style Transfer Methods by Iterative Channel Pruning.
ICASSP
2024: 3800-3804
-
Yuxin Zhang, Weiming Dong,
Fan Tang,
Nisha Huang, Haibin Huang, Chongyang Ma,
Tong-Yee Lee, Oliver Deussen, Changsheng Xu: ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models.
ACM Transactions on Graphics
42(6): 244:1-244:14 (2023) [
Code
][
Paper
]
-
Yuxin Zhang, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma,
Tong-Yee Lee, Changsheng Xu: A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning.
ACM Transactions on Graphics
42(5): 169:1-169:16 (2023) [
Code
][
Paper
]
-
Yuxin Zhang, Nisha Huang, Fan Tang, Haibin Huang, Chongyang Ma, Weiming Dong, Changsheng Xu: Inversion-Based Style Transfer with Diffusion Models.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2023: 10146-10156
[Code]
[
Paper
]
-
Yuxin
Zhang, Fan Tang, Weiming Dong, Thi-Ngoc-Hanh Le, Changsheng Xu,
Tong-Yee Lee: Portrait Map Art Generation by Asymmetric Image-to-Image
Translation.
Leonardo
56(1): 28-36 (2023) (Cover Paper)
-
Dong Chen, Xingjia Pan, Fan Tang, Weiming Dong, Changsheng Xu: SPA
2
Net: Structure-Preserved Attention Activated Network for Weakly Supervised Object Localization.
IEEE Transactions on Image Processing
32: 5779-5793 (2023)
-
Wuqin Liu, Minxuan Lin, Haibin Huang, Chongyang Ma, Yu Song, Weiming Dong, Changsheng Xu:
Emotion-Aware Music Driven Movie Montage
.
Journal of Computer Science and Technology
38(3): 540-553 (2023)
-
Sifei Li, Fuzhang Wu, Yuqing Fan, Xue Song, Weiming Dong: PLDGAN: Portrait Line Drawing Generation with Prior Knowledge and Conditioning Target.
The Visual Computer
39: 3507–3518 (2023)
-
董未名, 邓盈盈, 张宇欣, 黄妮莎: 面向影视制作的风格迁移技术及展望.
影视文化
2022(01): 12-19 (2022)
-
Shuwei Dong, Xiaoyu Kong, Xingjia Pan, Fan Tang, Wei Li, Yi Chang, Weiming Dong: Semantic-Context Graph Network for Point-based 3D Object Detection.
IEEE Transactions on Circuits and Systems for Video Technology
33(11): 6474-6486 (2023)
-
Chengcheng Ma, Yang Liu, Jiankang Deng, Lingxi Xie, Weiming Dong, Changsheng Xu: Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models.
IEEE Transactions on Circuits and Systems for Video Technology
33(9): 4616-4629 (2023) [
Code
]
-
Yu Song, Fan
Tang, Weiming Dong, Feiyue Huang, Tong-Yee Lee, Changsheng Xu:
Balance-Aware Grid Collage for Small Image Collections.
IEEE Transactions on Visualization and Computer Graphics
29(2): 1330-1344 (2023)
-
Pei Lv, Jianqi Fan, Xixi Nie, Weiming Dong, Xiaoheng Jiang, Bing Zhou, Mingliang Xu, Changsheng Xu: User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning.
IEEE Transactions on Multimedia
25: 736-749 (2023)
-
Shideng Lin, Fan Tang, Weiming Dong, Xingjia Pan, Changsheng Xu: SMNet: Synchronous Multi-scale Low Light Enhancement Network with Local and Global Concern.
IEEE Transactions on Multimedia
25: 9506-9517 (2023)
-
Cong Wang, Fan Tang, Yong Zhang, Tieru Wu, Weiming Dong: Towards Harmonized Regional Style Transfer and Manipulation for Facial Images.
Computational Visual Media
9(2): 351-366 (2023)
-
Zhiyong Huang, Kekai Sheng, Ke Li, Jian Liang, Taiping Yao, Weiming Dong, Dengwen Zhou, Xing Sun: Reciprocal Normalization for Domain Adaptation.
Pattern Recognition
140: 109533 (2023)
-
Chengcheng Ma, Xingjia Pan, Qixiang Ye, Fan Tang, Weiming Dong, Changsheng Xu: CrossRectify: Leveraging Disagreement for Semi-Supervised Object Detection.
Pattern Recognition
137: 109280 (2023)
-
Xue Song, Jiawei Pan, Fuzhang Wu, Weiming Dong: Optimal Composition Recommendation for Portrait Photography.
SIGGRAPH Asia
Posters 2022: 20:1-20:2
-
Rui Wang, Nisha Huang, Fan Tang, Weiming Dong, Tong-Yee Lee: Language-driven Diversified Image Retargeting.
SIGGRAPH Asia
Posters 2022: 19:1-19:2
-
Nisha Huang, Fan Tang, Weiming Dong, Changsheng Xu: Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion.
ACM Multimedia
2022: 1085-1094
[Code]
-
Yuxin Zhang, Fan Tang, Weiming Dong, Changsheng Xu: Quantification of Artist Representativity within an Art Movement.
ICME Workshops on AIART
2022: 1-6
-
Yuxin Zhang, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Changsheng Xu: Domain Enhanced Arbitrary Image Style Transfer via Contrastive Learning.
ACM SIGGRAPH
(Conference Paper Track) 2022: 12:1-12:8 [
Code
]
-
Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Xingjia Pan, Lei Wang, Changsheng Xu: StyTr
2
: Image Style Transfer with Transformers.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2022: 11326-11336
[
Code
]
-
Huapeng Wei, Yingying Deng, Fan Tang, Xingjia Pan, Weiming Dong: A Comparative Study of CNN- and Transformer-Based Visual Style Transfer.
Journal of Computer Science and Technology
37(3): 601-614 (2022)
-
Yifan Xu, Zhijie Zhang, Mengdan Zhang, Kekai Sheng, Ke Li, Weiming Dong, Liqing Zhang, Changsheng Xu, Xing Sun: Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer.
The 36th AAAI Conference on Artificial Intelligence (AAAI)
2022: 2964-2972
[
Code&Paper]
-
Yifan Xu, Kekai Sheng, Weiming Dong, Baoyuan Wu, Changsheng Xu, Bao-Gang Hu: Towards Corruption-Agnostic Robust Domain Adaptation.
ACM Transactions on Multimedia Computing, Communications, and Applications
18(4): 99:1-99:16 (2022)
-
Yifan Xu, Huapeng Wei, Minxuan Lin, Yingying Deng, Kekai Sheng, Mengdan Zhang, Fan Tang, Weiming Dong, Feiyue Huang, Changsheng Xu: Transformers in computational visual media: A survey.
Computational Visual Media
8(1): 32-62 (2022) [
Paper
]
-
Yu Song, Fan
Tang, Weiming Dong, Changsheng Xu: Non-dominated sorting based
multi-page photo collage.
Computational Visual Media
8(2): 199-212
(2022)
-
Huaiyu Li, Weiming Dong, Bao-Gang Hu: Incremental Concept Learning via Online Generative Memory Recall.
IEEE Transactions on Neural Networks and Learning Systems
32(7): 3206-3216 (2021)
-
Yingying Deng, Fan Tang, Weiming Dong, Chongyang Ma, Feiyue Huang,
Oliver Deussen, Changsheng Xu: Exploring the Representativity of Art
Paintings.
IEEE Transactions on Multimedia
23: 2794-2805 (2021) [
Code
]
-
Minxuan Lin, Fan Tang, Weiming Dong, Xiao Li, Changsheng Xu, Chongyang Ma: Distribution Aligned Multimodal and Multi-Domain Image Stylization.
ACM Transactions on Multimedia Computing, Communications, and Applications
17(3): 96:1-96:17 (2021)
-
Xingjia Pan, Yingguo Gao, Zhiwen Lin, Fan Tang, Weiming Dong, Haolei Yuan, Feiyue Huang, Changsheng Xu: Unveiling the Potential of Structure-preserving for Weakly Supervised Object Localization.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2021: 11642-11651 [
Code
]
-
Yingying Deng, Fan Tang, Weiming Dong, Haibin Huang, Chongyang Ma,
Changsheng Xu: Arbitrary Video Style Transfer via Multi-Channel Correlation.
The 35th AAAI Conference on Artificial Intelligence (AAAI)
2021: 1210-1217 [
Code
]
-
Dong Chen, Fan Tang, Weiming Dong, Hanxing Yao, Changsheng Xu: SiamCPN: Visual Tracking with the Siamese Center-Prediction Network.
Computational Visual Media
7(2): 253-265 (2021)
-
Xingjia Pan, Fan Tang, Weiming Dong, Chongyang Ma, Yiping Meng, Feiyue
Huang, Tong-Yee Lee, Changsheng Xu: Content-Based Visual Summarization
for Image Collections.
IEEE Transactions on Visualization and Computer Graphics
27(4): 2298-2312 (2021) [
Project Page
]
-
Kekai Sheng, Weiming Dong, Haibin Huang, Guohui Wang, Yong Zhang,
Chongyang Ma, Bao-Gang Hu: Learning to Assess Visual Aesthetics of Food
Images.
Computational Visual Media
7(1): 139-152 (2021) [
Data & Code
]
-
Yuting Ma, Fan Tang, Weiming Dong, Changsheng Xu: Destylization of Text with Decorative Elements.
ACM Multimedia Asia
2020: 14:1-14:7
-
Yingying Deng, Fan Tang, Weiming Dong, Wen Sun, Feiyue Huang, Changsheng Xu: Arbitrary Style Transfer via Multi-Adaptation Network.
ACM Multimedia
2020: 2719-2727 [
Paper
][
Code
]
-
Xingjia Pan, Fan Tang, Weiming Dong, Yang Gu, Zhichao Song, Yiping Meng, Pengfei Xu, Oilver Deussen, Changsheng Xu: Self-Supervised Feature Augmentation for Large Image Object Detection.
IEEE Transactions on Image Processing
29: 6745-6758 (2020)
-
Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu: Dynamic Refinement Network for Oriented and Densely Packed Object Detection.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2020: 11204-11213 (Oral) [
Paper
][
Data & Code
]
-
Minxuan Lin, Yingying Deng, Fan Tang, Weiming Dong, Changsheng Xu: Multi-Attribute Guided Painting Generation.
The 2nd IEEE Workshop on Artificial Intelligence for Art Creation (AIART)
2020: 400-403 [
Paper
]
-
Kekai Sheng, Weiming Dong, Menglei Chai, Guohui Wang, Peng Zhou, Feiyue Huang, Bao-Gang Hu, Rongrong Ji, Chongyang Ma: Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning.
The 34th AAAI Conference on Artificial Intelligence (AAAI)
2020: 5709-5716 (Spotlight) [
Paper
]
-
Fan
Tang, Weiming Dong, Yiping Meng, Chongyang Ma, Fuzhang Wu, Xinrui Li, Tong-Yee Lee: Image Retargetability.
IEEE Transactions on Multimedia
22(3): 641-654 (2020)
-
Huaiyu Li, Weiming Dong, Xing Mei, Chongyang Ma, Feiyue Huang, Bao-Gang Hu: LGM-Net: Learning to Generate Matching Networks for Few Shot Learning.
International Conference on Machine Learning (ICML)
2019: 3825-3834 [
Code
]
-
Yong Zhang, Baoyuan Wu, Weiming Dong, Zhifeng Li, Wei Liu, Bao-Gang Hu, Qiang Ji: Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2019: 3457-3466
-
Fuzhang Wu, Yan Kong, Weiming Dong, Yanjun Wu: Gradient-aware blind face inpainting for deep face verification.
Neurocomputing
331: 301-311 (2019)
-
Yucheng
Zhao, Fan Tang, Weiming Dong, Feiyue Huang, Xiaopeng Zhang: Joint face
alignment and segmentation via deep multi-task learning.
Multimedia Tools and Applications
78(10): 13131–13148 (2019)
-
Fan
Tang, Weiming Dong, Yiping Meng, Xing Mei, Feiyue Huang, Xiaopeng
Zhang, Oliver Deussen: Animated Construction of Chinese Brush Paintings.
IEEE Transactions on Visualization and Computer Graphics
24(12): 3019-3031 (2018) [
Project Page
]
-
Kekai Sheng, Weiming Dong, Haibin Huang, Chongyang Ma, Bao-Gang Hu: Gourmet photography dataset for aesthetic assessment of food images.
SIGGRAPH Asia
Technical Briefs 2018: 20:1-20:4 [
Data & Code
]
-
Yu Song, Fan Tang, Weiming Dong, Xiaopeng Zhang, Oliver Deussen, Tong-Yee Lee. Photo Squarization by Deep Multi-Operator Retargeting.
ACM Multimedia
2018: 1047-1055
-
Kekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu. Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment.
ACM Multimedia
2018: 879-886 [
Data & Code
]
-
Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji: Weakly-supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2018: 2314-2323
-
Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji: Classifier Learning with Prior Probabilities for Facial Action Unit Recognition.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2018: 5108-5116
-
Yong Zhang, Rui Zhao, Weiming Dong, Bao-Gang Hu, Qiang Ji: Bilateral Ordinal Relevance Multi-Instance Regression for Facial Action Unit Intensity Estimation.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2018: 7034-7043
-
Huaiyu Li, Weiming Dong, Bao-Gang Hu: Facial Image Attributes Transformation via Conditional Recycle Generative Adversarial Networks.
Journal of Computer Science and Technology
33(3): 511-521 (2018)
-
Kekai Sheng, Weiming Dong, Wei Li, Joseph Razik, Feiyue Huang, Bao-Gang Hu: Centroid-aware local discriminative metric learning in speaker verification.
Pattern Recognition
72: 176-185 (2017)
-
Yingying Deng, Fan Tang, Weiming Dong, Hanxing Yao, Bao-Gang Hu: Style-oriented representative paintings selection.
SIGGRAPH ASIA
Posters 2017: 12:1-12:2
-
Xingjia Pan, Juntao Ye, Fan Tang, Weiming Dong, Feiyue Huang, Xiaopeng Zhang: Content-based measure of image set diversity.
SIGGRAPH ASIA
Posters 2017: 43:1-43:2
-
Jia Liu, Weiming Dong, Xiaopeng Zhang, Zhiguo Jiang: Orientation judgment for abstract paintings.
Multimedia Tools and Applications
76(1): 1017-1036 (2017)
-
Yong Zhang, Weiming Dong, Chongyang Ma, Xing Mei, Ke Li, Feiyue Huang, Bao-Gang Hu, Oliver Deussen: Data-Driven Synthesis of Cartoon Faces Using Different Styles.
IEEE Transactions on Image Processing
26(1): 464-478 (2017)
-
Weiming Dong, Fuzhang Wu, Yan Kong, Xing Mei, Tong-Yee Lee, Xiaopeng Zhang: Image Retargeting by Texture-Aware Synthesis.
IEEE Transactions on Visualization and Computer Graphics
22(2): 1088-1101 (2016)
-
Haiyong Jiang, Liangliang Nan, Dong-Ming Yan, Weiming Dong, Xiaopeng Zhang, Peter Wonka: Automatic Constraint Detection for 2D Layout Regularization.
IEEE Transactions on Visualization and Computer Graphics
22(8): 1933-1944 (2016)
-
Yan Kong, Weiming Dong, Xing Mei, Chongyang Ma, Tong-Yee Lee, Siwei Lyu, Feiyue Huang, Xiaopeng Zhang: Measuring and Predicting Visual Importance of Similar Objects.
IEEE Transactions on Visualization and Computer Graphics
22(12): 2564-2578 (2016)
-
Fuzhang Wu, Weiming Dong, Yan Kong, Xing Mei, Dong-Ming Yan, Xiaopeng Zhang, Jean-Claude Paul: Feature-aware natural texture synthesis.
The Visual Computer
32(1): 43-55 (2016)
-
Yiping Meng, Fan Tang, Weiming Dong, Xiaopeng Zhang: Optimal character composing for Chinese calligraphic artwork.
SIGGRAPH Asia
Posters 2016: 25
-
Kekai Sheng, Weiming Dong, Yan Kong, Xing Mei, Jilin Li, Chengjie Wang, Feiyue Huang, Bao-Gang Hu: Evaluating the Quality of Face Alignment without Ground Truth.
Computer Graphics Forum
34(7): 213-223 (2015)
-
Xing Mei, Weiming Dong, Bao-Gang Hu, Siwei Lyu: UniHIST: A unified framework for image restoration with marginal histogram constraints.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
2015: 3753-3761
-
Fuzhang Wu, Dong-Ming Yan, Weiming Dong, Xiaopeng Zhang, Peter Wonka: Inverse Procedural Modeling of Facade Layouts. A
CM Transactions on Graphics (Proceedings of SIGGRAPH)
33(4): 121:1-121:10 (2014)
-
Weiming Dong, Ning Zhou, Tong-Yee Lee, Fuzhang Wu, Yan Kong, Xiaopeng Zhang: Summarization-Based Image Resizing by Intelligent Object Carving.
IEEE Transactions on Visualization and Computer Graphics
20(1): 111-124 (2014)
-
Dengwen Zhou, Weiming Dong, Wengang Chen.
Joint demosaicking and zooming using moderate spectral correlation and consistent edge map
. Journal of Electronic Imaging 23(4): 034310 (2014)
-
Yong Zhang, Weiming Dong, Oliver Deussen, Feiyue Huang, Ke Li, Bao-Gang Hu: Data-driven face cartoon stylization.
SIGGRAPH ASIA
Technical Briefs 2014: 14:1-14:4
-
Fuzhang Wu, Weiming Dong, Yan Kong, Xing Mei, Jean-Claude Paul, Xiaopeng Zhang: Content-Based Colour Transfer.
Computer Graphics Forum
32(1): 190-203 (2013)
-
Yan Kong, Weiming Dong, Xing Mei, Xiaopeng Zhang, Jean-Claude Paul: SimLocator: robust locator of similar objects in images.
The Visual Computer
29(9): 861-870 (2013)
-
Xing Mei, Xun Sun, Weiming Dong, Haitao Wang, Xiaopeng Zhang: Segment-Tree Based Cost Aggregation for Stereo Matching.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
2013: 313-320
-
Dengwen Zhou, Xiaoliu Shen, Weiming Dong: Image zooming using directional cubic convolution interpolation.
IET Image Processing
(6): 627–634 (2012)
-
Weiming Dong, Guan-Bo Bao, Xiaopeng Zhang, Jean-Claude Paul: Fast Multi-Operator Image Resizing and Evaluation.
Journal of Computer Science and Technology
27(1): 121-134 (2012)
-
Dengwen Zhou, Xiaoliu Shen, Weiming Dong: Colour demosaicking with directional filtering and weighting.
IET Image Processing
6(8): 1084–1092 (2012)
-
Weiming Dong, Guanbo Bao, Xiaopeng Zhang, Jean-Claude Paul: Fast Local Color Transfer via Dominant Colors Mapping.
SIGGRAPH ASIA
Technical Sketches 2010: 46:1-46:2
-
Weiming Dong, Ning Zhou, Jean-Claude Paul, Xiaopeng Zhang: Optimized image resizing using seam carving and scaling.
ACM Transactions on Graphics
28(5): 125:1-125:10 (2009)
-
Weiming Dong, Ning Zhou, Jean-Claude Paul: Robust tile-based texture synthesis using artificial immune system.
Neural Computing and Applications
18(3): 223-235 (2009)
-
Weiming Dong, Ning Zhou, Jean-Claude Paul. Perspective-aware texture analysis and synthesis.
The Visual Computer
24(7-9): 515-523 (2008)
-
Ning Zhou, Jiaxin Wang, Weiming Dong, Jean-Claude Paul: Modeling and Visualization of Flower Color Patterns.
CAD/Graphics
2007: 150-155
-
Weiming Dong, Ning Zhou, Jean-Claude Paul: Optimized tile-based texture synthesis.
Graphics Interface
2007: 249-256
-
Weiming Dong: Rendering Optical Effects Based on Spectra Representation in Complex Scenes.
Computer Graphics International
2006: 719-726
发明专利
-
盛柯恺; 董未名; 马重阳; 梅星; 胡包钢 ; 基于注意力机制的通用图像美学评估方法、系统及设备, 2021-4-27, 中国, ZL201910086789.X
-
宋玉; 唐帆; 董未名; 徐常胜 ; 图片方形化缩放方法、系统及装置, 2020-11-10, 中国, ZL201811545250.8
-
邓盈盈; 唐帆; 董未名; 徐常胜 ; 自动挑选画家代表作的方法及装置, 2020-11-30, 中国, ZL201810759512.4
-
潘兴甲; 董未名; 袁豪磊; 盛柯恺; 林志文; 高英国; 任玉强; 郭晓威; 黄小明; 黄飞跃 ; 目标检测方法、装置、设备及存储介质, 2020-10-12, 中国, ZL202011085853.1
-
唐帆; 余宗桥; 黄飞跃; 李季檩; 李科; 吴永坚; 董未名; 孟一平 ; 图像处理方法和装置(水墨动态绘制过程重构), 2019-8-27, 中国, ZL201410505493.4
-
Wang, Chengjie; Li, Jilin; Huang, Feiyue; Sheng, Kekai; Dong, Weiming ; Evaluation method and evaluation device for facial key point positioning result, 2017-8-7, 美国, US 10,706,263 B2
-
樊艳波; 董未名; 胡包钢 ; 基于自适应阈值调整拒识子空间学习的人脸检测方法, 2016-04-13, 中 国, ZL201510811406.2
-
张宇欣,可视媒体视觉属性表示与可控生成,博士,2025 [
论文下载
,百度网盘提取码:
f4s4
]
-
黄妮莎,多模态信息引导的艺术图像与视频生成研究,硕士,2024 [
论文下载
,百度网盘提取码:hnk9]
-
刘伍琴,面向影视再创作的多模态引导可视媒体编辑,硕士,2024 [
论文下载
,百度网盘提取码:8ahj]
-
李岚祺,基于图神经网络的分子相互作用关系预测方法研究,硕士,2024 [
论文下载
,
百度网盘提取码:j6bn]
-
宋雪,知识与数据共同驱动的人像构图推荐算法研究与应用,硕士,2023 [
论文下载
,百度网盘提取码:4rvz]
-
许逸凡,标签稀缺条件下的视觉模型可迁移性研究,硕士,2022 [
论文下载
,百度网盘提取码:3xt7]
-
邓盈盈,风格导向的绘画作品生成与分析,博士,2022 [
论文下载
,百度网盘提取码:ff68]
-
林诗登,基于深度学习的暗光图像与视频增强,硕士,2022 [
论文下载
,百度网盘提取码:vmhq]
-
范宇擎,基于属性学习的图像质量评估算法研究,硕士,2022 [
论文下载
,百度网盘提取码:niq1]
-
宋玉,内容相关的图像呈现方法研究及应用,博士,2022 [
论文下载
,百度网盘提取码:tw7i]
-
林敏轩,基于对抗学习的多域艺术图像生成,硕士,2021 [
论文下载
,百度网盘提取码:t3jx]
-
潘兴甲,复杂环境下的图像目标检测与可视化,博士,2020 [
论文下载
,百度网盘提取码: djd7]
-
李怀宇,面向非平稳环境的知识迁移方法研究,博士,2020 [
论文下载
,百度网盘提取码:pwhz]
-
唐帆,中国水墨作品数字化创作重构研究,博士,2019 [
论文下载
,百度网盘提取码:p59i]
-
盛柯恺,图像美学质量评估的方法与应用,博士,2019 [
论文下载
,百度网盘提取码:ghnk]
-
张勇,知识与数据共同驱动的面部行为分析与人脸卡通画合成,博士,2018 [
论文下载
,百度网盘提取码:vi4k]
-
赵昱程,人脸图像分析与妆容图像合成,硕士,2018 [
论文下载
,百度网盘提取码:djmm]
-
孟一平,图像可缩放度的研究与应用,硕士,2017 [
论文下载
,百度网盘提取码:3338]
-
孔彦,图像内容的相似模式分析,博士,2016 [
论文下载
,百度网盘提取码:qhyf]
-
吴富章,内容相关的图像合成,博士,2015 [
论文下载
,百度网盘提取码:vdgk]
-
百度奖学金
(全球10人/年):张宇欣(2023)
-
CCF-凌迪图形学奖学金
(全国10人/年):张宇欣(2023)
-
中国科学院院长优秀奖
:张宇欣(2025)
-
国家奖学金(研究生)
:李思霏(2024)、杨振宇(2024)、周梓骏(2024)、张宇欣(2022)、邓盈盈(2021)、吴富章(2015)
-
国家奖学金(本科生)
:王诗文(2024)、谭米宁(2023
)、李思霏(2021)
-
北京市优秀毕业生
:骆敏言(2025,本科生)、黄妮莎(2024,研究生)
-
中国科学院大学校级优秀毕业论文(本科生)
:沈菲尔(2025)
-
本科生启研项目
:骆敏言(2024)
-
本科生大创项目
:尹子娇(2025)、骆敏言(2024)
-
腾讯技术大咖:
潘兴甲(2020)、盛柯恺(2019)、张勇(2018)、孔彦(2016)、吴富章(2015)
-
腾讯犀牛鸟精英人才计划
:李思霏(2024)、许逸凡(2022)
-
中国电子学会—腾讯博士生科研激励计划(混元大模型专项)
:杨振宇(2025)
科研项目
-
蚂蚁集团,可控视频生成技术,2024/12-2025/12
-
快手,可控高质量视频生成和编辑,2024/08-2025/07
-
快手,扩散模型的可解释性与可控性研究,2023/06-2024/07
-
新一代人工智能国家科技重大专项,认知计算基础理论与方法研究,2020/11-2023/10
-
北京市自然科学基金-丰台轨道交通前沿研究联合项目,恶劣天气下列车前向障碍物检测关键技术研究,2023/01-2025/12
-
国家自然科学基金重点项目,基于视觉认知的可视媒体合成与评价,2019/01-2023/12
-
NSFC企业创新发展
联合
基金重点项目,知识和数据共同驱动的小样本目标识别理论和方法,2021/01-2024/12
-
中文在线,真人照片转指定风格人像技术开发,2022/06-2022/10
-
腾讯优图实验室,“优图研究”联合项目第九期,2021/08-2022/07
-
腾讯优图实验室,“优图研究”联合项目第八期,2020/08-2021/07
-
腾讯优图实验室,“优图研究”联合项目第七期,2019/08-2020/07
-
远鉴科技,图像内容合成与质量评价,2019/08-2020/07
-
腾讯优图实验室,“优图研究”联合项目第六期,2018/08-2019/07
-
国家重点研发计划,社会安全事件智能监测与预警关键技术与装备,2018/07-2021/06
-
中科院自动化所-亮亮视野“第一视角计算”联合实验室,2018/07-2021/06
-
咪咕视频,视频精细化标签AI能力定制,2018/08-2019/07
-
腾讯优图实验室,“优图研究”联合项目第五期,2017/08-2018/07
-
中国科学院,卢嘉锡国际合作团队项目,2018/01-2020/12
-
国家自然科学基金面上项目,数据驱动的图像合成,2017/01-2020/12
-
北京市自然基金面上项目,单图像超分辨率技术与应用,2016/01-2018/12
-
腾讯,“优图研究”联合项目第四期,2016/08-2017/07
-
爱奇艺,视频智能编辑创作系统,2016/02-2016/12
-
“优图研究”联合项目第三期,2015/08-2016/07
-
核高基重大专项课题分任务,开源操作系统内核分析和安全性评估:基于人脸识别的关键应用程序保护,2015/04-2015/12
-
腾讯,“优图研究”联合项目第二期,2014/08-2015/07
-
国家自然科学基金面上项目,基于梯度场的计算成像和恢复技术,2014/01-2017/12
-
腾讯,“优图研究”联合项目第一期,2013/08-2014/07
-
北京市自然基金面上项目,内容相关的图像合成研究与应用,2011/01-2013/12
-
法国国家科研署国际合作项目,Shape Modeling: New theories and new algorithms,2010/01-2012/12
-
企业委托(上海市科技信息中心),个性化影视动漫制作关键技术研发,2009/12-2010/11
-
中国博士后科学基金特别资助,基于图像的植物建模与绘制,2008/11-2009/10
-
教育部留学归国人员启动基金,自然景物建模与渲染中的若干问题研究,2008/08-2010/07
-
中国博士后科学基金面上项目,真实植物场景数字化与可视化,2008/08-2009/07
-
科技部国际合作项目,自然植被景观的动态演变模拟与应用,2007/09-2010/10
媒体报道/采访
学生姓名 培养单位
(包括联合培养)
学位类别 入学/毕业时间 研究课题 毕业去向
吴富章 中国科学院自动化研究所 硕博 2010/2015 内容相关的图像合成 腾讯优图实验室(技术大咖)
李超 北京大学 硕士 2010/2012 图像物体材质分析与传递 德克萨斯大学达拉斯分校(读博)
沈思成 西北师范大学 硕士 2013/2015 基于深度学习的人脸识别 远鉴科技(产品部总监)
孔彦 中国科学院自动化研究所 硕博 2011/2016 图像内容的相似模式分析 远鉴科技(图像部总监)、腾讯技术大咖
李志磊 西北师范大学 硕士 2014/2016 人脸识别与活体验证 中国邮政集团公司
温祥 北京交通大学 硕士 2014/2016 基于深度学习的图像内容识别 网易
徐国智 华北电力大学 硕士 2014/2017 基于深度学习的人脸配准 网易
刘园园 华北电力大学 硕士 2014/2017 基于深度学习的花卉图像分类 国家电网
孟一平 中国科学院自动化研究所 硕士 2014/2017 图像可缩放度研究与应用 滴滴出行->快手
唐帆 中国科学院自动化研究所 硕博 2013/2019 中国水墨作品数字化创作重构 远鉴科技->吉林大学
张勇 中国科学院自动化研究所 硕博 2012/2018 人脸面部行为分析与卡通合成 腾讯AI Lab
赵昱程 中国科学院自动化研究所 硕士 2015/2018 人脸妆容图像分析与合成 阿里巴巴->字节跳动
盛柯恺 中国科学院自动化研究所 硕博 2014/2019 图像美学评估的方法与应用 腾讯优图实验室(技术大咖)
李怀宇 中国科学院自动化研究所 硕博 2014/2020 面向非平稳环境的知识迁移方法研究 快手
潘兴甲 中国科学院自动化研究所 硕博 2015/2020 通用目标检测 腾讯优图实验室(技术大咖)
胡广宇 华北电力大学 硕士 2015/2018 视频人脸虚拟美颜 远鉴科技
李欣芮 华北电力大学 硕士 2016/2019 时序数据分析与挖掘 国家电网研究院
周鹏 华北电力大学 硕士 2016/2019 目标检测与分割 远鉴科技
邓盈盈 中国科学院自动化研究所 硕博 2017/2022 艺术图像分析与合成 华为
宋玉 中国科学院自动化研究所 普博 2017/2021 可视媒体呈现 北京科技大学
孙秀秀 华北电力大学 硕士 2017/2020 商品图像识别 上海电网
林敏轩 中国科学院自动化研究所 硕士 2018/2021 图像风格化 快手
马雨廷 中国科学院自动化研究所 硕士 2018/2021 文字风格化与去风格化 中信银行
张旭龙 国科大人工智能学院 硕士 2018/2021 面向智能眼镜的目标检测 建设银行
陈东 国科大人工智能学院 硕士 2018/2021 目标跟踪 吉林大学(读博)
黄志勇 华北电力大学 硕士 2018/2021 领域自适应 字节跳动
许逸凡 中国科学院自动化研究所 硕士 2019/2022 视觉模型可迁移性 中科院自动化所(读博)
范宇擎 国科大人工智能学院 硕士 2019/2022 图像质量评价 中科院软件所
林诗登 国科大人工智能学院 硕士 2019/2022 图像与视频暗光增强 京东
宋雪 郑州大学 硕士 2020/2023 人像拍照姿态推荐 中科院软件所
张宇欣 中国科学院自动化研究所 硕博 2020/2025 可视媒体可控生成 字节跳动
刘伍琴 国科大人工智能学院 硕士 2021/2024 多模态可视媒体生成 快手
黄妮莎 国科大人工智能学院 硕士 2021/2024 多模态可视媒体生成 清华大学(读博)
李岚琪 郑州大学 硕士 2021/2024 AI+化学 深圳证券交易所
李思霏 中国科学院自动化研究所 硕博 2022/ 音乐生成
杨晗 国科大人工智能学院 硕士 2022/2025 多模态可视媒体呈现 美团
马赛赛 郑州大学 硕士 2022/2025 多模态可视媒体生成
任宥衡 郑州大学 硕士 2022/ 多模态可视媒体生成 字节跳动
杜俊萱 国科大人工智能学院 直博 2023/ 多模态可视媒体生成
谭米宁 中国科学院自动化研究所 硕士 2024/ 多模态可视媒体生成
骆敏言 中国科学院自动化研究所 直博 2025/ 多模态可视媒体生成
沈菲尔
国科大人工智能学院
直博 2025/ 舞蹈生成