Jingjing Wang

Jingjing Wang 王晶晶

Associate Professor, Natural Language Processing Lab, Soochow University

Email: djingwang [at] suda [dot] edu [dot] cn

Address: Shizi Street 1#, Suzhou, China, 215006

I am an Associate Professor at School of Computer Science and Technology, Soochow University. I am also a Senior Technical Consultant (Part-time) at Microsoft (Asia), China. My research interests focus on Multimodal Computing (especially for Visual-Language Understanding and Generation), Affective Computing, Natural Language Processing, Large Language Models, and AI for Medical Diagnosis. I received my Ph.D. degree from Soochow University in 2019, fortunately advised by Prof. Guodong Zhou and Prof. Shoushan Li. During my career, I am also working with Prof. Min Zhang for advancing Natural Language Processing and Artificial Intelligence technology to benefit humanity.

I am actively seeking dedicated students with intellectual curiosity and a strong work ethic to join my research team. Prospective candidates are welcome to submit their academic CV and research interests via email for initial consultation. Regarding recommendation letters, please be advised that I can only provide substantive evaluations for candidates with whom I have maintained at least six months of meaningful academic collaboration. This duration allows me to objectively assess your research competencies, scholarly contributions, and professional development through sustained engagement.

What's New

[Jan. 21, 2025] Two Papers ("Omni-SILA" & "Sherlock") are accepted by WWW 2025 (CCF A).

[Dec. 28, 2024] Two students (Yu Tan & Jia Ning) in my team receive the National Scholarship for Graduate Students.

[Dec. 10, 2024] "SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing" is accepted by AAAI 2025 (CCF A).

[Dec. 01, 2024] "TACL: A Trusted Action-enhanced Curriculum Learning Approach" is accepted by Neurocomputing (JCR Q1).

[Jul. 16, 2024] Two Papers ("Emotion-enriched Text-to-Motion Generation" & "Hawkeye") are accepted by ACM MM 2024 (CCF A).

[Feb. 20, 2024] Three Papers ("ChatASU", "Weakly-supervised Phrase Grounding" & "TopicDiff") are accepted by COLING 2024 (CCF B).

Recent Research

The goal is to establish a unified framework for video anomaly detection, advancing precision in identifying and localizing abnormal events across dynamic scenes while enabling interpretable analysis of complex visual patterns.

Starting from real-world applications in surveillance and social media analysis, we introduce Hawkeye (Zhang et al., ACM MM'24) , the first scene-enhanced video-language model designed for anomaly detection. Hawkeye integrates multimodal context (visual-textual-temporal cues) to recognize subtle anomalies and pinpoint their temporal boundaries in untrimmed videos, This work lays a critical foundation for event typing and spatiotemporal localization in short video understanding.

Building on this, we investigate low-resource scenarios where annotated anomaly data is scarce. Our Continuous Attention Modeling method (Zhang et al.,JOS'23) enhances adaptability by capturing long-range dependencies in sparse anomaly signals. Further, we extend Hawkeye with self-supervised learning to uncover latent patterns across unlabeled videos, improving generalization to unseen anomaly types. To scale solutions, we construct a benchmark suite combining large-scale anomaly annotations and instruction-tuned datasets. This addresses the challenge of diverse event types (e.g., accidents, unusual behaviors) and supports downstream tasks like explainable reasoning (Liu et al., COLING'2024).

Papers: (Zhang et al., ACM MM'24) (Zhang et al.,JOS'23) (Liu et al., COLING'2024).

The goal is to establish an emotion-driven framework for multimodal large language model (LLM) video generation , achieving cross-modal semantic alignment and high-fidelity emotional action synthesis while enhancing controllability and realism in generated content.

Starting from the core demand of multimodal interaction, we propose LLM-Guided Emotion-Action Synthesis (Yu et al., ACM MM'2024), the first method to jointly model text emotion semantics and human motion sequences for emotionally rich video generation This work enables precise conveyance of complex emotions (e.g., "excited waving") in synthetic videos, providing critical support for emotion-aware short video editing.

To address cross-modal alignment challenges, we introduce two innovations: 1) A Trustworthy Reflection Mechanism (Liu et al., COLING'24) that dynamically calibrates generation intent with emotional labels, improving semantic coherence in action outputs; 2) Latent Diffusion Models (Luo et al., COLING'24) that jointly model emotion-themed distributions across text, audio, and motion modalities in latent space, overcoming efficiency bottlenecks of pixel-level generation. To scale practical applications, we construct the first benchmark dataset for emotion-driven action generation, containing 100K+ emotion-annotated human motion sequences and multimodal instruction pairs. Based on this, we propose AutoEmoDirector, a framework enabling users to edit video emotion intensity and motion styles via natural language instructions (e.g., "transform a joyful dance into a sorrowful stroll"), significantly lowering the barrier for video content creation.

Papers: (Yu et al., ACM MM'2024) (Luo et al., COLING'24) (Liu et al., COLING'24)

Publications

2025

Junxiao Ma, Jingjing Wang, Jiamin Luo, Peiying Yu, Guodong Zhou. Sherlock: Towards Multi-scene Video Abnormal Event Extraction and Localization via a Global-local Spatial-sensitive LLM. The Web Conference (WWW), 2025. (CCF A)
Jiamin Luo, Jingjing Wang, Junxiao Ma, Yujie Jin, Shoushan Li, Guodong Zhou. Omni-SILA: Towards Omni-scene Driven Visual Sentiment Identifying, Locating and Attributing in Videos. The Web Conference (WWW), 2025. (CCF A)
Jipeng Cen, Jiaxin Liu, Zhixu Li, Jingjing Wang. SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent Collaboration. The AAAI Conference on Artificial Intelligence (AAAI), 2025. (CCF A)
Luo J., Wang J., Zhou G. Multi-modal Reliability-aware Affective Computing. Ruan Jian Xue Bao/Journal of Software (JOS), 2025, 36(2):537-553. (CCF A in Chinese Journal)
Tan Yu, Jingjing Wang, Jiamin Luo, Jiawen Wang, Guodong Zhou. TACL: A Trusted Action-enhanced Curriculum Learning Approach to Multimodal Affective Computing. Neurocomputing, 2025, 620:129195. (SCI, JCR Q1)

2024

Jianing Zhao, Jingjing Wang, Yujie Jin, Jiamin Luo, Guodong Zhou. Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanced Video Large Language Model. Proceedings of ACM International Conference on Multimedia (MM), 2024, 592-601. (CCF A)
Tan Yu, Jingjing Wang, Jiawen Wang, Jiamin Luo, Guodong Zhou. Towards Emotion-enriched Text-to-Motion Generation via LLM-guided Limb-level Emotion Manipulating. Proceedings of ACM International Conference on Multimedia (MM), 2024, 612-621. (CCF A)
Han Zhang, Jingjing Wang, Jiamin Luo, Guodong Zhou. Continual Attention Modeling for Sucessive Sentiment Analysis in Low resource Scenarios. Ruan Jian Xue Bao/Journal of Software (JOS), 2024, 35(12):5470-5486. (CCF A in Chinese Journal)
Yiding Liu, Jingjing Wang, Jiamin Luo, Tao Zeng, Guodong Zhou. ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues. Proceedings of International Conference on Computational Linguistics (COLING), 2024, 3075-3085. (CCF B)
Jiamin Luo, Jianing Zhao, Jingjing Wang, Guodong Zhou. How to Understand 'Support'? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding. International Conference on Computational Linguistics (COLING), 2024. (CCF B)
Jiamin Luo, Jingjing Wang, Guodong Zhou. TopicDiff: A Topic-enriched Diffusion Approach for Multimodal Conversational Emotion Detection. International Conference on Computational Linguistics (COLING), 2024. (CCF B)
Jiamin Luo, Jingjing Wang, Guodong Zhou. Topic-Enriched Variational Transformer for Conversational Emotion Detection. CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC), 2024, 3-15. (CCF C)

2023

Yiding Liu, Jingjing Wang, Jiamin Luo, Guodong Zhou. LLM-Grounded Conversation Aspect Sentiment Understanding via Muti-Agent Consistency Reflection. Journal of Software (JOS), 2021. (CCF A in Chinese Journal)
Jianing Zhao, Jingjing Wang, Jiamin Luo, Guodong Zhou. Implicit-enhanced Causal Modeling for Phrasal Visual Grounding. Ruan Jian Xue Bao/Journal of Software (JOS), 2021. (CCF A in Chinese Journal)
Jingjing Wang, Jiamin Luo, Guodong Zhou. Fine-Grained Question-Answer Matching via Sentence-Aware Contrastive Self-supervised Transfer. CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC), 2023, 616-628. (CCF C)

2022

Xiaoya Gao, Jingjing Wang, Shoushan Li, Min Zhang, Guodong Zhou. Cognition-driven multimodal personality classification. Science China Information Sciences (SCIS), 2022, 65(10). (CCF A, SCI Q1)

2021

Xiaoya Gao, Jingjing Wang, Shoushan Li, Guodong Zhou. Cognition-Driven Real-Time Personality Detection via Language-Guided Contrastive Visual Attention. IEEE International Conference on Multimedia and Expo (ICME), 2021, 1-6. (CCF B)

2020

Xiao Chen, Changlong Sun, Jingjing Wang, Shoushan Li, Luo Si, Min Zhang, Guodong Zhou. Aspect Sentiment Classification with Document-level Sentiment Preference Modeling. Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2020, 3667-3677. (CCF A)
Jingjing Wang, Jiancheng Wang, Changlong Sun, Shoushan Li, Xiaozhong Liu, Luo Si, Min Zhang, Guodong Zhou. Sentiment classification in customer service dialogue with topic-aware multi-task learning. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2020, 9177-9184. (CCF A)
Minghui An, Jingjing Wang, Shoushan Li, Guodong Zhou. Multimodal topic-enriched auxiliary learning for depression detection. Proceedings of the International Conference on Computational Linguistics (COLING), 2020, 1078-1089. (CCF B)

2019

Jingjing Wang, Changlong Sun, Shoushan Li, Jiancheng Wang, Luo Si, Min Zhang, Xiaozhong Liu, Guodong Zhou. Human-Like Decision Making: Document-level Aspect Sentiment Classification via Hierarchical Reinforcement Learning. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019, 5585-5594. (CCF B)
Jingjing Wang, Changlong Sun, Shoushan Li, Xiaozhong Liu, Min Zhang, Luo Si, Guodong Zhou. Aspect Sentiment Classification Towards Question-Answering with Reinforced Bidirectional Attention Network. Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2019. (CCF A)
Hanqian Wu, Shangbin Zhang, Jingjing Wang, Mumu Liu, Shoushan Li. Multi-label Aspect Classification on Question-Answering Text with Contextualized Attention-Based Neural Network. Proceedings of China Conference on Chinese Language Processing (CCL), 2019, 479-491. (EI)

2018

Jingjing Wang, Jie Li, Shoushan Li, Yangyang Kang, Min Zhang, Luo Si, Guodong Zhou. Aspect Sentiment Classification with both Word-level and Clause-level Attention Networks. Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 2018, 4439-4445. (CCF A)
Jingjing Wang, Shoushan Li, Mingqi Jiang, Hanqian Wu, Guodong Zhou. Cross-media User Profiling with Joint Textual and Social User Embedding. Proceedings of International Conference on Computational Linguistics (COLING), 2018, 246-251. (CCF B)
Chenlin Shen, Changlong Sun, Jingjing Wang, Yangyang Kang, Shoushan Li, Xiaozhong Liu, Luo Si, Min Zhang, Guodong Zhou. Sentiment Classification towards Question-Answering with Hierarchical Matching Network. Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018, 3654-3663. (CCF B)
Huan Liu, Jingjing Wang, Shoushan Li, Guodong Zhou. Semi-supervised Sentiment Classification Based on Auxiliary Task Learning. In Proceeding of NLPCC-2018. (CCF C)
Hanqian Wu, Mumu Liu, Jingjing Wang, Jue Xie, Chenlin Shen. Question-Answering Aspect Classification with Hierarchical Attention Network. In Proceeding of CCL-2018, pp. 225-237. (EI)
Hanqian Wu, Mumu Liu, Jingjing Wang, Jue Xie, Shoushan Li. Question-Answering Aspect Classification with Multi-attention Representation. In Proceeding of CCIR-2018, pp.78-89. (EI)

2017

Jingjing Wang, Shoushan Li, Guodong Zhou. Joint Learning on Relevant User Attributes in Micro-blog. Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 2017, 4130-4136. (CCF A)
Dong Zhang, Shoushan Li, Jingjing Wang. Semi-supervised Question Classification with Jointly Learning Question and Answer Representations. Journal of Chinese Information Processing, vol. 31(1), 2017. (Chinese Core Journal)
Jing Chen, Shoushan Li, Jingjing Wang, Guodong Zhou. User age prediction by combining classification and regression. Sci Sin Inform, 2017, 47: 1095–1108, doi: 10.1360/N112016-00278. (Chinese Core Journal)

2015

Jingjing Wang, Yunxia Xue, Shoushan Li, Guodong Zhou. Leveraging Interactive Knowledge and Unlabeled Data in Gender Classification with Co-training. Proceedings of International Conference on Database Systems for Advanced Applications (DASFAA), 2015, 246-251. (CCF B)
Shoushan Li, Jingjing Wang, Guodong Zhou, Hanxiao Shi. Interactive Gender Inference with Integer Linear Programming. Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 2015, 2341-2347. (CCF A)
Shoushan Li, Lei Huang, Jingjing Wang, Guodong Zhou. Semi-Stacking for Semi-supervised Sentiment Classification. Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2015, 27-31. (CCF A)
Lei Huang, Shoushan Li, Jingjing Wang. User-Type Classification in Micro-Blog Based on Information of Authenticated User. Journal of Frontiers of Computer Science and Technology, vol. 9(6), 2015. (Chinese Core Journal)
Jingjing Wang, Shoushan Li, Lei Huang. User Gender Classification in Chinese Microblog. Journal of Chinese Information Processing, vol. 28(6), 2014. (Chinese Core Journal)
Zhu zhu, Jingjing Wang, Shoushan Li, Guodong Zhou. Interactive Gender Inference in Social Media. Proceedings of International Conference on Database Systems for Advanced Applications (DASFAA), 2015, 252-258. (CCF B)

Awards & Honors

Outstanding Expert and Supervisor of Microsoft (2022)
Outstanding Graduate PhD Student of Soochow University (2019)
Suzhou Industrial Park Scholarship (2018)
National Scholarship for Ph.D. (2017)
Ph.D. Scholarship of Soochow University (2017)
National Scholarship for Master (2016)
Outstanding Graduate Student of Soochow University (2016)
Suzhou Industrial Park Scholarship (2015)
etc.

Academic Services

Technical Program Committee (Area Chair & PC)
ACL: Annual Meeting of the Association for Computational Linguistics, Area Chair
EMNLP: Conference on Empirical Methods in Natural Language Processing, Area Chair
AAAI: Association for the Advancement of Artificial Intelligence, PC
IJCAI: International Joint Conference on Artificial Intelligence, PC
etc.
Journal Reviewer
TASLP: IEEE/ACM Transactions on Audio, Speech, and Language Processing
TALLIP: ACM Transactions on Asian and Low-Resource Language Information Processing
SCIS: Science China Information Sciences
Science China
Acta Automatica Sinica
Journal of Chinese Information Processing
etc.
Academic Presentations and Exchanges
2016-2021: Academic reports and exchanges at top conferences including ACL, AAAI, IJCAI
2019: Academic report and exchange at Zhejiang Tailong Commercial Bank, Suzhou Industrial Park Headquarters
2019: Invited talk at Ecovacs, Suzhou
2022: Academic report and exchange at Alibaba Ant Financial
2023: Academic report and exchange at the establishment of NLPAI-SCHOOL, Microsoft Asia Engineering Institute, Suzhou
etc.

Research Grants

As Principle Investigator

Key Technology Research on Attribute-level Sentiment Analysis for Conversational Texts (No. 62006166: 240K RMB: 2021.01–2023.12)
Supported by the National Natural Science Foundation of China (NSFC Young Scientist Fund Project)
Research on Chinese Single-document Automatic Summarization Based on Discourse Structure Analysis (No. 61976146: 560K RMB: 2020.01–2023.12)
Supported by the National Natural Science Foundation of China (NSFC General Program)
Resource Construction and Key Technology Research on Sentiment Information Extraction from Question-answer Texts (No. 2019M661930: 80K RMB: 2020.01–2022.12)
Supported by China Postdoctoral Science Foundation (CPSF)

As Co-investigator

Scene-based Knowledge Graph for Language Understanding and Generation (Sub-project No. 2020AAA0108604: 6,650K RMB: 2020.11–2023.10)
Supported by the National Key Research and Development Program