TY - GEN
T1 - Label-Free Contrastive Learning for Open-World Multimodal Social Event Detection
AU - Yang, Zhiwei
AU - Qin, Haimei
AU - Peng, Hao
AU - Yu, Xiaoyan
AU - Sun, Li
AU - Jiang, Lei
N1 - Publisher Copyright:
© 2026 Owner/Author.
PY - 2026/2/21
Y1 - 2026/2/21
N2 - Multimodal content on social media contains abundant cues about real-world events, and its automatic detection is critical for public safety and social governance. However, Multimodal Social Event Detection in the open world faces two major challenges: (1) They depend on supervised event labels or structured information; however, social media data in open-world settings often lack both, making it challenging for such methods to adapt to the dynamic nature of social media. (2) They rely on predefined label sets, i.e., the total number of events must generally be specified during the detection process. In contrast, in the open world, the total number of events is inherently difficult to estimate. To tackle these challenges, this paper proposes LFEvent, a label-free contrastive learning framework for Multimodal Social Event Detection. To address the first challenge, we design a label-free multimodal contrastive learning strategy that relies solely on positive samples. Specifically, we design a multimodal large language model-based semantic enhancement strategy. Leveraging carefully crafted prompts, it enriches raw image-text pairs across three dimensions - event theme, event type, and image description - to construct robust positive samples. Subsequently, a dedicated Siamese Network enables self-supervised cross-modal alignment and representation learning. To address the second challenge, we introduce unsupervised clustering into the MSED task for the first time. A novel structure entropy-guided hierarchical clustering method is proposed, which automatically determines the number of event clusters and enables the detection of unseen events in the training set. Experiments on multiple social media datasets demonstrate that LFEvent significantly outperforms existing methods, especially in detecting previously unseen events.
AB - Multimodal content on social media contains abundant cues about real-world events, and its automatic detection is critical for public safety and social governance. However, Multimodal Social Event Detection in the open world faces two major challenges: (1) They depend on supervised event labels or structured information; however, social media data in open-world settings often lack both, making it challenging for such methods to adapt to the dynamic nature of social media. (2) They rely on predefined label sets, i.e., the total number of events must generally be specified during the detection process. In contrast, in the open world, the total number of events is inherently difficult to estimate. To tackle these challenges, this paper proposes LFEvent, a label-free contrastive learning framework for Multimodal Social Event Detection. To address the first challenge, we design a label-free multimodal contrastive learning strategy that relies solely on positive samples. Specifically, we design a multimodal large language model-based semantic enhancement strategy. Leveraging carefully crafted prompts, it enriches raw image-text pairs across three dimensions - event theme, event type, and image description - to construct robust positive samples. Subsequently, a dedicated Siamese Network enables self-supervised cross-modal alignment and representation learning. To address the second challenge, we introduce unsupervised clustering into the MSED task for the first time. A novel structure entropy-guided hierarchical clustering method is proposed, which automatically determines the number of event clusters and enables the detection of unseen events in the training set. Experiments on multiple social media datasets demonstrate that LFEvent significantly outperforms existing methods, especially in detecting previously unseen events.
KW - contrastive learning
KW - multimodal social event detection
KW - structural entropy
UR - https://www.scopus.com/pages/publications/105033151413
U2 - 10.1145/3773966.3777919
DO - 10.1145/3773966.3777919
M3 - 会议稿件
AN - SCOPUS:105033151413
T3 - WSDM 2026 - Proceedings of the 19th ACM International Conference on Web Search and Data Mining
SP - 818
EP - 827
BT - WSDM 2026 - Proceedings of the 19th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
T2 - 19th ACM International Conference on Web Search and Data Mining, WSDM 2026
Y2 - 22 February 2026 through 26 February 2026
ER -