Research on Short Video Multi-label Classification Based on Deep Multimodal Association Learning

doi:10.11925/infotech.2096-3467.2023.0587

Li Yun,Lu Zhixiang,Liu Shuyi,WangSu,Lv Zimin,Jing Peiguang

(School of Big Data and Artificial Intelligence, Guangxi University of Finance and Economics, Guangxi 530003, China) (College of Artificial Intelligence and Software, NanNing University, Guangxi 530200, China) (School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China) (School of Electronic Information, Guangxi University for Nationalities, Guangxi 530006, China) (School of Computer,Electronic and Information, Guangxi University, Guangxi 530004, China)

Download:
Export: BibTeX | EndNote (RIS)

Abstract

[Objective] The research extensively utilizes the complementarity of modalities to enhance the correlation between different modalities and between modalities and labels, leading to highly accurate classification results. [Methods] The research introduces a novel algorithm for multi-label classification of short videos, which leverages multi-modal semantic enhancement and graph convolutional networks. The algorithm seamlessly integrates both multi-modal learning and label semantic learning within a unified network framework. [Results]This paper verifies the effectiveness of the proposed algorithm through a large number of experimental analyses, and the algorithm's classification accuracy reaches 87%, which is 6.82% higher than the optimal benchmark algorithm.[Limitations] The process of modality fusion to enhance information is hindered by the presence of redundant data, which in turn obscures the correlation between modalities. Furthermore, the domain of modality-based multi-label classification remains relatively unexplored with limited research available. [Conclusions] The algorithm effectively enhances the complementarity among modalities, strengthens the correlation between modalities and categories, and greatly improves the accuracy of classification.

Key words： multimodal fusion Semantic Enhancement Graph convolutional network Short video

Published: 19 April 2024

ZTFLH:

TP391.41

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors

Cite this article:

Li Yun, Lu Zhixiang, Liu Shuyi, WangSu, Lv Zimin, Jing Peiguang. Research on Short Video Multi-label Classification Based on Deep Multimodal Association Learning . Data Analysis and Knowledge Discovery, 0, (): 1-.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2023.0587 OR https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y0/V/I/1

[1]	He Yu, Zhang Xiaodong, Zheng Xin. Constructing Patent Knowledge Graph with SpERT-Aggcn Model[J]. 数据分析与知识发现, 2024, 8(1): 146-156.
[2]	Liu Yang, Ding Xingchen, Ma Lili, Wang Chunyang, Zhu Lifang. Usefulness Detection of Travel Reviews Based on Multi-dimensional Graph Convolutional Networks[J]. 数据分析与知识发现, 2023, 7(8): 95-104.
[3]	Zhao Meng, Wang Hao, Li Xiaomin. Recognition of Emotions and Analysis of Emotional Changes in Chinese Folk Songs[J]. 数据分析与知识发现, 2023, 7(7): 111-124.
[4]	Xu Guixian, Zhang Zixin, Yu Shaona, Dong Yushuang, Tian Yuan. Tibetan News Text Classification Based on Graph Convolutional Networks[J]. 数据分析与知识发现, 2023, 7(6): 73-85.
[5]	Xu Kang, Yu Shengnan, Chen Lei, Wang Chuandong. Linguistic Knowledge-Enhanced Self-Supervised Graph Convolutional Network for Event Relation Extraction[J]. 数据分析与知识发现, 2023, 7(5): 92-104.
[6]	Wang Hao, Gong Lijuan, Zhou Zeyu, Fan Tao, Wang Yongsheng. Detecting Mis/Dis-information from Social Media with Semantic Enhancement[J]. 数据分析与知识发现, 2023, 7(2): 48-60.
[7]	Zhang Zhengang, Yu Chuanming. Knowledge Graph Completion Model Based on Entity and Relation Fusion[J]. 数据分析与知识发现, 2023, 7(2): 15-25.
[8]	Lyu Xueqiang, Du Yifan, Zhang Le, Pan Huiping, Tian Chi. GKTR Retrieval Model for Engineering Consulting Reports with Graph Convolution Topological and Keyword Features[J]. 数据分析与知识发现, 2023, 7(12): 155-163.
[9]	Gao Haoxin, Sun Lijuan, Wu Jingchen, Gao Yutong, Wu Xu. Online Sensitive Text Classification Model Based on Heterogeneous Graph Convolutional Network[J]. 数据分析与知识发现, 2023, 7(11): 26-36.
[10]	Li Xuemei,Jiang Jianhong. Identifying Useful Reviews with Improved Graph Convolutional Neural Network[J]. 数据分析与知识发现, 2022, 6(11): 38-51.
[11]	Zhou Zeyu,Wang Hao,Zhao Zibo,Li Yueyan,Zhang Xiaoqin. Construction and Application of GCN Model for Text Classification with Associated Information[J]. 数据分析与知识发现, 2021, 5(9): 31-41.
[12]	Ren Qiutong, Wang Hao, Xiong Xin, Fan Tao. Extracting Drama Terms with GCN Long-distance Constrain[J]. 数据分析与知识发现, 2021, 5(12): 123-136.

Viewed

Full text

Abstract

Cited

Shared

Discussed