arXiv:2311.10899 Abstract | arXiv Analytics

arXiv:2311.10899 [cs.CV]Abstract References Reviews Resources

Extraction and Summarization of Explicit Video Content using Multi-Modal Deep Learning

Published 2023-11-17Version 1

With the increase in video-sharing platforms across the internet, it is difficult for humans to moderate the data for explicit content. Hence, an automated pipeline to scan through video data for explicit content has become the need of the hour. We propose a novel pipeline that uses multi-modal deep learning to first extract the explicit segments of input videos and then summarize their content using text to determine its age appropriateness and age rating. We also evaluate our pipeline's effectiveness in the end using standard metrics.

Comments: 8 pages, 3 figures

Categories: cs.CV, cs.CL, cs.LG

Keywords: multi-modal deep learning, explicit video content, extraction, explicit content, summarization

Related articles: Most relevant | Search more

arXiv:1910.09233 [cs.CV] (Published 2019-10-21)

CNN based Extraction of Panels/Characters from Bengali Comic Book Page Images

Arpita Dutta, Samit Biswas

arXiv:2306.00640 [cs.CV] (Published 2023-06-01)

Multi-Modal Deep Learning for Multi-Temporal Urban Mapping With a Partly Missing Optical Modality

Sebastian Hafner, Yifang Ban

arXiv:1804.04436 [cs.CV] (Published 2018-04-12)

Extraction of Airways using Graph Neural Networks