arXiv Analytics

Sign in

arXiv:2206.10861 [cs.CV]AbstractReferencesReviewsResources

UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022

Yuanhang Zhang, Susan Liang, Shuang Yang, Shiguang Shan

Published 2022-06-22Version 1

This report presents a brief description of our winning solution to the AVA Active Speaker Detection (ASD) task at ActivityNet Challenge 2022. Our underlying model UniCon+ continues to build on our previous work, the Unified Context Network (UniCon) and Extended UniCon which are designed for robust scene-level ASD. We augment the architecture with a simple GRU-based module that allows information of recurring identities to flow across scenes through read and update operations. We report a best result of 94.47% mAP on the AVA-ActiveSpeaker test set, which continues to rank first on this year's challenge leaderboard and significantly pushes the state-of-the-art.

Comments: 5 pages, 3 figures; technical report for AVA Challenge (see https://research.google.com/ava/challenge.html) at the International Challenge on Activity Recognition (ActivityNet), CVPR 2022
Categories: cs.CV, cs.SD, eess.AS
Related articles: Most relevant | Search more
arXiv:1710.08011 [cs.CV] (Published 2017-10-22)
ActivityNet Challenge 2017 Summary
arXiv:1806.04391 [cs.CV] (Published 2018-06-12)
Qiniu Submission to ActivityNet Challenge 2018
arXiv:1807.00686 [cs.CV] (Published 2018-06-29)
YH Technologies at ActivityNet Challenge 2018