File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Video event recognition using kernel methods with multilevel temporal alignment

TitleVideo event recognition using kernel methods with multilevel temporal alignment
Authors
KeywordsConcept ontology
Earth mover's distance
Event recognition
News video
Temporally aligned pyramid matching
Video indexing
Issue Date2008
Citation
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, v. 30, n. 11, p. 1985-1997 How to Cite?
AbstractIn this work, we systematically study the problem of event recognition in unconstrained news video sequences. We adopt the discriminative kernel-based method for which video clip similarity plays an important role. First, we represent a video clip as a bag of orderless descriptors extracted from all of the constituent frames and apply the Earth Mover's Distance (EMD) to integrate similarities among frames from two clips. Observing that a video clip is usually comprised of multiple subclips corresponding to event evolution over time, we further build a multi-level temporal pyramid. At each pyramid level, we integrate the information from different subclips with Integer-valueconstrained EMD to explicitly align the subclips. By fusing the information from the different pyramid levels, we develop Temporally Aligned Pyramid Matching (TAPM) for measuring video similarity. We conduct comprehensive experiments on the Trecvid 2005 corpus, which contains more than 6,800 clips. Our experiments demonstrate that 1) the TAPM multi-level method clearly outperforms single-level EMD, and 2) single-level EMD outperforms keyframe and multi-frame based detection methods by a large margin. In addition, we conduct in-depth investigation of various aspects of the proposed techniques, such as weight selection in single-level EMD, sensitivity to temporal clustering, the effect of temporal alignment, and possible approaches for speedup. © 2008 IEEE.
Persistent Identifierhttp://hdl.handle.net/10722/321355
ISSN
2021 Impact Factor: 24.314
2020 SCImago Journal Rankings: 3.811
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorXu, Dong-
dc.contributor.authorChang, Shih Fu-
dc.date.accessioned2022-11-03T02:18:21Z-
dc.date.available2022-11-03T02:18:21Z-
dc.date.issued2008-
dc.identifier.citationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, v. 30, n. 11, p. 1985-1997-
dc.identifier.issn0162-8828-
dc.identifier.urihttp://hdl.handle.net/10722/321355-
dc.description.abstractIn this work, we systematically study the problem of event recognition in unconstrained news video sequences. We adopt the discriminative kernel-based method for which video clip similarity plays an important role. First, we represent a video clip as a bag of orderless descriptors extracted from all of the constituent frames and apply the Earth Mover's Distance (EMD) to integrate similarities among frames from two clips. Observing that a video clip is usually comprised of multiple subclips corresponding to event evolution over time, we further build a multi-level temporal pyramid. At each pyramid level, we integrate the information from different subclips with Integer-valueconstrained EMD to explicitly align the subclips. By fusing the information from the different pyramid levels, we develop Temporally Aligned Pyramid Matching (TAPM) for measuring video similarity. We conduct comprehensive experiments on the Trecvid 2005 corpus, which contains more than 6,800 clips. Our experiments demonstrate that 1) the TAPM multi-level method clearly outperforms single-level EMD, and 2) single-level EMD outperforms keyframe and multi-frame based detection methods by a large margin. In addition, we conduct in-depth investigation of various aspects of the proposed techniques, such as weight selection in single-level EMD, sensitivity to temporal clustering, the effect of temporal alignment, and possible approaches for speedup. © 2008 IEEE.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Pattern Analysis and Machine Intelligence-
dc.subjectConcept ontology-
dc.subjectEarth mover's distance-
dc.subjectEvent recognition-
dc.subjectNews video-
dc.subjectTemporally aligned pyramid matching-
dc.subjectVideo indexing-
dc.titleVideo event recognition using kernel methods with multilevel temporal alignment-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TPAMI.2008.129-
dc.identifier.pmid18787246-
dc.identifier.scopuseid_2-s2.0-54749131961-
dc.identifier.volume30-
dc.identifier.issue11-
dc.identifier.spage1985-
dc.identifier.epage1997-
dc.identifier.isiWOS:000259110000011-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats