File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Detecting generic visual events with temporal cues

TitleDetecting generic visual events with temporal cues
Authors
Issue Date2006
Citation
Conference Record - Asilomar Conference on Signals, Systems and Computers, 2006, p. 54-58 How to Cite?
AbstractWe present novel algorithms for detecting generic visual events from video. Target event models will produce binary decisions on each shot about classes of events involving object actions and their interactions with the scene, such as airplane taking off, exiting car, riot. While event detection has been studied in scenarios with strong scene and imaging assumptions, the detection of generic visual events from an unconstrained domain such as broadcast news has not been explored. This work extends our recent work [3] on event detection by (1) using a novel bag-of-features representation along with the earth movers' distance to account for the temporal variations within a shot, (2) learn the importance among input modalities with a double-convex combination along both different kernels and different support vectors, which is in turn solved via multiple kernel learning. Experiments show that the bag-of-features representation significantly outperforms the static baseline; multiple kernel learning yields promising performance improvement while providing intuitive explanations for the importance of the input kernels.
Persistent Identifierhttp://hdl.handle.net/10722/321350
ISSN
2023 SCImago Journal Rankings: 0.376

 

DC FieldValueLanguage
dc.contributor.authorXie, Lexing-
dc.contributor.authorXu, Dong-
dc.contributor.authorEbadollahi, Shahram-
dc.contributor.authorScheinberg, Katya-
dc.contributor.authorChange, Shih Fu-
dc.contributor.authorSmith, John R.-
dc.date.accessioned2022-11-03T02:18:19Z-
dc.date.available2022-11-03T02:18:19Z-
dc.date.issued2006-
dc.identifier.citationConference Record - Asilomar Conference on Signals, Systems and Computers, 2006, p. 54-58-
dc.identifier.issn1058-6393-
dc.identifier.urihttp://hdl.handle.net/10722/321350-
dc.description.abstractWe present novel algorithms for detecting generic visual events from video. Target event models will produce binary decisions on each shot about classes of events involving object actions and their interactions with the scene, such as airplane taking off, exiting car, riot. While event detection has been studied in scenarios with strong scene and imaging assumptions, the detection of generic visual events from an unconstrained domain such as broadcast news has not been explored. This work extends our recent work [3] on event detection by (1) using a novel bag-of-features representation along with the earth movers' distance to account for the temporal variations within a shot, (2) learn the importance among input modalities with a double-convex combination along both different kernels and different support vectors, which is in turn solved via multiple kernel learning. Experiments show that the bag-of-features representation significantly outperforms the static baseline; multiple kernel learning yields promising performance improvement while providing intuitive explanations for the importance of the input kernels.-
dc.languageeng-
dc.relation.ispartofConference Record - Asilomar Conference on Signals, Systems and Computers-
dc.titleDetecting generic visual events with temporal cues-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/ACSSC.2006.356582-
dc.identifier.scopuseid_2-s2.0-47049109638-
dc.identifier.spage54-
dc.identifier.epage58-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats