File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TIP.2017.2740160
- Scopus: eid_2-s2.0-85028456098
- PMID: 28816665
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Body structure aware deep crowd counting
Title | Body structure aware deep crowd counting |
---|---|
Authors | |
Keywords | Convolutional neural networks Crowd counting Pedestrian semantic analysis Visual context structure |
Issue Date | 2018 |
Citation | IEEE Transactions on Image Processing, 2018, v. 27, n. 3, p. 1049-1059 How to Cite? |
Abstract | Crowd counting is a challenging task, mainly due to the severe occlusions among dense crowds. This paper aims to take a broader view to address crowd counting from the perspective of semantic modeling. In essence, crowd counting is a task of pedestrian semantic analysis involving three key factors: pedestrians, heads, and their context structure. The information of different body parts is an important cue to help us judge whether there exists a person at a certain position. Existing methods usually perform crowd counting from the perspective of directly modeling the visual properties of either the whole body or the heads only, without explicitly capturing the composite body-part semantic structure information that is crucial for crowd counting. In our approach, we first formulate the key factors of crowd counting as semantic scene models. Then, we convert the crowd counting problem into a multi-task learning problem, such that the semantic scene models are turned into different sub-tasks. Finally, the deep convolutional neural networks are used to learn the sub-tasks in a unified scheme. Our approach encodes the semantic nature of crowd counting and provides a novel solution in terms of pedestrian semantic analysis. In experiments, our approach outperforms the state-ofthe- art methods on four benchmark crowd counting data sets. The semantic structure information is demonstrated to be an effective cue in scene of crowd counting. |
Persistent Identifier | http://hdl.handle.net/10722/345090 |
ISSN | 2023 Impact Factor: 10.8 2023 SCImago Journal Rankings: 3.556 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Huang, Siyu | - |
dc.contributor.author | Li, Xi | - |
dc.contributor.author | Zhang, Zhongfei | - |
dc.contributor.author | Wu, Fei | - |
dc.contributor.author | Gao, Shenghua | - |
dc.contributor.author | Ji, Rongrong | - |
dc.contributor.author | Han, Junwei | - |
dc.date.accessioned | 2024-08-15T09:25:10Z | - |
dc.date.available | 2024-08-15T09:25:10Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | IEEE Transactions on Image Processing, 2018, v. 27, n. 3, p. 1049-1059 | - |
dc.identifier.issn | 1057-7149 | - |
dc.identifier.uri | http://hdl.handle.net/10722/345090 | - |
dc.description.abstract | Crowd counting is a challenging task, mainly due to the severe occlusions among dense crowds. This paper aims to take a broader view to address crowd counting from the perspective of semantic modeling. In essence, crowd counting is a task of pedestrian semantic analysis involving three key factors: pedestrians, heads, and their context structure. The information of different body parts is an important cue to help us judge whether there exists a person at a certain position. Existing methods usually perform crowd counting from the perspective of directly modeling the visual properties of either the whole body or the heads only, without explicitly capturing the composite body-part semantic structure information that is crucial for crowd counting. In our approach, we first formulate the key factors of crowd counting as semantic scene models. Then, we convert the crowd counting problem into a multi-task learning problem, such that the semantic scene models are turned into different sub-tasks. Finally, the deep convolutional neural networks are used to learn the sub-tasks in a unified scheme. Our approach encodes the semantic nature of crowd counting and provides a novel solution in terms of pedestrian semantic analysis. In experiments, our approach outperforms the state-ofthe- art methods on four benchmark crowd counting data sets. The semantic structure information is demonstrated to be an effective cue in scene of crowd counting. | - |
dc.language | eng | - |
dc.relation.ispartof | IEEE Transactions on Image Processing | - |
dc.subject | Convolutional neural networks | - |
dc.subject | Crowd counting | - |
dc.subject | Pedestrian semantic analysis | - |
dc.subject | Visual context structure | - |
dc.title | Body structure aware deep crowd counting | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/TIP.2017.2740160 | - |
dc.identifier.pmid | 28816665 | - |
dc.identifier.scopus | eid_2-s2.0-85028456098 | - |
dc.identifier.volume | 27 | - |
dc.identifier.issue | 3 | - |
dc.identifier.spage | 1049 | - |
dc.identifier.epage | 1059 | - |