Body structure aware deep crowd counting

Huang, Siyu; Li, Xi; Zhang, Zhongfei; Wu, Fei; Gao, Shenghua; Ji, Rongrong; Han, Junwei

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TIP.2017.2740160
Scopus: eid_2-s2.0-85028456098
PMID: 28816665
WOS: WOS:000418092400002
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Body structure aware deep crowd counting

Title	Body structure aware deep crowd counting
Authors	Huang, Siyu Li, Xi Zhang, Zhongfei Wu, Fei Gao, Shenghua Ji, Rongrong Han, Junwei
Keywords	Convolutional neural networks Crowd counting Pedestrian semantic analysis Visual context structure
Issue Date	2018
Citation	IEEE Transactions on Image Processing, 2018, v. 27, n. 3, p. 1049-1059 How to Cite? DOI: http://dx.doi.org/10.1109/TIP.2017.2740160
Abstract	Crowd counting is a challenging task, mainly due to the severe occlusions among dense crowds. This paper aims to take a broader view to address crowd counting from the perspective of semantic modeling. In essence, crowd counting is a task of pedestrian semantic analysis involving three key factors: pedestrians, heads, and their context structure. The information of different body parts is an important cue to help us judge whether there exists a person at a certain position. Existing methods usually perform crowd counting from the perspective of directly modeling the visual properties of either the whole body or the heads only, without explicitly capturing the composite body-part semantic structure information that is crucial for crowd counting. In our approach, we first formulate the key factors of crowd counting as semantic scene models. Then, we convert the crowd counting problem into a multi-task learning problem, such that the semantic scene models are turned into different sub-tasks. Finally, the deep convolutional neural networks are used to learn the sub-tasks in a unified scheme. Our approach encodes the semantic nature of crowd counting and provides a novel solution in terms of pedestrian semantic analysis. In experiments, our approach outperforms the state-ofthe- art methods on four benchmark crowd counting data sets. The semantic structure information is demonstrated to be an effective cue in scene of crowd counting.
Persistent Identifier	http://hdl.handle.net/10722/345090
ISSN	1057-7149 2023 Impact Factor: 10.8 2023 SCImago Journal Rankings: 3.556
ISI Accession Number ID	WOS:000418092400002

DC Field	Value	Language
dc.contributor.author	Huang, Siyu	-
dc.contributor.author	Li, Xi	-
dc.contributor.author	Zhang, Zhongfei	-
dc.contributor.author	Wu, Fei	-
dc.contributor.author	Gao, Shenghua	-
dc.contributor.author	Ji, Rongrong	-
dc.contributor.author	Han, Junwei	-
dc.date.accessioned	2024-08-15T09:25:10Z	-
dc.date.available	2024-08-15T09:25:10Z	-
dc.date.issued	2018	-
dc.identifier.citation	IEEE Transactions on Image Processing, 2018, v. 27, n. 3, p. 1049-1059	-
dc.identifier.issn	1057-7149	-
dc.identifier.uri	http://hdl.handle.net/10722/345090	-
dc.description.abstract	Crowd counting is a challenging task, mainly due to the severe occlusions among dense crowds. This paper aims to take a broader view to address crowd counting from the perspective of semantic modeling. In essence, crowd counting is a task of pedestrian semantic analysis involving three key factors: pedestrians, heads, and their context structure. The information of different body parts is an important cue to help us judge whether there exists a person at a certain position. Existing methods usually perform crowd counting from the perspective of directly modeling the visual properties of either the whole body or the heads only, without explicitly capturing the composite body-part semantic structure information that is crucial for crowd counting. In our approach, we first formulate the key factors of crowd counting as semantic scene models. Then, we convert the crowd counting problem into a multi-task learning problem, such that the semantic scene models are turned into different sub-tasks. Finally, the deep convolutional neural networks are used to learn the sub-tasks in a unified scheme. Our approach encodes the semantic nature of crowd counting and provides a novel solution in terms of pedestrian semantic analysis. In experiments, our approach outperforms the state-ofthe- art methods on four benchmark crowd counting data sets. The semantic structure information is demonstrated to be an effective cue in scene of crowd counting.	-
dc.language	eng	-
dc.relation.ispartof	IEEE Transactions on Image Processing	-
dc.subject	Convolutional neural networks	-
dc.subject	Crowd counting	-
dc.subject	Pedestrian semantic analysis	-
dc.subject	Visual context structure	-
dc.title	Body structure aware deep crowd counting	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/TIP.2017.2740160	-
dc.identifier.pmid	28816665	-
dc.identifier.scopus	eid_2-s2.0-85028456098	-
dc.identifier.volume	27	-
dc.identifier.issue	3	-
dc.identifier.spage	1049	-
dc.identifier.epage	1059	-
dc.identifier.isi	WOS:000418092400002	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Body structure aware deep crowd counting

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats