RGBD based gaze estimation via multi-task CNN

Lian, Dongze; Zhang, Ziheng; Luo, Weixin; Hu, Lina; Wu, Minye; Li, Zechao; Yu, Jingyi; Gao, Shenghua

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Scopus: eid_2-s2.0-85083883072

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: RGBD based gaze estimation via multi-task CNN

Title	RGBD based gaze estimation via multi-task CNN
Authors	Lian, Dongze Zhang, Ziheng Luo, Weixin Hu, Lina Wu, Minye Li, Zechao Yu, Jingyi Gao, Shenghua
Issue Date	2019
Citation	33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, 2019, p. 2488-2495 How to Cite?
Abstract	This paper tackles RGBD based gaze estimation with Convolutional Neural Networks (CNNs). Specifically, we propose to decompose gaze point estimation into eyeball pose, head pose, and 3D eye position estimation. Compared with RGB image-based gaze tracking, having depth modality helps to facilitate head pose estimation and 3D eye position estimation. The captured depth image, however, usually contains noise and black holes which noticeably hamper gaze tracking. Thus we propose a CNN-based multi-task learning framework to simultaneously refine depth images and predict gaze points. We utilize a generator network for depth image generation with a Generative Neural Network (GAN), where the generator network is partially shared by both the gaze tracking network and GAN-based depth synthesizing. By optimizing the whole network simultaneously, depth image synthesis improves gaze point estimation and vice versa. Since the only existing RGBD dataset (EYEDIAP) is too small, we build a large-scale RGBD gaze tracking dataset for performance evaluation. As far as we know, it is the largest RGBD gaze dataset in terms of the number of participants. Comprehensive experiments demonstrate that our method outperforms existing methods by a large margin on both our dataset and the EYEDIAP dataset.
Persistent Identifier	http://hdl.handle.net/10722/345118

DC Field	Value	Language
dc.contributor.author	Lian, Dongze	-
dc.contributor.author	Zhang, Ziheng	-
dc.contributor.author	Luo, Weixin	-
dc.contributor.author	Hu, Lina	-
dc.contributor.author	Wu, Minye	-
dc.contributor.author	Li, Zechao	-
dc.contributor.author	Yu, Jingyi	-
dc.contributor.author	Gao, Shenghua	-
dc.date.accessioned	2024-08-15T09:25:22Z	-
dc.date.available	2024-08-15T09:25:22Z	-
dc.date.issued	2019	-
dc.identifier.citation	33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, 2019, p. 2488-2495	-
dc.identifier.uri	http://hdl.handle.net/10722/345118	-
dc.description.abstract	This paper tackles RGBD based gaze estimation with Convolutional Neural Networks (CNNs). Specifically, we propose to decompose gaze point estimation into eyeball pose, head pose, and 3D eye position estimation. Compared with RGB image-based gaze tracking, having depth modality helps to facilitate head pose estimation and 3D eye position estimation. The captured depth image, however, usually contains noise and black holes which noticeably hamper gaze tracking. Thus we propose a CNN-based multi-task learning framework to simultaneously refine depth images and predict gaze points. We utilize a generator network for depth image generation with a Generative Neural Network (GAN), where the generator network is partially shared by both the gaze tracking network and GAN-based depth synthesizing. By optimizing the whole network simultaneously, depth image synthesis improves gaze point estimation and vice versa. Since the only existing RGBD dataset (EYEDIAP) is too small, we build a large-scale RGBD gaze tracking dataset for performance evaluation. As far as we know, it is the largest RGBD gaze dataset in terms of the number of participants. Comprehensive experiments demonstrate that our method outperforms existing methods by a large margin on both our dataset and the EYEDIAP dataset.	-
dc.language	eng	-
dc.relation.ispartof	33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019	-
dc.title	RGBD based gaze estimation via multi-task CNN	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.scopus	eid_2-s2.0-85083883072	-
dc.identifier.spage	2488	-
dc.identifier.epage	2495	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: RGBD based gaze estimation via multi-task CNN

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats