File Download

There are no files associated with this item.

Supplementary

Conference Paper: AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

TitleAE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
Authors
KeywordsText Spotting
Text Detection
Text Recognition
Text Detection Ambiguit
Issue Date2020
Citation
The 16th European Conference on Computer Vision (ECCV), Online, 23-28 August 2020 How to Cite?
AbstractScene text spotting aims to detect and recognize the entire word or sentence with multiple characters in natural images. It is still challenging because ambiguity often occurs when the spacing between characters is large or the characters are evenly spread in multiple rows and columns, making many visually plausible groupings of the characters (e.g. 'BERLIN' is incorrectly detected as 'BERL' and 'IN' in Fig. 1(c)). Unlike previous works that merely employed visual features for text detection, this work proposes a novel text spotter, named Ambiguity Eliminating Text Spotter (AE TextSpotter), which learns both visual and linguistic features to significantly reduce ambiguity in text detection. The proposed AE TextSpotter has three important benefits. 1) The linguistic representation is learned together with the visual representation in a framework. To our knowledge, it is the first time to improve text detection by using a language model. 2) A carefully designed language module is utilized to reduce the detection confidence of incorrect text lines, making them easily pruned in the detection stage. 3) Extensive experiments show that AE TextSpotter outperforms other state-of-the-art methods by a large margin. For example, we carefully select a set of extremely ambiguous samples from the IC19-ReCTS dataset, where our approach surpasses other methods by more than 4%
DescriptionECCV 2020 take place virtually due to COVID-19
Poster Presentation - Paper ID: 2183
Persistent Identifierhttp://hdl.handle.net/10722/284148

 

DC FieldValueLanguage
dc.contributor.authorWang, W-
dc.contributor.authorLiu, X-
dc.contributor.authorJi, X-
dc.contributor.authorXie, E-
dc.contributor.authorLiang, D-
dc.contributor.authorYang, Z-
dc.contributor.authorLu, T-
dc.contributor.authorShen, C-
dc.contributor.authorLuo, P-
dc.date.accessioned2020-07-20T05:56:28Z-
dc.date.available2020-07-20T05:56:28Z-
dc.date.issued2020-
dc.identifier.citationThe 16th European Conference on Computer Vision (ECCV), Online, 23-28 August 2020-
dc.identifier.urihttp://hdl.handle.net/10722/284148-
dc.descriptionECCV 2020 take place virtually due to COVID-19-
dc.descriptionPoster Presentation - Paper ID: 2183-
dc.description.abstractScene text spotting aims to detect and recognize the entire word or sentence with multiple characters in natural images. It is still challenging because ambiguity often occurs when the spacing between characters is large or the characters are evenly spread in multiple rows and columns, making many visually plausible groupings of the characters (e.g. 'BERLIN' is incorrectly detected as 'BERL' and 'IN' in Fig. 1(c)). Unlike previous works that merely employed visual features for text detection, this work proposes a novel text spotter, named Ambiguity Eliminating Text Spotter (AE TextSpotter), which learns both visual and linguistic features to significantly reduce ambiguity in text detection. The proposed AE TextSpotter has three important benefits. 1) The linguistic representation is learned together with the visual representation in a framework. To our knowledge, it is the first time to improve text detection by using a language model. 2) A carefully designed language module is utilized to reduce the detection confidence of incorrect text lines, making them easily pruned in the detection stage. 3) Extensive experiments show that AE TextSpotter outperforms other state-of-the-art methods by a large margin. For example, we carefully select a set of extremely ambiguous samples from the IC19-ReCTS dataset, where our approach surpasses other methods by more than 4%-
dc.languageeng-
dc.relation.ispartofEuropean Conference on Computer Vision (ECCV)-
dc.subjectText Spotting-
dc.subjectText Detection-
dc.subjectText Recognition-
dc.subjectText Detection Ambiguit-
dc.titleAE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting-
dc.typeConference_Paper-
dc.identifier.emailLuo, P: pluo@hku.hk-
dc.identifier.authorityLuo, P=rp02575-
dc.identifier.hkuros311006-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats