TextContourNet: A flexible and effective framework for improving scene text detection architecture with a multi-task cascade

Dafang He, Xiao Yang, Daniel Kifer, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We study the problem of extracting text instance contour information from images and use it to assist scene text detection. We propose a novel and effective framework for this and experimentally demonstrate that: (1) A CNN that can be effectively used to extract instance-level text contour from natural images. (2) The extracted contour information can be used for better scene text detection. We propose two ways for learning the contour task together with the scene text detection: (1) as an auxiliary task and (2) as multi-task cascade. Extensive experiments with different benchmark datasets demonstrate that both designs improve the performance of a state-of-the-art scene text detector and that a multi-task cascade design achieves the best performance.

Original languageEnglish (US)
Title of host publicationProceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages676-685
Number of pages10
ISBN (Electronic)9781728119755
DOIs
StatePublished - Mar 4 2019
Event19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019 - Waikoloa Village, United States
Duration: Jan 7 2019Jan 11 2019

Publication series

NameProceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019

Conference

Conference19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019
CountryUnited States
CityWaikoloa Village
Period1/7/191/11/19

Fingerprint

Detectors
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Cite this

He, D., Yang, X., Kifer, D., & Giles, C. L. (2019). TextContourNet: A flexible and effective framework for improving scene text detection architecture with a multi-task cascade. In Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019 (pp. 676-685). [8659067] (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WACV.2019.00077
He, Dafang ; Yang, Xiao ; Kifer, Daniel ; Giles, C. Lee. / TextContourNet : A flexible and effective framework for improving scene text detection architecture with a multi-task cascade. Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 676-685 (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019).
@inproceedings{760bf363cc7e4eb7bc34fa8e2be0f163,
title = "TextContourNet: A flexible and effective framework for improving scene text detection architecture with a multi-task cascade",
abstract = "We study the problem of extracting text instance contour information from images and use it to assist scene text detection. We propose a novel and effective framework for this and experimentally demonstrate that: (1) A CNN that can be effectively used to extract instance-level text contour from natural images. (2) The extracted contour information can be used for better scene text detection. We propose two ways for learning the contour task together with the scene text detection: (1) as an auxiliary task and (2) as multi-task cascade. Extensive experiments with different benchmark datasets demonstrate that both designs improve the performance of a state-of-the-art scene text detector and that a multi-task cascade design achieves the best performance.",
author = "Dafang He and Xiao Yang and Daniel Kifer and Giles, {C. Lee}",
year = "2019",
month = "3",
day = "4",
doi = "10.1109/WACV.2019.00077",
language = "English (US)",
series = "Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "676--685",
booktitle = "Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019",
address = "United States",

}

He, D, Yang, X, Kifer, D & Giles, CL 2019, TextContourNet: A flexible and effective framework for improving scene text detection architecture with a multi-task cascade. in Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019., 8659067, Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Institute of Electrical and Electronics Engineers Inc., pp. 676-685, 19th IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Waikoloa Village, United States, 1/7/19. https://doi.org/10.1109/WACV.2019.00077

TextContourNet : A flexible and effective framework for improving scene text detection architecture with a multi-task cascade. / He, Dafang; Yang, Xiao; Kifer, Daniel; Giles, C. Lee.

Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 676-685 8659067 (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - TextContourNet

T2 - A flexible and effective framework for improving scene text detection architecture with a multi-task cascade

AU - He, Dafang

AU - Yang, Xiao

AU - Kifer, Daniel

AU - Giles, C. Lee

PY - 2019/3/4

Y1 - 2019/3/4

N2 - We study the problem of extracting text instance contour information from images and use it to assist scene text detection. We propose a novel and effective framework for this and experimentally demonstrate that: (1) A CNN that can be effectively used to extract instance-level text contour from natural images. (2) The extracted contour information can be used for better scene text detection. We propose two ways for learning the contour task together with the scene text detection: (1) as an auxiliary task and (2) as multi-task cascade. Extensive experiments with different benchmark datasets demonstrate that both designs improve the performance of a state-of-the-art scene text detector and that a multi-task cascade design achieves the best performance.

AB - We study the problem of extracting text instance contour information from images and use it to assist scene text detection. We propose a novel and effective framework for this and experimentally demonstrate that: (1) A CNN that can be effectively used to extract instance-level text contour from natural images. (2) The extracted contour information can be used for better scene text detection. We propose two ways for learning the contour task together with the scene text detection: (1) as an auxiliary task and (2) as multi-task cascade. Extensive experiments with different benchmark datasets demonstrate that both designs improve the performance of a state-of-the-art scene text detector and that a multi-task cascade design achieves the best performance.

UR - http://www.scopus.com/inward/record.url?scp=85063584864&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063584864&partnerID=8YFLogxK

U2 - 10.1109/WACV.2019.00077

DO - 10.1109/WACV.2019.00077

M3 - Conference contribution

AN - SCOPUS:85063584864

T3 - Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019

SP - 676

EP - 685

BT - Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

He D, Yang X, Kifer D, Giles CL. TextContourNet: A flexible and effective framework for improving scene text detection architecture with a multi-task cascade. In Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 676-685. 8659067. (Proceedings - 2019 IEEE Winter Conference on Applications of Computer Vision, WACV 2019). https://doi.org/10.1109/WACV.2019.00077