TextBoxes++: A Single-Shot Oriented Scene Text Detector

Structured data

A study conducted in the UK from 2009 to 2010 by leading scientists explored neonatal resuscitation practices in various neonatal units, aiming to assess adherence to international guidelines and identify differences between tertiary and non-tertiary care providers...

Read on arXiv Cardiology Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

One Sentence Abstract

"TextBoxes++ is presented as an end-to-end trainable, fast scene text detector that achieves high accuracy and efficiency, outperforming competing methods on four public datasets, and significantly improving state-of-the-art approaches for word spotting and end-to-end text recognition tasks."

Simplified Abstract

Imagine you're trying to read the text on a photo, but the words are twisted, tiny, or just hard to see. This is a tricky problem for computers too. Scientists have developed a new tool to help computers quickly and accurately detect text in photos, no matter how it's oriented, how small it is, or how different it looks from other text.

The new tool, called TextBoxes++, is a smart way for computers to find and identify text in photos. It's much better than previous methods and can find text more quickly. When put to the test, TextBoxes++ did a great job on various types of photos, finding the text faster and more accurately than other methods.

This new tool is important because it helps computers understand the text in photos better, which can be useful in many situations, like helping you find a specific word in a photo or understanding signs and labels in real life. The scientists made this tool open-source, so others can use it and build on it to make it even better.

Study Fields

Main fields:

Scene text detection
Scene text recognition

Subfields:

Arbitrary orientations
Small sizes
Significantly variant aspect ratios of text in natural images
End-to-end trainable fast scene text detector (TextBoxes++)
Text localization accuracy
Runtime
Public datasets (ICDAR 2015, COCO-Text)
Post-processing (non-maximum suppression)
Word spotting
End-to-end text recognition

Study Objectives

Develop an end-to-end trainable fast scene text detector named TextBoxes++
Detect arbitrary-oriented scene text with high accuracy and efficiency in a single network forward pass
No post-processing other than an efficient non-maximum suppression involved
Evaluate TextBoxes++ on four public datasets
Outperform competing methods in terms of text localization accuracy and runtime
Achieve high f-measure and runtime performance on ICDAR 2015 Incidental text images and COCO-Text images
Combine TextBoxes++ with a text recognizer for word spotting and end-to-end text recognition tasks on popular benchmarks

Conclusions

The authors present an end-to-end trainable fast scene text detector named TextBoxes++ that can detect arbitrary-oriented scene text with high accuracy and efficiency.
The proposed method achieves text localization accuracy and runtime performance superior to competing methods on four public datasets.
TextBoxes++ has an f-measure of 0.817 at 11.6fps for 1024x1024 ICDAR 2015 Incidental text images, and an f-measure of 0.5591 at 19.8fps for 768x768 COCO-Text images.
TextBoxes++ significantly outperforms state-of-the-art approaches for word spotting and end-to-end text recognition tasks on popular benchmarks when combined with a text recognizer.
The code for TextBoxes++ is available at: https://github.com/MhLiao/TextBoxes_plusplusLorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

References

C. Yi, Y. Tian•IEEE Trans. Image Processing

C. Yi and Y. Tian, “Scene text recognition in mobile applications by character descriptor and structure configuration,” IEEE Trans. Image Processing, vol. 23, no. 7, pp. 2972–2982, 2014.

TextBoxes++: A Single-Shot Oriented Scene Text Detector

One Sentence Abstract

Simplified Abstract

Study Fields

Study Objectives

Conclusions

References

References

Unlock full article access by joining Solve