TextBoxes++: A Single-Shot Oriented Scene Text Detector
A study conducted in the UK from 2009 to 2010 by leading scientists explored neonatal resuscitation practices in various neonatal units, aiming to assess adherence to international guidelines and identify differences between tertiary and non-tertiary care providers...
Read on arXiv
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
One Sentence Abstract
"TextBoxes++ is presented as an end-to-end trainable, fast scene text detector that achieves high accuracy and efficiency, outperforming competing methods on four public datasets, and significantly improving state-of-the-art approaches for word spotting and end-to-end text recognition tasks."
Simplified Abstract
Imagine you're trying to read the text on a photo, but the words are twisted, tiny, or just hard to see. This is a tricky problem for computers too. Scientists have developed a new tool to help computers quickly and accurately detect text in photos, no matter how it's oriented, how small it is, or how different it looks from other text.
The new tool, called TextBoxes++, is a smart way for computers to find and identify text in photos. It's much better than previous methods and can find text more quickly. When put to the test, TextBoxes++ did a great job on various types of photos, finding the text faster and more accurately than other methods.
This new tool is important because it helps computers understand the text in photos better, which can be useful in many situations, like helping you find a specific word in a photo or understanding signs and labels in real life. The scientists made this tool open-source, so others can use it and build on it to make it even better.
Study Fields
Main fields:
- Scene text detection
- Scene text recognition
Subfields:
- Arbitrary orientations
- Small sizes
- Significantly variant aspect ratios of text in natural images
- End-to-end trainable fast scene text detector (TextBoxes++)
- Text localization accuracy
- Runtime
- Public datasets (ICDAR 2015, COCO-Text)
- Post-processing (non-maximum suppression)
- Word spotting
- End-to-end text recognition
Study Objectives
- Develop an end-to-end trainable fast scene text detector named TextBoxes++
- Detect arbitrary-oriented scene text with high accuracy and efficiency in a single network forward pass
- No post-processing other than an efficient non-maximum suppression involved
- Evaluate TextBoxes++ on four public datasets
- Outperform competing methods in terms of text localization accuracy and runtime
- Achieve high f-measure and runtime performance on ICDAR 2015 Incidental text images and COCO-Text images
- Combine TextBoxes++ with a text recognizer for word spotting and end-to-end text recognition tasks on popular benchmarks
Conclusions
- The authors present an end-to-end trainable fast scene text detector named TextBoxes++ that can detect arbitrary-oriented scene text with high accuracy and efficiency.
- The proposed method achieves text localization accuracy and runtime performance superior to competing methods on four public datasets.
- TextBoxes++ has an f-measure of 0.817 at 11.6fps for 1024x1024 ICDAR 2015 Incidental text images, and an f-measure of 0.5591 at 19.8fps for 768x768 COCO-Text images.
- TextBoxes++ significantly outperforms state-of-the-art approaches for word spotting and end-to-end text recognition tasks on popular benchmarks when combined with a text recognizer.
- The code for TextBoxes++ is available at: https://github.com/MhLiao/TextBoxes_plusplus
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
References
- University of AI
Received 20 Oct 2011, Revised 9 Dec 2011, Accepted 5 Jan 2012, Available online 12 Jan 2012.





