Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

QEST: Quantized and Efficient Scene Text Detector Using Deep Learning

QEST: Quantized and Efficient Scene Text Detector Using Deep Learning Scene text detection is complicated and one of the most challenging tasks due to different environmental restrictions, such as illuminations, lighting conditions, tiny and curved texts, and many more. Most of the works on scene text detection have overlooked the primary goal of increasing model accuracy and efficiency, resulting in heavy-weight models that require more processing resources. A novel lightweight model has been developed in this article to improve the accuracy and efficiency of scene text detection. The proposed model relies on ResNet50 and MobileNetV2 as backbones with quantization used to make the resulting model lightweight. During quantization, the precision has been changed from float32 to float16 and int8 for making the model lightweight. In terms of inference time and Floating-Point Operations Per Second, the proposed method outperforms the state-of-the-art techniques by around 30–100 times. Here, well-known datasets, i.e., ICDAR2015 and ICDAR2019, have been utilized for training and testing to validate the performance of the proposed model. Finally, the findings and discussion indicate that the proposed model is more efficient than the existing schemes. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) Association for Computing Machinery

Loading next page...
 
/lp/association-for-computing-machinery/qest-quantized-and-efficient-scene-text-detector-using-deep-learning-MC00ag0dU0

References (70)

Publisher
Association for Computing Machinery
Copyright
Copyright © 2023 Association for Computing Machinery.
ISSN
2375-4699
eISSN
2375-4702
DOI
10.1145/3526217
Publisher site
See Article on Publisher Site

Abstract

Scene text detection is complicated and one of the most challenging tasks due to different environmental restrictions, such as illuminations, lighting conditions, tiny and curved texts, and many more. Most of the works on scene text detection have overlooked the primary goal of increasing model accuracy and efficiency, resulting in heavy-weight models that require more processing resources. A novel lightweight model has been developed in this article to improve the accuracy and efficiency of scene text detection. The proposed model relies on ResNet50 and MobileNetV2 as backbones with quantization used to make the resulting model lightweight. During quantization, the precision has been changed from float32 to float16 and int8 for making the model lightweight. In terms of inference time and Floating-Point Operations Per Second, the proposed method outperforms the state-of-the-art techniques by around 30–100 times. Here, well-known datasets, i.e., ICDAR2015 and ICDAR2019, have been utilized for training and testing to validate the performance of the proposed model. Finally, the findings and discussion indicate that the proposed model is more efficient than the existing schemes.

Journal

ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)Association for Computing Machinery

Published: May 8, 2023

Keywords: Deep neural network

There are no references for this article.