• Phan Thị Thanh Nga Faculty of Information Technology, Dalat University, Viet Nam,
  • Nguyễn Thị Huyền Trang Faculty of Information Technology, Dalat University, Viet Nam,
  • Nguyễn Văn Phúc Devsoft Company, Viet Nam,
  • Thái Duy Quý The Research Management and International Cooperation Department, Dalat University, Viet Nam,
  • Võ Phương Bình Faculty of Information Technology, Dalat University, Viet Nam,



Book cover, OCR (Optical Character Recognition), Text information extraction, Vietnamese text detection.


Automatic information extraction from images reduces the cost, human interference, and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians, bookshop keepers, and individual users. In this paper, we present a novel method for the Vietnamese text extraction from images of scanned book covers. The proposed system accepts the book covers snapshot, filters the input image for an enhancement of quality, locates the regions with text, then utilizes the optical character recognizer (OCR) to extract the text. The last step is to filter the extracted text in accompany with at dictionary to achieve the final text result. Carrying out the experiments with the proposed system using our dataset delivered encouraging experimental results.


Nga, P. T. T., Trang, N. T. H., Phúc, N. V., Quý, T. D., & Bình, V. P. (2017). VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS. Dalat University Journal of Science, 7(2), 142-152.