Extracting Words from Scanned Books: A Step-by-Step Tutorial with Python and OpenCV
Eran Feit
Have you ever wanted to extract individual words from scanned book pages, but weren't sure how to do it? In this video tutorial, we'll show you how to use Python and OpenCV to perform word segmentation on scanned book pages, allowing you to easily extract words and use them in your own projects. We'll start by converting the image to grayscale and applying thresholding to enhance the contrast between the text and the page background. Then, we'll use OpenCV functions to detect the lines of text and sort them by their position on the page. From there, we'll zoom in on each line and repeat the process to detect individual words. Finally, we'll store the words in an array along with their corresponding coordinates, making it easy to display any word on the page by selecting the corresponding item in the array. Whether you're working on a digital library project or just want to extract text from scanned documents, this tutorial will give you the tools you need to succeed. So join us as we explore the exciting world of word segmentation with Python and OpenCV!
Code for this video: https://github.com/feitgemel/Open-CV/tree/main/Words-Segmentation
More relevant content in this playlist : https://youtube.com/playlist?list=PLdkryDe59y4bxVvpexwR6PMTHH6_vFXjA
A perfect course for learning modern Computer Vision with deep dive in TensorFlow , Keras and Pytorch . You can find it here : http://bit.ly/3HeDy1V
I also recommend this book for deep dive in deep learning : https://amzn.to/3TTZym7
~~~~~~~~~~~~~~~ CONNECT ~~~~~~~~~~~~~~~
đĽď¸ Email : feitgemel@gmail.com
đŚ Twitter - https://twitter.com/eran_feit
đ¸ GitHub - https://github.com/feitgemel
đ¸ Instagram - https://www.instagram.com/eran_feit/
âśď¸ Subscribe - youtube.com/@eranfeit?sub_confirmation=1
đ Facebook - https://www.facebook.com/groups/3080601358933585
~~~~~~~~~~~~~~ SUPPORT ME đ~~~~~~~~~~~~~~
đ
ż Patreon - https://www.patreon.com/EranFeit
#OpenCV #EranFeit #opencv #python
Music by Vincent Rubinetti
Download the music on Bandcamp: https://vincerubinetti.bandcamp.com/album/the-music-of-3blue1brown
Stream the music on Spotify: https://open.spotify.com/album/1dVyjwS8FBqXhRunaG5W5u
...
https://www.youtube.com/watch?v=c61w6H8pdzs
119119046 Bytes