Freeware image rescaler to improve OCR Autor da sequência: Ilya Razmanov
| Ilya Razmanov Federação Russa Local time: 12:42 Inglês para Russo
Greetings,
frequently I receive source files to translate as low resolution scans (supposedly I'm not the only one), so OCR software starts to complain and tell me to encrease the resolution. Sure enough, rescaling images using common algorithms like nearest neighbour, bilinear and bicubic doesn't help much (except for that OCR software stops moaning).
Recently I've got a weird idea of... See more Greetings,
frequently I receive source files to translate as low resolution scans (supposedly I'm not the only one), so OCR software starts to complain and tell me to encrease the resolution. Sure enough, rescaling images using common algorithms like nearest neighbour, bilinear and bicubic doesn't help much (except for that OCR software stops moaning).
Recently I've got a weird idea of using pixelart algorithms like Scale2x (https://www.scale2x.it/) for scaling up the low-res scans. Surely Scale2x was developed for a completely different area, but why not?
So I wrote some small utilities for that, and preliminary test show that they improve OCR results.
Utilities forementioned are available here:
https://github.com/Dnyarri/PixelArtScaling
and include single file Scale2x and Scale3x PNG image rescalers with GUI, similar ones in commandline variant, and batch processing ones to batch process selected folder (recursively).
Programs open PNG image files, scale them up 2 times (Scale2x) or 3 times (Scale3x), and save as PNG files.
Everything is written on 100% Python so can be run on anything (Windows or Unixes) having Python available. For Windows users, there are compiled exe files it Release section:
https://github.com/Dnyarri/PixelArtScaling/releases/tag/2024.02.18
and sourcecodes available for everyone.
Main disadvantage is that everything is written on 100% Python so is quite slow when it comes to large images in batch mode.
Anyway, so far it seems to give some advantage to me and may be worth trying to you. Constructive feedback is appreciated. ▲ Collapse | | | Michael Beijer Reino Unido Local time: 09:42 Membro Holandês para Inglês + ... cool thanks! | Feb 19, 2024 |
I've bookmarked this as it is something that might actually come in handy on the future.
Michael | | | Ilya Razmanov Federação Russa Local time: 12:42 Inglês para Russo Autor do assunto Always in motion future is ;-) | Feb 20, 2024 |
Michael Beijer wrote:
I've bookmarked this as it is something that might actually come in handy on the future.
Michael
I'm not sure whether I will implement other rescaling algorithms of that weird "pixel art" class. They are specifically developed for scaling up old games graphics, and some of them, for example, use chromaticity info for ranking pixels "closeness". Since b/w text scans obviously have no chroma, trying to take chroma into account is likely to make things worse. Scale2x and Scale3x are not only simple, but seem to work well with images they are initially not intended to be used for. So for a while I'll stick with that, and only add some minor improvements (next release will contain more informative progress indication, but I still need to test it). Bugfixes are not expected, since I seem to have stepped onto every bug during development already | | | Daryo Reino Unido Local time: 09:42 Sérvio para Inglês + ... You should send them back | Feb 22, 2024 |
Ilya Razmanov wrote:
Greetings,
frequently I receive source files to translate as low resolution scans (supposedly I'm not the only one), so OCR software starts to complain and tell me to encrease the resolution. Sure enough, rescaling images using common algorithms like nearest neighbour, bilinear and bicubic doesn't help much (except for that OCR software stops moaning).
Recently I've got a weird idea of using pixelart algorithms like Scale2x ( https://www.scale2x.it/) for scaling up the low-res scans. Surely Scale2x was developed for a completely different area, but why not?
So I wrote some small utilities for that, and preliminary test show that they improve OCR results.
Utilities forementioned are available here:
https://github.com/Dnyarri/PixelArtScaling
and include single file Scale2x and Scale3x PNG image rescalers with GUI, similar ones in commandline variant, and batch processing ones to batch process selected folder (recursively).
Programs open PNG image files, scale them up 2 times (Scale2x) or 3 times (Scale3x), and save as PNG files.
Everything is written on 100% Python so can be run on anything (Windows or Unixes) having Python available. For Windows users, there are compiled exe files it Release section:
https://github.com/Dnyarri/PixelArtScaling/releases/tag/2024.02.18
and sourcecodes available for everyone.
Main disadvantage is that everything is written on 100% Python so is quite slow when it comes to large images in batch mode.
Anyway, so far it seems to give some advantage to me and may be worth trying to you. Constructive feedback is appreciated.
If you are given low resolution scans to work from, you should send them back and ask for either a .doc / .rtf file or a better scan. *The client is NOT always right*. If you never say anything, you make it "normal/acceptable" to send any rubbish scan to translators to work from. Also, many clients are blissfully unaware of the headaches they cause by sending crap scans, and will remain unaware unless they start getting feedback.
If you can read easily the text, there are good chances of making the text usable for OCR. If even you struggle to read the text, no point in wasting your time trying to make it digestible for OCR.
With even only mid-level software for image processing, you can try resizing combined with modifying contrast, playing with histogram curves etc. You need to experiment with varying parameters - there is no one universal solution. One surprisingly effective trick is to ADD (the right amount of) blur to the enlarged picture of the text and then increase contrast. May sound strange, but it does improve readability. Maybe even to the point that OCR starts liking it.
If you don't mind spending a lot of time on it, you could make OCR-friendly a lot of scanned texts - surely an interesting game for an imaging software user. But for me the first question, from the point of view of a translator, is why should you have to? | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Freeware image rescaler to improve OCR Protemos translation business management system | Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.
More info » |
| Wordfast Pro | Translation Memory Software for Any Platform
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Buy now! » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |