NEW FEATURES:
Automatically downloads Tesseract v4.0 training files from github.
Use -y for no prompting. See -ocrlang command usage for full details.
Added new -go (grid order) option to specify the order in which
Areas in gridded pages are processed. Grid area processing also
Defaults to right-to-left if -r is specified.
ENHANCEMENTS:
Prints Tesseract init info at the beginning of each file conversion
Now--in the GUI also.
Redirects Tesseract library debug statements to a log file. Prints
Them out if it cannot initialize correctly.
Re-edited a lot of the -mode usage descriptions.
CMakeLists.txt files updated.
Added Kobo Clara HD, Kobo Forma, and Kobo Libra H2O to device list
(using preliminary resolution numbers).
Windows version now compiled with gcc 9.3.1 on a core i9-9900.
MS WINDOWS GUI
Tesseract language selection and download fully supported.
Added three more preset buttons for a total of seven.
Added "View" button to GUI to view PDF source file.
BUG FIXES:
Fixed issue where OCR layer placement from an already generated
OCR layer was not correct if a document scale factor not equal
To 1 was used (e.g. -ds 2).
Hopefully improved detection of out-of-family small rows.
Modified textrows_remove_small_rows() in textrows.c and also added
Rhmin option to remove any row w/text height less than a certain value.
http://Https://www.mobileread.com/forums/showthread.php?p=3693210#post3693210 http://Https://www.mobileread.com/forums/showthread.php?p=2781815#post2781815 Fixed some issues with -fr option. Added -ddr option (-ddr- to turn
Off double-row detect) to prevent tall figures from potentially
Getting broken up inadvertently. Modified k2proc.c to
"rotate the trow" structure next the end of bmpregion_add() function.
http://Https://www.mobileread.com/forums/showthread.php?p=3549163#post3549163 Fixed issue where the output margins weren't exactly right in the
Case of -mode trim. See k2settings_adjust_devdims_for_om() function
In k2settings.c. (PM from Tex2002ans on MR.)
Added textrows_remove_defects() function to bmpregion_find_textrows()
In bmpregion.c. This helped fix an issue where rows were not
Getting selected properly in a document due to small scanning
Defects.
Fixed implementation of -bp <gap>. This had been broken for some
Time. It was getting overridden by other features. See mods to
K2proc.c and k2master.c.