[Libre-soc-bugs] [Bug 602] low performance bare minimum functionality SIMD emulator required
bugzilla-daemon at libre-soc.org
bugzilla-daemon at libre-soc.org
Mon Jun 7 09:12:36 BST 2021
https://bugs.libre-soc.org/show_bug.cgi?id=602
--- Comment #14 from Jacob Lifshay <programmerjake at gmail.com> ---
(In reply to Luke Kenneth Casson Leighton from comment #11)
> yes, we are however using OCR being developed by richard before
> doing it by hand. this will save massive amounts of time.
I ended up looking through Wikipedia's list of OCR programs, and I noticed
Tessarect (and several others) supports outputting to hOCR format, an
HTML-based format, which seems like it would be waay easier to parse than
trying to manually roll-your-own text column/row/formatting detector based on
Octave and FFTs...
hOCR:
http://kba.cloud/hocr-spec/1.2/
--
You are receiving this mail because:
You are on the CC list for the bug.
More information about the libre-soc-bugs
mailing list