Tesseract + gscan2pdf : l’ocr 5 étoiles.

Je me suis basé sur un article publié par « Linux on the root« , en l’adaptant pour mon archlinux.

L’installation des logiciels est assez simple :

yaourt -S tesseract gscan2pdf

Ensuite, en lançant gscan2pdf, on peut numériser – 300 dpi étant pratique pour avoir une bonne reconnaissance – ou récupérer une image numérisée.

Et j’avoue que le résultat est franchement… étonnant. Bref, mon précédent billet semble être un peu trop « éjaculation précoce » 😉

Mais que va-t-il rester aux logiciels propriétaires, mis à part les bonnes grosses daubes ludiques ? 😉

Ajout du 10 janvier, 20h08 :

Voici le résultat de tesseract :

IT WAS a bright cold day in April, and the clocks were
striking thirteen. Winston Smith, his chin nuzzled into his
breast in an effort to escape the vile wind, slipped quickly
through the glass doors of Victory Mansions, though not
quickly enough to prevent a swirl of gritty dust from
entering along with him.
The hallway smelt of boiled cabbage and old rag mats.
At one end of it a coloured poster, too large for indoor
display, had been tacked to the wall. It depicted simply an
enormous face, more than a metre wide: the face of a man
of about forty—five, with a heavy black moustache and
ruggcdly handsome features. Winston made for the stairs.
It was no use trying the lift. Even at the best of times it was
seldom working, and at present the electric current was cut
ull`during daylight hours. It was part ofthe economy drive
un preparation for Hate Week. The Hat was seven flights
up, mul Winston, who was thirty-nine and had a varicose
ulcer nhovc his right ankle, went slowly, resting several
Ilmes on thc way. On each landing, opposite the lift shaft,
the poster with the enormous face gazed from the wall. It
WAH an ie ufthose pictures which are so contrived that the eyes
hollow you about when you move. BIG BROTHER
li W /\’l`( II IING YOU, the caption beneath it ran.
Iusule the Flat a fruity voice was reading out a list of
flguteu which had something to do with the production of
pig in in. ‘l`l1e voice came from an oblong metal plaque like
S tlulleil mirror which formed part of the surface of the
Eiglitelmnal wall. Winston turned a switch and the voice
QQHL nmm·w|ml, though the words were still distinguish-

