lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Абрашин, Игорь Олегович <Igor.Abras...@novatek.ru>
Subject Problem with cyrillics letters through Tika OCR indexing
Date Fri, 10 Feb 2017 07:50:44 GMT
Hello, everyone I'm encountered the error mentioned at the title?
The original image attached and recognized text below:
3ApaBCTyI7ITe 9| )KVIBy xopomo

Does anyone faced the similar?
Need to mentioned that tesseract recognize it more correctly with -l rus option.

Thanks in advance!


С уважением,
Игорь Абрашин
ООО <НОВАТЭК НТЦ>
тел. раб.: +7 (3452) 680-386
тел. внутр. корпор.: 22-586
[121]


Mime
  • Unnamed multipart/mixed (inline, None, 0 bytes)
View raw message