uima-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Philippe de Rochambeau <phi...@free.fr>
Subject Analysing archive PDFs
Date Thu, 19 Feb 2015 20:28:06 GMT
Hello,

In the past few months, I have indexed tens of thousands of PDFs containing newspaper articles
from 1887 until 1940 using SOLR for my company.

Every day, my colleagues in the Archive Department spend hours searching through the archives
using SOLR, looking for potentially-interesting articles from a social and historical point
of view.

Can UIMA or OpenNLP be used to automate their work and/or to analyze patterns in the data?

Many thanks.

Philippe
Mime
View raw message