Message view | « Date » · « Thread » |
---|---|
Top | « Date » · « Thread » |
From | Philippe de Rochambeau <phi...@free.fr> |
Subject | Analysing archive PDFs |
Date | Thu, 19 Feb 2015 20:28:06 GMT |
Hello, In the past few months, I have indexed tens of thousands of PDFs containing newspaper articles from 1887 until 1940 using SOLR for my company. Every day, my colleagues in the Archive Department spend hours searching through the archives using SOLR, looking for potentially-interesting articles from a social and historical point of view. Can UIMA or OpenNLP be used to automate their work and/or to analyze patterns in the data? Many thanks. Philippe | |
Mime |
|
View raw message |