lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jack Krupansky <jack.krupan...@gmail.com>
Subject Re: Apache Solr and Tika used to index Panama Papers
Date Thu, 07 Apr 2016 15:25:45 GMT
Hmmm... I seem to have missed it, but remind me where the link is for
public access? I mean, if this is all open source, it should be available
to me, right?

-- Jack Krupansky

On Thu, Apr 7, 2016 at 6:52 AM, Erik Hatcher <erik.hatcher@gmail.com> wrote:

> Also of note, Blacklight was used for the Solr-based UI -
> http://projectblacklight.org
>
> And another link about the data analysis process -
> https://ijnet.org/en/blog/how-icij-pulled-large-scale-cross-border-investigative-collaboration
>
> "Layered on top was the shiny interface, built using Blacklight, another
> open source development."
>
>
>
> > On Apr 6, 2016, at 04:45, Uwe Schindler <uschindler@apache.org> wrote:
> >
> > Hi all,
> >
> > I just wanted to repost the following by Chris Mattman on the TIKA list:
> >
> > If you have been following the news you’ve seen the Panama papers and
> how the world’s rich and elite have been storing all their money offshore
> to hide it. Two of the ASF’s key technologies were used in uncovering that
> story and showing the world what was going on: Apache Tika and Apache Solr.
> >
> > Solr was used for making the Terabytes of Panama Papers available to
> journalists. The preprocessing of the documents for indexing was done with
> Tika (maybe through the contrib/extraction module).
> >
> > Here is the article by Forbes about that:
> >
> http://www.forbes.com/sites/thomasbrewster/2016/04/05/panama-papers-amazon-encryption-epic-leak
> >
> > Uwe
> >
> > -----
> > Uwe Schindler
> > uschindler@apache.org
> > ASF Member, Apache Lucene PMC / Committer
> > Bremen, Germany
> > http://lucene.apache.org/
> >
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: dev-help@lucene.apache.org
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message