incubator-jena-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paolo Castagna <>
Subject Re: [jira] [Commented] (JENA-117) A pure Java version of tdbloader2, a.k.a. tdbloader3
Date Fri, 02 Mar 2012 20:51:53 GMT
Sarven Capadisli (Commented) (JIRA) wrote:
> Given my preferences, do you reckon that tdbloader2 is more suitable?

Hi Sarven,
I use tdbloader2 myself and my advise is alone the lines of what others have said.

I use a 64 bit (Linux) machine with enough RAM (to fit the node table in it).
If I have RDF data not in N-Triple|N-Quad format, I convert (and validate) it into N-Triple|N-Quad
I find it easier to deal with a large and compressed .nt|.nq file, rather than a lot of small
files (this is what Hadoop also likes, uncompressed in that case).

With enough RAM you should not have problems loading 500 million triples into an empty DB.


View raw message