Return-Path: X-Original-To: apmail-stanbol-dev-archive@www.apache.org Delivered-To: apmail-stanbol-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3A4051008D for ; Tue, 17 Sep 2013 07:16:37 +0000 (UTC) Received: (qmail 12542 invoked by uid 500); 17 Sep 2013 07:16:20 -0000 Delivered-To: apmail-stanbol-dev-archive@stanbol.apache.org Received: (qmail 12448 invoked by uid 500); 17 Sep 2013 07:16:12 -0000 Mailing-List: contact dev-help@stanbol.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@stanbol.apache.org Delivered-To: mailing list dev@stanbol.apache.org Received: (qmail 12341 invoked by uid 99); 17 Sep 2013 07:16:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Sep 2013 07:16:05 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of rupert.westenthaler@gmail.com designates 74.125.82.175 as permitted sender) Received: from [74.125.82.175] (HELO mail-we0-f175.google.com) (74.125.82.175) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Sep 2013 07:16:01 +0000 Received: by mail-we0-f175.google.com with SMTP id q59so4727166wes.34 for ; Tue, 17 Sep 2013 00:15:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=dwewBzjA+MVl+G1+6oyS3GMctseG542i3rINwVBUYm0=; b=KSnOT8B3HPh4EwQY8RhUswrgZJ8NLTIIL5wsbXApS23Ko3i0jtAQ7XHHORUl+tvLAg tEe/N19DMdwakR73fW6ffj52v4vQCwmFt/4X/yk2hgEcTrflja2DLsLV0CH133pU7PqU STnMPg8DPFeZrWPA9x2CY/ESsdzV4JmwK8GMpbTdceUiXfXJJzufsQ1UTD3T41tEVZvM QhyceUJuqHlOmTxS9ASzIeYc+R7nmyH7fpNnlAOwEn57jOp67eVKlHxwtA9JlX+IigMk GmX6OUnPwg7Deqm9wfEInfKd77BYZBQGJFOa6rUv9SXUpll9jQgsIGKVXrJC8IkDulHe IuUQ== MIME-Version: 1.0 X-Received: by 10.180.76.171 with SMTP id l11mr1151247wiw.39.1379402140462; Tue, 17 Sep 2013 00:15:40 -0700 (PDT) Received: by 10.216.181.138 with HTTP; Tue, 17 Sep 2013 00:15:40 -0700 (PDT) In-Reply-To: References: <33C03AD12DD22546B1E3301242E861E33D36FE0B@BLR-SJP-MBX-6.wipro.com> <33C03AD12DD22546B1E3301242E861E33D36FFA5@BLR-SJP-MBX-6.wipro.com> Date: Tue, 17 Sep 2013 09:15:40 +0200 Message-ID: Subject: Re: Working with large RDF data From: Rupert Westenthaler To: "dev@stanbol.apache.org" Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Hi It should be possible to reason over the enhancement results and store all triples (including the deduced one) in Jena TDB. After that you can use SPARQL on the Jena TDB as suggested by Reto. However note that any change in the Ontology will not be reflected in the Jena TDB - as there is not truth maintenance. If the data does fit into memory you just store the plain RDF data, load them into an reasoning session to get the results. After that you can store the results in an other RDF store (e.g. Jena TDB) for later queries. IMO if you need reasoning support over the whole knowledge base you should use a System that natively supports it. While the above workflows would allow to mimic such functionality it will become unpractical as the amount of data grows. best Rupert On Mon, Sep 16, 2013 at 3:29 PM, Reto Bachmann-Gm=C3=BCr = wrote: > Why in memory? TDB based clerezza store is quite efficient, so why not ad= d > the data to such a graph? > > reto > > > On Sat, Sep 14, 2013 at 9:14 AM, wrote: > >> Thanks a lot Rupert >> If the RDF data is smaller ( can fit into memory ) is there a way we can >> import into Stanbol and do a joint search across the enhancements from >> unstructured text as well as the imported RDF data. >> If yes would this import be permanent or needs to be repeated each time. >> >> >> Thanks and Rgds, >> Arthi >> >> >> -----Original Message----- >> From: Rupert Westenthaler [mailto:rupert.westenthaler@gmail.com] >> Sent: Saturday, September 14, 2013 12:40 PM >> To: dev@stanbol.apache.org >> Subject: Re: Working with large RDF data >> >> Hi Arthi >> >> AFAIK the reasoning and rule components of Apache Stanbol are intended t= o >> be used in "Sessions". They are not intended to be used on a whole >> knowledge base. A typical use case could be validating RDF data retrieve= d >> from a remote Server (e.g. Linked Data) against some validation rules. >> Rewriting RDF generated by the Enhancer (Refactor >> Engine) ... >> >> Applying Rules and Reasoning on a whole knowledge base (RDF data that do >> not fit in-memory) is not a typical use case. >> >> Based on your problem description you might want to have a look onto >> >> * Apache Marmotta and the Kiwi Triple Store >> (http://marmotta.incubator.apache.org/kiwi/introduction.html): This is a >> Sesame Sail implementation that supports reasoning >> * OWLLIM (http://www.ontotext.com/owlim): Commercial product also >> implementing Reasoning on top of the Sesame API. >> >> But I am not an export in those topics so there might be additional >> options I am not aware of. >> >> hope this helps >> best >> Rupert >> >> >> On Fri, Sep 13, 2013 at 1:48 PM, wrote: >> > Hi, >> > >> > I have large RDF data. >> > >> > The requirement is to be able to reason / run rules on this data / >> > >> > search this data along with any other unstructured data which I have >> enhanced using Stanbol. >> > >> > >> > >> > Any pointers on how I can achieve this? >> > >> > >> > >> > >> > >> > Thanking you and Rgds, >> > >> > Arthi >> > >> > >> > >> > >> > Please do not print this email unless it is absolutely necessary. >> > >> > The information contained in this electronic message and any attachmen= ts >> to this message are intended for the exclusive use of the addressee(s) a= nd >> may contain proprietary, confidential or privileged information. If you = are >> not the intended recipient, you should not disseminate, distribute or co= py >> this e-mail. Please notify the sender immediately and destroy all copies= of >> this message and any attachments. >> > >> > WARNING: Computer viruses can be transmitted via email. The recipient >> should check this email and any attachments for the presence of viruses. >> The company accepts no liability for any damage caused by any virus >> transmitted by this email. >> > >> > www.wipro.com >> >> >> >> -- >> | Rupert Westenthaler rupert.westenthaler@gmail.com >> | Bodenlehenstra=C3=9Fe 11 ++43-699-11108907 >> | A-5500 Bischofshofen >> >> Please do not print this email unless it is absolutely necessary. >> >> The information contained in this electronic message and any attachments >> to this message are intended for the exclusive use of the addressee(s) a= nd >> may contain proprietary, confidential or privileged information. If you = are >> not the intended recipient, you should not disseminate, distribute or co= py >> this e-mail. Please notify the sender immediately and destroy all copies= of >> this message and any attachments. >> >> WARNING: Computer viruses can be transmitted via email. The recipient >> should check this email and any attachments for the presence of viruses. >> The company accepts no liability for any damage caused by any virus >> transmitted by this email. >> >> www.wipro.com >> --=20 | Rupert Westenthaler rupert.westenthaler@gmail.com | Bodenlehenstra=C3=9Fe 11 ++43-699-11108907 | A-5500 Bischofshofen