manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: ManifoldCF transformation connector for Apache Stanbol
Date Tue, 26 Jan 2016 12:39:16 GMT
If you set up a pull request, and download a .diff, it should be easy to
confirm that this will "svn patch" onto a workarea.

Karl


On Tue, Jan 26, 2016 at 7:35 AM, Rafa Haro <rharo@apache.org> wrote:

> Hi Karl,
>
> I will proceed in the same way than the OpenNLP connector, so as I ask in
> the other email, should we do this using Pull request instead of manually
> importing the master branch of Dileepa's repo?
>
> Cheers,
> Rafa
>
> On Tue, Jan 26, 2016 at 12:07 PM Karl Wright <daddywri@gmail.com> wrote:
>
> > Hi Rafa,
> >
> > Any time you are ready, please import this into a branch.  I'll need to
> > look over licensing and build before committing to trunk.
> >
> > Thanks!
> > Karl
> >
> >
> > On Tue, Jan 26, 2016 at 3:20 AM, Dileepa Jayakody <djayakody@zaizi.com>
> > wrote:
> >
> > > Hi All,
> > >
> > > I have done the discussed modifications to the Stanbol connector. Now
> the
> > > users can either define dereference fields or define a LDPath program
> to
> > > extract entity properties from Stanbol entities and add them to the
> > > document as fields.
> > >
> > > The latest code is available here for your review:
> > >
> > >
> >
> https://github.com/zaizi/sensefy-connectors/tree/feature/SENSEFY-1453-modify-stanbol-connector/transformation/mcf-stanbol-connector
> > >
> > > I have also written a blog post on how to configure the connector:
> > >
> > >
> >
> http://dileepajayakody.blogspot.com/2016/01/enhancing-documents-in-apache.html
> > >
> > > Looking forward to your comments.
> > >
> > > Thanks,
> > > Dileepa
> > >
> > >
> > > On Mon, Dec 14, 2015 at 1:18 PM, Rafa Haro <rharo@apache.org> wrote:
> > >
> > > > Hi Karl,
> > > >
> > > > I will import this one, don't worry.
> > > >
> > > > Cheers,
> > > > Rafa
> > > > El El sáb, 12 dic 2015 a las 20:36, Dileepa Jayakody <
> > > djayakody@zaizi.com>
> > > > escribió:
> > > >
> > > > > Hi Karl,
> > > > >
> > > > > Yes, I will improve the code with Rafa's reviews and then we can
> > import
> > > > it
> > > > > to mcf code base.
> > > > >
> > > > > Thanks
> > > > > Dileepa
> > > > >
> > > > > On Sat, Dec 12, 2015 at 5:26 PM, Karl Wright <daddywri@gmail.com>
> > > wrote:
> > > > >
> > > > > > Ok, it seems premature for me to try to import this from Github
> > > today,
> > > > so
> > > > > > I'll wait until the dust settles a bit further first.
> > > > > >
> > > > > > Karl
> > > > > >
> > > > > >
> > > > > > On Fri, Dec 11, 2015 at 1:45 PM, Dileepa Jayakody <
> > > djayakody@zaizi.com
> > > > >
> > > > > > wrote:
> > > > > >
> > > > > > > Thanks a lot Rafa for pointing that out. big miss as  I didn't
> > test
> > > > the
> > > > > > > LDPath configuration part yet. More improvements to be done.
> > > > > > > I will do the required mprovements as pointed out.
> > > > > > >
> > > > > > > Regards,
> > > > > > > Dileepa
> > > > > > >
> > > > > > >
> > > > > > > On Fri, Dec 11, 2015 at 8:42 PM, Rafa Haro <rharo@apache.org>
> > > wrote:
> > > > > > >
> > > > > > > > Hi Dileepa,
> > > > > > > >
> > > > > > > > The problem is not in that part on the code, it is rather on
> > this
> > > > > part:
> > > > > > > >
> > > > > > > > if (entity != null) { Collection<String> properties = entity.
> > > > > > > > getProperties(); for (String property : properties) { String
> > > > > > > > targetFieldName = derefFields.get(property); Set<String>
> > > > propValues =
> > > > > > > > entityPropertyMap.get(targetFieldName); if (propValues ==
> > null) {
> > > > > > > > propValues = new HashSet<String>(); } Collection<String>
> > > > > > > entityPropValues =
> > > > > > > > entity.getPropertyValues(property);
> > > > > > propValues.addAll(entityPropValues);
> > > > > > > > entityPropertyMap.put(targetFieldName, propValues); } }
> > > > > > > > You are collecting from the EnhancementStructure response
> just
> > > only
> > > > > the
> > > > > > > > configured dereferenced fields and LDPath fields are ignored.
> > > Also,
> > > > > > there
> > > > > > > > is a potential bug in that code if there is no dereferencing
> > > field
> > > > > > > > configured for a certain entity property here:
> > > > > > > >
> > > > > > > > String targetFieldName = derefFields.get(property);
> > > > > > > >
> > > > > > > > targetFieldName would be Null then. Instead of trying to
> index
> > > > every
> > > > > > > > property, you should just collect the configured ones by the
> > user
> > > > (or
> > > > > > at
> > > > > > > > least, if the user wants all of them, provide a configuration
> > > > option
> > > > > > for
> > > > > > > > that).
> > > > > > > >
> > > > > > > > Anyway, going back to LDPath issue, please take into account
> > that
> > > > > when
> > > > > > > you
> > > > > > > > define a field you must use a custom Namespace and Prefix for
> > > later
> > > > > > being
> > > > > > > > able to retrieve that property from the entity. If you don't
> do
> > > > that,
> > > > > > > > Stanbol will provide a random namespace for that property.
> > Check
> > > > this
> > > > > > > > example from RedLink SDK:
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/redlink-gmbh/redlink-java-sdk/blob/master/src/test/java/io/redlink/sdk/AnalysisTest.java#L423-443
> > > > > > > >
> > > > > > > > Hope that helps
> > > > > > > >
> > > > > > > > On Fri, Dec 11, 2015 at 3:57 PM Karl Wright <
> > daddywri@gmail.com>
> > > > > > wrote:
> > > > > > > >
> > > > > > > > > The next step would be to pull this code into an svn
> branch.
> > > > This
> > > > > is
> > > > > > > > > something I can tackled after the 2.3 release candidate is
> > put
> > > > > > > together.
> > > > > > > > >
> > > > > > > > > Thanks,
> > > > > > > > > Karl
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > On Fri, Dec 11, 2015 at 9:07 AM, Dileepa Jayakody <
> > > > > > djayakody@zaizi.com
> > > > > > > >
> > > > > > > > > wrote:
> > > > > > > > >
> > > > > > > > > > Hi Rafa,
> > > > > > > > > >
> > > > > > > > > > Thanks for reviewing my code and for your feedback.
> Please
> > > see
> > > > my
> > > > > > > > > comments
> > > > > > > > > > inline below.
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > On Fri, Dec 11, 2015 at 6:51 PM, Rafa Haro <
> > rharo@apache.org
> > > >
> > > > > > wrote:
> > > > > > > > > >
> > > > > > > > > > > Hi Dileepa,
> > > > > > > > > > >
> > > > > > > > > > > This seems to be going in the right direction clearly
> now
> > > in
> > > > my
> > > > > > > > > opinion.
> > > > > > > > > > > Quick comments after a first review:
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >    - Rejecting a document because it can't be enhanced
> is
> > > > kind
> > > > > of
> > > > > > > > > tough.
> > > > > > > > > > >    You are preventing a document to be finally indexed
> > > > because
> > > > > > the
> > > > > > > > > > > enhancement
> > > > > > > > > > >    didn't perform correctly, probably it is better just
> > to
> > > > let
> > > > > > them
> > > > > > > > > > > continue
> > > > > > > > > > >    the workflow within the system
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > Got your point. Will remove that part from the code
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > >    - As I can deduce for the code, you are correctly
> > > > extracting
> > > > > > the
> > > > > > > > > > >    configured dereferenced fields, but you are not
> > > processing
> > > > > at
> > > > > > > all
> > > > > > > > > the
> > > > > > > > > > >    LDPath results
> > > > > > > > > > >
> > > > > > > > > > > I'm passing the LDPath program as an enhancer parameter
> > to
> > > > > > Stanbol
> > > > > > > to
> > > > > > > > > > retrieve the enhancement result according to the LDPath
> > > program
> > > > > > > (which
> > > > > > > > is
> > > > > > > > > > given as a text string in the connector UI).
> > > > > > > > > > If the user has not defined a LDPath program and added
> > > > derefence
> > > > > > > fields
> > > > > > > > > in
> > > > > > > > > > the UI instead, then the enhancement request will be
> built
> > > > using
> > > > > > the
> > > > > > > > > > dereference fields as enhancer parameters.
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > If neither a LDPath or dereference fields are given in
> the
> > > > > > > > transformation
> > > > > > > > > > UI, then I just call the given enhancement chain without
> > any
> > > > > other
> > > > > > > > > enhancer
> > > > > > > > > > paramaters.
> > > > > > > > > >
> > > > > > > > > > Please refer below code segment where I do this and let
> me
> > > know
> > > > > if
> > > > > > it
> > > > > > > > > needs
> > > > > > > > > > more improvements.
> > > > > > > > > >
> > > > > > > > > >             // ldpath program is given priority if it's
> set
> > > > > > > > > >             if (ldPath != null)
> > > > > > > > > >             {
> > > > > > > > > >                 parameters =
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> EnhancerParameters.builder().setChain(chain).setContent(content).setLDpathProgram(ldPath).build();
> > > > > > > > > >             }
> > > > > > > > > >             else if (!derefFields.isEmpty())
> > > > > > > > > >             {
> > > > > > > > > >                 parameters =
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> EnhancerParameters.builder().setChain(chain).setContent(content).setDereferencingFields(
> > > > > > > > > >                         derefFields.keySet()).build();
> > > > > > > > > >             }
> > > > > > > > > >             else
> > > > > > > > > >             {
> > > > > > > > > >                 parameters =
> > > > > > > > > >
> > > > > > > >
> > > > > >
> > > >
> > EnhancerParameters.builder().setChain(chain).setContent(content).build();
> > > > > > > > > >             }
> > > > > > > > > >             eRes = enhancerClient.enhance(parameters);
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > Thanks,
> > > > > > > > > > Dileepa
> > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > Cheers,
> > > > > > > > > > > Rafa
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > On Fri, Dec 11, 2015 at 1:05 PM Dileepa Jayakody <
> > > > > > > > djayakody@zaizi.com>
> > > > > > > > > > > wrote:
> > > > > > > > > > >
> > > > > > > > > > > > Hi All,
> > > > > > > > > > > >
> > > > > > > > > > > > As per our discussion I have modified the Stanbol
> > > Connector
> > > > > so
> > > > > > > that
> > > > > > > > > it
> > > > > > > > > > > adds
> > > > > > > > > > > > all extracted entity URIs and entity attributes to
> the
> > > > > > repository
> > > > > > > > > > > document
> > > > > > > > > > > > as fields.
> > > > > > > > > > > >
> > > > > > > > > > > > On a separate branch I have committed this code to
> our
> > > > github
> > > > > > > > project
> > > > > > > > > > > > sensefy-connectors.
> > > > > > > > > > > > You can find the source code here:
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/zaizi/sensefy-connectors/tree/feature/SENSEFY-1453-modify-stanbol-connector/transformation/mcf-stanbol-connector
> > > > > > > > > > > > Let me know your feedback.
> > > > > > > > > > > >
> > > > > > > > > > > > I will write a blog post on how to add it in a
> > connection
> > > > and
> > > > > > get
> > > > > > > > > > > > ehancement results and share it with you.
> > > > > > > > > > > >
> > > > > > > > > > > > Thanks,
> > > > > > > > > > > > Dileepa
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > On Mon, Dec 7, 2015 at 6:29 PM, Karl Wright <
> > > > > > daddywri@gmail.com>
> > > > > > > > > > wrote:
> > > > > > > > > > > >
> > > > > > > > > > > > > Hi Dileepa,
> > > > > > > > > > > > >
> > > > > > > > > > > > > You cannot create sub-documents in a transformation
> > > > > > connector.
> > > > > > > > And
> > > > > > > > > > > > adding
> > > > > > > > > > > > > that capability to the framework is not possible;
> we
> > > > would
> > > > > be
> > > > > > > > > missing
> > > > > > > > > > > key
> > > > > > > > > > > > > bookkeeping logic if that was allowed.
> > > > > > > > > > > > >
> > > > > > > > > > > > > Karl
> > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > On Mon, Dec 7, 2015 at 6:59 AM, Dileepa Jayakody <
> > > > > > > > > > djayakody@zaizi.com>
> > > > > > > > > > > > > wrote:
> > > > > > > > > > > > >
> > > > > > > > > > > > > > Hi Karl,
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks a lot for the pointer.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Stanbol doesn't update an existing document, it
> > > > > generates a
> > > > > > > new
> > > > > > > > > > > > response
> > > > > > > > > > > > > > with requested enhancement details for the
> content
> > > > > > enhansment
> > > > > > > > > > > request.
> > > > > > > > > > > > > > For example for a request like : "Paris is a city
> > in
> > > > > > France"
> > > > > > > > > > > following
> > > > > > > > > > > > > RDF
> > > > > > > > > > > > > > response [1] is given by Stanbol.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > In the Stanbol connector, enhancement artifacts
> > such
> > > as
> > > > > > > > > > > TextAnnotations
> > > > > > > > > > > > > > and EntityAnnotations are extracted from the RDF
> > > > > response,
> > > > > > to
> > > > > > > > > > > generate
> > > > > > > > > > > > > the
> > > > > > > > > > > > > > entity abstractions and add them to the mcf
> > > repository
> > > > > > > > document.
> > > > > > > > > > > > > Currently
> > > > > > > > > > > > > > in the Stanbol connector we have added these
> entity
> > > > > > > > abstractions
> > > > > > > > > as
> > > > > > > > > > > > JSON
> > > > > > > > > > > > > > strings to a multi-valued 'entities' field in the
> > > > > > repository
> > > > > > > > > > document
> > > > > > > > > > > > and
> > > > > > > > > > > > > > we parse that JSON in the SolrWrapper output
> > > connector
> > > > to
> > > > > > > index
> > > > > > > > > in
> > > > > > > > > > > > > separate
> > > > > > > > > > > > > > Solr cores (primary documents, linked entities
> and
> > > > entity
> > > > > > > types
> > > > > > > > > > with
> > > > > > > > > > > > > their
> > > > > > > > > > > > > > attributes).
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Can we can have a primary repository document and
> > > > create
> > > > > > sub
> > > > > > > > > > > documents
> > > > > > > > > > > > > for
> > > > > > > > > > > > > > the extracted entities? Is it possible to
> generate
> > > sub
> > > > > > > > documents
> > > > > > > > > > for
> > > > > > > > > > > a
> > > > > > > > > > > > > > repo-document in a transformation connector?
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Thanks.
> > > > > > > > > > > > > > Dileepa
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > [1] Sample Stanbol response
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > {
> > > > > > > > > > > > > >   "@context": {
> > > > > > > > > > > > > >     "dbp-ont": "http://dbpedia.org/ontology/",
> > > > > > > > > > > > > >     "dc": "http://purl.org/dc/terms/",
> > > > > > > > > > > > > >     "dc:created": {
> > > > > > > > > > > > > >       "@type": "xsd:dateTime"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "enhancer": "
> > > http://fise.iks-project.eu/ontology/
> > > > ",
> > > > > > > > > > > > > >     "enhancer:confidence": {
> > > > > > > > > > > > > >       "@type": "xsd:double"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "enhancer:end": {
> > > > > > > > > > > > > >       "@type": "xsd:int"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "enhancer:entity-reference": {
> > > > > > > > > > > > > >       "@type": "@id"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "enhancer:entity-type": {
> > > > > > > > > > > > > >       "@type": "@id"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "enhancer:extracted-from": {
> > > > > > > > > > > > > >       "@type": "@id"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "enhancer:start": {
> > > > > > > > > > > > > >       "@type": "xsd:int"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "entityhub": "
> > > > > > > > > > > > >
> > > http://stanbol.apache.org/ontology/entityhub/entityhub#
> > > > > > > > > > > > > > ",
> > > > > > > > > > > > > >     "foaf": "http://xmlns.com/foaf/0.1/",
> > > > > > > > > > > > > >     "foaf:depiction": {
> > > > > > > > > > > > > >       "@type": "@id"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     "owl": "http://www.w3.org/2002/07/owl#",
> > > > > > > > > > > > > >     "rdfs": "
> http://www.w3.org/2000/01/rdf-schema#
> > ",
> > > > > > > > > > > > > >     "schema": "http://schema.org/",
> > > > > > > > > > > > > >     "xsd": "http://www.w3.org/2001/XMLSchema#"
> > > > > > > > > > > > > >   },
> > > > > > > > > > > > > >   "@graph": [
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id": "http://dbpedia.org/resource/France
> ",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "dbp-ont:Country",
> > > > > > > > > > > > > >         "dbp-ont:Place",
> > > > > > > > > > > > > >         "dbp-ont:PopulatedPlace",
> > > > > > > > > > > > > >         "http://www.opengis.net/gml/_Feature",
> > > > > > > > > > > > > >         "owl:Thing",
> > > > > > > > > > > > > >         "schema:Country",
> > > > > > > > > > > > > >         "schema:Place"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "foaf:depiction": [
> > > > > > > > > > > > > >         "
> > > > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > >
> > > > >
> > http://upload.wikimedia.org/wikipedia/commons/c/c3/Flag_of_France.svg
> > > > > > > > > > > > ",
> > > > > > > > > > > > > >         "
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://upload.wikimedia.org/wikipedia/commons/thumb/c/c3/Flag_of_France.svg/200px-Flag_of_France.svg.png
> > > > > > > > > > > > > > "
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "rdfs:comment": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "France, officially the French
> > > > > Republic,
> > > > > > > is a
> > > > > > > > > > > > > > unitary semi-presidential republic in Western
> > Europe
> > > > with
> > > > > > > > several
> > > > > > > > > > > > > > overseas territories and islands located on other
> > > > > > continents
> > > > > > > > and
> > > > > > > > > in
> > > > > > > > > > > > > > the Indian, Pacific, and Atlantic oceans.
> > > Metropolitan
> > > > > > France
> > > > > > > > > > extends
> > > > > > > > > > > > > > from the Mediterranean Sea to the English Channel
> > and
> > > > the
> > > > > > > North
> > > > > > > > > > Sea,
> > > > > > > > > > > > > > and from the Rhine to the Atlantic Ocean. It is
> > often
> > > > > > > referred
> > > > > > > > to
> > > > > > > > > > as
> > > > > > > > > > > > > > l’Hexagone because of the geometric shape of its
> > > > > > territory."
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "rdfs:label": [
> > > > > > > > > > > > > >         {
> > > > > > > > > > > > > >           "@language": "en",
> > > > > > > > > > > > > >           "@value": "France"
> > > > > > > > > > > > > >         },
> > > > > > > > > > > > > >         {
> > > > > > > > > > > > > >           "@language": "fr",
> > > > > > > > > > > > > >           "@value": "France"
> > > > > > > > > > > > > >         },
> > > > > > > > > > > > > >       ]
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id": "http://dbpedia.org/resource/Paris
> ",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "dbp-ont:Place",
> > > > > > > > > > > > > >         "dbp-ont:PopulatedPlace",
> > > > > > > > > > > > > >         "dbp-ont:Settlement",
> > > > > > > > > > > > > >         "http://www.opengis.net/gml/_Feature",
> > > > > > > > > > > > > >         "owl:Thing",
> > > > > > > > > > > > > >         "schema:Place"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "foaf:depiction": [
> > > > > > > > > > > > > >         "
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://upload.wikimedia.org/wikipedia/commons/6/6e/Paris_-_Eiffelturm_und_Marsfeld2.jpg
> > > > > > > > > > > > > > ",
> > > > > > > > > > > > > >         "
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://upload.wikimedia.org/wikipedia/commons/thumb/6/6e/Paris_-_Eiffelturm_und_Marsfeld2.jpg/200px-Paris_-_Eiffelturm_und_Marsfeld2.jpg
> > > > > > > > > > > > > > "
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "geo:lat": 48.8567,
> > > > > > > > > > > > > >       "geo:long": 2.3508,
> > > > > > > > > > > > > >       "rdfs:comment": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "Paris is the capital and
> largest
> > > > city
> > > > > of
> > > > > > > > > France.
> > > > > > > > > > > It
> > > > > > > > > > > > > > is situated on the river Seine, in northern
> France,
> > > at
> > > > > the
> > > > > > > > heart
> > > > > > > > > of
> > > > > > > > > > > > > > the Île-de-France region (or Paris Region,
> French:
> > > > Région
> > > > > > > > > > > parisienne).
> > > > > > > > > > > > > > As of January 2008 the city of Paris, within its
> > > > > > > administrative
> > > > > > > > > > > limits
> > > > > > > > > > > > > > largely unchanged since 1860, has an estimated
> > > > population
> > > > > > of
> > > > > > > > > > > 2,211,297
> > > > > > > > > > > > > > and a metropolitan population of 12,089,098, and
> is
> > > one
> > > > > of
> > > > > > > the
> > > > > > > > > most
> > > > > > > > > > > > > > populated metropolitan areas in Europe."
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "rdfs:label": [
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >         {
> > > > > > > > > > > > > >           "@language": "en",
> > > > > > > > > > > > > >           "@value": "Paris"
> > > > > > > > > > > > > >         },
> > > > > > > > > > > > > >         {
> > > > > > > > > > > > > >           "@language": "fr",
> > > > > > > > > > > > > >           "@value": "Paris"
> > > > > > > > > > > > > >         },
> > > > > > > > > > > > > >       ]
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >    }
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id":
> > > > > > > > > > "urn:enhancement-8db13707-1ecd-b4df-90ad-52447c8f2c84",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "enhancer:Enhancement",
> > > > > > > > > > > > > >         "enhancer:TextAnnotation"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "dc:created": "2015-12-07T11:22:07.740Z",
> > > > > > > > > > > > > >       "dc:creator":
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> "org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhancementEngine",
> > > > > > > > > > > > > >       "dc:type": "dbp-ont:Place",
> > > > > > > > > > > > > >       "enhancer:confidence": 0.6017613,
> > > > > > > > > > > > > >       "enhancer:end": 5,
> > > > > > > > > > > > > >       "enhancer:extracted-from":
> > > > > > > > > > > > > >
> > > > > > > >
> > "urn:content-item-sha1-c8ae372ed26679df14da13050dd432fd32c527e3",
> > > > > > > > > > > > > >       "enhancer:selected-text": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "Paris"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:selection-context": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "Paris is in France"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:start": 0
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id":
> > > > > > > > > > "urn:enhancement-b2855552-0e46-62f5-cd33-9f84ab32e547",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "enhancer:Enhancement",
> > > > > > > > > > > > > >         "enhancer:EntityAnnotation"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "dc:created": "2015-12-07T11:22:07.748Z",
> > > > > > > > > > > > > >       "dc:creator":
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> "org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine",
> > > > > > > > > > > > > >       "dc:relation":
> > > > > > > > > > > > > >
> > > "urn:enhancement-e9c9c187-2d69-2c1f-6552-e76111430d4a",
> > > > > > > > > > > > > >       "enhancer:confidence": 1.0,
> > > > > > > > > > > > > >       "enhancer:entity-label": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "France"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:entity-reference": "
> > > > > > > > > > > http://dbpedia.org/resource/France
> > > > > > > > > > > > ",
> > > > > > > > > > > > > >       "enhancer:entity-type": [
> > > > > > > > > > > > > >         "dbp-ont:Country",
> > > > > > > > > > > > > >         "dbp-ont:Place",
> > > > > > > > > > > > > >         "dbp-ont:PopulatedPlace",
> > > > > > > > > > > > > >         "schema:Country",
> > > > > > > > > > > > > >         "schema:Place",
> > > > > > > > > > > > > >         "http://www.opengis.net/gml/_Feature",
> > > > > > > > > > > > > >         "owl:Thing"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "enhancer:extracted-from":
> > > > > > > > > > > > > >
> > > > > > > >
> > "urn:content-item-sha1-c8ae372ed26679df14da13050dd432fd32c527e3",
> > > > > > > > > > > > > >       "entityhub:site": "dbpedia"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id":
> > > > > > > > > > "urn:enhancement-c50474e4-ea0e-03ff-5db5-a25f4c8dae45",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "enhancer:Enhancement",
> > > > > > > > > > > > > >         "enhancer:EntityAnnotation"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "dc:created": "2015-12-07T11:22:07.748Z",
> > > > > > > > > > > > > >       "dc:creator":
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> "org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine",
> > > > > > > > > > > > > >       "dc:relation":
> > > > > > > > > > > > > >
> > > "urn:enhancement-e9c9c187-2d69-2c1f-6552-e76111430d4a",
> > > > > > > > > > > > > >       "enhancer:confidence": 0.25715446,
> > > > > > > > > > > > > >       "enhancer:entity-label": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "Vichy France"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:entity-reference": "
> > > > > > > > > > > > > > http://dbpedia.org/resource/Vichy_France",
> > > > > > > > > > > > > >       "enhancer:entity-type": [
> > > > > > > > > > > > > >         "dbp-ont:Country",
> > > > > > > > > > > > > >         "dbp-ont:Place",
> > > > > > > > > > > > > >         "dbp-ont:PopulatedPlace",
> > > > > > > > > > > > > >         "schema:Country",
> > > > > > > > > > > > > >         "schema:Place",
> > > > > > > > > > > > > >         "http://www.opengis.net/gml/_Feature",
> > > > > > > > > > > > > >         "owl:Thing"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "enhancer:extracted-from":
> > > > > > > > > > > > > >
> > > > > > > >
> > "urn:content-item-sha1-c8ae372ed26679df14da13050dd432fd32c527e3",
> > > > > > > > > > > > > >       "entityhub:site": "dbpedia"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id":
> > > > > > > > > > "urn:enhancement-de07bc41-e4a1-f510-3f93-99ebfd8c39f4",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "enhancer:Enhancement",
> > > > > > > > > > > > > >         "enhancer:EntityAnnotation"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "dc:created": "2015-12-07T11:22:07.748Z",
> > > > > > > > > > > > > >       "dc:creator":
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> "org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine",
> > > > > > > > > > > > > >       "dc:relation":
> > > > > > > > > > > > > >
> > > "urn:enhancement-8db13707-1ecd-b4df-90ad-52447c8f2c84",
> > > > > > > > > > > > > >       "enhancer:confidence": 0.1493264,
> > > > > > > > > > > > > >       "enhancer:entity-label": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "Paris Commune"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:entity-reference": "
> > > > > > > > > > > > > > http://dbpedia.org/resource/Paris_Commune",
> > > > > > > > > > > > > >       "enhancer:entity-type": [
> > > > > > > > > > > > > >         "dbp-ont:Country",
> > > > > > > > > > > > > >         "dbp-ont:Place",
> > > > > > > > > > > > > >         "dbp-ont:PopulatedPlace",
> > > > > > > > > > > > > >         "schema:Country",
> > > > > > > > > > > > > >         "schema:Place",
> > > > > > > > > > > > > >         "owl:Thing"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "enhancer:extracted-from":
> > > > > > > > > > > > > >
> > > > > > > >
> > "urn:content-item-sha1-c8ae372ed26679df14da13050dd432fd32c527e3",
> > > > > > > > > > > > > >       "entityhub:site": "dbpedia"
> > > > > > > > > > > > > >     },
> > > > > > > > > > > > > >     {
> > > > > > > > > > > > > >       "@id":
> > > > > > > > > > "urn:enhancement-e9c9c187-2d69-2c1f-6552-e76111430d4a",
> > > > > > > > > > > > > >       "@type": [
> > > > > > > > > > > > > >         "enhancer:Enhancement",
> > > > > > > > > > > > > >         "enhancer:TextAnnotation"
> > > > > > > > > > > > > >       ],
> > > > > > > > > > > > > >       "dc:created": "2015-12-07T11:22:07.740Z",
> > > > > > > > > > > > > >       "dc:creator":
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> "org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhancementEngine",
> > > > > > > > > > > > > >       "dc:type": "dbp-ont:Place",
> > > > > > > > > > > > > >       "enhancer:confidence": 0.99354976,
> > > > > > > > > > > > > >       "enhancer:end": 18,
> > > > > > > > > > > > > >       "enhancer:extracted-from":
> > > > > > > > > > > > > >
> > > > > > > >
> > "urn:content-item-sha1-c8ae372ed26679df14da13050dd432fd32c527e3",
> > > > > > > > > > > > > >       "enhancer:selected-text": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "France"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:selection-context": {
> > > > > > > > > > > > > >         "@language": "en",
> > > > > > > > > > > > > >         "@value": "Paris is in France"
> > > > > > > > > > > > > >       },
> > > > > > > > > > > > > >       "enhancer:start": 12
> > > > > > > > > > > > > >     }
> > > > > > > > > > > > > >   ]
> > > > > > > > > > > > > > }
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > On Mon, Dec 7, 2015 at 4:23 PM, Karl Wright <
> > > > > > > > daddywri@gmail.com>
> > > > > > > > > > > > wrote:
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Hi Dileepa,
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Repository connectors have an abstraction that
> > > allows
> > > > > > them
> > > > > > > to
> > > > > > > > > > > > generate
> > > > > > > > > > > > > > > compound documents (where a document has a
> > primary
> > > > > > > > identifier,
> > > > > > > > > > and
> > > > > > > > > > > > > there
> > > > > > > > > > > > > > > are subdocuments that share that primary
> > identifier
> > > > and
> > > > > > > have
> > > > > > > > a
> > > > > > > > > > > > > secondary
> > > > > > > > > > > > > > > identifier).  This sounds a bit like what you
> are
> > > > > > > describing.
> > > > > > > > > > Does
> > > > > > > > > > > > > > Stanbol
> > > > > > > > > > > > > > > work by decorating an existing document, or
> does
> > it
> > > > > work
> > > > > > by
> > > > > > > > > > > > generating
> > > > > > > > > > > > > > all
> > > > > > > > > > > > > > > content for a document?
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > Karl
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > On Mon, Dec 7, 2015 at 5:12 AM, Dileepa
> Jayakody
> > <
> > > > > > > > > > > > djayakody@zaizi.com>
> > > > > > > > > > > > > > > wrote:
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Hi All,
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > While thanking you all for your input on
> > Stanbol
> > > > > > > connector
> > > > > > > > > > > > > > requirement, I
> > > > > > > > > > > > > > > > would like to continue with modifying the
> > Stanbol
> > > > > > > connector
> > > > > > > > > to
> > > > > > > > > > be
> > > > > > > > > > > > > > > > compatible with any output connector. If you
> > guys
> > > > can
> > > > > > > give
> > > > > > > > > some
> > > > > > > > > > > > > > guidance
> > > > > > > > > > > > > > > on
> > > > > > > > > > > > > > > > how the entity metadata should be added to
> the
> > > > > > repository
> > > > > > > > > > > document
> > > > > > > > > > > > I
> > > > > > > > > > > > > > can
> > > > > > > > > > > > > > > > modify the stanbol connector accordingly.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > From Rafa's comments, I gathered we can add
> the
> > > > > entity
> > > > > > > > > metadata
> > > > > > > > > > > to
> > > > > > > > > > > > > the
> > > > > > > > > > > > > > > > repo.doc as key value pairs.
> > > > > > > > > > > > > > > > However this idea is not yet clear to me.
> There
> > > > could
> > > > > > be
> > > > > > > > 'N'
> > > > > > > > > > > number
> > > > > > > > > > > > > of
> > > > > > > > > > > > > > > > entities in a document and each of them will
> > have
> > > > > some
> > > > > > > > common
> > > > > > > > > > > > > > attributes
> > > > > > > > > > > > > > > > such as name, id, type and specific
> attributes
> > > for
> > > > > > > > particular
> > > > > > > > > > > > entity
> > > > > > > > > > > > > > > type.
> > > > > > > > > > > > > > > > I'm not clear on how to maintain that
> structure
> > > of
> > > > N
> > > > > > > number
> > > > > > > > > of
> > > > > > > > > > > > > entities
> > > > > > > > > > > > > > > > with their attributes in a repo.document as
> key
> > > > value
> > > > > > > pairs
> > > > > > > > > and
> > > > > > > > > > > > make
> > > > > > > > > > > > > > them
> > > > > > > > > > > > > > > > LDPath compatible for retrieval in an output
> > > > > connector.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > @Rafa
> > > > > > > > > > > > > > > > If you can please elaborate on your
> suggestion
> > it
> > > > > would
> > > > > > > be
> > > > > > > > > > > greatly
> > > > > > > > > > > > > > > helpful
> > > > > > > > > > > > > > > > to me.
> > > > > > > > > > > > > > > > All other suggestions are also welcome.
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > Thanks,
> > > > > > > > > > > > > > > > Dileepa
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > On Fri, Nov 13, 2015 at 7:00 PM, Karl Wright
> <
> > > > > > > > > > daddywri@gmail.com
> > > > > > > > > > > >
> > > > > > > > > > > > > > wrote:
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > I, too, agree.  Somebody will need to turn
> > this
> > > > > > > connector
> > > > > > > > > > into
> > > > > > > > > > > > one
> > > > > > > > > > > > > > that
> > > > > > > > > > > > > > > > > plays by the rules.  It may be possible for
> > > > someone
> > > > > > on
> > > > > > > > the
> > > > > > > > > > team
> > > > > > > > > > > > > here
> > > > > > > > > > > > > > to
> > > > > > > > > > > > > > > > do
> > > > > > > > > > > > > > > > > that, but it won't be me; I'm seriously
> > > > > overextended
> > > > > > at
> > > > > > > > the
> > > > > > > > > > > > moment.
> > > > > > > > > > > > > > It
> > > > > > > > > > > > > > > > > would be best if someone who knew the
> > connector
> > > > > well
> > > > > > > > could
> > > > > > > > > do
> > > > > > > > > > > the
> > > > > > > > > > > > > > > > necessary
> > > > > > > > > > > > > > > > > work.
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > Karl
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > On Fri, Nov 13, 2015 at 5:45 AM, Rafa Haro
> <
> > > > > > > > > > > > rharoapache@gmail.com>
> > > > > > > > > > > > > > > > wrote:
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > I must agree with Antonio. When I started
> > to
> > > > work
> > > > > > on
> > > > > > > > > this I
> > > > > > > > > > > was
> > > > > > > > > > > > > > > > expecting
> > > > > > > > > > > > > > > > > > the connector to work by just extracting
> > the
> > > > > > entities
> > > > > > > > and
> > > > > > > > > > > > > entities
> > > > > > > > > > > > > > > > > metadata
> > > > > > > > > > > > > > > > > > and put them as plain metadata of the
> > > > documents,
> > > > > > > > probably
> > > > > > > > > > > > > following
> > > > > > > > > > > > > > > > > LDPATH
> > > > > > > > > > > > > > > > > > queries configuration
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > This is probably ok for Sensefy but I
> don’t
> > > > think
> > > > > > > this
> > > > > > > > > > could
> > > > > > > > > > > be
> > > > > > > > > > > > > > > > suitable
> > > > > > > > > > > > > > > > > > to be included in the project. But this
> is
> > > only
> > > > > my
> > > > > > > > > opinion.
> > > > > > > > > > > Of
> > > > > > > > > > > > > > > course,
> > > > > > > > > > > > > > > > a
> > > > > > > > > > > > > > > > > > version of the connector that fully
> respect
> > > the
> > > > > > > > > ManifoldCF
> > > > > > > > > > > > > > > architecture
> > > > > > > > > > > > > > > > > > would be more than welcome in my opinion
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > On Fri, Nov 13, 2015 at 11:38 AM, Antonio
> > > David
> > > > > > Pérez
> > > > > > > > > > Morales
> > > > > > > > > > > > > > > > > > <adperezmorales@gmail.com> wrote:
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > > Hi
> > > > > > > > > > > > > > > > > > > The removal of the SolrWrapper is a
> must.
> > > It
> > > > > was
> > > > > > a
> > > > > > > > > > > > requirement
> > > > > > > > > > > > > > for
> > > > > > > > > > > > > > > an
> > > > > > > > > > > > > > > > > > > internal project which has nothing to
> do
> > > here
> > > > > > with
> > > > > > > a
> > > > > > > > > > normal
> > > > > > > > > > > > > > > operation
> > > > > > > > > > > > > > > > > of
> > > > > > > > > > > > > > > > > > > Manifold, so forcing the users to use
> > Solr
> > > > does
> > > > > > not
> > > > > > > > fit
> > > > > > > > > > the
> > > > > > > > > > > > > > > Manifold
> > > > > > > > > > > > > > > > > > > philosophy.
> > > > > > > > > > > > > > > > > > > In my opinion, at this moment, a
> Stanbol
> > > > > > connector
> > > > > > > > with
> > > > > > > > > > > such
> > > > > > > > > > > > a
> > > > > > > > > > > > > > big
> > > > > > > > > > > > > > > > > > > dependency which will not fit almost
> any
> > > use
> > > > > case
> > > > > > > is
> > > > > > > > > not
> > > > > > > > > > > very
> > > > > > > > > > > > > > > useful.
> > > > > > > > > > > > > > > > > > > You should think a way to convert
> Stanbol
> > > > > > connector
> > > > > > > > > into
> > > > > > > > > > a
> > > > > > > > > > > > > normal
> > > > > > > > > > > > > > > > > > > Transformation connector without
> assuming
> > > > that
> > > > > a
> > > > > > > > > specific
> > > > > > > > > > > > > output
> > > > > > > > > > > > > > > > > > connector
> > > > > > > > > > > > > > > > > > > will be used.
> > > > > > > > > > > > > > > > > > > Regards
> > > > > > > > > > > > > > > > > > > 2015-11-13 11:20 GMT+01:00 Dileepa
> > > Jayakody <
> > > > > > > > > > > > > djayakody@zaizi.com
> > > > > > > > > > > > > > >:
> > > > > > > > > > > > > > > > > > >> Hi guys,
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> I have developed a Stanbol connector
> for
> > > > MCF.
> > > > > > You
> > > > > > > > can
> > > > > > > > > > > check
> > > > > > > > > > > > it
> > > > > > > > > > > > > > out
> > > > > > > > > > > > > > > > > from
> > > > > > > > > > > > > > > > > > our
> > > > > > > > > > > > > > > > > > >> github repo here:
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/zaizi/sensefy-connectors/tree/master/transformation/mcf-stanbol-connector
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> It requires the SolrWrapper output
> > > connector
> > > > > > which
> > > > > > > > > > indexes
> > > > > > > > > > > > > > > enhanced
> > > > > > > > > > > > > > > > > > >> documents, entities and entityTypes in
> > > > > separate
> > > > > > > Solr
> > > > > > > > > > > cores.
> > > > > > > > > > > > > > > > Basically
> > > > > > > > > > > > > > > > > it
> > > > > > > > > > > > > > > > > > >> requires 3 separate solr cores
> > configured
> > > > > with a
> > > > > > > > > > specific
> > > > > > > > > > > > Solr
> > > > > > > > > > > > > > > > schema
> > > > > > > > > > > > > > > > > > for
> > > > > > > > > > > > > > > > > > >> primary documents, entities and
> > > entityTypes
> > > > > > > > > separately.
> > > > > > > > > > > This
> > > > > > > > > > > > > was
> > > > > > > > > > > > > > > > done
> > > > > > > > > > > > > > > > > > for
> > > > > > > > > > > > > > > > > > >> our specific use-case.
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> The SolrWrapper code is here :
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/zaizi/sensefy-connectors/tree/master/output/mcf-solrwrapperconnector
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> Perhaps we can discuss and remove the
> > > > Stanbol
> > > > > > > > > > connector's
> > > > > > > > > > > > > > > dependency
> > > > > > > > > > > > > > > > > > with
> > > > > > > > > > > > > > > > > > >> SolrWrapper and have it working with
> any
> > > > > output
> > > > > > > > > > connector.
> > > > > > > > > > > > > > > > > > >> Please note that the Stanbol connector
> > > > > currently
> > > > > > > > has a
> > > > > > > > > > bug
> > > > > > > > > > > > in
> > > > > > > > > > > > > > the
> > > > > > > > > > > > > > > UI
> > > > > > > > > > > > > > > > > > >> (editSpecification) which I'm working
> on
> > > at
> > > > > the
> > > > > > > > > moment.
> > > > > > > > > > > > After
> > > > > > > > > > > > > > > fixing
> > > > > > > > > > > > > > > > > > that I
> > > > > > > > > > > > > > > > > > >> will update here. And also I will
> > provide
> > > > > > > > > documentations
> > > > > > > > > > > for
> > > > > > > > > > > > > > > > > configuring
> > > > > > > > > > > > > > > > > > >> the connector.
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> Thanks,
> > > > > > > > > > > > > > > > > > >> Dileepa
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> On Thu, Jul 9, 2015 at 8:36 PM,
> Antonio
> > > > David
> > > > > > > Pérez
> > > > > > > > > > > Morales
> > > > > > > > > > > > <
> > > > > > > > > > > > > > > > > > >> adperezmorales@gmail.com> wrote:
> > > > > > > > > > > > > > > > > > >>
> > > > > > > > > > > > > > > > > > >> > Hi Joshua
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> > It is not the list for that, but
> > > Marmotta
> > > > is
> > > > > > > > already
> > > > > > > > > > > > > > integrated
> > > > > > > > > > > > > > > in
> > > > > > > > > > > > > > > > > > Apache
> > > > > > > > > > > > > > > > > > >> > Stanbol. You can take a look at this
> > > issue
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > https://issues.apache.org/jira/browse/STANBOL-1165
> > > > > > > > > .
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> > Anyway, as I said this is not the
> list
> > > for
> > > > > > that,
> > > > > > > > so
> > > > > > > > > > > let's
> > > > > > > > > > > > > use
> > > > > > > > > > > > > > > the
> > > > > > > > > > > > > > > > > > proper
> > > > > > > > > > > > > > > > > > >> > list for these things.
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> > Regards
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> > 2015-07-09 15:29 GMT+02:00 Joshua
> > > Dunham <
> > > > > > > > > > > > > > > joshua.dunham@gmail.com
> > > > > > > > > > > > > > > > >:
> > > > > > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > > > > > >> > > Hey Dileepa,
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > >       In case you were
> interested, I
> > > > > pinged
> > > > > > > the
> > > > > > > > > > list a
> > > > > > > > > > > > few
> > > > > > > > > > > > > > > days
> > > > > > > > > > > > > > > > > ago
> > > > > > > > > > > > > > > > > > >> > asking
> > > > > > > > > > > > > > > > > > >> > > for integration tips for Apache
> > > > Marmotta.
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > > I got some great tips on how to do
> > > this
> > > > > > which
> > > > > > > > > could
> > > > > > > > > > > help
> > > > > > > > > > > > > > you.
> > > > > > > > > > > > > > > > > Since
> > > > > > > > > > > > > > > > > > >> > > Marmotta is a drop in replacement
> > for
> > > > > > Clarezza
> > > > > > > > on
> > > > > > > > > > > > Stanbol
> > > > > > > > > > > > > it
> > > > > > > > > > > > > > > may
> > > > > > > > > > > > > > > > > be
> > > > > > > > > > > > > > > > > > >> > easier
> > > > > > > > > > > > > > > > > > >> > > for you to take this way.
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > > I'm not a Java programmer but I'm
> > > > bringing
> > > > > > > this
> > > > > > > > > > > problem
> > > > > > > > > > > > to
> > > > > > > > > > > > > > the
> > > > > > > > > > > > > > > > > > >> > development
> > > > > > > > > > > > > > > > > > >> > > staff at my company for
> assistance.
> > If
> > > > you
> > > > > > > like
> > > > > > > > > the
> > > > > > > > > > > > > Marmotta
> > > > > > > > > > > > > > > > > > approach
> > > > > > > > > > > > > > > > > > >> we
> > > > > > > > > > > > > > > > > > >> > > may gain more traction solving the
> > > same
> > > > > > > > > integration.
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > > I'm also integrating Marmotta with
> > > > Stanbol
> > > > > > so
> > > > > > > > the
> > > > > > > > > > > effect
> > > > > > > > > > > > > > would
> > > > > > > > > > > > > > > > be
> > > > > > > > > > > > > > > > > > the
> > > > > > > > > > > > > > > > > > >> > same
> > > > > > > > > > > > > > > > > > >> > > except not using the Stanbol API
> for
> > > > data
> > > > > > > import
> > > > > > > > > in
> > > > > > > > > > > > favor
> > > > > > > > > > > > > of
> > > > > > > > > > > > > > > > > > Marmotta.
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > > Best,
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > > -J
> > > > > > > > > > > > > > > > > > >> > >
> > > > > > > > > > > > > > > > > > >> > > > On Jul 9, 2015, at 1:03 AM,
> > Dileepa
> > > > > > > Jayakody <
> > > > > > > > > > > > > > > > > djayakody@zaizi.com
> > > > > > > > > > > > > > > > > > >
> > > > > > > > > > > > > > > > > > >> > > wrote:
> > > > > > > > > > > > > > > > > > >> > > >
> > > > > > > > > > > > > > > > > > >> > > > Hi all,
> > > > > > > > > > > > > > > > > > >> > > >
> > > > > > > > > > > > > > > > > > >> > > > Thanks you for the feedback and
> > > > offering
> > > > > > > your
> > > > > > > > > help
> > > > > > > > > > > in
> > > > > > > > > > > > > > this.
> > > > > > > > > > > > > > > > > > >> > > > Let me get back to you on where
> to
> > > > start
> > > > > > the
> > > > > > > > > code
> > > > > > > > > > > > base.
> > > > > > > > > > > > > > > > > > >> > > > As the first step, I would like
> to
> > > > start
> > > > > > by
> > > > > > > > > > > creating a
> > > > > > > > > > > > > > > > > > architecture
> > > > > > > > > > > > > > > > > > >> > > diagram
> > > > > > > > > > > > > > > > > > >> > > > for the connector.
> > > > > > > > > > > > > > > > > > >> > > > I will send the diagram for your
> > > > review
> > > > > > > soon.
> > > > > > > > > > > > > > > > > > >> > > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message