Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 7EAB5200B5A for ; Thu, 4 Aug 2016 17:43:54 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7D091160A6A; Thu, 4 Aug 2016 15:43:54 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9EBFE160AAB for ; Thu, 4 Aug 2016 17:43:53 +0200 (CEST) Received: (qmail 60300 invoked by uid 500); 4 Aug 2016 15:43:52 -0000 Mailing-List: contact dev-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@manifoldcf.apache.org Delivered-To: mailing list dev@manifoldcf.apache.org Received: (qmail 59759 invoked by uid 99); 4 Aug 2016 15:43:52 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 04 Aug 2016 15:43:52 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id DF5EC187AA5 for ; Thu, 4 Aug 2016 15:43:51 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 9RHsBgM_KBmV for ; Thu, 4 Aug 2016 15:43:49 +0000 (UTC) Received: from mail-lf0-f49.google.com (mail-lf0-f49.google.com [209.85.215.49]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id 4EDF760E50 for ; Thu, 4 Aug 2016 15:43:49 +0000 (UTC) Received: by mail-lf0-f49.google.com with SMTP id b199so185048243lfe.0 for ; Thu, 04 Aug 2016 08:43:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=JIEl9mnGVjTI12d2uRJuRe4MzWWrfdlt5iy+FQ8b04o=; b=pyyoHgpqUR84rPYYDJ/81FQxVuw8yZyIcVk+A9oBgUoWP7OVkT3w+u75KMw/lXrspM +fHpGJ94VLOt+jnyvu9XlY6m2gssu8XKoZMti4vp79Bhfu0BwKzTfsAyz+NQfR3m3uU+ f/Z0rSeLBKgZApEG7QyR5SshQ99aduypZ2ghsT4sSQ3KFUEFJwzhvpZTQqiQoDPFx8Mh S26QNDD8/iHNrHgTqpIQsgydk826ki5BnkYKRqU3houm66nbuOEtF9ETARbLrK4P4vqQ 93Qj4V8B/TQnUSfCTGKxkrNUn5mqCRQXOLx2KNmz2j8v7VGR6TgWTxT3RHX7vNwVRcnz p/+A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=JIEl9mnGVjTI12d2uRJuRe4MzWWrfdlt5iy+FQ8b04o=; b=ieapqmyXCyj8PdHjNZh6zGOQONxP6Lf9Nm+UkPYchhtexhzXlZmo4P2/0Ss2TPmdHq /TVe6N+L9PyyurbxNVeEeY14DNlvzWo9T+fbSSZMv5HgdDTerjkXa+tO9LIOX71b9H7L VxwKObASluRF1AcRSEsDa9429OKiOC3Kdtucv+fXlfvainy6P0onAWpkHVcBselikY9J aVF+femlRB/gHBM/YBQQUm9PV4qRiMAd7UXsTWFe5teCXKwFN3ZDj3B0B0ac5mK5fIsz KuzWD35FyE7RPZP02Kq6X7vFRBWddnADf6DR8K0Loke7JFAmNN7hqNj0aop6pNwssBAB 4P9g== X-Gm-Message-State: AEkoouu9UXzN42JTYHPTo81PSN87x7C09eOoei7FlFLj07z7tijbu2pXEC6yFnH4WoClZO/POXQkVUmYMLNouA== X-Received: by 10.46.32.131 with SMTP id g3mr20678408lji.46.1470325428455; Thu, 04 Aug 2016 08:43:48 -0700 (PDT) MIME-Version: 1.0 Received: by 10.25.19.209 with HTTP; Thu, 4 Aug 2016 08:43:47 -0700 (PDT) In-Reply-To: References: From: David Arroyo Date: Thu, 4 Aug 2016 17:43:47 +0200 Message-ID: Subject: Re: CONNECTORS-1290 [GSOC 2016] Nuxeo repository and Authority connector for Apache ManifoldCF To: dev@manifoldcf.apache.org Content-Type: multipart/alternative; boundary=001a1142ad40de0d6d053940d057 archived-at: Thu, 04 Aug 2016 15:43:54 -0000 --001a1142ad40de0d6d053940d057 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi Karl, Let me answer your questions but anyway, next week I will provide a document describing both connectors in detail. - what the document ID's are The documents ID=E2=80=99s are the original document UID which is a u= nique ID provided by Nuxeo. - what the access tokens are For each document, the usernames and groups that have access to the documents are stored. When the user search, his usernames and all his groups are used for matching with these tokens. - what the document specification has in it Sorry, I don't understand this question. - what the configuration has in it The configuration currently supported is: - Select domains to be crawled. - Select documents type to be crawled. - Give the option of choosing if the tags must be processed. - a description of any metadata you include with the document The metadata include with de document are: Uid, title, last modified, state, path, type, is checked out, repository, parent reference, description, language, coverage, valid, creators, contributors, last contributor, rights, expired, created, issued, nature, source, publisher and subjects. Specifically to note are included the notes and mime type. I have implemented some unit tests. Furthermore, I have been checking how I could to do integration test, but it is not easy because Nuxeo doesn't seem to provide Maven resources for easing the development of integration test. Regards. On 3 August 2016 at 09:56, Karl Wright wrote: > Thanks, David. > > Since we'll be picking up the nuxeo connector and authority into the > project, it would be great to have a high-level description of what you d= id > here. Specifically: > > - what the document ID's are > - what the access tokens are > - what the document specification has in it > - what the configuration has in it > - a description of any metadata you include with the document > > I hope you also will consider supplying an integration test, ideally one > that sets up its own Nuxeo instance. If that's not possible that's OK bu= t > it is usually quite helpful to have such a thing. > > Thanks again! > Karl > > > On Wed, Aug 3, 2016 at 3:35 AM, David Arroyo > > wrote: > > > Hi, > > > > Last week, I finished developing the authority connector and I have bee= n > > testing it directly with a ManifoldCF instance. The authority connector > has > > been developed using Acls [1]. Each document stores users and groups > which > > have read permissions. When a user searches, they can only see the > > documents which contain his groups or his username. > > > > The repository connector has been developed using an incremental mode f= or > > seeding. At the moment, it just stores the basic properties of the > > documents. > > > > During the next weeks, I will be doing some improvements until the last > > week of my work period. During this last week, the connectors will be > > tested and documented. > > > > Some improvements are: > > - Get the document tags > > - Give the option of choosing the domain and the documents type to be > > crawled. > > > > Regards. > > > > > > [1] https://doc.nuxeo.com/display/NXDOC/ACLs > > > > -- > > David Arroyo Escobar > > > --=20 David Arroyo Escobar --001a1142ad40de0d6d053940d057--