Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id AEA7F200D3E for ; Thu, 12 Oct 2017 01:44:05 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id AD1C01609E5; Wed, 11 Oct 2017 23:44:05 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id F3686160BE3 for ; Thu, 12 Oct 2017 01:44:04 +0200 (CEST) Received: (qmail 55158 invoked by uid 500); 11 Oct 2017 23:44:04 -0000 Mailing-List: contact dev-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@manifoldcf.apache.org Delivered-To: mailing list dev@manifoldcf.apache.org Received: (qmail 54848 invoked by uid 99); 11 Oct 2017 23:44:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Oct 2017 23:44:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 4B19A1849A2 for ; Wed, 11 Oct 2017 23:44:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id gTHgHfHHuON1 for ; Wed, 11 Oct 2017 23:44:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 609CC5FBE9 for ; Wed, 11 Oct 2017 23:44:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id DA8FCE0EB9 for ; Wed, 11 Oct 2017 23:44:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 8D0B82539B for ; Wed, 11 Oct 2017 23:44:00 +0000 (UTC) Date: Wed, 11 Oct 2017 23:44:00 +0000 (UTC) From: "Karl Wright (JIRA)" To: dev@manifoldcf.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (CONNECTORS-1356) Initial implementation of the CMIS Output Connector MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 11 Oct 2017 23:44:05 -0000 [ https://issues.apache.org/jira/browse/CONNECTORS-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16201204#comment-16201204 ] Karl Wright edited comment on CONNECTORS-1356 at 10/11/17 11:43 PM: -------------------------------------------------------------------- [~piergiorgiolucidi@gmail.com], you should return MODEL_ADD_CHANGE_DELETE *only* if your seeding query returns documents that have been deleted in addition to those that have been added or changed. If your seeding query does not return deleted documents, then maybe MODEL_ADD_CHANGE is appropriate, if the seeding query is the only way new or changed documents can be discovered. But if new documents can be discovered by scanning crawled documents, then MODEL_CHAINED_ADD_CHANGE is the right model. I've been looking for a connector that uses straight MODEL_ADD_CHANGE where there's an integration test, and I haven't found one yet. But MODEL_CHAINED_ADD_CHANGE is very similar and the file system connector uses it. The current trunk Cmis connector uses MODEL_CHAINED_ADD_CHANGE too. The web and rss connectors use MODEL_ALL, which stipulates that the seeding query return all documents in the set, so these won't help. So, this is how I would debug this: (1) Try running the file system connector integration tests. Do these pass? (2) If they don't, figure out what changes have been made to the framework in your branch that aren't on trunk. (3) If these tests pass, try changing your connector model to MODEL_CHAINED_ADD_CHANGE. If that *does* work, maybe the framework is doing something wrong with the unchained models, and I'll have to look into that. In that case it would help if you could describe to me all ways that the CMIS connector discovers added and changed documents. Thanks! was (Author: kwright@metacarta.com): [~piergiorgiolucidi@gmail.com], you should return MODEL_ADD_CHANGE_DELETE *only* if your seeding query returns documents that have been deleted in addition to those that have been added or changed. If your seeding query does not return deleted documents, then maybe MODEL_ADD_CHANGE is appropriate, if the seeding query is the only way new or changed documents can be discovered. But if new documents can be discovered by scanning crawled documents, then MODEL_CHAINED_ADD_CHANGE is the right model. I've been looking for a connector that uses straight MODEL_ADD_CHANGE where there's an integration test, and I haven't found one yet. But MODEL_CHAINED_ADD_CHANGE is very similar and the file system connector uses it. The current trunk Cmis connector uses MODEL_CHAINED_ADD_CHANGE too. The web and rss connectors use MODEL_ALL, which stipulates that the seeding query return all documents in the set, so these won't help. So, this is how I would debug this: (1) Try running the file system connector integration tests. Do these pass? (2) If they don't, figure out what changes have been made to the framework in your branch that aren't on trunk. (3) If these tests pass, try changing your connector model to MODEL_CHAINED_ADD_CHANGE. If that *does* work, maybe the framework is doing something wrong with the unchained models, and I'll have to look into that. It would help if you Thanks! > Initial implementation of the CMIS Output Connector > --------------------------------------------------- > > Key: CONNECTORS-1356 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1356 > Project: ManifoldCF > Issue Type: Task > Components: CMIS Output Connector > Reporter: Piergiorgio Lucidi > Assignee: Piergiorgio Lucidi > Fix For: ManifoldCF 2.9 > > Attachments: CMISOutputConnectorSettings.png, ResultWithTimestampTreeOnTargetRepo.png, SourceRepoFolder.png > > Original Estimate: 672h > Remaining Estimate: 672h > > After we have discussed about the possibility to have some output connector dedicated to migrate contents from a specific repo to another one, this is the first implementation for the CMIS protocol. > The discussion is the following: > http://mail-archives.apache.org/mod_mbox/manifoldcf-dev/201611.mbox/%3CCAEO2op-bjNv4xSTPwGN%3DV2v47Sy8d%2BwKNtd1RpV2PC85y_JAgw%40mail.gmail.com%3E > The main scenario is migrate contents from any repository type to a CMIS-compliant repo. -- This message was sent by Atlassian JIRA (v6.4.14#64029)