From dev-return-18805-archive-asf-public=cust-asf.ponee.io@manifoldcf.apache.org Sat Dec 1 21:05:54 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id D2A4C18064E for ; Sat, 1 Dec 2018 21:05:53 +0100 (CET) Received: (qmail 3348 invoked by uid 500); 1 Dec 2018 20:05:52 -0000 Mailing-List: contact dev-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@manifoldcf.apache.org Delivered-To: mailing list dev@manifoldcf.apache.org Received: (qmail 3272 invoked by uid 99); 1 Dec 2018 20:05:52 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 01 Dec 2018 20:05:52 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id D81581869F5 for ; Sat, 1 Dec 2018 20:05:51 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.297 X-Spam-Level: ** X-Spam-Status: No, score=2.297 tagged_above=-999 required=6.31 tests=[DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=2, KAM_NUMSUBJECT=0.5, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id E31n-xqsNTkm for ; Sat, 1 Dec 2018 20:05:49 +0000 (UTC) Received: from mail-wm1-f68.google.com (mail-wm1-f68.google.com [209.85.128.68]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 980D160D2B for ; Sat, 1 Dec 2018 20:05:48 +0000 (UTC) Received: by mail-wm1-f68.google.com with SMTP id c126so2085525wmh.0 for ; Sat, 01 Dec 2018 12:05:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=cY9+fkaKrznNdDlcZUxlD1XBjbxebUWDQ1ZrlXdn3u4=; b=CU0TGxaZ/XKaxLrsnyihQrhHJiGCq0dLjXkE503yuIN4Lx+qnvReARNzR6dcab8xbl wRpjxt6ewDcTVZViTIS+fF/n3QdyIY6UR6WM47eNyD0s8YaYa5bO2vEjYNBX+yzm8Skv e63kL85hno2q9WI4CLwNhFqWfbfoBTrlczUjjwp7/lud1YtNfoVhDwKd/UVe/yV36/z4 Jvt2i0R1uorGuDC9LATeBWry5plKtXp7GwesqKCLDDQQ49DgfQmTMf1RNo6mWKzCaugc EFAmXo/bPpZEbWz0uzEx+FbeualsdfOFboLx+/T6oEuBp9Z/sEWwnf7k1DWZAuznPXp5 ZP5w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=cY9+fkaKrznNdDlcZUxlD1XBjbxebUWDQ1ZrlXdn3u4=; b=ZQ06laiNaJUuvA2fLj3EE8/9UyfRTGN3p6C+ILNd2CA3Ox9wb0wCIKeTLT4oqX8jm2 tUuQtI5OidEoNA5sULOOfqLILA2+A+2RksrfhGjMv0CnTIKdEQudeIAFgtgYp59YVEMi QvHwgp4oyiMNIEx9ne741QnMob/sfqsmiBG6abQRjhQY7yr2P6BFmRxM78+VPoPm3Pco Inp59MZIf6oxgRmxlN1CYckaRPZkGVe3h1TbQgR/vX63y5r7Y/Qp7dq3tiQWwcbpx/yY hus80kfXZJPuZ3nGL+ZavCKKGScouPgNLVBobwvVs08XAf0tZWoxkLtmxKZ6PxHWA7mr IVnw== X-Gm-Message-State: AA+aEWb96FgBj8sdEG9v8p9u3ia66CUH/ZG38W/psDy+rdMvoewdn1Qa nuRUe6LA1GnqC2eMvPWYJiEH/l8uuR32xaD2u8Bksg== X-Google-Smtp-Source: AFSGD/Vpc53LDr+xcB8jjPQs1JgUSvwvpQCzjs0l56hn7obMLk9o8V2Cmt86mUTbbNU76fvNpcTmInbbmoJteYDdv0s= X-Received: by 2002:a1c:414:: with SMTP id 20mr3116477wme.67.1543694747737; Sat, 01 Dec 2018 12:05:47 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Karl Wright Date: Sat, 1 Dec 2018 15:05:36 -0500 Message-ID: Subject: Re: Apache Manifold 2.10 To: dev Content-Type: multipart/alternative; boundary="00000000000014b196057bfb71a2" --00000000000014b196057bfb71a2 Content-Type: text/plain; charset="UTF-8" Another thing: it's quite important to guarantee a working setup here, otherwise you're just wasting everyone's time. So, please base your installation on the multiprocess-zk-example. Start off by running the example as is, on a small test crawl. Once you know how it works, then move next to changing only what you have to -- namely, the database properties in the global properties file, to point to your MySQL instance. Try that also on a small test case (crawl some files for instance), before trying it on your large case. Every step of the way should work, and if it doesn't, figure out why not before you move onto the next step. Thanks, Karl On Sat, Dec 1, 2018 at 2:59 PM Karl Wright wrote: > Zookeeper does not require a locking directory. It is a process that > synchronizes other processes, and they connect to it by port. > > Karl > > > On Sat, Dec 1, 2018 at 2:55 PM krishna agrawal > wrote: > >> Thanks for the information. >> if we use Zookeeper how can we make sure all our ManifoldCF processes use >> same locking directory does it can be done at the configuration level >> while >> installing. >> >> thanks, >> Krishna A >> >> On Sat, Dec 1, 2018 at 1:39 PM Karl Wright wrote: >> >> > That error is the result of the database not managing transactions >> > properly. It can occur if the locking system is not set up properly, >> or if >> > you are using multiple agents processes and each process does not have >> its >> > own ID. We have also seen it reported before just because MySQL seems >> to >> > have bugs and sometimes writes are delayed or don't go through. >> > >> > My recommendation would be to: >> > (1) use zookeeper, not file locking >> > (2) Make sure all your ManifoldCF processes use the SAME locking >> directory >> > or Zookeeper instance >> > (3) If you are using multiple agents process, be certain that each such >> > process gets its own ID (as is done in the examples). >> > >> > Karl >> > >> > >> > On Sat, Dec 1, 2018 at 11:43 AM krishna agrawal >> > wrote: >> > >> > > Thanks Karl, >> > > >> > > I will take a look at it >> > > >> > > But there is the error keep on tossing at manifold log >> > > >> > > ERROR 2018-12-01T11:13:26,297 (Job reset thread) - Exception tossed: >> > > Unexpected job status encountered: 33 >> > > org.apache.manifoldcf.core.interfaces.ManifoldCFException: Unexpected >> job >> > > status encountered: 33 >> > > at >> > > >> org.apache.manifoldcf.crawler.jobs.Jobs.returnJobToActive(Jobs.java:2145) >> > > ~[mcf-pull-agent.jar:?] >> > > at >> > > >> > > >> > >> org.apache.manifoldcf.crawler.jobs.JobManager.resetJobs(JobManager.java:8449) >> > > ~[mcf-pull-agent.jar:?] >> > > at >> > > >> > > >> > >> org.apache.manifoldcf.crawler.system.JobResetThread.run(JobResetThread.java:77) >> > > [mcf-pull-agent.jar:?] >> > > >> > > Thanks, >> > > Krishna A >> > > >> > > >> > > On Fri, Nov 30, 2018 at 7:00 PM Karl Wright >> wrote: >> > > >> > > > Hi Krishna, >> > > > >> > > > First of all I suggest that you *not* use multiprocess-file-example, >> > and >> > > > instead use multiprocess-zk-example. >> > > > >> > > > Your symptoms suggest many possibilities. But if you move to >> Zookeeper >> > > we >> > > > will be able to eliminate dangling file locks as a complication. So >> > > please >> > > > do that first. >> > > > >> > > > Karl >> > > > >> > > > >> > > > On Fri, Nov 30, 2018 at 6:29 PM krishna agrawal < >> krish.agwl@gmail.com> >> > > > wrote: >> > > > >> > > > > Yeah in our local set up we did Simple example but in server we >> did >> > > > > multiprocess-file-example are you suggesting us to upgrade from >> 2.10 >> > to >> > > > > 2.11 ? >> > > > > >> > > > > and we are using MY Sql database , >> > > > > >> > > > > So most of time i saw nothing is running and still it say job is >> > > running >> > > > > and you have to wait for it to complete. >> > > > > >> > > > > and restarting also not helping. >> > > > > >> > > > > Any other solution woould be greatly appreciated. >> > > > > >> > > > > Thanks, >> > > > > Krishna A >> > > > > >> > > > > On Fri, Nov 30, 2018 at 10:50 AM Karl Wright >> > > wrote: >> > > > > >> > > > > > It also may be useful to start with the simple example, which is >> > not >> > > > > > multiprocess, and get familiar with using ManifoldCF that way, >> > before >> > > > you >> > > > > > try to go to a more complicated setup. >> > > > > > >> > > > > > Thanks, >> > > > > > Karl >> > > > > > >> > > > > > >> > > > > > On Fri, Nov 30, 2018 at 9:46 AM Karl Wright > > >> > > > wrote: >> > > > > > >> > > > > > > "simplified multi-process"? There is no such example. >> > > > > > > >> > > > > > > These are the examples available. Which one are you using? >> > > > > > > >> > > > > > > 11/15/2018 03:40 AM example >> > > > > > > 11/15/2018 03:40 AM example-proprietary >> > > > > > > 11/15/2018 03:40 AM >> multiprocess-file-example >> > > > > > > 11/15/2018 03:40 AM >> > > > > > > multiprocess-file-example-proprietary >> > > > > > > 11/15/2018 03:40 AM multiprocess-zk-example >> > > > > > > 11/15/2018 03:40 AM >> > > > > > multiprocess-zk-example-proprietary >> > > > > > > >> > > > > > > Cleaning locks makes no sense unless you are using the >> > > > > multiprocess-file >> > > > > > > setup. This is deprecated, by the way, in favor of the >> Zookeeper >> > > > > setup. >> > > > > > > >> > > > > > > As for the buttons, please read: >> > > > > > > >> > > > > > > >> > > > > > > >> > > > > > >> > > > > >> > > > >> > > >> > >> https://manifoldcf.apache.org/release/release-2.11/en_US/end-user-documentation.html#outputs >> > > > > > > >> > > > > > > The buttons in question are "Reindex all..." and "Remove >> all..." >> > > > > > > >> > > > > > > Karl >> > > > > > > >> > > > > > > >> > > > > > > On Fri, Nov 30, 2018 at 9:36 AM krishna agrawal < >> > > > krish.agwl@gmail.com> >> > > > > > > wrote: >> > > > > > > >> > > > > > >> We have deployed the Manifold using >> > > > > > >> >> > > > > > >> - Simplified multi-process model >> > > > > > >> >> > > > > > >> We did try clean up of lock Sh but that also did not work. >> > > > > > >> >> > > > > > >> I dont have forget all document button in output connector. >> > > > > > >> >> > > > > > >> [image: image.png] >> > > > > > >> >> > > > > > >> On Thu, Nov 29, 2018 at 6:52 PM Karl Wright < >> daddywri@gmail.com >> > > >> > > > > wrote: >> > > > > > >> >> > > > > > >>> Hi Krishna, >> > > > > > >>> >> > > > > > >>> Please give us some background as to how you've deployed >> > > > ManifoldCF. >> > > > > > Are >> > > > > > >>> you using one of the examples? If so, which one? >> > > > > > >>> >> > > > > > >>> The detailed answer to your question is: the job must delete >> > all >> > > > > > >>> documents >> > > > > > >>> it indexed before it can be deleted. That is the typical >> way >> > > jobs >> > > > > > work. >> > > > > > >>> Thus, if you shut down the target of your output connection, >> > you >> > > > may >> > > > > be >> > > > > > >>> blocked in deleting your job. >> > > > > > >>> >> > > > > > >>> At that point, you can either (a) restart the target of your >> > > output >> > > > > > >>> connection, or (b) go to the "view" page for the output >> > > connection >> > > > > and >> > > > > > >>> click both of the "forget all documents" buttons on it. >> (b) is >> > > not >> > > > > > >>> recommended unless you really want to start over fresh on >> your >> > > > output >> > > > > > >>> index. >> > > > > > >>> >> > > > > > >>> Thanks, >> > > > > > >>> Karl >> > > > > > >>> >> > > > > > >>> >> > > > > > >>> On Thu, Nov 29, 2018 at 3:21 PM krishna agrawal < >> > > > > krish.agwl@gmail.com> >> > > > > > >>> wrote: >> > > > > > >>> >> > > > > > >>> > Hi We are facing issue of action button is not available >> > > > > > >>> > >> > > > > > >>> > [image: image.png] >> > > > > > >>> > >> > > > > > >>> > I have stop the agent process but still i am not able to >> > > remove >> > > > > the >> > > > > > >>> job >> > > > > > >>> > it say it >> > > > > > >>> > >> > > > > > >>> > there should be some way to forcefully restart and stop >> the >> > > > running >> > > > > > >>> > process ? >> > > > > > >>> > >> > > > > > >>> > Job 1542835910915 is busy; you must wait and/or shut it >> down >> > > > before >> > > > > > >>> > deleting it >> > > > > > >>> > but there is no job running, and i am seeing this message >> > from >> > > > > past 3 >> > > > > > >>> days. >> > > > > > >>> > >> > > > > > >>> > is there any ways to clear this? >> > > > > > >>> > >> > > > > > >>> > >> > > > > > >>> > Any help in this matter will be appreciated. >> > > > > > >>> > >> > > > > > >>> > Thanks, >> > > > > > >>> > Krishna A >> > > > > > >>> > >> > > > > > >>> >> > > > > > >> >> > > > > > >> > > > > >> > > > >> > > >> > >> > --00000000000014b196057bfb71a2--