From user-return-5559-archive-asf-public=cust-asf.ponee.io@manifoldcf.apache.org Wed Nov 21 10:05:37 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id AAE7D180675 for ; Wed, 21 Nov 2018 10:05:36 +0100 (CET) Received: (qmail 47375 invoked by uid 500); 21 Nov 2018 09:05:35 -0000 Mailing-List: contact user-help@manifoldcf.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@manifoldcf.apache.org Delivered-To: mailing list user@manifoldcf.apache.org Received: (qmail 47307 invoked by uid 99); 21 Nov 2018 09:05:35 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Nov 2018 09:05:35 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 54579190B1D for ; Wed, 21 Nov 2018 09:05:35 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.343 X-Spam-Level: X-Spam-Status: No, score=0.343 tagged_above=-999 required=6.31 tests=[DKIMWL_WL_MED=-1.459, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id 9vgypMO37s-t for ; Wed, 21 Nov 2018 09:05:33 +0000 (UTC) Received: from mail-yw1-f67.google.com (mail-yw1-f67.google.com [209.85.161.67]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 9DFE05F1E3 for ; Wed, 21 Nov 2018 09:05:33 +0000 (UTC) Received: by mail-yw1-f67.google.com with SMTP id h193so716458ywc.4 for ; Wed, 21 Nov 2018 01:05:33 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=FumiG/5Mwcua9XOsFQUGjGmxdArP1wAqbCYOZGnqbW4=; b=S6w9/f+YVHjC3KjyiK+RzZB83pHxVepm8dBwxXZpJ8yvhfZQNRW5yLcCYhA4UKUK3k LH6Ly0JV0sYxH/XJmNnHnMcDZS6j3DLmKHcGrpvQRIg9aIDpc7kmjCsCn+eadjD+DMi/ CmoCmP3HlHynAd2KobldqU7nxz63RP5iPctJVaWaxe2E36zTS6J1bc64MbgPpt3yir8T 4VskNRRqA+ITyx0eGuQTzLYWWbc4Yclfu1N3BiQE7KnKcrPTfmIE6fXP5dTduJsjoMat YTatE3LyKwSjXP+Sx8FsFlI7eJ0Vu0Av16bcknzxaXfXmCbFFQleY6OC2A/7t9p0B124 T7zQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=FumiG/5Mwcua9XOsFQUGjGmxdArP1wAqbCYOZGnqbW4=; b=dL4KLWjPNdQKTow/a9TlH2ZgArEHngxv4uUaB8kPcZwnmoLbRfk1linT0aeg9R3tJr 09ynuxmWv8pAcl6dhAGxKVCMtRX4jE8dUna0xm4lURr6UVVrH+baVx59ylmqDzsPrCu6 /y17ahX/aWTPXI0FJ3hM4BigsmvimEbnfcyOr0rvVxTgQCGoBEjOVPp0w1UPPBzfRRsc SAfC1CfiTcro2Quo9ymKqRsTnHlqM/9oC2cYeCH2vxWNCu6epBTYhCrzX4PIMQAB7uTV h0Kt7BcdhO5Is/k7g3NhkxQ/p3ER6VTN92Fb7PuPe8KrjvidzFZrygW6Rfxb8eg8G4Gx XEpQ== X-Gm-Message-State: AGRZ1gJFLaLO/yf8O8Yo1T7HMGscXxd62Mdb8BXiWcG+2AKSlvLaav7K ppF027MDL/DgTjSAqdvrMzCqfjcos1q0XKoKLyloTG3ZLd0= X-Google-Smtp-Source: AJdET5eudjdWBr6L17c9w9dg5xleaKFJ006jIRJkVhql5LWKKHzHaGPLfVLZO+Z+Slgiy/OAngFzF6NxHbmTquWL6Xo= X-Received: by 2002:a0d:dc87:: with SMTP id f129-v6mr5468969ywe.500.1542791132841; Wed, 21 Nov 2018 01:05:32 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Furkan KAMACI Date: Wed, 21 Nov 2018 12:05:20 +0300 Message-ID: Subject: Re: Language Detection for the data To: user@manifoldcf.apache.org Content-Type: multipart/alternative; boundary="0000000000006f8eb2057b290d63" --0000000000006f8eb2057b290d63 Content-Type: text/plain; charset="UTF-8" Hi Nikita, First of all, OpenNLP is a transformation connector at ManifoldCF and should be enabled by default. It extracts named entities (people, locations and organizations) from document. You should download trained models to run OpenNLP connector. You can check here for such purpose: https://opennlp.apache.org/models.html Check here for a detailed explanation: https://github.com/ChalithaUdara/OpenNLP-Manifold-Connector Feel free to ask any questions when you try to integrate it. Also, you should explain the points if you cannot success to run it. Kind Regards, Furkan KAMACI On Wed, Nov 21, 2018 at 11:54 AM Karl Wright wrote: > Hi Nikita, > > Can you be more specific when you say "OpenNLP is not working"? All that > this connector does is integrate OpenNLP as a ManifoldCF transformer. It > uses a specific directory to deliver the models that OpenNLP uses to match > and extract content from documents. Thus, you can provide any models you > want that are compatible with the OpenNLP version we're including. > > Can you describe the steps you are taking and what you are seeing? > > On Wed, Nov 21, 2018 at 12:44 AM Nikita Ahuja > wrote: > >> Hi, >> >> I have query related to detect the language of the records/data which is >> going to be ingest in the Output Connector. >> >> OpenNLP connector is not working for the detection as per the user >> documentation, but this is not working appropriately. Please suggest is NLP >> has to be used if yes, then how it should be used or is there any other >> solution for this? >> >> -- >> Thanks and Regards, >> Nikita >> Email: nikita@smartshore.nl >> United Sources Service Pvt. Ltd. >> a "Smartshore" Company >> Mobile: +91 99 888 57720 >> http://www.smartshore.nl >> > --0000000000006f8eb2057b290d63 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi Niki= ta,

First of all, OpenNLP is a transformation connector = at ManifoldCF and should be enabled by default.=C2=A0It extracts named enti= ties (people, locations and organizations) from document.

You should download trained models to run OpenNLP connector. You ca= n check here for such purpose:=C2=A0https://opennlp.apache.org/models.html

=
Check here for a detailed explanation:=C2=A0https://github.com/ChalithaUd= ara/OpenNLP-Manifold-Connector

Feel free to as= k any questions when you try to integrate it. Also, you should explain the = points if you cannot success to run it.

Kind Regar= ds,
Furkan KAMACI

On Wed, Nov 21, 2018 at 11:54 = AM Karl Wright <daddywri@gmail.com= > wrote:
Hi= Nikita,

Can you be more specific when you say "Ope= nNLP is not working"?=C2=A0 All that this connector does is integrate = OpenNLP as a ManifoldCF transformer.=C2=A0 It uses a specific directory to = deliver the models that OpenNLP uses to match and extract content from docu= ments.=C2=A0 Thus, you can provide any models you want that are compatible = with the OpenNLP version we're including.

Can you describe the s= teps you are taking and what you are seeing?

On Wed, Nov 21, 2018 at 12:44 AM Nikita Ahuja = <nikita@smarts= hore.nl> wrote:
Hi,

I have query related to detect the language of = the records/data which is going to be ingest in the Output Connector.
OpenNLP connector is not working for the detection as per the user docume= ntation, but this is not working appropriately. Please suggest is NLP has t= o be used if yes, then how it should be used or is there any other solution= for=C2=A0this?

--
Thanks and Regards,
Nikita
Email: nikita@smartshore.nl
United Sources Service Pvt. Ltd.
a "Smartshore" Company
Mobile: += 91 99 888 57720

http://www.smartshore.nl
--0000000000006f8eb2057b290d63--