From general-return-5044-apmail-lucene-general-archive=lucene.apache.org@lucene.apache.org Fri Mar 9 16:01:32 2018 Return-Path: X-Original-To: apmail-lucene-general-archive@www.apache.org Delivered-To: apmail-lucene-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5C96C18AF3 for ; Fri, 9 Mar 2018 16:01:32 +0000 (UTC) Received: (qmail 21123 invoked by uid 500); 9 Mar 2018 16:01:31 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 21062 invoked by uid 500); 9 Mar 2018 16:01:31 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 21045 invoked by uid 99); 9 Mar 2018 16:01:30 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 09 Mar 2018 16:01:30 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 1C70A1A08A4 for ; Fri, 9 Mar 2018 16:01:30 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.129 X-Spam-Level: X-Spam-Status: No, score=0.129 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, KAM_COUK=0.85, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=messagingengine.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id sx30WCdLtK-6 for ; Fri, 9 Mar 2018 16:01:28 +0000 (UTC) Received: from out2-smtp.messagingengine.com (out2-smtp.messagingengine.com [66.111.4.26]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 1B58D5F51F for ; Fri, 9 Mar 2018 16:01:28 +0000 (UTC) Received: from compute5.internal (compute5.nyi.internal [10.202.2.45]) by mailout.nyi.internal (Postfix) with ESMTP id 3189020E75 for ; Fri, 9 Mar 2018 11:01:26 -0500 (EST) Received: from web3 ([10.202.2.213]) by compute5.internal (MEProxy); Fri, 09 Mar 2018 11:01:26 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-sender:x-me-sender:x-sasl-enc; s=fm2; bh=c279LZ cvFCOVcpjlipzpNlyMpjL1/UefvvuD/TwzbrU=; b=lBBwVNNL2UVM+gMiYEuptv 9OaOMnxLFBFV1pY9x8WGlD13vYQ+YAQFqbQcQmcLN7ECYhK986gvJVIE0uD2xe61 f8fF/dn516I2uKIVErt9SJKekJ+NoCDBsMW+1XMvCiwq8uIDeRQii/EDFV1LwJAX CulXaIR314lwsmu9OgPrz8TcDcztxDBqC1LPt0Jx7/8i4LlsrEm0HZ9E5AAshiwX X+2YMI+xtGxTq5xH7Y9Jz7uXpCWYUjSt5ZJHCRiaWPcmGo9nX3HfGXctq9Hq6q3F S6EVjqH0/iT9hlRAjsCF65BP1orceoZqibEftDOg4ztRxhRqyG+VxOZ6rVCfCdmg == X-ME-Sender: Received: by mailuser.nyi.internal (Postfix, from userid 99) id 0D82B9E4B3; Fri, 9 Mar 2018 11:01:26 -0500 (EST) Message-Id: <1520611285.2526097.1297472920.6780F377@webmail.messagingengine.com> From: Upayavira To: general@lucene.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset="utf-8" X-Mailer: MessagingEngine.com Webmail Interface - ajax-54087d22 In-Reply-To: <0110B4CA-088A-4AB4-A3C8-15D51AB40A35@alias-i.com> Date: Fri, 09 Mar 2018 16:01:25 +0000 References: <504f7506-3eef-f32b-a30e-88ca537caaf2@gmail.com> <0110B4CA-088A-4AB4-A3C8-15D51AB40A35@alias-i.com> Subject: Re: CLOB Hadoop and Solr I will unsubscribe you both now. Upayavira On Fri, 9 Mar 2018, at 2:05 PM, Bob Carpenter wrote: > Me, too. Can someone fix the unsubscribe mechanism? > > Thanks. > > > > On Mar 8, 2018, at 10:23 PM, john spooner wrote: > > > > I keep trying to unsubscribe but I am still getting endless emails. > > > > > > On 3/8/2018 10:30 AM, Jon Morisi wrote: > >> Hi, > >> I'm doing some preliminary investigation and am wondering if anyone can provide guidance. > >> I have a lot of CLOB data in an Oracle database. I also have a Hadoop cluster and am planning to install Solr (HDP Search). > >> > >> What would be the best way to use Solr for indexing this data? Sqoop to Hive and index that? Dump the clobs as individual txt files and index those? > >> > >> There seem to be a lot of options. Using the ClobTransformer directly on the Oracle DB is something I'd like to avoid. I'd rather move the data to Hadoop and manage my full-text indexing there. (I don't want to stress the DB with the indexing). > >> > >> Thanks, > >> Jon > >> > > >