Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2DE03106E3 for ; Sun, 29 Mar 2015 20:26:01 +0000 (UTC) Received: (qmail 71040 invoked by uid 500); 29 Mar 2015 20:25:57 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 70966 invoked by uid 500); 29 Mar 2015 20:25:57 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 70954 invoked by uid 99); 29 Mar 2015 20:25:56 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Mar 2015 20:25:56 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jack.krupansky@gmail.com designates 209.85.212.176 as permitted sender) Received: from [209.85.212.176] (HELO mail-wi0-f176.google.com) (209.85.212.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 29 Mar 2015 20:25:52 +0000 Received: by wibgn9 with SMTP id gn9so99298914wib.1 for ; Sun, 29 Mar 2015 13:25:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=EUcp9Hxg9c3Tjl86bAZHRJcdptlF5DGLhulbAamfDlw=; b=NJmKK7vusIwtFB9FLv9ftVQb1F51yd3PAeGIcRPS6oyvHVSo/Co3Z2tEMa8IVRAD6h 0TNlK06pYx1Fi/+t0adB4w4KQqKC1/MJCPna8F6ki0Fj063JKSIGnQnVzNE7Yg9IQ5Eb Qd68AzHhX1vdg3a55q3yBxYrlgEwbI1KNcCwRxS96c8xgtUNK8gLLsEduLGmIgHiW2tE NHnbzQIsGDnAXPvQX6yMDKt/2cPvMKRDUPKruaXxZA0au2Hogh7k0IZBFpc2McwknAnr Govnx57CyVwiZeLNCmAoRSa2h8jw8xVO3gTEHYzuuC7l8Wuh0TNEA2KS0ki1vKT1ulzI iIIQ== MIME-Version: 1.0 X-Received: by 10.180.107.2 with SMTP id gy2mr7405687wib.7.1427660731480; Sun, 29 Mar 2015 13:25:31 -0700 (PDT) Received: by 10.27.9.129 with HTTP; Sun, 29 Mar 2015 13:25:31 -0700 (PDT) In-Reply-To: <000001d06a5b$8f497ab0$addc7010$@whishworks.com> References: <000001d06a5b$8f497ab0$addc7010$@whishworks.com> Date: Sun, 29 Mar 2015 16:25:31 -0400 Message-ID: Subject: Re: Structured and Unstructured data indexing in SolrCloud From: Jack Krupansky To: solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=e89a8f3b9f65c27f020512732a65 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f3b9f65c27f020512732a65 Content-Type: text/plain; charset=UTF-8 The first step is to work out the queries that you wish to perform - that will determine how the data should be organized in the Solr schema. -- Jack Krupansky On Sun, Mar 29, 2015 at 4:04 PM, Vijay Bhoomireddy < vijaya.bhoomireddy@whishworks.com> wrote: > Hi, > > > > We have a requirement where both structured and unstructured data comes > into > the system. We need to index both of them and then enable search > functionality on it. We are using SolrCloud on Hadoop platform. For > structured data, we are planning to put the data into HBase and for > unstructured, directly into HDFS. > > > > My question is how to index these sources under a single Solr core? Would > that be possible to index both structured and unstructured data under a > single core/collection in SolrCloud and then enable search functionality > over that index? > > > > Thanks in advance. > > > -- > The contents of this e-mail are confidential and for the exclusive use of > the intended recipient. If you receive this e-mail in error please delete > it from your system immediately and notify us either by e-mail or > telephone. You should not copy, forward or otherwise disclose the content > of the e-mail. The views expressed in this communication may not > necessarily be the view held by WHISHWORKS. > --e89a8f3b9f65c27f020512732a65--