Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 358E110C82 for ; Tue, 18 Mar 2014 06:19:11 +0000 (UTC) Received: (qmail 70079 invoked by uid 500); 18 Mar 2014 06:19:05 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 69671 invoked by uid 500); 18 Mar 2014 06:19:04 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 69655 invoked by uid 99); 18 Mar 2014 06:19:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Mar 2014 06:19:01 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of arafalov@gmail.com designates 209.85.192.177 as permitted sender) Received: from [209.85.192.177] (HELO mail-pd0-f177.google.com) (209.85.192.177) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Mar 2014 06:18:57 +0000 Received: by mail-pd0-f177.google.com with SMTP id y10so6576566pdj.22 for ; Mon, 17 Mar 2014 23:18:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=Zw3F4Pi62WUi1JQei9/JdMZHaWuXc97LHAC+0OCV/ww=; b=pvBBrH32YxCaeAzlLXiRBSd6Ye1OnWgID2jDqvYgWtA0hCuZsZFkBLGRT5wjlfdkDd juQeBPsZQbWDbJNvVibtx5rZXvHW0kQ5DzMZVcefz8qeKAQ6NkJ/5H+n3WbyFQgI9PR6 ObIjNLAELKVWEV3v7JvurvzH6K/qzv2fSlrTb5faOsLKnWEVZ5iQSjz/CXkmQZex0Yw4 bUNCHsbnZwHPXP+1Zt2fPtUX1CcMUkGln1TywyyueUzBfBdJYMnW+4jl3MdgKsfstOcE lVDWJtiDhBT1hVFvxGMqK05tgYXoo0pPRRkTSw1ld5HYIKCgROGZPMEFJcG2lOEDqZLe uxTw== X-Received: by 10.66.50.166 with SMTP id d6mr224513pao.88.1395123515347; Mon, 17 Mar 2014 23:18:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.66.50.10 with HTTP; Mon, 17 Mar 2014 23:17:55 -0700 (PDT) In-Reply-To: <1395122140.194443.21291.24191@mail.rambler.ru> References: <1395122140.194443.21291.24191@mail.rambler.ru> From: Alexandre Rafalovitch Date: Tue, 18 Mar 2014 13:17:55 +0700 Message-ID: Subject: Re: Send many files to update/extract To: solr-user@lucene.apache.org, =?UTF-8?B?0JDQu9C10LrRgdCw0L3QtNGAINCS0LDQvdC00YvRiNC10LI=?= Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org HttpSolrServer allows to send multiple documents at once. But they need to be extracted/converted on the client. However, if you know you will be sending a lot of documents to Solr, you are better off to run Tika locally on the client (or as a standalone network server). A lot more performant. I am not sure if ExtractingRequestHandler takes multipart MIME format, but that would be the thing to check if you still want to process on the server. Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Tue, Mar 18, 2014 at 12:55 PM, =D0=90=D0=BB=D0=B5=D0=BA=D1=81=D0=B0=D0= =BD=D0=B4=D1=80 =D0=92=D0=B0=D0=BD=D0=B4=D1=8B=D1=88=D0=B5=D0=B2 wrote: > Who knows how to index a lot of files with ExtractingRequestHandler using= a > single query?