Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A362E200C45 for ; Tue, 28 Mar 2017 21:44:47 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A1F11160B6B; Tue, 28 Mar 2017 19:44:47 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E8973160B89 for ; Tue, 28 Mar 2017 21:44:46 +0200 (CEST) Received: (qmail 61185 invoked by uid 500); 28 Mar 2017 19:44:45 -0000 Mailing-List: contact dev-help@community.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@community.apache.org Delivered-To: mailing list dev@community.apache.org Received: (qmail 60859 invoked by uid 99); 28 Mar 2017 19:44:45 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Mar 2017 19:44:45 +0000 Received: from mail-qt0-f176.google.com (mail-qt0-f176.google.com [209.85.216.176]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id EC5101A05A7 for ; Tue, 28 Mar 2017 19:44:44 +0000 (UTC) Received: by mail-qt0-f176.google.com with SMTP id n21so73620647qta.1 for ; Tue, 28 Mar 2017 12:44:44 -0700 (PDT) X-Gm-Message-State: AFeK/H0XsoOGwMcAeOLPDTNU68I6YQKe2YetrI9UcGSJONmVLr4eoomThnqWb0Ne/QP4k0cbA7QZV+CQFlBPhg== X-Received: by 10.237.53.48 with SMTP id a45mr26961861qte.286.1490730284013; Tue, 28 Mar 2017 12:44:44 -0700 (PDT) MIME-Version: 1.0 References: <7e0cdb00-b89a-958b-8196-2bd54619d370@shanecurcuru.org> In-Reply-To: From: Grant Ingersoll Date: Tue, 28 Mar 2017 19:44:33 +0000 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Using Solr/Lucene to provide our own site search? To: dev@community.apache.org Content-Type: multipart/alternative; boundary=001a11c000ae08e1c9054bcfb188 archived-at: Tue, 28 Mar 2017 19:44:47 -0000 --001a11c000ae08e1c9054bcfb188 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable https://github.com/lucidworks/searchhub has all the crawlers/setup already setup for a number of ASF projects (email, Github, websites, wikis, Stack Overflow) and a pretty easy framework for specifying others (I looked at the FOAF stuff, but it wasn't consistent enough to automate). Lucidworks (my employer/company) is happy to donate licenses of Fusion, our commercial product on top of Solr and Spark, if the ASF will provide hardware. Or, if someone will put up the Pull Request to add all the projects, we can host it, as we already have a multinode cluster setup and we have read only APIs available, so it would just take UI integration. -Grant On Tue, Mar 28, 2017 at 1:16 PM Dave Fisher wrote: > Hi - > > I=E2=80=99ve got knowledge too and I also have some ideas I am thinking a= bout. I > also have some bandwidth now that I am going into job search mode. > > I think an important step is to think through what the taxonomy should be > as that will help inform the common schema. > > Regards, > Dave > > > On Mar 28, 2017, at 9:34 AM, Alexandre Rafalovitch > wrote: > > > > Just to provide links: > > http://jirasearch.mikemccandless.com/search.py?index=3Djira - Lucene > > (not Solr) based search of issues for several projects. Very deep > > understanding of the domain. Adding more is probably not that hard. > > http://search-lucene.com/ - Solr-based, search over mailing lists, > > wikis, issues, etc for a bunch (a larger number) of projects. Run by > > Sematext (Otis' company) > > http://find.searchhub.org/ - commercial LucidWorks' Fusion-based IIRC > > (though some bits are open-source). Lots of projects and sources. But > > it feels a bit dogfoody, so the attention it gets is uneven. > > > > So, I think Nick/Chris' point is valid that the definition of the > > project may need to take this into account and it is entirely possible > > that expanding these (if the project owners would agree) might be > > actually the easiest path forward. > > > > > > Regards, > > Alex. > > ---- > > http://www.solr-start.com/ - Resources for Solr users, new and > experienced > > > > > > On 28 March 2017 at 12:20, Chris Mattmann wrote: > >> +1 I think that minimizing the requirement to run specific > infrastructure, and trying > >> to convince those already running such services I believe like Otis an= d > Grant/others > >> from Lucid are optimal choices. > >> > >> Cheers, > >> Chris > >> > >> > >> > >> > >> On 3/28/17, 12:19 PM, "Nick Burch" wrote: > >> > >> On Tue, 28 Mar 2017, Shane Curcuru wrote: > >>> As has been pondered many times (recently by Rich and Sally, among ma= ny > >>> others), it would be really nice to better help newcomers find the > right > >>> information at the ASF or our projects. We have one of the industry'= s > >>> leading search tools right here: why aren't we using it, and even > >>> better, semi-consistently across apache.org sites that want to? > >> > >> Some Apache projects do have externally hosted instances of SOLR > indexing > >> and searching their project sites. Tika and Lucene are two such > sites, off > >> the top of my head. Would asking the committers maintaining those > about > >> adding some more sites be an option? > >> > >> Nick > >> > >> -------------------------------------------------------------------= -- > >> To unsubscribe, e-mail: dev-unsubscribe@community.apache.org > >> For additional commands, e-mail: dev-help@community.apache.org > >> > >> > >> > >> > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: dev-unsubscribe@community.apache.org > >> For additional commands, e-mail: dev-help@community.apache.org > >> > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: dev-unsubscribe@community.apache.org > > For additional commands, e-mail: dev-help@community.apache.org > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscribe@community.apache.org > For additional commands, e-mail: dev-help@community.apache.org > > --001a11c000ae08e1c9054bcfb188--