Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id DFFC7200BE2 for ; Thu, 10 Nov 2016 03:41:22 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id DE581160AFA; Thu, 10 Nov 2016 02:41:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 09396160AFD for ; Thu, 10 Nov 2016 03:41:21 +0100 (CET) Received: (qmail 56089 invoked by uid 500); 10 Nov 2016 02:41:21 -0000 Mailing-List: contact user-help@lucenenet.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucenenet.apache.org Delivered-To: mailing list user@lucenenet.apache.org Received: (qmail 56065 invoked by uid 99); 10 Nov 2016 02:41:20 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 10 Nov 2016 02:41:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 20EEBC028B; Thu, 10 Nov 2016 02:41:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.281 X-Spam-Level: * X-Spam-Status: No, score=1.281 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id jCuIr130LhED; Thu, 10 Nov 2016 02:41:17 +0000 (UTC) Received: from mail-lf0-f48.google.com (mail-lf0-f48.google.com [209.85.215.48]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 0E26C5FAE5; Thu, 10 Nov 2016 02:41:17 +0000 (UTC) Received: by mail-lf0-f48.google.com with SMTP id c13so178764247lfg.0; Wed, 09 Nov 2016 18:41:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc; bh=NnGOidRBeDoT54OG4THeVTsPFTUVxZ3cCmpOzYoxe9w=; b=j0qc4zTriLhY+bLLXY8xEAso55WworMJ7pB6me9DOhK+5beyOqa8DfmD1Et6nMKr/e X1ovc4JWmzznYuwZWwgOt3bq0qUMB9+iYgfs+mEl2mjAZcNtHyu07GxZsOz/umNU+Iat nsKXGvOUnDQExX4U5sQLlWf8NnDWDpttvX0uFFLK+ZXczCVq7sbgR1M6gMmms5NDoI8v HuHphyNEjH2InLJ4wJQB6zqZnSFsRVgaWgA3/PrWgtmXlq588uQPSGX7wdPVRPaB2fAT 8oxRu4XBIe0JqIAowUVst1jIg1EDwN1SOfF9YHS2rNc1KDpLaUBVq5tqQWbTG5OD91tN z08w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:cc; bh=NnGOidRBeDoT54OG4THeVTsPFTUVxZ3cCmpOzYoxe9w=; b=gM3BJYK0hMpz1F3gG6PbYn+b2FEi3yd5f4CY3+ggsC4FihK3T0oC5Ifb7SauM3OHuY VjeuZOIr+npM5AF1lO/YtCiRDkTH+8CXsEqv0FPjs+iyCD0zLSiG3jD4NEgnoCIeDiZQ gRuDRAzyeDB5TSGf/4h7/6YaJtIqUr52zJMU5atqjvfRZgRkFlsmKe57/6IW4AVEPj/x ysDWal1P+KjeSehZINxZftKwE0nopc4ouViFB/QlT7JnPa1qDpkEdGeXBt+GZRTkgFTI Zc8POoMQFL8TLC2huWs7y99Lv7yeJtL7GwfxdZLKemX3dT/S+hNJurzpowSLyC0e6I3L SfxQ== X-Gm-Message-State: ABUngvcoXIE9nzTCVup6GjJscg3xxLSg3DkBrTydZ6lzAtYweKowmOqqBr+MWblgkvb62GGfgx84fpKqCefp4g== X-Received: by 10.25.125.2 with SMTP id y2mr1337593lfc.51.1478745616313; Wed, 09 Nov 2016 18:40:16 -0800 (PST) MIME-Version: 1.0 Sender: itamar.synhershko@gmail.com Received: by 10.25.125.133 with HTTP; Wed, 9 Nov 2016 18:40:15 -0800 (PST) In-Reply-To: <4606B227B7AF19498F107C2C59CC9849867390F1@Ex10DAG10-N1.apps4rent.net> References: <4606B227B7AF19498F107C2C59CC9849867390F1@Ex10DAG10-N1.apps4rent.net> From: Itamar Syn-Hershko Date: Wed, 9 Nov 2016 18:40:15 -0800 X-Google-Sender-Auth: e5DapqqRlOD7mPobSKDTmaxhIKs Message-ID: Subject: Re: Lucene.NET 4.8 demo To: Shad Storhaug Cc: "user@lucenenet.apache.org" , "dev@lucenenet.apache.org" Content-Type: multipart/alternative; boundary=001a114b007a2c729c0540e94bae archived-at: Thu, 10 Nov 2016 02:41:23 -0000 --001a114b007a2c729c0540e94bae Content-Type: text/plain; charset=UTF-8 Great feedback Shad, thanks Yes, let's add AnonymousAnalyzer to core - or see if we can just use the Analyzer class for this. This indeed looks better - the demo was just a quick'n'dirty something I wrote, but it was intended especially for finding pain-points like you just did. Another pain-point is the LuceneVersion argument that we currently have as a requirement - and I'm mid-work on removing it and setting it's value via a default. Anything else that you can think of would probably make sense to add too :) More comments inline. -- Itamar Syn-Hershko http://code972.com | @synhershko Freelance Developer & Consultant Lucene.NET committer and PMC member On Wed, Nov 9, 2016 at 1:27 AM, Shad Storhaug wrote: > > Another thing I noticed is that we should probably move the > TokenStreamComponents class so it is not a nested class of Analyzer to > match the syntax more closely to Lucene. > > Not sure, I think the current syntax make things quite concise. Maybe a helper/shortcut method would be helpful instead. > A few thoughts on the demo: > > 1. Not everyone is familiar with a GitHub organization. Perhaps the demo > should provide a list to choose from? Currently, if you type something that > doesn't exist you get an exception. I had to do a Google search to come up > with something, since my own username didn't work. One of the top results > (before an actual list of organizations) was an API that can be utilized to > read all of the GitHub organizations: https://developer.github.com/ > v3/orgs/ microsoft, facebook, github, apache - there are plenty. I will fix the exceptions, and some WriteLine and docs in the Readme will fix the rest. > > 2. Maybe there should be some kind of estimate given on how long it will > take to index the organization. When I ultimately chose "apache" it took > several minutes to index the results, which I was not expecting. > Shouldn't take minutes. The demo is pulling the Readme HTML for each so that might be slowing things done, but I don't have the time to add a progress bar :) > 3. Perhaps the API key should be put into a separate (config) file rather > than inline in the code. And you could pre-define the name of this file and > put it into a .gitignore file. This would help prevent anyone from > accidentally committing their API key to the Git repo. > Yup, maybe in the future :) > 4. The search results seemed a bit underwhelming. Maybe there should be > some kind of indicators how many results Lucene.Net had to sift through to > come up with the short list. Or at least there should be some kind of > explanation what is happening to put things into perspective. Think of a > crime scene investigation. If the investigators enter the search criteria > and it comes up with 50,000 suspects it would ruin their day. If it comes > up with 3, then their work is much easier. But without some kind of > indicator showing that 3 is better than 50,000, the latter seems much more > impressive in a demo. > The total number of results is displayed - along with the 10 top-rated results. I could prettify it and add higlighted snippets (which would be a nice addition to the demo!), and give more context etc - but as I said this is a quick'n'dirty job. I will probably do this later on to try and find more pain points / improvements we could do to the API. Contributions welcome. > 5. Perhaps there should be some way to reset the index? I entered another > organization to test my updates to the code and it added that > organization's results to the original index, which I wasn't expecting. > That should be eay to do. > > > Thanks, > Shad Storhaug (NightOwl888) > > > -----Original Message----- > From: itamar.synhershko@gmail.com [mailto:itamar.synhershko@gmail.com] On > Behalf Of Itamar Syn-Hershko > Sent: Wednesday, November 9, 2016 6:45 AM > To: dev@lucenenet.apache.org; user@lucenenet.apache.org > Subject: Lucene.NET 4.8 demo > > Hey folks, > > I just pushed a working demo for Lucene.NET 4.8 using the latest bits to > index and search public repositories on github. Check it out: > https://github.com/synhershko/LuceneNetDemo > > I also recorded a Channel 9 video walking through the demo - I will post > it here again as soon as it's released on the nets. > > This should clarify some mysteries around the new-ish API and hopefully > drive confidence in what we consider a stable beta release. > > Cheers, > > -- > > Itamar Syn-Hershko > http://code972.com | @synhershko > Freelance Developer & Consultant Lucene.NET committer and PMC member > --001a114b007a2c729c0540e94bae--