Return-Path: X-Original-To: apmail-lucenenet-user-archive@www.apache.org Delivered-To: apmail-lucenenet-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1FBBD17658 for ; Fri, 10 Apr 2015 13:27:50 +0000 (UTC) Received: (qmail 91479 invoked by uid 500); 10 Apr 2015 13:27:50 -0000 Delivered-To: apmail-lucenenet-user-archive@lucenenet.apache.org Received: (qmail 91433 invoked by uid 500); 10 Apr 2015 13:27:49 -0000 Mailing-List: contact user-help@lucenenet.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucenenet.apache.org Delivered-To: mailing list user@lucenenet.apache.org Received: (qmail 91421 invoked by uid 99); 10 Apr 2015 13:27:49 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 13:27:49 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of henko.sch@gmail.com designates 74.125.82.46 as permitted sender) Received: from [74.125.82.46] (HELO mail-wg0-f46.google.com) (74.125.82.46) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Apr 2015 13:27:24 +0000 Received: by wgin8 with SMTP id n8so17652584wgi.0 for ; Fri, 10 Apr 2015 06:25:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=gojPPm/Eipfyuf0JOO920alwYGb3q6m8oUuMlrfoCRQ=; b=Ojlpj6azgvTXSgq1EaBiDjANNFj0fgMCAKAgXJ6XdE2XE0G+bFP0sMqm6DRG2KoLro XAs8cIm0Y16GjkDdgWxLUocYKE5CmF2C+w11DdIr2q3aSgDwwTR6gv6gaJkoIYIxo0o1 gQ50nURyL9+9OMcV7VMI0h7rAfzPdAhPB2GNVKdCc8qmjFqGQ1v7GLVHsrn8AnoS2oMt 9rlVAkcTZGzWx41iaXApvWbF9PBKXw3G2AKmx+F8KnILUNTQ6bfe0hJDvIBqn0XLSiBR xhR8sN7YhloF+3h6ChVFdSh5zCm/1iY5B7bE+VrpCkTOJv+Crlt+bj0R6xTgkXZTvmjb Flqg== X-Received: by 10.180.93.166 with SMTP id cv6mr14929830wib.89.1428672308444; Fri, 10 Apr 2015 06:25:08 -0700 (PDT) MIME-Version: 1.0 Received: by 10.28.217.75 with HTTP; Fri, 10 Apr 2015 06:24:48 -0700 (PDT) From: Schnell Henrik Date: Fri, 10 Apr 2015 15:24:48 +0200 Message-ID: Subject: Classification and WhitespaceAnalyzer problems To: user@lucenenet.apache.org Content-Type: multipart/alternative; boundary=f46d043c7fbc7204d505135eb1c4 X-Virus-Checked: Checked by ClamAV on apache.org --f46d043c7fbc7204d505135eb1c4 Content-Type: text/plain; charset=UTF-8 Hello, I have noticed that you have added Lucene.Net.Classification implementations, so I thought I would try them with a large corpus to see how the different algorithms perform with classifying different texts. I cloned the latest branch from github and opened the solution. I could build it successfully, so I grabbed the dll's and included them in my project. So far so good. But then I noticed that I cannot instantiate an IndexWriter because it needs an IndexWriterConfig which needs an Analyzer and I could not find any Analyzer implementations, only the abstract Analyzer class. Then I have noticed that the WhitespaceAnalyzer.cs is there in the src\Lucene.Net.Core\Analysis directory but it is not included in the Lucene.Net project that is in the solution, so it was not built into the dll's. Ok, so I tried to include all the neccessary files for the WhitespaceAnalyzer, but they don't build, it seems they use an older lucene api and are not compatible anymore. So my question is: how could I try the new Classification features? I'm not sure I could write an own Analyzer. Are there any working implementations currently? Thank you, Henrik --f46d043c7fbc7204d505135eb1c4--