Return-Path: X-Original-To: apmail-lucenenet-user-archive@www.apache.org Delivered-To: apmail-lucenenet-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4B6A6174F1 for ; Sat, 11 Apr 2015 17:51:26 +0000 (UTC) Received: (qmail 78033 invoked by uid 500); 11 Apr 2015 17:51:21 -0000 Delivered-To: apmail-lucenenet-user-archive@lucenenet.apache.org Received: (qmail 77989 invoked by uid 500); 11 Apr 2015 17:51:21 -0000 Mailing-List: contact user-help@lucenenet.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@lucenenet.apache.org Delivered-To: mailing list user@lucenenet.apache.org Received: (qmail 77976 invoked by uid 99); 11 Apr 2015 17:51:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Apr 2015 17:51:20 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,TVD_FROM_1 X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of itamar.synhershko@gmail.com designates 209.85.220.174 as permitted sender) Received: from [209.85.220.174] (HELO mail-qk0-f174.google.com) (209.85.220.174) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Apr 2015 17:51:16 +0000 Received: by qkx62 with SMTP id 62so88906455qkx.0 for ; Sat, 11 Apr 2015 10:50:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=jIMk8OY8jfoTOgvguoYfZEbKarLxS+siK+5sjywENms=; b=sIA9hXAn6Ltc7naL8dyL2W3XcjgNZN/pUSfo/skjSPmfnmhTtTxXlAnDvkbXMtieTq Cz0vudKSv94njW2RRIT193A37tR9yT0WTnvdPVFYK9YZS0lb9nTT7anws5qOiho6mFcp oSBSV2ccmZDao78kDN/8rGBaeg/Vu2oKFioHYSHooT+Z4WJIcr+e9t72ORYVQtF0koTy eclzbK1fmserB/75fJa2Lo/C8CyAFkPsVmnLR+9dkGz02Ln+Hqhqz3H5/2UGNen0bZTk 8pWRZ4kgEhqD1wqQnW6nTiB+Tv2khqI2ovd2JtTZWg6V1jOew2wCU42jtXwy2N30FexU gT4Q== MIME-Version: 1.0 X-Received: by 10.182.105.66 with SMTP id gk2mr6864322obb.76.1428774610056; Sat, 11 Apr 2015 10:50:10 -0700 (PDT) Sender: itamar.synhershko@gmail.com Received: by 10.60.48.8 with HTTP; Sat, 11 Apr 2015 10:50:10 -0700 (PDT) In-Reply-To: References: Date: Sat, 11 Apr 2015 20:50:10 +0300 X-Google-Sender-Auth: P79SmewAucGJpZaeUjufMrNnz0s Message-ID: Subject: Re: Classification and WhitespaceAnalyzer problems From: Itamar Syn-Hershko To: user@lucenenet.apache.org Content-Type: multipart/alternative; boundary=e89a8ff2513418bb61051376839a X-Virus-Checked: Checked by ClamAV on apache.org --e89a8ff2513418bb61051376839a Content-Type: text/plain; charset=UTF-8 This is still WIP, we haven't yet released the new Lucene.NET -- Itamar Syn-Hershko http://code972.com | @synhershko Freelance Developer & Consultant Lucene.NET committer and PMC member On Fri, Apr 10, 2015 at 4:24 PM, Schnell Henrik wrote: > Hello, > > I have noticed that you have added Lucene.Net.Classification > implementations, so I thought I would try them with a large corpus to see > how the different algorithms perform with classifying different texts. I > cloned the latest branch from github and opened the solution. I could build > it successfully, so I grabbed the dll's and included them in my project. So > far so good. > > But then I noticed that I cannot instantiate an IndexWriter because it > needs an IndexWriterConfig which needs an Analyzer and I could not find any > Analyzer implementations, only the abstract Analyzer class. > > Then I have noticed that the WhitespaceAnalyzer.cs is there in the > src\Lucene.Net.Core\Analysis directory but it is not included in the > Lucene.Net project that is in the solution, so it was not built into the > dll's. Ok, so I tried to include all the neccessary files for the > WhitespaceAnalyzer, but they don't build, it seems they use an older lucene > api and are not compatible anymore. > > So my question is: how could I try the new Classification features? I'm not > sure I could write an own Analyzer. Are there any working implementations > currently? > > Thank you, > Henrik > --e89a8ff2513418bb61051376839a--