lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Muir <rcm...@gmail.com>
Subject Re: [jira] Commented: (LUCENE-1628) Persian Analyzer
Date Tue, 14 Jul 2009 21:10:29 GMT
its not for laziness that this doesn't have new api, the problem is
that it shares code with ArabicAnalyzer.

at the time I submitted it you could not mix APIs. Let me know if you
need a new patch, but I agree it might be best to do all of
contrib/analysis at once...

On Tue, Jul 14, 2009 at 3:11 PM, Mark Miller<markrmiller@gmail.com> wrote:
> Fine with me.
>
> On Tue, Jul 14, 2009 at 2:25 PM, Michael McCandless (JIRA) <jira@apache.org>
> wrote:
>>
>>    [
>> https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731017#action_12731017
>> ]
>>
>> Michael McCandless commented on LUCENE-1628:
>> --------------------------------------------
>>
>> I think we should go ahead and commit this and cutover to the new API as a
>> separate step?
>>
>> > Persian Analyzer
>> > ----------------
>> >
>> >                 Key: LUCENE-1628
>> >                 URL: https://issues.apache.org/jira/browse/LUCENE-1628
>> >             Project: Lucene - Java
>> >          Issue Type: New Feature
>> >          Components: contrib/analyzers
>> >            Reporter: Robert Muir
>> >            Assignee: Mark Miller
>> >            Priority: Minor
>> >             Fix For: 2.9
>> >
>> >         Attachments: LUCENE-1628.patch, LUCENE-1628.patch
>> >
>> >
>> > A simple persian analyzer.
>> > i measured trec scores with the benchmark package below against
>> > http://ece.ut.ac.ir/DBRG/Hamshahri/ :
>> > SimpleAnalyzer:
>> > SUMMARY
>> >   Search Seconds:         0.012
>> >   DocName Seconds:        0.020
>> >   Num Points:           981.015
>> >   Num Good Points:       33.738
>> >   Max Good Points:       36.185
>> >   Average Precision:      0.374
>> >   MRR:                    0.667
>> >   Recall:                 0.905
>> >   Precision At 1:         0.585
>> >   Precision At 2:         0.531
>> >   Precision At 3:         0.513
>> >   Precision At 4:         0.496
>> >   Precision At 5:         0.486
>> >   Precision At 6:         0.487
>> >   Precision At 7:         0.479
>> >   Precision At 8:         0.465
>> >   Precision At 9:         0.458
>> >   Precision At 10:        0.460
>> >   Precision At 11:        0.453
>> >   Precision At 12:        0.453
>> >   Precision At 13:        0.445
>> >   Precision At 14:        0.438
>> >   Precision At 15:        0.438
>> >   Precision At 16:        0.438
>> >   Precision At 17:        0.429
>> >   Precision At 18:        0.429
>> >   Precision At 19:        0.419
>> >   Precision At 20:        0.415
>> > PersianAnalyzer:
>> > SUMMARY
>> >   Search Seconds:         0.004
>> >   DocName Seconds:        0.011
>> >   Num Points:           987.692
>> >   Num Good Points:       36.123
>> >   Max Good Points:       36.185
>> >   Average Precision:      0.481
>> >   MRR:                    0.833
>> >   Recall:                 0.998
>> >   Precision At 1:         0.754
>> >   Precision At 2:         0.715
>> >   Precision At 3:         0.646
>> >   Precision At 4:         0.646
>> >   Precision At 5:         0.631
>> >   Precision At 6:         0.621
>> >   Precision At 7:         0.593
>> >   Precision At 8:         0.577
>> >   Precision At 9:         0.573
>> >   Precision At 10:        0.566
>> >   Precision At 11:        0.572
>> >   Precision At 12:        0.562
>> >   Precision At 13:        0.554
>> >   Precision At 14:        0.549
>> >   Precision At 15:        0.542
>> >   Precision At 16:        0.538
>> >   Precision At 17:        0.533
>> >   Precision At 18:        0.527
>> >   Precision At 19:        0.525
>> >   Precision At 20:        0.518
>>
>> --
>> This message is automatically generated by JIRA.
>> -
>> You can reply to this email to add a comment to the issue online.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-dev-help@lucene.apache.org
>>
>
>
>
> --
> --
> - Mark
>
> http://www.lucidimagination.com
>
>



-- 
Robert Muir
rcmuir@gmail.com

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message