<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
<title>java-dev@lucene.apache.org Archives</title>
<link rel="self" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/?format=atom"/>
<link href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/"/>
<id>http://mail-archives.apache.org/mod_mbox/lucene-java-dev/</id>
<updated>2009-12-09T18:29:16Z</updated>
<entry>
<title>[jira] Commented: (LUCENE-1606) Automaton Query/Filter (scalable regex)</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1563489283.1260381258281.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1563489283-1260381258281-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T17:54:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788189#action_12788189
] 

Robert Muir commented on LUCENE-1606:
-------------------------------------

btw, Thanks to Uwe, Mike, Mark for all the help here!


&gt; Automaton Query/Filter (scalable regex)
&gt; ---------------------------------------
&gt;
&gt;                 Key: LUCENE-1606
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-1606
&gt;             Project: Lucene - Java
&gt;          Issue Type: New Feature
&gt;          Components: Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Robert Muir
&gt;            Priority: Minor
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: automaton.patch, automatonMultiQuery.patch, automatonmultiqueryfuzzy.patch,
automatonMultiQuerySmart.patch, automatonWithWildCard.patch, automatonWithWildCard2.patch,
BenchWildcard.java, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch,
LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch,
LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch,
LUCENE-1606-flex.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch,
LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch,
LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch,
LUCENE-1606.patch, LUCENE-1606_nodep.patch
&gt;
&gt;
&gt; Attached is a patch for an AutomatonQuery/Filter (name can change if its not suitable).
&gt; Whereas the out-of-box contrib RegexQuery is nice, I have some very large indexes (100M+
unique tokens) where queries are quite slow, 2 minutes, etc. Additionally all of the existing
RegexQuery implementations in Lucene are really slow if there is no constant prefix. This
implementation does not depend upon constant prefix, and runs the same query in 640ms.
&gt; Some use cases I envision:
&gt;  1. lexicography/etc on large text corpora
&gt;  2. looking for things such as urls where the prefix is not constant (http:// or ftp://)
&gt; The Filter uses the BRICS package (http://www.brics.dk/automaton/) to convert regular
expressions into a DFA. Then, the filter "enumerates" terms in a special way, by using the
underlying state machine. Here is my short description from the comments:
&gt;      The algorithm here is pretty basic. Enumerate terms but instead of a binary accept/reject
do:
&gt;       
&gt;      1. Look at the portion that is OK (did not enter a reject state in the DFA)
&gt;      2. Generate the next possible String and seek to that.
&gt; the Query simply wraps the filter with ConstantScoreQuery.
&gt; I did not include the automaton.jar inside the patch but it can be downloaded from http://www.brics.dk/automaton/
and is BSD-licensed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Resolved: (LUCENE-1606) Automaton Query/Filter (scalable regex)</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c392851865.1260380898249.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c392851865-1260380898249-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T17:48:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Muir resolved LUCENE-1606.
---------------------------------

       Resolution: Fixed
    Fix Version/s:     (was: 3.1)
                   Flex Branch

Committed revision 888891.

&gt; Automaton Query/Filter (scalable regex)
&gt; ---------------------------------------
&gt;
&gt;                 Key: LUCENE-1606
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-1606
&gt;             Project: Lucene - Java
&gt;          Issue Type: New Feature
&gt;          Components: Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Robert Muir
&gt;            Priority: Minor
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: automaton.patch, automatonMultiQuery.patch, automatonmultiqueryfuzzy.patch,
automatonMultiQuerySmart.patch, automatonWithWildCard.patch, automatonWithWildCard2.patch,
BenchWildcard.java, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch,
LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch,
LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch, LUCENE-1606-flex.patch,
LUCENE-1606-flex.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch,
LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch,
LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch, LUCENE-1606.patch,
LUCENE-1606.patch, LUCENE-1606_nodep.patch
&gt;
&gt;
&gt; Attached is a patch for an AutomatonQuery/Filter (name can change if its not suitable).
&gt; Whereas the out-of-box contrib RegexQuery is nice, I have some very large indexes (100M+
unique tokens) where queries are quite slow, 2 minutes, etc. Additionally all of the existing
RegexQuery implementations in Lucene are really slow if there is no constant prefix. This
implementation does not depend upon constant prefix, and runs the same query in 640ms.
&gt; Some use cases I envision:
&gt;  1. lexicography/etc on large text corpora
&gt;  2. looking for things such as urls where the prefix is not constant (http:// or ftp://)
&gt; The Filter uses the BRICS package (http://www.brics.dk/automaton/) to convert regular
expressions into a DFA. Then, the filter "enumerates" terms in a special way, by using the
underlying state machine. Here is my short description from the comments:
&gt;      The algorithm here is pretty basic. Enumerate terms but instead of a binary accept/reject
do:
&gt;       
&gt;      1. Look at the portion that is OK (did not enter a reject state in the DFA)
&gt;      2. Generate the next possible String and seek to that.
&gt; the Query simply wraps the filter with ConstantScoreQuery.
&gt; I did not include the automaton.jar inside the patch but it can be downloaded from http://www.brics.dk/automaton/
and is BSD-licensed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2139) Cleanup and Improvement of Spatial Contrib</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1406192845.1260379038912.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1406192845-1260379038912-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T17:17:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788169#action_12788169
] 

Simon Willnauer commented on LUCENE-2139:
-----------------------------------------

Chris, I have a couple of issues with your patch. It seems that you renamed a couple of files
which doesn't work well with patches for some reason. I will comment on this later again.
The other thing is that you use 1.6 classes like [http://java.sun.com/javase/6/docs/api/java/util/concurrent/LinkedBlockingDeque.html|LinkedBlockingDeque]
but we should try to keep the contrib 1.5 dependent.
could you fix those 1.6 references please.

simon

&gt; Cleanup and Improvement of Spatial Contrib
&gt; ------------------------------------------
&gt;
&gt;                 Key: LUCENE-2139
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2139
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 3.1
&gt;            Reporter: Chris Male
&gt;            Assignee: Simon Willnauer
&gt;         Attachments: LUCENE-2139.patch
&gt;
&gt;
&gt; The current spatial contrib can be improved by adding documentation, tests, removing
unused classes and code, repackaging the classes and improving the performance of the distance
filtering.  The latter will incorporate the multi-threaded functionality introduced in LUCENE-1732.
 
&gt; Other improvements involve adding better support for different distance units, different
distance calculators and different data formats (whether it be lat/long fields, geohashes,
or something else in the future).
&gt; Patch to be added soon.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2123) Move FuzzyQuery rewrite as separate RewriteMode into MTQ, was: Highlighter fails to highlight FuzzyQuery</title>
<author><name>&quot;Uwe Schindler (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c36769497.1260378918315.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c36769497-1260378918315-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T17:15:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Uwe Schindler updated LUCENE-2123:
----------------------------------

    Attachment: LUCENE-2123.patch
                LUCENE-2123-flex.patch

Here the final patches with updated JavaDocs. I want to apply them in this form to trunk and
flex. If nobody objects I will do this tomorrow.

WIth this patch, FuzzyQuery will always highlight correctly, as it can be switched to boolean
query rewrite mode.

&gt; Move FuzzyQuery rewrite as separate RewriteMode into MTQ, was: Highlighter fails to highlight
FuzzyQuery
&gt; --------------------------------------------------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2123
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2123
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/highlighter
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Uwe Schindler
&gt;            Assignee: Uwe Schindler
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: LUCENE-2123-flex.patch, LUCENE-2123-flex.patch, LUCENE-2123-flex.patch,
LUCENE-2123-flex.patch, LUCENE-2123.patch
&gt;
&gt;
&gt; As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in
flex since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is:
Allow MTQ to change rewrite method and make FUZZY_REWRITE public for that.
&gt; The rewrite mode will live in MTQ as TOP_TERMS_SCORING_BOOLEAN_REWRITE. Also the code
will be refactored to make heavy reuse of term enumeration code and only plug in the PQ for
filtering the top terms.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2123) Move FuzzyQuery rewrite as separate RewriteMode into MTQ, was: Highlighter fails to highlight FuzzyQuery</title>
<author><name>&quot;Uwe Schindler (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c814915282.1260376281048.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c814915282-1260376281048-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T16:31:21Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Uwe Schindler updated LUCENE-2123:
----------------------------------

    Description: 
As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in flex
since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is: Allow
MTQ to change rewrite method and make FUZZY_REWRITE public for that.

The rewrite mode will live in MTQ as TOP_TERMS_SCORING_BOOLEAN_REWRITE. Also the code will
be refactored to make heavy reuse of term enumeration code and only plug in the PQ for filtering
the top terms.

  was:
As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in flex
since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is: Allow
MTQ to change rewrite method and make FUZZY_REWRITE public for that.


        Summary: Move FuzzyQuery rewrite as separate RewriteMode into MTQ, was: Highlighter
fails to highlight FuzzyQuery  (was: Highlighter fails to highlight FuzzyQuery)

Trunk patch comes soon.

&gt; Move FuzzyQuery rewrite as separate RewriteMode into MTQ, was: Highlighter fails to highlight
FuzzyQuery
&gt; --------------------------------------------------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2123
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2123
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/highlighter
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Uwe Schindler
&gt;            Assignee: Uwe Schindler
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: LUCENE-2123-flex.patch, LUCENE-2123-flex.patch, LUCENE-2123-flex.patch
&gt;
&gt;
&gt; As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in
flex since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is:
Allow MTQ to change rewrite method and make FUZZY_REWRITE public for that.
&gt; The rewrite mode will live in MTQ as TOP_TERMS_SCORING_BOOLEAN_REWRITE. Also the code
will be refactored to make heavy reuse of term enumeration code and only plug in the PQ for
filtering the top terms.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2139) Cleanup and Improvement of Spatial Contrib</title>
<author><name>&quot;Chris Male (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c830994414.1260373758343.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c830994414-1260373758343-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T15:49:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788130#action_12788130
] 

Chris Male commented on LUCENE-2139:
------------------------------------

I have also included LUCENE-1934 in this, and tried to include LUCENE-1930 but was unable
to get 1930 to work.

&gt; Cleanup and Improvement of Spatial Contrib
&gt; ------------------------------------------
&gt;
&gt;                 Key: LUCENE-2139
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2139
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 3.1
&gt;            Reporter: Chris Male
&gt;            Assignee: Simon Willnauer
&gt;         Attachments: LUCENE-2139.patch
&gt;
&gt;
&gt; The current spatial contrib can be improved by adding documentation, tests, removing
unused classes and code, repackaging the classes and improving the performance of the distance
filtering.  The latter will incorporate the multi-threaded functionality introduced in LUCENE-1732.
 
&gt; Other improvements involve adding better support for different distance units, different
distance calculators and different data formats (whether it be lat/long fields, geohashes,
or something else in the future).
&gt; Patch to be added soon.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2139) Cleanup and Improvement of Spatial Contrib</title>
<author><name>&quot;Chris Male (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c23238359.1260372798100.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c23238359-1260372798100-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T15:33:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris Male updated LUCENE-2139:
-------------------------------

    Attachment: LUCENE-2139.patch

Added patch.

Couple of TODOs still noted in the patch related to distances.  We need to decide what distances
we are going to use for the radius and circumference of the Earth and then use them in SpatialConstants.
 Currently the SpatialConstants values are taken from Wikipedia and other sites, yet differ
from some of the distances in the coded.

Also the patch doesn't seem to remove a couple of empty packages.  Too many changes in 1 patch
confusing the IDE I think.  Help cleaning this up would be appreciated.

&gt; Cleanup and Improvement of Spatial Contrib
&gt; ------------------------------------------
&gt;
&gt;                 Key: LUCENE-2139
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2139
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 3.1
&gt;            Reporter: Chris Male
&gt;            Assignee: Simon Willnauer
&gt;         Attachments: LUCENE-2139.patch
&gt;
&gt;
&gt; The current spatial contrib can be improved by adding documentation, tests, removing
unused classes and code, repackaging the classes and improving the performance of the distance
filtering.  The latter will incorporate the multi-threaded functionality introduced in LUCENE-1732.
 
&gt; Other improvements involve adding better support for different distance units, different
distance calculators and different data formats (whether it be lat/long fields, geohashes,
or something else in the future).
&gt; Patch to be added soon.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>Re: [jira] Commented: (LUCENE-2122) Use JUnit4 capabilites for more	thorough Locale testing for classes deriving from LocalizedTestCase</title>
<author><name>Erick Erickson &lt;erickerickson@gmail.com&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c359a92830912090617u2f94f3d5k2b0ce4e94b2ad9f4@mail.gmail.com%3e"/>
<id>urn:uuid:%3c359a92830912090617u2f94f3d5k2b0ce4e94b2ad9f4@mail-gmail-com%3e</id>
<updated>2009-12-09T14:17:17Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>
Siiiiggggh. I'll look at it again tonight....

On Wed, Dec 9, 2009 at 9:13 AM, Robert Muir (JIRA) &lt;jira@apache.org&gt; wrote:

&gt;
&gt;    [
&gt; https://issues.apache.org/jira/browse/LUCENE-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788100#action_12788100]
&gt;
&gt; Robert Muir commented on LUCENE-2122:
&gt; -------------------------------------
&gt;
&gt; Hi Erick, in the Date tools test I think you can delete the public static
&gt; Collection&lt;Locale[]&gt; data(), I think you might have accidentally included
&gt; it?
&gt;
&gt;
&gt; &gt; Use JUnit4 capabilites for more thorough Locale testing for classes
&gt; deriving from LocalizedTestCase
&gt; &gt;
&gt; ---------------------------------------------------------------------------------------------------
&gt; &gt;
&gt; &gt;                 Key: LUCENE-2122
&gt; &gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2122
&gt; &gt;             Project: Lucene - Java
&gt; &gt;          Issue Type: Improvement
&gt; &gt;          Components: Other
&gt; &gt;    Affects Versions: 3.1
&gt; &gt;            Reporter: Erick Erickson
&gt; &gt;            Assignee: Erick Erickson
&gt; &gt;            Priority: Minor
&gt; &gt;             Fix For: 3.1
&gt; &gt;
&gt; &gt;         Attachments: LUCENE-2122-r2.patch, LUCENE-2122-r3.patch,
&gt; LUCENE-2122.patch
&gt; &gt;
&gt; &gt;
&gt; &gt; Use the @Parameterized capabilities of Junit4 to allow more extensive
&gt; testing of Locales.
&gt;
&gt; --
&gt; This message is automatically generated by JIRA.
&gt; -
&gt; You can reply to this email to add a comment to the issue online.
&gt;
&gt;
&gt; ---------------------------------------------------------------------
&gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;
&gt;


</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2122) Use JUnit4 capabilites for more thorough Locale testing for classes deriving from LocalizedTestCase</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c778083366.1260367998224.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c778083366-1260367998224-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T14:13:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788100#action_12788100
] 

Robert Muir commented on LUCENE-2122:
-------------------------------------

Hi Erick, in the Date tools test I think you can delete the public static Collection&lt;Locale[]&gt;
data(), I think you might have accidentally included it?


&gt; Use JUnit4 capabilites for more thorough Locale testing for classes deriving from LocalizedTestCase
&gt; ---------------------------------------------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2122
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2122
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: Other
&gt;    Affects Versions: 3.1
&gt;            Reporter: Erick Erickson
&gt;            Assignee: Erick Erickson
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2122-r2.patch, LUCENE-2122-r3.patch, LUCENE-2122.patch
&gt;
&gt;
&gt; Use the @Parameterized capabilities of Junit4 to allow more extensive testing of Locales.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Assigned: (LUCENE-2122) Use JUnit4 capabilites for more thorough Locale testing for classes deriving from LocalizedTestCase</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c2060593427.1260367998251.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c2060593427-1260367998251-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T14:13:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Muir reassigned LUCENE-2122:
-----------------------------------

    Assignee: Robert Muir  (was: Erick Erickson)

&gt; Use JUnit4 capabilites for more thorough Locale testing for classes deriving from LocalizedTestCase
&gt; ---------------------------------------------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2122
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2122
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: Other
&gt;    Affects Versions: 3.1
&gt;            Reporter: Erick Erickson
&gt;            Assignee: Robert Muir
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2122-r2.patch, LUCENE-2122-r3.patch, LUCENE-2122.patch
&gt;
&gt;
&gt; Use the @Parameterized capabilities of Junit4 to allow more extensive testing of Locales.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2126) Split up IndexInput and IndexOutput into DataInput and DataOutput</title>
<author><name>&quot;Marvin Humphrey (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1100901971.1260367758142.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1100901971-1260367758142-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T14:09:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788098#action_12788098
] 

Marvin Humphrey commented on LUCENE-2126:
-----------------------------------------

&gt; These methods should only be able to call the read/write methods (which this
&gt; issue moves to DataInput/Output), but not methods like close(), seek() etc..

Ah, so that's what it is.  

In that case, let me vote my (non-binding) -1.  I don't believe that the
enforcement of such a restriction justifies the complexity cost of adding a
new class to the public API.

First, adding yet another class to the hierarchy steepens the learning curve
for users and contributors.  If you aren't in the rarefied echelon of
exceptional brilliance occupied by people named Michael who work for IBM :),
the gradual accumulation of complexity in the Lucene code base matters.  Inch
by inch, things move out of reach.

Second, changing things now for what seems to me like a minor reason makes it
harder to refactor the class hierarchy in the future when other, more
important reasons are inevitably discovered.

For LUCENE-2125, I recommend two possible options. 

  * Do nothing and assume that the sort of advanced user who writes a posting 
    codec won't do something incredibly stupid like call indexInput.close().
  * Add a note to the docs for writing posting codecs indicating which sort of
    of IO methods you ought not to call.

&gt; once we see a need to allow users to extend DataInput/Output outside of
&gt; Lucene we can go ahead and make the additional changes that are mentioned in
&gt; your in my comments here.

In Lucy, there are three tiers of IO usage:

    * For low-level IO, use FileHandle.
    * For most applications, use InStream's encoder/decoder methods.
    * For performance-critical inner-loop material (e.g. posting decoders, 
      SortCollector), access the raw memory-mapped IO buffer using
      InStream_Buf()/InStream_Advance_Buf() and use static inline functions 
      such as NumUtil_decode_c32 (which does no bounds checking) from
      Lucy::Util::NumberUtils.

While you can extend InStream to add a codec, that's not generally the best
way to go about it, because adding a method to InStream requires that all of
your users both use your InStream class and use a subclassed Folder which
overrides the Folder_Open_In() factory method (analogous to 
Directory.openInput()).  Better is to use the extension point provided by
InStream_Buf()/InStream_Advance_Buf() and write a utility function which
accepts an InStream as an argument.

I don't expect and am not advocating that Lucene adopt the same IO hierarchy
as Lucy, but I wanted to provide an example of other reasons why you might
change things.  (What I'd really like to see is for Lucene to come up with
something *better* than the Lucy IO hierarchy.)  

One of the reasons Lucene has so many backwards compatibility headaches is
because the public APIs are so extensive and thus constitute such an elaborate
set of backwards compatibility promises.  IMO, DataInput and DataOutput do 
not offer sufficient benefit to compensate for the increased intricacy they add 
to that backwards compatibility contract.


&gt; Split up IndexInput and IndexOutput into DataInput and DataOutput
&gt; -----------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2126
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2126
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Michael Busch
&gt;            Assignee: Michael Busch
&gt;            Priority: Minor
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: lucene-2126.patch
&gt;
&gt;
&gt; I'd like to introduce the two new classes DataInput and DataOutput
&gt; that contain all methods from IndexInput and IndexOutput that actually
&gt; decode or encode data, such as readByte()/writeByte(),
&gt; readVInt()/writeVInt().
&gt; Methods like getFilePointer(), seek(), close(), etc., which are not
&gt; related to data encoding, but to files as input/output source stay in
&gt; IndexInput/IndexOutput.
&gt; This patch also changes ByteSliceReader/ByteSliceWriter to extend
&gt; DataInput/DataOutput. Previously ByteSliceReader implemented the
&gt; methods that stay in IndexInput by throwing RuntimeExceptions.
&gt; See also LUCENE-2125.
&gt; All tests pass.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Resolved: (LUCENE-2124) move JDK collation to core, ICU collation to ICU contrib</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c2113083248.1260367219283.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c2113083248-1260367219283-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T14:00:19Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Muir resolved LUCENE-2124.
---------------------------------

    Resolution: Fixed

website updated in revision 888803

&gt; move JDK collation to core, ICU collation to ICU contrib
&gt; --------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2124
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2124
&gt;             Project: Lucene - Java
&gt;          Issue Type: Task
&gt;          Components: contrib/*, Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Robert Muir
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2124.patch, LUCENE-2124.patch
&gt;
&gt;
&gt; As mentioned on the list, I propose we move the JDK-based CollationKeyFilter/CollationKeyAnalyzer,
currently located in contrib/collation into core for collation support (language-sensitive
sorting)
&gt; These are not much code (the heavy duty stuff is already in core, IndexableBinaryString).

&gt; And I would also like to move the ICUCollationKeyFilter/ICUCollationKeyAnalyzer (along
with the jar file they depend on) also currently located in contrib/collation into a contrib/icu.
&gt; This way, we can start looking at integrating other functionality from ICU into a fully-fleshed
out icu contrib.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Resolved: (LUCENE-2100) Make contrib analyzers final</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c309635135.1260365660633.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c309635135-1260365660633-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T13:34:20Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer resolved LUCENE-2100.
-------------------------------------

    Resolution: Fixed

committed in revision 888799

thanks robert for review

&gt; Make contrib analyzers final
&gt; ----------------------------
&gt;
&gt;                 Key: LUCENE-2100
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2100
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/analyzers
&gt;    Affects Versions: 1.9, 2.0.0, 2.1, 2.2, 2.3, 2.3.1, 2.3.2, 2.4, 2.4.1, 2.9, 2.9.1,
3.0
&gt;            Reporter: Simon Willnauer
&gt;            Assignee: Simon Willnauer
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2100.patch, LUCENE-2100.patch
&gt;
&gt;
&gt; The analyzers in contrib/analyzers should all be marked final. None of the Analyzers
should ever be subclassed - users should build their own analyzers if a different combination
of filters and Tokenizers is desired.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2100) Make contrib analyzers final</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1975153816.1260365538078.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1975153816-1260365538078-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T13:32:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788084#action_12788084
] 

Robert Muir commented on LUCENE-2100:
-------------------------------------

patch looks good to me!

&gt; Make contrib analyzers final
&gt; ----------------------------
&gt;
&gt;                 Key: LUCENE-2100
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2100
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/analyzers
&gt;    Affects Versions: 1.9, 2.0.0, 2.1, 2.2, 2.3, 2.3.1, 2.3.2, 2.4, 2.4.1, 2.9, 2.9.1,
3.0
&gt;            Reporter: Simon Willnauer
&gt;            Assignee: Simon Willnauer
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2100.patch, LUCENE-2100.patch
&gt;
&gt;
&gt; The analyzers in contrib/analyzers should all be marked final. None of the Analyzers
should ever be subclassed - users should build their own analyzers if a different combination
of filters and Tokenizers is desired.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2100) Make contrib analyzers final</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1655130955.1260365178303.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1655130955-1260365178303-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T13:26:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer updated LUCENE-2100:
------------------------------------

    Attachment: LUCENE-2100.patch

Updated to latest trunk

&gt; Make contrib analyzers final
&gt; ----------------------------
&gt;
&gt;                 Key: LUCENE-2100
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2100
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/analyzers
&gt;    Affects Versions: 1.9, 2.0.0, 2.1, 2.2, 2.3, 2.3.1, 2.3.2, 2.4, 2.4.1, 2.9, 2.9.1,
3.0
&gt;            Reporter: Simon Willnauer
&gt;            Assignee: Simon Willnauer
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2100.patch, LUCENE-2100.patch
&gt;
&gt;
&gt; The analyzers in contrib/analyzers should all be marked final. None of the Analyzers
should ever be subclassed - users should build their own analyzers if a different combination
of filters and Tokenizers is desired.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>Patch for LUCENE-2122 ready to go</title>
<author><name>Erick Erickson &lt;erickerickson@gmail.com&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c359a92830912090521t76c4d154pfb328d7c596dfe1d@mail.gmail.com%3e"/>
<id>urn:uuid:%3c359a92830912090521t76c4d154pfb328d7c596dfe1d@mail-gmail-com%3e</id>
<updated>2009-12-09T13:21:22Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>
Does someone with commit rights want to pick this up? I've incorporated the
changes suggested by Robert (Thanks!) and think it's ready to go.....

Erick


</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2104) IndexWriter.unlock does does nothing if NativeFSLockFactory is used</title>
<author><name>&quot;Shai Erera (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1198916474.1260363798114.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1198916474-1260363798114-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T13:03:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788077#action_12788077
] 

Shai Erera commented on LUCENE-2104:
------------------------------------

I think that if I move those lines (in NativeFSLock.release()):
{code}
      if (!path.delete())
        throw new LockReleaseFailedException("failed to delete " + path);
{code}
to outside the if(lockExists()) section, this should work? Because then the new NativeFSLock
will attempt to release an lock that's held by someone else, and fail. If the lock exists
for some reason, but nobody is holding it, that line should succeed.

In order to test it, I think I'll need to spawn two processes, which is trickier. Let me know
what you think about the fix in the meantime.

&gt; IndexWriter.unlock does does nothing if NativeFSLockFactory is used
&gt; -------------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2104
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2104
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;            Reporter: Shai Erera
&gt;             Fix For: 3.1
&gt;
&gt;
&gt; If NativeFSLockFactory is used, IndexWriter.unlock will return, silently doing nothing.
The reason is that NativeFSLockFactory's makeLock always creates a new NativeFSLock. NativeFSLock's
release first checks if its lock is not null. However, only if obtain() is called, that lock
is not null. So release actually does nothing, and so IndexWriter.unlock does not delete the
lock, or fail w/ exception.
&gt; This is only a problem in NativeFSLock, and not in other Lock implementations, at least
as I was able to see.
&gt; Need to think first how to reproduce in a test, and then fix it. I'll work on it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Resolved: (LUCENE-2117) Fix SnowballAnalyzer casing behavior for Turkish Language</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c56004925.1260363078129.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c56004925-1260363078129-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T12:51:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer resolved LUCENE-2117.
-------------------------------------

    Resolution: Fixed

committed in revision 888787

thanks robert

&gt; Fix SnowballAnalyzer casing behavior for Turkish Language
&gt; ---------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2117
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2117
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/*
&gt;    Affects Versions: 3.0
&gt;            Reporter: Simon Willnauer
&gt;            Assignee: Simon Willnauer
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2117.patch, LUCENE-2117.patch
&gt;
&gt;
&gt; LUCENE-2102 added a new TokenFilter to handle Turkish unique casing behavior correctly.
We should fix the casing behavior in SnowballAnalyzer too as it supports a TurkishStemmer.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2124) move JDK collation to core, ICU collation to ICU contrib</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c878464381.1260360559779.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c878464381-1260360559779-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T12:09:19Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788054#action_12788054
] 

Robert Muir commented on LUCENE-2124:
-------------------------------------

Committed revision 888780.

I will keep this open until i regen the website and commit the changes.

&gt; move JDK collation to core, ICU collation to ICU contrib
&gt; --------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2124
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2124
&gt;             Project: Lucene - Java
&gt;          Issue Type: Task
&gt;          Components: contrib/*, Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Robert Muir
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2124.patch, LUCENE-2124.patch
&gt;
&gt;
&gt; As mentioned on the list, I propose we move the JDK-based CollationKeyFilter/CollationKeyAnalyzer,
currently located in contrib/collation into core for collation support (language-sensitive
sorting)
&gt; These are not much code (the heavy duty stuff is already in core, IndexableBinaryString).

&gt; And I would also like to move the ICUCollationKeyFilter/ICUCollationKeyAnalyzer (along
with the jar file they depend on) also currently located in contrib/collation into a contrib/icu.
&gt; This way, we can start looking at integrating other functionality from ICU into a fully-fleshed
out icu contrib.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2123) Highlighter fails to highlight FuzzyQuery</title>
<author><name>&quot;Uwe Schindler (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1190390623.1260360498637.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1190390623-1260360498637-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T12:08:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Uwe Schindler updated LUCENE-2123:
----------------------------------

    Attachment: LUCENE-2123-flex.patch

Now i also made the strange anonymous inner class a named inner class to get rid of the strange
boolean holder, implemented by an array.

&gt; Highlighter fails to highlight FuzzyQuery
&gt; -----------------------------------------
&gt;
&gt;                 Key: LUCENE-2123
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2123
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/highlighter
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Uwe Schindler
&gt;            Assignee: Uwe Schindler
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: LUCENE-2123-flex.patch, LUCENE-2123-flex.patch, LUCENE-2123-flex.patch
&gt;
&gt;
&gt; As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in
flex since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is:
Allow MTQ to change rewrite method and make FUZZY_REWRITE public for that.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2123) Highlighter fails to highlight FuzzyQuery</title>
<author><name>&quot;Uwe Schindler (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c31244610.1260359898228.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c31244610-1260359898228-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:58:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Uwe Schindler updated LUCENE-2123:
----------------------------------

    Attachment: LUCENE-2123-flex.patch

More refactoring. No also AUTO_REWRITE uses the new TermCollector. It gets less and less code.

&gt; Highlighter fails to highlight FuzzyQuery
&gt; -----------------------------------------
&gt;
&gt;                 Key: LUCENE-2123
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2123
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/highlighter
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Uwe Schindler
&gt;            Assignee: Uwe Schindler
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: LUCENE-2123-flex.patch, LUCENE-2123-flex.patch
&gt;
&gt;
&gt; As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in
flex since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is:
Allow MTQ to change rewrite method and make FUZZY_REWRITE public for that.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2123) Highlighter fails to highlight FuzzyQuery</title>
<author><name>&quot;Uwe Schindler (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1543999704.1260358218079.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1543999704-1260358218079-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:30:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Uwe Schindler updated LUCENE-2123:
----------------------------------

    Attachment: LUCENE-2123-flex.patch

Here a refactoring of the rewrite modes in Flex. I'll port to trunk, too.

FuzzyQuery now uses per default TOP_TERMS_SCORING_BOOLEAN_REWRITE which is part of MTQ and
can now also be used by e.g. MoreLikeThis.

&gt; Highlighter fails to highlight FuzzyQuery
&gt; -----------------------------------------
&gt;
&gt;                 Key: LUCENE-2123
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2123
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/highlighter
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Uwe Schindler
&gt;            Assignee: Uwe Schindler
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: LUCENE-2123-flex.patch
&gt;
&gt;
&gt; As FuzzyQuery does not allow to change the rewrite mode, highlighter fails with UOE in
flex since LUCENE-2110, because it changes the rewrite mode to Boolean query. The fix is:
Allow MTQ to change rewrite method and make FUZZY_REWRITE public for that.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Resolved: (LUCENE-1512) Incorporate GeoHash in contrib/spatial</title>
<author><name>&quot;Grant Ingersoll (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1618391324.1260357978197.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1618391324-1260357978197-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:26:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Grant Ingersoll resolved LUCENE-1512.
-------------------------------------

       Resolution: Fixed
    Fix Version/s: 2.9
    Lucene Fields:   (was: [New])

&gt; Incorporate GeoHash in contrib/spatial
&gt; --------------------------------------
&gt;
&gt;                 Key: LUCENE-1512
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-1512
&gt;             Project: Lucene - Java
&gt;          Issue Type: New Feature
&gt;          Components: contrib/spatial
&gt;            Reporter: patrick o'leary
&gt;            Assignee: Ryan McKinley
&gt;            Priority: Minor
&gt;             Fix For: 2.9
&gt;
&gt;         Attachments: LUCENE-1512.patch, LUCENE-1512.patch
&gt;
&gt;
&gt; Based on comments from Yonik and Ryan in SOLR-773 
&gt; GeoHash provides the ability to store latitude / longitude values in a single field consistent
hash field.
&gt; Which elements the need to maintain 2 field caches for latitude / longitude fields, reducing
the size of an index
&gt; and the amount of memory needed for a spatial search.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2124) move JDK collation to core, ICU collation to ICU contrib</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c769555597.1260357738084.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c769555597-1260357738084-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:22:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788040#action_12788040
] 

Simon Willnauer commented on LUCENE-2124:
-----------------------------------------

Robert patch looks good to me!
Go for it!

&gt; move JDK collation to core, ICU collation to ICU contrib
&gt; --------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2124
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2124
&gt;             Project: Lucene - Java
&gt;          Issue Type: Task
&gt;          Components: contrib/*, Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Robert Muir
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2124.patch, LUCENE-2124.patch
&gt;
&gt;
&gt; As mentioned on the list, I propose we move the JDK-based CollationKeyFilter/CollationKeyAnalyzer,
currently located in contrib/collation into core for collation support (language-sensitive
sorting)
&gt; These are not much code (the heavy duty stuff is already in core, IndexableBinaryString).

&gt; And I would also like to move the ICUCollationKeyFilter/ICUCollationKeyAnalyzer (along
with the jar file they depend on) also currently located in contrib/collation into a contrib/icu.
&gt; This way, we can start looking at integrating other functionality from ICU into a fully-fleshed
out icu contrib.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-1512) Incorporate GeoHash in contrib/spatial</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c2110655762.1260357258103.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c2110655762-1260357258103-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:14:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-1512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788037#action_12788037
] 

Simon Willnauer commented on LUCENE-1512:
-----------------------------------------

Is this isssue still relevant? seems like it has been committted

&gt; Incorporate GeoHash in contrib/spatial
&gt; --------------------------------------
&gt;
&gt;                 Key: LUCENE-1512
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-1512
&gt;             Project: Lucene - Java
&gt;          Issue Type: New Feature
&gt;          Components: contrib/spatial
&gt;            Reporter: patrick o'leary
&gt;            Assignee: Ryan McKinley
&gt;            Priority: Minor
&gt;         Attachments: LUCENE-1512.patch, LUCENE-1512.patch
&gt;
&gt;
&gt; Based on comments from Yonik and Ryan in SOLR-773 
&gt; GeoHash provides the ability to store latitude / longitude values in a single field consistent
hash field.
&gt; Which elements the need to maintain 2 field caches for latitude / longitude fields, reducing
the size of an index
&gt; and the amount of memory needed for a spatial search.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Assigned: (LUCENE-2139) Cleanup and Improvement of Spatial Contrib</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c869569121.1260356898326.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c869569121-1260356898326-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:08:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer reassigned LUCENE-2139:
---------------------------------------

    Assignee: Simon Willnauer

&gt; Cleanup and Improvement of Spatial Contrib
&gt; ------------------------------------------
&gt;
&gt;                 Key: LUCENE-2139
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2139
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 3.1
&gt;            Reporter: Chris Male
&gt;            Assignee: Simon Willnauer
&gt;
&gt; The current spatial contrib can be improved by adding documentation, tests, removing
unused classes and code, repackaging the classes and improving the performance of the distance
filtering.  The latter will incorporate the multi-threaded functionality introduced in LUCENE-1732.
 
&gt; Other improvements involve adding better support for different distance units, different
distance calculators and different data formats (whether it be lat/long fields, geohashes,
or something else in the future).
&gt; Patch to be added soon.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2139) Cleanup and Improvement of Spatial Contrib</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1297884265.1260356898560.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1297884265-1260356898560-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:08:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788034#action_12788034
] 

Simon Willnauer commented on LUCENE-2139:
-----------------------------------------

cant wait to see you patch - its gonna be huge I guess :) 

I will be here to help you splitting it apart and get you good work into contrib/spatial

&gt; Cleanup and Improvement of Spatial Contrib
&gt; ------------------------------------------
&gt;
&gt;                 Key: LUCENE-2139
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2139
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 3.1
&gt;            Reporter: Chris Male
&gt;            Assignee: Simon Willnauer
&gt;
&gt; The current spatial contrib can be improved by adding documentation, tests, removing
unused classes and code, repackaging the classes and improving the performance of the distance
filtering.  The latter will incorporate the multi-threaded functionality introduced in LUCENE-1732.
 
&gt; Other improvements involve adding better support for different distance units, different
distance calculators and different data formats (whether it be lat/long fields, geohashes,
or something else in the future).
&gt; Patch to be added soon.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Created: (LUCENE-2139) Cleanup and Improvement of Spatial Contrib</title>
<author><name>&quot;Chris Male (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1384348470.1260356778111.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1384348470-1260356778111-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T11:06:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>
Cleanup and Improvement of Spatial Contrib
------------------------------------------

                 Key: LUCENE-2139
                 URL: https://issues.apache.org/jira/browse/LUCENE-2139
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/spatial
    Affects Versions: 3.1
            Reporter: Chris Male


The current spatial contrib can be improved by adding documentation, tests, removing unused
classes and code, repackaging the classes and improving the performance of the distance filtering.
 The latter will incorporate the multi-threaded functionality introduced in LUCENE-1732. 


Other improvements involve adding better support for different distance units, different distance
calculators and different data formats (whether it be lat/long fields, geohashes, or something
else in the future).

Patch to be added soon.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Resolved: (LUCENE-2107) Add contrib/fast-vector-highlighter to Maven central repo</title>
<author><name>&quot;Simon Willnauer (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c518358972.1260354738207.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c518358972-1260354738207-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T10:32:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Simon Willnauer resolved LUCENE-2107.
-------------------------------------

       Resolution: Fixed
    Fix Version/s: 2.9.1
                   3.0

The missing maven artifacts for the fast-vector-highlighter contrib of
Lucene Java in version 2.9.1 and 3.0.0 are now available at:

http://repo1.maven.org/maven2/org/apache/lucene/
http://repo2.maven.org/maven2/org/apache/lucene/


&gt; Add contrib/fast-vector-highlighter to Maven central repo
&gt; ---------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2107
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2107
&gt;             Project: Lucene - Java
&gt;          Issue Type: Task
&gt;          Components: contrib/*
&gt;    Affects Versions: 2.9.1, 3.0
&gt;            Reporter: Chas Emerick
&gt;            Assignee: Simon Willnauer
&gt;             Fix For: 3.0, 2.9.1
&gt;
&gt;         Attachments: LUCENE-2107.patch
&gt;
&gt;
&gt; I'm not at all familiar with the Lucene build/deployment process, but it would be very
nice if releases of the fast vector highlighter were pushed to the maven central repository,
as is done with other contrib modules.
&gt; (Issue filed at the request of Grant Ingersoll.)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>Re: [VOTE] Push fast-vector-highlighter mvn artifacts for 3.0.0 and	2.9.1</title>
<author><name>Simon Willnauer &lt;simon.willnauer@googlemail.com&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3cf18c9dde0912090232s3d796dc6hf7b9b9fb98a249d1@mail.gmail.com%3e"/>
<id>urn:uuid:%3cf18c9dde0912090232s3d796dc6hf7b9b9fb98a249d1@mail-gmail-com%3e</id>
<updated>2009-12-09T10:32:12Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>
nice - I closed the issue.
thanks uwe

On Wed, Dec 9, 2009 at 10:28 AM, Uwe Schindler &lt;uwe@thetaphi.de&gt; wrote:
&gt; Hi all,
&gt;
&gt; The missing maven artifacts for the fast-vector-highlighter contrib of
&gt; Lucene Java in version 2.9.1 and 3.0.0 are now available at:
&gt;
&gt; http://repo1.maven.org/maven2/org/apache/lucene/
&gt; http://repo2.maven.org/maven2/org/apache/lucene/
&gt;
&gt; Uwe
&gt;
&gt; -----
&gt; Uwe Schindler
&gt; uschindler@apache.org
&gt; Apache Lucene Java Committer
&gt; Bremen, Germany
&gt; http://lucene.apache.org/java/docs/
&gt;
&gt;&gt; From: Uwe Schindler [mailto:uwe@thetaphi.de]
&gt;&gt; Sent: Tuesday, December 08, 2009 10:41 PM
&gt;&gt; To: java-dev@lucene.apache.org; general@lucene.apache.org
&gt;&gt; Subject: RE: [VOTE] Push fast-vector-highlighter mvn artifacts for 3.0.0
&gt;&gt; and 2.9.1
&gt;&gt;
&gt;&gt; I got 3 binding votes from Grant, Mike, and Ted (and one from Simon, who
&gt;&gt; was
&gt;&gt; a big help on Sunday evening when I created the artifacts), so I push the
&gt;&gt; maven artifacts onto the rsync repo in few minutes.
&gt;&gt;
&gt;&gt; Thanks!
&gt;&gt;
&gt;&gt; -----
&gt;&gt; Uwe Schindler
&gt;&gt; H.-H.-Meier-Allee 63, D-28213 Bremen
&gt;&gt; http://www.thetaphi.de
&gt;&gt; eMail: uwe@thetaphi.de
&gt;&gt;
&gt;&gt; &gt; -----Original Message-----
&gt;&gt; &gt; From: Uwe Schindler [mailto:uwe@thetaphi.de]
&gt;&gt; &gt; Sent: Tuesday, December 08, 2009 7:03 PM
&gt;&gt; &gt; To: java-dev@lucene.apache.org
&gt;&gt; &gt; Subject: [VOTE] Push fast-vector-highlighter mvn artifacts for 3.0.0 and
&gt;&gt; &gt; 2.9.1
&gt;&gt; &gt;
&gt;&gt; &gt; Sorry,
&gt;&gt; &gt;
&gt;&gt; &gt; I initially didn't want to start a vote, as Grant only proposed to
&gt;&gt; "maybe
&gt;&gt; &gt; start one". But nobody responded (esp. to the questions in this mail) I
&gt;&gt; &gt; ask
&gt;&gt; &gt; again, an I will start the vote for now.
&gt;&gt; &gt;
&gt;&gt; &gt;
&gt;&gt; ==========================================================================
&gt;&gt; &gt; ==
&gt;&gt; &gt; Please vote, that the missing artifacts for of fast-verctor-highlighter
&gt;&gt; of
&gt;&gt; &gt; Lucene Java 2.9.1 and 3.0.0 should be pushed to repoX.maven.org.
&gt;&gt; &gt;
&gt;&gt; &gt; You can find the artifacts here:
&gt;&gt; &gt; http://people.apache.org/~uschindler/staging-area/
&gt;&gt; &gt;
&gt;&gt; &gt; This dir contains only the maven folder to be copied to maven-rsync dir
&gt;&gt; on
&gt;&gt; &gt; p.a.o. The top-level version in the maven metadata is 3.0.0, which
&gt;&gt; &gt; conforms
&gt;&gt; &gt; to the current state on maven (so during merging both folders during
&gt;&gt; &gt; build,
&gt;&gt; &gt; I set preference to metadata.xml of 3.0.0).
&gt;&gt; &gt;
&gt;&gt; &gt; All files are signed by my PGP key (even the 2.9.1 ones; that release
&gt;&gt; was
&gt;&gt; &gt; originally built by Mike McCandless).
&gt;&gt; &gt;
&gt;&gt; ==========================================================================
&gt;&gt; &gt; ==
&gt;&gt; &gt;
&gt;&gt; &gt; What I additionally found out until now (because Simon nagged me):
&gt;&gt; &gt;
&gt;&gt; &gt; If you compare the JAR files inside the binary ZIP file from the apache
&gt;&gt; &gt; archive and the JAR files directly published on maven (for the other
&gt;&gt; &gt; contribs), the MD5s/SHA1s are different even as they are created from
&gt;&gt; the
&gt;&gt; &gt; same source code (because the timestamps inside the JAR are different,
&gt;&gt; for
&gt;&gt; &gt; 2.9.1 another JDK compiler/platform was used). This interestingly does
&gt;&gt; not
&gt;&gt; &gt; apply to lucene-core.jar in 3.0. Because of that I see no problem with
&gt;&gt; &gt; this
&gt;&gt; &gt; maven release, even that they are not the orginal JAR files from the
&gt;&gt; &gt; binary
&gt;&gt; &gt; distrib.
&gt;&gt; &gt;
&gt;&gt; &gt; What is not nice, is that the svn revision number in the manifest is
&gt;&gt; &gt; different, but else is exactly the same, see my comments below in
&gt;&gt; earlier
&gt;&gt; &gt; mails about changing the ant script for showing the SVN rev of the last
&gt;&gt; &gt; changed file.
&gt;&gt; &gt;
&gt;&gt; &gt; So if nobody objects to release these rebuild jar files, all signed by
&gt;&gt; my
&gt;&gt; &gt; key, I would like to simply put them on the maven-rsync folder.
&gt;&gt; &gt;
&gt;&gt; &gt; Uwe
&gt;&gt; &gt;
&gt;&gt; &gt; -----
&gt;&gt; &gt; Uwe Schindler
&gt;&gt; &gt; H.-H.-Meier-Allee 63, D-28213 Bremen
&gt;&gt; &gt; http://www.thetaphi.de
&gt;&gt; &gt; eMail: uwe@thetaphi.de
&gt;&gt; &gt;
&gt;&gt; &gt;
&gt;&gt; &gt; &gt; -----Original Message-----
&gt;&gt; &gt; &gt; From: Chris Hostetter [mailto:hossman_lucene@fucit.org]
&gt;&gt; &gt; &gt; Sent: Tuesday, December 08, 2009 6:48 PM
&gt;&gt; &gt; &gt; To: java-dev@lucene.apache.org
&gt;&gt; &gt; &gt; Subject: Re: (NAG) Push fast-vector-highlighter mvn artifacts for 3.0
&gt;&gt; &gt; and
&gt;&gt; &gt; &gt; 2.9
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt; : What to do now, any votes on adding the missing maven artifacts for
&gt;&gt; &gt; &gt; : fast-vector-highlighter to 2.9.1 and 3.0.0 on the apache maven
&gt;&gt; &gt; &gt; reposititory?
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt; It's not even clear to me that anything special needs to be done
&gt;&gt; before
&gt;&gt; &gt; &gt; publishing those jars to maven.  2.9.1 and 3.0.0 were already voted on
&gt;&gt; &gt; and
&gt;&gt; &gt; &gt; released -- including all of the source code in them.
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt; The safest bet least likely to anger the process gods is just to call
&gt;&gt; a
&gt;&gt; &gt; &gt; vote (new thread with VOTE in the subject) and cast a vote ...
&gt;&gt; &gt; considering
&gt;&gt; &gt; &gt; the sources has already been reviewed it should go pretty quick.
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt; :
&gt;&gt; &gt; &gt; : &gt; I rebuilt the maven-dir for 2.9.1 and 3.0.0, merged them (3.0.0
is
&gt;&gt; &gt; &gt; top-
&gt;&gt; &gt; &gt; : &gt; level
&gt;&gt; &gt; &gt; : &gt; version) and extracted only fast-vector-highlighter:
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; http://people.apache.org/~uschindler/staging-area/
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; I will copy this dir to the maven folder on people.a.o, when I got
&gt;&gt; &gt; &gt; votes
&gt;&gt; &gt; &gt; : &gt; (how many)? At least someone should check the signatures.
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; By the way, we have a small error in our ant build.xml that
&gt;&gt; inserts
&gt;&gt; &gt; &gt; : &gt; svnversion into the manifest file. This version is not the version
&gt;&gt; &gt; of
&gt;&gt; &gt; &gt; the
&gt;&gt; &gt; &gt; : &gt; last changed item (would be svnversion -c) but the current svn
&gt;&gt; &gt; &gt; version,
&gt;&gt; &gt; &gt; : &gt; even
&gt;&gt; &gt; &gt; : &gt; that I checked out the corresponding tags. It's no problem at all,
&gt;&gt; &gt; but
&gt;&gt; &gt; &gt; not
&gt;&gt; &gt; &gt; : &gt; very nice.
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; Maybe we should change build.xml to call "svnversion -c" in
&gt;&gt; future,
&gt;&gt; &gt; to
&gt;&gt; &gt; &gt; get
&gt;&gt; &gt; &gt; : &gt; the real number.
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; Uwe
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; -----
&gt;&gt; &gt; &gt; : &gt; Uwe Schindler
&gt;&gt; &gt; &gt; : &gt; H.-H.-Meier-Allee 63, D-28213 Bremen
&gt;&gt; &gt; &gt; : &gt; http://www.thetaphi.de
&gt;&gt; &gt; &gt; : &gt; eMail: uwe@thetaphi.de
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; -----Original Message-----
&gt;&gt; &gt; &gt; : &gt; &gt; From: Grant Ingersoll [mailto:gsingers@apache.org]
&gt;&gt; &gt; &gt; : &gt; &gt; Sent: Saturday, December 05, 2009 10:26 PM
&gt;&gt; &gt; &gt; : &gt; &gt; To: java-dev@lucene.apache.org
&gt;&gt; &gt; &gt; : &gt; &gt; Subject: Re: Push fast-vector-highlighter mvn artifacts for
3.0
&gt;&gt; &gt; and
&gt;&gt; &gt; &gt; 2.9
&gt;&gt; &gt; &gt; : &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; I suppose we could put up the artifacts on a dev site and then
&gt;&gt; we
&gt;&gt; &gt; &gt; could
&gt;&gt; &gt; &gt; : &gt; &gt; vote to release both of them pretty quickly.  I think that
&gt;&gt; should
&gt;&gt; &gt; be
&gt;&gt; &gt; &gt; : &gt; easy
&gt;&gt; &gt; &gt; : &gt; &gt; to do, since it pretty much only involves verifying the jar
and
&gt;&gt; &gt; the
&gt;&gt; &gt; &gt; : &gt; &gt; signatures.
&gt;&gt; &gt; &gt; : &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; On Dec 5, 2009, at 1:03 PM, Simon Willnauer wrote:
&gt;&gt; &gt; &gt; : &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; hi folks,
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; The maven artifacts for fast-vector-highlighter have never
&gt;&gt; been
&gt;&gt; &gt; &gt; pushed
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; since it was released because there were no pom.xml.template
&gt;&gt; &gt; &gt; inside
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; the module. I added a pom file a day ago in the context
of
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; LUCENE-2107. I already talked to uwe and grant how to
deal
&gt;&gt; with
&gt;&gt; &gt; &gt; this
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; issues and if we should push the artifact for Lucene 2.9
/
&gt;&gt; 3.0.
&gt;&gt; &gt; &gt; Since
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; this is only a metadata file we could consider rebuilding
the
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; artefacts and publish them for those releases. I can not
&gt;&gt; &gt; remember
&gt;&gt; &gt; &gt; that
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; anything like that happened before, so we should discuss
how
&gt;&gt; to
&gt;&gt; &gt; &gt; deal
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; with this situation and if we should wait until 3.1.
&gt;&gt; &gt; &gt; : &gt; &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; simon
&gt;&gt; &gt; &gt; : &gt; &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; --------------------------------------------------------------
&gt;&gt; --
&gt;&gt; &gt; --
&gt;&gt; &gt; &gt; ---
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt;&gt; &gt; &gt; : &gt; &gt; &gt; For additional commands, e-mail: java-dev-
&gt;&gt; help@lucene.apache.org
&gt;&gt; &gt; &gt; : &gt; &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt;
&gt;&gt; &gt; &gt; : &gt; &gt; ----------------------------------------------------------------
&gt;&gt; --
&gt;&gt; &gt; --
&gt;&gt; &gt; &gt; -
&gt;&gt; &gt; &gt; : &gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt;&gt; &gt; &gt; : &gt; &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt;
&gt;&gt; &gt; &gt; : &gt; ------------------------------------------------------------------
&gt;&gt; --
&gt;&gt; &gt; -
&gt;&gt; &gt; &gt; : &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt;&gt; &gt; &gt; : &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;&gt; &gt; &gt; :
&gt;&gt; &gt; &gt; :
&gt;&gt; &gt; &gt; :
&gt;&gt; &gt; &gt; : --------------------------------------------------------------------
&gt;&gt; -
&gt;&gt; &gt; &gt; : To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt;&gt; &gt; &gt; : For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;&gt; &gt; &gt; :
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt; -Hoss
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt;
&gt;&gt; &gt; &gt; ---------------------------------------------------------------------
&gt;&gt; &gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt;&gt; &gt; &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;&gt; &gt;
&gt;&gt; &gt;
&gt;&gt; &gt;
&gt;&gt; &gt; ---------------------------------------------------------------------
&gt;&gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt;&gt; &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;&gt;
&gt;
&gt;
&gt;
&gt; ---------------------------------------------------------------------
&gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt;
&gt;

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2138) Allow custom index readers when using IndexWriter.getReader</title>
<author><name>&quot;Michael McCandless (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c749923994.1260354498079.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c749923994-1260354498079-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T10:28:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788018#action_12788018
] 

Michael McCandless commented on LUCENE-2138:
--------------------------------------------

Could we maybe instead factor out ReaderPool from IW, and somehow enable this extensibility,
there?

This would be the first step in LUCENE-2026, I guess.

The mergedSegmentWarmer should then also go into ReaderPool.


&gt; Allow custom index readers when using IndexWriter.getReader
&gt; -----------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2138
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2138
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: Index
&gt;    Affects Versions: 3.0
&gt;            Reporter: Jason Rutherglen
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2138.patch
&gt;
&gt;
&gt; This is needed for backwards compatible support with Solr, and is a spin-off from SOLR-1606.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2126) Split up IndexInput and IndexOutput into DataInput and DataOutput</title>
<author><name>&quot;Michael Busch (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c348464405.1260352098096.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c348464405-1260352098096-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T09:48:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12788001#action_12788001
] 

Michael Busch commented on LUCENE-2126:
---------------------------------------

The main reason why I'd like to separate DataInput/Output from IndexInput/Output now is LUCENE-2125.
Users should be able to implement methods that serialize/deserialize attributes into/from
a postinglist. These methods should only be able to call the read/write methods (which this
issue moves to DataInput/Output), but not methods like close(), seek() etc.. 

Thanks for spending time reviewing this and giving feedback from Lucy land, Marvin!
I think I will go ahead and commit this, and once we see a need to allow users to extend DataInput/Output
outside of Lucene we can go ahead and make the additional changes that are mentioned in your
in my comments here.

So I will commit this tomorrow if nobody objects.

&gt; Split up IndexInput and IndexOutput into DataInput and DataOutput
&gt; -----------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2126
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2126
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Michael Busch
&gt;            Assignee: Michael Busch
&gt;            Priority: Minor
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: lucene-2126.patch
&gt;
&gt;
&gt; I'd like to introduce the two new classes DataInput and DataOutput
&gt; that contain all methods from IndexInput and IndexOutput that actually
&gt; decode or encode data, such as readByte()/writeByte(),
&gt; readVInt()/writeVInt().
&gt; Methods like getFilePointer(), seek(), close(), etc., which are not
&gt; related to data encoding, but to files as input/output source stay in
&gt; IndexInput/IndexOutput.
&gt; This patch also changes ByteSliceReader/ByteSliceWriter to extend
&gt; DataInput/DataOutput. Previously ByteSliceReader implemented the
&gt; methods that stay in IndexInput by throwing RuntimeExceptions.
&gt; See also LUCENE-2125.
&gt; All tests pass.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>RE: [VOTE] Push fast-vector-highlighter mvn artifacts for 3.0.0 and 2.9.1</title>
<author><name>&quot;Uwe Schindler&quot; &lt;uwe@thetaphi.de&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c01AF1109550C4822A317DE46CFE66280@VEGA%3e"/>
<id>urn:uuid:%3c01AF1109550C4822A317DE46CFE66280@VEGA%3e</id>
<updated>2009-12-09T09:28:10Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>
Hi all,

The missing maven artifacts for the fast-vector-highlighter contrib of
Lucene Java in version 2.9.1 and 3.0.0 are now available at:

http://repo1.maven.org/maven2/org/apache/lucene/
http://repo2.maven.org/maven2/org/apache/lucene/

Uwe

-----
Uwe Schindler
uschindler@apache.org 
Apache Lucene Java Committer
Bremen, Germany
http://lucene.apache.org/java/docs/

&gt; From: Uwe Schindler [mailto:uwe@thetaphi.de]
&gt; Sent: Tuesday, December 08, 2009 10:41 PM
&gt; To: java-dev@lucene.apache.org; general@lucene.apache.org
&gt; Subject: RE: [VOTE] Push fast-vector-highlighter mvn artifacts for 3.0.0
&gt; and 2.9.1
&gt; 
&gt; I got 3 binding votes from Grant, Mike, and Ted (and one from Simon, who
&gt; was
&gt; a big help on Sunday evening when I created the artifacts), so I push the
&gt; maven artifacts onto the rsync repo in few minutes.
&gt; 
&gt; Thanks!
&gt; 
&gt; -----
&gt; Uwe Schindler
&gt; H.-H.-Meier-Allee 63, D-28213 Bremen
&gt; http://www.thetaphi.de
&gt; eMail: uwe@thetaphi.de
&gt; 
&gt; &gt; -----Original Message-----
&gt; &gt; From: Uwe Schindler [mailto:uwe@thetaphi.de]
&gt; &gt; Sent: Tuesday, December 08, 2009 7:03 PM
&gt; &gt; To: java-dev@lucene.apache.org
&gt; &gt; Subject: [VOTE] Push fast-vector-highlighter mvn artifacts for 3.0.0 and
&gt; &gt; 2.9.1
&gt; &gt;
&gt; &gt; Sorry,
&gt; &gt;
&gt; &gt; I initially didn't want to start a vote, as Grant only proposed to
&gt; "maybe
&gt; &gt; start one". But nobody responded (esp. to the questions in this mail) I
&gt; &gt; ask
&gt; &gt; again, an I will start the vote for now.
&gt; &gt;
&gt; &gt;
&gt; ==========================================================================
&gt; &gt; ==
&gt; &gt; Please vote, that the missing artifacts for of fast-verctor-highlighter
&gt; of
&gt; &gt; Lucene Java 2.9.1 and 3.0.0 should be pushed to repoX.maven.org.
&gt; &gt;
&gt; &gt; You can find the artifacts here:
&gt; &gt; http://people.apache.org/~uschindler/staging-area/
&gt; &gt;
&gt; &gt; This dir contains only the maven folder to be copied to maven-rsync dir
&gt; on
&gt; &gt; p.a.o. The top-level version in the maven metadata is 3.0.0, which
&gt; &gt; conforms
&gt; &gt; to the current state on maven (so during merging both folders during
&gt; &gt; build,
&gt; &gt; I set preference to metadata.xml of 3.0.0).
&gt; &gt;
&gt; &gt; All files are signed by my PGP key (even the 2.9.1 ones; that release
&gt; was
&gt; &gt; originally built by Mike McCandless).
&gt; &gt;
&gt; ==========================================================================
&gt; &gt; ==
&gt; &gt;
&gt; &gt; What I additionally found out until now (because Simon nagged me):
&gt; &gt;
&gt; &gt; If you compare the JAR files inside the binary ZIP file from the apache
&gt; &gt; archive and the JAR files directly published on maven (for the other
&gt; &gt; contribs), the MD5s/SHA1s are different even as they are created from
&gt; the
&gt; &gt; same source code (because the timestamps inside the JAR are different,
&gt; for
&gt; &gt; 2.9.1 another JDK compiler/platform was used). This interestingly does
&gt; not
&gt; &gt; apply to lucene-core.jar in 3.0. Because of that I see no problem with
&gt; &gt; this
&gt; &gt; maven release, even that they are not the orginal JAR files from the
&gt; &gt; binary
&gt; &gt; distrib.
&gt; &gt;
&gt; &gt; What is not nice, is that the svn revision number in the manifest is
&gt; &gt; different, but else is exactly the same, see my comments below in
&gt; earlier
&gt; &gt; mails about changing the ant script for showing the SVN rev of the last
&gt; &gt; changed file.
&gt; &gt;
&gt; &gt; So if nobody objects to release these rebuild jar files, all signed by
&gt; my
&gt; &gt; key, I would like to simply put them on the maven-rsync folder.
&gt; &gt;
&gt; &gt; Uwe
&gt; &gt;
&gt; &gt; -----
&gt; &gt; Uwe Schindler
&gt; &gt; H.-H.-Meier-Allee 63, D-28213 Bremen
&gt; &gt; http://www.thetaphi.de
&gt; &gt; eMail: uwe@thetaphi.de
&gt; &gt;
&gt; &gt;
&gt; &gt; &gt; -----Original Message-----
&gt; &gt; &gt; From: Chris Hostetter [mailto:hossman_lucene@fucit.org]
&gt; &gt; &gt; Sent: Tuesday, December 08, 2009 6:48 PM
&gt; &gt; &gt; To: java-dev@lucene.apache.org
&gt; &gt; &gt; Subject: Re: (NAG) Push fast-vector-highlighter mvn artifacts for 3.0
&gt; &gt; and
&gt; &gt; &gt; 2.9
&gt; &gt; &gt;
&gt; &gt; &gt;
&gt; &gt; &gt; : What to do now, any votes on adding the missing maven artifacts for
&gt; &gt; &gt; : fast-vector-highlighter to 2.9.1 and 3.0.0 on the apache maven
&gt; &gt; &gt; reposititory?
&gt; &gt; &gt;
&gt; &gt; &gt; It's not even clear to me that anything special needs to be done
&gt; before
&gt; &gt; &gt; publishing those jars to maven.  2.9.1 and 3.0.0 were already voted on
&gt; &gt; and
&gt; &gt; &gt; released -- including all of the source code in them.
&gt; &gt; &gt;
&gt; &gt; &gt; The safest bet least likely to anger the process gods is just to call
&gt; a
&gt; &gt; &gt; vote (new thread with VOTE in the subject) and cast a vote ...
&gt; &gt; considering
&gt; &gt; &gt; the sources has already been reviewed it should go pretty quick.
&gt; &gt; &gt;
&gt; &gt; &gt; :
&gt; &gt; &gt; : &gt; I rebuilt the maven-dir for 2.9.1 and 3.0.0, merged them (3.0.0 is
&gt; &gt; &gt; top-
&gt; &gt; &gt; : &gt; level
&gt; &gt; &gt; : &gt; version) and extracted only fast-vector-highlighter:
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; http://people.apache.org/~uschindler/staging-area/
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; I will copy this dir to the maven folder on people.a.o, when I got
&gt; &gt; &gt; votes
&gt; &gt; &gt; : &gt; (how many)? At least someone should check the signatures.
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; By the way, we have a small error in our ant build.xml that
&gt; inserts
&gt; &gt; &gt; : &gt; svnversion into the manifest file. This version is not the version
&gt; &gt; of
&gt; &gt; &gt; the
&gt; &gt; &gt; : &gt; last changed item (would be svnversion -c) but the current svn
&gt; &gt; &gt; version,
&gt; &gt; &gt; : &gt; even
&gt; &gt; &gt; : &gt; that I checked out the corresponding tags. It's no problem at all,
&gt; &gt; but
&gt; &gt; &gt; not
&gt; &gt; &gt; : &gt; very nice.
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; Maybe we should change build.xml to call "svnversion -c" in
&gt; future,
&gt; &gt; to
&gt; &gt; &gt; get
&gt; &gt; &gt; : &gt; the real number.
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; Uwe
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; -----
&gt; &gt; &gt; : &gt; Uwe Schindler
&gt; &gt; &gt; : &gt; H.-H.-Meier-Allee 63, D-28213 Bremen
&gt; &gt; &gt; : &gt; http://www.thetaphi.de
&gt; &gt; &gt; : &gt; eMail: uwe@thetaphi.de
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; &gt; -----Original Message-----
&gt; &gt; &gt; : &gt; &gt; From: Grant Ingersoll [mailto:gsingers@apache.org]
&gt; &gt; &gt; : &gt; &gt; Sent: Saturday, December 05, 2009 10:26 PM
&gt; &gt; &gt; : &gt; &gt; To: java-dev@lucene.apache.org
&gt; &gt; &gt; : &gt; &gt; Subject: Re: Push fast-vector-highlighter mvn artifacts for 3.0
&gt; &gt; and
&gt; &gt; &gt; 2.9
&gt; &gt; &gt; : &gt; &gt;
&gt; &gt; &gt; : &gt; &gt; I suppose we could put up the artifacts on a dev site and then
&gt; we
&gt; &gt; &gt; could
&gt; &gt; &gt; : &gt; &gt; vote to release both of them pretty quickly.  I think that
&gt; should
&gt; &gt; be
&gt; &gt; &gt; : &gt; easy
&gt; &gt; &gt; : &gt; &gt; to do, since it pretty much only involves verifying the jar and
&gt; &gt; the
&gt; &gt; &gt; : &gt; &gt; signatures.
&gt; &gt; &gt; : &gt; &gt;
&gt; &gt; &gt; : &gt; &gt; On Dec 5, 2009, at 1:03 PM, Simon Willnauer wrote:
&gt; &gt; &gt; : &gt; &gt;
&gt; &gt; &gt; : &gt; &gt; &gt; hi folks,
&gt; &gt; &gt; : &gt; &gt; &gt; The maven artifacts for fast-vector-highlighter have never
&gt; been
&gt; &gt; &gt; pushed
&gt; &gt; &gt; : &gt; &gt; &gt; since it was released because there were no pom.xml.template
&gt; &gt; &gt; inside
&gt; &gt; &gt; : &gt; &gt; &gt; the module. I added a pom file a day ago in the context of
&gt; &gt; &gt; : &gt; &gt; &gt; LUCENE-2107. I already talked to uwe and grant how to deal
&gt; with
&gt; &gt; &gt; this
&gt; &gt; &gt; : &gt; &gt; &gt; issues and if we should push the artifact for Lucene 2.9 /
&gt; 3.0.
&gt; &gt; &gt; Since
&gt; &gt; &gt; : &gt; &gt; &gt; this is only a metadata file we could consider rebuilding
the
&gt; &gt; &gt; : &gt; &gt; &gt; artefacts and publish them for those releases. I can not
&gt; &gt; remember
&gt; &gt; &gt; that
&gt; &gt; &gt; : &gt; &gt; &gt; anything like that happened before, so we should discuss how
&gt; to
&gt; &gt; &gt; deal
&gt; &gt; &gt; : &gt; &gt; &gt; with this situation and if we should wait until 3.1.
&gt; &gt; &gt; : &gt; &gt; &gt;
&gt; &gt; &gt; : &gt; &gt; &gt; simon
&gt; &gt; &gt; : &gt; &gt; &gt;
&gt; &gt; &gt; : &gt; &gt; &gt; --------------------------------------------------------------
&gt; --
&gt; &gt; --
&gt; &gt; &gt; ---
&gt; &gt; &gt; : &gt; &gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; &gt; &gt; : &gt; &gt; &gt; For additional commands, e-mail: java-dev-
&gt; help@lucene.apache.org
&gt; &gt; &gt; : &gt; &gt; &gt;
&gt; &gt; &gt; : &gt; &gt;
&gt; &gt; &gt; : &gt; &gt;
&gt; &gt; &gt; : &gt; &gt;
&gt; &gt; &gt; : &gt; &gt; ----------------------------------------------------------------
&gt; --
&gt; &gt; --
&gt; &gt; &gt; -
&gt; &gt; &gt; : &gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; &gt; &gt; : &gt; &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt;
&gt; &gt; &gt; : &gt; ------------------------------------------------------------------
&gt; --
&gt; &gt; -
&gt; &gt; &gt; : &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; &gt; &gt; : &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt; &gt; &gt; :
&gt; &gt; &gt; :
&gt; &gt; &gt; :
&gt; &gt; &gt; : --------------------------------------------------------------------
&gt; -
&gt; &gt; &gt; : To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; &gt; &gt; : For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt; &gt; &gt; :
&gt; &gt; &gt;
&gt; &gt; &gt;
&gt; &gt; &gt;
&gt; &gt; &gt; -Hoss
&gt; &gt; &gt;
&gt; &gt; &gt;
&gt; &gt; &gt; ---------------------------------------------------------------------
&gt; &gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; &gt; &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt; &gt;
&gt; &gt;
&gt; &gt;
&gt; &gt; ---------------------------------------------------------------------
&gt; &gt; To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
&gt; &gt; For additional commands, e-mail: java-dev-help@lucene.apache.org
&gt; 



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2089) explore using automaton for fuzzyquery</title>
<author><name>&quot;Uwe Schindler (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1252681990.1260350058195.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1252681990-1260350058195-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T09:14:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12787989#action_12787989
] 

Uwe Schindler commented on LUCENE-2089:
---------------------------------------

Cool! The code looks quite simple (but maybe this is because of n=1). But FuzzyQuery with
n&gt;1 are used seldom, or not? And how slow it is?

&gt; explore using automaton for fuzzyquery
&gt; --------------------------------------
&gt;
&gt;                 Key: LUCENE-2089
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2089
&gt;             Project: Lucene - Java
&gt;          Issue Type: Wish
&gt;          Components: Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Mark Miller
&gt;            Priority: Minor
&gt;         Attachments: Moman-0.2.1.tar.gz, TestFuzzy.java
&gt;
&gt;
&gt; Mark brought this up on LUCENE-1606 (i will assign this to him, I know he is itching
to write that nasty algorithm)
&gt; we can optimize fuzzyquery by using AutomatonTermEnum, here is my idea
&gt; * up front, calculate the maximum required K edits needed to match the users supplied
float threshold.
&gt; * for at least common K (1,2,3, etc) we should use automatontermenum. if its outside
of that, maybe use the existing slow logic. At high K, it will seek too much to be helpful
anyway.
&gt; i modified my wildcard benchmark to generate random fuzzy queries.
&gt; * Pattern: 7N stands for NNNNNNN, etc.
&gt; * AvgMS_DFA: this is the time spent creating the automaton (constructor)
&gt; ||Pattern||Iter||AvgHits||AvgMS(old)||AvgMS (new,total)||AvgMS_DFA||
&gt; |7N|10|64.0|4155.9|38.6|20.3|
&gt; |14N|10|0.0|2511.6|46.0|37.9|	
&gt; |28N|10|0.0|2506.3|93.0|86.6|
&gt; |56N|10|0.0|2524.5|304.4|298.5|
&gt; as you can see, this prototype is no good yet, because it creates the DFA in a slow way.
right now it creates an NFA, and all this wasted time is in NFA-&gt;DFA conversion.
&gt; So, for a very long string, it just gets worse and worse. This has nothing to do with
lucene, and here you can see, the TermEnum is fast (AvgMS - AvgMS_DFA), there is no problem
there.
&gt; instead we should just build a DFA to begin with, maybe with this paper: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.16.652
&gt; we can precompute the tables with that algorithm up to some reasonable K, and then I
think we are ok.
&gt; the paper references using http://portal.acm.org/citation.cfm?id=135907 for linear minimization,
if someone wants to implement this they should not worry about minimization.
&gt; in fact, we need to at some point determine if AutomatonQuery should even minimize FSM's
at all, or if it is simply enough for them to be deterministic with no transitions to dead
states. (The only code that actually assumes minimal DFA is the "Dumb" vs "Smart" heuristic
and this can be rewritten as a summation easily). we need to benchmark really complex DFAs
(i.e. write a regex benchmark) to figure out if minimization is even helping right now.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-1815) Geohash encode/decode floating point problems</title>
<author><name>&quot;Wouter Heijke (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1852937402.1260347780347.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1852937402-1260347780347-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T08:36:20Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12787973#action_12787973
] 

Wouter Heijke commented on LUCENE-1815:
---------------------------------------

I'm happily using now for some time:

http://code.google.com/p/geospatialweb/source/browse/trunk/geohash/src/Geohash.java


&gt; Geohash encode/decode floating point problems
&gt; ---------------------------------------------
&gt;
&gt;                 Key: LUCENE-1815
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-1815
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 2.9
&gt;            Reporter: Wouter Heijke
&gt;            Priority: Minor
&gt;
&gt; i'm finding the Geohash support in the spatial package to be rather unreliable.
&gt; Here is the outcome of a test that encodes/decodes the same lat/lon and geohash a few
times.
&gt; the format:
&gt; action geohash=(latitude, longitude)
&gt; the result:
&gt; encode u173zq37x014=(52.3738007,4.8909347)
&gt; decode u173zq37x014=(52.373799999999996,4.890934)
&gt; encode u173zq37rpbw=(52.373799999999996,4.890934)
&gt; decode u173zq37rpbw=(52.373799999999996,4.8909329999999995)
&gt; encode u173zq37qzzy=(52.373799999999996,4.8909329999999995)
&gt; if I now change to the google code implementation:
&gt; encode u173zq37x014=(52.3738007,4.8909347)
&gt; decode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; encode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; decode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; encode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; Note the differences between the geohashes in both situations and the lat/lon's!
&gt; Now things get worse if you work on low-precision geohashes:
&gt; decode u173=(52.0,4.0)
&gt; encode u14zg429yy84=(52.0,4.0)
&gt; decode u14zg429yy84=(52.0,3.999999)
&gt; encode u14zg429ywx6=(52.0,3.999999)
&gt; and google:
&gt; decode u173=(52.20703125,4.5703125)
&gt; encode u17300000000=(52.20703125,4.5703125)
&gt; decode u17300000000=(52.20703125,4.5703125)
&gt; encode u17300000000=(52.20703125,4.5703125)
&gt; We are using geohashes extensively and will now use the google code version unfortunately.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-1815) Geohash encode/decode floating point problems</title>
<author><name>&quot;patrick o'leary (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c2070655500.1260338958230.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c2070655500-1260338958230-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T06:09:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12787934#action_12787934
] 

patrick o'leary commented on LUCENE-1815:
-----------------------------------------

What google code are you working with?


&gt; Geohash encode/decode floating point problems
&gt; ---------------------------------------------
&gt;
&gt;                 Key: LUCENE-1815
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-1815
&gt;             Project: Lucene - Java
&gt;          Issue Type: Bug
&gt;          Components: contrib/spatial
&gt;    Affects Versions: 2.9
&gt;            Reporter: Wouter Heijke
&gt;            Priority: Minor
&gt;
&gt; i'm finding the Geohash support in the spatial package to be rather unreliable.
&gt; Here is the outcome of a test that encodes/decodes the same lat/lon and geohash a few
times.
&gt; the format:
&gt; action geohash=(latitude, longitude)
&gt; the result:
&gt; encode u173zq37x014=(52.3738007,4.8909347)
&gt; decode u173zq37x014=(52.373799999999996,4.890934)
&gt; encode u173zq37rpbw=(52.373799999999996,4.890934)
&gt; decode u173zq37rpbw=(52.373799999999996,4.8909329999999995)
&gt; encode u173zq37qzzy=(52.373799999999996,4.8909329999999995)
&gt; if I now change to the google code implementation:
&gt; encode u173zq37x014=(52.3738007,4.8909347)
&gt; decode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; encode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; decode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; encode u173zq37x014=(52.37380061298609,4.890934377908707)
&gt; Note the differences between the geohashes in both situations and the lat/lon's!
&gt; Now things get worse if you work on low-precision geohashes:
&gt; decode u173=(52.0,4.0)
&gt; encode u14zg429yy84=(52.0,4.0)
&gt; decode u14zg429yy84=(52.0,3.999999)
&gt; encode u14zg429ywx6=(52.0,3.999999)
&gt; and google:
&gt; decode u173=(52.20703125,4.5703125)
&gt; encode u17300000000=(52.20703125,4.5703125)
&gt; decode u17300000000=(52.20703125,4.5703125)
&gt; encode u17300000000=(52.20703125,4.5703125)
&gt; We are using geohashes extensively and will now use the google code version unfortunately.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2089) explore using automaton for fuzzyquery</title>
<author><name>&quot;Robert Muir (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c387469272.1260326538097.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c387469272-1260326538097-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T02:42:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Robert Muir updated LUCENE-2089:
--------------------------------

    Attachment: TestFuzzy.java

attached is a testcase that builds n=1 for some string the slow way.
you can use this to verify that some faster method is correct, via assertEquals


&gt; explore using automaton for fuzzyquery
&gt; --------------------------------------
&gt;
&gt;                 Key: LUCENE-2089
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2089
&gt;             Project: Lucene - Java
&gt;          Issue Type: Wish
&gt;          Components: Search
&gt;            Reporter: Robert Muir
&gt;            Assignee: Mark Miller
&gt;            Priority: Minor
&gt;         Attachments: Moman-0.2.1.tar.gz, TestFuzzy.java
&gt;
&gt;
&gt; Mark brought this up on LUCENE-1606 (i will assign this to him, I know he is itching
to write that nasty algorithm)
&gt; we can optimize fuzzyquery by using AutomatonTermEnum, here is my idea
&gt; * up front, calculate the maximum required K edits needed to match the users supplied
float threshold.
&gt; * for at least common K (1,2,3, etc) we should use automatontermenum. if its outside
of that, maybe use the existing slow logic. At high K, it will seek too much to be helpful
anyway.
&gt; i modified my wildcard benchmark to generate random fuzzy queries.
&gt; * Pattern: 7N stands for NNNNNNN, etc.
&gt; * AvgMS_DFA: this is the time spent creating the automaton (constructor)
&gt; ||Pattern||Iter||AvgHits||AvgMS(old)||AvgMS (new,total)||AvgMS_DFA||
&gt; |7N|10|64.0|4155.9|38.6|20.3|
&gt; |14N|10|0.0|2511.6|46.0|37.9|	
&gt; |28N|10|0.0|2506.3|93.0|86.6|
&gt; |56N|10|0.0|2524.5|304.4|298.5|
&gt; as you can see, this prototype is no good yet, because it creates the DFA in a slow way.
right now it creates an NFA, and all this wasted time is in NFA-&gt;DFA conversion.
&gt; So, for a very long string, it just gets worse and worse. This has nothing to do with
lucene, and here you can see, the TermEnum is fast (AvgMS - AvgMS_DFA), there is no problem
there.
&gt; instead we should just build a DFA to begin with, maybe with this paper: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.16.652
&gt; we can precompute the tables with that algorithm up to some reasonable K, and then I
think we are ok.
&gt; the paper references using http://portal.acm.org/citation.cfm?id=135907 for linear minimization,
if someone wants to implement this they should not worry about minimization.
&gt; in fact, we need to at some point determine if AutomatonQuery should even minimize FSM's
at all, or if it is simply enough for them to be deterministic with no transitions to dead
states. (The only code that actually assumes minimal DFA is the "Dumb" vs "Smart" heuristic
and this can be rewritten as a summation easily). we need to benchmark really complex DFAs
(i.e. write a regex benchmark) to figure out if minimization is even helping right now.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Commented: (LUCENE-2126) Split up IndexInput and IndexOutput into DataInput and DataOutput</title>
<author><name>&quot;Marvin Humphrey (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c56602394.1260324618265.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c56602394-1260324618265-JavaMail-jira@brutus%3e</id>
<updated>2009-12-09T02:10:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

    [ https://issues.apache.org/jira/browse/LUCENE-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&amp;focusedCommentId=12787876#action_12787876
] 

Marvin Humphrey commented on LUCENE-2126:
-----------------------------------------

I spent a long time today trying to understand why DataInput and 
DataOutput are justified so that I could formulate an intelligent reply, 
but I had to give up. :\ Please carry on.

&gt; Split up IndexInput and IndexOutput into DataInput and DataOutput
&gt; -----------------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2126
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2126
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;    Affects Versions: Flex Branch
&gt;            Reporter: Michael Busch
&gt;            Assignee: Michael Busch
&gt;            Priority: Minor
&gt;             Fix For: Flex Branch
&gt;
&gt;         Attachments: lucene-2126.patch
&gt;
&gt;
&gt; I'd like to introduce the two new classes DataInput and DataOutput
&gt; that contain all methods from IndexInput and IndexOutput that actually
&gt; decode or encode data, such as readByte()/writeByte(),
&gt; readVInt()/writeVInt().
&gt; Methods like getFilePointer(), seek(), close(), etc., which are not
&gt; related to data encoding, but to files as input/output source stay in
&gt; IndexInput/IndexOutput.
&gt; This patch also changes ByteSliceReader/ByteSliceWriter to extend
&gt; DataInput/DataOutput. Previously ByteSliceReader implemented the
&gt; methods that stay in IndexInput by throwing RuntimeExceptions.
&gt; See also LUCENE-2125.
&gt; All tests pass.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Updated: (LUCENE-2138) Allow custom index readers when using IndexWriter.getReader</title>
<author><name>&quot;Jason Rutherglen (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1040979000.1260315618244.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1040979000-1260315618244-JavaMail-jira@brutus%3e</id>
<updated>2009-12-08T23:40:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>

     [ https://issues.apache.org/jira/browse/LUCENE-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jason Rutherglen updated LUCENE-2138:
-------------------------------------

    Attachment: LUCENE-2138.patch

IndexWriter has an API to set and get the IndexReaderFactory which creates SegmentReaders.
 

&gt; Allow custom index readers when using IndexWriter.getReader
&gt; -----------------------------------------------------------
&gt;
&gt;                 Key: LUCENE-2138
&gt;                 URL: https://issues.apache.org/jira/browse/LUCENE-2138
&gt;             Project: Lucene - Java
&gt;          Issue Type: Improvement
&gt;          Components: Index
&gt;    Affects Versions: 3.0
&gt;            Reporter: Jason Rutherglen
&gt;            Priority: Minor
&gt;             Fix For: 3.1
&gt;
&gt;         Attachments: LUCENE-2138.patch
&gt;
&gt;
&gt; This is needed for backwards compatible support with Solr, and is a spin-off from SOLR-1606.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
<entry>
<title>[jira] Created: (LUCENE-2138) Allow custom index readers when using IndexWriter.getReader</title>
<author><name>&quot;Jason Rutherglen (JIRA)&quot; &lt;jira@apache.org&gt;</name></author>
<link rel="alternate" href="http://mail-archives.apache.org/mod_mbox/lucene-java-dev/200912.mbox/%3c1378444889.1260314358098.JavaMail.jira@brutus%3e"/>
<id>urn:uuid:%3c1378444889-1260314358098-JavaMail-jira@brutus%3e</id>
<updated>2009-12-08T23:19:18Z</updated>
<content type="xhtml">
<div xmlns="http://www.w3.org/1999/xhtml">
<pre>
Allow custom index readers when using IndexWriter.getReader
-----------------------------------------------------------

                 Key: LUCENE-2138
                 URL: https://issues.apache.org/jira/browse/LUCENE-2138
             Project: Lucene - Java
          Issue Type: Improvement
          Components: Index
    Affects Versions: 3.0
            Reporter: Jason Rutherglen
            Priority: Minor
             Fix For: 3.1


This is needed for backwards compatible support with Solr, and is a spin-off from SOLR-1606.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org



</pre>
</div>
</content>
</entry>
</feed>
