lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <>
Subject Re: Searching Subversion comments:
Date Mon, 08 Mar 2010 21:30:03 GMT
Hi Otis!

Your examples look JIRA-centric, not code-centric. Frankly I'm not
sure there's a difference for my use-case, but....

Let's say I want to answer the question "what source files were
changed for JIRA-1234". Currently I'd have to open up the JIRA and
collate all the changed files by opening the patches, writing down the
file names and making a list. Given the number of patches sometimes
attached, that looks like it can get tedious.

Either the committers have run across this many times and cursed
every time or there's *already* a way to handle it....

The link in my original mail allegedly lets me query on such a thing. As
well as the source code with full Lucene syntax. Of course, answering
the question I posed above depends on the fidelity of the SVN comments
at commit time...

You can always to a grep of the source code for stuff *in* the source
but not with full Lucene syntax, and not in the SVN comments. And I've run
into a similar situation at work, so I'd be gathering information for there

I'm not wedded to the idea, but I'd be willing to devote some time to it
if others thought it *might* be useful. I'd imagine a proof-of-concept hack,
one of the outcomes of which is "interesting, but not worth the
or "utterly and completely useless in our situation", or "Why the heck
we have this ages ago?". But I don't even want  to do a POC if there's
already a mechanism that people like. I just hate manual collation....

Although I have the vision of *this* group saying "we could improve that,
where's the source code?"....


On Mon, Mar 8, 2010 at 4:06 PM, Otis Gospodnetic <
> wrote:

> Hi Erick,
> For what it's worth, we are considering indexing JIRA comments over on
> , though I'm not entirely convinced searching in
> comments would be super valuable.  Would it?
> But note that JIRA (and LucidFind) already do that.  For example, go to
> and search for "Attached
> first cut python script"~10 (it's in that issue's comments) and
> JIRA will find that issue.
> What exactly are you lokoing to do/build?
> Otis
> ----
> Sematext :: :: Solr - Lucene - Nutch
> Hadoop ecosystem search ::
> ----- Original Message ----
> > From: Erick Erickson <>
> > To: java-user <>
> > Sent: Mon, March 8, 2010 3:48:41 PM
> > Subject: Searching Subversion comments:
> >
> > Before I reinvent the wheel.....
> >
> > Is there any convenient way to, say, find all the files associated with
> > patch XXXX? I realize one can (hopefully) get this information from JIRA,
> > but... This is a subset of the problem of searching Subversion comments.
> >
> > I can see it being useful, especially for people coming into the code
> fresh.
> > Grep (or the equivalent in the IDE) only goes so far. If there's any
> > interest, I'm thinking of playing with
> to
> > see what I could see and report back. It should be easy enough to set up
> on
> > my machine at home, although I'm not set up to show it to others.
> >
> > And it's even based on Lucene. This is feeling recursive..
> >
> > Mostly I'm checking to see if something like this has already been done
> and
> > I just missed the boat. Besides, I'm curious...
> >
> > Erick
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message