oodt-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nguyen, Ricky" <rngu...@chla.usc.edu>
Subject Re: crawler with unique action fails on first ingest
Date Sat, 19 Nov 2011 01:04:06 GMT
Thanks Chris! (and Paul if he ends up working on it :P )

On Nov 18, 2011, at 4:55 PM, Mattmann, Chris A (388J) wrote:

> Hey Ricky,
> 
> I've ran into this a number of times myself and recently Paul Ramirez and I were talking
about this too. Paul even 
> said he would try and fix it (ha! I'm signing him up for work :P ). Actually I'll just
look at it myself.
> 
> In the meanwhile, the workaround is exactly the one you stated. Ingest a file, that gets
you a catalog. Then, you can 
> simply delete the file if you want using fmquery | fmdel and then Unique works just fine.
> 
> Cheers,
> Chris
> 
> On Nov 18, 2011, at 4:52 PM, Nguyen, Ricky wrote:
> 
>> Hi,
>> 
>> I am trying to run a crawler using "--actionIds Unique". Since this is the first
time I am ingesting a file into FileMgr, the user guide [1] says that the catalog dir MUST
NOT exist so that Lucene can create it. However, the crawler fails with the error:
>> 
>> IOException when opening index directory: [/Users/rnguyen/vpicu/data/catalog] for
search: Message: /Users/rnguyen/vpicu/data/catalog is not a directory
>> 
>> Seems like crawler is trying to search for a product (to determine it's uniqueness),
but the catalog hasn't been created yet. I guess since I have no catalog, the workaround is
to omit the "Unique" action.
>> 
>> But if I use crawler as a daemon, it would be useful to leave "Unique" as an action.
Any thoughts on the right course?
>> 
>> Thanks,
>> Ricky
>> 
>> [1] http://oodt.apache.org/components/maven/filemgr/user/basic.html
>> 
>> 
>> ---------------------------------------------------------------------
>> CONFIDENTIALITY NOTICE: This e-mail message, including any attachments, 
>> is for the sole use of the intended recipient(s) and may contain confidential
>> or legally privileged information. Any unauthorized review, use, disclosure
>> or distribution is prohibited. If you are not the intended recipient, please
>> contact the sender by reply e-mail and destroy all copies of this original message.
 
>> 
>> ---------------------------------------------------------------------
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 




---------------------------------------------------------------------
CONFIDENTIALITY NOTICE: This e-mail message, including any attachments, 
is for the sole use of the intended recipient(s) and may contain confidential
or legally privileged information. Any unauthorized review, use, disclosure
or distribution is prohibited. If you are not the intended recipient, please
contact the sender by reply e-mail and destroy all copies of this original message.  

---------------------------------------------------------------------


Mime
View raw message