lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Günter Kukies <guenter.kuk...@heuft.com>
Subject Re: IndexWriter addDocument NullPointerException
Date Mon, 24 Feb 2003 07:22:25 GMT
So, weekend is over.

here is some code :

   private void addDocument(IndexWriter writer, File file ) throws
IOException, InterruptedException {
        String path = file.getName();
        log( "-------------start Indexing:" + path );
        Document doc = null;
        if(path.matches("\\d+_\\d{4}_[a-z]{2,3}\\.pdf")) {
            doc = PDF_Document_Parser.getDocument(this,RealPath,file);
        }
        else if(path.matches("\\d+_\\d{4}_[a-z]{2,3}\\.txt")) {
            doc = TXT_Document_Parser.getDocument(this,RealPath,file);
        }
        else {
            log("do nothing");
        }

        log("doc: "+doc);
        if( doc != null ) {
            try {
                writer.addDocument(doc);
            }
            catch(Exception ex) {
                System.err.println("hallo1 "+doc);
                ex.printStackTrace();
                System.err.println("hallo2");
                log("ERROR writer.addDocument(doc);");
            }
        }
        else {
            log( "Skipping " + path );
        }
        log( "-------------end Indexing:" + path );
    }



Here is the output:

hallo1 Document<Text<contents:java.io.InputStreamReader@18a3a1d>
Unindexed<email:hansdampf@xxx.com> Unindexed<name:Hans Dampf>
Text<summary:Equipo de deteccion 2002> Text<title:Equipo de deteccion 2002>
Text<doctypeid:0001> Unindexed<lifetime:0>
org.apache.lucene.document.Field@1ed688f Keyword<modified:0dcek766w>
Keyword<username:hda>
Unindexed<relative_path_xml:documents/news_new/sub1/sub11/sub111/10457359746
80_0001_hda.xml> Unindexed<category:documents/news_new/sub1/sub11/sub111/>
Keyword<search_all:all> org.apache.lucene.document.Field@16b98c3
Unindexed<relative_path:documents/news_new/sub1/sub11/sub111/1045735974680_0
001_hda.pdf>>
java.lang.NullPointerException
hallo2
hallo1 Document<Text<contents:java.io.InputStreamReader@c12978>
Unindexed<email:hansdampf@xxx.com> Unindexed<name:Hans Dampf>
Text<summary:testsummary> Text<title:testtitle> Text<doctypeid:0001>
Unindexed<lifetime:0> org.apache.lucene.document.Field@189c12a
Keyword<modified:0dcek76bm> Keyword<username:hda>
Unindexed<relative_path_xml:documents/news_new/sub1/sub11/sub111/10457359748
50_0001_hda.xml> Unindexed<category:documents/news_new/sub1/sub11/sub111/>
Keyword<search_all:all> org.apache.lucene.document.Field@e8c7db
Unindexed<relative_path:documents/news_new/sub1/sub11/sub111/1045735974850_0
001_hda.pdf>>
java.lang.NullPointerException
hallo2

Günter

----- Original Message -----
From: "Tatu Saloranta" <tatu@hypermall.net>
To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Sent: Saturday, February 22, 2003 5:32 PM
Subject: Re: IndexWriter addDocument NullPointerException


> On Friday 21 February 2003 13:22, Günter Kukies wrote:
> > Hello,
> >
> > I don't have any line number.
>
> You unfortunately do need to know the line number, if you do get an
exception
> and try to see where it occurs.
> Another less frequent problem is that you actually get the exception as an
> object and print out that exception; in that case you would just see
> "java.lang.NullPointerException", and nothing else?
> Otherwise, based on your code, you should see a stack trace, with or
without
> line numbers. But you would at least see the method call stack, which
would
> help in figuring out where problem occured.
>
> However, if you do catch an exception, and stack trace doesn't have line
> numbers (it seems that some JVMs do not have line number info available
when
> running JIT'ed code) there are basically two ways to figure out exact
> location:
>
> (1) Try to make JVM get the line number info (either running in
interpreted
>   mode; I think there was option, something like '-Djava.compiler= ' to
>   disable JIT?)
> (2) Run code in a debugger. One nice free debugger (if you are not using
an
>   IDE that has one is JSwat:
>      http://www.bluemarsh.com/java/jswat/
>
> Hope this helps,
>
> -+ Tatu +-
>
> >
> > this is the code snippet:
> >
> > Document doc;
> > IndexWriter writer;
> >
> > .....
> >
> > try{
> > writer.addDocument(doc);
> > }
> > catch(Exception ex){
> > ex.printStackTrace();
> > }
> >
> > this is the output on Standard.out:
> >
> > java.lang.NullPointerException
> >
> >
> > and nothing more.
> >
> > The doc is not null and System.out.println(doc) seems to be ok. There is
> > no difference between the working 80% and the not working 20% doc's.
> > Thanks,
> >
> > Günter
> >
> > > On Friday 21 February 2003 05:33, Günter Kukies wrote:
> > >> Hello,
> > >>
> > >> writer.addDocument(doc) is throwing an NullPointerException. The
> > >> stacktrace from the catched Exception is only one line
> > >> "NullPointerException" without anything else. I open the IndexWriter
> > >> with create true. Run over the files in a Directory and add all found
> > >> documents. After that i close the indexwriter. 80% of the documents
> > >> were added without problems. The rest gets that NullPointerException.
> > >>
> > >> Any Ideas?
> > >
> > > Perhaps look at the line where the null pointer exception is thrown
and
> > > see  what happens? NullPointerException is thrown when a null
reference
> > > is being  de-referenced. Seeing the immediate cause should be easy,
> > > given line number.
> > >
> > > Perhaps you have added a field with null value? (just a guess, I don't
> > > know if  that's even illegal).
> > >
> > > -+ Tatu +-
> > >
> > >
> > >
> > > ---------------------------------------------------------------------
> > > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For
> > > additional commands, e-mail: lucene-user-help@jakarta.apache.org
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> > For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message