lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Curtis d'Entremont (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-591) Add meta keywords to HTMLParser
Date Wed, 07 Jun 2006 15:37:29 GMT
Add meta keywords to HTMLParser
-------------------------------

         Key: LUCENE-591
         URL: http://issues.apache.org/jira/browse/LUCENE-591
     Project: Lucene - Java
        Type: Improvement

  Components: Examples  
    Versions: 2.0.0    
    Reporter: Curtis d'Entremont
    Priority: Minor



It would be good if the HTMLParser could give us the keywords specified in the meta tags,
so that we can index them.

In HTMLParser.jj:

  void addMetaTag() {
      metaTags.setProperty(currentMetaTag, currentMetaContent);
      currentMetaTag = null;
      currentMetaContent = null;
      return;
  }

One way to do it:

  void addMetaTag() throws IOException {
      metaTags.setProperty(currentMetaTag, currentMetaContent);
      if (currentMetaTag.equalsIgnoreCase("keywords")) {
          pipeOut.write(currentMetaContent);
      }
      currentMetaTag = null;
      currentMetaContent = null;
      return;
  }


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message