incubator-couchdb-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Hank Knight <>
Subject couchdb-lucene: ignore certain elements of HTML attachments
Date Mon, 07 Apr 2014 20:29:26 GMT
Using couchdb-lucene is there a way to ignore all content inside a
blacklisted element of HTML attachments?  Certain common information
is found in the header of every HTML document, including links to
other pages, and it would be ideal for these common areas not to be

<div id="header">Hello</div>

View raw message