lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simon Willnauer" <simon.willna...@googlemail.com>
Subject GData index html documents
Date Sun, 30 Jul 2006 17:39:25 GMT
Hello all,

I'm at a point where I have to retrieve data from entry elements which
could contain text, html, xhtml or even xml. So there is not problem
so far. Detecting which format the element contains is also pretty
easy as each element has a "type" attribute. if there is not such type
attribute I treat it like html and remove all html tags.
So my kind of problem is a licence problem. I'd like to use CyberNeko
HTML parser the licence looks different to the apache licene although
the licence has this sentence at the very bottom:
"This license is based on the Apache Software License, version 1.1."

http://people.apache.org/~andyc/neko/LICENSE

I know that any software, lib, jar whatever distributed with apache
project must be apache licenced. I'm not familiar with all the licence
stuff so some help would be greatly appreciated.
So can I add the cyberneko jar to the gdata project?
I might send Andy Clark an email if he grands me a licence...

regards simon

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message