lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Che Dong" <ched...@hotmail.com>
Subject Re: making XML from articles
Date Mon, 07 Jul 2003 10:32:58 GMT
>>        // just remove invalid characters: in php
>>        $pattern ="/[\x-\x8\xb-\xc\xe-\x1f]/";
>>        $string = preg_replace($pattern,'',$string);

----- Original Message ----- 
From: "Jagdip Singh" <jxs1878@cs.rit.edu>
To: "'Lucene Users List'" <lucene-user@jakarta.apache.org>
Sent: Monday, July 07, 2003 7:53 AM
Subject: making XML from articles


> Hi,
> I am trying to use Lucene for searching articles (text files) and web
> pages. I am thinking of converting those articles to XML files and then
> feed to Lucene for indexing.
> I have not done anything much with XML before and trying to know if this
> is going to be a better idea in term of searching. 
> How can I convert text into XML?
>  
> Please suggest me if someone has faced similar situation before.
>  
> Regards, 
> Jagdip
> 
Mime
View raw message