Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 72027 invoked from network); 12 Dec 2007 06:31:21 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 12 Dec 2007 06:31:21 -0000 Received: (qmail 6754 invoked by uid 500); 12 Dec 2007 06:31:04 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 6712 invoked by uid 500); 12 Dec 2007 06:31:04 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 6701 invoked by uid 99); 12 Dec 2007 06:31:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Dec 2007 22:31:04 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [206.190.38.62] (HELO web50308.mail.re2.yahoo.com) (206.190.38.62) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 12 Dec 2007 06:30:41 +0000 Received: (qmail 22778 invoked by uid 60001); 12 Dec 2007 06:30:43 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding:Message-ID; b=Z3XWWYxV0qejitwH1mVOFOpYuqOA1Rz+WkXYB44x0NoVWXt0/TL00nUSKGxNNKEVBMKzphZvpqbJ+X6WTlxP3cAJW+lI4Q5+aHVS8nyvyJMVN4MZG6sIhpt4Q3nXmWzVWG1U0JgU9oiXTYVAb7p6VjeOYjWghHhO8rzScSUNUWE=; X-YMail-OSG: qnsNYyMVM1lO..pH_2INR2XXhFrkH71WaAR1S4TVNVqfnZ8NWTOlptCCvkAFkB5MZuVB_Ou2xX6le2DWqMW5i_LLqyw8S3GGDBf6HPlKRqyZkc9TzVZ4xUj0C8z9QRmuqGgb_VAm_ThaclQ- Received: from [72.231.9.236] by web50308.mail.re2.yahoo.com via HTTP; Tue, 11 Dec 2007 22:30:43 PST X-Mailer: YahooMailRC/818.31 YahooMailWebService/0.7.158.1 Date: Tue, 11 Dec 2007 22:30:43 -0800 (PST) From: Otis Gospodnetic Subject: Re: Indexing XML document To: java-user@lucene.apache.org MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Message-ID: <555499.22042.qm@web50308.mail.re2.yahoo.com> X-Virus-Checked: Checked by ClamAV on apache.org Liaqat,=0A=0AOut of curiosity - what are you using to analyze and index Urd= u? AraMorph or something else?=0A=0AThanks,=0AOtis=0A--=0ASematext -- http= ://sematext.com/ -- Lucene - Solr - Nutch=0A=0A----- Original Message ----= =0AFrom: Liaqat Ali =0ATo: java-user@lucene.apache= .org=0ASent: Tuesday, December 4, 2007 1:04:45 PM=0ASubject: Indexing XML d= ocument=0A=0AHi all,=0A=0AI want to index an XML file,containing 200 Urdu l= anguage (Varient of =0AArabic and Persian) documents. This corpus is in CES= format,consisting =0Aof information about author and many more, I just wan= t to extract =0Atextual data of each document and relative Doc number and t= itle in each=0A =0Adocument using SAX.=0A=0AThe problem I m facing that wha= t should be the output of this whole =0Aprocessing, which is acceptable to = Lucene Indexer. I just want to store=0A =0ADocument number, and Title with = each document. The example given below =0Ais Doc 2 from that XML file. I wa= nt to make complete index of 200 =0Adocuments with Doc number and title... = Kindly guide me......=0A=0A=0ADoc 2=0A=0A=D8=AD= =DA=A9=D9=85=D8=AA =DB=8C=D8=A7=D8=B1 =DA=A9=D9=88 =D8=A7=DB=8C=D8=B1=D8=A7= =D9=86 =D8=A8=D8=AF=D8=B1 =DA=A9=D8=B1=D9=86=DB=92 =D9=BE=D8=B1=0A =D8=BA= =D9=88=D8=B1=0A

=0A

=D8=A7=D9=88=D8=B1 =D8=AE=D8=A8=D8=B1=DB= =8C=DA=BA =DB=81=DB=8C=DA=BA =DA=A9=DB=81 =D8=A7=D9=86=DA=BE=DB=8C=DA=BA = =D8=A7=DB=8C=D8=B1=D8=A7=D9=86 =D8=A8=D8=AF=D8=B1 =DA=A9=D8=B1=D9=86=DB=92= =0A =D9=BE=D8=B1 =D8=A8=DA=BE=DB=8C =D8=BA=D9=88=D8=B1 =DA=A9=DB=8C=D8=A7 = =D8=AC=D8=A7 =D8=B1=DB=81=D8=A7 =DB=81=DB=92=DB=94 =D8=AD=DA=A9=D9=85=D8=AA= =0A=DB=8C=D8=A7=D8=B1 =D8=AC=D9=88 =D8=B3=D8=A7=D8=A8=D9=82 =D8=B3=D9=88= =D9=88=DB=8C=D8=AA =DB=8C=D9=88=D9=86=DB=8C=D9=86 =DA=A9=DB=8C =D9=85=D8=AF= =D8=A7=D8=AE=D9=84=D8=AA =DA=A9=DB=92=0A =D8=AE=D9=84=D8=A7=D9=81 =D8=A7=D9= =85=D8=B1=DB=8C=DA=A9=DB=8C =D8=AD=D9=85=D8=A7=DB=8C=D8=AA =D8=B3=DB=92 =DA= =86=D9=84=DB=92 =D9=88=D8=A7=D9=84=DB=8C =0A=D9=85=D8=B2=D8=A7=D8=AD=D9=85= =D8=AA =D9=85=DB=8C=DA=BA =D8=B3=D8=A7=D9=85=D9=86=DB=92 =D8=A2=DB=93 =D8= =AA=DA=BE=DB=92 =D8=A7=D8=A8 =D9=85=D8=AE=D8=A7=D9=84=D9=81 =D8=AE=DB=8C=D8= =A7=D9=84=D8=A7=D8=AA=0A =DA=A9=DB=92 =D9=84=DB=93 =D8=AC=D8=A7=D9=86=DB=92= =D8=AC=D8=A7=D8=AA=DB=92 =DB=81=DB=8C=DA=BA =D8=A7=D9=88=D8=B1 =D8=A7=D8= =A8 =D9=88=DB=81 =0A=DA=A9=D8=B1=D8=B2=D8=A6=DB=8C =D8=A7=D9=86=D8=AA=D8=B8= =D8=A7=D9=85=DB=8C=DB=81 =DA=A9=DB=8C =D8=A8=DA=BE=DB=8C =D9=85=D8=AE=D8=A7= =D9=84=D9=81=D8=AA =DA=A9=D8=B1=D8=B1=DB=81=DB=92=0A =D8=AA=DA=BE=DB=92=DB= =94 =DA=AF=D8=B0=D8=B4=D8=AA=DB=81 =DB=81=D9=81=D8=AA=DB=92 =D8=A7=DB=8C=D8= =B1=D8=A7=D9=86 =D9=86=DB=92 =D8=AD=DA=A9=D9=85=D8=AA =DB=8C=D8=A7=D8=B1 = =D9=BE=D8=B1 =0A=D8=A7=D9=84=D8=B2=D8=A7=D9=85 =D9=84=DA=AF=D8=A7=DB=8C=D8= =A7 =D8=AA=DA=BE=D8=A7 =DA=A9=DB=81 =D9=88=DB=81 =D8=A7=DB=8C=D8=B1=D8=A7= =D9=86 =DA=A9=DB=8C =D8=B3=D8=B1=D8=B2=D9=85=DB=8C=D9=86=0A =DA=A9=D9=88 = =D8=A7=D9=81=D8=BA=D8=A7=D9=86 =D8=A7=D9=86=D8=AA=D8=B8=D8=A7=D9=85=DB=8C= =DB=81 =DA=A9=DB=92 =D8=AE=D9=84=D8=A7=D9=81 =0A=DA=A9=D8=A7=D8=B1=D9=88=D8= =A7=D8=A6=DB=8C=D8=A7=DA=BA =DA=A9=D8=B1=D9=86=DB=92 =DA=A9=DB=92 =D9=84=DB= =93 =D8=A7=D8=B3=D8=AA=D8=B9=D9=85=D8=A7=D9=84 =DA=A9=D8=B1=D8=B1=DB=81=DB= =92 =DB=81=DB=8C=DA=BA=0A =D8=AC=D8=A8 =DA=A9=DB=81 =D8=A7=DB=8C=D8=B1=D8= =A7=D9=86 =DA=A9=D8=A7 =DA=A9=DB=81=D9=86=D8=A7 =DB=81=DB=92 =DA=A9=DB=81 = =D9=88=DB=81 =0A=D8=B7=D8=A7=D9=84=D8=A8=D8=A7=D9=86 =DA=A9=DB=92 =D8=AE=D9= =84=D8=A7=D9=81 =D9=85=D8=B2=D8=A7=D8=AD=D9=85 =D8=AF=DA=BE=DA=91=D9=88=DA= =BA =DA=A9=D9=88 =D8=AC=D9=88 =D8=AD=D9=85=D8=A7=DB=8C=D8=AA=0A =D9=81=D8= =B1=D8=A7=D8=AD=D9=85 =DA=A9=D8=B1 =D8=B1=DB=81=D8=A7 =D8=AA=DA=BE=D8=A7 = =D9=88=DB=81 =D8=B7=D8=A7=D9=84=D8=A8=D8=A7=D9=86 =DA=A9=D8=A7 =0A=DA=A9=D9= =86=D9=B9=D8=B1=D9=88=D9=84 =D8=AE=D8=AA=D9=85 =DB=81=D9=88=D9=86=DB=92 =DA= =A9=DB=92 =D8=A8=D8=B9=D8=AF =D8=A8=D9=86=D8=AF =DA=A9=D8=B1 =D8=AF=DB=8C = =DA=AF=D8=A6=DB=8C =DB=81=DB=92=DB=94=0A =D8=AA=D8=A7=DB=81=D9=85 =D8=A8=D8= =B9=D8=B6 =D8=B0=D8=B1=D8=A7=D8=A6=D8=B9 =DA=A9=D8=A7 =D8=AE=DB=8C=D8=A7=D9= =84 =DB=81=DB=92 =DA=A9=DB=81 =0A=D8=A7=DB=8C=D8=B1=D8=A7=D9=86 =D9=86=DB= =92 =D8=AD=DA=A9=D9=85=D8=AA =DB=8C=D8=A7=D8=B1 =DA=A9=DB=92 =D8=AE=D9=84= =D8=A7=D9=81 =D8=A7=D9=82=D8=AF=D8=A7=D9=85 =D8=A7=D9=85=D8=B1=DB=8C=DA=A9= =DB=81=0A =DA=A9=DB=92 =D8=A7=D8=B9=D8=AA=D8=B1=D8=A7=D8=B6=D8=A7=D8=AA =DA= =A9=DB=92 =D8=A8=D8=B9=D8=AF =DA=A9=DB=8C=DB=92 =DB=81=DB=8C=DA=BA=DB=94=0A=0A=0A=0AThanks ..... Liaqat=0A=0A-------------------------------------= --------------------------------=0ATo unsubscribe, e-mail: java-user-unsubs= cribe@lucene.apache.org=0AFor additional commands, e-mail: java-user-help@l= ucene.apache.org=0A=0A=0A=0A --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org