lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Will Martin" <wmartin...@gmail.com>
Subject RE: index duplicate records from data source into 1 document
Date Thu, 19 Mar 2015 02:07:19 GMT
Did you search for the answer? Updating parts of documents sir.
You have to make some ugly choices in schema tho.

-----Original Message-----
From: Derek Poh [mailto:dpoh@globalsources.com] 
Sent: Wednesday, March 18, 2015 5:47 AM
To: general@lucene.apache.org
Subject: index duplicate records from data source into 1 document

Hi

If I have duplicaterecords in my source data (DB or delimited files). 
For simplicity sake they are of the following nature

Product Id    Business Type
-----------------------------------
12345         Exporter
12345     Agent
12366     Manufacturer
12377         Exporter
12377 Distributor

There are other fields with multiple values as well.

How do I index theduplicate records into 1 document. Eg. Product Id 
12345 will be 1 document,12366 as 1 document and 12377 as 1 document.

-Derek


Mime
View raw message