lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "DataImportHandler" by FergusMcMenemie
Date Thu, 20 May 2010 17:24:42 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "DataImportHandler" page has been changed by FergusMcMenemie.
http://wiki.apache.org/solr/DataImportHandler?action=diff&rev1=239&rev2=240

--------------------------------------------------

   * '''`commonField`''' : can be (true| false) . If true, this field once encountered in
a record will be copied to other records before creating a Solr document
  
  If an API supports chunking (when the dataset is too large) multiple calls need to be made
to complete the process.
- X!PathEntityprocessor supports this with a transformer. If transformer returns a row which
contains a field '''`$hasMore`''' with a the value `"true"` the Processor makes another request
with the same url template (The actual value is recomputed before invoking ). A transformer
can pass a totally new url too for the next call by returning a row which contains a field
'''`$nextUrl`''' whose value must be the complete url for the next call.
+ XPathEntityprocessor supports this with a transformer. If transformer returns a row which
contains a field '''`$hasMore`''' with a the value `"true"` the Processor makes another request
with the same url template (The actual value is recomputed before invoking ). A transformer
can pass a totally new url too for the next call by returning a row which contains a field
'''`$nextUrl`''' whose value must be the complete url for the next call.
  
- The X!PathEntityProcessor implements a streaming parser which supports a subset of xpath
syntax. Complete xpath syntax is not supported but most of the common use cases are covered
as follows:-
+ The XPathEntityProcessor implements a streaming parser which supports a subset of xpath
syntax. Complete xpath syntax is not supported but most of the common use cases are covered
as follows:-
  {{{
     xpath="/a/b/subject[@qualifier='fullTitle']"
     xpath="/a/b/subject/@qualifier"
     xpath="/a/b/c"
+    xpath="//a/..."
+    xpath="/a//b..."
  }}}
  
  
  == HttpDataSource Example ==
- <!> Http!DataSource is being deprecated in favour of URL!DataSource in [[Solr1.4]]
+ <!> HttpDataSource is being deprecated in favour of URL!DataSource in [[Solr1.4]]
  
  Download the full import example given in the DB section to try this out. We'll try indexing
the [[http://rss.slashdot.org/Slashdot/slashdot|Slashdot RSS feed]] for this example.
  

Mime
View raw message