lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "DataImportHandler" by NoblePaul
Date Mon, 31 Mar 2008 07:13:13 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The following page has been changed by NoblePaul:
http://wiki.apache.org/solr/DataImportHandler

The comment on the change is:
simplified httpdatasource documentation

------------------------------------------------------------------------------
  
  It moves ahead and encounters `/RDF/item` and processes the rows one by one . It gets the
values for all the fields except for the 3 fields in the header. But as they were marked as
common fields, the processor puts those fields into the record just before creating the document.
  
- What about this ''transformer=!DateFormatTransformer'' attribute in the entity? This is
an inbuilt utility transformer helps the user parse his date strings in custom format to 'Date'
objects . Note the field `<field column="date" xpath="/RDF/item/date" dateTimeFormat="yyyy-MM-dd'T'hh:mm:ss"
/>` . The transformer only applies to a field which has the attribute 'dateTimeFormat'
and it uses the syntax of [ http://java.sun.com/j2se/1.4.2/docs/api/java/text/SimpleDateFormat.html
java's !SimpleDateFormat].
+ What about this ''transformer=!DateFormatTransformer'' attribute in the entity? This is
an inbuilt utility transformer helps the user parse his date strings in custom format to 'Date'
objects . Note the field `<field column="date" xpath="/RDF/item/date" dateTimeFormat="yyyy-MM-dd'T'hh:mm:ss"
/>` . The transformer only applies to a field which has the attribute 'dateTimeFormat'
and it uses the syntax of java's [http://java.sun.com/j2se/1.4.2/docs/api/java/text/SimpleDateFormat.html
SimpleDateFormat].
  
  
  You can use this feature for indexing from REST API's such as rss/atom feeds, XML data feeds
, other SOLR servers or even well formed xhtml documents . Our XPath support has its limitations
but we have tried to make sure that common use-cases are covered and since it's based on a
streaming parser, it is extremely fast and consumes constant amount of memory even for large
XMLs. Easy, isn't it? And you didn't need to write one line of code! Enjoy :)

Mime
View raw message