lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-1925) CSV Response Writer
Date Thu, 15 Jul 2010 02:28:51 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12888676#action_12888676
] 

Yonik Seeley commented on SOLR-1925:
------------------------------------

Excel (at least the version I just tried) handled embedded newlines just fine.

> > always encapsulates with quotes - not as readable
> See the CSV spec, via Wikipedia in the links in the code

AFAIK, the CSV spec doesn't recommend always using encapsulators.

> > doesn't escape encapsulator in values
> Is there a need to do this? I don't think so...

Proper escaping is an absolute necessity.  You can't represent arbitrary text field values
without it.

> > What do you mean not compatible with CSV loader?

If we do things correctly, we should be able to round-trip with http://wiki.apache.org/solr/UpdateCSV

> > uses System.getProperty("line.separator")... we should avoid different behavior
on different platforms
> Hmm, I've never been dinged before for writing platform independent code.

Having a server process act differently on different hosts is bad.  We strive to never use
the default locale - it's a recipe for non-portability.  All file encodings (stopword lists,
etc) default to UTF-8 instead of the system locale.  Date and number formatting is standardized
and does not use the system locale.  We missed some of these in the past (and sure enough,
Solr wouldn't work properly when installed on a machine of a certain locale), but Robert cleaned
all that up.



> CSV Response Writer
> -------------------
>
>                 Key: SOLR-1925
>                 URL: https://issues.apache.org/jira/browse/SOLR-1925
>             Project: Solr
>          Issue Type: New Feature
>          Components: Response Writers
>         Environment: indep. of env.
>            Reporter: Chris A. Mattmann
>            Assignee: Erik Hatcher
>             Fix For: Next
>
>         Attachments: SOLR-1925.Chheng.071410.patch.txt, SOLR-1925.Mattmann.053010.patch.2.txt,
SOLR-1925.Mattmann.053010.patch.3.txt, SOLR-1925.Mattmann.053010.patch.txt, SOLR-1925.Mattmann.061110.patch.txt
>
>
> As part of some work I'm doing, I put together a CSV Response Writer. It currently takes
all the docs resultant from a query and then outputs their metadata in simple CSV format.
The use of a delimeter is configurable (by default if there are multiple values for a particular
field they are separated with a | symbol).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message