lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-1925) CSV Response Writer
Date Thu, 15 Jul 2010 15:13:51 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12888821#action_12888821
] 

Chris A. Mattmann commented on SOLR-1925:
-----------------------------------------

bq. I tried excel 2003 and excel 2007 - both work fine with embedded newlines. Perhaps you're
not encoding your test file correctly. 

Huh? It has nothing to do with any test file? There is no test file.

Process

# Load data into Solr with embedded newlines
# Do query to Solr, call wt=csv and save results to a file (don't specify excel=true, preserving
embedded newlines)
# Load .csv file into Excel on the platforms I mentioned and watch it break

You continue to omit the platforms you're testing on. Why do you continue to do this?

bq. I'll repeat: "the CSV spec doesn't recommend always using encapsulators". Why do you keep
suggesting that it does?

Because your statement that it doesn't recommend always using encapsulators neglects to take
into account my perspective. Also the spec isn't black and white, as you imply. There is room
for interpretation. I believe my interpretation allows for the most flexibility. 

bq. I didn't need to try it... I just looked at this patch, which doesn't do proper CSV encoding/escaping.

Right, and you selectively quoted me. If you're going to quote me, include the whole quote.
Look at the 2nd part of what I wrote that mentions that *there are plenty of cases* with Solr
data loading and Response Writers where it doesn't round trip.

bq. You pick one (like \n)... if there is a need for a different one, you let it be configured
/ specified by the client.

Hmmm, not sure I understand this statement.

All I know is that regardless, I'm using this and it's been working fine for me in production
for weeks by applying this patch to branch-1.5. 


> CSV Response Writer
> -------------------
>
>                 Key: SOLR-1925
>                 URL: https://issues.apache.org/jira/browse/SOLR-1925
>             Project: Solr
>          Issue Type: New Feature
>          Components: Response Writers
>         Environment: indep. of env.
>            Reporter: Chris A. Mattmann
>            Assignee: Erik Hatcher
>             Fix For: Next
>
>         Attachments: SOLR-1925.Chheng.071410.patch.txt, SOLR-1925.Mattmann.053010.patch.2.txt,
SOLR-1925.Mattmann.053010.patch.3.txt, SOLR-1925.Mattmann.053010.patch.txt, SOLR-1925.Mattmann.061110.patch.txt
>
>
> As part of some work I'm doing, I put together a CSV Response Writer. It currently takes
all the docs resultant from a query and then outputs their metadata in simple CSV format.
The use of a delimeter is configurable (by default if there are multiple values for a particular
field they are separated with a | symbol).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message