lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley" <yo...@apache.org>
Subject Re: Another text I cannot get into SOLR with csv
Date Tue, 08 Jan 2008 15:11:14 GMT
On Jan 8, 2008 9:58 AM, Yonik Seeley <yonik@apache.org> wrote:
> On Jan 8, 2008 3:07 AM, Michael Lackhoff <michael@lackhoff.de> wrote:
> > After a long weekend I could do a deeper look into this one and it looks
> > as if the problem has to do with splitting.
> >
> > > This one works for me fine.
> > >
> > > $ cat t2.csv
> > > id,name
> > > 12345,"'s-Gravenhage"
> > > 12345,'s-Gravenhage
> > > 12345,"""s-Gravenhage"
> > >
> > > $ curl http://localhost:8983/solr/update/csv?commit=true --data-binary
> > > @t2.csv -H 'Content-type:text/csv; charset=utf-8'
> >
> > My csv-file:
> > DBRECORDID,PUBLPLACE
> > 43298,"'s-Gravenhage"
> >
> > The URL (giving a 400 error):
> > http://localhost:8983/solr/update/csv?f.PUBLPLACE.split=true&commit=true"
> > (PUBLPLACE is defined as multivalued field)
> >
> > If I remove the "f.PUBLPLACE.split=true" parameter OR make sure that the
> > apostrophe is not the first character, everything is fine.
>
> Indeed... looks like you hit another bug.
> Could you file another bug (this time with Solr)?
> If it turns out to be a commons-csv bug, I'll file another bug there.

Ahh, wait, it looks a single quote as the encapsulator for split field
values by default.
Try adding f.PUBLPLACE.encapsulator=%00
to disable the encapsulation.

-Yonik

Mime
View raw message