pig-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Brown" <jbr...@buzzlogic.com>
Subject RE: Encoding byte code 254 in pig.
Date Mon, 06 Dec 2010 19:15:54 GMT
in the bash script that calls the pig job, replace the interpreter definition:


with this:

#!/bin/bash -l

-----Original Message-----
From: Marilson Campos [mailto:mbc_active@yahoo.com] 
Sent: Wednesday, November 10, 2010 3:53 PM
To: user@pig.apache.org
Subject: Re: Encoding byte code 254 in pig.

Hey,I've got it working fine in the shell. When I installed on the crontab i get this error.

p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px Consolas}

2010-11-10 12:29:02,684 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2999: Unexpected
internal error. could not instantiate 'PigStorage' with arguments '[??]'
Free consulting. :-) Great!
Marilson Campos


--- On Wed, 11/10/10, Stefan Will <stefan.will@gmx.net> wrote:

From: Stefan Will <stefan.will@gmx.net>
Subject: Re: Encoding byte code 254 in pig.
To: user@pig.apache.org
Date: Wednesday, November 10, 2010, 3:47 PM

I don't think character sequences actually work in Pig scripts. IIRC there
is even a bug reported for it. So using the unescaped character might be
your only choice.

-- Stefan

On 11/11/10 12:34 AM, "Marilson Campos" <mbc_active@yahoo.com> wrote:

>I have a file that has the char (254) as a separator. I can force the
>character into the file, but wanted to
>p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Monaco}
>LOAD 'file.log.gz' USING PigStorage('þ')
>when encoding as UTF-8
>LOAD 'file.log.gz' USING PigStorage('\u00FE')
>It does not parse. I believe \u notation works only with chars that take
>1 byte.
>I need  to encode the xFE byte
>Thanks for the help in advance.

View raw message