hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wouter de Bie <wou...@spotify.com>
Subject Re: jets3t 0.7.4
Date Fri, 22 Jul 2011 14:54:10 GMT
 Hi,

When I use 0.6.1, hive gets stuck at the following point after about 6 "select * from table
limit 10":

2011-07-22 14:50:44,153 DEBUG httpclient.HttpClient (HttpClient.java:executeMethod(322)) -
enter HttpClient.executeMethod(HttpMethod)
2011-07-22 14:50:44,153 DEBUG httpclient.HttpClient (HttpClient.java:executeMethod(373)) -
enter HttpClient.executeMethod(HostConfiguration,HttpMethod,HttpState)
2011-07-22 14:50:44,153 DEBUG httpclient.MultiThreadedHttpConnectionManager (MultiThreadedHttpConnectionManager.java:getConnectionWithTimeout(383))
- enter HttpConnectionManager.getConnectionWithTimeout(HostConfiguration, long)
2011-07-22 14:50:44,154 DEBUG httpclient.MultiThreadedHttpConnectionManager (MultiThreadedHttpConnectionManager.java:getConnectionWithTimeout(390))
- HttpConnectionManager.getConnection: config = HostConfiguration[host=https://MYBUCKET.s3.amazonaws.com],
timeout = 0
2011-07-22 14:50:44,154 DEBUG httpclient.MultiThreadedHttpConnectionManager (MultiThreadedHttpConnectionManager.java:getHostPool(775))
- enter HttpConnectionManager.ConnectionPool.getHostPool(HostConfiguration)
2011-07-22 14:50:44,155 DEBUG httpclient.MultiThreadedHttpConnectionManager (MultiThreadedHttpConnectionManager.java:doGetConnection(494))
- Unable to get a connection, waiting..., hostConfig=HostConfiguration[host=https://MYBUCKET.s3.amazonaws.com]


--Wouter de Bie
Developer Business Intelligence, Spotify
wouter@spotify.com (mailto:wouter@spotify.com)
+46 72 018 0777

On Friday, July 22, 2011 at 11:30 AM, Florin Diaconeasa wrote:

> Hi,
> 
> First of all i think jets3t is called by hadoop and not by hive. Hive, from my knowledge,
doesn't know about the storage system and i see that the jets3t jar is actually inside the
lib directory of hadoop. 
> 
> Apparently there is a way to configure jets3t: http://www.jets3t.org/toolkit/configuration.html
> 
> I tried putting a jets3t.properties file with httpclient.max-connections=100 in the conf
dir of hadoop, but not sure it took into consideration. 
> 
> Also, another question: How did you reach the conclusion that the pool got depleted?
> 
> On 21 July 2011 23:45, Wouter de Bie <wouter@spotify.com (mailto:wouter@spotify.com)>
wrote:
> > hive 0.7.0+27.1-2~maverick-cdh3 and hadoop 0.20.2+923.21-1
> > 
> > --Wouter de Bie
> > Developer Business Intelligence, Spotify
> > wouter@spotify.com (mailto:wouter@spotify.com)
> > +46 72 018 0777
> > 
> > On Thursday, July 21, 2011 at 9:05 PM, Florin Diaconeasa wrote:
> > 
> > > What hive version are you using?
> > > 
> > > On Jul 21, 2011, at 1:10 PM, Wouter de Bie wrote:
> > > >  Hi guys,
> > > > 
> > > > I've just trying to upgrade to jets3t 0.7.4 from 0.6.1, because the connection
pool gets depleted after 20 requests. Now, I'm getting the following stack trace when trying
to access s3. Does anyone have a clue how to solve this? 
> > > > 
> > > > FAILED: Hive Internal Error: java.lang.RuntimeException(org.apache.hadoop.fs.s3.S3Exception:
org.jets3t.service.S3ServiceException: Request Error. HEAD '/tmp' on Host MYBUCKET.s3.amazonaws.com
(http://MYBUCKET.s3.amazonaws.com)' @ 'Thu, 21 Jul 2011 09:10:20 GMT' -- ResponseCode: 404,
ResponseStatus: Not Found, RequestId: 37E8E1DD4E8FC3D8, HostId: abkdf12EFCJBY9FyBwZrC9SEcSibtW7moeZHnSBdUoyKwIO8btedy0Q8AjdNQqop)

> > > > java.lang.RuntimeException: org.apache.hadoop.fs.s3.S3Exception: org.jets3t.service.S3ServiceException:
Request Error. HEAD '/tmp' on Host 'MYBUCKET.s3.amazonaws.com (http://MYBUCKET.s3.amazonaws.com)'
@ 'Thu, 21 Jul 2011 09:10:20 GMT' -- ResponseCode: 404, ResponseStatus: Not Found, RequestId:
37E8E1DD4E8FC3D8, HostId: abkdf12EFCJBY9FyBwZrC9SEcSibtW7moeZHnSBdUoyKwIO8btedy0Q8AjdNQqop
> > > > at org.apache.hadoop.hive.ql.Context.getScratchDir(Context.java:151)
> > > > at org.apache.hadoop.hive.ql.Context.getMRScratchDir(Context.java:190)
> > > > at org.apache.hadoop.hive.ql.Context.getMRTmpFileURI(Context.java:247)
> > > > at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.getMetaData(SemanticAnalyzer.java:900)
> > > > at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:6594)
> > > > at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:238)
> > > > at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:340)
> > > > at org.apache.hadoop.hive.ql.Driver.run(Driver.java:736)
> > > > at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:164)
> > > > at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:241)
> > > > at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:456)
> > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> > > > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> > > > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> > > > at java.lang.reflect.Method.invoke(Method.java:597)
> > > > at org.apache.hadoop.util.RunJar.main(RunJar.java:186)
> > > > Caused by: org.apache.hadoop.fs.s3.S3Exception: org.jets3t.service.S3ServiceException:
Request Error. HEAD '/tmp' on Host 'MYBUCKET.s3.amazonaws.com (http://MYBUCKET.s3.amazonaws.com)'
@ 'Thu, 21 Jul 2011 09:10:20 GMT' -- ResponseCode: 404, ResponseStatus: Not Found, RequestId:
37E8E1DD4E8FC3D8, HostId: abkdf12EFCJBY9FyBwZrC9SEcSibtW7moeZHnSBdUoyKwIO8btedy0Q8AjdNQqop
> > > > at org.apache.hadoop.fs.s3native.Jets3tNativeFileSystemStore.handleServiceException(Jets3tNativeFileSystemStore.java:229)
> > > > at org.apache.hadoop.fs.s3native.Jets3tNativeFileSystemStore.retrieveMetadata(Jets3tNativeFileSystemStore.java:111)
> > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> > > > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> > > > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> > > > at java.lang.reflect.Method.invoke(Method.java:597)
> > > > at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:82)
> > > > at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:59)
> > > > at org.apache.hadoop.fs.s3native.$Proxy4.retrieveMetadata(Unknown Source)
> > > > at org.apache.hadoop.fs.s3native.NativeS3FileSystem.getFileStatus(NativeS3FileSystem.java:394)
> > > > at org.apache.hadoop.fs.s3native.NativeS3FileSystem.mkdir(NativeS3FileSystem.java:507)
> > > > at org.apache.hadoop.fs.s3native.NativeS3FileSystem.mkdirs(NativeS3FileSystem.java:500)
> > > > at org.apache.hadoop.fs.FileSystem.mkdirs(FileSystem.java:1226)
> > > > at org.apache.hadoop.hive.ql.Context.getScratchDir(Context.java:147)
> > > > ... 15 more
> > > > Caused by: org.jets3t.service.S3ServiceException: Request Error. HEAD
'/tmp' on Host 'MYBUCKET.s3.amazonaws.com (http://MYBUCKET.s3.amazonaws.com)' @ 'Thu, 21 Jul
2011 09:10:20 GMT' -- ResponseCode: 404, ResponseStatus: Not Found, RequestId: 37E8E1DD4E8FC3D8,
HostId: abkdf12EFCJBY9FyBwZrC9SEcSibtW7moeZHnSBdUoyKwIO8btedy0Q8AjdNQqop
> > > > at org.jets3t.service.impl.rest.httpclient.RestS3Service.performRequest(RestS3Service.java:520)
> > > > at org.jets3t.service.impl.rest.httpclient.RestS3Service.performRestHead(RestS3Service.java:868)
> > > > at org.jets3t.service.impl.rest.httpclient.RestS3Service.getObjectImpl(RestS3Service.java:2016)
> > > > at org.jets3t.service.impl.rest.httpclient.RestS3Service.getObjectDetailsImpl(RestS3Service.java:1944)
> > > > at org.jets3t.service.S3Service.getObjectDetails(S3Service.java:3059)
> > > > at org.jets3t.service.S3Service.getObjectDetails(S3Service.java:1940)
> > > > at org.apache.hadoop.fs.s3native.Jets3tNativeFileSystemStore.retrieveMetadata(Jets3tNativeFileSystemStore.java:103)
> > > > ... 27 more
> > > > Caused by: org.jets3t.service.impl.rest.HttpException
> > > > at org.jets3t.service.impl.rest.httpclient.RestS3Service.performRequest(RestS3Service.java:518)
> > > > ... 33 more
> > > > 
> > > > 
> > > > --Wouter de Bie
> > > > Developer Business Intelligence, Spotify
> > > > wouter@spotify.com (mailto:wouter@spotify.com)
> > > > +46 72 018 0777
> > > 
> > 
> 
> 
> 
> -- 
> 
> 
> Florin


Mime
View raw message