hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marc-Olivier Fleury (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-4635) Memory leak ?
Date Tue, 11 Nov 2008 22:43:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-4635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12646706#action_12646706
] 

Marc-Olivier Fleury commented on HADOOP-4635:
---------------------------------------------

Great, I see that you have some good ideas on what parts of the code should cause issues.

I am using the latest version from svn, 0.20.0, I think. (had to upgrade to be able to use
the writing functionalities)

I spent some time looking at the code, and I noticed a strange little quirk (fuse_dfs.c:608).
The '+1' in the malloc. I don't get why it is there, must be a relic... it is not important,
since it will be freed anyway, still...

Anyway, I am chasing the leaks right now and I am happy to see that some of the leaks are
already located. Is the hdfsConnectAsUser difficult to fix? I will take a look and try to
fix it, but if you have any insight, pleease let me know!


> Memory leak ?
> -------------
>
>                 Key: HADOOP-4635
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4635
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/fuse-dfs
>    Affects Versions: 0.20.0
>            Reporter: Marc-Olivier Fleury
>
> I am running a process that needs to crawl a tree structure containing ~10K images, copy
the images to the local disk, process these images, and copy them back to HDFS.
> My problem is the following : after about 10h of processing, the processes crash, complaining
about a std::bad_alloc exception (I use hadoop pipes to run existing software). When running
fuse_dfs in debug mode, I get an outOfMemoryError, telling that there is no more room in the
heap.
> While the process is running, using top or ps, I notice that fuse is using up an increasing
amount of memory, until some limit is reached. At that point , the memory used is oscillating.
I suppose that this is due to the use of the virtual memory.
> This leads me to the conclusion that there is some memory leak in fuse_dfs, since the
only other programs running are Hadoop and the existing software, both thoroughly tested in
the past.
> My problem is that my knowledge concerning memory leak tracking is rather limited, so
I will need some instructions to get more insight concerning this issue.
> Thank you

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message