flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Aljoscha Krettek <aljos...@apache.org>
Subject Re: RollingSink
Date Thu, 24 Mar 2016 17:45:00 GMT
Hi,
I’m not aware of anyone having tested the RollingSink with anything besides
“hdfs://“ and “file://“. That the file is empty is strange. Is something
like revokeLease() necessary for your custom HCFS?

Cheers,
Aljoscha

On Wed, 23 Mar 2016 at 17:53 Vijay Srinivasaraghavan
<vijikarthi@yahoo.com.invalid> wrote:

> Hi Aljoscha,
> It was my bad that I have copied some wrong class files during one of the
> step. I have retried the same steps that I mentioned earlier and with that
> I am able to see all the debug statements that I have added to the
> RollingSink..
> I have found another interesting issue here. I am using HCFS (Hadoop
> Compatible File System) implementation of the filesystem that we have built
> in-house (not stock HDFC). As part of the recovery process in the
> restoreState() method of RollingSink class, we are trying to invoke
> revokeLease() API which is available only in DistributedFileSystem (or any
> inherited class) whereas the HCFS contact class that we have implemented is
> FileSystem. Since the codepath will not invoke revokeLease() for our HCFS
> implementation class, I am seeing the part file with empty content though
> the file is renamed from "in-progress" to actual file name.
> Question: Do you know if RollingSink implementation is tested with any
> Hadoop Compatible File System like GlusterFS, etc.,?
> RegardsVijay
>
>     On Wednesday, March 23, 2016 7:42 AM, Aljoscha Krettek <
> aljoscha@apache.org> wrote:
>
>
>  Hmm, that’s strange. Could you maybe send one of the TaskManager logs?
>
> Cheers,
> Aljoscha
> > On 23 Mar 2016, at 15:28, Vijay <vijikarthi@yahoo.com.INVALID> wrote:
> >
> > Yes, I have updated on all cluster nodes and restarted entire cluster.
> >
> > Do you see any problems with the steps that I followed?
> >
> > Regards,
> > Vijay
> >
> > Sent from my iPhone
> >
> >> On Mar 23, 2016, at 7:18 AM, Aljoscha Krettek <aljoscha@apache.org>
> wrote:
> >>
> >> Hi,
> >> did you update the log4j.properties file on all nodes where the
> TaskManagers run and did you restart the whole cluster?
> >>
> >> Cheers,
> >> Aljoscha
> >>> On 23 Mar 2016, at 15:02, Vijay <vijikarthi@yahoo.com.INVALID> wrote:
> >>>
> >>> Hi Aljoscha,
> >>>
> >>> I am using standalone flink cluster (3 node). I am running flink job
> by submitting/uploading jar through Flink UI.
> >>>
> >>> I have built flink from maven and modified the RollingSink code to add
> new debug statements.
> >>>
> >>> I have also packaged the streaming file system connector package
> (including RollingSink changes) to the job jar file. Modified changes
> include both Sytem.out as well as logger statements.
> >>>
> >>> Updated log4j property file to DEBUG
> >>>
> >>> Regards,
> >>> Vijay
> >>>
> >>> Sent from my iPhone
> >>>
> >>>> On Mar 23, 2016, at 6:48 AM, Aljoscha Krettek <aljoscha@apache.org>
> wrote:
> >>>>
> >>>> Hi,
> >>>> what where the steps you took? By the way, are you running this on
> yarn or in standalone mode? How are you starting the Flink job? Do you
> still don’t see DEBUG entries in the log?
> >>>>
> >>>> Cheers,
> >>>> Aljoscha
> >>>>> On 23 Mar 2016, at 14:32, Vijay <vijikarthi@yahoo.com> wrote:
> >>>>>
> >>>>> I have changed the properties file but it did not help.
> >>>>>
> >>>>> Regards,
> >>>>> Vijay
> >>>>>
> >>>>> Sent from my iPhone
> >>>>>
> >>>>>> On Mar 23, 2016, at 5:39 AM, Aljoscha Krettek <aljoscha@apache.org>
> wrote:
> >>>>>>
> >>>>>> Ok, then you should be able to change the log level to DEBUG
in
> conf/log4j.properties.
> >>>>>>
> >>>>>>> On 23 Mar 2016, at 12:41, Vijay <vijikarthi@yahoo.com>
wrote:
> >>>>>>>
> >>>>>>> I think only the ERROR category gets displayed in the log
file
> >>>>>>>
> >>>>>>> Regards,
> >>>>>>> Vijay
> >>>>>>>
> >>>>>>> Sent from my iPhone
> >>>>>>>
> >>>>>>>> On Mar 23, 2016, at 2:30 AM, Aljoscha Krettek <
> aljoscha@apache.org> wrote:
> >>>>>>>>
> >>>>>>>> Hi,
> >>>>>>>> are you seeing the regular log output from the RollingSink
in the
> TaskManager logs?
> >>>>>>>>
> >>>>>>>> Cheers,
> >>>>>>>> Aljoscha
> >>>>>>>>> On 22 Mar 2016, at 20:03, Vijay Srinivasaraghavan
<
> vijikarthi@yahoo.com> wrote:
> >>>>>>>>>
> >>>>>>>>> I have tried both log4j logger as well as System.out.println
> option but none of these worked.
> >>>>>>>>>
> >>>>>>>>> From what I have seen so far is the Filesystem streaming
> connector classes are not packaged in the grand jar
> (flink-dist_2.10-1.1-SNAPSHOT.jar) that is copied under
> <FLINK_HOME>/build-target/lib location as part of Flink maven build step.
> >>>>>>>>>
> >>>>>>>>> So, I manually copied (overwrite) the compiled class
files from
> org.apache.flink.streaming.connectors.fs package to the my "Flink job"
> distribution jar (otherwise it was using standard jars that are defined as
> mvn dependency in Articatory) and then uploaded the jar to Job Manager.
> >>>>>>>>>
> >>>>>>>>> Am I missing something? How do I enable logging
for the
> RollingSink class?
> >>>>>>>>>
> >>>>>>>>> <dependency>
> >>>>>>>>> <groupId>org.apache.flink</groupId>
> >>>>>>>>> <artifactId>flink-connector-filesystem_2.11</artifactId>
> >>>>>>>>> <version>${flink.version}</version>
> >>>>>>>>> <scope>provided</scope>
> >>>>>>>>> </dependency>
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> On Tuesday, March 22, 2016 3:04 AM, Aljoscha Krettek
<
> aljoscha@apache.org> wrote:
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> Hi,
> >>>>>>>>> how are you printing the debug statements?
> >>>>>>>>>
> >>>>>>>>> But yeah all the logic of renaming in progress files
and
> cleaning up after a failed job happens in restoreState(BucketState state).
> The steps are roughly these:
> >>>>>>>>>
> >>>>>>>>> 1. Move current in-progress file to final location
> >>>>>>>>> 2. truncate the file if necessary (if truncate is
not available
> write a .valid-length file)
> >>>>>>>>> 3. Move pending files to final location that where
part of the
> checkpoint
> >>>>>>>>> 4. cleanup any leftover pending/in-progress files
> >>>>>>>>>
> >>>>>>>>> Cheers,
> >>>>>>>>> Aljoscha
> >>>>>>>>>
> >>>>>>>>>> On 22 Mar 2016, at 10:08, Vijay Srinivasaraghavan
> <vijikarthi@yahoo.com.INVALID> wrote:
> >>>>>>>>>>
> >>>>>>>>>> Hello,
> >>>>>>>>>> I have enabled checkpoint and I am using RollingSink
to sink
> the data to HDFS (2.7.x) from KafkaConsumer. To simulate failover/recovery,
> I stopped TaskManager and the job gets rescheduled to other Taskmanager
> instance. During this momemnt, the current "in-progress" gets closed and
> renamed to part-0-1 from _part-0-1_in-progress.
> >>>>>>>>>> I was hoping to see the debug statement that
I have added to
> "restoreState" method but none of my debug statement gets printed. I am not
> sure if the restoreState() method gets invoked during this scenario. Could
> you please help me understand the flow during "failover" scenario?
> >>>>>>>>>> P.S: Functionally the code appears to be working
fine but I am
> trying to understand the underlying implementation details. public void
> restoreState(BucketState state)
> >>>>>>>>>> Regards
> >>>>>>>>>> Vijay
> >>
>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message