Mailing-List: contact user-help@flume.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flume.apache.org
Received-SPF: pass (athena.apache.org: domain of cwoodson.dev@gmail.com
 designates 209.85.212.174 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAPs9MCyJR8asP3Td+ZsF1O+gJgZWGsQT-HXucPAVsgXAh=DOuA@mail.gmail.com>
References: 
 <CAPs9MCzpDqwXUF=j63umEmkhA9duBtHzM5+eHHbMdJcG7P=kWw@mail.gmail.com>
	<CAFukC=6Npu6RWg+f_Z_8207iSZ0Nau5CcshHNPV9k9HPiDF7aA@mail.gmail.com>
	<CAPs9MCzgNNUPYoEKz7kG1O3U=b=ug07vJ98c3C9UwZ00Otmhwg@mail.gmail.com>
	<CAFukC=5q45FhgHX_JX9DtT+cFEJ=y_iFx-7Po2fBV9Rm3B9V9Q@mail.gmail.com>
	<CABPQxstYMQ5BbGewd4B+GgYvdW=Sn=VuOswKAt0QecvUVNZBJg@mail.gmail.com>
	<CAPs9MCyud6QXkasL6vhHsL3dAzVGj7Z0umaYVMnM=TPZ+aNptA@mail.gmail.com>
	<CAJLbxRbM5jB7mGq2ATT_P-Q+jB9F=JffGqf+rZbEnoawYyUd1A@mail.gmail.com>
	<CAPs9MCxE96qP0MnQ2MGawC_t-Emxvuy_ViZksJCz8ZTgx=agyQ@mail.gmail.com>
	<CAJLbxRZng=X1avgEZbmb8vUFG4EmHENSdpEcVO2bcZuOk0KZ2A@mail.gmail.com>
	<CAPs9MCyJR8asP3Td+ZsF1O+gJgZWGsQT-HXucPAVsgXAh=DOuA@mail.gmail.com>
Date: Fri, 18 Jan 2013 01:13:27 -0800
Message-ID: 
 <CAH0o75OzeRv6HNmmUUiXp+ELJxXLys1sMVABkGDsbeJtdokfeg@mail.gmail.com>
Subject: Re: Uncaught Exception When Using Spooling Directory Source
From: Connor Woodson <cwoodson.dev@gmail.com>
To: "user@flume.apache.org" <user@flume.apache.org>
Content-Type: multipart/alternative; boundary=089e0122896c41c1e804d38c86ea

--089e0122896c41c1e804d38c86ea
Content-Type: text/plain; charset=GB2312
Content-Transfer-Encoding: quoted-printable

The Spooling Directory Source is best used for sending old data / backups
through Flume, as opposed to trying to use it for realtime data (due to, as
you discovered, you aren't supposed to write directly to a file in that
directory  but rather place closed files in there). You could implement
what Mike mentioned above about rolling the logs into the spooling
directory, but there are other options.

If you are looking to pull data in real time, the Exec
Source<http://flume.apache.org/FlumeUserGuide.html#exec-source>mentioned
above does work. The one downside with this is that this source
is not the most reliable, as is mentioned in the red box in that link, and
you will have to monitor it to make sure it hasn't crashed. However, other
than the Spooling Directory source and any custom source you write, this is
the only other pulling source.

But depending on how your system is set up, you could set up a system for
pushing your logs into Flume. Here are some options:

If the log files you want to capture use Log4J, then there is a Log4JAppend=
er
<http://flume.apache.org/FlumeUserGuide.html#log4j-appender>which will send
events directly to Flume. The benefit to this is that you let Flume take
control of the events right as they are generated; they are sent through
Avro to your specified host/ip where you will have a Flume agent with an Av=
ro
Source <http://flume.apache.org/FlumeUserGuide.html#flume-sources> running.

Another alternative to the above if you don't use Log4J but you do have
direct control over the application is to use the Embedded Flume
Agent<https://github.com/apache/flume/blob/trunk/flume-ng-doc/sphinx/FlumeD=
eveloperGuide.rst#embedded-agent>.
This is even more powerful than the log4j appender as you have more control
over how it works and you are able to use the Flume channels with it. This
would end up pushing events via Avro to your Flume agent to then
collect/process/store.

There are a variety of network methods that can communicate with Flume.
Flume has support for listening on a specified port with the Netcat
Source<http://flume.apache.org/FlumeUserGuide.html#netcat-source>,
getting events via HTTP
Post<http://flume.apache.org/FlumeUserGuide.html#http-source>messages,
and if your application uses Syslog that's
supported <http://flume.apache.org/FlumeUserGuide.html#syslog-sources> as
well.

In summation, if you need to set up a pulling system you will need to place
a Flume agent on each of your servers and have it use a Spooling Directory
or Exec source; or if your system is configurable enough you will be able
to modify it in various possible ways to push the logs to Flume.

I hope some of that was helpful,

- Connor


On Fri, Jan 18, 2013 at 12:18 AM, Henry Ma <henry.ma.1986@gmail.com> wrote:

> We have an advertisement system, which owns hundreds of servers running
> service such as resin/nginx, and each of them generates log files to a
> local location every seconds. What we need is to collect all the log file=
s
> in time to a central storage such as MooseFS for real-time analysis, and
> then archive them to HDFS by hour.
>
> We want to deploy Flume to collect log files as soon as they are generate=
d
> from nearly one hundred servers (the server list may be added or removed =
at
> any time) to a central location, and then archive to HDFS each hour.
>
> By now the log files cannot be pushed to any collecting system. We want t=
o
> the collecting system can PULL all of them remotely.
>
> Can you give me some guide? Thanks!
>
>
> On Fri, Jan 18, 2013 at 3:45 PM, Mike Percy <mpercy@apache.org> wrote:
>
>> Can you provide more detail about what kinds of services?
>>
>> If you roll the logs every 5 minutes or so then you can configure the
>> spooling source to pick them up once they are rolled by either rolling t=
hem
>> into a directory for immutable files or using the trunk version of the
>> spooling file source to specify a filter to ignore files that don't matc=
h a
>> "rolled" pattern.
>>
>> You could also use exec source with "tail -F" but that is much more
>> unreliable than the spooling file source.
>>
>> Regards,
>> Mike
>>
>>
>> On Thu, Jan 17, 2013 at 10:23 PM, Henry Ma <henry.ma.1986@gmail.com>wrot=
e:
>>
>>> OK, thank you very much, now I know why the problem occurs.
>>>
>>> I am a new comer of Flume. Here is my scenario: using Flume to
>>> collecting from hundreds of directories from dozens of servers to a cen=
tral
>>> storage. It seems that spooling directory source may not be the best
>>> choice. Can someone give me some advice about how to design the
>>> architecture? Which type of source and sink can fit?
>>>
>>> Thanks!
>>>
>>>
>>> On Fri, Jan 18, 2013 at 2:05 PM, Mike Percy <mpercy@apache.org> wrote:
>>>
>>>> Hi Henry,
>>>> The files must be immutable before putting them into the spooling
>>>> directory. So if you copy them from a different file system then you c=
an
>>>> run into this issue. The right way to do it is to copy them to the sam=
e
>>>> file system and then atomically move them into the spooling directory.
>>>>
>>>> Regards,
>>>> Mike
>>>>
>>>>
>>>> On Thu, Jan 17, 2013 at 9:59 PM, Henry Ma <henry.ma.1986@gmail.com>wro=
te:
>>>>
>>>>> Thank you very much! I clean all the related dir and restart again. I
>>>>> keep the source spooling dir empty, then start Flume, and then put so=
me
>>>>> file into the spooling dir. But this time a new error occured:
>>>>>
>>>>> 13/01/18 13:44:24 INFO avro.SpoolingFileLineReader: Preparing to move
>>>>> file
>>>>> /disk2/mahy/FLUME_TEST/source/sspstat.log.20130118112700-201301181128=
00.hs016.ssp
>>>>> to /disk2/mahy/FLUME_TEST/
>>>>> source/sspstat.log.20130118112700-20130118112800.hs016.ssp.COMPLETED
>>>>> 13/01/18 13:44:24 ERROR source.SpoolDirectorySource: Uncaught
>>>>> exception in Runnable
>>>>> java.lang.IllegalStateException: File has changed size since being
>>>>> read:
>>>>> /disk2/mahy/FLUME_TEST/source/sspstat.log.20130118112700-201301181128=
00.hs016.ssp
>>>>>         at
>>>>> org.apache.flume.client.avro.SpoolingFileLineReader.retireCurrentFile=
(SpoolingFileLineReader.java:241)
>>>>>         at
>>>>> org.apache.flume.client.avro.SpoolingFileLineReader.readLines(Spoolin=
gFileLineReader.java:185)
>>>>>         at
>>>>> org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.r=
un(SpoolDirectorySource.java:135)
>>>>>         at
>>>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:44=
1)
>>>>>         at
>>>>> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java=
:317)
>>>>>         at
>>>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
>>>>>         at
>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.=
access$101(ScheduledThreadPoolExecutor.java:98)
>>>>>         at
>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.=
runPeriodic(ScheduledThreadPoolExecutor.java:180)
>>>>>         at
>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.=
run(ScheduledThreadPoolExecutor.java:204)
>>>>>         at
>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExec=
utor.java:886)
>>>>>         at
>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor=
.java:908)
>>>>>         at java.lang.Thread.run(Thread.java:662)
>>>>> 13/01/18 13:44:24 ERROR source.SpoolDirectorySource: Uncaught
>>>>> exception in Runnable
>>>>> java.io.IOException: Stream closed
>>>>>         at java.io.BufferedReader.ensureOpen(BufferedReader.java:97)
>>>>>          at java.io.BufferedReader.readLine(BufferedReader.java:292)
>>>>>         at java.io.BufferedReader.readLine(BufferedReader.java:362)
>>>>>         at
>>>>> org.apache.flume.client.avro.SpoolingFileLineReader.readLines(Spoolin=
gFileLineReader.java:180)
>>>>>         at
>>>>> org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.r=
un(SpoolDirectorySource.java:135)
>>>>>         at
>>>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:44=
1)
>>>>>         at
>>>>> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java=
:317)
>>>>>         at
>>>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
>>>>>         at
>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.=
access$101(ScheduledThreadPoolExecutor.java:98)
>>>>>         at
>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.=
runPeriodic(ScheduledThreadPoolExecutor.java:180)
>>>>>         at
>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.=
run(ScheduledThreadPoolExecutor.java:204)
>>>>>         at
>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExec=
utor.java:886)
>>>>>         at
>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor=
.java:908)
>>>>>         at java.lang.Thread.run(Thread.java:662)
>>>>> 13/01/18 13:44:25 ERROR source.SpoolDirectorySource: Uncaught
>>>>> exception in Runnable
>>>>> java.io.IOException: Stream closed
>>>>>         at java.io.BufferedReader.ensureOpen(BufferedReader.java:97)
>>>>>
>>>>>
>>>>> I think it is a typical scenario: Flume is watching some dirs and
>>>>> collecting new arriving files. I don't know why the exception " File =
has
>>>>> changed size since being read" was throwed and how to avoid it. Can y=
ou
>>>>> give some advice and guide? Thanks!
>>>>>
>>>>>
>>>>> On Fri, Jan 18, 2013 at 1:48 PM, Patrick Wendell <pwendell@gmail.com>=
wrote:
>>>>>
>>>>>> Hey Henry,
>>>>>>
>>>>>> The Spooling source assumes that each file is uniquely named. If it
>>>>>> sees that new file name has arrived that it already processed (and h=
as
>>>>>> rolled over to a COMPLETED file), it throws an error and shuts down.
>>>>>> This is to try and prevent sending duplicate data downstream.
>>>>>>
>>>>>> Probably the best idea is to clear out the COMPLETED file (and the
>>>>>> original file, if they are indeed the same one) and restart.
>>>>>>
>>>>>> - Patrick
>>>>>>
>>>>>> On Thu, Jan 17, 2013 at 9:31 PM, Brock Noland <brock@cloudera.com>
>>>>>> wrote:
>>>>>> > Hmm, I think this is probaly the root cause. Looks like their was =
a
>>>>>> > file with that name already used.
>>>>>> >
>>>>>> > 13/01/18 13:16:59 ERROR source.SpoolDirectorySource: Uncaught
>>>>>> > exception in Runnable
>>>>>> > java.lang.IllegalStateException: File name has been re-used with
>>>>>> > different files. Spooling assumption violated for
>>>>>> >
>>>>>> /disk2/mahy/FLUME_TEST/source/sspstat.log.20130118100000-20130118100=
100.hs009.ssp.COMPLETED
>>>>>> >   at
>>>>>> org.apache.flume.client.avro.SpoolingFileLineReader.retireCurrentFil=
e(SpoolingFileLineReader.java:272)
>>>>>> >   at
>>>>>> org.apache.flume.client.avro.SpoolingFileLineReader.readLines(Spooli=
ngFileLineReader.java:185)
>>>>>> >   at
>>>>>> org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.=
run(SpoolDirectorySource.java:135)
>>>>>> >   at
>>>>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:4=
41)
>>>>>> >   at
>>>>>> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.jav=
a:317)
>>>>>> >   at
>>>>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
>>>>>> >   at
>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask=
.access$101(ScheduledThreadPoolExecutor.java:98)
>>>>>> >   at
>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask=
.runPeriodic(ScheduledThreadPoolExecutor.java:180)
>>>>>> >   at
>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask=
.run(ScheduledThreadPoolExecutor.java:204)
>>>>>> >   at
>>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExe=
cutor.java:886)
>>>>>> >   at
>>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecuto=
r.java:908)
>>>>>> >   at java.lang.Thread.run(Thread.java:662)
>>>>>> >
>>>>>> > On Thu, Jan 17, 2013 at 9:22 PM, Henry Ma <henry.ma.1986@gmail.com=
>
>>>>>> wrote:
>>>>>> >> attached is the log file.
>>>>>> >>
>>>>>> >> the content of conf file:
>>>>>> >> # Name the components on this agent
>>>>>> >> a1.sources =3D r1
>>>>>> >> a1.sinks =3D k1
>>>>>> >> a1.channels =3D c1
>>>>>> >>
>>>>>> >> # Describe/configure the source
>>>>>> >> a1.sources.r1.type =3D spooldir
>>>>>> >> a1.sources.r1.spoolDir =3D /disk2/mahy/FLUME_TEST/source
>>>>>> >> a1.sources.r1.channels =3D c1
>>>>>> >>
>>>>>> >> # Describe the sink
>>>>>> >> a1.sinks.k1.type =3D file_roll
>>>>>> >> a1.sinks.k1.sink.directory =3D /disk2/mahy/FLUME_TEST/sink
>>>>>> >> a1.sinks.k1.sink.rollInterval =3D 0
>>>>>> >>
>>>>>> >> # Use a channel which buffers events in memory
>>>>>> >> a1.channels.c1.type =3D memory
>>>>>> >> a1.channels.c1.capacity =3D 99999
>>>>>> >> #a1.channels.c1. =3D /disk2/mahy/FLUME_TEST/check
>>>>>> >> #a1.channels.c1.dataDirs =3D /disk2/mahy/FLUME_TEST/channel-data
>>>>>> >>
>>>>>> >> # Bind the source and sink to the channel
>>>>>> >> a1.sources.r1.channels =3D c1
>>>>>> >> a1.sinks.k1.channel =3D c1
>>>>>> >>
>>>>>> >>
>>>>>> >> On Fri, Jan 18, 2013 at 12:39 PM, Brock Noland <brock@cloudera.co=
m>
>>>>>> wrote:
>>>>>> >>>
>>>>>> >>> Hi,
>>>>>> >>>
>>>>>> >>> Would you mind turning logging to debug and then posting your fu=
ll
>>>>>> >>> log/config?
>>>>>> >>>
>>>>>> >>> Brock
>>>>>> >>>
>>>>>> >>> On Thu, Jan 17, 2013 at 8:24 PM, Henry Ma <
>>>>>> henry.ma.1986@gmail.com> wrote:
>>>>>> >>> > Hi,
>>>>>> >>> >
>>>>>> >>> > When using Spooling Directory Source in Flume NG 1.3.1, this
>>>>>> exception
>>>>>> >>> > happens:
>>>>>> >>> >
>>>>>> >>> > 13/01/18 11:37:09 ERROR source.SpoolDirectorySource: Uncaught
>>>>>> exception
>>>>>> >>> > in
>>>>>> >>> > Runnable
>>>>>> >>> > java.io.IOException: Stream closed
>>>>>> >>> > at java.io.BufferedReader.ensureOpen(BufferedReader.java:97)
>>>>>> >>> > at java.io.BufferedReader.readLine(BufferedReader.java:292)
>>>>>> >>> > at java.io.BufferedReader.readLine(BufferedReader.java:362)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> org.apache.flume.client.avro.SpoolingFileLineReader.readLines(Spooli=
ngFileLineReader.java:180)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.=
run(SpoolDirectorySource.java:135)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:4=
41)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.jav=
a:317)
>>>>>> >>> > at
>>>>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask=
.access$101(ScheduledThreadPoolExecutor.java:98)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask=
.runPeriodic(ScheduledThreadPoolExecutor.java:180)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask=
.run(ScheduledThreadPoolExecutor.java:204)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExe=
cutor.java:886)
>>>>>> >>> > at
>>>>>> >>> >
>>>>>> >>> >
>>>>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecuto=
r.java:908)
>>>>>> >>> > at java.lang.Thread.run(Thread.java:662)
>>>>>> >>> >
>>>>>> >>> > It usually happened when dropping some new files into the
>>>>>> spooling dir,
>>>>>> >>> > and
>>>>>> >>> > stop collecting file. Does someone know the reason and how to
>>>>>> avoid it?
>>>>>> >>> >
>>>>>> >>> > Thanks very much!
>>>>>> >>> > --
>>>>>> >>> > Best Regards,
>>>>>> >>> > Henry Ma
>>>>>> >>>
>>>>>> >>>
>>>>>> >>>
>>>>>> >>> --
>>>>>> >>> Apache MRUnit - Unit testing MapReduce -
>>>>>> >>> http://incubator.apache.org/mrunit/
>>>>>> >>
>>>>>> >>
>>>>>> >>
>>>>>> >>
>>>>>> >> --
>>>>>> >> Best Regards,
>>>>>> >> Henry Ma
>>>>>> >
>>>>>> >
>>>>>> >
>>>>>> > --
>>>>>> > Apache MRUnit - Unit testing MapReduce -
>>>>>> http://incubator.apache.org/mrunit/
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Henry Ma
>>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> Henry Ma
>>>
>>
>>
>
>
> --
> Best Regards,
> =C2=ED=BB=B7=D3=EE
> =CD=F8=D2=D7=D3=D0=B5=C0 EAD-Platform
> POPO:   mahy@corp.netease.com
> MSN:    henry.ma.1986@gmail.com
> MOBILE: 18600601996
>

--089e0122896c41c1e804d38c86ea
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div style>The Spooling Directory Source is best used for =
sending old data / backups through Flume, as opposed to trying to use it fo=
r realtime data (due to, as you discovered, you aren&#39;t supposed to writ=
e directly to a file in that directory =C2=A0but rather place closed files =
in there). You could implement what Mike mentioned above about rolling the =
logs into the spooling directory, but there are other options.</div>
<div><br></div>If you are looking to pull data in real time, the <a href=3D=
"http://flume.apache.org/FlumeUserGuide.html#exec-source">Exec Source</a> m=
entioned above does work. The one downside with this is that this source is=
 not the most reliable, as is mentioned in the red box in that link, and yo=
u will have to monitor it to make sure it hasn&#39;t crashed. However, othe=
r than the Spooling Directory source and any custom source you write, this =
is the only other pulling source.<div>
<br></div><div>But depending on how your system is set up, you could set up=
 a system for pushing your logs into Flume. Here are some options:<br><div>=
<br></div><div style>If the log files you want to capture use Log4J, then t=
here is a <a href=3D"http://flume.apache.org/FlumeUserGuide.html#log4j-appe=
nder">Log4JAppender </a>which will send events directly to Flume. The benef=
it to this is that you let Flume take control of the events right as they a=
re generated; they are sent through Avro to your specified host/ip where yo=
u will have a Flume agent with an <a href=3D"http://flume.apache.org/FlumeU=
serGuide.html#flume-sources">Avro Source</a> running.</div>
<div style><br></div><div style>Another alternative to the above if you don=
&#39;t use Log4J but you do have direct control over the application is to =
use the <a href=3D"https://github.com/apache/flume/blob/trunk/flume-ng-doc/=
sphinx/FlumeDeveloperGuide.rst#embedded-agent">Embedded Flume Agent</a>. Th=
is is even more powerful than the log4j appender as you have more control o=
ver how it works and you are able to use the Flume channels with it. This w=
ould end up pushing events via Avro to your Flume agent to then collect/pro=
cess/store.</div>
</div><div style><br></div><div style>There are a variety of network method=
s that can communicate with Flume. Flume has support for listening on a spe=
cified port with the <a href=3D"http://flume.apache.org/FlumeUserGuide.html=
#netcat-source">Netcat Source</a>, getting events via <a href=3D"http://flu=
me.apache.org/FlumeUserGuide.html#http-source">HTTP Post</a> messages, and =
if your application uses Syslog <a href=3D"http://flume.apache.org/FlumeUse=
rGuide.html#syslog-sources">that&#39;s supported</a> as well.</div>
<div style><br></div><div style>In summation, if you need to set up a pulli=
ng system you will need to place a Flume agent on each of your servers and =
have it use a Spooling Directory or Exec source; or if your system is confi=
gurable enough you will be able to modify it in various possible ways to pu=
sh the logs to Flume.</div>
<div style><br></div><div style>I hope some of that was helpful,</div><div =
style><br></div><div style>- Connor</div></div><div class=3D"gmail_extra"><=
br><br><div class=3D"gmail_quote">On Fri, Jan 18, 2013 at 12:18 AM, Henry M=
a <span dir=3D"ltr">&lt;<a href=3D"mailto:henry.ma.1986@gmail.com" target=
=3D"_blank">henry.ma.1986@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">We have an advertisement sy=
stem, which owns hundreds of servers running service such as resin/nginx, a=
nd each of them generates log files to a local location every seconds. What=
 we need is to collect all the log files in time to a central storage such =
as MooseFS for real-time analysis, and then archive them to HDFS by hour.<d=
iv>


<br></div><div>We want to deploy Flume to collect log files as soon as they=
 are generated from nearly one hundred servers (the server list may be adde=
d or removed at any time) to a central location, and then archive to HDFS e=
ach hour.</div>


<div><br></div><div>By now the log files cannot be pushed to any collecting=
 system. We want to the collecting system can PULL all of them remotely.</d=
iv><div><br></div><div>Can you give me some guide? Thanks!<br><div class=3D=
"gmail_extra">
<div><div class=3D"h5">

<br><br><div class=3D"gmail_quote">On Fri, Jan 18, 2013 at 3:45 PM, Mike Pe=
rcy <span dir=3D"ltr">&lt;<a href=3D"mailto:mpercy@apache.org" target=3D"_b=
lank">mpercy@apache.org</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Can you provide more detail=
 about what kinds of services?<div><br></div><div>If you roll the logs ever=
y 5 minutes or so then you can configure the spooling source to pick them u=
p once they are rolled by either rolling them into a directory for immutabl=
e files or using the trunk version of the spooling file source to specify a=
 filter to ignore files that don&#39;t match a &quot;rolled&quot; pattern.<=
/div>


<div><br></div><div>You could also use exec source with &quot;tail -F&quot;=
 but that is much more unreliable than the spooling file source.</div><div>=
<br></div><div>Regards,</div><div>Mike</div></div><div>
<div><div class=3D"gmail_extra">

<br><br><div class=3D"gmail_quote">On Thu, Jan 17, 2013 at 10:23 PM, Henry =
Ma <span dir=3D"ltr">&lt;<a href=3D"mailto:henry.ma.1986@gmail.com" target=
=3D"_blank">henry.ma.1986@gmail.com</a>&gt;</span> wrote:<br><blockquote cl=
ass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;p=
adding-left:1ex">


<div dir=3D"ltr">OK, thank you very much, now I know why the problem occurs=
.<div><br></div><div>I am a new comer of Flume. Here is my scenario: using =
Flume to collecting from hundreds of directories from dozens of servers to =
a central storage. It seems that spooling directory source may not be the b=
est choice. Can someone give me some advice about how to design the archite=
cture? Which type of source and sink can fit?</div>


<div><br></div><div>Thanks!</div><div class=3D"gmail_extra"><div><div><br><=
br><div class=3D"gmail_quote">On Fri, Jan 18, 2013 at 2:05 PM, Mike Percy <=
span dir=3D"ltr">&lt;<a href=3D"mailto:mpercy@apache.org" target=3D"_blank"=
>mpercy@apache.org</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Henry,<div>The files mus=
t be immutable before putting them into the spooling directory. So if you c=
opy them from a different file system then you can run into this issue. The=
 right way to do it is to copy them to the same file system and then atomic=
ally move them into the spooling directory.</div>


<div><br></div><div>Regards,</div><div>Mike</div></div><div><div><div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Jan 17, 2013 at=
 9:59 PM, Henry Ma <span dir=3D"ltr">&lt;<a href=3D"mailto:henry.ma.1986@gm=
ail.com" target=3D"_blank">henry.ma.1986@gmail.com</a>&gt;</span> wrote:<br=
>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Thank you very much! I clea=
n all the=C2=A0related dir and restart again. I keep the source spooling di=
r empty, then start Flume, and then put some file into the spooling dir. Bu=
t this time a new error occured:<div>


<br><div><div>13/01/18 13:44:24 INFO avro.SpoolingFileLineReader: Preparing=
 to move file /disk2/mahy/FLUME_TEST/source/sspstat.log.20130118112700-2013=
0118112800.hs016.ssp to /disk2/mahy/FLUME_TEST/</div><div>source/sspstat.lo=
g.20130118112700-20130118112800.hs016.ssp.COMPLETED</div>


<div>13/01/18 13:44:24 ERROR source.SpoolDirectorySource: Uncaught exceptio=
n in Runnable</div><div>java.lang.IllegalStateException: File has changed s=
ize since being read: /disk2/mahy/FLUME_TEST/source/sspstat.log.20130118112=
700-20130118112800.hs016.ssp</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flume.client.avro.SpoolingFi=
leLineReader.retireCurrentFile(SpoolingFileLineReader.java:241)</div><div><=
div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flume.client.avro.SpoolingFil=
eLineReader.readLines(SpoolingFileLineReader.java:185)</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flume.source.SpoolDirectoryS=
ource$SpoolDirectoryRunnable.run(SpoolDirectorySource.java:135)</div><div>=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.Executors$RunnableAdapt=
er.call(Executors.java:441)</div><div>


=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.FutureTask$Sync.innerRu=
nAndReset(FutureTask.java:317)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at jav=
a.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)</div><div>=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.ScheduledThreadPoolExecuto=
r$ScheduledFutureTask.access$101(ScheduledThreadPoolExecutor.java:98)</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.ScheduledThreadPoo=
lExecutor$ScheduledFutureTask.runPeriodic(ScheduledThreadPoolExecutor.java:=
180)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.Schedule=
dThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.jav=
a:204)</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.ThreadPoolExecutor=
$Worker.runTask(ThreadPoolExecutor.java:886)</div><div>=C2=A0 =C2=A0 =C2=A0=
 =C2=A0 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe=
cutor.java:908)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.lang.Thread.r=
un(Thread.java:662)</div>


</div><div>13/01/18 13:44:24 ERROR source.SpoolDirectorySource: Uncaught ex=
ception in Runnable</div><div><div>java.io.IOException: Stream closed</div>=
<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.io.BufferedReader.ensureOpen(Buffe=
redReader.java:97)</div>


<div>

=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.io.BufferedReader.readLine(BufferedRead=
er.java:292)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.io.BufferedReade=
r.readLine(BufferedReader.java:362)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 a=
t org.apache.flume.client.avro.SpoolingFileLineReader.readLines(SpoolingFil=
eLineReader.java:180)</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at org.apache.flume.source.SpoolDirectoryS=
ource$SpoolDirectoryRunnable.run(SpoolDirectorySource.java:135)</div><div>=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.Executors$RunnableAdapt=
er.call(Executors.java:441)</div><div>


=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.FutureTask$Sync.innerRu=
nAndReset(FutureTask.java:317)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at jav=
a.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)</div><div>=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.ScheduledThreadPoolExecuto=
r$ScheduledFutureTask.access$101(ScheduledThreadPoolExecutor.java:98)</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.ScheduledThreadPoo=
lExecutor$ScheduledFutureTask.runPeriodic(ScheduledThreadPoolExecutor.java:=
180)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.Schedule=
dThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.jav=
a:204)</div>


<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.util.concurrent.ThreadPoolExecutor=
$Worker.runTask(ThreadPoolExecutor.java:886)</div><div>=C2=A0 =C2=A0 =C2=A0=
 =C2=A0 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe=
cutor.java:908)</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.lang.Thread.r=
un(Thread.java:662)</div>


</div><div>13/01/18 13:44:25 ERROR source.SpoolDirectorySource: Uncaught ex=
ception in Runnable</div><div><div>java.io.IOException: Stream closed</div>=
<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 at java.io.BufferedReader.ensureOpen(Buffe=
redReader.java:97)</div>


</div></div>

</div><div><br></div><div><br></div><div>I think it is a typical scenario: =
Flume is watching some dirs and collecting new arriving files. I don&#39;t =
know why the exception &quot;=C2=A0File has changed size since being read&q=
uot; was throwed and how to avoid it. Can you give some advice and guide? T=
hanks!</div>


<div class=3D"gmail_extra"><div><div><br><br><div class=3D"gmail_quote">On =
Fri, Jan 18, 2013 at 1:48 PM, Patrick Wendell <span dir=3D"ltr">&lt;<a href=
=3D"mailto:pwendell@gmail.com" target=3D"_blank">pwendell@gmail.com</a>&gt;=
</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hey Henry,<br>
<br>
The Spooling source assumes that each file is uniquely named. If it<br>
sees that new file name has arrived that it already processed (and has<br>
rolled over to a COMPLETED file), it throws an error and shuts down.<br>
This is to try and prevent sending duplicate data downstream.<br>
<br>
Probably the best idea is to clear out the COMPLETED file (and the<br>
original file, if they are indeed the same one) and restart.<br>
<span><font color=3D"#888888"><br>
- Patrick<br>
</font></span><div><div><br>
On Thu, Jan 17, 2013 at 9:31 PM, Brock Noland &lt;<a href=3D"mailto:brock@c=
loudera.com" target=3D"_blank">brock@cloudera.com</a>&gt; wrote:<br>
&gt; Hmm, I think this is probaly the root cause. Looks like their was a<br=
>
&gt; file with that name already used.<br>
&gt;<br>
&gt; 13/01/18 13:16:59 ERROR source.SpoolDirectorySource: Uncaught<br>
&gt; exception in Runnable<br>
&gt; java.lang.IllegalStateException: File name has been re-used with<br>
&gt; different files. Spooling assumption violated for<br>
&gt; /disk2/mahy/FLUME_TEST/source/sspstat.log.20130118100000-2013011810010=
0.hs009.ssp.COMPLETED<br>
&gt; =C2=A0 at org.apache.flume.client.avro.SpoolingFileLineReader.retireCu=
rrentFile(SpoolingFileLineReader.java:272)<br>
&gt; =C2=A0 at org.apache.flume.client.avro.SpoolingFileLineReader.readLine=
s(SpoolingFileLineReader.java:185)<br>
&gt; =C2=A0 at org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryR=
unnable.run(SpoolDirectorySource.java:135)<br>
&gt; =C2=A0 at java.util.concurrent.Executors$RunnableAdapter.call(Executor=
s.java:441)<br>
&gt; =C2=A0 at java.util.concurrent.FutureTask$Sync.innerRunAndReset(Future=
Task.java:317)<br>
&gt; =C2=A0 at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:=
150)<br>
&gt; =C2=A0 at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFu=
tureTask.access$101(ScheduledThreadPoolExecutor.java:98)<br>
&gt; =C2=A0 at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFu=
tureTask.runPeriodic(ScheduledThreadPoolExecutor.java:180)<br>
&gt; =C2=A0 at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFu=
tureTask.run(ScheduledThreadPoolExecutor.java:204)<br>
&gt; =C2=A0 at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Threa=
dPoolExecutor.java:886)<br>
&gt; =C2=A0 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoo=
lExecutor.java:908)<br>
&gt; =C2=A0 at java.lang.Thread.run(Thread.java:662)<br>
&gt;<br>
&gt; On Thu, Jan 17, 2013 at 9:22 PM, Henry Ma &lt;<a href=3D"mailto:henry.=
ma.1986@gmail.com" target=3D"_blank">henry.ma.1986@gmail.com</a>&gt; wrote:=
<br>
&gt;&gt; attached is the log file.<br>
&gt;&gt;<br>
&gt;&gt; the content of conf file:<br>
&gt;&gt; # Name the components on this agent<br>
&gt;&gt; a1.sources =3D r1<br>
&gt;&gt; a1.sinks =3D k1<br>
&gt;&gt; a1.channels =3D c1<br>
&gt;&gt;<br>
&gt;&gt; # Describe/configure the source<br>
&gt;&gt; a1.sources.r1.type =3D spooldir<br>
&gt;&gt; a1.sources.r1.spoolDir =3D /disk2/mahy/FLUME_TEST/source<br>
&gt;&gt; a1.sources.r1.channels =3D c1<br>
&gt;&gt;<br>
&gt;&gt; # Describe the sink<br>
&gt;&gt; a1.sinks.k1.type =3D file_roll<br>
&gt;&gt; a1.sinks.k1.sink.directory =3D /disk2/mahy/FLUME_TEST/sink<br>
&gt;&gt; a1.sinks.k1.sink.rollInterval =3D 0<br>
&gt;&gt;<br>
&gt;&gt; # Use a channel which buffers events in memory<br>
&gt;&gt; a1.channels.c1.type =3D memory<br>
&gt;&gt; a1.channels.c1.capacity =3D 99999<br>
&gt;&gt; #a1.channels.c1. =3D /disk2/mahy/FLUME_TEST/check<br>
&gt;&gt; #a1.channels.c1.dataDirs =3D /disk2/mahy/FLUME_TEST/channel-data<b=
r>
&gt;&gt;<br>
&gt;&gt; # Bind the source and sink to the channel<br>
&gt;&gt; a1.sources.r1.channels =3D c1<br>
&gt;&gt; a1.sinks.k1.channel =3D c1<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; On Fri, Jan 18, 2013 at 12:39 PM, Brock Noland &lt;<a href=3D"mail=
to:brock@cloudera.com" target=3D"_blank">brock@cloudera.com</a>&gt; wrote:<=
br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Hi,<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Would you mind turning logging to debug and then posting your =
full<br>
&gt;&gt;&gt; log/config?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Brock<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; On Thu, Jan 17, 2013 at 8:24 PM, Henry Ma &lt;<a href=3D"mailt=
o:henry.ma.1986@gmail.com" target=3D"_blank">henry.ma.1986@gmail.com</a>&gt=
; wrote:<br>
&gt;&gt;&gt; &gt; Hi,<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; When using Spooling Directory Source in Flume NG 1.3.1, t=
his exception<br>
&gt;&gt;&gt; &gt; happens:<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; 13/01/18 11:37:09 ERROR source.SpoolDirectorySource: Unca=
ught exception<br>
&gt;&gt;&gt; &gt; in<br>
&gt;&gt;&gt; &gt; Runnable<br>
&gt;&gt;&gt; &gt; java.io.IOException: Stream closed<br>
&gt;&gt;&gt; &gt; at java.io.BufferedReader.ensureOpen(BufferedReader.java:=
97)<br>
&gt;&gt;&gt; &gt; at java.io.BufferedReader.readLine(BufferedReader.java:29=
2)<br>
&gt;&gt;&gt; &gt; at java.io.BufferedReader.readLine(BufferedReader.java:36=
2)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; org.apache.flume.client.avro.SpoolingFileLineReader.readL=
ines(SpoolingFileLineReader.java:180)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; org.apache.flume.source.SpoolDirectorySource$SpoolDirecto=
ryRunnable.run(SpoolDirectorySource.java:135)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt; java.util.concurrent.Executors$RunnableAdapter.call(Execu=
tors.java:441)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; java.util.concurrent.FutureTask$Sync.innerRunAndReset(Fut=
ureTask.java:317)<br>
&gt;&gt;&gt; &gt; at java.util.concurrent.FutureTask.runAndReset(FutureTask=
.java:150)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; java.util.concurrent.ScheduledThreadPoolExecutor$Schedule=
dFutureTask.access$101(ScheduledThreadPoolExecutor.java:98)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; java.util.concurrent.ScheduledThreadPoolExecutor$Schedule=
dFutureTask.runPeriodic(ScheduledThreadPoolExecutor.java:180)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; java.util.concurrent.ScheduledThreadPoolExecutor$Schedule=
dFutureTask.run(ScheduledThreadPoolExecutor.java:204)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Th=
readPoolExecutor.java:886)<br>
&gt;&gt;&gt; &gt; at<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; java.util.concurrent.ThreadPoolExecutor$Worker.run(Thread=
PoolExecutor.java:908)<br>
&gt;&gt;&gt; &gt; at java.lang.Thread.run(Thread.java:662)<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; It usually happened when dropping some new files into the=
 spooling dir,<br>
&gt;&gt;&gt; &gt; and<br>
&gt;&gt;&gt; &gt; stop collecting file. Does someone know the reason and ho=
w to avoid it?<br>
&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt; &gt; Thanks very much!<br>
&gt;&gt;&gt; &gt; --<br>
&gt;&gt;&gt; &gt; Best Regards,<br>
&gt;&gt;&gt; &gt; Henry Ma<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --<br>
&gt;&gt;&gt; Apache MRUnit - Unit testing MapReduce -<br>
&gt;&gt;&gt; <a href=3D"http://incubator.apache.org/mrunit/" target=3D"_bla=
nk">http://incubator.apache.org/mrunit/</a><br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Best Regards,<br>
&gt;&gt; Henry Ma<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; --<br>
&gt; Apache MRUnit - Unit testing MapReduce - <a href=3D"http://incubator.a=
pache.org/mrunit/" target=3D"_blank">http://incubator.apache.org/mrunit/</a=
><br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div><span><font color=3D"#888888">-- <br><div style=3D"color:rgb(34,34,34)=
"><font face=3D"Consolas"><span style=3D"font-size:14px">Henry Ma</span></f=
ont></div>


</font></span></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div><span><font color=3D"#888888">-- <br><div style=3D"color:rgb(34,34,34)=
"><font face=3D"Consolas"><span style=3D"font-size:14px">Henry Ma</span></f=
ont></div>


</font></span></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div><span class=3D"HOEnZb"><font color=3D"#888888">-- <br><div style=3D"co=
lor:rgb(34,34,34);font-size:13px;font-family:arial,sans-serif"><span style=
=3D"font-size:10.5pt;font-family:Consolas">Best Regards,</span></div>


<div style=3D"color:rgb(34,34,34);font-size:13px;font-family:arial,sans-ser=
if"><span style=3D"font-size:10.5pt;font-family:Consolas">=E9=A9=AC=E7=8E=
=AF=E5=AE=87</span></div><div style=3D"color:rgb(34,34,34);font-size:13px;f=
ont-family:arial,sans-serif">


<span style=3D"font-size:10.5pt;font-family:Consolas">=E7=BD=91=E6=98=93=E6=
=9C=89=E9=81=93 EAD-Platform</span></div><div style=3D"color:rgb(34,34,34);=
font-size:13px;font-family:arial,sans-serif"><span style=3D"font-size:10.5p=
t;font-family:Consolas">POPO:=C2=A0=C2=A0=C2=A0<a href=3D"mailto:mahy@corp.=
netease.com" style=3D"color:rgb(17,85,204);font-size:10.5pt" target=3D"_bla=
nk">mahy@corp.netease.com</a></span></div>


<div style=3D"color:rgb(34,34,34);font-size:13px;font-family:arial,sans-ser=
if"><span style=3D"font-family:Consolas"><span style=3D"font-size:10.5pt">M=
SN:=C2=A0=C2=A0=C2=A0=C2=A0<a href=3D"mailto:henry.ma.1986@gmail.com" style=
=3D"color:rgb(17,85,204)" target=3D"_blank"><span style=3D"font-size:10.5pt=
">henry.ma.1986@gmail.com</span></a></span></span></div>


<div style=3D"color:rgb(34,34,34);font-size:13px;font-family:arial,sans-ser=
if"><span style=3D"font-size:10.5pt;font-family:Consolas">MOBILE: 186006019=
96</span></div>
</font></span></div></div></div>
</blockquote></div><br></div>

--089e0122896c41c1e804d38c86ea--