nutch-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lewis John Mcgibbney <lewis.mcgibb...@gmail.com>
Subject Re: bin/nutch
Date Mon, 27 Aug 2012 13:05:35 GMT
If you wish to use HBase to store your crawl data then yes.
Alternatively (currently) you can use Cassandra, Accumulo, MySQL or
HSQLDB and soon Amazon's DynamoDB

Check out the HBase tutorial, it includes absolutely everything you
require to get going with this.

Lewis

On Mon, Aug 27, 2012 at 1:46 PM, Tolga <tolga@ozses.net> wrote:
> Do I need HBase as well?
>
> On 08/27/2012 03:00 PM, Lewis John Mcgibbney wrote:
>>
>> try "ant runtime"
>>
>> This will generate the runtime deployment(s) you require to get going,
>> however it _does_not_ give you a ready to rock deployment.
>>
>> You should check out the following tutorials below
>>
>> http://wiki.apache.org/nutch/Nutch2Tutorial
>> http://nlp.solutions.asia/?p=180
>>
>> Lewis
>>
>> On Mon, Aug 27, 2012 at 12:34 PM, Tolga <tolga@ozses.net> wrote:
>>>
>>> Hi, and thanks for your fast reply.
>>>
>>> I found a tutorial on the interwebz, and it said to use ant in $NUTCH.
>>> However, when I used it, I got:
>>>
>>>
>>> [mtozses@atlas NUTCH]$ time ant
>>> Buildfile: build.xml
>>>    [taskdef] Could not load definitions from resource
>>> org/sonar/ant/antlib.xml. It could not be found.
>>>
>>> ivy-probe-antlib:
>>>
>>> BUILD FAILED
>>> /usr/local/solr/NUTCH/build.xml:472: Class
>>> org.apache.tools.ant.taskdefs.ConditionTask doesn't support the nested
>>> "typefound" element. What to do now?
>>>
>>> Regards,
>>>
>>>
>>> On 08/27/2012 11:45 AM, hugo.ma wrote:
>>>>
>>>> yes you need to compile nutch 2.0
>>>>
>>>>
>>>>
>>>> --
>>>> View this message in context:
>>>> http://lucene.472066.n3.nabble.com/bin-nutch-tp4003408p4003427.html
>>>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>>
>>>
>>
>>
>



-- 
Lewis

Mime
View raw message