Hi,
I think that the checksum error during fetch is leading a bunch of
other errors I am getting when I try to run updateb and generate after
a fetch.
errors during updatedb:
---------------
java.lang.RuntimeException: problem advancing post rec#1018238
Caused by: java.io.IOException: can't find class:
org.apache.nutch.protocgl.ProtocolStatus because
org.apache.nutch.protocgl.ProtocolStatus
---------------
errors during generate:
---------------
java.lang.ArrayIndexOutOfBoundsException: 1107937
org.apache.hadoop.fs.ChecksumException: Checksum Error
java.io.IOException: Task: attempt_200910271443_0022_r_000006_0 - The
reduce copier failed
.
.
.
--------------
Any help would greatly be appreciated, I don't really know where to
start to fix these problems since this is first time I have
encountered - my guess is that they are rooted in the checksum error I
get when fetching sometimes.
Thanks for the help,
Eric Osgood
---------------------------------------------
Cal Poly - Computer Engineering, Moon Valley Software
---------------------------------------------
eosgood@...,
eric@...
---------------------------------------------
www.calpoly.edu/~eosgood, www.lakemeadonline.com