HELP - ERROR: org.apache.hadoop.fs.ChecksumException: Checksum Error

View: New views
1 Messages — Rating Filter:   Alert me  

HELP - ERROR: org.apache.hadoop.fs.ChecksumException: Checksum Error

by Eric Osgood :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hi,

I think that the checksum error during fetch is leading a bunch of  
other errors I am getting when I try to run updateb and generate after  
a fetch.

errors during updatedb:
---------------
java.lang.RuntimeException: problem advancing post rec#1018238
Caused by: java.io.IOException: can't find class:  
org.apache.nutch.protocgl.ProtocolStatus because  
org.apache.nutch.protocgl.ProtocolStatus
---------------
errors during generate:
---------------
java.lang.ArrayIndexOutOfBoundsException: 1107937
org.apache.hadoop.fs.ChecksumException: Checksum Error
java.io.IOException: Task: attempt_200910271443_0022_r_000006_0 - The  
reduce copier failed
.
.
.
--------------

Any help would greatly be appreciated, I don't really know where to  
start to fix these problems since this is first time I have  
encountered - my guess is that they are rooted in the checksum error I  
get when fetching sometimes.

Thanks for the help,

Eric Osgood
---------------------------------------------
Cal Poly - Computer Engineering, Moon Valley Software
---------------------------------------------
eosgood@..., eric@...
---------------------------------------------
www.calpoly.edu/~eosgood, www.lakemeadonline.com