<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
	<id>tag:old.nabble.com,2006:forum-30592</id>
	<title>Nabble - Hadoop Core</title>
	<updated>2009-12-15T03:36:10Z</updated>
	<link rel="self" type="application/atom+xml" href="http://old.nabble.com/Hadoop-Core-f30592.xml" />
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Hadoop-Core-f30592.html" />
	<subtitle type="html"></subtitle>
	
<entry>
	<id>tag:old.nabble.com,2006:post-26793409</id>
	<title>File _partition.lst does not exist.</title>
	<published>2009-12-15T03:36:10Z</published>
	<updated>2009-12-15T03:36:10Z</updated>
	<author>
		<name>afarsek</name>
	</author>
	<content type="html">Hi,
&lt;br&gt;I'm using the InputSampler.RandomSampler to perform a partition sampling. It should create a file called _partition.lst that should be use later on by the partitionner class.
&lt;br&gt;&lt;br&gt;For some reason it doesn't work and I get a 
&lt;br&gt;java.io.FileNotFoundException: File _partition.lst does not exist.
&lt;br&gt;Below the code: it consists of a mapper only job, taking as input a file in a SequenceFileInputFormat that was generated by a previous job.
&lt;br&gt;&lt;br&gt;Thanks a lot in advance for any insights.
&lt;br&gt;&lt;br&gt;public class WordCountSorted {
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; public static class Map extends MapReduceBase implements Mapper&amp;lt;LongWritable, Text, IntWritable, Text&amp;gt; {
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; //		private final static IntWritable one = new IntWritable(1);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; private Text word = new Text();
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; public void map(LongWritable key, Text value, OutputCollector&amp;lt;IntWritable, Text&amp;gt; output, Reporter reporter) throws IOException {
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; String line = value.toString();
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; String[] tokens = line.split(&amp;quot;\t&amp;quot;);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; int nbOccurences = Integer.parseInt(tokens[1]);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; word.set(tokens[0]);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; output.collect(new IntWritable(nbOccurences),word );
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; }
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; }
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; public static void main(String[] args) throws Exception {
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; JobConf conf = new JobConf(WordCountSorted.class);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setJobName(&amp;quot;wordcountsorted&amp;quot;);
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FileInputFormat.setInputPaths(conf, new Path(args[0]));
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FileOutputFormat.setOutputPath(conf, new Path(args[1]));
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setInputFormat(SequenceFileInputFormat.class);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setOutputKeyClass(IntWritable.class);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setOutputValueClass(Text.class);
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setMapperClass(Map.class);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setReducerClass(IdentityReducer.class);
&lt;br&gt;&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setNumReduceTasks(2);
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; InputSampler.Sampler&amp;lt;IntWritable, Text&amp;gt; sampler =
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; new InputSampler.RandomSampler&amp;lt;IntWritable, Text&amp;gt;(0.1, 100, 10);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; InputSampler.writePartitionFile(conf, sampler);
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; conf.setPartitionerClass(TotalOrderPartitioner.class);
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; JobClient.runJob(conf);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; }
&lt;br&gt;}&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/File-_partition.lst-does-not-exist.-tp26793409p26793409.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26774461</id>
	<title>0.18.3-14.cloudera.CH0_3 RHEL5: hadoop-site.xml properties not being read when formatting the Hadoop namenode</title>
	<published>2009-12-14T00:38:43Z</published>
	<updated>2009-12-14T00:38:43Z</updated>
	<author>
		<name>David Stemmer</name>
	</author>
	<content type="html">I'm trying to set up a basic Hadoop single-node cluster on a Red Hat Enterprise Linux 5 system. The Hadoop version is 0.18.3-14.cloudera.CH0_3. Unfortunately I am having problems getting Hadoop to read the configuration properties from hadoop-site.xml. Specifically, when I try to run the following command:
&lt;br&gt;&lt;br&gt;${HADOOP_HOME}/bin/hadoop namenode -format
&lt;br&gt;&lt;br&gt;...it seems that Hadoop is not using the storage directory that I have configured for HDFS. I get the following output:
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;&amp;nbsp; 09/12/14 02:24:14 INFO dfs.NameNode: STARTUP_MSG: 
&lt;br&gt;&amp;nbsp; /************************************************************
&lt;br&gt;&amp;nbsp; STARTUP_MSG: Starting NameNode
&lt;br&gt;&amp;nbsp; STARTUP_MSG: &amp;nbsp; host = #####/###.##.#.##
&lt;br&gt;&amp;nbsp; STARTUP_MSG: &amp;nbsp; args = [-format]
&lt;br&gt;&amp;nbsp; STARTUP_MSG: &amp;nbsp; version = 0.18.3-14.cloudera.CH0_3
&lt;br&gt;&amp;nbsp; STARTUP_MSG: &amp;nbsp; build = &amp;nbsp;-r HEAD; compiled by 'root' on Mon Jul &amp;nbsp;6 15:02:31 EDT 2009
&lt;br&gt;&amp;nbsp; ************************************************************/
&lt;br&gt;&amp;nbsp; Re-format filesystem in /tmp/hadoop-myusername/dfs/name ? (Y or N) Y
&lt;br&gt;&amp;nbsp; 09/12/14 02:24:17 INFO fs.FSNamesystem: fsOwner=myusername,ugrad
&lt;br&gt;&amp;nbsp; 09/12/14 02:24:17 INFO fs.FSNamesystem: supergroup=supergroup
&lt;br&gt;&amp;nbsp; 09/12/14 02:24:17 INFO fs.FSNamesystem: isPermissionEnabled=true
&lt;br&gt;&amp;nbsp; 09/12/14 02:24:17 INFO dfs.Storage: Image file of size 82 saved in 0 seconds.
&lt;br&gt;&amp;nbsp; 09/12/14 02:24:17 INFO dfs.Storage: Storage directory /tmp/hadoop-myusername/dfs/name has been successfully formatted.
&lt;br&gt;&amp;nbsp; 09/12/14 02:24:17 INFO dfs.NameNode: SHUTDOWN_MSG: 
&lt;br&gt;&amp;nbsp; /************************************************************
&lt;br&gt;&amp;nbsp; SHUTDOWN_MSG: Shutting down NameNode at #####/###.##.#.##
&lt;br&gt;&amp;nbsp; ************************************************************/
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;I do not want it to format /tmp/hadoop-myusername/dfs/name as the storage directory. Here is what is in my hadoop-site.xml file. Notice that hadoop.tmp.dir is set /home/u/fall06/myusername/hadoop_tmp:
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;lt;?xml version=&amp;quot;1.0&amp;quot;?&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;?xml-stylesheet type=&amp;quot;text/xsl&amp;quot; href=&amp;quot;configuration.xsl&amp;quot;?&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;lt;!-- Put site-specific property overrides in this file. --&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;lt;configuration&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;name&amp;gt;hadoop.tmp.dir&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;value&amp;gt;/home/u/fall06/myusername/hadoop_tmp&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;/property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp;
&lt;br&gt;&amp;nbsp; &amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;name&amp;gt;fs.default.name&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;value&amp;gt;hdfs://localhost:50031&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;name&amp;gt;mapred.job.tracker&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;value&amp;gt;localhost:50032&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;name&amp;gt;dfs.replication&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;value&amp;gt;1&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;/property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;/configuration&amp;gt;
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;&lt;br&gt;Here are the relevent lines of my .bash_profile where I set some of the Hadoop environment variables:
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;&amp;nbsp; export HADOOP_HOME=/home/u/fall06/myusername/Desktop/hadoop_project/hadoop
&lt;br&gt;&amp;nbsp; export HADOOP_CONF_DIR=/home/u/fall06/myusername/Desktop/hadoop_project/hadoop/conf
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;A few more details on the system setup:
&lt;br&gt;&lt;br&gt;- I did not install Hadoop on the system myself; it was installed by a system administrator.
&lt;br&gt;- Because the default Hadoop directory is read/execute only, I copied the Hadoop directory to a folder that I own and ran in from there. My HADOOP_HOME enviromnent variable is set to this duplicate directory.
&lt;br&gt;- hadoop_site.xml is in my ${HADOOP_HOME}/conf directory.
&lt;br&gt;- I did try to echo what ${HADOOP_CONF_DIR} was being set to while '{HADOOP_HOME}/bin/hadoop namenode -format' was running. The result was this:
&lt;br&gt;&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&amp;nbsp; /home/u/fall06/myusername/Desktop/hadoop_project/conf
&lt;br&gt;---------------------------------------------------------------------
&lt;br&gt;&lt;br&gt;That's what I would expect, so I'm stumped. Does anyone have any ideas what I might be doing wrong, or know of more tests I can run to figure out the problem? Any and all advice is appreciated.&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/0.18.3-14.cloudera.CH0_3-RHEL5%3A-hadoop-site.xml-properties-not-being-read-when-formatting-the-Hadoop-namenode-tp26774461p26774461.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26694569</id>
	<title>Re: multiple file input</title>
	<published>2009-12-08T06:18:38Z</published>
	<updated>2009-12-08T06:18:38Z</updated>
	<author>
		<name>laser08150815</name>
	</author>
	<content type="html">&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;pmg wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;I am evaluating hadoop for a problem that do a Cartesian product of input from one file of 600K (File A) with another set of file set (FileB1, FileB2, FileB3) with 2 millions line in total.
&lt;br&gt;&lt;br&gt;Each line from FileA gets compared with every line from FileB1, FileB2 etc. etc. FileB1, FileB2 etc. are in a different input directory
&lt;br&gt;&lt;br&gt;So....
&lt;br&gt;&lt;br&gt;Two input directories 
&lt;br&gt;&lt;br&gt;1. input1 directory with a single file of 600K records - FileA
&lt;br&gt;2. input2 directory segmented into different files with 2Million records - FileB1, FileB2 etc.
&lt;br&gt;&lt;br&gt;How can I have a map that reads a line from a FileA in directory input1 and compares the line with each line from input2? 
&lt;br&gt;&lt;br&gt;What is the best way forward? I have seen plenty of examples that maps each record from single input file and reduces into an output forward.
&lt;br&gt;&lt;br&gt;thanks
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;br&gt;I had a similar problem and solved it by writing a custom InputFormat (see attachment). You should improve the methods ACrossBInputSplit.getLength , ACrossBRecordReader.getPos and ACrossBRecordReader.getProgress.&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/multiple-file-input-tp24095358p26694569.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26683812</id>
	<title>Hadoop / Data Migration Specialist Needed</title>
	<published>2009-12-07T12:43:04Z</published>
	<updated>2009-12-07T12:43:04Z</updated>
	<author>
		<name>alevin</name>
	</author>
	<content type="html">Hello. &amp;nbsp;My name is Alex Levin and I am the COO of Brilig (www.brilig.com) a startup in New York focused on the online advertising space. &amp;nbsp;We are looking to hire a Hadoop / Data Migration Specialist to play a crucial role in converting new client's data onto Brilig's service platforms. We are looking for a talented and energetic full-time freelance programmer to work both remotely and onsite at our Manhattan location. The Specialist will work with our clients' technical teams to determine the optimal formats and requirements to create files for subsequent import into a Brilig distributed file system during the implementation process. This process involves extracting, scrubbing, combining, transforming, validating and importing large data tables into final data sets suitable for loading into Brilig's defined databases. The Specialist will be responsible for creating/editing the database structure and writing of all import scripts and programs. The ability to work on multiple projects simultaneously while meeting tight deadlines is critical. This position is initially contract but may eventually lead to a full time position in our fun and exciting startup. Must be able to travel to client meetings and work independently. 
&lt;br&gt;&lt;br&gt;Responsibilities: 
&lt;br&gt;- Subject Matter Expert on software tools used in the entire data migration process from extraction to validation and load
&lt;br&gt;- Design, develop and execute quality data movement processes that are consistent, repeatable and scalable 
&lt;br&gt;&lt;br&gt;- Streamline testing, audit and validation processes through data scrubbing routines and presentation of audit reports prior to load
&lt;br&gt;- Roll out newly developed processes via documentation and training
&lt;br&gt;- Maintain and manage a template library of executed solutions to leverage against future opportunities
&lt;br&gt;- Identify, clarify, and resolve issues and risks, escalating them as needed
&lt;br&gt;- Build and nourish strong business relationships with external clients 
&lt;br&gt;&lt;br&gt;Please include: 
&lt;br&gt;&lt;br&gt;Salary Requirements 
&lt;br&gt;Availability 
&lt;br&gt;&lt;br&gt;Experience: 
&lt;br&gt;&lt;br&gt;At least 2 years experience with Htable, HBase and Hadoop 
&lt;br&gt;At least 3-5 years experience in the development of java applications 
&lt;br&gt;Use of XML and other protocols for data exchange between systems 
&lt;br&gt;SQL database design and implementation 
&lt;br&gt;Experience with Eclipse, Maven, and SVN a plus 
&lt;br&gt;Excellent communication skills with both technical and non-technical colleagues 
&lt;br&gt;Upper management and client facing skills 
&lt;br&gt;Interest in keeping up with technology advances 
&lt;br&gt;&lt;br&gt;PLEASE NOTE:
&lt;br&gt;US citizens and Green Card Holders and those authorized to work in the US only. We are unable to sponsor or transfer H-1B candidates. 
&lt;br&gt;&lt;br&gt;Contact:
&lt;br&gt;Alex Levin, COO
&lt;br&gt;Brilig
&lt;br&gt;alevin@brilig.com&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Hadoop---Data-Migration-Specialist-Needed-tp26683812p26683812.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26681351</id>
	<title>writing files to HDFS (from c++/pipes)</title>
	<published>2009-12-07T10:05:54Z</published>
	<updated>2009-12-07T10:05:54Z</updated>
	<author>
		<name>horson</name>
	</author>
	<content type="html">Hi,
&lt;br&gt;i want to write a file to hdfs, using hadoop pipes. can anyone tell me how to do that?
&lt;br&gt;Im using an external library that writes its output to disk, so probably i have to read that output and write it to the distributed filesystem?
&lt;br&gt;I found only FSDataOutputStream, a java class. 
&lt;br&gt;Can anyone help?
&lt;br&gt;Moreover, can anyone tell me where i can find goot documentation about hadoop pipes? Nearly everything i find is java specific or general information about MapReduce. I looked at the hadoop pipes source and it looked very restricted, can i do everything in hadoop pipes that possible in java?
&lt;br&gt;&lt;br&gt;Thank for your help
&lt;br&gt;horson&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/writing-files-to-HDFS-%28from-c%2B%2B-pipes%29-tp26681351p26681351.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26635758</id>
	<title>Start Hadoop env using JAVA or HADOOP APIs (InProcess)</title>
	<published>2009-12-03T20:25:47Z</published>
	<updated>2009-12-03T20:25:47Z</updated>
	<author>
		<name>samuellawrence</name>
	</author>
	<content type="html">Hai,
&lt;br&gt;&lt;br&gt;I have to start the HADOOP environment using java code (inprocess). I would like to use the APIs to start it. &amp;nbsp;
&lt;br&gt;&lt;br&gt;Could anyone please give me snippet or a link.
&lt;br&gt;&lt;br&gt;Thanks in Advance.&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Start-Hadoop-env-using-JAVA-or-HADOOP-APIs-%28InProcess%29-tp26635758p26635758.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26605201</id>
	<title>Re: About Hadoop pseudo distribution</title>
	<published>2009-12-02T00:30:40Z</published>
	<updated>2009-12-02T00:30:40Z</updated>
	<author>
		<name>Doss_IPH</name>
	</author>
	<content type="html">Hi 
&lt;br&gt;Hadoop pseudo distribution runs five java processors which are given below 
&lt;br&gt;&amp;nbsp; 1. namenode, 
&lt;br&gt;&amp;nbsp; 2. secondarynamenode
&lt;br&gt;&amp;nbsp; 3. datanode
&lt;br&gt;&amp;nbsp; 4 jobtracker
&lt;br&gt;&amp;nbsp; 5. tasktracker
&lt;br&gt;&lt;br&gt;As you know namenode, secondarynamenode, datanode processors are for HDFS and jobtracker ,tastktracker are for MR (Map Reduce).
&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;kvorion wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;Hi All,
&lt;br&gt;&lt;br&gt;I have been trying to set up a hadoop cluster on a number of machines, a few of which are multicore machines. I have been wondering whether the hadoop pseudo distribution is something that can help me take advantage of the multiple cores on my machines. All the tutorials say that the pseudo distribution mode lets you start each daemon in a separate java process. I have the following configuration settings for hadoop-site.xml:
&lt;br&gt;&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;fs.default.name&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;hdfs://athena:9000&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;mapred.job.tracker&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;athena:9001&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;dfs.replication&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;2&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;I am not sure if this is really running in the pseudo-distribution mode. Are there any indicators or outputs that confirm what mode you are running in?
&lt;br&gt;&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/About-Hadoop-pseudo-distribution-tp26322382p26605201.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26588090</id>
	<title>Help! FSNamesystem initialization failed.</title>
	<published>2009-12-01T00:51:25Z</published>
	<updated>2009-12-01T00:51:25Z</updated>
	<author>
		<name>Sergio Puchini</name>
	</author>
	<content type="html">Hi, suddenly I’ve got problem with starting Namenode:
&lt;br&gt;&lt;br&gt;[hadoop@hadoop1 hadoop]$ bin/hadoop namenode
&lt;br&gt;09/12/01 11:26:51 INFO namenode.NameNode: STARTUP_MSG:
&lt;br&gt;/************************************************************
&lt;br&gt;STARTUP_MSG: Starting NameNode
&lt;br&gt;STARTUP_MSG: &amp;nbsp; host = hadoop1.bcr.ru/81.68.243.18
&lt;br&gt;STARTUP_MSG: &amp;nbsp; args = []
&lt;br&gt;STARTUP_MSG: &amp;nbsp; version = 0.19.2-dev
&lt;br&gt;STARTUP_MSG: &amp;nbsp; build = &lt;a href=&quot;http://svn.apache.org/repos/asf/hadoop/core/branches/bran&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://svn.apache.org/repos/asf/hadoop/core/branches/bran&lt;/a&gt;&lt;br&gt;ch-0.19 -r 755955; compiled by 'maksim07' on Sun Mar 22 14:29:37 MSK 2009
&lt;br&gt;************************************************************/
&lt;br&gt;09/12/01 11:26:51 INFO metrics.RpcMetrics: Initializing RPC Metrics with hostName=NameNode, port=9000
&lt;br&gt;09/12/01 11:26:51 INFO namenode.NameNode: Namenode up at: hadoop1.bcr.ru/81.68.243.18:9000
&lt;br&gt;09/12/01 11:26:51 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=NameNode, sessionId=null
&lt;br&gt;09/12/01 11:26:51 INFO metrics.NameNodeMetrics: Initializing NameNodeMeterics using context object:org.apache.hadoop.metrics.spi.NullContext
&lt;br&gt;09/12/01 11:26:52 INFO namenode.FSNamesystem: fsOwner=hadoop,hadoop,dba
&lt;br&gt;09/12/01 11:26:52 INFO namenode.FSNamesystem: supergroup=supergroup
&lt;br&gt;09/12/01 11:26:52 INFO namenode.FSNamesystem: isPermissionEnabled=true
&lt;br&gt;09/12/01 11:26:52 INFO metrics.FSNamesystemMetrics: Initializing FSNamesystemMet
&lt;br&gt;rics using context object:org.apache.hadoop.metrics.spi.NullContext
&lt;br&gt;09/12/01 11:26:52 INFO namenode.FSNamesystem: Registered FSNamesystemStatusMBean
&lt;br&gt;09/12/01 11:26:52 INFO common.Storage: Number of files = 30079
&lt;br&gt;09/12/01 11:26:53 INFO common.Storage: Number of files under construction = 0
&lt;br&gt;09/12/01 11:26:53 INFO common.Storage: Image file of size 4451066 loaded in 1 seconds.
&lt;br&gt;09/12/01 11:27:11 ERROR namenode.FSNamesystem: FSNamesystem initialization failed.
&lt;br&gt;java.io.IOException: Incorrect data format. logVersion is -18 but writables.length is 0.
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSEditLog.loadFSEdits(FSEditLog.java:542)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSEdits(FSImage.java:973)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java:793)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:352)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirectory.java:87)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesystem.java:309)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.&amp;lt;init&amp;gt;(FSNamesystem.java:288)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:163)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.&amp;lt;init&amp;gt;(NameNode.java:208)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.&amp;lt;init&amp;gt;(NameNode.java:194)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:859)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:868)
&lt;br&gt;09/12/01 11:27:11 INFO ipc.Server: Stopping server on 9000
&lt;br&gt;09/12/01 11:27:11 ERROR namenode.NameNode: java.io.IOException: Incorrect data f
&lt;br&gt;ormat. logVersion is -18 but writables.length is 0.
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSEditLog.loadFSEdits(FSEditLog.java:542)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSEdits(FSImage.java:973)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java:793)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:352)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirectory.java:87)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesystem.java:309)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.&amp;lt;init&amp;gt;(FSNamesystem.java:288)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:163)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.&amp;lt;init&amp;gt;(NameNode.java:208)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.&amp;lt;init&amp;gt;(NameNode.java:194)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:859)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:868)
&lt;br&gt;09/12/01 11:27:11 INFO namenode.NameNode: SHUTDOWN_MSG:
&lt;br&gt;/************************************************************
&lt;br&gt;SHUTDOWN_MSG: Shutting down NameNode at hadoop1.bcr.ru/81.68.243.18
&lt;br&gt;************************************************************/
&lt;br&gt;&lt;br&gt;Any ideas?
&lt;br&gt;&lt;br&gt;Thanks!
&lt;br&gt;Sergio Puchini.
&lt;br&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Help%21-FSNamesystem-initialization-failed.-tp26588090p26588090.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26574876</id>
	<title>How to change the MapReduce output coder?</title>
	<published>2009-11-30T05:41:18Z</published>
	<updated>2009-11-30T05:41:18Z</updated>
	<author>
		<name>gpcuster</name>
	</author>
	<content type="html">I know the default output coder is &lt;b&gt;utf-8&lt;/b&gt;, but how can we change it into a other coder? like gbk?
&lt;br&gt;&lt;br&gt;thx.&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/How-to-change-the-MapReduce-output-coder--tp26574876p26574876.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26558264</id>
	<title>hadoop-0.20.1\src\hdfs\org\apache\hadoop\hdfs\server</title>
	<published>2009-11-28T16:11:34Z</published>
	<updated>2009-11-28T16:11:34Z</updated>
	<author>
		<name>dzisaacs</name>
	</author>
	<content type="html">why is this folder called &amp;quot;server&amp;quot;
&lt;br&gt;&lt;br&gt;what is this folder for?&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/hadoop-0.20.1%5Csrc%5Chdfs%5Corg%5Capache%5Chadoop%5Chdfs%5Cserver-tp26558264p26558264.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26491406</id>
	<title>Re: Using hadoop for Matrix Multiplication in NFS?</title>
	<published>2009-11-23T23:17:59Z</published>
	<updated>2009-11-23T23:17:59Z</updated>
	<author>
		<name>Martin Mituzas</name>
	</author>
	<content type="html">ii) I once run the mapred program DistCp to copy data from NFS into HDFS. I mount the file system directory to each node. Thus it can be connected.
&lt;br&gt;&amp;nbsp;
&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;Gimick wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;Hi, I am new to hadoop. &amp;nbsp;I am planning to do matrix multiplication(of order millions) using hadoop. &amp;nbsp;
&lt;br&gt;&lt;br&gt;I have a few queries regarding the above.
&lt;br&gt;&lt;br&gt;i) Will using hadoop be a fix for this or should I try some other approaches?
&lt;br&gt;ii) I will be using it in NFS. &amp;nbsp;Will using hadoop still be a good option?
&lt;br&gt;&lt;br&gt;If I can use hadoop for this problem, could you plz send links to configure hadoop-site.xml file for a nfs system.
&lt;br&gt;&lt;br&gt;P.S. I tried a few setup instructions via search, but everything seems to give &amp;quot;Unable to connect to ....&amp;quot; error.
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Using-hadoop-for-Matrix-Multiplication-in-NFS--tp26332382p26491406.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26443615</id>
	<title>PIG bin/labeling relation</title>
	<published>2009-11-20T10:17:59Z</published>
	<updated>2009-11-20T10:17:59Z</updated>
	<author>
		<name>drd_</name>
	</author>
	<content type="html">I am using PIG and this is what I am trying to do:
&lt;br&gt;&lt;br&gt;1) Sort a relation A into B by a field x. The smallest value of x is first. Just use SORT.
&lt;br&gt;&lt;br&gt;2) Label each tuple in B with a number denoting its order in the sorted relation. So the first tuple would be labeled with a 1, the second tuple with a 2, the third with a 3 and so on. Not certain how to do this.
&lt;br&gt;&lt;br&gt;3) Derive a relation C where each row is a bag of tuples. The first row contains the first n1 tuples from relation B, the second row contains the tuples from B labeled (n1 + 1) to n2 from, the third row contains the tuples from B labeled (n2 + 1) to n3 and so on to n100. This step is simple (just use filter) once we've labeled each tuple in B with a number. 
&lt;br&gt;&lt;br&gt;The question: how do I do step 2).
&lt;br&gt;&lt;br&gt;thanks&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/PIG-bin-labeling-relation-tp26443615p26443615.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26332382</id>
	<title>Using hadoop for Matrix Multiplication in NFS?</title>
	<published>2009-11-12T23:21:50Z</published>
	<updated>2009-11-12T23:21:50Z</updated>
	<author>
		<name>Gimick</name>
	</author>
	<content type="html">Hi, I am new to hadoop. &amp;nbsp;I am planning to do matrix multiplication(of order millions) using hadoop. &amp;nbsp;
&lt;br&gt;&lt;br&gt;I have a few queries regarding the above.
&lt;br&gt;&lt;br&gt;i) Will using hadoop be a fix for this or should I try some other approaches?
&lt;br&gt;ii) I will be using it in NFS. &amp;nbsp;Will using hadoop still be a good option?
&lt;br&gt;&lt;br&gt;If I can use hadoop for this problem, could you plz send links to configure hadoop-site.xml file for a nfs system.
&lt;br&gt;&lt;br&gt;P.S. I tried a few setup instructions via search, but everything seems to give &amp;quot;Unable to connect to ....&amp;quot; error.
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Using-hadoop-for-Matrix-Multiplication-in-NFS--tp26332382p26332382.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26322618</id>
	<title>Help !! Hadoop installation to One machine has 24 CPU 16 disk (Each one 2 TB)</title>
	<published>2009-11-12T09:17:01Z</published>
	<updated>2009-11-12T09:17:01Z</updated>
	<author>
		<name>dgoker</name>
	</author>
	<content type="html">Hi
&lt;br&gt;&lt;br&gt;I installed the hadoop to one server which has following configurations
&lt;br&gt;&lt;br&gt;&amp;nbsp;24 CPU, 
&lt;br&gt;&amp;nbsp;72 GB RAM
&lt;br&gt;&amp;nbsp;17 Disk (2 TB)
&lt;br&gt;&lt;br&gt;&amp;nbsp;
&lt;br&gt;All configuration belongs to Hadoop and Pig are is default settings. ın order to run process efficiently waht should be the following configuration settings. The settings i find on forums usually 4 CPU machines and clustered system.
&lt;br&gt;&lt;br&gt;&lt;br&gt;What do you suggest me following settings?
&lt;br&gt;&lt;br&gt;&lt;br&gt;mapred.tasktracker.reduce.tasks.maximum &amp;nbsp; ?
&lt;br&gt;mapred.map.tasks ?
&lt;br&gt;mapred.reduce.tasks ?
&lt;br&gt;dfs.datanode.handler.count ?
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Help-%21%21-Hadoop-installation-to-One-machine-has-24-CPU-16-disk-%28Each-one-2-TB%29-tp26322618p26322618.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26322437</id>
	<title>java code - parsing string -outofindex  error -File on HDFS</title>
	<published>2009-11-12T09:05:50Z</published>
	<updated>2009-11-12T09:05:50Z</updated>
	<author>
		<name>dgoker</name>
	</author>
	<content type="html">When i try to read a file from HDFS, i want to substring the record by java code (javada line.substring(1,9)) but i am getting outofindex error. When i do the same operation on same file by PIG code there is no problem.
&lt;br&gt;&lt;br&gt;I took following error, What shoul i do in Java. 
&lt;br&gt;&lt;br&gt;&lt;br&gt;java.lang.StringIndexOutOfBoundsException: String index out of range: 9
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.lang.String.substring(String.java:1934)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at WordCount$Map.map(WordCount.java:19)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at WordCount$Map.map(WordCount.java:12)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.Child.main(Child.java:170)
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-dev-f30593.html&quot; embed=&quot;fixTarget[30593]&quot; target=&quot;_top&quot; &gt;Hadoop core-dev&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/java-code---parsing-string--outofindex--error--File-on-HDFS-tp26322437p26322437.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26322382</id>
	<title>About Hadoop pseudo distribution</title>
	<published>2009-11-12T09:02:57Z</published>
	<updated>2009-11-12T09:02:57Z</updated>
	<author>
		<name>kvorion</name>
	</author>
	<content type="html">Hi All,
&lt;br&gt;&lt;br&gt;I have been trying to set up a hadoop cluster on a number of machines, a few of which are multicore machines. I have been wondering whether the hadoop pseudo distribution is something that can help me take advantage of the multiple cores on my machines. All the tutorials say that the pseudo distribution mode lets you start each daemon in a separate java process. I have the following configuration settings for hadoop-site.xml:
&lt;br&gt;&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;fs.default.name&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;hdfs://athena:9000&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;mapred.job.tracker&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;athena:9001&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;dfs.replication&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;2&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&lt;br&gt;I am not sure if this is really running in the pseudo-distribution mode. Are there any indicators or outputs that confirm what mode you are running in?
&lt;br&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/About-Hadoop-pseudo-distribution-tp26322382p26322382.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26305272</id>
	<title>Re: Could not find any valid local directory for taskTracker</title>
	<published>2009-11-11T09:33:28Z</published>
	<updated>2009-11-11T09:33:28Z</updated>
	<author>
		<name>Saju K K</name>
	</author>
	<content type="html">Hi,
&lt;br&gt;&lt;br&gt;Did you get a solution for this problem ,we are facing a similar problem 
&lt;br&gt;&lt;br&gt;saju 
&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;Pallavi Palleti wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;Hi,
&lt;br&gt;&amp;nbsp;I got below error while running my hadoop task. But, when I tried after few hours, it worked fine. 
&lt;br&gt;Can some one please tell me why this error occured?
&lt;br&gt;&lt;br&gt;ERROR Below:
&lt;br&gt;&lt;br&gt;Error initializing task_200805161358_0158_m_000000_0:
&lt;br&gt;org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for taskTracker/jobcache/job_200805161358_0158/job.xml
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:313)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:590)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1172)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:836)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1208)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:1924)
&lt;br&gt;&lt;br&gt;08/05/29 23:30:35 WARN mapred.JobClient: Error reading task outputhttp://hadoop31:50060/tasklog?plaintext=true&amp;taskid=task_200805161358_0158_m_000000_0&amp;filter=stdout
&lt;br&gt;08/05/29 23:30:35 WARN mapred.JobClient: Error reading task outputhttp://hadoop31:50060/tasklog?plaintext=true&amp;taskid=task_200805161358_0158_m_000000_0&amp;filter=stderr
&lt;br&gt;08/05/29 23:30:37 INFO mapred.JobClient: &amp;nbsp;map 83% reduce 0%
&lt;br&gt;&lt;br&gt;&lt;br&gt;Thanks
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Could-not-find-any-valid-local-directory-for-taskTracker-tp17553998p26305272.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26266407</id>
	<title>Using jobtracker api for developing UI in hadoop 0.19.1</title>
	<published>2009-11-09T05:28:06Z</published>
	<updated>2009-11-09T05:28:06Z</updated>
	<author>
		<name>pnd</name>
	</author>
	<content type="html">&amp;nbsp;Hi i am new to hadoop been using it for few weeks now, i tried few map/reduce example and could see logs in jobtracker.jsp.
&lt;br&gt;Is there any way to use jobtracker api(&lt;a href=&quot;http://hadoop.apache.org/common/docs/r0.19.1/api/org/apache/hadoop/mapred/JobTracker.html&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://hadoop.apache.org/common/docs/r0.19.1/api/org/apache/hadoop/mapred/JobTracker.html&lt;/a&gt;) &amp;nbsp;for tracking the jobs instead using jobtracker jsp because I am trying to build new UI to show the job progress according to my requirements.
&lt;br&gt;&lt;br&gt;Thanks in advance.
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Using-jobtracker-api-for-developing-UI-in-hadoop-0.19.1-tp26266407p26266407.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26142981</id>
	<title>Re: What if an XML file cross boundary of HDFS chunks?</title>
	<published>2009-10-31T07:38:57Z</published>
	<updated>2009-10-31T07:38:57Z</updated>
	<author>
		<name>Oliver Fischer-3</name>
	</author>
	<content type="html">-----BEGIN PGP SIGNED MESSAGE-----
&lt;br&gt;Hash: SHA1
&lt;br&gt;&lt;br&gt;Hello Jeff,
&lt;br&gt;&lt;br&gt;does it means, that there is no programmatic possibility to define where
&lt;br&gt;a logical file will be splitted independent of the distribution of it
&lt;br&gt;blocks in the HDFS?
&lt;br&gt;&lt;br&gt;Regards
&lt;br&gt;&lt;br&gt;Oliver
&lt;br&gt;&lt;br&gt;Jeff Zhang schrieb:
&lt;div class='shrinkable-quote'&gt;&lt;br&gt;&amp;gt; Hi Steve,
&lt;br&gt;&amp;gt; 
&lt;br&gt;&amp;gt; When you want to read xml, you should provide your custom InputFormat which
&lt;br&gt;&amp;gt; extends FileInputFormat.
&lt;br&gt;&amp;gt; 
&lt;br&gt;&amp;gt; and override the method isSplitable to not split a file , that means one xml
&lt;br&gt;&amp;gt; file for one mapper.
&lt;br&gt;&amp;gt; 
&lt;br&gt;&amp;gt; 
&lt;br&gt;&amp;gt; &amp;nbsp; protected boolean isSplitable(FileSystem fs, Path filename) {
&lt;br&gt;&amp;gt; &amp;nbsp; &amp;nbsp; return false;
&lt;br&gt;&amp;gt; &amp;nbsp; }
&lt;/div&gt;&lt;br&gt;&lt;br&gt;- --
&lt;br&gt;Oliver B. Fischer, Schönhauser Allee 64, 10437 Berlin
&lt;br&gt;Tel. +49 30 44793251, Mobil: +49 178 7903538
&lt;br&gt;Mail: &lt;a href=&quot;http://old.nabble.com/user/SendEmail.jtp?type=post&amp;post=26142981&amp;i=0&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;o.b.fischer@...&lt;/a&gt; Blog: &lt;a href=&quot;http://www.swe-blog.net&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://www.swe-blog.net&lt;/a&gt;&lt;br&gt;-----BEGIN PGP SIGNATURE-----
&lt;br&gt;Version: GnuPG v1.4.9 (MingW32)
&lt;br&gt;Comment: Using GnuPG with Mozilla - &lt;a href=&quot;http://enigmail.mozdev.org/&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://enigmail.mozdev.org/&lt;/a&gt;&lt;br&gt;&lt;br&gt;iQEcBAEBAgAGBQJK7EwBAAoJELeiwuwqd1DGO/wIAJl8wwf6uNgm/ZwsGh8M1xvz
&lt;br&gt;wSEH9sD2cfjUSV3rmpHndKEfSTEOeHvvaJmJn24K9HhB9w8QyDogAgHawCdBY2TE
&lt;br&gt;K27n4wqSGlbLpQz4XmKUOVtFSooeEPUT58Jn2aMAno+nrWHM7oq9tuCJAAYkBexV
&lt;br&gt;wCrc7eE+o55TlAlx+LDWWS9mJrdTNBYqzoHh0gnWsEGm98CWvzn08tNA/L2moJbQ
&lt;br&gt;HZwnWzfgEBKBwAZUOYLFt2GigIYN3GE0pMp33BgjWi91zPwGSk7Bcq7XhObLK7o/
&lt;br&gt;uYxS+s3BTkLy+R6ngjOW1NLvg6STX37FpFNZowDmPt8Bzd8GxAefnqcxkVcnb90=
&lt;br&gt;=t6vV
&lt;br&gt;-----END PGP SIGNATURE-----
&lt;br&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-dev-f30593.html&quot; embed=&quot;fixTarget[30593]&quot; target=&quot;_top&quot; &gt;Hadoop core-dev&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/What-if-an-XML-file-cross-boundary-of-HDFS-chunks--tp26120236p26142981.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26123313</id>
	<title>Re: What if an XML file cross boundary of HDFS chunks?</title>
	<published>2009-10-29T17:42:37Z</published>
	<updated>2009-10-29T17:42:37Z</updated>
	<author>
		<name>Jeff Zhang-4</name>
	</author>
	<content type="html">Hi Steve,
&lt;br&gt;&lt;br&gt;When you want to read xml, you should provide your custom InputFormat which
&lt;br&gt;extends FileInputFormat.
&lt;br&gt;&lt;br&gt;and override the method isSplitable to not split a file , that means one xml
&lt;br&gt;file for one mapper.
&lt;br&gt;&lt;br&gt;&lt;br&gt;&amp;nbsp; protected boolean isSplitable(FileSystem fs, Path filename) {
&lt;br&gt;&amp;nbsp; &amp;nbsp; return false;
&lt;br&gt;&amp;nbsp; }
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;Best Regards,
&lt;br&gt;&lt;br&gt;Jeff zhang
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;On Thu, Oct 29, 2009 at 12:32 PM, Steve Gao &amp;lt;&lt;a href=&quot;http://old.nabble.com/user/SendEmail.jtp?type=post&amp;post=26123313&amp;i=0&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;steve.gao@...&lt;/a&gt;&amp;gt; wrote:
&lt;br&gt;&lt;div class='shrinkable-quote'&gt;&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; Does anybody have the similar issue? If you store XML files in HDFS, how
&lt;br&gt;&amp;gt; can you make sure a chunk reads by a mapper does not contain partial data of
&lt;br&gt;&amp;gt; an XML segment?
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; For example:
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;title&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;book&amp;gt;book1&amp;lt;/book&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;author&amp;gt;me&amp;lt;/author&amp;gt;
&lt;br&gt;&amp;gt; ..............what if this is the boundary of a chunk?...................
&lt;br&gt;&amp;gt; &amp;lt;year&amp;gt;2009&amp;lt;/year&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;book&amp;gt;book2&amp;lt;/book&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;author&amp;gt;me&amp;lt;/author&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;year&amp;gt;2009&amp;lt;/year&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;book&amp;gt;book3&amp;lt;/book&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;author&amp;gt;me&amp;lt;/author&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;year&amp;gt;2009&amp;lt;/year&amp;gt;
&lt;br&gt;&amp;gt; &amp;lt;title&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&lt;/div&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-dev-f30593.html&quot; embed=&quot;fixTarget[30593]&quot; target=&quot;_top&quot; &gt;Hadoop core-dev&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/What-if-an-XML-file-cross-boundary-of-HDFS-chunks--tp26120236p26123313.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26120236</id>
	<title>What if an XML file cross boundary of HDFS chunks?</title>
	<published>2009-10-29T13:32:21Z</published>
	<updated>2009-10-29T13:32:21Z</updated>
	<author>
		<name>meili100</name>
	</author>
	<content type="html">&lt;br&gt;Does anybody have the similar issue? If you store XML files in HDFS, how can you make sure a chunk reads by a mapper does not contain partial data of an XML segment?
&lt;br&gt;&lt;br&gt;For example:
&lt;br&gt;&lt;br&gt;&amp;lt;title&amp;gt;
&lt;br&gt;&amp;lt;book&amp;gt;book1&amp;lt;/book&amp;gt;
&lt;br&gt;&amp;lt;author&amp;gt;me&amp;lt;/author&amp;gt;
&lt;br&gt;..............what if this is the boundary of a chunk?...................
&lt;br&gt;&amp;lt;year&amp;gt;2009&amp;lt;/year&amp;gt;
&lt;br&gt;&amp;lt;book&amp;gt;book2&amp;lt;/book&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;author&amp;gt;me&amp;lt;/author&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;year&amp;gt;2009&amp;lt;/year&amp;gt;
&lt;br&gt;&amp;lt;book&amp;gt;book3&amp;lt;/book&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;author&amp;gt;me&amp;lt;/author&amp;gt;
&lt;br&gt;&lt;br&gt;&amp;lt;year&amp;gt;2009&amp;lt;/year&amp;gt;
&lt;br&gt;&amp;lt;title&amp;gt;
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;      
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-dev-f30593.html&quot; embed=&quot;fixTarget[30593]&quot; target=&quot;_top&quot; &gt;Hadoop core-dev&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/What-if-an-XML-file-cross-boundary-of-HDFS-chunks--tp26120236p26120236.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26018722</id>
	<title>Can I have multiple reducers?</title>
	<published>2009-10-22T16:17:18Z</published>
	<updated>2009-10-22T16:17:18Z</updated>
	<author>
		<name>Forhadoop</name>
	</author>
	<content type="html">Hello,

In my application I need to reduce the original reducer output keys further.

I was reading about Chainreducer and Chainmappers but looks like it is for : 
one or more mapper -&gt; reducer -&gt; 0 or more mappers

I need something like:
one or more mapper -&gt; reducer -&gt; reducer

Please help me figure out the best way to achieve it. Currently, the only options seems like I write another map reduce application and run it separately after the first map-reduce application. In this second application, the mapper will be dummy and won't do anything. The reducer will further club the first run outputs.

Any other comments such as this is not a good programming practice are welcome, so that I know I am in the wrong direction..&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Can-I-have-multiple-reducers--tp26018722p26018722.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-26003660</id>
	<title>Re: Need Info</title>
	<published>2009-10-21T20:49:16Z</published>
	<updated>2009-10-21T20:49:16Z</updated>
	<author>
		<name>Doss_IPH</name>
	</author>
	<content type="html">&lt;br&gt;Hi,
&lt;br&gt;&amp;nbsp;you can use this pseudo code for loading data to HDFS.
&lt;br&gt;&lt;br&gt;import java.io.File;
&lt;br&gt;import java.net.URI;
&lt;br&gt;import org.apache.hadoop.conf.Configuration;
&lt;br&gt;import org.apache.hadoop.fs.FileStatus;
&lt;br&gt;import org.apache.hadoop.fs.FileSystem;
&lt;br&gt;import org.apache.hadoop.fs.Path;
&lt;br&gt;import org.apache.hadoop.hdfs.DistributedFileSystem;
&lt;br&gt;&lt;br&gt;/**
&lt;br&gt;&amp;nbsp;* @author: Arockia Doss S
&lt;br&gt;&amp;nbsp;* @emailto: doss@intellipowerhive.com
&lt;br&gt;&amp;nbsp;* @url: &lt;a href=&quot;http://www.intellipowerhive.com,http://www.dossinfotech.com&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://www.intellipowerhive.com,http://www.dossinfotech.com&lt;/a&gt;&lt;br&gt;&amp;nbsp;* @comments: You can use and modify this code for your use.
&lt;br&gt;&amp;nbsp;* @About this: This below code works in hadoop-0.19.0 version platform. 
&lt;br&gt;&amp;nbsp;* &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;If you want to test this code, you have set the hadoop libraries in your class path.
&lt;br&gt;&amp;nbsp;* &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;You need to give set of parameters before running it (like Hadoop path, Host, Users).
&lt;br&gt;&amp;nbsp;*/
&lt;br&gt;&lt;br&gt;public class HadoopConfiguration { &amp;nbsp; &amp;nbsp;
&lt;br&gt;&amp;nbsp; &amp;nbsp; //Hadoop Absolute Path
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static final String CLUSTERPATH=&amp;quot;/home/hadoop-0.19.0/&amp;quot;;
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static final String SITEFILE = &amp;quot;conf/hadoop-site.xml&amp;quot;;
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static final String DEFAULTFILE = &amp;quot;conf/hadoop-default.xml&amp;quot;;
&lt;br&gt;&amp;nbsp; &amp;nbsp; //Hadoop Name Node Host
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static final String HADOOPHOST = &amp;quot;192.168.1.11&amp;quot;;
&lt;br&gt;&amp;nbsp; &amp;nbsp; //Hadoop Root and its users list
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static final String HOSTUSERS = &amp;quot;root,doss&amp;quot;; &amp;nbsp; &amp;nbsp;
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static Configuration conf = new Configuration();
&lt;br&gt;&amp;nbsp; &amp;nbsp; private static DistributedFileSystem dfs = new DistributedFileSystem();
&lt;br&gt;&amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; public HadoopConfiguration() &amp;nbsp;throws java.lang.Exception{
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Path sitepath = new Path(CLUSTERPATH+SITEFILE);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Path defaultpath = new Path(CLUSTERPATH+DEFAULTFILE);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; getConf().set(&amp;quot;fs.default.name&amp;quot;,&amp;quot;hdfs://&amp;quot;+HADOOPHOST+&amp;quot;:9000/&amp;quot;);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; getConf().addResource(sitepath);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; getConf().addResource(defaultpath);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; getConf().set(&amp;quot;hadoop.job.ugi&amp;quot;, HOSTUSERS);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; dfs.initialize(new URI(&amp;quot;hdfs://&amp;quot;+HADOOPHOST+&amp;quot;:9000/&amp;quot;), conf); &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;
&lt;br&gt;&amp;nbsp; &amp;nbsp; }
&lt;br&gt;&amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; public static Configuration getConf(){
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; return conf;
&lt;br&gt;&amp;nbsp; &amp;nbsp; }
&lt;br&gt;&amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; public static void main(String[] args){
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; try{
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; HadoopConfiguration h = new HadoopConfiguration();
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FileSystem fs = FileSystem.get(h.getConf());
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; //Copy sample.xls file to HDFS, The file will be there after copying it.
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; fs.copyFromLocalFile(new Path(&amp;quot;/home/sample.xls&amp;quot;),new Path(&amp;quot;/home/xls/&amp;quot;));
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; //Move sample.doc file to HDFS, The file will not be there after moving it.
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; fs.moveFromLocalFile(new Path(&amp;quot;/home/sample.doc&amp;quot;),new Path(&amp;quot;/home/doc/&amp;quot;));
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; //This below code gives to list the files from HDFS
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; FileStatus[] &amp;nbsp;fileStatus = fs.listStatus(new Path(&amp;quot;/home/xls&amp;quot;));
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; for(int i=0;i&amp;lt;fileStatus.length;i++){
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Path path = fileStatus[i].getPath();
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; }
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; }catch(java.lang.Exception e){
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; System.out.println(e);
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; }
&lt;br&gt;&amp;nbsp; &amp;nbsp; }
&lt;br&gt;&amp;nbsp; &amp;nbsp; 
&lt;br&gt;}
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;quote author=&quot;shwitzu&quot;&gt;&lt;br&gt;Thanks for Responding.
&lt;br&gt;&lt;br&gt;I read about HDFS and understood how it works and I also installed hadoop in my windows using cygwin and tried a sample driver code and made sure it works.
&lt;br&gt;&lt;br&gt;But my concern is, given the problem statement how should I proceed
&lt;br&gt;&lt;br&gt;Could you please give me some clue/ pseudo code or a design.
&lt;br&gt;&lt;br&gt;Thanks in anticipation. 
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;Doss_IPH wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;First and for most, you need to understand about hadoop platform infrastructures. 
&lt;br&gt;Currently, I am working in real time application using hadoop. I think that Hadoop will be fit to your requirements. 
&lt;br&gt;Hadoop is mainly for three things,
&lt;br&gt;1. Scalability no limit for storage
&lt;br&gt;2. Peta bytes of data processing in distributed parallel mode.
&lt;br&gt;3. Fault tolerance (Automatically Block Replication) recovering data from failure. 
&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;shwitzu wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;Hello Sir!
&lt;br&gt;&lt;br&gt;I am new to hadoop. I have a project &amp;nbsp;based on webservices. I have my information in 4 databases with different files in each one of them. Say, images in one, video, documents etc. My task is to develop a web service which accepts the keyword from the client and process the request and send back the actual requested file back to the user. Now I have to use Hadoop distributed file system in this project.
&lt;br&gt;&lt;br&gt;I have the following questions:
&lt;br&gt;&lt;br&gt;1) How should I start with the design?
&lt;br&gt;2) &amp;nbsp;Should I upload all the files and create Map, Reduce and Driver code and once I run my application will it automatically go the file system and get back the results to me?
&lt;br&gt;3) How do i handle the binary data? I want to store binary format data using MTOM in my databse.
&lt;br&gt;&lt;br&gt;Please let me know how I should proceed. I dont know much about this hadoop and am I searching for some help. It would be great if you could assist me. Thanks again
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;/quote&gt;&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Need-Info-tp25901902p26003660.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25996385</id>
	<title>Regd. Hadoop Implementation</title>
	<published>2009-10-21T10:02:55Z</published>
	<updated>2009-10-28T15:51:40Z</updated>
	<author>
		<name>shwitzu</name>
	</author>
	<content type="html">Thanks for Responding,
&lt;br&gt;&lt;br&gt;I read about HDFS and understood how it works and I also installed hadoop in my windows using cygwin and tried a sample driver code and made sure it works.
&lt;br&gt;&lt;br&gt;But my concern is, given the problem statement how should I proceed
&lt;br&gt;&lt;br&gt;Could you please give me some clue/ pseudo code or a design.
&lt;br&gt;&lt;br&gt;Thanks in anticipation. 
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;quote author=&quot;Doss_IPH&quot;&gt;&lt;br&gt;First and for most, you need to understand about hadoop platform infrastructures. 
&lt;br&gt;Currently, I am working in real time application using hadoop. I think that Hadoop will be fit to your requirements. 
&lt;br&gt;Hadoop is mainly for three things,
&lt;br&gt;1. Scalability no limit for storage
&lt;br&gt;2. Peta bytes of data processing in distributed parallel mode.
&lt;br&gt;3. Fault tolerance (Automatically Block Replication) recovering data from failure. 
&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;shwitzu wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;Hello Sir!
&lt;br&gt;&lt;br&gt;I am new to hadoop. I have a project &amp;nbsp;based on webservices. I have my information in 4 databases with different files in each one of them. Say, images in one, video, documents etc. My task is to develop a web service which accepts the keyword from the client and process the request and send back the actual requested file back to the user. Now I have to use Hadoop distributed file system in this project.
&lt;br&gt;&lt;br&gt;I have the following questions:
&lt;br&gt;&lt;br&gt;1) How should I start with the design?
&lt;br&gt;2) &amp;nbsp;Should I upload all the files and create Map, Reduce and Driver code and once I run my application will it automatically go the file system and get back the results to me?
&lt;br&gt;3) How do i handle the binary data? I want to store binary format data using MTOM in my databse.
&lt;br&gt;&lt;br&gt;Please let me know how I should proceed. I dont know much about this hadoop and am I searching for some help. It would be great if you could assist me. Thanks again
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;/quote&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Need-Info-tp25901902p25996385.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25995151</id>
	<title>Re: Datanode Throwing NoRouteToHostException</title>
	<published>2009-10-21T08:46:18Z</published>
	<updated>2009-10-21T08:46:18Z</updated>
	<author>
		<name>lazikid</name>
	</author>
	<content type="html">Thanks very much for your reply and time. I found it was a firewall issue. After stopping iptables, everything worked fine.
&lt;br&gt;&lt;br&gt;Thanks again.
&lt;br&gt;&lt;br&gt;&lt;quote author=&quot;Doss_IPH&quot;&gt;&lt;br&gt;Comment all the IPs in /etc/hosts first except localhost or 127.0.0.1 as i have given below. Then, give in conf/hadoop-site.xml
&lt;br&gt;fs.default.name parameter value is hdfs://hdfs://[Master Node IP]:9000/
&lt;br&gt;mapred.job.tracker parameter value is hdfs://[Job Tracker IP]:9001
&lt;br&gt;Make sure you have enabled ssh and sshd running without password.
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Datanode-Throwing-NoRouteToHostException-tp25946798p25995151.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25986574</id>
	<title>Re: DataNode is shutting down</title>
	<published>2009-10-20T22:09:25Z</published>
	<updated>2009-10-20T22:09:25Z</updated>
	<author>
		<name>Doss_IPH</name>
	</author>
	<content type="html">Hi!
&lt;br&gt;&lt;br&gt;Minimum HDFS replication must be 3.
&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;yibo820217 wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;hi all,there is my problem.
&lt;br&gt;when add a datanode to hadoop,the way is;
&lt;br&gt;1.in namenode add the new datanode to conf/slave
&lt;br&gt;&lt;br&gt;2.in new datanode cd $HADOOP_HOME then
&lt;br&gt;$ bin/hadoop-daemon.sh start datanode 
&lt;br&gt;$ bin/hadoop-daemon.sh start tasktracker
&lt;br&gt;&lt;br&gt;3.in namenode,
&lt;br&gt;$bin/hadoop &amp;nbsp;balancer
&lt;br&gt;&lt;br&gt;and the new datanode is added to hadoop,but a old datanode is shutdown
&lt;br&gt;and here is the logs in datanode which is shutdown
&lt;br&gt;&lt;br&gt;2009-10-14 13:16:30,604 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: BlockReport of 3 blocks got processed in 5 msecs
&lt;br&gt;2009-10-14 13:48:44,395 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeCommand action: DNA_REGISTER
&lt;br&gt;2009-10-14 13:48:47,402 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeCommand action: DNA_REGISTER
&lt;br&gt;2009-10-14 13:48:50,403 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeCommand action: DNA_REGISTER
&lt;br&gt;2009-10-14 13:48:53,407 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeCommand action: DNA_REGISTER
&lt;br&gt;2009-10-14 13:48:56,418 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeCommand action: DNA_REGISTER
&lt;br&gt;2009-10-14 13:48:59,415 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeCommand action: DNA_REGISTER
&lt;br&gt;2009-10-14 13:49:02,420 WARN org.apache.hadoop.hdfs.server.datanode.DataNode: DataNode is shutting down: org.apache.hadoop.ipc.RemoteException: org.apache.hadoop.hdfs.protocol.UnregisteredDatanodeException: Data node 100.207.100.33:50010 is attempting to report storage ID DS-1277539940-100.207.100.33-50010-1255486116525. Node 100.207.100.25:50010 is expected to serve this storage.
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getDatanode(FSNamesystem.java:3914)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.processReport(FSNamesystem.java:2885)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.blockReport(NameNode.java:715)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.reflect.GeneratedMethodAccessor11.invoke(Unknown Source)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.lang.reflect.Method.invoke(Method.java:597)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.security.AccessController.doPrivileged(Native Method)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at javax.security.auth.Subject.doAs(Subject.java:396)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.call(Client.java:739)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at $Proxy4.blockReport(Unknown Source)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.offerService(DataNode.java:756)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.run(DataNode.java:1186)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.lang.Thread.run(Thread.java:619)
&lt;br&gt;&lt;br&gt;2009-10-14 13:49:02,527 INFO org.apache.hadoop.ipc.Server: Stopping server on 50020
&lt;br&gt;2009-10-14 13:49:02,528 INFO org.apache.hadoop.ipc.Server: IPC Server handler 1 on 50020: exiting
&lt;br&gt;2009-10-14 13:49:02,529 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server Responder
&lt;br&gt;2009-10-14 13:49:02,529 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: Waiting for threadgroup to exit, active threads is 1
&lt;br&gt;2009-10-14 13:49:02,528 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0 on 50020: exiting
&lt;br&gt;2009-10-14 13:49:02,528 INFO org.apache.hadoop.ipc.Server: IPC Server handler 2 on 50020: exiting
&lt;br&gt;2009-10-14 13:49:02,529 WARN org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.207.0.33:50010, storageID=DS-1277539940-100.207.100.33-50010-1255486116525, infoPort=50075, ipcPort=50020):DataXceiveServer: java.nio.channels.AsynchronousCloseException
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.nio.channels.spi.AbstractInterruptibleChannel.end(AbstractInterruptibleChannel.java:185)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.nio.ch.ServerSocketChannelImpl.accept(ServerSocketChannelImpl.java:152)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.nio.ch.ServerSocketAdaptor.accept(ServerSocketAdaptor.java:84)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataXceiverServer.run(DataXceiverServer.java:130)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.lang.Thread.run(Thread.java:619)
&lt;br&gt;&lt;br&gt;2009-10-14 13:49:02,530 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server listener on 50020
&lt;br&gt;2009-10-14 13:49:03,267 INFO org.apache.hadoop.hdfs.server.datanode.DataBlockScanner: Exiting DataBlockScanner thread.
&lt;br&gt;2009-10-14 13:49:03,530 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: Waiting for threadgroup to exit, active threads is 0
&lt;br&gt;2009-10-14 13:49:03,635 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.207.0.33:50010, storageID=DS-1277539940-100.207.100.33-50010-1255486116525, infoPort=50075, ipcPort=50020):Finishing DataNode in: FSDataset{dirpath='/data0/hadoop/hadoopfs/data/current'}
&lt;br&gt;2009-10-14 13:49:03,635 INFO org.apache.hadoop.ipc.Server: Stopping server on 50020
&lt;br&gt;2009-10-14 13:49:03,635 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: Waiting for threadgroup to exit, active threads is 0
&lt;br&gt;2009-10-14 13:49:03,636 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG: 
&lt;br&gt;/************************************************************
&lt;br&gt;SHUTDOWN_MSG: Shutting down DataNode at cent52ip33/100.207.100.33
&lt;br&gt;************************************************************/
&lt;br&gt;&lt;br&gt;----hdfs-core.xml----
&lt;br&gt;...
&lt;br&gt;&amp;nbsp; &amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;name&amp;gt;dfs.replication&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;lt;value&amp;gt;1&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;/property&amp;gt;
&lt;br&gt;...
&lt;br&gt;&lt;br&gt;can anybody help me please?
&lt;br&gt;&lt;br&gt;Thanks!
&lt;br&gt;&lt;br&gt;Darren.
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/DataNode-is-shutting-down-tp25885861p25986574.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25986550</id>
	<title>Re: Need Info</title>
	<published>2009-10-20T22:05:18Z</published>
	<updated>2009-10-20T22:05:18Z</updated>
	<author>
		<name>Doss_IPH</name>
	</author>
	<content type="html">First and for most, you need to understand about hadoop platform infrastructures. 
&lt;br&gt;Currently, I am working in real time application using hadoop. I think that Hadoop will be fit to your requirements. 
&lt;br&gt;Hadoop is mainly for three things,
&lt;br&gt;1. Scalability no limit for storage
&lt;br&gt;2. Peta bytes of data processing in distributed parallel mode.
&lt;br&gt;3. Fault tolerance (Automatically Block Replication) recovering data from failure. 
&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;shwitzu wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;Hello Sir!
&lt;br&gt;&lt;br&gt;I am new to hadoop. I have a project &amp;nbsp;based on webservices. I have my information in 4 databases with different files in each one of them. Say, images in one, video, documents etc. My task is to develop a web service which accepts the keyword from the client and process the request and send back the actual requested file back to the user. Now I have to use Hadoop distributed file system in this project.
&lt;br&gt;&lt;br&gt;I have the following questions:
&lt;br&gt;&lt;br&gt;1) How should I start with the design?
&lt;br&gt;2) &amp;nbsp;Should I upload all the files and create Map, Reduce and Driver code and once I run my application will it automatically go the file system and get back the results to me?
&lt;br&gt;3) How do i handle the binary data? I want to store binary format data using MTOM in my databse.
&lt;br&gt;&lt;br&gt;Please let me know how I should proceed. I dont know much about this hadoop and am I searching for some help. It would be great if you could assist me. Thanks again
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Need-Info-tp25901902p25986550.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25986468</id>
	<title>Re: Datanode Throwing NoRouteToHostException</title>
	<published>2009-10-20T21:46:49Z</published>
	<updated>2009-10-20T21:46:49Z</updated>
	<author>
		<name>Doss_IPH</name>
	</author>
	<content type="html">Comment all the IPs in /etc/hosts first except localhost or 127.0.0.1 as i have given below. Then, give in conf/hadoop-site.xml
&lt;br&gt;fs.default.name parameter value is hdfs://hdfs://[Master Node IP]:9000/
&lt;br&gt;mapred.job.tracker parameter value is hdfs://[Job Tracker IP]:9001
&lt;br&gt;Make sure you have enabled ssh and sshd running without password.
&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;lazikid wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;I need some help with setting up a Hadoop cluster. The datanode on the slave is not coming up throwing java.net.NoRouteToHostException: No route to host. Please see the details below.
&lt;br&gt;&lt;br&gt;I have a centos master and a fedora slave. Both have Java 6 and Hadoop 0.20.1. I instaleed hadoop under /opt in both machines.
&lt;br&gt;&lt;br&gt;The machines can ping one another using hostname and ip. They can also make password-less ssh connection to one another using both hostname and ip.
&lt;br&gt;&lt;br&gt;This is what I have in the /etc/hosts file for the master :
&lt;br&gt;&lt;br&gt;************************************************
&lt;br&gt;127.0.0.1 &amp;nbsp; &amp;nbsp; &amp;nbsp; localhost.localdomain localhost
&lt;br&gt;#192.168.1.125 &amp;nbsp; centos
&lt;br&gt;#192.168.1.102 &amp;nbsp; fedora
&lt;br&gt;::1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; localhost6.localdomain6 localhost6
&lt;br&gt;************************************************
&lt;br&gt;&lt;br&gt;This is what I have in the /etc/hosts for the slave
&lt;br&gt;&lt;br&gt;************************************************
&lt;br&gt;127.0.0.1	localhost.localdomain localhost
&lt;br&gt;#192.168.1.104	ubuntu64
&lt;br&gt;#192.168.1.102	fedora
&lt;br&gt;#192.168.1.125	centos
&lt;br&gt;::1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; localhost localhost.localdomain localhost6 localhost6.localdomain6
&lt;br&gt;************************************************
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;These are my configuration files :
&lt;br&gt;&lt;br&gt;1. core-site.xml 
&lt;br&gt;&lt;br&gt;&amp;lt;configuration&amp;gt;
&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;fs.default.name&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;hdfs://centos:54310&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;description&amp;gt;determine the host, port, etc. for a filesystem.&amp;lt;/description&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&amp;lt;/configuration&amp;gt;
&lt;br&gt;&lt;br&gt;2. mapred-site.xml
&lt;br&gt;&lt;br&gt;&amp;lt;configuration&amp;gt;
&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;mapred.job.tracker&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;hdfs://centos:54311&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;description&amp;gt;....&amp;lt;/description&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&amp;lt;/configuration&amp;gt;
&lt;br&gt;&lt;br&gt;3. masters
&lt;br&gt;&amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp;centos
&lt;br&gt;&lt;br&gt;4. slaves
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp;fedora
&lt;br&gt;&lt;br&gt;&lt;br&gt;I issued bin/start-dfs.sh from the master(centos) machine and this is what I see :
&lt;br&gt;&lt;br&gt;starting namenode, logging to xxxxxxxxxxx
&lt;br&gt;fedora: starting datanode, logging to xxxxxxx
&lt;br&gt;centos: starting secondarynamenode, logging to xxxxxx
&lt;br&gt;&lt;br&gt;When I checked the slave(fedora) logs, this what I see :
&lt;br&gt;&lt;br&gt;*************************************************************************
&lt;br&gt;STARTUP_MSG: Starting DataNode
&lt;br&gt;STARTUP_MSG: &amp;nbsp; host = fedora/192.168.1.102
&lt;br&gt;STARTUP_MSG: &amp;nbsp; args = []
&lt;br&gt;STARTUP_MSG: &amp;nbsp; version = 0.20.1
&lt;br&gt;STARTUP_MSG: &amp;nbsp; build = &lt;a href=&quot;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&lt;/a&gt;&amp;nbsp;-r 810220; compiled by 'oom' on Tue Sep &amp;nbsp;1 20:55:56 UTC 
&lt;br&gt;&lt;br&gt;2009
&lt;br&gt;************************************************************/
&lt;br&gt;2009-10-17 17:05:43,385 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 0 time(s).
&lt;br&gt;2009-10-17 17:05:44,387 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 1 time(s).
&lt;br&gt;2009-10-17 17:05:45,389 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 2 time(s).
&lt;br&gt;2009-10-17 17:05:46,390 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 3 time(s).
&lt;br&gt;2009-10-17 17:05:47,388 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 4 time(s).
&lt;br&gt;2009-10-17 17:05:48,390 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 5 time(s).
&lt;br&gt;2009-10-17 17:05:49,393 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 6 time(s).
&lt;br&gt;2009-10-17 17:05:50,394 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 7 time(s).
&lt;br&gt;2009-10-17 17:05:51,395 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 8 time(s).
&lt;br&gt;2009-10-17 17:05:52,398 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 9 time(s).
&lt;br&gt;2009-10-17 17:05:52,404 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: Call to centos/192.168.1.125:54310 failed on local 
&lt;br&gt;&lt;br&gt;exception: java.net.NoRouteToHostException: No route to host
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.wrapException(Client.java:774)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.call(Client.java:742)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at $Proxy4.getProtocolVersion(Unknown Source)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:346)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:383)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:314)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:291)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.startDataNode(DataNode.java:269)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.&amp;lt;init&amp;gt;(DataNode.java:216)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:1283)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:1238)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:1246)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:1368)
&lt;br&gt;Caused by: java.net.NoRouteToHostException: No route to host
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:574)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:404)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:304)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client$Connection.access$1700(Client.java:176)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.getConnection(Client.java:859)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.call(Client.java:719)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ... 13 more
&lt;br&gt;&lt;br&gt;2009-10-17 17:05:52,407 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG: 
&lt;br&gt;/************************************************************
&lt;br&gt;SHUTDOWN_MSG: Shutting down DataNode at fedora/192.168.1.102
&lt;br&gt;************************************************************/
&lt;br&gt;&lt;br&gt;*************************************************************************
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;The log on the master shows :
&lt;br&gt;&lt;br&gt;2009-10-17 17:05:39,352 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: STARTUP_MSG: 
&lt;br&gt;/************************************************************
&lt;br&gt;STARTUP_MSG: Starting NameNode
&lt;br&gt;STARTUP_MSG: &amp;nbsp; host = centos/192.168.1.125
&lt;br&gt;STARTUP_MSG: &amp;nbsp; args = []
&lt;br&gt;STARTUP_MSG: &amp;nbsp; version = 0.20.1
&lt;br&gt;STARTUP_MSG: &amp;nbsp; build = &lt;a href=&quot;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&lt;/a&gt;&amp;nbsp;-r 810220; compiled by 'oom' on Tue Sep &amp;nbsp;1 20:55:56 UTC 
&lt;br&gt;&lt;br&gt;2009
&lt;br&gt;************************************************************/
&lt;br&gt;2009-10-17 17:05:39,615 INFO org.apache.hadoop.ipc.metrics.RpcMetrics: Initializing RPC Metrics with hostName=NameNode, port=54310
&lt;br&gt;2009-10-17 17:05:39,620 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: Namenode up at: centos/192.168.1.125:54310
&lt;br&gt;2009-10-17 17:05:39,622 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: Initializing JVM Metrics with processName=NameNode, sessionId=null
&lt;br&gt;2009-10-17 17:05:39,624 INFO org.apache.hadoop.hdfs.server.namenode.metrics.NameNodeMetrics: Initializing NameNodeMeterics using context 
&lt;br&gt;&lt;br&gt;object:org.apache.hadoop.metrics.spi.NullContext
&lt;br&gt;2009-10-17 17:05:39,779 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: fsOwner=joe,joe
&lt;br&gt;2009-10-17 17:05:39,780 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: supergroup=supergroup
&lt;br&gt;2009-10-17 17:05:39,781 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: isPermissionEnabled=true
&lt;br&gt;2009-10-17 17:05:39,877 INFO org.apache.hadoop.hdfs.server.namenode.metrics.FSNamesystemMetrics: Initializing FSNamesystemMetrics using context 
&lt;br&gt;&lt;br&gt;object:org.apache.hadoop.metrics.spi.NullContext
&lt;br&gt;2009-10-17 17:05:39,883 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Registered FSNamesystemStatusMBean
&lt;br&gt;2009-10-17 17:05:39,929 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files = 1
&lt;br&gt;2009-10-17 17:05:39,933 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files under construction = 0
&lt;br&gt;2009-10-17 17:05:39,933 INFO org.apache.hadoop.hdfs.server.common.Storage: Image file of size 97 loaded in 0 seconds.
&lt;br&gt;2009-10-17 17:05:39,934 INFO org.apache.hadoop.hdfs.server.common.Storage: Edits file /tmp/hadoop-joe/dfs/name/current/edits of size 4 edits # 0 
&lt;br&gt;&lt;br&gt;loaded in 0 seconds.
&lt;br&gt;2009-10-17 17:05:39,972 INFO org.apache.hadoop.hdfs.server.common.Storage: Image file of size 97 saved in 0 seconds.
&lt;br&gt;2009-10-17 17:05:40,130 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Finished loading FSImage in 469 msecs
&lt;br&gt;2009-10-17 17:05:40,131 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Total number of blocks = 0
&lt;br&gt;2009-10-17 17:05:40,131 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of invalid blocks = 0
&lt;br&gt;2009-10-17 17:05:40,131 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of under-replicated blocks = 0
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of &amp;nbsp;over-replicated blocks = 0
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.StateChange: STATE* Leaving safe mode after 0 secs.
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.StateChange: STATE* Network topology has 0 racks and 0 datanodes
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.StateChange: STATE* UnderReplicatedBlocks has 0 blocks
&lt;br&gt;2009-10-17 17:05:40,320 INFO org.mortbay.log: Logging to org.slf4j.impl.Log4jLoggerAdapter(org.mortbay.log) via org.mortbay.log.Slf4jLog
&lt;br&gt;2009-10-17 17:05:40,518 INFO org.apache.hadoop.http.HttpServer: Port returned by webServer.getConnectors()[0].getLocalPort() before open() is -1. 
&lt;br&gt;&lt;br&gt;Opening the listener on 50070
&lt;br&gt;2009-10-17 17:05:40,520 INFO org.apache.hadoop.http.HttpServer: listener.getLocalPort() returned 50070 webServer.getConnectors()[0].getLocalPort() 
&lt;br&gt;&lt;br&gt;returned 50070
&lt;br&gt;2009-10-17 17:05:40,520 INFO org.apache.hadoop.http.HttpServer: Jetty bound to port 50070
&lt;br&gt;2009-10-17 17:05:40,520 INFO org.mortbay.log: jetty-6.1.14
&lt;br&gt;2009-10-17 17:05:46,612 INFO org.mortbay.log: Started SelectChannelConnector@0.0.0.0:50070
&lt;br&gt;2009-10-17 17:05:46,612 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: Web-server up at: 0.0.0.0:50070
&lt;br&gt;2009-10-17 17:05:46,613 INFO org.apache.hadoop.ipc.Server: IPC Server Responder: starting
&lt;br&gt;2009-10-17 17:05:46,615 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,629 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,634 INFO org.apache.hadoop.ipc.Server: IPC Server handler 1 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,636 INFO org.apache.hadoop.ipc.Server: IPC Server handler 2 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,637 INFO org.apache.hadoop.ipc.Server: IPC Server handler 4 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,641 INFO org.apache.hadoop.ipc.Server: IPC Server handler 6 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,642 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,643 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,643 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,900 INFO org.apache.hadoop.ipc.Server: IPC Server handler 5 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,916 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 54310: starting
&lt;br&gt;2009-10-17 17:11:10,967 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Roll Edit Log from 192.168.1.125
&lt;br&gt;2009-10-17 17:11:10,968 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of transactions: 0 Total time for transactions(ms): 0Number of 
&lt;br&gt;&lt;br&gt;transactions batched in Syncs: 0 Number of syncs: 0 SyncTimes(ms): 0 
&lt;br&gt;2009-10-17 17:11:11,518 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Roll FSImage from 192.168.1.125
&lt;br&gt;2009-10-17 17:11:11,519 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of transactions: 0 Total time for transactions(ms): 0Number of 
&lt;br&gt;&lt;br&gt;transactions batched in Syncs: 0 Number of syncs: 1 SyncTimes(ms): 2 
&lt;br&gt;&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp;
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Datanode-Throwing-NoRouteToHostException-tp25946798p25986468.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25973811</id>
	<title>Re: editing etc hosts files of a cluster</title>
	<published>2009-10-20T04:58:33Z</published>
	<updated>2009-10-20T04:58:33Z</updated>
	<author>
		<name>Ramesh.Ramasamy</name>
	</author>
	<content type="html">This following script does the work, but testing against /etc/hosts file instead of /home/hadoop/hosts/hosts fails, I'm getting permission denied error..
&lt;br&gt;&lt;br&gt;hadoop@master:sed$ cat applyhadoopcluster.sh 
&lt;br&gt;#!/bin/bash
&lt;br&gt;for i in `cat clusternodes`
&lt;br&gt;do
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # Remove the exiting entiries of Hadoop cluster, if any
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ssh hadoop@$i &amp;quot;sed '/#HADOOP_HOST_ENTRIES_STARTING/,/#HADOOP_HOST_ENTRIES_ENDING/d' /home/hadoop/hosts/hosts -i&amp;quot;
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # Insert the Hadoop cluster entries
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ssh hadoop@$i &amp;quot;sed '1i\#HADOOP_HOST_ENTRIES_STARTING' /home/hadoop/hosts/hosts -i&amp;quot;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; count=2
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; while read j
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; do
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; #echo $j
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ssh -n hadoop@$i &amp;quot;sed '2i$j' /home/hadoop/hosts/hosts -i&amp;quot;
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; count=`expr $count + 1`
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; #echo $count
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; done &amp;lt; clusterhostnames
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ssh hadoop@$i &amp;quot;sed ${count}i\#HADOOP_HOST_ENTRIES_ENDING /home/hadoop/hosts/hosts -i&amp;quot;
&lt;br&gt;done
&lt;br&gt;&lt;br&gt;hadoop@master:sed$ cat clusterhostnames 
&lt;br&gt;172.25.149.37	 &amp;nbsp;master
&lt;br&gt;172.25.149.38	 &amp;nbsp;slave2
&lt;br&gt;172.25.149.66 &amp;nbsp; slave1
&lt;br&gt;&lt;br&gt;hadoop@master:sed$ cat clusternodes 
&lt;br&gt;master
&lt;br&gt;slave2
&lt;br&gt;slave1
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/editing-etc-hosts-files-of-a-cluster-tp25958579p25973811.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25958579</id>
	<title>editing etc hosts files of a cluster</title>
	<published>2009-10-19T06:40:18Z</published>
	<updated>2009-10-19T06:40:18Z</updated>
	<author>
		<name>Ramesh.Ramasamy</name>
	</author>
	<content type="html">Hi,
&lt;br&gt;&lt;br&gt;I have a cluster setup with 3 nodes, and I'm adding hostname details (in /etc/hosts) manually in each node. Seems it is not an effective approach. How this scenario is handled in big clusters?
&lt;br&gt;&lt;br&gt;Is there any simple of way to add the hostname details in all the nodes by editing a single entry/file/script? 
&lt;br&gt;&lt;br&gt;Thanks and Regards,
&lt;br&gt;Ramesh
&lt;br&gt;&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/editing-etc-hosts-files-of-a-cluster-tp25958579p25958579.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25951789</id>
	<title>Seattle / NW Hadoop, Lucene, Apache &quot;Cloud Stack&quot; Meetup, Wed Oct 28  6:45pm</title>
	<published>2009-10-18T17:10:49Z</published>
	<updated>2009-10-18T17:10:49Z</updated>
	<author>
		<name>BrickMcLargeHuge</name>
	</author>
	<content type="html">Greetings,
&lt;br&gt;&lt;br&gt;(You're receiving this e-mail because you're on a DL or I think you'd
&lt;br&gt;be interested)
&lt;br&gt;&lt;br&gt;It's time for another Hadoop/Lucene/Apache &amp;quot;Cloud&amp;quot; stack meetup! This
&lt;br&gt;month it'll be on Wednesday, the 28th, at 6:45 pm.
&lt;br&gt;&lt;br&gt;A *huge* thanks for everyone who showed up last month, and to Facebook
&lt;br&gt;for sending someone awesome to speak about Hive. We learned quite a
&lt;br&gt;bit!
&lt;br&gt;&lt;br&gt;For October, we will have someone speaking about Cascading, and how it
&lt;br&gt;helps workflow abstraction with MapReduce. Very useful stuff to know.
&lt;br&gt;&lt;br&gt;We've had great attendance in the past few months, let's keep it up!
&lt;br&gt;I'm always amazed by the things I learn from everyone.
&lt;br&gt;&lt;br&gt;We're at the University of Washington, Allen Computer Science Center
&lt;br&gt;(not Electrical Engineering)
&lt;br&gt;&lt;br&gt;Map: &lt;a href=&quot;http://www.washington.edu/home/maps/?CSE&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://www.washington.edu/home/maps/?CSE&lt;/a&gt;&lt;br&gt;&lt;br&gt;Room: 303 -or- the Entry level. If there are changes, signs will be posted.
&lt;br&gt;&lt;br&gt;More Info:
&lt;br&gt;&lt;br&gt;The meetup is about 2 hours (and there's usually food): we'll have two
&lt;br&gt;in-depth talks, and then several &amp;quot;lightning talks&amp;quot; of 5 minutes. We'll
&lt;br&gt;then have discussion and 'social time'. Let me know if you're
&lt;br&gt;interested in speaking or attending. We'd like to focus on education,
&lt;br&gt;so feel free to ask questions.
&lt;br&gt;&lt;br&gt;Contact: Bradford Stephens, 904-415-3009, &lt;a href=&quot;http://old.nabble.com/user/SendEmail.jtp?type=post&amp;post=25951789&amp;i=0&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;bradfordstephens@...&lt;/a&gt;
&lt;br&gt;&lt;br&gt;-- 
&lt;br&gt;&lt;a href=&quot;http://www.drawntoscaleconsulting.com&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://www.drawntoscaleconsulting.com&lt;/a&gt;&amp;nbsp;- Scalability, Hadoop, HBase,
&lt;br&gt;and Distributed Lucene Consulting
&lt;br&gt;&lt;br&gt;&lt;a href=&quot;http://www.roadtofailure.com&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://www.roadtofailure.com&lt;/a&gt;&amp;nbsp;-- The Fringes of Scalability, Social
&lt;br&gt;Media, and Computer Science
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Seattle---NW-Hadoop%2C-Lucene%2C-Apache-%22Cloud-Stack%22-Meetup%2C-Wed-Oct-28--6%3A45pm-tp25951789p25951789.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25947843</id>
	<title>HTTP secure access</title>
	<published>2009-10-18T08:50:09Z</published>
	<updated>2009-10-18T08:50:09Z</updated>
	<author>
		<name>BM-13</name>
	</author>
	<content type="html">Hi!
&lt;br&gt;Well, I have a kinda simple question, but I can not spot a proper doc
&lt;br&gt;for it: how you, guys, restricting access to the web interfaces? :-)
&lt;br&gt;&lt;br&gt;It is somewhere in jetty or there is no feature like this? I am OK
&lt;br&gt;with a simple basic authentication, but I don't really like when
&lt;br&gt;others are staring at logs that are easily accessible through the web
&lt;br&gt;within the same network...
&lt;br&gt;&lt;br&gt;-- 
&lt;br&gt;Kind regards, BM
&lt;br&gt;&lt;br&gt;Things, that are stupid at the beginning, rarely ends up wisely.
&lt;br&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/HTTP-secure-access-tp25947843p25947843.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25946798</id>
	<title>Datanode Throwing NoRouteToHostException</title>
	<published>2009-10-18T06:44:24Z</published>
	<updated>2009-10-18T06:44:24Z</updated>
	<author>
		<name>lazikid</name>
	</author>
	<content type="html">I need some help with setting up a Hadoop cluster. The datanode on the slave is not coming up throwing java.net.NoRouteToHostException: No route to host. Please see the details below.
&lt;br&gt;&lt;br&gt;I have a centos master and a fedora slave. Both have Java 6 and Hadoop 0.20.1. I instaleed hadoop under /opt in both machines.
&lt;br&gt;&lt;br&gt;The machines can ping one another using hostname and ip. They can also make password-less ssh connection to one another using both hostname and ip.
&lt;br&gt;&lt;br&gt;This is what I have in the /etc/hosts file for the master :
&lt;br&gt;&lt;br&gt;************************************************
&lt;br&gt;127.0.0.1 &amp;nbsp; &amp;nbsp; &amp;nbsp; localhost.localdomain localhost
&lt;br&gt;192.168.1.125 &amp;nbsp; centos
&lt;br&gt;192.168.1.102 &amp;nbsp; fedora
&lt;br&gt;::1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; localhost6.localdomain6 localhost6
&lt;br&gt;************************************************
&lt;br&gt;&lt;br&gt;This is what I have in the /etc/hosts for the slave
&lt;br&gt;&lt;br&gt;************************************************
&lt;br&gt;127.0.0.1	localhost.localdomain localhost
&lt;br&gt;192.168.1.104	ubuntu64
&lt;br&gt;192.168.1.102	fedora
&lt;br&gt;192.168.1.125	centos
&lt;br&gt;::1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; localhost localhost.localdomain localhost6 localhost6.localdomain6
&lt;br&gt;************************************************
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;These are my configuration files :
&lt;br&gt;&lt;br&gt;1. core-site.xml 
&lt;br&gt;&lt;br&gt;&amp;lt;configuration&amp;gt;
&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;fs.default.name&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;hdfs://centos:54310&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;description&amp;gt;determine the host, port, etc. for a filesystem.&amp;lt;/description&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&amp;lt;/configuration&amp;gt;
&lt;br&gt;&lt;br&gt;2. mapred-site.xml
&lt;br&gt;&lt;br&gt;&amp;lt;configuration&amp;gt;
&lt;br&gt;&amp;lt;property&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;name&amp;gt;mapred.job.tracker&amp;lt;/name&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;value&amp;gt;hdfs://centos:54311&amp;lt;/value&amp;gt;
&lt;br&gt;&amp;nbsp; &amp;lt;description&amp;gt;....&amp;lt;/description&amp;gt;
&lt;br&gt;&amp;lt;/property&amp;gt;
&lt;br&gt;&amp;lt;/configuration&amp;gt;
&lt;br&gt;&lt;br&gt;3. masters
&lt;br&gt;&amp;nbsp; 
&lt;br&gt;&amp;nbsp; &amp;nbsp;centos
&lt;br&gt;&lt;br&gt;4. slaves
&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp;fedora
&lt;br&gt;&lt;br&gt;&lt;br&gt;I issued bin/start-dfs.sh from the master(centos) machine and this is what I see :
&lt;br&gt;&lt;br&gt;starting namenode, logging to xxxxxxxxxxx
&lt;br&gt;fedora: starting datanode, logging to xxxxxxx
&lt;br&gt;centos: starting secondarynamenode, logging to xxxxxx
&lt;br&gt;&lt;br&gt;When I checked the slave(fedora) logs, this what I see :
&lt;br&gt;&lt;br&gt;*************************************************************************
&lt;br&gt;STARTUP_MSG: Starting DataNode
&lt;br&gt;STARTUP_MSG: &amp;nbsp; host = fedora/192.168.1.102
&lt;br&gt;STARTUP_MSG: &amp;nbsp; args = []
&lt;br&gt;STARTUP_MSG: &amp;nbsp; version = 0.20.1
&lt;br&gt;STARTUP_MSG: &amp;nbsp; build = &lt;a href=&quot;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&lt;/a&gt;&amp;nbsp;-r 810220; compiled by 'oom' on Tue Sep &amp;nbsp;1 20:55:56 UTC 
&lt;br&gt;&lt;br&gt;2009
&lt;br&gt;************************************************************/
&lt;br&gt;2009-10-17 17:05:43,385 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 0 time(s).
&lt;br&gt;2009-10-17 17:05:44,387 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 1 time(s).
&lt;br&gt;2009-10-17 17:05:45,389 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 2 time(s).
&lt;br&gt;2009-10-17 17:05:46,390 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 3 time(s).
&lt;br&gt;2009-10-17 17:05:47,388 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 4 time(s).
&lt;br&gt;2009-10-17 17:05:48,390 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 5 time(s).
&lt;br&gt;2009-10-17 17:05:49,393 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 6 time(s).
&lt;br&gt;2009-10-17 17:05:50,394 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 7 time(s).
&lt;br&gt;2009-10-17 17:05:51,395 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 8 time(s).
&lt;br&gt;2009-10-17 17:05:52,398 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: centos/192.168.1.125:54310. Already tried 9 time(s).
&lt;br&gt;2009-10-17 17:05:52,404 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: Call to centos/192.168.1.125:54310 failed on local 
&lt;br&gt;&lt;br&gt;exception: java.net.NoRouteToHostException: No route to host
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.wrapException(Client.java:774)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.call(Client.java:742)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at $Proxy4.getProtocolVersion(Unknown Source)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:346)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:383)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:314)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:291)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.startDataNode(DataNode.java:269)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.&amp;lt;init&amp;gt;(DataNode.java:216)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:1283)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:1238)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:1246)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:1368)
&lt;br&gt;Caused by: java.net.NoRouteToHostException: No route to host
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:574)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:404)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:304)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client$Connection.access$1700(Client.java:176)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.getConnection(Client.java:859)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Client.call(Client.java:719)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ... 13 more
&lt;br&gt;&lt;br&gt;2009-10-17 17:05:52,407 INFO org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG: 
&lt;br&gt;/************************************************************
&lt;br&gt;SHUTDOWN_MSG: Shutting down DataNode at fedora/192.168.1.102
&lt;br&gt;************************************************************/
&lt;br&gt;&lt;br&gt;*************************************************************************
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;The log on the master shows :
&lt;br&gt;&lt;br&gt;2009-10-17 17:05:39,352 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: STARTUP_MSG: 
&lt;br&gt;/************************************************************
&lt;br&gt;STARTUP_MSG: Starting NameNode
&lt;br&gt;STARTUP_MSG: &amp;nbsp; host = centos/192.168.1.125
&lt;br&gt;STARTUP_MSG: &amp;nbsp; args = []
&lt;br&gt;STARTUP_MSG: &amp;nbsp; version = 0.20.1
&lt;br&gt;STARTUP_MSG: &amp;nbsp; build = &lt;a href=&quot;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&quot; target=&quot;_top&quot; rel=&quot;nofollow&quot;&gt;http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1&lt;/a&gt;&amp;nbsp;-r 810220; compiled by 'oom' on Tue Sep &amp;nbsp;1 20:55:56 UTC 
&lt;br&gt;&lt;br&gt;2009
&lt;br&gt;************************************************************/
&lt;br&gt;2009-10-17 17:05:39,615 INFO org.apache.hadoop.ipc.metrics.RpcMetrics: Initializing RPC Metrics with hostName=NameNode, port=54310
&lt;br&gt;2009-10-17 17:05:39,620 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: Namenode up at: centos/192.168.1.125:54310
&lt;br&gt;2009-10-17 17:05:39,622 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: Initializing JVM Metrics with processName=NameNode, sessionId=null
&lt;br&gt;2009-10-17 17:05:39,624 INFO org.apache.hadoop.hdfs.server.namenode.metrics.NameNodeMetrics: Initializing NameNodeMeterics using context 
&lt;br&gt;&lt;br&gt;object:org.apache.hadoop.metrics.spi.NullContext
&lt;br&gt;2009-10-17 17:05:39,779 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: fsOwner=joe,joe
&lt;br&gt;2009-10-17 17:05:39,780 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: supergroup=supergroup
&lt;br&gt;2009-10-17 17:05:39,781 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: isPermissionEnabled=true
&lt;br&gt;2009-10-17 17:05:39,877 INFO org.apache.hadoop.hdfs.server.namenode.metrics.FSNamesystemMetrics: Initializing FSNamesystemMetrics using context 
&lt;br&gt;&lt;br&gt;object:org.apache.hadoop.metrics.spi.NullContext
&lt;br&gt;2009-10-17 17:05:39,883 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Registered FSNamesystemStatusMBean
&lt;br&gt;2009-10-17 17:05:39,929 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files = 1
&lt;br&gt;2009-10-17 17:05:39,933 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files under construction = 0
&lt;br&gt;2009-10-17 17:05:39,933 INFO org.apache.hadoop.hdfs.server.common.Storage: Image file of size 97 loaded in 0 seconds.
&lt;br&gt;2009-10-17 17:05:39,934 INFO org.apache.hadoop.hdfs.server.common.Storage: Edits file /tmp/hadoop-joe/dfs/name/current/edits of size 4 edits # 0 
&lt;br&gt;&lt;br&gt;loaded in 0 seconds.
&lt;br&gt;2009-10-17 17:05:39,972 INFO org.apache.hadoop.hdfs.server.common.Storage: Image file of size 97 saved in 0 seconds.
&lt;br&gt;2009-10-17 17:05:40,130 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Finished loading FSImage in 469 msecs
&lt;br&gt;2009-10-17 17:05:40,131 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Total number of blocks = 0
&lt;br&gt;2009-10-17 17:05:40,131 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of invalid blocks = 0
&lt;br&gt;2009-10-17 17:05:40,131 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of under-replicated blocks = 0
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of &amp;nbsp;over-replicated blocks = 0
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.StateChange: STATE* Leaving safe mode after 0 secs.
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.StateChange: STATE* Network topology has 0 racks and 0 datanodes
&lt;br&gt;2009-10-17 17:05:40,132 INFO org.apache.hadoop.hdfs.StateChange: STATE* UnderReplicatedBlocks has 0 blocks
&lt;br&gt;2009-10-17 17:05:40,320 INFO org.mortbay.log: Logging to org.slf4j.impl.Log4jLoggerAdapter(org.mortbay.log) via org.mortbay.log.Slf4jLog
&lt;br&gt;2009-10-17 17:05:40,518 INFO org.apache.hadoop.http.HttpServer: Port returned by webServer.getConnectors()[0].getLocalPort() before open() is -1. 
&lt;br&gt;&lt;br&gt;Opening the listener on 50070
&lt;br&gt;2009-10-17 17:05:40,520 INFO org.apache.hadoop.http.HttpServer: listener.getLocalPort() returned 50070 webServer.getConnectors()[0].getLocalPort() 
&lt;br&gt;&lt;br&gt;returned 50070
&lt;br&gt;2009-10-17 17:05:40,520 INFO org.apache.hadoop.http.HttpServer: Jetty bound to port 50070
&lt;br&gt;2009-10-17 17:05:40,520 INFO org.mortbay.log: jetty-6.1.14
&lt;br&gt;2009-10-17 17:05:46,612 INFO org.mortbay.log: Started SelectChannelConnector@0.0.0.0:50070
&lt;br&gt;2009-10-17 17:05:46,612 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: Web-server up at: 0.0.0.0:50070
&lt;br&gt;2009-10-17 17:05:46,613 INFO org.apache.hadoop.ipc.Server: IPC Server Responder: starting
&lt;br&gt;2009-10-17 17:05:46,615 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,629 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,634 INFO org.apache.hadoop.ipc.Server: IPC Server handler 1 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,636 INFO org.apache.hadoop.ipc.Server: IPC Server handler 2 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,637 INFO org.apache.hadoop.ipc.Server: IPC Server handler 4 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,641 INFO org.apache.hadoop.ipc.Server: IPC Server handler 6 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,642 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,643 INFO org.apache.hadoop.ipc.Server: IPC Server handler 8 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,643 INFO org.apache.hadoop.ipc.Server: IPC Server handler 3 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,900 INFO org.apache.hadoop.ipc.Server: IPC Server handler 5 on 54310: starting
&lt;br&gt;2009-10-17 17:05:46,916 INFO org.apache.hadoop.ipc.Server: IPC Server handler 9 on 54310: starting
&lt;br&gt;2009-10-17 17:11:10,967 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Roll Edit Log from 192.168.1.125
&lt;br&gt;2009-10-17 17:11:10,968 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of transactions: 0 Total time for transactions(ms): 0Number of 
&lt;br&gt;&lt;br&gt;transactions batched in Syncs: 0 Number of syncs: 0 SyncTimes(ms): 0 
&lt;br&gt;2009-10-17 17:11:11,518 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Roll FSImage from 192.168.1.125
&lt;br&gt;2009-10-17 17:11:11,519 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of transactions: 0 Total time for transactions(ms): 0Number of 
&lt;br&gt;&lt;br&gt;transactions batched in Syncs: 0 Number of syncs: 1 SyncTimes(ms): 2 
&lt;br&gt;&lt;br&gt;&lt;br&gt;&amp;nbsp; &amp;nbsp;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/Datanode-Throwing-NoRouteToHostException-tp25946798p25946798.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25925504</id>
	<title>Re: fuse:The ratio of reported blocks 0.0000 has not reached the threshold 0.9990</title>
	<published>2009-10-16T06:31:18Z</published>
	<updated>2009-10-16T06:31:18Z</updated>
	<author>
		<name>Brian Bockelman</name>
	</author>
	<content type="html">Hey,
&lt;br&gt;&lt;br&gt;I thought this sort of basic safemode operation is covered well in the &amp;nbsp;
&lt;br&gt;manual. &amp;nbsp;Do you have any questions or suggestions for the documentation?
&lt;br&gt;&lt;br&gt;Brian
&lt;br&gt;&lt;br&gt;On Oct 16, 2009, at 3:12 AM, yibo820217 wrote:
&lt;br&gt;&lt;div class='shrinkable-quote'&gt;&lt;div class='shrinkable-quote'&gt;&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; It seems lke the normal procedure to restart the hdfs.
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; yibo820217 wrote:
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; hi,recently,i got some problem.
&lt;br&gt;&amp;gt;&amp;gt; at first,I start the hadoop
&lt;br&gt;&amp;gt;&amp;gt; #bin/start-all.sh
&lt;br&gt;&amp;gt;&amp;gt; then I look the logs
&lt;br&gt;&amp;gt;&amp;gt; #cat hadoop-root-namenode-cent52ip32.log
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; 2009-10-13 22:12:29,733 INFO
&lt;br&gt;&amp;gt;&amp;gt; org.apache.hadoop.hdfs.server.namenode.FSNamesystem.audit:
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; ugi=root,root,bin,daemon,sys,adm,disk,wheel &amp;nbsp; &amp;nbsp;ip=/127.0.0.1
&lt;br&gt;&amp;gt;&amp;gt; cmd=listStatus &amp;nbsp;src=/tmp/hadoop-root/mapred/system
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp;dst=null &amp;nbsp; &amp;nbsp; &amp;nbsp;perm=null
&lt;br&gt;&amp;gt;&amp;gt; 2009-10-13 22:12:29,736 INFO org.apache.hadoop.ipc.Server: IPC Server
&lt;br&gt;&amp;gt;&amp;gt; handler 5 on 9000, call delete(/tmp/hadoop-
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; root/mapred/system, true) from 127.0.0.1:37371: error:
&lt;br&gt;&amp;gt;&amp;gt; org.apache.hadoop.hdfs.server.namenode.SafeModeException:
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; Cannot delete /tmp/hadoop-root/mapred/system. Name node is in safe &amp;nbsp;
&lt;br&gt;&amp;gt;&amp;gt; mode.
&lt;br&gt;&amp;gt;&amp;gt; The ratio of reported blocks 0.0000 has not reached the threshold &amp;nbsp;
&lt;br&gt;&amp;gt;&amp;gt; 0.9990.
&lt;br&gt;&amp;gt;&amp;gt; Safe mode will be turned off
&lt;br&gt;&amp;gt;&amp;gt; automatically.
&lt;br&gt;&amp;gt;&amp;gt; org.apache.hadoop.hdfs.server.namenode.SafeModeException: Cannot &amp;nbsp;
&lt;br&gt;&amp;gt;&amp;gt; delete
&lt;br&gt;&amp;gt;&amp;gt; /tmp/hadoop-root/mapred/system. Name node is
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; in safe mode.
&lt;br&gt;&amp;gt;&amp;gt; The ratio of reported blocks 0.0000 has not reached the threshold &amp;nbsp;
&lt;br&gt;&amp;gt;&amp;gt; 0.9990.
&lt;br&gt;&amp;gt;&amp;gt; Safe mode will be turned off
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; automatically.
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at
&lt;br&gt;&amp;gt;&amp;gt; org.apache.hadoop.hdfs.server.namenode.FSNamesystem.deleteInternal 
&lt;br&gt;&amp;gt;&amp;gt; (FSNamesystem.java:1696)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at
&lt;br&gt;&amp;gt;&amp;gt; org.apache.hadoop.hdfs.server.namenode.FSNamesystem.delete 
&lt;br&gt;&amp;gt;&amp;gt; (FSNamesystem.java:1676)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at
&lt;br&gt;&amp;gt;&amp;gt; org.apache.hadoop.hdfs.server.namenode.NameNode.delete 
&lt;br&gt;&amp;gt;&amp;gt; (NameNode.java:517)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at
&lt;br&gt;&amp;gt;&amp;gt; sun.reflect.NativeMethodAccessorImpl.invoke 
&lt;br&gt;&amp;gt;&amp;gt; (NativeMethodAccessorImpl.java:39)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at
&lt;br&gt;&amp;gt;&amp;gt; sun.reflect.DelegatingMethodAccessorImpl.invoke 
&lt;br&gt;&amp;gt;&amp;gt; (DelegatingMethodAccessorImpl.java:25)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at java.lang.reflect.Method.invoke(Method.java:597)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at java.security.AccessController.doPrivileged(Native Method)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at javax.security.auth.Subject.doAs(Subject.java:396)
&lt;br&gt;&amp;gt;&amp;gt; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; the same message have appear 10 times,the the safemode is off
&lt;br&gt;&amp;gt;&amp;gt; [root@cent52ip32 hadoop]# bin/hadoop dfsadmin -safemode get
&lt;br&gt;&amp;gt;&amp;gt; Safe mode is OFF
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;&amp;gt; I don't know the reason,please help me.
&lt;br&gt;&amp;gt;&amp;gt; thanks very much.
&lt;br&gt;&amp;gt;&amp;gt;
&lt;br&gt;&amp;gt;
&lt;br&gt;&amp;gt; -- 
&lt;br&gt;&amp;gt; View this message in context: &lt;a href=&quot;http://www.nabble.com/fuse%3AThe-ratio-of-reported-blocks-0.0000-has-not-reached-the-threshold-0.9990-tp25868687p25921487.html&quot; target=&quot;_top&quot;&gt;http://www.nabble.com/fuse%3AThe-ratio-of-reported-blocks-0.0000-has-not-reached-the-threshold-0.9990-tp25868687p25921487.html&lt;/a&gt;&lt;br&gt;&amp;gt; Sent from the Hadoop core-user mailing list archive at Nabble.com.
&lt;/div&gt;&lt;/div&gt;&lt;br /&gt; &lt;div class=&quot;small&quot;&gt;&lt;br/&gt;&lt;img src=&quot;http://old.nabble.com/images/icon_attachment.gif&quot; &gt; &lt;strong&gt;smime.p7s&lt;/strong&gt; (3K) &lt;a href=&quot;http://old.nabble.com/attachment/25925504/0/smime.p7s&quot; target=&quot;_top&quot;&gt;Download Attachment&lt;/a&gt;&lt;/div&gt;&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/fuse%3AThe-ratio-of-reported-blocks-0.0000-has-not-reached-the-threshold-0.9990-tp25868687p25925504.html" />
</entry>

<entry>
	<id>tag:old.nabble.com,2006:post-25921487</id>
	<title>Re: fuse:The ratio of reported blocks 0.0000 has not reached the threshold 0.9990</title>
	<published>2009-10-16T01:12:00Z</published>
	<updated>2009-10-16T01:12:00Z</updated>
	<author>
		<name>yibo820217</name>
	</author>
	<content type="html">&lt;br&gt;It seems lke the normal procedure to restart the hdfs.
&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;br&gt;&lt;blockquote class=&quot;quote light-black dark-border-color&quot;&gt;&lt;div class=&quot;quote light-border-color&quot;&gt;
&lt;div class=&quot;quote-author&quot; style=&quot;font-weight: bold;&quot;&gt;yibo820217 wrote:&lt;/div&gt;
&lt;div class=&quot;quote-message shrinkable-quote&quot;&gt;hi,recently,i got some problem.
&lt;br&gt;at first,I start the hadoop
&lt;br&gt;#bin/start-all.sh
&lt;br&gt;then I look the logs
&lt;br&gt;#cat hadoop-root-namenode-cent52ip32.log 
&lt;br&gt;&lt;br&gt;2009-10-13 22:12:29,733 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem.audit: 
&lt;br&gt;&lt;br&gt;ugi=root,root,bin,daemon,sys,adm,disk,wheel &amp;nbsp; &amp;nbsp;ip=/127.0.0.1 &amp;nbsp; cmd=listStatus &amp;nbsp;src=/tmp/hadoop-root/mapred/system &amp;nbsp; &amp;nbsp;
&lt;br&gt;&lt;br&gt;&amp;nbsp; dst=null &amp;nbsp; &amp;nbsp; &amp;nbsp;perm=null
&lt;br&gt;2009-10-13 22:12:29,736 INFO org.apache.hadoop.ipc.Server: IPC Server handler 5 on 9000, call delete(/tmp/hadoop-
&lt;br&gt;&lt;br&gt;root/mapred/system, true) from 127.0.0.1:37371: error: org.apache.hadoop.hdfs.server.namenode.SafeModeException: 
&lt;br&gt;&lt;br&gt;&lt;b&gt;&lt;i&gt;Cannot delete /tmp/hadoop-root/mapred/system. Name node is in safe mode.
&lt;br&gt;The ratio of reported blocks 0.0000 has not reached the threshold 0.9990. Safe mode will be turned off &lt;/i&gt;&lt;/b&gt;&lt;br&gt;automatically.
&lt;br&gt;org.apache.hadoop.hdfs.server.namenode.SafeModeException: Cannot delete /tmp/hadoop-root/mapred/system. Name node is 
&lt;br&gt;&lt;br&gt;in safe mode.
&lt;br&gt;The ratio of reported blocks 0.0000 has not reached the threshold 0.9990. Safe mode will be turned off 
&lt;br&gt;&lt;br&gt;automatically.
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.deleteInternal(FSNamesystem.java:1696)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.delete(FSNamesystem.java:1676)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.hdfs.server.namenode.NameNode.delete(NameNode.java:517)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.lang.reflect.Method.invoke(Method.java:597)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at java.security.AccessController.doPrivileged(Native Method)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at javax.security.auth.Subject.doAs(Subject.java:396)
&lt;br&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)
&lt;br&gt;&lt;br&gt;the same message have appear 10 times,the the safemode is off
&lt;br&gt;[root@cent52ip32 hadoop]# bin/hadoop dfsadmin -safemode get
&lt;br&gt;Safe mode is OFF
&lt;br&gt;&lt;br&gt;I don't know the reason,please help me.
&lt;br&gt;thanks very much.
&lt;/div&gt;
&lt;/div&gt;&lt;/blockquote&gt;
&lt;p&gt;From forum: &lt;a href=&quot;http://old.nabble.com/Hadoop-core-user-f30590.html&quot; embed=&quot;fixTarget[30590]&quot; target=&quot;_top&quot; &gt;Hadoop core-user&lt;/a&gt;&lt;/p&gt;</content>
	<link rel="alternate" type="text/html" href="http://old.nabble.com/fuse%3AThe-ratio-of-reported-blocks-0.0000-has-not-reached-the-threshold-0.9990-tp25868687p25921487.html" />
</entry>

</feed>
