Headers and footers on Hadoop output results

View: New views
1 Messages — Rating Filter:   Alert me  

Headers and footers on Hadoop output results

by ncardoso :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hello.

I'm using Hadoop to process several XML files, each with several XML records, through a group of Linux servers. I am using an XMLInputFormat that I found here in Nabble, and I'm using the TextOutputFormat with an overrided write funcion, to output XML.

Yet, the XML needs its root tag and the ?xml line. Where is the best place to place two writing functions like header() and footer()? I've tried, but all I manage was to write in the local task, not in the synchronized "part-0000" file.