« Return to Thread: How to crawl URLs getting from RSSParser

How to crawl URLs getting from RSSParser

by Saurabh Suman :: Rate this Message:

Reply to Author | View in Thread

Hi Nutch guys
I used org.apache.nutch.parse.rss.RSSParser , for parsing RSS feeds. It is showing urls on console.Now i want to crawl those urls.
              How will i do this? Does RSSPrser class store it in crawldb or i  need to send to all URLs to crawldb.Then run the crawl command.
   Is there another approach?

 « Return to Thread: How to crawl URLs getting from RSSParser