Reg Crowbar

View: New views
1 Messages — Rating Filter:   Alert me  

Reg Crowbar

by Rakesh Soni :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hi,

I tried to get source(raw_page) of multiple websites using Crowbar, but crowbar didn't return me correct source.
It returned correct source of only one site.
for other sites, it didn't execute javascript embedded in sites.

Sites I tried are:

http://www.jr.com/viewsonic/pe/VIW_VT2430_hy_1M#productTabReviews
http://www.jr.com/audiovox/pe/VOX_D1917PK/#productTabReviews
http://www.jr.com/nokia/pe/NOK_N97BLK/#productTabReviews
http://www.jr.com/acer-computer/pe/ACE_AS3810T6415/#productTabReviews
http://www.jr.com/polk-psw505-powered-subwoofer/pe/POK_PSW505/#productTabReviews

Kindly let me know if many pages will hit crowbar at same time then it will work or not?
If it will work then how to test it?

PS: I am using Python programming language to scrap.

Thanks in advance
--
Rakesh Soni
Chennai



_______________________________________________
General mailing list
General@...
http://simile.mit.edu/mailman/listinfo/general