• Post Reply Bookmark Topic Watch Topic
  • New Topic

suggestions for web scraping in Java?  RSS feed

 
Aftab Hassan
Ranch Hand
Posts: 40
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello,
I would like to start learning "web scraping in java'. Can you suggest me what library I should be using ?

I have seen a few on the internet such as : 1.JSoup 2.JTidy etc. But would be good to get some suggestions before I get started on one.
Or should I use import java.net.* ?

Thanks
 
Ulf Dittmer
Rancher
Posts: 42972
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
HtmlUnit is the best, IMO.
 
Aftab Hassan
Ranch Hand
Posts: 40
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Ulf Dittmer wrote:HtmlUnit is the best, IMO.

Thanks Ulf, that was a quick reply
Let me just wait for some more time before I get started on one. Meanwhile, I've already started reading up on HtmlUnit.

Cheers!
 
Aftab Hassan
Ranch Hand
Posts: 40
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
While I'm just waiting to get a few suggestions, I've taken some baby steps. Please see my code below.
I'm glad to see the first output.
 
Ulf Dittmer
Rancher
Posts: 42972
73
 
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!