• Post Reply Bookmark Topic Watch Topic
  • New Topic

reading a web page and extract its contents  RSS feed

 
Pezhman Na
Greenhorn
Posts: 11
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi, I was wondering how and with which library it's possible to read a url (for example this page) and extract its contents ? thanks
 
Bear Bibeault
Author and ninkuma
Marshal
Posts: 65833
134
IntelliJ IDE Java jQuery Mac Mac OS X
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Check out the URL and URLConnection classes of java.net.
 
Hauke Ingmar Schmidt
Rancher
Posts: 436
2
  • Likes 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
And you probably need something like HTMLUnit for getting / manipulating the content. If the page is XHTML, any XML parser would do.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!