This week's book giveaway is in the OCAJP forum.
We're giving away four copies of Programmer's Guide to Java SE 8 Oracle Certified Associate (OCA) and have Khalid A Mughal & Rolf W Rasmussen on-line!
See this thread for details.
Win a copy of Programmer's Guide to Java SE 8 Oracle Certified Associate (OCA) this week in the OCAJP forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Html document to java

 
Gabriel Beres
Ranch Hand
Posts: 61
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi!

I'm searching for a java library, that allows me, to parse a html formatted string, separate the content from html tags, and bind it to java objects, so i can save it to database with hibernate, and load the whole page later.

Thanks for help.
 
Ulf Dittmer
Rancher
Posts: 42968
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
There are several libraries that will transform HTML into a DOM tree as best as they can (NekoXNI, TagSoup, JTidy). From that DOM tree you could then construct a hibernatable presentation that suits your needs.
 
Gabriel Beres
Ranch Hand
Posts: 61
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks. I found jerichohtml, and it looks like it suids my needs.
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic