• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
  • Campbell Ritchie
  • Ron McLeod
  • Paul Clapham
  • Bear Bibeault
  • Junilu Lacar
  • Jeanne Boyarsky
  • Tim Cooke
  • Henry Wong
Saloon Keepers:
  • Tim Moores
  • Stephan van Hulst
  • Tim Holloway
  • salvin francis
  • Frits Walraven
  • Scott Selikoff
  • Piet Souris
  • Carey Brown

Parsing html in frames?

Ranch Hand
Posts: 625
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I spent a couple of days last week creating a servlet that goes to espn.go.com and retrieves the headlines. In order to do this I created a custom parser that shed all the html tags and pulled out the text of the headlines. I watched their site for days and noticed that setup for the headlines varies very slightly and I made my parser able to handle either way. Today I logged on and their whole website has changed. I think it's using frames now. I go to try to read the page source, and their is very little their to read. Is this the result of using frames? Or is this something else? Can I still access the html of the page? If anyone wants a look I'll include the link.
click here for espn.go.com
Ranch Hand
Posts: 4716
Scala Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
They are not using frames. they use a lot of javascript and might be using css too (not sure). You can view the html from IE by choosing view from the menu and selecting source. you can do the same thing in netscape. it looks to me like you can still parse the html. there is just not a lot of text there.
Remember to always leap before you look. But always take the time to smell the tiny ads:
Building a Better World in your Backyard by Paul Wheaton and Shawn Klassen-Koop
    Bookmark Topic Watch Topic
  • New Topic