• Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Data mining a webpage

 
Gorby Green
Greenhorn
Posts: 12
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi!
I want to create at data mining applikation that collects info from a dropdown list on a webpage. My purpose with this is to store that information into a single xml-file.
What is the best way to do this?

Does anyone know some good site or some good api?

My example:
I would like to store data from date and country- droplists from this html- site.

<html
<head>
<title>Testpage</title>
</head>
<body>
<form name="date" id="date" method="post" action="">
<select name="select">
<option value="20060101">01 Jan 06</option>
<option value="20060102">02 Jan 06</option>
<option value="20060103">03 Jan 06</option>
<option value="20060104">04 Jan 06</option>
</select>
</form>
<form name="country" id="country" method="country" action="">
<select name="select">
<option value="au">Australia</option>
<option value="dk">Denmark</option>
<option value="fi">Finland</option>
<option value="/fr/">France</option>
<option value="/de/">Germany</option>
</select>
</form>
</body>
</html>

/Gorby
 
Ulf Dittmer
Rancher
Posts: 42968
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
For all these purposes I recommened HttpUnit (on SourceForge). It's meant to be a web testing extension to JUnit, but can very nicely be used for accessing web pages programmatically.
 
Tom Blough
Ranch Hand
Posts: 263
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Search the forums. This topic has been covered recently.

Cheers,
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic