Win a copy of Cross-Platform Desktop Applications: Using Node, Electron, and NW.js this week in the JavaScript forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic

Retrieve spreadsheets from a web site  RSS feed

 
Anna Husten
Greenhorn
Posts: 2
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hello guys,

I am a data analyst and I want to download spreadsheets from a site.

The site looks like that, that a lot of spreadsheets are on the site and below there is a next button.

However, I want to download all these sheets to my local computer.

I have looked for web scrapping lib, but I am very inexperienced in such an operation. Therefore, it would be extremely kind if you could help me with my problem.

I appreciate your answer!

btw the site is: https://portal.mvp.bafin.de/database/DealingsInfo/sucheForm.do?emittentButton=Suche%20Emittent&RAP=-248889fb%3A1404d63cf50%3A-7fd1
 
Ulf Dittmer
Rancher
Posts: 42972
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
For programmatic web access I generally recommend the HtmlUnit library. There are tutorials on the web site, although I don't know whether they cover downloading files.
 
Karthik Shiraly
Bartender
Posts: 1210
25
Android C++ Java Linux PHP Python
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Anna, I think you can do this in a much simpler way, without doing any web scraping at all.
When you open that URL in your post, it shows a "CSV" link at bottom. That CSV file has *all* the records (of all 275 companies) in a single file. It should be much easier to process that single CSV, than scrape 14 odd pages.
 
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!