• Post Reply Bookmark Topic Watch Topic
  • New Topic

I want to create a project this week, what should I know?  RSS feed

 
Sergiu Dobozi
Ranch Hand
Posts: 107
2
Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I want to create a program in Java that will extract all the images from a Reddit page and place them into a folder on my computer. Alternatively, it should extract only the titles of the posts and place them into a doc file. I have no idea which one would be more difficult to do.
Below is an example of a Reddit page:
https://www.reddit.com/r/funny/
What kind of knowledge should I possess to achieve this? Do I need to read up on nodes, XML files or something else?

 
Jeanne Boyarsky
author & internet detective
Marshal
Posts: 37462
537
Eclipse IDE Java VI Editor
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I think the getting the post titles will be easier. Considering using Selenium to connect to the web page. Then you need to know XPath or CSS selectors to parse the page.
 
Consider Paul's rocket mass heater.
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!