Win a copy of The Java Performance Companion this week in the Performance forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Read a word document

 
Ram Kas
Ranch Hand
Posts: 83
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,

I want t o read a word document and print the contents to console. But, when I do it as if I were doing it with text files, it displays some weird characters. Can anyone throw some light on how I should proceed?

Thanks in advance.

Dinakar Kasturi.
 
Ulf Dittmer
Rancher
Posts: 42968
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
DOC is a binary file format; you can't treat it like you would treat text files. An API that can extract the text from a doc file is Jakarta POI; you can find some usage examples here.
 
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic