Win a copy of Murach's Python Programming this week in the Jython/Python forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic

Read a word document  RSS feed

 
Ram Kas
Ranch Hand
Posts: 83
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,

I want t o read a word document and print the contents to console. But, when I do it as if I were doing it with text files, it displays some weird characters. Can anyone throw some light on how I should proceed?

Thanks in advance.

Dinakar Kasturi.
 
Ulf Dittmer
Rancher
Posts: 42970
73
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
DOC is a binary file format; you can't treat it like you would treat text files. An API that can extract the text from a doc file is Jakarta POI; you can find some usage examples here.
 
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!