This code example shows how to use the Apache POI library to read a PowerPoint presentation file, and how to extract text, images and notes from it.
This code works with binary PPT files (.ppt), not the XML format (.pptx). POI's APIs for both are pretty similar, though. One would use JavaDoc:org.apache.poi.xslf.XSLFSlideShow instead of HSLSFSlideShow, and then use the classes in org.apache.poi.xslf.* instead of the ones in org.apache.poi.hssf.*