Perforce Chronicle 2012.2/486814
API Documentation

P4Cms_Filter_PptxToText Class Reference

Filter to convert a Microsoft PowerPoint 2007 document to text. More...

List of all members.

Public Member Functions

 filter ($pptx)
 Extract text contents from a PowerPoint format.

Detailed Description

Filter to convert a Microsoft PowerPoint 2007 document to text.

This implementation uses Zend_Search_Lucene_Docuemtn_Pptx to extract text contents from a PowerPoint document (supports PowerPoint 2007 format only.)

Copyright:
2011-2012 Perforce Software. All rights reserved
License:
Please see LICENSE.txt in top-level folder of this distribution.
Version:
2012.2/486814

Member Function Documentation

P4Cms_Filter_PptxToText::filter ( pptx)

Extract text contents from a PowerPoint format.

Parameters:
string$pptxthe Powerpoint contents to be filtered.
Returns:
string the plain text output.
    {
        // shortcut if we have an empty string
        if (!$pptx) {
            return;
        }

        // write contents to a tmp file
        $tempFile = tempnam(sys_get_temp_dir(), 'powerpoint');
        file_put_contents($tempFile, $pptx);

        $document = Zend_Search_Lucene_Document_Pptx::loadPptxFile($tempFile);

        // remove the temp file
        unlink($tempFile);
        
        return $document->getFieldValue('body');
    }

The documentation for this class was generated from the following file: