ExtractFilePageTextBlocks

Text, Extraction, Page properties

Description

Similar to the ExtractFilePageText function but the results are stored in a text block list rather than returned as a CSV string.

This function internally uses the direct access functionality.

Once the results are in the text block list, functions such as DAGetTextBlockCount, DAGetTextBlockText and DAGetTextBlockColor can be used to retrieve the properties of each block of text.

Syntax

Delphi

function TPDFlib.ExtractFilePageTextBlocks(Const InputFileName, Password: WideString; Page, Options: Integer): Integer;

ActiveX

Function PDFlib::ExtractFilePageTextBlocks(InputFileName As String, Password As String, Page As Long, Options As Long) As Long

DLL

int DLExtractFilePageTextBlocks(int InstanceID, wchar_t * InputFileName, wchar_t * Password, int Page, int Options);

Parameters

InputFileNameThe path and file name of the file to extract text from. The password to use, if any, when opening the file
Password
PageThe number of the page that must be extracts. The first page in the document is page 1.
Options3 = Normal extraction 4 = Split words

Return values

0The text could not be extracted
1A TextBlockListID value