2.4 Document Content


This section specifies algorithms that are used to analyze document content and determine its properties. These algorithms take CPs as input and return some piece of information about the document content at that location. For example, the algorithm in section 2.4.1 returns the text at that CP.

Collectively, these algorithms specify relationships among data structures in the file types that are specified in this documentation. These relationships MUST be maintained. These algorithms are not examples, but definitions of how to interpret these data structures.

These algorithms can derive significant performance benefits from common programming practices such as caching the results from previous input.