Patent attributes
A device, a control method, and a program to increase the accuracy of voice read-out and text mining by automatically structuring a presentation file. The arrangement and practice of the invention involves an overlap grouping part for extracting overlap information between objects in a presentation file and grouping the objects as a parent-child relationship; a graph dividing grouping part for grouping the objects as a sibling relationship by representing the objects as nodes of a graph and by recursively dividing the graph so that a predefined cost between the nodes is minimized; a distance information grouping part for further grouping the objects as a sibling relationship if distance information between the objects is below a threshold determined by a predefined computation from a distribution histogram of the distance information; and a link information extraction part for extracting arrow graphics that represents a link relationship and generating link information including the link relationship and a link label. The resulting structured data is output as meta-information.