Patent 7280052 was granted and assigned to Intel on October, 2007 by the United States Patent and Trademark Office.
A method and system of compressing an input data item such as, e.g., a cache line of machinc-executable code, by selecting a partition of the data item into subsequences that appear as words in a dictionary of a probability-based encoding scheme, such that the encoding of the subsequences produces a compressed representation of the data according to a desired efficiency parameter, e.g., minimized length. The method may implement an algorithm to identify one or more possible compressed representations of the input data item and to select a more efficient compressed representation.