[cairo] PDF Text Extraction: Past and Present

Baz brian.ewins at gmail.com
Sat Feb 3 13:13:57 PST 2007


On 03/02/07, Adrian Johnson <ajohnson at redneon.com> wrote:
> The attached patch
>  - Generates ToUnicode mappings for all fonts
>  - Adds a TrueType/OpenType reverse cmap lookup function.
>  - Adds FT and Win32 font backend functions for mapping glyphs to
>    unicode. These backend functions are fallbacks for when the
>    reverse cmap fails (although for win32 the backend function
>    only supports Type1 fonts).
>
> Text selection works well in acroread however evince does not
> correctly select TrueType fonts. This seems to be caused by
> the individual glyph positioning in the content stream.

I take it that I just need to implement load_truetype_table for this
to work for atsui too? (from reading this it seems I won't even get
reverse cmap at the moment)

-Baz


More information about the cairo mailing list