Hi,
when I extract non-english text I get wrong(chinese) symbols from time to time.
How can I fix it?
Example:
page.ObjsStart();
int pageLength = page.ObjsGetCharCount();
String tempStr = page.ObjsGetString(0, pageLength);
Results I get:
섄됵ксей Голощапов
nроrра�ирование
дn茐1 мо6иnьных
舠' у섐Ё⑀оиств
Сан섎т-Петербург
Results I expect
Алексей Голощапов
програмирование
для мобильных
устройств
Санкт-Петербург
I understand that this can be because of bad OCR text recognition,
but on images or PDF viewers it looks OK.
Maybe it is possible to force some encoding or something?