- Posts: 80
- Thank you received: 0
Microsoft Windows Phone 8.1 support ends (13 Jul 2017)
Microsoft has ended support for Windows Phone 8.1
Submit your requests and polls about new features
Identify a searchable PDF
- arcmobile.div
- Topic Author
- Offline
- Premium Member
-
Less
More
11 years 1 week ago - 11 years 1 week ago #8567
by arcmobile.div
Identify a searchable PDF was created by arcmobile.div
How to identify whether a PDF is searchable or not ?
Kindly provide the API
Searchable PDF is essentially a PDF image file. Unlike static image formats such as TIFF, JPEG and BMP, every PDA document has the ability to contain several layers of information i.e. image layer and text layer. The image layer carries information including the actual image, resolution, compression method, color depth, etc. Similarly, the text layer includes the actual ASCII text and an identification of the text's location on the page. In simple terms the Searchable PDF's text portions of the scanned document gets stored in a text layer, allowing the user to easily search for and locate any keyword within the scanned document.
Kindly provide the API
Searchable PDF is essentially a PDF image file. Unlike static image formats such as TIFF, JPEG and BMP, every PDA document has the ability to contain several layers of information i.e. image layer and text layer. The image layer carries information including the actual image, resolution, compression method, color depth, etc. Similarly, the text layer includes the actual ASCII text and an identification of the text's location on the page. In simple terms the Searchable PDF's text portions of the scanned document gets stored in a text layer, allowing the user to easily search for and locate any keyword within the scanned document.
Last edit: 11 years 1 week ago by arcmobile.div.
- arcmobile.div
- Topic Author
- Offline
- Premium Member
-
Less
More
- Posts: 80
- Thank you received: 0
11 years 1 week ago #8568
by arcmobile.div
Replied by arcmobile.div on topic Identify a searchable PDF
Any update ?
- arcmobile.div
- Topic Author
- Offline
- Premium Member
-
Less
More
- Posts: 80
- Thank you received: 0
11 years 5 days ago - 11 years 17 hours ago #8573
by arcmobile.div
Replied by arcmobile.div on topic Identify a searchable PDF
Kindly update
Last edit: 11 years 17 hours ago by arcmobile.div.
10 years 11 months ago - 10 years 11 months ago #8631
by support
Replied by support on topic Identify a searchable PDF
Dear user, we evaluated that actually this kind of API isn't interesting for our user base.
The request has been market and put in our requests' list with a low priority.
It's simply reproducible from your side extracting page text, looking for non empty text string.
Note: I invite you not to add new empty post to your thread.
Even if your aim is to push us providing you an answer, you're getting the opposite result: the thread seems containing replies and disappear from "thread without answer" list and it could happen we miss the request.
The request has been market and put in our requests' list with a low priority.
It's simply reproducible from your side extracting page text, looking for non empty text string.
Note: I invite you not to add new empty post to your thread.
Even if your aim is to push us providing you an answer, you're getting the opposite result: the thread seems containing replies and disappear from "thread without answer" list and it could happen we miss the request.
Last edit: 10 years 11 months ago by support.
10 years 11 months ago - 10 years 11 months ago #8633
by Davide
Replied by Davide on topic Identify a searchable PDF
Hi,
to extract text from a pdf you can use:
that gets text objects to memory and then:
that gets string from range.
For more info check this: www.radaeepdf.com/documentation/javadocs...e.html#ObjsGetString (int, int)
to extract text from a pdf you can use:
Code:
ObjsStart();
Code:
ObjsGetString(int from, int to);
For more info check this: www.radaeepdf.com/documentation/javadocs...e.html#ObjsGetString (int, int)
Last edit: 10 years 11 months ago by Davide.
- arcmobile.div
- Topic Author
- Offline
- Premium Member
-
Less
More
- Posts: 80
- Thank you received: 0
10 years 11 months ago - 10 years 11 months ago #8680
by arcmobile.div
Replied by arcmobile.div on topic Identify a searchable PDF
Using the following code to determine whether the PDF is searchable or not.
I call the method after the document is loaded but with a searchable PDF also , the String text is returning a null value.
private boolean checkSearchablePDF() {
boolean isSearchablePDF = false;
if (mDoc != null) {
int totalPageCount = mDoc.GetPageCount();
for (int pageCount = 0; pageCount < totalPageCount; pageCount++) {
Page page = mDoc.GetPage(pageCount + 1);
page.ObjsStart();
String text = page.ObjsGetString(0, page.ObjsGetCharCount() - 1);
if (text != null && text.trim().length() > 0) {
isSearchablePDF = true;
break;
}
}
}
return isSearchablePDF;
}
Kindly provide your inputs.
I call the method after the document is loaded but with a searchable PDF also , the String text is returning a null value.
private boolean checkSearchablePDF() {
boolean isSearchablePDF = false;
if (mDoc != null) {
int totalPageCount = mDoc.GetPageCount();
for (int pageCount = 0; pageCount < totalPageCount; pageCount++) {
Page page = mDoc.GetPage(pageCount + 1);
page.ObjsStart();
String text = page.ObjsGetString(0, page.ObjsGetCharCount() - 1);
if (text != null && text.trim().length() > 0) {
isSearchablePDF = true;
break;
}
}
}
return isSearchablePDF;
}
Kindly provide your inputs.
Last edit: 10 years 11 months ago by arcmobile.div.
Time to create page: 0.427 seconds