RadaeePDF.com :: Topic: Search regular expression in pdf (1/2)
Welcome, Guest
Username: Password: Remember me

Signin/Signup with:

Questions about Android development and PDF
  • Page:
  • 1
  • 2

TOPIC: Search regular expression in pdf

Search regular expression in pdf 1 year 9 months ago #10426

  • Rignanese
  • Rignanese's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 17
  • Karma: 0
Hi,
is it possibile with Radaee library to search Regular Expression in a pdf?
Regards
The administrator has disabled public write access.

Search regular expression in pdf 1 year 9 months ago #10427

  • radaee
  • radaee's Avatar
  • OFFLINE
  • Moderator
  • Posts: 826
  • Thank you received: 10
  • Karma: 43
consider Regular Expression is most popular, and lots open source enabled.
this feature is possible,
currently, u can implement it on APP layer with java.
The administrator has disabled public write access.

Search regular expression in pdf 1 year 9 months ago #10429

  • ankur123987
  • ankur123987's Avatar
  • OFFLINE
  • Expert Boarder
  • Available as freelancer (ex website to app 30min)
  • Posts: 97
  • Karma: 0
hey Radaee could you provide any demo for serach the text with in pdf or write separate thread for RDPDFViewer Module?

Thanks
Last Edit: 1 year 9 months ago by ankur123987.
The administrator has disabled public write access.

Search regular expression in pdf 1 year 9 months ago #10430

  • support
  • support's Avatar
  • OFFLINE
  • Administrator
  • Posts: 513
  • Thank you received: 43
  • Karma: 6
It's mostly an android development, we could work on it within some days starting from now but it could be a proof of concept and not a full usable class.

You should develop on your own:
- cycle through pages ("n" page)
- extract text from pages and put it in an array of "n" elements
- apply your regular expression to each array element
- find the first an last char extracted by regex from page "m"
- inspect for the whole substring in the "m" page and highlight it on the page
- when moving to a different page you should apply step of above point using the proper substring

Different approach:
- cycle through pages ("n" page)
- extract text from actual page
- if regex is false continue cycling
- if regex is good, find the first an last char extracted by regex from page "m"
- inspect for the whole substring in the "m" page and highlight it on the page
- when moving to a different page you should cycle through pages starting from actual one and looking for a new positive result

As you could guess the whole process couldn't be fast.
The first approach will be slower at start and quicker when array result is ready.
The first approach could be adopted to provide a full result list to user.
The second approach will be quicker while starting up but slower when moving.
The administrator has disabled public write access.

Search regular expression in pdf 1 week 6 days ago #13310

  • sivus
  • sivus's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 3
  • Karma: 0
Hi,
is it now (01.2018) possibile with Radaee library to search Regular Expression in a pdf?
Regards
The administrator has disabled public write access.

Search regular expression in pdf 1 week 6 days ago #13311

  • support
  • support's Avatar
  • OFFLINE
  • Administrator
  • Posts: 513
  • Thank you received: 43
  • Karma: 6
No. Main reason is that such an approach would be quite slow and we prefer not to offer a feature with poor performances.

We can offer a full indexing and FTS approach performances are really great (after having indexed the file, the FTS search relies on sqlite-fts tables).
We've developed both clients-side (Android and iOS, coming soon Windows) and server side (Linux / Windows Server).

We offer it as add-on module.
The administrator has disabled public write access.
  • Page:
  • 1
  • 2
Powered by Kunena Forum