|
|
#1 |
|
New Member
Join Date: Nov 2009
Posts: 4
|
Question from a trial user:
I have many PDFs of books I do in my research. Is there a way to get them into EF in a searchable way? Thanks |
|
|
|
#2 |
|
Developer
Join Date: Aug 2006
Posts: 4,128
|
If the PDFs contain text, EagleFiler can search them. If the PDFs only contain images, you would need to run them through OCR software, which adds a text layer that EagleFiler and other PDF software can read.
|
|
|
|
#3 |
|
Join Date: Oct 2008
Posts: 17
|
ABBYY FineReader and Acrobat can OCR PDFs into Searchable PDFs (text overlay)
|
|
|
|
#5 |
|
Join Date: Nov 2008
Posts: 89
|
I am always confused by the topic of OCR. My Brother printer/scanner appears to come with built-in OCR software. By using the "Brother Control Center" I can select an OCR option and get searchable text.
At the same time, doesn't Image Capture, which comes on the iMac, provide OCR capability? I think I've used that as an alternative to the Brother stuff. What is gained by using the other programs mentioned in this thread? |
|
|
|
#6 |
|
Join Date: Nov 2006
Posts: 211
|
Are there plans to integrate an OCR engine in EF?
If not, could someone share a script that automatically does OCR on PDFs using PDFPen on import? Thanks! |
|
|
|
#7 | |
|
Developer
Join Date: Aug 2006
Posts: 4,128
|
Quote:
That’s something I’m considering. Sorry, that’s all I want to say for now. That sounds like a great idea for a script! I’ll see what I can do. |
|
|
|
|
#8 |
|
Developer
Join Date: Aug 2006
Posts: 4,128
|
I’ve just written a script to do this, but there seem to be two bugs in PDFpen that prevent it from working. I’ve reported them to SmileOnMyMac, and I’ll update this thread when we have a resolution.
|
|
|
|
#9 |
|
Join Date: Nov 2006
Posts: 211
|
Great news, thanks!
|
|
|
|
#10 |
|
Join Date: Nov 2006
Posts: 211
|
I guess that no update to this thread means there has been no news on this front?
|
|
|
|
#11 |
|
Developer
Join Date: Aug 2006
Posts: 4,128
|
|
|
|
|
#12 |
|
Developer
Join Date: Aug 2006
Posts: 4,128
|
The PDFpen developer sent me a workaround for the problem, so I’ve posted the OCR With PDFpen script.
|
|
|
|
#13 |
|
Join Date: Nov 2006
Posts: 211
|
Great news, thanks!
I have a tiny suggestion: maybe the script could add the tag "ocred" or something similar to help finding pdfs which have not yet been ocred. |
|
|
|
#14 |
|
Join Date: Sep 2009
Posts: 19
|
Another solution would be to use the ocr to EagleFiler script that michael has. Make a target folder, and add that script to the folder as a folder action, have you scanner deposit the pdfs it makes to that folder, and the rest just happens. I know it works because I've been using it to scan my receipts into EagleFiler. My Backlog went away in short order with that script which is here.
|
|
|
|
#15 |
|
Join Date: Jan 2010
Posts: 8
|
Thank you, thank you, thank you for the the OCR With PDFpen script! This was really the last feature that made me vacillate between EagleFiler and DevonThink. Now, for me, EagleFiler is a clear winner!
|
|
|
|
#16 |
|
Join Date: Jan 2010
Posts: 8
|
There is a 20% discount on PDFpen and PDFpen Pro available until 02/28/10:
http://www.smileonmymac.com/mpu/ |
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Links to specific pages within PDFs | marick | EagleFiler | 1 | 03-03-2008 01:15 PM |
| Issues importing OCR PDFs from DevonThink Pro | spi | EagleFiler | 4 | 02-19-2008 03:04 PM |
| rotated pdfs | chipbrock | EagleFiler | 2 | 11-30-2007 10:39 AM |
| Viewing PDFs | brab | EagleFiler | 5 | 08-02-2007 04:11 PM |