Go Back   C-Command Forums > EagleFiler

Reply
 
Thread Tools Display Modes
Old 02-01-2010, 04:29 PM   #1
Josef
 
Join Date: Jan 2010
Posts: 90
Default How to get info from Journal pdf

I try to import IEEE Journal files (pdf) like 'yyyynnpppp.pdf' (year, issue number, page).

EF's Info Inspector shows their info like
[title] : title of Journal
[from] : authors of Journal
for ONE group of files, however, shows like
[title] : yyyynnpp (file name itself)
[from] : empty
for another group of files.

I guess this difference comes from different pdf format somehow. And so, if I know about what kind of way EF extracts such info from pdf file, I can import them (9700 files : 7 GB) with correct info via appropriate Script.

Please suggest any information or hint about this.
  Reply With Quote
Old 02-01-2010, 07:44 PM   #2
Michael Tsai
Developer
 
Join Date: Aug 2006
Posts: 4,128
Default

Quote:
Originally Posted by Josef View Post
I guess this difference comes from different pdf format somehow.
EagleFiler extracts the title from the PDF’s title field (which you can view using Get Info in the Finder, for example).
  Reply With Quote
Old 02-01-2010, 09:10 PM   #3
Josef
 
Join Date: Jan 2010
Posts: 90
Default

Thank you for kind reply as always.

Your information has made 1st step forward to get the same results as Mendeley.

I will try to make script according to your previous suggestion in this forum.

Regards,
  Reply With Quote
Old 02-02-2010, 09:36 AM   #4
Michael Tsai
Developer
 
Join Date: Aug 2006
Posts: 4,128
Default

Also, if the PDF’s title field is empty EagleFiler will use the filename.
  Reply With Quote
Old 02-02-2010, 06:07 PM   #5
Josef
 
Join Date: Jan 2010
Posts: 90
Default

Thank you for additional advice.

Is there any way to get info from pdf files that have standard expression (title, authors, abstract, ...) with file name as yyyynnpppp.pdf.

Practically, I have to make script for that purpose?
  Reply With Quote
Old 02-02-2010, 07:32 PM   #6
Michael Tsai
Developer
 
Join Date: Aug 2006
Posts: 4,128
Default

Quote:
Originally Posted by Josef View Post
Is there any way to get info from pdf files that have standard expression (title, authors, abstract, ...) with file name as yyyynnpppp.pdf.
Well, EagleFiler will extract the title and author automatically and make them available in the user interface as well as via AppleScript. If you want to directly get at the metadata in the PDF file, you could perhaps write a script that uses PDFpen or mdls.
  Reply With Quote
Old 02-08-2010, 09:22 AM   #7
Josef
 
Join Date: Jan 2010
Posts: 90
Default

Using mdls command, I can extract the title and author automatically from kMDItemTitle and kMDItemAuthors even in the case of yyyynnpppp.pdf.

Thank you for your kind advice, that is truly helpful for beginners of EF and OSX like me.
  Reply With Quote
Reply

Thread Tools
Display Modes


Similar Threads
Thread Thread Starter Forum Replies Last Post
Info Panel sychou EagleFiler 2 11-13-2008 10:19 AM
Automating Web Snippet Info davet EagleFiler 10 10-28-2008 02:24 PM
show iCal info talazem EagleFiler 4 10-16-2008 04:08 PM
Tags not sticking in Info Inspector cdavisjr EagleFiler 2 06-03-2008 03:10 PM
tagging and info for folders talazem EagleFiler 2 12-16-2006 11:24 AM


All times are GMT -4. The time now is 06:54 AM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2010, Jelsoft Enterprises Ltd.