Excel friends,
I need to open and close 1000 PDF files, search several texts strings inside each PDF and extract the texts that are located on the right side of each text string, and build a database with the extracted data.
More details, these files are NOT PDF forms!. They all have the same layout and data. They have the basic ID data for each inventory item, for example:
Item No.: 123456 [] Storage: Deposit 1
after several text rows or PDF pages:
City: Miami [] Telf: 305-XXX-YYYY [] Contact Person: John Doe
and much more data and text inside of the PDF that I am not interested:
I need to extract to Excel a database with 1000 rows showing:
Item No. [] Storage [] City [] Telf [] Contact
123456 Deposit 1 Miami 305-XXX-YYYY John Doe
[The extracted database above should be aligned in this posting, but it is not ], I think the reader get the idea of my need.
Some PDFs have no phone or no Contact info, but the field name is there, i.e. Contact. There are different storage locations and cities in text length or with spaces.
Is it possible to extract this data with VBA? Can somebody to point out a possible way to do this? a website with the VBA code? a solution with a not so expensive software? I do have Adobe Acrobat Professional if needed.
Thanks!
I need to open and close 1000 PDF files, search several texts strings inside each PDF and extract the texts that are located on the right side of each text string, and build a database with the extracted data.
More details, these files are NOT PDF forms!. They all have the same layout and data. They have the basic ID data for each inventory item, for example:
Item No.: 123456 [] Storage: Deposit 1
after several text rows or PDF pages:
City: Miami [] Telf: 305-XXX-YYYY [] Contact Person: John Doe
and much more data and text inside of the PDF that I am not interested:
I need to extract to Excel a database with 1000 rows showing:
Item No. [] Storage [] City [] Telf [] Contact
123456 Deposit 1 Miami 305-XXX-YYYY John Doe
[The extracted database above should be aligned in this posting, but it is not ], I think the reader get the idea of my need.
Some PDFs have no phone or no Contact info, but the field name is there, i.e. Contact. There are different storage locations and cities in text length or with spaces.
Is it possible to extract this data with VBA? Can somebody to point out a possible way to do this? a website with the VBA code? a solution with a not so expensive software? I do have Adobe Acrobat Professional if needed.
Thanks!