gregthenovelist
New Member
- Joined
- Jul 2, 2021
- Messages
- 4
- Office Version
- 2016
- Platform
- MacOS
Hey Community,
I quickly skimmed through a few related articles on the forum, but I couldn't find a good answer. Maybe you can help me here!
I have transformed a pdf into an excel spreadsheet. I want to use this excel spreadsheet for some data analysis with STATA. Just that you understand what I am talking about, here is the pdf and the new excel spreadsheet:
To use the Excel data in STATA, I have to keep the structure of the pdf document somehow. So, I need a further section that, for instance, indicates that this is "Official Creditor" embedded in "of which interest arrears on LDOD" embedded in "short-term debt" embedded in "total debt stocks" under the category of "Albania". This is necessary so that I can compare them with a table from the year before and after.
One not very efficient option would be to introduce new columns to the left of the written part and indicate: Albania; Total Debt Stock; etc. and then merge them. This would be quite time-consuming as there are 147 countries on 10 pdfs.
Does anyone here know how I can most efficiently solve this problem? Is there potentially a "if - then" command that I could employ? How can I keep the "embeddedness" structure of the pdf in my excel spreadsheet? I would be very grateful for some ideas.
All the best,
Greg
I quickly skimmed through a few related articles on the forum, but I couldn't find a good answer. Maybe you can help me here!
I have transformed a pdf into an excel spreadsheet. I want to use this excel spreadsheet for some data analysis with STATA. Just that you understand what I am talking about, here is the pdf and the new excel spreadsheet:
To use the Excel data in STATA, I have to keep the structure of the pdf document somehow. So, I need a further section that, for instance, indicates that this is "Official Creditor" embedded in "of which interest arrears on LDOD" embedded in "short-term debt" embedded in "total debt stocks" under the category of "Albania". This is necessary so that I can compare them with a table from the year before and after.
One not very efficient option would be to introduce new columns to the left of the written part and indicate: Albania; Total Debt Stock; etc. and then merge them. This would be quite time-consuming as there are 147 countries on 10 pdfs.
Does anyone here know how I can most efficiently solve this problem? Is there potentially a "if - then" command that I could employ? How can I keep the "embeddedness" structure of the pdf in my excel spreadsheet? I would be very grateful for some ideas.
All the best,
Greg