Combine Word Files into Excel

Tofik

Board Regular
Joined
Feb 4, 2021
Messages
114
Office Version
  1. 365
  2. 2016
Platform
  1. Windows
Hi Guys, I need help. It is very important for me and if someone know how to do it please help.
1) Word files they are have the same info. And I need combine all Word files into one Excel File (only yellow lines ) see Example 2. I also add Word Document for you.


1632925814551.png


Example 2:
In Excel File It should be as this.

1632926007453.png
 

Excel Facts

Who is Mr Spreadsheet?
Author John Walkenbach was Mr Spreadsheet until his retirement in June 2019.
If you save your file as a .PDF, then you can import to Power Query and clean up as shown in the following Mcode. Then transfer to Native Excel.

Power Query:
let
    Source = Pdf.Tables(File.Contents("C:\Users\alans\Downloads\05-B92-4-883(1-6).pdf"), [Implementation="1.3"]),
    Table002 = Source{[Id="Table002"]}[Data],
    #"Changed Type" = Table.TransformColumnTypes(Table002,{{"Column1", type text}, {"Column2", type text}, {"Column3", type text}, {"Column4", type text}, {"Column5", type text}, {"Column6", type text}, {"Column7", type text}, {"Column8", type text}, {"Column9", type text}, {"Column10", type text}, {"Column11", type text}, {"Column12", type text}, {"Column13", type text}, {"Column14", type text}, {"Column15", type text}, {"Column16", type text}, {"Column17", type text}, {"Column18", type text}, {"Column19", type text}}),
    #"Removed Top Rows" = Table.Skip(#"Changed Type",3),
    #"Kept First Rows" = Table.FirstN(#"Removed Top Rows",6),
    #"Promoted Headers" = Table.PromoteHeaders(#"Kept First Rows", [PromoteAllScalars=true]),
    #"Changed Type1" = Table.TransformColumnTypes(#"Promoted Headers",{{"", Int64.Type}, {"Drawing No", type text}, {"Rev#(lf)No.", Int64.Type}, {"*** No", type text}, {"Material 1#(lf)Grade No", type text}, {"Material 2#(lf)Grade No", type text}, {"Thickness#(lf)mm", Int64.Type}, {"Wel#(lf)d#(lf)No.", Int64.Type}, {"Weld#(lf)Type", type text}, {"Penet#(lf)ration#(lf)Type", type text}, {"Welder#(lf)No.", type text}, {"WPS#(lf)No.", Int64.Type}, {"Weld Date", type date}, {"Acc", type text}, {"Rej", type text}, {"RT", type text}, {"UT", type text}, {"MT", type text}, {"PT", type text}})
in
    #"Changed Type1"

Book2
ABCDEFGHIJKLMNOPQRS
1Column1Drawing NoRev No.*** NoMaterial 1 Grade NoMaterial 2 Grade NoThickness mmWel d No.Weld TypePenet ration TypeWelder No.WPS No.Weld DateAccRejRTUTMTPT
21AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR51FWN/ASW-0662410/2/2020
32AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR32FWN/ASW-0662410/2/2020
43AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR53FWN/ASW-0662410/2/2020
54AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR54FWN/ASW-0692410/2/2020
65AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR35FWN/ASW-0692410/2/2020
Sheet1


Power Query is a free AddIn for Excel 2010 and 2013, and is built-in functionality from Excel 2016 onwards (where it is referred to as "Get & Transform Data").

It is a powerful yet simple way of getting, changing and using data from a broad variety of sources, creating steps which may be easily repeated and refreshed. I strongly recommend learning how to use Power Query - it's among the most powerful functionalities of Excel.

- Follow this link to learn how to install Power Query in Excel 2010 / 2013.

- Follow this link for an introduction to Power Query functionality.

- Follow this link for a video which demonstrates how to use Power Query code provided.
 
Upvote 0
The problem is
If you save your file as a .PDF, then you can import to Power Query and clean up as shown in the following Mcode. Then transfer to Native Excel.

Power Query:
let
    Source = Pdf.Tables(File.Contents("C:\Users\alans\Downloads\05-B92-4-883(1-6).pdf"), [Implementation="1.3"]),
    Table002 = Source{[Id="Table002"]}[Data],
    #"Changed Type" = Table.TransformColumnTypes(Table002,{{"Column1", type text}, {"Column2", type text}, {"Column3", type text}, {"Column4", type text}, {"Column5", type text}, {"Column6", type text}, {"Column7", type text}, {"Column8", type text}, {"Column9", type text}, {"Column10", type text}, {"Column11", type text}, {"Column12", type text}, {"Column13", type text}, {"Column14", type text}, {"Column15", type text}, {"Column16", type text}, {"Column17", type text}, {"Column18", type text}, {"Column19", type text}}),
    #"Removed Top Rows" = Table.Skip(#"Changed Type",3),
    #"Kept First Rows" = Table.FirstN(#"Removed Top Rows",6),
    #"Promoted Headers" = Table.PromoteHeaders(#"Kept First Rows", [PromoteAllScalars=true]),
    #"Changed Type1" = Table.TransformColumnTypes(#"Promoted Headers",{{"", Int64.Type}, {"Drawing No", type text}, {"Rev#(lf)No.", Int64.Type}, {"*** No", type text}, {"Material 1#(lf)Grade No", type text}, {"Material 2#(lf)Grade No", type text}, {"Thickness#(lf)mm", Int64.Type}, {"Wel#(lf)d#(lf)No.", Int64.Type}, {"Weld#(lf)Type", type text}, {"Penet#(lf)ration#(lf)Type", type text}, {"Welder#(lf)No.", type text}, {"WPS#(lf)No.", Int64.Type}, {"Weld Date", type date}, {"Acc", type text}, {"Rej", type text}, {"RT", type text}, {"UT", type text}, {"MT", type text}, {"PT", type text}})
in
    #"Changed Type1"

Book2
ABCDEFGHIJKLMNOPQRS
1Column1Drawing NoRev No.*** NoMaterial 1 Grade NoMaterial 2 Grade NoThickness mmWel d No.Weld TypePenet ration TypeWelder No.WPS No.Weld DateAccRejRTUTMTPT
21AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR51FWN/ASW-0662410/2/2020
32AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR32FWN/ASW-0662410/2/2020
43AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR53FWN/ASW-0662410/2/2020
54AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR54FWN/ASW-0692410/2/2020
65AF-SS01-CIV-SSD-0083105-B92-4-883S275JRS275JR35FWN/ASW-0692410/2/2020
Sheet1


Power Query is a free AddIn for Excel 2010 and 2013, and is built-in functionality from Excel 2016 onwards (where it is referred to as "Get & Transform Data").

It is a powerful yet simple way of getting, changing and using data from a broad variety of sources, creating steps which may be easily repeated and refreshed. I strongly recommend learning how to use Power Query - it's among the most powerful functionalities of Excel.

- Follow this link to learn how to install Power Query in Excel 2010 / 2013.

- Follow this link for an introduction to Power Query functionality.

- Follow this link for a video which demonstrates how to use Power Query code provided.
The problem is, I havea lot of same Word documents nealy 4500 and they are in the same conditio, I just want create DataBase fast . To conver them into PDF it take so much time.
My have 1 day to create that. I saw a lot of videos on youtube but they are usless. I need another option.
 
Upvote 0
Good Luck. The only thing I can envision would be some VBA to loop through all 4500 docs (should all be in the same folder) and convert them to .pdf or .html files. Then use PQ to gang load them and combine them into one big file. You could have an issue though with row availabilty.

You may want to consider bringing them into Access instead. Access can accept Text files (not word). Again this would require you to convert your documents and write some code to load them all. There is no row limitation there, but a 2 Gig limitation.

This is the best I can offer. Maybe someone else can provide some alternative solutions. One day for a project like this is very constricting.
 
Upvote 0

Forum statistics

Threads
1,224,818
Messages
6,181,151
Members
453,021
Latest member
Justyna P

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top