I am looking to Webscrape several pages from a password protected site - to which I have access. I can negotiate my way with VBA to the first page I wish to scrape from, but am stuck from there. My VBA knowledge is basic, but my HTML skills are non existent, so I require help with what exactly to do from here. The first page I wish to scrape is below. I wish to have the headings, 'blueheadings' and data 'readablerowlink'. If I can be guided how to scrape this page, and then paste into my active workbook I can negotiate the other, similar, pages on the site. Help greatly appreciated.
Dim a As String, url As String
Set ie = CreateObject("InternetExplorer.Application")
url = "Horse Racing Database Solutions"
With ie
.Visible = True
.Navigate url
Do Until .ReadyState = 4
DoEvents
Loop
.Document.all.Item("Login").Value = "****"
.Document.all.Item("Password").Value = "*********"
.Document.forms(4).submit
End With
url = "Daily Jockeys Report"
ie.Navigate url
While ie.Busy Or ie.ReadyState <> 4: DoEvents: Wend
Below is an extract from the page concerned
<a target=_blank href='helpsection.php#jocktrainreport' title='Jockey Report Help Section'>Understand the Jockey Report.</a>
</td></tr></table></td></tr></table><br><hr><table><tr><th></th>
<th class=blueheadings valign=top colspan=6 title='Stats relate to the last 14 days'>Last 14 Days</th>
<th class=blueheadings valign=top colspan=6 title='Stats relate to the last 2 years'>Last 2 Years</th>
<th class=blueheadings valign=top colspan=2 title='Stats refer to time lapsed since last win'>Since Win</th>
<th class=blueheadings valign=top title='Booked rides for today by track'>TODAY</th>
<th class=blueheadings valign=top title='Runs, wins & places for each of the last seven days'>7 Day Form Figures</th></tr>
<tr><th class=readablerowlink valign=top title='Name of jockey, click to open'><a href='?sortselection=1'>Jockey</a></th>
<th class=readablerowlink valign=top title='Number of rides in last 14 days'><a href='?sortselection=2'>R</a></th>
<th class=readablerowlink valign=top title='Number of winning rides in last 14 days'><a href='?sortselection=3'>W</a></th>
<th class=readablerowlink valign=top title='Number of placed rides in last 14 days'><a href='?sortselection=4'>P</a></th>
<th class=readablerowlink valign=top title='14 day winning strike rate percentage'><a href='?sortselection=5'>W%</a></th>
<th class=readablerowlink valign=top title='14 day place strike rate percentage'><a href='?sortselection=6'>P%</a></th>
<th class=readablerowlink valign=top title='14 day profit/loss when backing at SP'><a href='?sortselection=7'>P/L</a></th>
<th class=readablerowlink valign=top title='Number of rides in last 2 years'><a href='?sortselection=8'>R</a></th>
<th class=readablerowlink valign=top title='Number of winning rides in last 2 years'><a href='?sortselection=9'>W</a></th>
<th class=readablerowlink valign=top title='Number of placed rides in last 2 years'><a href='?sortselection=10'>P</a></th>
<th class=readablerowlink valign=top title='2 year winning strike rate percentage'><a href='?sortselection=11'>W%</a></th>
<th class=readablerowlink valign=top title='2 year place strike rate percentage'><a href='?sortselection=12'>P%</a></th>
<th class=readablerowlink valign=top title='2 year profit/loss when backing at SP'><a href='?sortselection=13'>P/L</a></th>
<th class=readablerowlink valign=top title='The number of days since last rode a winner'><a href='?sortselection=14'>Days</a></th>
<th class=readablerowlink valign=top title='The number of runs since riding a winner'><a href='?sortselection=15'>Runs</a></th>
<th class=readablerowlink valign=top></th>
<th class=readablerowlink valign=top>(4th)(3rd)(2nd)(1st)(31st)(30th)(29th)</th></tr><tr><td valign=top class=readablerowlink><a href='jockeys.php?id=314' target='_blank' title='Allan, David, click to open form page'>Allan, David</a> <a title='FC Odds include 6/1 shot or less' class='texttype926'> $</a></td><td class=readablerow valign=top title='Allan, David has had 65 rides in last 14 days'>65</td>
<td class=readablerow valign=top title='Allan, David has had 10 winning rides in last 14 days'>10</td>
<td class=readablerow valign=top title='Allan, David has had 27 placed rides in last 14 days'>27</td>
<td class=readablerow valign=top title='Allan, David winning strike rate in last 14 days is 15%'>15%</td>
etc. down to row 3000
Dim a As String, url As String
Set ie = CreateObject("InternetExplorer.Application")
url = "Horse Racing Database Solutions"
With ie
.Visible = True
.Navigate url
Do Until .ReadyState = 4
DoEvents
Loop
.Document.all.Item("Login").Value = "****"
.Document.all.Item("Password").Value = "*********"
.Document.forms(4).submit
End With
url = "Daily Jockeys Report"
ie.Navigate url
While ie.Busy Or ie.ReadyState <> 4: DoEvents: Wend
Below is an extract from the page concerned
<a target=_blank href='helpsection.php#jocktrainreport' title='Jockey Report Help Section'>Understand the Jockey Report.</a>
</td></tr></table></td></tr></table><br><hr><table><tr><th></th>
<th class=blueheadings valign=top colspan=6 title='Stats relate to the last 14 days'>Last 14 Days</th>
<th class=blueheadings valign=top colspan=6 title='Stats relate to the last 2 years'>Last 2 Years</th>
<th class=blueheadings valign=top colspan=2 title='Stats refer to time lapsed since last win'>Since Win</th>
<th class=blueheadings valign=top title='Booked rides for today by track'>TODAY</th>
<th class=blueheadings valign=top title='Runs, wins & places for each of the last seven days'>7 Day Form Figures</th></tr>
<tr><th class=readablerowlink valign=top title='Name of jockey, click to open'><a href='?sortselection=1'>Jockey</a></th>
<th class=readablerowlink valign=top title='Number of rides in last 14 days'><a href='?sortselection=2'>R</a></th>
<th class=readablerowlink valign=top title='Number of winning rides in last 14 days'><a href='?sortselection=3'>W</a></th>
<th class=readablerowlink valign=top title='Number of placed rides in last 14 days'><a href='?sortselection=4'>P</a></th>
<th class=readablerowlink valign=top title='14 day winning strike rate percentage'><a href='?sortselection=5'>W%</a></th>
<th class=readablerowlink valign=top title='14 day place strike rate percentage'><a href='?sortselection=6'>P%</a></th>
<th class=readablerowlink valign=top title='14 day profit/loss when backing at SP'><a href='?sortselection=7'>P/L</a></th>
<th class=readablerowlink valign=top title='Number of rides in last 2 years'><a href='?sortselection=8'>R</a></th>
<th class=readablerowlink valign=top title='Number of winning rides in last 2 years'><a href='?sortselection=9'>W</a></th>
<th class=readablerowlink valign=top title='Number of placed rides in last 2 years'><a href='?sortselection=10'>P</a></th>
<th class=readablerowlink valign=top title='2 year winning strike rate percentage'><a href='?sortselection=11'>W%</a></th>
<th class=readablerowlink valign=top title='2 year place strike rate percentage'><a href='?sortselection=12'>P%</a></th>
<th class=readablerowlink valign=top title='2 year profit/loss when backing at SP'><a href='?sortselection=13'>P/L</a></th>
<th class=readablerowlink valign=top title='The number of days since last rode a winner'><a href='?sortselection=14'>Days</a></th>
<th class=readablerowlink valign=top title='The number of runs since riding a winner'><a href='?sortselection=15'>Runs</a></th>
<th class=readablerowlink valign=top></th>
<th class=readablerowlink valign=top>(4th)(3rd)(2nd)(1st)(31st)(30th)(29th)</th></tr><tr><td valign=top class=readablerowlink><a href='jockeys.php?id=314' target='_blank' title='Allan, David, click to open form page'>Allan, David</a> <a title='FC Odds include 6/1 shot or less' class='texttype926'> $</a></td><td class=readablerow valign=top title='Allan, David has had 65 rides in last 14 days'>65</td>
<td class=readablerow valign=top title='Allan, David has had 10 winning rides in last 14 days'>10</td>
<td class=readablerow valign=top title='Allan, David has had 27 placed rides in last 14 days'>27</td>
<td class=readablerow valign=top title='Allan, David winning strike rate in last 14 days is 15%'>15%</td>
etc. down to row 3000