Web Scrape

mikeymay

Well-known Member
Joined
Jan 17, 2006
Messages
1,633
Office Version
  1. 365
Platform
  1. Windows
I have a routine that scrapes a website but occasionally the web address entered returns a 502 Bad Gateway page

1729503053566.png


How do I get the h1 text to test if I need to refresh the page or not?


TIA
 

Excel Facts

Do you hate GETPIVOTDATA?
Prevent GETPIVOTDATA. Select inside a PivotTable. In the Analyze tab of the ribbon, open the dropown next to Options and turn it off
Can you post the relevant bit of your code, that makes it much easier to answer your question.

If I elaborate a bit on this example: VBA Web Scraping to Excel: The Complete Step-by-Step Guide for Beginners - 33rd Square , you could try a simple Instr to find if a 502 was returned.
VBA Code:
Sub ScrapeWebsite()

Dim http As New MSXML2.XMLHTTP
http.Open "GET", "https://example.com"
http.send

Dim html As String
html = http.responseText

If InStr(html, "<h1>502") > 0 Then
    'Error page, do something..
End If

End Sub
 
Upvote 0
Solution

Forum statistics

Threads
1,223,931
Messages
6,175,465
Members
452,646
Latest member
tudou

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top