WinHTTPrequest and MSXML2.DOMDocument60

lzr · Oct 30, 2017

I'm trying to put together a macro that will access a site's API and download files.

First, it accesses the site and searches for a particular document. If it is found, the site returns various information about the document, one item being the 'view_address', which is the url to view or download the document. Below is part of the code. It seems to work down to just before the MSXML2.DOMDocument60 portion. So, the Debug.Print WHttp.responseText returns all the information about my particular document.

Next, I need to get the url link that is at "view_address", as seen in the below snippet of what is returned at WHttp.responseText.
],
"images": [
{
"number": 5294136,
"book": "000692",
"page": "0347",
"view_address": "\/api\/v1\/images?city=NYC&number=5294136&action=view"

I'm trying to load this information so that the portion I want can be extracted, but xmlDoc.LoadXML(SearchResponse) does not work. The Msgbox "Load Error" shows and it appears that xmldoc is blank.

Clearly I'm stumbling around and don't know what I'm doing, but can anyone offer any ideas of what I'm doing wrong?

Thanks in advance.

Code:

WHttp.Open "GET", FullSearchPath, False
'WHttp.setRequestHeader "Authorization", "Basic " & APIToken
WHttp.SetCredentials APIToken, APIToken, HTTPREQUEST_SETCREDENTIALS_FOR_SERVER
WHttp.send

SearchResponse = WHttp.responseText

Debug.Print WHttp.responseText

Dim xmlDoc As New MSXML2.DOMDocument60

    If Not xmlDoc.LoadXML(SearchResponse) Then
         MsgBox "Load Error"
    End If

John_w · Oct 30, 2017

The response looks like JSON, not XML. There are several JSON parsers you could use, but the following code uses string functions to extract the view_address:

Code:

    Dim p1 As Long, p2 As Long, view_address As String
    
    p1 = InStr(WHttp.responseText, """view_address"": ") + Len("""view_address"": ") + 1
    p2 = InStr(p1, WHttp.responseText, Chr(34))    
    view_address = Replace(Mid(WHttp.responseText, p1, p2 - p1), "\", "")
    Debug.Print view_address

lzr · Oct 30, 2017

John_w said:
The response looks like JSON, not XML. There are several JSON parsers you could use, but the following code uses string functions to extract the view_address:

Code:

Dim p1 As Long, p2 As Long, view_address As String p1 = InStr(WHttp.responseText, """view_address"": ") + Len("""view_address"": ") + 1 p2 = InStr(p1, WHttp.responseText, Chr(34)) view_address = Replace(Mid(WHttp.responseText, p1, p2 - p1), "\", "") Debug.Print view_address

Thank you, this works great.

So, then I inserted the following:

Code:

WHttp.Open "GET", BaseAddress & view_address, False
'WHttp.setRequestHeader "Authorization", "Basic " & APIToken
WHttp.SetCredentials APIToken, APIToken, HTTPREQUEST_SETCREDENTIALS_FOR_SERVER
WHttp.send

I believe that the code should be the equivalent of typing the url in a browser, which leaves me looking at a pdf of my document. Now I'd like to download and saveAs the file as "B-P.pdf" to a folder called "Test Folder" on "C:". Any guidance or ideas on how to do that?

And a side question, is the .setcredentials way that I'm passing the API token the best way to handle this? It works, but doesn't look quite right to me. The site simply wants the APIToken and their documentation says Basic authentication. But the way I've entered it here is the only way I've gotten it to work.

Thanks again for your help.

John_w · Oct 31, 2017

Try this to download the file:

Code:

    Dim fileNum As Integer, buffer() As Byte
    fileNum = FreeFile
    Open "C:\Folder\file.pdf" For Binary Access Write As [URL=https://www.mrexcel.com/forum/usertag.php?do=list&action=hash&hash=fileNum]#fileNum[/URL] 
    buffer = WHttp.responseBody
    Put [URL=https://www.mrexcel.com/forum/usertag.php?do=list&action=hash&hash=fileNum]#fileNum[/URL] , , buffer
    Close [URL=https://www.mrexcel.com/forum/usertag.php?do=list&action=hash&hash=fileNum]#fileNum[/URL]

I don't know about the API token because I don't know which API you're using. With basic authentication some sites require the token to be encoded as a base64 string, but it all depends on the API.

lzr · Nov 1, 2017

John_w said:
The response looks like JSON, not XML. There are several JSON parsers you could use, but the following code uses string functions to extract the view_address:

Code:

Dim p1 As Long, p2 As Long, view_address As String p1 = InStr(WHttp.responseText, """view_address"": ") + Len("""view_address"": ") + 1 p2 = InStr(p1, WHttp.responseText, Chr(34)) view_address = Replace(Mid(WHttp.responseText, p1, p2 - p1), "\", "") Debug.Print view_address

Thanks, I think I'm on the home stretch. As it turns out, my results could contain several instances of 'view_address'. Is there a way for the above code to reflect the first 'view_address' and ignore any others?

lzr · Nov 1, 2017

Please ignore #5 , I was wrong in how the API worked. The API actually returns a 'view_address' tag and information for every page in the document. So, I need to cycle through all the 'view_address' tags and info found in Whttp.responseText, then when done, combine them all together in one document and save locally. Or append each new page as it finds more 'view_page' items and then download the result. Any thoughts or ideas?

Below is a rough example of what whttp.responseText might look like.

],
"images": [
{
"number": 5294136,
"book": "000692",
"page": "0347",
"view_address": "\/api\/v1\/images?city=NYC&number=5294136&action=view"

"number": 5294137,
"book": "000692",
"page": "0348",
"view_address": "\/api\/v1\/images?city=NYC&number=5294137&action=view"

John_w · Nov 2, 2017

Try this:

Code:

    Dim p1 As Long, p2 As Long, view_address As String
    
    p1 = 1
    Do
        p1 = InStr(p1, WHttp.responseText, """view_address"": ", vbTextCompare)
        If p1 <> 0 Then
            p1 = p1 + Len("""view_address"": ") + 1
            p2 = InStr(p1, WHttp.responseText, Chr(34))
            view_address = Replace(Mid(WHttp.responseText, p1, p2 - p1), "\", "")
            Debug.Print view_address
            p1 = p2 + 1
        End If
    Loop While p1 <> 0

lzr · Nov 2, 2017

John_w said:

Try this:

Code:

    Dim p1 As Long, p2 As Long, view_address As String
    
    p1 = 1
    Do
        p1 = InStr(p1, WHttp.responseText, """view_address"": ", vbTextCompare)
        If p1 <> 0 Then
            p1 = p1 + Len("""view_address"": ") + 1
            p2 = InStr(p1, WHttp.responseText, Chr(34))
            view_address = Replace(Mid(WHttp.responseText, p1, p2 - p1), "\", "")
            Debug.Print view_address
            p1 = p2 + 1
        End If
    Loop While p1 <> 0

Would it be better to loop through and save all the view_addresses in an array and then use the array to build my document, or find a view_address and add/append it to a file, find the next view_address and append it and so on..?

WinHTTPrequest and MSXML2.DOMDocument60

lzr

Board Regular

John_w

MrExcel MVP

lzr

Board Regular

John_w

MrExcel MVP

lzr

Board Regular

lzr

Board Regular

John_w

MrExcel MVP

lzr

Board Regular

Share this page

WinHTTPrequest and MSXML2.DOMDocument60

Board Regular

MrExcel MVP

Board Regular

MrExcel MVP

Board Regular

Board Regular

MrExcel MVP

Board Regular

Share this page

We've detected that you are using an adblocker.

Which adblocker are you using?

Disable AdBlock

Disable AdBlock Plus

Disable uBlock Origin

Disable uBlock