mauricem72
New Member
- Joined
- Mar 13, 2018
- Messages
- 4
I have a spreadsheet that I'm using to import data from a webpage. I've been able to pull in everything from the page but I'm looking for the information in a specific node. I'm having trouble pulling it in.
Here's the info
Site: https://leagueofcomicgeeks.com/profile/mauricem72/pull-list
I'm trying to pull the URLs listed from the links on the page
Node:
<div class="comic-title"><a href="/comic/9487913/action-comics-999">Action Comics #999</a></div>
<a href="/comic/9487913/action-comics-999">Action Comics #999</a>
I can pull the title in using =importXML("https://leagueofcomicgeeks.com/profile/mauricem72/pull-list","//a/@href") but it pulls in all the URLs on the page. How do I filter it by only the ones that start with "comic"? Preferably the first entry that matches from each entry?
I tried =importXML("https://leagueofcomicgeeks.com/profile/mauricem72/pull-list","//a/@href[comics]") and it didn't work.
Thanks!
Here's the info
Site: https://leagueofcomicgeeks.com/profile/mauricem72/pull-list
I'm trying to pull the URLs listed from the links on the page
Node:
<div class="comic-title"><a href="/comic/9487913/action-comics-999">Action Comics #999</a></div>
<a href="/comic/9487913/action-comics-999">Action Comics #999</a>
I can pull the title in using =importXML("https://leagueofcomicgeeks.com/profile/mauricem72/pull-list","//a/@href") but it pulls in all the URLs on the page. How do I filter it by only the ones that start with "comic"? Preferably the first entry that matches from each entry?
I tried =importXML("https://leagueofcomicgeeks.com/profile/mauricem72/pull-list","//a/@href[comics]") and it didn't work.
Thanks!