google-sheetsweb-scrapingscripting

Google Sheets IMPORTXML - Scrape Exhibitor List


Screenshot of company names: Trying to scrape company names for my CRM (Google Sheets), but getting Formula Parse Error.

Website: https://exhibitormanual.oceanbusiness.com/

Tried to change child/class as a xpath quary parameter. Tried also IMPORTHTML.

Used code: =importxml ("https://exhibitormanual.oceanbusiness.com/", "//*[@id="drts-platform-render-66fc2684827da-2-post-20696"]/div/div/div[2]/div1/a")

Formula parse Error.


Solution

  • Here's one approach which pulls the 293 company names spread across 2 pages; some unwanted titles(top 11) are pulled as well. You can mess around with the code if you want a more targeted xpath:

    =reduce(tocol(,1),sequence(2),lambda(a,c,vstack(a,importxml("https://exhibitormanual.oceanbusiness.com/?_page="&c&"&num=200&sort=post_title","//*[@title]"))))
    

    enter image description here