pythonselenium-webdrivercss-selectorsselenium-chromedriver

How do I code a CSS Selector to find a specific button, click on it, and wait until the webpage has completed the clicked task?


I have a working web scraper written in Python, Selenium and Chromedriver (all up-to-date version wise) and various other software packages. The target webpage has a field for the phone number, but you have to click a button to reveal it:

<button aria-busy="false" class="sc-adc2db3f-0 cLDdWI sc-608879d0-0 cbUwJX">
 <div class="sc-eb45309b-0 kexGZq">
  <svg aria-hidden="true" color="#373373" fill="none" focusable="false" height="24" viewbox="0 0 24 24" width="24" xmlns="http://www.w3.org/2000/svg">
   <path d="M18.078 14.702c-.502-.6-1.506-.5-1.908.2-.602 1-.903 1.3-1.204 1.3-2.21-.3-6.928-5-7.129-7.2 0-.1 0-.3 1.406-1.2.703-.4.803-1.3.2-1.8-2.61-2.4-2.71-3-3.513-3-1.607 0-3.113 4-2.912 5.7.502 4 8.835 12.3 12.65 12.3 1.707 0 5.322-1.3 5.322-2.8.1-.8-.502-.9-2.912-3.5z" fill="#fff" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round" stroke-miterlimit="10">
   </path>
  </svg>
  <p class="sc-991ea11d-0 hCxFrC">
   +1-XXX-XXX-XXXX
  </p>
 </div>
 <p aria-label="Reveal phone number" class="sc-991ea11d-0 hCxFrC">
 Reveal
 </p>
</button>

Notice that the class names are all gobbledegook and have a tendency to change on me so I am trying to use names that will stay constant. The only thing I can think of is to use the TAG "button" and the ID 'p' with the attribute 'aria-label="Reveal phone number"'. I cannot figure out a CSS_SELECTOR that I can use with these values to do a fluent wait after I make sure that field is on the webpage. Here is the code I have tried. I know this is not enough to test, but I am hoping for some ideas on what else to try:

results = soup.find_all('p',attrs={'aria-label':'Reveal phone number'})
if re.search('Reveal phone number',str(results)):
    elems = ''
    try:
        elems = webdwait.until(EC.presence_of_all_elements_located((By.CSS_SELECTOR,"[aria-label='Reveal phone number']")))
    except Exception as err:
        logging.info(f"ERROR F{flag}-70: Webpage element (PhoneRevealButton) search failed ...")
        logging.info(f"ERROR F{flag}-70: type={type(err)}: {err}")
        print(f"                    WARNING F{flag}-70: Webpage search failed ({type(err)}): trying again ...",flush=True)
        failed = True

    if not failed:
        for elem in elems:
            if re.search('Reveal phone number',str(elem.get_attribute('outerHTML'))):
                driver.execute_script("arguments[0].scrollIntoView(true);",elem)
                try:
                    driver.execute_script("arguments[0].click();",elem)
                except Exception as err:
                    logging.info(f"ERROR F{flag}-71: Webpage element (PhoneRevealButton) search failed ...")
                    logging.info(f"ERROR F{flag}-71: type={type(err)}: {err}")
                    print(f"                    WARNING F{flag}-71: Webpage click failed ({type(err)}): trying again ...",flush=True)
                    failed = True

    if not failed:
        # Wait until these elements are updated
        lcnt = 0
        while True:
            try:
                elems = webdwait.until(EC.presence_of_all_elements_located((By.CSS_SELECTOR,"[aria-label='Reveal phone number']")))
            except Exception as err:
                # Element is gone?
                logging.info(f"INFO F{flag}-72: type={type(err)}: {err}")
                break
            cnt = 0
            for elem in elems:
                if re.search('Reveal phone number',str(elem.get_attribute('outerHTML'))):
                    # Element still exists
                    cnt += 1
            if cnt == 0:
                break
            lcnt += 1
            if lcnt * timeout_wait2 >= timeout_dri:
                print(f"                    ERROR: Waited too long for webpage to update - phone number not retrieved")
                break
            time.sleep(timeout_wait2)

Solution

  • It looks as though you can find the p element by its unique aria-label attribute. Therefore you can find the button element (its parent) like this:

    if (p := soup.select_one("button p[aria-label='Reveal phone number']")) is not None:
        button = p.find_parent()
        print(button)