pythonurlbeautifulsoupyoutube

Get YouTube Playlist urls with python


How I can get the playlist urls stored like

enter image description here

here: https://www.youtube.com/watch?v=VpTRlS7EO6E&list=RDOIhVs0FQ8xc&index=5 with bs4?

Using

from bs4 import BeautifulSoup as bs
import requests

r = requests.get('https://www.youtube.com/watch?v=OIhVs0FQ8xc&list=RDOIhVs0FQ8xc&index=1')
page = r.text
soup=bs(page,'html.parser')
#print(soup)
res=soup.find_all('ytd-playlist-panel-video-renderer')
print(res)

doesn't return anything. Even printing the soup itself doesn't contain the link I'am looking for (like href="/watch?v=puNOG62lf-Y&list=RDOIhVs0FQ8xc&index=2")


Solution

  • It is a javascript rendered page. You have to use selenium.

    from bs4 import BeautifulSoup as bs
    from selenium import webdriver
    from webdriver_manager.chrome import ChromeDriverManager
    import time
    
    url = 'https://www.youtube.com/watch?v=OIhVs0FQ8xc&list=RDOIhVs0FQ8xc&index=1'
    driver = webdriver.Chrome(ChromeDriverManager().install())
    driver.set_window_size(1024, 600)
    driver.maximize_window()
    driver.get(url)
    time.sleep(2)
    
    soup=bs(driver.page_source,'html.parser')
    res=soup.find_all('ytd-playlist-panel-video-renderer')
    print(res)
    

    Install the required package using pip install webdriver-manager