Skip to content

Instantly share code, notes, and snippets.

@taka-wang
Forked from korakot/selenium.py
Created October 26, 2021 05:05
Show Gist options
  • Save taka-wang/54352d8d1bb595e811e3c10ddacf9788 to your computer and use it in GitHub Desktop.
Save taka-wang/54352d8d1bb595e811e3c10ddacf9788 to your computer and use it in GitHub Desktop.
Use selenium in Colab
# install chromium, its driver, and selenium
!apt update
!apt install chromium-chromedriver
!pip install selenium
# set options to be headless, ..
from selenium import webdriver
options = webdriver.ChromeOptions()
options.add_argument('--headless')
options.add_argument('--no-sandbox')
options.add_argument('--disable-dev-shm-usage')
# open it, go to a website, and get results
wd = webdriver.Chrome(options=options)
wd.get("https://www.website.com")
print(wd.page_source) # results
# divs = wd.find_elements_by_css_selector('div')
# I create my own library to make it even easier
!pip install kora -q
from kora.selenium import wd
wd.get("https://www.website.com")
print(wd.page_source) # results
# I add a few helpers
divs = wd.select("div") # css selecter
div = divs[0]
span = div.select1("span") # return the first result
wd # screenshot
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment