Skip to content

Instantly share code, notes, and snippets.

@nniiicc
Last active May 25, 2022 00:15
Show Gist options
  • Save nniiicc/a079b76edb6d44a34a72722d8e77833c to your computer and use it in GitHub Desktop.
Save nniiicc/a079b76edb6d44a34a72722d8e77833c to your computer and use it in GitHub Desktop.
from selenium import webdriver
driver.get(https://weblink.co.thurston.wa.us/DCCASES/CustomSearch.aspx?SearchName=CivilCases) #this retrieves page
search_by_case = driver.find_element_by_id("CivilCases_Input0").send_keys(CASE%20NUMBER) #this inputs case number to search interface
submit = driver.find_element_by_id("CivilCases_Button2']").click() #this clicks 'submit' and makes laserfiche go vroom
#on next page
Docuemnt_title = driver.find_element_by_class_name('DocumentTitle') #this is where the document title element appears from the java call
Extract id from Document_Title where the HREF element = DocView.aspx?id=14375719&searchid=3152c8ac-cfa7-44b8-a5a3-bca6a23d190c&dbid=0 #this is pseudo code about where to find the id in the returned value
###
To finish this
- write a statement to loop through the `search_by_case` function with each case number
- Somehow use selenium to connect pages
- write function to extract and publish document ids (this is so we have a list of ids and case numbers)
- Retrieve each pdf by inserting id -> https://weblink.co.thurston.wa.us/DCCASES/DocView.aspx?id=""&openpdf=true
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment