Skip to content

Instantly share code, notes, and snippets.

@cpsievert
Created November 20, 2014 19:45
Show Gist options
  • Save cpsievert/a1435d95acb00088f4a2 to your computer and use it in GitHub Desktop.
Save cpsievert/a1435d95acb00088f4a2 to your computer and use it in GitHub Desktop.
library(XML)
library(RSelenium)
pJS <- phantom()
Sys.sleep(5) # give the binary a moment
remDr <- remoteDriver(browserName = 'phantomjs')
remDr$open(silent = TRUE)
remDr$navigate("http://www.cbssports.com/collegebasketball/gametracker/live/NCAAB_20140407_UK@UCONN")
src <- remDr$getPageSource()[[1]]
doc <- htmlParse(src, asText = TRUE)
nodes <- getNodeSet(doc, path = "//div[@class='court']")
kids <- sapply(nodes, xmlChildren)
recs <- sapply(kids, xmlAttrs)
head(t(recs))
pJS$stop()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment