Learning how to web scrape empowers you to apply your XQuery skills to any data residing on the web. You can fetch data from remote sites and services—for example, entire web pages or just the pieces of a page that matter to you. Once fetched, you can perform further analysis on the data, clean it up, mash it up with other data, transform it into different formats, etc.
XPath-based languages like XQuery offer an standard function for accessing remote documents, the fn:doc()
function.
However, a limitation of this function is that it only works if the URI returns a well-formed XML document.