Skip to content

Instantly share code, notes, and snippets.

@naranjja
Created November 21, 2018 17:46
Show Gist options
  • Save naranjja/6026eb7104ca3d0731bd33b8f7ade24e to your computer and use it in GitHub Desktop.
Save naranjja/6026eb7104ca3d0731bd33b8f7ade24e to your computer and use it in GitHub Desktop.
Get body HTML from HTML string using LXML
# pip install lxml cssselect
import lxml.html
from lxml import etree
html = """
<html>
<body>
<p>Hello world</p>
</body>
</html>
"""
tree = lxml.html.fromstring(html)
body_html = etree.tostring(tree.cssselect("body")[0]).decode("utf-8")
print(body_html)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment