Skip to content

Instantly share code, notes, and snippets.

@jtemporal
Created January 30, 2017 16:43
Show Gist options
  • Save jtemporal/cb85aab543da82cc8888b1933cfddfa6 to your computer and use it in GitHub Desktop.
Save jtemporal/cb85aab543da82cc8888b1933cfddfa6 to your computer and use it in GitHub Desktop.
parsing error
root@rosie-staging-16gb:~/rosie# docker run --rm -v /tmp/serenata-data:/tmp/serenata-data rosie
2017-01-30 16:16:03 Creating the CSV file
2017-01-30 16:16:03 Reading the XML file
2017-01-30 16:16:04 Writing record #3,200 to the CSV
2017-01-30 16:16:04 Done!
2017-01-30 16:16:04 Creating the CSV file
2017-01-30 16:16:04 Reading the XML file
2017-01-30 16:18:07 Writing record #342,225 to the CSV
2017-01-30 16:18:07 Done!
2017-01-30 16:18:07 Creating the CSV file
2017-01-30 16:18:08 Reading the XML file
Traceback (most recent call last): #1,285,075 to the CSV
File "rosie.py", line 36, in <module>
command()
File "rosie.py", line 23, in run
rosie.main(target_directory)
File "/rosie/__init__.py", line 64, in main
dataset = Dataset(target_directory).get()
File "/rosie/dataset.py", line 16, in get
self.update_datasets()
File "/rosie/dataset.py", line 28, in update_datasets
ceap.convert_to_csv()
File "/usr/local/lib/python3.5/site-packages/serenata_toolbox/ceap_dataset.py", line 36, in convert_to_csv
convert_xml_to_csv(xml_path, csv_path)
File "/usr/local/lib/python3.5/site-packages/serenata_toolbox/xml2csv.py", line 70, in convert_xml_to_csv
for json_io in xml_parser(xml_file_path):
File "/usr/local/lib/python3.5/site-packages/serenata_toolbox/xml2csv.py", line 23, in xml_parser
for _, node in iterparse(xml_path, tag=tag):
File "src/lxml/iterparse.pxi", line 208, in lxml.etree.iterparse.__next__ (src/lxml/lxml.etree.c:148582)
File "src/lxml/iterparse.pxi", line 193, in lxml.etree.iterparse.__next__ (src/lxml/lxml.etree.c:148280)
File "src/lxml/iterparse.pxi", line 224, in lxml.etree.iterparse._read_more_events (src/lxml/lxml.etree.c:148818)
File "src/lxml/parser.pxi", line 1374, in lxml.etree._FeedParser.close (src/lxml/lxml.etree.c:114116)
File "src/lxml/parser.pxi", line 586, in lxml.etree._ParserContext._handleParseResult (src/lxml/lxml.etree.c:104990)
File "src/lxml/parser.pxi", line 595, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:105109)
File "src/lxml/parser.pxi", line 706, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:106817)
File "src/lxml/parser.pxi", line 635, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:105671)
File "/tmp/serenata-data/AnosAnteriores.xml", line 2
lxml.etree.XMLSyntaxError: expected '>', line 2, column 1334886371
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment