Last active
March 28, 2018 10:29
-
-
Save davidefiocco/8ac44a819d9ea8f9ce3cff49b80c9a56 to your computer and use it in GitHub Desktop.
Apache Tika extraction and Grobid extraction on https://www.overleaf.com/latex/examples/simple-draft-manuscript-template-with-line-numbers/wnbtffygpkwz#.WrEys9Yh30o
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
'<?xml version="1.0" encoding="UTF-8"?>\n<TEI xmlns="http://www.tei-c.org/ns/1.0" \nxmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" \nxsi:schemaLocation="http://www.tei-c.org/ns/1.0 /opt/grobid/grobid-home/schemas/xsd/Grobid.xsd"\n xmlns:xlink="http://www.w3.org/1999/xlink">\n\t<teiHeader xml:lang="fr">\n\t\t<encodingDesc>\n\t\t\t<appInfo>\n\t\t\t\t<application version="0.5.1-SNAPSHOT" ident="GROBID" when="2018-03-20T16:03+0000">\n\t\t\t\t\t<ref target="https://github.com/kermitt2/grobid">GROBID - A machine learning software for extracting information from scholarly documents</ref>\n\t\t\t\t</application>\n\t\t\t</appInfo>\n\t\t</encodingDesc>\n\t\t<fileDesc>\n\t\t\t<titleStmt>\n\t\t\t\t<title level="a" type="main">Manuscript Title Author Name 1 2</title>\n\t\t\t</titleStmt>\n\t\t\t<publicationStmt>\n\t\t\t\t<publisher/>\n\t\t\t\t<availability status="unknown"><licence/></availability>\n\t\t\t</publicationStmt>\n\t\t\t<sourceDesc>\n\t\t\t\t<biblStruct>\n\t\t\t\t\t<analytic>\n\t\t\t\t\t\t<title level="a" type="main">Manuscript Title Author Name 1 2</title>\n\t\t\t\t\t</analytic>\n\t\t\t\t\t<monogr>\n\t\t\t\t\t\t<imprint>\n\t\t\t\t\t\t\t<date/>\n\t\t\t\t\t\t</imprint>\n\t\t\t\t\t</monogr>\n\t\t\t\t\t<note>1</note>\n\t\t\t\t</biblStruct>\n\t\t\t</sourceDesc>\n\t\t</fileDesc>\n\t\t<profileDesc>\n\t\t\t<abstract/>\n\t\t</profileDesc>\n\t</teiHeader>\n\t<text xml:lang="fr">\n\t\t<body>\n<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figures 46 (insert any figures here) 47</figDesc></figure>\n<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0" validated="false"><head>nibh sagittis donec porttitor sagittis orci cras tempor parturient purus quisque risus</head><label></label><figDesc>nullam non vulputate turpis quis non vitae id sit eget massa commodo etiam pharetra magna 7 ante tincidunt eget ullamcorper dui a sed adipiscing nec nec euismod sit ante mus mattis 8 mollis cursus in placerat lacus odio fusce libero non fringilla morbi sem massa at libero sed 9 justo faucibus tellus amet vivamus vestibulum vestibulum mi proin libero vel scelerisque 10 amet nam sem ornare pellentesque ac vestibulum nonummy orci nulla id ligula sit porttitor 11 sed quis ut posuere amet pellentesque morbi at tincidunt vel elementum neque netus habitant 12 ridiculus tempus vel elit et sollicitudin lacinia at id lobortis ac non mollis nec sapien nisl 13 dictum ipsum sociosqu pede eu sit semper dapibus in pede habitant neque vestibulum nullam 14 vehicula sociis posuere aliquam ac ac enim sed suspendisse ac nunc integer ut lectus enim 15 suspendisse lorem ut pellentesque purus sapien sed parturient morbi numquam nibh vitae 16 morbi ut ut elit et lorem ligula torquent nec morbi nunc nisl libero sapien maecenas quis 17 dui pretium mollis quam sint pellentesque at voluptate nunc neque in rhoncus tincidunt 18 ipsum neque odio at iaculis iaculis maecenas felis tortor velit et rhoncus quis eleifend porta 19 quisque vestibulum sagittis semper enim viverra scelerisque praesent ea praesent quis nec 20 eget posuere elit nunc metus varius justo sit aliquet fermentum mattis risus quis ante risus 21 a quis mauris blandit orci urna arcu pellentesque suspendisse in magna dolor turpis sed etLorem ipsum dolor sit amet, elementum pharetra dui volutpat ligula, risus et repellat penati- 26 bus, eget sapien viverra eget, sed fringilla sed arcu ut. Morbi mollis et accumsan, ullamcorper 27 quis euismod sed ipsum, lorem nam iaculis faucibus, et donec diam eleifend convallis. Lacus 28 non augue sed mollis, purus nunc faucibus eros. Nibh eleifend nullam placerat, magna quam 29 tellus pellentesque (Aamport, 1986a), convallis amet amet libero vestibulum, integer biben- 30 dum maecenas sodales eget. Vivamus tellus in quisque felis, ac ipsum non neque. Nonummy 31 potenti id aliquet, phasellus per a pellentesque nisl, feugiat convallis non commodo. Integer 32 aut auctor donec, iaculis elit nunc amet, in sed mi interdum, tortor etiam lacus rutrum 33 penatibus. Eget montes metus velit commodo, sed quisque cras libero quisque, aliquet ac- 34 cumsan mauris nunc, libero ullamcorper pretium vehicula amet. Viverra sit non tristique, 35 sollicitudin at massa magna vel. Lacus pellentesque lectus suscipit suspendisse, lectus rhon- 36 cus vel et dolor. Tempor nec sed viverra amet, ac mattis dolor enim, metus ipsum sapien 37 cras (Aamport, 1986b). 38 Vestibulum nec faucibus facilisis aliquet, luctus adipiscing cursus lectus. Risus pede lacus 39 accumsan, lacus aliquam tincidunt integer. Lorem pellentesque dolor fringilla mollis, cras id 40 justo praesent vestibulum, at duis aptent dapibus eros. In et nisl integer, et primis vel quis, 41 sem elementum suspendisse mi. Sit etiam laoreet viverra quam, ac urna fermentum massa 42 congue, lectus non ultricies vehicula conubia, sem quam dui nunc sem.</figDesc><table>1 Author Affiliation \n\n3 \n\nFebruary 3, 2016 \n\n4 \n\n1 \n\nAbstract \n\n5 \n\nut vel 6 \n\n22 \n\nvivamus nunc fringilla dictum eu magna massa \n\n23 1 Section 1 \n\n24 \n\n1.1 Subsection 1.1 \n\n25 \n\n</table></figure>\n\t\t</body>\n\t\t<back>\n\t\t\t<div type="references">\n\n\t\t\t\t<listBibl>\n\n<biblStruct xml:id="b0">\n\t<analytic>\n\t\t<title level="a" type="main">The gnats and gnus document preparation system. G-Animal's 49</title>\n\t\t<author>\n\t\t\t<persName xmlns="http://www.tei-c.org/ns/1.0"><forename type="first">L</forename><forename type="middle">A</forename><surname>Aamport</surname></persName>\n\t\t</author>\n\t</analytic>\n\t<monogr>\n\t\t<title level="j">Journal</title>\n\t\t<imprint>\n\t\t\t<biblScope unit="volume">41</biblScope>\n\t\t\t<biblScope unit="issue">7</biblScope>\n\t\t\t<date type="published" when="1986" />\n\t\t</imprint>\n\t</monogr>\n\t<note>this is a full ARTICLE entry</note>\n</biblStruct>\n\n<biblStruct xml:id="b1">\n\t<monogr>\n\t\t<title level="m" type="main">The gnats and gnus document preparation system. G-Animal's 51 Journal</title>\n\t\t<author>\n\t\t\t<persName xmlns="http://www.tei-c.org/ns/1.0"><forename type="first">L</forename><forename type="middle">A</forename><surname>Aamport</surname></persName>\n\t\t</author>\n\t\t<imprint>\n\t\t\t<date type="published" when="1986" />\n\t\t</imprint>\n\t</monogr>\n</biblStruct>\n\n\t\t\t\t</listBibl>\n\t\t\t</div>\n\t\t</back>\n\t</text>\n</TEI>\n' |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Manuscript Title1 | |
Author Name12 | |
1Author Affiliation3 | |
February 3, 20164 | |
1 | |
Abstract5 | |
ut vel nibh sagittis donec porttitor sagittis orci cras tempor parturient purus quisque risus6 | |
nullam non vulputate turpis quis non vitae id sit eget massa commodo etiam pharetra magna7 | |
ante tincidunt eget ullamcorper dui a sed adipiscing nec nec euismod sit ante mus mattis8 | |
mollis cursus in placerat lacus odio fusce libero non fringilla morbi sem massa at libero sed9 | |
justo faucibus tellus amet vivamus vestibulum vestibulum mi proin libero vel scelerisque10 | |
amet nam sem ornare pellentesque ac vestibulum nonummy orci nulla id ligula sit porttitor11 | |
sed quis ut posuere amet pellentesque morbi at tincidunt vel elementum neque netus habitant12 | |
ridiculus tempus vel elit et sollicitudin lacinia at id lobortis ac non mollis nec sapien nisl13 | |
dictum ipsum sociosqu pede eu sit semper dapibus in pede habitant neque vestibulum nullam14 | |
vehicula sociis posuere aliquam ac ac enim sed suspendisse ac nunc integer ut lectus enim15 | |
suspendisse lorem ut pellentesque purus sapien sed parturient morbi numquam nibh vitae16 | |
morbi ut ut elit et lorem ligula torquent nec morbi nunc nisl libero sapien maecenas quis17 | |
dui pretium mollis quam sint pellentesque at voluptate nunc neque in rhoncus tincidunt18 | |
ipsum neque odio at iaculis iaculis maecenas felis tortor velit et rhoncus quis eleifend porta19 | |
quisque vestibulum sagittis semper enim viverra scelerisque praesent ea praesent quis nec20 | |
eget posuere elit nunc metus varius justo sit aliquet fermentum mattis risus quis ante risus21 | |
a quis mauris blandit orci urna arcu pellentesque suspendisse in magna dolor turpis sed et22 | |
vivamus nunc fringilla dictum eu magna massa23 | |
2 | |
1 Section 124 | |
1.1 Subsection 1.125 | |
Lorem ipsum dolor sit amet, elementum pharetra dui volutpat ligula, risus et repellat penati-26 | |
bus, eget sapien viverra eget, sed fringilla sed arcu ut. Morbi mollis et accumsan, ullamcorper27 | |
quis euismod sed ipsum, lorem nam iaculis faucibus, et donec diam eleifend convallis. Lacus28 | |
non augue sed mollis, purus nunc faucibus eros. Nibh eleifend nullam placerat, magna quam29 | |
tellus pellentesque (Aamport, 1986a), convallis amet amet libero vestibulum, integer biben-30 | |
dum maecenas sodales eget. Vivamus tellus in quisque felis, ac ipsum non neque. Nonummy31 | |
potenti id aliquet, phasellus per a pellentesque nisl, feugiat convallis non commodo. Integer32 | |
aut auctor donec, iaculis elit nunc amet, in sed mi interdum, tortor etiam lacus rutrum33 | |
penatibus. Eget montes metus velit commodo, sed quisque cras libero quisque, aliquet ac-34 | |
cumsan mauris nunc, libero ullamcorper pretium vehicula amet. Viverra sit non tristique,35 | |
sollicitudin at massa magna vel. Lacus pellentesque lectus suscipit suspendisse, lectus rhon-36 | |
cus vel et dolor. Tempor nec sed viverra amet, ac mattis dolor enim, metus ipsum sapien37 | |
cras (Aamport, 1986b).38 | |
Vestibulum nec faucibus facilisis aliquet, luctus adipiscing cursus lectus. Risus pede lacus39 | |
accumsan, lacus aliquam tincidunt integer. Lorem pellentesque dolor fringilla mollis, cras id40 | |
justo praesent vestibulum, at duis aptent dapibus eros. In et nisl integer, et primis vel quis,41 | |
sem elementum suspendisse mi. Sit etiam laoreet viverra quam, ac urna fermentum massa42 | |
congue, lectus non ultricies vehicula conubia, sem quam dui nunc sem.43 | |
3 | |
Tables44 | |
(insert any tables here)45 | |
Figures46 | |
(insert any figures here)47 | |
4 | |
Figure 1: This is a caption for your figure. Here, have some fish. | |
5 | |
References48 | |
Aamport, L. A., 1986a: The gnats and gnus document preparation system. G-Animal’s49 | |
Journal, 41 (7), 73+, this is a full ARTICLE entry.50 | |
Aamport, L. A., 1986b: The gnats and gnus document preparation system. G-Animal’s51 | |
Journal.52 | |
6 | |
Section 1 | |
Subsection 1.1 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment