Skip to content

Instantly share code, notes, and snippets.

View drvenabili's full-sized avatar

Simon Hengchen drvenabili

View GitHub Profile
import numpy as np
import matplotlib.pyplot as plt
valeurs = {"p1":[0.4, 0.55, 0.05, 0.0], "p2":[0.2, 0.3, 0.5, 0.0], "p3":[0.4, 0.2, 0.2, 0.2], "p4":[0.2, 0.2, 0.2, 0.4], "p5":[0.4, 0.55, 0.05, 0.0], "p6":[0.4, 0.55, 0.05, 0.0], "p7":[0.4, 0.55, 0.05, 0.0]}
colours = ['b','g','r','c','m','y','k']
valeurs2 = dict()
for key in valeurs.keys():
#print(key)
2017-09-18 15:14:46,932 INFO - Processing file '/home/sigmund/work/hartlib/OneDrive/lemmatization/cleaned_EN_input/9D_17_55_cleaned.xml' .
org.jdom2.input.JDOMParseException: Error on line 1: The reference to entity "c" must end with the ';' delimiter.
at org.jdom2.input.sax.SAXBuilderEngine.build(SAXBuilderEngine.java:232)
at org.jdom2.input.sax.SAXBuilderEngine.build(SAXBuilderEngine.java:303)
at org.jdom2.input.SAXBuilder.build(SAXBuilder.java:1196)
at edu.northwestern.at.morphadorner.corpuslinguistics.inputter.XMLTextInputter.doLoadText(XMLTextInputter.java:328)
at edu.northwestern.at.morphadorner.corpuslinguistics.inputter.XMLTextInputter.loadText(XMLTextInputter.java:415)
at edu.northwestern.at.morphadorner.MorphAdorner.adornXML(MorphAdorner.java:817)
at edu.northwestern.at.morphadorner.MorphAdorner.processInputFiles(MorphAdorner.java:718)
at edu.northwestern.at.morphadorner.MorphAdorner.main(MorphAdorner.java:2610)
\begin{figure}[H]
\centering
\caption{\todo{écrire une caption}}
\label{fig:distrib3}
\begin{tikzpicture}[scale=1]
\begin{axis}[
area style,
xtick=data,
tick label style={font=\small},
xticklabel interval boundaries,
f = open("fichier","r")
f = readlines()
x = 0
sample = open("sample.txt","w")
for line in f:
if x%20 == 0:
sample.write(line+"\n")
x = x+1
Warning: JAVA_HOME environment variable is not set.
[INFO] Scanning for projects...
[INFO]
[INFO] ------------------------------------------------------------------------
[INFO] Building Mallet Topic-Modeling-Tool GUI 0.99-SNAPSHOT
[INFO] ------------------------------------------------------------------------
[INFO]
[INFO] --- maven-resources-plugin:2.6:copy-resources (copy-resources) @ TopicModelingTool ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 1 resource
vagrant@lamachine:~$ sudo lamachine-update.sh ticcl
=====================================================================
, LaMachine - NLP Software distribution
~) (http://proycon.github.io/LaMachine)
(----í Language Machines research group
/| |\ & Centre of Language and Speech Technology
/ / /| Radboud University Nijmegen
=====================================================================
Bootstrapping Virtual Machine or Docker image....
vagrant@lamachine:~/TICCL$ sudo perl TICCLops.PICCL.pl TICCL.Black.config
TICCL_OPTSin: abcmdef TXT /home/vagrant/TICCL/ticclops /home/vagrant/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion empty.txt xml 100000000 /home/vagrant/TICCL/data/int/nld/nld.aspell.dict.lc.chars /home/vagrant/vooruit_preprocessed /home/vagrant/TICCL/data/int/nld/nuTICCL.OldandINLlexandINLNamesAspell.v2.COL1.tsv 2 /home/vagrant/OUT TESTTWO 3 nld /usr/bin/ 30 5 50
TICCL_OPTSin2: MODE: abcmdef TEXTTYPE: TXT ROOTDIR: /home/vagrant/TICCL/ticclops CHARCONFUS: /home/vagrant/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion KHC: empty.txt EXT: xml$ ARTIFRQ: 100000000 ALPH: /home/vagrant/TICCL/data/int/nld/nld.aspell.dict.lc.chars INPUTDIR: /home/vagrant/vooruit_preprocessed DIR: LEX: /home/vagrant/TICCL/data/int/nld/nuTICCL.OldandINLlexandINLNamesAspell.v2.COL1.tsv LD: 2 OUTPUTDIR: /home/vagrant/OUT PREFIX: TESTTWO RANK: 3 LANG: nld TOOLDIR: /usr/bin/ THREADS: 30 MINLENGTH: 5 MAXLENGTH: 50
OUT1:
OUT2: /home/vagrant/OUT/zzz/TICCL/TES
sigmund@debian:~/git/TICCL$ perl TICCLops.PICCL.pl TICCL.Black.config
TICCL_OPTSin: abcmdef TXT /home/sigmund/git/TICCL/ticclops /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion empty.txt xml 100000000 /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.lc.chars /home/sigmund/Desktop/Vooruit/vooruit_preprocessed /home/sigmund/git/TICCL/data/int/nld/nuTICCL.OldandINLlexandINLNamesAspell.v2.COL1.tsv 2 /home/sigmund/Desktop/Vooruit/OUT TESTTWO 3 nld /usr/local/bin 30 5 50
TICCL_OPTSin2: MODE: abcmdef TEXTTYPE: TXT ROOTDIR: /home/sigmund/git/TICCL/ticclops CHARCONFUS: /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion KHC: empty.txt EXT: xml$ ARTIFRQ: 100000000 ALPH: /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.lc.chars INPUTDIR: /home/sigmund/Desktop/Vooruit/vooruit_preprocessed DIR: LEX: /home/sigmund/git/TICCL/data/int/nld/nuTICCL.OldandINLlexandINLNamesAspell.v2.COL1.tsv LD: 2 OUTPUTDIR: /home/sigmund/Desktop/Vooruit/OUT PREFIX: TESTTWO RANK: 3 LANG: nld TOOLD
-a abcmdef ## $mode : The mode specifies which submodules will be run.
-b TXT ##$texttype : What is the type of your input files? Are these IM : images, PDF : images in PDF files, TXT : plain text files, XML : an XML format, FOLIA : FoLiA XML format, TSV : a frequency file (word type -tab - frequency)
-z /home/sigmund/git/TICCL/ticclops/ ## $ROOTDIR : The directory where your version of the TICCL system files are located.
-c /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion ## $charconfus : A file listing the particular character confusions the system will gather word pairs for.
-d empty.txt ## $KHC Specify the name of the Known Historical Confusions file (if you have one). If not, create an empty file in TICCL's root directory.
-e xml ## $ext : The extension ending your input file names. Can be single, e.g. '.xml' or double '.folia.xml'.
-f 100000000 ## $artifrq : The artificial frequency. Should be higher than the highest word frequency in your input files frequency list. Typically set
sigmund@debian:~/git/TICCL$ perl TICCLops.PICCL.pl TICCL.Black.config
TICCL_OPTSin: abcmdef TXT /home/sigmund/git/TICCL/ticclops/ /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion empty.txt xml 100000000 /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.lc.chars /home/sigmund/Desktop/Vooruit/vooruit_preprocessed /home/sigmund/git/TICCL/data/int/nld/nuTICCL.OldandINLlexandINLNamesAspell.v2.COL1.tsv 2 /home/sigmund/Desktop/Vooruit/OUT TESTTWO 3 nld /usr/local/bin 30 5 50
TICCL_OPTSin2: MODE: abcmdef TEXTTYPE: TXT ROOTDIR: /home/sigmund/git/TICCL/ticclops/ CHARCONFUS: /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.c20.d2.confusion KHC: empty.txt EXT: xml$ ARTIFRQ: 100000000 ALPH: /home/sigmund/git/TICCL/data/int/nld/nld.aspell.dict.lc.chars INPUTDIR: /home/sigmund/Desktop/Vooruit/vooruit_preprocessed DIR: LEX: /home/sigmund/git/TICCL/data/int/nld/nuTICCL.OldandINLlexandINLNamesAspell.v2.COL1.tsv LD: 2 OUTPUTDIR: /home/sigmund/Desktop/Vooruit/OUT PREFIX: TESTTWO RANK: 3 LANG: nld TOO