Skip to content

Instantly share code, notes, and snippets.

@rotzbouw
Created November 20, 2013 20:25
Show Gist options
  • Select an option

  • Save rotzbouw/7570380 to your computer and use it in GitHub Desktop.

Select an option

Save rotzbouw/7570380 to your computer and use it in GitHub Desktop.
postgresql table information & log of 3 consecutive runs on same file with different settings.
aida=# \d entity_keywords
Table "public.entity_keywords"
Column | Type | Modifiers
---------+------------------+-------------
entity | integer |
keyword | integer |
count | integer |
tf | integer |
weight | double precision | default 0.0
Indexes:
"entity_keyword_counts_eindex" btree (entity)
"entity_keyword_counts_ekindex" btree (entity, keyword, count)
aida=# \d entity_keyphrases
Table "public.entity_keyphrases"
Column | Type | Modifiers
-------------------------+------------------------+-------------
entity | integer |
keyphrase | integer |
keyphrase_tokens | integer[] |
keyphrase_token_weights | double precision[] |
source | character varying(100) |
count | integer | default 1
weight | double precision | default 0.0
Indexes:
"entity_keyphrase_eindex" btree (entity)
"entity_keyphrase_ekindex" btree (entity, keyphrase)
Wed Nov 20 19:06:59 CET 2013 Start
---> Disambiguating <---
Setting maxEntityKeyphraseCount to 1000
Set to 1000
Nov 20, 2013 7:06:59 PM elkfed.knowledge.yago.wsd.AIDA main
INFO: Loading a2e_0030_part_00.mmax [1]
Disambiguating text with 423 tokens ...
100 [main] WARN mpi.aida.AidaManager - Could not read cache file, reading from DB.
java.io.FileNotFoundException: database_aida.cache (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.<init>(FileInputStream.java:137)
at mpi.aida.AidaManager.createAndLoadCache(AidaManager.java:164)
at mpi.aida.AidaManager.initWordExpansion(AidaManager.java:144)
at mpi.aida.AidaManager.<init>(AidaManager.java:542)
at mpi.aida.AidaManager.getTasksInstance(AidaManager.java:136)
at mpi.aida.AidaManager.prepareInputData(AidaManager.java:211)
at mpi.aida.Preparator.prepare(Preparator.java:65)
at mpi.aida.Preparator.prepare(Preparator.java:37)
at elkfed.knowledge.yago.wsd.AIDA.main(AIDA.java:86)
120 [main] INFO mpi.aida.access.DataAccessSQL - Reading word expansions.
240 [main] INFO mpi.aida.AidaManager - Connecting to postgres database goeckety@localhost:5432/aida
25814 [main] INFO mpi.aida.AidaManager - Done loading word_expansions.
25818 [main] INFO mpi.tokenizer.data.TokenizerManager - Loading Tokenizer (ssplit, tokenize, pos, ner)
Adding annotator tokenize
Adding annotator ssplit
Adding annotator pos
Loading default properties from tagger edu/stanford/nlp/models/pos-tagger/english-left3words/english-left3words-distsim.tagger
Reading POS tagger model from edu/stanford/nlp/models/pos-tagger/english-left3words/english-left3words-distsim.tagger ... done [1.6 sec].
Adding annotator lemma
Adding annotator ner
Loading classifier from edu/stanford/nlp/models/ner/english.all.3class.distsim.crf.ser.gz ... done [6.5 sec].
Loading classifier from edu/stanford/nlp/models/ner/english.muc.7class.distsim.crf.ser.gz ... done [4.6 sec].
Loading classifier from edu/stanford/nlp/models/ner/english.conll.4class.distsim.crf.ser.gz ... done [5.4 sec].
53268 [main] INFO mpi.aida.access.DataAccessSQL - Querying keywords with 69092 keyword IDs ...
1299190 [main] INFO mpi.aida.graph.GraphGenerator - Computing the mention-entity similarities...
1299894 [main] INFO mpi.aida.graph.GraphGenerator - Building the graph...
1299902 [main] INFO mpi.aida.graph.similarity.measure.MilneWittenEntityEntitySimilarity - Initializing MilneWittenEntityEntitySimilarity for 97 entities
1300150 [main] INFO mpi.aida.graph.similarity.measure.MilneWittenEntityEntitySimilarity - Done initializing MilneWittenEntityEntitySimilarity for 97 entities
1300154 [main] INFO mpi.aida.graph.extraction.ExtractGraph - Computing EE sims on '1' for 97 entities (30 threads)
1300330 [main] INFO mpi.aida.graph.extraction.ExtractGraph - Done calculating EE sims on '1' (0s)
1300374 [main] INFO mpi.aida.Disambiguator - Disambiguating '1' (21 mentions)
1300382 [main] INFO mpi.aida.graph.algorithms.CocktailParty - Initial minimum weighted degree: 2.4643908870308127E-5
1300594 [main] INFO mpi.aida.graph.algorithms.CocktailParty - No node can be removed without violating constraints.
1300598 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - Computing the initial solution...
1300606 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - Checked 32 combinations
1300606 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - The final solution has total weight 0.24818580922959216
1300606 [main] INFO mpi.aida.Disambiguator - Document '1' done in 1,254.47s
Golan (22) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian (165) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian Golan (202) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Muslim (338) -> Islam (http://en.wikipedia.org/wiki/Islam)
Israeli (955) -> Israel (http://en.wikipedia.org/wiki/Israel)
Arabic (1004) -> Arabic_language (http://en.wikipedia.org/wiki/Arabic_language)
Syrian (1243) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Muslim (1375) -> Islam (http://en.wikipedia.org/wiki/Islam)
Israeli (1434) -> Israel (http://en.wikipedia.org/wiki/Israel)
Golan (1465) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian (1665) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Golan (1723) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Arab (2053) -> Arabic_language (http://en.wikipedia.org/wiki/Arabic_language)
Syrian (2062) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Setting maxEntityKeyphraseCount to 100
Set to 100
Nov 20, 2013 7:28:40 PM elkfed.knowledge.yago.wsd.AIDA main
INFO: Loading a2e_0030_part_00.mmax [1]
Disambiguating text with 423 tokens ...
1303966 [main] INFO mpi.aida.access.DataAccessSQL - Querying keywords with 69092 keyword IDs ...
2574981 [main] INFO mpi.aida.graph.GraphGenerator - Computing the mention-entity similarities...
2575705 [main] INFO mpi.aida.graph.GraphGenerator - Building the graph...
2575705 [main] INFO mpi.aida.graph.similarity.measure.MilneWittenEntityEntitySimilarity - Initializing MilneWittenEntityEntitySimilarity for 97 entities
2575785 [main] INFO mpi.aida.graph.similarity.measure.MilneWittenEntityEntitySimilarity - Done initializing MilneWittenEntityEntitySimilarity for 97 entities
2575785 [main] INFO mpi.aida.graph.extraction.ExtractGraph - Computing EE sims on '2' for 97 entities (30 threads)
2575881 [main] INFO mpi.aida.graph.extraction.ExtractGraph - Done calculating EE sims on '2' (0s)
2575917 [main] INFO mpi.aida.Disambiguator - Disambiguating '2' (21 mentions)
2575921 [main] INFO mpi.aida.graph.algorithms.CocktailParty - Initial minimum weighted degree: 2.4643908870308127E-5
2575997 [main] INFO mpi.aida.graph.algorithms.CocktailParty - No node can be removed without violating constraints.
2575997 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - Computing the initial solution...
2575997 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - Checked 32 combinations
2575997 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - The final solution has total weight 0.24818580922959216
2575997 [main] INFO mpi.aida.Disambiguator - Document '2' done in 1,274.53s
Golan (22) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian (165) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian Golan (202) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Muslim (338) -> Islam (http://en.wikipedia.org/wiki/Islam)
Israeli (955) -> Israel (http://en.wikipedia.org/wiki/Israel)
Arabic (1004) -> Arabic_language (http://en.wikipedia.org/wiki/Arabic_language)
Syrian (1243) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Muslim (1375) -> Islam (http://en.wikipedia.org/wiki/Islam)
Israeli (1434) -> Israel (http://en.wikipedia.org/wiki/Israel)
Golan (1465) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian (1665) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Golan (1723) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Arab (2053) -> Arabic_language (http://en.wikipedia.org/wiki/Arabic_language)
Syrian (2062) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Setting maxEntityKeyphraseCount to 5000
Set to 5000
Nov 20, 2013 7:49:55 PM elkfed.knowledge.yago.wsd.AIDA main
INFO: Loading a2e_0030_part_00.mmax [1]
Disambiguating text with 423 tokens ...
2579258 [main] INFO mpi.aida.access.DataAccessSQL - Querying keywords with 69092 keyword IDs ...
3863786 [main] INFO mpi.aida.graph.GraphGenerator - Computing the mention-entity similarities...
3864734 [main] INFO mpi.aida.graph.GraphGenerator - Building the graph...
3864734 [main] INFO mpi.aida.graph.similarity.measure.MilneWittenEntityEntitySimilarity - Initializing MilneWittenEntityEntitySimilarity for 97 entities
3864850 [main] INFO mpi.aida.graph.similarity.measure.MilneWittenEntityEntitySimilarity - Done initializing MilneWittenEntityEntitySimilarity for 97 entities
3864850 [main] INFO mpi.aida.graph.extraction.ExtractGraph - Computing EE sims on '3' for 97 entities (30 threads)
3864922 [main] INFO mpi.aida.graph.extraction.ExtractGraph - Done calculating EE sims on '3' (0s)
3864938 [main] INFO mpi.aida.Disambiguator - Disambiguating '3' (21 mentions)
3864942 [main] INFO mpi.aida.graph.algorithms.CocktailParty - Initial minimum weighted degree: 2.4643908870308127E-5
3865026 [main] INFO mpi.aida.graph.algorithms.CocktailParty - No node can be removed without violating constraints.
3865026 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - Computing the initial solution...
3865026 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - Checked 32 combinations
3865026 [main] INFO mpi.aida.graph.algorithms.GreedyHillClimbing - The final solution has total weight 0.24818580922959216
3865026 [main] INFO mpi.aida.Disambiguator - Document '3' done in 1,288.46s
Golan (22) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian (165) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian Golan (202) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Muslim (338) -> Islam (http://en.wikipedia.org/wiki/Islam)
Israeli (955) -> Israel (http://en.wikipedia.org/wiki/Israel)
Arabic (1004) -> Arabic_language (http://en.wikipedia.org/wiki/Arabic_language)
Syrian (1243) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Muslim (1375) -> Islam (http://en.wikipedia.org/wiki/Islam)
Israeli (1434) -> Israel (http://en.wikipedia.org/wiki/Israel)
Golan (1465) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Syrian (1665) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Golan (1723) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Arab (2053) -> Arabic_language (http://en.wikipedia.org/wiki/Arabic_language)
Syrian (2062) -> Golan_Heights (http://en.wikipedia.org/wiki/Golan_Heights)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment