This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
file="$1" | |
colorpages=0 | |
# count all pages | |
totalpages=$(gs -q -dNODISPLAY -c "($1) (r) file runpdfbegin pdfpagecount = quit") | |
echo "Total pages : $totalpages" | |
# find pages with colors | |
for page in $(identify -density 12 -format '%p ' "$file") ; do |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/python | |
# convert citations into latex format | |
# | |
# (Nivre et al., 2007) | |
# (Sagae and Tsujii 2007) | |
# Nivre (2007) | |
# (Chen et al., 2007; Dredze et al., 2007). | |
# | |
# \cite{Nivre2007} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
from pyelasticsearch import ElasticSearch | |
import pandas as pd | |
from time import time | |
root_path="/home/clemsos/Dev/mitras/" | |
raw_data_path=root_path+"data/" | |
csv_filename="week10.csv" | |
t0=time() |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python | |
# -*- coding: utf-8 -*- | |
''' | |
This script just show the basic workflow to compute TF-IDF similarity matrix with Gensim | |
OUTPUT : |
NewerOlder