We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 1 column, instead of 3 in line 1.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
TsvHttpData-1.0 | |
https://s3-us-west-1.amazonaws.com/fasttext-vectors/wiki.en.vec 6597238061 RBesTphc+YsGkOD1gBEqaA== |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#%Module1.0 | |
## Spark 2.4.3 modulefile | |
## | |
proc ModulesHelp { } { | |
puts stderr "Provides the Spark environment for large-scale data processing" | |
puts stderr "This version has been prebuilt with Scala 2.11." | |
} | |
module-whatis "Apache Spark™ is a unified analytics engine for large-scale data processing." |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
#SBATCH --job-name spark-cluster | |
#SBATCH --account=qh82 | |
#SBATCH --time=02:00:00 | |
# --- Master resources --- | |
#SBATCH --nodes=1 | |
#SBATCH --mem-per-cpu=1G | |
#SBATCH --cpus-per-task=1 | |
#SBATCH --ntasks-per-node=1 | |
# --- Worker resources --- |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
if [ "$#" -ne 1 ]; then | |
echo "Network device must be specified" | |
exit 1 | |
fi | |
DEVICE="$1" | |
OLD_MAC=$(ifconfig | grep "$DEVICE:" -A3 | grep ether | head -1 | cut -d ' ' -f 2) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import csv | |
import glob | |
import gzip | |
import hashlib | |
import xml.sax | |
import tqdm | |
file_list = glob.glob("./pubmed/*.xml.gz") |