Created
September 21, 2018 12:47
-
-
Save loretoparisi/8019aa0b29f78e5f88bbf352cbb8574a to your computer and use it in GitHub Desktop.
Invert Indian to Roman CSV dataset using indictrans https://github.com/libindic/indic-trans/tree/master/indictrans
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
SOURCE=$1 | |
TARGET=eng | |
IN=$2 | |
OUT=$3 | |
while read -r col1 rest; do | |
printf '%s\t%s\n' "$col1" "$(indictrans -s $SOURCE -t $TARGET --ml --build-lookup <<<"$rest")" | |
done < $IN > $OUT |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Example usage:
Please find an example dataset for Kannada (ISO 639-1
kn
) here.Example input data
Output data
Please see this SF question for more info about this task.