Skip to content

Instantly share code, notes, and snippets.

@siddMahen
Created June 19, 2014 16:30
Show Gist options
  • Save siddMahen/9bb5df007843f71d9ab3 to your computer and use it in GitHub Desktop.
Save siddMahen/9bb5df007843f71d9ab3 to your computer and use it in GitHub Desktop.
#!/bin/bash
for i in {3..229}
do
curl -s "http://twssstories.com/node?page=$i" | perl -n -e '/"([^"]*)" TWSS/ && print "$1\n"' >> twss_test.txt
done
cat eng-ca_web_2002_100K-sentences.txt | perl -n -e '/^[0-9]+\s+([A-Z].*\.)$/ && print "$1\n"' | grep " I " >> normal.txt
cat eng-uk_web_2002_100K-sentences.txt | perl -n -e '/^[0-9]+\s+([A-Z].*\.)$/ && print "$1\n"' | grep " I " >> normal.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment