This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
AMAZON_ACCESS_KEY = 'WHATEVER' | |
AMAZON_SECRET_KEY = 'SECRET_WHATEVER' | |
# I'm old-school, so I like the AWS-S3 gem. It's just a lightweight wrapper around Amazon's API. | |
# https://github.com/marcel/aws-s3 | |
require "aws/s3" | |
include AWS::S3 | |
def publish_json!(bucket='int.nyt.com', path='applications/represent-json/', filename='foo.json') |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
''' | |
pairwise.py | |
This script uses the Python Gensim library and heapq from the standard library to make | |
massively fast and scalable pairwise comparisons between an aribtrarily large number of | |
documents using TF-IDF and cosine distance. | |
The script first generates a similarity matrix between all documents in a set, then uses | |
heapq to retrieve the top K most similar matches to each document in that set. It has been | |
tested on sets as large as 400,000 documents on a Macbook Air. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
name|id | |
1911 United|C00508200 | |
50 State Strategy|C00502633 | |
9-9-9 FUND|C00504241 | |
Accountability 2010|C00489641 | |
AFL-CIO Workers' Voices PAC|C00484287 | |
Alaskans Standing Together|C00489385 | |
America for the People|C00497081 | |
America Get Up|C00494278 | |
America Votes Action Fund|C00492520 |
NewerOlder