This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Visualizing Twitter Search Results with Protovis and/or Graphviz is this easy: | |
$ easy_install twitter # See https://github.com/sixohsix/twitter and http://pypi.python.org/pypi/setuptools | |
$ git clone https://github.com/ptwobrussell/Mining-the-Social-Web.git | |
$ cd Mining-the-Social-Web/python_code | |
$ python introduction__retweet_visualization.py TeaParty # or whatever you want to search for | |
Your browser should pop open and display the results as a force directed graph, but also check your console for some useful output. | |
You can create an image file from the DOT language output with a command like the following: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Twitter's Trends API has been in flux since Feburary 2011 when Mining the Social Web was published | |
# and unfortunately, this is causing some confusion in the earliest examples. | |
# See also https://dev.twitter.com/docs/api/1/get/trends | |
# Note that the twitter package that's being imported is from https://github.com/sixohsix/twitter | |
# If you have first done an "easy_install pip" to get pip, you could easily install the latest | |
# version directly from GitHub as follows: | |
# $ pip install -e git+http://github.com/sixohsix/twitter.git#egg=github-pip-install | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# -*- coding: utf-8 -*- | |
# Studying this script might be helpful in understanding why UnicodeDecode errors | |
# sometimes happen when trying to capture utf-8 output to files with Python 2 even | |
# though the output prints to your (utf-8 capable) terminal. | |
# Note that the first line of this file is called the Byte Order Marker (BOM), which | |
# is a directive to tell Python that it should treat this file as utf-8 (i.e. comments and | |
# string values may be utf-8) |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# On a remote AWS VM, you'll need to create and save your | |
# CSV connections to the the remote VM before executing Example 6 | |
# since you're not using Vagrant (and since we won't be using SSH | |
# as part of the workshop.) | |
# Copy/paste your connections (or a large subset of them) into a string value | |
# that's bounded by triple quotes like the following example (which defines only | |
# a single contact for brevity.) | |
csv_as_string = \ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
######################################################################## | |
# | |
# An example of how to deploy a custom predictive model to yhat | |
# and "predict" the summary for a news article. | |
# | |
# Input: URL for a web page containing a news article | |
# | |
# Output: Summary of the "story" in the web page for the URL | |
# | |
# Example usage: $ python summarizer.py <username> <apikey> <url> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import geojson | |
import sys | |
lines = [line.strip().split("\t") for line in open(sys.argv[1]).readlines()] | |
features = [] | |
for (x, y, _id, text, screen_name, utc) in lines: | |
props = dict(text=text, screen_name=screen_name, utc=utc) | |
features.append( geojson.Feature(id=_id, geometry=geojson.Point(coordinates=(x,y)), properties=props) ) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
""" | |
A modification of MTSW2E Example 6-3 (http://bit.ly/1aWYgAv) with the following modifications: | |
* Extra debugging information is written to sys.stderr to help isolate any problematic content | |
that may be encountered. | |
* A (hopeful) fix to a blasted UnicodeEncodeError in cleanContent() that may be triggered from | |
quopri.decodestring attempting to decode an already decoded Unicode value. | |
* The JSONification in jsonifyMessage now ignores any content that's not text. MIME-encoded content | |
such as images, PDFs, and other non-text data that is not useful for textual analysis without | |
significant additional work is now no longer carried forward into the JSON for import into MongoDB. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import json | |
import pymongo # pip install pymongo | |
from bson import json_util # Comes with pymongo | |
import re | |
# The basis of our query | |
FROM = "[email protected]" # As opposed to a value like "Coursera <[email protected]>" | |
client = pymongo.MongoClient() |
OlderNewer