Tomer Elmalem tomelm

tomelm / classify.py

Created May 27, 2015 17:11

	# clf = sklearn.linear_model.LogisticRegression
	# significant_terms = set of terms appearing more than n times in training
	def classify(left_name, right_name):
	"""
	Classifies names using delta term analysis.
	:return:
	A tuple (p_is_duplicate, exact_match_rare_terms, one_side_rare_terms).

	* p_is_duplicate is the score from the log-linear classifier. It's
	probably the most relevant signal.

tomelm / wordpress_jekyll_converter.rb

Last active August 29, 2015 14:24

	require 'date'
	require 'nokogiri'
	require 'rest-client'
	require 'reverse_markdown'

	# Match [caption <stuff>]...[/caption] tags
	# example: http://rubular.com/r/r2FH3QSOpL
	CAPTION_REGEX = /\[caption.\](?=.\[)\|\[\/caption\]/

tomelm / feed_snippet.json

Last active August 29, 2015 14:26

tomelm / gist:7b257c4261e029f4afcfc0e83fce1ee0

Created November 29, 2016 18:46

fusion ruby sample

	ruby\|ruby ⇒ ruby sample.rb lookup --business-id=tropisueño-san-francisco-3
	Found business with id yelp-san-francisco:
	{
	"categories": [
	{
	"alias": "localflavor",
	"title": "Local Flavor"
	},
	{
	"alias": "massmedia",