Skip to content

Instantly share code, notes, and snippets.

@joeldrapper
Created March 27, 2025 11:20
Show Gist options
  • Save joeldrapper/d411523d4a30129c182e9f49724e338e to your computer and use it in GitHub Desktop.
Save joeldrapper/d411523d4a30129c182e9f49724e338e to your computer and use it in GitHub Desktop.
Simple fuzzy index with left weight
class FuzzyIndex
def initialize
@index = Hash.new { |h, k| h[k] = Set.new }
end
def []=(key, value)
trigrams(key).each { @index[it] << [key, value] }
end
def [](query)
results = {}
trigrams(query).each do |trigram|
@index[trigram].each do |pair|
results[pair] ||= 0
results[pair] += 1
end
end
results
.map { |(k, v), score| [score.to_f / k.length, [k, v]] }
.sort_by { |(weight, pair)| -weight }
.take(10)
end
private
def trigrams(word)
padded_normalized_word = "■■#{word.downcase}"
(0..padded_normalized_word.length - 3).map do |index|
padded_normalized_word[index, 3]
end
end
end
@joeldrapper
Copy link
Author

You can explore this algorithm in JavaScript here. https://svelte.dev/playground/8e09f4610bd240d4bb20eed6dadc00fb?version=5.25.3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment