Skip to content

Instantly share code, notes, and snippets.

View mbruzek's full-sized avatar

Matt Bruzek mbruzek

View GitHub Profile
@mbruzek
mbruzek / find_similar.py
Last active November 19, 2021 19:49
A duplicate expansion finding tool not yet ready for primetime.
#!/usr/bin/env python3
"""
A Python3 tool that uses the NASA-Acronym data to find similar expansions.
This code uses the Levenshtein Distance to calculate the difference between
sequences of characters in the expansions field of the data.
This file relies on thefuzz python package: https://github.com/seatgeek/thefuzz
#!/usr/bin/env python3
"""
A Python3 tool that uses the NASA-Acronym data to find similar expansions.
This code uses the Levenshtein Distance to calculate the difference between
sequences of characters in the expansions field of the data.
This file relies on thefuzz python package: https://github.com/seatgeek/thefuzz