Skip to content

Instantly share code, notes, and snippets.

@html
Created November 3, 2012 12:08
Show Gist options
  • Save html/4007205 to your computer and use it in GitHub Desktop.
Save html/4007205 to your computer and use it in GitHub Desktop.
Transliteration from Cyrillic to Latin in Common Lisp of Russian symbols/Транслитерация с кирилицы в латиницу на Common Lisp русских символов
(defun transliterate-cyr-to-lat-russian (text)
; Taken from http://cl-cookbook.sourceforge.net/strings.html
(defun replace-all (string part replacement &key (test #'char=))
"Returns a new string in which all the occurences of the part
is replaced with replacement."
(with-output-to-string (out)
(loop with part-length = (length part)
for old-pos = 0 then (+ pos part-length)
for pos = (search part string
:start2 old-pos
:test test)
do (write-string string out
:start old-pos
:end (or pos (length string)))
when pos do (write-string replacement out)
while pos)))
(let ((simple-transliteration-ru "абвгдежзийклмнопрстуфыэАБВГДЕЖЗИЙКЛМНОПРСТУФЫЭ")
(simple-transliteration-en "abvgdegziyklmnoprstufieABVGDEGZIYKLMNOPRSTUFIE")
(complex-transliteration-ru-en '(("ё" "yo")
("х" "h")
("ц" "ts")
("ч" "ch")
("ш" "sh")
("щ" "shch")
("ъ" "")
("ь" "")
("ю" "yu")
("я" "ya")
("Ё" "Yo")
("Х" "H")
("Ц" "Ts")
("Ч" "Ch")
("Ш" "Sh")
("Щ" "Shch")
("Ъ" "")
("Ь" "")
("Ю" "Yu")
("Я" "Ya"))))
; Replacing single characters
(loop for char across simple-transliteration-ru for en-char across simple-transliteration-en do
(setf text (substitute en-char char text)))
; Replacing "complex" characters
(loop for (char en-char) in complex-transliteration-ru-en do
(setf text (replace-all text char en-char)))
text))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment