Last active
June 13, 2019 10:15
-
-
Save rolfen/0def4840f52b21dbf025785f7e6db999 to your computer and use it in GitHub Desktop.
Detects whether a string is (mostly) Arabic (or Farsi, etc.)
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/* | |
* Tries to detects whether a given string is "Arabic" (or Farsi, Pashto, etc.) | |
* The detection is based on counting "Arabic alphabet" characters | |
* Returns true if the string is mostly made of "Arabic characters", false otherwise | |
* This is how to use on HTML element "el": | |
* detectArabic(el.innerText) | |
*/ | |
function detectArabic(string) { | |
try { | |
arCount = string.match(/[\u0600-\u06FF]/g).length; | |
// results are approximative because whitespace and some ranges of arabic characters are ignored. | |
return((string.length / arCount) > .5); | |
} catch(e) { | |
return(false); | |
} | |
} |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment