-
Star
(171)
You must be signed in to star a gist -
Fork
(40)
You must be signed in to fork a gist
-
-
Save ryanmcgrath/982242 to your computer and use it in GitHub Desktop.
Regex to test for presence of Japanese characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
// REFERENCE UNICODE TABLES: | |
// http://www.rikai.com/library/kanjitables/kanji_codes.unicode.shtml | |
// http://www.tamasoft.co.jp/en/general-info/unicode.html | |
// | |
// TEST EDITOR: | |
// http://www.gethifi.com/tools/regex | |
// | |
// UNICODE RANGE : DESCRIPTION | |
// | |
// 3000-303F : punctuation | |
// 3040-309F : hiragana | |
// 30A0-30FF : katakana | |
// FF00-FFEF : Full-width roman + half-width katakana | |
// 4E00-9FAF : Common and uncommon kanji | |
// | |
// Non-Japanese punctuation/formatting characters commonly used in Japanese text | |
// 2605-2606 : Stars | |
// 2190-2195 : Arrows | |
// u203B : Weird asterisk thing | |
var regex = /[\u3000-\u303F]|[\u3040-\u309F]|[\u30A0-\u30FF]|[\uFF00-\uFFEF]|[\u4E00-\u9FAF]|[\u2605-\u2606]|[\u2190-\u2195]|\u203B/; | |
var input = "input string"; | |
if(regex.test(input)) { | |
console.log("Japanese characters found") | |
} | |
else { | |
console.log("No Japanese characters"); | |
} | |
Thank you so much, helped me out big time!
Works like a charm, thanks.
I believe this code is what the kids call "bussin"
Thanks you
the g
is unnecessary and can cause bugs because it introduces state into the regex that affects subsequent calls to .test
@aehlke wow, long time no see! I’ll edit it lol
hey @ryanmcgrath it's been a long time indeed! I have this snippet in an older part of my app and hadn't realized it's from you :D hope you're doing well, and thanks for the edit
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thank you