In this gist we provide 2 JSON files.
- joyo2010.json
joyo2010.json
is a parsed JSON file to facilitate the search for Japanese Jōyō kanji readings.- It is retrieved from github.com/cjkvi/cjkvi-tables which is based on the "List of jōyō kanji" from Wikipedia with some modifications. Such as missing "kyu-jitai" ("旧字体", old kanji) that corresponds to the joyo-kanji.
- Currntly, it only supports 2,136 characters of "常用漢字" (Jōyō kanji) defined by the Japanese Ministry of Education in 2010.
- old_to_new_kanjis.json
old_to_new_kanjis.json
is a JSON file that maps the Kyūjitai ("旧字体", old-form kanji) to Shinjitai ("新字体", new-form of kanji).- It is a simple key-value mapping. The key of each element is the "old-kanji" and the value is the corresponding "new-kanji".
- Currently, it supports 5,314 old-kanji characters.
- Note that it contains new-kanjis that are not in Jōyō kanjis.
- https://gist.githubusercontent.com/KEINOS/fb660943484008b7f5297bb627e0e1b1/raw/joyo2010.json
- https://gist.githubusercontent.com/KEINOS/fb660943484008b7f5297bb627e0e1b1/raw/old_to_new_kanjis.json
- Please post a comment below for any issues.
- ikawaha/kagome#276
- Every key and values are string. Cast the type during parse/unmarshal to the object.
{
"<Unicode of the joyo kanji>": {
"joyo_kanji": "<string>",
"kyu_jitai": "<string>",
"yomi": {
"on_yomi": [
"<string>",
"<string>",
...
],
"kun_yomi": [
"<string>",
"<string>",
...
],
"example_yomi": [
"<string>",
"<string>",
...
],
"raw_info": "<string>"
}
}
}
joyo_kanji
: The target kanji.joyo kanji
is the regular-use Chinese characters for Japanese writing.kyu_jitai
: The traditional/old forms of kanji used in Japanese writing. This character is preferable to be replaced tojoyo kanji
.yomi
: The variations of the reading of the kanji.on_yomi
: The "On"(Sino-Japanese/Chinese) reading of the kanji.kun_yomi
: The "Kun"(Japanese) reading of the kanji.example_yomi
: Example or variation of the reading.
raw_info
: The tab-separated line from the original source. It contains the number of strokes of the kanji in the 3rd column as well.
𠮟 = U+20B9F = 0x20B9F = 0d134047
{
"134047": {
"joyo_kanji": "𠮟",
"yomi": {
"on_yomi": [
"シツ"
],
"kun_yomi": [
"しか"
],
"example_yomi": [
"しか-る"
]
},
"raw_info": "𠮟\t\t5\t7S\t2010\tシツ、しか-る"
}
}