Japanese Tokenizer Dictionaries for use with MeCab.
Infrequently (typically less than once a year)
Versions of Unidic offered here are available under the GPL/LGPL/BSD license.IPADic is offered under a unique BSD-like license. See below.
This dataset includes dictionaries for tokenization and morphological analysis of Japanese for use with MeCab. This includes NINJAL's UniDic, a modified smaller version of UniDic for situations that require it, and the legacy IPADic dictionary.
See all datasets managed by Cotonoha.
aws s3 ls s3://cotonoha-dic/ --no-sign-request