Japanese Tokenizer Dictionaries

csv japanese natural language processing

Description

Japanese Tokenizer Dictionaries for use with MeCab.

Update Frequency

Infrequently (typically less than once a year)

License

Versions of Unidic offered here are available under the GPL/LGPL/BSD license.IPADic is offered under a unique BSD-like license. See below.

https://github.com/polm/ipadic-py/blob/master/ipadic/dicdir/COPYING

Documentation

This dataset includes dictionaries for tokenization and morphological analysis of Japanese for use with MeCab. This includes NINJAL's UniDic, a modified smaller version of UniDic for situations that require it, and the legacy IPADic dictionary.

Managed By

Cotonoha

See all datasets managed by Cotonoha.

Contact

polm@cotonoha.io

Usage Examples

Tutorials
Tools & Applications
Publications

Resources on AWS

  • Description
    Dictionary Files
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::cotonoha-dic
    AWS Region
    ap-northeast-1
    AWS CLI Access (No AWS account required)
    aws s3 ls s3://cotonoha-dic/ --no-sign-request

Edit this dataset entry on GitHub

Home