Skip to content

Latest commit

 

History

History
66 lines (58 loc) · 1.36 KB

README.md

File metadata and controls

66 lines (58 loc) · 1.36 KB

furigana

Generate furigana(振り仮名) from Japanese

It uses MeCab (a Natural Language Toolkit) to split Japanese into words, and superscript it with furigana (振り仮名).

Example:

input

from furigana.furigana import print_html
print_html('澱んだ街角で僕らは出会った')

output

よど ん だ 街角まちかどぼく ら は っ た

input

from furigana.furigana import print_html
print_html('お茶にお煎餅、よく合いますね')

output

ちゃ に お 煎餅せんべい 、 よく い ます ね

Usage

$ python3 furigana.py '活版印刷の流れを汲む出版作業では'

Dependency

See https://pypi.python.org/pypi/mecab-python3/0.7
run below commands on ubuntu

sudo apt-get install libmecab-dev mecab mecab-ipadic-utf8
sudo -H pip3 install .

Conflict with Anaconda Python

Please use Ubuntu's original python3, not to use with Anaconda Python3

Testing

Some unit tests are provided:

sudo -H pip3 install pytest
python3 -m pytest -s tests/test_furigana.py