MARINE : Multi-task leaRnIng-based JapaNese accent Estimation
September 20, 2022 · View on GitHub
marine is a tool kit for building the Japanese accent estimation model proposed in our paper (demo).
For academic use, please cite the following paper (ISCA archive).
@inproceedings{park22b_interspeech,
author={Byeongseon Park and Ryuichi Yamamoto and Kentaro Tachibana},
title={{A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech}},
year=2022,
booktitle={Proc. Interspeech 2022},
pages={1931--1935},
doi={10.21437/Interspeech.2022-334}
}
Notice
The model included in this package is trained using JSUT corpus, which is not the same as the dataset in our paper. Therefore, the model's performance is also not equal to the performance introduced in our paper.
Get started
Installation
$ pip install marine
For development
$ pip install -e ".[dev]"
Quick demo
In [1]: from marine.predict import Predictor
In [2]: nodes = [{"surface": "こんにちは", "pos": "感動詞:*:*:*", "pron": "コンニチワ", "c_type": "*", "c_form": "*", "accent_type": 0, "accent_con_type": "-1", "chain_flag": -1}]
In [3]: predictor = Predictor()
In [4]: predictor.predict([nodes])
Out[4]:
{'mora': [['コ', 'ン', 'ニ', 'チ', 'ワ']],
'intonation_phrase_boundary': [[0, 0, 0, 0, 0]],
'accent_phrase_boundary': [[0, 0, 0, 0, 0]],
'accent_status': [[0, 0, 0, 0, 0]]}
In [5]: predictor.predict([nodes], accent_represent_mode="high_low")
Out[5]:
{'mora': [['コ', 'ン', 'ニ', 'チ', 'ワ']],
'intonation_phrase_boundary': [[0, 0, 0, 0, 0]],
'accent_phrase_boundary': [[0, 0, 0, 0, 0]],
'accent_status': [[0, 1, 1, 1, 1]]}
Build model yourself
Coming soon...
LICENSE
- marine: Apache 2.0 license (LICENSE)
- JSUT: CC-BY-SA 4.0 license, etc. (Please check jsut-label/LICENCE.txt)