オオノ トモヒロ
Ohno Tomohiro
大野 誠寛 所属 東京電機大学 未来科学部 情報メディア学科 東京電機大学大学院 未来科学研究科 情報メディア学専攻 東京電機大学大学院 先端科学技術研究科 情報通信メディア工学専攻 職種 准教授 |
|
言語種別 | 英語 |
発行・発表の年月 | 2005/03 |
形態種別 | 学術研究論文 |
査読 | 査読有り |
標題 | Robust Dependency Parsing of Spontaneous Japanese Spoken Language |
執筆形態 | 共著 |
掲載誌名 | IEICE Transactions on Information and Systems |
掲載区分 | 国内 |
巻・号・頁 | E88-D(3),pp.545-552 |
著者・共著者 | Tomohiro Ohno, Shigeki Matsubara, Nobuo Kawaguchi, Yasuyoshi Inagaki |
概要 | Spontaneously spoken Japanese includes a lot of grammatically ill-formed linguistic phenomena such as fillers, hesitations, inversions, and so on, which do not appear in written language. This paper proposes a novel method of robust dependency parsing using a large-scale spoken language corpus, and evaluates the availability and robustness of the method using spontaneously spoken dialogue sentences. By utilizing stochastic information about the appearance of ill-formed phenomena, the method can robustly parse spoken Japanese including fillers, inversions, or dependencies over utterance units. Experimental results reveal that the parsing accuracy reached 87.0%, and we confirmed that it is effective to utilize the location information of a bunsetsu, and the distance information between bunsetsus as stochastic information. |