グラフに基づくクラスタリングによる動詞類義語の獲得(自然言語の論理と感情)

Takeuchi, Koichi; Takahashi, Hideyuki; Kobayashi, Daisuke

Permalink : http://escholarship.lib.okayama-u.ac.jp/49872

ID	49872
フルテキストURL	IEICE_NLC_110_245_13_18.pdf 624 KB
タイトル（別表記）	Extraction of Verb Synonyms Using Graph-Based Clustering
著者	竹内孔一岡山大学大学院自然科学研究科 Kaken ID publons researchmap 高橋秀幸岡山大学大学院自然科学研究科小林大介岡山大学工学部情報工学科
抄録	本研究ではクラスタリングを利用して動詞の類義語を獲得する方法について検討している.先行研究において,同時に2つの要素のクラスタを考慮しながらクラスタリングする同時共起クラスタリングを適用して,ベクトルベースのクラスタリングより精度が高いことを示した.しかし,近年ベクトルベースのクラスタリングでKernel K-meansという非線形境界でクラスタリングするより高度な手法が提案された.そこで,本報告ではKernel K-means法を我々の動詞類語獲得に適用し,先行研究における同じデータで同時共起クラスタリングとの比較を行う.この結果からKernel K-meansでのグラフ-ベクトルの等価性には限界があり,本研究が利用する2部グラフの構造は直接反映できないこと,先行研究の同時共起クラスタリングの方が高い精度で類語を獲得できることを報告する.
抄録（別表記）	This manuscript describes evaluation results of Kernel K-means clustering approach comparing with modified Aizawa's co-clustering approach for verb synonym extraction task. Kernel K-means approach is one of the state-of-the-art vector-based clustering method which can divide vector-spaces with non-linear boundary by incorporating Kernel method. Besides the mathematical framework of Kernel K-means can cover Spectral Graph Clustering. In this manuscript, however, we reveal Aizawa's co-clustering approach overcomes Kernel K-means on the verb synonym extraction task (bi-graph clustering) in Japanese. From this results we discuss that the equivalence between graph-vector space in Kernel K-means approach can be limited, and then Kernel K-means decease their accuracy in our verb synonym extraction.
発行日	2010-10-16
出版物タイトル	電子情報通信学会技術研究報告. NLC, 言語理解とコミュニケーション
巻	110巻
号	245号
出版者	一般社団法人電子情報通信学会
開始ページ	13
終了ページ	18
ISSN	0913-5685
NCID	AN10091225
資料タイプ	テクニカルレポート
言語	日本語
著作権者	一般社団法人電子情報通信学会
論文のバージョン	publisher
査読	有り