首页 | 本学科首页   官方微博 | 高级检索  
   检索      

关系抽取中远监督错误标注消除
引用本文:汝承森,唐晋韬,谢松县,李莎莎,王挺.关系抽取中远监督错误标注消除[J].国防科技大学学报,2018,40(3):148-152.
作者姓名:汝承森  唐晋韬  谢松县  李莎莎  王挺
作者单位:国防科技大学计算机学院
基金项目:国家自然科学基金资助项目(61472436,61532001,61303190)
摘    要:目前远监督方法被广泛应用于关系抽取任务。然而,远监督方法中存在大量错误标注现象,给远监督方法的学习效果带来了很大的影响。提出利用语义Jaccard度量关系短语与依存词间语义相似性的错误标注消除方法。消除错误标注后的训练数据用于训练模型,完成关系抽取。实验结果表明:该方法可以有效消除错误标注,提高关系抽取的性能。

关 键 词:关系抽取  远监督  错误标注  语义相似性
收稿时间:2016/11/25 0:00:00

Reducing wrong labels in distant supervision for relation extraction
RU Chengsen,TANG Jintao,XIE Songxian,LI Shasha and WANG Ting.Reducing wrong labels in distant supervision for relation extraction[J].Journal of National University of Defense Technology,2018,40(3):148-152.
Authors:RU Chengsen  TANG Jintao  XIE Songxian  LI Shasha and WANG Ting
Institution:College of Computer, National University of Defense Technology, Changsha 410073, China,College of Computer, National University of Defense Technology, Changsha 410073, China,College of Computer, National University of Defense Technology, Changsha 410073, China,College of Computer, National University of Defense Technology, Changsha 410073, China and College of Computer, National University of Defense Technology, Changsha 410073, China
Abstract:Distant supervision has been widely used for relation extraction recently. In the distant supervision, many labels may to wrongly marked, which exerts a bad impact on relation extraction. A method to reduce wrong labels was introduced by using the semantic Jaccard to measure semantic similarity between the relation phrases and the dependency terms. The training data after reducing wrong labels was used to train the relation extractors. The experimental results show that the proposed method can effectively reduce wrong labels and improve the relation extraction performance compared with the state-of-art methods.
Keywords:relation extraction  distant supervision  wrong labels  semantic similarity
本文献已被 CNKI 等数据库收录!
点击此处可从《国防科技大学学报》浏览原始摘要信息
点击此处可从《国防科技大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号