CN108182179A - 一种自然语言处理方法及装置 - Google Patents
一种自然语言处理方法及装置 Download PDFInfo
- Publication number
- CN108182179A CN108182179A CN201810085253.1A CN201810085253A CN108182179A CN 108182179 A CN108182179 A CN 108182179A CN 201810085253 A CN201810085253 A CN 201810085253A CN 108182179 A CN108182179 A CN 108182179A
- Authority
- CN
- China
- Prior art keywords
- word
- name entity
- default
- sequence
- keyword
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/247—Thesauruses; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810085253.1A CN108182179B (zh) | 2018-01-29 | 2018-01-29 | 一种自然语言处理方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810085253.1A CN108182179B (zh) | 2018-01-29 | 2018-01-29 | 一种自然语言处理方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108182179A true CN108182179A (zh) | 2018-06-19 |
CN108182179B CN108182179B (zh) | 2019-07-30 |
Family
ID=62551616
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810085253.1A Active CN108182179B (zh) | 2018-01-29 | 2018-01-29 | 一种自然语言处理方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108182179B (zh) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109558584A (zh) * | 2018-10-26 | 2019-04-02 | 平安科技(深圳)有限公司 | 企业关系预测方法、装置、计算机设备和存储介质 |
CN109582975A (zh) * | 2019-01-31 | 2019-04-05 | 北京嘉和美康信息技术有限公司 | 一种命名实体的识别方法及装置 |
CN109766552A (zh) * | 2019-01-08 | 2019-05-17 | 安徽省泰岳祥升软件有限公司 | 一种基于公告信息的指代消解方法及装置 |
CN110059320A (zh) * | 2019-04-23 | 2019-07-26 | 腾讯科技(深圳)有限公司 | 实体关系抽取方法、装置、计算机设备和存储介质 |
CN111859970A (zh) * | 2020-07-23 | 2020-10-30 | 北京字节跳动网络技术有限公司 | 用于处理信息的方法、装置、设备和介质 |
CN111859858A (zh) * | 2020-07-22 | 2020-10-30 | 智者四海(北京)技术有限公司 | 从文本中提取关系的方法及装置 |
CN113128226A (zh) * | 2019-12-31 | 2021-07-16 | 阿里巴巴集团控股有限公司 | 命名实体识别方法、装置、电子设备及计算机存储介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120036130A1 (en) * | 2007-12-21 | 2012-02-09 | Marc Noel Light | Systems, methods, software and interfaces for entity extraction and resolution and tagging |
CN105468583A (zh) * | 2015-12-09 | 2016-04-06 | 百度在线网络技术(北京)有限公司 | 一种实体关系的获取方法及装置 |
CN106844413A (zh) * | 2016-11-11 | 2017-06-13 | 南京缘长信息科技有限公司 | 实体关系抽取的方法及装置 |
CN107247707A (zh) * | 2017-06-27 | 2017-10-13 | 北京神州泰岳软件股份有限公司 | 基于补全策略的企业关联关系信息提取方法和装置 |
CN107392436A (zh) * | 2017-06-27 | 2017-11-24 | 北京神州泰岳软件股份有限公司 | 一种提取企业关联关系信息的方法和装置 |
US20170351749A1 (en) * | 2016-06-03 | 2017-12-07 | Microsoft Technology Licensing, Llc | Relation extraction across sentence boundaries |
-
2018
- 2018-01-29 CN CN201810085253.1A patent/CN108182179B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120036130A1 (en) * | 2007-12-21 | 2012-02-09 | Marc Noel Light | Systems, methods, software and interfaces for entity extraction and resolution and tagging |
CN105468583A (zh) * | 2015-12-09 | 2016-04-06 | 百度在线网络技术(北京)有限公司 | 一种实体关系的获取方法及装置 |
US20170351749A1 (en) * | 2016-06-03 | 2017-12-07 | Microsoft Technology Licensing, Llc | Relation extraction across sentence boundaries |
CN106844413A (zh) * | 2016-11-11 | 2017-06-13 | 南京缘长信息科技有限公司 | 实体关系抽取的方法及装置 |
CN107247707A (zh) * | 2017-06-27 | 2017-10-13 | 北京神州泰岳软件股份有限公司 | 基于补全策略的企业关联关系信息提取方法和装置 |
CN107392436A (zh) * | 2017-06-27 | 2017-11-24 | 北京神州泰岳软件股份有限公司 | 一种提取企业关联关系信息的方法和装置 |
Non-Patent Citations (2)
Title |
---|
刘绍毓 等: "实体关系抽取研究综述", 《信息工程大学学报》 * |
李颖 等: "中文开放式多元实体关系抽取", 《计算机科学》 * |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109558584A (zh) * | 2018-10-26 | 2019-04-02 | 平安科技(深圳)有限公司 | 企业关系预测方法、装置、计算机设备和存储介质 |
CN109766552A (zh) * | 2019-01-08 | 2019-05-17 | 安徽省泰岳祥升软件有限公司 | 一种基于公告信息的指代消解方法及装置 |
CN109766552B (zh) * | 2019-01-08 | 2023-01-31 | 安徽省泰岳祥升软件有限公司 | 一种基于公告信息的指代消解方法及装置 |
CN109582975A (zh) * | 2019-01-31 | 2019-04-05 | 北京嘉和美康信息技术有限公司 | 一种命名实体的识别方法及装置 |
CN109582975B (zh) * | 2019-01-31 | 2023-05-23 | 北京嘉和海森健康科技有限公司 | 一种命名实体的识别方法及装置 |
CN110059320A (zh) * | 2019-04-23 | 2019-07-26 | 腾讯科技(深圳)有限公司 | 实体关系抽取方法、装置、计算机设备和存储介质 |
CN113128226A (zh) * | 2019-12-31 | 2021-07-16 | 阿里巴巴集团控股有限公司 | 命名实体识别方法、装置、电子设备及计算机存储介质 |
CN111859858A (zh) * | 2020-07-22 | 2020-10-30 | 智者四海(北京)技术有限公司 | 从文本中提取关系的方法及装置 |
CN111859858B (zh) * | 2020-07-22 | 2024-03-01 | 智者四海(北京)技术有限公司 | 从文本中提取关系的方法及装置 |
CN111859970A (zh) * | 2020-07-23 | 2020-10-30 | 北京字节跳动网络技术有限公司 | 用于处理信息的方法、装置、设备和介质 |
CN111859970B (zh) * | 2020-07-23 | 2022-05-17 | 北京字节跳动网络技术有限公司 | 用于处理信息的方法、装置、设备和介质 |
Also Published As
Publication number | Publication date |
---|---|
CN108182179B (zh) | 2019-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108182179B (zh) | 一种自然语言处理方法及装置 | |
Zhu et al. | Multimodal joint attribute prediction and value extraction for e-commerce product | |
CN112783921A (zh) | 一种数据库操作方法及装置 | |
CN103678288A (zh) | 一种专名自动翻译的方法 | |
Firdaus et al. | Incorporating politeness across languages in customer care responses: Towards building a multi-lingual empathetic dialogue agent | |
CN109766552B (zh) | 一种基于公告信息的指代消解方法及装置 | |
Garain et al. | JUNLP@ DravidianLangTech-EACL2021: Offensive language identification in Dravidian langauges | |
CN108304383B (zh) | 业务文档的公式信息提取方法及装置 | |
Liu et al. | A crosslingual investigation of conceptualization in 1335 languages | |
Xiong et al. | Improving deep learning method for biomedical named entity recognition by using entity definition information | |
CN107577674A (zh) | 识别企业名称的方法及装置 | |
CN114139543A (zh) | 实体链接语料标注方法和装置 | |
Peng et al. | A dialogue-based information extraction system for medical insurance assessment | |
Carr | Evidence for the persistence of ancient Beothuk and Maritime Archaic mitochondrial DNA genome lineages among modern Native American peoples | |
Xu et al. | Lexical micro-adaptation for neural machine translation | |
CN110674630B (zh) | 指代消解方法和装置、电子设备及存储介质 | |
Priyadarshani et al. | Statistical machine learning for transliteration: Transliterating names between sinhala, tamil and english | |
Skadina et al. | Towards hybrid neural machine translation for English-Latvian | |
Yang et al. | TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization | |
CN111611779A (zh) | 辅助文本标注方法、装置、设备及其存储介质 | |
Houssein | 11. Somalia: The Experience of Hawala Receiving Countries | |
Aletras et al. | Proceedings of the Natural Legal Language Processing Workshop 2021 | |
Mun et al. | How do Transformer-Architecture Models Address Polysemy of Korean Adverbial Postpositions? | |
Fosteri | Cross-Lingual and Genre-Supervised Parsing and Tagging for Low-Resource Spoken Data | |
Dinu et al. | Romanian word production: An orthographic approach based on sequence labeling |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20180619 Assignee: Zhongke Dingfu (Beijing) Science and Technology Development Co., Ltd. Assignor: Beijing Shenzhou Taiyue Software Co., Ltd. Contract record no.: X2019990000215 Denomination of invention: Method and device for processing natural language Granted publication date: 20190730 License type: Exclusive License Record date: 20191127 |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200629 Address after: 230000 zone B, 19th floor, building A1, 3333 Xiyou Road, hi tech Zone, Hefei City, Anhui Province Patentee after: Dingfu Intelligent Technology Co., Ltd Address before: 100089 Beijing city Haidian District wanquanzhuang Road No. 28 Wanliu new building block A Room 601 Patentee before: BEIJING ULTRAPOWER SOFTWARE Co.,Ltd. |