CN106202010B - 基于深度神经网络构建法律文本语法树的方法和装置 - Google Patents
基于深度神经网络构建法律文本语法树的方法和装置 Download PDFInfo
- Publication number
- CN106202010B CN106202010B CN201610546350.7A CN201610546350A CN106202010B CN 106202010 B CN106202010 B CN 106202010B CN 201610546350 A CN201610546350 A CN 201610546350A CN 106202010 B CN106202010 B CN 106202010B
- Authority
- CN
- China
- Prior art keywords
- text
- word
- syntax tree
- training text
- term vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 28
- 238000012549 training Methods 0.000 claims abstract description 57
- 239000013598 vector Substances 0.000 claims abstract description 50
- 238000012545 processing Methods 0.000 claims description 7
- 239000000203 mixture Substances 0.000 claims description 3
- 230000001537 neural effect Effects 0.000 claims description 3
- 230000011218 segmentation Effects 0.000 claims description 3
- 230000009897 systematic effect Effects 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 238000003058 natural language processing Methods 0.000 description 7
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 230000004913 activation Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 210000004218 nerve net Anatomy 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- XCWPUUGSGHNIDZ-UHFFFAOYSA-N Oxypertine Chemical compound C1=2C=C(OC)C(OC)=CC=2NC(C)=C1CCN(CC1)CCN1C1=CC=CC=C1 XCWPUUGSGHNIDZ-UHFFFAOYSA-N 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000009223 counseling Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 210000003739 neck Anatomy 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- RYYVLZVUVIJVGH-UHFFFAOYSA-N trimethylxanthine Natural products CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/151—Transformation
- G06F40/154—Tree transformation for tree-structured or markup documents, e.g. XSLT, XSL-FO or stylesheets
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/151—Transformation
- G06F40/16—Automatic learning of transformation rules, e.g. from examples
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610546350.7A CN106202010B (zh) | 2016-07-12 | 2016-07-12 | 基于深度神经网络构建法律文本语法树的方法和装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610546350.7A CN106202010B (zh) | 2016-07-12 | 2016-07-12 | 基于深度神经网络构建法律文本语法树的方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106202010A CN106202010A (zh) | 2016-12-07 |
CN106202010B true CN106202010B (zh) | 2019-11-26 |
Family
ID=57477432
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610546350.7A Active CN106202010B (zh) | 2016-07-12 | 2016-07-12 | 基于深度神经网络构建法律文本语法树的方法和装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106202010B (zh) |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108268431B (zh) * | 2016-12-30 | 2019-12-03 | 北京国双科技有限公司 | 段落向量化的方法和装置 |
CN106952193A (zh) * | 2017-03-23 | 2017-07-14 | 北京华宇信息技术有限公司 | 一种基于模糊深度信念网络的刑事案件辅助决策方法 |
CN107066560B (zh) * | 2017-03-30 | 2019-12-06 | 东软集团股份有限公司 | 文本分类的方法和装置 |
CN107247613A (zh) * | 2017-04-25 | 2017-10-13 | 北京航天飞行控制中心 | 语句解析方法及语句解析装置 |
CN107301246A (zh) * | 2017-07-14 | 2017-10-27 | 河北工业大学 | 基于超深卷积神经网络结构模型的中文文本分类方法 |
US20190065486A1 (en) * | 2017-08-24 | 2019-02-28 | Microsoft Technology Licensing, Llc | Compression of word embeddings for natural language processing systems |
CN108133436A (zh) * | 2017-11-23 | 2018-06-08 | 科大讯飞股份有限公司 | 自动判案方法及系统 |
CN108021934B (zh) * | 2017-11-23 | 2022-03-04 | 创新先进技术有限公司 | 多要素识别的方法及装置 |
CN108062411A (zh) * | 2017-12-29 | 2018-05-22 | 深圳市智搜信息技术有限公司 | 一种查找电子元器件数据信息的系统及方法 |
CN108170848B (zh) * | 2018-01-18 | 2021-08-13 | 重庆邮电大学 | 一种面向中国移动智能客服的对话场景分类方法 |
CN108491381B (zh) * | 2018-03-13 | 2021-05-14 | 山西大学 | 一种汉语二分结构的句法分析方法 |
CN108920447B (zh) * | 2018-05-07 | 2022-08-05 | 国家计算机网络与信息安全管理中心 | 一种面向特定领域的中文事件抽取方法 |
CN108628834B (zh) * | 2018-05-14 | 2022-04-15 | 国家计算机网络与信息安全管理中心 | 一种基于句法依存关系的词语表示学习方法 |
CN110969018A (zh) * | 2018-09-30 | 2020-04-07 | 北京国双科技有限公司 | 案情描述要素提取方法、机器学习模型获得方法及装置 |
CN109388801B (zh) * | 2018-09-30 | 2023-07-14 | 创新先进技术有限公司 | 相似词集合的确定方法、装置和电子设备 |
CN111143707A (zh) * | 2018-11-05 | 2020-05-12 | 千寻位置网络有限公司 | 播发链路选择方法和装置 |
CN109977401A (zh) * | 2019-03-15 | 2019-07-05 | 上海火商智能科技有限公司 | 一种基于神经网络的语义识别方法 |
CN110046262B (zh) * | 2019-06-10 | 2021-03-12 | 南京擎盾信息科技有限公司 | 一种基于法律专家知识库的上下文推理方法 |
CN112632269A (zh) * | 2019-09-24 | 2021-04-09 | 北京国双科技有限公司 | 一种文档分类模型训练的方法和相关装置 |
CN111859407A (zh) * | 2019-10-16 | 2020-10-30 | 沈阳工业大学 | 基于候选池自收缩机制的文本自动生成隐写方法 |
CN111431540B (zh) * | 2020-04-01 | 2021-10-08 | 西安交通大学 | 一种基于神经网络模型的fpga配置文件算术压缩与解压方法 |
CN111460834B (zh) * | 2020-04-09 | 2023-06-06 | 北京北大软件工程股份有限公司 | 基于lstm网络的法条语义标注方法及装置 |
CN111814452A (zh) * | 2020-07-13 | 2020-10-23 | 四川长虹电器股份有限公司 | 一种影视领域基于神经网络的依存句法分析方法 |
CN112559713B (zh) * | 2020-12-24 | 2023-12-01 | 北京百度网讯科技有限公司 | 文本相关性判断方法及装置、模型、电子设备、可读介质 |
CN116363686B (zh) * | 2023-06-02 | 2023-08-11 | 深圳大学 | 一种在线社交网络视频平台来源检测方法及其相关设备 |
CN117591662B (zh) * | 2024-01-19 | 2024-03-29 | 川投信息产业集团有限公司 | 基于人工智能的数字化企业服务数据挖掘方法及系统 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005045695A1 (en) * | 2003-10-27 | 2005-05-19 | Educational Testing Service | Method and system for determining text coherence |
US7484219B2 (en) * | 2002-11-21 | 2009-01-27 | Microsoft Corporation | Synchronizing centralized data store from distributed independent data stores using fixed application programming interfaces |
CN102662931A (zh) * | 2012-04-13 | 2012-09-12 | 厦门大学 | 一种基于协同神经网络的语义角色标注方法 |
CN104008092A (zh) * | 2014-06-10 | 2014-08-27 | 复旦大学 | 一种基于语义空间映射的语义关系表征、聚类及识别的方法和系统 |
CN104021115A (zh) * | 2014-06-13 | 2014-09-03 | 北京理工大学 | 基于神经网络的中文比较句识别方法及装置 |
CN104462066A (zh) * | 2014-12-24 | 2015-03-25 | 北京百度网讯科技有限公司 | 语义角色标注方法及装置 |
-
2016
- 2016-07-12 CN CN201610546350.7A patent/CN106202010B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7484219B2 (en) * | 2002-11-21 | 2009-01-27 | Microsoft Corporation | Synchronizing centralized data store from distributed independent data stores using fixed application programming interfaces |
WO2005045695A1 (en) * | 2003-10-27 | 2005-05-19 | Educational Testing Service | Method and system for determining text coherence |
CN102662931A (zh) * | 2012-04-13 | 2012-09-12 | 厦门大学 | 一种基于协同神经网络的语义角色标注方法 |
CN104008092A (zh) * | 2014-06-10 | 2014-08-27 | 复旦大学 | 一种基于语义空间映射的语义关系表征、聚类及识别的方法和系统 |
CN104021115A (zh) * | 2014-06-13 | 2014-09-03 | 北京理工大学 | 基于神经网络的中文比较句识别方法及装置 |
CN104462066A (zh) * | 2014-12-24 | 2015-03-25 | 北京百度网讯科技有限公司 | 语义角色标注方法及装置 |
Non-Patent Citations (1)
Title |
---|
采用连续词袋模型(CBOW)的领域术语自动抽取研究;姜霖 等;《现代图书情报技术》;20160225;论文第3节 * |
Also Published As
Publication number | Publication date |
---|---|
CN106202010A (zh) | 2016-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106202010B (zh) | 基于深度神经网络构建法律文本语法树的方法和装置 | |
CN111177394B (zh) | 基于句法注意力神经网络的知识图谱关系数据分类方法 | |
CN110162749B (zh) | 信息提取方法、装置、计算机设备及计算机可读存储介质 | |
CN107291693B (zh) | 一种改进词向量模型的语义计算方法 | |
CN106980683B (zh) | 基于深度学习的博客文本摘要生成方法 | |
CN109508377A (zh) | 基于融合模型的文本特征提取方法、装置、聊天机器人和存储介质 | |
CN110516245A (zh) | 细粒度情感分析方法、装置、计算机设备及存储介质 | |
CN109299341A (zh) | 一种基于字典学习的对抗跨模态检索方法和系统 | |
CN109271493A (zh) | 一种语言文本处理方法、装置和存储介质 | |
CN110083710A (zh) | 一种基于循环神经网络与潜变量结构的词语定义生成方法 | |
CN110222163A (zh) | 一种融合cnn与双向lstm的智能问答方法及系统 | |
CN111914067A (zh) | 中文文本匹配方法及系统 | |
CN113704460B (zh) | 一种文本分类方法、装置、电子设备和存储介质 | |
CN108108468A (zh) | 一种基于概念和文本情感的短文本情感分析方法和装置 | |
CN113901191A (zh) | 问答模型的训练方法及装置 | |
CN111597341B (zh) | 一种文档级关系抽取方法、装置、设备及存储介质 | |
CN110084323A (zh) | 端到端语义解析系统及训练方法 | |
CN111145914B (zh) | 一种确定肺癌临床病种库文本实体的方法及装置 | |
CN115438674A (zh) | 实体数据处理、实体链接方法、装置和计算机设备 | |
CN114254645A (zh) | 一种人工智能辅助写作系统 | |
CN114841353A (zh) | 一种融合句法信息的量子语言模型建模系统及其应用 | |
CN114417823A (zh) | 一种基于句法和图卷积网络的方面级情感分析方法及装置 | |
CN114490926A (zh) | 一种相似问题的确定方法、装置、存储介质及终端 | |
CN114282528A (zh) | 一种关键词提取方法、装置、设备及存储介质 | |
Bai et al. | Gated character-aware convolutional neural network for effective automated essay scoring |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP02 | Change in the address of a patent holder |
Address after: 400000 6-1, 6-2, 6-3, 6-4, building 7, No. 50, Shuangxing Avenue, Biquan street, Bishan District, Chongqing Patentee after: CHONGQING ZHAOGUANG TECHNOLOGY CO.,LTD. Address before: 400000 2-2-1, 109 Fengtian Avenue, tianxingqiao, Shapingba District, Chongqing Patentee before: CHONGQING ZHAOGUANG TECHNOLOGY CO.,LTD. |
|
CP02 | Change in the address of a patent holder | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and device for constructing legal text syntax tree based on deep neural network Effective date of registration: 20221115 Granted publication date: 20191126 Pledgee: Bishan sub branch of Chongqing Three Gorges Bank Co.,Ltd. Pledgor: CHONGQING ZHAOGUANG TECHNOLOGY CO.,LTD. Registration number: Y2022980021313 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20191126 Pledgee: Bishan sub branch of Chongqing Three Gorges Bank Co.,Ltd. Pledgor: CHONGQING ZHAOGUANG TECHNOLOGY CO.,LTD. Registration number: Y2022980021313 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and device for constructing a legal text syntax tree based on deep neural networks Granted publication date: 20191126 Pledgee: Bishan sub branch of Chongqing Three Gorges Bank Co.,Ltd. Pledgor: CHONGQING ZHAOGUANG TECHNOLOGY CO.,LTD. Registration number: Y2024500000034 |