CN104598530B - 一种领域术语抽取的方法 - Google Patents
一种领域术语抽取的方法 Download PDFInfo
- Publication number
- CN104598530B CN104598530B CN201410831590.2A CN201410831590A CN104598530B CN 104598530 B CN104598530 B CN 104598530B CN 201410831590 A CN201410831590 A CN 201410831590A CN 104598530 B CN104598530 B CN 104598530B
- Authority
- CN
- China
- Prior art keywords
- mrow
- candidate terms
- morpheme
- probability
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410831590.2A CN104598530B (zh) | 2014-12-26 | 2014-12-26 | 一种领域术语抽取的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410831590.2A CN104598530B (zh) | 2014-12-26 | 2014-12-26 | 一种领域术语抽取的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104598530A CN104598530A (zh) | 2015-05-06 |
CN104598530B true CN104598530B (zh) | 2018-06-05 |
Family
ID=53124315
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410831590.2A Active CN104598530B (zh) | 2014-12-26 | 2014-12-26 | 一种领域术语抽取的方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104598530B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105550200A (zh) * | 2015-12-02 | 2016-05-04 | 北京信息科技大学 | 一种面向专利摘要的中文分词方法 |
CN107463548B (zh) * | 2016-06-02 | 2021-04-27 | 阿里巴巴集团控股有限公司 | 短语挖掘方法及装置 |
CN106445921B (zh) * | 2016-09-29 | 2019-05-07 | 北京理工大学 | 利用二次互信息的中文文本术语抽取方法 |
CN106649277B (zh) * | 2016-12-29 | 2020-07-03 | 语联网(武汉)信息技术有限公司 | 一种词典录入方法及系统 |
CN109710947B (zh) * | 2019-01-22 | 2021-09-07 | 福建亿榕信息技术有限公司 | 电力专业词库生成方法及装置 |
CN114841175A (zh) * | 2022-04-22 | 2022-08-02 | 北京百度网讯科技有限公司 | 机器翻译方法、装置、设备及存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101122919A (zh) * | 2007-09-14 | 2008-02-13 | 中国科学院计算技术研究所 | 一种专业术语抽取方法和系统 |
CN103778243A (zh) * | 2014-02-11 | 2014-05-07 | 北京信息科技大学 | 一种领域术语抽取方法 |
-
2014
- 2014-12-26 CN CN201410831590.2A patent/CN104598530B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101122919A (zh) * | 2007-09-14 | 2008-02-13 | 中国科学院计算技术研究所 | 一种专业术语抽取方法和系统 |
CN103778243A (zh) * | 2014-02-11 | 2014-05-07 | 北京信息科技大学 | 一种领域术语抽取方法 |
Non-Patent Citations (3)
Title |
---|
A Statistical Corpus-Based Term Extractor;Patrick Pantel 等;《Springer Berlin Heidelberg》;20001231;全文 * |
一种基于加权投票的术语自动识别方法;游宏梁 等;《中文信息学报》;20110531;第25卷(第3期);第3.2节 * |
专业领域未登录词识别研究;鞠菲;《中国优秀硕士学位论文全文数据库 信息科技辑》;20131215;第2013年卷(第S2期);摘要,正文第17页第2段,第20页第3段,第36、40、41页,第45页第7.1节 * |
Also Published As
Publication number | Publication date |
---|---|
CN104598530A (zh) | 2015-05-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104572622B (zh) | 一种术语的筛选方法 | |
CN104598530B (zh) | 一种领域术语抽取的方法 | |
CN103123618B (zh) | 文本相似度获取方法和装置 | |
CN106897559B (zh) | 一种面向多数据源的症状体征类实体识别方法及装置 | |
CN103678684B (zh) | 一种基于导航信息检索的中文分词方法 | |
CN104636466B (zh) | 一种面向开放网页的实体属性抽取方法和系统 | |
CN104298662B (zh) | 一种基于有机物命名实体的机器翻译方法及翻译系统 | |
CN106407235B (zh) | 一种基于点评数据的语义词典构建方法 | |
CN103324626B (zh) | 一种建立多粒度词典的方法、分词的方法及其装置 | |
CN104778256B (zh) | 一种领域问答系统咨询的快速可增量聚类方法 | |
CN107391495B (zh) | 一种双语平行语料的句对齐方法 | |
CN108062305B (zh) | 一种基于迭代的三步式无监督中文分词方法 | |
CN108845982A (zh) | 一种基于词的关联特征的中文分词方法 | |
CN105956158B (zh) | 基于海量微博文本和用户信息的网络新词自动提取的方法 | |
CN113705226B (zh) | 医学文本实体标注方法和装置 | |
CN103955450A (zh) | 一种新词自动提取方法 | |
CN106156013B (zh) | 一种固定搭配型短语优先的两段式机器翻译方法 | |
CN105912522A (zh) | 基于成分分析的英语语料自动提取方法和提取器 | |
CN104598441B (zh) | 一种计算机拆分汉语句子的方法 | |
CN108268669A (zh) | 一种基于多维词句特征和情感分析的关键新词发现方法 | |
CN101763403A (zh) | 面向多语言信息检索系统的查询翻译方法 | |
CN103336803B (zh) | 一种嵌名春联的计算机生成方法 | |
CN106126497A (zh) | 一种自动挖掘对应施引片段和被引文献原文内容片段的方法 | |
CN101520775B (zh) | 一种融入语义信息的中文句法分析与解码方法 | |
CN106933799A (zh) | 一种兴趣点poi名称的中文分词方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: WUHAN TRANSN INFORMATION TECHNOLOGY CO., LTD. Free format text: FORMER OWNER: YULIANWANG (WUHAN) INFORMATION TECHNOLOGY CO., LTD. Effective date: 20150805 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20150805 Address after: 430073 East Lake Hubei Development Zone, Optics Valley Software Park, a phase of the west, South Lake Road South, Optics Valley Software Park, No. 2, No. 5, layer 205, six Applicant after: Wuhan Transn Information Technology Co., Ltd. Address before: 430073 East Lake Hubei Development Zone, Optics Valley Software Park, a phase of the west, South Lake Road South, Optics Valley Software Park, No. 2, No. 6, layer 206, six Applicant before: Language network (Wuhan) Information Technology Co., Ltd. |
|
CB02 | Change of applicant information |
Address after: 430070 East Lake Hubei Development Zone, Optics Valley Software Park, a phase of the west, South Lake Road South, Optics Valley Software Park, No. 2, No. 5, layer 205, six Applicant after: Language network (Wuhan) Information Technology Co., Ltd. Address before: 430073 East Lake Hubei Development Zone, Optics Valley Software Park, a phase of the west, South Lake Road South, Optics Valley Software Park, No. 2, No. 5, layer 205, six Applicant before: Wuhan Transn Information Technology Co., Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |