CN102207945B - 基于知识网络的文本标引系统及其方法 - Google Patents
基于知识网络的文本标引系统及其方法 Download PDFInfo
- Publication number
- CN102207945B CN102207945B CN 201010168526 CN201010168526A CN102207945B CN 102207945 B CN102207945 B CN 102207945B CN 201010168526 CN201010168526 CN 201010168526 CN 201010168526 A CN201010168526 A CN 201010168526A CN 102207945 B CN102207945 B CN 102207945B
- Authority
- CN
- China
- Prior art keywords
- text
- word
- knowledge
- knowledge tree
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000000605 extraction Methods 0.000 claims abstract description 38
- 239000000284 extract Substances 0.000 claims description 11
- 230000000875 corresponding effect Effects 0.000 claims description 9
- 238000009412 basement excavation Methods 0.000 claims description 6
- 230000003993 interaction Effects 0.000 claims description 4
- 238000000528 statistical test Methods 0.000 claims description 3
- 230000002596 correlated effect Effects 0.000 claims description 2
- 238000012360 testing method Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 5
- 238000002372 labelling Methods 0.000 abstract 1
- 238000000638 solvent extraction Methods 0.000 abstract 1
- 238000005516 engineering process Methods 0.000 description 10
- 239000004575 stone Substances 0.000 description 9
- 230000010365 information processing Effects 0.000 description 6
- 206010028916 Neologism Diseases 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 235000015842 Hesperis Nutrition 0.000 description 2
- 235000012633 Iberis amara Nutrition 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 239000008358 core component Substances 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000000465 moulding Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000011295 pitch Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010168526 CN102207945B (zh) | 2010-05-11 | 2010-05-11 | 基于知识网络的文本标引系统及其方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010168526 CN102207945B (zh) | 2010-05-11 | 2010-05-11 | 基于知识网络的文本标引系统及其方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102207945A CN102207945A (zh) | 2011-10-05 |
CN102207945B true CN102207945B (zh) | 2013-10-23 |
Family
ID=44696783
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010168526 Expired - Fee Related CN102207945B (zh) | 2010-05-11 | 2010-05-11 | 基于知识网络的文本标引系统及其方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102207945B (zh) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102622451A (zh) * | 2012-04-16 | 2012-08-01 | 上海交通大学 | 电视节目标签自动生成系统 |
CN102819858B (zh) * | 2012-07-30 | 2015-07-01 | 北京中科盘古科技发展有限公司 | 一种动画素材组织和应用的方法 |
CN102855295A (zh) * | 2012-08-14 | 2013-01-02 | 周宇 | 一种基于个人能力发展需求描述的出版标签表达系统 |
CN103685409B (zh) * | 2012-09-18 | 2016-09-28 | 中国科学院声学研究所 | 一种面向自主服务的知识网络及其构建方法 |
CN103049490B (zh) * | 2012-12-05 | 2016-09-07 | 北京海量融通软件技术有限公司 | 知识网络节点间属性生成系统及生成方法 |
CN102999487B (zh) * | 2012-12-24 | 2015-06-24 | 中国科学院自动化研究所 | 一种数字出版资源语义增强描述系统及其方法 |
CN103744837B (zh) * | 2014-01-23 | 2017-01-04 | 北京优捷信达信息科技有限公司 | 基于关键词抽取的多文本对照方法 |
CN104090955A (zh) * | 2014-07-07 | 2014-10-08 | 科大讯飞股份有限公司 | 一种音视频标签自动标注方法及系统 |
CN104376044A (zh) * | 2014-10-16 | 2015-02-25 | 江苏博智软件科技有限公司 | 一种基于信息粒度的信息检索优化方法 |
CN104462063B (zh) * | 2014-12-12 | 2016-08-17 | 武汉大学 | 基于语义位置模型的位置信息结构化提取方法及系统 |
CN106355628B (zh) * | 2015-07-16 | 2019-07-05 | 中国石油化工股份有限公司 | 图文知识点标注方法和装置、图文标注的修正方法和系统 |
CN106649395B (zh) * | 2015-11-03 | 2021-05-25 | 腾讯科技(深圳)有限公司 | 网页更新方法和装置 |
CN105573968A (zh) * | 2015-12-10 | 2016-05-11 | 天津海量信息技术有限公司 | 基于规则的文本标引方法 |
CN108205564B (zh) * | 2016-12-19 | 2021-04-09 | 北大方正集团有限公司 | 知识体系构建方法及系统 |
CN106845798A (zh) * | 2016-12-29 | 2017-06-13 | 兰州大学淮安高新技术研究院 | 一种基于多叉树的跨领域专利预警信息分析方法 |
CN107679084B (zh) * | 2017-08-31 | 2021-09-28 | 平安科技(深圳)有限公司 | 聚类标签生成方法、电子设备及计算机可读存储介质 |
CN111199143A (zh) * | 2018-10-31 | 2020-05-26 | 北大方正集团有限公司 | Word论文的标引方法、装置、设备及存储介质 |
CN109657052B (zh) * | 2018-12-12 | 2023-01-03 | 中国科学院文献情报中心 | 一种论文摘要蕴含细粒度知识元的抽取方法及装置 |
CN110442670B (zh) * | 2019-06-11 | 2023-05-26 | 天津交通职业学院 | 一种基于文本标引的消费者画像生成方法 |
CN110414680A (zh) * | 2019-07-23 | 2019-11-05 | 国家计算机网络与信息安全管理中心 | 基于众包标注的知识加工系统 |
CN112215000B (zh) * | 2020-10-21 | 2022-08-23 | 重庆邮电大学 | 一种基于实体替换的文本分类方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005008521A1 (de) * | 2003-07-15 | 2005-01-27 | Siemens Aktiengesellschaft | Verfahren zur indizierung von strukturierten dokumenten |
US7734623B2 (en) * | 2006-11-07 | 2010-06-08 | Cycorp, Inc. | Semantics-based method and apparatus for document analysis |
US20090037408A1 (en) * | 2007-08-04 | 2009-02-05 | James Neil Rodgers | Essence based search engine |
CN101692240A (zh) * | 2009-08-14 | 2010-04-07 | 北京中献电子技术开发中心 | 一种基于规则的专利摘要自动抽取和关键词标引方法 |
-
2010
- 2010-05-11 CN CN 201010168526 patent/CN102207945B/zh not_active Expired - Fee Related
Non-Patent Citations (4)
Title |
---|
单永明.汉语文本的篇章结构及其标引算法的研究.《自然语言理解与机器翻译——全国第六届计算语言学联合学术会议论文集》.2001,227-232. |
彭俊.面向阅读的论文主题标引管理系统研究.《中国优秀硕士学位论文全文数据库》.2007, |
汉语文本的篇章结构及其标引算法的研究;单永明;《自然语言理解与机器翻译——全国第六届计算语言学联合学术会议论文集》;20011231;227-232 * |
面向阅读的论文主题标引管理系统研究;彭俊;《中国优秀硕士学位论文全文数据库》;20071016;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN102207945A (zh) | 2011-10-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102207945B (zh) | 基于知识网络的文本标引系统及其方法 | |
CN101593200B (zh) | 基于关键词频度分析的中文网页分类方法 | |
CN110321925B (zh) | 一种基于语义聚合指纹的文本多粒度相似度比对方法 | |
CN100401300C (zh) | 具有自动分类功能的搜索引擎 | |
CN101079024B (zh) | 一种专业词表动态生成系统和方法 | |
CN102207946B (zh) | 一种知识网络的半自动生成方法 | |
CN104199972A (zh) | 一种基于深度学习的命名实体关系抽取与构建方法 | |
JP5605583B2 (ja) | 検索方法、類似度計算方法、類似度計算及び同一文書照合システムと、そのプログラム | |
CN102184262A (zh) | 基于web的文本分类挖掘系统及方法 | |
CN104199857A (zh) | 一种基于多标签分类的税务文档层次分类方法 | |
CN111104510B (zh) | 一种基于词嵌入的文本分类训练样本扩充方法 | |
CN106202065B (zh) | 一种跨语言话题检测方法及系统 | |
CN103678412A (zh) | 一种文档检索的方法及装置 | |
Ritu et al. | Performance analysis of different word embedding models on bangla language | |
CN107357895B (zh) | 一种基于词袋模型的文本表示的处理方法 | |
Sun et al. | Towards effective short text deep classification | |
Ye et al. | A web services classification method based on GCN | |
Bellaachia et al. | Hg-rank: A hypergraph-based keyphrase extraction for short documents in dynamic genre | |
CN113515632A (zh) | 基于图路径知识萃取的文本分类方法 | |
CN114491062B (zh) | 一种融合知识图谱和主题模型的短文本分类方法 | |
Liu et al. | Internet news headlines classification method based on the n-gram language model | |
Ma et al. | Feature-enriched word embeddings for named entity recognition in open-domain conversations | |
Dang et al. | WordNet-based suffix tree clustering algorithm | |
CN111061939B (zh) | 基于深度学习的科研学术新闻关键字匹配推荐方法 | |
Yang et al. | Web service clustering method based on word vector and biterm topic model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee | ||
CP03 | Change of name, title or address |
Address after: 300020 Tianjin Heping District, South Road, No. 11 International Building 23 purchase of Wheat Patentee after: TIANJIN HYLANDA INFORMATION TECHNOLOGY CO.,LTD. Address before: 300384 Tianjin City Huayuan Industrial Zone Rong Yuan Road No. 1 North B room 322-323 Patentee before: HYLANDA INFORMATION TECHNOLOGY Co.,Ltd. |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Knowledge network-based text indexing system and method Effective date of registration: 20161128 Granted publication date: 20131023 Pledgee: Beijing technology intellectual property financing Company limited by guarantee Pledgor: TIANJIN HYLANDA INFORMATION TECHNOLOGY CO.,LTD. Registration number: 2016990001027 |
|
PLDC | Enforcement, change and cancellation of contracts on pledge of patent right or utility model | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20180410 Granted publication date: 20131023 Pledgee: Beijing technology intellectual property financing Company limited by guarantee Pledgor: TIANJIN HYLANDA INFORMATION TECHNOLOGY CO.,LTD. Registration number: 2016990001027 |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20131023 |