CN100435145C - 一种基于句子关系图的多文档摘要方法 - Google Patents
一种基于句子关系图的多文档摘要方法 Download PDFInfo
- Publication number
- CN100435145C CN100435145C CNB2006100725868A CN200610072586A CN100435145C CN 100435145 C CN100435145 C CN 100435145C CN B2006100725868 A CNB2006100725868 A CN B2006100725868A CN 200610072586 A CN200610072586 A CN 200610072586A CN 100435145 C CN100435145 C CN 100435145C
- Authority
- CN
- China
- Prior art keywords
- sentence
- relation
- document
- graph
- sentences
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 78
- 239000011159 matrix material Substances 0.000 claims description 52
- 238000009792 diffusion process Methods 0.000 claims description 8
- 230000015572 biosynthetic process Effects 0.000 claims description 4
- 238000013016 damping Methods 0.000 claims description 4
- 238000000354 decomposition reaction Methods 0.000 claims description 3
- 238000012804 iterative process Methods 0.000 claims description 3
- 238000011156 evaluation Methods 0.000 abstract description 17
- 230000000694 effects Effects 0.000 abstract description 15
- 239000000284 extract Substances 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- 238000003058 natural language processing Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 2
- 101000911753 Homo sapiens Protein FAM107B Proteins 0.000 description 1
- 102100026983 Protein FAM107B Human genes 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000004836 empirical method Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 235000019988 mead Nutrition 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2006100725868A CN100435145C (zh) | 2006-04-13 | 2006-04-13 | 一种基于句子关系图的多文档摘要方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2006100725868A CN100435145C (zh) | 2006-04-13 | 2006-04-13 | 一种基于句子关系图的多文档摘要方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1828608A CN1828608A (zh) | 2006-09-06 |
CN100435145C true CN100435145C (zh) | 2008-11-19 |
Family
ID=36947000
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2006100725868A Expired - Fee Related CN100435145C (zh) | 2006-04-13 | 2006-04-13 | 一种基于句子关系图的多文档摘要方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100435145C (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111914083A (zh) * | 2019-05-10 | 2020-11-10 | 腾讯科技(深圳)有限公司 | 语句处理方法、装置及存储介质 |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101398814B (zh) * | 2007-09-26 | 2010-08-25 | 北京大学 | 一种同时抽取文档摘要和关键词的方法及系统 |
US9317593B2 (en) * | 2007-10-05 | 2016-04-19 | Fujitsu Limited | Modeling topics using statistical distributions |
CN101231634B (zh) * | 2007-12-29 | 2011-05-04 | 中国科学院计算技术研究所 | 一种多文档自动文摘方法 |
US8402369B2 (en) * | 2008-05-28 | 2013-03-19 | Nec Laboratories America, Inc. | Multiple-document summarization using document clustering |
JP2011227758A (ja) * | 2010-04-21 | 2011-11-10 | Sony Corp | 情報処理装置、情報処理方法及びプログラム |
CN102831119B (zh) * | 2011-06-15 | 2016-08-17 | 日电(中国)有限公司 | 短文本聚类设备及方法 |
CN104298709A (zh) * | 2014-09-05 | 2015-01-21 | 上海中和软件有限公司 | 基于句间关联图的文本主题挖掘方法 |
CN107766419B (zh) * | 2017-09-08 | 2021-08-31 | 广州汪汪信息技术有限公司 | 一种基于阈值去噪的TextRank文档摘要方法及装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1341899A (zh) * | 2000-09-07 | 2002-03-27 | 国际商业机器公司 | 为文字文档自动生成摘要的方法 |
US6678676B2 (en) * | 2000-06-09 | 2004-01-13 | Oracle International Corporation | Summary creation |
US6718346B1 (en) * | 2000-08-17 | 2004-04-06 | 3Com Corporation | Generating summary data for a requested time period having a requested start time and end time a plurality of data records |
CN1755696A (zh) * | 2004-09-29 | 2006-04-05 | 株式会社东芝 | 用于创建文档摘要的系统和方法 |
-
2006
- 2006-04-13 CN CNB2006100725868A patent/CN100435145C/zh not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6678676B2 (en) * | 2000-06-09 | 2004-01-13 | Oracle International Corporation | Summary creation |
US6718346B1 (en) * | 2000-08-17 | 2004-04-06 | 3Com Corporation | Generating summary data for a requested time period having a requested start time and end time a plurality of data records |
CN1341899A (zh) * | 2000-09-07 | 2002-03-27 | 国际商业机器公司 | 为文字文档自动生成摘要的方法 |
CN1755696A (zh) * | 2004-09-29 | 2006-04-05 | 株式会社东芝 | 用于创建文档摘要的系统和方法 |
Non-Patent Citations (2)
Title |
---|
一种新的句子相似度度量及其在文本自动摘要中的应用. 张奇,黄萱菁,吴立德.NCIRCS2004第一届全国信息检索与内容安全学术会议论文集. 2004 |
一种新的句子相似度度量及其在文本自动摘要中的应用. 张奇,黄萱菁,吴立德.NCIRCS2004第一届全国信息检索与内容安全学术会议论文集. 2004 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111914083A (zh) * | 2019-05-10 | 2020-11-10 | 腾讯科技(深圳)有限公司 | 语句处理方法、装置及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN1828608A (zh) | 2006-09-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100435145C (zh) | 一种基于句子关系图的多文档摘要方法 | |
CN101398814B (zh) | 一种同时抽取文档摘要和关键词的方法及系统 | |
Medelyan et al. | Mining meaning from Wikipedia | |
CN101446940B (zh) | 为文档集自动生成摘要的方法及装置 | |
Thakkar et al. | Graph-based algorithms for text summarization | |
Sarkar | Bengali text summarization by sentence extraction | |
CN100418093C (zh) | 一种基于簇排列的面向主题或查询的多文档摘要方法 | |
CN111177365A (zh) | 一种基于图模型的无监督自动文摘提取方法 | |
CN100511214C (zh) | 一种对文档集进行批量单文档摘要的方法及系统 | |
CN102622338A (zh) | 一种短文本间语义距离的计算机辅助计算方法 | |
CN109670039A (zh) | 基于三部图和聚类分析的半监督电商评论情感分析方法 | |
CN1158460A (zh) | 一种跨语种语料自动分类与检索方法 | |
CN107526841A (zh) | 一种基于Web的藏文文本自动摘要生成方法 | |
CN101382962A (zh) | 一种考虑概念抽象度的浅层分析自动文档综述方法 | |
CN1916904A (zh) | 一种基于文档扩展的单文档摘要方法 | |
CN115906805A (zh) | 基于词细粒度的长文本摘要生成方法 | |
CN103336803A (zh) | 一种嵌名春联的计算机生成方法 | |
CN101599075A (zh) | 汉语缩略语处理方法和装置 | |
Chen et al. | A query substitution-search result refinement approach for long query web searches | |
Liao et al. | Combining Language Model with Sentiment Analysis for Opinion Retrieval of Blog-Post. | |
Dray et al. | Opinion mining from blogs | |
Schilit et al. | Exploring a digital library through key ideas | |
Ramezani et al. | Automated text summarization: An overview | |
Li et al. | Keyphrase extraction and grouping based on association rules | |
Huang et al. | Learning to find comparable entities on the web |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220915 Address after: 3007, Hengqin international financial center building, No. 58, Huajin street, Hengqin new area, Zhuhai, Guangdong 519031 Patentee after: New founder holdings development Co.,Ltd. Patentee after: Peking University Patentee after: PEKING University FOUNDER R & D CENTER Address before: 100871, fangzheng building, 298 Fu Cheng Road, Beijing, Haidian District Patentee before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Patentee before: Peking University Patentee before: PEKING University FOUNDER R & D CENTER |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230403 Address after: 100871 No. 5, the Summer Palace Road, Beijing, Haidian District Patentee after: Peking University Address before: 3007, Hengqin international financial center building, No. 58, Huajin street, Hengqin new area, Zhuhai, Guangdong 519031 Patentee before: New founder holdings development Co.,Ltd. Patentee before: Peking University Patentee before: PEKING University FOUNDER R & D CENTER |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20081119 |