CN102521377B - 从文档处理系统的文档集合中筛选优质文档的方法及系统 - Google Patents
从文档处理系统的文档集合中筛选优质文档的方法及系统 Download PDFInfo
- Publication number
- CN102521377B CN102521377B CN201110428369.9A CN201110428369A CN102521377B CN 102521377 B CN102521377 B CN 102521377B CN 201110428369 A CN201110428369 A CN 201110428369A CN 102521377 B CN102521377 B CN 102521377B
- Authority
- CN
- China
- Prior art keywords
- document
- documents
- collection
- judged result
- quality
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 238000012545 processing Methods 0.000 title claims abstract description 29
- 238000012216 screening Methods 0.000 title abstract description 18
- 230000000052 comparative effect Effects 0.000 claims description 8
- 230000003245 working effect Effects 0.000 claims description 2
- 238000002372 labelling Methods 0.000 abstract 1
- 238000012163 sequencing technique Methods 0.000 abstract 1
- 230000002093 peripheral effect Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 238000000605 extraction Methods 0.000 description 6
- 238000004590 computer program Methods 0.000 description 5
- 239000000463 material Substances 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 230000002349 favourable effect Effects 0.000 description 2
- 230000006399 behavior Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000696 magnetic material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110428369.9A CN102521377B (zh) | 2011-12-19 | 2011-12-19 | 从文档处理系统的文档集合中筛选优质文档的方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110428369.9A CN102521377B (zh) | 2011-12-19 | 2011-12-19 | 从文档处理系统的文档集合中筛选优质文档的方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102521377A CN102521377A (zh) | 2012-06-27 |
CN102521377B true CN102521377B (zh) | 2014-02-05 |
Family
ID=46292290
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110428369.9A Active CN102521377B (zh) | 2011-12-19 | 2011-12-19 | 从文档处理系统的文档集合中筛选优质文档的方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102521377B (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107463569A (zh) * | 2016-06-02 | 2017-12-12 | 索意互动(北京)信息技术有限公司 | 一种文献分析方法与装置 |
CN109726390B (zh) * | 2018-12-06 | 2023-07-21 | 天津字节跳动科技有限公司 | 文档处理方法、装置、电子设备和存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101046820A (zh) * | 2006-03-29 | 2007-10-03 | 国际商业机器公司 | 在web爬取过程期间给网站排优先级的系统和方法 |
US20090106221A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
US7680812B2 (en) * | 2004-09-16 | 2010-03-16 | Telenor Asa | Method, system, and computer program product for searching for, navigating among, and ranking of documents in a personal web |
-
2011
- 2011-12-19 CN CN201110428369.9A patent/CN102521377B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7680812B2 (en) * | 2004-09-16 | 2010-03-16 | Telenor Asa | Method, system, and computer program product for searching for, navigating among, and ranking of documents in a personal web |
CN101046820A (zh) * | 2006-03-29 | 2007-10-03 | 国际商业机器公司 | 在web爬取过程期间给网站排优先级的系统和方法 |
US20090106221A1 (en) * | 2007-10-18 | 2009-04-23 | Microsoft Corporation | Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features |
Also Published As
Publication number | Publication date |
---|---|
CN102521377A (zh) | 2012-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9171072B2 (en) | System and method for real-time dynamic measurement of best-estimate quality levels while reviewing classified or enriched data | |
CN108509417B (zh) | 标题生成方法及设备、存储介质、服务器 | |
CN109388712A (zh) | 一种基于机器学习的行业分类方法及终端设备 | |
CN108038119A (zh) | 利用新词发现投资标的的方法、装置及存储介质 | |
CN106951925A (zh) | 数据处理方法、装置、服务器及系统 | |
CN104281622A (zh) | 一种社交媒体中的信息推荐方法和装置 | |
CN109299258A (zh) | 一种舆情事件检测方法、装置及设备 | |
CN103150359B (zh) | 微博信息显示方法和装置 | |
CN111859830A (zh) | 一种验证计划及报告的生成方法、装置、设备及存储介质 | |
CN111652468A (zh) | 业务流程的生成方法、装置、存储介质及计算机设备 | |
CN103500158A (zh) | 批注电子文档的方法和装置 | |
CN108829651A (zh) | 一种公文处理的方法、装置、终端设备及存储介质 | |
CN108664471A (zh) | 文字识别纠错方法、装置、设备及计算机可读存储介质 | |
US10073938B2 (en) | Integrated circuit design verification | |
CN108681505A (zh) | 一种基于决策树的测试用例排序方法和装置 | |
CN113434542B (zh) | 数据关系识别方法、装置、电子设备及存储介质 | |
CN102521377B (zh) | 从文档处理系统的文档集合中筛选优质文档的方法及系统 | |
CN109472724A (zh) | 诉讼文书的自动生成方法及系统、电子设备 | |
CN110019556A (zh) | 一种话题新闻获取方法、装置及其设备 | |
CN105786929B (zh) | 一种信息监测方法及装置 | |
CN102651097A (zh) | 一种电子考评系统及电子考评方法 | |
CN107748711A (zh) | 自动优化Storm并行度的方法、终端设备及存储介质 | |
CN114021716A (zh) | 一种模型训练的方法、系统及电子设备 | |
CN110309047B (zh) | 一种测试点生成方法、装置及系统 | |
CN113515577A (zh) | 数据预处理方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20120627 Assignee: Beijing Jingyudian Network Technology Co., Ltd. Assignor: Liu Songtao Contract record no.: 2015990000087 Denomination of invention: Method and system for screening high-quality documents from document collection of document processing system Granted publication date: 20140205 License type: Exclusive License Record date: 20150228 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20151230 Address after: Beijing City 100000 Dongcheng District Avenue No. 80 is International Building room 1106 Patentee after: Beijing Jingyudian Network Technology Co., Ltd. Address before: 100078 Beijing city Fengtai District Fangguyuan a District 17 Building 1 No. 1105 Patentee before: Liu Songtao |