CN103049568B - The method of the document classification to magnanimity document library - Google Patents
The method of the document classification to magnanimity document library Download PDFInfo
- Publication number
- CN103049568B CN103049568B CN201210593096.8A CN201210593096A CN103049568B CN 103049568 B CN103049568 B CN 103049568B CN 201210593096 A CN201210593096 A CN 201210593096A CN 103049568 B CN103049568 B CN 103049568B
- Authority
- CN
- China
- Prior art keywords
- document
- keyword
- word
- category
- term
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 230000011218 segmentation Effects 0.000 claims description 3
- 230000008878 coupling Effects 0.000 abstract description 9
- 238000010168 coupling process Methods 0.000 abstract description 9
- 238000005859 coupling reaction Methods 0.000 abstract description 9
- 241001076939 Artines Species 0.000 description 2
- 239000012141 concentrate Substances 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210593096.8A CN103049568B (en) | 2012-12-31 | 2012-12-31 | The method of the document classification to magnanimity document library |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210593096.8A CN103049568B (en) | 2012-12-31 | 2012-12-31 | The method of the document classification to magnanimity document library |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103049568A CN103049568A (en) | 2013-04-17 |
CN103049568B true CN103049568B (en) | 2016-05-18 |
Family
ID=48062208
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210593096.8A Active CN103049568B (en) | 2012-12-31 | 2012-12-31 | The method of the document classification to magnanimity document library |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103049568B (en) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104679733B (en) * | 2013-11-26 | 2018-02-23 | 中国移动通信集团公司 | A kind of voice dialogue interpretation method, apparatus and system |
CN103729344B (en) * | 2013-12-30 | 2016-08-31 | 传神联合(北京)信息技术有限公司 | A kind of method of statement mark in document manuscript |
CN103729350B (en) * | 2013-12-30 | 2017-01-04 | 语联网(武汉)信息技术有限公司 | The preprocess method of various dimensions waiting for translating shelves |
CN103714051B (en) * | 2013-12-30 | 2016-05-18 | 传神联合(北京)信息技术有限公司 | A kind of preprocess method of waiting for translating shelves |
CN103955449B (en) * | 2014-04-21 | 2018-03-06 | 安一恒通(北京)科技有限公司 | The method and apparatus for positioning target sample |
CN104615772B (en) * | 2015-02-16 | 2017-11-03 | 重庆大学 | A kind of professional degree analyzing method of text evaluating data for ecommerce |
CN104778371A (en) * | 2015-04-21 | 2015-07-15 | 天脉聚源(北京)传媒科技有限公司 | Method and device for evaluating document content speciality |
WO2017117781A1 (en) * | 2016-01-07 | 2017-07-13 | 马岩 | Network information classification method and system |
CN106484788A (en) * | 2016-09-19 | 2017-03-08 | 合肥清浊信息科技有限公司 | Patent search system based on industry keyword |
CN107798074A (en) * | 2017-09-29 | 2018-03-13 | 汤东澜 | Information processing method and server |
CN108182182B (en) * | 2017-12-27 | 2021-09-10 | 传神语联网网络科技股份有限公司 | Method and device for matching documents in translation database and computer readable storage medium |
CN107992633B (en) * | 2018-01-09 | 2021-07-27 | 国网福建省电力有限公司 | Automatic electronic document classification method and system based on keyword features |
CN108572942A (en) * | 2018-04-20 | 2018-09-25 | 北京深度智耀科技有限公司 | A kind of method and apparatus creating hyperlink |
CN109543023B (en) * | 2018-09-29 | 2020-09-08 | 中国石油化工股份有限公司石油勘探开发研究院 | Document classification method and system based on trie and LCS algorithm |
US11144579B2 (en) * | 2019-02-11 | 2021-10-12 | International Business Machines Corporation | Use of machine learning to characterize reference relationship applied over a citation graph |
CN109871433B (en) * | 2019-02-21 | 2021-07-23 | 北京奇艺世纪科技有限公司 | Method, device, equipment and medium for calculating relevance between document and topic |
CN111274798B (en) * | 2020-01-06 | 2023-08-18 | 北京大米科技有限公司 | Text subject term determining method and device, storage medium and terminal |
CN111782601A (en) * | 2020-06-08 | 2020-10-16 | 北京海泰方圆科技股份有限公司 | Electronic file processing method and device, electronic equipment and machine readable medium |
CN112015884A (en) * | 2020-08-28 | 2020-12-01 | 欧冶云商股份有限公司 | Method and device for extracting keywords of user visiting data and storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101593200A (en) * | 2009-06-19 | 2009-12-02 | 淮海工学院 | Chinese Web page classification method based on the keyword frequency analysis |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7047236B2 (en) * | 2002-12-31 | 2006-05-16 | International Business Machines Corporation | Method for automatic deduction of rules for matching content to categories |
-
2012
- 2012-12-31 CN CN201210593096.8A patent/CN103049568B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101593200A (en) * | 2009-06-19 | 2009-12-02 | 淮海工学院 | Chinese Web page classification method based on the keyword frequency analysis |
Also Published As
Publication number | Publication date |
---|---|
CN103049568A (en) | 2013-04-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103049568B (en) | The method of the document classification to magnanimity document library | |
US20170161375A1 (en) | Clustering documents based on textual content | |
US11176124B2 (en) | Managing a search | |
CN104679778B (en) | A kind of generation method and device of search result | |
CN109885773B (en) | Personalized article recommendation method, system, medium and equipment | |
US10019515B2 (en) | Attribute-based contexts for sentiment-topic pairs | |
US10579661B2 (en) | System and method for machine learning and classifying data | |
CN103593418B (en) | A kind of distributed motif discovery method and system towards big data | |
CN103823838B (en) | A kind of method of multi-format document typing and comparison | |
CN103258000A (en) | Method and device for clustering high-frequency keywords in webpages | |
CN105022827A (en) | Field subject-oriented Web news dynamic aggregation method | |
CN104199833A (en) | Network search term clustering method and device | |
CN103106245A (en) | Method which is used for classifying translation manuscript in automatic fragmentation mode and based on large-scale term corpus | |
CN110888981B (en) | Title-based document clustering method and device, terminal equipment and medium | |
CN115563313A (en) | Knowledge graph-based document book semantic retrieval system | |
CN109885641A (en) | A kind of method and system of database Chinese Full Text Retrieval | |
JP6047365B2 (en) | SEARCH DEVICE, SEARCH PROGRAM, AND SEARCH METHOD | |
US9256669B2 (en) | Stochastic document clustering using rare features | |
Amato et al. | YFCC100M hybridnet fc6 deep features for content-based image retrieval | |
CN107657067B (en) | Cosine distance-based leading-edge scientific and technological information rapid pushing method and system | |
CN111090743B (en) | Thesis recommendation method and device based on word embedding and multi-value form concept analysis | |
CN103793466A (en) | Image retrieval method and image retrieval device | |
CN110008407B (en) | Information retrieval method and device | |
CN113486148A (en) | PDF file conversion method and device, electronic equipment and computer readable medium | |
CN111639099A (en) | Full-text indexing method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Jiang Chao Inventor after: Zhang Pi Inventor before: Jiang Chao |
|
COR | Change of bibliographic data | ||
C56 | Change in the name or address of the patentee | ||
CP03 | Change of name, title or address |
Address after: 430070 East Lake Hubei Development Zone, Optics Valley Software Park, a phase of the west, South Lake Road South, Optics Valley Software Park, No. 2, No. 5, layer 205, six Patentee after: Language network (Wuhan) Information Technology Co., Ltd. Address before: 430073 East Lake Hubei Development Zone, Optics Valley Software Park, a phase of the west, South Lake Road South, Optics Valley Software Park, No. 2, No. 5, layer 205, six Patentee before: Wuhan Transn Information Technology Co., Ltd. |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method for classifying documents in mass document library Effective date of registration: 20181115 Granted publication date: 20160518 Pledgee: Bank of Communications Co., Ltd. Wuhan Branch of Hubei Free Trade Experimental Zone Pledgor: Language network (Wuhan) Information Technology Co., Ltd. Registration number: 2018420000061 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20200617 Granted publication date: 20160518 Pledgee: Bank of Communications Co.,Ltd. Wuhan Branch of Hubei Free Trade Experimental Zone Pledgor: IOL (WUHAN) INFORMATION TECHNOLOGY Co.,Ltd. Registration number: 2018420000061 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |