CN107463569A - A kind of document analysis method and apparatus - Google Patents

A kind of document analysis method and apparatus Download PDF

Info

Publication number
CN107463569A
CN107463569A CN201610390982.9A CN201610390982A CN107463569A CN 107463569 A CN107463569 A CN 107463569A CN 201610390982 A CN201610390982 A CN 201610390982A CN 107463569 A CN107463569 A CN 107463569A
Authority
CN
China
Prior art keywords
sort
document
mark
ranking results
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610390982.9A
Other languages
Chinese (zh)
Inventor
裘钢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suoyi Interactive Beijing Information Technology Co ltd
Original Assignee
Suoyi Interactive Beijing Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suoyi Interactive Beijing Information Technology Co ltd filed Critical Suoyi Interactive Beijing Information Technology Co ltd
Priority to CN201610390982.9A priority Critical patent/CN107463569A/en
Publication of CN107463569A publication Critical patent/CN107463569A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/338Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of document analysis method and apparatus, this method includes:First sort by is determined according to the selection of user;The document in archives is ranked up according to first sort by, and the document for the most forward predetermined quantity that sorts is marked, to generate the first ranking results, the second sort by is determined according to the selection of user;First ranking results are ranked up according to second sort by, generate the second ranking results, second ranking results carry the mark.The present invention can be formed according to the ranking results after a variety of sort bies, and especially the document that belonged to according to the sequence of a variety of sort bies in the literature borders of most forward predetermined quantity can be marked.

Description

A kind of document analysis method and apparatus
Technical field
The present invention relates to technical field of data processing, more particularly to a kind of document analysis method and apparatus.
Background technology
With the continuous progress of science and technology, technical literature is more and more, especially, reflects the patent of scientific and technical innovation Document is also more and more.In the prior art, there are several searching platforms, the search condition inputted according to user, using the teaching of the invention it is possible to provide Meet the patent document of the search condition, but when showing patent document, current searching platform only can be according to one kind Sort by shown, current sort by have citation times, the degree of correlation and some there is the value of special algorithm Degree.When user needs to consider a variety of foundations to determine the value of patent or determine the patent for needing to pay close attention to, without any inspection Suo Pingtai can provide suitable display methods to solve this problem.
The content of the invention
In view of the above problems, the present invention proposes a kind of document analysis method, and this method includes:
First sort by is determined according to the selection of user;
The document in archives is ranked up according to first sort by, and to the most forward predetermined quantity that sorts Document be marked, to generate the first ranking results,
Second sort by is determined according to the selection of user;
First ranking results are ranked up according to second sort by, generate the second ranking results, it is described Second ranking results carry the mark.
Optionally, this method also includes:Judge to carry the row whether document of the mark belongs in the second ranking results The document of the most forward predetermined quantity of sequence, if it is, retaining mark, otherwise remove mark.
Optionally, this method also includes:
3rd sort by is determined according to the selection of user;
Second ranking results are ranked up according to the 3rd sort by, generate the 3rd ranking results;
Judge the most forward predetermined quantity of the sequence whether document for carrying the mark belongs in the 3rd ranking results Document, if it is, retaining mark, otherwise remove mark.
Optionally, first sort by, the second sort by and/or the 3rd sort by are:Citation times, quilt One in company's quantity of reference, the national quantity being cited, quantity of the same clan, feature degree, patent degree.
Optionally, this method also includes:Extraction carries markd document.
The present invention also provides a kind of document analysis device, including:
First sort by determining unit, for determining the first sort by according to the selection of user;
First sequence indexing unit, for being ranked up according to first sort by the document in archives, and First mark is carried out to the document for the most forward predetermined quantity that sorts, to generate the first ranking results,
Second sort by determining unit, for determining the second sort by according to the selection of user;
Second sequencing unit, for being ranked up according to second sort by first ranking results, generation Second ranking results, second ranking results carry first mark.
Optionally, the device also includes:Judging unit and mark processing unit, the judging unit are used to judge to carry institute State the document for the most forward predetermined quantity of the sequence whether document of mark belongs in the second ranking results;It is if it is, described Processing unit is marked to retain mark, otherwise the mark processing unit removes mark.
Optionally, the device also includes:
3rd sort by determining unit, for determining the 3rd sort by according to the selection of user;
3rd sequencing unit, for being ranked up according to the 3rd sort by second ranking results, generation 3rd ranking results;
The judging unit is additionally operable to judge to carry the sequence whether document of the mark belongs in the 3rd ranking results The document of most forward predetermined quantity, marked if it is, the mark processing unit retains, otherwise the mark processing unit Remove mark.
Optionally, first sort by, the second sort by and/or the 3rd sort by are:Citation times, quilt One in company's quantity of reference, the national quantity being cited, quantity of the same clan, feature degree, patent degree.
Optionally, the device also includes:Document extraction unit, markd document is carried for extracting.
The technical scheme provided in the embodiment of the present application, it can be formed according to the ranking results after a variety of sort bies, especially It is can to enter rower to the document belonged to according to the sequence of a variety of sort bies in the literature borders of most forward predetermined quantity Note, can extract the document for carrying out the mark, so automatically can show or extract important literature (important text Offer the document that can be high value document or there is high attention rate in some respect), carried to be further analyzed For data basis.
Brief description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this area Technical staff will be clear understanding.Accompanying drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And in whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings:
Fig. 1 shows the flow chart according to document analysis method proposed by the invention;
Fig. 2 shows that the sort by of one embodiment of the invention chooses interface;
Fig. 3 shows the second sequence result interface of one embodiment of the invention;
Fig. 4 is shownThe interface of the cancellation mark of one embodiment of the invention;
Fig. 5 shows the structured flowchart of document analysis device proposed by the invention.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
The present invention proposes a kind of document analysis method, as shown in figure 1, this method includes:
S1. the first sort by is determined according to the selection of user;
S2. the document in archives is ranked up according to first sort by, and to most forward make a reservation for of sorting The document of quantity is marked, to generate the first ranking results,
S3. the second sort by is determined according to the selection of user;
S4. first ranking results are ranked up according to second sort by, generate the second ranking results, institute State the second ranking results and carry the mark.
Above-mentioned analysis method is carried out based on the archives got.Archives are exactly the set of more documents.Get The mode of archives have it is multiple, such as according to user input search conditions, searching system predetermined field is matched or Person carries out semantic retrieval and provides corresponding archives, and (text includes predetermined format to the text being input to according to user Document number) or cls files, clipbook file or pc files, obtain archives.
In the step S1, items selection first sort by of the user in sort by list, it is preferable that row Sequence is according to the project in list includes the applying date, citation times, by self reference number, (self reference refers to the document being cited Applicant either author with quote document application or author it is identical), by non-self reference number (by non-self reference refer to by Either author differs the applicant of the document of reference with the application of document quoted or author), company's quantity for being cited (quantity of the company belonging to the document of REFER object document), the national quantity being cited are (belonging to the document of REFER object document National quantity), quantity of the same clan, national quantity of the same clan (target literature it is of the same clan belonging to national quantity), feature degree (mesh Mark document independent claims technical characteristic number), patent degree (quantity of the claim of target literature), patent valency Value (economic value of technical scheme described in the target literature calculated according to certain algorithm or rule), patent group value (are pressed The economic value for the technology organized belonging to the target literature calculated according to certain algorithm or rule).The selection of first sort by by User determines according to point of interest, can select any one in above-mentioned project.
In the step S2, the document in archives is ranked up according to first sort by, and to sequence The document of most forward predetermined quantity is marked, most forward predetermined quantity preferably 10%, certainly, also can be according in archives The order of magnitude of document is adjusted flexibly.The mark of progress can be mark or User Defined fixed in system Mark, in general, mark be preferably letter, numeral or icon.If after being marked user think revocation mark or Person removes mark, the mark carried out can be cancelled or gone by the operation of progress " cancel and marking ", methods described Remove.
In the step S3, user can be selected according to the project in above-mentioned sort by list, with carrying out first Sort by is similar, and any one in above-mentioned project may be selected.But, for the effectiveness of sequence, it is typically chosen and first The sort by of sort by different dimensions (reflecting different aspect in other words).The value of target literature has been determined from multi-angle Or identification target literature.
In the application, the first ranking results are resequenced according to the second sort by, generate the second ranking results, The label added in step s 2 is carried in second ranking results.It can be shown according to the first sequence by such technological means Position of the document in the second ranking results according to predetermined quantity most forward when being ranked up, it is true so as to come from two kinds of dimensions Set the goal the value or importance of document.
As the first embodiment, this method may also include:Whether the document for judging to carry the mark belongs to The document of the most forward predetermined quantity that sorts in second ranking results, if it is, retaining mark, otherwise removes mark.Such as This only comes the document of most forward predetermined quantity according to the first sort by and can also come according to the second sort by The document of most forward predetermined quantity could retain mark.As second of preferred embodiment, in the first specific embodiment party On the basis of formula, this method can further comprise:The document in archives is extracted according to the mark.Pass through the technological means energy Enough be ranked up foundation and computing, i.e., comprehensive two kinds of sort bies carry out the mark of target literature, so as to efficiently, automatically Document basis is provided for manual depth's analysis of next step.
As the third embodiment, this method also includes:3rd sort by is determined according to the selection of user;Root Second ranking results are ranked up according to the 3rd sort by, generate the 3rd ranking results.One kind is achieved in that To the mark of the generation in the second ranking results without processing, directly it is ranked up, this mode can completely retain second The mark of sequence, and mainly show and entered according to the 3rd sort by document and the second sequence of most forward predetermined quantity Produce the position of the document of mark.Another implementation, also to judge whether the document for carrying the mark belongs to the 3rd again The document of the most forward predetermined quantity that sorts in ranking results, if it is, retaining mark, otherwise removes mark.This side Formula, sorted by third time, caused mark in the second sequence is further processed, i.e., three sort bies carried out With operation, that is to say, that the document for coming most forward predetermined quantity according to three kinds of sort by sequences is marked, this Kind mode directly reflects the result with operation, and is easy to carry out the extraction of document according to mark.Certainly, in specific implementation process, The method that the application is proposed is not limited to three minor sorts, as needed, can also carry out the 4th minor sort, the 5th row Sequence ... etc..
Scene is embodied as one kind, user needs to analyze the high value patent of invention of company of Haier, then passes through The patent of invention of the first archives, i.e. company of Haier is obtained in retrieval module input search condition ann/haier and na/1, Wherein ann represents applicant's title of standardization, and na represents patent of invention, first archives are imported into analysis module.Such as Fruit target is screening high value patent, as shown in Fig. 2 citation times may be selected as the first sort by, is sorted to first As a result the document for belonging to most forward predetermined quantity in is marked.It is optional from the value of patent from the aspect of innovative height Patent degree is selected as the second sort by, to be ranked up again according to the second sort by the first ranking results, and is judged Whether the document for carrying the mark belongs to the document of predetermined quantity most forward in the second ranking results, if it is, protecting Mark is stayed, otherwise removes mark, the mark of the second final ranking results is as shown in figure 3, mark therein uses change icon Mode, i.e., document form icon be changed to a mark.In general, it is believed that citation times it is high document it is very possible Advanced technology, and be many late comers according to this based on carry out next step research and extension, and the document that patent degree is high It is likely to be that patent has carried out comprehensive protection to it, layout is preferable.In the present embodiment, according to citation times and patent Degree most forward predetermined quantity document it is labeled user is out shown to prominent, so as to by be laid out, technology Patent document that is advanced and having very big market space highlights out.Operated according to the extraction of user, can be by labeled text Offer and extract, using the basis analyzed as next step manual depth.
It is worth special instruction, in order to distinguish the sequence being marked and the sequence without mark, side of the invention Method may include:Receive the beginning label ordering instruction of user's input;In response to the beginning label ordering instruction, selected according to user The sort by selected, it is ranked up and is marked;The end mark ordering instruction of user's input is received, in response to the end Tag sort is instructed, and no longer the mark of ranking results is adjusted and handled.As shown in Fig. 2 as a kind of specific embodiment party Formula, in sort menu, using bingo on as tag align sort sign on, terminate to refer to using bingo off as tag align sort Order.After user triggers tag align sort END instruction, analysis module is no longer modified to mark.Various analyses are carried out afterwards In, the mark done before can be all carried, if user no longer needs mark, the instruction for cancelling mark, analysis can be sent Module is removed according to the instruction to the mark made.
As shown in figure 5, the present invention also provides a kind of document analysis device, including:
First sort by determining unit 10, for determining the first sort by according to the selection of user;
First sequence indexing unit 20, for being ranked up according to first sort by the document in archives, And the first mark is carried out to the document for the most forward predetermined quantity that sorts, to generate the first ranking results,
Second sort by determining unit 30, for determining the second sort by according to the selection of user;
Second sequencing unit 40, it is raw for being ranked up according to second sort by first ranking results Into the second ranking results, second ranking results carry first mark.
Document analytical equipment can belong to a part for server side or high in the clouds or terminal, can be a kind of Browser or a kind of client software in terminal operating.Which kind of either above-mentioned structure setting, document analysis dress Putting can integrate with the retrieval module for obtaining archives, also can be independently.
In order to by the second minor sort, be optimized to the mark in the first ranking results, i.e., to according to the first sequence according to It is marked according to the document that most forward predetermined quantity is come with the second sort by, the device also includes:Judging unit and Processing unit is marked, the judging unit is used to judge to carry the row whether document of the mark belongs in the second ranking results The document of the most forward predetermined quantity of sequence;If it is, the mark processing unit retains mark, otherwise the mark processing is single Member removes mark.
The device is not limited in two minor sorts, and the device also includes:
3rd sort by determining unit, for determining the 3rd sort by according to the selection of user;
3rd sequencing unit, for being ranked up according to the 3rd sort by second ranking results, generation 3rd ranking results;
The judging unit is additionally operable to judge to carry the sequence whether document of the mark belongs in the 3rd ranking results The document of most forward predetermined quantity, marked if it is, the mark processing unit retains, otherwise the mark processing unit Remove mark.
Certainly, in specific implementation process, the method that the application is proposed is not limited to three minor sorts, as needed, also may be used Carry out the 4th minor sort, the 5th minor sort ... etc..As long as needs target literature is analyzed from various dimensions, also according to It is secondary set sequence foundation (dimension considered), also can triggered mark END instruction at any time, i.e., according to triggering the instruction before Foundation be ranked up and be marked, and according to trigger the instruction after foundation sorted merely, with observation before progress The trend of the document of mark, so as to provide the user the neatly technological means of mark and Ordination.
First sort by, the second sort by and/or the 3rd sort by are:Citation times, it is cited One in company's quantity, the national quantity being cited, quantity of the same clan, feature degree, patent degree, certain sort by is also not necessarily limited to Project listed above, as long as the feature being ranked up to document can be arranged to, sort by can be turned into.
Above-mentioned mark is used not only for display highlighting document, it may also be used for extracts emphasis document, it is preferred that the device is also Including:Document extraction unit, markd document is carried for extracting.
By means of the invention it is possible to using most important sort by as the first sort by, and to according to the first sequence according to It is marked according to the document for coming forward predetermined quantity, afterwards when choosing other sort bies, can on the one hand shows The whereabouts of document is marked, on the one hand mark can also be continued to optimize according to other sequences, so as to propose a variety of sort by phases With so that archives are ranked up with display, so as to which protrude the target literature of emphasis showing.Carried in the embodiment of the present application The technical scheme of confession, has at least the following technical effects or advantages:It can be formed according to the ranking results after a variety of sort bies, The document belonged to according to the sequence of a variety of sort bies in the literature borders of most forward predetermined quantity can especially be carried out Mark, can extract the document for carrying out the mark, so can automatically show or to extract important literature (important Document can be high value document or have the document of high attention rate in some respect), to be further analyzed Basis is provided.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with teaching based on this.As described above, required by constructing this kind of system Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that it can utilize various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the specification that this place provides, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description to the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The application claims of shield features more more than the feature being expressly recited in each claim.It is more precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself Separate embodiments all as the present invention.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit requires, summary and accompanying drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation Replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments in this include institute in other embodiments Including some features rather than further feature, but the combination of the feature of different embodiments means to be in the scope of the present invention Within and form different embodiments.For example, in the following claims, embodiment claimed it is any it One mode can use in any combination.
The all parts embodiment of the present invention can be realized with hardware, or to be run on one or more processor Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that it can use in practice Microprocessor or digital signal processor (DSP) are realized in gateway according to embodiments of the present invention, proxy server, system Some or all parts some or all functions.The present invention is also implemented as being used to perform side as described herein The some or all equipment or program of device (for example, computer program and computer program product) of method.It is such Realizing the program of the present invention can store on a computer-readable medium, or can have the shape of one or more signal Formula.Such signal can be downloaded from internet website and obtained, and either be provided or with any other shape on carrier signal Formula provides.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of some different elements and being come by means of properly programmed computer real It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame Claim.

Claims (13)

  1. A kind of 1. document analysis method, it is characterised in that this method includes:
    First sort by is determined according to the selection of user;
    The document in archives is ranked up according to first sort by, and to the text for the most forward predetermined quantity that sorts Offer and be marked, to generate the first ranking results,
    Second sort by is determined according to the selection of user;
    First ranking results are ranked up according to second sort by, the second ranking results of generation, described second Ranking results carry the mark.
  2. 2. document analysis method according to claim 1, this method also include:Judge to carry the mark document whether The document of the most forward predetermined quantity that sorts belonged in the second ranking results, if it is, retaining mark, otherwise removes mark Note.
  3. 3. document analysis method according to claim 2, this method also include:
    3rd sort by is determined according to the selection of user;
    Second ranking results are ranked up according to the 3rd sort by, generate the 3rd ranking results.
  4. 4. document analysis method according to claim 3, this method also include:
    Judge the document of the most forward predetermined quantity of the sequence whether document for carrying the mark belongs in the 3rd ranking results, If it is, retaining mark, mark is otherwise removed.
  5. 5. according to the document analysis method described in claim any one of 1-4, first sort by, the second sort by And/or the 3rd sort by be:Citation times, the company's quantity being cited, the national quantity being cited, quantity of the same clan, spy One in sign degree, patent degree.
  6. 6. according to the document analysis method described in claim any one of 1-5, this method also includes:Display carries the mark The first ranking results, the second ranking results and/or the 3rd ranking results or extraction carry markd document.
  7. A kind of 7. document analysis device, it is characterised in that including:
    First sort by determining unit, for determining the first sort by according to the selection of user;
    First sequence indexing unit, for being ranked up according to first sort by the document in archives, and to row The document of the most forward predetermined quantity of sequence carries out the first mark, to generate the first ranking results,
    Second sort by determining unit, for determining the second sort by according to the selection of user;
    Second sequencing unit, for being ranked up according to second sort by first ranking results, generation second Ranking results, second ranking results carry first mark.
  8. 8. document analysis device according to claim 7, the device also include:Judging unit and mark processing unit, institute State judging unit and be used for most forward make a reservation for of sorting for judging whether the document for carrying the mark belongs in the second ranking results The document of quantity;If it is, the mark processing unit retains mark, otherwise the mark processing unit removes mark.
  9. 9. document analysis device according to claim 8, the device also include:
    3rd sort by determining unit, for determining the 3rd sort by according to the selection of user;
    3rd sequencing unit, for being ranked up according to the 3rd sort by second ranking results, generation the 3rd Ranking results.
  10. 10. document analysis device according to claim 9, the judging unit is additionally operable to judge the text for carrying the mark The document of the most forward predetermined quantity of the sequence whether belonged in the 3rd ranking results is offered, if it is, the mark processing is single Member retains mark, and otherwise the mark processing unit removes mark.
  11. 11. according to the document analysis device described in claim any one of 7-10, first sort by, the second sort by And/or the 3rd sort by be:Citation times, the company's quantity being cited, the national quantity being cited, quantity of the same clan, spy One in sign degree, patent degree.
  12. 12. according to the document analysis device described in claim any one of 7-11, the device also includes:Document display unit, use The first ranking results, the second ranking results and/or the 3rd ranking results of the mark are carried in display.
  13. 13. according to the document analysis device described in claim any one of 7-12, the device also includes document extraction unit, is used for Extraction carries the document of the mark.
CN201610390982.9A 2016-06-02 2016-06-02 A kind of document analysis method and apparatus Pending CN107463569A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610390982.9A CN107463569A (en) 2016-06-02 2016-06-02 A kind of document analysis method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610390982.9A CN107463569A (en) 2016-06-02 2016-06-02 A kind of document analysis method and apparatus

Publications (1)

Publication Number Publication Date
CN107463569A true CN107463569A (en) 2017-12-12

Family

ID=60545844

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610390982.9A Pending CN107463569A (en) 2016-06-02 2016-06-02 A kind of document analysis method and apparatus

Country Status (1)

Country Link
CN (1) CN107463569A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846019A (en) * 2018-05-08 2018-11-20 北京市科学技术情报研究所 A kind of paper sort method based on gold reference algorithm
CN109063148A (en) * 2018-08-07 2018-12-21 黑龙江阳光惠远信息技术有限公司 A kind of related patents recommender system and recommended method based on third-party platform

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6654742B1 (en) * 1999-02-12 2003-11-25 International Business Machines Corporation Method and system for document collection final search result by arithmetical operations between search results sorted by multiple ranking metrics
US20050144162A1 (en) * 2003-12-29 2005-06-30 Ping Liang Advanced search, file system, and intelligent assistant agent
CN102521377A (en) * 2011-12-19 2012-06-27 刘松涛 Method and system for screening high-quality documents from document collection of document processing system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6654742B1 (en) * 1999-02-12 2003-11-25 International Business Machines Corporation Method and system for document collection final search result by arithmetical operations between search results sorted by multiple ranking metrics
US20050144162A1 (en) * 2003-12-29 2005-06-30 Ping Liang Advanced search, file system, and intelligent assistant agent
CN102521377A (en) * 2011-12-19 2012-06-27 刘松涛 Method and system for screening high-quality documents from document collection of document processing system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108846019A (en) * 2018-05-08 2018-11-20 北京市科学技术情报研究所 A kind of paper sort method based on gold reference algorithm
CN109063148A (en) * 2018-08-07 2018-12-21 黑龙江阳光惠远信息技术有限公司 A kind of related patents recommender system and recommended method based on third-party platform

Similar Documents

Publication Publication Date Title
Freudling et al. Automated data reduction workflows for astronomy-The ESO Reflex environment
Coe et al. Quantitative content analysis
CN104715064B (en) It is a kind of to realize the method and server that keyword is marked on webpage
CN103678509B (en) Generate the method and device of web page template
JP2006018693A (en) Similar source code extraction program, similar source code extraction device and similar source code extraction method
CN104050286A (en) Method and device for providing search result integration
WO2019074125A1 (en) System, method and program for automating business process that involves web browser operation
CN107678968A (en) Sample extraction method, apparatus, computing device and the storage medium of source code function
CN106599299A (en) Determining method and device of website key words
CN107463566A (en) A kind of document retrieval method and system
CN108062422B (en) Sorting method, intelligent terminal, system and storage medium for paging query
CN107463569A (en) A kind of document analysis method and apparatus
CN105956121A (en) Patent retrieval analysis auxiliary system and auxiliary method thereof
CN103838865B (en) For excavating the method and device of ageing kind of subpage
CN107608965A (en) Extracting method, electronic equipment and the storage medium of books the names of protagonists
CN103678601A (en) Model essay retrieval request processing method and device
CN106951405A (en) Data processing method and device based on typesetting engine
CN104239586B (en) A kind of method and apparatus of processing information material file
JP6394213B2 (en) Search program, search method, and information processing apparatus
CN103838877A (en) Method and device for pushing timeliness information webpage results based on search
KR20040099462A (en) System and method for client-side locale specific numeric format handling in a web environment
KR101589705B1 (en) purchase request book marc data implementation method
CN105573731A (en) Terminal time setting method and apparatus
Monaco Methods for in-sourcing authority control with MarcEdit, SQL, and regular expressions
CN107562753B (en) Index word-based analysis method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20171212

RJ01 Rejection of invention patent application after publication