CN102483749A - Method, system, and apparatus for delivering query results from an electronic document collection - Google Patents

Method, system, and apparatus for delivering query results from an electronic document collection Download PDF

Info

Publication number
CN102483749A
CN102483749A CN2009801613414A CN200980161341A CN102483749A CN 102483749 A CN102483749 A CN 102483749A CN 2009801613414 A CN2009801613414 A CN 2009801613414A CN 200980161341 A CN200980161341 A CN 200980161341A CN 102483749 A CN102483749 A CN 102483749A
Authority
CN
China
Prior art keywords
compilation
document
sections
chapters
concise
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2009801613414A
Other languages
Chinese (zh)
Other versions
CN102483749B (en
Inventor
贾森·雷斯尼克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CPA Global FIP LLC
Original Assignee
FoundationIP LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by FoundationIP LLC filed Critical FoundationIP LLC
Publication of CN102483749A publication Critical patent/CN102483749A/en
Application granted granted Critical
Publication of CN102483749B publication Critical patent/CN102483749B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/40Data acquisition and logging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method, system, and article are provided for efficiently and effectively searching an electronic document collection. Each of the documents in the collection is pre-divided into sub-sections. One or more profiles are created, each including a selection of at least one of the sections of the documents in the collection. In addition, a weight is assigned to each of the selected sections in the profile. Based upon the parameters of a query and selection of a profile, select sub-sections of each document are employed to compare query data to the underlying document collection. A compilation of documents is created with data matching the query data, and a relevancy score is computed for each document in the compilation. The relevancy score is then leveraged to sort the documents in a manner to convey relevancy to the query submission.

Description

Transmit method, system and the equipment of Query Result from the electronic literature set
Technical field
The present invention relates to the electronic literature set, submit inquiry to, and show Query Result to this electronic literature set.More specifically, the present invention relates to generate retrieval briefly, and show this Query Result based on relevance to the concise and to the point Query Result that returns of at least one retrieval through each chapters and sections specified weight to intellecture property document to be retrieved.
Background technology
All intellecture property documents that are used to examine of submitting to worldwide any Patent Office all must meet some requirements, and these conditions comprise that each intellecture property document must be novel, practical in non-obvious.For the intellecture property document of preparing rightly to be used to examine, the prior knowledge property right document of understanding in the correlative technology field (that is, prior art) is helpful, and this is because every invention can only be authorized a patent right.The process of confirming prior art is retrieval.Usually, result for retrieval helps the writer of intellecture property application subsequently that energy is focused on can to authorize theme maybe can protect on the theme, and helps to formulate reasonably strategy of a cover, to realize inventor or the proprietary target of intellecture property.
Known, entered into current electronic information before the epoch in technological revolution, the intellecture property retrieval is carried out through manual.Retrieval person browses disclosure, and confirms the classification of the disclosure content based on categorizing system, in this classification, carries out the retrieval of document and record subsequently.Have recognized that retrieval person browses the suitable chapters and sections of intellecture property document intuitively based on the limited range of the retrieval of being carried out.Along with the appearance of infotech, because great majority authorize intellecture property and disclosed application only to exist with electronic form, so manual information retrieval has not been suitable for most examination.Along with the appearance of the intellecture property document of electronic format, the similar strategy that manual information retrieval is adopted also can be used for retrieve electronic intellecture property database.
Can adopt different classes of retrieval, to obtain different results.For example, can adopt novelty search, to determine whether to submit to the application of intellecture property.Can adopt product to evade the infringement retrieval, whether drop in the protection domain of the claim that has intellecture property now to confirm product.Can adopt invalid retrieval, whether effective etc. with the claim of the intellecture property confirming to have authorized.Existing Electronic Intellectual Property Right literature search instrument is not supported different classes of retrieval.Thereby retrieval people (being also referred to as retrieval person) need bear following workload: according to the scope of retrieval, when retrieval, limit the chapters and sections that need browse in the intellecture property document.Because the quantity of mandate intellecture property in the database and disclosed unsettled intellecture property application constantly increases, so each retrieval need be browsed more pertinent literature, thereby has increased retrieval person's burden.
Therefore, retrieval person need use a kind of instrument to be used to discern inquiry and submit to the result to alleviate and assess the relevant workload of this Query Result, and this instrument utilizes the advantage of the electronic format of intellecture property document.This instrument should make retrieval person can in retrieving, utilize the different chapters and sections of intellecture property document, thereby more high-level efficiency and more effective is confirmed accurately, the result for retrieval of related and expectation.
Summary of the invention
The present invention includes and be used for method, system and product efficient and patent searching document set effectively.
In one aspect of the invention, a kind of computer implemented method that is used for to the result for retrieval specified associations property of electronic literature set is provided.The set of patent documentation is collected and indexed, and the said document of each in the said set has a plurality of chapters and sections.Discern each said chapters and sections of each the said document in the said set.To build retrieval concise and to the point for said archives is set up jointly.Said retrieval comprise briefly that each document from said set selects at least one through the identification chapters and sections.Concise and to the point for each, to each selected chapters and sections specified weight.When said set submission is inquired about, select retrieval concise and to the point, and the data in the chapters and sections that weight is arranged of also specifying through discerning of data query and said literature collection are compared.To returning each document compute associations property scoring in documentation, said documentation is created on said submission inquiry.Based on said relevance scoring, to said documentation rating through calculating.Then, dynamically limit based on the result of said grade said compilation.Based on applied said dynamic qualification, generate first compilation of the pertinent literature of said warp classification.
In another aspect of this invention, a kind of computer system is provided, said computer system comprises the processor of communicating by letter with storage medium, and the electronic literature set is stored on the said storage medium.Said electronic literature set is the compilation of intellecture property document.Based on the characteristic of intellecture property document, the said document of each in the said set has a plurality of chapters and sections.Use controller that said literature collection is collected and indexs.Said controller is communicated by letter with the documentation management device, and said documentation management device is discerned each chapters and sections of each document in the said set.In addition, concise and to the point manager is used to said archives and sets up that to build retrieval concise and to the point jointly.Said concise and to the point manager is communicated by letter with said documentation management device, and will being selected in the said retrieval briefly through the identification chapters and sections through each document in the compilation set.Except concrete chapters and sections are selected into said concise and to the point in, said concise and to the point manager is each the selected chapters and sections specified weight in each is concise and to the point also.Said weight is the reflection of the importance of related Sections.When inquiry, inquiry manager is submitted inquiry to said document set.Said inquiry comprises that to select at least one retrieval concise and to the point and the data in said each said chapters and sections that reflect in briefly of data query and said document are compared.After the said submission of said searching, managing device, generate the compilation of related patent U.S. Patent No. document and it is returned.The said match query that comprises and have at least one data in discerning concise and to the point chapters and sections of specified weight and relevance scoring in each document of compilation of returning.The relevance omniselector is set also and said inquiry manager communicates, thereby dynamically limits the result of said compilation to the document rating in the said compilation and based on said grade.Based on applied said dynamic qualification, generate first compilation through the pertinent literature of classification.
In still another aspect of the invention, a kind of product that is provided with computer readable carrier is provided, said computer readable carrier comprises computer program instructions, and it is used for the result for retrieval specified associations property of the electronic literature set on computer memory.Said computer readable carrier comprises the computer program instructions to said document set specified associations property.Be provided for instruction that the set of intellecture property document is collected and indexed.Each said patent documentation in the said set is divided into a plurality of chapters and sections.After said set is indexed, be provided for discerning the instruction of each said chapters and sections of each document in the said set.In case discerned the said chapters and sections of said document, be provided for setting up jointly and build the concise and to the point instruction of retrieval for said archives.Said retrieval briefly be selected from each document in the said set each through the identification chapters and sections.In addition, be provided for to said retrieval concise and to the point in through the instruction of each said chapters and sections specified weight of identification.When inquiry is submitted in the set of said document to, provide instruction be used for selecting at least one retrieval concise and to the point and with the said document of data query and said set said concise and to the point in data in the said chapters and sections discerned of warp compare.Then, provide the instruction for return in each document compute associations property scoring of documentation and based on said scoring to the document rating in the said set.In case accomplish said ranking,, provide instruction to come dynamically to limit the result of said compilation then based on said grade.Based on the said dynamic qualification that is applied to said compilation, generation is returned through first compilation of the pertinent literature of classification and with it.
Through the following explanation of carrying out with reference to accompanying drawing to the preferred embodiment of the present invention, other features and advantages of the present invention will be more obvious.
Description of drawings
The accompanying drawing of reference here constitutes the part of instructions.Only if offer some clarification on, characteristic shown in the drawings only is used to explain some embodiment of the present invention, rather than all embodiment of the present invention are described.In addition, this does not contain the meaning in contrast.
Fig. 1 is that the chapters and sections of expression identification patent documentation are to generate the process flow diagram of one or more concise and to the point processes.
Fig. 2 is the process flow diagram that is expressed as the process of one or more concise and to the point generations time weight.
Fig. 3 is that expression adopts time weight to reflect the process flow diagram of the process of the position that produces string matching in each concise and to the point chapters and sections.
It is secondary concise and to the point and to the process flow diagram of the process of the result for retrieval specified weight of submitting to inquiry to return that Fig. 4 is that expression generates.
To be expression use the process flow diagram of secondary concise and to the point process to Query Result set to Fig. 5.
Fig. 6 is the process flow diagram of expression to the process of Query Result classification.
Fig. 7 be expression according to the preferred embodiment of the present invention to return and through the process flow diagram of the process of sorting result specified associations property.
The process flow diagram of Fig. 8 process that to be expression dynamically limit the demonstration of the Query Result of base document set.
Fig. 9 is that expression utilizes graphic user interface the Query Result of base document set dynamically to be provided with the process flow diagram of the process of qualification as instrument.
Figure 10 is the block diagram of presentation graphic user interface embodiment.
Figure 11 is that expression is used for classifying and resolves the block diagram of one group of instrument of the Query Result that base document gathers.
Embodiment
Should understand easily, here, can arrange and design the present invention through different configurations and for example summarize in the accompanying drawings and illustrated assembly.Therefore, shown in accompanying drawing, hereinafter only is to be selected from representational embodiment of the present invention to the detailed description of equipment of the present invention, system and method, rather than will limit the present invention and require the scope protected.
The functional unit of describing in this instructions is called manager and controller.Manager and/or controller can be realized in such as programmable hardware device such as field programmable gate array, programmable logic array, PLDs.Manager and/or controller also can be realized by the software that various processors are carried out.For example, the appointment manager and/or the controller that are made up of executable code can comprise one or more physical blocks or the logical block that computer instruction constitutes, and these computer instructions may be constructed such for example object, program, function or other structures.However; The executable file that is equal to manager and/or controller need not physically to put together; But can comprise the different instruction that is stored in diverse location; When these command logics ground gang, then their are formed manager and/or controller and realize manager and/or the define objective of controller.
In fact, manager that is made up of executable code and/or controller can be an instruction or many instructions, in addition can be distributed in a plurality of different code segments, different application program, and a plurality of different storeies in.Likewise, here, service data can be specified (be identified) and explanation in the scope of manager and/or controller, and it may be embodied as any suitable form, also can be structured in the data structure of any suitable type.Service data can be integrated into the individual data set, maybe can be distributed to the diverse location that comprises different memory, can also be present on system or the network as electronic signal at least in part.
" the selected embodiment " that is mentioned in the whole instructions, " embodiment " or " embodiment " mean, are included among at least one embodiment of the present invention in conjunction with the described certain features of this embodiment, structure or characteristic.Therefore, the term that occurs everywhere at whole instructions " selected embodiment ", " in one embodiment " or " in an embodiment " not necessarily are meant same embodiment.
And, in one or more embodiments, can make up described characteristics, structure or characteristic in any suitable manner.A large amount of detailed descriptions is provided hereinafter, so that the complete understanding embodiments of the invention.Yet the person of ordinary skill in the field it will be appreciated that and omitting one or more specific detail, or under the situation with additive method, member, apparatus etc., also can realize the present invention.In other cases, for avoiding making inventive point of the present invention become obscure, then no longer be shown specifically or explain for well-known structure, apparatus or operation.
Through understanding embodiments of the invention better, wherein, in whole instructions, use identical Reference numeral to represent identical part with reference to accompanying drawing.Only sketch and some preferred embodiment below at the corresponding to equipment of this invention of protecting, system and method with mode for example.
General introduction
The set of intellecture property document is the compilation of authorizing announcement and disclosed application.The patent documentation set is the subclass of intellecture property document set.Patent documentation comprises the form and the publication forms of application of granted patent.But the difference between these two kinds of document classifications has been confirmed the value of their exercise the right.More specifically, granted patent is the realized property power that can exercise in law court, and the publication application is undelegated application, promptly is unsettled patent right.Each patent documentation is resolved to a plurality of chapters and sections, and each chapters and sections comprise written speech or phrase (being also referred to as string data).In order to retrieve set, based on the chapters and sections of each document in the set, each document is resolved, and to each chapters and sections specified weight through resolving of intellecture property document.Weight is the numerical metric of the significance level of one or more particular chapter in the document, to be used for inquiry.It is concise and to the point that selected document chapters and sections have constituted retrieval with the weight that is assigned to selected chapters and sections.Based on range of search, can retrieval be locked in the particular chapter of document, perhaps can specify different weights to the data of matching inquiry in each chapters and sections of document.For can this result's relevance dynamically being limited according to submitting the query display Query Result to.More specifically, can and/or retrieve concise and to the point feature and dynamically adjust the relevance of following this result based on the integral body of the statistical analysis of this Query Result, this Query Result.Therefore, retrieve quantification and the demonstration that concise and to the point generation and selection directly relate to result for retrieval.
Ins and outs
Below, will come illustrative embodiment with reference to the accompanying drawing of a part that constitutes instructions, accompanying drawing shows can realize specific embodiment of the present invention.It should be noted, under the situation that does not depart from scope of the present invention, can carry out structural change, thereby adopt other embodiments.
It should be understood that and authorize the explanation document of announcement and disclosed intellecture property document to be divided into a plurality of chapters and sections.Each chapters and sections are necessary for submitting complete application to, and each chapters and sections have purposes separately.Here will not go through each chapters and sections of rudimentary knowledge property right.Yet,, need the different chapters and sections of identification patent (as the example of intellecture property document) based on disclosed purpose.For most applications, each patented claim comprises title, priority date, summary, background technology, summary of the invention, Brief Description Of Drawings (if any), Figure of description (if any), embodiment and claim.
Purpose based on retrieval has adopted different retrieval classifications in patent field.For example, it is relevant with term in the claim that infringement and/or product are evaded infringement retrieval, thereby to gather the claim that comprises relevant with document basically.Effective and/or invalid retrieval is relevant with any known systems, thereby needs the priority date of identification patent documentation.When before or after the inventor is submitting patented claim to, wanting to confirm its novelty of an invention, inventor or its procurator or representative can adopt novelty search.Claim can not be paid attention in this retrieval, and pays close attention to the embodiment of invention.Therefore, as described herein, the different chapters and sections specified weight of the patent documentation of each retrieval in the document set.
Fig. 1 is expressed as to generate one or more concise and to the point and flow process Figure 100 of process that the chapters and sections of patent documentation are discerned.With United States Patent (USP) trademark office is example; According to current way principle; Each patented claim document of submitting to United States Patent (USP) trademark office comprises following chapters and sections: the detailed description of title, background technology, summary of the invention, Brief Description Of Drawings, accompanying drawing, preferred implementation, claims and summary, wherein background technology comprises the explanation of technical field and prior art.In one embodiment, not all patent documentation all comprises accompanying drawing, for example, and Chemistry Literature or some foreign patents and patent documentation.Similarly, other countries with the area Patent Office and before domestic practice in, patent documentation possibly have the chapters and sections of varying number, perhaps these chapters and sections possibly occur with different order.Therefore, for inquiry, before one or more chapters and sections specified weight of the patent documentation in set, need the source of the document in the identification set, the different chapters and sections of document and the tissue order of chapters and sections.
At first, set is collected and is indexed (step 102) to patent documentation.It should be understood that in the art patent and patent publications are made up of a plurality of chapters and sections.After document is collected, each chapters and sections (step 104) of each patent in the set of identification document.With variable N TotalBe appointed as the quantity (step 106) of chapters and sections in the patent documentation.For satisfying different retrieval needs, generate different concise and to the point.Through various combination specified weight, and/or do not give considering through during retrieving, one or more chapters and sections of document being ignored to the chapters and sections of patent documentation, generate concise and to the point, wherein above-mentioned ignoring through specifying 0 value to realize to these chapters and sections.For the retrieval that realizes carrying out, generate one at least briefly based on concise and to the point.Yet, in one embodiment,, generated a plurality of concise and to the point in order to realize concise and to the point selection to satisfy the needs of specific retrieval.In case in step 106, identify the chapters and sections of patent documentation, initialization and the relevant counting variable X of concise and to the point sign, and be assigned therein as integer 1 (step 108), then the counting variable N relevant with the chapters and sections of patent documentation is appointed as integer 1 (step 110).Chapters and sections from the patent documentation set NBeginning judges whether chapters and sections NConcise and to the point (concise and to the point as what generating X) a part (step 112).If the judged result in the step 112 is sure, then with chapters and sections NJoin concise and to the point XIn (step 114).Selecting chapters and sections NSituation under, to chapters and sections NSpecify sovereignty heavy (step 116).Sovereignty heavily are digital values, and it is used to represent compare chapters and sections with other chapters and sections of patent documentation set NFor briefly XImportance, above-mentioned other chapters and sections comprise any previous selected chapters and sections and to be added in concise and to the point or treat the chapters and sections from concise and to the point, ignored.If the judged result after step 116 or in the step 112 negates then to increase progressively the variable N (step 118) relevant with the chapters and sections of patent documentation.Whether all that judge in the compilation and the set of indexing patent documentation have subsequently been discerned chapters and sections and have been assessed, so that these chapters and sections are added concise and to the point XOr from briefly XIn ignore (step 120).If the judged result in the step 120 is sure, then finish concise and to the point XConcise and to the point generative process (step 122).On the contrary, if the judged result in the step 120 negates then next to turn back to step 112, for briefly XConsider all the other chapters and sections in the set.Subsequently, judge whether also to become any other concise and to the point (step 124) for the archives symphysis.If the judged result in the step 124 is sure, counts variable X (step 126) next then, and turn back to step 110.On the contrary, if the judged result in the step 124 negates to stop concise and to the point XGeneration, and the numerical value of X is assigned to variable X Total(step 128).Therefore, can for patent documentation set generate one or more concise and to the point, and each briefly one or more in the patent documentation set specified weight through the identification chapters and sections.
As shown in Figure 1, can generate one or more concise and to the point, be used for stressing or the selected chapters and sections of reduction patent documentation in the degree of utilizing of retrieving.Fig. 2 is expression flow process Figure 200 to each concise and to the point increase additional weight value that has generated.More specifically, based on each concise and to the point in the quantity of matched character string in the selected chapters and sections, can additional weight (inferior weight) be joined in the weighted value or from weighted value and deduct.As shown in Figure 2, variable X TotalBe designated as and be used to represent the concise and to the point quantity (step 202) that generated, and counting variable X is appointed as integer 1 (step 204).After this, as shown in Figure 2, variable Y TotalBe designated as and be used for expression briefly XMiddle quantity (step 206) of specifying the heavy chapters and sections of having the right.In order to assess each concise and to the point chapters and sections, counting variable Y is appointed as integer 1 (step 208).Subsequently, judge whether inferior weight is joined briefly XChapters and sections YIn (step 210).If the judged result in the step 210 negates, then next jump to step 230 to assess this next chapters and sections (if these chapters and sections exist) in concise and to the point.On the contrary, if the judged result in the step 210 is sure, then next inquire about through secondary whether the inferior weight of judging appointment is hierarchy (step 212).More specifically, each briefly can comprise graduate weighted value, and this depends in the quantity through the data character string coupling returned during selecting briefly to retrieve.If the judged result in the step 212 negates, then next set the minimum threshold of the data character string coupling that must return, with to chapters and sections YSpecify time weight (step 214).After step 214, for briefly XChapters and sections YSet time weighted value (step 216).The input at step 214 and 216 places is used for setup parameter, the inferior weight structure that this parameter satisfies in step 212 to be generated.Therefore,, can set time weighted value, to be used for when surpassing matching threshold to the result for retrieval specified weight for each concise and to the point chapters and sections.
Except that single weighted value of setting, the configurable one-tenth of the selected chapters and sections of concise and to the point each has weight threshold graduate time.If the judged result in the step 212 is sure, then next will specify concise and to the point XChapters and sections YThe quantity of graduation threshold value be appointed as variable Z Total(step 218), layering counting variable Z is set at integer 1 (step 220).After step 220, the minimum threshold (step 222) of data character string that setting must be returned coupling is with to briefly XChapters and sections YLayering ZSpecify time weight, thereby be concise and to the point XChapters and sections YLayering ZSpecify time weighted value (step 224).In case be selected layering ZSpecified weighted value, then increased progressively layering counting variable Z (step 226), then judged whether for concise and to the point XChapters and sections YAll layerings specified weighted value (step 228).If the judged result in the step 228 negates then next to return step 222.On the contrary, if the judged result in the step 228 is sure or after step 116, counts variable Y next then is to assess (step 230) to selected next concise and to the point chapters and sections.Subsequently, judge whether selected all concise and to the point chapters and sections to have been carried out the assessment (step 232) of the appointment of weight threshold by different level.If the judged result in the step 232 negates, then next turn back to step 210, and if the judged result in the step 232 is sure, then next increase progressively concise and to the point counting variable X (step 234).After step 234, judge whether assessment (step 236) to all appointments of briefly having carried out inferior weight that generated.If the judged result in the step 236 negates, then next turn back to step 206, and if the judged result in the step 236 is sure, then next stop to specify graduate weight threshold (step 238) to generating concise and to the point selected chapters and sections.Therefore, each briefly can have weight graduate time, be used for to each concise and to the point selected chapters and sections and concise and to the point in the quantity specified weight of matched character string.
As shown in Figure 2, inferior weight of graduation (being layering) can be applicable to each concise and to the point independent chapters and sections, and wherein, each time weight is that one or more threshold values of the number of matches between the data in gathering with the inquiry string and the document of just being resolved are the basis.In another embodiment, as shown in Figure 3, inferior weight can reflect the position that string matching occurs in one or more concise and to the point chapters and sections.This time weight can be separated from each other with inferior weight shown in Figure 2, also can add in the inferior weight shown in Figure 2.As shown in Figure 3, variable X TotalBe appointed as and be used to represent the concise and to the point quantity (step 302) that generates, and counting variable X is appointed as integer 1 (step 304).After this, variable Y TotalBe appointed as and be used for expression briefly XIn appointment the quantity (step 306) of the chapters and sections of weight is arranged, and counting variable Y is appointed as integer 1 (step 308).Judge whether subsequently and will inferior weight be joined briefly XChapters and sections YIn (step 310).If the judged result in the step 310 is sure, then next will be concise and to the point XChapters and sections YBe divided into a plurality of sub-chapters and sections (step 312).Division in the step 312 can be adopted different embodiment.For example, in one embodiment, can it be divided into three sub-chapters and sections, wherein the first sub-chapters and sections are defined as first sentence, and the 3rd sub-chapters and sections are defined as last sentence, and the second sub-chapters and sections are defined as all data between the first and the 3rd sub-chapters and sections.Similarly, in another embodiment, concise and to the point XChapters and sections YCan be divided into a plurality of chapters and sections, the length of each chapters and sections with whole concise and to the point XChapters and sections YIn proportion relevant.No matter it is adopt which kind of method to confirm the quantity of sub-chapters and sections, concise and to the point XEach chapters and sections YCan be divided into plural sub-chapters and sections, wherein the inferior weight to sub-chapters and sections appointment not only is used to reflect briefly XChapters and sections YIn matched character string, and be used for reflecting the position of above-mentioned coupling at selected sub-chapters and sections.
After step 312, variable Z TotalBe appointed as concise and to the point XChapters and sections YIn the quantity (step 314) of the sub-chapters and sections that generated, and counting variable Z is appointed as integer 1 (step 316).To briefly XChapters and sections YSub-chapters and sections ZSpecify time weight (step 318).After the appointment in step 318, counts variable Z (step 320) then, judges briefly XChapters and sections YIn whether also have the sub-chapters and sections (step 322) of the assessment of not carrying out time weight appointment.If the judged result in the step 322 negates then next to turn back to step 318.On the contrary, negate, then counts variable Y (step 324) next if the judged result in the step 322 is a judged result in sure or the step 310.Subsequently, judge briefly XIn whether have the chapters and sections (step 326) of the appointment assessment of not carrying out time weight.If the judged result in the step 326 negates then next to turn back to step 310.On the contrary, if the judged result in the step 326 is sure, counts variable X (step 328) next then, and judge whether all briefly to have been carried out appointments assessments (step 330) of inferior weight.If the judged result in the step 330 negates, then next turn back to step 306, and if sure, then stop the assignment procedure of time weight.Therefore, can it be divided into a plurality of sub-chapters and sections, wherein inferior weight be assigned to one or more sub-chapters and sections through identification based on the physical location of concise and to the point chapters and sections.
In Fig. 1 to Fig. 3, for the patent documentation that retrieves matched character string combination and to each different chapters and sections specified weight with document of coupling, formation base is concise and to the point.Based on documentation, also can adopt secondary concise and to the point with matched character string combination.More specifically,, consider, can adopt and secondaryly briefly specify time weight to this result based on secondary before retrieval person's display result.For using time weight, can utilize the different characteristic of patent documentation, this includes but are not limited to priority date and/or open day.In the patent field, priority date is represented the earliest time in the patent families.More specifically, when submitting the patented claim that specifies certain invention to first, set up priority date for this invention.The authorization date of issued patents has been authorized in expression in open day of patent documentation (a patent document), and the open date of representing unsettled patented claim in open day of patent publications (a patent publication).Secondary date or the whole dates and generating that briefly can utilize in these record dates.
Fig. 4 is the process flow diagram (step 400) that expression generates secondary concise and to the point process, this secondary briefly based on the date key element relevant with the document date of submitting to inquiry to be returned to the result for retrieval specified weight.In one embodiment, the date key element can include but are not limited to open day, the applying date and external priority date.At first, set secondary concise and to the point " secondary-concise and to the point " (step 402).The quantity of document of submitting to inquiry to obtain is appointed as variable N Total(step 404), and counting variable N is appointed as integer 1 (step 406).For the document that returns in the archives N, its priority date is retrieved (step 408), next counts variable N (step 410).Then, judge whether to accomplish the retrieval (step 412) of the secondary-concise and to the point key element of the archives that returns.If the judged result in the step 412 negates then next to return step 408.On the contrary, if the judged result in the step 412 is sure, then next based on the secondary-concise and to the point key element execution sorting algorithm that extracts, thus the document (step 414) in this result for retrieval of classifying.Sorting algorithm can adopt multiple different form, and therefore, the present invention will be not limited to any specific sorting algorithm.In case the document classification in the pair set finishes, the variable document OLDBe appointed as a document (step 416) that has secondary the earliest-concise and to the point date in the archives, and the variable document NEWBe appointed as have in the archives up-to-date secondary-document (step 418) on concise and to the point date.Variable date-scope is defined as the variable document OLDAnd document NEWDifference (step 420), and be a plurality of chapters and sections (step 422) with date-scope division.Can adopt different embodiment to divide date-scope in the step 422.For example, in one embodiment, can divide three sub-chapters and sections, wherein the first sub-chapters and sections are defined as near document NEWThe document of relevant date, the 3rd sub-chapters and sections are defined as near document OLDThe document of relevant date, and the second sub-chapters and sections are defined as date all documents between the first and the 3rd sub-chapters and sections.Similarly, in another embodiment, date-scope can be divided into a plurality of chapters and sections, and wherein each chapters and sections have the document distribution in the equal document set.Therefore, no matter adopt which kind of method, can specify time weight to each sub-chapters and sections of archives, this time weight has the relevance based on the Query Result of this time weight.
Based at least one secondary data standard to after the document classification in the Query Result, variable Z TotalBe appointed as the chapters and sections quantity (step 424) of date-scope, and counting variable Z is appointed as integer 1 (step 426).To date-scope ZSpecified weight (step 428) next increases progressively variable Z (step 430).After step 430, judge whether to be each sub-chapters and sections specified weight (step 432).If the judged result of step 432 negates then next to return step 428.On the contrary, if the judged result of step 432 is sure, then stop the sub-chapters and sections specified weight that has generated for each.Therefore, in order further give prominence to secondary key element before the video data, can to generate and secondaryly briefly come to specify inferior weight to result set.
Using secondary key element to the Query Result of document set is not to be limited as the date.Fig. 5 is that expression is used secondary concise and to the point process flow diagram (step 500) to result set, and this result set does not adopt the relevant date of any one patent documentation.When starting retrieval, select one or more literature collections to realize inquiry (step 502).In one embodiment, the document set can be the form of intellecture property document set.Likewise, in one embodiment, document set can be the form of country variant, for example by the set of publication such as United States Patent (USP) trademark office, Jap.P. office, EUROPEAN PATENT OFFICE.In case select document set, then selected concise and to the point (step 504) that be used to retrieve.Concise and to the point embodiment explains among Fig. 1 to Fig. 3 in the above.Based on the completion of selecting in the step 502 and 504, open inquiry, and inquiry is committed to this concise and to the point and selected literature collection (step 506).In one embodiment, inquiry is the character string inquiry.Confirm to take place in the set quantity of the document of at least query event, this quantity is appointed as variable X Total(step 508), and the counting variable X of coupling document is appointed as integer 1 (step 510).In addition, variable N TotalBe appointed as to be used to submit to and inquire about selected concise and to the point chapters and sections quantity (step 512), and selected concise and to the point counting variable N is appointed as integer 1 (step 514).Be each document XChapters and sections NCalculate scoring.In one embodiment, score calculation becomes chapters and sections NIn match query number with to chapters and sections NThe product (step 516) of the number of the point of appointment.In one embodiment, be specified in chapters and sections NPoint represent the significance level of this particular chapter in set.
After step 516, increase progressively variable N (step 518), judge whether that next these all concise and to the point chapters and sections are by assessment (step 520).If the judged result of step 520 negates then to return step 516.On the contrary, if the judged result of step 520 is sure, then increase progressively variable X (step 522).Then, judge whether under this quantity all documents by the assessment (step 524).If the judged result of step 524 is sure, then stop the document that is returned is carried out mark assessment (step 526).On the contrary, if the judged result of step 524 negate, then return step 514, through concise and to the point chapters and sections next document is marked.
In case to all documents and selected concise and to the point appointment scoring, then for each document and based on the selected brief calculation overall score (step 526) of submitting to inquiry to return.As shown in Figure 5, each document all has scoring in the compilation, and this scoring is based on the numerical value form of the string matching quantity and the associated weight of concise and to the point middle appointment.
Should be understood that in the process of carrying out patent retrieval, judge that importantly which result for retrieval is more relevant.For example, except the contribution key element that will mark as match query, also utilize scoring to coupling document given level.This grade is represented: return document with other and compare, the document which returns is rated as more relevant.The evaluation of this grade utilizes different key elements, and these key elements can comprise combining based on the grade of scoring and/or grade and secondary key element.
Fig. 6 is the process flow diagram 600 of expression process that the document that is returned by inquiry is classified, and this classification is marked based on the appointment that each returns document and each concise and to the point chapters and sections.Like what Fig. 6 calculated, variable X TotalBe appointed as and at least once submit the total quantity (step 602) of inquiring about the document that returns to.Then, follow sorting algorithm and come classified documents (step 604).In one embodiment, can be from score to minimum scoring, perhaps series classification document from minimum scoring to score.Sorting algorithm can adopt multiple different form, and therefore, the present invention will be not limited to any specific sorting algorithm.In case the document classification in the whole set finishes, also can classify, thereby in each chapters and sections, produce one type of document inquiring about selected each concise and to the point literature collection.In one embodiment, the assorting process of returning document can be regarded the ranking process of appraisal result as.Variable N TotalThe quantity (step 606) of the selected concise and to the point middle chapters and sections of expression retrieval.Chapters and sections counting variable N is initialized as integer 1 (step 608), and document counting variable X is initialized as variable 1 (step 610).For chapters and sections N, corresponding to each document of the input of one query at least XBe divided into first document XTo last document XTotal(step 612).In case to chapters and sections NClassification finishes, and then increases progressively variable N (step 614), next judges whether the assessment (step 616) of classifying of selected all concise and to the point chapters and sections.If the judged result of step 616 negates then next to return step 612.On the contrary, if the judged result of step 616 is sure, then expression is classified to all documents through selected all concise and to the point chapters and sections.Therefore, the classification of Query Result is carried out on two ranks, and first rank is to its whole inquiry, and second level is through forming the classification of these concise and to the point selected chapters and sections.
In case the classification of document set finishes, then take different instruments to transmit the classified inquiry result.More specifically, after inquiry and classification completion to Query Result, the data that send inquiry submission person to are based on the relevance as a result of whole inquiry, and/or based on the relevance through each concise and to the point chapters and sections of the inquiry of submitting to.Fig. 7 be expression to return and through the process flow diagram 700 of process of the result for retrieval specified associations property of classification.This quantity of returning the layering of result for retrieval is appointed as variable T Total(step 702).In one embodiment, variable T TotalIt is static variable.Yet, in another embodiment, variable T TotalIt can be dynamic variable.Relevance assessment can be carried out on two ranks, and first rank is based on all documents in the Query Result, and the assessment of second level is concise and to the point based on each of document set.Variable X TotalExpression is returned according to inquiry and all documents (step 704) through classifying, and variable X TotalBy layering quantity T TotalRemove, be assigned to the quantity QS (step 706) of the Query Result of each layering (T representes) with calculating.For Query Result is assigned to layering T, layering counting variable T is initialized as integer 1 (step 708), and counting variable X representes to be assigned to the document of layering, and be initialized as integer 1 (step 710).After step 708 and 710 is carried out initialization, with document XBe assigned to layering T(step 712).After step 712 is carried out above-mentioned appointment, increase progressively variable X (step 714), judge whether layering then TFill up Query Result (step 716).If the judged result of step 716 negates then next to return step 712.On the contrary, if the judged result of step 716 is sure, then expression is accomplished layering TQuery Result specify.Then, increase progressively variable T (step 718), next judge whether to accomplish all the Query Result of layering is specified (step 720).If the judged result of step 720 negates then next to return step 710.On the contrary, if the judged result of step 720 is sure, then stop layering given query result to generating.Should be noted that and perhaps to go up the sorted table of ground extremely down from being related to most least relevant mode classification to layering given query result from least being related to maximally related mode from top to down to layering given query result's sorted table.Similarly, in one embodiment, in classification and graduate result, flex point occurring, and adjacent layering is divided at this flex point place.Therefore, for outstanding selected relevance, to layering given query result through classified documents.
As stated, can not consider to submit to the concise and to the point of inquiry, with rough principle classified inquiry result.Yet, also can carry out layering and specify according to concise and to the point principle (profile basis) (being also referred to as the refinement principle).More specifically, can be with reference to concise and to the point characteristic, briefly classify to submitting in the inquiry each to according to the relevance order of returning document.In order to realize, also can mode shown in Figure 7 be divided into layering to each archives of concise and to the point appointment to concise and to the point utilization.The concise and to the point layering of this refinement is specified Query Result is further transmitted based on concise and to the point characteristic.
As stated, can resolve each patent in the document set Query Result, so that result's demonstration to be provided based on relevance.In one embodiment, can show that this result is to stress or to remove and stress the matched data value in the appointment chapters and sections that return compilation of intellecture property document.Similarly, in one embodiment,, the demonstration of result for retrieval is limited based on relevance.With regard to specify layering to Query Result with regard to, only can view the layering of selection, wherein the layering of this selection can be those layerings that are considered to comprise more heterogeneous pass Query Result.Similarly, with regard to Query Result scoring, can set restriction and make the inquiry of being submitted to is only shown the result in those regulation marks.The demonstration restriction of Query Result should not be confined to embodiment described herein, can adopt other forms that the Query Result that views is defined as the result that those only have certain relevance score.
In one embodiment, be shown as the compilation of pertinent literature statically through the Query Result of classification.Yet, in another embodiment,, can dynamically limit returning of document set based on the classification of returning document.This dynamic aspect support changes correlation criterion with the reflection Query Result.The process flow diagram 800 of Fig. 8 embodiment that to be expression dynamically limit the demonstration of Query Result.As stated, based on the relevance quantitative factor of submitting query elements to, to each document classification (step 802) that returns according to inquiry.Based on this numeric data (that is), to the compilation application curves match routine (step 804) of returning document to the quantitative factor of each relevance of document appointment of returning.This curve fitting routine calculates the theory function of compilation data.More specifically, this curve fitting routine is confirmed this theory function based on the raw value factor of relevance.Based on this curve fitting routine, some documents in the compilation and the curves of this theory function or near this theory function curve (step 806).One or more derivatives (step 808) of theory of computation function.In order dynamically to limit the result of compilation, the quantity (step 810) of Choice Theory function derivative.More specifically, for the result that will collect is defined as pertinent literature, this Dynamic Selection will be limited in the first order derivative of theory function of this curve fitting routine.Similarly, obtain the greater amount document for the result who expands compilation, this Dynamic Selection will extend to second derivative (perhaps more high-order).Based on the quantity of selected derivative, the documentation in the derivative range of choice is returned (step 812).Therefore, based on the degree of approximation of the theory function of document and curve fitting routine, the compilation that returns document is dynamically adjusted.
Dynamic Selection process shown in Figure 8 and instrument represent to limit compilation result's a embodiment.In another embodiment, utilize the upper strata (veneer) of graphic user interface, thereby realize that the user is to through the mutual of the conventional result of the compilation of classification with revise as source code.Fig. 9 is expression through using graphic user interface dynamically to limit the result's of compilation the process flow diagram of process.As stated, based on the relevance quantitative factor of submitting query elements to, to each document that returns according to inquiry classify (step 902).Result for retrieval is depicted as figure (904).Can adopt multi-form figure.In one embodiment, this figure can be the X-Y scheme form, is an axle with returned quantity of document wherein, and the quantitative factor of correlativity is another axle.The mechanism (step 906) that can make quantity of document be defined in selected relevance numerical value is provided on the interface.In one embodiment, vernier (slider) is provided on user interface, and through fixed point instrument (pointing tool), this vernier can move to any relating value (step 908) of figure.Based on moving of vernier, the quantity that dynamically changes pertinent literature be considered to relevant specific document.More specifically; This vernier plays marginal effect; Wherein specify all documents of the relevance that this slider position top is arranged to be identified as pertinent literature and to be returned (step 910), specify all documents of the relevance that this slider position below is arranged then not return (step 912).In one embodiment, appointment has all documents of the relevance of slider position to be identified as pertinent literature and returns.On the contrary, in one embodiment, appointment has all documents of the relevance of slider position to be identified as uncorrelated and does not return.Therefore, can move the vernier of this graphic user interface, be identified as relevant document and it is returned in the documentation with adjustment.
As shown in Figure 9, can adopt graphic user interface that the instrument of being convenient to the pertinent literature Dynamic Selection is provided.Figure 10 is the block diagram 1000 of presentation graphic user interface embodiment.More specifically, computer system 1000 has processing unit 1002, and said processing unit 1002 is connected to storer 1006 through bus structure 1008.In one embodiment, though a processing unit 1002 only is shown, in expansion design, more a plurality of processing units can be provided.Illustrated system 1000 is connected with the storage medium that is configured to store document set 1,042 1040 communications.In one embodiment, the electronic literature set comprises the compilation of patent documentation, and this patent documentation comprises granted patent and disclosed patented claim.Storage medium 1040 communicates to connect with processing unit 1002.In addition, illustrated this system is connected with visual display unit 1050 communications, with the display of visually data.Adopt input equipment 1052 and visual display unit 1050 to communicate.Can adopt multiple multi-form input equipment, it includes but not limited to keyboard, mouse, tracking ball, electronic pen etc.Based on single result carry through relevance scoring of calculating and the quantity of document that constitutes this compilation, graphic user interface 1054 is provided, with the graphic presentation of the compilation that transmits Query Result on visual display unit 1050.In one embodiment, graphical interfaces user 1054 is as the upper strata of the source code of operation in the processing unit 1002.Providing in this graphic user interface can be through the figure mechanism 1060 of input equipment visit, with the Dynamic Selection of the subclass that realizes Query Result.In one embodiment, figure mechanism 1060 is forms of vernier, and vernier is represented the interior separatrix of diagrammatic representation of Query Result.Along with figure mechanism 1060 moves in the diagrammatic representation scope, modification will fall into the ad hoc inquiry result of compilation.In one embodiment, selection is quoted from all documents of figure mechanism 1,060 one sides and/or all documents that fall in this figure mechanism and is contained in Query Result, and gets rid of all documents of quoting from figure mechanism 1060 opposite sides.Therefore, the figure of this graphic user interface mechanism 1060 is instruments that the compilation application of dynamic of Query Result is revised.
Adopt as Fig. 1 submits inquiry to literature collection to process shown in Figure 9 and/or instruction, and should gather in response to this query parse.Yet the present invention should not be limited to process or instruction set, in one embodiment, can comprise the hardware element that is connected with the archives hop communication.Figure 11 is expression based on the concise and to the point submission of the retrieval of submitting inquiry to the Query Result classification with resolve to the block diagram 1100 of one group of instrument of one or more layerings, comprising the different chapters and sections specified weight to the concise and to the point intellecture property document through identification of this retrieval.As shown in the figure, computer system 1102 has processing unit 1104, and said processing unit 1104 is connected to storer 1106 with bus structure 1108.In one embodiment, though a processing unit 1104 only is shown, in expansion design, more a plurality of processing units can be provided.Illustrated system 1102 is connected with the storage medium that is configured to store document set 1,142 1140 communications.In one embodiment, the electronic literature set comprises the compilation of patent documentation, and this patent documentation comprises granted patent and disclosed patented claim.Storage medium 1140 communicates to connect with processing unit 1104.In addition, illustrated system is connected with visual display unit 1150 communications, with the display of visually data.The various piece support that illustrates here and illustrate is committed to the inquiry of literature collection 1142.
Controller 1160 is arranged on computer system 1102 this locality, and 1104 communications are connected with processing unit with storer 1106.Controller 1160 is responsible for literature collection 1142 compilations and is indexed.Controller 1160 is connected with 1162 communications of documentation management device, and documentation management device 1162 is used for discerning each chapters and sections of each document of set.As stated, under the situation of patent documentation set, each patent or disclosed patented claim are made up of chapters and sections specific, unity of form.Yet not all patent documentation set all has unified form (layout).Therefore, documentation management device 1162 is used for discerning the chapters and sections of set document, and in one embodiment, documentation management device 1162 is used to discern the DISPLAY ORDER through the identification chapters and sections.Concise and to the point manager 1164 is arranged to be connected with 1162 communications of documentation management device.Concise and to the point manager 1164 is that literature collection 1142 is set up retrieval briefly.More specifically, concise and to the point manager 1164 helps to select one or more chapters and sections of document, and is each selected chapters and sections specified weight.Wherein, selected chapters and sections are the chapters and sections by 1162 identifications of documentation management device that comprise in the inquiry.In one embodiment, weight is a digital value, to represent the importance of matched data in the selected chapters and sections.Therefore, the retrieval of being set up by concise and to the point manager 1164 briefly provides the summary of chapters and sections associated with the query in the literature collection.
Inquiry manager 1166 communicates to connect with concise and to the point manager 1164, and inquiry manager 1166 is arranged on computer system 1102 this locality, and communicates to connect with memory 1106.Inquiry manager 1166 is responsible for by submit to inquiry to select at least one retrieval concise and to the point to literature collection 1142.More specifically, the data that inquiry manager 1166 is gathered data query and document in 1142 chapters and sections compare, and these chapters and sections are discerned in concise and to the point and had a specified weight.Inquiry manager 1166 is connected with 1168 communications of relevance omniselector.This relevance omniselector is used for based on the document rating of relevance scoring for compilation, and dynamically limits the result in this compilation based on this grade.Through inquiry manager 1166 carry out relatively and be used in combination relevance omniselector 1168, generated compilation based on the related patent U.S. Patent No. document of applied dynamic qualification.In one embodiment, this compilation is presented on the visual display unit 1150.Similarly, in one embodiment, this compilation can be kept in volatibility or the permanent memory.To inquiry submission person's transmission, inquiry manager and class manager communicate to connect for ease, with the grade based on document classification evaluation submission Query Result.
In one embodiment, controller 1160, documentation management device 1162, concise and to the point manager 1164 and inquiry manager 1166 can be arranged in the storer 1106 of computer system 1102 this locality.Yet, the invention is not restricted to this embodiment.For example, in one embodiment, each can be used as the outside that hardware tools resides in local storage 1106 controller, documentation management device, concise and to the point manager and inquiry manager 1160-1166, and the combination that perhaps they can hardware and software is implemented.Similarly, in one embodiment, controller can reside in the remote system that is connected with storage medium 1140 communications with manager 1160-1166.Therefore, controller and controller may be embodied as Software tool or hardware tools, submit one or more inquiries to be used for support to the electronic literature set, to generate the compilation of related patent U.S. Patent No. document.
In one embodiment, the present invention is with software implementation, and said software includes but not limited to firmware, resident software, microcode etc.The present invention can be the form through the computer program of computer usable medium or computer-readable medium visit; This computer usable medium or computer-readable medium provide the program code that is used by computing machine or any instruction execution system, and the program code of communicating by letter and being connected with computing machine or any instruction execution system perhaps is provided.For purposes of illustration; Computer usable medium or computer-readable medium can be can hold, store, communicate by letter, propagate or the device of transmission procedure, and said procedure is by instruction execution system, equipment or device uses or and instruction executive system, equipment or device communication connect.
Embodiment in the scope of the invention also comprises the product of manufacturing, and this product comprises the program storage device that wherein has the coded program code.This program storage device can be can be through any available medium general or the special purpose computer visit.For instance, this program storage device can include but not limited to RAM, ROM, EEPROM, CD-ROM or other optical disc memorys, magnetic disk memory or other magnetic storage apparatus, perhaps can be used to store the expectation program code means also can be by any other medium general or the special purpose computer visit.The combination of said apparatus also should be included in the scope of this program storage device.
Above-mentioned medium can be electronic system, magnetic system, optical system, electromagnetic system, infrared system or semiconductor system (or equipment or device).The example of computer-readable medium comprises semiconductor or solid-state memory, tape, movably computer format floppy, random-access memory (ram), ROM (read-only memory) (ROM), hard disk and CD.Present example of optical disks comprises read-only compact disk B (CD-ROM), read/write compact disk B (CD-R/W) and DVD.
The data handling system that is applicable to storage and/or executive routine code comprises a processor that is connected to memory element through system bus directly or indirectly at least.Employed local storage, mass storage and buffer memory when memory element can be included in the actual execution of program code.This buffer memory is stored at least some program codes temporarily, thereby can reduce the number of times that from mass storage, replaces sign indicating number in the process of implementation.
I/O or I/O equipment (including but not limited to keyboard, display, pointing device etc.) can directly or through middle I/O controller be coupled to system.Network adapter also can be coupled to system, so that individual that data handling system can be through the centre or public network and other data handling system or remote printer or memory device are coupled.
Software tool can be can be by the form of the computer program of computer usable medium or computer-readable medium visit, and this computer usable medium or computer-readable medium are used to provide the program code that is used by computing machine or any instruction execution system or communicate by letter and be connected with computing machine or any instruction execution system.
The advantage of relative prior art
Known, each intellecture property document has for meeting the chapters and sections of the qualification general picture that legal submission condition requires in the art.Generate one or more concise and to the point, to help submitting inquiry to literature collection.Each briefly in document one or more through identification chapters and sections apply weight.Weight is represented the importance through the chapters and sections of identification, and each document that returns in the compilation is applied numerical value.Not all retrieval is not always the case.For example, have recognized that, also only to have a limited number of accompanying drawing even the intellecture property document in the chemical field has under the situation of accompanying drawing.Therefore, the inquiry in the chemical field can be removed the stressing of accompanying drawing, and improves stressing penman text.Submit different retrievals to set, to obtain different results.Therefore, generate a plurality of briefly, and each briefly selects differently from the identification chapters and sections, and specifies different weights to different selected chapters and sections, makes it possible to efficiently and submit to effectively inquiry, to produce the documentation result who is paid close attention to.
Select at least one concise and to the point in case generate briefly and for submitting to inquire about, then next step shows Query Result through making selected briefly mode arranged side by side.In one embodiment, this inquiry produces documentation, and this compilation then is classified and places the layering of graduation classification.This makes and when showing Query Result, can directly show relevance.In another embodiment,, can further transmit Query Result, wherein, show the second group polling result based on the single chapters and sections represented in concise and to the point and the classification of the document in each chapters and sections based on selected chapters and sections in concise and to the point.Therefore, utilize concise and to the point selection to come both generated query results, again based on relevance and so that selected concise and to the point mode arranged side by side shows Query Result.
Optional embodiment
Although should be appreciated that for purposes of illustration and specific embodiments more of the present invention have been described, under prerequisite without departing from the spirit and scope of the present invention, can do various modifications at this.Especially, there is multi-form intellecture property document, comprises patent, trade mark and literary property.In the classification of patent documentation, can further classify to document, comprise granted patent, disclosed patented claim, abridgments of specifications and utility model registration.Some documents in these documents can comprise the chapters and sections with the same sequence ordering of equal number, and some other document can comprise the chapters and sections of varying number and/or different orders.Generate independently briefly based on included chapters and sections, and needn't be concerned about these chapters and sections putting in order in base document.
In addition; Although the specifically clear electronic literature relevant set (comprising registration of granted patent and disclosed patented claim, trade mark registration and application and literary property and application) with the intellecture property document; Yet the present invention should not be limited to these specific classification of electronic literature.In one embodiment, the electronic literature set can comprise the document of any kind with a plurality of chapters and sections that limit.This will make the supvr can document be resolved to the chapters and sections that limited, and create for one or more chapters and sections that limit to have a plurality of concise and to the point of respective weights, and submit inquiry to having selected concise and to the point document set.As stated, can dynamically revise inquiring about concise and to the point selection.In one embodiment, when keeping query contents, can change the order of the relevance that the document that returns in the compilation and document appear in compilation to inquiring about concise and to the point correction.Therefore, protection scope of the present invention is only limited appended claim and equivalent thereof.

Claims (35)

1. one kind is used for the method to the result for retrieval specified associations property of electronic literature set by what computing machine was carried out, and it comprises:
The set of intellecture property document is collected and indexed, and each said document has a plurality of chapters and sections in the said set;
Discern each said chapters and sections of each said document in the said set;
To build retrieval concise and to the point for said archives is set up jointly, and wherein, said retrieval briefly comprises at least one chapters and sections through identification of selecting from through each said document of the set of compilation;
In the retrieval of being set up is concise and to the point, to each selected chapters and sections specified weight through identification;
When inquiry, to patent documentation set submission inquiry, said inquiry comprises that at least one retrieval of selection is concise and to the point, and the data in selected each said document chapters and sections with specified weight briefly in data query and the said set are compared;
To returning each document compute associations property scoring in documentation, said documentation generates according to said submission inquiry;
Based on said relevance scoring, to the document rating in the said documentation through calculating;
Based on said grade, the result of said compilation is dynamically limited; And
First compilation of the pertinent literature of warp classification is based on the said dynamic qualification that is applied to said compilation.
2. method according to claim 1 further comprises, the said classification based on said compilation is divided into graduate layering with said documentation.
3. method according to claim 2 further comprises, based on the selection of desired said graduation layering, limits the demonstration of Query Result.
4. method according to claim 2 further comprises, the flex point place that in said ranking, occurs divides adjacent layering.
5. method according to claim 1 further comprises, first compilation of said document is used secondary classification key element, and based on the compilation rating of said secondary classification key element to said warp classification.
6. method according to claim 5, wherein, said secondary classification key element is selected from by the title of the applying date, open day, external priority date, tabulation alphabetically, group that the obligee constitutes and the combination of above-mentioned each item.
7. method according to claim 1 further comprises, as second compilation of the document of the subclass of said first compilation, and the secondary standard that occurs in concise and to the point based on said retrieval is that said subclass is calculated the scoring of second relevance.
8. method according to claim 7 further comprises, respectively the subclass of each document is classified.
9. method according to claim 8 further comprises, to said dynamically specified associations property qualification of subclass through classification, and limits based on the relevance of said appointment returning of said Query Result limited.
10. method according to claim 1 further comprises, generates the diagrammatic representation of said first compilation based on said through the relevance scoring of calculating, and in the said compilation with quantity through the document of the difference scoring expression of calculating.
11. method according to claim 10; Comprise that further be used to represent marginal vernier in the said diagrammatic representation, said vernier is used for comprising said compilation; Thereby comprise all documents that draw from said separatrix one side, and get rid of all documents of said separatrix opposite side.
12. the described method of claim 1 further comprises, to the said first compilation application curves match routine, wherein said routine calculates the theory function of the data of said first compilation, and calculates at least one derivative of said function.
13. method according to claim 12, wherein, the said step that dynamically limits the result of said compilation comprises: select said function derivative, and return the data that fall in the selected said function derivative scope.
14. a system, it comprises:
Processor, itself and memory and storage medium communicate to connect;
The set of intellecture property document, it is kept on the said storage medium, and each the said document in the said set has a plurality of chapters and sections;
Controller, it is connected with said processor communication, and said literature collection is collected and indexs;
The documentation management device, it is communicated by letter with said controller and is connected, and is used for discerning each chapters and sections of said each document of set;
Concise and to the point manager; Itself and said documentation management device communicate to connect; And be that said literature collection is set up retrieval briefly; Wherein, said retrieval briefly comprises from least one chapters and sections through identification of through each said document of set of compilation, selecting, and the retrieval that said concise and to the point manager passes through to be set up is briefly selected to each, through the chapters and sections specified weight of identification;
Inquiry manager; It submits inquiry to said literature collection when inquiry; Said inquiry comprises selects at least one retrieval briefly and with the data in selected each said document chapters and sections with specified weight briefly in data query and the said set to compare; Said inquiry makes submits the compilation that produces the pertinent literature that is returned by said inquiry manager to by said inquiry, and each said pertinent literature has the match query of submitting to at least one data in discerning concise and to the point chapters and sections with the scoring of specified weight and relevance;
The relevance omniselector, it is communicated by letter with said inquiry manager and is connected, and based on said relevance score to the said document rating in the said compilation, and the result who dynamically limits said compilation based on said grade; And
Through first compilation of the pertinent literature of classification, it is based on the application to the said dynamic qualification of said compilation.
15. system according to claim 9 further comprises, uses secondary classification key element to said first compilation of document, thereby based on of the said first compilation rating of said secondary classification key element to the warp classification.
16. system according to claim 15, wherein, said secondary classification key element is selected from by the title of the applying date, open day, external priority date, tabulation alphabetically, group that the obligee constitutes and the combination of above-mentioned each item.
17. system according to claim 16 further comprises, is compiled as second compilation of document of the subclass of said first compilation, and each secondary standard that occurs in concise and to the point based on said retrieval is that said subclass is calculated the scoring of second relevance.
18. system according to claim 17 further comprises, with the class manager that said relevance omniselector is communicated by letter and is connected, said class manager marks to each compilation classification based on said secondary relevance.
19. system according to claim 14 further comprises, based on the classification of said compilation said first compilation of document is divided into graduate layering, and limits based on the selection of the graduation layering demonstration to Query Result.
20. system according to claim 19 further comprises, the flex point place that said relevance omniselector occurs in said grade divides adjacent layering.
21. system according to claim 1 further comprises, based on the diagrammatic representation of said said first compilation through the relevance scoring of calculating, and in the said compilation with quantity through the document of the difference scoring expression of calculating.
22. system according to claim 21; Further comprise, be used for the instrument of communicating by letter with said diagrammatic representation, wherein said instrument is dynamically set the separatrix for the compilation result; Thereby comprise all documents that draw from said separatrix one side, and get rid of all documents of said separatrix opposite side.
23. system according to claim 14; Further comprise; Run on the instruction of the curve fitting routine that is used for said first compilation of said processor; Said curve fitting routine calculates the theory function of the data of said first compilation, and calculates at least one derivative of said function.
24. system according to claim 23 further comprises, said relevance omniselector is limited to the result of said first compilation in the said function derivative scope of selection, and returns the data that fall in the selected said function derivative scope.
25. a product that is used for the result for retrieval specified associations property of the electronics patent documentation set on computer memory, it comprises the computer readable carrier and first compilation, wherein:
Said computer readable carrier comprises the computer program instructions that is used to carry out the relevance appointment, and said instruction comprises:
Be used for instruction that the set of intellecture property document is collected and indexed, the said document of each in the said set has a plurality of chapters and sections;
Be used for discerning the instruction of each said chapters and sections of said each document of set;
Be used to said archives and set up jointly and build the concise and to the point instruction of retrieval, wherein, said retrieval briefly comprises at least one chapters and sections through identification of selecting from each the said document through the set of compilation;
Be used in the retrieval of being set up briefly to each instruction through the chapters and sections specified weight of identification and warp selection;
Be used for when inquiry, gathering the instruction that submission is inquired about to said patent documentation, said instruction comprises selects at least one retrieval briefly and with the data in selected each said document chapters and sections with specified weight briefly in data query and the said set to compare; And
Be used for calculating return in the relevance scoring of each document of documentation and based on said through the relevance scoring of calculating instruction for the document rating of compilation, said documentation generates according to said submission inquiry;
Be used for dynamically limiting the result's of compilation instruction based on said grade; And
Said first compilation is that wherein said classification is based on the said dynamic qualification that is applied to said documentation through first compilation of the pertinent literature of classification.
26. product according to claim 25 further comprises, to the first compilation secondary classification key element of application of said document and based on the instruction of said secondary classification key element to the rating of collecting through said first of classification.
27. product according to claim 26, wherein, said secondary classification key element is selected from by the title of the applying date, open day, external priority date, tabulation alphabetically, group that the obligee constitutes and the combination of above-mentioned each item.
28. product according to claim 33 further comprises, the secondary standard that occurs in concise and to the point based on said retrieval, the instruction that second compilation of document is collected as the subclass of said first compilation.
29. product according to claim 28 further comprises, calculates the instruction of secondary relevance scoring for said subclass.
30. product according to claim 28 further comprises, based on said secondary relevance scoring, and the instruction that said second compilation of document is classified.
31. product according to claim 25 further comprises, the compilation of said document is divided into graduate layering and limits the instruction of the demonstration of Query Result based on the selection of graduation layering based on the said classification of said compilation.
32. product according to claim 25 further comprises, the flex point place that in said grade, occurs divides the instruction of adjacent layering.
33. product according to claim 25; Further comprise; Based in the diagrammatic representation of said said first compilation through the relevance scoring of calculating and said first compilation with quantity through the document of the different scoring expressions of calculating; And being used for the instruction that communicates with said diagrammatic representation, wherein said instruction is dynamically set the separatrix for the compilation result, to comprise all documents that draw one side from said separatrix and all documents of getting rid of said separatrix opposite side.
34. product according to claim 25; Further comprise; To the instruction of the said first compilation execution curve fitting routine, said curve fitting routine calculates the theory function of the data of said first compilation, and calculates at least one derivative of said function.
35. product according to claim 34 further comprises, is used for result with said first compilation and is limited in the said function derivative scope of selection and returns the instruction that falls into the data in the selected said function derivative scope.
CN200980161341.4A 2009-07-22 2009-07-22 Method, system, and apparatus for delivering query results from an electronic document collection Active CN102483749B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2009/051432 WO2011011002A1 (en) 2009-07-22 2009-07-22 Method, system, and apparatus for delivering query results from an electronic document collection

Publications (2)

Publication Number Publication Date
CN102483749A true CN102483749A (en) 2012-05-30
CN102483749B CN102483749B (en) 2015-06-17

Family

ID=43499303

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200980161341.4A Active CN102483749B (en) 2009-07-22 2009-07-22 Method, system, and apparatus for delivering query results from an electronic document collection

Country Status (7)

Country Link
EP (1) EP2457182A4 (en)
JP (1) JP5534266B2 (en)
KR (1) KR101481680B1 (en)
CN (1) CN102483749B (en)
AU (1) AU2009350126A1 (en)
CA (1) CA2768901A1 (en)
WO (1) WO2011011002A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104871201A (en) * 2012-10-09 2015-08-26 Ubic股份有限公司 Forensic system, forensic method, and forensic program

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9154942B2 (en) 2008-11-26 2015-10-06 Free Stream Media Corp. Zero configuration communication between a browser and a networked media device
US10880340B2 (en) 2008-11-26 2020-12-29 Free Stream Media Corp. Relevancy improvement through targeting of information based on data gathered from a networked device associated with a security sandbox of a client device
US8180891B1 (en) 2008-11-26 2012-05-15 Free Stream Media Corp. Discovery, access control, and communication with networked services from within a security sandbox
US9519772B2 (en) 2008-11-26 2016-12-13 Free Stream Media Corp. Relevancy improvement through targeting of information based on data gathered from a networked device associated with a security sandbox of a client device
US9026668B2 (en) 2012-05-26 2015-05-05 Free Stream Media Corp. Real-time and retargeted advertising on multiple screens of a user watching television
US10977693B2 (en) 2008-11-26 2021-04-13 Free Stream Media Corp. Association of content identifier of audio-visual data with additional data through capture infrastructure
US10631068B2 (en) 2008-11-26 2020-04-21 Free Stream Media Corp. Content exposure attribution based on renderings of related content across multiple devices
US10334324B2 (en) 2008-11-26 2019-06-25 Free Stream Media Corp. Relevant advertisement generation based on a user operating a client device communicatively coupled with a networked media device
US9386356B2 (en) 2008-11-26 2016-07-05 Free Stream Media Corp. Targeting with television audience data across multiple screens
US9961388B2 (en) 2008-11-26 2018-05-01 David Harrison Exposure of public internet protocol addresses in an advertising exchange server to improve relevancy of advertisements
US10567823B2 (en) 2008-11-26 2020-02-18 Free Stream Media Corp. Relevant advertisement generation based on a user operating a client device communicatively coupled with a networked media device
US10419541B2 (en) 2008-11-26 2019-09-17 Free Stream Media Corp. Remotely control devices over a network without authentication or registration
US9986279B2 (en) 2008-11-26 2018-05-29 Free Stream Media Corp. Discovery, access control, and communication with networked services
US9223769B2 (en) 2011-09-21 2015-12-29 Roman Tsibulevskiy Data processing systems, devices, and methods for content analysis

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11250070A (en) * 1998-03-05 1999-09-17 Toshiba Corp Similar document retrieval device and its method, and medium for recording program for similar document retrieval
US20090138466A1 (en) * 2007-08-17 2009-05-28 Accupatent, Inc. System and Method for Search

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001325273A (en) * 2000-05-15 2001-11-22 Ricoh Co Ltd Important sentence extraction device, method therefor and storage medium
US7376635B1 (en) * 2000-07-21 2008-05-20 Ford Global Technologies, Llc Theme-based system and method for classifying documents
US6662178B2 (en) * 2001-03-21 2003-12-09 Knowledge Management Objects, Llc Apparatus for and method of searching and organizing intellectual property information utilizing an IP thesaurus
JP3717808B2 (en) * 2001-06-29 2005-11-16 株式会社日立製作所 Information retrieval system
US20040230568A1 (en) * 2002-10-28 2004-11-18 Budzyn Ludomir A. Method of searching information and intellectual property
US8600963B2 (en) * 2003-08-14 2013-12-03 Google Inc. System and method for presenting multiple sets of search results for a single query
US7346839B2 (en) * 2003-09-30 2008-03-18 Google Inc. Information retrieval based on historical data
US20080015968A1 (en) * 2005-10-14 2008-01-17 Leviathan Entertainment, Llc Fee-Based Priority Queuing for Insurance Claim Processing
JP5146108B2 (en) * 2008-05-27 2013-02-20 日本電気株式会社 Document importance calculation system, document importance calculation method, and program
WO2010128967A1 (en) * 2009-05-07 2010-11-11 Cpa Software Limited Method, system, and apparatus for searching an electronic document collection

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11250070A (en) * 1998-03-05 1999-09-17 Toshiba Corp Similar document retrieval device and its method, and medium for recording program for similar document retrieval
US20090138466A1 (en) * 2007-08-17 2009-05-28 Accupatent, Inc. System and Method for Search

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104871201A (en) * 2012-10-09 2015-08-26 Ubic股份有限公司 Forensic system, forensic method, and forensic program

Also Published As

Publication number Publication date
EP2457182A1 (en) 2012-05-30
JP5534266B2 (en) 2014-06-25
KR20120085731A (en) 2012-08-01
CN102483749B (en) 2015-06-17
AU2009350126A1 (en) 2012-02-23
KR101481680B1 (en) 2015-01-12
EP2457182A4 (en) 2014-01-15
WO2011011002A1 (en) 2011-01-27
CA2768901A1 (en) 2011-01-27
JP2012533817A (en) 2012-12-27

Similar Documents

Publication Publication Date Title
CN102483749A (en) Method, system, and apparatus for delivering query results from an electronic document collection
CN101408886B (en) Selecting tags for a document by analyzing paragraphs of the document
CN102023989B (en) Information retrieval method and system thereof
CN103593425A (en) Preference-based intelligent retrieval method and system
CN103870523A (en) Analyzing content to determine context and serving relevant content based on the context
US20130013616A1 (en) Systems and Methods for Natural Language Searching of Structured Data
CN105045799A (en) Searchable index
CN103309886A (en) Trading-platform-based structural information searching method and device
CN105787068B (en) The academic recommended method and system analyzed based on citation network and user's proficiency
CN103729351A (en) Search term recommendation method and device
CN103823900B (en) Information point importance determines method and apparatus
CN103562916A (en) Hybrid and iterative keyword and category search technique
CN102160066A (en) Search engine and method, particularly applicable to patent literature
Serrano Neural networks in big data and Web search
CN101558408A (en) Method for offering result of search and system for executing the method
JP2012501489A (en) Search method and system using extended keyword pool
Özdağoğlu et al. Topic modelling-based decision framework for analysing digital voice of the customer
CN109918563A (en) A method of the book recommendation based on public data
KR20210082109A (en) An apparatus for evaluating the value of real estates based on estimations of real transactions of the real estates
CN102483744A (en) Method, system, and apparatus for searching an electronic document collection
CN111755111B (en) Medical resource optimal configuration method and system based on supply and demand relation
CN101088082A (en) Full text query and search systems and methods of use
CN106934046A (en) A kind of distribution of publications analysis system and method
WO2020095357A1 (en) Search needs assessment device, search needs assessment system, and search needs assessment method
CN116662521B (en) Electronic document screening and inquiring method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant