CN109325101A - A kind of high value patent automatically obtains method and apparatus - Google Patents

A kind of high value patent automatically obtains method and apparatus Download PDF

Info

Publication number
CN109325101A
CN109325101A CN201811085899.6A CN201811085899A CN109325101A CN 109325101 A CN109325101 A CN 109325101A CN 201811085899 A CN201811085899 A CN 201811085899A CN 109325101 A CN109325101 A CN 109325101A
Authority
CN
China
Prior art keywords
document
value
information
patentee
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201811085899.6A
Other languages
Chinese (zh)
Inventor
邓梅
宋国华
黄家旺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JIANGSU RAINPAT DATA SERVICE Co Ltd
Original Assignee
JIANGSU RAINPAT DATA SERVICE Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIANGSU RAINPAT DATA SERVICE Co Ltd filed Critical JIANGSU RAINPAT DATA SERVICE Co Ltd
Priority to CN201811085899.6A priority Critical patent/CN109325101A/en
Publication of CN109325101A publication Critical patent/CN109325101A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/184Intellectual property management

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Technology Law (AREA)
  • Tourism & Hospitality (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Method and apparatus are automatically obtained the present invention provides a kind of high value patent, which comprises obtain a target literature;The first keyword is obtained according to the target literature;First object patent database is obtained according to first keyword;The first document is obtained from the first object patent database;According to the patented power people's information of the first document, wherein judge the property of patentee by patentee's information;When patentee's information meets the first predetermined condition, prompt information is sent to first object patent database, wherein the prompt information is first document.Through the invention, when solving retrieval high value patent in the prior art, it is excessive to be easy to appear range of search, denoise heavy workload, high value patent retrieval inaccuracy, or not to keyword expansion or upper in retrieving, there is the technical issues of missing inspection, it is convenient operation is reached, recall precision is high, speed is fast, and the accurate technical effect of search result.

Description

A kind of high value patent automatically obtains method and apparatus
Technical field
Method and dress are automatically obtained the present invention relates to intellectual property technical field more particularly to a kind of high value patent It sets.
Background technique
Currently, when carrying out high value patent retrieval using patent retrieval platform generally being intended to that some keys are manually entered Vocabulary and use with or wait logical relations field and other fields to constitute a retrieval type, these fields include: the patent No., Patent name, abstract, international classification number, inventor, applicant, publication date etc., then carried out by computer denoising and artificial denoising It screens layer by layer, finally obtains high value patent.
But present inventor during inventive technique scheme, has found above-mentioned technology extremely in realizing the embodiment of the present application It has the following technical problems less:
When retrieving high value patent in the prior art, it is excessive to be easy to appear range of search, denoises heavy workload, and high value is special Benefit retrieval inaccuracy, or there is the technical issues of missing inspection not to keyword expansion or upper in retrieving.
Summary of the invention
Method and apparatus are automatically obtained the embodiment of the invention provides a kind of high value patent, are solved in the prior art When retrieving high value patent, it is excessive to be easy to appear range of search, denoises heavy workload, high value patent retrieval inaccuracy, or There is not the technical issues of missing inspection to keyword expansion or upper in retrieving, it is convenient to have reached operation, recall precision Height, speed are fast, and the accurate technical effect of search result.
In view of the above problems, propose the embodiment of the present application in order to provide a kind of high value patent automatically obtain method and Device.
In a first aspect, automatically obtaining method the present invention provides a kind of high value patent, which comprises obtain one Target literature;The first keyword is obtained according to the target literature;First object patent number is obtained according to first keyword According to library;The first document is obtained from the first object patent database;According to the patented power people's information of the first document, In, the property of patentee is judged by patentee's information;When patentee's information meets the first predetermined condition When, prompt information is sent to first object patent database, wherein the prompt information is first document.
Preferably, described to obtain the first document from the first object patent database, comprising: according to first mesh Mark patent database determines the first value assessment score of document;It is pre- to judge whether the first value assessment score is greater than first Determine threshold value;If the first value assessment score is greater than the first predetermined threshold, the first document is obtained.
Preferably, described according to the patented power people's information of the first document, comprising: according to the judgement of the first searching platform Whether patentee has transfer history;When the transfer history of the patentee is greater than the second predetermined threshold, acquisition described first First value scoring of document.
Preferably, described according to the patented power people's information of the first document, further includes: institute is obtained by the first searching platform State the first document citation times;According to the first document citation times, the second value for obtaining first document is commented Point.
Preferably, described when patentee's information meets the first predetermined condition, to first object patent database Send prompt information, comprising: according to the transfer history of the patentee, obtain the first weighted value of the first document;According to institute The first document citation times are stated, the second weighted value of the first document is obtained;It is commented according to first weighted value, the first value Divide, the second weighted value and the second value score, the second value assessment score of acquisition first document;Judge second valence Whether value assessment score meets the first predetermined condition;When the second value assessment score meets the first predetermined condition, to first Target patent database sends prompt information.
Preferably, the method also includes: according to the first object patent database, obtain the power of first document The number of words of sharp requested number and claim;According to the claim quantity of first document, the third of the first document is obtained Weighted value;According to the number of words of the claim of first document, the 4th weighted value of the first document is obtained;According to the third Weighted value and the 4th weighted value, determine the third value assessment score of first document.
Second aspect automatically obtains device the present invention provides a kind of high value patent, and described device includes:
First obtains unit, the first obtains unit obtain a target literature;
Second obtaining unit, second obtaining unit obtain the first keyword according to the target literature;
Third obtaining unit, the third obtaining unit obtain first object patent data according to first keyword Library;
4th obtaining unit, the 4th obtaining unit obtain the first document from the first object patent database;
5th obtaining unit, the 5th obtaining unit is according to the patented power people's information of the first document, wherein passes through institute State the property that patentee's information judges patentee;
First processing units, when patentee's information meet the first predetermined condition when, the first processing units to First object patent database sends prompt information, wherein the prompt information is first document.
Preferably, the 4th obtaining unit includes:
First determination unit, first determination unit determine the first of document according to the first object patent database Value assessment score;
First judging unit, it is predetermined that first judging unit judges whether the first value assessment score is greater than first Threshold value;
6th obtaining unit, if the first value assessment score is greater than the first predetermined threshold, the described 6th obtains list Member obtains the first document.
Preferably, the 5th obtaining unit further include:
Second judgment unit, the second judgment unit judge whether the patentee has shifting according to the first searching platform Turn history;
7th obtaining unit, when the transfer history of the patentee is greater than the second predetermined threshold, the described 7th obtains list Member obtains the first value scoring of first document.
Preferably, the 5th obtaining unit further include:
8th obtaining unit, the 8th obtaining unit are cited secondary by the first searching platform acquisition first document Number;
9th obtaining unit, the 9th obtaining unit obtain described first according to the first document citation times Second value scoring of document.
Preferably, the first processing units further include:
Tenth obtaining unit, the tenth obtaining unit obtain the first document according to the transfer history of the patentee The first weighted value;
11st obtaining unit, the 11st obtaining unit obtain first according to the first document citation times Second weighted value of document;
12nd obtaining unit, the 12nd obtaining unit is according to first weighted value, the first value scoring, second Weighted value and the second value score, and obtain the second value assessment score of first document;
Third judging unit, it is predetermined that the third judging unit judges whether the second value assessment score meets first Condition;
The second processing unit, when the second value assessment score meets the first predetermined condition, described the second processing unit Prompt information is sent to first object patent database.
Preferably, described device further include:
13rd obtaining unit, the 13rd obtaining unit is according to the first object patent database, described in acquisition The claim quantity of first document and the number of words of claim;
14th obtaining unit, the 14th obtaining unit are obtained according to the claim quantity of first document The third weighted value of first document;
15th obtaining unit, the 15th obtaining unit are obtained according to the number of words of the claim of first document Obtain the 4th weighted value of the first document;
Second determination unit, second determination unit are determined according to the third weighted value and the 4th weighted value The third value assessment score of first document.
The third aspect, the present invention provides a kind of device that automatically obtains of high value patent, including memory, processor and The computer program that can be run on a memory and on a processor is stored, the processor is realized following when executing described program Step: a target literature is obtained;The first keyword is obtained according to the target literature;First is obtained according to first keyword Target patent database;The first document is obtained from the first object patent database;According to the patented power of the first document People's information, wherein the property of patentee is judged by patentee's information;When patentee's information meets first When predetermined condition, prompt information is sent to first object patent database, wherein the prompt information is first document.
Said one or multiple technical solutions in the embodiment of the present application at least have following one or more technology effects Fruit:
1. a kind of high value patent provided by the embodiments of the present application automatically obtains method and apparatus, which comprises Obtain a target literature;The first keyword is obtained according to the target literature;First object is obtained according to first keyword Patent database;The first document is obtained from the first object patent database;According to the patented power people letter of the first document Breath, wherein the property of patentee is judged by patentee's information;Make a reservation for when patentee's information meets first When condition, prompt information is sent to first object patent database, wherein the prompt information is first document.Pass through this Invention, solves that when retrieving high value patent in the prior art, it is excessive to be easy to appear range of search, denoises heavy workload, high price It is worth patent retrieval inaccuracy, or occurs the technical issues of missing inspection not to keyword expansion or upper in retrieving, reach Operation is convenient, and recall precision is high, speed is fast, and search result accurate technical effect.
2. the embodiment of the present application is by described according to the patented power people's information of the first document, comprising: according to the first retrieval Platform judges whether the patentee has transfer history;When the patentee transfer history be greater than the second predetermined threshold, Obtain the first value scoring of first document.Further reach automatically retrieval patent transferable information, accurately judges patent The technical effect of value.
3. the embodiment of the present application passes through according to described when patentee's information meets the first predetermined condition, to first Target patent database sends prompt information, comprising: according to the transfer history of the patentee, obtains the first of the first document Weighted value;According to the first document citation times, the second weighted value of the first document is obtained;According to first weight Value, the first value scoring, the second weighted value and the second value score, and obtain the second value assessment score of first document; Judge whether the second value assessment score meets the first predetermined condition;When the second value assessment score meets first in advance Fixed condition sends prompt information to first object patent database.Effective denoising, and comprehensive descision patent valence are further reached Value, to improve the technical effect of recall precision and precision.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
Fig. 1 is a kind of flow diagram for automatically obtaining method of high value patent in the embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram for automatically obtaining device of high value patent in the embodiment of the present invention;
Fig. 3 is the structural schematic diagram for automatically obtaining device of another high value patent in the embodiment of the present invention.
Specific embodiment
The embodiment of the invention provides a kind of method and apparatus that automatically obtain of high value patent, technologies provided by the invention Scheme general thought is as follows: by obtaining a target literature;The first keyword is obtained according to the target literature;According to described One keyword obtains first object patent database;The first document is obtained from the first object patent database;According to The patented power people's information of one document, wherein the property of patentee is judged by patentee's information;When the patent When weighing people's information the first predetermined condition of satisfaction, prompt information is sent to first object patent database, wherein the prompt information For first document.Through the invention, when solving retrieval high value patent in the prior art, it is easy to appear range of search mistake Greatly, heavy workload is denoised, high value patent retrieval inaccuracy, or not to keyword expansion or upper in retrieving, out The technical issues of existing missing inspection, reach that operation is convenient, and recall precision is high, speed is fast, and the accurate technical effect of search result.
Technical solution of the present invention is described in detail below by attached drawing and specific embodiment, it should be understood that the application Specific features in embodiment and embodiment are the detailed description to technical scheme, rather than to present techniques The restriction of scheme, in the absence of conflict, the technical characteristic in the embodiment of the present application and embodiment can be combined with each other.
Embodiment one
Fig. 1 is a kind of flow diagram of high value patent automatically obtained in the embodiment of the present invention.As shown in Figure 1, institute The method of stating includes:
Step 110: obtaining a target literature.
Specifically, user can according to actual needs, searching one or more documents and materials or patent document are as base The target literature of plinth can be retrieved by the target literature and obtain target patent.
Step 120: the first keyword is obtained according to the target literature.
Specifically, by the searching system for the document typing automatically retrieval keyword for needing to retrieve, by system to described Target retrieval document content analysis obtains a keyword, and as the first keyword, first keyword can be the target The more word of the frequency of occurrences in the information such as some keyword, inventive point, the field in document or document.
Step 130: first object patent database is obtained according to first keyword.
Specifically, being retrieved in searching platform or database according to the first keyword obtained, pass through retrieval Corresponding patent database is obtained, which may include all patent documents announced, can also be by artificial Operation, screens the patent document of authorization, unauthorized and failure.
Step 140: obtaining the first document from the first object patent database.
Further, described to obtain the first document from the first object patent database, comprising: according to described first Target patent database determines the first value assessment score of document;Judge whether the first value assessment score is greater than first Predetermined threshold;If the first value assessment score is greater than the first predetermined threshold, the first document is obtained.
Specifically, claim quantity, the number of words of document are determined according to the first object patent database, by described The quantity and number of words of claim judge the first weighted value and the second weighted value of document, and then calculate the value of document first and comment Estimate point, a threshold value is preset according to the actual demand of user, assessment judgement is carried out to the patent documentation data library retrieved, if The assessment score of patent document is greater than preset threshold value, then retains it, that is, obtain the first document.If it is less than preset Threshold value then deletes it.For example, first predetermined threshold is set as 85, when the first value assessment of the document When score is greater than 85, as the first document.
Step 150: according to the patented power people's information of the first document, wherein judged by patentee's information special The property of li quanren.
Further, described according to the patented power people's information of the first document, comprising: institute is judged according to the first searching platform State whether patentee has transfer history;When the transfer history of the patentee is greater than the second predetermined threshold, described the is obtained First value scoring of one document.Further, described according to the patented power people's information of the first document, further includes: to pass through One searching platform obtains the first document citation times;According to the first document citation times, described first is obtained Second value scoring of document.
Specifically, obtaining every patent document by the retrieval to every patent document in patent database obtained Patentee's information and transfer history, preset a threshold value, when patent transfer the possession of number be higher than the threshold value when, to the patent carry out Scoring obtains the first value scoring of the patent.The patentee of the patent or the property of applicant are obtained by searching platform Matter and the number being cited, then judge that the second value of the patent scores by citation times.
Step 160: when patentee's information meets the first predetermined condition, being sent to first object patent database Prompt information, wherein the prompt information is first document.
Further, described when patentee's information meets the first predetermined condition, to first object patent data Library sends prompt information, comprising: according to the transfer history of the patentee, obtains the first weighted value of the first document;According to The first document citation times obtain the second weighted value of the first document;It is commented according to first weighted value, the first value Divide, the second weighted value and the second value score, the second value assessment score of acquisition first document;Judge second valence Whether value assessment score meets the first predetermined condition;When the second value assessment score meets the first predetermined condition, to first Target patent database sends prompt information.
Specifically, first weighted value are as follows: the transfer history of the first document patentee × shared score value ratio, Second weighted value are as follows: the first document citation times × shared score value ratio, according to first weighted value, first Value scoring, the second weighted value and the second value score, and obtain the second value assessment score of first document, when the first text When offering the second value assessment score of satisfaction, the first document is sent to the first object patent database, the document is protected It deposits, and prompts user's document to meet retrieval and require.Meanwhile second is obtained in the retrieval history of patent retrieval platform according to user Keyword, by the high patent of the second value assessment score relevant to the second keyword to user's pushed information, pushed information packet Include the information such as patentee, abstract of description, the patentee's transfer history of the patent.
Further, the method also includes: according to the first object patent database, obtain first document The number of words of claim quantity and claim;According to the claim quantity of first document, the of the first document is obtained Three weighted values;According to the number of words of the claim of first document, the 4th weighted value of the first document is obtained;According to described Three weighted values and the 4th weighted value, determine the third value assessment score of first document.
Specifically, the third weighted value are as follows: the claim quantity of first document × shared score value ratio, it is described 4th weighted value are as follows: the number of words of the claim of first document × shared score value ratio, by the third weighted value and 4th weighted value obtains the third value assessment score of first document, is further screened to patent document, can User is set more accurately to obtain target patent.
Further, the method also includes: from first searching database obtain the first document;Judge described Similarity between one document and target retrieval document;When the similarity meets the first predetermined condition, described first is examined Rope database column is target database.Further, similar between the judgement first document and target retrieval document Degree, further includes: semantic analysis is carried out according to the claim of first document and the target retrieval document, obtains the first phase Like paragraph;Determine the first ratio of the number of words of the claim of the described first similar paragraph and the target retrieval document;Judgement Whether first ratio is greater than the first predetermined threshold;When first ratio is greater than the first predetermined threshold, described the is obtained The second similarity between one document and target retrieval document.
Specifically, the keyword more than the wherein frequency of occurrences is found in the literature content by semantic analysis, then The keyword more than the wherein frequency of occurrences is found in the target retrieval document, the keyword of the two is compared, and is obtained wherein Similarity, the similarity be the first similarity, if the keyword is identical, or be synonym first similarity value It is just big.Other than being compared to keyword, also further the claim content of the two is compared, makes search result more It is accurate to add, and implements process are as follows: the claim of the document and the target retrieval document is subjected to semantic analysis respectively, It therefrom searches and contrasts the high paragraph of content similarity, then the higher paragraph of the similarity is subjected to number of words comparison, obtain described Second similarity of the high paragraph of similarity, if number of words is also close, the second similarity ratio is big, finally judges described first Similarity and which numerical value of the second similarity are bigger, and choosing is wherein biggish to be used as the document and the target retrieval document Final similarity degree.The similarity value obtained by comparison is preset similarity with searching system to compare, is judged Whether the literature content retrieved and the target retrieval document are close, finally will acquire the target retrieval by searching for automatically The target retrieval content of document, system automatically retrieval is more comprehensive, and missing inspection, false retrieval caused by avoiding human factor from being added etc. is asked Topic, to solve in the prior art, retrieving is manually operated, and carries out manual search according to title or keyword, then will Search result carries out finishing analysis, and there is retrieval, time-consuming, and the technical issues of be easy to appear missing inspection, has reached and is automatically System retrieval, retrieval comparison is more careful, and search result is more acurrate, avoids occurring because the unstable factor being artificially introduced missing inspection and shows As improving the technical effect of recall precision.
Further, the method also includes: according to the target retrieval document, obtain expansion word range;From described The first expansion word is obtained according to the first rule in one searching database, wherein first expansion word is in the expansion word range It is interior;The second searching database is obtained according to first expansion word;According to second searching database and first retrieval Database obtains target database.
Specifically, by judging the full text text meaning of word and description, determining the mesh according to target retrieval document Mark technical field locating for search file.The technological know-how that fields are used is judged by the technical field, so that it is determined that Technical tool dictionary.Then the range of the keyword of the core technology in patent document is determined by the technical tool dictionary, That is expansion word range.Multiple patent documents are retrieved from first searching database by the first term, it will be described more A patent document carries out semantic analysis, mainly judges the keyword of the core technology in patent document, from the keyword really Fixed multiple expansion words to patent searching, such as denomination of invention, technical field, abstract of description.Judge word in multiple expansion words It anticipates same or similar word, and the highest expansion word of multiplicity expands as the first expansion word, described first in multiple expansion words Word is opened up within the scope of the expansion word.Wherein, first expansion word is similar word, e.g., polyethylene with first term With thermoplastic resin etc..The first expansion word is judged whether within the scope of the expansion word, when first expansion word is in the expansion It can be the second searching database according to the database of the first expansion word patent searching document when opening up within the scope of word.Pass through the first inspection The intersection of first searching database that rope word determines and second searching database determined by the first expansion word can To obtain the target database of target retrieval document, retrieved by second searching database and first searching database Patent document accuracy it is high.The weighting of the target database is calculated according to first weighted value and second weighted value Value, the accuracy of the target database is determined by the weighted value.
Further, the method also includes: according to target retrieval document, obtain skill locating for the target retrieval document Art field;Technical tool dictionary is obtained according to the technical field;It is obtained according to the technical tool dictionary and the first keyword First expansion word;First, which is obtained, according to the target retrieval document, the first keyword and the first expansion word compares document;According to institute It states the first searching database and obtains the first document;Judge that first document and first compares the similarity of document;When the phase When meeting the first predetermined condition like degree, first document is stored in target database.
Specifically, being obtained described in the target retrieval document by the content analysis of the target retrieval document Particular content belong to a certain technical field, the high data of the degree of correlation can be further searched for by determining technical field and believed Breath excludes invalid information.The skill is found out accordingly according to particular technique field belonging to the target retrieval document judged The technical tool dictionary in art field, the technical tool dictionary are all related major terms in the technical field, proprietary spy Sign, technical term etc., i.e., include all core contents and the keyword in the technical field comprehensively.
Searched in the technical tool dictionary with the synonym of first keyword or similar import, play identical work With equal correlation words, which is the similar word of first keyword, the similar word be it is multiple, for example, if crucial Word is " nail ", can search similar word in related-art tool dictionary, such as screw, bolt it is multiple close or Person acts on identical similar word.Then the multiple similar words found out are subjected to semantic analysis again, are found out and first key Word looks like close multiple expansion words, finally by the number that the multiple expansion words determined by semantic analysis are carried out with frequency of occurrence Amount statistics, using the highest expansion word of the most multiplicities of frequency of occurrence as the first expansion word, first expansion word be with it is described The high similar word of the close degree of first keyword.
It will be existed by first expansion word obtained in conjunction with the target retrieval document and first keyword It is scanned in large database concept, finds the first of the condition of satisfaction and compare document, described first, which compares document, is and the target The higher document information of search file matching degree can be used as the destination document of classification reference.Crucial by described first Word is retrieved in large database concept and recalls pertinent literature in first searching database obtained, and the document is to examine with target Rope document has certain relevance, includes the documents and materials of first keyword in content.To in first searching database The first document compare document with described first and be compared, carry out semantic analysis in first document first, obtain Then plurality of first keyword out compares document content to described first and carries out semantic analysis, show that described first compares Multiple second keywords occurred in document, finally to the multiple first keyword and the multiple second keyword successively into Row semantic analysis obtains the similarity degree of the multiple first keyword and the multiple second keyword, to its similarity degree Quantify to obtain the first similarity numerical value between the multiple first keyword and the multiple second keyword by calculating, this The similarity that value compares document with described first as first document.
Obtained first document and described first is compared preset first in the similarity and system of document Predetermined condition is compared, and first predetermined condition can be preset similarity threshold.When first document with When described first similarity for comparing document meets the first predetermined condition, then first document is to compare document with described first Belong to same technical field, the big documents and materials of content relevance, then using first document as target literature typing number of targets According in library;If the similarity that first document compares document with described first is unsatisfactory for lower than first predetermined condition When condition, first document is not to be inconsistent document, then does not enter in the target database, be deleted.
Further, the method also includes: the first keyword is obtained from automatically retrieval document;It is closed according to described first Keyword determines the first searching database;The first document is determined from first searching database;Judge first document and The similarity of target retrieval document;When the similarity meets predetermined condition, the second keyword is obtained from the first document, In the first keyword and second keyword belong to same technical field.
Specifically, by the searching system for the document typing automatically retrieval keyword for needing to retrieve, by system to institute It states target retrieval document content analysis and obtains keyword therein, as the first keyword, first keyword can be mark The more word of the frequency of occurrences in the subject or document of topic, or state word by the core effect that semantic analysis goes out Etc..After obtaining first keyword, first keyword is reaffirmed, first be examined according to the target Rope document determines the particular technique field of its content description, finds out the skill accordingly according to the particular technique field judged The technical tool dictionary in art field, the technical tool dictionary are all related major terms in the technical field, proprietary spy Sign, technical term etc., i.e., include all keywords in the technical field, then in the technical tool dictionary comprehensively All keywords in the technical field where the target retrieval document are searched, with first keyword found out It compares and analyzes, judges whether first keyword includes the keyword range found out in the described technical field Interior, if first keyword is within the scope of the keyword, first keyword is effective keyword, if not described It in keyword range, is then continued to search for invalid keyword needs, it is known that find effective first keyword, then use institute It states the first keyword to be retrieved in the large database concept of internet document, obtains all documents about first keyword Set forms the first searching database, and first searching database is all documents retrieved after keyword recognition Set, ensure that the comprehensive and correctness of retrieval.
Phase is recalled being retrieved in first searching database obtained in large database concept by first keyword Document is closed, the document is the documents and materials for having certain relevance with target retrieval document, from first searching database In find out corresponding document, the document particular content high to the degree of association in first searching database carries out successively right respectively Than analysis, the similarity degree between the document and the target retrieval document in first searching database, the phase are judged It carries out being quantified as specific data like degree system.
Similarity threshold is preset in system, is compared according to the predetermined condition of obtained similarity and setting, When the similarity numerical value of document and the target retrieval document in first searching database meets predetermined condition, it is determined that The document is effective document.After effective documents have been determined, then the second keyword is searched from the document, it is described Second keyword is different keywords from first keyword, but belongs to same technical field, is all from determining technology The keyword obtained is analyzed in first searching database that field retrieves.
Further, the method also includes: the first classification number is determined according to the target retrieval document;According to described One document determines the second classification number;Judge whether first classification number and the second classification number are approximate classification number;When described One classification number and the second classification number are not approximate classification number, and first document is deleted from the first object database.
Further, described to judge whether first classification number and the second classification number are approximate classification number, comprising: according to First classification number determines the portion that the target retrieval document included, major class, group, big group, the first meaning of group;Root The Secondary Meaning in the portion, major class, group, big group, group that first document included is determined according to second classification number;Judgement First meaning and the Secondary Meaning whether semantic similarity;When first meaning and the Secondary Meaning semanteme be not close When, first classification number and the second classification number are not approximate classification number.
Specifically, obtaining technical field locating for the target retrieval document, then root according to target retrieval document first Technical tool dictionary is obtained according to the technical field, and then obtains the range of keyword, then judges that first keyword is It is no in the range of the keyword, when first keyword is within the scope of the keyword, on patent retrieval website First keyword is inputted to scan for, so that the first object database comprising first keyword is obtained, In, patent document largely comprising first keyword is had collected in the first object database.Obtaining described the After one target database, in several patent documents comprising first keyword in the first object database, The patent document comprising first keyword is arbitrarily picked out as first document;At the same time, according to described The technical field that target retrieval document is determined, and then determine first classification number, it then opens select First document, and then determine the second classification number of first document.Again by first classification number and described second Classification number is compared, and analyzes and determines out whether first classification number and second classification number are approximate classification number.True It makes first classification number and when the second classification number is not approximate classification number, that is, can determine first document and the target The semanteme of search file is not close, it may also be said to which first document is uncorrelated to the content of the target retrieval document, at this time With regard to first document is deleted from the first object database.
Further, the method also includes: the first classification number is determined according to first document;According to described first point Class-mark determines the portion that first document included, major class, group, big group, the first meaning of group;To first meaning with The target retrieval document carries out semantic analysis, wherein when first meaning and the target retrieval document semantic are kept off, First document is deleted from the first object database.
Specifically, first being contained by the classification number middle part of the first determining document, major class, group, big group, group Justice so that whether judge the first document identical as the semanteme of the target retrieval document, and then reaches the denoising of the first document Purpose.
Further, which comprises determine that patent document quantity is arranged according to classification number according to first object database Name;Obtain least first classification number of patent quantity of document in the classification number;From the patent document of first classification number Obtain the first document;Judge the first similarity of first document Yu target patent document;When first similarity is less than When predetermined threshold, the patent document that first classification number includes is deleted from first object database.
Specifically, the target patent document is the patent document that user wants retrieval, the first object database For the database comprising the target patent document, the Q for the patent document for including in the first object database is then determined A classification number, wherein Q is positive integer, all special by include in the first object database according still further to the Q classification number Sharp document is sorted out, to obtain the corresponding patent document quantity of the Q classification number, and to the Q classification number pair The patent document quantity answered carries out ranking by ascending order, and then obtains patent quantity of document least first in the Q classification number Classification number, wherein first classification number is included in the Q classification number, is one of classification of the Q classification number Number, and the corresponding patent document minimum number of first classification number.It is retrieved from the patent document of first classification number The first document is obtained, the first similarity of first document and the target patent document is analyzed and determined, that is, is exactly right respectively The title of first document and the target patent document, description carry out semantic analysis, determine first text First similarity with the target patent document is offered, when first similarity is less than predetermined threshold, by described the The patent document that one classification number includes is deleted from first object database.
Further, which comprises according to first document, obtain the claim number of first document Amount, claim number of words and specification number of words;According to the claim quantity, claim number of words and explanation of first document Book number of words obtains the first weighted value, the second weighted value and the third weighted value of first document, and determines first document The first value assessment score;Judge whether the first value assessment score is greater than the first predetermined threshold;When first valence When value assessment score is greater than the first predetermined threshold, prompt information is sent to first object patent database, wherein the prompt is believed Breath is first document.
Specifically, passing through the quantity and claim and explanation of retrieving the claim for automatically obtaining the patent document The number of words of book determines the first weighted value, first weighted value are as follows: target patent by the claim quantity of target patent Claim quantity × shared score value ratio determines the second weight of target patent by the number of words of target patent claims Value, second weighted value are as follows: the number of words of target patent claims × shared score value ratio passes through target patent specification Number of words determine the third weighted value of target patent, the third weighted value are as follows: the number of words of target patent specification × shared point Value ratio obtains the first value assessment point of target patent according to first weighted value, the second weighted value and third weighted value Number.A predetermined threshold is set, when the first value assessment score of target patent is greater than the predetermined threshold, by the patent document It is sent to the first object patent database, determines that this patent document is qualified document.Meanwhile being existed according to user The retrieval history of patent retrieval platform obtains the second keyword, and the first value assessment score relevant to the second keyword is high For patent to user's pushed information, pushed information includes patentee, abstract of description, patent licensing information and the lawsuit of the patent The information such as information.
Embodiment 2
Based on the same inventive concept of method is automatically obtained with high value patent a kind of in previous embodiment, the present invention is also There is provided a kind of high value patent automatically obtains device, as shown in Fig. 2, described device includes:
First obtains unit, the first obtains unit obtain a target literature;
Second obtaining unit, second obtaining unit obtain the first keyword according to the target literature;
Third obtaining unit, the third obtaining unit obtain first object patent data according to first keyword Library;
4th obtaining unit, the 4th obtaining unit obtain the first document from the first object patent database;
5th obtaining unit, the 5th obtaining unit is according to the patented power people's information of the first document, wherein passes through institute State the property that patentee's information judges patentee;
First processing units, when patentee's information meet the first predetermined condition when, the first processing units to First object patent database sends prompt information, wherein the prompt information is first document.
Further, the 4th obtaining unit includes:
First determination unit, first determination unit determine the first of document according to the first object patent database Value assessment score;
First judging unit, it is predetermined that first judging unit judges whether the first value assessment score is greater than first Threshold value;
6th obtaining unit, if the first value assessment score is greater than the first predetermined threshold, the described 6th obtains list Member obtains the first document.
Further, the 5th obtaining unit further include:
Second judgment unit, the second judgment unit judge whether the patentee has shifting according to the first searching platform Turn history;
7th obtaining unit, when the transfer history of the patentee is greater than the second predetermined threshold, the described 7th obtains list Member obtains the first value scoring of first document.
Further, the 5th obtaining unit further include:
8th obtaining unit, the 8th obtaining unit are cited secondary by the first searching platform acquisition first document Number;
9th obtaining unit, the 9th obtaining unit obtain described first according to the first document citation times Second value scoring of document.
Further, the first processing units further include:
Tenth obtaining unit, the tenth obtaining unit obtain the first document according to the transfer history of the patentee The first weighted value;
11st obtaining unit, the 11st obtaining unit obtain first according to the first document citation times Second weighted value of document;
12nd obtaining unit, the 12nd obtaining unit is according to first weighted value, the first value scoring, second Weighted value and the second value score, and obtain the second value assessment score of first document;
Third judging unit, it is predetermined that the third judging unit judges whether the second value assessment score meets first Condition;
The second processing unit, when the second value assessment score meets the first predetermined condition, described the second processing unit Prompt information is sent to first object patent database.
Further, described device further include:
13rd obtaining unit, the 13rd obtaining unit is according to the first object patent database, described in acquisition The claim quantity of first document and the number of words of claim;
14th obtaining unit, the 14th obtaining unit are obtained according to the claim quantity of first document The third weighted value of first document;
15th obtaining unit, the 15th obtaining unit are obtained according to the number of words of the claim of first document Obtain the 4th weighted value of the first document;
Second determination unit, second determination unit are determined according to the third weighted value and the 4th weighted value The third value assessment score of first document.
The various change mode for automatically obtaining method of one of 1 embodiment 1 of earlier figures high value patent and specific reality A kind of high value patent that example is equally applicable to the present embodiment automatically obtains device, by aforementioned to a kind of high value patent Automatically obtain the detailed description of method, those skilled in the art are clear that a kind of high value patent in the present embodiment The implementation method of device is automatically obtained, so this will not be detailed here in order to illustrate the succinct of book.
Embodiment 3
Based on the same inventive concept of method is automatically obtained with high value patent a kind of in previous embodiment, the present invention is also There is provided another high value patent automatically obtains device, computer program is stored thereon with, when which is executed by processor Realize a kind of the step of automatically obtaining either method method of high value patent described previously.
Wherein, in Fig. 3, bus architecture (is represented) with bus 300, and bus 300 may include any number of interconnection Bus and bridge, bus 300 will include the one or more processors represented by processor 302 and what memory 304 represented deposits The various circuits of reservoir link together.Bus 300 can also will peripheral equipment, voltage-stablizer and management circuit etc. it Various other circuits of class link together, and these are all it is known in the art, therefore, no longer carry out further to it herein Description.Bus interface 306 provides interface between bus 300 and receiver 301 and transmitter 303.Receiver 301 and transmitter 303 can be the same element, i.e. transceiver, provide the unit for communicating over a transmission medium with various other devices.
Processor 302 is responsible for management bus 300 and common processing, and memory 304 can be used for storage processor 302 when executing operation used data.
Said one or multiple technical solutions in the embodiment of the present application at least have following one or more technology effects Fruit:
1. a kind of high value patent provided by the embodiments of the present application automatically obtains method and apparatus, which comprises Obtain a target literature;The first keyword is obtained according to the target literature;First object is obtained according to first keyword Patent database;The first document is obtained from the first object patent database;According to the patented power people letter of the first document Breath, wherein the property of patentee is judged by patentee's information;Make a reservation for when patentee's information meets first When condition, prompt information is sent to first object patent database, wherein the prompt information is first document.Pass through this Invention, solves that when retrieving high value patent in the prior art, it is excessive to be easy to appear range of search, denoises heavy workload, high price It is worth patent retrieval inaccuracy, or occurs the technical issues of missing inspection not to keyword expansion or upper in retrieving, reach Operation is convenient, and recall precision is high, speed is fast, and search result accurate technical effect.
2. the embodiment of the present application is by described according to the patented power people's information of the first document, comprising: according to the first retrieval Platform judges whether the patentee has transfer history;When the patentee transfer history be greater than the second predetermined threshold, Obtain the first value scoring of first document.Further reach automatically retrieval patent transferable information, accurately judges patent The technical effect of value.
3. the embodiment of the present application passes through according to described when patentee's information meets the first predetermined condition, to first Target patent database sends prompt information, comprising: according to the transfer history of the patentee, obtains the first of the first document Weighted value;According to the first document citation times, the second weighted value of the first document is obtained;According to first weight Value, the first value scoring, the second weighted value and the second value score, and obtain the second value assessment score of first document; Judge whether the second value assessment score meets the first predetermined condition;When the second value assessment score meets first in advance Fixed condition sends prompt information to first object patent database.Effective denoising, and comprehensive descision patent valence are further reached Value, to improve the technical effect of recall precision and precision.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to include these modifications and variations.

Claims (8)

1. a kind of high value patent automatically obtains method, which is characterized in that the described method includes:
Obtain a target literature;
The first keyword is obtained according to the target literature;
First object patent database is obtained according to first keyword;
The first document is obtained from the first object patent database;
According to the patented power people's information of the first document, wherein judge the property of patentee by patentee's information;
When patentee's information meets the first predetermined condition, prompt information is sent to first object patent database, Described in prompt information be first document.
2. the method as described in claim 1, which is characterized in that described to obtain first from the first object patent database Document, comprising:
The first value assessment score of document is determined according to the first object patent database;
Judge whether the first value assessment score is greater than the first predetermined threshold;
If the first value assessment score is greater than the first predetermined threshold, the first document is obtained.
3. the method as described in claim 1, which is characterized in that described according to the patented power people's information of the first document, comprising:
Judge whether the patentee has transfer history according to the first searching platform;
When the transfer history of the patentee is greater than the second predetermined threshold, the first value for obtaining first document scores.
4. method as claimed in claim 3, which is characterized in that it is described according to the patented power people's information of the first document, also wrap It includes:
The first document citation times are obtained by the first searching platform;
According to the first document citation times, the second value scoring of first document is obtained.
5. method as claimed in claim 4, which is characterized in that described when patentee's information meets the first predetermined condition When, prompt information is sent to first object patent database, comprising:
According to the transfer history of the patentee, the first weighted value of the first document is obtained;
According to the first document citation times, the second weighted value of the first document is obtained;
It is scored according to first weighted value, the first value scoring, the second weighted value and the second value, obtains first document The second value assessment score;
Judge whether the second value assessment score meets the first predetermined condition;
When the second value assessment score meets the first predetermined condition, to first object patent database transmission prompt information.
6. the method as described in claim 1, which is characterized in that the method also includes:
According to the first object patent database, the claim quantity of first document and the word of claim are obtained Number;
According to the claim quantity of first document, the third weighted value of the first document is obtained;
According to the number of words of the claim of first document, the 4th weighted value of the first document is obtained;
According to the third weighted value and the 4th weighted value, the third value assessment score of first document is determined.
7. a kind of high value patent automatically obtains device, which is characterized in that described device includes:
First obtains unit, the first obtains unit obtain a target literature;
Second obtaining unit, second obtaining unit obtain the first keyword according to the target literature;
Third obtaining unit, the third obtaining unit obtain first object patent database according to first keyword;
4th obtaining unit, the 4th obtaining unit obtain the first document from the first object patent database;
5th obtaining unit, the 5th obtaining unit is according to the patented power people's information of the first document, wherein by described special Li quanren's information judges the property of patentee;
First processing units, when patentee's information meets the first predetermined condition, the first processing units are to first Target patent database sends prompt information, wherein the prompt information is first document.
8. a kind of high value patent automatically obtains device, including memory, processor and storage on a memory and can located The computer program run on reason device, which is characterized in that the processor performs the steps of when executing described program
Obtain a target literature;
The first keyword is obtained according to the target literature;
First object patent database is obtained according to first keyword;
The first document is obtained from the first object patent database;
According to the patented power people's information of the first document, wherein judge the property of patentee by patentee's information;
When patentee's information meets the first predetermined condition, prompt information is sent to first object patent database, Described in prompt information be first document.
CN201811085899.6A 2018-09-18 2018-09-18 A kind of high value patent automatically obtains method and apparatus Withdrawn CN109325101A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811085899.6A CN109325101A (en) 2018-09-18 2018-09-18 A kind of high value patent automatically obtains method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811085899.6A CN109325101A (en) 2018-09-18 2018-09-18 A kind of high value patent automatically obtains method and apparatus

Publications (1)

Publication Number Publication Date
CN109325101A true CN109325101A (en) 2019-02-12

Family

ID=65265541

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811085899.6A Withdrawn CN109325101A (en) 2018-09-18 2018-09-18 A kind of high value patent automatically obtains method and apparatus

Country Status (1)

Country Link
CN (1) CN109325101A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780671A (en) * 2022-03-14 2022-07-22 珠海横琴濠麦科技有限公司 Display method of patent citation relation, computer device and computer readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372225A (en) * 2016-09-07 2017-02-01 知识产权出版社有限责任公司 Information processing device and method based on high-value comparison base
CN108022189A (en) * 2016-11-03 2018-05-11 西安科技大市场创新云服务股份有限公司 A kind of patent of invention value calculation method and apparatus
CN108022031A (en) * 2016-11-03 2018-05-11 西安科技大市场创新云服务股份有限公司 A kind of patent value computational methods and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372225A (en) * 2016-09-07 2017-02-01 知识产权出版社有限责任公司 Information processing device and method based on high-value comparison base
CN108022189A (en) * 2016-11-03 2018-05-11 西安科技大市场创新云服务股份有限公司 A kind of patent of invention value calculation method and apparatus
CN108022031A (en) * 2016-11-03 2018-05-11 西安科技大市场创新云服务股份有限公司 A kind of patent value computational methods and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780671A (en) * 2022-03-14 2022-07-22 珠海横琴濠麦科技有限公司 Display method of patent citation relation, computer device and computer readable storage medium

Similar Documents

Publication Publication Date Title
Lahitani et al. Cosine similarity to determine similarity measure: Study case in online essay assessment
CN110704621B (en) Text processing method and device, storage medium and electronic equipment
CN111797214A (en) FAQ database-based problem screening method and device, computer equipment and medium
CN113761218B (en) Method, device, equipment and storage medium for entity linking
CN106446071B (en) Information processing apparatus and method
WO2021218322A1 (en) Paragraph search method and apparatus, and electronic device and storage medium
CN105302793A (en) Method for automatically evaluating scientific and technical literature novelty by utilizing computer
KR20180072167A (en) System for extracting similar patents and method thereof
CN107247743A (en) A kind of judicial class case search method and system
CN106227756A (en) A kind of stock index forecasting method based on emotional semantic classification and system
CN114238577B (en) Multi-task learning emotion classification method integrating multi-head attention mechanism
Upendran et al. Application of predictive analytics in intelligent course recommendation
CN109344400A (en) A kind of judgment method and device of document storage
US20170154294A1 (en) Performance evaluation device, control method for performance evaluation device, and control program for performance evaluation device
KR101745874B1 (en) System and method for a learning course automatic generation
CN109189893A (en) A kind of method and apparatus of automatically retrieval
CN109189955A (en) A kind of determination method and apparatus of automatically retrieval keyword
CN109325099A (en) A kind of method and apparatus of automatically retrieval
CN109325101A (en) A kind of high value patent automatically obtains method and apparatus
WO2016009553A1 (en) Intellectual property evaluation system, intellectual property evaluation system control method, and intellectual property evaluation program
Syafrullah et al. Improving term extraction using particle swarm optimization techniques
CN109325100A (en) A kind of high value patent automatically obtains method and apparatus
Gao et al. Text categorization based on improved Rocchio algorithm
CN109284360A (en) A kind of automatic denoising method of patent retrieval and device
CN103279549A (en) Method and device for acquiring target data of target objects

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20190212