CN110232071A - Search method, device and storage medium, the electronic device of drug data - Google Patents

Search method, device and storage medium, the electronic device of drug data Download PDF

Info

Publication number
CN110232071A
CN110232071A CN201910345736.5A CN201910345736A CN110232071A CN 110232071 A CN110232071 A CN 110232071A CN 201910345736 A CN201910345736 A CN 201910345736A CN 110232071 A CN110232071 A CN 110232071A
Authority
CN
China
Prior art keywords
drug
entry
search key
attribute information
drug entry
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910345736.5A
Other languages
Chinese (zh)
Inventor
吴壮伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201910345736.5A priority Critical patent/CN110232071A/en
Publication of CN110232071A publication Critical patent/CN110232071A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of search method of drug data, device and storage mediums, electronic device, wherein the search method of drug data includes: the medicine types label for obtaining input;According to medicine types label, the drug item list of medicine types tag match is retrieved in drug data bank;Obtain the search key of input;Search key is split, multiple search keys are obtained;For the attribute information of each drug entry in drug item list, gap character number that the frequency of occurrence and the adjacent keyword of every two for counting each search key occur in attribute information;It is determined according to statistical result and recommends drug entry.Through the invention, search result inaccuracy when solving the retrieval in the prior art for drug data.

Description

Search method, device and storage medium, the electronic device of drug data
Technical field
The present invention relates to searching fields, are situated between in particular to a kind of search method of drug data, device and storage Matter, electronic device.
Background technique
Medical drugs are many kinds of, to each attributes of each medical drugs (for example, effect, indication, taboo crowd, Adverse reaction etc.) word content that is introduced is also more, it is examined in the database of medical drugs based on keyword The time cost of rope is higher, also, due to the specific properties of medical domain, it is not exactly the same to be frequently present of the identical vocabulary of meaning The case where, if still scanned for using keyword, it will lead to the case where can not searching related drug.
For the above problem present in the relevant technologies, at present it is not yet found that the solution of effect.
Summary of the invention
The embodiment of the invention provides a kind of search method of drug data, device and storage mediums, electronic device, so that Search result inaccuracy when the retrieval in the prior art for drug data is solved the problems, such as less.
According to one embodiment of present invention, a kind of search method of drug data is provided, this method comprises: obtaining defeated The medicine types label entered;According to medicine types label, the drug item of medicine types tag match is retrieved in drug data bank Mesh list;Obtain the search key of input;Search key is split, multiple search keys are obtained;For drug entry column The attribute information of each drug entry in table counts the frequency of occurrence and the adjacent key of every two of each search key The gap character number that word occurs in attribute information;It is determined according to statistical result and recommends drug entry.
Further, it is determined according to statistical result and recommends drug entry, comprising: according to each search key of statistics Frequency of occurrence determines the total degree n that multiple search keys occur in the attribute information of each drug entry;According to statistics The gap character number that the adjacent keyword of every two occurs in attribute information determines in the attribute information of each drug entry The sum of the gap character number that the adjacent search key of any two occurs in multiple search keys t;Based on numerical value n and numerical value T determines recommendation, wherein in the case where numerical value n is bigger, in the case that recommendation is bigger, and numerical value t is smaller, recommendation is got over Greatly;The higher preceding m drug entry of recommendation is obtained, obtains recommending drug entry.
Further, for the attribute information of each drug entry in drug item list, the adjacent pass of every two is counted The gap character number that key word occurs in attribute information, comprising: determine the adjacent retrieval of any two in multiple search keys Crucial combinatorics on words;For two adjacent search key a and b in each combination, in the attribute information of each drug entry Sequentially there is the character pitch number between a and b every time in middle retrieval.
Further, the character pitch sequentially occurred between a and b every time is retrieved in the attribute information of each drug entry Number, comprising: word for word retrieve a from the front to the back in the attribute information of each drug entry;After being matched to a, word for word examine backward Rope a and b, and present interval number of characters is counted;If a is matched to again when word for word retrieving a and b backward, to current Gap character number counts again;If being matched to b when word for word retrieving a and b backward, present interval number of characters is recorded;From current Character rises, and returns to the step of word for word retrieving a from the front to the back, until the attribute information retrieval to drug entry finishes.
Further, adjacent in statistics every two for the attribute information of each drug entry in drug item list Before the gap character number that keyword occurs in attribute information, the frequency of occurrence and record of each search key are counted, In determining multiple search keys after the combination of the adjacent search key of any two, this method further include: for each Drug entry, according to the record of the frequency of occurrence of each search key as a result, determining in the attribute information of corresponding drug entry The search key not occurred;Among the combination of the adjacent search key of any two, deletes the retrieval for including do not occur and close Key combinatorics on words.
Further, according to medicine types label, the drug of medicine types tag match is retrieved in drug data bank Before item list, this method further include: word segmentation processing is executed to the attribute information of each drug entry in drug data bank, Obtain the corresponding multiple participles of each drug entry;For each drug entry, multiple keys are extracted in the multiple participles of correspondence Word;Respectively by multiple keywords input topic model that training obtains in advance of each drug entry, the defeated of topic model is obtained Result out, wherein topic model is used for the theme label according to the output prediction of multiple words of input;By topic model for each Medicine types label of the theme label of multiple keywords output of drug entry as corresponding drug entry.
Further, using topic model for each drug entry multiple keywords export theme label as pair After the medicine types label for answering drug entry, this method further include: newly-increased to each drug entry in drug data bank to belong to Property, wherein the value of the newly-increased attribute of each drug entry is the medicine types label of corresponding drug entry;Based on newly-increased attribute Index is established to the drug entry in drug data bank.
According to another embodiment of the invention, a kind of retrieval device of drug data is provided, which includes: first Module is obtained, for obtaining the medicine types label of input;Retrieval module is used for according to medicine types label, in drug data The drug item list of medicine types tag match is retrieved in library;Second obtains module, for obtaining the search key of input; It splits module and obtains multiple search keys for splitting search key;Statistical module, for being directed to drug item list In each drug entry attribute information, count the frequency of occurrence and the adjacent keyword of every two of each search key The gap character number occurred in attribute information;Determining module recommends drug entry for determining according to statistical result.
Further, it is determined that module includes: the first determination unit, for the appearance according to each search key of statistics Number determines the total degree n that multiple search keys occur in the attribute information of each drug entry;Second determination unit, For the gap character number that the adjacent keyword of every two according to statistics occurs in attribute information, determine in each drug item The sum of the gap character number that the adjacent search key of any two occurs in multiple search keys in purpose attribute information t; Third determination unit, for determining recommendation based on numerical value n and numerical value t, wherein in the case where numerical value n is bigger, recommendation is got over Greatly, and in the case that numerical value t is smaller, recommendation is bigger;Acquiring unit, for obtaining the higher preceding m drug entry of recommendation, It obtains recommending drug entry.
Further, statistical module includes: the 4th determination unit, for determining any two phase in multiple search keys The combination of adjacent search key;First retrieval unit, two adjacent search key a and b for being directed in each combination, The character pitch number sequentially occurred between a and b every time is retrieved in the attribute information of each drug entry.
Further, retrieval unit includes: the second retrieval unit, in the attribute information of each drug entry by preceding A is word for word retrieved backward;First execution unit, for word for word retrieving a and b backward, and to present interval word after being matched to a Symbol number is counted;Second execution unit, if for being matched to a again when word for word retrieving a and b backward, to present interval Number of characters counts again;Third execution unit, if recording present interval for being matched to b when word for word retrieving a and b backward Number of characters;4th execution unit, for from current character, returning to the step of word for word retrieving a from the front to the back, until to drug item The retrieval of purpose attribute information finishes.
Further, statistical module is used to unite for the attribute information of each drug entry in drug item list Before the gap character number that the adjacent keyword of meter every two occurs in attribute information, the appearance of each search key is counted Number simultaneously records, the device further include: the 5th determination unit, for the adjacent inspection of any two in determining multiple search keys After Suo Guanjian combinatorics on words, for each drug entry, according to the record of the frequency of occurrence of each search key as a result, really Surely the search key not occurred in the attribute information of drug entry is corresponded to;Unit is deleted, in the adjacent retrieval of any two Among crucial combinatorics on words, the combination of the search key including not occurring is deleted.
Further, the device further include: word segmentation module, for according to medicine types label, in drug data bank Before the drug item list for retrieving medicine types tag match, to the attribute information of each drug entry in drug data bank Word segmentation processing is executed, the corresponding multiple participles of each drug entry are obtained;Extraction module, for being directed to each drug entry, Multiple keywords are extracted in corresponding multiple participles;Execution module, for respectively inputting multiple keywords of each drug entry The topic model that training obtains in advance, obtains the output result of topic model, wherein topic model is used for according to the multiple of input The theme label of word output prediction;Logic module is exported for multiple keywords by topic model for each drug entry Medicine types label of the theme label as corresponding drug entry.
Further, the device further include: newly-increased module, for topic model to be directed to the multiple of each drug entry After medicine types label of the theme label of keyword output as corresponding drug entry, to each medicine in drug data bank Product entry increases attribute newly, wherein the value of the newly-increased attribute of each drug entry is the medicine types label of corresponding drug entry;It builds Formwork erection block, for establishing index to the drug entry in drug data bank based on newly-increased attribute.
According to still another embodiment of the invention, a kind of storage medium is additionally provided, meter is stored in the storage medium Calculation machine program, wherein the computer program is arranged to execute the step in any of the above-described embodiment of the method when operation.
According to still another embodiment of the invention, a kind of electronic device, including memory and processor are additionally provided, it is described Computer program is stored in memory, the processor is arranged to run the computer program to execute any of the above-described Step in embodiment of the method.
Through the invention, by obtaining the medicine types label inputted;According to medicine types label, in drug data bank Retrieve the drug item list of medicine types tag match;Obtain the search key of input;Search key is split, is obtained more A search key;For the attribute information of each drug entry in drug item list, each search key is counted The gap character number that frequency of occurrence and the adjacent keyword of every two occur in attribute information;It is determined according to statistical result Recommend drug entry, search result inaccuracy when solving the retrieval in the prior art for drug data, by obtaining simultaneously The retrieval for taking medicine types label and search key to obtain user with carrying out various dimensions is intended to, and for search key, performs The processing method for splitting search key, is accurate to the frequency of occurrence of each search key and two neighboring search key, The case where user search keyword and the retrieval of medical drugs term difference are less than hitting target is prevented, has been reached more accurately For the technical effect of user search purpose feedback searching result.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present invention, constitutes part of this application, this hair Bright illustrative embodiments and their description are used to explain the present invention, and are not constituted improper limitations of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart of the search method of drug data according to an embodiment of the present invention;
Fig. 2 is the schematic diagram of the retrieval device of drug data according to an embodiment of the present invention.
Specific embodiment
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only The embodiment of the application a part, instead of all the embodiments, in the absence of conflict, embodiment and reality in the application The feature applied in example can be combined with each other.Based on the embodiment in the application, those of ordinary skill in the art are not making wound Every other embodiment obtained under the premise of the property made labour, shall fall within the protection scope of the present application.
It should be noted that the description and claims of this application and term " first " in above-mentioned attached drawing, " Two " etc. be to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should be understood that using in this way Data be interchangeable under appropriate circumstances, so as to embodiments herein described herein can in addition to illustrating herein or Sequence other than those of description is implemented.In addition, term " includes " and " having " and their any deformation, it is intended that cover Cover it is non-exclusive include, for example, the process, method, system, product or equipment for containing a series of steps or units are not necessarily limited to Step or unit those of is clearly listed, but may include be not clearly listed or for these process, methods, product Or other step or units that equipment is intrinsic.
Embodiment 1
The search method for present embodiments providing a kind of drug data, can be applied to client-side, for example, can run Among the middle terminal device such as similar arithmetic facility of PC, work station, mobile terminal, specifically, client-side can be with It is the function that retrieval is realized by browser or client application.Operating in different arithmetic facilities only is scheme in executing subject On difference, those skilled in the art are contemplated that in nonidentity operation equipment, operation can generate identical technical effect.
The search method of drug data provided in this embodiment, by obtaining medicine types label and search key simultaneously Obtain user retrieval with carrying out various dimensions is intended to, and for search key, performs the processing method for splitting search key, essence The frequency of occurrence for really arriving each search key and two neighboring search key prevents user search keyword and medicine The case where retrieval of drug term difference is less than hitting target has reached more accurately for user search purpose feedback searching result Technical effect.
As shown in Figure 1, the search method of drug data provided in this embodiment includes the following steps:
Step 101, the medicine types label of input is obtained;
Step 102, according to medicine types label, the drug entry of medicine types tag match is retrieved in drug data bank List;
Step 103, the search key of input is obtained;
Step 104, search key is split, multiple search keys are obtained;
Step 105, for the attribute information of each drug entry in drug item list, each search key is counted Frequency of occurrence and the gap character number that occurs in attribute information of the adjacent keyword of every two;
Step 106, it is determined according to statistical result and recommends drug entry.
Drug data bank in the present embodiment is relevant database, for storing multiple drug entries, each drug item Mesh includes multiple attributes, and the attribute value of each attribute is the information of corresponding attribute, for example, for drug A, the category of " effect " attribute Property value be " treatment xxx class disease ", the attribute value of " adverse reaction " attribute is " occasionally having vomiting, dizziness ".It can in drug data bank Multiple drug entries are saved in the form of through table, every data line corresponds to a drug entry, and each column data is corresponding In an attribute.
Drug entry in drug data bank can be indexed by medicine types label.Medicine types label can be One attribute is indexed as inside, can also additionally set up an external index.Based on the index of medicine types label, if The execution side of one or more medicine types labels of user input selection, the present embodiment can be according to the subject classification mark of input Label, retrieve the multiple drug entries (drug item list) being matched in drug data bank.
In addition to needing user to input medicine types label, it is also necessary to which user inputs search key.In order to increase search Keyword is split as multiple keywords, counts drug entry by range, the search key non-medical term for preventing user from inputting There is the number of each keyword in each drug entry in list and the adjacent keyword of every two occurs in drug entry Gap character number, thus based on statistical result determine recommend drug entry.
Recommend drug entry specifically, determining according to statistical result, may include:
Step 11, it according to the frequency of occurrence of each search key of statistics, determines and believes in the attribute of each drug entry The total degree n that multiple search keys occur in breath;
Step 12, the gap character number occurred in attribute information according to the adjacent keyword of the every two of statistics determines The interval word that the adjacent search key of any two occurs in multiple search keys in the attribute information of each drug entry Accord with the sum of number t;
Step 13, recommendation is determined based on numerical value n and numerical value t, wherein in the case where numerical value n is bigger, recommendation is got over Greatly, and in the case that numerical value t is smaller, recommendation is bigger;
Step 14, the higher preceding m drug entry of recommendation is obtained, obtains recommending drug entry.
For example, being split as search key " stream ", " nose " and " tears ", such as if search key includes " rhinorrhea " There are sentence " stuffy nose with watery discharge " in fruit drug entry, then " flowing " the gap character number between " nose " is 1, " nose " and " tears " Between gap character number be 1, for sentence " stuffy nose with watery discharge ", the sum of gap character number is 2.
The function for calculating recommendation can be f (n/t), and recommendation is directly proportional to the size of numerical value n, the size with numerical value t It is inversely proportional, that is, recommendation is directly proportional to the total degree for all keywords occur, with two neighboring keyword in drug entry The sum of gap character number of appearance is inversely proportional, thus, recommendation can show that degree relevant to keyword, drug entry Recommendation is higher, illustrates that the keyword degree of correlation of drug entry and input is higher.
Optionally, in the attribute information for each drug entry in drug item list, the adjacent pass of statistics every two When the gap character number that key word occurs in attribute information, following steps can be used:
Step 21, the combination of the adjacent search key of any two in multiple search keys is determined;
For example, it is split as multiple search keys " stream ", " nose " and " tears " if search key is " rhinorrhea ", Obtain the combination of the adjacent search key of any two: " stream " and " nose " and " nose " and " tears ".
Step 22, for two adjacent search key a and b in each combination, believe in the attribute of each drug entry The character pitch number sequentially occurred between a and b every time is retrieved in breath, that is, being counted for each combination.
Specifically, retrieving the character pitch number sequentially occurred between a and b every time in the attribute information of each drug entry When, include the following steps:
Step 31, a is word for word retrieved from the front to the back in the attribute information of each drug entry;
Step 32, after being matched to a, a and b is word for word retrieved backward, and count to present interval number of characters;
Step 33, if being matched to a again when word for word retrieving a and b backward, present interval number of characters is counted again;
Step 34, if being matched to b when word for word retrieving a and b backward, present interval number of characters is recorded;
Step 35, from current character, the step of word for word retrieving a from the front to the back is returned to, until to the attribute of drug entry Information retrieval finishes.
Optionally, in step 105, it for the attribute information of each drug entry in drug item list, is counting Before the gap character number that the adjacent keyword of every two occurs in attribute information, executes and count going out for each search key Occurrence number simultaneously records.In turn, in determining multiple search keys after the combination of the adjacent search key of any two, for Each drug entry, according to the record of the frequency of occurrence of each search key as a result, determining that the attribute of corresponding drug entry is believed The search key not occurred in breath, among the combination of the adjacent search key of any two, deleting includes the inspection not occurred Suo Guanjian combinatorics on words.
For example, if statistics search key abcd in search key b frequency of occurrence be 0, without count a and b, The sum of combined gap character number of b and c.
Further, according to medicine types label, the drug of medicine types tag match is retrieved in drug data bank Before item list, this method further include: word segmentation processing is executed to the attribute information of each drug entry in drug data bank, Obtain the corresponding multiple participles of each drug entry;For each drug entry, multiple keys are extracted in the multiple participles of correspondence Word;Respectively by multiple keywords input topic model that training obtains in advance of each drug entry, the defeated of topic model is obtained Result out, wherein topic model is used for the theme label according to the output prediction of multiple words of input;By topic model for each Medicine types label of the theme label of multiple keywords output of drug entry as corresponding drug entry.
Optionally, in the theme label for multiple keywords output that topic model is directed to each drug entry as correspondence After the medicine types label of drug entry, in drug data bank newly attribute is increased to each drug entry, wherein each drug The value of the newly-increased attribute of entry is the medicine types label of corresponding drug entry, in turn, based on newly-increased attribute to drug data Drug entry in library establishes index.
It should be noted that step shown in the flowchart of the accompanying drawings can be in such as a group of computer-executable instructions It is executed in computer system, although also, logical order is shown in flow charts, and it in some cases, can be with not The sequence being same as herein executes shown or described step.
Through the above description of the embodiments, those skilled in the art can be understood that according to above-mentioned implementation The method of example can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but it is very much In the case of the former be more preferably embodiment.Based on this understanding, technical solution of the present invention is substantially in other words to existing The part that technology contributes can be embodied in the form of software products, which is stored in a storage In medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal device (can be mobile phone, calculate Machine, server or network equipment etc.) execute method described in each embodiment of the present invention.
Embodiment 2
A kind of retrieval device of drug data is additionally provided in the present embodiment, and the device is for realizing above-described embodiment 1 And its preferred embodiment, to the term or implementation not being described in detail in this present embodiment, reference can be made to mutually speaking on somebody's behalf in embodiment 1 Bright, the descriptions that have already been made will not be repeated.
Term " module " as used below, can be achieved on the combination of the software and/or hardware of predetermined function.Although Device described in following embodiment is preferably realized with software, but the combined realization of hardware or software and hardware And can be contemplated.
Fig. 2 is the schematic diagram of the retrieval device of drug data according to an embodiment of the present invention, as shown in Fig. 2, the device packet The first acquisition module 10 is included, retrieval module 20, second obtains module 30, splits module 40, statistical module 50 and determining module 60.
First acquisition module 10 is used to obtain the medicine types label of input;Retrieval module 20 is used for according to medicine types mark Label retrieve the drug item list of medicine types tag match in drug data bank;Second acquisition module 30 is defeated for obtaining The search key entered;Module 40 is split for splitting search key, obtains multiple search keys;Statistical module 50 is used for For the attribute information of each drug entry in drug item list, count each search key frequency of occurrence and The gap character number that the adjacent keyword of every two occurs in attribute information;Determining module 60 is used to be determined according to statistical result Recommend drug entry.
Optionally, determining module includes: the first determination unit, for going out occurrence according to each search key of statistics Number determines the total degree n that multiple search keys occur in the attribute information of each drug entry;Second determination unit is used In the gap character number that the adjacent keyword of every two according to statistics occurs in attribute information, determine in each drug entry Attribute information in the sum of the adjacent search key of any two occurs in multiple search keys gap character number t;The Three determination units, for determining recommendation based on numerical value n and numerical value t, wherein in the case where numerical value n is bigger, recommendation is got over Greatly, and in the case that numerical value t is smaller, recommendation is bigger;Acquiring unit, for obtaining the higher preceding m drug entry of recommendation, It obtains recommending drug entry.
Optionally, statistical module includes: the 4th determination unit, and for determining, any two are adjacent in multiple search keys The combination of search key;First retrieval unit, two adjacent search key a and b for being directed in each combination, every The character pitch number sequentially occurred between a and b every time is retrieved in the attribute information of a drug entry.
Optionally, retrieval unit includes: the second retrieval unit, in the attribute information of each drug entry by forward direction A is word for word retrieved afterwards;First execution unit, for word for word retrieving a and b backward, and to present interval character after being matched to a Number is counted;Second execution unit, if for being matched to a again when word for word retrieving a and b backward, to present interval word Symbol number counts again;Third execution unit, if recording present interval word for being matched to b when word for word retrieving a and b backward Accord with number;4th execution unit, for from current character, returning to the step of word for word retrieving a from the front to the back, until to drug entry Attribute information retrieval finish.
Optionally, statistical module is used to count for the attribute information of each drug entry in drug item list Before the gap character number that the adjacent keyword of every two occurs in attribute information, count each search key goes out occurrence It counts and records, the device further include: the 5th determination unit, for the adjacent retrieval of any two in determining multiple search keys After crucial combinatorics on words, for each drug entry, according to the record of the frequency of occurrence of each search key as a result, determining The search key not occurred in the attribute information of corresponding drug entry;Unit is deleted, for closing in the adjacent retrieval of any two Among key combinatorics on words, the combination of the search key including not occurring is deleted.
Optionally, the device further include: word segmentation module, for being examined in drug data bank according to medicine types label Before the drug item list of rope medicine types tag match, the attribute information of each drug entry in drug data bank is held Row word segmentation processing obtains the corresponding multiple participles of each drug entry;Extraction module, for being directed to each drug entry, right It answers and extracts multiple keywords in multiple participles;Execution module inputs multiple keywords of each drug entry for respectively pre- The first topic model that training obtains, obtains the output result of topic model, wherein topic model is used for multiple words according to input Export the theme label of prediction;Logic module, for export topic model for multiple keywords of each drug entry Medicine types label of the theme label as corresponding drug entry.
Optionally, the device further include: newly-increased module, in multiple passes that topic model is directed to each drug entry After medicine types label of the theme label of keyword output as corresponding drug entry, to each drug in drug data bank Entry increases attribute newly, wherein the value of the newly-increased attribute of each drug entry is the medicine types label of corresponding drug entry;It establishes Module, for establishing index to the drug entry in drug data bank based on newly-increased attribute.
It should be noted that above-mentioned modules can be realized by software or hardware, for the latter, Ke Yitong Following manner realization is crossed, but not limited to this: above-mentioned module is respectively positioned in same processor;Alternatively, above-mentioned modules are with any Combined form is located in different processors.
Obviously, those skilled in the art should be understood that each module of the above invention or each step can be with general Computing device realize that they can be concentrated on a single computing device, or be distributed in multiple computing devices and formed Network on, optionally, they can be realized with the program code that computing device can perform, it is thus possible to which they are stored It is performed by computing device in the storage device, and in some cases, it can be to be different from shown in sequence execution herein Out or description the step of, perhaps they are fabricated to each integrated circuit modules or by them multiple modules or Step is fabricated to single integrated circuit module to realize.In this way, the present invention is not limited to any specific hardware and softwares to combine.
Embodiment 3
The embodiments of the present invention also provide a kind of storage medium, computer program is stored in the storage medium, wherein The computer program is arranged to execute the step in any of the above-described embodiment of the method when operation.
Optionally, in the present embodiment, above-mentioned storage medium can include but is not limited to: USB flash disk, read-only memory (Read- OnlyMemory, referred to as ROM), random access memory (Random AccessMemory, referred to as RAM), mobile hard disk, The various media that can store computer program such as magnetic or disk.
Embodiment 4
The embodiments of the present invention also provide a kind of electronic device, including memory and processor, stored in the memory There is computer program, which is arranged to run computer program to execute the step in any of the above-described embodiment of the method Suddenly.
Optionally, above-mentioned electronic device can also include transmission device and input-output equipment, wherein the transmission device It is connected with above-mentioned processor, which connects with above-mentioned processor.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.It is all within principle of the invention, it is made it is any modification, etc. With replacement, improvement etc., should all be included in the protection scope of the present invention.

Claims (10)

1. a kind of search method of drug data, which is characterized in that the described method includes:
Obtain the medicine types label of input;
According to the medicine types label, the drug entry column of the medicine types tag match is retrieved in drug data bank Table;
Obtain the search key of input;
The search key is split, multiple search keys are obtained;
For the attribute information of each drug entry in the drug item list, going out for each search key is counted The gap character number that occurrence number and the adjacent keyword of every two occur in the attribute information;
It is determined according to statistical result and recommends drug entry.
2. being wrapped the method according to claim 1, wherein described determined according to statistical result recommends drug entry It includes:
According to the frequency of occurrence of each of statistics search key, determine in the attribute information of each drug entry The total degree n that the multiple search key occurs;
According to the gap character number that the adjacent keyword of the every two of statistics occurs in the attribute information, determine in each institute State the interval word that the adjacent search key of any two in multiple search keys described in the attribute information of drug entry occurs Accord with the sum of number t;
Recommendation is determined based on numerical value n and numerical value t, wherein in the case where the numerical value n is bigger, the recommendation is bigger, and In the case that the numerical value t is smaller, the recommendation is bigger;
The higher preceding m drug entry of the recommendation is obtained, the recommendation drug entry is obtained.
3. according to the method described in claim 2, it is characterized in that, each drug in the drug item list The attribute information of entry, the gap character number that the adjacent keyword of statistics every two occurs in the attribute information, comprising:
Determine the combination of the adjacent search key of any two in the multiple search key;
For two adjacent search key a and b in each combination, in the attribute information of each drug entry Sequentially there is the character pitch number between a and b every time in retrieval.
4. according to the method described in claim 3, it is characterized in that, described examine in the attribute information of each drug entry Sequentially there is the character pitch number between a and b every time in rope, comprising:
A is word for word retrieved from the front to the back in the attribute information of each drug entry;
After being matched to a, a and b is word for word retrieved backward, and count to present interval number of characters;
If being matched to a again when word for word retrieving a and b backward, the present interval number of characters is counted again;
If being matched to b when word for word retrieving a and b backward, the present interval number of characters is recorded;
From current character, the step of word for word retrieving a from the front to the back is returned to, until the attribute information to the drug entry is retrieved It finishes.
5. according to the method described in claim 3, it is characterized in that, for each drug entry in the drug item list Attribute information, before the gap character number that the adjacent keyword of statistics every two occurs in the attribute information, statistics The frequency of occurrence and record of each search key, the adjacent retrieval of any two in determining the multiple search key After crucial combinatorics on words, the method also includes:
For each drug entry, corresponded to according to the record of the frequency of occurrence of each search key as a result, determining The search key not occurred in the attribute information of drug entry;
Among the combination of the adjacent search key of any two, the group for the search key not occurred described in including is deleted It closes.
6. the method according to claim 1, wherein according to the medicine types label, in drug data bank Before the drug item list of the middle retrieval medicine types tag match, the method also includes:
Word segmentation processing is executed to the attribute information of each of the drug data bank drug entry, obtains each medicine The corresponding multiple participles of product entry;
For each drug entry, multiple keywords are extracted in the multiple participle of correspondence;
Respectively by multiple keywords input topic model that training obtains in advance of each drug entry, the theme is obtained The output result of model, wherein the topic model is used for the theme label according to the output prediction of multiple words of input;
The theme label that multiple keywords that the topic model is directed to each drug entry are exported is as corresponding drug The medicine types label of entry.
7. according to the method described in claim 6, it is characterized in that, the topic model is directed to each drug entry Multiple keywords output theme label as correspondence drug entry medicine types label after, the method also includes:
In the drug data bank newly attribute is increased to each drug entry, wherein what each drug entry increased newly The value of attribute is the medicine types label of corresponding drug entry;
Index is established to the drug entry in the drug data bank based on the newly-increased attribute.
8. a kind of retrieval device of drug data, which is characterized in that described device includes:
First obtains module, for obtaining the medicine types label of input;
Retrieval module, for retrieving the medicine types tag match in drug data bank according to the medicine types label Drug item list;
Second obtains module, for obtaining the search key of input;
It splits module and obtains multiple search keys for splitting the search key;
Statistical module counts each described for the attribute information for each drug entry in the drug item list The gap character number that the adjacent keyword of frequency of occurrence and every two of search key occurs in the attribute information;
Determining module recommends drug entry for determining according to statistical result.
9. a kind of storage medium, which is characterized in that be stored with computer program in the storage medium, wherein the computer Program is arranged to perform claim when operation and requires method described in 1 to 7 any one.
10. a kind of electronic device, including memory and processor, which is characterized in that be stored with computer journey in the memory Sequence, the processor are arranged to run the computer program in method described in perform claim 1 to 7 any one of requirement.
CN201910345736.5A 2019-04-26 2019-04-26 Search method, device and storage medium, the electronic device of drug data Pending CN110232071A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910345736.5A CN110232071A (en) 2019-04-26 2019-04-26 Search method, device and storage medium, the electronic device of drug data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910345736.5A CN110232071A (en) 2019-04-26 2019-04-26 Search method, device and storage medium, the electronic device of drug data

Publications (1)

Publication Number Publication Date
CN110232071A true CN110232071A (en) 2019-09-13

Family

ID=67860369

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910345736.5A Pending CN110232071A (en) 2019-04-26 2019-04-26 Search method, device and storage medium, the electronic device of drug data

Country Status (1)

Country Link
CN (1) CN110232071A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111198887A (en) * 2019-12-31 2020-05-26 北京左医健康技术有限公司 Medicine indexing method, medicine retrieval method and system
CN112307180A (en) * 2020-10-22 2021-02-02 上海芯翌智能科技有限公司 Rapid retrieval method and device based on label object
CN115359924A (en) * 2022-08-11 2022-11-18 浙江迪谱诊断技术有限公司 Cardiovascular and cerebrovascular medicine scoring method and device, storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010282480A (en) * 2009-06-05 2010-12-16 Nippon Telegr & Teleph Corp <Ntt> Retrieval result ranking method, device and program, and computer-readable recording medium
CN102043812A (en) * 2009-10-13 2011-05-04 北京大学 Method and system for retrieving medical information
CN106024013A (en) * 2016-04-29 2016-10-12 努比亚技术有限公司 Voice data searching method and system
CN106776741A (en) * 2016-11-18 2017-05-31 北京奇虎科技有限公司 A kind of searching method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010282480A (en) * 2009-06-05 2010-12-16 Nippon Telegr & Teleph Corp <Ntt> Retrieval result ranking method, device and program, and computer-readable recording medium
CN102043812A (en) * 2009-10-13 2011-05-04 北京大学 Method and system for retrieving medical information
CN106024013A (en) * 2016-04-29 2016-10-12 努比亚技术有限公司 Voice data searching method and system
CN106776741A (en) * 2016-11-18 2017-05-31 北京奇虎科技有限公司 A kind of searching method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111198887A (en) * 2019-12-31 2020-05-26 北京左医健康技术有限公司 Medicine indexing method, medicine retrieval method and system
CN112307180A (en) * 2020-10-22 2021-02-02 上海芯翌智能科技有限公司 Rapid retrieval method and device based on label object
CN115359924A (en) * 2022-08-11 2022-11-18 浙江迪谱诊断技术有限公司 Cardiovascular and cerebrovascular medicine scoring method and device, storage medium and electronic equipment

Similar Documents

Publication Publication Date Title
US11328128B2 (en) System and method for analysis and navigation of data
US20180336193A1 (en) Artificial Intelligence Based Method and Apparatus for Generating Article
CN109062994A (en) Recommended method, device, computer equipment and storage medium
CN110209805B (en) Text classification method, apparatus, storage medium and computer device
US8725717B2 (en) System and method for identifying topics for short text communications
CN110532451A (en) Search method and device for policy text, storage medium, electronic device
CA3153598A1 (en) Method of and device for predicting video playback integrity
CN109614550A (en) Public sentiment monitoring method, device, computer equipment and storage medium
CN108595494A (en) The acquisition methods and device of reply message
CN108073568A (en) keyword extracting method and device
CN108304375A (en) A kind of information identifying method and its equipment, storage medium, terminal
CN109325146B (en) Video recommendation method and device, storage medium and server
CN107844533A (en) A kind of intelligent Answer System and analysis method
CN108959559A (en) Question and answer are to generation method and device
CN110232071A (en) Search method, device and storage medium, the electronic device of drug data
CN105843796A (en) Microblog emotional tendency analysis method and device
CN107688616A (en) Show unique fact of entity
CN112036187A (en) Context-based video barrage text auditing method and system
CN110955659A (en) Method and system for processing data table
CN113204953A (en) Text matching method and device based on semantic recognition and device readable storage medium
CN110162752A (en) Article sentences weight processing method, device and electronic equipment
CN112989824A (en) Information pushing method and device, electronic equipment and storage medium
CN112948664A (en) Method and system for automatically processing sensitive words
CA3046474A1 (en) Portfolio-based text analytics tool
CN115310436A (en) Document outline extraction method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190913

RJ01 Rejection of invention patent application after publication