CN109657044A - Data retrieval method, data reordering method, device, terminal and storage medium - Google Patents

Data retrieval method, data reordering method, device, terminal and storage medium Download PDF

Info

Publication number
CN109657044A
CN109657044A CN201811536639.6A CN201811536639A CN109657044A CN 109657044 A CN109657044 A CN 109657044A CN 201811536639 A CN201811536639 A CN 201811536639A CN 109657044 A CN109657044 A CN 109657044A
Authority
CN
China
Prior art keywords
data
retrieved
term
candidate word
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811536639.6A
Other languages
Chinese (zh)
Inventor
高安
陈而淦
刘永刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Shangyi Heart Technology Co Ltd
Original Assignee
Beijing Shangyi Heart Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Shangyi Heart Technology Co Ltd filed Critical Beijing Shangyi Heart Technology Co Ltd
Priority to CN201811536639.6A priority Critical patent/CN109657044A/en
Publication of CN109657044A publication Critical patent/CN109657044A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a kind of data retrieval method, data reordering method, device, terminal and storage mediums.Wherein, which includes: acquisition term;Wherein, term has attribute information for retrieving data to be retrieved, data to be retrieved, and attribute information includes code, Chinese, English name, alias, phonetic, searching times and the temperature of data to be retrieved;Term is matched with the candidate word in candidate word list;Wherein, candidate word is determined according to the attribute information of data to be retrieved;Data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.Through the embodiment of the present invention, solves the technical issues of how improving data retrieval accuracy, so that the expected result of search result and user are highly relevant, user can rapidly obtain interested data, and then user experience is improved, and also achieve the technical effect that heat is searched.

Description

Data retrieval method, data reordering method, device, terminal and storage medium
Technical field
The present invention relates to technical field of data processing, more particularly to a kind of data retrieval method, data reordering method, dress It sets, terminal and storage medium.
Background technique
With the continuous development of society, a large amount of data abundant are produced.When in face of a large amount of data abundant, if wanted Interested data are obtained, then, data retrieval just seems particularly significant.
The prior art carries out data retrieval generally according to data code.But the retrieval item that this search method is relied on Part is single, and the interested data of user often can not be arranged in forward position in search result, to cause data inspection The accuracy of rope is very poor, prevent user is from rapidly getting interested data.
Therefore, the prior art haves the defects that data retrieval accuracy is poor because search condition is single.
Summary of the invention
The embodiment of the present invention is designed to provide a kind of data retrieval method, accurate with how solution improves data retrieval The technical issues of property.In addition, the embodiment of the present invention also provides a kind of data reordering method, device, terminal and storage medium.
To achieve the goals above, according to the first aspect of the invention, following technical scheme is provided:
A kind of data retrieval method comprising:
Obtain term;Wherein, the term is believed for retrieving data to be retrieved, the data to be retrieved with attribute Breath, the attribute information includes code, Chinese, English name, alias, phonetic, searching times and the heat of data to be retrieved Degree;
The term is matched with the candidate word in candidate word list;Wherein, the candidate word according to it is described to The attribute information for retrieving data determines;
The data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
Further, after the acquisition term the step of, the method also includes:
Regularization is carried out to the term.
Further, the step of term being matched with the candidate word in candidate word list, specifically includes:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key of the node storage of the trident search tree Value is generated to based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, by the term after segmentation and the candidate word progress in the candidate word list Match.
To achieve the goals above, according to the second aspect of the invention, following technical scheme is additionally provided:
A kind of data reordering method comprising:
Obtain pending data evidence;Wherein, the pending data is according to the data retrieval side described according to a first aspect of the present invention Method obtains;
Target data is obtained to the pending data according to arranging according to predetermined policy;Wherein, the predetermined policy Including one or more in following: data code exact matching, phonetic are first after data code exact matching, removal leading zero Letter exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
To achieve the goals above, according to the third aspect of the invention we, following technical scheme is additionally provided:
A kind of data searcher comprising:
First obtains module, for obtaining term;Wherein, the term is for retrieving data to be retrieved, it is described to Retrieving data has an attribute information, the attribute information include the codes of data to be retrieved, Chinese, English name, alias, Phonetic, searching times and temperature;
Matching module, for matching the term with the candidate word in candidate word list;Wherein, the candidate Root is determined according to the attribute information of the data to be retrieved;
Screening module, for screening, obtaining to the data to be retrieved according to predetermined condition according to matching result Target data.
Further, described device further include:
Regularization module, for carrying out regularization to the term.
Further, the matching module is specifically used for:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key of the node storage of the trident search tree Value is generated to based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, by the term after segmentation and the candidate word progress in the candidate word list Match.
To achieve the goals above, according to the fourth aspect of the invention, following technical scheme is additionally provided:
A kind of data sorting device comprising:
Second obtains module, for obtaining pending data evidence;Wherein, the pending data is according to according to a third aspect of the present invention The data searcher obtains;
Module is arranged, for obtaining target data to the pending data according to arranging according to predetermined policy;Wherein, The predetermined policy includes one or more in following: data code is complete after data code exact matching, removal leading zero Full matching, first letter of pinyin exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
To achieve the goals above, according to the fifth aspect of the invention, following technical scheme is additionally provided:
A kind of terminal comprising processor, communication interface, memory and communication bus, wherein the processor, described Communication interface and the memory complete mutual communication by the communication bus;
The memory, for storing computer program;
The processor when for executing the program stored on the memory, realizes first aspect present invention or the Method and step described in two aspects.
To achieve the goals above, according to the sixth aspect of the invention, following technical scheme is additionally provided:
A kind of computer readable storage medium, is stored with computer program, and the computer program is held by processor Method and step described in first aspect or a second aspect of the present invention is realized when row.
The embodiment of the present invention provides a kind of data retrieval method, data reordering method, device, terminal and storage medium. Wherein, which includes: acquisition term;Wherein, term is for retrieving data to be retrieved, data tool to be retrieved There is attribute information, attribute information includes code, Chinese, English name, alias, phonetic, the searching times of data to be retrieved And temperature;Term is matched with the candidate word in candidate word list;Wherein, candidate word is according to the attributes of data to be retrieved Information determines;Data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
The embodiment of the present invention combines the attribute information of data to be retrieved, i.e., code, Chinese, the English of data to be retrieved Literary fame claims, alias, phonetic, searching times, temperature etc., carries out the matching of term, so that the accuracy of data retrieval is improved, So that the expected result of search result and user are highly relevant, user can rapidly obtain interested data, moreover, this hair Bright embodiment considers searching times and temperature, by word frequency statistics, to realize the technical effect that heat is searched, and then improves User experience.
In order to better understand technological means of the invention, and can be implemented in accordance with the contents of the specification, and be Above and other objects, features and advantages of the invention are allowed to can be more clearly understood, it is special below to lift preferred embodiment, and cooperate attached Figure, detailed description are as follows.Other features and advantages of the present invention will be illustrated in the following description, also, partly from froming the perspective of It is become apparent in bright book, or emerged from by implementing the present invention.The objectives and other advantages of the invention can pass through Specifically noted structure is achieved and obtained in the specification, claims and drawings.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is the flow diagram according to the data retrieval method of the embodiment of the present invention;
Fig. 2 is the flow diagram according to the data reordering method of the embodiment of the present invention;
Fig. 3 is the structural schematic diagram according to the data searcher of the embodiment of the present invention;
Fig. 4 is the structural schematic diagram according to the data sorting device of the embodiment of the present invention.
Specific embodiment
Illustrate embodiments of the present invention below by specific specific example, those skilled in the art can be by this specification Other advantages and efficacy of the present invention can be easily understood for disclosed content.Obviously, described embodiment is only the present invention A part of the embodiment, instead of all the embodiments.The present invention can also be subject to reality by way of a different and different embodiment It applies or applies, the various details in this specification can also be based on different viewpoints and application, without departing from spirit of the invention Lower carry out various modifications or alterations.It should be noted that in the absence of conflict, the feature in following embodiment and embodiment can To be combined with each other.Based on the embodiments of the present invention, those of ordinary skill in the art are without creative efforts Every other embodiment obtained, shall fall within the protection scope of the present invention.
It should be noted that the various aspects of embodiment within the scope of the appended claims are described below.Ying Xian And be clear to, aspect described herein can be embodied in extensive diversified forms, and any specific structure described herein And/or function is only illustrative.Based on the present invention, it will be understood by one of ordinary skill in the art that one described herein Aspect can be independently implemented with any other aspect, and can combine the two or both in these aspects or more in various ways. For example, carry out facilities and equipments in terms of any number set forth herein can be used and/or practice method.In addition, can make With other than one or more of aspect set forth herein other structures and/or it is functional implement this equipment and/or Practice the method.
It should also be noted that, illustrating the basic structure that only the invention is illustrated in a schematic way provided in following embodiment Think, only shown in schema with it is of the invention in related component rather than component count, shape and size when according to actual implementation draw System, when actual implementation kenel, quantity and the ratio of each component can arbitrarily change for one kind, and its assembly layout kenel can also It can be increasingly complex.
In addition, in the following description, specific details are provided for a thorough understanding of the examples.However, fields The skilled person will understand that the aspect can be practiced without these specific details.
As stock personal share and it is added to free entrance, the accuracy and correlation of stock certificate data retrieval can seriously affect The experience of user.Existing data retrieval method generally carries out data retrieval by the matching of stock certificate data title, thus To result can not reflect tested institute's temperature of stock and to the degree of correlation of term, result even in the result of the low degree of association It is arranged in front, the result of the high degree of association comes below instead.
The prior art generally carries out data retrieval by the matching of data code when carrying out data retrieval.Citing comes Say, by taking stock certificate data as an example (it is of course also possible to for the marketable securities data such as futures, consumption statistics, tourism distributed data, Drcssing index data etc.), the prior art usually retrieves related stock certificate data by the code of stock certificate data.For example, if making " 51 " are used to be retrieved as term;So, it when being retrieved by stock certificate data code, can be retrieved as follows As a result: Yin Huali (511880), Kang Mei medicine company (600518) etc..But in fact, user it is interested be carefree English (51Talk/COE).That is, the uninterested data of user can be arranged in forward position, and use in search result The interested data in family can be arranged in position after examination, thereby result in the defect of the accuracy difference of data retrieval, so that with Family cannot rapidly get interested data, reduce user experience.
It can be seen that the prior art is matched because only using data code, to be retrieved to data, thus in the presence of Can not data retrieval accuracy difference defect.
In consideration of it, in order to improve the accuracy of data retrieval, the embodiment of the present invention also provides a kind of data retrieval method.Such as Shown in Fig. 1, which mainly includes the following steps that S100 to step S120.Wherein:
S100: term is obtained.Wherein, the term is for retrieving data to be retrieved;The data to be retrieved have attribute Information;The attribute information includes code, Chinese, English name, alias, phonetic, the searching times, heat of data to be retrieved Degree.
In this step, user can input term by terminal, to carry out data search.Wherein, terminal include but It is not limited to smart phone, computer, tablet computer, intelligent TV set, wearable device etc..
Wherein, the attribute information of data to be retrieved includes but is not limited to the code of data to be retrieved, in data to be retrieved Literary fame claims, the phonetic of the alias of the English name of data to be retrieved, data to be retrieved, data to be retrieved, data to be retrieved are searched Rope number, temperature of data to be retrieved etc..The existing technology is not recognized that the code of data to be retrieved, Chinese, English name Title, alias, phonetic, searching times, temperature etc. can influence the result of data retrieval;And the embodiment of the present invention consider it is to be checked Code, Chinese, English name, alias, phonetic, searching times, temperature of rope data etc. influence the factor of search result, by This provides the foundation for the raising of data retrieval accuracy.
Wherein, the temperature of data to be retrieved can for example announce number according to such as news report number, media, be shared Number etc. determine.
The embodiment of the present invention passes through the inspection by the temperature of the searching times of data to be retrieved, data to be retrieved applied to data The high correlation of search result Yu user's expected result may be implemented in rope.
After this step, which can also include:
S101: regularization is carried out to the term.
In this step, logic filter can be carried out to the term by building regular expression, and needed for acquisition The content wanted.Wherein, regular expression can use scheduled character and/or character string is combined and obtains.
For example, above-mentioned regular expression can indicate small English alphabet being converted to capitalization English letter, will be numerous Body Chinese is converted into simplified form of Chinese Character, and removal word segmentation etc..
S110: term is matched with the candidate word in candidate word list.Wherein, the candidate word is according to number to be retrieved According to attribute information determine.
Specifically, this step S110 may include: step S111 to step S113.Wherein:
S111: data to be retrieved are split.
In this step, for example, can use the matched method of positive maximum length, the matched side of reverse maximum length Method, maximum probability segmenting method, maximum entropy segmenting method etc., are split data to be retrieved.
For example, if data to be retrieved are Chinese data, using a Chinese character as segmentation unit;If data to be retrieved For English data, then a word is as segmentation unit.
S112: candidate word list is constructed by trident search tree;Wherein, the key assignments of the node storage of the trident search tree It is generated to based on data to be retrieved and its attribute information.
For example, the value that each node of trident search tree (also referred to as prefix trees) is stored is raw based on data to be retrieved At;The key (key) that each node of trident search tree is stored is generated based on the attribute information of data to be retrieved.
S113: it is based on prefix matching strategy, the term after segmentation is matched with the candidate word in candidate word list.
In this step, due to being matched using prefix matching strategy, so, all right prefixes will also be added to time Select word list.
For example, if with BABA (Alibaba, ALiBaBa) be term, can generate " BABA ", " ALIBABA ", " Alibaba ", " ALBB " these prefixes, meanwhile, it can also generate " Li Baba ", " Ba Ba ", " LBB ", " BB " etc. Right prefix.In order to improve the accuracy of data retrieval, these right prefixes are also added to candidate word list.
S120: data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
Wherein, predetermined condition can be condition set by user, for example, the type data of the marketing data of security, security, Music beats list data, books sales volume data etc..
In conclusion different using single Data Matching factor from the prior art, the embodiment of the present invention combines to be checked Rope data attribute information (its include but is not limited to the codes of data to be retrieved, Chinese, English name, alias, phonetic, Searching times, temperature) carry out term matching, so that the accuracy of data retrieval is improved, so that search result and user Expected result it is highly relevant, user can rapidly obtain interested data, moreover, the embodiment of the present invention considers search Number and temperature to realize the technical effect that heat is searched, and then improve user experience by word frequency statistics.
In addition, the embodiment of the present invention also provides a kind of data reordering method.As shown in Fig. 2, the data reordering method is main Include:
S200: pending data evidence is obtained.
Wherein, the pending data evidence of this step can be obtained by aforementioned data retrieval embodiment of the method.
S210: it according to predetermined policy, treats sorting data and is arranged, obtain target data.Wherein, the predetermined policy packet It includes one or more in following: data code exact matching, phonetic lead-in after data code exact matching, removal leading zero Mother's exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
Wherein, data weighting can come forth according to the data, report, pay close attention to etc. number or degree determine.For example, If the news temperature and user's attention rate degree of the data are high;It can then assign the data high weight.
It in practical applications, can be by predetermined order or randomly according to the items in above-mentioned predetermined policy, the row for the treatment of Ordinal number obtains the target data as ranking results according to being arranged.
To facilitate the understanding of the present invention, the present embodiment is described in detail with specific embodiment below.
When user searches for " GOOG " (it is as term), if there is " GOOG " and " GOOGL ", then according to data generation The strategy of code exact matching, the priority of " GOOG " is high (i.e. high with the degree of association of term), so that " GOOG " be come Before " GOOGL ".When user searches for " 5 ", if there is following data: 00005,000005,57000,57001, then root According to the strategy that data code after removal leading zero exactly matches, 00005,000005 is come before 57000,57001.When with It is if there is " PG ", " AAPL ", " PGC ", then first according to the strategy of data code exact matching and phonetic when " PG " is retrieved at family The strategy of letter exact matching, " PG " and " AAPL " comes before " PGC ".When user searches for " 51 ", if there is " COE " (i.e. 51Talk), 510010, then according to data code and the matched strategy of data name prefix, COE is come before 510010.
If data to be retrieved are suitable with the matching degree of term (exact matching of data code in predetermined policy, Data code exact matching, first letter of pinyin exact matching and data code and data name prefix after removal leading zero Matching the case where being not satisfied or being all satisfied), then which or a little numbers are often determined greatly with data weighting according to history retrieval According to coming forward position.
In conclusion the present embodiment shows search result by predetermined policy, it can be by the search result of high correlation Sequence in order to which user preferentially sees most interested data, which thereby enhances user experience preceding.
Hereinbefore, although being described in data retrieval method and data reordering method embodiment according to above-mentioned sequence Each step, it will be apparent to one skilled in the art that the step in the embodiment of the present invention not necessarily executes in the order described above, It other sequences can also be executed with inverted order, parallel, intersection etc., moreover, those skilled in the art can also on the basis of above-mentioned steps To add other steps, the mode of these obvious variants or equivalent replacement be should also be included within protection scope of the present invention, Details are not described herein.
The following is an embodiment of the apparatus of the present invention, and apparatus of the present invention embodiment is used to execute embodiment of the present invention method realization Step, for ease of description, only parts related to embodiments of the present invention are shown, disclosed by specific technical details, please join According to embodiment of the present invention method.Each functional unit in each Installation practice of the present invention can integrate in a processing unit In, it is also possible to each unit and physically exists alone, can also be integrated in one unit with two or more units.It is above-mentioned Integrated unit both can take the form of hardware realization, can also realize in the form of hardware adds SFU software functional unit.
Based on technical concept identical with above-mentioned data retrieval method embodiment, the embodiment of the present invention also provides a kind of data Retrieve device.As shown in figure 3, the data searcher specifically includes that the first acquisition module 31, matching module 32 and screening module 33.Wherein, the first acquisition module 31 is for obtaining term;Wherein, the term is for retrieving data to be retrieved, it is described to Retrieving data has an attribute information, the attribute information include the codes of data to be retrieved, Chinese, English name, alias, Phonetic, searching times and temperature.Matching module 32 is for matching the term with the candidate word in candidate word list; Wherein, the candidate word is determined according to the attribute information of the data to be retrieved.Screening module 33 is used to be tied according to matching Fruit screens the data to be retrieved, obtains target data according to predetermined condition.
Wherein, the temperature of data to be retrieved can for example announce number according to such as news report number, media, be shared Number etc. determine.The embodiment of the present invention obtains module 31 for searching times of data to be retrieved, to be checked by first as a result, The temperature of rope data is applied to the retrieval of data, and the high correlation of search result Yu user's expected result may be implemented.
Wherein, predetermined condition can be condition set by user, for example, the type data of the marketing data of security, security, Music beats list data, books sales volume data etc..
In a preferred embodiment, which can also include regularization module.Wherein, regularization mould Block is used to carry out regularization to term.
Wherein, regularization module can carry out logic filter to the term by building regular expression, and obtain Required content.Wherein, regular expression can use scheduled character and/or character string is combined and obtains.
In a preferred embodiment, above-mentioned matching module 32 is specifically used for: being split to term;Pass through trident Search tree constructs candidate word list;Wherein, the key-value pair of the node storage of trident search tree is based on data to be retrieved and its category Property information generate;Based on prefix matching strategy, the term after segmentation is matched with the candidate word in candidate word list.
Wherein, matching module 32 can use the matched method of positive maximum length, the matched method of reverse maximum length, Maximum probability segmenting method, maximum entropy segmenting method etc., are split data to be retrieved.
The related explanation that can refer to preceding method embodiment is described in detail in relation to data searcher embodiment, herein It repeats no more.
In conclusion different using single Data Matching factor from the prior art, the embodiment of the present invention is obtained using first Modulus block 31, matching module 32 and screening module 33, combining the attribute informations of data to be retrieved, (it is including but not limited to be checked The code of rope data, Chinese, English name, alias, phonetic, searching times, temperature) carry out term matching, thus The accuracy of data retrieval is improved, so that the expected result of search result and user are highly relevant, user can rapidly be obtained Interested data are obtained, moreover, the embodiment of the present invention considers searching times and temperature, by word frequency statistics, to realize The technical effect that heat is searched, and then improve user experience.
In addition, the embodiment of the present invention also provides a kind of data sorting device.As shown in figure 4, the data sorting device is main It include: the second acquisition module 41 and arrangement module 42.Wherein, the second acquisition module 41 is for obtaining pending data evidence;Wherein, to Sorting data is obtained according to above-mentioned data searcher.Module 42 is arranged to be used to treat sorting data progress according to predetermined policy Arrangement, obtains target data;Wherein, predetermined policy includes one or more in following: before data code exact matching, removal Data code exact matching, first letter of pinyin exact matching, data code and the matching of data name prefix, history inspection after leading zero Rope number, data weighting.
Specific implementation process in relation to the present embodiment solves the problems, such as and the technical effect of acquirement can refer to aforementioned side Associated description in method embodiment, details are not described herein.
The embodiment of the present invention obtains module 41 and arrangement module 42 using second, and retrieval knot is shown by predetermined policy The search result of high correlation can be sorted preceding, in order to which user preferentially sees most interested data, thus be improved by fruit User experience.
Based on technical concept identical with above-mentioned data retrieval method or data reordering method, the embodiment of the present invention is also provided A kind of terminal comprising processor and memory;Wherein: memory is for storing computer program.Processor is deposited for executing When the program stored on reservoir, each technical side described in data retrieval method embodiment or data reordering method embodiment is realized Any method and step in case.
Wherein, which may include one or more processing cores, such as 4 core processors, 8 core processors Deng.Processor can use DSP (Digital Signal Processing, Digital Signal Processing), FPGA (Field- Programmable Gate Array, field programmable gate array), PLA (Programmable Logic Array, may be programmed Logic array) at least one of example, in hardware realize.Processor also may include primary processor and coprocessor, main process task Device is the processor for being handled data in the awake state, also referred to as CPU (Central Processing Unit, Central processing unit);Coprocessor is the low power processor for being handled data in the standby state.In some realities It applies in example, processor can be integrated with GPU (Graphics Processing Unit, image processor), and GPU is for being responsible for The rendering and drafting of content to be shown needed for display screen.In some embodiments, processor can also include AI (Artificial Intelligence, artificial intelligence) processor, the AI processor is for handling the calculating operation in relation to machine learning.
Above-mentioned memory may include random access memory (Random Access Memory, RAM), also may include Nonvolatile memory (non-volatile memory, NVM), for example, at least a magnetic disk storage.Optionally, memory It can also be that at least one is located remotely from the storage device of aforementioned processor.
In some embodiments, terminal has also optionally included: peripheral device interface and at least one peripheral equipment.Processing It can be connected by bus or signal wire between device, memory and peripheral device interface.Each peripheral equipment can by bus, Signal wire or circuit board are connected with peripheral device interface.
Specific implementation process in relation to the present embodiment, the detail solved the problems, such as can refer to preceding method embodiment In associated description, details are not described herein.
Terminal provided in an embodiment of the present invention combines to be retrieved when processor executes the program stored on memory The attribute information of the data to be retrieved such as code, Chinese, English name, alias, phonetic, searching times, the temperature of data, into The matching of row term is to improve the accuracy of data retrieval, so that the expected result height phase of search result and user It closes, user can rapidly obtain interested data, moreover, the embodiment of the present invention considers searching times and temperature, lead to Word frequency statistics are crossed, to realize the technical effect that heat is searched, and then improve user experience.
Based on technical concept identical with above-mentioned data retrieval method or data reordering method, the embodiment of the present invention is also provided A kind of computer readable storage medium.Computer program is stored in the computer readable storage medium, computer program is located It is realized when managing device execution any in each technical solution described in data retrieval method embodiment or data reordering method embodiment Method and step.
Above-mentioned computer readable storage medium can include but is not limited to random access memory (RAM), dynamic random is deposited Access to memory (DRAM), static random access memory (SRAM), read-only memory (ROM), programmable read only memory (PROM), Erarable Programmable Read only Memory (EPROM), electrically erasable programmable read-only memory (EEPROM), flash memory (example Such as, NOR type flash memory or NAND-type flash memory), Content Addressable Memory (CAM), polymer memory is (for example, ferroelectric polymers Memory), phase transition storage, ovonic memory, silicon-oxide-nitride silicon-silica-silicon (Silicon- Oxide-Nitride-Oxide-Silicon, SONOS) memory, magnetic card or light-card, also or any other appropriate type Computer readable storage medium.
Specific implementation process in relation to the present embodiment solves the problems, such as that detail can be with reference in preceding method embodiment Associated description, details are not described herein.
Computer readable storage medium provided in an embodiment of the present invention combines data to be retrieved when being executed by processor The data to be retrieved such as code, Chinese, English name, alias, phonetic, searching times, temperature attribute information, examined The matching of rope word is used to improve the accuracy of data retrieval so that the expected result of search result and user are highly relevant Family can rapidly obtain interested data and pass through word frequency moreover, the embodiment of the present invention considers searching times and temperature Statistics to realize the technical effect that heat is searched, and then improves user experience.
The basic principle of the disclosure is described in conjunction with specific embodiments above, however, it is desirable to, it is noted that in the disclosure The advantages of referring to, advantage, effect etc. are only exemplary rather than limitation, must not believe that these advantages, advantage, effect etc. are the disclosure Each embodiment is prerequisite.In addition, detail disclosed above is merely to exemplary effect and the work being easy to understand With, rather than limit, it is that must be realized using above-mentioned concrete details that above-mentioned details, which is not intended to limit the disclosure,.
It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality Body or operation are distinguished with another entity or operation, are deposited without necessarily requiring or implying between these entities or operation In any actual relationship or order or sequence.Moreover, the terms "include", "comprise" or its any other variant are intended to Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that There is also other identical elements in process, method, article or equipment including the element.
It may also be noted that in the system and method for the disclosure, each component or each step are can to decompose and/or again Combination nova.These decompose and/or reconfigure the equivalent scheme that should be regarded as the disclosure.
Each embodiment in this specification is all made of relevant mode and describes, the highlights of each of the examples are with The difference of other embodiments, the same or similar parts between the embodiments can be referred to each other.It can not depart from by institute The technology for the introduction that attached claim defines and carry out the various changes to technology described herein, replacement and change.In addition, this Disclosed the scope of the claims is not limited to process described above, machine, manufacture, the composition of event, means, method and movement Specific aspect.It can use and carry out essentially identical function to corresponding aspect described herein or realize essentially identical knot Fruit there is currently or processing, machine, manufacture, the composition of event, means, method or the movement to be developed later.Thus, Appended claims include such processing, machine, manufacture, the composition of event, means, method or movement within its scope.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention It is interior.

Claims (10)

1. a kind of data retrieval method characterized by comprising
Obtain term;Wherein, the term has attribute information for retrieving data to be retrieved, the data to be retrieved, The attribute information includes code, Chinese, English name, alias, phonetic, searching times and the temperature of data to be retrieved;
The term is matched with the candidate word in candidate word list;Wherein, the candidate word is according to described to be retrieved The attribute information of data determines;
The data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
2. data retrieval method according to claim 1, which is characterized in that after the acquisition term the step of, The method also includes:
Regularization is carried out to the term.
3. data retrieval method according to claim 1, which is characterized in that will be in the term and candidate word list The step of candidate word is matched specifically includes:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key-value pair of the node storage of the trident search tree It is generated based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, the term after segmentation is matched with the candidate word in the candidate word list.
4. a kind of data reordering method characterized by comprising
Obtain pending data evidence;Wherein, the pending data is according to data retrieval method according to claim 1 to 3 It obtains;
Target data is obtained to the pending data according to arranging according to predetermined policy;Wherein, the predetermined policy includes It is one or more in below: data code exact matching, first letter of pinyin after data code exact matching, removal leading zero Exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
5. a kind of data searcher characterized by comprising
First obtains module, for obtaining term;Wherein, the term is described to be retrieved for retrieving data to be retrieved Data have attribute information, and the attribute information includes code, Chinese, English name, the alias, spelling of data to be retrieved Sound, searching times and temperature;
Matching module, for matching the term with the candidate word in candidate word list;Wherein, the candidate root It is determined according to the attribute information of the data to be retrieved;
Screening module, for being screened to the data to be retrieved according to predetermined condition according to matching result, obtaining target Data.
6. data searcher according to claim 5, which is characterized in that described device further include:
Regularization module, for carrying out regularization to the term.
7. data searcher according to claim 5, which is characterized in that the matching module is specifically used for:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key-value pair of the node storage of the trident search tree It is generated based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, the term after segmentation is matched with the candidate word in the candidate word list.
8. a kind of data sorting device characterized by comprising
Second obtains module, for obtaining pending data evidence;Wherein, the pending data is according to according to any in claim 5-7 The data searcher obtains;
Module is arranged, for obtaining target data to the pending data according to arranging according to predetermined policy;Wherein, described Predetermined policy includes one or more in following: data code complete after data code exact matching, removal leading zero Match, first letter of pinyin exactly matches, data code and the matching of data name prefix, history retrieve number, data weighting.
9. a kind of terminal, which is characterized in that including processor, communication interface, memory and communication bus, wherein the processing Device, the communication interface and the memory complete mutual communication by the communication bus;
The memory, for storing computer program;
The processor when for executing the program stored on the memory, is realized any described in claim 1-4 Method and step.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program realizes any method and step in claim 1-4 when the computer program is executed by processor.
CN201811536639.6A 2018-12-14 2018-12-14 Data retrieval method, data reordering method, device, terminal and storage medium Pending CN109657044A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811536639.6A CN109657044A (en) 2018-12-14 2018-12-14 Data retrieval method, data reordering method, device, terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811536639.6A CN109657044A (en) 2018-12-14 2018-12-14 Data retrieval method, data reordering method, device, terminal and storage medium

Publications (1)

Publication Number Publication Date
CN109657044A true CN109657044A (en) 2019-04-19

Family

ID=66114283

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811536639.6A Pending CN109657044A (en) 2018-12-14 2018-12-14 Data retrieval method, data reordering method, device, terminal and storage medium

Country Status (1)

Country Link
CN (1) CN109657044A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110246592A (en) * 2019-06-25 2019-09-17 山东健康医疗大数据有限公司 Realize the mapping method and system of medical institutions' isomeric data codomain code standardization
CN110377830A (en) * 2019-07-25 2019-10-25 拉扎斯网络科技(上海)有限公司 Retrieval method, retrieval device, readable storage medium and electronic equipment
CN110377831A (en) * 2019-07-25 2019-10-25 拉扎斯网络科技(上海)有限公司 Retrieval method, retrieval device, readable storage medium and electronic equipment
CN110895585A (en) * 2019-10-18 2020-03-20 深圳市富途网络科技有限公司 Stock data acquisition method and device, terminal equipment and storage medium
CN111104375A (en) * 2019-11-22 2020-05-05 泰康保险集团股份有限公司 Authority rule editing method, system, equipment and storage medium
CN111143661A (en) * 2019-12-18 2020-05-12 深圳易伙科技有限责任公司 Object-oriented semantic retrieval method and device
CN113515940A (en) * 2021-07-14 2021-10-19 上海芯翌智能科技有限公司 Method and equipment for text search
CN113921082A (en) * 2021-10-27 2022-01-11 云舟生物科技(广州)有限公司 Gene search weight adjustment method, computer storage medium, and electronic device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388012A (en) * 2007-09-13 2009-03-18 阿里巴巴集团控股有限公司 Phonetic check system and method with easy confusion tone recognition
CN104125505A (en) * 2014-06-23 2014-10-29 小米科技有限责任公司 Television program processing method and device
CN104268157A (en) * 2014-09-03 2015-01-07 乐视网信息技术(北京)股份有限公司 Device and method for error correction in data search
CN106970936A (en) * 2017-02-09 2017-07-21 阿里巴巴集团控股有限公司 Data processing method and device, data query method and device
CN108170852A (en) * 2018-01-19 2018-06-15 深圳市富途网络科技有限公司 A kind of stock searching method of efficiently and accurately

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388012A (en) * 2007-09-13 2009-03-18 阿里巴巴集团控股有限公司 Phonetic check system and method with easy confusion tone recognition
CN104125505A (en) * 2014-06-23 2014-10-29 小米科技有限责任公司 Television program processing method and device
CN104268157A (en) * 2014-09-03 2015-01-07 乐视网信息技术(北京)股份有限公司 Device and method for error correction in data search
CN106970936A (en) * 2017-02-09 2017-07-21 阿里巴巴集团控股有限公司 Data processing method and device, data query method and device
CN108170852A (en) * 2018-01-19 2018-06-15 深圳市富途网络科技有限公司 A kind of stock searching method of efficiently and accurately

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110246592B (en) * 2019-06-25 2023-07-14 山东浪潮智慧医疗科技有限公司 Mapping method and system for realizing standardization of medical institution heterogeneous data value domain codes
CN110246592A (en) * 2019-06-25 2019-09-17 山东健康医疗大数据有限公司 Realize the mapping method and system of medical institutions' isomeric data codomain code standardization
CN110377831B (en) * 2019-07-25 2022-05-17 拉扎斯网络科技(上海)有限公司 Retrieval method, retrieval device, readable storage medium and electronic equipment
CN110377830A (en) * 2019-07-25 2019-10-25 拉扎斯网络科技(上海)有限公司 Retrieval method, retrieval device, readable storage medium and electronic equipment
CN110377831A (en) * 2019-07-25 2019-10-25 拉扎斯网络科技(上海)有限公司 Retrieval method, retrieval device, readable storage medium and electronic equipment
CN110895585B (en) * 2019-10-18 2022-08-23 深圳市富途网络科技有限公司 Stock data acquisition method and device, terminal equipment and storage medium
CN110895585A (en) * 2019-10-18 2020-03-20 深圳市富途网络科技有限公司 Stock data acquisition method and device, terminal equipment and storage medium
CN111104375A (en) * 2019-11-22 2020-05-05 泰康保险集团股份有限公司 Authority rule editing method, system, equipment and storage medium
CN111104375B (en) * 2019-11-22 2023-06-09 泰康保险集团股份有限公司 Nuclear protection rule editing method, system, equipment and storage medium
CN111143661A (en) * 2019-12-18 2020-05-12 深圳易伙科技有限责任公司 Object-oriented semantic retrieval method and device
CN113515940A (en) * 2021-07-14 2021-10-19 上海芯翌智能科技有限公司 Method and equipment for text search
CN113515940B (en) * 2021-07-14 2022-12-13 上海芯翌智能科技有限公司 Method and equipment for text search
CN113921082A (en) * 2021-10-27 2022-01-11 云舟生物科技(广州)有限公司 Gene search weight adjustment method, computer storage medium, and electronic device

Similar Documents

Publication Publication Date Title
CN109657044A (en) Data retrieval method, data reordering method, device, terminal and storage medium
WO2021139325A1 (en) Media information recommendation method and apparatus, electronic device, and storage medium
US20210158164A1 (en) Finding k extreme values in constant processing time
CN110532451A (en) Search method and device for policy text, storage medium, electronic device
CN109062994A (en) Recommended method, device, computer equipment and storage medium
CN108885624B (en) Information recommendation system and method
US10223453B2 (en) Dynamic search set creation in a search engine
CN112434151A (en) Patent recommendation method and device, computer equipment and storage medium
CN100442284C (en) Search system for providing information of keyword input frequency by category and method thereof
CN108021708B (en) Content recommendation method and device and computer readable storage medium
CN104516910A (en) Method and system for recommending content in client-side server environment
CN112818218B (en) Information recommendation method, device, terminal equipment and computer readable storage medium
KR102108683B1 (en) Method for providing recommendation contents including non-interest contents
CN110275952A (en) News recommended method, device and medium based on user's short-term interest
CN110110139A (en) The method, apparatus and electronic equipment that a kind of pair of recommendation results explain
CA2919878A1 (en) Refining search query results
CN109325146A (en) A kind of video recommendation method, device, storage medium and server
CN104933044A (en) Application uninstalling reason classification method and classification apparatus
CN109800427A (en) Word segmentation method, word segmentation device, word segmentation terminal and computer readable storage medium
CN103365842B (en) A kind of page browsing recommends method and device
CN112579854A (en) Information processing method, device, equipment and storage medium
CN112825089B (en) Article recommendation method, device, equipment and storage medium
CN108959453A (en) Information extracting method, device and readable storage medium storing program for executing based on text cluster
Liu et al. Detecting industry clusters from the bottom up based on co-location patterns mining: A case study in Dongguan, China
CN108446378B (en) Method, system and computer storage medium based on user search

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190419