CN109657044A - Data retrieval method, data reordering method, device, terminal and storage medium - Google Patents
Data retrieval method, data reordering method, device, terminal and storage medium Download PDFInfo
- Publication number
- CN109657044A CN109657044A CN201811536639.6A CN201811536639A CN109657044A CN 109657044 A CN109657044 A CN 109657044A CN 201811536639 A CN201811536639 A CN 201811536639A CN 109657044 A CN109657044 A CN 109657044A
- Authority
- CN
- China
- Prior art keywords
- data
- retrieved
- term
- candidate word
- matching
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The embodiment of the invention provides a kind of data retrieval method, data reordering method, device, terminal and storage mediums.Wherein, which includes: acquisition term;Wherein, term has attribute information for retrieving data to be retrieved, data to be retrieved, and attribute information includes code, Chinese, English name, alias, phonetic, searching times and the temperature of data to be retrieved;Term is matched with the candidate word in candidate word list;Wherein, candidate word is determined according to the attribute information of data to be retrieved;Data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.Through the embodiment of the present invention, solves the technical issues of how improving data retrieval accuracy, so that the expected result of search result and user are highly relevant, user can rapidly obtain interested data, and then user experience is improved, and also achieve the technical effect that heat is searched.
Description
Technical field
The present invention relates to technical field of data processing, more particularly to a kind of data retrieval method, data reordering method, dress
It sets, terminal and storage medium.
Background technique
With the continuous development of society, a large amount of data abundant are produced.When in face of a large amount of data abundant, if wanted
Interested data are obtained, then, data retrieval just seems particularly significant.
The prior art carries out data retrieval generally according to data code.But the retrieval item that this search method is relied on
Part is single, and the interested data of user often can not be arranged in forward position in search result, to cause data inspection
The accuracy of rope is very poor, prevent user is from rapidly getting interested data.
Therefore, the prior art haves the defects that data retrieval accuracy is poor because search condition is single.
Summary of the invention
The embodiment of the present invention is designed to provide a kind of data retrieval method, accurate with how solution improves data retrieval
The technical issues of property.In addition, the embodiment of the present invention also provides a kind of data reordering method, device, terminal and storage medium.
To achieve the goals above, according to the first aspect of the invention, following technical scheme is provided:
A kind of data retrieval method comprising:
Obtain term;Wherein, the term is believed for retrieving data to be retrieved, the data to be retrieved with attribute
Breath, the attribute information includes code, Chinese, English name, alias, phonetic, searching times and the heat of data to be retrieved
Degree;
The term is matched with the candidate word in candidate word list;Wherein, the candidate word according to it is described to
The attribute information for retrieving data determines;
The data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
Further, after the acquisition term the step of, the method also includes:
Regularization is carried out to the term.
Further, the step of term being matched with the candidate word in candidate word list, specifically includes:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key of the node storage of the trident search tree
Value is generated to based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, by the term after segmentation and the candidate word progress in the candidate word list
Match.
To achieve the goals above, according to the second aspect of the invention, following technical scheme is additionally provided:
A kind of data reordering method comprising:
Obtain pending data evidence;Wherein, the pending data is according to the data retrieval side described according to a first aspect of the present invention
Method obtains;
Target data is obtained to the pending data according to arranging according to predetermined policy;Wherein, the predetermined policy
Including one or more in following: data code exact matching, phonetic are first after data code exact matching, removal leading zero
Letter exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
To achieve the goals above, according to the third aspect of the invention we, following technical scheme is additionally provided:
A kind of data searcher comprising:
First obtains module, for obtaining term;Wherein, the term is for retrieving data to be retrieved, it is described to
Retrieving data has an attribute information, the attribute information include the codes of data to be retrieved, Chinese, English name, alias,
Phonetic, searching times and temperature;
Matching module, for matching the term with the candidate word in candidate word list;Wherein, the candidate
Root is determined according to the attribute information of the data to be retrieved;
Screening module, for screening, obtaining to the data to be retrieved according to predetermined condition according to matching result
Target data.
Further, described device further include:
Regularization module, for carrying out regularization to the term.
Further, the matching module is specifically used for:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key of the node storage of the trident search tree
Value is generated to based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, by the term after segmentation and the candidate word progress in the candidate word list
Match.
To achieve the goals above, according to the fourth aspect of the invention, following technical scheme is additionally provided:
A kind of data sorting device comprising:
Second obtains module, for obtaining pending data evidence;Wherein, the pending data is according to according to a third aspect of the present invention
The data searcher obtains;
Module is arranged, for obtaining target data to the pending data according to arranging according to predetermined policy;Wherein,
The predetermined policy includes one or more in following: data code is complete after data code exact matching, removal leading zero
Full matching, first letter of pinyin exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
To achieve the goals above, according to the fifth aspect of the invention, following technical scheme is additionally provided:
A kind of terminal comprising processor, communication interface, memory and communication bus, wherein the processor, described
Communication interface and the memory complete mutual communication by the communication bus;
The memory, for storing computer program;
The processor when for executing the program stored on the memory, realizes first aspect present invention or the
Method and step described in two aspects.
To achieve the goals above, according to the sixth aspect of the invention, following technical scheme is additionally provided:
A kind of computer readable storage medium, is stored with computer program, and the computer program is held by processor
Method and step described in first aspect or a second aspect of the present invention is realized when row.
The embodiment of the present invention provides a kind of data retrieval method, data reordering method, device, terminal and storage medium.
Wherein, which includes: acquisition term;Wherein, term is for retrieving data to be retrieved, data tool to be retrieved
There is attribute information, attribute information includes code, Chinese, English name, alias, phonetic, the searching times of data to be retrieved
And temperature;Term is matched with the candidate word in candidate word list;Wherein, candidate word is according to the attributes of data to be retrieved
Information determines;Data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
The embodiment of the present invention combines the attribute information of data to be retrieved, i.e., code, Chinese, the English of data to be retrieved
Literary fame claims, alias, phonetic, searching times, temperature etc., carries out the matching of term, so that the accuracy of data retrieval is improved,
So that the expected result of search result and user are highly relevant, user can rapidly obtain interested data, moreover, this hair
Bright embodiment considers searching times and temperature, by word frequency statistics, to realize the technical effect that heat is searched, and then improves
User experience.
In order to better understand technological means of the invention, and can be implemented in accordance with the contents of the specification, and be
Above and other objects, features and advantages of the invention are allowed to can be more clearly understood, it is special below to lift preferred embodiment, and cooperate attached
Figure, detailed description are as follows.Other features and advantages of the present invention will be illustrated in the following description, also, partly from froming the perspective of
It is become apparent in bright book, or emerged from by implementing the present invention.The objectives and other advantages of the invention can pass through
Specifically noted structure is achieved and obtained in the specification, claims and drawings.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below
There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this
Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with
It obtains other drawings based on these drawings.
Fig. 1 is the flow diagram according to the data retrieval method of the embodiment of the present invention;
Fig. 2 is the flow diagram according to the data reordering method of the embodiment of the present invention;
Fig. 3 is the structural schematic diagram according to the data searcher of the embodiment of the present invention;
Fig. 4 is the structural schematic diagram according to the data sorting device of the embodiment of the present invention.
Specific embodiment
Illustrate embodiments of the present invention below by specific specific example, those skilled in the art can be by this specification
Other advantages and efficacy of the present invention can be easily understood for disclosed content.Obviously, described embodiment is only the present invention
A part of the embodiment, instead of all the embodiments.The present invention can also be subject to reality by way of a different and different embodiment
It applies or applies, the various details in this specification can also be based on different viewpoints and application, without departing from spirit of the invention
Lower carry out various modifications or alterations.It should be noted that in the absence of conflict, the feature in following embodiment and embodiment can
To be combined with each other.Based on the embodiments of the present invention, those of ordinary skill in the art are without creative efforts
Every other embodiment obtained, shall fall within the protection scope of the present invention.
It should be noted that the various aspects of embodiment within the scope of the appended claims are described below.Ying Xian
And be clear to, aspect described herein can be embodied in extensive diversified forms, and any specific structure described herein
And/or function is only illustrative.Based on the present invention, it will be understood by one of ordinary skill in the art that one described herein
Aspect can be independently implemented with any other aspect, and can combine the two or both in these aspects or more in various ways.
For example, carry out facilities and equipments in terms of any number set forth herein can be used and/or practice method.In addition, can make
With other than one or more of aspect set forth herein other structures and/or it is functional implement this equipment and/or
Practice the method.
It should also be noted that, illustrating the basic structure that only the invention is illustrated in a schematic way provided in following embodiment
Think, only shown in schema with it is of the invention in related component rather than component count, shape and size when according to actual implementation draw
System, when actual implementation kenel, quantity and the ratio of each component can arbitrarily change for one kind, and its assembly layout kenel can also
It can be increasingly complex.
In addition, in the following description, specific details are provided for a thorough understanding of the examples.However, fields
The skilled person will understand that the aspect can be practiced without these specific details.
As stock personal share and it is added to free entrance, the accuracy and correlation of stock certificate data retrieval can seriously affect
The experience of user.Existing data retrieval method generally carries out data retrieval by the matching of stock certificate data title, thus
To result can not reflect tested institute's temperature of stock and to the degree of correlation of term, result even in the result of the low degree of association
It is arranged in front, the result of the high degree of association comes below instead.
The prior art generally carries out data retrieval by the matching of data code when carrying out data retrieval.Citing comes
Say, by taking stock certificate data as an example (it is of course also possible to for the marketable securities data such as futures, consumption statistics, tourism distributed data,
Drcssing index data etc.), the prior art usually retrieves related stock certificate data by the code of stock certificate data.For example, if making
" 51 " are used to be retrieved as term;So, it when being retrieved by stock certificate data code, can be retrieved as follows
As a result: Yin Huali (511880), Kang Mei medicine company (600518) etc..But in fact, user it is interested be carefree English
(51Talk/COE).That is, the uninterested data of user can be arranged in forward position, and use in search result
The interested data in family can be arranged in position after examination, thereby result in the defect of the accuracy difference of data retrieval, so that with
Family cannot rapidly get interested data, reduce user experience.
It can be seen that the prior art is matched because only using data code, to be retrieved to data, thus in the presence of
Can not data retrieval accuracy difference defect.
In consideration of it, in order to improve the accuracy of data retrieval, the embodiment of the present invention also provides a kind of data retrieval method.Such as
Shown in Fig. 1, which mainly includes the following steps that S100 to step S120.Wherein:
S100: term is obtained.Wherein, the term is for retrieving data to be retrieved;The data to be retrieved have attribute
Information;The attribute information includes code, Chinese, English name, alias, phonetic, the searching times, heat of data to be retrieved
Degree.
In this step, user can input term by terminal, to carry out data search.Wherein, terminal include but
It is not limited to smart phone, computer, tablet computer, intelligent TV set, wearable device etc..
Wherein, the attribute information of data to be retrieved includes but is not limited to the code of data to be retrieved, in data to be retrieved
Literary fame claims, the phonetic of the alias of the English name of data to be retrieved, data to be retrieved, data to be retrieved, data to be retrieved are searched
Rope number, temperature of data to be retrieved etc..The existing technology is not recognized that the code of data to be retrieved, Chinese, English name
Title, alias, phonetic, searching times, temperature etc. can influence the result of data retrieval;And the embodiment of the present invention consider it is to be checked
Code, Chinese, English name, alias, phonetic, searching times, temperature of rope data etc. influence the factor of search result, by
This provides the foundation for the raising of data retrieval accuracy.
Wherein, the temperature of data to be retrieved can for example announce number according to such as news report number, media, be shared
Number etc. determine.
The embodiment of the present invention passes through the inspection by the temperature of the searching times of data to be retrieved, data to be retrieved applied to data
The high correlation of search result Yu user's expected result may be implemented in rope.
After this step, which can also include:
S101: regularization is carried out to the term.
In this step, logic filter can be carried out to the term by building regular expression, and needed for acquisition
The content wanted.Wherein, regular expression can use scheduled character and/or character string is combined and obtains.
For example, above-mentioned regular expression can indicate small English alphabet being converted to capitalization English letter, will be numerous
Body Chinese is converted into simplified form of Chinese Character, and removal word segmentation etc..
S110: term is matched with the candidate word in candidate word list.Wherein, the candidate word is according to number to be retrieved
According to attribute information determine.
Specifically, this step S110 may include: step S111 to step S113.Wherein:
S111: data to be retrieved are split.
In this step, for example, can use the matched method of positive maximum length, the matched side of reverse maximum length
Method, maximum probability segmenting method, maximum entropy segmenting method etc., are split data to be retrieved.
For example, if data to be retrieved are Chinese data, using a Chinese character as segmentation unit;If data to be retrieved
For English data, then a word is as segmentation unit.
S112: candidate word list is constructed by trident search tree;Wherein, the key assignments of the node storage of the trident search tree
It is generated to based on data to be retrieved and its attribute information.
For example, the value that each node of trident search tree (also referred to as prefix trees) is stored is raw based on data to be retrieved
At;The key (key) that each node of trident search tree is stored is generated based on the attribute information of data to be retrieved.
S113: it is based on prefix matching strategy, the term after segmentation is matched with the candidate word in candidate word list.
In this step, due to being matched using prefix matching strategy, so, all right prefixes will also be added to time
Select word list.
For example, if with BABA (Alibaba, ALiBaBa) be term, can generate " BABA ",
" ALIBABA ", " Alibaba ", " ALBB " these prefixes, meanwhile, it can also generate " Li Baba ", " Ba Ba ", " LBB ", " BB " etc.
Right prefix.In order to improve the accuracy of data retrieval, these right prefixes are also added to candidate word list.
S120: data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
Wherein, predetermined condition can be condition set by user, for example, the type data of the marketing data of security, security,
Music beats list data, books sales volume data etc..
In conclusion different using single Data Matching factor from the prior art, the embodiment of the present invention combines to be checked
Rope data attribute information (its include but is not limited to the codes of data to be retrieved, Chinese, English name, alias, phonetic,
Searching times, temperature) carry out term matching, so that the accuracy of data retrieval is improved, so that search result and user
Expected result it is highly relevant, user can rapidly obtain interested data, moreover, the embodiment of the present invention considers search
Number and temperature to realize the technical effect that heat is searched, and then improve user experience by word frequency statistics.
In addition, the embodiment of the present invention also provides a kind of data reordering method.As shown in Fig. 2, the data reordering method is main
Include:
S200: pending data evidence is obtained.
Wherein, the pending data evidence of this step can be obtained by aforementioned data retrieval embodiment of the method.
S210: it according to predetermined policy, treats sorting data and is arranged, obtain target data.Wherein, the predetermined policy packet
It includes one or more in following: data code exact matching, phonetic lead-in after data code exact matching, removal leading zero
Mother's exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
Wherein, data weighting can come forth according to the data, report, pay close attention to etc. number or degree determine.For example,
If the news temperature and user's attention rate degree of the data are high;It can then assign the data high weight.
It in practical applications, can be by predetermined order or randomly according to the items in above-mentioned predetermined policy, the row for the treatment of
Ordinal number obtains the target data as ranking results according to being arranged.
To facilitate the understanding of the present invention, the present embodiment is described in detail with specific embodiment below.
When user searches for " GOOG " (it is as term), if there is " GOOG " and " GOOGL ", then according to data generation
The strategy of code exact matching, the priority of " GOOG " is high (i.e. high with the degree of association of term), so that " GOOG " be come
Before " GOOGL ".When user searches for " 5 ", if there is following data: 00005,000005,57000,57001, then root
According to the strategy that data code after removal leading zero exactly matches, 00005,000005 is come before 57000,57001.When with
It is if there is " PG ", " AAPL ", " PGC ", then first according to the strategy of data code exact matching and phonetic when " PG " is retrieved at family
The strategy of letter exact matching, " PG " and " AAPL " comes before " PGC ".When user searches for " 51 ", if there is " COE "
(i.e. 51Talk), 510010, then according to data code and the matched strategy of data name prefix, COE is come before 510010.
If data to be retrieved are suitable with the matching degree of term (exact matching of data code in predetermined policy,
Data code exact matching, first letter of pinyin exact matching and data code and data name prefix after removal leading zero
Matching the case where being not satisfied or being all satisfied), then which or a little numbers are often determined greatly with data weighting according to history retrieval
According to coming forward position.
In conclusion the present embodiment shows search result by predetermined policy, it can be by the search result of high correlation
Sequence in order to which user preferentially sees most interested data, which thereby enhances user experience preceding.
Hereinbefore, although being described in data retrieval method and data reordering method embodiment according to above-mentioned sequence
Each step, it will be apparent to one skilled in the art that the step in the embodiment of the present invention not necessarily executes in the order described above,
It other sequences can also be executed with inverted order, parallel, intersection etc., moreover, those skilled in the art can also on the basis of above-mentioned steps
To add other steps, the mode of these obvious variants or equivalent replacement be should also be included within protection scope of the present invention,
Details are not described herein.
The following is an embodiment of the apparatus of the present invention, and apparatus of the present invention embodiment is used to execute embodiment of the present invention method realization
Step, for ease of description, only parts related to embodiments of the present invention are shown, disclosed by specific technical details, please join
According to embodiment of the present invention method.Each functional unit in each Installation practice of the present invention can integrate in a processing unit
In, it is also possible to each unit and physically exists alone, can also be integrated in one unit with two or more units.It is above-mentioned
Integrated unit both can take the form of hardware realization, can also realize in the form of hardware adds SFU software functional unit.
Based on technical concept identical with above-mentioned data retrieval method embodiment, the embodiment of the present invention also provides a kind of data
Retrieve device.As shown in figure 3, the data searcher specifically includes that the first acquisition module 31, matching module 32 and screening module
33.Wherein, the first acquisition module 31 is for obtaining term;Wherein, the term is for retrieving data to be retrieved, it is described to
Retrieving data has an attribute information, the attribute information include the codes of data to be retrieved, Chinese, English name, alias,
Phonetic, searching times and temperature.Matching module 32 is for matching the term with the candidate word in candidate word list;
Wherein, the candidate word is determined according to the attribute information of the data to be retrieved.Screening module 33 is used to be tied according to matching
Fruit screens the data to be retrieved, obtains target data according to predetermined condition.
Wherein, the temperature of data to be retrieved can for example announce number according to such as news report number, media, be shared
Number etc. determine.The embodiment of the present invention obtains module 31 for searching times of data to be retrieved, to be checked by first as a result,
The temperature of rope data is applied to the retrieval of data, and the high correlation of search result Yu user's expected result may be implemented.
Wherein, predetermined condition can be condition set by user, for example, the type data of the marketing data of security, security,
Music beats list data, books sales volume data etc..
In a preferred embodiment, which can also include regularization module.Wherein, regularization mould
Block is used to carry out regularization to term.
Wherein, regularization module can carry out logic filter to the term by building regular expression, and obtain
Required content.Wherein, regular expression can use scheduled character and/or character string is combined and obtains.
In a preferred embodiment, above-mentioned matching module 32 is specifically used for: being split to term;Pass through trident
Search tree constructs candidate word list;Wherein, the key-value pair of the node storage of trident search tree is based on data to be retrieved and its category
Property information generate;Based on prefix matching strategy, the term after segmentation is matched with the candidate word in candidate word list.
Wherein, matching module 32 can use the matched method of positive maximum length, the matched method of reverse maximum length,
Maximum probability segmenting method, maximum entropy segmenting method etc., are split data to be retrieved.
The related explanation that can refer to preceding method embodiment is described in detail in relation to data searcher embodiment, herein
It repeats no more.
In conclusion different using single Data Matching factor from the prior art, the embodiment of the present invention is obtained using first
Modulus block 31, matching module 32 and screening module 33, combining the attribute informations of data to be retrieved, (it is including but not limited to be checked
The code of rope data, Chinese, English name, alias, phonetic, searching times, temperature) carry out term matching, thus
The accuracy of data retrieval is improved, so that the expected result of search result and user are highly relevant, user can rapidly be obtained
Interested data are obtained, moreover, the embodiment of the present invention considers searching times and temperature, by word frequency statistics, to realize
The technical effect that heat is searched, and then improve user experience.
In addition, the embodiment of the present invention also provides a kind of data sorting device.As shown in figure 4, the data sorting device is main
It include: the second acquisition module 41 and arrangement module 42.Wherein, the second acquisition module 41 is for obtaining pending data evidence;Wherein, to
Sorting data is obtained according to above-mentioned data searcher.Module 42 is arranged to be used to treat sorting data progress according to predetermined policy
Arrangement, obtains target data;Wherein, predetermined policy includes one or more in following: before data code exact matching, removal
Data code exact matching, first letter of pinyin exact matching, data code and the matching of data name prefix, history inspection after leading zero
Rope number, data weighting.
Specific implementation process in relation to the present embodiment solves the problems, such as and the technical effect of acquirement can refer to aforementioned side
Associated description in method embodiment, details are not described herein.
The embodiment of the present invention obtains module 41 and arrangement module 42 using second, and retrieval knot is shown by predetermined policy
The search result of high correlation can be sorted preceding, in order to which user preferentially sees most interested data, thus be improved by fruit
User experience.
Based on technical concept identical with above-mentioned data retrieval method or data reordering method, the embodiment of the present invention is also provided
A kind of terminal comprising processor and memory;Wherein: memory is for storing computer program.Processor is deposited for executing
When the program stored on reservoir, each technical side described in data retrieval method embodiment or data reordering method embodiment is realized
Any method and step in case.
Wherein, which may include one or more processing cores, such as 4 core processors, 8 core processors
Deng.Processor can use DSP (Digital Signal Processing, Digital Signal Processing), FPGA (Field-
Programmable Gate Array, field programmable gate array), PLA (Programmable Logic Array, may be programmed
Logic array) at least one of example, in hardware realize.Processor also may include primary processor and coprocessor, main process task
Device is the processor for being handled data in the awake state, also referred to as CPU (Central Processing Unit,
Central processing unit);Coprocessor is the low power processor for being handled data in the standby state.In some realities
It applies in example, processor can be integrated with GPU (Graphics Processing Unit, image processor), and GPU is for being responsible for
The rendering and drafting of content to be shown needed for display screen.In some embodiments, processor can also include AI (Artificial
Intelligence, artificial intelligence) processor, the AI processor is for handling the calculating operation in relation to machine learning.
Above-mentioned memory may include random access memory (Random Access Memory, RAM), also may include
Nonvolatile memory (non-volatile memory, NVM), for example, at least a magnetic disk storage.Optionally, memory
It can also be that at least one is located remotely from the storage device of aforementioned processor.
In some embodiments, terminal has also optionally included: peripheral device interface and at least one peripheral equipment.Processing
It can be connected by bus or signal wire between device, memory and peripheral device interface.Each peripheral equipment can by bus,
Signal wire or circuit board are connected with peripheral device interface.
Specific implementation process in relation to the present embodiment, the detail solved the problems, such as can refer to preceding method embodiment
In associated description, details are not described herein.
Terminal provided in an embodiment of the present invention combines to be retrieved when processor executes the program stored on memory
The attribute information of the data to be retrieved such as code, Chinese, English name, alias, phonetic, searching times, the temperature of data, into
The matching of row term is to improve the accuracy of data retrieval, so that the expected result height phase of search result and user
It closes, user can rapidly obtain interested data, moreover, the embodiment of the present invention considers searching times and temperature, lead to
Word frequency statistics are crossed, to realize the technical effect that heat is searched, and then improve user experience.
Based on technical concept identical with above-mentioned data retrieval method or data reordering method, the embodiment of the present invention is also provided
A kind of computer readable storage medium.Computer program is stored in the computer readable storage medium, computer program is located
It is realized when managing device execution any in each technical solution described in data retrieval method embodiment or data reordering method embodiment
Method and step.
Above-mentioned computer readable storage medium can include but is not limited to random access memory (RAM), dynamic random is deposited
Access to memory (DRAM), static random access memory (SRAM), read-only memory (ROM), programmable read only memory
(PROM), Erarable Programmable Read only Memory (EPROM), electrically erasable programmable read-only memory (EEPROM), flash memory (example
Such as, NOR type flash memory or NAND-type flash memory), Content Addressable Memory (CAM), polymer memory is (for example, ferroelectric polymers
Memory), phase transition storage, ovonic memory, silicon-oxide-nitride silicon-silica-silicon (Silicon-
Oxide-Nitride-Oxide-Silicon, SONOS) memory, magnetic card or light-card, also or any other appropriate type
Computer readable storage medium.
Specific implementation process in relation to the present embodiment solves the problems, such as that detail can be with reference in preceding method embodiment
Associated description, details are not described herein.
Computer readable storage medium provided in an embodiment of the present invention combines data to be retrieved when being executed by processor
The data to be retrieved such as code, Chinese, English name, alias, phonetic, searching times, temperature attribute information, examined
The matching of rope word is used to improve the accuracy of data retrieval so that the expected result of search result and user are highly relevant
Family can rapidly obtain interested data and pass through word frequency moreover, the embodiment of the present invention considers searching times and temperature
Statistics to realize the technical effect that heat is searched, and then improves user experience.
The basic principle of the disclosure is described in conjunction with specific embodiments above, however, it is desirable to, it is noted that in the disclosure
The advantages of referring to, advantage, effect etc. are only exemplary rather than limitation, must not believe that these advantages, advantage, effect etc. are the disclosure
Each embodiment is prerequisite.In addition, detail disclosed above is merely to exemplary effect and the work being easy to understand
With, rather than limit, it is that must be realized using above-mentioned concrete details that above-mentioned details, which is not intended to limit the disclosure,.
It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality
Body or operation are distinguished with another entity or operation, are deposited without necessarily requiring or implying between these entities or operation
In any actual relationship or order or sequence.Moreover, the terms "include", "comprise" or its any other variant are intended to
Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those
Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment
Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that
There is also other identical elements in process, method, article or equipment including the element.
It may also be noted that in the system and method for the disclosure, each component or each step are can to decompose and/or again
Combination nova.These decompose and/or reconfigure the equivalent scheme that should be regarded as the disclosure.
Each embodiment in this specification is all made of relevant mode and describes, the highlights of each of the examples are with
The difference of other embodiments, the same or similar parts between the embodiments can be referred to each other.It can not depart from by institute
The technology for the introduction that attached claim defines and carry out the various changes to technology described herein, replacement and change.In addition, this
Disclosed the scope of the claims is not limited to process described above, machine, manufacture, the composition of event, means, method and movement
Specific aspect.It can use and carry out essentially identical function to corresponding aspect described herein or realize essentially identical knot
Fruit there is currently or processing, machine, manufacture, the composition of event, means, method or the movement to be developed later.Thus,
Appended claims include such processing, machine, manufacture, the composition of event, means, method or movement within its scope.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all
Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention
It is interior.
Claims (10)
1. a kind of data retrieval method characterized by comprising
Obtain term;Wherein, the term has attribute information for retrieving data to be retrieved, the data to be retrieved,
The attribute information includes code, Chinese, English name, alias, phonetic, searching times and the temperature of data to be retrieved;
The term is matched with the candidate word in candidate word list;Wherein, the candidate word is according to described to be retrieved
The attribute information of data determines;
The data to be retrieved are screened, target data is obtained according to predetermined condition according to matching result.
2. data retrieval method according to claim 1, which is characterized in that after the acquisition term the step of,
The method also includes:
Regularization is carried out to the term.
3. data retrieval method according to claim 1, which is characterized in that will be in the term and candidate word list
The step of candidate word is matched specifically includes:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key-value pair of the node storage of the trident search tree
It is generated based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, the term after segmentation is matched with the candidate word in the candidate word list.
4. a kind of data reordering method characterized by comprising
Obtain pending data evidence;Wherein, the pending data is according to data retrieval method according to claim 1 to 3
It obtains;
Target data is obtained to the pending data according to arranging according to predetermined policy;Wherein, the predetermined policy includes
It is one or more in below: data code exact matching, first letter of pinyin after data code exact matching, removal leading zero
Exact matching, data code and the matching of data name prefix, history retrieve number, data weighting.
5. a kind of data searcher characterized by comprising
First obtains module, for obtaining term;Wherein, the term is described to be retrieved for retrieving data to be retrieved
Data have attribute information, and the attribute information includes code, Chinese, English name, the alias, spelling of data to be retrieved
Sound, searching times and temperature;
Matching module, for matching the term with the candidate word in candidate word list;Wherein, the candidate root
It is determined according to the attribute information of the data to be retrieved;
Screening module, for being screened to the data to be retrieved according to predetermined condition according to matching result, obtaining target
Data.
6. data searcher according to claim 5, which is characterized in that described device further include:
Regularization module, for carrying out regularization to the term.
7. data searcher according to claim 5, which is characterized in that the matching module is specifically used for:
The term is split;
The candidate word list is constructed by trident search tree;Wherein, the key-value pair of the node storage of the trident search tree
It is generated based on the data to be retrieved and its attribute information;
Based on prefix matching strategy, the term after segmentation is matched with the candidate word in the candidate word list.
8. a kind of data sorting device characterized by comprising
Second obtains module, for obtaining pending data evidence;Wherein, the pending data is according to according to any in claim 5-7
The data searcher obtains;
Module is arranged, for obtaining target data to the pending data according to arranging according to predetermined policy;Wherein, described
Predetermined policy includes one or more in following: data code complete after data code exact matching, removal leading zero
Match, first letter of pinyin exactly matches, data code and the matching of data name prefix, history retrieve number, data weighting.
9. a kind of terminal, which is characterized in that including processor, communication interface, memory and communication bus, wherein the processing
Device, the communication interface and the memory complete mutual communication by the communication bus;
The memory, for storing computer program;
The processor when for executing the program stored on the memory, is realized any described in claim 1-4
Method and step.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium
Program realizes any method and step in claim 1-4 when the computer program is executed by processor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811536639.6A CN109657044A (en) | 2018-12-14 | 2018-12-14 | Data retrieval method, data reordering method, device, terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811536639.6A CN109657044A (en) | 2018-12-14 | 2018-12-14 | Data retrieval method, data reordering method, device, terminal and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109657044A true CN109657044A (en) | 2019-04-19 |
Family
ID=66114283
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811536639.6A Pending CN109657044A (en) | 2018-12-14 | 2018-12-14 | Data retrieval method, data reordering method, device, terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109657044A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110246592A (en) * | 2019-06-25 | 2019-09-17 | 山东健康医疗大数据有限公司 | Realize the mapping method and system of medical institutions' isomeric data codomain code standardization |
CN110377830A (en) * | 2019-07-25 | 2019-10-25 | 拉扎斯网络科技(上海)有限公司 | Retrieval method, retrieval device, readable storage medium and electronic equipment |
CN110377831A (en) * | 2019-07-25 | 2019-10-25 | 拉扎斯网络科技(上海)有限公司 | Retrieval method, retrieval device, readable storage medium and electronic equipment |
CN110895585A (en) * | 2019-10-18 | 2020-03-20 | 深圳市富途网络科技有限公司 | Stock data acquisition method and device, terminal equipment and storage medium |
CN111104375A (en) * | 2019-11-22 | 2020-05-05 | 泰康保险集团股份有限公司 | Authority rule editing method, system, equipment and storage medium |
CN111143661A (en) * | 2019-12-18 | 2020-05-12 | 深圳易伙科技有限责任公司 | Object-oriented semantic retrieval method and device |
CN113515940A (en) * | 2021-07-14 | 2021-10-19 | 上海芯翌智能科技有限公司 | Method and equipment for text search |
CN113921082A (en) * | 2021-10-27 | 2022-01-11 | 云舟生物科技(广州)有限公司 | Gene search weight adjustment method, computer storage medium, and electronic device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101388012A (en) * | 2007-09-13 | 2009-03-18 | 阿里巴巴集团控股有限公司 | Phonetic check system and method with easy confusion tone recognition |
CN104125505A (en) * | 2014-06-23 | 2014-10-29 | 小米科技有限责任公司 | Television program processing method and device |
CN104268157A (en) * | 2014-09-03 | 2015-01-07 | 乐视网信息技术(北京)股份有限公司 | Device and method for error correction in data search |
CN106970936A (en) * | 2017-02-09 | 2017-07-21 | 阿里巴巴集团控股有限公司 | Data processing method and device, data query method and device |
CN108170852A (en) * | 2018-01-19 | 2018-06-15 | 深圳市富途网络科技有限公司 | A kind of stock searching method of efficiently and accurately |
-
2018
- 2018-12-14 CN CN201811536639.6A patent/CN109657044A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101388012A (en) * | 2007-09-13 | 2009-03-18 | 阿里巴巴集团控股有限公司 | Phonetic check system and method with easy confusion tone recognition |
CN104125505A (en) * | 2014-06-23 | 2014-10-29 | 小米科技有限责任公司 | Television program processing method and device |
CN104268157A (en) * | 2014-09-03 | 2015-01-07 | 乐视网信息技术(北京)股份有限公司 | Device and method for error correction in data search |
CN106970936A (en) * | 2017-02-09 | 2017-07-21 | 阿里巴巴集团控股有限公司 | Data processing method and device, data query method and device |
CN108170852A (en) * | 2018-01-19 | 2018-06-15 | 深圳市富途网络科技有限公司 | A kind of stock searching method of efficiently and accurately |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110246592B (en) * | 2019-06-25 | 2023-07-14 | 山东浪潮智慧医疗科技有限公司 | Mapping method and system for realizing standardization of medical institution heterogeneous data value domain codes |
CN110246592A (en) * | 2019-06-25 | 2019-09-17 | 山东健康医疗大数据有限公司 | Realize the mapping method and system of medical institutions' isomeric data codomain code standardization |
CN110377831B (en) * | 2019-07-25 | 2022-05-17 | 拉扎斯网络科技(上海)有限公司 | Retrieval method, retrieval device, readable storage medium and electronic equipment |
CN110377830A (en) * | 2019-07-25 | 2019-10-25 | 拉扎斯网络科技(上海)有限公司 | Retrieval method, retrieval device, readable storage medium and electronic equipment |
CN110377831A (en) * | 2019-07-25 | 2019-10-25 | 拉扎斯网络科技(上海)有限公司 | Retrieval method, retrieval device, readable storage medium and electronic equipment |
CN110895585B (en) * | 2019-10-18 | 2022-08-23 | 深圳市富途网络科技有限公司 | Stock data acquisition method and device, terminal equipment and storage medium |
CN110895585A (en) * | 2019-10-18 | 2020-03-20 | 深圳市富途网络科技有限公司 | Stock data acquisition method and device, terminal equipment and storage medium |
CN111104375A (en) * | 2019-11-22 | 2020-05-05 | 泰康保险集团股份有限公司 | Authority rule editing method, system, equipment and storage medium |
CN111104375B (en) * | 2019-11-22 | 2023-06-09 | 泰康保险集团股份有限公司 | Nuclear protection rule editing method, system, equipment and storage medium |
CN111143661A (en) * | 2019-12-18 | 2020-05-12 | 深圳易伙科技有限责任公司 | Object-oriented semantic retrieval method and device |
CN113515940A (en) * | 2021-07-14 | 2021-10-19 | 上海芯翌智能科技有限公司 | Method and equipment for text search |
CN113515940B (en) * | 2021-07-14 | 2022-12-13 | 上海芯翌智能科技有限公司 | Method and equipment for text search |
CN113921082A (en) * | 2021-10-27 | 2022-01-11 | 云舟生物科技(广州)有限公司 | Gene search weight adjustment method, computer storage medium, and electronic device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109657044A (en) | Data retrieval method, data reordering method, device, terminal and storage medium | |
WO2021139325A1 (en) | Media information recommendation method and apparatus, electronic device, and storage medium | |
US20210158164A1 (en) | Finding k extreme values in constant processing time | |
CN110532451A (en) | Search method and device for policy text, storage medium, electronic device | |
CN109062994A (en) | Recommended method, device, computer equipment and storage medium | |
CN108885624B (en) | Information recommendation system and method | |
US10223453B2 (en) | Dynamic search set creation in a search engine | |
CN112434151A (en) | Patent recommendation method and device, computer equipment and storage medium | |
CN100442284C (en) | Search system for providing information of keyword input frequency by category and method thereof | |
CN108021708B (en) | Content recommendation method and device and computer readable storage medium | |
CN104516910A (en) | Method and system for recommending content in client-side server environment | |
CN112818218B (en) | Information recommendation method, device, terminal equipment and computer readable storage medium | |
KR102108683B1 (en) | Method for providing recommendation contents including non-interest contents | |
CN110275952A (en) | News recommended method, device and medium based on user's short-term interest | |
CN110110139A (en) | The method, apparatus and electronic equipment that a kind of pair of recommendation results explain | |
CA2919878A1 (en) | Refining search query results | |
CN109325146A (en) | A kind of video recommendation method, device, storage medium and server | |
CN104933044A (en) | Application uninstalling reason classification method and classification apparatus | |
CN109800427A (en) | Word segmentation method, word segmentation device, word segmentation terminal and computer readable storage medium | |
CN103365842B (en) | A kind of page browsing recommends method and device | |
CN112579854A (en) | Information processing method, device, equipment and storage medium | |
CN112825089B (en) | Article recommendation method, device, equipment and storage medium | |
CN108959453A (en) | Information extracting method, device and readable storage medium storing program for executing based on text cluster | |
Liu et al. | Detecting industry clusters from the bottom up based on co-location patterns mining: A case study in Dongguan, China | |
CN108446378B (en) | Method, system and computer storage medium based on user search |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190419 |