CN110413645A - Data search method, device, terminal and computer readable storage medium - Google Patents

Data search method, device, terminal and computer readable storage medium Download PDF

Info

Publication number
CN110413645A
CN110413645A CN201910533692.9A CN201910533692A CN110413645A CN 110413645 A CN110413645 A CN 110413645A CN 201910533692 A CN201910533692 A CN 201910533692A CN 110413645 A CN110413645 A CN 110413645A
Authority
CN
China
Prior art keywords
data
keyword
search
text information
occurrence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910533692.9A
Other languages
Chinese (zh)
Inventor
杨小彦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Puhui Enterprise Management Co Ltd
Original Assignee
Ping An Puhui Enterprise Management Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Puhui Enterprise Management Co Ltd filed Critical Ping An Puhui Enterprise Management Co Ltd
Priority to CN201910533692.9A priority Critical patent/CN110413645A/en
Publication of CN110413645A publication Critical patent/CN110413645A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Fuzzy Systems (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Automation & Control Theory (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of data search methods, comprising: obtains the text information for needing to search for, and extracts keyword from text information;File is named with keyword to corresponding according to keyword query;All preset data numbers, composition number set a, wherein data number is corresponding with a unique data, and data number is for searching for data are obtained from file;Corresponding data are searched for according to the data number in number set.The present invention also provides a kind of data serching device, terminal and computer readable storage mediums.Technical solution proposed by the present invention scans for the data in database based on data query, file is scanned for using keyword, since unique data number corresponding with data is stored in the file of keyword name, therefore, it can use operating system and efficient index carried out to data, so as to improve the search efficiency of data search and the accuracy rate of search result.

Description

Data search method, device, terminal and computer readable storage medium
Technical field
The present invention relates to data searching technology field more particularly to a kind of data search method, device, terminal and computers Readable storage medium storing program for executing.
Background technique
Currently, to the data in database scan for inquiry when, developer generally use the like of database into Row fuzzy query, when data volume is larger, due to that must carry out full table scan inquiry, the efficiency of inquiry is very low.Also, Due to it is matched in the way of search of fuzzy matching be complete sentence data search content, qualified object search may be very It is few, so that the data of search may be less accurate.
Therefore, it is existing carry out the mode inefficiency of data search in the database and search result inaccuracy be it is a kind of urgently Problem to be solved.
Summary of the invention
The main purpose of the present invention is to provide a kind of data search method, device, terminal and computer-readable storage mediums Matter, it is intended to which the technology for solving the existing mode inefficiency and search result inaccuracy for carrying out data search in the database is asked Topic.
To achieve the above object, the present invention provides a kind of data search method, and the data search method includes:
The text information for needing to search for is obtained, and extracts keyword from the text information;
According to the keyword query to the corresponding file named with keyword;
All preset data numbers, composition number set a, wherein data number are obtained from the file It is corresponding with a unique data, the data number is for searching for the data;
Corresponding data are searched for according to the data number in the number set.
Preferably, described the step of searching for corresponding data according to the data number numbered in set, includes:
Obtain the frequency of occurrence of each data number in number set;
Data are scanned for by from high to low sequence according to the frequency of occurrence;
The data searched are carried out by the frequency of occurrence from high to low sequence.
Preferably, after described the step of obtaining the frequency of occurrence of each data number in number set, further includes:
Each data number is carried out by frequency of occurrence from high to low sequence;
In the ranking since the highest data number of frequency of occurrence, the data number of predetermined number is extracted;
Corresponding data are searched for according to the data number of predetermined number.
Preferably, described the step of obtaining the frequency of occurrence of each data number in number set, includes:
Obtain the corresponding default weight of keyword;
By the corresponding default weight of the data number obtained from the file named using keyword as weight number;
It is added the corresponding weight number of each data number to obtain corresponding frequency of occurrence.
Preferably, described to obtain the text information for needing to search for, and the step of keyword is extracted from the text information Suddenly include:
The text information for needing to search for is obtained, and the text information is subjected to matching in preset keywords database and is looked into It askes;
If there is the word to match with the character in the text information in preset keywords database, the character is mentioned It is taken as keyword;
If the word not matched with the character in the text information in preset keywords database, generation can not be mentioned Take the prompt information of keyword.
Preferably, described to obtain the text information for needing to search for, and the step of keyword is extracted from the text information Before rapid, further includes:
To one unique data number of each data definition in database;
The keyword and data number in each data are extracted, and the keyword and the data number are closed Connection;
The file named with keyword is established in keywords database;
Data number corresponding with keyword is saved into the file named with keyword.
Preferably, the keyword and data number extracted in each data, and by the keyword and the number The step of being associated according to number include:
The content of each data is segmented;
The frequency of occurrence of all words after obtaining participle;
Frequency of occurrence is greater than the participle of preset times as keyword;
The data number of each data is extracted, and the keyword and the data number are associated.
In addition, the present invention also provides a kind of data serching device, the data serching device includes:
Extraction module, the extraction module is used to obtain the text information for needing to search for, and mentions from the text information Take out keyword;
Enquiry module, the enquiry module are used for according to the keyword query to the corresponding file named with keyword Folder;
Module is obtained, the acquisition module is formed for obtaining all preset data numbers from the file Number set a, wherein data number is corresponding with a unique data, and the data number is for searching for the data;
Search module, described search module are used to search for corresponding data according to the data number in the number set.
The present invention also provides a kind of terminal, including processor, memory and be stored on the memory can be by institute State the data search program of processor execution, wherein when the data search program is executed by the processor, realize institute as above The step of data search method stated.
The present invention also provides a kind of computer readable storage medium, data are stored on the computer readable storage medium Search program, wherein when the data search program is executed by processor, realize the step of data search method as described above Suddenly.
In technical solution of the present invention, the text information for needing to search for is obtained, and keyword is extracted from text information;Root File is named with keyword to corresponding according to keyword query;All preset data numbers, group are obtained from file Gather at number, wherein a data number is corresponding with a unique data, and data number is for searching for data;According to number Data number in set searches for corresponding data.Technical solution proposed by the present invention is based on data query to the number in database According to scanning for, first from needing to extract keyword in the text information searched for, found further according to keyword corresponding with key The file of word name, is finally searched according to unique data number corresponding with each data in file in the database Rope is to corresponding data, that is, when user needs to search for corresponding data by a certain text information, due to corresponding with data Unique data number is stored in the file of keyword name, to find after the file of keyword name, so that it may To obtain wherein preset all data numbers, then corresponding number can directly and accurately be found by data number According to therefore, the application can use operating system and carry out efficient index to data, improve search efficiency and the search of data search As a result accuracy rate.
Detailed description of the invention
Fig. 1 is the hardware structural diagram of terminal involved in the embodiment of the present invention;
Fig. 2 is the flow diagram of data search method first embodiment of the present invention;
Fig. 3 is the text information for needing to search for be obtained in the embodiment of the present invention, and pass is extracted from the text information The process refinement schematic diagram of the step of keyword;
Fig. 4 is the step of searching for corresponding data according to the data number in the number set in the embodiment of the present invention Process refinement schematic diagram;
Fig. 5 is the process refinement signal that the frequency of occurrence of each data number in number set is obtained in the embodiment of the present invention Figure;
Fig. 6 is the flow diagram of data search method second embodiment of the present invention;
Fig. 7 is the flow diagram of data search method 3rd embodiment of the present invention;
Fig. 8 is that keyword and data number in each data are extracted in the embodiment of the present invention, and by the keyword The process refinement schematic diagram for the step of being associated with the data number;
Fig. 9 is the module diagram of data serching device of the present invention.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
The present embodiments relate to data search method be mainly used in terminal, which can be PC, portable calculating The equipment that machine, mobile terminal etc. have display and processing function.
Referring to Fig.1, Fig. 1 is terminal structure schematic diagram involved in the embodiment of the present invention.In the embodiment of the present invention, eventually End may include processor 1001 (such as CPU), communication bus 1002, user interface 1003, network interface 1004, memory 1005.Wherein, communication bus 1002 is for realizing the connection communication between these components;User interface 1003 may include display Shield (Display), input unit such as keyboard (Keyboard);Network interface 1004 optionally may include that the wired of standard connects Mouth, wireless interface (such as WI-FI interface);Memory 1005 can be high speed RAM memory, be also possible to stable memory (non-volatile memory), such as magnetic disk storage, memory 1005 optionally can also be independently of aforementioned processor 1001 storage device.
It will be understood by those skilled in the art that hardware configuration shown in Fig. 1 does not constitute the restriction to equipment, can wrap It includes than illustrating more or fewer components, perhaps combines certain components or different component layouts.
With continued reference to Fig. 1, the memory 1005 in Fig. 1 as a kind of computer readable storage medium may include operation system System, network communication module and data search program.
In Fig. 1, network communication module is mainly used for connecting server, carries out data communication with server;And processor 1001 can call the data search program stored in memory 1005, and the step of executing data search method.
Based on the hardware configuration of above-mentioned terminal, each embodiment of data search method of the present invention is proposed.
The present invention provides a kind of data search method.
Referring to Fig. 2, in the first embodiment of the invention, data search method the following steps are included:
Step S100 obtains the text information for needing to search for, and extracts keyword from the text information;
When user needs to scan in the database, the available text information for needing to search for.Text information It can be one or more phrases, can be in short, be also possible to passage, for example, it is desired to which the text information of search can Think " television set of Sony's production ", after terminal gets the text information for needing to search for, can be mentioned from text information Keyword is taken out, only needs to extract " Sony " " television set " two keywords from " television set that Sony produces ".Tool Body, in one embodiment, text information can be carried out by matched mode in keywords database by predetermined keyword library To extract keyword.In another embodiment, Lucene and segmenter can be recycled from text by predetermined keyword library Customized keyword in keywords database is accurately extracted in this information, also, customized when that can not find in text information When keyword, the prompt information that can not extract keyword is generated, to remind user that can not scan for text information.
Specifically, referring to figure 3., Fig. 3 is to obtain to need the text information searched in the embodiment of the present invention, and from the text The process refinement schematic diagram for the step of keyword is extracted in this information, based on the above embodiment, step S100 includes:
Step S110 is obtained and is needed the text information searched for, and by the text information in preset keywords database into Row matching inquiry;
It should be noted that there being keywords database in advance in the terminal, the keyword in keywords database is from database Data in extract, that is, the keyword extracted from each data in database is stored in keywords database.
After getting and needing the text information searched for, text information can be carried out in preset keywords database With inquiry, specifically, word segmentation processing can be carried out to text information, obtain different participles.For example, when text information is " rope The television set of Buddhist nun's production ", the character branched away by participle technique can be " Sony ", " production ", " television set ".It will branch away Character matching inquiry is carried out in keywords database, judge in keywords database with the presence or absence of word with the character match branched away Language.
Step S120 will if there is the word to match with the character in the text information in preset keywords database The character is extracted as keyword;
Specifically, if there is the word to match with the character in text information in keywords database, by text information In character be extracted as keyword.For example, when the text information for needing to search for is " television set of Sony's production ", and work as keyword When having " Sony " " television set " two words in library, " Sony ", " television set " are extracted as keyword.
Step S130, if the word not matched with the character in the text information in preset keywords database, Generate the prompt information that can not extract keyword.
Specifically, if the word to match with the character in text information can not be found in preset keywords database, Data corresponding with the text message are not present in database of descriptions, in this case, terminal, which can be generated, to be extracted The prompt information of keyword, and it is shown in terminal interface, to remind user that can not scan for text information.
Step S200 names file to corresponding according to the keyword query with keyword;
It should be noted that the data stored in database are all defined with a unique data number, it will be in data After keyword extraction comes out, data number corresponding with data is stored to the file of the keyword name extracted In, if two data have identical keyword, the corresponding data number of two data is stored in the same keyword and is ordered In the file of name.Therefore, after extracting the keyword in text information, can use the keyword find it is corresponding with The file of keyword name.
Step S300 obtains all preset data numbers, composition number set, wherein one from the file A data number is corresponding with a unique data, and the data number is for searching for the data;
Specifically, after finding with the file of keyword name, all preset data are obtained from file Number, composition number set a, wherein data number is corresponding with a unique data, and data number is for searching for data.
It should be noted that the keyword extracted from text information may have multiple, can be looked for according to multiple keywords It is multiple to be stored with the data numbers of data in the files of keyword name to multiple files named with keyword, By All Files press from both sides in data number extract after, composition number set.In one embodiment, the data due to having Number is potentially stored in the file of different keyword names, and therefore, the same data number is likely to occur repeatedly.One In kind embodiment, the frequency of occurrence of each data number can have directly been recorded in number set.For example, if Data Data There is keyword in 1 (number 001): India, computer, Dell;The keyword of data 2 (number 002) has: India, Sony, TV Machine;The keyword of data 3 (number 003) has: China, Sony, camera;Then generate with India, computer, Dell, Sony, television set 5 text files of name are stored in each file: India (001,002), computer (001), Dell (001), Sony respectively (002,003), television set (002), Chinese (003), camera (003).When the text information that needs are searched for is " the electricity of Sony's production Depending on machine " when, the keyword extracted is " Sony ", " television set ", and the file found is Sony (002,003), television set (002), wherein data number 002 occurs 2 times, and 003 occurs 1 time.
Step S400 searches for corresponding data according to the data number in the number set.
Every data in database all has a unique data number, after constituting number set, according to volume Number set in data number can find data corresponding with text information.
Specifically, referring to figure 4., Fig. 4 is to be searched in the embodiment of the present invention according to the data number in the number set The process refinement schematic diagram of the step of corresponding data, based on the above embodiment, step S400 includes:
Step S410 obtains the frequency of occurrence of each data number in number set;
After composition number set, the search order of data number or in order to be carried out to the data that inquire in order to obtain Sequence, the available frequency of occurrence for numbering each data in set, in one embodiment, each data number go out occurrence Number just refers to the number that the data number is found in the file named with keyword.
In another embodiment, referring to figure 5., Fig. 5 is that each data in number set are obtained in the embodiment of the present invention The process refinement schematic diagram of the frequency of occurrence of number, based on the above embodiment, step S410 includes:
Step S411 obtains the corresponding default weight of keyword;
Specifically, when user needs to search for the corresponding data of a certain text information in the database, user can basis Specific demand sets the weight of each keyword, that is, can be in terminal after extracting the keyword in text information The prompt information of display interface pop-up setting weight, after terminal receives the input of user, obtains the corresponding default weight of keyword. For example, when needing the text information searched for is " television set and computer of Sony's production ", the keyword extracted be " Sony ", " television set ", " computer ", when user feel oneself mainly think search be data relevant to television set when, can be set to electricity Higher weight is set depending on machine, for example, the weight of setting " television set " is A, the weight of Sony is B, and the weight of computer is C.
Step S412, by the corresponding default weight of the data number obtained from the file named using keyword as power Weight number;
Specifically, after finding with the file of keyword name, by the corresponding default power of the data number in file Recast is weight number.It should be noted that due to a data number there may be it is several it is different with keyword name In file, the corresponding default weight of data number is as weight number, when being provided with different weights to different keywords Afterwards, available several different corresponding weight numbers.For example, when the data number in the file named with " television set " It is 002, with the data number in the file of " Sony " name for 002,003, with the data in the file of " computer " name Number is 003, and therefore, 002 corresponding default weight has A, B, and A, B is corresponding as the weight number of data number 002,003 Default weight have B, C, using B, C as the weight number of data number 003.
Step S413 is added the corresponding weight number of each data number to obtain corresponding frequency of occurrence.
Specifically, after obtaining the corresponding weight number of data number, by the corresponding all weight numbers of data number Addition obtains final frequency of occurrence.It should be noted that in the case where being provided with weight to keyword, each data number Corresponding final frequency of occurrence is the number less than 1, this acquisition is that frequency of occurrence is intended merely to obtain the inquiry of data number Sequentially or in order to be ranked up to the data inquired.
Step S420 scans for data by from high to low sequence according to the frequency of occurrence;
After getting the frequency of occurrence of data number, in one embodiment, it can be pressed according to the height of frequency of occurrence Sequence from high to low scans for the data in database, convenient for improving search efficiency.In another embodiment, in order to Search efficiency is further increased, multi-threaded parallel directly can be carried out to the data in database according to the data number of acquisition and searched Rope.
Step S430 is carried out the data searched from high to low sequence by the frequency of occurrence.
Specifically, it can be ranked up to the data result come is searched out, the corresponding number of the data number more than frequency of occurrence Matching degree according to the data for needing to search for user is higher, therefore, can be by the corresponding data of data number more than frequency of occurrence Front is come, the corresponding data of data number that frequency of occurrence is lacked are come below.That is, by the data searched according to corresponding The frequency of occurrence of data number carries out sequence from high to low.Preferably, data can be carried out from high to low by frequency of occurrence Sequence from top to bottom or from left to right, specific sortord can be set according to the actual situation, for example, can will own The data inquired are set in list from top to bottom by the height of frequency of occurrence.
Further, Fig. 6 is please referred to, Fig. 6 is the flow diagram of data search method second embodiment of the present invention, is based on Above-described embodiment, after step S410 further include:
Step S440 is carried out each data number from high to low sequence by frequency of occurrence;
Specifically, in one embodiment, user only may need to obtain the highest several datas of matching degree, in such feelings Under condition, user can preset the number for needing the data obtained, then by each data number in number set according to occurrence out Number is carried out from high to low sequence.
Step S450 extracts the data number of predetermined number in the ranking since the highest data number of frequency of occurrence;
Data number is being pressed into frequency of occurrence after high to low sequence, in the ranking from the highest data number of frequency of occurrence Start, extracts the data number of predetermined number.For example, it is assumed that data number has 001,002,003,004 4, when according to appearance Number from it is high to low data number is ranked up after the result is that when 001,003,002,004, if predetermined number is 1, The data number then extracted is 001;If predetermined number is 2, the data number extracted is 001,003;If default Number is 3, then the data number extracted is 001,003,002.
Step S460 searches for corresponding data according to the data number of predetermined number.
After data number is extracted, corresponding data are searched for according to the data number of predetermined number, that is, according to default The data number of number can search the data of predetermined number.
In addition, in another embodiment, if all had in the file of the different keywords name inquired same One data number can will be searched directly using the data number as the highest data number of matching degree by the data number The data that rope arrives are as the highest data of matching degree.
In technical solution of the present invention, the text information for needing to search for is obtained, and keyword is extracted from text information;Root File is named with keyword to corresponding according to keyword query;All preset data numbers, group are obtained from file Gather at number, wherein a data number is corresponding with a unique data, and data number is for searching for data;According to number Data number in set searches for corresponding data.Technical solution proposed by the present invention is based on data query to the number in database According to scanning for, first from needing to extract keyword in the text information searched for, found further according to keyword corresponding with key The file of word name, is finally searched according to unique data number corresponding with each data in file in the database Rope is to corresponding data, that is, when user needs to search for corresponding data by a certain text information, due to corresponding with data Unique data number is stored in the file of keyword name, to find after the file of keyword name, so that it may To obtain wherein preset all data numbers, then corresponding number can directly and accurately be found by data number According to therefore, the application can use operating system and carry out efficient index to data, improve search efficiency and the search of data search As a result accuracy rate.
Further, Fig. 7 is please referred to, Fig. 7 is the flow diagram of data search method 3rd embodiment of the present invention, is based on First embodiment, step S100 include: before
Step S500, to one unique data number of each data definition in database;
It specifically, can be first to each data in database when scanning for inquiry to the data in database A unique data number is all defined, can accurately be found in the database according to unique data number corresponding Data.
Step S600, extracts keyword and data number in each data, and by the keyword and the data Number is associated;
After to the complete data number of each data definition in database, can in each data keyword and The data number of each data extracts, and the keyword extracted is associated with data number.It needs to illustrate It is that there may be multiple keywords in a data, and a data only corresponds to a data number, therefore, when a data In extracted multiple keywords in the case where, corresponding with this data data number association of multiple keyword.
Step S700 establishes the file named with keyword in keywords database;
The file named with keyword is established in preset keywords database, that is, be just stored in keywords database from number All keywords extracted according to data each in library and the file named with keyword.
Step S800 saves data number corresponding with keyword into the file named with keyword.
After establishing file, data number corresponding with keyword is saved to the file named with keyword In, after a keyword is corresponding with multiple data numbers, the data number more having is stored in the text named with the keyword In part folder.
Further, Fig. 8 is please referred to, Fig. 8 is the keyword and data extracted in each data in the embodiment of the present invention Number, and the process refinement schematic diagram for the step of keyword and the data number are associated, are implemented based on third Example, step S600 include:
Step S610 segments the content of each data;
Specifically, each data in database can be segmented to obtain single word, for example, using IKanalyzer participle tool segments sentence.
Step S620, the frequency of occurrence of all words after obtaining participle;
After the word of that obtained after segmenting to each data, the frequency of occurrence of each word is obtained.
Frequency of occurrence is greater than the participle of preset times as keyword by step S630;
Specifically, preset times can be arranged in the terminal, preset when the frequency of occurrence of a certain word branched away is greater than When number, which can be extracted to the keyword as the data.
Step S640 extracts the data number of each data, and the keyword and the data number is carried out Association.
Extract the data number of each data, then by the data number extracted and the keyword that extracts into Row association, in order to which data number to be stored in the corresponding file named with keyword.
In addition, in another embodiment, the keyword abstraction (Topic- based on topic model can also be passed through Model), the side such as keyword abstraction (TextRank) of keyword abstraction and word-based graph model based on TF-IDF word frequency statistics Formula extracts the keyword of each data in database.
In addition, please referring to Fig. 9, the present invention also provides a kind of data serching device 10, the data serching device 10 includes:
Extraction module 20, the extraction module are used to obtain the text information for needing to search for, and from the text information Extract keyword;
Enquiry module 30, the enquiry module are used for according to the keyword query to the corresponding text named with keyword Part folder;
Module 40 is obtained, the acquisition module from the file for obtaining all preset data numbers, group Gather at number, wherein a data number is corresponding with a unique data, and the data number is for searching for the data;
Search module 50, described search module are used to search for corresponding number according to the data number in the number set According to.
Further, described search module 50 is also used to:
Obtain the frequency of occurrence of each data number in number set;
Data are scanned for by from high to low sequence according to the frequency of occurrence;
The data searched are carried out by the frequency of occurrence from high to low sequence.
Further, described search module 50 is also used to:
Each data number is carried out by frequency of occurrence from high to low sequence;
In the ranking since the highest data number of frequency of occurrence, the data number of predetermined number is extracted;
Corresponding data are searched for according to the data number of predetermined number.
Further, described search module 50 is also used to:
Obtain the corresponding default weight of keyword;
By the corresponding default weight of the data number obtained from the file named using keyword as weight number;
It is added the corresponding weight number of each data number to obtain corresponding frequency of occurrence.
Further, the extraction module 20 is also used to:
The text information for needing to search for is obtained, and the text information is subjected to matching in preset keywords database and is looked into It askes;
If there is the word to match with the character in the text information in preset keywords database, the character is mentioned It is taken as keyword;
If the word not matched with the character in the text information in preset keywords database, generation can not be mentioned Take the prompt information of keyword.
Further, the data serching device 10 further include:
Definition module, the definition module are used to compile each data definition one unique data in database Number;
Relating module, the relating module are used to extract the keyword and data number in each data, and will be described Keyword is associated with the data number;
Module is named, the name module in keywords database for establishing the file named with keyword;
Preserving module, the preserving module are used to save corresponding with keyword data number to being named with keyword In file.
Further, the relating module is also used to:
The content of each data is segmented;
The frequency of occurrence of all words after obtaining participle;
Frequency of occurrence is greater than the participle of preset times as keyword;
The data number of each data is extracted, and the keyword and the data number are associated.
Wherein, modules are opposite with each step in above-mentioned data search method embodiment in above-mentioned data serching device 10 It answers, function and realization process no longer repeat one by one here.
In addition, the present invention also provides a kind of computer readable storage mediums.
Data search program is stored on computer readable storage medium of the present invention, wherein data search program is processed When device executes, realize such as the step of above-mentioned data search method.
Wherein, data search program, which is performed realized method, can refer to each reality of data search method of the present invention Example is applied, details are not described herein again.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
It should be noted that in the claims, any reference symbol between parentheses should not be configured to power The limitation that benefit requires.Word "comprising" does not exclude the presence of component or step not listed in the claims.Before component Word "a" or "an" does not exclude the presence of multiple such components.The present invention can be by means of including several different components It hardware and is realized by means of properly programmed computer.In the unit claims listing several devices, these are filled Several in setting, which can be, to be embodied by the same item of hardware.The use of word first, second, and third is not Indicate any sequence.These words can be construed to title.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.
The above description is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all at this Under the inventive concept of invention, using equivalent structure transformation made by description of the invention and accompanying drawing content, or directly/use indirectly It is included in other related technical areas in scope of patent protection of the invention.

Claims (10)

1. a kind of data search method, which is characterized in that the data search method includes:
The text information for needing to search for is obtained, and extracts keyword from the text information;
According to the keyword query to the corresponding file named with keyword;
All preset data numbers, composition number set a, wherein data number is corresponding are obtained from the file There is a unique data, the data number is for searching for the data;
Corresponding data are searched for according to the data number in the number set.
2. data search method as described in claim 1, which is characterized in that the data according in the number set are compiled Number search for corresponding data the step of include:
Obtain the frequency of occurrence of each data number in number set;
Data are scanned for by from high to low sequence according to the frequency of occurrence;
The data searched are carried out by the frequency of occurrence from high to low sequence.
3. data search method as claimed in claim 2, which is characterized in that described to obtain each data number in number set After the step of frequency of occurrence, further includes:
Each data number is carried out by frequency of occurrence from high to low sequence;
In the ranking since the highest data number of frequency of occurrence, the data number of predetermined number is extracted;
Corresponding data are searched for according to the data number of predetermined number.
4. data search method as claimed in claim 2, which is characterized in that each data number in the acquisition number set Frequency of occurrence the step of include:
Obtain the corresponding default weight of keyword;
By the corresponding default weight of the data number obtained from the file named using keyword as weight number;
It is added the corresponding weight number of each data number to obtain corresponding frequency of occurrence.
5. data search method as described in claim 1, which is characterized in that the text information for obtaining needs and searching for, and The step of extracting keyword from the text information include:
The text information for needing to search for is obtained, and the text information is subjected to matching inquiry in preset keywords database;
If there is the word to match with the character in the text information in preset keywords database, the character is extracted as Keyword;
If the word not matched with the character in the text information in preset keywords database, generation can not extract pass The prompt information of keyword.
6. data search method according to any one of claims 1 to 5, which is characterized in that described to obtain the text for needing to search for This information, and before the step of extracting keyword in the text information, further includes:
To one unique data number of each data definition in database;
The keyword and data number in each data are extracted, and the keyword and the data number are associated;
The file named with keyword is established in keywords database;
Data number corresponding with keyword is saved into the file named with keyword.
7. data search method as claimed in claim 6, which is characterized in that the keyword extracted in each data and Data number, and the step of keyword is associated with the data number includes:
The content of each data is segmented;
The frequency of occurrence of all words after obtaining participle;
Frequency of occurrence is greater than the participle of preset times as keyword;
The data number of each data is extracted, and the keyword and the data number are associated.
8. a kind of data serching device, which is characterized in that the data serching device includes:
Extraction module, the extraction module is used to obtain the text information for needing to search for, and extracts from the text information Keyword;
Enquiry module, the enquiry module are used for according to the keyword query to the corresponding file named with keyword;
Module is obtained, the acquisition module from the file for obtaining all preset data numbers, composition number Set a, wherein data number is corresponding with a unique data, and the data number is for searching for the data;
Search module, described search module are used to search for corresponding data according to the data number in the number set.
9. a kind of terminal, which is characterized in that including processor, memory and be stored on the memory can be described The data search program that processor executes, wherein when the data search program is executed by the processor, realize as right is wanted The step of data search method described in asking any one of 1 to 7.
10. a kind of computer readable storage medium, which is characterized in that be stored with data on the computer readable storage medium and search Suo Chengxu, wherein when the data search program is executed by processor, realize the number as described in any one of claims 1 to 7 The step of according to searching method.
CN201910533692.9A 2019-06-19 2019-06-19 Data search method, device, terminal and computer readable storage medium Pending CN110413645A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910533692.9A CN110413645A (en) 2019-06-19 2019-06-19 Data search method, device, terminal and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910533692.9A CN110413645A (en) 2019-06-19 2019-06-19 Data search method, device, terminal and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN110413645A true CN110413645A (en) 2019-11-05

Family

ID=68359441

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910533692.9A Pending CN110413645A (en) 2019-06-19 2019-06-19 Data search method, device, terminal and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110413645A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291171A (en) * 2020-01-21 2020-06-16 南方电网能源发展研究院有限责任公司 Risk data searching method for critical engineering
CN111797201A (en) * 2020-06-23 2020-10-20 中民筑友建设科技集团有限公司 BIM (building information modeling) model acquisition method, device, equipment and computer readable storage medium
CN112507000A (en) * 2020-12-23 2021-03-16 深圳市普渡科技有限公司 Method and device for configuring target point of robot, electronic device and storage medium
CN114357030A (en) * 2022-01-04 2022-04-15 深圳市智百威科技发展有限公司 Big data storage system and method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1021265A (en) * 1996-06-28 1998-01-23 Kokusai Denshin Denwa Co Ltd <Kdd> Data base device
CN102456058A (en) * 2010-11-02 2012-05-16 阿里巴巴集团控股有限公司 Method and device for providing category information
CN108804642A (en) * 2018-06-05 2018-11-13 中国平安人寿保险股份有限公司 Search method, device, computer equipment and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1021265A (en) * 1996-06-28 1998-01-23 Kokusai Denshin Denwa Co Ltd <Kdd> Data base device
CN102456058A (en) * 2010-11-02 2012-05-16 阿里巴巴集团控股有限公司 Method and device for providing category information
CN108804642A (en) * 2018-06-05 2018-11-13 中国平安人寿保险股份有限公司 Search method, device, computer equipment and storage medium

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291171A (en) * 2020-01-21 2020-06-16 南方电网能源发展研究院有限责任公司 Risk data searching method for critical engineering
CN111291171B (en) * 2020-01-21 2023-05-16 南方电网能源发展研究院有限责任公司 Dangerous engineering risk data searching method
CN111797201A (en) * 2020-06-23 2020-10-20 中民筑友建设科技集团有限公司 BIM (building information modeling) model acquisition method, device, equipment and computer readable storage medium
CN112507000A (en) * 2020-12-23 2021-03-16 深圳市普渡科技有限公司 Method and device for configuring target point of robot, electronic device and storage medium
WO2022134979A1 (en) * 2020-12-23 2022-06-30 深圳市普渡科技有限公司 Method and apparatus for configuring target point for robot, and electronic apparatus and storage medium
CN114357030A (en) * 2022-01-04 2022-04-15 深圳市智百威科技发展有限公司 Big data storage system and method

Similar Documents

Publication Publication Date Title
US10896212B2 (en) System and methods for automating trademark and service mark searches
CN110413645A (en) Data search method, device, terminal and computer readable storage medium
US7418443B2 (en) Question answering system, data search method, and computer program
CN111368042A (en) Intelligent question and answer method and device, computer equipment and computer storage medium
CN109299320A (en) A kind of information interacting method, device, computer equipment and storage medium
CN111898643A (en) Semantic matching method and device
US20130290138A1 (en) Search Method, Apparatus and Server for Online Trading Platform
CN113076423A (en) Data processing method and device and data query method and device
CN102750366A (en) Video search system and method based on natural interactive import and video search server
CN110209781B (en) Text processing method and device and related equipment
CN108345663A (en) A kind of news push method and apparatus
CN112699645A (en) Corpus labeling method, apparatus and device
CN112507139A (en) Knowledge graph-based question-answering method, system, equipment and storage medium
CN109815390A (en) Search method, device, computer equipment and the computer storage medium of multilingual information
CN104391969A (en) User query statement syntactic structure determining method and device
CN110795544B (en) Content searching method, device, equipment and storage medium
CN113220854B (en) Intelligent dialogue method and device for machine reading and understanding
CN106156262A (en) A kind of search information processing method and system
CN106407332B (en) Search method and device based on artificial intelligence
CN112036843A (en) Flow element positioning method, device, equipment and medium based on RPA and AI
CN111651554A (en) Insurance question-answer method and device based on natural language understanding and processing
CN107229675B (en) Question and answer base construction method, method, apparatus of answering and the system of list type knowledge
CN116662495A (en) Question-answering processing method, and method and device for training question-answering processing model
CN111078724A (en) Method, device and equipment for searching test questions in learning system and storage medium
CN1971557A (en) Glossary shared system and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination