CN114416733A - Data retrieval processing method and device, electronic equipment and storage medium - Google Patents

Data retrieval processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN114416733A
CN114416733A CN202111649812.5A CN202111649812A CN114416733A CN 114416733 A CN114416733 A CN 114416733A CN 202111649812 A CN202111649812 A CN 202111649812A CN 114416733 A CN114416733 A CN 114416733A
Authority
CN
China
Prior art keywords
target
data
retrieval
index table
topic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111649812.5A
Other languages
Chinese (zh)
Inventor
张磊
王健
徐锐
甄青伟
周福可
焦松
马单
徐东明
槐正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Telecom Corp Ltd
Original Assignee
China Telecom Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Telecom Corp Ltd filed Critical China Telecom Corp Ltd
Priority to CN202111649812.5A priority Critical patent/CN114416733A/en
Publication of CN114416733A publication Critical patent/CN114416733A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a data retrieval processing method, a data retrieval processing device, electronic equipment and a storage medium, wherein the method comprises the following steps: for data located in different databases, target data belonging to the same service type may be obtained from the databases, then an initial index table for the target data may be obtained, an index field set for the service type may be included in the initial index table, then a target information entry may be extracted from the target data according to the index field, and the target information entry is added to the initial index table, so as to generate a target index table corresponding to the service type, and in the data retrieval process, a retrieval key corresponding to the data retrieval operation may be determined in response to the data retrieval operation, and a retrieval result corresponding to the retrieval key may be extracted from the target index table.

Description

Data retrieval processing method and device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a data retrieval processing method, a data retrieval processing apparatus, an electronic device, and a computer-readable storage medium.
Background
In the operation of an IT (Internet Technology ) system, a large amount of data is generated, including service data, operation logs, monitoring data and the like, which are valuable assets of an enterprise, and if the data is fully utilized, not only can the service level of a user be improved, the problem troubleshooting be accelerated, but also information which is very valuable for improving the service quality can be mined.
However, in the actual case of an enterprise, building a data retrieval system often faces several problems:
(1) data dispersion: in the construction of information systems, the construction of a chimney cannot be avoided in the initial stage, all systems are dispersed and independent, data are not communicated with each other, and even isolation is realized on a network layer.
(2) Data standards are inconsistent: even if the data are of the same type, the data standards are different, the quality of some data is not high, and even necessary statistical dimensions are lacked, so that uniform retrieval is not convenient.
(3) Potential safety hazard: because of the above two problems, if unified retrieval is performed, it is necessary to clean and aggregate the data together first, which increases the potential safety hazard of important data leakage.
In view of the above, a data retrieval method capable of solving or partially solving the above problems is required.
Disclosure of Invention
The embodiment of the invention provides a data retrieval processing method, a data retrieval processing device, electronic equipment and a computer readable storage medium, and aims to solve or partially solve the problems of low data retrieval efficiency and low accuracy.
The embodiment of the invention discloses a processing method for data retrieval, which comprises the following steps:
acquiring target data belonging to the same service type from preset different databases;
obtaining an initial index table for the target data, wherein the initial index table comprises index fields;
extracting a target information entry from the target data according to the index field, adding the target information entry to the initial index table, and generating a target index table corresponding to the service type;
and responding to data retrieval operation, determining a retrieval key word corresponding to the data retrieval operation, and extracting a retrieval result corresponding to the retrieval key word from the target index table.
Optionally, the obtaining target data belonging to the same service type from preset different databases includes:
acquiring a service identifier;
extracting target data corresponding to the service identification from different preset databases;
the service identification at least comprises one of a recharging identification, a query identification, a service handling identification, a consultation identification and a complaint identification.
Optionally, the target data includes a preset information entry, the preset information entry includes service information, the extracting a target information entry from the target data according to the index field, and adding the target information entry to the initial index table to generate a target index table corresponding to the service type, including:
extracting a preset information item corresponding to the service information successfully matched with the index field from the target data as a target information item, wherein the target information item comprises the service information corresponding to the index field;
and adding the service information of the target information entry to the initial index table according to the index field to generate a target index table corresponding to the service type.
Optionally, the index field includes a topic keyword, and the determining, in response to a data retrieval operation, a retrieval keyword corresponding to the data retrieval operation and extracting a retrieval result corresponding to the retrieval keyword from the target index table includes:
responding to a data retrieval operation, and determining a search keyword corresponding to the data retrieval operation;
and if the search keyword is not the topic keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a first-level retrieval result page aiming at the target data.
Optionally, the determining, in response to a data retrieval operation, a retrieval key corresponding to the data retrieval operation, and extracting a retrieval result corresponding to the retrieval key from the target index table further includes:
if the search keyword is the topic keyword, determining the topic grade of the search keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a target retrieval result page corresponding to the topic grade.
Optionally, the displaying the target retrieval result page corresponding to the topic grade includes:
if the topic grade corresponding to the topic keyword is a last grade, displaying a last grade retrieval result page corresponding to the topic grade;
if the topic grade corresponding to the topic keyword is a first grade, displaying a first grade retrieval result page corresponding to the first grade;
and if the topic grade corresponding to the topic keyword is not the first grade and not the last grade, displaying a retrieval result page corresponding to the last grade of the topic grade.
Optionally, the method further comprises:
acquiring the duration of a retrieval result page;
and if the duration is greater than or equal to a preset duration threshold, stopping responding to the access operation of the retrieval result page.
The embodiment of the invention also discloses a processing device for data retrieval, which comprises:
the target data acquisition module is used for acquiring target data belonging to the same service type from preset different databases;
an index table obtaining module, configured to obtain an initial index table for the target data, where the initial index table includes an index field;
a target index table generation module, configured to extract a target information entry from the target data according to the index field, add the target information entry to the initial index table, and generate a target index table corresponding to the service type;
and the retrieval result determining module is used for responding to data retrieval operation, determining a retrieval key word corresponding to the data retrieval operation and extracting a retrieval result corresponding to the retrieval key word from the target index table.
Optionally, the target data acquiring module includes:
a service identifier obtaining submodule for obtaining a service identifier;
the target data extraction submodule is used for extracting target data corresponding to the service identifier from different preset databases;
the service identification at least comprises one of a recharging identification, a query identification, a service handling identification, a consultation identification and a complaint identification.
Optionally, the target data includes a preset information entry, the preset information entry includes service information, and the target index table generating module includes:
an information entry determining submodule, configured to extract, from the target data, a preset information entry corresponding to the service information that is successfully matched with the index field, as a target information entry, where the target information entry includes the service information corresponding to the index field;
and the index table generation submodule is used for adding the service information of the target information entry to the initial index table according to the index field and generating a target index table corresponding to the service type.
Optionally, the index field includes a topic keyword, and the search result determining module includes:
the search keyword determining submodule is used for responding to data retrieval operation and determining a search keyword corresponding to the data retrieval operation;
and the first result page display sub-module is used for extracting target data corresponding to the search keyword from the target index table and displaying a first-level search result page aiming at the target data if the search keyword is not the topic keyword.
Optionally, the retrieval result determining module further includes:
and the second result page display sub-module is used for determining the topic grade of the search keyword if the search keyword is the topic keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a target retrieval result page corresponding to the topic grade.
Optionally, the second result page display sub-module is specifically configured to:
if the topic grade corresponding to the topic keyword is a last grade, displaying a last grade retrieval result page corresponding to the topic grade;
if the topic grade corresponding to the topic keyword is a first grade, displaying a first grade retrieval result page corresponding to the first grade;
and if the topic grade corresponding to the topic keyword is not the first grade and not the last grade, displaying a retrieval result page corresponding to the last grade of the topic grade.
Optionally, the method further comprises:
the duration acquisition module is used for acquiring the duration of the retrieval result page;
and the page processing module is used for stopping responding to the access operation of the retrieval result page if the duration is greater than or equal to a preset duration threshold.
The embodiment of the invention also discloses electronic equipment which comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory finish mutual communication through the communication bus;
the memory is used for storing a computer program;
the processor is configured to implement the method according to the embodiment of the present invention when executing the program stored in the memory.
Also disclosed is a computer-readable storage medium having instructions stored thereon, which, when executed by one or more processors, cause the processors to perform a method according to an embodiment of the invention.
The embodiment of the invention has the following advantages:
in the embodiment of the present invention, for data located in different databases, target data belonging to the same service type may be first obtained from the databases, then an initial index table for the target data may be obtained, an index field set for the service type may be included in the initial index table, then a target information entry may be extracted from the target data according to the index field, and the target information entry is added to the initial index table, so as to generate a target index table corresponding to the service type, during data retrieval, a retrieval key corresponding to the data retrieval operation may be determined in response to the data retrieval operation, and a retrieval result corresponding to the retrieval key may be extracted from the target index table, so as to construct an index table corresponding to the service type for the target data of the same service type in different databases, and aggregate the target data through the index field in the index table, on one hand, the method is beneficial to searching the scattered data through the index table, the data surface of data searching is enlarged, the efficiency of data searching is improved, and on the other hand, the accuracy of data searching is improved by searching through the matching relation among the keywords.
Drawings
FIG. 1 is a flow chart illustrating the steps of a data retrieval processing method provided in an embodiment of the present invention;
FIG. 2 is a schematic diagram of a data retrieval architecture provided in an embodiment of the present invention;
FIG. 3 is a schematic diagram of a retrieval process provided in an embodiment of the present invention;
fig. 4 is a block diagram of a processing apparatus for data retrieval according to an embodiment of the present invention;
fig. 5 is a block diagram of an electronic device provided in an embodiment of the invention;
fig. 6 is a schematic diagram of a computer-readable medium provided in an embodiment of the invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
In the operation of the IT system, a large amount of data can be generated, including service data, operation logs, monitoring data and the like, which are valuable assets of enterprises, and if the data are fully utilized, the service level of users can be improved, the problem troubleshooting can be accelerated, and information which is very valuable for improving the service quality can be mined. In the process of utilizing data, data retrieval is a very basic requirement, and a highly aggregated entrance is provided, so that system users can conveniently and quickly obtain data required by the users.
However, in the actual case of an enterprise, building a data retrieval system often faces several problems:
(1) data dispersion: in the construction of information systems, the construction of a chimney cannot be avoided in the initial stage, all systems are dispersed and independent, data are not communicated with each other, and even isolation is realized on a network layer.
(2) Data standards are inconsistent: even if the data are of the same type, the data standards are different, the quality of some data is not high, and even necessary statistical dimensions are lacked, so that uniform retrieval is not convenient.
(3) Potential safety hazard: because of the above two problems, if unified retrieval is performed, it is necessary to clean and aggregate the data together first, which increases the potential safety hazard of important data leakage.
For data retrieval, it is not independent of the search engine, and the full-text indexing technology is a key technology of the search engine. The principle of the full text search technology is that a word bank is defined, frequency and position information of each entry are searched in an article, and the corresponding frequency and position information are summarized according to the sequence of the word bank, so that an index which is equivalent to establishing the word bank as a directory for a file is obtained, and under the condition, the position where a word appears can be quickly positioned when the word is searched.
In the case of processing an english document, since english words are separated by spaces, accuracy of english retrieval can be achieved by providing a sufficiently large vocabulary library. However, in the case of chinese, since there is no space as a word-breaking mark, it is difficult to determine a word, and the word used by people is changing continuously, so that it is very costly to maintain an expandable vocabulary library. In view of the above, a data retrieval method that can solve or partially solve the problem of low efficiency and low accuracy of data retrieval in the data retrieval process is needed.
In this regard, one of the core invention points of the embodiments of the present invention is that, for data located in different databases, target data belonging to the same service type may be obtained from the databases, then an initial index table for the target data may be obtained, an index field set for the service type may be included in the initial index table, then a target information entry may be extracted from the target data according to the index field, and the target information entry is added to the initial index table, a target index table corresponding to the service type is generated, data aggregation is implemented, then, in the process of data retrieval, a retrieval key corresponding to the data retrieval operation may be determined in response to the data retrieval operation, and a retrieval result corresponding to the retrieval key may be extracted from the target index table, so as to target data of the same service type in different databases, the index table corresponding to the service type is built, and the target data are aggregated through the index fields in the index table, so that on one hand, the method is beneficial to retrieving the scattered data through the index table, the data surface of data retrieval is enlarged, the efficiency of data retrieval is improved, and on the other hand, the accuracy of data retrieval is improved by retrieving through the matching relation among the keywords.
Referring to fig. 1, a flowchart illustrating steps of a processing method for data retrieval provided in an embodiment of the present invention is shown, which may specifically include the following steps:
step 101, acquiring target data belonging to the same service type from preset different databases;
the databases may be data tables, data systems, terminals, servers, etc. for storing corresponding data, and different databases may store data of the same service type or different types of data. The service type may be matched with the service identifier, and different service identifiers represent different application services, for example, a recharge service, an inquiry service, a service handling service, a consultation service, a complaint service, and the like provided in an application program may respectively correspond to the recharge identifier, the inquiry identifier, the service handling identifier, the consultation identifier, the complaint identifier, and the like, and before retrieving data of a certain service type, the service identifier corresponding to the service type may be obtained first, and then target data corresponding to the service identifier may be extracted from different databases, so that data scattered in each database is aggregated.
For example, the service type may be a recharge service, and for the recharge service, assuming that a database (i), a database (ii), a database (iii), and the like store data corresponding to the recharge service, before retrieving the recharge data, data associated with the recharge service may be extracted from the database (i), the database (ii), and the database (iii) according to the recharge identifier as target data and stored in a target terminal (e.g., a terminal, a server, and the like), so as to aggregate the recharge data dispersed in each database, so as to retrieve the aggregated data.
102, acquiring an initial index table aiming at the target data, wherein the initial index table comprises an index field;
for target data of the same service type extracted from different databases, in order to improve the accuracy of data retrieval, an initial index table for the target data may be obtained, and an index field corresponding to the service type may be included in the initial index table. For example, for the recharging service, the corresponding index fields can be recharging channels, recharging sales products, recharging time, recharging amount and the like, and for the initial index table, taking the recharging service as an example, the initial index table can be shown in table 1 below, so that by constructing the index table corresponding to the service type, not only can the customized design be effectively performed for the service type, but also the accuracy of Chinese retrieval can be effectively improved.
Recharging channel Top-up selling article Recharge time Amount of money to be recharged
TABLE 1
It is understood that, for the index field, the embodiments of the present invention include, but are not limited to, the above examples, for example, for the recharge service, it may further include user identification, recharge location, and the like, which is not limited by the present invention.
103, extracting a target information entry from the target data according to the index field, adding the target information entry to the initial index table, and generating a target index table corresponding to the service type;
in the embodiment of the present invention, the target index table may be generated in an ETL (Extract-Transform-Load) manner, including extracting target data of the same service type from different databases, then processing and converting the extracted data according to service logic, and loading the data into the initial index table to generate the target index table corresponding to the service type, so as to implement aggregation of data distributed in different databases on the one hand, and facilitate improvement of accuracy and adaptability of data retrieval by constructing the index table matched with the service type on the other hand.
In a specific implementation, the extracted target data may include preset information entries corresponding to the service types, each preset information entry may include corresponding service information, and then the preset information entry corresponding to the service information successfully matched with the index field may be extracted from the target data as the target information entry, the target information entry includes the service information corresponding to the index field, and the service information of the target information entry is added to the initial index table according to the index field, so as to generate a target index table corresponding to the service type.
Optionally, the index field may include a "keyword" field, a "value" field, a rating field, and the like, where the "keyword" field may be service description information that is converted by processing, such as a dimension describing a problem, and the like, and taking a recharging service as an example, the "keyword" field after processing may be "12/28 th-of-a-month applet recharging in 2021" and the like; a "value" field, which may be a specific service value, is a measure of the problem, and also takes the mobile phone recharging as an example, the "value" field may be a specific recharging amount, such as 200 yuan; the hierarchical field may include a primary topic, a secondary topic, a tertiary topic, etc., which is used to structure the problem and guide the user to perform an accurate search, still taking the recharging service as an example, the hierarchical field may include a description "order type", "order channel", etc., and the corresponding data is stored as "recharging order", "applet channel", etc., which is not limited by the present invention.
Specifically, the service information included in the preset information entry may be a problem dimension, a problem measure, a problem structure, and the like corresponding to the index field, for example, in a recharge information entry, it may include a recharge order, a specific channel for recharging (such as an applet), a recharge time, a recharge sales item (such as a mobile phone recharge), a recharge amount, and specifically may be: recharging orders, small program channels, 20211228, mobile phone recharging, 200, etc. according to the actual service logic, the preset information items corresponding to the index fields can be extracted according to the matching condition between the index fields and the preset information items, for example, the target information items corresponding to the recharging orders or the target information items corresponding to the query orders can be extracted from the target data, so that the corresponding full-text indexes are established on the key fields in the index table, and the key fields can be the content personalized and customized according to the actual service content, which is beneficial to improving the accuracy and the adaptability of data retrieval.
And 104, responding to data retrieval operation, determining a retrieval key word corresponding to the data retrieval operation, and extracting a retrieval result corresponding to the retrieval key word from the target index table.
When the aggregation of the data and the construction of the index table are completed, the user searches the data, namely, the related data can be searched through the index table. Optionally, for the aggregation of the data, it may be performed in the server, and for the retrieval of the data, it may be performed for the client, that is, the data manager may aggregate the data and construct the corresponding index table according to the above manner, and the user may input the corresponding search condition in the client to perform the data retrieval.
In a specific implementation, the client may respond to a data retrieval operation input by a user, determine a retrieval keyword corresponding to the data retrieval operation, extract a retrieval result corresponding to the retrieval keyword from the target index table, and display target data to complete data retrieval. In the process of searching through the search keyword, hierarchical order searching can be performed through the subject level so as to guide a user to search according to a certain hierarchical order, and the precision of data searching is improved.
In addition, if the search keyword is the topic keyword, determining the topic grade of the search keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a target retrieval result page corresponding to the topic grade. Specifically, for a target retrieval result page corresponding to the topic grade, if the topic grade corresponding to the topic keyword is a last-level grade, displaying a last-level retrieval result page corresponding to the topic grade; if the topic grade corresponding to the topic keyword is a first grade, displaying a first grade retrieval result page corresponding to the first grade; and if the topic grade corresponding to the topic keyword is not the first grade and is not the last grade, displaying a retrieval result page corresponding to the last grade of the topic grade.
For example, in the recharging service, the first-level theme keyword may be a "recharging channel", and the second-level theme keyword is a "recharging consumer product", and the like. Alternatively, the top level may be the highest level and the last level may be the lowest level, for example, in the topic keyword, assuming that the first-level topic is the highest-level topic, the top level may be the first-level, and the second-level topic is the lowest-level topic, the last-level may be the second-level.
In one example, assuming that the topics include a primary topic (top-up channel) and a secondary topic (top-up sales), wherein the primary topic is ranked higher than the secondary topic, the target index table may be as shown in table 2 below:
recharging channel Top-up selling article Recharge time Amount of money to be recharged
Client recharge Wide band 20211212 200
Client recharge Wide band 20211215 100
Client recharge Telephone charge 20211217 50
WeChat applet Telephone charge 20211220 100
Tianmao shop Fixed telephone 20211228 200
TABLE 2
In the above-mentioned search process, the search keyword corresponding to the data search operation input by the user is "200", which is not any topic keyword (client, broadband, etc.), and the search result may be:
client recharge-broadband-202102-200
Tianmao shop-fixed telephone-202107-
If the retrieval keyword is a "client side" and corresponds to a primary topic, a search result page corresponding to the primary topic can be displayed, and the included content can be:
client recharge-broadband-20211212-
Client recharge-broadband-20211215-100
Client recharge-telephone charge-20211217-50
If the retrieval key is 'telephone charge' and the retrieval key corresponds to a secondary topic, a search result page corresponding to the secondary topic can be displayed, and the included content can be:
client recharge-telephone charge-20211217-50
WeChat applet-telephone fee-20211220-
In the above searching process, in order to guide the user to search step by using the theme, when the user does not go in and out the theme key word, the searching result page can jump to the first-level theme result page, and jump to the last level according to the user's further step by step until the user designates the last level of theme (such as broadband, telephone charge, fixed telephone, etc.), and perform data retrieval according to the corresponding retrieval keywords, thereby aiming at the target data of the same service type in different databases, an index table corresponding to the service type is constructed, and target data is aggregated through the index fields in the index table, so that on one hand, the method is beneficial to searching scattered data through the index table, the data surface of data searching is enlarged, the efficiency of data searching is improved, and on the other hand, the accuracy of data searching is improved by searching through the matching relation among keywords.
In addition, in some data retrieval scenarios, since the queried data relates to sensitive information of the user, in order to ensure the security of the user information, corresponding data encryption processing may be performed, for example, by obtaining a duration of a retrieval result page, and if the duration is greater than or equal to a preset duration threshold, stopping responding to an access operation to the retrieval result page. Specifically, the page can be processed in a timestamp encryption mode, the timestamp parameters can be used for determining the last access time of the web, when the last access time exceeds a certain duration of the page, and then the page is accessed, the corresponding access request can be rejected, the system function is prevented from being stolen in a page copying mode, and therefore the safety of user information is effectively guaranteed.
Alternatively, for data encryption, the retrieval result may be encrypted by XML (Extensible Markup Language), and specifically, the DES3 encryption algorithm may be used to encrypt the retrieval result. The database end is only responsible for generating the encrypted result set, and does not provide the key for the web page, so that the key can be generated by the web program according to the generation principle and used for decrypting the result set. For example, the specific generation rule may be: the 14-bit time string (YYYYMMDDHHMMSS) and 16-bit numbers (0-F) are used as basic materials, such as "202010101180700" and "1A 56F45B12F45B 88". The time character string is taken as the first, two character strings are spliced into one character string, then each digit is added with 15 according to the bit operation, and the remainder is obtained according to the 16-system.
In an example, referring to fig. 2, a schematic diagram of a data retrieval architecture provided in an embodiment of the present invention is shown, and for the data retrieval architecture, 4 hierarchies may be included, specifically:
(1) and (4) a service layer: this part is a separate decentralized business system (i.e. database)
(2) A data acquisition layer: the part is developed and realized based on a storage process, and aims to extract service data dispersed in each service system to a target end.
(3) A data preprocessing layer: classifying and segmenting data acquired from a business system in advance to obtain the content of an index field, writing the content into an index table, and waiting for the user to inquire and use.
(4) Viewing the image layer: an interactive interface for a user is provided.
In the process of retrieving data, referring to fig. 3, a schematic diagram of a retrieval process provided in the embodiment of the present invention is shown, where a web may represent a corresponding web interface (web1.1 is a keyword search page, web1.2 is a high-level search page, web2 is a first-level topic summary page, web3 is a second-level topic summary page, web4 is a detail entry page, and web5 is a numerical page of detail entries, etc.), pkg _ search may be used to represent a retrieval node, and the retrieval process may include:
the method comprises the steps that a user inputs corresponding keywords in the web1.1, if the keywords are not any theme keywords, a retrieval result of a primary theme summary page is displayed, then the user can input a secondary theme summary page on the basis of the primary theme summary page and conduct retrieval step by step, and therefore service numerical values of detailed items are displayed finally, and further through a step-by-step retrieval mode, data retrieval precision is effectively guaranteed, and accuracy of the retrieval result is improved. In addition, a user can select an advanced search mode in the web1.1 to perform topic search in an advanced search page of the web2, wherein the topic search mode includes directly appointing a first-level topic for retrieval, or directly appointing a first-level topic and a second-level topic for retrieval and the like, so that the user is guided to perform retrieval according to the hierarchical sequence of the first-level topic, the second-level topic and the like in the search process, and the search precision is improved.
In another example, as the business of an enterprise is continuously expanded, the sources of business data are more and more, and the data volume is also more and more, taking the recharging data as an example, the recharging channel is divided into: a plurality of channels such as a client, a wap, a webcast, a wechat public number, a wechat applet, a tianmao shop and a pay-for-all applet; the recharging method comprises the following steps: bank card recharge, card secret recharge and other modes; the recharging sales items are classified into telephone fee, fixed telephone, broadband and the like.
The hierarchical relationship of these data is clear, but the data is scattered in each business table, and needs to be queried separately when the data is temporarily used. If a report is customized and developed, data needs to be developed for the second time according to the report style every time, and the method is very inflexible.
The framework described by the invention and the retrieval mode related to the above can realize flexible data retrieval. The service layer is recharging data from each system, keywords are extracted from the recharging data, and the keywords are uniformly extracted into an index table. The 'first-level theme' and 'second-level theme' in the 'index table' are set as 'recharging channels' and 'recharging consumables'. The keywords in the index table are similar to those of the client recharging broadband of 2021 year 2 month, and the numerical value field in the index table stores specific recharging amount. The automatic execution time of the extraction program is set to be 2 every morning of the night: 00, extracting the service data of the previous day, organizing into a keyword, and adding the keyword into an index table.
In order to guide a user to search step by using themes, when the user does not input the themes, the page jumps to a first-level theme summary page, jumps step by step after clicking until the user designates the last-level theme, and then carries out data retrieval based on full-text indexes according to keywords.
Because the recharge data relates to user sensitive information, a two-layer encryption scheme is added, and the data display safety is ensured:
(1) and encrypting a timestamp, wherein the timestamp parameter is used for determining the last access time of the web page, and refusing the page to be accessed again for more than 20 minutes, so that the function of the system is prevented from being embezzled by copying the page. This parameter is generated by the database program and passed again to the database for use by the web page.
(2) XML data is encrypted, and a DES3 encryption algorithm is adopted to encrypt a retrieval result. The database side is only responsible for generating the encrypted result set and does not provide keys to the web page. A key is generated by the web program on a generation basis for decrypting the result set. The specific generation rules are similar to: the 14-bit time string (YYYYMMDDHHMMSS) and 16-bit numbers (0-F) are used as basic materials, such as "202010101180700" and "1A 56F45B12F45B 88". The time character string is taken as the first, two character strings are spliced into one character string, then each digit is added with 15 according to the bit operation, and the remainder is obtained according to the 16-system.
It should be noted that, the embodiment of the present invention includes but is not limited to the above examples, and it is understood that, under the guidance of the idea of the embodiment of the present invention, a person skilled in the art may also set the method according to practical requirements, and the present invention is not limited to this.
In the embodiment of the present invention, for data located in different databases, target data belonging to the same service type may be first obtained from the databases, then an initial index table for the target data may be obtained, an index field set for the service type may be included in the initial index table, then a target information entry may be extracted from the target data according to the index field, and the target information entry is added to the initial index table, so as to generate a target index table corresponding to the service type, during data retrieval, a retrieval key corresponding to the data retrieval operation may be determined in response to the data retrieval operation, and a retrieval result corresponding to the retrieval key may be extracted from the target index table, so as to construct an index table corresponding to the service type for the target data of the same service type in different databases, and aggregate the target data through the index field in the index table, on one hand, the method is beneficial to searching the scattered data through the index table, the data surface of data searching is enlarged, the efficiency of data searching is improved, and on the other hand, the accuracy of data searching is improved by searching through the matching relation among the keywords.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 4, a block diagram of a processing apparatus for data retrieval according to an embodiment of the present invention is shown, which may specifically include the following modules:
a target data obtaining module 401, configured to obtain target data belonging to the same service type from preset different databases;
an index table obtaining module 402, configured to obtain an initial index table for the target data, where the initial index table includes an index field;
a target index table generating module 403, configured to extract a target information entry from the target data according to the index field, add the target information entry to the initial index table, and generate a target index table corresponding to the service type;
a retrieval result determining module 404, configured to determine, in response to a data retrieval operation, a retrieval key corresponding to the data retrieval operation, and extract a retrieval result corresponding to the retrieval key from the target index table.
In an alternative embodiment, the target data obtaining module 401 includes:
a service identifier obtaining submodule for obtaining a service identifier;
the target data extraction submodule is used for extracting target data corresponding to the service identifier from different preset databases;
the service identification at least comprises one of a recharging identification, a query identification, a service handling identification, a consultation identification and a complaint identification.
In an optional embodiment, the target data includes a preset information entry, the preset information entry includes service information, and the target index table generating module 403 includes:
an information entry determining submodule, configured to extract, from the target data, a preset information entry corresponding to the service information that is successfully matched with the index field, as a target information entry, where the target information entry includes the service information corresponding to the index field;
and the index table generation submodule is used for adding the service information of the target information entry to the initial index table according to the index field and generating a target index table corresponding to the service type.
In an alternative embodiment, the index field includes a topic keyword, and the search result determining module 404 includes:
the search keyword determining submodule is used for responding to data retrieval operation and determining a search keyword corresponding to the data retrieval operation;
and the first result page display sub-module is used for extracting target data corresponding to the search keyword from the target index table and displaying a first-level search result page aiming at the target data if the search keyword is not the topic keyword.
In an alternative embodiment, the retrieval result determining module 404 further includes:
and the second result page display sub-module is used for determining the topic grade of the search keyword if the search keyword is the topic keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a target retrieval result page corresponding to the topic grade.
In an optional embodiment, the second result page display sub-module is specifically configured to:
if the topic grade corresponding to the topic keyword is a last grade, displaying a last grade retrieval result page corresponding to the topic grade;
if the topic grade corresponding to the topic keyword is a first grade, displaying a first grade retrieval result page corresponding to the first grade;
and if the topic grade corresponding to the topic keyword is not the first grade and not the last grade, displaying a retrieval result page corresponding to the last grade of the topic grade.
In an alternative embodiment, further comprising:
the duration acquisition module is used for acquiring the duration of the retrieval result page;
and the page processing module is used for stopping responding to the access operation of the retrieval result page if the duration is greater than or equal to a preset duration threshold.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
In addition, an embodiment of the present invention further provides an electronic device, as shown in fig. 5, which includes a processor 501, a communication interface 502, a memory 503 and a communication bus 504, where the processor 501, the communication interface 502 and the memory 503 complete mutual communication through the communication bus 504,
a memory 503 for storing a computer program;
the processor 501, when executing the program stored in the memory 503, implements the following steps:
acquiring target data belonging to the same service type from preset different databases;
obtaining an initial index table for the target data, wherein the initial index table comprises index fields;
extracting a target information entry from the target data according to the index field, adding the target information entry to the initial index table, and generating a target index table corresponding to the service type;
and responding to data retrieval operation, determining a retrieval key word corresponding to the data retrieval operation, and extracting a retrieval result corresponding to the retrieval key word from the target index table.
In an optional embodiment, the obtaining target data belonging to the same service type from preset different databases includes:
acquiring a service identifier;
extracting target data corresponding to the service identification from different preset databases;
the service identification at least comprises one of a recharging identification, a query identification, a service handling identification, a consultation identification and a complaint identification.
In an optional embodiment, the extracting the target information entry from the target data according to the index field, adding the target information entry to the initial index table, and generating the target index table corresponding to the service type includes:
extracting a preset information item corresponding to the service information successfully matched with the index field from the target data as a target information item, wherein the target information item comprises the service information corresponding to the index field;
and adding the service information of the target information entry to the initial index table according to the index field to generate a target index table corresponding to the service type.
In an optional embodiment, the index field includes a topic keyword, and the determining, in response to a data retrieval operation, a retrieval keyword corresponding to the data retrieval operation and extracting a retrieval result corresponding to the retrieval keyword from the target index table includes:
responding to a data retrieval operation, and determining a search keyword corresponding to the data retrieval operation;
and if the search keyword is not the topic keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a first-level retrieval result page aiming at the target data.
In an optional embodiment, the determining, in response to a data retrieval operation, a retrieval key corresponding to the data retrieval operation, and extracting a retrieval result corresponding to the retrieval key from the target index table further includes:
if the search keyword is the topic keyword, determining the topic grade of the search keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a target retrieval result page corresponding to the topic grade.
In an optional embodiment, the presenting the target search result page corresponding to the topic rank includes:
if the topic grade corresponding to the topic keyword is a last grade, displaying a last grade retrieval result page corresponding to the topic grade;
if the topic grade corresponding to the topic keyword is a first grade, displaying a first grade retrieval result page corresponding to the first grade;
and if the topic grade corresponding to the topic keyword is not the first grade and not the last grade, displaying a retrieval result page corresponding to the last grade of the topic grade.
In an alternative embodiment, further comprising:
acquiring the duration of a retrieval result page;
and if the duration is greater than or equal to a preset duration threshold, stopping responding to the access operation of the retrieval result page.
The communication bus mentioned in the above terminal may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.
The communication interface is used for communication between the terminal and other equipment.
The Memory may include a Random Access Memory (RAM) or a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.
In another embodiment provided by the present invention, as shown in fig. 6, there is further provided a computer-readable storage medium 601, which stores instructions that, when executed on a computer, cause the computer to execute the processing method of data retrieval described in the above embodiment.
In yet another embodiment of the present invention, a computer program product containing instructions is also provided, which when run on a computer causes the computer to execute the processing method for data retrieval described in the above embodiments.
In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, cause the processes or functions described in accordance with the embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the system embodiment, since it is substantially similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (10)

1. A processing method for data retrieval, comprising:
acquiring target data belonging to the same service type from preset different databases;
obtaining an initial index table for the target data, wherein the initial index table comprises index fields;
extracting a target information entry from the target data according to the index field, adding the target information entry to the initial index table, and generating a target index table corresponding to the service type;
and responding to data retrieval operation, determining a retrieval key word corresponding to the data retrieval operation, and extracting a retrieval result corresponding to the retrieval key word from the target index table.
2. The method according to claim 1, wherein the obtaining target data belonging to the same service type from different preset databases comprises:
acquiring a service identifier;
extracting target data corresponding to the service identification from different preset databases;
the service identification at least comprises one of a recharging identification, a query identification, a service handling identification, a consultation identification and a complaint identification.
3. The method of claim 1, wherein the target data comprises a preset information entry, the preset information entry comprises service information, and the extracting a target information entry from the target data according to the index field and adding the target information entry to the initial index table generates a target index table corresponding to the service type, including:
extracting a preset information item corresponding to the service information successfully matched with the index field from the target data as a target information item, wherein the target information item comprises the service information corresponding to the index field;
and adding the service information of the target information entry to the initial index table according to the index field to generate a target index table corresponding to the service type.
4. The method of claim 1, wherein the index field comprises a topic keyword, and wherein, in response to a data retrieval operation, determining a retrieval keyword corresponding to the data retrieval operation and extracting a retrieval result corresponding to the retrieval keyword from the target index table comprises:
responding to a data retrieval operation, and determining a search keyword corresponding to the data retrieval operation;
and if the search keyword is not the topic keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a first-level retrieval result page aiming at the target data.
5. The method of claim 4, wherein the determining, in response to a data retrieval operation, a retrieval key corresponding to the data retrieval operation and extracting a retrieval result corresponding to the retrieval key from the target index table further comprises:
if the search keyword is the topic keyword, determining the topic grade of the search keyword, extracting target data corresponding to the search keyword from the target index table, and displaying a target retrieval result page corresponding to the topic grade.
6. The method of claim 5, wherein said presenting a target search result page corresponding to said topic rank comprises:
if the topic grade corresponding to the topic keyword is a last grade, displaying a last grade retrieval result page corresponding to the topic grade;
if the topic grade corresponding to the topic keyword is a first grade, displaying a first grade retrieval result page corresponding to the first grade;
and if the topic grade corresponding to the topic keyword is not the first grade and not the last grade, displaying a retrieval result page corresponding to the last grade of the topic grade.
7. The method of claim 4, 5 or 6, further comprising:
acquiring the duration of a retrieval result page;
and if the duration is greater than or equal to a preset duration threshold, stopping responding to the access operation of the retrieval result page.
8. A processing apparatus for data retrieval, comprising:
the target data acquisition module is used for acquiring target data belonging to the same service type from preset different databases;
an index table obtaining module, configured to obtain an initial index table for the target data, where the initial index table includes an index field;
a target index table generation module, configured to extract a target information entry from the target data according to the index field, add the target information entry to the initial index table, and generate a target index table corresponding to the service type;
and the retrieval result determining module is used for responding to data retrieval operation, determining a retrieval key word corresponding to the data retrieval operation and extracting a retrieval result corresponding to the retrieval key word from the target index table.
9. An electronic device, comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory communicate with each other via the communication bus;
the memory is used for storing a computer program;
the processor, when executing a program stored on the memory, implementing the method of any of claims 1-7.
10. A computer-readable storage medium having stored thereon instructions, which when executed by one or more processors, cause the processors to perform the method of any one of claims 1-7.
CN202111649812.5A 2021-12-29 2021-12-29 Data retrieval processing method and device, electronic equipment and storage medium Pending CN114416733A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111649812.5A CN114416733A (en) 2021-12-29 2021-12-29 Data retrieval processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111649812.5A CN114416733A (en) 2021-12-29 2021-12-29 Data retrieval processing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN114416733A true CN114416733A (en) 2022-04-29

Family

ID=81269372

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111649812.5A Pending CN114416733A (en) 2021-12-29 2021-12-29 Data retrieval processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114416733A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116991919A (en) * 2023-09-26 2023-11-03 中国铁塔股份有限公司吉林省分公司 Service data retrieval method combined with platform database and artificial intelligent system
CN117573704A (en) * 2024-01-17 2024-02-20 上海合见工业软件集团有限公司 Method, device, equipment and medium for indexing composite document of EDA software

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116991919A (en) * 2023-09-26 2023-11-03 中国铁塔股份有限公司吉林省分公司 Service data retrieval method combined with platform database and artificial intelligent system
CN116991919B (en) * 2023-09-26 2023-12-08 中国铁塔股份有限公司吉林省分公司 Service data retrieval method combined with platform database and artificial intelligent system
CN117573704A (en) * 2024-01-17 2024-02-20 上海合见工业软件集团有限公司 Method, device, equipment and medium for indexing composite document of EDA software
CN117573704B (en) * 2024-01-17 2024-04-12 上海合见工业软件集团有限公司 Method, device, equipment and medium for indexing composite document of EDA software

Similar Documents

Publication Publication Date Title
US10242016B2 (en) Systems and methods for management of data platforms
US10198460B2 (en) Systems and methods for management of data platforms
US10146878B2 (en) Method and system for creating filters for social data topic creation
CN111008321B (en) Logistic regression recommendation-based method, device, computing equipment and readable storage medium
US8688702B1 (en) Techniques for using dynamic data sources with static search mechanisms
US20160034514A1 (en) Providing search results based on an identified user interest and relevance matching
US20120016863A1 (en) Enriching metadata of categorized documents for search
US8959112B2 (en) Methods for semantics-based citation-pairing information
US9646246B2 (en) System and method for using a statistical classifier to score contact entities
US8732194B2 (en) Systems and methods for generating issue libraries within a document corpus
CN114416733A (en) Data retrieval processing method and device, electronic equipment and storage medium
CN111966866A (en) Data asset management method and device
CN112100396A (en) Data processing method and device
CN108287901A (en) Method and apparatus for generating information
EP3301603A1 (en) Improved search for data loss prevention
WO2009054611A1 (en) System and method for managing information map
US9984108B2 (en) Database joins using uncertain criteria
EP3152678B1 (en) Systems and methods for management of data platforms
CN107291951B (en) Data processing method, device, storage medium and processor
CN111190965A (en) Text data-based ad hoc relationship analysis system and method
Morbidoni et al. Leveraging linked entities to estimate focus time of short texts
CN112836126A (en) Recommendation method and device based on knowledge graph, electronic equipment and storage medium
KR20190109628A (en) Method for providing personalized article contents and apparatus for the same
KR101111497B1 (en) Classifying and searching method for business category information of domain
CN104240107A (en) Community data screening system and method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination