CN112632122A - Report retrieval method, device, equipment and storage medium based on multiple indexes - Google Patents

Report retrieval method, device, equipment and storage medium based on multiple indexes Download PDF

Info

Publication number
CN112632122A
CN112632122A CN202011500570.9A CN202011500570A CN112632122A CN 112632122 A CN112632122 A CN 112632122A CN 202011500570 A CN202011500570 A CN 202011500570A CN 112632122 A CN112632122 A CN 112632122A
Authority
CN
China
Prior art keywords
report
retrieval
index
synonym
reports
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011500570.9A
Other languages
Chinese (zh)
Inventor
陈忠平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Puhui Enterprise Management Co Ltd
Original Assignee
Ping An Puhui Enterprise Management Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Puhui Enterprise Management Co Ltd filed Critical Ping An Puhui Enterprise Management Co Ltd
Priority to CN202011500570.9A priority Critical patent/CN112632122A/en
Publication of CN112632122A publication Critical patent/CN112632122A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the field of big data and discloses a report retrieval method, a report retrieval device, report retrieval equipment and a storage medium based on multiple indexes. A report retrieval method based on multiple indexes comprises the following steps: regularly acquiring all reports in a preset system, and constructing an index retrieval text base based on the reports; sequentially adding label marks to the report, and setting weights for the label marks to obtain a label retrieval text library; based on the index retrieval text base and the label retrieval text base, carrying out fuzzy matching on retrieval keywords input by a user to obtain a retrieval target report; and sequencing the retrieval target report, outputting a sequencing result and displaying. According to the method and the device, the required report can be accurately retrieved, the report files with low correlation are filtered, and only the report with high correlation is displayed.

Description

Report retrieval method, device, equipment and storage medium based on multiple indexes
Technical Field
The invention relates to the field of big data, in particular to a report retrieval method, a report retrieval device, report retrieval equipment and a storage medium based on multiple indexes.
Background
In big data analysis, a user needs many steps to extract valid data from the complicated data. When a user extracts, various problems may be encountered, and if a data extraction error exists, deviation of the analysis chart is caused, and incorrect presentation of the chart has various influences on the ring ratio, the same ratio, the budget and the like. The analysis effect is that the analysis is labor-consuming and not high in accuracy, and the larger the data volume of the user is, the higher the probability of the problem is, and the larger the deviation is.
If can be according to the configuration, snatch the data automatically, omit the operation of extraction data, show the chart according to the multidimension degree of selection simultaneously, can realize once taking out the number, multiple analysis show, can convenient and fast obtain required information to follow-up information can be accurately forecasted according to the trend, data accuracy and user experience can all be improved by a wide margin under the operation like this.
In the prior art, the report retrieval and checking scheme adopts retrieval according to the report name, and the scheme has the defects that the required report cannot be accurately retrieved, the report files with lower relevance are filtered, and only the report with higher relevance is displayed.
Disclosure of Invention
The invention mainly aims to solve the problem that the required report cannot be accurately retrieved by the current report retrieval scheme.
The invention provides a report retrieval method based on multiple indexes in a first aspect, which comprises the following steps:
regularly acquiring all reports in a preset system, and constructing an index retrieval text base based on the reports;
sequentially adding label marks to the report, and setting weights for the label marks to obtain a label retrieval text library;
based on the index retrieval text base and the label retrieval text base, carrying out fuzzy matching on retrieval keywords input by a user to obtain a retrieval target report;
and sequencing the retrieval target report, outputting a sequencing result and displaying.
Optionally, in a first implementation manner of the first aspect of the present invention, the obtaining all reports in a preset system at regular time, and the constructing an index retrieval text base based on the reports includes:
based on a preset timing task, regularly calling a preset report export interface to download all reports;
intercepting an index column in the report and corresponding menu information and report ID;
appointing synonyms for each index in the index column in sequence;
and constructing an index retrieval text library based on the index column, the synonym, the menu information and the report ID.
Optionally, in a second implementation manner of the first aspect of the present invention, the retrieving a text base based on the index and the retrieving a text base by the tag, and performing fuzzy matching on a retrieval keyword input by a user to obtain a retrieval target report includes:
acquiring a retrieval keyword input by a user, and carrying out synonym detection on the retrieval keyword;
if the fact that the search keyword does not have the synonym is detected, the search keyword is used for carrying out search independently;
and if the synonym exists in the search keyword is detected, respectively searching the search keyword and the synonym to obtain a search target report.
Optionally, in a third implementation manner of the first aspect of the present invention, if it is detected that the search keyword has a synonym, the search keyword and the synonym are respectively searched, and obtaining a search target report includes:
if the fact that the search keyword has the synonym is detected, performing content matching based on the search keyword or the synonym to obtain a pre-search target report;
and performing database text matching on the pre-retrieval target report based on the retrieval key words or the synonyms to obtain a retrieval target report.
Optionally, in a fourth implementation manner of the first aspect of the present invention, the sorting the retrieval target report, and outputting and displaying a sorting result includes:
sequentially calculating the comprehensive sorting weight corresponding to the retrieval target report;
and setting the display sequence of the target report forms based on the comprehensive sequencing weight, sequencing the retrieval target report forms according to the display sequence, outputting a sequencing result and displaying.
Optionally, in a fifth implementation manner of the first aspect of the present invention, the sequentially calculating the comprehensive ranking weight corresponding to the retrieval target report includes:
obtaining hit keywords corresponding to the retrieval keywords, and calculating the editing distance from the retrieval keywords to the hit keywords;
calculating the similarity between the retrieval keyword and the hit keyword based on the editing distance;
and calculating the comprehensive sequencing weight corresponding to the retrieval target report based on the similarity between the retrieval keyword and the hit keyword and the weight of the label identification.
The second aspect of the present invention provides a report retrieving device based on multiple indexes, including:
the index library construction module is used for acquiring all reports in a preset system at regular time and constructing an index retrieval text library based on the reports;
the tag library construction module is used for sequentially adding tag identifications to the report and setting weights for the tag identifications to obtain a tag retrieval text library;
the report retrieval module is used for carrying out fuzzy matching on retrieval keywords input by a user based on the index retrieval text base and the label retrieval text base to obtain a retrieval target report;
and the report output module is used for sequencing the retrieval target reports, outputting sequencing results and displaying the sequencing results.
Optionally, in a first implementation manner of the second aspect of the present invention, the index library constructing module is specifically configured to:
based on a preset timing task, regularly calling a preset report export interface to download all reports;
intercepting an index column in the report and corresponding menu information and report ID;
appointing synonyms for each index in the index column in sequence;
and constructing an index retrieval text library based on the index column, the synonym, the menu information and the report ID.
Optionally, in a second implementation manner of the second aspect of the present invention, the report retrieval module is specifically configured to:
acquiring a retrieval keyword input by a user, and carrying out synonym detection on the retrieval keyword;
if the fact that the search keyword does not have the synonym is detected, the search keyword is used for carrying out search independently;
and if the synonym exists in the search keyword is detected, respectively searching the search keyword and the synonym to obtain a search target report.
Optionally, in a third implementation manner of the second aspect of the present invention, the report retrieval module is specifically configured to:
if the fact that the search keyword has the synonym is detected, performing content matching based on the search keyword or the synonym to obtain a pre-search target report;
and performing database text matching on the pre-retrieval target report based on the retrieval key words or the synonyms to obtain a retrieval target report.
Optionally, in a fourth implementation manner of the second aspect of the present invention, the report output module is specifically configured to:
sequentially calculating the comprehensive sorting weight corresponding to the retrieval target report;
and setting the display sequence of the target report forms based on the comprehensive sequencing weight, sequencing the retrieval target report forms according to the display sequence, outputting a sequencing result and displaying.
Optionally, in a fifth implementation manner of the second aspect of the present invention, the report output module is specifically configured to:
obtaining hit keywords corresponding to the retrieval keywords, and calculating the editing distance from the retrieval keywords to the hit keywords;
calculating the similarity between the retrieval keyword and the hit keyword based on the editing distance;
and calculating the comprehensive sequencing weight corresponding to the retrieval target report based on the similarity between the retrieval keyword and the hit keyword and the weight of the label identification.
The third aspect of the present invention provides a report retrieving device based on multiple indexes, including: a memory and at least one processor, the memory having instructions stored therein; the at least one processor calls the instructions in the memory to enable the multi-index-based report retrieval device to execute the multi-index-based report retrieval method.
A fourth aspect of the present invention provides a computer-readable storage medium, which stores instructions that, when executed on a computer, cause the computer to execute the above-mentioned multi-index-based report retrieval method.
According to the technical scheme provided by the invention, the index retrieval text base and the label retrieval text base are constructed, then the retrieval keywords input by the user are subjected to fuzzy matching based on the index retrieval text base and the label retrieval text base, and finally, the fuzzy matching results are displayed and sequenced and then displayed to the user, so that the report required by the user is accurately retrieved, the report files with low correlation are filtered out, and only the report with high correlation is displayed.
Drawings
FIG. 1 is a diagram of an embodiment of a report retrieval method based on multiple indexes according to an embodiment of the present invention;
FIG. 2 is a diagram of an embodiment of a report retrieval apparatus based on multiple indexes according to the embodiment of the present invention;
fig. 3 is a diagram of an embodiment of a report retrieval device based on multiple indexes in the embodiment of the present invention.
Detailed Description
The embodiment of the invention provides a report retrieval method, a report retrieval device, report retrieval equipment and a storage medium based on multiple indexes, which can accurately retrieve reports required by users, filter report files with lower relevance and only display the report files with higher relevance.
The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," or "having," and any variations thereof, are intended to cover non-exclusive inclusions, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For convenience of understanding, a detailed process of the embodiment of the present invention is described below, and referring to fig. 1, an embodiment of a report retrieval method based on multiple indexes in the embodiment of the present invention includes:
101. regularly acquiring all reports in a preset system, and constructing an index retrieval text base based on the reports;
setting a timing task, for example, setting a calling time of the timing task as 3 am every day, calling a derivation interface of a report to download all reports when 3 am every day, acquiring an index column in the report to analyze a falling library, recording a corresponding menu and a report ID when analyzing the falling library, and simultaneously specifying a synonym for an index in the index column, wherein the synonym is Chinese and English, and constructing an index retrieval text library by using data of the index column, the synonym, the corresponding menu and the report ID, please refer to a table I. When the report is newly added, modified or deleted, the system can automatically add, modify or delete the MI report to download the detailed report, and the system acquires the report index data according to the latest report in the morning of T +1 day.
Watch 1
Figure BDA0002843413050000051
Figure BDA0002843413050000061
In this embodiment, the foregoing step 101 further includes the following steps:
based on a preset timing task, regularly calling a preset report export interface to download all reports;
intercepting an index column in the report and corresponding menu information and report ID;
appointing synonyms for each index in the index column in sequence;
and constructing an index retrieval text library based on the index column, the synonym, the menu information and the report ID.
In this embodiment, the timed task is an instruction set for executing some operations at a fixed time, for example, a shell script is executed at a fixed time based on a crottab, a python script is executed at a fixed time based on a loop and sleep method, a python script is executed at a fixed time based on a Timer, a python script is executed at a fixed time based on a schedule module, and a python script is executed at an applied schedule based on a task framework. In the embodiment, a method for executing the shell script at regular time based on the crontab is adopted, and firstly, a script main logic statement is written in a text editor, namely, a function which needs to be realized by the script, and the writing is all parameters of a report export interface, including an export path, an export object type, a target path and the like. After the script main logic statement is written, the file is named and stored as a file in an sh format, for example, output.sh, and then the execution authority of the script is given to the script by a chmod command, namely chmod 755output.sh, otherwise, no execution authority exists. And finally, starting the crontab service, and setting the script file output.sh to be executed and the execution time through the editing/etc/crontab file. When the set execution time is reached, the system automatically executes the output.sh script file, and downloads all report data at corresponding addresses according to various export parameters written in the script file.
The downloaded report data is composed of a plurality of columns of data, the report is opened in excel or an index column of each report is screened out through a data interception tool, and the index column is a column of data composed of a plurality of indexes, such as time, size, speed and the like. The indexes are stored in a menu corresponding to each index, one index may correspond to one menu, or a plurality of indexes may correspond to one menu, and the menu information corresponding to each index is archived and recorded. For the report forms, at least one or more report forms may exist in one index, the relationship between the menu and the index is one-to-many, the relationship between the index and the report forms is one-to-many, and the corresponding report form ID under each index is archived. In this embodiment, synonyms need to be manually specified for the index, the index may have multiple synonyms such as chinese and english, and both chinese and english can be retrieved after an english synonym is specified. And finally, establishing a table in a database based on the index column and the report ID of the corresponding archived record, the menu information of the archived record and the synonym, and finishing the construction of the index retrieval text base.
102. Sequentially adding label marks to the report, and setting weights for the label marks to obtain a label retrieval text library;
in this embodiment, in order to make up for the deficiency of the report index downloaded by the program, an artificial addition of a tag identifier and a corresponding weight to the report is added, for example, a tag "note" and a tag "life" are added to the report a, where a weight is set for the tag "note" to be 10 and a weight is set for the tag "life" to be 5, please refer to tables two and three:
watch two
Label (R) Weight of
Note book " 10
'Life' 5
Watch III
Name of report Report ID Report label
Statistics of personal expenses in 2019 00232 Life saving
103. Based on the index retrieval text base and the label retrieval text base, carrying out fuzzy matching on retrieval keywords input by a user to obtain a retrieval target report;
in this embodiment, firstly, synonym detection is performed on a search keyword input by a user, a keyword used for search is determined according to a synonym detection result, for example, the search keyword input by the user is "work", if a corresponding synonym is not searched, the "work" is directly used as the keyword used for search, if a corresponding synonym "work" is searched, the "work" and the "work" are respectively used as the keywords used for search, content matching and database text matching are respectively performed during search, and a search target report is finally determined.
In this embodiment, the step 103 further includes the following steps:
acquiring a search keyword input by a user, and carrying out synonym detection on the search keyword;
if the fact that the search keyword does not have the synonym is detected, the search keyword is used for carrying out search independently;
and if the synonym exists in the search keyword is detected, respectively searching the search keyword and the synonym to obtain a search target report.
After the system acquires the search terms of the user, whether the synonyms exist or not is checked in the index retrieval text base, and the table I shows that the synonyms exist. If no synonym exists, the search word is used for searching alone. If the synonym exists, the search word and the synonym are respectively searched. For example, if the search word input by the user is "order quantity", and the synonym detects that there is a corresponding synonym "salt _ Amount", then the "order quantity" and "salt _ Amount" are retrieved separately.
In this embodiment, the above retrieval process further includes:
performing content matching based on the retrieval keywords or the synonyms to obtain a pre-retrieval target report;
and performing database text matching on the pre-retrieval target report based on the retrieval key words or the synonyms.
Firstly, content matching is carried out on search keywords or synonyms, wherein the content matching is preferably completely matched, for example, the search keywords are 'order quantity', if a report contains the word 'order quantity', the report accords with the search target, if the report only contains 'orders', the report does not accord with the search target, and if the report contains 'orders' and 'quantity', the report also does not accord with the search target. If the complete matching has no result, the fuzzy matching is continued, such as prefix, suffix and case indifference.
And obtaining a batch of pre-retrieval target reports through the content matching, and filtering the pre-retrieval target reports again through database text matching. The database text matching is preferably performed by starting tag matching in a tag text search library, for example, a search term is "order quantity", and if a tag of a report contains an "order", the report meets a search target. If the label matching has no result, index matching is carried out, for example, the search word is 'personal working time', if the index 'working time' exists, report IDs corresponding to the index 'working time' are inquired, the reports corresponding to the report IDs conform to the search target, and the last search target report is the intersection part of the report corresponding to the report IDs and the pre-search target report.
104. And sequencing the retrieval target report, outputting a sequencing result and displaying.
In this example, the ordering rules include:
if the label matching is carried out in the database text matching stage during retrieval, the report forms are sorted according to the weight of the search terms in the label library, the weight calculation formula of the search terms refers to the following formula, and the higher the weight of the search terms is, the higher the sorting is.
The similarity between the search word and the hit word is the correlation between the index and the report
The relevance between the index and the report is the weight set for the label, the relevance between the index and the report is higher when the weight value is larger, and the similarity between the search word and the hit word is calculated according to the following formula:
similarity 1-1/edit distance
The edit Distance (Levenshtein Distance) refers to the minimum number of single character edit operations required to convert one word into another between two words. If they are at greater distances, they are said to be more different. Permitted editing operations include replacing one character with another, inserting one character, and deleting one character. If we have two strings, we can calculate what the cost of getting two identical strings after one of them goes through the above three operations. For example, we have two strings kitten and sitting, we now want to convert kitten into sitting, where kitten to sitten replace k with s, sitten to sitten replace e with i, sittin to sitting add g, where we set the cost we spend every time we go through editing, i.e. change (insert, delete, replace) is all 1, and the edit distance is 3.
And (3) sequencing by a weighting algorithm according to the condition of multi-word search, adding the retrieved weight of each word in a certain report, wherein the higher the weight is, the higher the report sequencing is. For example, the user searches for "AA" and "BB", if the report a has no "AA", and if the report a has "BB" with a weight of 8, the comprehensive retrieval weight of the report a is 0+8 to 8, and if the report B has "AA" with a weight of 4, and if the report B has "BB" with a weight of 8, the comprehensive retrieval weight of the report B is 4+8 to 12, and since the comprehensive retrieval weight of the report B is greater than that of the report a, the report B will be ranked before the report a in the ranking process.
In the embodiment, the retrieval of the report content is supported, the report data can be retrieved according to the specific indexes in the report, and the report with the highest association degree can be displayed to the user; the method supports retrieval according to a plurality of words at the same time, can identify a plurality of index words contained in a sentence, and quickly positions the related report according to the report data which is most close to the user's desire and is matched according to the plurality of words; when a user inputs a search word, index information existing in the current system can be associated, and the user can be guided to input conveniently.
The above description is made on the report retrieval method based on multiple indexes in the embodiment of the present invention, and the following description is made on the report retrieval device based on multiple indexes in the embodiment of the present invention, referring to fig. 2, an embodiment of the report retrieval device based on multiple indexes in the embodiment of the present invention includes:
the index base building module 201 is used for regularly acquiring all reports in a preset system and building an index retrieval text base based on the reports;
the tag library construction module 202 is configured to sequentially add tag identifiers to the report, and set weights for the tag identifiers to obtain a tag retrieval text library;
the report retrieval module 203 is used for performing fuzzy matching on the retrieval keywords input by the user based on the index retrieval text base and the label retrieval text base to obtain a retrieval target report;
and the report output module 204 is used for sequencing the retrieval target reports, outputting the sequencing result and displaying the sequencing result.
Optionally, the index library constructing module 201 may be further specifically configured to:
based on a preset timing task, regularly calling a preset report export interface to download all reports;
intercepting an index column in the report and corresponding menu information and report ID;
appointing synonyms for each index in the index column in sequence;
and constructing an index retrieval text library based on the index column, the synonym, the menu information and the report ID.
Optionally, the report retrieving module 203 may be further specifically configured to:
acquiring a retrieval keyword input by a user, and carrying out synonym detection on the retrieval keyword;
if the fact that the search keyword does not have the synonym is detected, the search keyword is used for carrying out search independently;
and if the synonym exists in the search keyword is detected, respectively searching the search keyword and the synonym to obtain a search target report.
Optionally, the report retrieving module 203 may be further specifically configured to:
if the fact that the search keyword has the synonym is detected, performing content matching based on the search keyword or the synonym to obtain a pre-search target report;
and performing database text matching on the pre-retrieval target report based on the retrieval key words or the synonyms to obtain a retrieval target report.
Optionally, the report output module 204 may be further specifically configured to:
sequentially calculating the comprehensive sorting weight corresponding to the retrieval target report;
and setting the display sequence of the target report forms based on the comprehensive sequencing weight, sequencing the retrieval target report forms according to the display sequence, outputting a sequencing result and displaying.
Optionally, the report output module 204 may be further specifically configured to:
obtaining hit keywords corresponding to the retrieval keywords, and calculating the editing distance from the retrieval keywords to the hit keywords;
calculating the similarity between the retrieval keyword and the hit keyword based on the editing distance;
and calculating the comprehensive sequencing weight corresponding to the retrieval target report based on the similarity between the retrieval keyword and the hit keyword and the weight of the label identification.
In the embodiment of the invention, the modular design ensures that hardware of each part of the report retrieval device based on multiple indexes is concentrated on realizing a certain function, the performance of the hardware is realized to the maximum extent, and meanwhile, the modular design also reduces the coupling among modules of the device, so that the maintenance is more convenient.
Fig. 1 and fig. 2 describe the report retrieving device based on multiple indexes in the embodiment of the present invention in detail from the perspective of the modular functional entity, and the report retrieving device based on multiple indexes in the embodiment of the present invention is described in detail from the perspective of hardware processing.
Fig. 3 is a schematic structural diagram of a multi-index-based report retrieving device 300 according to an embodiment of the present invention, where the multi-index-based report retrieving device 300 may generate relatively large differences due to different configurations or performances, and may include one or more processors (CPUs) 310 (e.g., one or more processors) and a memory 320, and one or more storage media 330 (e.g., one or more mass storage devices) storing applications 333 or data 332. Memory 320 and storage media 330 may be, among other things, transient or persistent storage. The program stored on the storage medium 330 may include one or more modules (not shown), each of which may include a series of instruction operations for the multi-index based report retrieval device 300. Still further, the processor 310 may be configured to communicate with the storage medium 330 to execute a series of instruction operations in the storage medium 330 on the multi-index based report retrieval device 300.
The multi-index based report retrieval device 300 may further include one or more power supplies 340, one or more wired or wireless network interfaces 350, one or more input-output interfaces 360, and/or one or more operating systems 331, such as Windows Server, Mac OS X, Unix, Linux, FreeBSD, and the like. Those skilled in the art will appreciate that the structure of the multi-index based report retrieval device shown in FIG. 3 does not constitute a limitation of the multi-index based report retrieval device, and may include more or less components than those shown, or some components may be combined, or a different arrangement of components.
The invention also provides report retrieval equipment based on multiple indexes, which comprises a memory and a processor, wherein the memory stores computer readable instructions, and the computer readable instructions are executed by the processor, so that the processor executes the steps of the report retrieval method based on multiple indexes in the embodiments.
The present invention also provides a computer-readable storage medium, which may be a non-volatile computer-readable storage medium, and may also be a volatile computer-readable storage medium, where instructions are stored, and when the instructions are executed on a computer, the instructions cause the computer to execute the steps of the multi-index based report retrieval method.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A report retrieval method based on multiple indexes is characterized in that the report retrieval method based on the multiple indexes comprises the following steps:
regularly acquiring all reports in a preset system, and constructing an index retrieval text base based on the reports;
sequentially adding label marks to the report, and setting weights for the label marks to obtain a label retrieval text library;
based on the index retrieval text base and the label retrieval text base, carrying out fuzzy matching on retrieval keywords input by a user to obtain a retrieval target report;
and sequencing the retrieval target report, outputting a sequencing result and displaying.
2. The report retrieval method based on multiple indexes of claim 1, wherein the periodically obtaining all reports in a preset system, and the constructing an index retrieval text base based on the reports comprises:
based on a preset timing task, regularly calling a preset report export interface to download all reports;
intercepting an index column in the report and corresponding menu information and report ID;
appointing synonyms for each index in the index column in sequence;
and constructing an index retrieval text library based on the index column, the synonym, the menu information and the report ID.
3. The report retrieval method based on multiple indexes of claim 1, wherein the retrieving the text base based on the indexes and the retrieving the text base based on the labels performs fuzzy matching on the retrieval keywords input by the user to obtain the retrieval target report comprises:
acquiring a retrieval keyword input by a user, and carrying out synonym detection on the retrieval keyword;
if the fact that the search keyword does not have the synonym is detected, the search keyword is used for carrying out search independently;
and if the synonym exists in the search keyword is detected, respectively searching the search keyword and the synonym to obtain a search target report.
4. The report retrieval method based on multiple indexes of claim 3, wherein if it is detected that the search keyword has a synonym, the search keyword and the synonym are respectively retrieved to obtain a retrieval target report, comprising:
if the fact that the search keyword has the synonym is detected, performing content matching based on the search keyword or the synonym to obtain a pre-search target report;
and performing database text matching on the pre-retrieval target report based on the retrieval key words or the synonyms to obtain a retrieval target report.
5. The report retrieval method based on multiple indexes according to claim 1, wherein the sorting the retrieval target report, outputting the sorting result and displaying comprises:
sequentially calculating the comprehensive sorting weight corresponding to the retrieval target report;
and setting the display sequence of the target report forms based on the comprehensive sequencing weight, sequencing the retrieval target report forms according to the display sequence, outputting a sequencing result and displaying.
6. The report retrieval method based on multiple indexes of claim 5, wherein the sequentially calculating the comprehensive ranking weight corresponding to the retrieval target report comprises:
obtaining hit keywords corresponding to the retrieval keywords, and calculating the editing distance from the retrieval keywords to the hit keywords;
calculating the similarity between the retrieval keyword and the hit keyword based on the editing distance;
and calculating the comprehensive sequencing weight corresponding to the retrieval target report based on the similarity between the retrieval keyword and the hit keyword and the weight of the label identification.
7. The report form retrieval device based on multiple indexes is characterized by comprising the following components:
the index library construction module is used for acquiring all reports in a preset system at regular time and constructing an index retrieval text library based on the reports;
the tag library construction module is used for sequentially adding tag identifications to the report and setting weights for the tag identifications to obtain a tag retrieval text library;
the report retrieval module is used for carrying out fuzzy matching on retrieval keywords input by a user based on the index retrieval text base and the label retrieval text base to obtain a retrieval target report;
and the report output module is used for sequencing the retrieval target reports, outputting sequencing results and displaying the sequencing results.
8. The report retrieval device based on multiple indexes of claim 7, wherein the index library construction module is specifically configured to:
based on a preset timing task, regularly calling a preset report export interface to download all reports;
intercepting an index column in the report and corresponding menu information and report ID;
appointing synonyms for each index in the index column in sequence;
and constructing an index retrieval text library based on the index column, the synonym, the menu information and the report ID.
9. The report retrieval device based on multiple indexes is characterized by comprising the following components: a memory and at least one processor, the memory having instructions stored therein;
the at least one processor invokes the instructions in the memory to cause the multi-index based report retrieval device to perform the multi-index based report retrieval method according to any of claims 1-6.
10. A computer-readable storage medium having instructions stored thereon, wherein the instructions, when executed by a processor, implement the multi-index based report retrieval method according to any one of claims 1-6.
CN202011500570.9A 2020-12-18 2020-12-18 Report retrieval method, device, equipment and storage medium based on multiple indexes Pending CN112632122A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011500570.9A CN112632122A (en) 2020-12-18 2020-12-18 Report retrieval method, device, equipment and storage medium based on multiple indexes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011500570.9A CN112632122A (en) 2020-12-18 2020-12-18 Report retrieval method, device, equipment and storage medium based on multiple indexes

Publications (1)

Publication Number Publication Date
CN112632122A true CN112632122A (en) 2021-04-09

Family

ID=75316755

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011500570.9A Pending CN112632122A (en) 2020-12-18 2020-12-18 Report retrieval method, device, equipment and storage medium based on multiple indexes

Country Status (1)

Country Link
CN (1) CN112632122A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113723062A (en) * 2021-08-31 2021-11-30 远光软件股份有限公司 Report label editing method and device, storage medium and terminal equipment
CN113888056A (en) * 2021-11-22 2022-01-04 中国工商银行股份有限公司 Report generation method and device
CN115759014A (en) * 2022-11-22 2023-03-07 北京码牛科技股份有限公司 Dynamic intelligent analysis method and system and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106776741A (en) * 2016-11-18 2017-05-31 北京奇虎科技有限公司 A kind of searching method and device
CN109902100A (en) * 2019-01-31 2019-06-18 平安科技(深圳)有限公司 Report form inquiring method, device and storage medium
CN110334331A (en) * 2019-05-30 2019-10-15 重庆金融资产交易所有限责任公司 Method, apparatus and computer equipment based on order models screening table
CN110955806A (en) * 2019-11-29 2020-04-03 国家电网有限公司客户服务中心 Character string matching method for Chinese text

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106776741A (en) * 2016-11-18 2017-05-31 北京奇虎科技有限公司 A kind of searching method and device
CN109902100A (en) * 2019-01-31 2019-06-18 平安科技(深圳)有限公司 Report form inquiring method, device and storage medium
CN110334331A (en) * 2019-05-30 2019-10-15 重庆金融资产交易所有限责任公司 Method, apparatus and computer equipment based on order models screening table
CN110955806A (en) * 2019-11-29 2020-04-03 国家电网有限公司客户服务中心 Character string matching method for Chinese text

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113723062A (en) * 2021-08-31 2021-11-30 远光软件股份有限公司 Report label editing method and device, storage medium and terminal equipment
CN113888056A (en) * 2021-11-22 2022-01-04 中国工商银行股份有限公司 Report generation method and device
CN115759014A (en) * 2022-11-22 2023-03-07 北京码牛科技股份有限公司 Dynamic intelligent analysis method and system and electronic equipment

Similar Documents

Publication Publication Date Title
CN112632122A (en) Report retrieval method, device, equipment and storage medium based on multiple indexes
CN109754233B (en) Method and system for intelligently recommending position information
US9418144B2 (en) Similar document detection and electronic discovery
US10387435B2 (en) Computer application query suggestions
CN109344230B (en) Code library file generation, code search, coupling, optimization and migration method
US10346457B2 (en) Platform support clusters from computer application metadata
US20060212441A1 (en) Full text query and search systems and methods of use
US20130311487A1 (en) Semantic search using a single-source semantic model
EP1342177A1 (en) Method for structuring and searching information
US20070016581A1 (en) Category setting support method and apparatus
EP2631815A1 (en) Method and device for ordering search results, method and device for providing information
CN106156111B (en) Patent document retrieval method, device and system
CN112307191A (en) Multi-system interactive log query method, device, equipment and storage medium
CN114722137A (en) Security policy configuration method and device based on sensitive data identification and electronic equipment
CN110674365A (en) Searching method, device, equipment and storage medium
CN110347788A (en) Service inquiry method, apparatus and storage medium, computer equipment
CN108062422B (en) Sorting method, intelligent terminal, system and storage medium for paging query
CN108345694B (en) Document retrieval method and system based on theme database
US10339148B2 (en) Cross-platform computer application query categories
CN111078988B (en) Electric power service information hotspot retrieval method and device and electronic equipment
CN113569132A (en) Information retrieval display method and system
CN110633430A (en) Event discovery method, device, equipment and computer readable storage medium
CN117591624B (en) Test case recommendation method based on semantic index relation
JPH07239861A (en) Document retrieving device
CN117435699A (en) Method, device, equipment and medium for automatically constructing learning2rank training set

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination