CN117093875A - Screening item search quality verification method and device and computer equipment - Google Patents

Screening item search quality verification method and device and computer equipment Download PDF

Info

Publication number
CN117093875A
CN117093875A CN202311061703.0A CN202311061703A CN117093875A CN 117093875 A CN117093875 A CN 117093875A CN 202311061703 A CN202311061703 A CN 202311061703A CN 117093875 A CN117093875 A CN 117093875A
Authority
CN
China
Prior art keywords
screening
result
search
target
item combination
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311061703.0A
Other languages
Chinese (zh)
Inventor
周林郁
李俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qichacha Technology Co ltd
Original Assignee
Qichacha Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qichacha Technology Co ltd filed Critical Qichacha Technology Co ltd
Priority to CN202311061703.0A priority Critical patent/CN117093875A/en
Publication of CN117093875A publication Critical patent/CN117093875A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9035Filtering based on additional data, e.g. user or group profiles

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application relates to a search quality verification method, a search quality verification device, a search quality verification computer device, a search quality verification storage medium and a search quality verification program product for a screening item. The method comprises the following steps: according to the historical search record of the user, determining a plurality of screening item combination modes aiming at the search object; determining a target screening item combination mode based on the occurrence times of each screening item combination mode in the historical search record; then, combining the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; further, searching through the screening conditions to be verified to obtain a first screening result and a second screening result; and finally, determining the search quality of the screening item based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified. The method can improve the verification efficiency of the search quality of the screening items.

Description

Screening item search quality verification method and device and computer equipment
Technical Field
The present application relates to the field of computer technology, and in particular, to a method, an apparatus, a computer device, a storage medium, and a computer program product for verifying search quality of a filtering term.
Background
With the development of computer technology, in order to implement filtering of data, an elastic search technology (ES, a real-time distributed storage, search and analysis engine, which is used for data retrieval) has emerged. The ES is a real-time distributed storage, search and analysis engine, and has a powerful data retrieval function. The data structure contained in the ES is capable of quickly filtering documents by representing each document as a set of bits, where each bit represents whether a particular filtering condition matches the document, when a query is executed, the ES converts the query into one or more filters, and applies the filters to all documents.
However, when an ES search engine is used for query, the number of combinations of screen items is not sufficient, and if the search quality of each combination of screen items is verified one by one, a lot of time is consumed. Therefore, the existing method for verifying the search quality is inefficient.
Disclosure of Invention
Based on this, it is necessary to provide a search quality verification method, apparatus, computer device, computer readable storage medium and computer program product for filtering items, aiming at the technical problem that the efficiency of the method for verifying search quality is low.
In a first aspect, the present application provides a search quality verification method for a filter term. The method comprises the following steps:
according to the historical search record of the user, determining a plurality of screening item combination modes aiming at the search object;
determining a target screening item combination mode from a plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record;
combining the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode;
searching through screening conditions to be verified at two different searching time points to obtain a first screening result and a second screening result;
and determining the search quality of screening items included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified.
In one embodiment, determining a target screening item combination mode from a plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record comprises: sorting a plurality of screening item combination modes according to the occurrence times; according to the sorting result, determining a screening item combination mode with the corresponding occurrence number being greater than the occurrence number of other screening item combination modes from a plurality of screening item combination modes as a target screening item combination mode; the other screening item combination modes are screening item combination modes except the target screening item combination mode in the multiple screening item combination modes.
In one embodiment, the method further includes, before performing a combination process on the target parameter values under each screening item included in the target screening item combination manner and the target screening item combination manner to obtain the screening condition to be verified: for each screening item, obtaining the search result number of each parameter value under the screening item; based on the number of search results, a target parameter value is determined from each parameter value under the filter term.
In one embodiment, determining a target parameter value from each parameter value under the filter term based on the number of search results includes: sorting the parameter values under the screening items according to the number of the search results; according to the sorting result, determining a parameter value of which the corresponding search result number is larger than that of other parameter values from the parameter values as a target parameter value; the other parameter values are parameter values other than the target parameter value among the respective parameter values under the screen item.
In one embodiment, determining the search quality of the screening item included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified includes: determining first search quality information based on difference information between the first screening result and the second screening result; determining second search quality information based on a matching result between the second screening result and the screening condition to be verified; and obtaining the search quality of the screening items included in the screening condition to be verified according to the first search quality information and the second search quality information.
In one embodiment, the difference information between the first screening result and the second screening result is determined by: acquiring the variation ratio of the search result number corresponding to the first screening result and the second screening result; and obtaining difference information between the first screening result and the second screening result based on the change ratio, a change ratio threshold corresponding to the first screening result and the size relation between the search result numbers corresponding to the first screening result and the second screening result.
In one embodiment, the variation ratio threshold corresponding to the first screening result is determined by: determining a target numerical value interval corresponding to the number of search results included in the first screening result from a plurality of preset numerical value intervals; each numerical interval has a corresponding variation ratio threshold; and acquiring a fluctuation ratio threshold corresponding to the target numerical value interval as a fluctuation ratio corresponding to the first screening result.
In a second aspect, the application further provides a search quality verification device. The device comprises:
the first data confirmation module is used for determining a plurality of screening item combination modes aiming at the retrieval object according to the historical search record of the user;
The second data confirmation module is used for determining a target screening item combination mode from the plurality of screening item combination modes according to the occurrence times of each screening item combination mode in the historical search record; the data processing module is used for carrying out combination processing on the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode;
the data retrieval module is used for retrieving through the screening conditions to be verified at two different searching time points respectively to obtain a first screening result and a second screening result;
and the search quality determining module is used for determining the search quality of the screening items included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and/or the matching result between the second screening result and the screening condition to be verified.
In a third aspect, the present application also provides a computer device. The computer device comprises a memory and a processor, the memory stores a computer program, and the processor executes the computer program to realize the search quality verification of the method according to any one of the embodiments of the application.
In a fourth aspect, the present application also provides a computer-readable storage medium. The computer readable storage medium has stored thereon a computer program which, when executed by a processor, performs search quality verification of the method according to any of the embodiments of the present application.
In a fifth aspect, the present application also provides a computer program product. The computer program product comprises a computer program which, when executed by a processor, implements search quality verification of the method according to any of the embodiments of the application.
The search quality verification method, the device, the computer equipment, the storage medium and the computer program product for screening items have the following beneficial effects when verifying the search quality: according to the historical search record of the user, determining a plurality of screening item combination modes aiming at the search object; based on the occurrence times of each screening item combination mode in the historical search record, determining a target screening item combination mode from a plurality of screening item combination modes, and confirming the screening item combination mode according to the historical search record so as to improve the timeliness of the search quality; then, combining the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified: including screening options and specific parameter values; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode, and searching is carried out through the screening conditions to be verified at two different searching time points to obtain a first screening result and a second screening result; finally, based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified, the method determines the target screening item combination mode from a plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record, improves the timeliness of the search quality and the efficiency of verifying the search quality, and meets the service requirement.
Drawings
FIG. 1 is an application environment diagram of a search quality verification method of screening items in one embodiment;
FIG. 2 is a flow diagram of a method of search quality verification of screening items in one embodiment;
FIG. 3 is a flow diagram of a search quality verification step for a screening item in one embodiment;
FIG. 4 is a block diagram of a search quality verification device for screening items in one embodiment;
fig. 5 is an internal structural diagram of a computer device in one embodiment.
Detailed Description
The present application will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present application more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.
The method for verifying the search quality of the screening item provided by the embodiment of the application can be applied to an application environment shown in figure 1. Wherein the terminal 102 communicates with the server 104 via a network. The data storage system may store data that the server 104 needs to process. The data storage system may be integrated on the server 104 or may be located on a cloud or other network server. In the application scenario of the present application, after the server 104 obtains the history search record of the user from the terminal 102, the server 104 obtains a plurality of screening item combination modes for the search object according to the history search record of the user; then, determining a target screening item combination mode from a plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record; further carrying out combination treatment on the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode; searching through screening conditions to be verified at two different searching time points to obtain a first screening result and a second screening result; and finally, determining the search quality of screening items included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified. The terminal 102 may be, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, internet of things devices, and portable wearable devices, where the internet of things devices may be smart speakers, smart televisions, smart air conditioners, smart vehicle devices, and the like. The portable wearable device may be a smart watch, smart bracelet, headset, or the like. The server 104 may be implemented as a stand-alone server or as a server cluster of multiple servers.
In one embodiment, as shown in fig. 2, a method for verifying search quality of a filtering term is provided, and the method is applied to the server 104 in fig. 1 for illustration, and includes the following steps:
step 202, determining a plurality of screening item combination modes aiming at the retrieval object according to the historical search record of the user.
And analyzing the combination modes of various screening conditions used by the user during searching according to the historical searching record of the user, and inputting the combination modes into the data storage system. For example, when a user searches using a screening condition such as a business name, a business of the user, and an incumbent and a duration of the business, the server recognizes and stores the combination, and determines a plurality of screening item combinations for the search object according to the historical search record of the user, which means that the server can acquire a screening item commonly used by the user according to the historical search behavior of the user and search according to the combination of the screening items.
Step 204, determining a target screening item combination mode from a plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record.
Specifically, the server counts the occurrence times of each screening item combination mode in the historical search record of the user, further analyzes and compares the occurrence times of different screening item combination modes in the historical search record, and can determine the most common screening item combination mode of the user, namely the target screening item combination mode.
Step 206, combining the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode.
Specifically, the to-be-verified screening condition includes a most-used screening item combination manner and a specific parameter value, and after the server determines the target screening item combination manner, the server can perform combination processing on the target parameter value under each screening item in the combination manner, which means that the server can combine each screening item in the target screening item combination manner with the corresponding target parameter value to form the to-be-verified screening condition.
And step 208, searching through the screening conditions to be verified at two different searching time points to obtain a first screening result and a second screening result.
Specifically, the server uses the screening conditions to be verified to search at two different searching time points to obtain two screening results.
Step 210, determining the search quality of the screening item included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified.
Specifically, the server can evaluate the influence degree of each screening item on the search result by analyzing the difference information and the matching result, and further determine the search quality of the screening item, so that more accurate search results are provided for users.
In the method for verifying the search quality of the screening items, the server determines a plurality of screening item combination modes aiming at the search objects according to the historical search records of the user; based on the occurrence times of each screening item combination mode in the historical search record, determining a target screening item combination mode from a plurality of screening item combination modes, and confirming the screening item combination mode according to the historical search record so as to improve the timeliness of the search quality; then, combining the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified: including screening options and specific parameter values; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode, and searching is carried out through the screening conditions to be verified at two different searching time points to obtain a first screening result and a second screening result; finally, based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified, the method determines the target screening item combination mode from a plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record, improves the timeliness of the search quality and the efficiency of verifying the search quality, and meets the service requirement.
In one embodiment, in step 204, determining a target screening item combination from a plurality of screening item combinations based on the number of occurrences of each screening item combination in the historical search record may include:
step 204a: and sorting the combination modes of the screening items according to the occurrence times.
Specifically, the plurality of screening items may be industries to which the search object belongs, provincial regions, participants, times of establishment, registration status, registered capital, and institution types, and the like.
Step 204b: according to the sorting result, determining a screening item combination mode with the corresponding occurrence number being greater than the occurrence number of other screening item combination modes from a plurality of screening item combination modes as a target screening item combination mode; the other screening item combination modes are screening item combination modes except the target screening item combination mode in the multiple screening item combination modes.
Specifically, because the user population and the search behavior thereof are continuously changed, the verified screening item combination mode is continuously updated so as to more fully meet the user demand, and the screening combination mode is re-extracted from the log in a month period to update the target screening item combination mode.
In this embodiment, the number of occurrences of the multiple screening item combination methods in the history search record is ordered to confirm the target screening item combination method, so that the accuracy of searching and the satisfaction of the user can be improved.
Further, in one embodiment, the method further includes, before performing a combination process on the target screening item combination mode and the target parameter values under each screening item included in the target screening item combination mode to obtain the screening condition to be verified: for each screening item, obtaining the search result number of each parameter value under the screening item; based on the number of search results, a target parameter value is determined from each parameter value under the filter term.
In one embodiment, determining a target parameter value from each parameter value under the filter term based on the search result number comprises:
step one: sorting the parameter values under the screening items according to the number of the search results;
specifically, the server writes a request script of the search engine, including a request protocol, a request mode, a request header and a request address, sets a set of parameter name lists identical to values or references to values in a database table field location area, and sets a set of parameter value lists identical to values or references to values in the database table field location area, wherein the selection rule of the parameter values is as wide as possible, the number of screening results is large, and the situation that screening is free after various parameter combinations is prevented.
The method for selecting the parameter values comprises the steps of respectively splicing the enumerated parameter values and parameter names to generate a filtering function to sequentially and singly request without using a combination mode, recording screening quantity, arranging the screening quantity of the enumerated parameter values in a descending order after grouping according to the parameter names, selecting the parameter value with the largest screening quantity, and adding the parameter value to a parameter value list according to the value in a field position area of a table or the reference sequence of the value.
Step two: according to the sorting result, determining a parameter value of which the corresponding search result number is larger than that of other parameter values from the parameter values as a target parameter value; the other parameter values are parameter values other than the target parameter value among the respective parameter values under the screen item.
Specifically, as the number of screening results fluctuates due to the flowability of the data, the parameter values need to be updated continuously, so that the selected parameter values are ensured to be the most in screening number, and therefore the screening number needs to be ordered with a month as a period, the parameter values need to be updated, and the target parameter values need to be confirmed.
In this embodiment, the search term can be confirmed more accurately by sorting the parameter values and selecting the target parameter value.
In one embodiment, in step 210, determining a search quality of a screening item included in the screening condition to be verified based on difference information between the first screening result and the second screening result and a matching result between the second screening result and the screening condition to be verified includes:
Step 210a: first search quality information is determined based on difference information between the first screening result and the second screening result.
Specifically, the first filtering result includes a parameter identifier, a filtering function request parameter of the filtering item combination, a processed filtering result and a request time. The data in the second screening result are: after a specified time interval is set, the filter function request parameters of the filter item combinations in the first screening set are read according to the preset time interval, the acquired screening results are sent again, and the first screening results and the second screening results are compared to obtain first search quality information.
Step 210b: and determining second search quality information based on a matching result between the second screening result and the screening condition to be verified.
Specifically, after the second screening result is written into the latest screening result, request parameters of the latest screening result corresponding to the screening item combination are sequentially read, and are matched with the screening conditions to be verified, and second search quality information is obtained according to the matching result.
Step 210c: and obtaining the search quality of the screening items included in the screening condition to be verified according to the first search quality information and the second search quality information.
Specifically, comparing the two pieces of search quality information of the above steps, it is verified whether the quality of the last filtering item is improved.
Further, in one embodiment, the difference information between the first screening result and the second screening result is determined by: acquiring the variation ratio of the search result number corresponding to the first screening result and the second screening result; and obtaining difference information between the first screening result and the second screening result based on the change ratio, a change ratio threshold corresponding to the first screening result and the size relation between the search result numbers corresponding to the first screening result and the second screening result.
Specifically, the data of the common company data source is steadily increased after continuous recording and accumulation, and the synchronous data in the search engine may have the condition of losing the data or failing, so that the condition of less screening results is more dangerous under the two conditions of more and less. Whereas data is smooth, company information may change, for example, the registered capital of a company changes from fifty thousand yuan to thirty thousand yuan, and the screening result becomes smaller. It is necessary to set a threshold value, for example 10%, for the difference information ratio, and if the threshold value is exceeded, the screening result gap is considered to be too large and an alarm is pushed. However, since the duty ratio depends on the numerator and denominator, that is, the difference information and the count of the first screening result, when the number of the first screening results is small, the amount of the difference information is floated to have a great influence on the duty ratio. For example, the number of the first screening results is ten, the number of the second screening results is eight, the difference information ratio is |8-10|/10=20%, the difference information ratio is large and exceeds the threshold, but the difference information amount is small.
In this embodiment, the difference information between the first screening result and the second screening result is more accurate by acquiring the variation ratio of the number of search results corresponding to the first screening result and the second screening result and analyzing and calculating the variation ratio and the variation ratio threshold corresponding to the first screening result.
In one embodiment, the variation ratio threshold corresponding to the screening result is determined by: determining a target numerical value interval corresponding to the number of search results included in the first screening result from a plurality of preset numerical value intervals; each numerical interval has a corresponding variation ratio threshold; and acquiring a fluctuation ratio threshold corresponding to the target numerical value interval as a fluctuation ratio corresponding to the first screening result.
Specifically, it is necessary to set a step duty threshold, for example, when the number of first screening results is less than one hundred, the duty threshold is set to 30%; when the number of the first screening results is more than one hundred and less than ten thousand, the threshold value of the duty ratio is set to be 20%; when the number of the first screening results is greater than ten thousand, the duty ratio threshold is set to 10%. And so on, the specific threshold may be adjusted according to the alert quality. Meanwhile, different priorities are set for more and less conditions, the more priorities are set to be common, and the alarm color is yellow; the lesser priority is set to emergency and the alarm color is red.
In this embodiment, the change ratio threshold value of the target value interval is determined by acquiring the change ratio threshold values corresponding to different value intervals in the screening result, and the change condition of the different threshold values is prioritized, so that the timeliness of the server for acquiring the change ratio corresponding to the first screening result can be improved.
In one embodiment, in order to facilitate a person skilled in the art to understand an embodiment of the present application, a search quality verification method for a filtering term provided by the present application will be described in detail below with reference to fig. 3, taking a search object as an example.
It should be noted that, when the search object is a company, the corresponding selectable filtering items generally include: the industry, province area, number of participants, time of establishment, registration status, registered capital and institution type, etc. For example, a company of the software industry that the user wants to find B cities with registered capital greater than 5 ten thousand can build a screening condition: registered capital >50000and province district = a province B city and industry affiliated = software and information technology services industry. The registered capital + provincial area + industry to which the screening conditions pertain may form a screening item portfolio.
In this embodiment, the specific steps of the method for verifying the search quality of the screening item include:
S302, determining a target screening item combination mode for a company.
Specifically, multiple screening item combination modes are determined according to search records of users, the screened parameter entering filter functions are analyzed from an interface layer according to the screening item combination modes, and screening item combinations with more occurrence times and higher use frequency in the filter functions are found through a specified method.
S304, screening the parameter values under each screening item in the target screening item combination mode to obtain the target parameter value corresponding to each screening item.
Specifically, the parameter values and the parameter names are spliced to generate a filtering function to sequentially and singly request, the screening quantity is recorded, the screening quantity of the enumerated parameter values is arranged in descending order after being grouped according to the parameter names, and the parameter value with the largest screening quantity and the screening item combination mode are selected to enter the parameters. According to the position order of the screening items, the corresponding inserted data is (software and information technology service industry, jiangsu, suzhou, null, null,50000, null), the value of the parameter identification bit test code corresponding to the value or the reference to the value in the parameter position area is 1100010, the corresponding position of the screening item with the transmission parameter representing the current line is marked as 1, and the value without the transmission parameter is 0. And inserting the screening item and the corresponding screening item parameter identification bit test code into a database as one row of data. After the test code fields are aggregated, the test code fields are arranged according to the descending order of values, so that a screening item combination mode which is frequently used by a user can be obtained, and the screening item combination mode of ten thousand of the first screening items are stored as a first screening result of the screening condition. For example, parameter enumeration values of registration states are: in the process of industry, persistence, migration, cancellation and cancellation, a single screening item in a registration state and a parameter enumeration value are subjected to a filter function splicing request, and the result values are arranged in descending order: survival (147551077), cancellation (105402557), incumbent (41923187), suspension (18411268), emigration (499086), emigration (313), and the parameter value with the largest number of screen items is selected as the parameter value. Because the screening quantity fluctuates due to the fluidity of the data, the parameter value needs to be updated continuously, the selected parameter value is ensured to be the most in screening quantity, the screening quantity is re-requested and ordered by taking a month as a period, and the parameter value is updated.
S306, determining a first screening result.
Specifically, the filtering function is used to write the filtering result into the first filtering result set after sending the request
Aiming at the statistics of historical search records in screening results of users, the click rate of the first five companies is displayed to occupy about 80% of the total click quantity, namely the correctness of the first five screening companies greatly influences the experience of the users in using search engines to screen the companies, so that the first five screening results are important indexes for screening item quality verification. Under the condition that the interfaces are fetched correctly, the screening quantity depends on the data, the data of the data source is generally smooth, and the data source cannot be changed drastically, otherwise, the data acquisition failure of the data source or the data source website modification or the data cleaning error can be caused. If the interface access is correct, it may be caused by failure of ES sync data or failure of the search engine. And a reasonable threshold is set to monitor the change of the screening quantity, so that loopholes of the data and ES layers can be found in time, and the screening quantity is also an important index for the quality verification of the screening items. And processing the screening result, and extracting the first five company records and the screening quantity of the screening result of the current screening item combination, wherein the company records comprise hit company names and hit information. And taking the parameter identifier, the filter function request parameter of the filter term combination, the processed filter result and the request time as a first filter result.
S308, determining a second screening result.
Specifically, according to the preset frequency, the step is executed again by obtaining the same filtering function as the first filtering result, and the filtering result is written into the second filtering result set. Setting a designated time interval, reading the filter function request parameters of the screening item combinations in the first screening result according to the preset time interval, sending a request again, writing the screening result into the second screening result, and reading the request parameters of the screening item combinations from the first screening result, wherein the written information is the same as that of the step S306, and the screening result before the designated interval is used as a reference. The designated time interval is generally set to be one day, and needs to cover the time period of the synchronization of the full amount, the incremental data and the task on-line of the research and development personnel of the search engine, so that the synchronization of the data and the change of the program are ensured to have no defects.
S310, determining the search quality according to the first screening result and the second screening result.
Specifically, the two screening results are compared, the latest screening result is compared with the first screening result set according to the screening conditions, and the first method is to verify the change of the latest screening result and the consistency of the screening result and the screening conditions, so that the searching quality is verified. Mainly, compared with the screening result before the specified time interval, the change condition of the value of the latest screening result is that the smaller the change is, the more stable the screening result is. The second method is to verify the consistency of the company information of the latest screening result and the screening condition, and the inconsistent information represents the error of the screening result. And comparing the first five company details in the screening result according to the screening conditions. And establishing a mapping relation between screening conditions and company detail verification, for example, screening conditions are that the registered capital is more than fifty thousand, acquiring the registered capital of the company through detail fields stored by the mapping relation, and comparing the registered capital with the five thousand. If the comparison relation is larger than the comparison relation, the consistency is verified, otherwise, the screening result is wrong. Setting a red pattern for a preset screening condition, a company name for verifying inconsistent with the screening condition, an expected value for verifying inconsistent with the screening condition and an actual value for verifying inconsistent with the screening condition, wherein the priority is urgent, pushing an alarm group and storing the alarm group into an abnormal change database, and checking by a developer one by one. This may be due to screening failures in the search engine, etc.
In the embodiment, the application can judge the screening quality of the screening items in batches and efficiently, can cover a large area of the combination mode of the complex mass screening items of the user, can track and early warn abnormal screening results, and can improve the stability and accuracy of the screening results in the search engine.
It should be understood that, although the steps in the flowcharts related to the embodiments described above are sequentially shown as indicated by arrows, these steps are not necessarily sequentially performed in the order indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in the flowcharts described in the above embodiments may include a plurality of steps or a plurality of stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of the steps or stages is not necessarily performed sequentially, but may be performed alternately or alternately with at least some of the other steps or stages.
In an embodiment, as shown in fig. 4, based on the same inventive concept, the embodiment of the present application further provides a detection apparatus for implementing the above-mentioned search quality verification method for the filtering term. Comprising the following steps: a first data validation module 401, a second data validation module 402, a data processing module 403, a data retrieval module 404, and a search quality determination module 405, wherein:
the first data confirmation module 401 is configured to determine, according to a historical search record of a user, a plurality of screening item combination manners for a search object.
A second data confirmation module 402, configured to determine a target screening item combination manner from a plurality of screening item combination manners based on the number of occurrences of each screening item combination manner in the historical search record.
The data processing module 403 is configured to perform a combination process on the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode, so as to obtain a screening condition to be verified; screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode.
The data retrieval module 404 is configured to retrieve through the to-be-verified screening condition at two different search time points, respectively, to obtain a first screening result and a second screening result.
The search quality determining module 405 is configured to determine a search quality of a screening item included in the screening condition to be verified based on difference information between the first screening result and the second screening result and/or a matching result between the second screening result and the screening condition to be verified.
In one embodiment, the first data confirmation module 401 is further configured to sort the multiple screening item combination manners according to the number of occurrences in the history search record; according to the sorting result, determining a screening item combination mode with the corresponding occurrence number being greater than the occurrence number of other screening item combination modes from a plurality of screening item combination modes as a target screening item combination mode; the other screening item combination modes are screening item combination modes except the target screening item combination mode in the multiple screening item combination modes.
In one embodiment, the data processing module 403 further includes a first data processing sub-module, configured to obtain, for each filtering item, a search result number of each parameter value under the filtering item; based on the number of search results, a target parameter value is determined from each parameter value under the filter term.
In one embodiment, the data processing module 403 further includes a second data processing sub-module, configured to sort the parameter values under the filtering term according to the number of search results; according to the sorting result, determining a parameter value of which the corresponding search result number is larger than that of other parameter values from the parameter values as a target parameter value; the other parameter values are parameter values other than the target parameter value among the respective parameter values under the screen item.
In one embodiment, the search quality determination module 405 further includes a first search quality determination sub-module for determining first search quality information based on difference information between the first screening result and the second screening result; determining second search quality information based on a matching result between the second screening result and the screening condition to be verified; and obtaining the search quality of the screening items included in the screening condition to be verified according to the first search quality information and the second search quality information.
In one embodiment, the search quality determining module 405 further includes a second search quality determining sub-module configured to obtain a variation ratio of the number of search results corresponding to the first filtering result and the second filtering result; and obtaining difference information between the first screening result and the second screening result based on the change ratio, a change ratio threshold corresponding to the first screening result and the size relation between the search result numbers corresponding to the first screening result and the second screening result.
In one embodiment, the search quality determining module 405 further includes a third search quality determining sub-module, configured to determine, from a plurality of preset numerical intervals, a target numerical interval corresponding to the number of search results included in the first screening result; each numerical interval has a corresponding variation ratio threshold; and acquiring a fluctuation ratio threshold corresponding to the target numerical value interval as a fluctuation ratio corresponding to the first screening result.
The respective modules in the above-described search quality verification apparatus may be implemented in whole or in part by software, hardware, and a combination thereof. The above modules may be embedded in hardware or may be independent of a processor in the computer device, or may be stored in software in a memory in the computer device, so that the processor may call and execute operations corresponding to the above modules.
In one embodiment, a computer device is provided, which may be a server, the internal structure of which may be as shown in fig. 5. The computer device includes a processor, a memory, an Input/Output interface (I/O) and a communication interface. The processor, the memory and the input/output interface are connected through a system bus, and the communication interface is connected to the system bus through the input/output interface. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer programs, and a database. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The database of the computer device is for storing screening item data. The input/output interface of the computer device is used to exchange information between the processor and the external device. The communication interface of the computer device is used for communicating with an external terminal through a network connection. The computer program, when executed by a processor, implements a method of search quality verification of a filter term.
It will be appreciated by those skilled in the art that the structure shown in FIG. 5 is merely a block diagram of some of the structures associated with the present inventive arrangements and is not limiting of the computer device to which the present inventive arrangements may be applied, and that a particular computer device may include more or fewer components than shown, or may combine some of the components, or have a different arrangement of components.
In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored which, when executed by a processor, implements the steps of the method embodiments described above.
In an embodiment, a computer program product is provided, comprising a computer program which, when executed by a processor, implements the steps of the method embodiments described above.
It should be noted that, the user information (including but not limited to user equipment information, user personal information, etc.) and the data (including but not limited to data for analysis, stored data, presented data, etc.) related to the present application are information and data authorized by the user or sufficiently authorized by each party, and the collection, use and processing of the related data need to comply with the related laws and regulations and standards of the related country and region.
Those skilled in the art will appreciate that implementing all or part of the above described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, database, or other medium used in embodiments provided herein may include at least one of non-volatile and volatile memory. The nonvolatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical Memory, high density embedded nonvolatile Memory, resistive random access Memory (ReRAM), magnetic random access Memory (Magnetoresistive Random Access Memory, MRAM), ferroelectric Memory (Ferroelectric Random Access Memory, FRAM), phase change Memory (Phase Change Memory, PCM), graphene Memory, and the like. Volatile memory can include random access memory (Random Access Memory, RAM) or external cache memory, and the like. By way of illustration, and not limitation, RAM can be in the form of a variety of forms, such as static random access memory (Static Random Access Memory, SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM), and the like. The databases referred to in the embodiments provided herein may include at least one of a relational database and a non-relational database. The non-relational database may include, but is not limited to, a blockchain-based distributed database, and the like. The processor referred to in the embodiments provided in the present application may be a general-purpose processor, a central processing unit, a graphics processor, a digital signal processor, a programmable logic unit, a data processing logic unit based on quantum computing, or the like, but is not limited thereto.
The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description.
The foregoing examples illustrate only a few embodiments of the application and are described in detail herein without thereby limiting the scope of the application. It should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the application, which are all within the scope of the application. Accordingly, the scope of the application should be assessed as that of the appended claims.

Claims (11)

1. A method of search quality verification of a filter term, the method comprising:
according to the historical search record of the user, determining a plurality of screening item combination modes aiming at the search object;
determining a target screening item combination mode from the plurality of screening item combination modes based on the occurrence times of each screening item combination mode in the historical search record;
Combining the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode;
searching through the screening conditions to be verified at two different searching time points to obtain a first screening result and a second screening result;
and determining the search quality of screening items included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the screening condition to be verified.
2. The method of claim 1, wherein determining a target screening option combination from the plurality of screening option combinations based on the number of occurrences of each screening option combination in the history search record, comprises:
sorting the multiple screening item combination modes according to the occurrence times;
according to the sorting result, determining a screening item combination mode with the corresponding occurrence number being greater than the occurrence number of other screening item combination modes from the plurality of screening item combination modes as a target screening item combination mode; the other screening item combination modes are screening item combination modes except the target screening item combination mode in the multiple screening item combination modes.
3. The method according to claim 1, wherein the combining the target screening item combination and the target parameter values under each screening item included in the target screening item combination, before obtaining the screening condition to be verified, further includes:
for each screening item, acquiring the search result number of each parameter value under the screening item;
and determining a target parameter value from each parameter value under the screening item based on the search result number.
4. A method according to claim 3, wherein said determining a target parameter value from each parameter value under the filter term based on the search result number comprises:
sorting the parameter values under the screening items according to the search result number;
according to the sorting result, determining a parameter value of which the corresponding search result number is larger than that of other parameter values from the parameter values, and taking the parameter value as a target parameter value; the other parameter values are parameter values other than the target parameter value among the respective parameter values under the screen item.
5. The method of claim 1, wherein the determining the search quality of the screen item included in the to-be-verified screening condition based on the difference information between the first screening result and the second screening result and the matching result between the second screening result and the to-be-verified screening condition comprises:
Determining first search quality information based on difference information between the first screening result and the second screening result;
determining second search quality information based on a matching result between the second screening result and the screening condition to be verified;
and obtaining the search quality of the screening items included in the screening condition to be verified according to the first search quality information and the second search quality information.
6. The method of claim 5, wherein the difference information between the first screening result and the second screening result is determined by:
acquiring the variation ratio of the search result number corresponding to the first screening result and the second screening result;
and obtaining difference information between the first screening result and the second screening result based on the change ratio, a change ratio threshold corresponding to the first screening result and the size relation between the number of search results corresponding to the first screening result and the second screening result.
7. The method of claim 6, wherein the change ratio threshold corresponding to the first screening result is determined by:
Determining a target numerical value interval corresponding to the number of search results included in the first screening result from a plurality of preset numerical value intervals; each numerical interval has a corresponding variation ratio threshold;
and acquiring a variation ratio threshold corresponding to the target numerical value interval as a variation ratio corresponding to the first screening result.
8. A search quality verification apparatus, the apparatus comprising:
the first data confirmation module is used for determining a plurality of screening item combination modes aiming at the retrieval object according to the historical search record of the user;
the second data confirmation module is used for determining a target screening item combination mode from the plurality of screening item combination modes according to the occurrence times of each screening item combination mode in the historical search record;
the data processing module is used for carrying out combination processing on the target screening item combination mode and target parameter values under each screening item included in the target screening item combination mode to obtain screening conditions to be verified; the screening items included in the screening conditions to be verified correspond to the screening items in the target screening item combination mode;
the data retrieval module is used for retrieving through the screening conditions to be verified at two different searching time points respectively to obtain a first screening result and a second screening result;
And the search quality determining module is used for determining the search quality of the screening items included in the screening condition to be verified based on the difference information between the first screening result and the second screening result and/or the matching result between the second screening result and the screening condition to be verified.
9. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor implements the steps of the method of any of claims 1 to 6 when the computer program is executed.
10. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 6.
11. A computer program product comprising a computer program, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 6.
CN202311061703.0A 2023-08-22 2023-08-22 Screening item search quality verification method and device and computer equipment Pending CN117093875A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311061703.0A CN117093875A (en) 2023-08-22 2023-08-22 Screening item search quality verification method and device and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311061703.0A CN117093875A (en) 2023-08-22 2023-08-22 Screening item search quality verification method and device and computer equipment

Publications (1)

Publication Number Publication Date
CN117093875A true CN117093875A (en) 2023-11-21

Family

ID=88769388

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311061703.0A Pending CN117093875A (en) 2023-08-22 2023-08-22 Screening item search quality verification method and device and computer equipment

Country Status (1)

Country Link
CN (1) CN117093875A (en)

Similar Documents

Publication Publication Date Title
CN102314460B (en) Data analysis method and system and servers
KR20150080533A (en) Characterizing data sources in a data storage system
CN112527783A (en) Data quality probing system based on Hadoop
CN107633015A (en) A kind of data processing method, device and equipment
CN111240876A (en) Fault positioning method and device for microservice, storage medium and terminal
US20220229854A1 (en) Constructing ground truth when classifying data
CN115587670A (en) Product quality diagnosis method and device based on index map
CN111913860A (en) Operation behavior analysis method and device
CN113010208A (en) Version information generation method, version information generation device, version information generation equipment and storage medium
CN116561607A (en) Method and device for detecting abnormality of resource interaction data and computer equipment
CN116186116A (en) Asset problem analysis method based on equal protection assessment
CN117093875A (en) Screening item search quality verification method and device and computer equipment
CN114860819A (en) Method, device, equipment and storage medium for constructing business intelligent system
CN115481026A (en) Test case generation method and device, computer equipment and storage medium
CN115293682A (en) Abnormal logistics order monitoring method and related device
CN112148721B (en) Data checking method and device, electronic equipment and storage medium
CN113778996A (en) Large data stream data processing method and device, electronic equipment and storage medium
CN111507397A (en) Abnormal data analysis method and device
CN116795723B (en) Chain unit test processing method and device and computer equipment
CN115629958A (en) Universal field level automatic checking method and device for different service interfaces
CN117743190A (en) Verification method and device for interface data flow playback and computer equipment
CN117312283A (en) Database and table data verification method and device, computer equipment and storage medium
Gorsuch et al. Family Matters: Development of new family interrelationship variables for US IPUMS data projects
CN115795120A (en) User portrait information verification method and device
CN118132562A (en) Data association method, device, storage medium and terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination