WO2022105775A1 - Search processing method and apparatus, model training method and apparatus, and medium and device - Google Patents

Search processing method and apparatus, model training method and apparatus, and medium and device Download PDF

Info

Publication number
WO2022105775A1
WO2022105775A1 PCT/CN2021/131113 CN2021131113W WO2022105775A1 WO 2022105775 A1 WO2022105775 A1 WO 2022105775A1 CN 2021131113 W CN2021131113 W CN 2021131113W WO 2022105775 A1 WO2022105775 A1 WO 2022105775A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
historical
historical search
target
search
Prior art date
Application number
PCT/CN2021/131113
Other languages
French (fr)
Chinese (zh)
Inventor
王鑫宇
张永华
Original Assignee
北京字节跳动网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字节跳动网络技术有限公司 filed Critical 北京字节跳动网络技术有限公司
Publication of WO2022105775A1 publication Critical patent/WO2022105775A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9532Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present disclosure relates to the field of search technology, and in particular, to a search processing method, a model training method, an apparatus, a medium, a computer program product, and a device.
  • a search is generally performed according to search information such as search words or search sentences input by a user, so as to obtain corresponding search results.
  • search information such as search words or search sentences input by a user
  • the correlation between the search results and the search information input by the user can reflect whether the search results conform to the user's search intent. Wherein, the higher the correlation between the search result and the search information, it can be represented that the search result is more in line with the user's search intention.
  • the present disclosure provides a search processing method, the method includes: receiving target search information; determining a target search result according to the target search information, and determining the target search result and the target through a correlation determination model The target correlation between the search information; wherein, the correlation determination model is obtained by training in the following way: according to the historical operation behavior information performed by the user on a plurality of historical search results, respectively determine each historical search result and historical search information The correlation between the historical search results and the historical search information with the same target text summary information is set to be the same, and the historical search results are based on the historical search results input by the user.
  • the search information is obtained by searching; the historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information is used as the target output of the model. Training is performed to obtain the correlation determination model.
  • the present disclosure provides a method for training a correlation determination model, the method comprising: determining the correlation between each historical search result and the historical search information according to historical operation behavior information performed by a user on a plurality of historical search results.
  • the degree of correlation between the historical search results with the same target text summary information and the historical search information is set to be the same, and the historical search results are searched according to the historical search information input by the user Obtained; using the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information as the target output of the model, the model is trained to The correlation determination model is obtained.
  • the present disclosure provides a search processing device, the device comprising: a receiving module for receiving target search information; a target relevance determination module for determining a target search result according to the target search information, and through correlation
  • the degree determination model determines the target correlation between the target search result and the target search information; wherein, the correlation determination model is obtained by training in the following way: according to the historical operations performed by the user on a plurality of historical search results behavior information, respectively determining the correlation between each historical search result and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text abstract information is set to be the same,
  • the historical search result is obtained by searching according to the historical search information input by the user; the historical search information and the historical search result are used as the input of the model, and the relationship between the historical search result and the historical search information is obtained.
  • the correlation degree is taken as the target output of the model, and the model is trained to obtain the correlation degree determination model.
  • the present disclosure provides a correlation determination model training device, the device includes: a correlation determination module, configured to determine the correlation between each historical search result and each historical search result according to historical operation behavior information performed by a user on a plurality of historical search results.
  • the correlation between historical search information wherein the correlation between the historical search results with the same target text summary information and the historical search information is set to be the same, and the historical search results are based on user input
  • the historical search information is obtained by searching; the training module is used to use the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information is used as the model
  • the target output of the model is trained to obtain the correlation determination model.
  • the present disclosure provides a computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the first aspect of the present disclosure.
  • the present disclosure provides a computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the second aspect of the present disclosure.
  • the present disclosure provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device, so as to implement the computer program provided in the first aspect of the present disclosure. the steps of the method.
  • the present disclosure provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device, so as to implement the computer program provided in the second aspect of the present disclosure. the steps of the method.
  • the present disclosure provides a computer program product comprising instructions that, when executed by a processing device, implement the steps of the method provided according to the first aspect of the present disclosure or the described methods provided according to the second aspect of the present disclosure. steps of the method.
  • Fig. 1 is a flow chart of a method for training a correlation determination model according to an exemplary embodiment.
  • Fig. 2 is a flow chart of a method for respectively determining the correlation between each historical search result and historical search information, according to an exemplary embodiment.
  • Fig. 3 is a flow chart of a method for determining target text summary information of historical search results according to an exemplary embodiment.
  • Fig. 4 is a flow chart of a method for determining the degree of relevancy between historical search results included in a search result group and historical search information, according to an exemplary embodiment.
  • Fig. 5 is a flowchart of a search processing method according to an exemplary embodiment.
  • Fig. 6 is a block diagram of a search processing apparatus according to an exemplary embodiment.
  • Fig. 7 is a block diagram of an apparatus for training a correlation determination model according to an exemplary embodiment.
  • FIG. 8 is a schematic structural diagram of an electronic device according to an exemplary embodiment.
  • the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
  • the term “based on” is “based at least in part on.”
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
  • the correlation between search results and search information is mainly determined by a model, and the training of the model depends on preset training data, which may include the correlation between the marked search results and search information.
  • preset training data which may include the correlation between the marked search results and search information.
  • technicians generally manually mark the correlation between search results and search information.
  • the amount of data required for training is large, and the manual labeling method is inefficient, and the correlation is manually marked.
  • the annotation is affected by the subjective judgment of technicians and is not accurate enough.
  • the training method of the relevance determination model in the embodiment of the present disclosure is introduced, and the relevance determination model can be used to determine the relevance between the search result and the search information.
  • Fig. 1 is a flow chart of a method for training a correlation determination model according to an exemplary embodiment.
  • the method can be applied to an electronic device with processing capability. As shown in Fig. 1 , the method can include S101 and S102.
  • the correlation between each historical search result and the historical search information is determined respectively according to the historical operation behavior information performed by the user on the plurality of historical search results.
  • the historical search result may be obtained by searching according to the historical search information input by the user.
  • the historical search information may include search information input by the user during a historical period (eg, the past week or the past month), and the historical search information may be search words or search sentences input by the user.
  • a plurality of historical search results can be found according to the historical search information.
  • the historical search information input by the user is the name of a singer, and the plurality of historical search results can include a plurality of songs written by the singer.
  • the users can perform operations on historical search results that meet their own search intentions according to their needs.
  • the operations can include operations such as clicking, browsing, and sliding.
  • the operation can also include music or video.
  • the historical operation behavior information performed by the user on the historical search results may include one or more of the following information items: user click behavior information, user browsing behavior information, user playing behavior information, and the like.
  • the historical operation behavior information can be used to represent whether the user has performed an operation on the historical search result.
  • the historical operation behavior information may include one of the above information items, for example, the user clicks on the historical search result, or the user browses the historical search result, and it is considered that the user has performed an operation on the historical search result.
  • the historical operation behavior information may include multiple items of the above information items, and whether the user has performed operations on the historical search results may be determined through the multiple items of information items. For example, only after the user clicks on the historical search result and browses the page corresponding to the historical search result for a first preset period of time, it is considered that the user has performed an operation on the historical search result.
  • first preset duration and second preset duration may be pre-calibrated.
  • the historical operation behavior information performed by the user on the historical search results can reflect the historical search results and historical search information to a certain extent. correlation between.
  • the historical search results that the user performs more operations are more in line with the user's search intent, that is, the correlation with the historical search information is relatively higher.
  • the correlation between each historical search result and the historical search information is determined respectively according to the historical operation behavior information performed by the user on the historical search results, and there is no need for technicians to manually mark the correlation.
  • the operation behavior performed by the user on the historical search results is affected by factors such as the display order of the historical search results.
  • factors such as the display order of the historical search results.
  • the user is generally accustomed to clicking the historical search result with the previous display order, which makes the determination based on the historical operation behavior information.
  • the correlation between historical search results and historical search information is easily affected by other factors such as display order, media file playback volume, and attention.
  • the historical search information input by the user is aaa
  • a plurality of historical search results are searched according to the historical search information, for example, including: user name: aaa1, degree of attention: 2000; user name aaa2, Attention: 200; Username: aaa3, Attention: 20. Since the user name aaa1 has a higher degree of attention and its display order is in the front, the click rate of the user named aaa1 is higher than the other two, and it will be considered that the historical search result with the user name aaa1 has a relationship between the historical search information and the historical search information.
  • the correlation is the highest, the correlation between the historical search results with the user name aaa2 and the historical search information is second, and the correlation between the historical search results with the user name aaa3 and the historical search information is the lowest.
  • the three historical search results have the same degree of matching with the historical search information aaa.
  • the historical search information input by the user is word A
  • word A is the name of a singer
  • the singer has multiple musical works, such as music 1, music 2.
  • Music 1 is displayed in the first order
  • Music 2 is displayed in the last order.
  • the user playback rate of Music 1 is higher
  • the user playback rate of Music 2 is lower.
  • the subject of the creator of both music 1 and music 2 is the singer, and the degree of matching between music 1 and word A is the same as the degree of matching between music 2 and word A.
  • the target text summary information of the historical search result may be the text information of the part associated with the historical search information in the historical search result. If the target text summary information is the same, it can be represented that from the perspective of text, the matching degree between the historical search results and the historical search information is the same.
  • the historical search information and historical search results are used as the input of the model, the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain a correlation determination model.
  • the correlation determination model may be any network model, such as a neural network model, and the present disclosure does not specifically limit the form of the correlation determination model.
  • each historical search result and historical search results are determined.
  • the correlation between information, the correlation can be used as model training data, and the correlation is determined by training.
  • the degree of relevancy between the historical search results and the historical search information the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, and it is possible to reduce the order of presentation, etc.
  • the influence of other factors on the user's operation behavior makes the model training data more accurate, so that the trained correlation determination model is more accurate.
  • Fig. 2 is a flowchart of a method for respectively determining the correlation between each historical search result and historical search information according to an exemplary embodiment.
  • S101 may include S201-S203.
  • target text summary information of the historical search result is determined.
  • step S201 may be shown in FIG. 3 , including S2011 to S2014.
  • the preset topic may be a preset tag used to describe historical search results from different dimensions.
  • the preset topics may include user ID, user name, degree of attention, user signature, and the like.
  • the preset subject may include the title of the document material, the author of the document material, the content of the document material, and the like.
  • the preset theme may include the name of the media file, the creator of the media file, the lyrics of the media file, the album to which the media file belongs, and the style of the media file. ,and many more. It is worth noting that, in the following introduction of the present disclosure, historical search information is used as an example for the search information for media files for illustration, but this does not constitute a limitation on the implementation of the present disclosure.
  • the text information belonging to the preset theme in the acquired historical search results may include all or part of the theme content under the preset theme, and if the text information includes part of the theme content under the preset theme, it belongs to the preset theme.
  • the text information of the preset theme is multiple.
  • the theme content under the preset theme is generally shorter, usually the song name of the song, so it belongs to the preset theme. It is assumed that the text information of a topic may include all topic contents under the preset topic.
  • the acquired text information of the creator of the media file, the album to which the media file belongs, and the style of the media file, such as preset themes may also include all the themes under the preset theme.
  • the text information belonging to the preset theme may include part of the theme content under the preset theme, and the part of the theme content may be, for example, the content of the lyrics in the lyrics. In a word. In this way, there are multiple pieces of text information belonging to the preset theme, and each sentence in the lyrics can be regarded as the text information belonging to the preset theme.
  • candidate text summary information is determined.
  • the candidate text summary information may include text information belonging to each preset topic, and text combination information of the text information.
  • the candidate text summary information may include the text combination information of the text information, in which different preset themes can be selected. Assuming that the text information of the topics is combined, it can cover the situation that the user inputs the contents of multiple preset topics, so that the candidate text summary information is more comprehensive.
  • the present disclosure does not specifically limit the manner of text combination, for example, text information may be combined in pairs.
  • the matching degree between the candidate text summary information and the historical search information may be determined by the number of characters matched between the two, wherein the matching may refer to the same or the same.
  • the matching degree between the candidate text summary information and the historical search information can be determined by the following formula (1):
  • M represents the matching degree between the candidate text summary information and the historical search information
  • hit_terms represents the number of characters matched between the candidate text abstract information and the historical search information
  • query_length represents the number of characters of the historical search information
  • doc_length represents the candidate text abstract The number of characters of the message.
  • the candidate text summary information with the highest matching degree with the historical search information is determined as the target text summary information.
  • the historical search results with the same target text abstract information can be determined. For example, if historical search result 1 and historical search result 2 have the same target text summary information, they can be aggregated into a search result group. It should be noted that the present disclosure does not specifically limit the number of historical search results included in the search result group, and the above examples are only for explanation.
  • the correlation between the historical search results included in the search result group and the historical search information is determined according to the historical operation behavior information performed by the user on the historical search results included in the search result group.
  • the correlation between the historical search results and the historical search information may be determined from a historical search behavior performed by the user according to the historical search information.
  • the correlation degree is only determined by one historical search behavior of the user, the reference data is small, and effective data support cannot be provided, which may make the determined correlation degree inaccurate.
  • the present disclosure provides another preferred embodiment of determining the correlation between historical search results and historical search information through multiple historical search behaviors performed by the user according to historical search information. This embodiment may be shown in FIG. 4 , and may include S2031 ⁇ S2033.
  • the historical operation behavior information is to determine the target historical operation behavior information performed by the user on the search result group in this historical search behavior.
  • the multiple historical search behaviors may be initiated by different users.
  • the target historical operation behavior information can be used to represent whether the user has performed an operation on the search result group. For example, in a user's historical search behavior, as long as the user performs an operation on any historical search result in the search result group, the target historical operation behavior information corresponding to the search result group may be recorded as 1.
  • the feature information of the historical operation behavior performed by the user on the search result group is determined.
  • the historical operation behavior feature information may represent the historical click-through rate, historical playback rate, historical browsing rate, etc. of the search result group.
  • the data of X times of historical search behaviors performed by the user according to the historical search information are counted, and the target historical operation behavior information is Y, and Y is less than or equal to X, then the historical operation behavior feature information can be represented by the ratio of Y to X. .
  • the correlation between the historical search results included in the search result group and the historical search information is determined according to the historical operation behavior feature information.
  • the historical operation behavior feature information can be directly used as the correlation between the historical search results included in the search result group and the historical search information.
  • the correlation may also be determined according to the correspondence between the pre-quantized historical operation behavior feature information and the correlation.
  • the correlation between the historical search results and the historical search information can be accurately determined through multiple historical search behaviors performed by the user according to the historical search information.
  • the historical search results with the same target text abstract information are aggregated into a search result group, and the correlation between the historical search results included in the search result group and the historical search information is the same, which can reduce other factors such as the display order.
  • the influence of factors on the user's operation behavior makes the model training data more accurate, so that the trained correlation determination model is more accurate.
  • FIG. 5 is a flowchart of a search processing method according to an exemplary embodiment.
  • the method can be applied to an electronic device with processing capability, such as a terminal or a server, as shown in FIG. As shown in 5, the method may include S501 and S502.
  • target search information is received.
  • the target search information may be information such as search words, search sentences, etc. input by the user.
  • the user and the user who input the historical search information may be the same or different.
  • the target search information may be the same or different from the above-mentioned historical search information. There are no specific restrictions on disclosure.
  • the target search result is determined according to the target search information, and the target correlation degree between the target search result and the target search information is determined by the correlation determination model.
  • the target search results and target search information may be input into the pre-trained correlation determination model to obtain the target correlation between the target search results and the target search information output by the correlation determination model.
  • the correlation determination model may be obtained by training in the following way: according to the historical operation behavior information performed by the user on a plurality of historical search results, the correlation between each historical search result and the historical search information is determined respectively, and the correlation between each historical search result and the historical search information is determined respectively, wherein the target text has the same target text.
  • the correlation between the historical search results of the abstract information and the historical search information is set to be the same, and the historical search results are obtained by searching based on the historical search information input by the user; taking the historical search information and historical search results as the input of the model, The correlation between historical search results and historical search information is used as the target output of the model, and the model is trained to obtain a correlation determination model.
  • the target search result is determined according to the target search information, and the target correlation between the target search result and the target search information is determined by the correlation determination model.
  • the correlation determination model if the historical search results conform to the user's search intent, the user will perform operations on the historical search results.
  • the correlation between the historical search results and the historical search information, the correlation can be used as model training data, and the correlation determination model is obtained by training. In this way, the data required for model training can be quickly obtained without manual labeling of training data, and the problem of inaccurate correlation of manual labeling can be solved.
  • the degree of relevancy between the historical search results and the historical search information when determining the degree of relevancy between the historical search results and the historical search information, the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, and it is possible to reduce the order of presentation, etc.
  • the influence of other factors on the user's operation behavior makes the model training data and the trained correlation determination model more accurate, so that the correlation between the target search results determined by the correlation determination model and the target search information is more accurate. It provides an accurate basis for judging whether the target search result conforms to the user's search intent.
  • determining the correlation between each historical search result and the historical search information according to the historical operation behavior information performed by the user on the plurality of historical search results includes: for each historical search result, determining all the historical search results.
  • the target text summary information of the historical search results; the historical search results with the same target text abstract information are aggregated into a search result group; according to the user's implementation of the historical search results included in the search result group the historical operation behavior information, and determine the correlation between the historical search results included in the search result group and the historical search information.
  • the historical operation behavior information performed by the user on the historical search results included in the search result group it is determined that the historical search results included in the search result group are the same as the historical search results included in the search result group.
  • the correlation between the historical search information includes: for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information, according to the user's search result group in this historical search behavior, The historical operation behavior information implemented by the historical search results included in the , determine the target historical operation behavior information performed by the user on the search result group in this historical search behavior; The target historical operation behavior information implemented on the search result group respectively, determine the historical operation behavior characteristic information implemented by the user on the search result group; according to the historical operation behavior characteristic information, determine the search result group the correlation between the historical search results and the historical search information included in .
  • the determining the target text summary information of the historical search results includes: acquiring text information belonging to a preset topic in the historical search results, wherein the text information includes information under the preset topic. All or part of the subject content, in the case that the text information includes part of the subject content under the preset subject, there are multiple text information belonging to the preset subject; determine candidate text summary information, wherein, The candidate text summary information includes the text information belonging to each of the preset topics and the text combination information of the text information; respectively determine the relationship between each of the candidate text summary information and the historical search information.
  • Matching degree Determine the candidate text summary information with the highest matching degree with the historical search information as the target text summary information.
  • the search processing method provided by the present disclosure may further include: determining the display order of the target search results according to the target relevance between the target search results and the target search information.
  • multiple target search results can be searched according to the target search information, and the relevancy degrees between the multiple target search results and the target search information may be different from each other.
  • the display order of the target search results with a high degree of relevance to the target search information may be arranged before the display order of the target search results with a low degree of relevance to the target search information. In this way, the user can first browse to the search result more relevant to the target search information input by the user, thereby improving the user experience.
  • FIG. 6 is a block diagram of a search processing apparatus according to an exemplary embodiment.
  • the apparatus 600 may include:
  • a target relevance determination module 602 configured to determine a target search result according to the target search information, and determine the target relevance between the target search result and the target search information through a relevance determination model;
  • the correlation determination model is obtained by training in the following way: according to the historical operation behavior information performed by the user on multiple historical search results, the correlation between each historical search result and the historical search information is determined respectively, wherein there are The correlation between the historical search results of the same target text abstract information and the historical search information is set to be the same, and the historical search results are obtained by searching according to the historical search information input by the user; The historical search information and the historical search results are used as the input of the model, the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the correlation Determine the model.
  • the apparatus 600 may further include: a display order determination module, configured to determine the display order of the target search results according to the target relevance between the target search results and the target search information.
  • a display order determination module configured to determine the display order of the target search results according to the target relevance between the target search results and the target search information.
  • FIG. 7 is a block diagram of a correlation determination model training apparatus according to an exemplary embodiment.
  • the apparatus 700 may include:
  • Relevance determination module 701 is used to determine the correlation between each historical search result and historical search information according to the historical operation behavior information performed by the user on a plurality of historical search results, wherein the The correlation between the historical search result and the historical search information is set to be the same, and the historical search result is obtained by searching according to the historical search information input by the user;
  • the training module 702 is used for taking the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information as the target output of the model, and performing the training on the model. training to obtain the correlation determination model.
  • the relevancy determination module 701 includes: a target text summary information determination submodule for determining the target text summary information of the historical search result for each historical search result; an aggregation submodule , used for aggregating the historical search results with the same target text summary information into a search result group; a first correlation determination sub-module is used for the historical search results included in the search result group according to the user's
  • the implemented historical operation behavior information determines the degree of relevancy between the historical search results included in the search result group and the historical search information.
  • the first relevancy determination sub-module includes: a behavior information determination sub-module, which is used for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information, according to the In this historical search behavior, the historical operation behavior information performed by the user on the historical search results included in the search result group determines the target history performed by the user on the search result group in this historical search behavior Operation behavior information; a behavior feature information determination sub-module, configured to determine that the user has performed the search result group according to the target historical operation behavior information respectively performed by the user on the search result group in multiple historical search behaviors the historical operation behavior feature information; the second correlation determination sub-module is used to determine the relationship between the historical search results included in the search result group and the historical search information according to the historical operation behavior feature information the correlation.
  • a behavior information determination sub-module which is used for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information, according to the In this historical search behavior, the historical operation behavior information performed by the user on the historical search results included in
  • the target text summary information determination sub-module includes: a text information acquisition sub-module for acquiring text information belonging to preset topics in the historical search results, wherein the text information includes the preset All or part of the theme content under the theme, in the case that the text information includes part of the theme content under the preset theme, there are multiple pieces of text information belonging to the preset theme; module for determining candidate text summary information, wherein the candidate text summary information includes the text information belonging to each of the preset topics and the text combination information of the text information; the matching degree determination sub-module, using for determining the matching degree between each of the candidate text abstract information and the historical search information respectively; the abstract information determination sub-module is used to determine the candidate text abstract with the highest matching degree with the historical search information information, which is determined as the target text summary information.
  • a text information acquisition sub-module for acquiring text information belonging to preset topics in the historical search results, wherein the text information includes the preset All or part of the theme content under the theme, in the case that the text information includes part of
  • the target search result is determined according to the target search information, and the target correlation between the target search result and the target search information is determined by the correlation determination model.
  • the correlation determination model if the historical search results conform to the user's search intent, the user will perform operations on the historical search results.
  • the correlation between the historical search results and the historical search information, the correlation can be used as model training data, and the correlation determination model is obtained by training. In this way, the data required for model training can be quickly obtained without manual labeling of training data, and the problem of inaccurate correlation of manual labeling can be solved.
  • the degree of relevancy between the historical search results and the historical search information when determining the degree of relevancy between the historical search results and the historical search information, the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, and it is possible to reduce the order of presentation, etc.
  • the influence of other factors on the user's operation behavior makes the model training data and the trained correlation determination model more accurate, so that the correlation between the target search results determined by the correlation determination model and the target search information is more accurate. It provides an accurate basis for judging whether the target search result conforms to the user's search intent.
  • each module performs operations has been described in detail in the embodiments of the related method, and will not be described in detail here.
  • the division of the above-mentioned modules does not limit the specific implementation manner, and the above-mentioned various modules may be implemented by, for example, software, hardware, or a combination of software and hardware.
  • the above-mentioned modules may be implemented as independent physical entities, or may also be implemented by a single entity (eg, a processor (CPU or DSP, etc.), an integrated circuit, etc.).
  • a processor CPU or DSP, etc.
  • the respective modules are shown as separate modules in the figures, one or more of these modules may also be combined into one module or split into multiple modules.
  • Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablets), PMPs (portable multimedia players), vehicle-mounted terminals (eg, mobile terminals such as in-vehicle navigation terminals), etc., and stationary terminals such as digital TVs, desktop computers, and the like.
  • the electronic device shown in FIG. 8 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
  • an electronic device 800 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 801 that may be loaded into random access according to a program stored in a read only memory (ROM) 802 or from a storage device 808 Various appropriate actions and processes are executed by the programs in the memory (RAM) 803 . In the RAM 803, various programs and data required for the operation of the electronic device 800 are also stored.
  • the processing device 801, the ROM 802, and the RAM 803 are connected to each other through a bus 804.
  • An input/output (I/O) interface 805 is also connected to bus 804 .
  • I/O interface 805 input devices 806 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration An output device 807 of a computer, etc.; a storage device 808 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 809.
  • Communication means 809 may allow electronic device 800 to communicate wirelessly or by wire with other devices to exchange data. While FIG. 8 shows an electronic device 800 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
  • Embodiments of the present disclosure also include a computer program product, comprising: instructions that, when executed by a processing device, implement the steps of the search processing method of the embodiments of the present disclosure or the steps of the relevance determination model training method of the embodiments of the present disclosure .
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via the communication device 809, or from the storage device 808, or from the ROM 802.
  • the processing device 801 the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
  • the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, electrical wire, optical fiber cable, RF (radio frequency), etc., or any suitable combination of the foregoing.
  • the client and server can use any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects.
  • HTTP HyperText Transfer Protocol
  • Examples of communication networks include local area networks (“LAN”), wide area networks (“WAN”), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently known or future development network of.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: receives target search information; determines a target search result according to the target search information, and passes The relevance determination model determines the target relevance between the target search result and the target search information; wherein, the relevance determination model is obtained by training in the following manner: according to the history of the user's implementation of multiple historical search results Operating behavior information, respectively determining the correlation between each historical search result and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text abstract information is set to be the same , the historical search result is obtained by searching according to the historical search information input by the user; the historical search information and the historical search result are used as the input of the model, and the difference between the historical search result and the historical search information The correlation between the two is used as the target output of the model, and the model is trained to obtain the correlation determination model.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: according to the historical operation behavior information performed by the user on multiple historical search results, respectively determining the degree of relevance between each historical search result and historical search information, wherein the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, the historical search
  • the result is obtained by searching according to the historical search information input by the user; the historical search information and the historical search results are taken as the input of the model, and the correlation between the historical search results and the historical search information is taken as The target output of the model, and the model is trained to obtain the correlation determination model.
  • Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider to via Internet connection).
  • LAN local area network
  • WAN wide area network
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the modules involved in the embodiments of the present disclosure may be implemented in software or hardware. Wherein, the name of the module does not constitute a limitation of the module itself under certain circumstances, for example, the receiving module may also be described as a "target search information receiving module".
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical Devices (CPLDs) and more.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSPs Application Specific Standard Products
  • SOCs Systems on Chips
  • CPLDs Complex Programmable Logical Devices
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
  • the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.
  • exemplary embodiment 1 provides a search processing method, the method includes: receiving target search information; determining a target search result according to the target search information, and determining by correlation The model determines the target correlation between the target search result and the target search information; wherein, the correlation determination model is obtained by training in the following way: according to the historical operation behavior information performed by the user on a plurality of historical search results , respectively determine the correlation between each historical search result and historical search information, wherein the correlation between the historical search results with the same target text abstract information and the historical search information is set to be the same, the The historical search results are obtained by searching according to the historical search information input by the user; the historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information The degree is used as the target output of the model, and the model is trained to obtain the correlation determination model.
  • the exemplary embodiment 2 provides the method of the exemplary embodiment 1, wherein according to the historical operation behavior information performed by the user on the multiple historical search results, the relationship between each historical search result and each historical search result is determined respectively.
  • the correlation between historical search information includes: for each historical search result, determining the target text summary information of the historical search result; aggregating the historical search results with the same target text abstract information as A search result group; according to the historical operation behavior information performed by the user on the historical search results included in the search result group, determine the historical search results included in the search result group and the historical search results correlation between information.
  • exemplary embodiment 3 provides the method of exemplary embodiment 2 according to the history performed by the user on the historical search results included in the search result group Operation behavior information, determining the correlation between the historical search results included in the search result group and the historical search information, including: for the user according to the historical search information for multiple historical search behaviors. For each historical search behavior, according to the historical operation behavior information performed by the user on the historical search results included in the search result group in the historical search behavior, determine the user's interest in the historical search behavior in this historical search behavior.
  • the target historical operation behavior information implemented by the search result group according to the target historical operation behavior information respectively implemented by the user on the search result group in multiple historical search behaviors, it is determined that the user implements the search result group.
  • the historical operation behavior feature information according to the historical operation behavior feature information, determine the correlation between the historical search results included in the search result group and the historical search information.
  • Exemplary Embodiment 4 provides the method of Exemplary Embodiment 2, and the determining the target text summary information of the historical search result includes: acquiring the historical search result The text information belonging to the preset theme in the text information, wherein the text information includes all the theme content or part of the theme content under the preset theme, in the case that the text information includes part of the theme content under the preset theme , there are multiple pieces of text information belonging to the preset theme; and candidate text summary information is determined, wherein the candidate text summary information includes the text information belonging to each preset theme and the text of the text information combination information; determine the degree of matching between each candidate text summary information and the historical search information respectively; determine the candidate text abstract information with the highest matching degree with the historical search information as the Target text summary information.
  • exemplary embodiment 5 provides the method of exemplary embodiment 4, the historical search information is search information for media files, and accordingly, the preset theme includes media files name of the media file, the creator of the media file, the lyrics of the media file, the album to which the media file belongs, and the style of the media file.
  • Exemplary Embodiment 6 provides the method of any one of Exemplary Embodiments 1 to 5, the method further comprising: searching according to the target The target relevance between the results and the target search information determines the display order of the target search results.
  • exemplary embodiment 7 provides a method for training a relevance determination model, the method comprising: according to historical operation behavior information performed by a user on a plurality of historical search results, respectively determining The correlation between historical search results and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text summary information is set to be the same, and the historical search results are Obtained by searching according to the historical search information input by the user; using the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information as the model's target output, the model is trained to obtain the correlation determination model.
  • the exemplary embodiment 8 provides the method of the exemplary embodiment 7, wherein according to the historical operation behavior information performed by the user on the multiple historical search results, the relationship between each historical search result and each historical search result is determined respectively.
  • the correlation between historical search information includes: for each historical search result, determining the target text summary information of the historical search result; aggregating the historical search results with the same target text abstract information as A search result group; according to the historical operation behavior information performed by the user on the historical search results included in the search result group, determine the historical search results included in the search result group and the historical search results correlation between information.
  • Exemplary Embodiment 9 provides the method of Exemplary Embodiment 8 according to the history performed by the user on the historical search results included in the set of search results Operation behavior information, determining the correlation between the historical search results included in the search result group and the historical search information, including: for the user according to the historical search information for multiple historical search behaviors. For each historical search behavior, according to the historical operation behavior information performed by the user on the historical search results included in the search result group in the historical search behavior, determine the user's interest in the historical search behavior in this historical search behavior.
  • the target historical operation behavior information implemented by the search result group according to the target historical operation behavior information respectively implemented by the user on the search result group in multiple historical search behaviors, it is determined that the user implements the search result group.
  • the historical operation behavior feature information according to the historical operation behavior feature information, determine the correlation between the historical search results included in the search result group and the historical search information.
  • Exemplary Embodiment 10 provides the method of Exemplary Embodiment 8, wherein the determining the target text summary information of the historical search result includes: acquiring the historical search result The text information belonging to the preset theme in the text information, wherein the text information includes all the theme content or part of the theme content under the preset theme, in the case that the text information includes part of the theme content under the preset theme , there are multiple pieces of text information belonging to the preset theme; and candidate text summary information is determined, wherein the candidate text summary information includes the text information belonging to each preset theme and the text of the text information combination information; determine the degree of matching between each candidate text summary information and the historical search information respectively; determine the candidate text abstract information with the highest matching degree with the historical search information as the Target text summary information.
  • Exemplary Embodiment 11 provides the method of Exemplary Embodiment 10, the historical search information is search information for media files, and accordingly, the preset theme includes media files name of the media file, the creator of the media file, the lyrics of the media file, the album to which the media file belongs, and the style of the media file.
  • exemplary embodiment 12 provides a search processing apparatus, the apparatus includes: a receiving module for receiving target search information; The target search information determines the target search result, and the target correlation between the target search result and the target search information is determined by a correlation determination model; wherein, the correlation determination model is obtained by training in the following manner: The historical operation behavior information performed by the user on a plurality of historical search results respectively determines the correlation between each historical search result and the historical search information, wherein the historical search results with the same target text summary information and the historical search information The correlations are set to be the same, and the historical search results are obtained by searching according to the historical search information input by the user; taking the historical search information and the historical search results as the input of the model, the The correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the correlation determination model.
  • exemplary embodiment 13 provides an apparatus for training a relevance determination model, the apparatus comprising: a relevance determination module configured to implement a history of a plurality of historical search results according to a user Operating behavior information, respectively determining the correlation between each historical search result and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text abstract information is set to be the same , the historical search results are obtained by searching according to the historical search information input by the user; the training module is used to use the historical search information and the historical search results as the input of the model, and the historical search results are the same as the historical search results.
  • the correlation between the historical search information is used as the target output of the model, and the model is trained to obtain the correlation determination model.
  • Exemplary Embodiment 14 provides a computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements Exemplary Embodiment 1 - Exemplary Embodiment The steps of any one of the methods in 6.
  • Exemplary Embodiment 15 provides a computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements Exemplary Embodiment 7 - Exemplary Embodiment The steps of any one of 11.
  • exemplary embodiment 16 provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer in the storage device A program to implement the steps of the method of any one of Exemplary Embodiment 1 - Exemplary Embodiment 6.
  • exemplary embodiment 17 provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer in the storage device A program to implement the steps of the method of any of Exemplary Embodiment 7-Example Embodiment 11.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present disclosure relates to a search processing method and apparatus, a model training method and apparatus, and a medium and a device. The search processing method comprises: receiving target search information; and determining a target search result according to the target search information, and determining a target degree of relevance between the target search result and the target search information by means of a degree of relevance determination model. The degree of relevance determination model is obtained by means of training in the following manner: according to information of historical operation behavior implemented by a user on a plurality of historical search results, respectively determining a degree of relevance between each historical search result and historical search information, wherein a degree of relevance between a historical search result and historical search information that have identical target text digest information is set to be identical; and training a model by taking the historical search information and the historical search result as an input of the model and taking the degree of relevance between the historical search result and the historical search information as a target output of the model, so as to obtain the degree of relevance determination model. In this way, a determined target degree of relevance is more accurate.

Description

搜索处理方法、模型训练方法、装置、介质及设备Search processing method, model training method, apparatus, medium and equipment
本申请是以申请号为202011303665.1,申请日为2020年11月19日的中国申请为基础,并主张其优先权,该中国申请的公开内容在此作为整体引入本申请中。This application is based on the Chinese application with the application number of 202011303665.1 and the filing date of November 19, 2020, and claims its priority. The disclosure of the Chinese application is hereby incorporated into this application as a whole.
技术领域technical field
本公开涉及搜索技术领域,具体地,涉及一种搜索处理方法、模型训练方法、装置、介质、计算机程序产品及设备。The present disclosure relates to the field of search technology, and in particular, to a search processing method, a model training method, an apparatus, a medium, a computer program product, and a device.
背景技术Background technique
在搜索领域,一般是根据用户输入的搜索词或搜索语句等搜索信息进行搜索,以得到对应的搜索结果。搜索结果与用户输入的搜索信息之间的相关度,可反映搜索结果是否符合用户的搜索意图。其中,搜索结果与搜索信息之间的相关度越高,可表征该搜索结果越符合用户的搜索意图。In the field of search, a search is generally performed according to search information such as search words or search sentences input by a user, so as to obtain corresponding search results. The correlation between the search results and the search information input by the user can reflect whether the search results conform to the user's search intent. Wherein, the higher the correlation between the search result and the search information, it can be represented that the search result is more in line with the user's search intention.
发明内容SUMMARY OF THE INVENTION
提供该发明内容部分以便以简要的形式介绍构思,这些构思将在后面的具体实施方式部分被详细描述。该发明内容部分并不旨在标识要求保护的技术方案的关键特征或必要特征,也不旨在用于限制所要求的保护的技术方案的范围。This Summary is provided to introduce concepts in a simplified form that are described in detail in the Detailed Description section that follows. This summary section is not intended to identify key features or essential features of the claimed technical solution, nor is it intended to be used to limit the scope of the claimed technical solution.
第一方面,本公开提供一种搜索处理方法,所述方法包括:接收目标搜索信息;根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;其中,所述相关度确定模型是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。In a first aspect, the present disclosure provides a search processing method, the method includes: receiving target search information; determining a target search result according to the target search information, and determining the target search result and the target through a correlation determination model The target correlation between the search information; wherein, the correlation determination model is obtained by training in the following way: according to the historical operation behavior information performed by the user on a plurality of historical search results, respectively determine each historical search result and historical search information The correlation between the historical search results and the historical search information with the same target text summary information is set to be the same, and the historical search results are based on the historical search results input by the user. The search information is obtained by searching; the historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information is used as the target output of the model. Training is performed to obtain the correlation determination model.
第二方面,本公开提供一种相关度确定模型训练方法,所述方法包括:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。In a second aspect, the present disclosure provides a method for training a correlation determination model, the method comprising: determining the correlation between each historical search result and the historical search information according to historical operation behavior information performed by a user on a plurality of historical search results. The degree of correlation between the historical search results with the same target text summary information and the historical search information is set to be the same, and the historical search results are searched according to the historical search information input by the user Obtained; using the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information as the target output of the model, the model is trained to The correlation determination model is obtained.
第三方面,本公开提供一种搜索处理装置,所述装置包括:接收模块,用于接收目标搜索信息;目标相关度确定模块,用于根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;其中,所述相关度确定模型是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。In a third aspect, the present disclosure provides a search processing device, the device comprising: a receiving module for receiving target search information; a target relevance determination module for determining a target search result according to the target search information, and through correlation The degree determination model determines the target correlation between the target search result and the target search information; wherein, the correlation determination model is obtained by training in the following way: according to the historical operations performed by the user on a plurality of historical search results behavior information, respectively determining the correlation between each historical search result and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text abstract information is set to be the same, The historical search result is obtained by searching according to the historical search information input by the user; the historical search information and the historical search result are used as the input of the model, and the relationship between the historical search result and the historical search information is obtained. The correlation degree is taken as the target output of the model, and the model is trained to obtain the correlation degree determination model.
第四方面,本公开提供一种相关度确定模型训练装置,所述装置包括:相关度确定模块,用于根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;训练模块,用于将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。In a fourth aspect, the present disclosure provides a correlation determination model training device, the device includes: a correlation determination module, configured to determine the correlation between each historical search result and each historical search result according to historical operation behavior information performed by a user on a plurality of historical search results. The correlation between historical search information, wherein the correlation between the historical search results with the same target text summary information and the historical search information is set to be the same, and the historical search results are based on user input The historical search information is obtained by searching; the training module is used to use the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information is used as the model The target output of the model is trained to obtain the correlation determination model.
第五方面,本公开提供一种计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现本公开第一方面提供的所述方法的步骤。In a fifth aspect, the present disclosure provides a computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the first aspect of the present disclosure.
第六方面,本公开提供一种计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现本公开第二方面提供的所述方法的步骤。In a sixth aspect, the present disclosure provides a computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the second aspect of the present disclosure.
第七方面,本公开提供一种电子设备,包括:存储装置,其上存储有计算机程序;处 理装置,用于执行所述存储装置中的所述计算机程序,以实现本公开第一方面提供的所述方法的步骤。In a seventh aspect, the present disclosure provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device, so as to implement the computer program provided in the first aspect of the present disclosure. the steps of the method.
第八方面,本公开提供一种电子设备,包括:存储装置,其上存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现本公开第二方面提供的所述方法的步骤。In an eighth aspect, the present disclosure provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device, so as to implement the computer program provided in the second aspect of the present disclosure. the steps of the method.
第九方面,本公开提供一种计算机程序产品,包括指令,所述指令在被处理装置执行时实现根据本公开第一方面提供的所述方法的步骤或根据本公开第二方面提供的所述方法的步骤。In a ninth aspect, the present disclosure provides a computer program product comprising instructions that, when executed by a processing device, implement the steps of the method provided according to the first aspect of the present disclosure or the described methods provided according to the second aspect of the present disclosure. steps of the method.
本公开的其他特征和优点将在随后的具体实施方式部分予以详细说明。Other features and advantages of the present disclosure will be described in detail in the detailed description that follows.
附图说明Description of drawings
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。在附图中:The above and other features, advantages and aspects of various embodiments of the present disclosure will become more apparent when taken in conjunction with the accompanying drawings and with reference to the following detailed description. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It should be understood that the drawings are schematic and that the originals and elements are not necessarily drawn to scale. In the attached image:
图1是根据一示例性实施例示出的一种相关度确定模型训练方法的流程图。Fig. 1 is a flow chart of a method for training a correlation determination model according to an exemplary embodiment.
图2是根据一示例性实施例示出的一种分别确定各个历史搜索结果与历史搜索信息之间的相关度的方法的流程图。Fig. 2 is a flow chart of a method for respectively determining the correlation between each historical search result and historical search information, according to an exemplary embodiment.
图3是根据一示例性实施例示出的一种确定历史搜索结果的目标文本摘要信息的方法的流程图。Fig. 3 is a flow chart of a method for determining target text summary information of historical search results according to an exemplary embodiment.
图4是根据一示例性实施例示出的一种确定搜索结果组中所包括的历史搜索结果与历史搜索信息之间的相关度的方法的流程图。Fig. 4 is a flow chart of a method for determining the degree of relevancy between historical search results included in a search result group and historical search information, according to an exemplary embodiment.
图5是根据一示例性实施例示出的一种搜索处理方法的流程图。Fig. 5 is a flowchart of a search processing method according to an exemplary embodiment.
图6是根据一示例性实施例示出的一种搜索处理装置的框图。Fig. 6 is a block diagram of a search processing apparatus according to an exemplary embodiment.
图7是根据一示例性实施例示出的一种相关度确定模型训练装置的框图。Fig. 7 is a block diagram of an apparatus for training a correlation determination model according to an exemplary embodiment.
图8是根据一示例性实施例示出的一种电子设备的结构示意图。FIG. 8 is a schematic structural diagram of an electronic device according to an exemplary embodiment.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施 例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, but rather are provided for the purpose of A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes, and are not intended to limit the protection scope of the present disclosure.
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。As used herein, the term "including" and variations thereof are open-ended inclusions, ie, "including but not limited to". The term "based on" is "based at least in part on." The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions of other terms will be given in the description below.
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as "first" and "second" mentioned in the present disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order of functions performed by these devices, modules or units or interdependence.
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "a" and "a plurality" mentioned in the present disclosure are illustrative rather than restrictive, and those skilled in the art should understand that unless the context clearly indicates otherwise, they should be understood as "one or a plurality of". multiple".
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present disclosure are only for illustrative purposes, and are not intended to limit the scope of these messages or information.
目前,搜索结果与搜索信息之间的相关度主要通过模型进行确定,模型的训练依赖于预先设置的训练数据,该训练数据可包括已经标注完成的搜索结果与搜索信息之间的相关度。相关技术中,一般是由技术人员对搜索结果与搜索信息之间的相关度进行人工标注,然而训练所需的数据量较大,通过人工标注的方式,效率低下,并且,对相关度进行人工标注受到技术人员主观判断的影响,不够准确。At present, the correlation between search results and search information is mainly determined by a model, and the training of the model depends on preset training data, which may include the correlation between the marked search results and search information. In related technologies, technicians generally manually mark the correlation between search results and search information. However, the amount of data required for training is large, and the manual labeling method is inefficient, and the correlation is manually marked. The annotation is affected by the subjective judgment of technicians and is not accurate enough.
首先介绍本公开实施例中相关度确定模型的训练方法,该相关度确定模型可用于确定搜索结果与搜索信息之间的相关度。First, the training method of the relevance determination model in the embodiment of the present disclosure is introduced, and the relevance determination model can be used to determine the relevance between the search result and the search information.
图1是根据一示例性实施例示出的一种相关度确定模型训练方法的流程图,该方法可应用于具有处理能力的电子设备中,如图1所示,该方法可包括S101和S102。Fig. 1 is a flow chart of a method for training a correlation determination model according to an exemplary embodiment. The method can be applied to an electronic device with processing capability. As shown in Fig. 1 , the method can include S101 and S102.
在S101中,根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度。In S101, the correlation between each historical search result and the historical search information is determined respectively according to the historical operation behavior information performed by the user on the plurality of historical search results.
历史搜索结果可以是根据用户输入的历史搜索信息进行搜索得到的。示例地,历史搜索信息可包括在历史时段(如过去一周或过去一个月)用户输入的搜索信息,该历史搜索信息可以是用户输入的搜索词或搜索语句。其中,根据历史搜索信息可搜索出多条历史搜索结果,例如,用户输入的历史搜索信息为歌手的姓名,多条历史搜索结果可包括该歌手创作的多首歌曲。The historical search result may be obtained by searching according to the historical search information input by the user. For example, the historical search information may include search information input by the user during a historical period (eg, the past week or the past month), and the historical search information may be search words or search sentences input by the user. Among them, a plurality of historical search results can be found according to the historical search information. For example, the historical search information input by the user is the name of a singer, and the plurality of historical search results can include a plurality of songs written by the singer.
用户可根据需求对符合自身搜索意图的历史搜索结果实施操作,该操作例如可包括点击、浏览、滑动等操作,另外,当用户搜索音乐、视频等媒体文件时,该操作还可包括对音乐或视频文件的播放操作。用户对历史搜索结果实施的历史操作行为信息可包括以下信息项中的一项或多项:用户点击行为信息、用户浏览行为信息、用户播放行为信息等。该历史操作行为信息可用于表征用户是否对该历史搜索结果实施了操作。Users can perform operations on historical search results that meet their own search intentions according to their needs. For example, the operations can include operations such as clicking, browsing, and sliding. In addition, when users search for media files such as music and videos, the operation can also include music or video. The playback operation of the video file. The historical operation behavior information performed by the user on the historical search results may include one or more of the following information items: user click behavior information, user browsing behavior information, user playing behavior information, and the like. The historical operation behavior information can be used to represent whether the user has performed an operation on the historical search result.
在一实施例中,历史操作行为信息可包括上述信息项中的一项,例如用户点击了历史搜索结果,或者用户浏览了历史搜索结果,认为用户对历史搜索结果实施了操作。在另一个实施例中,历史操作行为信息可包括上述信息项中的多项,可通过该多项信息项判断用户是否对历史搜索结果实施了操作。例如,用户点击了历史搜索结果,并且在该历史搜索结果对应的页面浏览了第一预设时长,才认为用户对该历史搜索结果实施了操作。或者,用户点击了历史搜索结果,并且将该历史搜索结果对应的音乐或视频文件播放了第二预设时长,才认为用户对该历史搜索结果实施了操作。上述的第一预设时长和第二预设时长可预先标定出。In one embodiment, the historical operation behavior information may include one of the above information items, for example, the user clicks on the historical search result, or the user browses the historical search result, and it is considered that the user has performed an operation on the historical search result. In another embodiment, the historical operation behavior information may include multiple items of the above information items, and whether the user has performed operations on the historical search results may be determined through the multiple items of information items. For example, only after the user clicks on the historical search result and browses the page corresponding to the historical search result for a first preset period of time, it is considered that the user has performed an operation on the historical search result. Alternatively, only after the user clicks on the historical search result and plays the music or video file corresponding to the historical search result for a second preset duration, it is considered that the user has performed an operation on the historical search result. The above-mentioned first preset duration and second preset duration may be pre-calibrated.
其中,在历史搜索结果符合用户的搜索意图的情况下,用户会对历史搜索结果实施操作,因此用户对历史搜索结果实施的历史操作行为信息,可以在一定程度上反映历史搜索结果与历史搜索信息之间的相关度。用户实施操作更多的历史搜索结果更加符合用户的搜索意图,即与历史搜索信息之间的相关度相对更高。本公开中,根据用户对历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,无需技术人员对相关度进行人工标注。Among them, if the historical search results conform to the user's search intention, the user will perform operations on the historical search results. Therefore, the historical operation behavior information performed by the user on the historical search results can reflect the historical search results and historical search information to a certain extent. correlation between. The historical search results that the user performs more operations are more in line with the user's search intent, that is, the correlation with the historical search information is relatively higher. In the present disclosure, the correlation between each historical search result and the historical search information is determined respectively according to the historical operation behavior information performed by the user on the historical search results, and there is no need for technicians to manually mark the correlation.
然而,用户对历史搜索结果实施的操作行为,受到历史搜索结果的展示顺序等因素的影响,示例地,用户一般习惯点击展示顺序在前的历史搜索结果,这就使得在根据历史操作行为信息确定历史搜索结果与历史搜索信息之间的相关度时,容易受到例如展示顺序、媒体文件播放量、关注度等其他因素的影响。However, the operation behavior performed by the user on the historical search results is affected by factors such as the display order of the historical search results. For example, the user is generally accustomed to clicking the historical search result with the previous display order, which makes the determination based on the historical operation behavior information. The correlation between historical search results and historical search information is easily affected by other factors such as display order, media file playback volume, and attention.
示例地,以搜索用户名为例,例如用户输入的历史搜索信息为aaa,根据该历史搜索信息搜索出多条历史搜索结果,例如包括:用户名:aaa1,关注度:2000;用户名aaa2,关注度:200;用户名:aaa3,关注度:20。由于用户名aaa1的关注度更高,且其展示顺序在前,因此用户名为aaa1的用户点击率高于其他两者,则会认为用户名为aaa1的历史搜索结果与历史搜索信息之间的相关度最高,用户名为aaa2的历史搜索结果与历史搜索信息之间的相关度次之,用户名为aaa3的历史搜索结果与历史搜索信息之间的相关度最低。然而,从文本信息的角度,这三个历史搜索结果与历史搜索信息aaa的匹配程度是相同的。Illustratively, taking the search user name as an example, for example, the historical search information input by the user is aaa, and a plurality of historical search results are searched according to the historical search information, for example, including: user name: aaa1, degree of attention: 2000; user name aaa2, Attention: 200; Username: aaa3, Attention: 20. Since the user name aaa1 has a higher degree of attention and its display order is in the front, the click rate of the user named aaa1 is higher than the other two, and it will be considered that the historical search result with the user name aaa1 has a relationship between the historical search information and the historical search information. The correlation is the highest, the correlation between the historical search results with the user name aaa2 and the historical search information is second, and the correlation between the historical search results with the user name aaa3 and the historical search information is the lowest. However, from the perspective of text information, the three historical search results have the same degree of matching with the historical search information aaa.
再示例地,以历史搜索信息为针对媒体文件的搜索信息为例,用户输入的历史搜索信息为词A,词A为歌手的姓名,该歌手的音乐作品有多个,例如包括音乐1、音乐2,在向用户展示历史搜索结果时,可能音乐1的展示顺序在前,音乐2的展示顺序在后,通常情况下音乐1的用户播放率较高,音乐2的用户播放率较低。然而,音乐1和音乐2的创作者这一主题均为该歌手,音乐1与词A之间的匹配程度和音乐2与词A之间的匹配程度是相同的。As another example, taking the historical search information as the search information for media files as an example, the historical search information input by the user is word A, and word A is the name of a singer, and the singer has multiple musical works, such as music 1, music 2. When displaying historical search results to users, it is possible that Music 1 is displayed in the first order, and Music 2 is displayed in the last order. Usually, the user playback rate of Music 1 is higher, and the user playback rate of Music 2 is lower. However, the subject of the creator of both music 1 and music 2 is the singer, and the degree of matching between music 1 and word A is the same as the degree of matching between music 2 and word A.
鉴于此,为了减少诸如展示顺序、播放量等其他因素对用户操作行为的影响,本公开中,在确定历史搜索结果与历史搜索信息之间的相关度时,具有相同目标文本摘要信息的历史搜索结果与历史搜索信息之间的相关度被设定成相同。其中,历史搜索结果的目标文本摘要信息可以是历史搜索结果中与历史搜索信息具有关联的部分的文本信息。如果目标文本摘要信息相同,可表征从文本的角度出发,历史搜索结果与历史搜索信息之间的匹配程度相同。In view of this, in order to reduce the influence of other factors such as presentation order, playback volume and other factors on user operation behavior, in the present disclosure, when determining the correlation between historical search results and historical search information, historical searches with the same target text summary information The degree of correlation between the result and the historical search information is set to be the same. Wherein, the target text summary information of the historical search result may be the text information of the part associated with the historical search information in the historical search result. If the target text summary information is the same, it can be represented that from the perspective of text, the matching degree between the historical search results and the historical search information is the same.
在S102中,将历史搜索信息和历史搜索结果作为模型的输入,历史搜索结果与历史搜索信息之间的相关度作为模型的目标输出,对模型进行训练,以得到相关度确定模型。其中,该相关度确定模型可以是任一种网络模型,例如神经网络模型,本公开对相关度确定模型的形式不做具体限制。In S102, the historical search information and historical search results are used as the input of the model, the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain a correlation determination model. The correlation determination model may be any network model, such as a neural network model, and the present disclosure does not specifically limit the form of the correlation determination model.
通过上述方案,在历史搜索结果符合用户的搜索意图的情况下,用户会对历史搜索结果实施操作,因此根据用户对多个历史搜索结果实施的历史操作行为信息,确定各个历史搜索结果与历史搜索信息之间的相关度,该相关度可作为模型训练数据,训练得到相关度确定模型。这样,无需人工标注训练数据,可快速获得模型训练所需的数据,并且解决人 工标注的相关度不准确的问题。而且,在确定历史搜索结果与历史搜索信息之间的相关度时,具有相同目标文本摘要信息的历史搜索结果与历史搜索信息之间的相关度被设定成相同,能够减小诸如展示顺序等其他因素对用户操作行为的影响,使得模型训练数据更为准确,从而训练出的相关度确定模型更为准确。Through the above solution, if the historical search results conform to the user's search intention, the user will perform operations on the historical search results. Therefore, according to the historical operation behavior information performed by the user on multiple historical search results, each historical search result and historical search results are determined. The correlation between information, the correlation can be used as model training data, and the correlation is determined by training. In this way, the data required for model training can be quickly obtained without manual labeling of training data, and the problem of inaccurate correlation of manual labeling can be solved. Also, when determining the degree of relevancy between the historical search results and the historical search information, the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, and it is possible to reduce the order of presentation, etc. The influence of other factors on the user's operation behavior makes the model training data more accurate, so that the trained correlation determination model is more accurate.
图2是根据一示例性实施例示出的一种分别确定各个历史搜索结果与历史搜索信息之间的相关度的方法的流程图,如图2所示,S101可包括S201~S203。Fig. 2 is a flowchart of a method for respectively determining the correlation between each historical search result and historical search information according to an exemplary embodiment. As shown in Fig. 2 , S101 may include S201-S203.
在S201中,针对每一历史搜索结果,确定该历史搜索结果的目标文本摘要信息。In S201, for each historical search result, target text summary information of the historical search result is determined.
在一实施例中,该步骤S201的示例性实施方式可如图3所示,包括S2011~S2014。In an embodiment, an exemplary implementation of this step S201 may be shown in FIG. 3 , including S2011 to S2014.
在S2011中,获取历史搜索结果中属于预设主题的文本信息。其中,预设主题可以是预先设置的标签,用于从不同的维度描述历史搜索结果。In S2011, text information belonging to a preset topic in the historical search results is acquired. The preset topic may be a preset tag used to describe historical search results from different dimensions.
示例地,在历史搜索信息为针对用户名的搜索信息的情况下,预设主题可包括用户ID、用户名、关注度、用户签名等。又示例地,在历史搜索信息为针对文献资料的搜索信息的情况下,预设主题可包括文献资料的标题、文献资料的作者、文献资料的内容等。再示例地,在历史搜索信息为针对媒体文件的搜索信息的情况下,预设主题可包括媒体文件的名称、媒体文件的创作者、媒体文件的歌词、媒体文件所属的专辑、媒体文件的风格,等等。值得说明的是,本公开以下介绍中以历史搜索信息为针对媒体文件的搜索信息为例进行举例说明,但并不构成对本公开实施方式的限制。For example, when the historical search information is search information for user names, the preset topics may include user ID, user name, degree of attention, user signature, and the like. For another example, in the case that the historical search information is the search information for the document material, the preset subject may include the title of the document material, the author of the document material, the content of the document material, and the like. For another example, when the historical search information is search information for media files, the preset theme may include the name of the media file, the creator of the media file, the lyrics of the media file, the album to which the media file belongs, and the style of the media file. ,and many more. It is worth noting that, in the following introduction of the present disclosure, historical search information is used as an example for the search information for media files for illustration, but this does not constitute a limitation on the implementation of the present disclosure.
其中,获取到的历史搜索结果中属于预设主题的文本信息可包括该预设主题下的全部主题内容或部分主题内容,在文本信息包括预设主题下的部分主题内容的情况下,属于该预设主题的文本信息为多个。Wherein, the text information belonging to the preset theme in the acquired historical search results may include all or part of the theme content under the preset theme, and if the text information includes part of the theme content under the preset theme, it belongs to the preset theme. The text information of the preset theme is multiple.
以历史搜索信息为针对媒体文件的搜索信息为例,例如对于媒体文件的名称这一预设主题,该预设主题下的主题内容一般较短,通常为歌曲的歌名,因此,属于该预设主题的文本信息可包括该预设主题下的全部主题内容。此外,媒体文件的创作者、媒体文件所属的专辑、媒体文件的风格这些预设主题,获取到的文本信息也可包括预设主题下的全部主题内容。对于媒体文件的歌词这一预设主题,由于歌词的内容通常较多,因此属于该预设主题的文本信息可包括该预设主题下的部分主题内容,该部分主题内容例如可以为歌词中的一句话。这样,属于该预设主题的文本信息为多个,可将歌词中的每句话作为属于该预设主题的文本信息。Taking the historical search information as the search information for media files as an example, for example, for the preset theme of the name of the media file, the theme content under the preset theme is generally shorter, usually the song name of the song, so it belongs to the preset theme. It is assumed that the text information of a topic may include all topic contents under the preset topic. In addition, the acquired text information of the creator of the media file, the album to which the media file belongs, and the style of the media file, such as preset themes, may also include all the themes under the preset theme. For the preset theme of the lyrics of the media file, since the content of the lyrics is usually large, the text information belonging to the preset theme may include part of the theme content under the preset theme, and the part of the theme content may be, for example, the content of the lyrics in the lyrics. In a word. In this way, there are multiple pieces of text information belonging to the preset theme, and each sentence in the lyrics can be regarded as the text information belonging to the preset theme.
在S2012中,确定候选文本摘要信息。该候选文本摘要信息可包括属于每一预设主题的文本信息、以及文本信息的文本组合信息。In S2012, candidate text summary information is determined. The candidate text summary information may include text information belonging to each preset topic, and text combination information of the text information.
用户在输入历史搜索信息时,可能输入了多种预设主题的内容,例如用户通过歌手加歌名的方式进行搜索,因此候选文本摘要信息可包括文本信息的文本组合信息,其中可以将不同预设主题的文本信息进行组合,能够覆盖用户输入多种预设主题的内容的情况,使得候选文本摘要信息更为全面。对于文本组合的方式,本公开不做具体限制,例如可将文本信息之间两两组合。When the user enters the historical search information, he or she may input content of various preset themes. For example, the user searches by adding a song title by a singer. Therefore, the candidate text summary information may include the text combination information of the text information, in which different preset themes can be selected. Assuming that the text information of the topics is combined, it can cover the situation that the user inputs the contents of multiple preset topics, so that the candidate text summary information is more comprehensive. The present disclosure does not specifically limit the manner of text combination, for example, text information may be combined in pairs.
在S2013中,分别确定每一候选文本摘要信息与历史搜索信息之间的匹配度。In S2013, the matching degree between each candidate text summary information and the historical search information is determined respectively.
示例地,可通过候选文本摘要信息与历史搜索信息之间相匹配的字符数来确定二者之间的匹配度,其中,相匹配可以指的是相同或一致。例如可通过如下公式(1)确定候选文本摘要信息与历史搜索信息之间的匹配度:For example, the matching degree between the candidate text summary information and the historical search information may be determined by the number of characters matched between the two, wherein the matching may refer to the same or the same. For example, the matching degree between the candidate text summary information and the historical search information can be determined by the following formula (1):
Figure PCTCN2021131113-appb-000001
Figure PCTCN2021131113-appb-000001
其中,M表示候选文本摘要信息与历史搜索信息之间的匹配度,hit_terms表示候选文本摘要信息与历史搜索信息之间相匹配的字符数,query_length表示历史搜索信息的字符数,doc_length表示候选文本摘要信息的字符数。Among them, M represents the matching degree between the candidate text summary information and the historical search information, hit_terms represents the number of characters matched between the candidate text abstract information and the historical search information, query_length represents the number of characters of the historical search information, and doc_length represents the candidate text abstract The number of characters of the message.
在S2014中,将与历史搜索信息之间的匹配度最高的候选文本摘要信息,确定为目标文本摘要信息。In S2014, the candidate text summary information with the highest matching degree with the historical search information is determined as the target text summary information.
在S202中,将目标文本摘要信息相同的历史搜索结果聚合为搜索结果组。In S202, the historical search results with the same target text summary information are aggregated into a search result group.
在确定出每一历史搜索结果的目标文本摘要信息后,可确定出目标文本摘要信息相同的历史搜索结果。例如,历史搜索结果1与历史搜索结果2的目标文本摘要信息相同,则可以将二者聚合为搜索结果组。值得说明的是,对于搜索结果组中包括的历史搜索结果的数量,本公开不做具体限制,上述示例仅为解释说明。After the target text abstract information of each historical search result is determined, the historical search results with the same target text abstract information can be determined. For example, if historical search result 1 and historical search result 2 have the same target text summary information, they can be aggregated into a search result group. It should be noted that the present disclosure does not specifically limit the number of historical search results included in the search result group, and the above examples are only for explanation.
在S203中,根据用户对搜索结果组中所包括的历史搜索结果实施的历史操作行为信息,确定搜索结果组中所包括的历史搜索结果与历史搜索信息之间的相关度。In S203, the correlation between the historical search results included in the search result group and the historical search information is determined according to the historical operation behavior information performed by the user on the historical search results included in the search result group.
在一实施例中,可从用户根据历史搜索信息进行的一次历史搜索行为中,确定历史搜索结果与历史搜索信息之间的相关度。然而仅通过用户的一次历史搜索行为确定相关度,参考数据较少,不能提供有效的数据支持,可能使得确定出的相关度不够准确。本公开提供另一种优选实施方式,通过用户根据历史搜索信息进行的多次历史搜索行为确定历史搜 索结果与历史搜索信息之间的相关度,该实施方式可如图4所示,可包括S2031~S2033。In one embodiment, the correlation between the historical search results and the historical search information may be determined from a historical search behavior performed by the user according to the historical search information. However, the correlation degree is only determined by one historical search behavior of the user, the reference data is small, and effective data support cannot be provided, which may make the determined correlation degree inaccurate. The present disclosure provides another preferred embodiment of determining the correlation between historical search results and historical search information through multiple historical search behaviors performed by the user according to historical search information. This embodiment may be shown in FIG. 4 , and may include S2031 ~S2033.
在S2031中,针对用户根据历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对搜索结果组中所包括的历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对搜索结果组实施的目标历史操作行为信息。In S2031, for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information, according to the historical search behavior performed by the user on the historical search results included in the search result group in this historical search behavior The historical operation behavior information is to determine the target historical operation behavior information performed by the user on the search result group in this historical search behavior.
其中,多次历史搜索行为可以是不同的用户发起的。目标历史操作行为信息可用于表征用户是否对该搜索结果组实施了操作。示例地,在用户的一次历史搜索行为中,只要用户对搜索结果组中任一历史搜索结果实施了操作,该搜索结果组对应的目标历史操作行为信息可记为1。The multiple historical search behaviors may be initiated by different users. The target historical operation behavior information can be used to represent whether the user has performed an operation on the search result group. For example, in a user's historical search behavior, as long as the user performs an operation on any historical search result in the search result group, the target historical operation behavior information corresponding to the search result group may be recorded as 1.
在S2032中,根据在多次历史搜索行为中用户分别对搜索结果组实施的目标历史操作行为信息,确定用户对搜索结果组实施的历史操作行为特征信息。In S2032, according to the target historical operation behavior information respectively performed by the user on the search result group in the multiple historical search behaviors, the feature information of the historical operation behavior performed by the user on the search result group is determined.
该历史操作行为特征信息可以表征搜索结果组的历史点击率、历史播放率、历史浏览率等。示例地,例如共统计了用户根据历史搜索信息进行的X次历史搜索行为的数据,目标历史操作行为信息为Y,Y小于或等于X,则可通过Y与X的比值表征历史操作行为特征信息。The historical operation behavior feature information may represent the historical click-through rate, historical playback rate, historical browsing rate, etc. of the search result group. Illustratively, for example, the data of X times of historical search behaviors performed by the user according to the historical search information are counted, and the target historical operation behavior information is Y, and Y is less than or equal to X, then the historical operation behavior feature information can be represented by the ratio of Y to X. .
在S2033中,根据历史操作行为特征信息,确定搜索结果组中所包括的历史搜索结果与历史搜索信息之间的相关度。In S2033, the correlation between the historical search results included in the search result group and the historical search information is determined according to the historical operation behavior feature information.
示例地,可直接将历史操作行为特征信息作为搜索结果组中所包括的历史搜索结果与历史搜索信息之间的相关度。或者,也可根据预先量化的历史操作行为特征信息与相关度之间的对应关系,确定该相关度。For example, the historical operation behavior feature information can be directly used as the correlation between the historical search results included in the search result group and the historical search information. Alternatively, the correlation may also be determined according to the correspondence between the pre-quantized historical operation behavior feature information and the correlation.
在上述方案中,通过用户根据历史搜索信息进行的多次历史搜索行为,可准确确定历史搜索结果与历史搜索信息之间的相关度。并且,将目标文本摘要信息相同的历史搜索结果聚合为搜索结果组,该搜索结果组中所包括的历史搜索结果与历史搜索信息之间的相关度是相同的,能够减小诸如展示顺序等其他因素对用户操作行为的影响,使得模型训练数据更为准确,从而训练出的相关度确定模型更为准确。In the above solution, the correlation between the historical search results and the historical search information can be accurately determined through multiple historical search behaviors performed by the user according to the historical search information. In addition, the historical search results with the same target text abstract information are aggregated into a search result group, and the correlation between the historical search results included in the search result group and the historical search information is the same, which can reduce other factors such as the display order. The influence of factors on the user's operation behavior makes the model training data more accurate, so that the trained correlation determination model is more accurate.
本公开还提供一种搜索处理方法,图5是根据一示例性实施例示出的一种搜索处理方法的流程图,该方法可应用于具有处理能力的电子设备中,如终端或服务器,如图5所示,该方法可包括S501和S502。The present disclosure also provides a search processing method. FIG. 5 is a flowchart of a search processing method according to an exemplary embodiment. The method can be applied to an electronic device with processing capability, such as a terminal or a server, as shown in FIG. As shown in 5, the method may include S501 and S502.
在S501中,接收目标搜索信息。In S501, target search information is received.
该目标搜索信息可以是用户输入的搜索词、搜索语句等信息,该用户与输入历史搜索信息的用户可以相同也可不同,该目标搜索信息与上述的历史搜索信息可以相同,也可以不同,本公开不做具体限制。The target search information may be information such as search words, search sentences, etc. input by the user. The user and the user who input the historical search information may be the same or different. The target search information may be the same or different from the above-mentioned historical search information. There are no specific restrictions on disclosure.
在S502中,根据目标搜索信息确定目标搜索结果,并通过相关度确定模型确定目标搜索结果与目标搜索信息之间的目标相关度。In S502, the target search result is determined according to the target search information, and the target correlation degree between the target search result and the target search information is determined by the correlation determination model.
其中,可将目标搜索结果和目标搜索信息输入到预先训练完成的相关度确定模型中,以得到该相关度确定模型输出的目标搜索结果与目标搜索信息之间的目标相关度。The target search results and target search information may be input into the pre-trained correlation determination model to obtain the target correlation between the target search results and the target search information output by the correlation determination model.
相关度确定模型可以是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的历史搜索结果与历史搜索信息之间的相关度被设定成相同,历史搜索结果是根据用户输入的历史搜索信息进行搜索得到的;将历史搜索信息和历史搜索结果作为模型的输入,历史搜索结果与历史搜索信息之间的相关度作为模型的目标输出,对模型进行训练,以得到相关度确定模型。The correlation determination model may be obtained by training in the following way: according to the historical operation behavior information performed by the user on a plurality of historical search results, the correlation between each historical search result and the historical search information is determined respectively, and the correlation between each historical search result and the historical search information is determined respectively, wherein the target text has the same target text. The correlation between the historical search results of the abstract information and the historical search information is set to be the same, and the historical search results are obtained by searching based on the historical search information input by the user; taking the historical search information and historical search results as the input of the model, The correlation between historical search results and historical search information is used as the target output of the model, and the model is trained to obtain a correlation determination model.
其中,相关度确定模型的训练过程已在上文详细说明,此处不再赘述。The training process of the correlation determination model has been described in detail above, and will not be repeated here.
通过上述技术方案,根据目标搜索信息确定目标搜索结果,并通过相关度确定模型确定目标搜索结果与目标搜索信息之间的目标相关度。在该相关度确定模型的训练阶段,在历史搜索结果符合用户的搜索意图的情况下,用户会对历史搜索结果实施操作,因此根据用户对多个历史搜索结果实施的历史操作行为信息,确定各个历史搜索结果与历史搜索信息之间的相关度,该相关度可作为模型训练数据,训练得到相关度确定模型。这样,无需人工标注训练数据,可快速获得模型训练所需的数据,并且解决人工标注的相关度不准确的问题。而且,在确定历史搜索结果与历史搜索信息之间的相关度时,具有相同目标文本摘要信息的历史搜索结果与历史搜索信息之间的相关度被设定成相同,能够减小诸如展示顺序等其他因素对用户操作行为的影响,使得模型训练数据以及训练出的相关度确定模型更为准确,从而通过相关度确定模型确定出的目标搜索结果与目标搜索信息之间的相关度更准确,为判断该目标搜索结果是否符合用户的搜索意图提供准确的依据。Through the above technical solution, the target search result is determined according to the target search information, and the target correlation between the target search result and the target search information is determined by the correlation determination model. In the training phase of the correlation determination model, if the historical search results conform to the user's search intent, the user will perform operations on the historical search results. The correlation between the historical search results and the historical search information, the correlation can be used as model training data, and the correlation determination model is obtained by training. In this way, the data required for model training can be quickly obtained without manual labeling of training data, and the problem of inaccurate correlation of manual labeling can be solved. Also, when determining the degree of relevancy between the historical search results and the historical search information, the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, and it is possible to reduce the order of presentation, etc. The influence of other factors on the user's operation behavior makes the model training data and the trained correlation determination model more accurate, so that the correlation between the target search results determined by the correlation determination model and the target search information is more accurate. It provides an accurate basis for judging whether the target search result conforms to the user's search intent.
可选地,所述根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,包括:针对每一所述历史搜索结果,确定所 述历史搜索结果的所述目标文本摘要信息;将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。Optionally, determining the correlation between each historical search result and the historical search information according to the historical operation behavior information performed by the user on the plurality of historical search results, includes: for each historical search result, determining all the historical search results. The target text summary information of the historical search results; the historical search results with the same target text abstract information are aggregated into a search result group; according to the user's implementation of the historical search results included in the search result group the historical operation behavior information, and determine the correlation between the historical search results included in the search result group and the historical search information.
可选地,所述根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度,包括:针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。Optionally, according to the historical operation behavior information performed by the user on the historical search results included in the search result group, it is determined that the historical search results included in the search result group are the same as the historical search results included in the search result group. The correlation between the historical search information includes: for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information, according to the user's search result group in this historical search behavior, The historical operation behavior information implemented by the historical search results included in the , determine the target historical operation behavior information performed by the user on the search result group in this historical search behavior; The target historical operation behavior information implemented on the search result group respectively, determine the historical operation behavior characteristic information implemented by the user on the search result group; according to the historical operation behavior characteristic information, determine the search result group the correlation between the historical search results and the historical search information included in .
可选地,所述确定所述历史搜索结果的所述目标文本摘要信息,包括:获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个;确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。Optionally, the determining the target text summary information of the historical search results includes: acquiring text information belonging to a preset topic in the historical search results, wherein the text information includes information under the preset topic. All or part of the subject content, in the case that the text information includes part of the subject content under the preset subject, there are multiple text information belonging to the preset subject; determine candidate text summary information, wherein, The candidate text summary information includes the text information belonging to each of the preset topics and the text combination information of the text information; respectively determine the relationship between each of the candidate text summary information and the historical search information. Matching degree: Determine the candidate text summary information with the highest matching degree with the historical search information as the target text summary information.
其中,上述步骤的具体实现方式已在上文相关度确定模型训练方法中详细阐述。The specific implementation of the above steps has been described in detail in the above-mentioned training method for determining the correlation degree.
本公开提供的搜索处理方法还可包括:根据目标搜索结果与目标搜索信息之间的目标相关度,确定目标搜索结果的展示顺序。The search processing method provided by the present disclosure may further include: determining the display order of the target search results according to the target relevance between the target search results and the target search information.
通常情况下,根据目标搜索信息可搜索出多个目标搜索结果,该多个目标搜索结果与目标搜索信息之间的相关度可能各不相同。其中,例如可将与目标搜索信息的相关度高的目标搜索结果的展示顺序排在与目标搜索信息的相关度低的目标搜索结果的展示顺序之前。这样,可以使得用户首先浏览到与其输入的目标搜索信息更为相关的搜索结果,提升用户体验。Under normal circumstances, multiple target search results can be searched according to the target search information, and the relevancy degrees between the multiple target search results and the target search information may be different from each other. Wherein, for example, the display order of the target search results with a high degree of relevance to the target search information may be arranged before the display order of the target search results with a low degree of relevance to the target search information. In this way, the user can first browse to the search result more relevant to the target search information input by the user, thereby improving the user experience.
基于同一发明构思,本公开还提供一种搜索处理装置,图6是根据一示例性实施例示出的一种搜索处理装置的框图,该装置600可包括:Based on the same inventive concept, the present disclosure also provides a search processing apparatus. FIG. 6 is a block diagram of a search processing apparatus according to an exemplary embodiment. The apparatus 600 may include:
接收模块601,用于接收目标搜索信息;a receiving module 601, configured to receive target search information;
目标相关度确定模块602,用于根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;A target relevance determination module 602, configured to determine a target search result according to the target search information, and determine the target relevance between the target search result and the target search information through a relevance determination model;
其中,所述相关度确定模型是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。Wherein, the correlation determination model is obtained by training in the following way: according to the historical operation behavior information performed by the user on multiple historical search results, the correlation between each historical search result and the historical search information is determined respectively, wherein there are The correlation between the historical search results of the same target text abstract information and the historical search information is set to be the same, and the historical search results are obtained by searching according to the historical search information input by the user; The historical search information and the historical search results are used as the input of the model, the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the correlation Determine the model.
可选地,所述装置600还可包括:展示顺序确定模块,用于根据所述目标搜索结果与所述目标搜索信息之间的所述目标相关度,确定所述目标搜索结果的展示顺序。Optionally, the apparatus 600 may further include: a display order determination module, configured to determine the display order of the target search results according to the target relevance between the target search results and the target search information.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the apparatus in the above-mentioned embodiments, the specific manner in which each module performs operations has been described in detail in the embodiments of the related method, and will not be described in detail here.
本公开还提供一种相关度确定模型训练装置,图7是根据一示例性实施例示出的一种相关度确定模型训练装置的框图,该装置700可包括:The present disclosure also provides a correlation determination model training apparatus. FIG. 7 is a block diagram of a correlation determination model training apparatus according to an exemplary embodiment. The apparatus 700 may include:
相关度确定模块701,用于根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的; Relevance determination module 701 is used to determine the correlation between each historical search result and historical search information according to the historical operation behavior information performed by the user on a plurality of historical search results, wherein the The correlation between the historical search result and the historical search information is set to be the same, and the historical search result is obtained by searching according to the historical search information input by the user;
训练模块702,用于将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。The training module 702 is used for taking the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information as the target output of the model, and performing the training on the model. training to obtain the correlation determination model.
可选地,所述相关度确定模块701,包括:目标文本摘要信息确定子模块,用于针对每一所述历史搜索结果,确定所述历史搜索结果的所述目标文本摘要信息;聚合子模块,用于将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;第一相关度确 定子模块,用于根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。Optionally, the relevancy determination module 701 includes: a target text summary information determination submodule for determining the target text summary information of the historical search result for each historical search result; an aggregation submodule , used for aggregating the historical search results with the same target text summary information into a search result group; a first correlation determination sub-module is used for the historical search results included in the search result group according to the user's The implemented historical operation behavior information determines the degree of relevancy between the historical search results included in the search result group and the historical search information.
可选地,所述第一相关度确定子模块,包括:行为信息确定子模块,用于针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;行为特征信息确定子模块,用于根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;第二相关度确定子模块,用于根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。Optionally, the first relevancy determination sub-module includes: a behavior information determination sub-module, which is used for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information, according to the In this historical search behavior, the historical operation behavior information performed by the user on the historical search results included in the search result group determines the target history performed by the user on the search result group in this historical search behavior Operation behavior information; a behavior feature information determination sub-module, configured to determine that the user has performed the search result group according to the target historical operation behavior information respectively performed by the user on the search result group in multiple historical search behaviors the historical operation behavior feature information; the second correlation determination sub-module is used to determine the relationship between the historical search results included in the search result group and the historical search information according to the historical operation behavior feature information the correlation.
可选地,所述目标文本摘要信息确定子模块,包括:文本信息获取子模块,用于获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个;候选文本摘要信息确定子模块,用于确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;匹配度确定子模块,用于分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;摘要信息确定子模块,用于将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。Optionally, the target text summary information determination sub-module includes: a text information acquisition sub-module for acquiring text information belonging to preset topics in the historical search results, wherein the text information includes the preset All or part of the theme content under the theme, in the case that the text information includes part of the theme content under the preset theme, there are multiple pieces of text information belonging to the preset theme; module for determining candidate text summary information, wherein the candidate text summary information includes the text information belonging to each of the preset topics and the text combination information of the text information; the matching degree determination sub-module, using for determining the matching degree between each of the candidate text abstract information and the historical search information respectively; the abstract information determination sub-module is used to determine the candidate text abstract with the highest matching degree with the historical search information information, which is determined as the target text summary information.
通过上述技术方案,根据目标搜索信息确定目标搜索结果,并通过相关度确定模型确定目标搜索结果与目标搜索信息之间的目标相关度。在该相关度确定模型的训练阶段,在历史搜索结果符合用户的搜索意图的情况下,用户会对历史搜索结果实施操作,因此根据用户对多个历史搜索结果实施的历史操作行为信息,确定各个历史搜索结果与历史搜索信息之间的相关度,该相关度可作为模型训练数据,训练得到相关度确定模型。这样,无需人工标注训练数据,可快速获得模型训练所需的数据,并且解决人工标注的相关度不准确的问题。而且,在确定历史搜索结果与历史搜索信息之间的相关度时,具有相同目标文本摘要信息的历史搜索结果与历史搜索信息之间的相关度被设定成相同,能够减小诸如展示顺序等其他因素对用户操作行为的影响,使得模型训练数据以及训练出的相关度确定模型 更为准确,从而通过相关度确定模型确定出的目标搜索结果与目标搜索信息之间的相关度更准确,为判断该目标搜索结果是否符合用户的搜索意图提供准确的依据。Through the above technical solution, the target search result is determined according to the target search information, and the target correlation between the target search result and the target search information is determined by the correlation determination model. In the training phase of the correlation determination model, if the historical search results conform to the user's search intent, the user will perform operations on the historical search results. The correlation between the historical search results and the historical search information, the correlation can be used as model training data, and the correlation determination model is obtained by training. In this way, the data required for model training can be quickly obtained without manual labeling of training data, and the problem of inaccurate correlation of manual labeling can be solved. Also, when determining the degree of relevancy between the historical search results and the historical search information, the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, and it is possible to reduce the order of presentation, etc. The influence of other factors on the user's operation behavior makes the model training data and the trained correlation determination model more accurate, so that the correlation between the target search results determined by the correlation determination model and the target search information is more accurate. It provides an accurate basis for judging whether the target search result conforms to the user's search intent.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关方法的实施例中进行了详细描述,此处将不做详细阐述说明。应注意,上述各个模块的划分并非限制具体的实现方式,上述各个模块例如可以以软件、硬件或者软硬件结合的方式来实现。在实际实现时,上述各个模块可被实现为独立的物理实体,或者也可由单个实体(例如,处理器(CPU或DSP等)、集成电路等)来实现。需要注意的是,尽管图中将各个模块示为分立的模块,但是这些模块中的一个或多个也可以合并为一个模块,或者拆分为多个模块。Regarding the apparatus in the above-mentioned embodiments, the specific manner in which each module performs operations has been described in detail in the embodiments of the related method, and will not be described in detail here. It should be noted that the division of the above-mentioned modules does not limit the specific implementation manner, and the above-mentioned various modules may be implemented by, for example, software, hardware, or a combination of software and hardware. In actual implementation, the above-mentioned modules may be implemented as independent physical entities, or may also be implemented by a single entity (eg, a processor (CPU or DSP, etc.), an integrated circuit, etc.). It should be noted that although the respective modules are shown as separate modules in the figures, one or more of these modules may also be combined into one module or split into multiple modules.
下面参考图8,其示出了适于用来实现本公开实施例的电子设备800的结构示意图。本公开实施例中的终端设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。图8示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。Referring next to FIG. 8 , it shows a schematic structural diagram of an electronic device 800 suitable for implementing an embodiment of the present disclosure. Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablets), PMPs (portable multimedia players), vehicle-mounted terminals (eg, mobile terminals such as in-vehicle navigation terminals), etc., and stationary terminals such as digital TVs, desktop computers, and the like. The electronic device shown in FIG. 8 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
如图8所示,电子设备800可以包括处理装置(例如中央处理器、图形处理器等)801,其可以根据存储在只读存储器(ROM)802中的程序或者从存储装置808加载到随机访问存储器(RAM)803中的程序而执行各种适当的动作和处理。在RAM 803中,还存储有电子设备800操作所需的各种程序和数据。处理装置801、ROM 802以及RAM 803通过总线804彼此相连。输入/输出(I/O)接口805也连接至总线804。As shown in FIG. 8 , an electronic device 800 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 801 that may be loaded into random access according to a program stored in a read only memory (ROM) 802 or from a storage device 808 Various appropriate actions and processes are executed by the programs in the memory (RAM) 803 . In the RAM 803, various programs and data required for the operation of the electronic device 800 are also stored. The processing device 801, the ROM 802, and the RAM 803 are connected to each other through a bus 804. An input/output (I/O) interface 805 is also connected to bus 804 .
通常,以下装置可以连接至I/O接口805:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置806;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置807;包括例如磁带、硬盘等的存储装置808;以及通信装置809。通信装置809可以允许电子设备800与其他设备进行无线或有线通信以交换数据。虽然图8示出了具有各种装置的电子设备800,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Typically, the following devices may be connected to the I/O interface 805: input devices 806 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration An output device 807 of a computer, etc.; a storage device 808 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 809. Communication means 809 may allow electronic device 800 to communicate wirelessly or by wire with other devices to exchange data. While FIG. 8 shows an electronic device 800 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。本公开的实施例还包括一种计算机程序产品,包括:指令,该指令在被处理装置执行时实现本公开实施例的搜索处理方法的步骤或本公开实施例的相关度确定模型训练方法的步骤。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机 可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置809从网络上被下载和安装,或者从存储装置808被安装,或者从ROM 802被安装。在该计算机程序被处理装置801执行时,执行本公开实施例的方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. Embodiments of the present disclosure also include a computer program product, comprising: instructions that, when executed by a processing device, implement the steps of the search processing method of the embodiments of the present disclosure or the steps of the relevance determination model training method of the embodiments of the present disclosure . For example, embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network via the communication device 809, or from the storage device 808, or from the ROM 802. When the computer program is executed by the processing device 801, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing. In this disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In the present disclosure, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device . Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, electrical wire, optical fiber cable, RF (radio frequency), etc., or any suitable combination of the foregoing.
在一些实施方式中,客户端、服务器可以利用诸如HTTP(HyperText Transfer Protocol,超文本传输协议)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, the client and server can use any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects. Examples of communication networks include local area networks ("LAN"), wide area networks ("WAN"), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently known or future development network of.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:接收目标搜索信息;根据所述目标搜索信息确定目标搜索结 果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;其中,所述相关度确定模型是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: receives target search information; determines a target search result according to the target search information, and passes The relevance determination model determines the target relevance between the target search result and the target search information; wherein, the relevance determination model is obtained by training in the following manner: according to the history of the user's implementation of multiple historical search results Operating behavior information, respectively determining the correlation between each historical search result and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text abstract information is set to be the same , the historical search result is obtained by searching according to the historical search information input by the user; the historical search information and the historical search result are used as the input of the model, and the difference between the historical search result and the historical search information The correlation between the two is used as the target output of the model, and the model is trained to obtain the correlation determination model.
或者,上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。Alternatively, the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: according to the historical operation behavior information performed by the user on multiple historical search results, respectively determining the degree of relevance between each historical search result and historical search information, wherein the degree of relevancy between the historical search results and the historical search information having the same target text summary information is set to be the same, the historical search The result is obtained by searching according to the historical search information input by the user; the historical search information and the historical search results are taken as the input of the model, and the correlation between the historical search results and the historical search information is taken as The target output of the model, and the model is trained to obtain the correlation determination model.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言——诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)——连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider to via Internet connection).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实 际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的模块可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,模块的名称在某种情况下并不构成对该模块本身的限定,例如,接收模块还可以被描述为“目标搜索信息接收模块”。The modules involved in the embodiments of the present disclosure may be implemented in software or hardware. Wherein, the name of the module does not constitute a limitation of the module itself under certain circumstances, for example, the receiving module may also be described as a "target search information receiving module".
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical Devices (CPLDs) and more.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
根据本公开的一个或多个实施例,示例性实施例1提供了一种搜索处理方法,所述方法包括:接收目标搜索信息;根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;其中,所述相关度确定模型是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。According to one or more embodiments of the present disclosure, exemplary embodiment 1 provides a search processing method, the method includes: receiving target search information; determining a target search result according to the target search information, and determining by correlation The model determines the target correlation between the target search result and the target search information; wherein, the correlation determination model is obtained by training in the following way: according to the historical operation behavior information performed by the user on a plurality of historical search results , respectively determine the correlation between each historical search result and historical search information, wherein the correlation between the historical search results with the same target text abstract information and the historical search information is set to be the same, the The historical search results are obtained by searching according to the historical search information input by the user; the historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information The degree is used as the target output of the model, and the model is trained to obtain the correlation determination model.
根据本公开的一个或多个实施例,示例性实施例2提供了示例性实施例1的方法,所述根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,包括:针对每一所述历史搜索结果,确定所述历史搜索结果的所述目标文本摘要信息;将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。According to one or more embodiments of the present disclosure, the exemplary embodiment 2 provides the method of the exemplary embodiment 1, wherein according to the historical operation behavior information performed by the user on the multiple historical search results, the relationship between each historical search result and each historical search result is determined respectively. The correlation between historical search information includes: for each historical search result, determining the target text summary information of the historical search result; aggregating the historical search results with the same target text abstract information as A search result group; according to the historical operation behavior information performed by the user on the historical search results included in the search result group, determine the historical search results included in the search result group and the historical search results correlation between information.
根据本公开的一个或多个实施例,示例性实施例3提供了示例性实施例2的方法,所述根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度,包括:针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。According to one or more embodiments of the present disclosure, exemplary embodiment 3 provides the method of exemplary embodiment 2 according to the history performed by the user on the historical search results included in the search result group Operation behavior information, determining the correlation between the historical search results included in the search result group and the historical search information, including: for the user according to the historical search information for multiple historical search behaviors. For each historical search behavior, according to the historical operation behavior information performed by the user on the historical search results included in the search result group in the historical search behavior, determine the user's interest in the historical search behavior in this historical search behavior. The target historical operation behavior information implemented by the search result group; according to the target historical operation behavior information respectively implemented by the user on the search result group in multiple historical search behaviors, it is determined that the user implements the search result group. The historical operation behavior feature information; according to the historical operation behavior feature information, determine the correlation between the historical search results included in the search result group and the historical search information.
根据本公开的一个或多个实施例,示例性实施例4提供了示例性实施例2的方法,所述确定所述历史搜索结果的所述目标文本摘要信息,包括:获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个;确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。According to one or more embodiments of the present disclosure, Exemplary Embodiment 4 provides the method of Exemplary Embodiment 2, and the determining the target text summary information of the historical search result includes: acquiring the historical search result The text information belonging to the preset theme in the text information, wherein the text information includes all the theme content or part of the theme content under the preset theme, in the case that the text information includes part of the theme content under the preset theme , there are multiple pieces of text information belonging to the preset theme; and candidate text summary information is determined, wherein the candidate text summary information includes the text information belonging to each preset theme and the text of the text information combination information; determine the degree of matching between each candidate text summary information and the historical search information respectively; determine the candidate text abstract information with the highest matching degree with the historical search information as the Target text summary information.
根据本公开的一个或多个实施例,示例性实施例5提供了示例性实施例4的方法,所述历史搜索信息为针对媒体文件的搜索信息,相应地,所述预设主题包括媒体文件的名称、媒体文件的创作者、媒体文件的歌词、媒体文件所属的专辑、媒体文件的风格。According to one or more embodiments of the present disclosure, exemplary embodiment 5 provides the method of exemplary embodiment 4, the historical search information is search information for media files, and accordingly, the preset theme includes media files name of the media file, the creator of the media file, the lyrics of the media file, the album to which the media file belongs, and the style of the media file.
根据本公开的一个或多个实施例,示例性实施例6提供了示例性实施例1至示例性实施例5中任一示例性实施例的方法,所述方法还包括:根据所述目标搜索结果与所述目标搜索信息之间的所述目标相关度,确定所述目标搜索结果的展示顺序。According to one or more embodiments of the present disclosure, Exemplary Embodiment 6 provides the method of any one of Exemplary Embodiments 1 to 5, the method further comprising: searching according to the target The target relevance between the results and the target search information determines the display order of the target search results.
根据本公开的一个或多个实施例,示例性实施例7提供了一种相关度确定模型训练方法,所述方法包括:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。According to one or more embodiments of the present disclosure, exemplary embodiment 7 provides a method for training a relevance determination model, the method comprising: according to historical operation behavior information performed by a user on a plurality of historical search results, respectively determining The correlation between historical search results and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text summary information is set to be the same, and the historical search results are Obtained by searching according to the historical search information input by the user; using the historical search information and the historical search results as the input of the model, and the correlation between the historical search results and the historical search information as the model's target output, the model is trained to obtain the correlation determination model.
根据本公开的一个或多个实施例,示例性实施例8提供了示例性实施例7的方法,所述根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,包括:针对每一所述历史搜索结果,确定所述历史搜索结果的所述目标文本摘要信息;将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。According to one or more embodiments of the present disclosure, the exemplary embodiment 8 provides the method of the exemplary embodiment 7, wherein according to the historical operation behavior information performed by the user on the multiple historical search results, the relationship between each historical search result and each historical search result is determined respectively. The correlation between historical search information includes: for each historical search result, determining the target text summary information of the historical search result; aggregating the historical search results with the same target text abstract information as A search result group; according to the historical operation behavior information performed by the user on the historical search results included in the search result group, determine the historical search results included in the search result group and the historical search results correlation between information.
根据本公开的一个或多个实施例,示例性实施例9提供了示例性实施例8的方法,所述根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度,包括:针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。According to one or more embodiments of the present disclosure, Exemplary Embodiment 9 provides the method of Exemplary Embodiment 8 according to the history performed by the user on the historical search results included in the set of search results Operation behavior information, determining the correlation between the historical search results included in the search result group and the historical search information, including: for the user according to the historical search information for multiple historical search behaviors. For each historical search behavior, according to the historical operation behavior information performed by the user on the historical search results included in the search result group in the historical search behavior, determine the user's interest in the historical search behavior in this historical search behavior. The target historical operation behavior information implemented by the search result group; according to the target historical operation behavior information respectively implemented by the user on the search result group in multiple historical search behaviors, it is determined that the user implements the search result group. The historical operation behavior feature information; according to the historical operation behavior feature information, determine the correlation between the historical search results included in the search result group and the historical search information.
根据本公开的一个或多个实施例,示例性实施例10提供了示例性实施例8的方法, 所述确定所述历史搜索结果的所述目标文本摘要信息,包括:获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个;确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。According to one or more embodiments of the present disclosure, Exemplary Embodiment 10 provides the method of Exemplary Embodiment 8, wherein the determining the target text summary information of the historical search result includes: acquiring the historical search result The text information belonging to the preset theme in the text information, wherein the text information includes all the theme content or part of the theme content under the preset theme, in the case that the text information includes part of the theme content under the preset theme , there are multiple pieces of text information belonging to the preset theme; and candidate text summary information is determined, wherein the candidate text summary information includes the text information belonging to each preset theme and the text of the text information combination information; determine the degree of matching between each candidate text summary information and the historical search information respectively; determine the candidate text abstract information with the highest matching degree with the historical search information as the Target text summary information.
根据本公开的一个或多个实施例,示例性实施例11提供了示例性实施例10的方法,所述历史搜索信息为针对媒体文件的搜索信息,相应地,所述预设主题包括媒体文件的名称、媒体文件的创作者、媒体文件的歌词、媒体文件所属的专辑、媒体文件的风格。According to one or more embodiments of the present disclosure, Exemplary Embodiment 11 provides the method of Exemplary Embodiment 10, the historical search information is search information for media files, and accordingly, the preset theme includes media files name of the media file, the creator of the media file, the lyrics of the media file, the album to which the media file belongs, and the style of the media file.
根据本公开的一个或多个实施例,示例性实施例12提供了一种搜索处理装置,所述装置包括:接收模块,用于接收目标搜索信息;目标相关度确定模块,用于根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;其中,所述相关度确定模型是通过如下方式训练得到的:根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。According to one or more embodiments of the present disclosure, exemplary embodiment 12 provides a search processing apparatus, the apparatus includes: a receiving module for receiving target search information; The target search information determines the target search result, and the target correlation between the target search result and the target search information is determined by a correlation determination model; wherein, the correlation determination model is obtained by training in the following manner: The historical operation behavior information performed by the user on a plurality of historical search results respectively determines the correlation between each historical search result and the historical search information, wherein the historical search results with the same target text summary information and the historical search information The correlations are set to be the same, and the historical search results are obtained by searching according to the historical search information input by the user; taking the historical search information and the historical search results as the input of the model, the The correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the correlation determination model.
根据本公开的一个或多个实施例,示例性实施例13提供了一种相关度确定模型训练装置,所述装置包括:相关度确定模块,用于根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;训练模块,用于将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。According to one or more embodiments of the present disclosure, exemplary embodiment 13 provides an apparatus for training a relevance determination model, the apparatus comprising: a relevance determination module configured to implement a history of a plurality of historical search results according to a user Operating behavior information, respectively determining the correlation between each historical search result and historical search information, wherein the correlation between the historical search results and the historical search information with the same target text abstract information is set to be the same , the historical search results are obtained by searching according to the historical search information input by the user; the training module is used to use the historical search information and the historical search results as the input of the model, and the historical search results are the same as the historical search results. The correlation between the historical search information is used as the target output of the model, and the model is trained to obtain the correlation determination model.
根据本公开的一个或多个实施例,示例性实施例14提供了一种计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现示例性实施例1-示例性实施例6中任一项所述方法的步骤。According to one or more embodiments of the present disclosure, Exemplary Embodiment 14 provides a computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements Exemplary Embodiment 1 - Exemplary Embodiment The steps of any one of the methods in 6.
根据本公开的一个或多个实施例,示例性实施例15提供了一种计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现示例性实施例7-示例性实施例11中任一项所述方法的步骤。According to one or more embodiments of the present disclosure, Exemplary Embodiment 15 provides a computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements Exemplary Embodiment 7 - Exemplary Embodiment The steps of any one of 11.
根据本公开的一个或多个实施例,示例性实施例16提供了一种电子设备,包括:存储装置,其上存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现示例性实施例1-示例性实施例6中任一项所述方法的步骤。According to one or more embodiments of the present disclosure, exemplary embodiment 16 provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer in the storage device A program to implement the steps of the method of any one of Exemplary Embodiment 1 - Exemplary Embodiment 6.
根据本公开的一个或多个实施例,示例性实施例17提供了一种电子设备,包括:存储装置,其上存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现示例性实施例7-示例性实施例11中任一项所述方法的步骤。According to one or more embodiments of the present disclosure, exemplary embodiment 17 provides an electronic device, comprising: a storage device on which a computer program is stored; and a processing device for executing the computer in the storage device A program to implement the steps of the method of any of Exemplary Embodiment 7-Example Embodiment 11.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is merely a preferred embodiment of the present disclosure and an illustration of the technical principles employed. Those skilled in the art should understand that the scope of the disclosure involved in the present disclosure is not limited to the technical solutions formed by the specific combination of the above-mentioned technical features, and should also cover, without departing from the above-mentioned disclosed concept, the technical solutions formed by the above-mentioned technical features or Other technical solutions formed by any combination of its equivalent features. For example, a technical solution is formed by replacing the above features with the technical features disclosed in the present disclosure (but not limited to) with similar functions.
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。Additionally, although operations are depicted in a particular order, this should not be construed as requiring that the operations be performed in the particular order shown or in a sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, although the above discussion contains several implementation-specific details, these should not be construed as limitations on the scope of the present disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Although the subject matter has been described in language specific to structural features and/or logical acts of method, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims. Regarding the apparatus in the above-mentioned embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.

Claims (25)

  1. 一种搜索处理方法,所述方法包括:A search processing method, the method comprising:
    接收目标搜索信息;receive target search information;
    根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;Determine the target search result according to the target search information, and determine the target correlation between the target search result and the target search information through a correlation determination model;
    其中,所述相关度确定模型是通过如下方式训练得到的:Wherein, the correlation determination model is obtained by training in the following manner:
    根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;According to the historical operation behavior information performed by the user on multiple historical search results, the correlation between each historical search result and the historical search information is respectively determined, wherein the historical search results with the same target text summary information are the same as the historical search results. The correlation between the information is set to be the same, and the historical search result is obtained by searching according to the historical search information input by the user;
    将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。The historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the Correlation determines the model.
  2. 根据权利要求1所述的方法,其中,所述根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,包括:The method according to claim 1, wherein determining the correlation between each historical search result and the historical search information according to the historical operation behavior information performed by the user on a plurality of historical search results, comprising:
    针对每一所述历史搜索结果,确定所述历史搜索结果的所述目标文本摘要信息;For each of the historical search results, determining the target text summary information of the historical search results;
    将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;Aggregating the historical search results with the same summary information of the target text into a search result group;
    根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。Determine the relationship between the historical search results included in the search result group and the historical search information according to the historical operation behavior information performed by the user on the historical search results included in the search result group relativity.
  3. 根据权利要求2所述的方法,其中,所述根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度,包括:The method according to claim 2, wherein, according to the historical operation behavior information performed by the user on the historical search results included in the search result group, the determination of the information included in the search result group The correlation between the historical search results and the historical search information, including:
    针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;For each historical search behavior among the multiple historical search behaviors performed by the user according to the historical search information, according to the historical search behavior performed by the user on the historical search results included in the search result group in this historical search behavior For the historical operation behavior information, determine the target historical operation behavior information performed by the user on the search result group in this historical search behavior;
    根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;According to the target historical operation behavior information respectively performed by the user on the search result group in multiple historical search behaviors, determine the historical operation behavior feature information performed by the user on the search result group;
    根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。The degree of correlation between the historical search results included in the search result group and the historical search information is determined according to the historical operation behavior feature information.
  4. 根据权利要求2所述的方法,其中,所述确定所述历史搜索结果的所述目标文本摘要信息,包括:The method according to claim 2, wherein the determining the target text summary information of the historical search result comprises:
    获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容;Acquiring text information belonging to a preset theme in the historical search results, wherein the text information includes all or part of the theme content under the preset theme;
    确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;determining candidate text summary information, wherein the candidate text summary information includes the text information belonging to each of the preset topics and text combination information of the text information;
    分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;respectively determining the degree of matching between each of the candidate text abstract information and the historical search information;
    将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。The candidate text summary information with the highest matching degree with the historical search information is determined as the target text summary information.
  5. 根据权利要求4所述的方法,其中,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个。The method according to claim 4, wherein, when the text information includes part of the subject content under the preset subject, there are multiple text information belonging to the preset subject.
  6. 根据权利要求1-5中任一项所述的方法,其中,所述历史搜索信息为针对媒体文件的搜索信息,相应地,所述预设主题包括媒体文件的名称、媒体文件的创作者、媒体文件的歌词、媒体文件所属的专辑、媒体文件的风格。The method according to any one of claims 1-5, wherein the historical search information is search information for media files, and correspondingly, the preset theme includes the name of the media file, the creator of the media file, The lyrics of the media file, the album to which the media file belongs, and the style of the media file.
  7. 根据权利要求1-6中任一项所述的方法,所述方法还包括:The method of any one of claims 1-6, further comprising:
    根据所述目标搜索结果与所述目标搜索信息之间的所述目标相关度,确定所述目标搜索结果的展示顺序。The display order of the target search results is determined according to the target relevance between the target search results and the target search information.
  8. 一种相关度确定模型训练方法,所述方法包括:A method for training a correlation determination model, the method comprising:
    根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;According to the historical operation behavior information performed by the user on multiple historical search results, the correlation between each historical search result and the historical search information is respectively determined, wherein the historical search results with the same target text summary information are the same as the historical search results. The correlation between the information is set to be the same, and the historical search result is obtained by searching according to the historical search information input by the user;
    将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。The historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the Correlation determines the model.
  9. 根据权利要求8所述的方法,其中,所述根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,包括:The method according to claim 8, wherein determining the correlation between each historical search result and the historical search information according to the historical operation behavior information performed by the user on a plurality of historical search results, comprising:
    针对每一所述历史搜索结果,确定所述历史搜索结果的所述目标文本摘要信息;For each of the historical search results, determining the target text summary information of the historical search results;
    将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;Aggregating the historical search results with the same summary information of the target text into a search result group;
    根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。Determine the relationship between the historical search results included in the search result group and the historical search information according to the historical operation behavior information performed by the user on the historical search results included in the search result group relativity.
  10. 根据权利要求9所述的方法,其中,所述根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度,包括:The method according to claim 9, wherein, according to the historical operation behavior information performed by the user on the historical search results included in the search result group, the determination of the information included in the search result group The correlation between the historical search results and the historical search information, including:
    针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;For each historical search behavior among the multiple historical search behaviors performed by the user according to the historical search information, according to the historical search behavior performed by the user on the historical search results included in the search result group in this historical search behavior For the historical operation behavior information, determine the target historical operation behavior information performed by the user on the search result group in this historical search behavior;
    根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;According to the target historical operation behavior information respectively performed by the user on the search result group in multiple historical search behaviors, determine the historical operation behavior feature information performed by the user on the search result group;
    根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。The degree of correlation between the historical search results included in the search result group and the historical search information is determined according to the historical operation behavior feature information.
  11. 根据权利要求9所述的方法,其中,所述确定所述历史搜索结果的所述目标文本摘要信息,包括:The method according to claim 9, wherein the determining the target text summary information of the historical search result comprises:
    获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容;Acquiring text information belonging to a preset theme in the historical search results, wherein the text information includes all or part of the theme content under the preset theme;
    确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;determining candidate text summary information, wherein the candidate text summary information includes the text information belonging to each of the preset topics and text combination information of the text information;
    分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;respectively determining the degree of matching between each of the candidate text abstract information and the historical search information;
    将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。The candidate text summary information with the highest matching degree with the historical search information is determined as the target text summary information.
  12. 根据权利要求11所述的方法,其中,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个。The method according to claim 11, wherein, when the text information includes part of the subject content under the preset subject, there are multiple text information belonging to the preset subject.
  13. 根据权利要求8-12中任一项所述的方法,其中,所述历史搜索信息为针对媒体文件的搜索信息,相应地,所述预设主题包括媒体文件的名称、媒体文件的创作者、媒体文件的歌词、媒体文件所属的专辑、媒体文件的风格。The method according to any one of claims 8-12, wherein the historical search information is search information for media files, and correspondingly, the preset theme includes the name of the media file, the creator of the media file, The lyrics of the media file, the album to which the media file belongs, and the style of the media file.
  14. 一种搜索处理装置,所述装置包括:A search processing device, the device comprising:
    接收模块,用于接收目标搜索信息;a receiving module for receiving target search information;
    目标相关度确定模块,用于根据所述目标搜索信息确定目标搜索结果,并通过相关度确定模型确定所述目标搜索结果与所述目标搜索信息之间的目标相关度;a target relevance determination module, configured to determine a target search result according to the target search information, and determine the target relevance between the target search result and the target search information through a relevance determination model;
    其中,所述相关度确定模型是通过如下方式训练得到的:Wherein, the correlation determination model is obtained by training in the following manner:
    根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果是根据用户输入的所述历史搜索信息进行搜索得到的;According to the historical operation behavior information performed by the user on multiple historical search results, the correlation between each historical search result and the historical search information is respectively determined, wherein the historical search results with the same target text summary information are the same as the historical search results. The correlation between the information is set to be the same, and the historical search result is obtained by searching according to the historical search information input by the user;
    将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。The historical search information and the historical search results are used as the input of the model, and the correlation between the historical search results and the historical search information is used as the target output of the model, and the model is trained to obtain the Correlation determines the model.
  15. 根据权利要求14所述的装置,还包括:The apparatus of claim 14, further comprising:
    展示顺序确定模块,用于根据所述目标搜索结果与所述目标搜索信息之间的所述目标相关度,确定所述目标搜索结果的展示顺序。The display order determination module is configured to determine the display order of the target search results according to the target relevance between the target search results and the target search information.
  16. 一种相关度确定模型训练装置,所述装置包括:A correlation determination model training device, the device comprising:
    相关度确定模块,用于根据用户对多个历史搜索结果实施的历史操作行为信息,分别确定各个历史搜索结果与历史搜索信息之间的相关度,其中,具有相同目标文本摘要信息的所述历史搜索结果与所述历史搜索信息之间的相关度被设定成相同,所述历史搜索结果 是根据用户输入的所述历史搜索信息进行搜索得到的;The correlation determination module is used to determine the correlation between each historical search result and the historical search information according to the historical operation behavior information implemented by the user on a plurality of historical search results, wherein the historical information with the same target text summary information The correlation between the search result and the historical search information is set to be the same, and the historical search result is obtained by searching according to the historical search information input by the user;
    训练模块,用于将所述历史搜索信息和所述历史搜索结果作为模型的输入,所述历史搜索结果与所述历史搜索信息之间的相关度作为模型的目标输出,对所述模型进行训练,以得到所述相关度确定模型。A training module is used to use the historical search information and the historical search results as the input of the model, the correlation between the historical search results and the historical search information as the target output of the model, and train the model , to obtain the correlation determination model.
  17. 根据权利要求16所述的装置,其中,所述相关度确定模块包括:The apparatus of claim 16, wherein the correlation determination module comprises:
    目标文本摘要信息确定子模块,用于针对每一所述历史搜索结果,确定所述历史搜索结果的所述目标文本摘要信息;a target text summary information determination submodule, configured to determine the target text summary information of the historical search result for each of the historical search results;
    聚合子模块,用于将所述目标文本摘要信息相同的所述历史搜索结果聚合为搜索结果组;an aggregation submodule for aggregating the historical search results with the same target text summary information into a search result group;
    第一相关度确定子模块,用于根据用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的相关度。A first relevance determination submodule, configured to determine the historical search included in the search result group according to the historical operation behavior information performed by the user on the historical search results included in the search result group The correlation between the result and the historical search information.
  18. 根据权利要求17所述的装置,其中,所述第一相关度确定子模块包括:The apparatus according to claim 17, wherein the first correlation determination sub-module comprises:
    行为信息确定子模块,用于针对用户根据所述历史搜索信息进行的多次历史搜索行为中的每次历史搜索行为,根据在该次历史搜索行为中、用户对所述搜索结果组中所包括的所述历史搜索结果实施的所述历史操作行为信息,确定在该次历史搜索行为中用户对所述搜索结果组实施的目标历史操作行为信息;The behavior information determination sub-module is used for each historical search behavior in the multiple historical search behaviors performed by the user according to the historical search information. The historical operation behavior information implemented by the historical search results, determine the target historical operation behavior information implemented by the user on the search result group in this historical search behavior;
    行为特征信息确定子模块,用于根据在多次历史搜索行为中用户分别对所述搜索结果组实施的所述目标历史操作行为信息,确定所述用户对所述搜索结果组实施的历史操作行为特征信息;The behavior feature information determination submodule is used to determine the historical operation behaviors performed by the user on the search result group according to the target historical operation behavior information performed by the user on the search result group in multiple historical search behaviors. characteristic information;
    第二相关度确定子模块,用于根据所述历史操作行为特征信息,确定所述搜索结果组中所包括的所述历史搜索结果与所述历史搜索信息之间的所述相关度。The second correlation degree determination submodule is configured to determine the degree of correlation between the historical search results included in the search result group and the historical search information according to the historical operation behavior feature information.
  19. 根据权利要求17所述的装置,其中,所述目标文本摘要信息确定子模块包括:The apparatus according to claim 17, wherein the target text summary information determination submodule comprises:
    文本信息获取子模块,用于获取所述历史搜索结果中属于预设主题的文本信息,其中,所述文本信息包括所述预设主题下的全部主题内容或部分主题内容;a text information acquisition sub-module, configured to acquire text information belonging to a preset theme in the historical search results, wherein the text information includes all or part of the theme content under the preset theme;
    候选文本摘要信息确定子模块,用于确定候选文本摘要信息,其中,所述候选文本摘要信息包括属于每一所述预设主题的所述文本信息、以及所述文本信息的文本组合信息;a candidate text summary information determination submodule, configured to determine candidate text summary information, wherein the candidate text summary information includes the text information belonging to each of the preset topics and text combination information of the text information;
    匹配度确定子模块,用于分别确定每一所述候选文本摘要信息与所述历史搜索信息之间的匹配度;a matching degree determination sub-module, used for respectively determining the matching degree between each of the candidate text abstract information and the historical search information;
    摘要信息确定子模块,用于将与所述历史搜索信息之间的匹配度最高的所述候选文本摘要信息,确定为所述目标文本摘要信息。The summary information determination submodule is configured to determine the candidate text summary information with the highest matching degree with the historical search information as the target text summary information.
  20. 根据权利要求19所述的装置,其中,在所述文本信息包括所述预设主题下的部分主题内容的情况下,属于所述预设主题的文本信息为多个。The apparatus according to claim 19, wherein in the case that the text information includes part of the subject content under the preset subject, there are multiple text information belonging to the preset subject.
  21. 一种计算机可读介质,其上存储有计算机程序,其中,该程序被处理装置执行时实现权利要求1-7中任一项所述方法的步骤。A computer-readable medium having a computer program stored thereon, wherein the program, when executed by a processing device, implements the steps of the method of any one of claims 1-7.
  22. 一种计算机可读介质,其上存储有计算机程序,其中,该程序被处理装置执行时实现权利要求8-13中任一项所述方法的步骤。A computer-readable medium having stored thereon a computer program, wherein the program, when executed by a processing device, implements the steps of the method of any one of claims 8-13.
  23. 一种电子设备,其中,包括:An electronic device comprising:
    存储装置,其上存储有计算机程序;a storage device on which a computer program is stored;
    处理装置,用于执行所述存储装置中的所述计算机程序,以实现权利要求1-7中任一项所述方法的步骤。A processing device, configured to execute the computer program in the storage device, to implement the steps of the method of any one of claims 1-7.
  24. 一种电子设备,其中,包括:An electronic device comprising:
    存储装置,其上存储有计算机程序;a storage device on which a computer program is stored;
    处理装置,用于执行所述存储装置中的所述计算机程序,以实现权利要求8-13中任一项所述方法的步骤。A processing device, configured to execute the computer program in the storage device, to implement the steps of the method of any one of claims 8-13.
  25. 一种计算机程序产品,包括:指令,A computer program product comprising: instructions,
    所述指令在被处理装置执行时实现权利要求1-7中任一项所述的搜索处理方法的步骤或权利要求8-13中任一项所述的相关度确定模型训练方法的步骤。The instructions, when executed by the processing device, implement the steps of the search processing method according to any one of claims 1-7 or the steps of the correlation determination model training method according to any one of claims 8-13.
PCT/CN2021/131113 2020-11-19 2021-11-17 Search processing method and apparatus, model training method and apparatus, and medium and device WO2022105775A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011303665.1 2020-11-19
CN202011303665.1A CN112364235A (en) 2020-11-19 2020-11-19 Search processing method, model training method, device, medium and equipment

Publications (1)

Publication Number Publication Date
WO2022105775A1 true WO2022105775A1 (en) 2022-05-27

Family

ID=74534025

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/131113 WO2022105775A1 (en) 2020-11-19 2021-11-17 Search processing method and apparatus, model training method and apparatus, and medium and device

Country Status (2)

Country Link
CN (1) CN112364235A (en)
WO (1) WO2022105775A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112364235A (en) * 2020-11-19 2021-02-12 北京字节跳动网络技术有限公司 Search processing method, model training method, device, medium and equipment
CN113127686B (en) * 2021-04-22 2024-02-02 北京爱奇艺科技有限公司 Video searching method, device, equipment and storage medium
CN113806483B (en) * 2021-09-17 2023-09-05 北京百度网讯科技有限公司 Data processing method, device, electronic equipment and computer program product

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190179965A1 (en) * 2017-12-13 2019-06-13 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for generating information
CN110147494A (en) * 2019-04-24 2019-08-20 北京三快在线科技有限公司 Information search method, device, storage medium and electronic equipment
CN111061954A (en) * 2019-12-19 2020-04-24 腾讯音乐娱乐科技(深圳)有限公司 Search result sorting method and device and storage medium
CN111177551A (en) * 2019-12-27 2020-05-19 百度在线网络技术(北京)有限公司 Method, device, equipment and computer storage medium for determining search result
CN112364235A (en) * 2020-11-19 2021-02-12 北京字节跳动网络技术有限公司 Search processing method, model training method, device, medium and equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10540357B2 (en) * 2016-03-21 2020-01-21 Ebay Inc. Dynamic topic adaptation for machine translation using user session context
CN106339756B (en) * 2016-08-25 2019-04-02 北京百度网讯科技有限公司 Generation method, searching method and the device of training data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190179965A1 (en) * 2017-12-13 2019-06-13 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for generating information
CN110147494A (en) * 2019-04-24 2019-08-20 北京三快在线科技有限公司 Information search method, device, storage medium and electronic equipment
CN111061954A (en) * 2019-12-19 2020-04-24 腾讯音乐娱乐科技(深圳)有限公司 Search result sorting method and device and storage medium
CN111177551A (en) * 2019-12-27 2020-05-19 百度在线网络技术(北京)有限公司 Method, device, equipment and computer storage medium for determining search result
CN112364235A (en) * 2020-11-19 2021-02-12 北京字节跳动网络技术有限公司 Search processing method, model training method, device, medium and equipment

Also Published As

Publication number Publication date
CN112364235A (en) 2021-02-12

Similar Documents

Publication Publication Date Title
WO2022105775A1 (en) Search processing method and apparatus, model training method and apparatus, and medium and device
CN111414498B (en) Multimedia information recommendation method and device and electronic equipment
WO2022105545A1 (en) Speech synthesis method and apparatus, and readable medium and electronic device
CN108604233B (en) Media consumption context for personalized instant query suggestions
WO2023273596A1 (en) Method and apparatus for determining text correlation, readable medium, and electronic device
WO2023279843A1 (en) Content search method, apparatus and device, and storage medium
US20180285448A1 (en) Producing personalized selection of applications for presentation on web-based interface
WO2023065825A1 (en) Information processing method and apparatus, device, and medium
CN112948540A (en) Information query method and device, electronic equipment and computer readable medium
WO2020151548A1 (en) Method and device for sorting followed pages
CN111400625A (en) Page processing method and device, electronic equipment and computer readable storage medium
WO2022156730A1 (en) Text processing method and apparatus, device, and medium
CN114357325A (en) Content search method, device, equipment and medium
US10241988B2 (en) Prioritizing smart tag creation
CN111767259A (en) Content sharing method and device, readable medium and electronic equipment
US10452710B2 (en) Selecting content items based on received term using topic model
WO2022001846A1 (en) Intention recognition method and apparatus, readable medium, and electronic device
CN111339452A (en) Method, terminal, server and system for displaying search result
WO2022222660A1 (en) Object display method and apparatus, electronic device, and computer readable storage medium
WO2023000782A1 (en) Method and apparatus for acquiring video hotspot, readable medium, and electronic device
CN113221572B (en) Information processing method, device, equipment and medium
CN111782895B (en) Retrieval processing method and device, readable medium and electronic equipment
CN113486212A (en) Search recommendation information generation and display method, device, equipment and storage medium
CN111737571A (en) Searching method and device and electronic equipment
CN110598133A (en) Method, apparatus, electronic device, and computer-readable storage medium for determining an order of search items

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21893922

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 070923)

122 Ep: pct application non-entry in european phase

Ref document number: 21893922

Country of ref document: EP

Kind code of ref document: A1