WO2023020506A1 - Search method with diversified and equalized search results, and computer device - Google Patents

Search method with diversified and equalized search results, and computer device Download PDF

Info

Publication number
WO2023020506A1
WO2023020506A1 PCT/CN2022/112863 CN2022112863W WO2023020506A1 WO 2023020506 A1 WO2023020506 A1 WO 2023020506A1 CN 2022112863 W CN2022112863 W CN 2022112863W WO 2023020506 A1 WO2023020506 A1 WO 2023020506A1
Authority
WO
WIPO (PCT)
Prior art keywords
search
preset data
search results
content
data models
Prior art date
Application number
PCT/CN2022/112863
Other languages
French (fr)
Chinese (zh)
Inventor
包伟
Original Assignee
深圳市世强元件网络有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市世强元件网络有限公司 filed Critical 深圳市世强元件网络有限公司
Publication of WO2023020506A1 publication Critical patent/WO2023020506A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9532Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Definitions

  • the invention relates to the field of search, and more specifically, to a search method and computer equipment for diversification and equalization of search results.
  • Search technology is a commonly used technology on the Internet, and users find target content by inputting search content.
  • Most of the existing search technologies only consider the correlation between the search content and the target content. For example, the higher the number of occurrences, the higher the correlation, and they are sorted and displayed according to the high or low correlation.
  • This search method does not consider the variety of target content, resulting in Some types of target content are rarely displayed, while some types of target content are displayed too much, and the search results are not diversified and balanced enough.
  • the technical problem to be solved by the present invention is to provide a search method and computer equipment for diversification and equalization of search results in view of the above-mentioned defects of the prior art.
  • the technical solution adopted by the present invention to solve the technical problem is: to construct a search method for diversification and equalization of search results, comprising the following steps:
  • the preset data model includes content title, content abstract, text, keywords and content type.
  • converting various types of original data models into preset data models in the step S1 includes:
  • the calculation of the total weight value of each of the preset data models in the search results in the step S3 includes: separately calculating the search keywords in the The sub-weight value of the content title, content abstract, text, keywords and content type, and the total weight value is obtained from all the sub-weight values.
  • the score The weight value is positively correlated with the number of occurrences of the search keyword.
  • step S3 after the step S3, it also includes:
  • using the search keyword to retrieve all the preset data models in the step S3 includes:
  • step S33 it further includes: making each group generate a preset number of preset data models.
  • the preset quantity corresponding to each group is positively correlated with the total number of the group.
  • the present invention also provides a computer device, including a memory and a processor, and the processor is communicatively connected to the memory.
  • the memory is used to store computer programs; the processor is used to execute the computer programs stored in the memory to realize the search method for diversification and equalization of search results as described above.
  • the present invention uniformly transforms various types of original data models into preset data models, avoiding the impact of search due to the expression of data types, making Search results are more diverse and balanced.
  • FIG. 1 is a flow chart of a search method for diversification and equalization of search results provided by an embodiment of the present invention
  • FIG. 2 is a flow chart of a search method for diversifying and equalizing search results provided by an embodiment of the present invention.
  • the search method for diversification and equalization of search results in this embodiment includes the following steps:
  • industry thesaurus which includes multiple industry professional vocabularies; convert various types of original data models into preset data models.
  • industry professional vocabulary refers to professional terms used in a certain industry. The professional terms are different from everyday expressions and are proper nouns with exclusive meanings in the industry. Setting up an industry thesaurus is beneficial for scientific word segmentation of search content entered by users, thereby improving search professionalism and accuracy.
  • the original data model is used in the search, that is, the original format of the original data is kept and the search is performed directly. Because various original data models vary widely, various original data models are not on the "same starting line", which will lead to certain types of data in the search results. Too many are displayed, some types are displayed too little, and some types are not displayed at all, and the search results are not diversified and balanced enough. For example, the original data models of news, movies, songs, encyclopedias and variety shows all contain "Andy Lau”. If there are too many "Andy Lau" keywords, the search results will basically be news, and there will be few movies, songs, encyclopedias, and variety shows, especially movies and songs.
  • this embodiment converts various types of original data models into preset data models, and after conversion, all original data models have a unified data model, so that all preset The data models have a "same starting line", and all preset data models have a more balanced probability of being searched when they are retrieved, thus making the search results more diversified and balanced.
  • the industry thesaurus and all converted preset data models are stored on the server.
  • S2 Receive the search content input by the user, and extract search keywords from the search content according to the industry thesaurus. Specifically, the user inputs search content in the search box, and the search content is uploaded to the server through the network, and the server divides the search content into words according to the industry professional vocabulary in the industry thesaurus, and extracts the search keywords corresponding to the search content.
  • the search content is "epson S1C17801 mcu "data booklet”, identify the word segmentation results according to the industry thesaurus: “epson” is the brand word, “S1C17801” is the model word, “mcu” is the category word, “data booklet” is the resource word, then the extracted search keywords They are: “epson”, “S1C17801”, “mcu”, “data booklet”.
  • the basic language structure can be used to extract the search keywords, that is, the subject-verb-object complement Language structure to parse the search content to get search keywords.
  • search keywords to retrieve all preset data models in the institute, calculate the total weight value of each preset data model in the search results, and sort the search results according to the total weight values. Specifically, if the search content contains only one search keyword, use the search keyword to retrieve all preset data models, calculate the total weight value of each preset data model in the search results, and sort the search results according to the total weight value . If the search content contains at least two search keywords, first use one search keyword to retrieve all preset data models to obtain the first search result; then use another search keyword to search in the first search structure to obtain the second Search results; and so on, until all search keywords are searched.
  • the search results are sorted according to the total weight value.
  • the search results are sorted Send it to the user terminal for display. It can be understood that the search result delivered by the server to the user terminal is not a preset data model, but an original data model corresponding to the preset data model.
  • various types of original data models are uniformly transformed into preset data models, so as to avoid affecting the search due to the expression form of the data type, and make the search results more diversified and balanced.
  • the preset data model includes content title, content abstract, text, keywords and content type.
  • the converted preset data model has content title, content abstract, body text, keywords and content type.
  • a song file usually only has song title and artist information, but no content summary and text. At this time, song lyrics can be used as content summary and content text to complete the conversion.
  • various types of original data models are uniformly transformed into preset data models, so as to avoid affecting the search due to the expression form of the data type, and make the search results more diversified and balanced.
  • converting various types of original data models into preset data models in step S1 includes: converting various types of original data models into preset data models and setting the preset data models The weight value of each part of the content, wherein the weight value of the keyword is greater than the weight value of the content title, the weight value of the content title is greater than the weight value of the content abstract, and the weight value of the content abstract is greater than the weight value of the text.
  • the calculation of the total weight value of each preset data model in the search results in step S3 includes: separately calculating the sub-weight values of the search keywords in the content title, content abstract, text, keywords and content types, and all sub-weights value to get the total weight value.
  • all sub-weight values are summed directly to obtain the total weight value.
  • the weighted values are positively correlated with the number of occurrences of search keywords, that is, the search keywords appear in a certain part The more times, the greater the score weight it gets in this part.
  • differences between original data models are balanced through weight configuration and unified preset data models, so that search results are more diversified and balanced.
  • the search method for diversification and equalization of search results further includes: S4, adjusting each preset data model according to the distribution of each type of data model corresponding to the preset data model in the search results.
  • the weighting value for some content so that the types are evenly distributed in the search results.
  • the distribution of each type of data model corresponding to the preset data model in the search results refers to whether each type of data model corresponds to the preset data model in the preset ranking number (the search result shows the home page), if each type of data model If the corresponding preset data models appear in the preset ranking numbers, it means that the existing weight value setting is relatively reasonable; if one or several types of data models corresponding to the preset data models do not appear in the search results of the preset ranking numbers In , it means that the existing weight value setting is unreasonable, and the diversification and balance of search results cannot be realized. It is necessary to adjust the weight value of each part of the preset data model to make the distribution of various types in the search results balanced.
  • the distribution of each type of data model corresponding to the preset data model in the search results refers to the proportion of each type of data model corresponding to the preset data model in the number of preset rankings (the search results display the home page). If each type of data If the proportion of the model corresponding to the preset data model in the preset ranking quantity is balanced, it means that the existing weight value setting is relatively reasonable; if one or several types of data models correspond to the preset data model in the preset ranking quantity If the proportion is too low or too high to realize the diversification and balance of search results, it is necessary to adjust the weight value of each part of the preset data model to balance the distribution of various types in the search results.
  • the weight value of each part of the content of the preset data model is adjusted through the feedback of search results, and the setting of the weight value of each part of the content of the preset data model is continuously optimized, so that the search results are more diversified and balanced.
  • using the search keyword to retrieve all preset data models in step S3 includes:
  • S32 Counting the total number of preset data types in each category, and classifying the categories with the same total number into the same group.
  • the same total number means that the total number is within the same preset number range.
  • some kinds of preset data types are more than 10 million, some kinds of preset data types are between 5 million and 10 million, some kinds of preset data types are between 1 million and 5 million, and some kinds of preset It is assumed that the data type is between 500,000 and 1 million, some types of preset data types are between 100,000 and 500,000, and some types of default data types are below 100,000.
  • Type A and Type B are 6.5 million and 8.5 million respectively, then Type A and Type B form a group; the total number of Type C and Type D is 650,000 and 850,000 respectively, then Type C and Type D are One group; the total number of types E and F is 60,000 and 80,000 respectively, then types E and F form a group.
  • groups are grouped according to the quantity level, and searches are performed in each group separately, so as to ensure that each group has a preset data model output, so that the search results are more diversified and balanced.
  • the computer device in this embodiment includes a memory and a processor, and the processor is communicatively connected to the memory.
  • the memory is used to store computer programs; the processor is used to execute the computer programs stored in the memory to implement the search method for diversification and equalization of search results as in the above embodiments.
  • the computer device is a server.
  • the computer device in this embodiment uniformly transforms various types of original data models into preset data models, so as to avoid affecting the search due to the expression of the data type, and make the search results more diversified and balanced.
  • each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same and similar parts of each embodiment can be referred to each other.
  • the description is relatively simple, and for the related information, please refer to the description of the method part.
  • RAM random access memory
  • ROM read-only memory
  • EEPROM electrically programmable ROM
  • EEPROM electrically erasable programmable ROM
  • registers hard disk, removable disk, CD-ROM, or any other Any other known storage medium.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a search method with diversified and equalized search results, and a computer device. The method comprises the following steps: S1, establishing an industry lexicon, wherein the industry lexicon comprises a plurality of professional industry vocabularies; and converting various types of original data models into preset data models; S2, receiving search content input by a user, and extracting, according to the industry lexicon, a search keyword from the search content; and S3, retrieving all the preset data models by using the search keyword, calculating the total weight value of each preset data model in search results, and sorting the search results according to the total weight values. By means of the present invention, various types of original data models are uniformly converted into preset data models, so as to avoid affecting a search due to expression forms of data types, such that search results are more diversified and equalized.

Description

一种搜索结果多样化均衡化搜索方法及计算机设备A search method and computer equipment for diversification and equalization of search results 技术领域technical field
本发明涉及搜索领域,更具体地说,涉及一种搜索结果多样化均衡化搜索方法及计算机设备。The invention relates to the field of search, and more specifically, to a search method and computer equipment for diversification and equalization of search results.
背景技术Background technique
搜索技术是互联网常用技术,用户通过输入搜索内容查找目标内容。现有搜索技术中多数仅考虑搜索内容和目标内容的关联性,例如出现次数越高则关联性越高,按照关联性高低进行排序显示,这种搜索方式没有考虑目标内容的种类多样性,导致一些种类的目标内容很少被展示,而一些种类的目标内容又过多被展示,搜索结果不够多样化均衡化。Search technology is a commonly used technology on the Internet, and users find target content by inputting search content. Most of the existing search technologies only consider the correlation between the search content and the target content. For example, the higher the number of occurrences, the higher the correlation, and they are sorted and displayed according to the high or low correlation. This search method does not consider the variety of target content, resulting in Some types of target content are rarely displayed, while some types of target content are displayed too much, and the search results are not diversified and balanced enough.
技术问题technical problem
本发明要解决的技术问题在于,针对现有技术的上述缺陷,提供一种搜索结果多样化均衡化搜索方法及计算机设备。The technical problem to be solved by the present invention is to provide a search method and computer equipment for diversification and equalization of search results in view of the above-mentioned defects of the prior art.
技术解决方案technical solution
本发明解决其技术问题所采用的技术方案是:构造一种搜索结果多样化均衡化搜索方法,包括下述步骤:The technical solution adopted by the present invention to solve the technical problem is: to construct a search method for diversification and equalization of search results, comprising the following steps:
S1、建立行业词库,所述行业词库包括多个行业专业词汇;将各类型原始数据模型转化为预设数据模型;S1. Establish an industry thesaurus, which includes a plurality of industry professional vocabularies; convert various types of original data models into preset data models;
S2、接收用户输入的搜索内容,按照所述行业词库从所述搜索内容中提取搜索关键词;S2. Receive the search content input by the user, and extract search keywords from the search content according to the industry thesaurus;
S3、使用所述搜索关键词检索所所有所述预设数据模型,计算搜索结果中每个所述预设数据模型的总权重值,根据所述总权重值对搜索结果进行排序。S3. Retrieve all the preset data models by using the search keywords, calculate the total weight value of each preset data model in the search results, and sort the search results according to the total weight values.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,所述预设数据模型包括内容标题、内容摘要、正文、关键词和内容类型。Further, in the search method for diversifying and equalizing search results according to the present invention, the preset data model includes content title, content abstract, text, keywords and content type.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,所述步骤S1中将各类型原始数据模型转化为预设数据模型包括:Further, in the method for diversifying and equalizing search results according to the present invention, converting various types of original data models into preset data models in the step S1 includes:
将各类型原始数据模型转化为预设数据模型并设置所述预设数据模型各部分内容的权重值,其中所述关键词的权重值大于所述内容标题的权重值,所述内容标题的权重值大于所述内容摘要的权重值,所述内容摘要的权重值大于所述正文的权重值。Convert various types of original data models into preset data models and set the weight values of each part of the preset data models, wherein the weight value of the keywords is greater than the weight value of the content title, and the weight value of the content title The value is greater than the weight value of the content abstract, and the weight value of the content abstract is greater than the weight value of the text.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,所述步骤S3中计算搜索结果中每个所述预设数据模型的总权重值包括:分别计算所述搜索关键词在所述内容标题、内容摘要、正文、关键词和内容类型的分权重值,由所有所述分权重值得到总权重值。Further, in the method for diversifying and equalizing the search results of the present invention, the calculation of the total weight value of each of the preset data models in the search results in the step S3 includes: separately calculating the search keywords in the The sub-weight value of the content title, content abstract, text, keywords and content type, and the total weight value is obtained from all the sub-weight values.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,在计算所述搜索关键词在所述内容标题、内容摘要、正文、关键词和内容类型的分权重值时,所述分权重值与所述搜索关键词出现次数正相关。Further, in the method for diversifying and equalizing the search results of the present invention, when calculating the weighted values of the search keywords in the content title, content abstract, text, keywords and content types, the score The weight value is positively correlated with the number of occurrences of the search keyword.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,在所述步骤S3之后还包括:Further, in the method for diversifying and equalizing the search results of the present invention, after the step S3, it also includes:
S4、根据所述搜索结果中各类型数据模型对应预设数据模型的分布情况调整所述预设数据模型各部分内容的权重值,以使所述搜索结果中各类型分布均衡。S4. According to the distribution of each type of data model corresponding to the preset data model in the search result, adjust the weight value of each part of the preset data model, so that the distribution of each type in the search result is balanced.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,所述步骤S3中使用所述搜索关键词检索所所有所述预设数据模型包括:Further, in the search method for diversifying and equalizing search results according to the present invention, using the search keyword to retrieve all the preset data models in the step S3 includes:
S31、按照分类标准将所有所述预设数据模型进行分类;S31. Classify all the preset data models according to the classification standard;
S32、统计每种类别中所述预设数据类型的总数,将总数相当的类别划分在同一个组;S32. Counting the total number of the preset data types in each category, and dividing the categories with the same total number into the same group;
S33、使用所述搜索关键词检索每个组中所有所述预设数据模型。S33. Retrieve all the preset data models in each group by using the search keyword.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,在所述步骤S33之后还包括:使每个组产生预设数量的预设数据模型。Further, in the search method for diversification and equalization of search results according to the present invention, after the step S33, it further includes: making each group generate a preset number of preset data models.
进一步,在本发明所述的搜索结果多样化均衡化搜索方法中,每个组对应的预设数量与该组的总数正相关。Further, in the search method for diversification and equalization of search results according to the present invention, the preset quantity corresponding to each group is positively correlated with the total number of the group.
另外,本发明还提供一种计算机设备,包括存储器和处理器,所述处理器通信连接所述存储器。所述存储器用于存储计算机程序;所述处理器用于执行所述存储器存储的计算机程序以实现如上述的搜索结果多样化均衡化搜索方法。In addition, the present invention also provides a computer device, including a memory and a processor, and the processor is communicatively connected to the memory. The memory is used to store computer programs; the processor is used to execute the computer programs stored in the memory to realize the search method for diversification and equalization of search results as described above.
有益效果Beneficial effect
实施本发明的一种搜索结果多样化均衡化搜索方法及计算机设备,具有以下有益效果:本发明将各类型原始数据模型统一转化为预设数据模型,避免因数据类型的表现形式影响搜索,使得搜索结果更加多样化均衡化。Implementing a search method and computer equipment for diversification and balance of search results of the present invention has the following beneficial effects: the present invention uniformly transforms various types of original data models into preset data models, avoiding the impact of search due to the expression of data types, making Search results are more diverse and balanced.
附图说明Description of drawings
下面将结合附图及实施例对本发明作进一步说明,附图中:The present invention will be further described below in conjunction with accompanying drawing and embodiment, in the accompanying drawing:
图1是本发明实施例提供的一种搜索结果多样化均衡化搜索方法的流程图;FIG. 1 is a flow chart of a search method for diversification and equalization of search results provided by an embodiment of the present invention;
图2是本发明实施例提供的一种搜索结果多样化均衡化搜索方法的流程图。FIG. 2 is a flow chart of a search method for diversifying and equalizing search results provided by an embodiment of the present invention.
本发明的最佳实施方式BEST MODE FOR CARRYING OUT THE INVENTION
为了对本发明的技术特征、目的和效果有更加清楚的理解,现对照附图详细说明本发明的具体实施方式。In order to have a clearer understanding of the technical features, purposes and effects of the present invention, the specific implementation manners of the present invention will now be described in detail with reference to the accompanying drawings.
在一优选实施例中,参考图1,本实施例的搜索结果多样化均衡化搜索方法包括下述步骤:In a preferred embodiment, with reference to FIG. 1, the search method for diversification and equalization of search results in this embodiment includes the following steps:
S1、建立行业词库,行业词库包括多个行业专业词汇;将各类型原始数据模型转化为预设数据模型。具体的,行业专业词汇是指某一行业中使用的专业术语,该专业术语不同于日常生活用语,是专有名词,在行业中有专属含义。设置行业词库有利于对用户输入的搜索内容进行科学分词,从而提高搜索专业性和准确性。行业词库可为一个或多个,在行业词库为多个时按照内容类型对行业词库进行分类,形成多个不同类别的行业词库,也即每个行业词库包含的行业专业词汇为同一类词汇。S1. Establish an industry thesaurus, which includes multiple industry professional vocabularies; convert various types of original data models into preset data models. Specifically, industry professional vocabulary refers to professional terms used in a certain industry. The professional terms are different from everyday expressions and are proper nouns with exclusive meanings in the industry. Setting up an industry thesaurus is beneficial for scientific word segmentation of search content entered by users, thereby improving search professionalism and accuracy. There can be one or more industry thesauruses. When there are multiple industry thesauruses, the industry thesauruses are classified according to the content type to form multiple industry thesauruses of different categories, that is, the industry professional vocabulary contained in each industry thesaurus for the same vocabulary.
现有技术中在搜索时使用原始数据模型,即保持原始资料原有格式直接进行搜索,因各类原始数据模型千差万别,各类原始数据模型不在“同一起跑线”,会导致搜索结果中某些类型显示过多,有些类型显示过少,甚至有些类型始终得不到显示,搜索结果不够多样化均衡化。例如,新闻、影视、歌曲、百科和综艺等原始数据模型均包含“刘德华”,因新闻、影视、歌曲、百科和综艺在数据类型上有较大差异,在利用词频搜索时,新闻中包含较多“刘德华”关键词,会导致搜索结果中排序考前的基本都是新闻,而很少出现影视、歌曲、百科和综艺等,特别是影视和歌曲,因数据类型的限制,很少能直接出现在首页搜索结果中,导致用户看到的搜索结果较为单一,不够多样化均衡化。为解决因数据模型差异带来的搜索结果不够多样化均衡化问题,本实施例将各类型原始数据模型转化为预设数据模型,转化后所有原始数据模型具有统一数据模型,从而使所有预设数据模型具有“同一起跑线”,在被检索时所有预设数据模型被搜索到的概率更加均衡,从而使得搜索结果更加多样化均衡化。作为选择,行业词库和转化后的所有预设数据模型存储在服务器上。In the existing technology, the original data model is used in the search, that is, the original format of the original data is kept and the search is performed directly. Because various original data models vary widely, various original data models are not on the "same starting line", which will lead to certain types of data in the search results. Too many are displayed, some types are displayed too little, and some types are not displayed at all, and the search results are not diversified and balanced enough. For example, the original data models of news, movies, songs, encyclopedias and variety shows all contain "Andy Lau". If there are too many "Andy Lau" keywords, the search results will basically be news, and there will be few movies, songs, encyclopedias, and variety shows, especially movies and songs. Due to the limitation of data types, it is rare to directly Appearing in the search results on the home page, the search results that users see are relatively single, not diversified and balanced enough. In order to solve the problem of insufficient diversity and balance of search results caused by differences in data models, this embodiment converts various types of original data models into preset data models, and after conversion, all original data models have a unified data model, so that all preset The data models have a "same starting line", and all preset data models have a more balanced probability of being searched when they are retrieved, thus making the search results more diversified and balanced. Alternatively, the industry thesaurus and all converted preset data models are stored on the server.
S2、接收用户输入的搜索内容,按照行业词库从搜索内容中提取搜索关键词。具体的,用户在搜索框中输入搜索内容,搜索内容通过网络上传至服务器,服务器按照行业词库中的行业专业词汇对搜索内容进行分词,提取搜索内容对应的搜索关键词。例如,搜索内容为“epson S1C17801 mcu 数据手册”,按行业词库进行识别得到分词结果:“epson”为厂牌词,“S1C17801”为型号词,“mcu”为品类词,“数据手册”为资源词,则提取的搜索关键词为:“epson”、“S1C17801”、“mcu”、“数据手册”。作为选择,若行业词库未涵盖搜索内容,则可使用基本语言结构提取搜索关键词,即使用主谓宾定状补语言结构来解析搜索内容得到搜索关键词。S2. Receive the search content input by the user, and extract search keywords from the search content according to the industry thesaurus. Specifically, the user inputs search content in the search box, and the search content is uploaded to the server through the network, and the server divides the search content into words according to the industry professional vocabulary in the industry thesaurus, and extracts the search keywords corresponding to the search content. For example, the search content is "epson S1C17801 mcu "data booklet", identify the word segmentation results according to the industry thesaurus: "epson" is the brand word, "S1C17801" is the model word, "mcu" is the category word, "data booklet" is the resource word, then the extracted search keywords They are: "epson", "S1C17801", "mcu", "data booklet". Alternatively, if the industry thesaurus does not cover the search content, the basic language structure can be used to extract the search keywords, that is, the subject-verb-object complement Language structure to parse the search content to get search keywords.
S3、使用搜索关键词检索所所有预设数据模型,计算搜索结果中每个预设数据模型的总权重值,根据总权重值对搜索结果进行排序。具体的,若搜索内容仅包含一个搜索关键词,则使用搜索关键词检索所所有预设数据模型,计算搜索结果中每个预设数据模型的总权重值,根据总权重值对搜索结果进行排序。若搜索内容包含至少两个搜索关键词,则首先使用一个搜索关键词检索所所有预设数据模型,得到第一搜索结果;然后使用另一个搜索关键词在第一搜索结构中搜索,得到第二搜索结果;以此类推,直至所有搜索关键词完成搜索,搜索完成后计算搜索结果中每个预设数据模型的总权重值,根据总权重值对搜索结果进行排序,服务器完成排序后将搜索结果下发至用户终端显示。可以理解的,服务器下发至用户终端的搜索结果并非预设数据模型,而是预设数据模型对应的原始数据模型。S3. Use the search keywords to retrieve all preset data models in the institute, calculate the total weight value of each preset data model in the search results, and sort the search results according to the total weight values. Specifically, if the search content contains only one search keyword, use the search keyword to retrieve all preset data models, calculate the total weight value of each preset data model in the search results, and sort the search results according to the total weight value . If the search content contains at least two search keywords, first use one search keyword to retrieve all preset data models to obtain the first search result; then use another search keyword to search in the first search structure to obtain the second Search results; and so on, until all search keywords are searched. After the search is completed, the total weight value of each preset data model in the search results is calculated, and the search results are sorted according to the total weight value. After the server completes the sorting, the search results are sorted Send it to the user terminal for display. It can be understood that the search result delivered by the server to the user terminal is not a preset data model, but an original data model corresponding to the preset data model.
本实施例将各类型原始数据模型统一转化为预设数据模型,避免因数据类型的表现形式影响搜索,使得搜索结果更加多样化均衡化。In this embodiment, various types of original data models are uniformly transformed into preset data models, so as to avoid affecting the search due to the expression form of the data type, and make the search results more diversified and balanced.
在一些实施例的搜索结果多样化均衡化搜索方法中,预设数据模型包括内容标题、内容摘要、正文、关键词和内容类型,将各类型原始数据模型转化为预设数据模型时,无论原始数据模型是否有内容标题、内容摘要、正文、关键词和内容类型,经转换后的预设数据模型都有内容标题、内容摘要、正文、关键词和内容类型。例如,歌曲文件通常只有歌曲名称和演唱者信息,没有内容摘要和正文,此时可将歌曲歌词作为内容摘要和内容正文,从而完成转化。本实施例将各类型原始数据模型统一转化为预设数据模型,避免因数据类型的表现形式影响搜索,使得搜索结果更加多样化均衡化。In the search method for diversification and balance of search results in some embodiments, the preset data model includes content title, content abstract, text, keywords and content type. When converting various types of original data models into preset data models, regardless of the original Whether the data model has content title, content abstract, body text, keywords and content type, the converted preset data model has content title, content abstract, body text, keywords and content type. For example, a song file usually only has song title and artist information, but no content summary and text. At this time, song lyrics can be used as content summary and content text to complete the conversion. In this embodiment, various types of original data models are uniformly transformed into preset data models, so as to avoid affecting the search due to the expression form of the data type, and make the search results more diversified and balanced.
在一些实施例的搜索结果多样化均衡化搜索方法中,步骤S1中将各类型原始数据模型转化为预设数据模型包括:将各类型原始数据模型转化为预设数据模型并设置预设数据模型各部分内容的权重值,其中关键词的权重值大于内容标题的权重值,内容标题的权重值大于内容摘要的权重值,内容摘要的权重值大于正文的权重值。对应的,步骤S3中计算搜索结果中每个预设数据模型的总权重值包括:分别计算搜索关键词在内容标题、内容摘要、正文、关键词和内容类型的分权重值,由所有分权重值得到总权重值。作为选择,所有分权重值直接求和得到总权重值。另外,在计算搜索关键词在内容标题、内容摘要、正文、关键词和内容类型的分权重值时,分权重值与搜索关键词出现次数正相关,也就是说,搜索关键词在某部分出现的次数越多,其在该部分得到的分权重值就越大。本实施例通过权重配置和统一预设数据模型来平衡原始数据模型之间的差异,使得搜索结果更加多样化均衡化。In the search method for diversifying and equalizing search results in some embodiments, converting various types of original data models into preset data models in step S1 includes: converting various types of original data models into preset data models and setting the preset data models The weight value of each part of the content, wherein the weight value of the keyword is greater than the weight value of the content title, the weight value of the content title is greater than the weight value of the content abstract, and the weight value of the content abstract is greater than the weight value of the text. Correspondingly, the calculation of the total weight value of each preset data model in the search results in step S3 includes: separately calculating the sub-weight values of the search keywords in the content title, content abstract, text, keywords and content types, and all sub-weights value to get the total weight value. Alternatively, all sub-weight values are summed directly to obtain the total weight value. In addition, when calculating the weighted values of search keywords in content titles, content abstracts, texts, keywords, and content types, the weighted values are positively correlated with the number of occurrences of search keywords, that is, the search keywords appear in a certain part The more times, the greater the score weight it gets in this part. In this embodiment, differences between original data models are balanced through weight configuration and unified preset data models, so that search results are more diversified and balanced.
在一些实施例的搜索结果多样化均衡化搜索方法中,参考图2,在步骤S3之后还包括:S4、根据搜索结果中各类型数据模型对应预设数据模型的分布情况调整预设数据模型各部分内容的权重值,以使搜索结果中各类型分布均衡。其中,搜索结果中各类型数据模型对应预设数据模型的分布情况是指每种类型数据模型对应预设数据模型在预设排名数量(搜索结果显示首页)中是否出现,若每种类型数据模型对应预设数据模型在预设排名数量中均有出现,则说明现有权重值设置相对合理;若某一种或几种类型数据模型对应预设数据模型未出现在预设排名数量的搜索结果中,说明现有权重值设置不合理,不能实现搜索结果多样化均衡化,则需要调整预设数据模型各部分内容的权重值,以使搜索结果中各类型分布均衡。In some embodiments, the search method for diversification and equalization of search results, referring to FIG. 2 , after step S3, further includes: S4, adjusting each preset data model according to the distribution of each type of data model corresponding to the preset data model in the search results. The weighting value for some content so that the types are evenly distributed in the search results. Among them, the distribution of each type of data model corresponding to the preset data model in the search results refers to whether each type of data model corresponds to the preset data model in the preset ranking number (the search result shows the home page), if each type of data model If the corresponding preset data models appear in the preset ranking numbers, it means that the existing weight value setting is relatively reasonable; if one or several types of data models corresponding to the preset data models do not appear in the search results of the preset ranking numbers In , it means that the existing weight value setting is unreasonable, and the diversification and balance of search results cannot be realized. It is necessary to adjust the weight value of each part of the preset data model to make the distribution of various types in the search results balanced.
进一步,搜索结果中各类型数据模型对应预设数据模型的分布情况是指每种类型数据模型对应预设数据模型在预设排名数量(搜索结果显示首页)中所占比例,若每种类型数据模型对应预设数据模型在预设排名数量中所占比例均衡,则说明现有权重值设置相对合理;若某一种或几种类型数据模型对应预设数据模型在预设排名数量中所占比例过低或过高,不能实现搜索结果多样化均衡化,则需要调整预设数据模型各部分内容的权重值,以使搜索结果中各类型分布均衡。Furthermore, the distribution of each type of data model corresponding to the preset data model in the search results refers to the proportion of each type of data model corresponding to the preset data model in the number of preset rankings (the search results display the home page). If each type of data If the proportion of the model corresponding to the preset data model in the preset ranking quantity is balanced, it means that the existing weight value setting is relatively reasonable; if one or several types of data models correspond to the preset data model in the preset ranking quantity If the proportion is too low or too high to realize the diversification and balance of search results, it is necessary to adjust the weight value of each part of the preset data model to balance the distribution of various types in the search results.
本实施例通过搜索结果反馈对预设数据模型各部分内容的权重值进行调整,不断优化预设数据模型各部分内容的权重值设置,使得搜索结果更加多样化均衡化。In this embodiment, the weight value of each part of the content of the preset data model is adjusted through the feedback of search results, and the setting of the weight value of each part of the content of the preset data model is continuously optimized, so that the search results are more diversified and balanced.
在一些实施例的搜索结果多样化均衡化搜索方法中,步骤S3中使用搜索关键词检索所所有预设数据模型包括:In the search method for diversification and equalization of search results in some embodiments, using the search keyword to retrieve all preset data models in step S3 includes:
S31、按照分类标准将所有预设数据模型进行分类。分类标准可根据用户需要灵活选择,例如生产厂家类、处理器类、资源类等,又例如新闻类、影视类、歌曲类、百科类和综艺类等。S31. Classify all preset data models according to the classification standard. Classification standards can be flexibly selected according to user needs, such as manufacturers, processors, resources, etc., and news, film and television, songs, encyclopedias, and variety shows, etc.
S32、统计每种类别中预设数据类型的总数,将总数相当的类别划分在同一个组,总数相当是指总数在同一个预设数量范围内。例如,有些种类的预设数据类型有1000万以上,有些种类的预设数据类型在500万至1000万之间,有些种类的预设数据类型在100万至500万之间,有些种类的预设数据类型在50万至100万之间,有些种类的预设数据类型在10万至50万之间,有些种类的预设数据类型在10万以下等。对应的,种类A和种类B的总数分别是650万和850万,则种类A和种类B为一组;种类C和种类D的总数分别是65万和85万,则种类C和种类D为一组;种类E和种类F的总数分别是6万和8万,则种类E和种类F为一组。S32. Counting the total number of preset data types in each category, and classifying the categories with the same total number into the same group. The same total number means that the total number is within the same preset number range. For example, some kinds of preset data types are more than 10 million, some kinds of preset data types are between 5 million and 10 million, some kinds of preset data types are between 1 million and 5 million, and some kinds of preset It is assumed that the data type is between 500,000 and 1 million, some types of preset data types are between 100,000 and 500,000, and some types of default data types are below 100,000. Correspondingly, if the total number of Type A and Type B is 6.5 million and 8.5 million respectively, then Type A and Type B form a group; the total number of Type C and Type D is 650,000 and 850,000 respectively, then Type C and Type D are One group; the total number of types E and F is 60,000 and 80,000 respectively, then types E and F form a group.
S33、使用搜索关键词检索每个组中所有预设数据模型。具体的,分别使用搜索关键词检索每个组中所有预设数据模型,得到搜索关键词在本组中的搜索结果。为使搜索结果中各类型分布均衡,需要保证每个组中都有在预设排名数量(搜索结果显示首页)中,则要求每个组产生预设数量的预设数据模型,且每个组对应的预设数量与该组的总数正相关。也就是说,该组的总数越多,其在预设排名数量(搜索结果显示首页)中所占的数量越多,从而既能保证每种类型均有显示,还能保证总数越多显示的预设数据模型越多,使得搜索结果更加多样化均衡化。S33. Retrieve all preset data models in each group by using a search keyword. Specifically, all the preset data models in each group are retrieved by using the search keywords respectively, and the search results of the search keywords in this group are obtained. In order to balance the distribution of various types in the search results, it is necessary to ensure that each group has a preset number of rankings (the search results display the home page), and each group is required to generate a preset number of preset data models, and each group The corresponding preset quantity is positively related to the total number of the group. That is to say, the more the total number of the group, the more it occupies in the preset number of rankings (the search results display the home page), so that it can not only ensure that each type is displayed, but also ensure that the total number of displayed more The more preset data models, the more diverse and balanced the search results.
本实施例按照数量级别进行分组,分别在每个组中进行检索,确保每个组都有预设数据模型输出,使得搜索结果更加多样化均衡化。In this embodiment, groups are grouped according to the quantity level, and searches are performed in each group separately, so as to ensure that each group has a preset data model output, so that the search results are more diversified and balanced.
在一优选实施例中,本实施例的计算机设备包括存储器和处理器,处理器通信连接存储器。存储器用于存储计算机程序;处理器用于执行存储器存储的计算机程序以实现如上述实施例的搜索结果多样化均衡化搜索方法。作为选择,计算机设备为服务器。本实施例的计算机设备将各类型原始数据模型统一转化为预设数据模型,避免因数据类型的表现形式影响搜索,使得搜索结果更加多样化均衡化。In a preferred embodiment, the computer device in this embodiment includes a memory and a processor, and the processor is communicatively connected to the memory. The memory is used to store computer programs; the processor is used to execute the computer programs stored in the memory to implement the search method for diversification and equalization of search results as in the above embodiments. Alternatively, the computer device is a server. The computer device in this embodiment uniformly transforms various types of original data models into preset data models, so as to avoid affecting the search due to the expression of the data type, and make the search results more diversified and balanced.
本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。对于实施例公开的装置而言,由于其与实施例公开的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。Each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same and similar parts of each embodiment can be referred to each other. As for the device disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and for the related information, please refer to the description of the method part.
专业人员还可以进一步意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Professionals can further realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software or a combination of the two. In order to clearly illustrate the possible For interchangeability, in the above description, the composition and steps of each example have been generally described according to their functions. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present invention.
结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理器执行的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。The steps of the methods or algorithms described in connection with the embodiments disclosed herein may be directly implemented by hardware, software modules executed by a processor, or a combination of both. Software modules can be placed in random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other Any other known storage medium.
以上实施例只为说明本发明的技术构思及特点,其目的在于让熟悉此项技术的人士能够了解本发明的内容并据此实施,并不能限制本发明的保护范围。凡跟本发明权利要求范围所做的均等变化与修饰,均应属于本发明权利要求的涵盖范围。The above embodiments are only to illustrate the technical conception and characteristics of the present invention. The purpose is to enable those skilled in the art to understand the content of the present invention and implement it accordingly, and cannot limit the protection scope of the present invention. All equivalent changes and modifications made in accordance with the scope of the claims of the present invention shall fall within the scope of the claims of the present invention.

Claims (10)

  1. 一种搜索结果多样化均衡化搜索方法,其特征在于,包括下述步骤:A search method for diversification and equalization of search results, characterized in that it comprises the following steps:
    S1、建立行业词库,所述行业词库包括多个行业专业词汇;将各类型原始数据模型转化为预设数据模型;S1. Establish an industry thesaurus, which includes a plurality of industry professional vocabularies; convert various types of original data models into preset data models;
    S2、接收用户输入的搜索内容,按照所述行业词库从所述搜索内容中提取搜索关键词;S2. Receive the search content input by the user, and extract search keywords from the search content according to the industry thesaurus;
    S3、使用所述搜索关键词检索所所有所述预设数据模型,计算搜索结果中每个所述预设数据模型的总权重值,根据所述总权重值对搜索结果进行排序。S3. Use the search keyword to search all the preset data models, calculate the total weight value of each preset data model in the search results, and sort the search results according to the total weight values.
  2. 根据权利要求1所述的搜索结果多样化均衡化搜索方法,其特征在于,所述预设数据模型包括内容标题、内容摘要、正文、关键词和内容类型。The search method for diversification and equalization of search results according to claim 1, wherein the preset data model includes content title, content summary, text, keywords and content type.
  3. 根据权利要求2所述的搜索结果多样化均衡化搜索方法,其特征在于,所述步骤S1中将各类型原始数据模型转化为预设数据模型包括:The search method for diversifying and equalizing search results according to claim 2, wherein converting various types of original data models into preset data models in the step S1 includes:
    将各类型原始数据模型转化为预设数据模型并设置所述预设数据模型各部分内容的权重值,其中所述关键词的权重值大于所述内容标题的权重值,所述内容标题的权重值大于所述内容摘要的权重值,所述内容摘要的权重值大于所述正文的权重值。Convert various types of original data models into preset data models and set the weight values of each part of the preset data models, wherein the weight value of the keywords is greater than the weight value of the content title, and the weight value of the content title The value is greater than the weight value of the content abstract, and the weight value of the content abstract is greater than the weight value of the text.
  4. 根据权利要求3所述的搜索结果多样化均衡化搜索方法,其特征在于,所述步骤S3中计算搜索结果中每个所述预设数据模型的总权重值包括:分别计算所述搜索关键词在所述内容标题、内容摘要、正文、关键词和内容类型的分权重值,由所有所述分权重值得到总权重值。The search method for diversification and equalization of search results according to claim 3, wherein the calculation of the total weight value of each of the preset data models in the search results in the step S3 includes: calculating the search keywords respectively In the sub-weight values of the content title, content abstract, text, keywords and content type, a total weight value is obtained from all the sub-weight values.
  5. 根据权利要求4所述的搜索结果多样化均衡化搜索方法,其特征在于,在计算所述搜索关键词在所述内容标题、内容摘要、正文、关键词和内容类型的分权重值时,所述分权重值与所述搜索关键词出现次数正相关。The search method for diversifying and equalizing search results according to claim 4, wherein when calculating the weighted values of the search keywords in the content title, content abstract, text, keywords and content types, the The score weight value is positively correlated with the number of occurrences of the search keyword.
  6. 根据权利要求3所述的搜索结果多样化均衡化搜索方法,其特征在于,在所述步骤S3之后还包括:The search method for diversification and equalization of search results according to claim 3, further comprising:
    S4、根据所述搜索结果中各类型数据模型对应预设数据模型的分布情况调整所述预设数据模型各部分内容的权重值,以使所述搜索结果中各类型分布均衡。S4. According to the distribution of each type of data model corresponding to the preset data model in the search result, adjust the weight value of each part of the preset data model, so that the distribution of each type in the search result is balanced.
  7. 根据权利要求1所述的搜索结果多样化均衡化搜索方法,其特征在于,所述步骤S3中使用所述搜索关键词检索所所有所述预设数据模型包括:The search method for diversification and equalization of search results according to claim 1, characterized in that, using the search keywords in step S3 to retrieve all the preset data models includes:
    S31、按照分类标准将所有所述预设数据模型进行分类;S31. Classify all the preset data models according to the classification standard;
    S32、统计每种类别中所述预设数据类型的总数,将总数相当的类别划分在同一个组;S32. Counting the total number of the preset data types in each category, and dividing the categories with the same total number into the same group;
    S33、使用所述搜索关键词检索每个组中所有所述预设数据模型。S33. Retrieve all the preset data models in each group by using the search keyword.
  8. 根据权利要求7所述的搜索结果多样化均衡化搜索方法,其特征在于,在所述步骤S33之后还包括:使每个组产生预设数量的预设数据模型。The search method for diversification and equalization of search results according to claim 7, further comprising: making each group generate a preset number of preset data models after the step S33.
  9. 根据权利要求8所述的搜索结果多样化均衡化搜索方法,其特征在于,每个组对应的预设数量与该组的总数正相关。The search method for diversification and equalization of search results according to claim 8, wherein the preset quantity corresponding to each group is positively correlated with the total number of the group.
  10. 一种计算机设备,其特征在于,包括存储器和处理器,所述处理器通信连接所述存储器;A computer device, characterized in that it includes a memory and a processor, and the processor is communicatively connected to the memory;
    所述存储器用于存储计算机程序;The memory is used to store computer programs;
    所述处理器用于执行所述存储器存储的计算机程序以实现如权利要求1至9任一项所述的搜索结果多样化均衡化搜索方法。The processor is configured to execute the computer program stored in the memory to realize the search method for diversification and equalization of search results according to any one of claims 1 to 9.
PCT/CN2022/112863 2021-08-16 2022-08-16 Search method with diversified and equalized search results, and computer device WO2023020506A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110936147.1A CN113779433A (en) 2021-08-16 2021-08-16 Search result diversification and equalization searching method and computer equipment
CN202110936147.1 2021-08-16

Publications (1)

Publication Number Publication Date
WO2023020506A1 true WO2023020506A1 (en) 2023-02-23

Family

ID=78837861

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/112863 WO2023020506A1 (en) 2021-08-16 2022-08-16 Search method with diversified and equalized search results, and computer device

Country Status (2)

Country Link
CN (1) CN113779433A (en)
WO (1) WO2023020506A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113779433A (en) * 2021-08-16 2021-12-10 深圳市世强元件网络有限公司 Search result diversification and equalization searching method and computer equipment
CN116501841B (en) * 2023-06-26 2023-09-08 深圳市唯特视科技有限公司 Fuzzy query method, system and storage medium for data model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103514299A (en) * 2013-10-18 2014-01-15 北京奇虎科技有限公司 Information searching method and device
CN105653661A (en) * 2015-12-29 2016-06-08 云南电网有限责任公司电力科学研究院 Search result re-ranking method and device
CN105938493A (en) * 2016-04-14 2016-09-14 乐视控股(北京)有限公司 Resource search method and apparatus
US20160328402A1 (en) * 2015-05-06 2016-11-10 App Annie Inc. Keyword Reporting for Mobile Applications
CN113779433A (en) * 2021-08-16 2021-12-10 深圳市世强元件网络有限公司 Search result diversification and equalization searching method and computer equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111177532A (en) * 2019-12-02 2020-05-19 平安资产管理有限责任公司 Vertical search method, device, computer system and readable storage medium
CN111460104B (en) * 2020-04-01 2023-09-22 神思电子技术股份有限公司 Industry self-adaptive intelligent searching method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103514299A (en) * 2013-10-18 2014-01-15 北京奇虎科技有限公司 Information searching method and device
US20160328402A1 (en) * 2015-05-06 2016-11-10 App Annie Inc. Keyword Reporting for Mobile Applications
CN105653661A (en) * 2015-12-29 2016-06-08 云南电网有限责任公司电力科学研究院 Search result re-ranking method and device
CN105938493A (en) * 2016-04-14 2016-09-14 乐视控股(北京)有限公司 Resource search method and apparatus
CN113779433A (en) * 2021-08-16 2021-12-10 深圳市世强元件网络有限公司 Search result diversification and equalization searching method and computer equipment

Also Published As

Publication number Publication date
CN113779433A (en) 2021-12-10

Similar Documents

Publication Publication Date Title
WO2023020506A1 (en) Search method with diversified and equalized search results, and computer device
CN107368515B (en) Application page recommendation method and system
US7487151B2 (en) Information processing apparatus, information processing method, program for implementing information processing method, information processing system, and method for information processing system
CN112199548A (en) Music audio classification method based on convolution cyclic neural network
CN106407420B (en) Multimedia resource recommendation method and system
CN111125334A (en) Search question-answering system based on pre-training
AU2009257386A1 (en) Searching using patterns of usage
JP5587821B2 (en) Document topic extraction apparatus, method, and program
CN101021852A (en) Video search dispatching system based on content
WO2008145055A1 (en) The method for obtaining restriction word information, optimizing output and the input method system
CN108681564A (en) The determination method, apparatus and computer readable storage medium of keyword and answer
CN100583086C (en) Conversation control system and conversation control method
KR20010113779A (en) Multi-feature combination generation and classification effectiveness evaluation using genetic algorithms
CN110399483A (en) A kind of subject classification method, apparatus, electronic equipment and readable storage medium storing program for executing
US20060026155A1 (en) Information processing apparatus and method, recording medium, and program
CN110166847A (en) Barrage treating method and apparatus
JP2009187522A (en) Content evaluating device
JP5302614B2 (en) Facility related information search database formation method and facility related information search system
CN110287284B (en) Semantic matching method, device and equipment
CN115062135B (en) Patent screening method and electronic equipment
CN111737607A (en) Data processing method, data processing device, electronic equipment and storage medium
JP2003016106A (en) Device for calculating degree of association value
JPH05233719A (en) Between-composite information relevance identifying method
US10937078B1 (en) Dynamic generation of item description content
CN112597295A (en) Abstract extraction method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22857817

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE