CN105955993B - Search result ordering method and device - Google Patents

Search result ordering method and device Download PDF

Info

Publication number
CN105955993B
CN105955993B CN201610245408.4A CN201610245408A CN105955993B CN 105955993 B CN105955993 B CN 105955993B CN 201610245408 A CN201610245408 A CN 201610245408A CN 105955993 B CN105955993 B CN 105955993B
Authority
CN
China
Prior art keywords
word
collocation
search results
words
preset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610245408.4A
Other languages
Chinese (zh)
Other versions
CN105955993A (en
Inventor
苏建雷
吴文权
刘占一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610245408.4A priority Critical patent/CN105955993B/en
Publication of CN105955993A publication Critical patent/CN105955993A/en
Application granted granted Critical
Publication of CN105955993B publication Critical patent/CN105955993B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

The invention discloses a search result ordering method and a search result ordering device, wherein the search result ordering method comprises the following steps: receiving an input query sentence, and dividing the query sentence into a plurality of words; acquiring collocation corresponding to the words; obtaining a plurality of search results corresponding to the query statement; and sequencing the plurality of search results based on the BOW model according to the words and the collocation corresponding to the words. According to the search result ordering method and device provided by the embodiment of the invention, the input query sentence is received, the query sentence is divided into a plurality of words, the collocation corresponding to the words is obtained, the plurality of search results corresponding to the query sentence are obtained, and the plurality of search results are ordered based on the BOW model according to the words and the collocation corresponding to the words, so that the ordering of the search results is optimized, the search results more conforming to the intention of the user are preferentially provided, and the use experience of the user is improved.

Description

Search result ordering method and device
Technical Field
The invention relates to the technical field of internet, in particular to a search result ordering method and a search result ordering device.
Background
With the rapid development of science and technology, the internet has gone deep into people's daily life. The user can query the required information through the search engine. How to sort the search results and arrange the information needed by the user in the front of the search results is a subject of current key research.
Currently, the search result sorting work is assisted mainly by semantic similarity between the query statement and the search result before the title. For example: and calculating a word vector of each word of the query statement, and adding the word vectors to obtain the word vector of the query statement. Similarly, a word vector for the title of the search result is calculated. And finally, calculating the semantic similarity between the two word vectors, thereby sequencing the search results.
However, the method does not consider the structural information of the query sentences, which may cause deviation of the final sequencing result, for example, word vectors of two query sentences, namely "wu song died tiger" and "wu song died tiger" are the same, and the accuracy of the sequencing result is reduced.
Disclosure of Invention
The present invention is directed to solving, at least to some extent, one of the technical problems in the related art. Therefore, an object of the present invention is to provide a search result ranking method, which can optimize the ranking of search results, preferentially provide search results that better meet the user's intention, and improve the user experience.
The second purpose of the invention is to provide a search result sorting device.
In order to achieve the above object, an embodiment of a first aspect of the present invention provides a search result ranking method, including: receiving an input query sentence, and dividing the query sentence into a plurality of words; acquiring collocation corresponding to the words; obtaining a plurality of search results corresponding to the query statement; and sequencing the plurality of search results based on the BOW model according to the words and the collocation corresponding to the words.
According to the search result ordering method provided by the embodiment of the invention, the input query sentence is received, the query sentence is divided into a plurality of words, the collocation corresponding to the words is obtained, the plurality of search results corresponding to the query sentence are obtained, and the plurality of search results are ordered based on the BOW model according to the words and the collocation corresponding to the words, so that the ordering of the search results is optimized, the search results which are more in line with the intention of the user are preferentially provided, and the use experience of the user is improved.
An embodiment of a second aspect of the present invention provides a search result sorting apparatus, including: the segmentation module is used for receiving an input query sentence and segmenting the query sentence into a plurality of words; the first acquisition module is used for acquiring collocation corresponding to the word; the second acquisition module is used for acquiring a plurality of search results corresponding to the query statement; and the ordering module is used for ordering the plurality of search results based on the BOW model according to the words and the collocation corresponding to the words.
According to the search result sorting device provided by the embodiment of the invention, the input query sentence is received, the query sentence is divided into a plurality of words, the collocation corresponding to the words is obtained, the plurality of search results corresponding to the query sentence are obtained, and the plurality of search results are sorted based on the BOW model according to the words and the collocation corresponding to the words, so that the sorting of the search results is optimized, the search results which are more in line with the intention of the user are preferentially provided, and the use experience of the user is improved.
Drawings
FIG. 1 is a flow diagram of a search result ranking method according to one embodiment of the invention;
FIG. 2 is a schematic diagram illustrating the effect of obtaining candidate matches;
FIG. 3 is a flow diagram of ranking a plurality of search results based on a BOW model, according to one embodiment of the invention;
FIG. 4 is a first schematic structural diagram of a search result sorting apparatus according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a search result sorting apparatus according to an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.
The search result ranking method and apparatus of the embodiments of the present invention are described below with reference to the accompanying drawings.
FIG. 1 is a flow diagram of a search result ranking method according to one embodiment of the invention.
As shown in fig. 1, the search result ranking method may include:
and S1, receiving the input query sentence, and dividing the query sentence into a plurality of words.
For example, the query statement is "how is bluetooth car auto-connect? ", the query sentence can be divided into" in-vehicle "," bluetooth "," how "," automatic "," connect ", and"? ".
And S2, acquiring collocation corresponding to the words.
Specifically, the current word and the word within the preset window length with the current word may be obtained first, then the current word and the word within the preset window length with the current word are combined into a plurality of candidate collocations, and finally the collocation corresponding to the current word is screened out based on the preset collocation dictionary.
Continuing with the above example, assume that the current word is "vehicle mounted" and the preset window length is 5. Then the words within the same window length as "onboard" may include "bluetooth", "how", "auto", and "connect", and the corresponding candidate collocations for "onboard" are "onboard bluetooth", "how onboard", "onboard auto", "onboard connect". Similarly, as shown in fig. 2, words that are within the same window length as "bluetooth" may include "how", "automatic", "connect", and "? "then" bluetooth "corresponds to the candidate collocation of" how bluetooth "," bluetooth is automatic "," bluetooth is connected "," bluetooth? ". And by analogy, scanning each word in the query sentence to obtain the corresponding candidate collocation.
After that, the reasonable collocation can be screened out based on the preset collocation dictionary. The reasonable collocation may be a collocation with high co-occurrence frequency or a collocation with high log-likelihood ratio score. For example: "how on-vehicle", "automatic on-vehicle", "on-vehicle connection", "how bluetooth", "bluetooth automatic", "bluetooth? These candidate matches are all matches with low co-occurrence frequency or matches with low log-likelihood ratio score, so the candidate matches can be filtered out. Finally, the collocation of vehicle-mounted Bluetooth and Bluetooth connection with high rationality is obtained.
And S3, acquiring a plurality of search results corresponding to the query statement.
For example, the inquiry sentence is "how is bluetooth car automatically connected? ", a plurality of search results corresponding thereto may be retrieved based on the query statement.
And S4, sorting the plurality of search results based on the BOW model according to the words and the collocation corresponding to the words.
After obtaining the Words and their corresponding collocations, and the plurality Of search results, the plurality Of search results may be sorted based on a BOW (Bag Of Words) model.
Specifically, as shown in fig. 3, the sorting process can be divided into the following steps:
and S41, calculating the sum of the word vector of the word and the word vector of the collocation corresponding to the word, and taking the sum as the word vector of the query sentence.
For example, the query sentence may include a word 1, a word 2, and a word 3, the collocation corresponding to the word 1 is collocation 1 and collocation 2, the collocation corresponding to the word 2 is collocation 3, collocation 4, and collocation 5, and the collocation corresponding to the word 3 is collocation 6, and word vectors of the word 1, the word 2, the word 3, the collocation 1, the collocation 2, the collocation 3, the collocation 4, the collocation 5, and the collocation 6 may be calculated respectively, and the word vectors are added to obtain a word vector of the query sentence finally. When the word vector of the query sentence is calculated, the matched word vector is added into the input features, the features of the syntactic structure are added, the defect that only the word vector of a word is used as the input features can be effectively overcome, and the semantic discrimination is improved.
And S42, calculating a word vector of a plurality of search results.
In this embodiment, the word vector of the search result is actually the word vector of the title corresponding to the search result. The title corresponding to the search result is composed of a plurality of words, so the word vector of the search result can be calculated by adopting the same method as the method for calculating the word vector of the query sentence.
S43, calculating semantic similarity between the word vector of the query sentence and the word vectors of the plurality of search results.
After the word vector of the query sentence and the word vectors of the search results are calculated, semantic similarity of the word vectors of the query sentence and the word vectors of each search result may be calculated, respectively.
And S44, sorting the plurality of search results according to the semantic similarity.
In this embodiment, the search results may be ranked according to the order of semantic similarity from high to low, so that the search results with higher semantic similarity to the query statement are ranked higher.
The specific process of creating the predetermined collocation dictionary will be described in detail below.
Firstly, a training corpus can be obtained, candidate word pair corpora in the training corpus are extracted, then, the word pair corpora can be screened out from the candidate word pair corpora based on co-occurrence frequency or log likelihood ratio scores, and finally, a preset collocation dictionary is established based on the word pair corpora. For example, assuming that the corpus is "auto bluetooth auto connect", each word can be regarded as a node, and whether there is a syntactic dependency between every two nodes is analyzed. And taking two words with grammar dependency relationship as a candidate word pair corpus. The null word nodes such as "ones", "it" and "ones" may exist in the training corpus, and the null word nodes can be filtered out. After the candidate word pair corpus is selected, the candidate word pair corpus may be screened. The screening method can be divided into two methods. The first method is to screen by co-occurrence frequency, i.e. the number of co-occurrences of two words that constitute the corpus of candidate word pairs. And screening the candidate word pair corpora with the co-occurrence frequency larger than the preset times. The second method is to screen through log likelihood ratio score, that is, screen out the top N candidate word pair corpora with the highest log likelihood ratio score. The log-likelihood ratio score is mainly obtained by calculating collocation frequency, word frequency, corpus size and the like. The collocation frequency refers to the frequency of collocation of two words forming the candidate word pair corpus; the word frequency is the times of respectively showing two word pairs forming the candidate word pair corpus in the corpus; the corpus size is the number of words contained. And finally, establishing a preset collocation dictionary according to the word pair corpus obtained after screening.
According to the search result ordering method provided by the embodiment of the invention, the input query sentence is received, the query sentence is divided into a plurality of words, the collocation corresponding to the words is obtained, the plurality of search results corresponding to the query sentence are obtained, and the plurality of search results are ordered based on the BOW model according to the words and the collocation corresponding to the words, so that the ordering of the search results is optimized, the search results which are more in line with the intention of the user are preferentially provided, and the use experience of the user is improved.
In order to achieve the purpose, the invention further provides a search result sorting device.
Fig. 4 is a first schematic structural diagram of a search result sorting apparatus according to an embodiment of the present invention.
As shown in fig. 4, the search result ranking means may include: a segmentation module 110, a first acquisition module 120, a second acquisition module 130, and a ranking module 140. The first obtaining module 120 may include an obtaining unit 121, a combining unit 122, and a screening unit 123.
The segmentation module 110 is configured to receive an input query sentence and segment the query sentence into a plurality of words. For example, the query statement is "how is bluetooth car auto-connect? ", the query sentence can be divided into" in-vehicle "," bluetooth "," how "," automatic "," connect ", and"? ".
The first obtaining module 120 is used for obtaining the collocation corresponding to the word. Specifically, the obtaining unit 121 may first obtain the current word and the word within the preset window length with the current word, then the combining unit 122 combines the current word and the word within the preset window length with the current word into a plurality of candidate collocations, and finally the screening unit 123 screens out the collocation corresponding to the current word based on the preset collocation dictionary. Continuing with the above example, assume that the current word is "vehicle mounted" and the preset window length is 5. Then the words within the same window length as "onboard" may include "bluetooth", "how", "auto", and "connect", and the corresponding candidate collocations for "onboard" are "onboard bluetooth", "how onboard", "onboard auto", "onboard connect". Similarly, as shown in fig. 2, words that are within the same window length as "bluetooth" may include "how", "automatic", "connect", and "? "then" bluetooth "corresponds to the candidate collocation of" how bluetooth "," bluetooth is automatic "," bluetooth is connected "," bluetooth? ". And by analogy, scanning each word in the query sentence to obtain the corresponding candidate collocation. After that, the reasonable collocation can be screened out based on the preset collocation dictionary. The reasonable collocation may be a collocation with high co-occurrence frequency or a collocation with high log-likelihood ratio score. For example: "how on-vehicle", "automatic on-vehicle", "on-vehicle connection", "how bluetooth", "bluetooth automatic", "bluetooth? These candidate matches are all matches with low co-occurrence frequency or matches with low log-likelihood ratio score, so the candidate matches can be filtered out. Finally, the collocation of vehicle-mounted Bluetooth and Bluetooth connection with high rationality is obtained.
The second obtaining module 130 is configured to obtain a plurality of search results corresponding to the query statement. For example, the inquiry sentence is "how is bluetooth car automatically connected? ", the second obtaining module 130 may retrieve a plurality of search results corresponding to the query statement based on the query statement.
The sorting module 140 is configured to sort the plurality of search results based on the BOW model according to the word and the collocation corresponding to the word. Specifically, the sorting module 140 may first calculate a sum of a word vector of a word and a word vector of a collocation corresponding to the word, and use the sum as the word vector of the query sentence. For example, the query sentence may include a word 1, a word 2, and a word 3, the collocation corresponding to the word 1 is collocation 1 and collocation 2, the collocation corresponding to the word 2 is collocation 3, collocation 4, and collocation 5, and the collocation corresponding to the word 3 is collocation 6, and word vectors of the word 1, the word 2, the word 3, the collocation 1, the collocation 2, the collocation 3, the collocation 4, the collocation 5, and the collocation 6 may be calculated respectively, and the word vectors are added to obtain a word vector of the query sentence finally. When the word vector of the query sentence is calculated, the matched word vector is added into the input features, the features of the syntactic structure are added, the defect that only the word vector of a word is used as the input features can be effectively overcome, and the semantic discrimination is improved.
The ranking module 140 then computes a word vector for the plurality of search results. In this embodiment, the word vector of the search result is actually the word vector of the title corresponding to the search result. The title corresponding to the search result is composed of a plurality of words, so the word vector of the search result can be calculated by adopting the same method as the method for calculating the word vector of the query sentence.
Then, the sorting module 140 calculates semantic similarity between the word vector of the query sentence and the word vectors of the plurality of search results.
Finally, the ranking module 140 may rank the plurality of search results according to semantic similarity. For example, the search results may be ranked in order of high to low semantic similarity, such that search results that are more semantically similar to the query statement are ranked higher.
Furthermore, as shown in fig. 5, the first obtaining module 120 may further include a establishing module 124.
Specifically, the establishing module 124 may first obtain the corpus, extract the candidate word pair corpus in the corpus, then may screen the word pair corpus from the candidate word pair corpus based on the co-occurrence frequency or log-likelihood ratio score, and finally establish the preset collocation dictionary based on the word pair corpus. For example, assuming that the corpus is "auto bluetooth auto connect", each word can be regarded as a node, and whether there is a syntactic dependency between every two nodes is analyzed. And taking two words with grammar dependency relationship as a candidate word pair corpus. The null word nodes such as "ones", "it" and "ones" may exist in the training corpus, and the null word nodes can be filtered out. After the candidate word pair corpus is selected, the candidate word pair corpus may be screened. The screening method can be divided into two methods. The first method is to screen by co-occurrence frequency, i.e. the number of co-occurrences of two words that constitute the corpus of candidate word pairs. And screening the candidate word pair corpora with the co-occurrence frequency larger than the preset times. The second method is to screen through log likelihood ratio score, that is, screen out the top N candidate word pair corpora with the highest log likelihood ratio score. The log-likelihood ratio score is mainly obtained by calculating collocation frequency, word frequency, corpus size and the like. The collocation frequency refers to the frequency of collocation of two words forming the candidate word pair corpus; the word frequency is the times of respectively showing two word pairs forming the candidate word pair corpus in the corpus; the corpus size is the number of words contained. And finally, establishing a preset collocation dictionary according to the word pair corpus obtained after screening.
According to the search result sorting device provided by the embodiment of the invention, the input query sentence is received, the query sentence is divided into a plurality of words, the collocation corresponding to the words is obtained, the plurality of search results corresponding to the query sentence are obtained, and the plurality of search results are sorted based on the BOW model according to the words and the collocation corresponding to the words, so that the sorting of the search results is optimized, the search results which are more in line with the intention of the user are preferentially provided, and the use experience of the user is improved.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (6)

1. A method for ranking search results, comprising the steps of:
receiving an input query sentence, and segmenting the query sentence into a plurality of words;
acquiring collocation corresponding to the word;
obtaining a plurality of search results corresponding to the query statement; and
sorting the plurality of search results based on a bag of words (BOW) model according to the words and the collocation corresponding to the words;
wherein obtaining a collocation corresponding to the word comprises:
acquiring a current word and a word which is within a preset window length with the current word;
combining the current word and the words of the current word within a preset window length into a plurality of candidate collocations;
screening out a collocation corresponding to the current word based on a preset collocation dictionary;
wherein, according to the word and the collocation corresponding to the word, the plurality of search results are sorted based on a BOW model, including:
calculating the sum of the word vector of the word and the word vector of the collocation corresponding to the word as the word vector of the query sentence;
calculating a word vector for the plurality of search results;
calculating semantic similarity of the word vector of the query statement and the word vectors of the plurality of search results;
and sequencing the plurality of search results according to the semantic similarity.
2. The method of claim 1, prior to filtering out a collocation corresponding to the current word based on a preset collocation dictionary, further comprising:
and establishing a preset collocation dictionary.
3. The method of claim 2, wherein establishing a predetermined collocation dictionary comprises:
acquiring a training corpus and extracting a candidate word pair corpus in the training corpus;
and screening out the word pair corpus from the candidate word pair corpus based on the co-occurrence frequency or the log-likelihood ratio score, and establishing a preset collocation dictionary based on the word pair corpus.
4. A search result ranking apparatus, comprising:
the segmentation module is used for receiving an input query sentence and segmenting the query sentence into a plurality of words;
the first obtaining module is used for obtaining the collocation corresponding to the word;
the second acquisition module is used for acquiring a plurality of search results corresponding to the query statement; and
the sorting module is used for sorting the plurality of search results based on a BOW model according to the words and the collocation corresponding to the words;
wherein, the first obtaining module comprises:
the acquiring unit is used for acquiring a current word and a word which is within a preset window length with the current word;
the combination unit is used for combining the current word and the word in a preset window length with the current word into a plurality of candidate collocations;
the screening unit is used for screening out the collocation corresponding to the current word based on a preset collocation dictionary;
wherein the sorting module is configured to:
calculating the sum of the word vector of the word and the word vector of the collocation corresponding to the word as the word vector of the query sentence;
calculating a word vector for the plurality of search results;
calculating semantic similarity of the word vector of the query statement and the word vectors of the plurality of search results;
and sequencing the plurality of search results according to the semantic similarity.
5. The apparatus of claim 4, wherein the first obtaining module further comprises:
and the establishing unit is used for establishing a preset collocation dictionary before the collocation corresponding to the current word is screened out based on the preset collocation dictionary.
6. The apparatus of claim 5, wherein the establishing unit is to:
acquiring a training corpus and extracting a candidate word pair corpus in the training corpus;
and screening out the word pair corpus from the candidate word pair corpus based on the co-occurrence frequency or the log-likelihood ratio score, and establishing a preset collocation dictionary based on the word pair corpus.
CN201610245408.4A 2016-04-19 2016-04-19 Search result ordering method and device Active CN105955993B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610245408.4A CN105955993B (en) 2016-04-19 2016-04-19 Search result ordering method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610245408.4A CN105955993B (en) 2016-04-19 2016-04-19 Search result ordering method and device

Publications (2)

Publication Number Publication Date
CN105955993A CN105955993A (en) 2016-09-21
CN105955993B true CN105955993B (en) 2020-09-25

Family

ID=56917662

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610245408.4A Active CN105955993B (en) 2016-04-19 2016-04-19 Search result ordering method and device

Country Status (1)

Country Link
CN (1) CN105955993B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110019665A (en) * 2017-09-30 2019-07-16 北京国双科技有限公司 Text searching method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101840397A (en) * 2009-03-20 2010-09-22 日电(中国)有限公司 Word sense disambiguation method and system
CN103390004A (en) * 2012-05-11 2013-11-13 北京百度网讯科技有限公司 Determination method and determination device for semantic redundancy and corresponding search method and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4587236B2 (en) * 2008-08-26 2010-11-24 Necビッグローブ株式会社 Information search apparatus, information search method, and program
CN102214189B (en) * 2010-04-09 2013-04-24 腾讯科技(深圳)有限公司 Data mining-based word usage knowledge acquisition system and method
CN105117487B (en) * 2015-09-19 2018-11-16 杭州电子科技大学 A kind of books semantic retrieving method based on content structure
CN105279252B (en) * 2015-10-12 2017-12-26 广州神马移动信息科技有限公司 Excavate method, searching method, the search system of related term

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101840397A (en) * 2009-03-20 2010-09-22 日电(中国)有限公司 Word sense disambiguation method and system
CN103390004A (en) * 2012-05-11 2013-11-13 北京百度网讯科技有限公司 Determination method and determination device for semantic redundancy and corresponding search method and device

Also Published As

Publication number Publication date
CN105955993A (en) 2016-09-21

Similar Documents

Publication Publication Date Title
CN104464736B (en) The error correction method and device of speech recognition text
CN106227815B (en) Multi-modal clue personalized application program function recommendation method and system
US11182435B2 (en) Model generation device, text search device, model generation method, text search method, data structure, and program
CN106776564B (en) Semantic recognition method and system based on knowledge graph
CN104317834B (en) A kind of across media sort methods based on deep neural network
US10922322B2 (en) Systems and methods for speech-based searching of content repositories
US20150269175A1 (en) Query Interpretation and Suggestion Generation under Various Constraints
US20150100307A1 (en) Text segmentation with multiple granularity levels
EP3213227A1 (en) Contextual search disambiguation
CN110750704B (en) Method and device for automatically completing query
US20090276378A1 (en) System and Method for Identifying Document Structure and Associated Metainformation and Facilitating Appropriate Processing
CN102999625A (en) Method for realizing semantic extension on retrieval request
CN103049548B (en) FAQ in electronic channel application identifies system and method
CN108038099B (en) Low-frequency keyword identification method based on word clustering
CN110705292B (en) Entity name extraction method based on knowledge base and deep learning
CN106202294A (en) The related news computational methods merged based on key word and topic model and device
CN103927339B (en) Knowledge Reorganizing system and method for knowledge realignment
CN105989058A (en) Chinese news brief generating system and method
CN103593338A (en) Information processing method and device
CN112527955A (en) Data processing method and device
CN105955993B (en) Search result ordering method and device
CN103377224A (en) Method and device for recognizing problem types and method and device for establishing recognition models
CN110413903A (en) Interest point information retrieves device and method in Vehicular navigation system
CN111428011A (en) Word recommendation method, device, equipment and storage medium
CN104008097A (en) Method and device for achieving query understanding

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant