CN104573015B - Information retrieval method and device - Google Patents

Information retrieval method and device Download PDF

Info

Publication number
CN104573015B
CN104573015B CN201510012725.7A CN201510012725A CN104573015B CN 104573015 B CN104573015 B CN 104573015B CN 201510012725 A CN201510012725 A CN 201510012725A CN 104573015 B CN104573015 B CN 104573015B
Authority
CN
China
Prior art keywords
term
word
rows
retrieval result
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510012725.7A
Other languages
Chinese (zh)
Other versions
CN104573015A (en
Inventor
马晋
张晓婧
杰艺
张博
刘初
禹贵辉
陈泓光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201510012725.7A priority Critical patent/CN104573015B/en
Publication of CN104573015A publication Critical patent/CN104573015A/en
Application granted granted Critical
Publication of CN104573015B publication Critical patent/CN104573015B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The present invention proposes a kind of information retrieval method and device, which includes receiving term;Retrieval result corresponding with the term is obtained, wherein, record has set of words in the retrieval result, and the set of words includes the word of other form of presentation of the term and the term;Show the retrieval result.This method can increase the quantity of the retrieval result of acquisition, promote retrieval effectiveness.

Description

Information retrieval method and device
Technical field
The present invention relates to search technique field more particularly to a kind of information retrieval methods and device.
Background technology
User can obtain the information needed by search engine, and when using search engine, user can be in search box Middle input term, search engine search and the relevant retrieval result of term, and it is illustrated in search results pages.
When name search paper of the user with author, different users may be searched for different title method for expressing Its title, current search engine can only recall opinion corresponding with author's title for exactly matching of method for expressing of user's search Text so that the retrieval result negligible amounts recalled or even acquisition are less than retrieval result.
The content of the invention
It is contemplated that it solves at least some of the technical problems in related technologies.
For this purpose, an object of the present invention is to provide a kind of information retrieval method, this method can increase the inspection of acquisition The quantity of hitch fruit promotes retrieval effectiveness.
It is another object of the present invention to propose a kind of information indexing device.
In order to achieve the above objectives, the information retrieval method that first aspect present invention embodiment proposes, including:Receive retrieval Word;Retrieval result corresponding with the term is obtained, wherein, record has set of words, the set of words in the retrieval result The word of other form of presentation including the term and the term;Show the retrieval result.
The information retrieval method that first aspect present invention embodiment proposes, by recording different expression shape in retrieval result The word of formula, during any one word that can be recorded in the term of reception is a retrieval result, it is possible to retrieve the retrieval As a result, so as to increase the quantity of the retrieval result of acquisition, retrieval effectiveness is improved.
In order to achieve the above objectives, the information indexing device that second aspect of the present invention embodiment proposes, including:Receiving module, For receiving term;Acquisition module, for obtaining retrieval result corresponding with the term, wherein, the retrieval result Middle record has set of words, and the set of words includes the word of other form of presentation of the term and the term;Displaying Module, for showing the retrieval result.
The information indexing device that second aspect of the present invention embodiment proposes, by recording different expression shape in retrieval result The word of formula, during any one word that can be recorded in the term of reception is a retrieval result, it is possible to retrieve the retrieval As a result, so as to increase the quantity of the retrieval result of acquisition, retrieval effectiveness is improved.
The additional aspect of the present invention and advantage will be set forth in part in the description, and will partly become from the following description It obtains substantially or is recognized by the practice of the present invention.
Description of the drawings
Above-mentioned and/or additional aspect and advantage of the invention will become from the following description of the accompanying drawings of embodiments Substantially and it is readily appreciated that, wherein:
Fig. 1 is the flow diagram for the information retrieval method that one embodiment of the invention proposes;
Fig. 2 is the flow diagram that the evidence of falling number of rows is established in the embodiment of the present invention;
Fig. 3 is the flow diagram that set of words is established in the embodiment of the present invention;
Fig. 4 is the schematic diagram of Inverted List in the embodiment of the present invention;
Fig. 5 is the schematic diagram of the summary part of article in the embodiment of the present invention;
Fig. 6 is a kind of displaying schematic diagram of information retrieval in the embodiment of the present invention;
Fig. 7 is the displaying schematic diagram of another information retrieval in the embodiment of the present invention;
Fig. 8 is the displaying schematic diagram of another information retrieval in the embodiment of the present invention;
Fig. 9 is the structure diagram for the information indexing device that another embodiment of the present invention proposes;
Figure 10 is the structure diagram for the information indexing device that another embodiment of the present invention proposes.
Specific embodiment
The embodiment of the present invention is described below in detail, the example of the embodiment is shown in the drawings, wherein from beginning to end Same or similar label represents same or similar element or has the function of same or like element.Below with reference to attached The embodiment of figure description is exemplary, and is only used for explaining the present invention, and is not considered as limiting the invention.On the contrary, this The embodiment of invention includes falling into all changes in the range of the spirit and intension of attached claims, modification and equivalent Object.
Fig. 1 is the flow diagram for the information retrieval method that one embodiment of the invention proposes, this method can should be arrived and searched During index is held up, this method can include:
S11:Receive term.
Term (query) is referred to as search term, query word etc..
Term can be the word that user is input in search box.It is, of course, understood that term can also be with The input of the other modes such as voice or picture, correspondingly, in retrieval, it can first carry out the side such as speech recognition or image identification Formula is converted to text, then carries out subsequent processing by the way of similar text input.
It is the entitled example of author with term in the embodiment of the present invention, the information retrieval method of the present embodiment can be applied Into these retrieval, the article including the author is retrieved by author's title realization of input.It is understood that the information Search method can also be applied to other field, correspondingly, term can be other titles, for example, trade name etc..
S12:Retrieval result corresponding with the term is obtained, wherein, record has set of words in the retrieval result, institute Predicate set includes the word of other form of presentation of the term and the term.
In the prior art, author's title is recorded in each article, which is typically the title of reference format, For example, author's title is:During Daniel A.Peterson, generally there are following several form of presentation:
Reference format:Daniel A.Peterson
GBT1174 forms:Peterson D A
APA forms:Peterson,Daniel A.
MLA forms:Peterson,D.A.
Surname is in preceding name in rear form:Peterson,Daniel A.
Normalized format:DA Peterson
In the prior art, due to the usual only title of record standard form, then when the title of user's search criterion form, During such as " Daniel A.Peterson ", the article of the author can be accurately hit, still, when retrieving other form of presentation During title, it can not but get accordingly as a result, for example, when user inputs " DA Peterson ", corresponding inspection can not be got Hitch fruit.
And in the present embodiment, corresponding each retrieval result, retrieval result is, for example, specifically article, is not only recorded in article Author's title of reference format also records the title of other form of presentation, for example, for " Daniel A.Peterson " author An article, when preserving article data, not only record " Daniel A.Peterson ", also record " Peterson D A ", " Peterson, Daniel A. ", " Peterson, D.A. ", " Peterson, Daniel A. " and " DA Peterson ".This Sample when title input by user is any one in the multiple titles recorded in an article, can retrieve this article, So as to increase the quantity for the retrieval result recalled.
Optionally, after receiving term, can be obtained corresponding with the term according to the evidence of falling number of rows pre-established Retrieval result, wherein, the evidence of falling number of rows is for showing term and the correspondence of retrieval result.
It is author's title with term, exemplified by retrieval result is article, the evidence of falling number of rows may indicate that author's title and article Correspondence, by the correspondence and author's title of reception, can get corresponding with the author's title received Article.
Referring to Fig. 2, establishing the flow for the evidence of falling number of rows can include:
S21:Establish set of words.
It is the entitled example of author with term referring to Fig. 3, establishing the flow of set of words can include:
S31:Obtain author's original name.
Author's original name can specifically refer to author's title of reference format.
S32:According to author's original name, different types of title is obtained.
Wherein it is possible to obtain different types of title by the way of the excavation of author's alias.
The excavation of author's alias mostlys come from two parts:First, according to the reference formats of various paper specification literary styles, such as The citation criterias such as GB, APA, MLA can generate various alias literary styles, second is that quoting this according on full internet according to author's full name The bibliography of other papers of piece paper excavates author's full name to author's literary style in the quotation of this paper according to the two Corresponding quotation forms the name set of author.
S33:Different types of title is normalized, obtains normalization title.
Wherein it is possible to it is normalized according to preset rules, for example, the generation specification of normalization title is:It is single in name The initial caps of a word, in addition the surname of author.
S34:Form set of words.
It, can be by these title forms set of words when obtaining different types of title and normalization title.
For example, the corresponding set of words of author's title that original name is " Daniel A.Peterson " includes:“Daniel A.Peterson ", " Peterson D A ", " Peterson, Daniel A. ", " Peterson, D.A. ", " Peterson, Daniel A. " and " DA Peterson ".
S22:According to the set of words, positive number of rows evidence is established, the positive number of rows evidence is used to show retrieval result and term Correspondence.
For example, author a has write two articles X and Y, the corresponding different types of titles of author a be (a1, a2, a3, A4), it is a0 to normalize title.
Positive number of rows evidence can include:
X->A0, a1, a2, a3, a4
Y->A0, a1, a2, a3, a4
S23:According to the positive number of rows according to the generation evidence of falling number of rows.
For example, the evidence of falling number of rows includes:
a0->X, Y
a1->X, Y
a2->X, Y
a3->X, Y
a4->X, Y
It is understood that record positive number of rows evidence and number of rows according to when, can be with the mark of physical record article (DocID)。
In addition, the evidence of falling number of rows can be specifically included in inverted index, inverted index is used for having recorded which document includes Some words.Many documents are generally had in collection of document and include some word, each document can recording documents number (DocID), such as there are at the information to the number (TF) and word that word occurs in this document in which position in a document, so Inverted index item (Posting) is referred to as with the relevant information of document, a series of inverted index items comprising this word List structure is formd, here it is the corresponding Inverted Lists of some word.It is the schematic diagram of Inverted List 41, in text referring to Fig. 4 All words and its corresponding Inverted List occurred in shelves set constitute inverted index.It is understood that this implementation In example, the word in Fig. 4 can specifically refer to author's title.
On the other hand, positive number of rows is being obtained after, it can be according to the corresponding set of words of article, by various statements in set of words The title of mode is recorded in article, for example, the title recorded in each article includes:
X->A0, a1, a2, a3, a4
Y->A0, a1, a2, a3, a4
Specifically, author's title can be recorded in the summary part of article, as shown in figure 5, be presented to the user for one The schematic diagram of the summary part of article, summary part 51 can specifically include:Article Titles (title), author's title (author), the abstract of a thesis (abstract), keyword (keyword) etc..
S13:Show the retrieval result.
Optionally, the displaying retrieval result, including:
In the retrieval result, general rise of prices of the stocks and other securities displaying is carried out to the word that form of presentation is preset in the set of words.
Optionally, the word of the default form of presentation is normalized word.
No matter author's title of which kind of form of presentation of user search, the article of the author can be recalled, and can be General rise of prices of the stocks and other securities is shown normalized title by article.
For example, with reference to Fig. 6, when term 61 input by user is " DA Peterson ", author can be got The article of Daniel A.Peterson, and general rise of prices of the stocks and other securities display normalization title 63 in author's title in retrieval result 62, return One assumed name claims to be " DA Peterson ".
In another example referring to Fig. 7, when term 71 input by user is " Daniel A.Peterson ", can get The article of author Daniel A.Peterson, and general rise of prices of the stocks and other securities display normalization title in author's title in retrieval result 72 73, normalization title is " DA Peterson ".
In another example referring to Fig. 8, when term 81 input by user is " when Peterson, D.A. ", can to get author The article of Daniel A.Peterson, and general rise of prices of the stocks and other securities display normalization title 83 in author's title in retrieval result 82, return One assumed name claims to be " DA Peterson ".
It should be noted that since application documents cannot use color picture, the normalization title " DA in Fig. 6-Fig. 8 Peterson " can be specifically that general rise of prices of the stocks and other securities is shown.
In the present embodiment, by recording the word of different expression form in retrieval result, can be in the term of reception During any one word recorded in one retrieval result, it is possible to the retrieval result is retrieved, so as to increase the retrieval of acquisition As a result quantity improves retrieval effectiveness;In addition, the present embodiment is to normalized word in retrieval result by carrying out general rise of prices of the stocks and other securities displaying, The content that can be shown to general rise of prices of the stocks and other securities carries out unification, improves bandwagon effect, promotes user experience.
Fig. 9 is the structure diagram for the information indexing device that another embodiment of the present invention proposes, which includes receiving Module 91, acquisition module 92 and display module 93.
Receiving module 91 is used to receive term;
Term (query) is referred to as search term, query word etc..
Term can be the word that user is input in search box.It is, of course, understood that term can also be with The input of the other modes such as voice or picture, correspondingly, in retrieval, it can first carry out the side such as speech recognition or image identification Formula is converted to text, then carries out subsequent processing by the way of similar text input.
It is the entitled example of author with term in the embodiment of the present invention, the information retrieval method of the present embodiment can be applied Into these retrieval, the article including the author is retrieved by author's title realization of input.It is understood that the information Search method can also be applied to other field, correspondingly, term can be other titles, for example, trade name etc..
Acquisition module 92 is used to obtain retrieval result corresponding with the term, wherein, it is recorded in the retrieval result There is set of words, the set of words includes the word of other form of presentation of the term and the term;
In the prior art, author's title is recorded in each article, which is typically the title of reference format, For example, author's title is:During Daniel A.Peterson, generally there are following several form of presentation:
Reference format:Daniel A.Peterson
GBT1174 forms:Peterson D A
APA forms:Peterson,Daniel A.
MLA forms:Peterson,D.A.
Surname is in preceding name in rear form:Peterson,Daniel A.
Normalized format:DA Peterson
In the prior art, due to the usual only title of record standard form, then when the title of user's search criterion form, During such as " Daniel A.Peterson ", the article of the author can be accurately hit, still, when retrieving other form of presentation During title, it can not but get accordingly as a result, for example, when user inputs " DA Peterson ", corresponding inspection can not be got Hitch fruit.
And in the present embodiment, corresponding each retrieval result, retrieval result is, for example, specifically article, is not only recorded in article Author's title of reference format also records the title of other form of presentation, for example, for " Daniel A.Peterson " author An article, when preserving article data, not only record " Daniel A.Peterson ", also record " Peterson D A ", " Peterson, Daniel A. ", " Peterson, D.A. ", " Peterson, Daniel A. " and " DA Peterson ".This Sample when title input by user is any one in the multiple titles recorded in an article, can retrieve this article, So as to increase the quantity for the retrieval result recalled.
Optionally, the acquisition module 92 is specifically used for:
According to the evidence of falling number of rows pre-established, retrieval result corresponding with the term is obtained, wherein, the number of rows According to for showing the correspondence of term and retrieval result.
It is author's title with term, exemplified by retrieval result is article, the evidence of falling number of rows may indicate that author's title and article Correspondence, by the correspondence and author's title of reception, can get corresponding with the author's title received Article.
Optionally, referring to Figure 10, which further includes:
Module 94 is established, for establishing the evidence of falling number of rows;
The module 94 of establishing includes:
First module 941, for establishing the set of words;
The first module 941 is specifically used for:
Obtain the term;
It is the entitled example of author with term, author's original name can be obtained, author's original name can specifically refers to reference format Author's title.
According to the term, the different types of word of the term is obtained;
Wherein it is possible to obtain different types of title by the way of the excavation of author's alias.
The excavation of author's alias mostlys come from two parts:First, according to the reference formats of various paper specification literary styles, such as The citation criterias such as GB, APA, MLA can generate various alias literary styles, second is that quoting this according on full internet according to author's full name The bibliography of other papers of piece paper excavates author's full name to author's literary style in the quotation of this paper according to the two Corresponding quotation forms the name set of author.
The different types of word is normalized, obtains normalized word;
Wherein it is possible to it is normalized according to preset rules, for example, the generation specification of normalization title is:It is single in name The initial caps of a word, in addition the surname of author.
The set of words is formed according to the different types of word and the normalized word.
It, can be by these title forms set of words when obtaining different types of title and normalization title.
For example, the corresponding set of words of author's title that original name is " Daniel A.Peterson " includes:“Daniel A.Peterson ", " Peterson D A ", " Peterson, Daniel A. ", " Peterson, D.A. ", " Peterson, Daniel A. " and " DA Peterson ".
Second unit 942, for according to the set of words, establishing positive number of rows evidence, the positive number of rows evidence is used to show to retrieve As a result with the correspondence of term;
For example, author a has write two articles X and Y, the corresponding different types of titles of author a be (a1, a2, a3, A4), it is a0 to normalize title.
Positive number of rows evidence can include:
X->A0, a1, a2, a3, a4
Y->A0, a1, a2, a3, a4
Third unit 943, for according to the positive number of rows according to generation the evidence of falling number of rows.
For example, the evidence of falling number of rows includes:
a0->X, Y
a1->X, Y
a2->X, Y
a3->X, Y
a4->X, Y
It is understood that record positive number of rows evidence and number of rows according to when, can be with the mark of physical record article (DocID)。
In addition, the evidence of falling number of rows can be specifically included in inverted index, inverted index is used for having recorded which document includes Some words.Many documents are generally had in collection of document and include some word, each document can recording documents number (DocID), such as there are at the information to the number (TF) and word that word occurs in this document in which position in a document, so Inverted index item (Posting) is referred to as with the relevant information of document, a series of inverted index items comprising this word List structure is formd, here it is the corresponding Inverted Lists of some word.It is the schematic diagram of Inverted List, in document referring to Fig. 4 All words and its corresponding Inverted List occurred in set constitute inverted index.It is understood that the present embodiment In, the word in Fig. 4 can specifically refer to author's title.
Optionally, referring to Figure 10, which further includes:
Logging modle 95, for according to the positive number of rows evidence, the set of words to be recorded in the retrieval result.
Positive number of rows is being obtained after, it can be according to the corresponding set of words of article, by the name of various form of presentation in set of words Title is recorded in article, for example, the title recorded in each article includes:
X->A0, a1, a2, a3, a4
Y->A0, a1, a2, a3, a4
Specifically, author's title can be recorded in the summary part of article, as shown in figure 5, be presented to the user for one The schematic diagram of the summary part of article, summary part 51 can specifically include:Article Titles (title), author's title (author), the abstract of a thesis (abstract), keyword (keyword) etc..
Display module 93, for showing the retrieval result.
Optionally, the display module 93 is specifically used for:
In the retrieval result, general rise of prices of the stocks and other securities displaying is carried out to the word that form of presentation is preset in the set of words.
Optionally, the word of the default form of presentation is normalized word.
No matter author's title of which kind of form of presentation of user search, the article of the author can be recalled, and can be General rise of prices of the stocks and other securities is shown normalized title by article.
For example, with reference to Fig. 6, when term 61 input by user is " DA Peterson ", author can be got The article of Daniel A.Peterson, and general rise of prices of the stocks and other securities display normalization title 63 in author's title in retrieval result 62, return One assumed name claims to be " DA Peterson ".
In another example referring to Fig. 7, when term 71 input by user is " Daniel A.Peterson ", can get The article of author Daniel A.Peterson, and general rise of prices of the stocks and other securities display normalization title in author's title in retrieval result 72 73, normalization title is " DA Peterson ".
In another example referring to Fig. 8, when term 81 input by user is " when Peterson, D.A. ", can to get author The article of Daniel A.Peterson, and general rise of prices of the stocks and other securities display normalization title 83 in author's title in retrieval result 82, return One assumed name claims to be " DA Peterson ".
It should be noted that since application documents cannot use color picture, the normalization title " DA in Fig. 6-Fig. 8 Peterson " can be specifically that general rise of prices of the stocks and other securities is shown.
In the present embodiment, by recording the word of different expression form in retrieval result, can be in the term of reception During any one word recorded in one retrieval result, it is possible to the retrieval result is retrieved, so as to increase the retrieval of acquisition As a result quantity improves retrieval effectiveness;In addition, the present embodiment is to normalized word in retrieval result by carrying out general rise of prices of the stocks and other securities displaying, The content that can be shown to general rise of prices of the stocks and other securities carries out unification, improves bandwagon effect, promotes user experience.
It should be noted that in the description of the present invention, term " first ", " second " etc. are only used for description purpose, without It is understood that indicate or imply relative importance.In addition, in the description of the present invention, unless otherwise indicated, the meaning of " multiple " It is two or more.
Any process described otherwise above or method description are construed as in flow chart or herein, represent to include Module, segment or the portion of the code of the executable instruction of one or more the step of being used to implement specific logical function or process Point, and the scope of the preferred embodiment of the present invention includes other realization, wherein can not press shown or discuss suitable Sequence, including according to involved function by it is basic simultaneously in the way of or in the opposite order, carry out perform function, this should be of the invention Embodiment person of ordinary skill in the field understood.
It should be appreciated that each several part of the present invention can be realized with hardware, software, firmware or combination thereof.Above-mentioned In embodiment, software that multiple steps or method can in memory and by suitable instruction execution system be performed with storage Or firmware is realized.If for example, with hardware come realize in another embodiment, can be under well known in the art Any one of row technology or their combination are realized:With for the logic gates to data-signal realization logic function Discrete logic, have suitable combinational logic gate circuit application-specific integrated circuit, programmable gate array (PGA), scene Programmable gate array (FPGA) etc..
Those skilled in the art are appreciated that realize all or part of step that above-described embodiment method carries Suddenly it is that relevant hardware can be instructed to complete by program, the program can be stored in a kind of computer-readable storage medium In matter, the program upon execution, one or a combination set of the step of including embodiment of the method.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, it can also That unit is individually physically present, can also two or more units be integrated in a module.Above-mentioned integrated mould The form that hardware had both may be employed in block is realized, can also be realized in the form of software function module.The integrated module is such as Fruit is realized in the form of software function module and is independent production marketing or in use, can also be stored in a computer In read/write memory medium.
Storage medium mentioned above can be read-only memory, disk or CD etc..
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means specific features, structure, material or the spy for combining the embodiment or example description Point is contained at least one embodiment of the present invention or example.In the present specification, schematic expression of the above terms is not Centainly refer to identical embodiment or example.Moreover, particular features, structures, materials, or characteristics described can be any One or more embodiments or example in combine in an appropriate manner.
Although the embodiment of the present invention has been shown and described above, it is to be understood that above-described embodiment is example Property, it is impossible to limitation of the present invention is interpreted as, those of ordinary skill in the art within the scope of the invention can be to above-mentioned Embodiment is changed, changes, replacing and modification.

Claims (12)

1. a kind of information retrieval method, which is characterized in that including:
Receive term;
Retrieval result corresponding with the term is obtained, wherein, record has set of words, the set of words in the retrieval result The word of other form of presentation including the term and the term;
Show the retrieval result;
It is described to obtain retrieval result corresponding with the term, including:
According to the evidence of falling number of rows pre-established, obtain retrieval result corresponding with the term, wherein, the number of rows according to In the correspondence for showing term and retrieval result.
2. according to the method described in claim 1, it is characterized in that, the evidence of falling number of rows that the basis pre-establishes, acquisition and institute Before stating the corresponding retrieval result of term, the method further includes:
The evidence of falling number of rows is established, it is described to establish the evidence of falling number of rows, including:
Establish the set of words;
According to the set of words, positive number of rows evidence is established, the positive number of rows is closed according to for showing that retrieval result is corresponding with term System;
According to the positive number of rows according to the generation evidence of falling number of rows.
3. according to the method described in claim 2, it is characterized in that, described establish positive number of rows after, the method further includes:
According to the positive number of rows evidence, the set of words is recorded in the retrieval result.
4. according to the method described in claim 2, it is characterized in that, described establish the set of words, including:
Obtain the term;
According to the term, the different types of word of the term is obtained;
The different types of word is normalized, obtains normalized word;
The set of words is formed according to the different types of word and the normalized word.
5. according to the method described in claim 1, it is characterized in that, the displaying retrieval result, including:
In the retrieval result, general rise of prices of the stocks and other securities displaying is carried out to the word that form of presentation is preset in the set of words.
6. according to the method described in claim 5, it is characterized in that, the word of the default form of presentation is normalized word.
7. according to claim 1-6 any one of them methods, which is characterized in that the term is author's title.
8. a kind of information indexing device, which is characterized in that including:
Receiving module, for receiving term;
Acquisition module, for obtaining retrieval result corresponding with the term, wherein, record has word set in the retrieval result It closes, the set of words includes the word of other form of presentation of the term and the term;
Display module, for showing the retrieval result;
The acquisition module is specifically used for:
According to the evidence of falling number of rows pre-established, obtain retrieval result corresponding with the term, wherein, the number of rows according to In the correspondence for showing term and retrieval result.
9. device according to claim 8, which is characterized in that further include:
Module is established, for establishing the evidence of falling number of rows;
The module of establishing includes:
First module, for establishing the set of words;
Second unit, for according to the set of words, establishing positive number of rows evidence, the positive number of rows evidence is used to show retrieval result and inspection The correspondence of rope word;
Third unit, for according to the positive number of rows according to generation the evidence of falling number of rows.
10. device according to claim 9, which is characterized in that further include:
Logging modle, for according to the positive number of rows evidence, the set of words to be recorded in the retrieval result.
11. device according to claim 9, which is characterized in that the first module is specifically used for:
Obtain the term;
According to the term, the different types of word of the term is obtained;
The different types of word is normalized, obtains normalized word;
The set of words is formed according to the different types of word and the normalized word.
12. device according to claim 8, which is characterized in that the display module is specifically used for:
In the retrieval result, general rise of prices of the stocks and other securities displaying is carried out to the word that form of presentation is preset in the set of words.
CN201510012725.7A 2015-01-12 2015-01-12 Information retrieval method and device Active CN104573015B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510012725.7A CN104573015B (en) 2015-01-12 2015-01-12 Information retrieval method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510012725.7A CN104573015B (en) 2015-01-12 2015-01-12 Information retrieval method and device

Publications (2)

Publication Number Publication Date
CN104573015A CN104573015A (en) 2015-04-29
CN104573015B true CN104573015B (en) 2018-06-05

Family

ID=53089077

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510012725.7A Active CN104573015B (en) 2015-01-12 2015-01-12 Information retrieval method and device

Country Status (1)

Country Link
CN (1) CN104573015B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105631052A (en) * 2016-03-01 2016-06-01 北京百度网讯科技有限公司 Artificial intelligence based retrieval method and artificial intelligence based retrieval device
CN106484841B (en) * 2016-09-30 2019-09-24 北京奇付通科技有限公司 It is furnished an answer the searching method and device of item based on search result
CN106776805A (en) * 2016-11-22 2017-05-31 百度在线网络技术(北京)有限公司 Periodical information acquisition methods and device based on artificial intelligence

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103886039A (en) * 2014-03-10 2014-06-25 百度在线网络技术(北京)有限公司 Optimization method and device with searching
CN103914552A (en) * 2014-04-14 2014-07-09 百度在线网络技术(北京)有限公司 Method and device for retrieving applications

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102023989B (en) * 2009-09-23 2012-10-10 阿里巴巴集团控股有限公司 Information retrieval method and system thereof

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103886039A (en) * 2014-03-10 2014-06-25 百度在线网络技术(北京)有限公司 Optimization method and device with searching
CN103914552A (en) * 2014-04-14 2014-07-09 百度在线网络技术(北京)有限公司 Method and device for retrieving applications

Also Published As

Publication number Publication date
CN104573015A (en) 2015-04-29

Similar Documents

Publication Publication Date Title
Rain Sentiment analysis in amazon reviews using probabilistic machine learning
US20070143298A1 (en) Browsing items related to email
US20110060739A1 (en) System and method to research documents in online libraries
TWI398786B (en) System, method and computer readable media for generating expertise based search results
CN104809195B (en) The recommended method and device of search result
CN104573015B (en) Information retrieval method and device
CN113778295B (en) Book recommendation method and device, computer equipment and storage medium
CN107992602A (en) Search result methods of exhibiting and device
Hinton et al. We who love to be astonished: experimental women's writing and performance poetics
Sweet et al. Machine learning techniques for brand-influencer matchmaking on the instagram social network
CN113222687A (en) Deep learning-based recommendation method and device
KR20180059112A (en) Apparatus for classifying contents and method for using the same
Belovari Expedited digital appraisal for regular archivists: an MPLP-type approach
US20150286721A1 (en) System and Method for Returning Precise Internet Search Results
Koh Alternative literature in libraries: The unseen zine
Rad et al. A survey on automatic image annotation
TWI537751B (en) Non-volatile computer-readable storage media, system and method for image automatic description
Kern et al. Exploring the influence of tagging motivation on tagging behavior
Alex et al. User-driven text mining of historical text
Ghafari et al. Futuristic analysis of the Second Step Statement of the Islamic Revolution
CN106408320A (en) Advertisement index construction method and apparatus and advertisement retrieval method and system
PARVINI et al. THE PATHOLOGY OF COMPARATIVE LITERATURE JOURNALS IN IRAN (THE CASE STUDY: THE RESEARCH ON COMPARATIVE LITERATURE)
Gržina From a Private Archive to a Public Museum.
Harper Literary archives: the British Library
Shaftel All Good Magazines Go to Heaven.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant