CN103631784A - Page content retrieval method and system - Google Patents

Page content retrieval method and system Download PDF

Info

Publication number
CN103631784A
CN103631784A CN201210299109.0A CN201210299109A CN103631784A CN 103631784 A CN103631784 A CN 103631784A CN 201210299109 A CN201210299109 A CN 201210299109A CN 103631784 A CN103631784 A CN 103631784A
Authority
CN
China
Prior art keywords
content
pages
keyword
participle
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201210299109.0A
Other languages
Chinese (zh)
Other versions
CN103631784B (en
Inventor
付笑冰
刘晓更
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210299109.0A priority Critical patent/CN103631784B/en
Publication of CN103631784A publication Critical patent/CN103631784A/en
Application granted granted Critical
Publication of CN103631784B publication Critical patent/CN103631784B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a page content retrieval method which includes the following steps: obtaining an input keyword, carrying out word segmentation on the keyword, carrying out dual word segmentation on page content, carrying out retrieval in the page content after the dual word segmentation according to the keyword after the word segmentation, and obtaining the page content matched with the keyword. According to the page content retrieval method, the retrieval accuracy can be improved. In addition, the invention further provides a page content retrieval system.

Description

Content of pages search method and system
Technical field
The present invention relates to information retrieval technique, particularly relate to a kind of content of pages search method and system.
Background technology
Along with the development of network technology, more and more general by Web browser browsing pages content on intelligent television.The quantity of information comprising due to content of pages is larger, and user often needs content of pages understand and some key messages of content of pages are positioned.
In traditional content of pages search method, need first obtain the keyword of input, can carry out keyword input by full keyboard telepilot, or carry out keyword input by handwriting input touch-screen or handwriting pad, also can be by phonetic entry keyword; Further, can in content of pages, retrieve according to the keyword of input, obtain the content of pages that mates with keyword, and the content of pages mating with keyword that retrieval is obtained carry out mark in the page.
Yet, in traditional this content of pages search method, in the process of retrieving in content of pages according to the keyword of input, only can carry out mating with content of pages after simple participle to keyword, tend to miss a lot of result for retrieval, thereby reduced the accuracy rate of result for retrieval.
Summary of the invention
Based on this, be necessary for the not high problem of result for retrieval accuracy rate, a kind of search method of content of pages is more accurately provided.
A content of pages search method, comprises the following steps:
Obtain the keyword of phonetic entry;
Described keyword is carried out to participle;
Content of pages is carried out to dual participle;
According to the keyword after described participle, in the described content of pages carrying out after dual participle, retrieve, obtain the content of pages mating with keyword after described participle.
In addition, also provide a kind of searching system of content of pages more accurately.
A content of pages searching system, comprising:
Keyword acquisition module, for obtaining the keyword of phonetic entry;
Keyword word-dividing mode, for carrying out participle to described keyword;
Content of pages word-dividing mode, for carrying out dual participle to content of pages;
Retrieval module, for retrieving at the described content of pages carrying out after dual participle according to the keyword after described participle, obtains the content of pages mating with keyword after described participle.
Above-mentioned content of pages search method and system, by keyword being carried out to participle and content of pages being carried out to dual participle, content of pages is carried out to the content of pages that dual participle can mate the keyword after participle more, reduce the result for retrieval that may omit, therefore can improve the accuracy rate of retrieval.
Accompanying drawing explanation
Fig. 1 is the schematic flow sheet of content of pages search method in an embodiment;
Fig. 2 is the structured flowchart of content of pages searching system in an embodiment;
Fig. 3 is the structured flowchart of the keyword acquisition module in Fig. 2;
Fig. 4 is the structured flowchart of content of pages searching system in another embodiment.
Embodiment
As shown in Figure 1, in one embodiment, a kind of content of pages search method, comprises the following steps:
Step S102, obtains the keyword of input.
When by Web browser browsing pages content, can by inputting default steering order, open the function of search of Web browser, the key message in the page is retrieved.Further, can receive options button on telepilot and the click commands of acknowledgement key, trigger and enter search pattern.In addition, search pattern can content search modes and chaining search pattern, and wherein, the resulting result for retrieval of content retrieval pattern is the word content in the page, and the resulting result for retrieval of chaining search pattern is with the word content of link in the page.
Further, trigger and enter after search pattern, can receive the click commands of the options button on telepilot, input focus is positioned in search edit box, can obtain the keyword of input by search edit box.
Concrete, in step S102, can receive the keyword of full keyboard telepilot, handwriting input touch-screen or handwriting pad input, also can receive the keyword of phonetic entry.In one embodiment, can be after receiving phonetic entry activation instruction, voice activated input pattern.For example, phonetic entry activation instruction can be pressed instruction for the MIC key on telepilot or the length of acknowledgement key.
In one embodiment, the detailed process of step S102 is: the voice messaging that obtains input; Obtain the candidate keywords of mating with the voice messaging of inputting; When candidate keywords includes unisonance word, show the candidate keywords option of the phonetic that comprises this unisonance word.
After voice activated input pattern, can on the screen of terminal that moves Web browser, show the information for pointing out user to carry out phonetic entry, for example, information is " please loquitur ".Further, receive the voice messaging of input, and obtain the candidate keywords of mating with the voice messaging of inputting.The voice messaging of input can be converted to Word message (as alphabetic writing), then search and obtain a plurality of candidate keywords of mating with Word message.For example, the voice messaging of input is converted to phonetic for " gaojizhanghao ", obtains a plurality of candidate keywords and comprise " senior account number ", " senior account " etc.
Further, can in the drop-down list of search edit box, show resulting a plurality of candidate keywords, the candidate keywords of selecting in drop-down list is the keyword of input.In the present embodiment, when candidate keywords includes unisonance word, show the candidate keywords option of the phonetic that comprises this unisonance word.For example, the candidate keywords that obtains mating with the voice messaging of inputting comprises " senior account number ", " senior account " etc., wherein, " account number " and " account " is unisonance word, in drop-down list, further show the option of " senior [zhanghao] ", this option can mate a plurality of unisonance words.
In the present embodiment, can select to input the candidate keywords of the phonetic that comprises unisonance word, in the process of retrieval, can match the content of pages more mating with keyword, reduce the missing rate of result for retrieval, thereby can improve the accuracy rate of retrieval.
Step S104, carries out participle to the keyword of input.
Keyword to input can carry out simple participle.In one embodiment, in step S104, can to the keyword of input, carry out participle according to the vocabulary in default dictionary and corresponding priority.
Concrete, set in advance dictionary, in dictionary, deposited common wordss, each vocabulary has corresponding priority.The keyword of input is carried out in the process of participle, keyword is put in order and is divided into the vocabulary in dictionary according to word, further, may be partitioned into the vocabulary that dictionary medium priority is high.For example, " senior account number " participle be " senior | account number ".Further, can be about to keyword and put in order and carry out in the process of participle according to word in conjunction with occurring that the earliest principle carries out participle to the keyword of input, split after word further participle again of remaining word.For example, for the keyword of input, be " since the latter ", participle is " since | the latter ", and there will not be participle, is " both | then | person ".
Step S106, carries out dual participle to content of pages.
Content of pages is content to be retrieved, and content to be retrieved is carried out to dual participle.So-called dual participle, refers to that be all possible word according to the vocabulary in dictionary by content of pages participle, has the word overlapping between the adjacent word after participle.In one embodiment, in step S106, according to the vocabulary in the dictionary of prediction, by the content of text participle in content of pages, be all possible word.For example, content of pages comprises " since the latter ", after participle, be " [since]-[then]-[the latter] ", wherein symbol "]-[" represents the coincidence of adjacent word.
Content of pages is carried out to dual participle, and the content of pages that the keyword after participle can be mated is more, has reduced the result for retrieval that may omit, therefore can improve the accuracy rate of retrieval.
In one embodiment, before step S106, also can filter out the content of text that comprises the key word in keyword in content of pages.Concrete, get after the keyword of input, further get the key word in keyword.For example, the keyword of input is " senior account number ", the key word in this keyword comprise " height ", " level ", " account " and " number ".In the present embodiment, filtering out the content of text that comprises key word in content of pages, can be specifically the sentence that comprises key word or paragraph etc.Preferably, the quantity of the key word that the sentence that filters out or paragraph comprise also can be set, such as thinking wherein 1 or 2 etc.
In the present embodiment, in step S106, can carry out dual participle to the content of text filtering out.Concrete dual segmenting method as mentioned above, repeats no more at this.In the present embodiment, first filter out the content of text that comprises the key word in keyword, carry out again dual participle, with respect to full page content being carried out to the mode of dual participle, carry out the data volume of dual participle still less, and follow-up in retrieving, the scope of retrieval is also dwindled, therefore can improve handling property and efficiency.
Step S108, retrieves in the content of pages according to the keyword after participle after carrying out dual participle, obtains the content of pages mating with keyword.
Concrete, the content of pages mating with keyword obtaining is the content of pages (being content of text) that comprises the keyword after participle after dual participle.For example, after the keyword participle of input, be " senior | account number ", the content of pages of " be [senior]-[rank] after dual participle | [account number] " mates with the keyword of input.
Further, in one embodiment, after obtaining the content of pages mating with keyword, also can in the page, to the content of pages obtaining, carry out mark.Concrete, can in the page, the form with underscore identify the content of pages retrieving, and the keyword of the input comprising in content of pages is carried out to highlighted demonstration, and wherein, the content of pages that the content of pages mating with keyword comprises plain text and with the content of pages linking.
When the content of pages mating with keyword obtaining has when a plurality of, also can be according to the content of pages obtaining the order of the appearance in the page, in the page, to the content of pages obtaining, carry out sequence number mark.For example, in the page, carry out sequence number and be labeled as " [#1] senior account number... .[#2] high-level account number... .. ".
In one embodiment, above-mentioned content of pages search method also can comprise: the voice messaging that obtains input; Obtain the steering order corresponding with voice messaging; According to steering order, the retrieval of content of pages and content of pages are controlled.
Concrete, can be after receiving the MIC key of telepilot or the click commands of acknowledgement key, voice activated navigation mode, obtains the voice messaging of input, according to pre-stored voice messaging and the corresponding relation of steering order, obtains the steering order corresponding with the voice messaging of inputting.In one embodiment, steering order can be the selection instruction of the candidate keywords to showing in the drop-down list of search edit box, for example, the voice messaging getting for " on " or D score, obtain the selection instruction up and down to the candidate keywords in drop-down list, the voice messaging getting is " OK ", confirms the candidate keywords of input selection.
In another embodiment, steering order also can be the switching command of the content of pages to retrieving.After the content of pages that retrieval obtains mating with keyword, in the page, the content of pages retrieving is carried out to mark, and can navigate to first content of pages retrieving in the page.When getting the steering order corresponding with voice messaging and be switching command, can in a plurality of content of pages that retrieve, switch.For example, the voice messaging getting is " mistake " or " returning ", switch to select rear one or last 's the content of pages retrieving.
In another embodiment, steering order also can be the instruction that moves up and down to the page.For example, when the voice messaging getting for " on " or during D score, the page is moved up and down.
By Voice Navigation pattern, the retrieval of content of pages or content of pages are controlled, make the retrieval of content of pages realize Voice Navigation completely, need between phonetic entry and manually input (utilizing telepilot input), not switch frequently, therefore can improve the convenience of operation.
In one embodiment, the content of pages retrieving also comprises the content of pages of band link, after navigating to the content of pages retrieving of band link, can receive telepilot acknowledgement key click commands or receive after specific voice messaging, open corresponding link, enter the corresponding page.
As shown in Figure 2, in one embodiment, a kind of content of pages searching system, comprises keyword acquisition module 102, keyword word-dividing mode 104, content of pages word-dividing mode 106 and retrieval module 108, wherein:
Keyword acquisition module 102 is for obtaining the keyword of input.
When by Web browser browsing pages content, can by inputting default steering order, open the function of search of Web browser, the key message in the page is retrieved.Further, keyword acquisition module 102 can receive options button on telepilot and the click commands of acknowledgement key, triggers and enters search pattern.In addition, search pattern can content search modes and chaining search pattern, and wherein, the resulting result for retrieval of content retrieval pattern is the word content in the page, and the resulting result for retrieval of chaining search pattern is with the word content of link in the page.
Further, keyword acquisition module 102 triggers and enters after search pattern, can receive the click commands of the options button on telepilot, and input focus is positioned in search edit box, can obtain the keyword of input by search edit box.
Concrete, keyword acquisition module 102 can receive the keyword of full keyboard telepilot, handwriting input touch-screen or handwriting pad input, also can receive the keyword of phonetic entry.As shown in Figure 3, in one embodiment, keyword acquisition module 102 comprises voice messaging acquisition module 112, candidate keywords acquisition module 122 and candidate keywords display module 132, wherein:
Voice messaging acquisition module 112 is for obtaining the voice messaging of input.
Candidate keywords acquisition module 122 is for obtaining the candidate keywords of mating with the voice messaging of inputting.
Candidate keywords display module 132 when including unisonance word when candidate keywords, is shown the candidate keywords option of the phonetic that comprises this unisonance word.
After voice activated input pattern, can on the screen of terminal that moves Web browser, show the information for pointing out user to carry out phonetic entry, for example, information is " please loquitur ".Further, voice messaging acquisition module 112 receives the voice messaging of input, and candidate keywords acquisition module 122 obtains the candidate keywords of mating with the voice messaging of inputting.Concrete, candidate keywords acquisition module 122 can be converted to the voice messaging of input Word message (as alphabetic writing), then searches and obtains a plurality of candidate keywords of mating with Word message.
Further, candidate keywords display module 132 is used in the drop-down list of searching for edit box shows resulting a plurality of candidate keywords, and the candidate keywords of selecting in drop-down list is the keyword of input.In the present embodiment, when candidate keywords includes unisonance word, candidate keywords display module 132 is shown the candidate keywords option of the phonetic that comprises this unisonance word.
In the present embodiment, can select to input the candidate keywords of the phonetic that comprises unisonance word, in the process of retrieval, can match the content of pages more mating with keyword, reduce the missing rate of result for retrieval, thereby can improve the accuracy rate of retrieval.
Keyword word-dividing mode 104 is for carrying out participle to the keyword of input.
Keyword to input can carry out simple participle.In one embodiment, keyword word-dividing mode 103 can be used for, according to the vocabulary in default dictionary and corresponding priority, the keyword of input is carried out to participle.
Concrete, set in advance dictionary, in dictionary, deposited common wordss, each vocabulary has corresponding priority.Keyword word-dividing mode 104, the keyword of input is carried out in the process of participle, puts in order keyword to be divided into the vocabulary in dictionary according to word, further, may be partitioned into the vocabulary that dictionary medium priority is high.For example, " senior account number " participle be " senior | account number ".Further, keyword word-dividing mode 104 can be about to keyword and put in order and carry out in the process of participle according to word in conjunction with occurring that the earliest principle carries out participle to the keyword of input, splits after word further participle again of remaining word.For example, for the keyword of input, be " since the latter ", participle is " since | the latter ", and there will not be participle, is " both | then | person ".
Content of pages word-dividing mode 106 is for carrying out dual participle to content of pages.
Content of pages is content to be retrieved, treats the content of band retrieval and carries out dual participle.So-called dual participle, refers to that be all possible word according to the vocabulary in dictionary by content of pages participle, has the word overlapping between the adjacent word after participle.In one embodiment, 106 of content of pages word-dividing mode are for being all possible word according to the vocabulary of the dictionary of prediction by the content of text participle in content of pages.For example, content of pages comprises " since the latter ", after participle, be " [since]-[then]-[the latter] ", wherein symbol "]-[" represents the coincidence of adjacent word.
Content of pages is carried out to dual participle, and the content of pages that the keyword after participle can be mated is more, has reduced the result for retrieval that may omit, therefore can improve the accuracy rate of retrieval.
In one embodiment, content of pages searching system also can comprise content of pages screening module (not shown), for filtering out the content of text of the key word in the keyword that content of pages comprises input.Concrete, content of pages screening module can further get the key word in keyword, filters out the content of text that comprises key word in content of pages, can be specifically the sentence that comprises key word or paragraph etc.
In the present embodiment, 106 of content of pages word-dividing mode are for carrying out dual participle to the content of text filtering out.In the present embodiment, first filter out the content of text that comprises the key word in keyword, carry out again dual participle, with respect to full page content being carried out to the mode of dual participle, carry out the data volume of dual participle still less, and follow-up in retrieving, the scope of retrieval is also dwindled, therefore can improve handling property and efficiency.
Retrieval module 108 for according to the keyword after participle the content of pages after carrying out dual participle retrieve, obtain the content of pages mating with keyword.
Concrete, retrieval module 108 content of pages mating with keyword that obtain is the content of pages (being content of text) that comprises the keyword after participle after dual participle.For example, after the keyword participle of input, be " senior | account number ", the content of pages of " be [senior]-[rank] after dual participle | [account number] " mates with the keyword of input.
In one embodiment, as shown in Figure 4, content of pages searching system also can comprise display module 110, for the appearance order at the page according to the described content of pages obtaining, in the page, the described content of pages obtaining is carried out to mark.
Concrete, display module 110 is used in the form with underscore in the page and identifies the content of pages retrieving, and the keyword of the input comprising in content of pages is carried out to highlighted demonstration, wherein, the content of pages that the content of pages mating with keyword comprises plain text and with the content of pages linking.
When the content of pages mating with keyword obtaining has when a plurality of, display module 110 also can be used for the order of the appearance in the page according to the content of pages obtaining, and in the page, the content of pages obtaining is carried out to sequence number mark.For example, in the page, carry out sequence number and be labeled as " [#1] senior account number... .[#2] high-level account number... .. ".
In one embodiment, content of pages searching system also can comprise speech control module (not shown), for obtaining the voice messaging of input, and obtain the steering order corresponding with voice messaging, according to steering order, the retrieval of content of pages and content of pages are controlled.
Concrete, speech control module can be after receiving the MIC key of telepilot or the click commands of acknowledgement key, voice activated navigation mode, obtains the voice messaging of input, according to pre-stored voice messaging and the corresponding relation of steering order, obtains the steering order corresponding with the voice messaging of inputting.
In one embodiment, steering order can be the selection instruction of the candidate keywords to showing in the drop-down list of search edit box, for example, the voice messaging that speech control module gets for " on " or D score, obtain the selection instruction up and down to the candidate keywords in drop-down list, the voice messaging getting is " OK ", confirms the candidate keywords of input selection.
In another embodiment, steering order also can be the switching command of the content of pages to retrieving.After the content of pages that retrieval obtains mating with keyword, in the page, the content of pages retrieving is carried out to mark, and can navigate to first content of pages retrieving in the page.Speech control module, when getting the steering order corresponding with voice messaging and be switching command, can switch in a plurality of content of pages that retrieve.For example, the voice messaging getting is " mistake " or " returning ", switch to select rear one or last 's the content of pages retrieving.
In another embodiment, steering order also can be the instruction that moves up and down to the page.For example, when the voice messaging getting for " on " or during D score, the page is moved up and down.
By Voice Navigation pattern, the retrieval of content of pages or content of pages are controlled, make the retrieval of content of pages realize Voice Navigation completely, need between phonetic entry and manually input (utilizing telepilot input), not switch frequently, therefore can improve the convenience of operation.
One of ordinary skill in the art will appreciate that all or part of flow process realizing in above-described embodiment method, to come the hardware that instruction is relevant to complete by computer program, described program can be stored in computer read/write memory medium, this program, when carrying out, can comprise as the flow process of the embodiment of above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-Only Memory, ROM) or random store-memory body (Random Access Memory, RAM) etc.
The above embodiment has only expressed several embodiment of the present invention, and it describes comparatively concrete and detailed, but can not therefore be interpreted as the restriction to the scope of the claims of the present invention.It should be pointed out that for the person of ordinary skill of the art, without departing from the inventive concept of the premise, can also make some distortion and improvement, these all belong to protection scope of the present invention.Therefore, the protection domain of patent of the present invention should be as the criterion with claims.

Claims (12)

1. a content of pages search method, comprises the following steps:
Obtain the keyword of input;
Described keyword is carried out to participle;
Content of pages is carried out to dual participle;
According to the keyword after described participle, in the described content of pages carrying out after dual participle, retrieve, obtain the content of pages mating with described keyword.
2. content of pages search method according to claim 1, is characterized in that, described in obtain the keyword of input step comprise:
Obtain the voice messaging of input;
Obtain the candidate keywords of mating with described voice messaging;
When described candidate keywords includes unisonance word, show the candidate keywords option of the phonetic that comprises described unisonance word.
3. content of pages search method according to claim 1, is characterized in that, the described step that content of pages is carried out to dual participle is:
According to the vocabulary in the dictionary of prediction, by the content of text participle in described content of pages, be all possible word.
4. content of pages search method according to claim 1, is characterized in that, before described step of content of pages being carried out to dual participle, also comprises:
Filter out the content of text that comprises the key word in described keyword in content of pages;
The described step that content of pages is carried out to dual participle is: the described content of text filtering out is carried out to dual participle.
5. content of pages search method according to claim 1, is characterized in that, after the described step that obtains the content of pages that mates with keyword, also comprises:
Appearance order according to the described content of pages obtaining in the page is carried out mark to the described content of pages obtaining in the page.
6. content of pages search method according to claim 1, is characterized in that, described method also comprises:
Obtain the voice messaging of input;
Obtain the steering order corresponding with described voice messaging;
According to described steering order, the retrieval of content of pages and content of pages are controlled.
7. a content of pages searching system, is characterized in that, comprising:
Keyword acquisition module, for obtaining the keyword of input;
Keyword word-dividing mode, for carrying out participle to described keyword;
Content of pages word-dividing mode, for carrying out dual participle to content of pages;
Retrieval module, for retrieving at the described content of pages carrying out after dual participle according to the keyword after described participle, obtains the content of pages mating with described keyword.
8. content of pages searching system according to claim 7, is characterized in that, described keyword acquisition module comprises:
Voice messaging acquisition module, for obtaining the voice messaging of input;
Candidate keywords acquisition module, for obtaining the candidate keywords of mating with described voice messaging;
Candidate keywords display module, for when described candidate keywords includes unisonance word, shows the candidate keywords option of the phonetic that comprises described unisonance word.
9. content of pages searching system according to claim 7, is characterized in that, it is all possible word by the content of text participle in described content of pages that described content of pages word-dividing mode is used for according to the vocabulary of the dictionary of prediction.
10. content of pages searching system according to claim 7, is characterized in that, described system also comprises: content of pages screening module, for filtering out the content of text of the key word in comprising of content of pages of described keyword;
Described content of pages word-dividing mode is also for carrying out dual participle to the described content of text filtering out.
11. content of pages searching systems according to claim 7, is characterized in that, described system also comprises:
Display module for the appearance order at the page according to the described content of pages obtaining, carries out mark to the described content of pages obtaining in the page.
12. content of pages searching systems according to claim 7, is characterized in that, described system also comprises:
Speech control module, for obtaining the voice messaging of input, and obtains the steering order corresponding with described voice messaging, according to described steering order, the retrieval of content of pages and content of pages is controlled.
CN201210299109.0A 2012-08-21 2012-08-21 Page content retrieval method and system Active CN103631784B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210299109.0A CN103631784B (en) 2012-08-21 2012-08-21 Page content retrieval method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210299109.0A CN103631784B (en) 2012-08-21 2012-08-21 Page content retrieval method and system

Publications (2)

Publication Number Publication Date
CN103631784A true CN103631784A (en) 2014-03-12
CN103631784B CN103631784B (en) 2018-07-20

Family

ID=50212859

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210299109.0A Active CN103631784B (en) 2012-08-21 2012-08-21 Page content retrieval method and system

Country Status (1)

Country Link
CN (1) CN103631784B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462425A (en) * 2014-12-12 2015-03-25 百度在线网络技术(北京)有限公司 Method and device for displaying search suggestion
CN104537088A (en) * 2014-12-31 2015-04-22 百度在线网络技术(北京)有限公司 Information showing method and device
WO2016165566A1 (en) * 2015-04-13 2016-10-20 腾讯科技(深圳)有限公司 Barrage posting method and mobile terminal
CN108563676A (en) * 2018-03-03 2018-09-21 贵州省气象信息中心 A kind of integrated searching system of meteorological data
CN110782886A (en) * 2018-07-30 2020-02-11 阿里巴巴集团控股有限公司 System, method, television, device and medium for speech processing
CN111460272A (en) * 2019-01-22 2020-07-28 北京国双科技有限公司 Text page sequencing method and related equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1152749A (en) * 1996-01-30 1997-06-25 陈肇雄 Fully automatic system for separating Chinese words from sentences
CN1281191A (en) * 1999-07-19 2001-01-24 松下电器产业株式会社 Information retrieval method and information retrieval device
US20030200211A1 (en) * 1999-02-09 2003-10-23 Katsumi Tada Document retrieval method and document retrieval system
CN101021851A (en) * 2006-02-14 2007-08-22 富士施乐株式会社 Text search device, text search method, recording medium for recording text search program
CN101149758A (en) * 2007-10-18 2008-03-26 中兴通讯股份有限公司 Searching system and searching method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1152749A (en) * 1996-01-30 1997-06-25 陈肇雄 Fully automatic system for separating Chinese words from sentences
US20030200211A1 (en) * 1999-02-09 2003-10-23 Katsumi Tada Document retrieval method and document retrieval system
CN1281191A (en) * 1999-07-19 2001-01-24 松下电器产业株式会社 Information retrieval method and information retrieval device
CN101021851A (en) * 2006-02-14 2007-08-22 富士施乐株式会社 Text search device, text search method, recording medium for recording text search program
CN101149758A (en) * 2007-10-18 2008-03-26 中兴通讯股份有限公司 Searching system and searching method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
戴维民: "信息组织", 《高等教育出版社》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462425A (en) * 2014-12-12 2015-03-25 百度在线网络技术(北京)有限公司 Method and device for displaying search suggestion
WO2016090960A1 (en) * 2014-12-12 2016-06-16 百度在线网络技术(北京)有限公司 Method and device for displaying search suggestion
CN104462425B (en) * 2014-12-12 2018-09-07 百度在线网络技术(北京)有限公司 Search for the methods of exhibiting and device suggested
CN104537088A (en) * 2014-12-31 2015-04-22 百度在线网络技术(北京)有限公司 Information showing method and device
CN104537088B (en) * 2014-12-31 2018-01-30 百度在线网络技术(北京)有限公司 information display method and device
WO2016165566A1 (en) * 2015-04-13 2016-10-20 腾讯科技(深圳)有限公司 Barrage posting method and mobile terminal
US10491949B2 (en) 2015-04-13 2019-11-26 Tencent Technology (Shenzhen) Company Limited Bullet screen posting method and mobile terminal
CN108563676A (en) * 2018-03-03 2018-09-21 贵州省气象信息中心 A kind of integrated searching system of meteorological data
CN108563676B (en) * 2018-03-03 2021-10-01 贵州省气象信息中心 Integrated retrieval system of meteorological data
CN110782886A (en) * 2018-07-30 2020-02-11 阿里巴巴集团控股有限公司 System, method, television, device and medium for speech processing
CN111460272A (en) * 2019-01-22 2020-07-28 北京国双科技有限公司 Text page sequencing method and related equipment
CN111460272B (en) * 2019-01-22 2024-02-13 北京国双科技有限公司 Text page ordering method and related equipment

Also Published As

Publication number Publication date
CN103631784B (en) 2018-07-20

Similar Documents

Publication Publication Date Title
US10156981B2 (en) User-centric soft keyboard predictive technologies
EP3288024B1 (en) Method and apparatus for executing a user function using voice recognition
US9026428B2 (en) Text/character input system, such as for use with touch screens on mobile phones
KR101122869B1 (en) Annotation management in a pen-based computing system
CN101369216B (en) Words input method and system
CN101576783B (en) User interface, equipment and method for hand input
CN103631784A (en) Page content retrieval method and system
US20100169098A1 (en) System and method of a list commands utility for a speech recognition command system
US20130061139A1 (en) Server-based spell checking on a user device
JP2014102669A (en) Information processor, information processing method and program
US20130060560A1 (en) Server-based spell checking
US20190377779A1 (en) Device, System and Method for Displaying Sectioned Documents
CN101561725B (en) Method and system of fast handwriting input
CN104808806A (en) Chinese character input method and device in accordance with uncertain information
US20140089841A1 (en) Device and method for providing application interface based on writing input
CN102314412A (en) Method and system for recording contextual information and tracing new word context
CN103488752A (en) POI (point of interest) searching method
CN105373236B (en) Word learning method and device
CN105786803A (en) translation method and translation device
KR20140014510A (en) Editing method of text generatied by a speech recognition and terminal thereof
KR20150083961A (en) The method for searching integrated multilingual consonant pattern, for generating a character input unit to input consonants and apparatus thereof
CN103941979A (en) Method and device for inputting characters into mobile device
CN104125334A (en) Information processing method and electronic equipment
CN112558784A (en) Method and device for inputting characters and electronic equipment
CN104731766A (en) Alphabetic writing lexicon establishing method, alphabetic writing lexicon establishing device, inputting method and inputting system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant