CN101131636A - On-line voice or Pinyin input method - Google Patents

On-line voice or Pinyin input method Download PDF

Info

Publication number
CN101131636A
CN101131636A CNA200610200808XA CN200610200808A CN101131636A CN 101131636 A CN101131636 A CN 101131636A CN A200610200808X A CNA200610200808X A CN A200610200808XA CN 200610200808 A CN200610200808 A CN 200610200808A CN 101131636 A CN101131636 A CN 101131636A
Authority
CN
China
Prior art keywords
voice
literal
phonetic
pinyin
import
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA200610200808XA
Other languages
Chinese (zh)
Inventor
李颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA200610200808XA priority Critical patent/CN101131636A/en
Publication of CN101131636A publication Critical patent/CN101131636A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

This invention is a new method in the information automatic field. In order to invert 'the voice (or Pinyin) method of import' into the affirmation for the minority relating voice literal this method makes the present the voice (or Pinyin) method of import allocated a kind of software with all of relating voice word combination basic unit sequencing in proper form updating sequence and then filter the sequenced result with the selection big priority installation through on line search literal segment source on web after that screen all the related voice web or non-overlapping related literal not only agreeing the import voice (or Pinyin) related voice arrangement nut also anatomizing the presented related voice segment on web. The imported voice segment hipping is stronger the successful possibility is larger for integral screen; the import word numbers of segment are more the screen result segments are less so either of them can not be short of. This invention improves the import efficiency for the voice (or Pinyin); and it also provides the search path for the web voice (or Pinyin).

Description

Online voice (or phonetic) input method
Technical field:
The IT robotization
Background technology:
Have only a spot of phrase phonetically similar word can give tacit consent to (i.e. so-called group word association) in existing voice (or phonetic) input method according to set dictionary in the computer unit input method software.And a large amount of same (closely) sound words relies in artificial affirmation, wastes time and energy.
Purpose benefit: dote on big diction storehouse, phrase and include drawbacks such as variation complete and that the new term that is difficult to follow up grows with each passing day for having overcome in the existing input method independent the setting; The phonetically similar word acquiescence is expanded to all medelling paragraph scopes of voice (or phonetic) input.Significantly improve voice (or phonetic) input efficiency; For online browsing provides voice (or phonetic) search approach.
Summary of the invention:
All relate to the in good time in due form tactic software of sound phrase elementary cell when importing with regard to voice (or phonetic) for existing " voice (or phonetic) input method " configuration is a, make the corresponding tangible transitional information carrier that relates to the sound literal that forms multiple series arrangement content of a kind of invisible paragraph of phonetic entry, and press big preferential the setting with regard to these transitional information carriers and progressively filter by the online literal fragment of on-line search resource, filter out minimum online existing Webpage with (closely) sound literal paragraph or relate to sound literal webpage and website for selecting, make to relate to sound literal transitional information carrier and the in esse sound paragraph formation corresponding expression that relates on the net with (closely) sound word or browsing page.Thereby input method is converted into the affirmation (should re-enter for the suffix portion that may not be identified) that minority is related to the sound literal to the identification of voice.
Described " medelling paragraph " is meant " the habitual use paragraph of people ".For example, the Chinese idiom of Chinese, popular word, onomasticon; The fixedly word of English, phrase, format term etc.In general, the paragraph medelling of input is strong more, and the whole section successful possibility of screening is big more; The paragraph number of words of input is many more, and the The selection result fragment is few more, and both are indispensable.
Described " relating to sound phrase elementary cell " is meant " the minimum composition that can directly collude into phrase of the fixedly written form that definite number arranged corresponding with fixing pronunciation ".For example, the series of Chinese is with (closely) sound Chinese character; Same (closely) sound word-building of the series of English etc.
Described " near sound word " is meant " causing the difference pronunciation phrase elementary cell that man-machine erroneous judgement is disconnected easily ".For example, initial consonant is the identical simple or compound vowel of a Chinese syllable Chinese character of L and N; The different English word-building that comprises voiceless consonant.They are together handled as relating to sound phrase elementary cell in arrangement software, be beneficial to affirmation.
Embodiment:
Online voice (or phonetic) input method has two basic purposes, and (one) support voice (or phonetic) input method is dwindled the unisonance literal greatly and selected scope, and (two) realize internet voice (or phonetic) primary election search.Embodiment is distinguished to some extent.
1, first kind of purposes requires to arrange software all relates to sound phrase elementary cell and are arranged as and comprise ad initio all series arrangement more than two sound unit numbers with regard to it; Second kind of purposes requires to arrange software, and all relate to all series arrangement that sound phrase elementary cell is arranged as the first figure place of all sounds with regard to it.
2, first kind of purposes requirement rank results presses greatly preferentially to be provided with as whole sentence and all participates in automatic on-line search step by step, and a dust has Search Results at the same level just to stop search; Second kind of purposes requires rank results all to participate in the automatic on-line search as whole sentence is disposable.
3, first kind of purposes requires search to point to be (comprise institute's Sorted list literal webpage and website) separately not overlapping institute Sorted list literal; It is all webpage and websites that comprise institute's Sorted list literal that second kind of purposes requires search to point to.
Certainly require the text search mode of existing search engine to do the adaptability change, make it can once hold a plurality of whole sentence search, the repeatedly search of independently having no result; Can search point to each search statement self, result's not overlapping (being used for the input of voice phonetic) or can search point to all website and webpage (being used for voice phonetic searches for) that comprise search statement on the net.
Described " the repeatedly search of independently having no result " is meant " this search end obtains the result and continues the next stage search automatically ", just stops the next stage search in case obtain Search Results.
4, first kind of purposes acquisition Search Results is to be no less than the not overlapping institute of a minority at the same level Sorted list literal; It is many institutes Sorted list literal webpage and website that second kind of purposes obtains Search Results.
5, first kind of purposes manually confirmed between acquiescence or to possible some Search Results automatically with regard to possible unique Search Results again; Second kind of Search Results that purposes obtained directly provides and browses.
Enumerate explanation: to desire voice (or phonetic) input Chinese " huang jing yan zhong e hua " is example, according to convention, this is a comparatively medelling paragraph, but under existing voice (or phonetic) input method condition, this paragraph can only be confirmed one by one, not only bothered but also make mistakes easily in the segmentation input.But if can whole confirm, then the ambiguity unisonance literal probability that accompanies becomes zero on the contrary.For this reason, existing voice (or phonetic) input method is embedded such program, will be in " huang jingyan zhong e hua " paragraph all relate to sound phrase elementary cell " also, yellow, ring ... ", " advance, border, gold, tight ... ", " cigarette, salt, drill, test, sternly ... ", " heavy, swollen ... ", " dislike, strategic point ... ", " change, stroke ... ".(1) be used for the input of voice phonetic the time be arranged as comprise all sequential series more than two bit locations " the sternly swollen strategic pointization in border also; Huang advances salt heavily to be worsened; the environment severe exacerbation; ... ", " the swollen strategic point of gold salt; environment is seriously disliked; also tight cigarette is heavily disliked; ... ", " also the border is tested swollen; Huang advances to drill heavily; environment is serious ... ", " gold cigarette; ring advances salt; environment is tight; Huang is tightly tested ... ", " going back the border; gold; ring is tight; environment ... ", and all rank results preferentially are provided with greatly the not overlapping institute Sorted list literal that automatic on-line is step by step independently had no result and repeatedly searched for the single webpage and website of each rank results according to getting, promptly will " go back the sternly swollen strategic pointization in border " earlier, " Huang advances salt heavily to be worsened ", " environment severe exacerbation ", ... all go up line search, if come to nothing, automatically again with " the swollen strategic point of gold salt ", " environment is seriously disliked ", " also tight cigarette is heavily disliked ", ... all go up line search, successively ..., its Search Results must be that those minimumly online existingly relate to that the sound paragraph is screened comes out and comprise consistent paragraph " environment severe exacerbation " or " environment is serious " or " environment " of actual purpose requirement with the input paragraph, one of " gold " group, here supposition " environment severe exacerbation " do not occur and unique webpage literal " environment is serious " relevant with " environment is serious " (or a plurality of webpage unisonance literal arranged side by side) occurred, just stop search and literal " environment is serious " is sent to required (a plurality of need arranged side by side are manually confirmed) automatically, thereby the last artificial minimum option of confirming to provide is provided.(2) when being used for the phonetic search webpage and website, only be arranged as whole figure place sequential series " also the sternly swollen strategic pointization in border, Huang advance that salt heavily worsens, the environment severe exacerbation ... ", and disposable whole automatic on-line is searched for all webpage and websites, its Search Results must be that those minimumly comprise that online both the webpage and website of Sorted list literal is screened to some extent comes out, as comprise many webpage and websites of " environment severe exacerbation " paragraph, thereby directly search on the realization voice network, raise the efficiency.
For a kind of method that the invention provides substantive examination is exactly an arbitrarily selected stage mode paragraph, with regard to its all phonetically similar word hierarchical sequence manual alignment, make whole sentence internet searching (promptly enclosing double quotation marks made in Great Britain " * ") one by one, its effect comes into plain view.

Claims (2)

1. " online voice (or phonetic) input method " is that a kind of utilization " relate to sound phrase elementary cell arrange software " combination " on-line search network character fragment resource " supports to have now the method that " voice (or phonetic) input method " improves suitable efficient.It is characterized in that:
(1) for embedding with regard to what all related to that sound phrase elementary cell is arranged as the series arrangement literal that comprises all series arrangement literal more than two bit locations or be arranged as the population of cells number in good time, existing " voice (or phonetic) input method " relate to sound phrase elementary cell alignment problem.
(2) rank results is pressed big preferential all webpage and websites of all participating in the online repeatedly search of independently having no result (comprise institute's Sorted list literal single webpage and website) separately not overlapping institute Sorted list literal step by step or comprising institute's Sorted list literal that are provided with as whole sentence.
2. all softwares, hardware are based on the design and the method for making of [claim 1] described feature (1), (2).It is characterized in that: can promote the practical application of " online voice (or phonetic) input method ".
CNA200610200808XA 2006-08-18 2006-08-18 On-line voice or Pinyin input method Pending CN101131636A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA200610200808XA CN101131636A (en) 2006-08-18 2006-08-18 On-line voice or Pinyin input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA200610200808XA CN101131636A (en) 2006-08-18 2006-08-18 On-line voice or Pinyin input method

Publications (1)

Publication Number Publication Date
CN101131636A true CN101131636A (en) 2008-02-27

Family

ID=39128913

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA200610200808XA Pending CN101131636A (en) 2006-08-18 2006-08-18 On-line voice or Pinyin input method

Country Status (1)

Country Link
CN (1) CN101131636A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103366742A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input method and system
CN105117500A (en) * 2015-10-10 2015-12-02 成都携恩科技有限公司 Data query and acquisition method under big data background

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103366742A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input method and system
CN103366742B (en) * 2012-03-31 2018-07-31 上海果壳电子有限公司 Pronunciation inputting method and system
CN105117500A (en) * 2015-10-10 2015-12-02 成都携恩科技有限公司 Data query and acquisition method under big data background
CN105117500B (en) * 2015-10-10 2018-07-06 成都携恩科技有限公司 A kind of data query acquisition methods under big data background

Similar Documents

Publication Publication Date Title
Zaidan et al. Arabic dialect identification
Cerruti Regional varieties of Italian in the linguistic repertoire
Mandera et al. Subtlex-pl: subtitle-based word frequency estimates for Polish
Handel What is Sino‐Tibetan? Snapshot of a field and a language family in flux
CN101308492A (en) Information processing apparatus, informaton processing method, program, and recording medium
Bjarnadóttir The database of modern Icelandic inflection (Beygingarlýsing íslensks nútímamáls)
CN103186509B (en) The extensive method and apparatus of asterisk wildcard class template, the extensive method and system of common template
Getman et al. Overview of Linguistic Resources for the TAC KBP 2017 Evaluations: Methodologies and Results.
CN101464856A (en) Alignment method and apparatus for parallel spoken language materials
Hundt et al. The use of the be-passive in academic Englishes: Local versus global usage in an international language
Beider Reapplying the language tree model to the history of Yiddish
Hundt et al. Corpus-based approaches to World Englishes
Collins et al. Grammatical change in the verb phrase in Australian English: A corpus-based study
Auer Reflections on linguistic pluricentricity
Berg The cohesiveness of English and German compounds
Dayter Collocations in non-interpreted and simultaneously interpreted English: a corpus study
CN101131636A (en) On-line voice or Pinyin input method
De Felice et al. CLaSSES: A new digital resource for Latin epigraphy
Duszkin et al. New parallel corpora of Baltic and Slavic languages—Assumptions of corpus construction
Podhorná-Polická RapCor, Francophone Rap Songs Text Corpus.
Vandeweerd et al. J’ai l’impression que: Lexical Bundles in the Dialogues of Beginner French Textbooks
de Souza et al. Development of a brazilian portuguese hotel’s reviews corpus
Chelliah et al. 2.5. Contact and convergence in the Northeast
Sánchez et al. Ordinal analysis of lexical patterns
Dheskali Hedging and boosting in Albanian, British and Italian online news articles in Europe

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination