CN107729321A

CN107729321A - A kind of method for correcting error of voice identification result

Info

Publication number: CN107729321A
Application number: CN201710994082.XA
Authority: CN
Inventors: 叶伟
Original assignee: Shanghai Century Network Technology Co Ltd
Current assignee: Shanghai Century Network Technology Co., Ltd.
Priority date: 2017-10-23
Filing date: 2017-10-23
Publication date: 2018-02-23

Abstract

A kind of method for correcting error of voice identification result, including voice identification result is pre-processed；The words and phrases easily to be malfunctioned in voice identification result are found out, or important word to be corrected, word are parsed to text semantic；Treat and correct word, word progress phonetic notation, including two kinds of phonetic modes of spelling and each first letter of pinyin, obtain phonetic corresponding to voice identification result to be corrected, corresponding phonetic refers to no tone；According to the phonetic spelling mode, using the true algorithm of editing distance, best candidate text and suboptimum candidate's text are determined；According to the first letter of pinyin, using editing distance algorithm, best candidate text and suboptimum candidate's text are determined；All best candidate texts and suboptimum candidate text are merged, the candidate item repeated only retains one；Quasi- candidate's text is replaced respectively and treats corrected text, calculates the respective sentence probability after each replacement respectively using n grama language models, chooses probability highest as the final voice identification result to be corrected.

Description

A kind of method for correcting error of voice identification result

Technical field

The invention belongs to field of artificial intelligence, more particularly to a kind of method for correcting error of voice identification result.

Background technology

Ripe day by day with speech recognition technology, interactive voice use range is more and more wider.Compared to other interactive modes, The interactive mode that interactive voice is realized more meets the daily habits of people, also highly efficient.At present, interactive voice mode is in intelligence Energy household, Industry Control, the every field such as auxiliary are driven, be obtained for extensive use.

In actual applications, due to the influence of the factors such as ambient noise, dialect, the knot of speech recognition during interactive voice Fruit is often inconsistent with the expression of user.Especially under everyday spoken english scene, the error rate of speech recognition is higher.And prior art In, all concentrate on lifting speech recognition accuracy, but lack the approach of error correction to identification mistake, thus have impact on speech recognition The further genralrlization of technology.

The content of the invention

The present invention provides a kind of method for correcting error of voice identification result, accurate to be carried out to the resulting text of speech recognition Error correction.

A kind of method for correcting error of voice identification result, comprises the following steps：

S11, voice identification result is pre-processed；

S12, finds out the words and phrases easily to be malfunctioned in voice identification result, or text semantic is parsed important word to be corrected, Word；

S13, treat and correct word, word progress phonetic notation, including two kinds of phonetic modes of spelling and each first letter of pinyin, obtain waiting to entangle Phonetic corresponding to positive voice identification result, corresponding phonetic refer to no tone；

S14, according to the phonetic spelling mode, using the true algorithm of editing distance, determine that best candidate text and suboptimum are waited Selection sheet；

S15, according to the first letter of pinyin, editing distance algorithm is reused, determine that best candidate text and suboptimum are waited Selection sheet；

S16, all best candidate texts and suboptimum candidate text are merged, the candidate item repeated only retains one；

S17, quasi- candidate's text is replaced treat corrected text respectively, calculated using n-grama language models and respectively replaced respectively Respective sentence probability after changing, probability highest is chosen as the final voice identification result to be corrected.

Pretreatment in step S11 includes participle, part-of-speech tagging, removes stop words and carries out syntactic analysis text maninulation.

The present invention by being segmented to voice identification result, part-of-speech tagging, remove stop words and carry out syntactic analysis.Will As a result middle V-O construction phrase, verb, noun and the word conduct text to be corrected not occurred in dictionary, while pay attention to keeping Order of each word in former speech text；Text results to be corrected are segmented, and obtain the phonetic corresponding to each participle；Root Candidate word is obtained from dictionary according to each participle phonetic, and best candidate word is determined in candidate word；Judge described optimal Whether candidate word meets preparatory condition；If meeting preparatory condition, original text word to be corrected is replaced with the best candidate word.Will All correction results merge, and show that result is corrected in final speech recognition.

Brief description of the drawings

Detailed description below, above-mentioned and other mesh of exemplary embodiment of the invention are read by reference to accompanying drawing , feature and advantage will become prone to understand.In the accompanying drawings, if showing the present invention's by way of example, and not by way of limitation Dry embodiment, wherein：

The schematic flow sheet of method for correcting error of voice identification result in Fig. 1 embodiment of the present invention.

Embodiment

Referring to Fig. 1, the method for the present embodiment includes:

S11:Voice identification result is segmented, part-of-speech tagging, stop words is removed and carries out the text maninulation such as syntactic analysis

S12:According to technology that is existing or occurring in the future, find out easily error or text semantic is parsed and important wait to correct Word, word.Especially pay attention to V-O construction phrase, verb, noun and the word not occurred in dictionary in voice identification result.

S13:Treat and correct word, word progress phonetic notation, obtain phonetic corresponding to voice identification result to be corrected, corresponding phonetic Refer to no tone.

Such a situation divides a variety of situations again, is elaborated as follows：

Unisonance malapropism, takes spelling：

For example, voice identification result to be corrected is " seeing that three sound three are ", corresponding phonetic is after having divided word:kan san sheng san shi

Pronounce nonstandard, take each prefix letter：

For example, voice identification result to be corrected is " seeing that Shan Shan mountains are ", corresponding phonetic is after having divided word:kan shan Shan shan shi, each initial letter k s s s s can be taken to it

S14:First according to the phonetic spelling, using the true algorithm of editing distance, determine that best candidate text and suboptimum are waited Selection sheet；

S15：Secondly according to the first letter of pinyin, editing distance algorithm is reused, determines best candidate text and secondary Excellent candidate's text.

S16：All best candidate texts and suboptimum candidate text are merged, the candidate item repeated only retains one, owns It is referred to as the candidate's text that is defined.

S17:Quasi- candidate's text is replaced respectively and treats corrected text, is calculated using n-grama language models and respectively replaced respectively Respective sentence probability after changing, probability highest is chosen as the final voice identification result to be corrected

What deserves to be explained is although foregoing teachings describe the essence of the invention by reference to some embodiments God and principle, it should be appreciated that, the present invention is not limited to disclosed embodiment, the also unawareness of the division to each side The feature that taste in these aspects can not combine, and this division is merely to the convenience of statement.It is contemplated that cover appended power Included various modifications and equivalent arrangements in the spirit and scope that profit requires.

Claims

1. a kind of method for correcting error of voice identification result, it is characterised in that comprise the following steps：

S11, voice identification result is pre-processed；

S12, the words and phrases easily to be malfunctioned in voice identification result are found out, or important word to be corrected, word are parsed to text semantic；

S13, treat and correct word, word progress phonetic notation, including two kinds of phonetic modes of spelling and each first letter of pinyin, obtain language to be corrected Phonetic corresponding to sound recognition result, corresponding phonetic refer to no tone；

S14, according to the phonetic spelling mode, using the true algorithm of editing distance, determine best candidate text and suboptimum candidate text This；

S15, according to the first letter of pinyin, editing distance algorithm is reused, determine best candidate text and suboptimum candidate text This；

S17, quasi- candidate's text is replaced treat corrected text respectively, after calculating each replacement respectively using n-grama language models Respective sentence probability, choose probability highest as voice identification result to be corrected described in final.

2. method for correcting error of voice identification result as claimed in claim 1, it is characterised in that the pretreatment in step S11 includes Participle, part-of-speech tagging, remove stop words and carry out syntactic analysis text maninulation.