CN108091325A

CN108091325A - A kind of speech recognition system and method based on surname

Info

Publication number: CN108091325A
Application number: CN201711440127.5A
Authority: CN
Inventors: 徐东群; 庄永军
Original assignee: Shenzhen Sanbao Innovation And Intelligence Co Ltd
Current assignee: Shenzhen Sanbao Innovation And Intelligence Co Ltd
Priority date: 2017-12-27
Filing date: 2017-12-27
Publication date: 2018-05-29

Abstract

The invention discloses a kind of speech recognition system and method based on surname, including main control unit, voice acquisition unit, Audio Processing Unit, surname model unit and pre-training unit：The main control unit connects voice acquisition unit, Audio Processing Unit, surname model unit and pre-training unit respectively, the present invention is not only simple and convenient, effectively realize the identification of the quick surname Chinese character of high-accuracy, simultaneously confusing a series of processing mode may be designed for surname is a variety of, the performance and discrimination of speech recognition are improved, alleviates system-computed burden.

Description

A kind of speech recognition system and method based on surname

Technical field

The present invention relates to technical field of voice recognition more particularly to a kind of speech recognition systems and method based on surname.

Background technology

Language is that the mankind mutually exchange the most frequently used, most effective, most important and most convenient communication form, and voice is language Acoustics shows, and it is the dream of the mankind all the time to carry out speech exchange with machine.With the rapid development of computer technology, voice The achievement of identification technology also making a breakthrough property, the dream that people is engaged in the dialogue with machine with natural language is progressively close to realizing.Voice The application range of identification technology is extremely wide, is not only related to the every aspect of daily life, and pole is also played in military field Its important role.It is information-intensive society towards intelligent and automation development key technology, makes processing of the people to information It is more convenient with obtaining, so as to improve the work efficiency of people.

The content of the invention

It is an object of the invention to provide a kind of speech recognition system and method based on surname, to solve above-mentioned background skill The problem of being proposed in art.

To achieve the above object, the present invention provides following technical solution：

A kind of speech recognition system based on surname, including main control unit, voice acquisition unit, Audio Processing Unit, surname Model unit and pre-training unit：The main control unit connects voice acquisition unit, Audio Processing Unit, surname model respectively Unit and pre-training unit.

Further technical solution as the present invention：The voice acquisition unit is M6027 microphones.

Further technical solution as the present invention：The Audio Processing Unit be divided into acoustic model characteristic extracting module, Language model characteristic extracting module obscures processing module and list Shuan Xingshi processing modules.

A kind of audio recognition method based on surname, comprises the steps of：

A, user inputs a string of voices；

B, system obtains voice signal, carries out feature extraction to voice by acoustic model, pronunciation sequence is drawn in acoustic model Afterwards, the character string sequence of maximum probability is found out from candidate character sequence using language model；

C, the Chinese character for representing surname is extracted from chinese character string sequence, by the whole Chinese characters and its spelling book of collection, Chinese character can be converted to phonetic, if it is polyphone, only be converted to wherein some phonetic；

If D, the surname is individual character surname, its phonetic is added in into list list, is done for surname phonetic and easily obscures pronunciation Processing and the processing for searching polyphone, and result is all saved in list；

If E, the surname is double word surname, the phonetic of each word in two word surnames is individually taken out to progress and easily obscures pronunciation Processing and lookup multitone, each word can obtain a list, two lists are combined two-by-two, find out these pinyin-groups Close corresponding all surnames；

F, according to obtained phonetic list, the corresponding surname Chinese character of each phonetic can be found, is listed complete expressed by speaker It portion may surname Chinese character；

G, it is final to obtain corresponding surname Chinese character list.

Compared with prior art, the beneficial effects of the invention are as follows：The present invention is not only simple and convenient, effectively realizes high precision The identification of the quick surname Chinese character of rate, at the same for surname it is a variety of it is confusing may design a series of processing mode, improve The performance and discrimination of speech recognition alleviate system-computed burden.

Description of the drawings

Fig. 1 is a kind of structure diagram of the speech recognition system based on surname

Fig. 2 is a kind of flow chart of the audio recognition method based on surname.

Specific embodiment

Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without making creative work Embodiment belongs to the scope of protection of the invention.

Please refer to Fig.1-2, in the embodiment of the present invention, a kind of speech recognition system based on surname is obtained including voice Unit, Audio Processing Unit, surname model unit and pre-training unit.

Voice acquisition unit：The unit is responsible for adopting the order progress voice signal that user sends using M6027 microphones Sample, and user voice signal is sent to Audio Processing Unit and is handled, speech recognition is passed to, is converted to chinese character String.

Audio Processing Unit：Unit is divided into acoustic model characteristic extracting module, language model characteristic extracting module, obscures place Manage module and list Shuan Xingshi processing modules.

Wherein acoustic model characteristic extracting module is to carry out feature extraction to voice signal using acoustic model, and voice is turned The output of acoustics expression is changed to, it is the probability for belonging to some delimiter symbol to provide voice；

Language model characteristic extracting module is after acoustic model provides pronunciation sequence, and probability is found out from candidate character sequence most Big character string sequence；

Obscure there are four types of processing form in processing module, one is that flat tongue consonant and cacuminal are handled, and for the phonetic of surname, judgement is No, with z, s, either if c beginnings are started with z, s or c, continue to judge to whether there is h in phonetic, if in the presence of removing；If no In the presence of then in second position of phonetic plus h；Two are handled for pre-nasal sound and rear nasal sound, for rear nasal sound：Alphabetical g generally goes out Either whether ending is judged comprising ang, eng or ing in surname phonetic for the beginning of present phonetic, if comprising removing phonetic The g of ending, for pre-nasal sound：An, en and in are generally present in the ending of phonetic, judge to whether there is an, en in surname phonetic Or in, if in the presence of whether last position for continuing to judge phonetic is g, if it is not, then adding g in the ending of phonetic.Three be nose Sound n and lateral l processing, whether the beginning for judging surname phonetic is n, if n, then the n of beginning is changed to l.Judge opening for phonetic Whether head is l, and n is changed to if l, the then l started；Four be multitone word processing, and surname Chinese character is being converted to the process of phonetic, if The surname that user says is polyphone, and it does not read according to correct phonetic of the polyphone in surname, and system is according to 35 The correct surname pronunciation of polyphone and the list of other pronunciations, judge institute's input Pinyin whether in other pronunciations, if at other In pronunciation, then correct surname pronunciation is taken out.The list of the pronunciation of correct surname and other pronunciations of 35 polyphones is by being system All polyphones in surname are found out in the list of existing whole Chinese character and its phonetic, wherein there are 28 multitones in individual character surname Word has 7 polyphones in double word surname.

The phonetic of individual character surname wherein for Dan Xingshi, is first added in list list, for surname by single Shuan Xingshi processing modules Family name's phonetic does the processing for easily obscuring pronunciation and the processing for searching polyphone, and result is all saved in list.And for The phonetic of each word in two word surnames is individually taken out the processing for carrying out easily obscuring pronunciation and searches multitone, each word by Shuan Xingshi A list can be all obtained, two lists are combined two-by-two, these pinyin combinations is found out and corresponds to all surnames.

Surname model unit：The whole Chinese characters and its spelling book of collection, will cover Chinese character as much as possible, and with this Build surname model.

Pre-training unit：The recognition mode of the surname of training extraction in advance is carried out by the structure surname model of system constructing, Carry out the extraction of surname Chinese character.

The present invention operation principle be：Its workflow is as shown in Figure 2：

1. user inputs a string of voices.

2. system obtains voice signal, feature extraction is carried out to voice by acoustic model, pronunciation is drawn in acoustic model After sequence, the character string sequence of maximum probability is found out from candidate character sequence using language model.

3. extracting the Chinese character for representing surname from chinese character string sequence, pass through whole Chinese characters of collection and its phonetic word Chinese character can be converted to phonetic by allusion quotation, if it is polyphone, only be converted to wherein some phonetic.

4. if the surname is individual character surname, its phonetic is added in into list list, is done for surname phonetic and easily obscures hair The processing of sound and the processing for searching polyphone, and result is all saved in list.

5. if the surname is double word surname, the phonetic of each word in two word surnames is individually taken out to progress and easily obscures hair The processing of sound and lookup multitone, each word can obtain a list, two lists are combined two-by-two, find out these spellings The corresponding all surnames of sound combination.

6. according to obtained phonetic list, the corresponding surname Chinese character of each phonetic can be found, is listed expressed by speaker All may surname Chinese character.

7. final obtain corresponding surname Chinese character list.

Claims

1. a kind of speech recognition system based on surname, including main control unit, voice acquisition unit, Audio Processing Unit, surname Family name's model unit and pre-training unit：It is characterized in that, the main control unit connects voice acquisition unit, speech processes respectively Unit, surname model unit and pre-training unit.

2. a kind of speech recognition system based on surname according to claim 1, which is characterized in that the voice obtains single Member is M6027 microphones.

A kind of 3. speech recognition system based on surname according to claim 1, which is characterized in that the speech processes list Member is divided into acoustic model characteristic extracting module, language model characteristic extracting module, obscures processing module and list Shuan Xingshi processing moulds Block.

4. a kind of audio recognition method based on surname, which is characterized in that comprise the steps of：

A, user inputs a string of voices；

G, it is final to obtain corresponding surname Chinese character list.