CN108091325A - A kind of speech recognition system and method based on surname - Google Patents

A kind of speech recognition system and method based on surname Download PDF

Info

Publication number
CN108091325A
CN108091325A CN201711440127.5A CN201711440127A CN108091325A CN 108091325 A CN108091325 A CN 108091325A CN 201711440127 A CN201711440127 A CN 201711440127A CN 108091325 A CN108091325 A CN 108091325A
Authority
CN
China
Prior art keywords
surname
phonetic
unit
list
chinese character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711440127.5A
Other languages
Chinese (zh)
Inventor
徐东群
庄永军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Sanbao Innovation And Intelligence Co Ltd
Original Assignee
Shenzhen Sanbao Innovation And Intelligence Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Sanbao Innovation And Intelligence Co Ltd filed Critical Shenzhen Sanbao Innovation And Intelligence Co Ltd
Priority to CN201711440127.5A priority Critical patent/CN108091325A/en
Publication of CN108091325A publication Critical patent/CN108091325A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/086Recognition of spelled words

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a kind of speech recognition system and method based on surname, including main control unit, voice acquisition unit, Audio Processing Unit, surname model unit and pre-training unit:The main control unit connects voice acquisition unit, Audio Processing Unit, surname model unit and pre-training unit respectively, the present invention is not only simple and convenient, effectively realize the identification of the quick surname Chinese character of high-accuracy, simultaneously confusing a series of processing mode may be designed for surname is a variety of, the performance and discrimination of speech recognition are improved, alleviates system-computed burden.

Description

A kind of speech recognition system and method based on surname
Technical field
The present invention relates to technical field of voice recognition more particularly to a kind of speech recognition systems and method based on surname.
Background technology
Language is that the mankind mutually exchange the most frequently used, most effective, most important and most convenient communication form, and voice is language Acoustics shows, and it is the dream of the mankind all the time to carry out speech exchange with machine.With the rapid development of computer technology, voice The achievement of identification technology also making a breakthrough property, the dream that people is engaged in the dialogue with machine with natural language is progressively close to realizing.Voice The application range of identification technology is extremely wide, is not only related to the every aspect of daily life, and pole is also played in military field Its important role.It is information-intensive society towards intelligent and automation development key technology, makes processing of the people to information It is more convenient with obtaining, so as to improve the work efficiency of people.
The content of the invention
It is an object of the invention to provide a kind of speech recognition system and method based on surname, to solve above-mentioned background skill The problem of being proposed in art.
To achieve the above object, the present invention provides following technical solution:
A kind of speech recognition system based on surname, including main control unit, voice acquisition unit, Audio Processing Unit, surname Model unit and pre-training unit:The main control unit connects voice acquisition unit, Audio Processing Unit, surname model respectively Unit and pre-training unit.
Further technical solution as the present invention:The voice acquisition unit is M6027 microphones.
Further technical solution as the present invention:The Audio Processing Unit be divided into acoustic model characteristic extracting module, Language model characteristic extracting module obscures processing module and list Shuan Xingshi processing modules.
A kind of audio recognition method based on surname, comprises the steps of:
A, user inputs a string of voices;
B, system obtains voice signal, carries out feature extraction to voice by acoustic model, pronunciation sequence is drawn in acoustic model Afterwards, the character string sequence of maximum probability is found out from candidate character sequence using language model;
C, the Chinese character for representing surname is extracted from chinese character string sequence, by the whole Chinese characters and its spelling book of collection, Chinese character can be converted to phonetic, if it is polyphone, only be converted to wherein some phonetic;
If D, the surname is individual character surname, its phonetic is added in into list list, is done for surname phonetic and easily obscures pronunciation Processing and the processing for searching polyphone, and result is all saved in list;
If E, the surname is double word surname, the phonetic of each word in two word surnames is individually taken out to progress and easily obscures pronunciation Processing and lookup multitone, each word can obtain a list, two lists are combined two-by-two, find out these pinyin-groups Close corresponding all surnames;
F, according to obtained phonetic list, the corresponding surname Chinese character of each phonetic can be found, is listed complete expressed by speaker It portion may surname Chinese character;
G, it is final to obtain corresponding surname Chinese character list.
Compared with prior art, the beneficial effects of the invention are as follows:The present invention is not only simple and convenient, effectively realizes high precision The identification of the quick surname Chinese character of rate, at the same for surname it is a variety of it is confusing may design a series of processing mode, improve The performance and discrimination of speech recognition alleviate system-computed burden.
Description of the drawings
Fig. 1 is a kind of structure diagram of the speech recognition system based on surname
Fig. 2 is a kind of flow chart of the audio recognition method based on surname.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without making creative work Embodiment belongs to the scope of protection of the invention.
Please refer to Fig.1-2, in the embodiment of the present invention, a kind of speech recognition system based on surname is obtained including voice Unit, Audio Processing Unit, surname model unit and pre-training unit.
Voice acquisition unit:The unit is responsible for adopting the order progress voice signal that user sends using M6027 microphones Sample, and user voice signal is sent to Audio Processing Unit and is handled, speech recognition is passed to, is converted to chinese character String.
Audio Processing Unit:Unit is divided into acoustic model characteristic extracting module, language model characteristic extracting module, obscures place Manage module and list Shuan Xingshi processing modules.
Wherein acoustic model characteristic extracting module is to carry out feature extraction to voice signal using acoustic model, and voice is turned The output of acoustics expression is changed to, it is the probability for belonging to some delimiter symbol to provide voice;
Language model characteristic extracting module is after acoustic model provides pronunciation sequence, and probability is found out from candidate character sequence most Big character string sequence;
Obscure there are four types of processing form in processing module, one is that flat tongue consonant and cacuminal are handled, and for the phonetic of surname, judgement is No, with z, s, either if c beginnings are started with z, s or c, continue to judge to whether there is h in phonetic, if in the presence of removing;If no In the presence of then in second position of phonetic plus h;Two are handled for pre-nasal sound and rear nasal sound, for rear nasal sound:Alphabetical g generally goes out Either whether ending is judged comprising ang, eng or ing in surname phonetic for the beginning of present phonetic, if comprising removing phonetic The g of ending, for pre-nasal sound:An, en and in are generally present in the ending of phonetic, judge to whether there is an, en in surname phonetic Or in, if in the presence of whether last position for continuing to judge phonetic is g, if it is not, then adding g in the ending of phonetic.Three be nose Sound n and lateral l processing, whether the beginning for judging surname phonetic is n, if n, then the n of beginning is changed to l.Judge opening for phonetic Whether head is l, and n is changed to if l, the then l started;Four be multitone word processing, and surname Chinese character is being converted to the process of phonetic, if The surname that user says is polyphone, and it does not read according to correct phonetic of the polyphone in surname, and system is according to 35 The correct surname pronunciation of polyphone and the list of other pronunciations, judge institute's input Pinyin whether in other pronunciations, if at other In pronunciation, then correct surname pronunciation is taken out.The list of the pronunciation of correct surname and other pronunciations of 35 polyphones is by being system All polyphones in surname are found out in the list of existing whole Chinese character and its phonetic, wherein there are 28 multitones in individual character surname Word has 7 polyphones in double word surname.
The phonetic of individual character surname wherein for Dan Xingshi, is first added in list list, for surname by single Shuan Xingshi processing modules Family name's phonetic does the processing for easily obscuring pronunciation and the processing for searching polyphone, and result is all saved in list.And for The phonetic of each word in two word surnames is individually taken out the processing for carrying out easily obscuring pronunciation and searches multitone, each word by Shuan Xingshi A list can be all obtained, two lists are combined two-by-two, these pinyin combinations is found out and corresponds to all surnames.
Surname model unit:The whole Chinese characters and its spelling book of collection, will cover Chinese character as much as possible, and with this Build surname model.
Pre-training unit:The recognition mode of the surname of training extraction in advance is carried out by the structure surname model of system constructing, Carry out the extraction of surname Chinese character.
The present invention operation principle be:Its workflow is as shown in Figure 2:
1. user inputs a string of voices.
2. system obtains voice signal, feature extraction is carried out to voice by acoustic model, pronunciation is drawn in acoustic model After sequence, the character string sequence of maximum probability is found out from candidate character sequence using language model.
3. extracting the Chinese character for representing surname from chinese character string sequence, pass through whole Chinese characters of collection and its phonetic word Chinese character can be converted to phonetic by allusion quotation, if it is polyphone, only be converted to wherein some phonetic.
4. if the surname is individual character surname, its phonetic is added in into list list, is done for surname phonetic and easily obscures hair The processing of sound and the processing for searching polyphone, and result is all saved in list.
5. if the surname is double word surname, the phonetic of each word in two word surnames is individually taken out to progress and easily obscures hair The processing of sound and lookup multitone, each word can obtain a list, two lists are combined two-by-two, find out these spellings The corresponding all surnames of sound combination.
6. according to obtained phonetic list, the corresponding surname Chinese character of each phonetic can be found, is listed expressed by speaker All may surname Chinese character.
7. final obtain corresponding surname Chinese character list.

Claims (4)

1. a kind of speech recognition system based on surname, including main control unit, voice acquisition unit, Audio Processing Unit, surname Family name's model unit and pre-training unit:It is characterized in that, the main control unit connects voice acquisition unit, speech processes respectively Unit, surname model unit and pre-training unit.
2. a kind of speech recognition system based on surname according to claim 1, which is characterized in that the voice obtains single Member is M6027 microphones.
A kind of 3. speech recognition system based on surname according to claim 1, which is characterized in that the speech processes list Member is divided into acoustic model characteristic extracting module, language model characteristic extracting module, obscures processing module and list Shuan Xingshi processing moulds Block.
4. a kind of audio recognition method based on surname, which is characterized in that comprise the steps of:
A, user inputs a string of voices;
B, system obtains voice signal, carries out feature extraction to voice by acoustic model, pronunciation sequence is drawn in acoustic model Afterwards, the character string sequence of maximum probability is found out from candidate character sequence using language model;
C, the Chinese character for representing surname is extracted from chinese character string sequence, by the whole Chinese characters and its spelling book of collection, Chinese character can be converted to phonetic, if it is polyphone, only be converted to wherein some phonetic;
If D, the surname is individual character surname, its phonetic is added in into list list, is done for surname phonetic and easily obscures pronunciation Processing and the processing for searching polyphone, and result is all saved in list;
If E, the surname is double word surname, the phonetic of each word in two word surnames is individually taken out to progress and easily obscures pronunciation Processing and lookup multitone, each word can obtain a list, two lists are combined two-by-two, find out these pinyin-groups Close corresponding all surnames;
F, according to obtained phonetic list, the corresponding surname Chinese character of each phonetic can be found, is listed complete expressed by speaker It portion may surname Chinese character;
G, it is final to obtain corresponding surname Chinese character list.
CN201711440127.5A 2017-12-27 2017-12-27 A kind of speech recognition system and method based on surname Pending CN108091325A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711440127.5A CN108091325A (en) 2017-12-27 2017-12-27 A kind of speech recognition system and method based on surname

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711440127.5A CN108091325A (en) 2017-12-27 2017-12-27 A kind of speech recognition system and method based on surname

Publications (1)

Publication Number Publication Date
CN108091325A true CN108091325A (en) 2018-05-29

Family

ID=62178515

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711440127.5A Pending CN108091325A (en) 2017-12-27 2017-12-27 A kind of speech recognition system and method based on surname

Country Status (1)

Country Link
CN (1) CN108091325A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110097880A (en) * 2019-04-20 2019-08-06 广东小天才科技有限公司 A kind of answer determination method and device based on speech recognition

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1342942A (en) * 2000-09-08 2002-04-03 百度在线网络技术(北京)有限公司 Computer recognizing and indexing method of Chinese names
CN103578467A (en) * 2013-10-18 2014-02-12 威盛电子股份有限公司 Acoustic model building method, voice recognition method and electronic device
CN104331148A (en) * 2014-09-23 2015-02-04 普强信息技术(北京)有限公司 Voice user interface information interaction method
CN105988989A (en) * 2015-02-26 2016-10-05 阿里巴巴集团控股有限公司 Chinese surname recognition method and device, as well as server
CN106447346A (en) * 2016-08-29 2017-02-22 北京中电普华信息技术有限公司 Method and system for construction of intelligent electric power customer service system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1342942A (en) * 2000-09-08 2002-04-03 百度在线网络技术(北京)有限公司 Computer recognizing and indexing method of Chinese names
CN103578467A (en) * 2013-10-18 2014-02-12 威盛电子股份有限公司 Acoustic model building method, voice recognition method and electronic device
CN104331148A (en) * 2014-09-23 2015-02-04 普强信息技术(北京)有限公司 Voice user interface information interaction method
CN105988989A (en) * 2015-02-26 2016-10-05 阿里巴巴集团控股有限公司 Chinese surname recognition method and device, as well as server
CN106447346A (en) * 2016-08-29 2017-02-22 北京中电普华信息技术有限公司 Method and system for construction of intelligent electric power customer service system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110097880A (en) * 2019-04-20 2019-08-06 广东小天才科技有限公司 A kind of answer determination method and device based on speech recognition

Similar Documents

Publication Publication Date Title
CN105957518B (en) A kind of method of Mongol large vocabulary continuous speech recognition
CN109829058B (en) Classification recognition method for improving dialect recognition accuracy based on multitask learning
CN110675854B (en) Chinese and English mixed speech recognition method and device
CN108847241A (en) It is method, electronic equipment and the storage medium of text by meeting speech recognition
CN104575497B (en) A kind of acoustic model method for building up and the tone decoding method based on the model
CN104078044A (en) Mobile terminal and sound recording search method and device of mobile terminal
WO2003010754A1 (en) Speech input search system
CN104485107B (en) Audio recognition method, speech recognition system and the speech recognition apparatus of title
CN109256150A (en) Speech emotion recognition system and method based on machine learning
CN110277088B (en) Intelligent voice recognition method, intelligent voice recognition device and computer readable storage medium
CN103632663B (en) A kind of method of Mongol phonetic synthesis front-end processing based on HMM
CN102063900A (en) Speech recognition method and system for overcoming confusing pronunciation
CN112818680B (en) Corpus processing method and device, electronic equipment and computer readable storage medium
CN103219007A (en) Voice recognition method and voice recognition device
CN104199825A (en) Information inquiry method and system
CN109102800A (en) A kind of method and apparatus that the determining lyrics show data
CN111192572A (en) Semantic recognition method, device and system
CN111933116B (en) Speech recognition model training method, system, mobile terminal and storage medium
CN110390929A (en) Chinese and English civil aviaton land sky call acoustic model construction method based on CDNN-HMM
CN108091325A (en) A kind of speech recognition system and method based on surname
CN112489634A (en) Language acoustic model training method and device, electronic equipment and computer medium
CN107562907A (en) A kind of intelligent lawyer's expert system and case answering device
CN109213988A (en) Barrage subject distillation method, medium, equipment and system based on N-gram model
CN104424942A (en) Method for improving character speed input accuracy
CN109523992A (en) Tibetan dialect speech processing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180529

RJ01 Rejection of invention patent application after publication