CN103761967A

CN103761967A - Embedded speech recognition system

Info

Publication number: CN103761967A
Application number: CN201410007573.7A
Authority: CN
Inventors: 钱平; 张英振; 李玉
Original assignee: Shanghai Institute of Technology
Current assignee: Shanghai Institute of Technology
Priority date: 2014-01-08
Filing date: 2014-01-08
Publication date: 2014-04-30

Abstract

The invention relates to an embedded speech recognition system which comprises a master control chip, a speech processing chip, a synchronous dynamic random access memory (SDRAM) data exchange buffer chip, a speech signal collection device, a loudspeaker, a control execution module and a power module, wherein the speech signal collection device collects and transmits speech information to the speech processing chip to perform preprocessing, terminal detection and feature extraction, the speech processing chip obtains and sends feature data to the SDRAM data exchange buffer chip to achieve temporary storage and to the master control chip which obtains features and outputs a control signal after matching features with speech features in a memory feature base, the control signal is sent to the control execution module, and the power module supplies power to the chips. The control chip is applied to speech recognition to replace a personal computer (PC) in the traditional speech recognition system, therefore, the embedded speech recognition system has the advantages of small size, low cost and power consumption, flexible installation and the like, and an embedded real-time control system is utilized to manage task distribution to ensure real-time performance of recognition.

Description

A kind of built-in speech recognition system

Technical field

The present invention relates to a kind of speech recognition technology, particularly a kind of built-in speech recognition system.

Background technology

Built-in speech recognition system refers to that the various advanced persons' of application microprocessor realizes speech recognition technology at plate level or chip-scale software or hardware.The speech recognition system of built-in speech recognition system and PC is compared, although travelling speed and memory size have certain limitation, but it has the advantages such as volume is little, low in energy consumption, reliability is high, cost is little, flexible for installation, is specially adapted to the fields such as Smart Home, robot and consumer electronics.

Summary of the invention

The present invention be directed to built-in speech recognition system and than PC, be more suitable for the problem of household consumption, propose a kind of built-in speech recognition system, for being operated in the speech recognition system of plate level, realize the speech identifying function of efficiently and accurately.

Technical scheme of the present invention is: a kind of built-in speech recognition system, comprise main control chip, pronounciation processing chip, SDRAM exchanges data buffer chip, speech signal collection device, loudspeaker, control execution module, power module, speech signal collection device gathers transmission of speech information and carries out pre-service to pronounciation processing chip, end-point detection and feature extraction, pronounciation processing chip obtains characteristic, and to send into SDRAM exchanges data buffer chip temporary, pronounciation processing chip obtains characteristic and sends into main control chip, main control chip obtains feature and mates rear output control signal with phonetic feature in memory features storehouse, control signal is sent control execution module, power module is given each chip power supply.

Described system also comprises program download jtag interface and auxiliary circuit, and auxiliary circuit comprises the LED light of gain adjusting circuit and status recognition.

The described system integration is on a pcb board, and main control chip is connected by SPI interface with the sound identification module being comprised of pronounciation processing chip, SDRAM and peripheral auxiliary circuits.

Described pronounciation processing chip adopts MFCC algorithm to carry out feature extraction, and phonetic feature is through single order MFCC Differential Characteristics, and the single order of energy feature and energy, second order difference characteristic processing, and described main control chip adopts CHMM algorithm to carry out characteristic matching.

Beneficial effect of the present invention is: a kind of built-in speech recognition system of the present invention, control chip is applied to the PC that replaces legacy speech recognition systems in speech recognition, and make it have the advantages such as volume is little, cost is low, low in energy consumption and flexible for installation.The feature extraction algorithm of selecting and Feature Correspondence Algorithm are applicable to embedded platform of the present invention preferably, guarantee the accuracy of identification, and apply embedded real-time control system management role and distribute, and guarantee the real-time of identification.

Accompanying drawing explanation

Fig. 1 is built-in speech recognition system structural representation of the present invention;

Fig. 2 is built-in speech recognition system process flow diagram of the present invention.

Embodiment

Built-in speech recognition system structural representation as shown in Figure 1: system comprises main control chip, pronounciation processing chip, SDRAM exchanges data buffer chip, microphone, loudspeaker, control execution module, auxiliary circuit, power module, jtag interface.

Main control chip: mainly complete the phonic signal character contrast in phonic signal character to be identified and pattern base and control and carry out as core controller, can coordinate speech recognition, and the data that identification is obtained be converted to control command send control execution module complete control carry out.Wherein all programs are all take embedded real-time operating system as basic environment, realize by standard C language.

Pronounciation processing chip: as the core of identification module, mainly complete pre-service, end-point detection and feature extraction to voice signal.

Power module: for each chip provides required normal working voltage.

Microphone: speech signal collection device.

Loudspeaker: as the interactive tool of built-in speech recognition system, carry out the feedback that voice message and identification, control complete.

Auxiliary circuit: comprise gain adjusting circuit and LED light, be mainly used in the indication of gain-adjusted and status recognition.

Control execution module: after voice signal is identified, main control chip is transferred to execution module by steering order, drive control object action.

The program download interface of jtag interface: STM32.

The system integration, on a pcb board, is made it have to the advantages such as volume is little, low in energy consumption, reliability is high, cost is little, flexible for installation.Main control chip is the STM32F407VGT6 chip that ST Microelectronics produces, internal memory phonetic feature storehouse.Power circuit by input voltage voltage stabilizing to after each module required voltage value, for each module independently-powered.Main control chip is connected by SPI interface with the sound identification module being comprised of pronounciation processing chip, SDRAM and peripheral auxiliary circuits, and applies embedded real-time control system management role and distribute and control and carry out, and meets the demand of system real time.

Process flow diagram as shown in Figure 2, in dormant state, reduces power consumption when system wait triggers, and when carrying out button or password triggering, system is recovered normal operating conditions, carries out hardware and program initialization; Speech signal collection device carries out pre-service, end-point detection and feature extraction by the transmission of speech information collecting to pronounciation processing chip, will obtain characteristic and be temporarily stored in SDRAM; Main control chip will obtain feature and mate with phonetic feature in feature database, if characteristic matching is correct, will convert corresponding steering order to and will complete control action, otherwise wait for new phonetic order, no matter whether identify successfully, all give voice feedback information.

Described system adopts MFCC algorithm to carry out feature extraction, in order to improve accuracy of identification, phonetic feature has increased single order MFCC Differential Characteristics, and the single order of energy feature and energy, second order difference feature, adopt CHMM algorithm to carry out characteristic matching, with DTW, DHMM scheduling algorithm is compared, although CHMM algorithm operation quantity and memory consumption are larger, but it is applicable to unspecified person identification, arithmetic accuracy is high, identification accurately, the main control chip of native system is 32 chips simultaneously, there is flash memory on the sheet of high performance computing power and 1M byte, meet selected algorithm requirement.

Claims

1. a built-in speech recognition system, it is characterized in that, comprise main control chip, pronounciation processing chip, SDRAM exchanges data buffer chip, speech signal collection device, loudspeaker, control execution module, power module, speech signal collection device gathers transmission of speech information and carries out pre-service to pronounciation processing chip, end-point detection and feature extraction, pronounciation processing chip obtains characteristic, and to send into SDRAM exchanges data buffer chip temporary, pronounciation processing chip obtains characteristic and sends into main control chip, main control chip obtains feature and mates rear output control signal with phonetic feature in memory features storehouse, control signal is sent control execution module, power module is given each chip power supply.

2. built-in speech recognition system according to claim 1, is characterized in that, described system also comprises that program downloads jtag interface and auxiliary circuit, and auxiliary circuit comprises the LED light of gain adjusting circuit and status recognition.

3. built-in speech recognition system according to claim 2, is characterized in that, the described system integration is on a pcb board, and main control chip is connected by SPI interface with the sound identification module being comprised of pronounciation processing chip, SDRAM and peripheral auxiliary circuits.

4. built-in speech recognition system according to claim 1, it is characterized in that, described pronounciation processing chip adopts MFCC algorithm to carry out feature extraction, phonetic feature is through single order MFCC Differential Characteristics, and the single order of energy feature and energy, second order difference characteristic processing, described main control chip adopts CHMM algorithm to carry out characteristic matching.