CN104657104A

CN104657104A - PC (personal computer)-oriented embedded non-specific voice communication system

Info

Publication number: CN104657104A
Application number: CN201510030838.XA
Authority: CN
Inventors: 吴振英
Original assignee: Suzhou Vocational Institute of Industrial Technology
Current assignee: Suzhou Vocational Institute of Industrial Technology
Priority date: 2015-01-22
Filing date: 2015-01-22
Publication date: 2015-05-27

Abstract

The invention discloses the design of an embedded non-specific voice signal and computer communication circuit. The circuit comprises a microprocessor STM32F407VG, a voice identification chip LD3320, a USB (universal serial bus) cable and the like. In the whole system, the STM32F407VG of a Cortex-M4 serves as a main control chip, a MuC/OS-III operation system is transplanted to perform task management, and the USB cable is transplanted to serve as human-computer interface standard equipment; the voice identification chip is externally connected with and provided with an audio player and an audio collector and is connected with the microprocessor SPI (Serial Peripheral Interface) through communication; the USB cable is connected to a PC. According to the scheme provided by the invention, control and operation on the computer are finished without a keyboard and a mouse and are performed only by giving a voice command by a user, the circuit has the advantages of high stability, high voice identification rate, high noise resistance, simple structure, convenience in use and the like, the cost can be effectively reduced, and the circuit can be widely applied in multiple fields, such as service robot intelligent space, intelligent home furnishing and consumer electronics.

Description

A kind of embedded nonspecific voice communication system towards PC

Technical field

The present invention relates to a kind of nonspecific voice signal identification circuit, specifically relate to a kind of Circuits System linked up by embedded nonspecific voice signal and computer.

Background technology

Embedded technology, as the core technology in intelligence epoch 21 century, has more and more played pillar effect in science and technology and sphere of life.At present, the control of computing machine has been come by keyboard and mouse, and along with the development of science and technology, people need a kind of mode of more convenient, more natural, hommization more and computing machine to carry out alternately.Speech recognition technology is one of important greatly development in science and technology technology of areas of information technology ten, and it is a cross discipline, just progressively becomes the gordian technique of man-machine interaction in infotech, and its application has become one and had emulative emerging hi-tech industry.The speech recognition technology of current main flow is the basic theories of Corpus--based Method pattern-recognition, and statistical model training is due to algorithm complexity, and operand is large, and power consumption is high, high in cost of production shortcoming, limits its utilization in actual applications.And embedded speech man-machine interaction due to its real-time good, many advantages such as stability is high have become the heat subject of research at present, but there is no comparative maturity, and the exploitativeness scheme that design complexity is low, power consumption is less is come out.

Summary of the invention

In view of above-mentioned the deficiencies in the prior art, the object of the invention is to propose a kind of embedded nonspecific voice communication system towards PC, with simple circuit design, the feature such as discrimination is high, real-time is high, good stability proposes the technical solution of PC interactive voice.

Above-mentioned purpose of the present invention, its technical solution be achieved is: a kind of embedded nonspecific voice communication system towards PC, it is characterized in that: described communication system is by microprocessor, voice recognition chip LD3320, USB cable and auxiliary distribution road composition, wherein said microprocessor is the STM32F407VG main control chip of Cortex-M4 kernel and transplanting has μ C/OS-III operating system, be equipped with audio player outside described voice recognition chip LD3320 to communicate with audio collection device and with microprocessor SPI and be connected, described USB cable connects microprocessor to PC, and voice communication comprises step:

I, general initialization, general initialization is exactly the initialization that speech recognition and speech play are all suitable for.The inner integrated PLL of LD3320, correctly configuring PLL according to clock frequency is the guarantee that speech recognition ADC samples and speech play .DA exports, and only needs to revise this macro definition of CLK_IN in code;

II, some parameters of initialization speech recognition, this parameter mainly comprise speech detection is set sensitivity, the time of initiating speech, the background noise time, sensitivity is not more high better, and the possibility of the higher false triggering of sensitivity is larger, therefore will arrange a suitable value according to actual environment.The initiating speech time is that decision-making is once that real voice start when the voice of chip detection to how long, and the background noise time is when chip detection is to the end being judged as voice after how long voice do not input.

III, to be write direct unspecific identification phrase towards microprocessor by phonetic, each identifies that phrase comprises a phrase ID and a corresponding PC action command, corresponding virtually on PC becomes a personal-machine interface keyboard;

IV, speech recognition is started, audio collection device receives outside nonspecific voice, identify voice by voice recognition chip LD3320 and interrupted to microprocessor application by recognition result, microcontroller interrupt reads out recognition result and the selected PC action command corresponding with phrase ID, respond action to the instruction of PC output action by PC by USB cable.

Further, described microprocessor is the MCU that maximum operation frequency reaches 168MHz.

Further, described voice recognition chip LD3320 is the speech recognition device being built-in with nonspecific speech recognition DSP algorithm.

Apply nonspecific voice communication system of the present invention, its remarkable advantage is presented as: without the need to being completed control and the operation of computing machine by keyboard and mouse, only need control by people's order of sounding and operate computing machine, the advantages such as this circuit has good stability, phonetic recognization rate is high, anti-noise jamming ability is strong, structure is simple and easy to use, effectively can reduce costs, and multiple fields such as service robot intelligent space, Smart Home and consumption electronic product can be widely used in.

Accompanying drawing explanation

Fig. 1 is circuit general diagram of the present invention.

Fig. 2 is the circuit connection diagram of voice recognition chip LD3320 in communication system of the present invention.

Embodiment

Below just accompanying drawing in conjunction with the embodiments, is described in further detail technical solution of the present invention, and to make novelty of the present invention, practicality is easier to understand.

The present invention innovates and proposes a kind of by embedded nonspecific voice signal and the mutual ditch circuit passband of computer, and this main circuit will comprise master control and speech recognition two large divisions.As shown in Figure 1 from concrete structure: its structure is made up of auxiliary distribution roads such as microprocessor STM32F407VG, voice recognition chip LD3320, USB cable and other house dogs, wherein microprocessor is the STM32F407VG main control chip (calling MCU in the following text) of Cortex-M4 kernel, and transplant μ COS-III operating system as task management, transplanted USB as HID standard device, MCU selects SPI to communicate with voice recognition chip LD3320.The maximum operation frequency of this MCU reaches 168MHz, and processing speed is fast; μ COS-III is the third generation micro controller system of micrium company, and it is a brand-new operating system, is widely used in various product at home and abroad, and main control chip is transplanted μ COS-III operating system, as management and the scheduling of task.

In communication system as of the present invention in Fig. 2 voice recognition chip LD3320 circuit connection diagram shown in, LD3320 adopts parallel mode directly to connect with MCU, general employing 1k Ω resistance pull-up, reset signal and interruption return signal are directly connected with MCU and adopt the pull-up resistor of 3.3k Ω, backup system steady operation, LD3320 and processor adopt same external clock, figure below is active crystal oscillator, upper right side is the interface of microphone and earphone, and lower right row's pin extracts and is connected in respective pin.Concerning LD3320, reset signal is sent by MCU, and look-at-me is sent by LD3320, and MCU is responsible for reception.Be equipped with audio player outside voice recognition chip LD3320 to communicate with audio collection device and with microprocessor SPI and be connected, USB cable connects microprocessor to PC, voice recognition chip LD3320 is built-in with the DSP algorithm of nonspecific speech recognition, can dynamically edit identification item list, without the need to other additional device plug-in, one chip can complete speech recognition, and directly support the speech play of mp3 data, voice recognition chip detects phonetic entry and identifies voice, recognition result interrupts to MCU application, MCU interruption reads out recognition result, and start corresponding identification mission, to the operation that PC is correlated with.

Above-mentioned voice are linked up and are comprised step:

I, general initialization, general initialization is exactly the initialization that speech recognition and speech play are all suitable for.The inner integrated PLL of LD3320, correctly configuring PLL according to clock frequency is the guarantee that speech recognition ADC samples and speech play .DA exports, and we only need to revise this macro definition of CLK_IN in code.

III, to be write direct unspecific identification phrase towards microprocessor by phonetic, each identifies that phrase comprises a phrase ID and a corresponding PC action command, corresponding virtually on PC becomes a personal-machine interface keyboard.

Under normal circumstances, as long as each identification repeats step I to IV; If systems stay is operated in speech identifying function and does not reset, so only need to start Exactly-once step IV when identifying at every turn, thus can save time, improve the response speed of speech recognition.

The actual excellent effect of the technical program is understood further below from the communication experiment of communication system of the present invention under the various occasion of reality.Under the environment of two different background noises such as family's (quiet environment) and market (noisy environment), by the ditch circuit passband be formed by connecting by above scheme framework, and after arranging the parameters such as rational speech detection sensitivity, voice initial time, background noise time to this ditch circuit passband in step II, the embedded nonspecific voice that can carry out towards PC are linked up.Allow adult and child send acoustic control according to the phonetic order preset to this ditch circuit passband, observe and record the actual operation situation (number of times is set to 15 times) of PC here, result arranges (discrimination is identification number of times and the ratio of total degree) as shown in the table: as can be seen here, communication system of the present invention is not only practical in actual applications, and efficiency is remarkable.

To sum up, the embedded nonspecific voice communication system of application the present invention, without the need to being completed control and the operation of computing machine by keyboard and mouse, only need control by people's order of sounding and operate computing machine, the advantages such as this circuit has good stability, phonetic recognization rate is high, anti-noise jamming ability is strong, structure is simple and easy to use, effectively can reduce costs, and multiple fields such as service robot intelligent space, Smart Home and consumption electronic product can be widely used in.

In addition to the implementation, the present invention can also have other embodiment, and all employings are equal to the technical scheme of replacement or equivalent transformation formation, all drop within the present invention's scope required for protection.

Claims

1. the embedded nonspecific voice communication system towards PC, it is characterized in that: described communication system is made up of microprocessor, voice recognition chip LD3320, USB cable and auxiliary distribution road, wherein said microprocessor is the STM32F407VG main control chip of Cortex-M4 kernel and transplanting has μ C/OS-III operating system, be equipped with audio player outside described voice recognition chip LD3320 to communicate with audio collection device and with microprocessor SPI and be connected, described USB cable connects microprocessor to PC, and voice communication comprises step:

I, voice recognition chip LD3320 is carried out to the general initialization of speech recognition and speech play, the inner integrated PLL of voice recognition chip LD3320, correctly configures PLL according to clock frequency by this macro definition of CLK_IN in amendment code and samples and speech play .DA output to ensure speech recognition ADC;

II, the parameter of initialization speech recognition, described parameter comprise speech detection sensitivity, judge the initiating speech time that voice start and judge the background noise time that voice terminate;

2., according to claim 1 towards the embedded nonspecific voice communication system of PC, it is characterized in that: described microprocessor is the MCU that maximum operation frequency reaches 168MHz.

3. according to claim 1 towards the embedded nonspecific voice communication system of PC, it is characterized in that: described voice recognition chip LD3320 is the speech recognition device being built-in with nonspecific speech recognition DSP algorithm.