CN104657104A - PC (personal computer)-oriented embedded non-specific voice communication system - Google Patents

PC (personal computer)-oriented embedded non-specific voice communication system Download PDF

Info

Publication number
CN104657104A
CN104657104A CN201510030838.XA CN201510030838A CN104657104A CN 104657104 A CN104657104 A CN 104657104A CN 201510030838 A CN201510030838 A CN 201510030838A CN 104657104 A CN104657104 A CN 104657104A
Authority
CN
China
Prior art keywords
voice
microprocessor
speech
chip
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510030838.XA
Other languages
Chinese (zh)
Inventor
吴振英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Vocational Institute of Industrial Technology
Original Assignee
Suzhou Vocational Institute of Industrial Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Vocational Institute of Industrial Technology filed Critical Suzhou Vocational Institute of Industrial Technology
Priority to CN201510030838.XA priority Critical patent/CN104657104A/en
Publication of CN104657104A publication Critical patent/CN104657104A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses the design of an embedded non-specific voice signal and computer communication circuit. The circuit comprises a microprocessor STM32F407VG, a voice identification chip LD3320, a USB (universal serial bus) cable and the like. In the whole system, the STM32F407VG of a Cortex-M4 serves as a main control chip, a MuC/OS-III operation system is transplanted to perform task management, and the USB cable is transplanted to serve as human-computer interface standard equipment; the voice identification chip is externally connected with and provided with an audio player and an audio collector and is connected with the microprocessor SPI (Serial Peripheral Interface) through communication; the USB cable is connected to a PC. According to the scheme provided by the invention, control and operation on the computer are finished without a keyboard and a mouse and are performed only by giving a voice command by a user, the circuit has the advantages of high stability, high voice identification rate, high noise resistance, simple structure, convenience in use and the like, the cost can be effectively reduced, and the circuit can be widely applied in multiple fields, such as service robot intelligent space, intelligent home furnishing and consumer electronics.

Description

A kind of embedded nonspecific voice communication system towards PC
Technical field
The present invention relates to a kind of nonspecific voice signal identification circuit, specifically relate to a kind of Circuits System linked up by embedded nonspecific voice signal and computer.
Background technology
Embedded technology, as the core technology in intelligence epoch 21 century, has more and more played pillar effect in science and technology and sphere of life.At present, the control of computing machine has been come by keyboard and mouse, and along with the development of science and technology, people need a kind of mode of more convenient, more natural, hommization more and computing machine to carry out alternately.Speech recognition technology is one of important greatly development in science and technology technology of areas of information technology ten, and it is a cross discipline, just progressively becomes the gordian technique of man-machine interaction in infotech, and its application has become one and had emulative emerging hi-tech industry.The speech recognition technology of current main flow is the basic theories of Corpus--based Method pattern-recognition, and statistical model training is due to algorithm complexity, and operand is large, and power consumption is high, high in cost of production shortcoming, limits its utilization in actual applications.And embedded speech man-machine interaction due to its real-time good, many advantages such as stability is high have become the heat subject of research at present, but there is no comparative maturity, and the exploitativeness scheme that design complexity is low, power consumption is less is come out.
Summary of the invention
In view of above-mentioned the deficiencies in the prior art, the object of the invention is to propose a kind of embedded nonspecific voice communication system towards PC, with simple circuit design, the feature such as discrimination is high, real-time is high, good stability proposes the technical solution of PC interactive voice.
Above-mentioned purpose of the present invention, its technical solution be achieved is: a kind of embedded nonspecific voice communication system towards PC, it is characterized in that: described communication system is by microprocessor, voice recognition chip LD3320, USB cable and auxiliary distribution road composition, wherein said microprocessor is the STM32F407VG main control chip of Cortex-M4 kernel and transplanting has μ C/OS-III operating system, be equipped with audio player outside described voice recognition chip LD3320 to communicate with audio collection device and with microprocessor SPI and be connected, described USB cable connects microprocessor to PC, and voice communication comprises step:
I, general initialization, general initialization is exactly the initialization that speech recognition and speech play are all suitable for.The inner integrated PLL of LD3320, correctly configuring PLL according to clock frequency is the guarantee that speech recognition ADC samples and speech play .DA exports, and only needs to revise this macro definition of CLK_IN in code;
II, some parameters of initialization speech recognition, this parameter mainly comprise speech detection is set sensitivity, the time of initiating speech, the background noise time, sensitivity is not more high better, and the possibility of the higher false triggering of sensitivity is larger, therefore will arrange a suitable value according to actual environment.The initiating speech time is that decision-making is once that real voice start when the voice of chip detection to how long, and the background noise time is when chip detection is to the end being judged as voice after how long voice do not input.
III, to be write direct unspecific identification phrase towards microprocessor by phonetic, each identifies that phrase comprises a phrase ID and a corresponding PC action command, corresponding virtually on PC becomes a personal-machine interface keyboard;
IV, speech recognition is started, audio collection device receives outside nonspecific voice, identify voice by voice recognition chip LD3320 and interrupted to microprocessor application by recognition result, microcontroller interrupt reads out recognition result and the selected PC action command corresponding with phrase ID, respond action to the instruction of PC output action by PC by USB cable.
Further, described microprocessor is the MCU that maximum operation frequency reaches 168MHz.
Further, described voice recognition chip LD3320 is the speech recognition device being built-in with nonspecific speech recognition DSP algorithm.
Apply nonspecific voice communication system of the present invention, its remarkable advantage is presented as: without the need to being completed control and the operation of computing machine by keyboard and mouse, only need control by people's order of sounding and operate computing machine, the advantages such as this circuit has good stability, phonetic recognization rate is high, anti-noise jamming ability is strong, structure is simple and easy to use, effectively can reduce costs, and multiple fields such as service robot intelligent space, Smart Home and consumption electronic product can be widely used in.
Accompanying drawing explanation
Fig. 1 is circuit general diagram of the present invention.
Fig. 2 is the circuit connection diagram of voice recognition chip LD3320 in communication system of the present invention.
Embodiment
Below just accompanying drawing in conjunction with the embodiments, is described in further detail technical solution of the present invention, and to make novelty of the present invention, practicality is easier to understand.
The present invention innovates and proposes a kind of by embedded nonspecific voice signal and the mutual ditch circuit passband of computer, and this main circuit will comprise master control and speech recognition two large divisions.As shown in Figure 1 from concrete structure: its structure is made up of auxiliary distribution roads such as microprocessor STM32F407VG, voice recognition chip LD3320, USB cable and other house dogs, wherein microprocessor is the STM32F407VG main control chip (calling MCU in the following text) of Cortex-M4 kernel, and transplant μ COS-III operating system as task management, transplanted USB as HID standard device, MCU selects SPI to communicate with voice recognition chip LD3320.The maximum operation frequency of this MCU reaches 168MHz, and processing speed is fast; μ COS-III is the third generation micro controller system of micrium company, and it is a brand-new operating system, is widely used in various product at home and abroad, and main control chip is transplanted μ COS-III operating system, as management and the scheduling of task.
In communication system as of the present invention in Fig. 2 voice recognition chip LD3320 circuit connection diagram shown in, LD3320 adopts parallel mode directly to connect with MCU, general employing 1k Ω resistance pull-up, reset signal and interruption return signal are directly connected with MCU and adopt the pull-up resistor of 3.3k Ω, backup system steady operation, LD3320 and processor adopt same external clock, figure below is active crystal oscillator, upper right side is the interface of microphone and earphone, and lower right row's pin extracts and is connected in respective pin.Concerning LD3320, reset signal is sent by MCU, and look-at-me is sent by LD3320, and MCU is responsible for reception.Be equipped with audio player outside voice recognition chip LD3320 to communicate with audio collection device and with microprocessor SPI and be connected, USB cable connects microprocessor to PC, voice recognition chip LD3320 is built-in with the DSP algorithm of nonspecific speech recognition, can dynamically edit identification item list, without the need to other additional device plug-in, one chip can complete speech recognition, and directly support the speech play of mp3 data, voice recognition chip detects phonetic entry and identifies voice, recognition result interrupts to MCU application, MCU interruption reads out recognition result, and start corresponding identification mission, to the operation that PC is correlated with.
Above-mentioned voice are linked up and are comprised step:
I, general initialization, general initialization is exactly the initialization that speech recognition and speech play are all suitable for.The inner integrated PLL of LD3320, correctly configuring PLL according to clock frequency is the guarantee that speech recognition ADC samples and speech play .DA exports, and we only need to revise this macro definition of CLK_IN in code.
II, some parameters of initialization speech recognition, this parameter mainly comprise speech detection is set sensitivity, the time of initiating speech, the background noise time, sensitivity is not more high better, and the possibility of the higher false triggering of sensitivity is larger, therefore will arrange a suitable value according to actual environment.The initiating speech time is that decision-making is once that real voice start when the voice of chip detection to how long, and the background noise time is when chip detection is to the end being judged as voice after how long voice do not input.
III, to be write direct unspecific identification phrase towards microprocessor by phonetic, each identifies that phrase comprises a phrase ID and a corresponding PC action command, corresponding virtually on PC becomes a personal-machine interface keyboard.
IV, speech recognition is started, audio collection device receives outside nonspecific voice, identify voice by voice recognition chip LD3320 and interrupted to microprocessor application by recognition result, microcontroller interrupt reads out recognition result and the selected PC action command corresponding with phrase ID, respond action to the instruction of PC output action by PC by USB cable.
Under normal circumstances, as long as each identification repeats step I to IV; If systems stay is operated in speech identifying function and does not reset, so only need to start Exactly-once step IV when identifying at every turn, thus can save time, improve the response speed of speech recognition.
The actual excellent effect of the technical program is understood further below from the communication experiment of communication system of the present invention under the various occasion of reality.Under the environment of two different background noises such as family's (quiet environment) and market (noisy environment), by the ditch circuit passband be formed by connecting by above scheme framework, and after arranging the parameters such as rational speech detection sensitivity, voice initial time, background noise time to this ditch circuit passband in step II, the embedded nonspecific voice that can carry out towards PC are linked up.Allow adult and child send acoustic control according to the phonetic order preset to this ditch circuit passband, observe and record the actual operation situation (number of times is set to 15 times) of PC here, result arranges (discrimination is identification number of times and the ratio of total degree) as shown in the table: as can be seen here, communication system of the present invention is not only practical in actual applications, and efficiency is remarkable.
To sum up, the embedded nonspecific voice communication system of application the present invention, without the need to being completed control and the operation of computing machine by keyboard and mouse, only need control by people's order of sounding and operate computing machine, the advantages such as this circuit has good stability, phonetic recognization rate is high, anti-noise jamming ability is strong, structure is simple and easy to use, effectively can reduce costs, and multiple fields such as service robot intelligent space, Smart Home and consumption electronic product can be widely used in.
In addition to the implementation, the present invention can also have other embodiment, and all employings are equal to the technical scheme of replacement or equivalent transformation formation, all drop within the present invention's scope required for protection.

Claims (3)

1. the embedded nonspecific voice communication system towards PC, it is characterized in that: described communication system is made up of microprocessor, voice recognition chip LD3320, USB cable and auxiliary distribution road, wherein said microprocessor is the STM32F407VG main control chip of Cortex-M4 kernel and transplanting has μ C/OS-III operating system, be equipped with audio player outside described voice recognition chip LD3320 to communicate with audio collection device and with microprocessor SPI and be connected, described USB cable connects microprocessor to PC, and voice communication comprises step:
I, voice recognition chip LD3320 is carried out to the general initialization of speech recognition and speech play, the inner integrated PLL of voice recognition chip LD3320, correctly configures PLL according to clock frequency by this macro definition of CLK_IN in amendment code and samples and speech play .DA output to ensure speech recognition ADC;
II, the parameter of initialization speech recognition, described parameter comprise speech detection sensitivity, judge the initiating speech time that voice start and judge the background noise time that voice terminate;
III, to be write direct unspecific identification phrase towards microprocessor by phonetic, each identifies that phrase comprises a phrase ID and a corresponding PC action command, corresponding virtually on PC becomes a personal-machine interface keyboard;
IV, speech recognition is started, audio collection device receives outside nonspecific voice, identify voice by voice recognition chip LD3320 and interrupted to microprocessor application by recognition result, microcontroller interrupt reads out recognition result and the selected PC action command corresponding with phrase ID, respond action to the instruction of PC output action by PC by USB cable.
2., according to claim 1 towards the embedded nonspecific voice communication system of PC, it is characterized in that: described microprocessor is the MCU that maximum operation frequency reaches 168MHz.
3. according to claim 1 towards the embedded nonspecific voice communication system of PC, it is characterized in that: described voice recognition chip LD3320 is the speech recognition device being built-in with nonspecific speech recognition DSP algorithm.
CN201510030838.XA 2015-01-22 2015-01-22 PC (personal computer)-oriented embedded non-specific voice communication system Pending CN104657104A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510030838.XA CN104657104A (en) 2015-01-22 2015-01-22 PC (personal computer)-oriented embedded non-specific voice communication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510030838.XA CN104657104A (en) 2015-01-22 2015-01-22 PC (personal computer)-oriented embedded non-specific voice communication system

Publications (1)

Publication Number Publication Date
CN104657104A true CN104657104A (en) 2015-05-27

Family

ID=53248298

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510030838.XA Pending CN104657104A (en) 2015-01-22 2015-01-22 PC (personal computer)-oriented embedded non-specific voice communication system

Country Status (1)

Country Link
CN (1) CN104657104A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107667318A (en) * 2015-06-25 2018-02-06 英特尔公司 Dialog interface technology for system control
CN108170480A (en) * 2017-12-25 2018-06-15 北京康拓科技有限公司 A kind of startup method based on u-boot guiding μ C/OS operating systems
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US10663938B2 (en) 2017-09-15 2020-05-26 Kohler Co. Power operation of intelligent devices
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US11921794B2 (en) 2017-09-15 2024-03-05 Kohler Co. Feedback for water consuming appliance

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201286140Y (en) * 2008-11-10 2009-08-05 天津三星电子有限公司 Video camera set by voice recognition system
CN201845546U (en) * 2010-11-18 2011-05-25 西安龙飞软件有限公司 Device capable of controlling mobile phone through speech
CN201936600U (en) * 2011-02-28 2011-08-17 山东大学 Non-specific person voice recognition and voice synthesis device based on special voice chip

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201286140Y (en) * 2008-11-10 2009-08-05 天津三星电子有限公司 Video camera set by voice recognition system
CN201845546U (en) * 2010-11-18 2011-05-25 西安龙飞软件有限公司 Device capable of controlling mobile phone through speech
CN201936600U (en) * 2011-02-28 2011-08-17 山东大学 Non-specific person voice recognition and voice synthesis device based on special voice chip

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LAUNCHER: "基于Stm32,LD3320的非特定语音识别USB HID Keyboard实现", 《HTTP://BLOG.SINA.COM.CN/S/BLOG_52E8BAA40101NIK6.HTML》 *
ST公司: "STM32F405xx & STM32F407xx Datasheet", 《PRODUCTION DATASHEET》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107667318A (en) * 2015-06-25 2018-02-06 英特尔公司 Dialog interface technology for system control
CN107667318B (en) * 2015-06-25 2022-02-15 英特尔公司 Conversational interface techniques for system control
US10448762B2 (en) 2017-09-15 2019-10-22 Kohler Co. Mirror
US10663938B2 (en) 2017-09-15 2020-05-26 Kohler Co. Power operation of intelligent devices
US10887125B2 (en) 2017-09-15 2021-01-05 Kohler Co. Bathroom speaker
US11099540B2 (en) 2017-09-15 2021-08-24 Kohler Co. User identity in household appliances
US11314215B2 (en) 2017-09-15 2022-04-26 Kohler Co. Apparatus controlling bathroom appliance lighting based on user identity
US11892811B2 (en) 2017-09-15 2024-02-06 Kohler Co. Geographic analysis of water conditions
US11921794B2 (en) 2017-09-15 2024-03-05 Kohler Co. Feedback for water consuming appliance
US11949533B2 (en) 2017-09-15 2024-04-02 Kohler Co. Sink device
CN108170480A (en) * 2017-12-25 2018-06-15 北京康拓科技有限公司 A kind of startup method based on u-boot guiding μ C/OS operating systems

Similar Documents

Publication Publication Date Title
CN104657104A (en) PC (personal computer)-oriented embedded non-specific voice communication system
CN102855872B (en) Based on terminal and the mutual household electric appliance control method of internet voice and system
CN103093755B (en) Based on terminal and mutual network household electric appliance control method and the system of internet voice
CN102855874B (en) Method and system for controlling household appliance on basis of voice interaction of internet
CN105745615A (en) Always-on audio control for mobile device
CN104516477B (en) Into the technology of low power state
CN202041916U (en) Sound control mouse
CN201936600U (en) Non-specific person voice recognition and voice synthesis device based on special voice chip
CN103002097A (en) Key induction device, key induction method and mobile terminal
CN104155890A (en) Speech recognition control switch system
CN204480661U (en) Phonetic controller
CN101789238B (en) Music rhythm extracting system based on MCU hardware platform and method thereof
CN204808301U (en) Terminal
CN104078042B (en) A kind of electronic equipment and a kind of method of information processing
CN206258698U (en) The voice that can record is switched with button coordinated signals
CN103970706A (en) External expansion system of mobile device based on FT311D interface chip
CN203588493U (en) Intelligent man-machine interaction early education machine
CN107945794A (en) A kind of Application on Voiceprint Recognition and the device of control
CN108597516A (en) A kind of speech recognition system for intelligent sound customer service robot
CN104658534A (en) Voice recognition system
CN213781581U (en) Voice recognition control system
CN202871118U (en) Wireless learning machine for foreign languages
CN107863107A (en) A kind of speech recognition system
CN220796302U (en) Audio playing circuit
CN203950424U (en) A kind of ammeter concentrator with voice interactive function

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20150527

WD01 Invention patent application deemed withdrawn after publication