CN209571226U - 一种语音识别装置及系统 - Google Patents

一种语音识别装置及系统 Download PDF

Info

Publication number
CN209571226U
CN209571226U CN201822144428.XU CN201822144428U CN209571226U CN 209571226 U CN209571226 U CN 209571226U CN 201822144428 U CN201822144428 U CN 201822144428U CN 209571226 U CN209571226 U CN 209571226U
Authority
CN
China
Prior art keywords
speech recognition
voice
module
electrically connected
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201822144428.XU
Other languages
English (en)
Inventor
高炳海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LENKENG TECHNOLOGY Co Ltd
Original Assignee
LENKENG TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LENKENG TECHNOLOGY Co Ltd filed Critical LENKENG TECHNOLOGY Co Ltd
Priority to CN201822144428.XU priority Critical patent/CN209571226U/zh
Priority to US16/427,335 priority patent/US20200202851A1/en
Application granted granted Critical
Publication of CN209571226U publication Critical patent/CN209571226U/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/38Information transfer, e.g. on bus
    • G06F13/42Bus transfer protocol, e.g. handshake; Synchronisation
    • G06F13/4204Bus transfer protocol, e.g. handshake; Synchronisation on a parallel bus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/38Information transfer, e.g. on bus
    • G06F13/40Bus structure
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/20Cameras or camera modules comprising electronic image sensors; Control thereof for generating image signals from infrared radiation only
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/30Transforming light or analogous information into electric information
    • H04N5/33Transforming infrared radiation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2213/00Indexing scheme relating to interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F2213/0004Parallel ports, e.g. centronics
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Otolaryngology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Computer Hardware Design (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Quality & Reliability (AREA)
  • Machine Translation (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

本实用新型公开了一种语音识别装置及系统,装置包括:语音采集模块、与语音采集模块电连接的语音降噪处理模块、与语音降噪处理模块电连接的语音识别模块、与语音识别模块电连接的接口转换模块和与接口转换模块电连接的并口排针接口;语音采集模块用于采集语音,语音降噪处理模块用于对采集到的语音进行降噪处理,语音识别模块用于识别出经过降噪处理的语音并转换为控制指令,接口转换模块用于将控制指令转换为数字信号,并口排针接口与电子设备插接,数字信号通过并口排针接口发送给电子设备,本实用新型提供的语音识别装置及系统,无需云端识别运算,快速响应简单操作指令。

Description

一种语音识别装置及系统
技术领域
本实用新型涉及语音识别领域,尤其涉及一种语音识别装置及系统。
背景技术
现有的语音识别装置的语音识别过程一般需要借助云服务器实现,语音指令的响应时间受到场景的限制,不利于简单操作指令的快速响应。
实用新型内容
为了克服现有技术的不足,本实用新型的目的之一在于提供一种语音识别装置,以快速响应简单操作指令。
本实用新型的目的之二在于提供一种语音识别系统,以快速响应简单操作指令。
本实用新型的目的之一采用如下技术方案实现:
一种语音识别装置,包括:语音采集模块、与所述语音采集模块电连接的语音降噪处理模块、与所述语音降噪处理模块电连接的语音识别模块、与所述语音识别模块电连接的接口转换模块和与所述接口转换模块电连接的并口排针接口;所述语音采集模块用于采集语音,所述语音降噪处理模块用于对采集到的语音进行降噪处理,所述语音识别模块用于识别出经过降噪处理的语音并转换为控制指令,所述接口转换模块用于将控制指令转换为数字信号,所述并口排针接口与电子设备插接,所述数字信号通过所述并口排针接口发送给所述电子设备。
进一步地,所述并口排针接口为8pin排针接口以传输二进制控制信号。
进一步地,所述并口排针接口的其中一个pin电连接所述电子设备的电源,以对所述语音识别装置供电。
进一步地,所述语音采集模块包括单麦克风和双麦克风。
进一步地,还包括与所述语音识别模块电连接的语音提示模块,所述语音提示模块用于在所述语音识别模块语音识别成功时发出提示语音。
进一步地,还包括与所述语音识别模块电连接的显示模块,所述显示模块用于显示语音识别模块的工作状态,并在语音识别成功时显示提示信息。
进一步地,还包括与所述语音识别模块电连接的设置按键,所述设置按键用于切换所述语音识别模块的语音识别类型。
进一步地,还包括与所述语音采集模块电连接的红外感应模块,所述红外感应模块用于在感应到人体时控制开启所述语音采集模块。
本实用新型的目的之二采用如下技术方案实现:
一种语音识别系统,包括电子设备和上述的语音识别装置,所述电子设备上设有与所述并口排针接口对应的插接口。
进一步地,所述电子设备为智能音箱。
相比现有技术,本实用新型的有益效果在于:语音采集模块采集语音,语音降噪处理模块对采集的语音进行降噪,语音识别模块识别出讲过降噪处理的语音并转换为控制指令,接口转换模块将控制指令转换为数字信号,通过并口排针发送给电子设备,从而无需云端识别运算,快速响应简单操作指令。
附图说明
图1为本实用新型实施例提供的语音识别装置的示意图;
图2为本实用新型实施例提供的语音识别系统的示意图。
具体实施方式
下面,结合附图以及具体实施方式,对本实用新型做进一步描述,需要说明的是,在不相冲突的前提下,以下描述的各实施例之间或各技术特征之间可以任意组合形成新的实施例。
如图1所示,本实用新型实施例提供的语音识别装置1,包括:语音采集模块11、与语音采集模块11电连接的语音降噪处理模块12、与语音降噪处理模块12电连接的语音识别模块13、与语音识别模块13电连接的接口转换模块14和与接口转换模块14电连接的并口排针接口15;语音采集模块11用于采集语音,语音降噪处理模块12用于对采集到的语音进行降噪处理,语音识别模块13用于识别出经过降噪处理的语音并转换为控制指令,接口转换模块14用于将控制指令转换为数字信号,并口排针接口15与电子设备2插接,数字信号通过并口排针接口15发送给电子设备2,从而无需联网到服务器,即可实现简单指令的快速响应。
作为优选的实施方式,语音采集模块11包括单麦克风和双麦克风,并口排针接口15为8pin排针接口,以传输二进制控制信号。并口排针接口15的其中一个pin电连接电子设备2的电源,以对语音识别装置1供电,从而无需额外对语音识别装置1供电。
作为优选的实施方式,语音识别装置1还包括与语音识别模块13电连接的语音提示模块16,语音提示模块16用于在语音识别模块13语音识别成功时发出提示语音。语音识别装置1还包括与语音识别模块13电连接的显示模块17,显示模块17用于显示语音识别模块13的工作状态,并在语音识别成功时显示提示信息。语音识别装置1还包括与语音识别模块13电连接的设置按键18,设置按键18用于切换语音识别模块13的语音识别类型,例如,语音识别装置1中有汉语、普通话、方言、英语等数据库,语音识别模块13根据设置按键18切换为不同的识别类型,对应通过不同的数据库识别语音。
作为优选的实施方式,语音识别装置1还包括与语音采集模块11电连接的红外感应模块19,红外感应模块19用于在感应到人体时控制开启语音采集模块11,以便实时识别语音,同时降低语音识别装置的功耗。
如图2所示,本实用新型实施例提供的语音识别系统,包括电子设备2和上述的语音识别装置1,电子设备2上设有与并口排针接口15对应的插接口21。其中,电子设备2为智能音箱。
本实用新型提供的语音识别装置1及系统,语音采集模块11采集语音,语音降噪处理模块12对采集的语音进行降噪,语音识别模块13识别出讲过降噪处理的语音并转换为控制指令,接口转换模块14将控制指令转换为数字信号,通过并口排针15发送给电子设备2,从而无需云端识别运算,快速响应简单操作指令。
上述实施方式仅为本实用新型的优选实施方式,不能以此来限定本实用新型保护的范围,本领域的技术人员在本实用新型的基础上所做的任何非实质性的变化及替换均属于本实用新型所要求保护的范围。

Claims (10)

1.一种语音识别装置,其特征在于,包括:语音采集模块、与所述语音采集模块电连接的语音降噪处理模块、与所述语音降噪处理模块电连接的语音识别模块、与所述语音识别模块电连接的接口转换模块和与所述接口转换模块电连接的并口排针接口;所述语音采集模块用于采集语音,所述语音降噪处理模块用于对采集到的语音进行降噪处理,所述语音识别模块用于识别出经过降噪处理的语音并转换为控制指令,所述接口转换模块用于将控制指令转换为数字信号,所述并口排针接口与电子设备插接,所述数字信号通过所述并口排针接口发送给所述电子设备。
2.根据权利要求1所述的语音识别装置,其特征在于,所述并口排针接口为8pin排针接口以传输二进制控制信号。
3.根据权利要求2所述的语音识别装置,其特征在于,所述并口排针接口的其中一个pin电连接所述电子设备的电源,以对所述语音识别装置供电。
4.根据权利要求1所述的语音识别装置,其特征在于,所述语音采集模块包括单麦克风和双麦克风。
5.根据权利要求1所述的语音识别装置,其特征在于,还包括与所述语音识别模块电连接的语音提示模块,所述语音提示模块用于在所述语音识别模块语音识别成功时发出提示语音。
6.根据权利要求5所述的语音识别装置,其特征在于,还包括与所述语音识别模块电连接的显示模块,所述显示模块用于显示语音识别模块的工作状态,并在语音识别成功时显示提示信息。
7.根据权利要求1所述的语音识别装置,其特征在于,还包括与所述语音识别模块电连接的设置按键,所述设置按键用于切换所述语音识别模块的语音识别类型。
8.根据权利要求1所述的语音识别装置,其特征在于,还包括与所述语音采集模块电连接的红外感应模块,所述红外感应模块用于在感应到人体时控制开启所述语音采集模块。
9.一种语音识别系统,其特征在于,包括电子设备和权利要求1-8任一项所述的语音识别装置,所述电子设备上设有与所述并口排针接口对应的插接口。
10.根据权利要求9所述的语音识别系统,其特征在于,所述电子设备为智能音箱。
CN201822144428.XU 2018-12-20 2018-12-20 一种语音识别装置及系统 Expired - Fee Related CN209571226U (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201822144428.XU CN209571226U (zh) 2018-12-20 2018-12-20 一种语音识别装置及系统
US16/427,335 US20200202851A1 (en) 2018-12-20 2019-05-30 Speech recognition device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201822144428.XU CN209571226U (zh) 2018-12-20 2018-12-20 一种语音识别装置及系统

Publications (1)

Publication Number Publication Date
CN209571226U true CN209571226U (zh) 2019-11-01

Family

ID=68330437

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201822144428.XU Expired - Fee Related CN209571226U (zh) 2018-12-20 2018-12-20 一种语音识别装置及系统

Country Status (2)

Country Link
US (1) US20200202851A1 (zh)
CN (1) CN209571226U (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11437031B2 (en) * 2019-07-30 2022-09-06 Qualcomm Incorporated Activating speech recognition based on hand patterns detected using plurality of filters
KR20230059029A (ko) * 2021-10-25 2023-05-03 삼성전자주식회사 전자 장치 및 그 동작 방법

Also Published As

Publication number Publication date
US20200202851A1 (en) 2020-06-25

Similar Documents

Publication Publication Date Title
CN105957514B (zh) 一种便携式聋哑人交流设备
CN108000526B (zh) 用于智能机器人的对话交互方法及系统
CN106920548A (zh) 语音控制装置、语音控制系统和语音控制方法
CN108009490A (zh) 一种基于识别情绪的聊天机器人系统及该系统的判断方法
CN102298694A (zh) 一种应用于远程信息服务的人机交互识别系统
CN209571226U (zh) 一种语音识别装置及系统
CN102737629A (zh) 一种嵌入式语音情感识别方法及装置
CN105807925A (zh) 一种基于柔性电子皮肤的唇语识别系统及方法
CN109101663A (zh) 一种基于互联网的机器人对话系统
CN107919117A (zh) 一种基于人脸识别的主动语音助手
CN109036430A (zh) 语音控制终端
CN102890931A (zh) 提高语音识别率的方法
CN111384778A (zh) 一种配电网设备智能运维系统
CN204379948U (zh) 智能多维互动呐喊宣泄装置
CN207654512U (zh) 智能互动宣泄仪
CN105916069A (zh) 一种可将语音实时转换成文字的智能话筒
CN206892866U (zh) 具有情景分析功能的智能对话装置
CN208538474U (zh) 语音识别系统
CN216387797U (zh) 一种停车设备用声控存取车装置
CN207950303U (zh) 一种基于压力的语音心理互动引导系统装置
CN210575088U (zh) 语音识别家电控制装置
CN209543926U (zh) 一种语音控制户外照明的灯具装置
CN208985692U (zh) 语音控制终端
CN205670785U (zh) 一种可进行语音拨号的电话座机
CN112399020A (zh) 一种智能语音客服系统

Legal Events

Date Code Title Description
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20191101

CF01 Termination of patent right due to non-payment of annual fee