GB201414086D0 - Speech recognition system - Google Patents
Speech recognition systemInfo
- Publication number
- GB201414086D0 GB201414086D0 GBGB1414086.7A GB201414086A GB201414086D0 GB 201414086 D0 GB201414086 D0 GB 201414086D0 GB 201414086 A GB201414086 A GB 201414086A GB 201414086 D0 GB201414086 D0 GB 201414086D0
- Authority
- GB
- United Kingdom
- Prior art keywords
- speech recognition
- recognition system
- speech
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/227—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Signal Processing (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/974,399 US9847082B2 (en) | 2013-08-23 | 2013-08-23 | System for modifying speech recognition and beamforming using a depth image |
Publications (3)
Publication Number | Publication Date |
---|---|
GB201414086D0 true GB201414086D0 (en) | 2014-09-24 |
GB2518512A GB2518512A (en) | 2015-03-25 |
GB2518512B GB2518512B (en) | 2017-09-13 |
Family
ID=51629515
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB1414086.7A Active GB2518512B (en) | 2013-08-23 | 2014-08-08 | Speech recognition system |
Country Status (2)
Country | Link |
---|---|
US (1) | US9847082B2 (en) |
GB (1) | GB2518512B (en) |
Families Citing this family (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6433903B2 (en) * | 2013-08-29 | 2018-12-05 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America | Speech recognition method and speech recognition apparatus |
CN106797413B (en) * | 2014-09-30 | 2019-09-27 | 惠普发展公司,有限责任合伙企业 | Sound is adjusted |
US9754607B2 (en) * | 2015-08-26 | 2017-09-05 | Apple Inc. | Acoustic scene interpretation systems and related methods |
US10079031B2 (en) * | 2015-09-23 | 2018-09-18 | Marvell World Trade Ltd. | Residual noise suppression |
US9900685B2 (en) * | 2016-03-24 | 2018-02-20 | Intel Corporation | Creating an audio envelope based on angular information |
JP6703460B2 (en) * | 2016-08-25 | 2020-06-03 | 本田技研工業株式会社 | Audio processing device, audio processing method, and audio processing program |
WO2018168427A1 (en) * | 2017-03-13 | 2018-09-20 | ソニー株式会社 | Learning device, learning method, speech synthesizer, and speech synthesis method |
CN107135443B (en) * | 2017-03-29 | 2020-06-23 | 联想(北京)有限公司 | Signal processing method and electronic equipment |
US10339929B2 (en) | 2017-06-27 | 2019-07-02 | Google Llc | Speech recognition using acoustic features in conjunction with distance information |
JP6927308B2 (en) * | 2017-07-26 | 2021-08-25 | 日本電気株式会社 | Voice control device and its control method |
US20190129027A1 (en) * | 2017-11-02 | 2019-05-02 | Fluke Corporation | Multi-modal acoustic imaging tool |
US10783882B2 (en) | 2018-01-03 | 2020-09-22 | International Business Machines Corporation | Acoustic change detection for robust automatic speech recognition based on a variance between distance dependent GMM models |
CN108470568B (en) * | 2018-01-22 | 2021-03-23 | 科大讯飞股份有限公司 | Intelligent device control method and device, storage medium and electronic device |
US10679621B1 (en) * | 2018-03-21 | 2020-06-09 | Amazon Technologies, Inc. | Speech processing optimizations based on microphone array |
CN108616790B (en) * | 2018-04-24 | 2021-01-26 | 京东方科技集团股份有限公司 | Pickup playback circuit and system, and pickup playback switching method |
US11011162B2 (en) | 2018-06-01 | 2021-05-18 | Soundhound, Inc. | Custom acoustic models |
US11222652B2 (en) | 2019-07-19 | 2022-01-11 | Apple Inc. | Learning-based distance estimation |
US12126971B2 (en) * | 2020-12-23 | 2024-10-22 | Intel Corporation | Acoustic signal processing adaptive to user-to-microphone distances |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6243683B1 (en) | 1998-12-29 | 2001-06-05 | Intel Corporation | Video control of speech recognition |
US6449593B1 (en) * | 2000-01-13 | 2002-09-10 | Nokia Mobile Phones Ltd. | Method and system for tracking human speakers |
JP2003131683A (en) * | 2001-10-22 | 2003-05-09 | Sony Corp | Device and method for voice recognition, and program and recording medium |
KR100519781B1 (en) * | 2004-02-18 | 2005-10-07 | 삼성전자주식회사 | Object tracking method and apparatus |
KR101415026B1 (en) * | 2007-11-19 | 2014-07-04 | 삼성전자주식회사 | Method and apparatus for acquiring the multi-channel sound with a microphone array |
US8233077B2 (en) * | 2007-12-27 | 2012-07-31 | Qualcomm Incorporated | Method and apparatus with depth map generation |
US9445193B2 (en) * | 2008-07-31 | 2016-09-13 | Nokia Technologies Oy | Electronic device directional audio capture |
JP5434231B2 (en) * | 2009-04-24 | 2014-03-05 | ソニー株式会社 | Image information processing apparatus, imaging apparatus, image information processing method, and program |
US8174932B2 (en) * | 2009-06-11 | 2012-05-08 | Hewlett-Packard Development Company, L.P. | Multimodal object localization |
US20110091055A1 (en) * | 2009-10-19 | 2011-04-21 | Broadcom Corporation | Loudspeaker localization techniques |
US8676581B2 (en) * | 2010-01-22 | 2014-03-18 | Microsoft Corporation | Speech recognition analysis via identification information |
GB2477793A (en) * | 2010-02-15 | 2011-08-17 | Sony Corp | A method of creating a stereoscopic image in a client device |
US8635066B2 (en) | 2010-04-14 | 2014-01-21 | T-Mobile Usa, Inc. | Camera-assisted noise cancellation and speech recognition |
US8296151B2 (en) * | 2010-06-18 | 2012-10-23 | Microsoft Corporation | Compound gesture-speech commands |
KR101750338B1 (en) | 2010-09-13 | 2017-06-23 | 삼성전자주식회사 | Method and apparatus for microphone Beamforming |
US9496841B2 (en) * | 2010-10-21 | 2016-11-15 | Nokia Technologies Oy | Recording level adjustment using a distance to a sound source |
US9491441B2 (en) * | 2011-08-30 | 2016-11-08 | Microsoft Technology Licensing, Llc | Method to extend laser depth map range |
EP2766901B1 (en) | 2011-10-17 | 2016-09-21 | Nuance Communications, Inc. | Speech signal enhancement using visual information |
TWI489326B (en) * | 2012-06-05 | 2015-06-21 | Wistron Corp | Operating area determination method and system |
US9092394B2 (en) * | 2012-06-15 | 2015-07-28 | Honda Motor Co., Ltd. | Depth based context identification |
US9258644B2 (en) | 2012-07-27 | 2016-02-09 | Nokia Technologies Oy | Method and apparatus for microphone beamforming |
WO2014080074A1 (en) * | 2012-11-20 | 2014-05-30 | Nokia Corporation | Spatial audio enhancement apparatus |
US9236050B2 (en) * | 2013-03-14 | 2016-01-12 | Vocollect Inc. | System and method for improving speech recognition accuracy in a work environment |
EP2974273A4 (en) * | 2013-03-15 | 2018-01-10 | Jibo, Inc. | Apparatus and methods for providing a persistent companion device |
US9280972B2 (en) | 2013-05-10 | 2016-03-08 | Microsoft Technology Licensing, Llc | Speech to text conversion |
KR102150013B1 (en) * | 2013-06-11 | 2020-08-31 | 삼성전자주식회사 | Beamforming method and apparatus for sound signal |
CN104427291B (en) * | 2013-08-19 | 2018-09-28 | 华为技术有限公司 | A kind of image processing method and equipment |
US20150123890A1 (en) * | 2013-11-04 | 2015-05-07 | Microsoft Corporation | Two hand natural user input |
-
2013
- 2013-08-23 US US13/974,399 patent/US9847082B2/en active Active
-
2014
- 2014-08-08 GB GB1414086.7A patent/GB2518512B/en active Active
Also Published As
Publication number | Publication date |
---|---|
GB2518512B (en) | 2017-09-13 |
US9847082B2 (en) | 2017-12-19 |
US20150058003A1 (en) | 2015-02-26 |
GB2518512A (en) | 2015-03-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2515527B (en) | Speech Recognition | |
GB2518512B (en) | Speech recognition system | |
HK1215614A1 (en) | Speech transaction processing | |
GB201600614D0 (en) | Utilizing voice biometrics | |
GB2524222B (en) | Activating speech processing | |
GB2584264B (en) | Processing received speech data | |
EP3022733A4 (en) | Multi-level speech recognition | |
HK1206862A1 (en) | Method for voice recognition and system thereof | |
EP3042342A4 (en) | Pattern recognition system | |
GB201600616D0 (en) | Utilizing voice biometrics | |
EP2959474A4 (en) | Hybrid performance scaling or speech recognition | |
GB201506046D0 (en) | Speech recognition | |
ZA201505388B (en) | Character recognition method | |
GB201317910D0 (en) | Speech processing | |
GB2520048B (en) | Speech processing system | |
GB201322979D0 (en) | Co-talker nulling for automatic speech recognition systems | |
GB201600613D0 (en) | Utilizing voice biometrics | |
EP2950291A4 (en) | Road environment recognition system | |
GB201307513D0 (en) | Secure voice transactions | |
ZA201600072B (en) | Guide system | |
GB2553683B (en) | Speech recognition | |
GB201311375D0 (en) | Speech Recognition | |
GB2515528B (en) | Speech Recognition | |
GB2531964B (en) | Speech recognition | |
GB201300715D0 (en) | Speech coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
732E | Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977) |
Free format text: REGISTERED BETWEEN 20201126 AND 20201202 |