GB201414086D0 - Speech recognition system - Google Patents

Speech recognition system

Info

Publication number
GB201414086D0
GB201414086D0 GBGB1414086.7A GB201414086A GB201414086D0 GB 201414086 D0 GB201414086 D0 GB 201414086D0 GB 201414086 A GB201414086 A GB 201414086A GB 201414086 D0 GB201414086 D0 GB 201414086D0
Authority
GB
United Kingdom
Prior art keywords
speech recognition
recognition system
speech
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GBGB1414086.7A
Other versions
GB2518512B (en
GB2518512A (en
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Honeywell International Inc
Original Assignee
Honeywell International Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honeywell International Inc filed Critical Honeywell International Inc
Publication of GB201414086D0 publication Critical patent/GB201414086D0/en
Publication of GB2518512A publication Critical patent/GB2518512A/en
Application granted granted Critical
Publication of GB2518512B publication Critical patent/GB2518512B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/227Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Signal Processing (AREA)
GB1414086.7A 2013-08-23 2014-08-08 Speech recognition system Active GB2518512B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US13/974,399 US9847082B2 (en) 2013-08-23 2013-08-23 System for modifying speech recognition and beamforming using a depth image

Publications (3)

Publication Number Publication Date
GB201414086D0 true GB201414086D0 (en) 2014-09-24
GB2518512A GB2518512A (en) 2015-03-25
GB2518512B GB2518512B (en) 2017-09-13

Family

ID=51629515

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1414086.7A Active GB2518512B (en) 2013-08-23 2014-08-08 Speech recognition system

Country Status (2)

Country Link
US (1) US9847082B2 (en)
GB (1) GB2518512B (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6433903B2 (en) * 2013-08-29 2018-12-05 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America Speech recognition method and speech recognition apparatus
CN106797413B (en) * 2014-09-30 2019-09-27 惠普发展公司,有限责任合伙企业 Sound is adjusted
US9754607B2 (en) * 2015-08-26 2017-09-05 Apple Inc. Acoustic scene interpretation systems and related methods
US10079031B2 (en) * 2015-09-23 2018-09-18 Marvell World Trade Ltd. Residual noise suppression
US9900685B2 (en) * 2016-03-24 2018-02-20 Intel Corporation Creating an audio envelope based on angular information
JP6703460B2 (en) * 2016-08-25 2020-06-03 本田技研工業株式会社 Audio processing device, audio processing method, and audio processing program
WO2018168427A1 (en) * 2017-03-13 2018-09-20 ソニー株式会社 Learning device, learning method, speech synthesizer, and speech synthesis method
CN107135443B (en) * 2017-03-29 2020-06-23 联想(北京)有限公司 Signal processing method and electronic equipment
US10339929B2 (en) 2017-06-27 2019-07-02 Google Llc Speech recognition using acoustic features in conjunction with distance information
JP6927308B2 (en) * 2017-07-26 2021-08-25 日本電気株式会社 Voice control device and its control method
US20190129027A1 (en) * 2017-11-02 2019-05-02 Fluke Corporation Multi-modal acoustic imaging tool
US10783882B2 (en) 2018-01-03 2020-09-22 International Business Machines Corporation Acoustic change detection for robust automatic speech recognition based on a variance between distance dependent GMM models
CN108470568B (en) * 2018-01-22 2021-03-23 科大讯飞股份有限公司 Intelligent device control method and device, storage medium and electronic device
US10679621B1 (en) * 2018-03-21 2020-06-09 Amazon Technologies, Inc. Speech processing optimizations based on microphone array
CN108616790B (en) * 2018-04-24 2021-01-26 京东方科技集团股份有限公司 Pickup playback circuit and system, and pickup playback switching method
US11011162B2 (en) 2018-06-01 2021-05-18 Soundhound, Inc. Custom acoustic models
US11222652B2 (en) 2019-07-19 2022-01-11 Apple Inc. Learning-based distance estimation
US12126971B2 (en) * 2020-12-23 2024-10-22 Intel Corporation Acoustic signal processing adaptive to user-to-microphone distances

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6243683B1 (en) 1998-12-29 2001-06-05 Intel Corporation Video control of speech recognition
US6449593B1 (en) * 2000-01-13 2002-09-10 Nokia Mobile Phones Ltd. Method and system for tracking human speakers
JP2003131683A (en) * 2001-10-22 2003-05-09 Sony Corp Device and method for voice recognition, and program and recording medium
KR100519781B1 (en) * 2004-02-18 2005-10-07 삼성전자주식회사 Object tracking method and apparatus
KR101415026B1 (en) * 2007-11-19 2014-07-04 삼성전자주식회사 Method and apparatus for acquiring the multi-channel sound with a microphone array
US8233077B2 (en) * 2007-12-27 2012-07-31 Qualcomm Incorporated Method and apparatus with depth map generation
US9445193B2 (en) * 2008-07-31 2016-09-13 Nokia Technologies Oy Electronic device directional audio capture
JP5434231B2 (en) * 2009-04-24 2014-03-05 ソニー株式会社 Image information processing apparatus, imaging apparatus, image information processing method, and program
US8174932B2 (en) * 2009-06-11 2012-05-08 Hewlett-Packard Development Company, L.P. Multimodal object localization
US20110091055A1 (en) * 2009-10-19 2011-04-21 Broadcom Corporation Loudspeaker localization techniques
US8676581B2 (en) * 2010-01-22 2014-03-18 Microsoft Corporation Speech recognition analysis via identification information
GB2477793A (en) * 2010-02-15 2011-08-17 Sony Corp A method of creating a stereoscopic image in a client device
US8635066B2 (en) 2010-04-14 2014-01-21 T-Mobile Usa, Inc. Camera-assisted noise cancellation and speech recognition
US8296151B2 (en) * 2010-06-18 2012-10-23 Microsoft Corporation Compound gesture-speech commands
KR101750338B1 (en) 2010-09-13 2017-06-23 삼성전자주식회사 Method and apparatus for microphone Beamforming
US9496841B2 (en) * 2010-10-21 2016-11-15 Nokia Technologies Oy Recording level adjustment using a distance to a sound source
US9491441B2 (en) * 2011-08-30 2016-11-08 Microsoft Technology Licensing, Llc Method to extend laser depth map range
EP2766901B1 (en) 2011-10-17 2016-09-21 Nuance Communications, Inc. Speech signal enhancement using visual information
TWI489326B (en) * 2012-06-05 2015-06-21 Wistron Corp Operating area determination method and system
US9092394B2 (en) * 2012-06-15 2015-07-28 Honda Motor Co., Ltd. Depth based context identification
US9258644B2 (en) 2012-07-27 2016-02-09 Nokia Technologies Oy Method and apparatus for microphone beamforming
WO2014080074A1 (en) * 2012-11-20 2014-05-30 Nokia Corporation Spatial audio enhancement apparatus
US9236050B2 (en) * 2013-03-14 2016-01-12 Vocollect Inc. System and method for improving speech recognition accuracy in a work environment
EP2974273A4 (en) * 2013-03-15 2018-01-10 Jibo, Inc. Apparatus and methods for providing a persistent companion device
US9280972B2 (en) 2013-05-10 2016-03-08 Microsoft Technology Licensing, Llc Speech to text conversion
KR102150013B1 (en) * 2013-06-11 2020-08-31 삼성전자주식회사 Beamforming method and apparatus for sound signal
CN104427291B (en) * 2013-08-19 2018-09-28 华为技术有限公司 A kind of image processing method and equipment
US20150123890A1 (en) * 2013-11-04 2015-05-07 Microsoft Corporation Two hand natural user input

Also Published As

Publication number Publication date
GB2518512B (en) 2017-09-13
US9847082B2 (en) 2017-12-19
US20150058003A1 (en) 2015-02-26
GB2518512A (en) 2015-03-25

Similar Documents

Publication Publication Date Title
GB2515527B (en) Speech Recognition
GB2518512B (en) Speech recognition system
HK1215614A1 (en) Speech transaction processing
GB201600614D0 (en) Utilizing voice biometrics
GB2524222B (en) Activating speech processing
GB2584264B (en) Processing received speech data
EP3022733A4 (en) Multi-level speech recognition
HK1206862A1 (en) Method for voice recognition and system thereof
EP3042342A4 (en) Pattern recognition system
GB201600616D0 (en) Utilizing voice biometrics
EP2959474A4 (en) Hybrid performance scaling or speech recognition
GB201506046D0 (en) Speech recognition
ZA201505388B (en) Character recognition method
GB201317910D0 (en) Speech processing
GB2520048B (en) Speech processing system
GB201322979D0 (en) Co-talker nulling for automatic speech recognition systems
GB201600613D0 (en) Utilizing voice biometrics
EP2950291A4 (en) Road environment recognition system
GB201307513D0 (en) Secure voice transactions
ZA201600072B (en) Guide system
GB2553683B (en) Speech recognition
GB201311375D0 (en) Speech Recognition
GB2515528B (en) Speech Recognition
GB2531964B (en) Speech recognition
GB201300715D0 (en) Speech coding

Legal Events

Date Code Title Description
732E Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977)

Free format text: REGISTERED BETWEEN 20201126 AND 20201202