BR0206910A - Método e aparelho para a reconstrução da fala em um sistema de reconhecimento de fala distribuìdo - Google Patents

Método e aparelho para a reconstrução da fala em um sistema de reconhecimento de fala distribuìdo

Info

Publication number
BR0206910A
BR0206910A BR0206910-5A BR0206910A BR0206910A BR 0206910 A BR0206910 A BR 0206910A BR 0206910 A BR0206910 A BR 0206910A BR 0206910 A BR0206910 A BR 0206910A
Authority
BR
Brazil
Prior art keywords
data
speech
encoded
recognition system
reconstruction
Prior art date
Application number
BR0206910-5A
Other languages
English (en)
Inventor
William M Kushner
Jeffrey Meunier
Mark A Jasiuk
Tenkasi V Ramabadran
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Publication of BR0206910A publication Critical patent/BR0206910A/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/093Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using sinusoidal excitation models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)

Abstract

"MéTODO E APARELHO PARA A RECONSTRUçãO DA FALA EM UM SISTEMA DE RECONHECIMENTO DE FALA DISTRIBUìDO". Em um sistema de reconhecimento de fala distribuído (20) que compreende um primeiro dispositivo de comunicação (22) que recebe uma entrada de fala (34), codifica dados representativos da entrada de fala (36, 38), e transmite os dados codificados (42) e um segundo dispositivo de comunicação localizado remotamente (26) que recebe os dados codificados (44) e compara os dados codificados com um conjunto de dados conhecido, o segundo dispositivo (26) incluindo um processador (92) com um programa que controla o processador (92) para operar de acordo com um método de reconstrução da entrada de fala incluindo a etapa (44) de receber dados codificados incluindo dados espectrais codificados e dados de energia codificados. O método inclui ainda a etapa (46, 48) de decodificar os dados espectrais codificados e os dados de energia codificados para determinar os dados espectrais e os dados de energia. O método também inclui a etapa (50, 52) de combinar os dados espectrais e os dados de energia para reconstruir a entrada de fala.
BR0206910-5A 2001-02-02 2002-01-18 Método e aparelho para a reconstrução da fala em um sistema de reconhecimento de fala distribuìdo BR0206910A (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/775,951 US6633839B2 (en) 2001-02-02 2001-02-02 Method and apparatus for speech reconstruction in a distributed speech recognition system
PCT/US2002/001481 WO2002062120A2 (en) 2001-02-02 2002-01-18 Method and apparatus for speech reconstruction in a distributed speech recognition system

Publications (1)

Publication Number Publication Date
BR0206910A true BR0206910A (pt) 2004-12-14

Family

ID=25106035

Family Applications (1)

Application Number Title Priority Date Filing Date
BR0206910-5A BR0206910A (pt) 2001-02-02 2002-01-18 Método e aparelho para a reconstrução da fala em um sistema de reconhecimento de fala distribuìdo

Country Status (6)

Country Link
US (1) US6633839B2 (pt)
EP (2) EP2945154A1 (pt)
CN (1) CN1327405C (pt)
AU (1) AU2002243594A1 (pt)
BR (1) BR0206910A (pt)
WO (1) WO2002062120A2 (pt)

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6006174A (en) * 1990-10-03 1999-12-21 Interdigital Technology Coporation Multiple impulse excitation speech encoder and decoder
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US9076448B2 (en) * 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7050977B1 (en) 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
GB2363236B (en) * 2000-06-05 2002-06-12 Motorola Inc Method and apparatus for mitigating the effect of transmission errors in a distributed speech recognition process and system
US7047196B2 (en) 2000-06-08 2006-05-16 Agiletv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
US20030004720A1 (en) * 2001-01-30 2003-01-02 Harinath Garudadri System and method for computing and transmitting parameters in a distributed voice recognition system
US8095370B2 (en) * 2001-02-16 2012-01-10 Agiletv Corporation Dual compression voice recordation non-repudiation system
US7941313B2 (en) * 2001-05-17 2011-05-10 Qualcomm Incorporated System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system
US7366712B2 (en) * 2001-05-31 2008-04-29 Intel Corporation Information retrieval center gateway
US7203643B2 (en) * 2001-06-14 2007-04-10 Qualcomm Incorporated Method and apparatus for transmitting speech activity in distributed voice recognition systems
US7353176B1 (en) * 2001-12-20 2008-04-01 Ianywhere Solutions, Inc. Actuation system for an agent oriented architecture
US20030139929A1 (en) * 2002-01-24 2003-07-24 Liang He Data transmission system and method for DSR application over GPRS
US7062444B2 (en) * 2002-01-24 2006-06-13 Intel Corporation Architecture for DSR client and server development platform
US7024353B2 (en) * 2002-08-09 2006-04-04 Motorola, Inc. Distributed speech recognition with back-end voice activity detection apparatus and method
DE10252070B4 (de) * 2002-11-08 2010-07-15 Palm, Inc. (n.d.Ges. d. Staates Delaware), Sunnyvale Kommunikationsendgerät mit parametrierter Bandbreitenerweiterung und Verfahren zur Bandbreitenerweiterung dafür
US7027979B2 (en) 2003-01-14 2006-04-11 Motorola, Inc. Method and apparatus for speech reconstruction within a distributed speech recognition system
US20040148160A1 (en) * 2003-01-23 2004-07-29 Tenkasi Ramabadran Method and apparatus for noise suppression within a distributed speech recognition system
US7305339B2 (en) * 2003-04-01 2007-12-04 International Business Machines Corporation Restoration of high-order Mel Frequency Cepstral Coefficients
US20050071158A1 (en) * 2003-09-25 2005-03-31 Vocollect, Inc. Apparatus and method for detecting user speech
US7496387B2 (en) * 2003-09-25 2009-02-24 Vocollect, Inc. Wireless headset for use in speech recognition environment
US7386443B1 (en) * 2004-01-09 2008-06-10 At&T Corp. System and method for mobile automatic speech recognition
CN101019171B (zh) * 2004-07-23 2011-08-10 意大利电信股份公司 用于生成向量码本的方法、用于压缩数据的方法及装置、以及分布式语音识别系统
BRPI0517246A (pt) * 2004-10-28 2008-10-07 Matsushita Electric Ind Co Ltd aparelho de codificação escalável, aparelho de decodificação escalável e métodos para os mesmos
KR20060066416A (ko) * 2004-12-13 2006-06-16 한국전자통신연구원 음성 코덱을 이용한 후두 원격 진단 서비스 장치 및 그 방법
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
US8417185B2 (en) 2005-12-16 2013-04-09 Vocollect, Inc. Wireless headset and method for robust voice data communication
US7783488B2 (en) * 2005-12-19 2010-08-24 Nuance Communications, Inc. Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information
US7773767B2 (en) 2006-02-06 2010-08-10 Vocollect, Inc. Headset terminal with rear stability strap
US7885419B2 (en) 2006-02-06 2011-02-08 Vocollect, Inc. Headset terminal with speech functionality
US8271285B2 (en) * 2007-08-02 2012-09-18 International Business Machines Corporation Using speaker identification and verification speech processing technologies to activate and deactivate a payment card
US8306817B2 (en) * 2008-01-08 2012-11-06 Microsoft Corporation Speech recognition with non-linear noise reduction on Mel-frequency cepstra
USD605629S1 (en) 2008-09-29 2009-12-08 Vocollect, Inc. Headset
US20100174539A1 (en) * 2009-01-06 2010-07-08 Qualcomm Incorporated Method and apparatus for vector quantization codebook search
US8160287B2 (en) 2009-05-22 2012-04-17 Vocollect, Inc. Headset with adjustable headband
US8438659B2 (en) 2009-11-05 2013-05-07 Vocollect, Inc. Portable computing device and headset interface
US9082408B2 (en) * 2011-06-13 2015-07-14 Mmodal Ip Llc Speech recognition using loosely coupled components
US8583425B2 (en) * 2011-06-21 2013-11-12 Genband Us Llc Methods, systems, and computer readable media for fricatives and high frequencies detection
US9710768B2 (en) 2011-09-23 2017-07-18 Elwha Llc Acquiring and transmitting event related tasks and subtasks to interface devices
US9437213B2 (en) * 2012-03-05 2016-09-06 Malaspina Labs (Barbados) Inc. Voice signal enhancement
US20130325449A1 (en) 2012-05-31 2013-12-05 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10395672B2 (en) 2012-05-31 2019-08-27 Elwha Llc Methods and systems for managing adaptation data
US9305565B2 (en) 2012-05-31 2016-04-05 Elwha Llc Methods and systems for speech adaptation data
US8843371B2 (en) * 2012-05-31 2014-09-23 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9495966B2 (en) 2012-05-31 2016-11-15 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10431235B2 (en) 2012-05-31 2019-10-01 Elwha Llc Methods and systems for speech adaptation data
US9093069B2 (en) * 2012-11-05 2015-07-28 Nuance Communications, Inc. Privacy-sensitive speech model creation via aggregation of multiple user models
MX370086B (es) 2013-01-08 2019-11-29 Dolby Int Ab Prediccion basada en modelo en un banco de filtros de muestreo critico.
US9449602B2 (en) * 2013-12-03 2016-09-20 Google Inc. Dual uplink pre-processing paths for machine and human listening
TWI506583B (zh) * 2013-12-10 2015-11-01 國立中央大學 分析系統及其方法
US10354422B2 (en) * 2013-12-10 2019-07-16 National Central University Diagram building system and method for a signal data decomposition and analysis
CN107112026A (zh) 2014-10-20 2017-08-29 奥迪马科斯公司 用于智能语音识别和处理的系统、方法和装置
US9817817B2 (en) * 2016-03-17 2017-11-14 International Business Machines Corporation Detection and labeling of conversational actions
US10789534B2 (en) 2016-07-29 2020-09-29 International Business Machines Corporation Measuring mutual understanding in human-computer conversation
US10373630B2 (en) * 2017-03-31 2019-08-06 Intel Corporation Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices
CN108766450B (zh) * 2018-04-16 2023-02-17 杭州电子科技大学 一种基于谐波冲激分解的语音转换方法
WO2020053871A1 (en) * 2018-09-13 2020-03-19 Telefonaktiebolaget Lm Ericsson (Publ) Automated plan synthesis and action dispatch
US11151979B2 (en) 2019-08-23 2021-10-19 Tencent America LLC Duration informed attention network (DURIAN) for audio-visual synthesis
CN113066472B (zh) * 2019-12-13 2024-05-31 科大讯飞股份有限公司 合成语音处理方法及相关装置
CN113823089A (zh) * 2021-09-19 2021-12-21 广州丹雅科技有限公司 交通量检测方法、装置、电子设备及可读存储介质

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5247579A (en) 1990-12-05 1993-09-21 Digital Voice Systems, Inc. Methods for speech transmission
US5734789A (en) 1992-06-01 1998-03-31 Hughes Electronics Voiced, unvoiced or noise modes in a CELP vocoder
ZA948426B (en) * 1993-12-22 1995-06-30 Qualcomm Inc Distributed voice recognition system
US5625749A (en) * 1994-08-22 1997-04-29 Massachusetts Institute Of Technology Segment-based apparatus and method for speech recognition by analyzing multiple speech unit frames and modeling both temporal and spatial correlation
US5751903A (en) 1994-12-19 1998-05-12 Hughes Electronics Low rate multi-mode CELP codec that encodes line SPECTRAL frequencies utilizing an offset
US5749073A (en) * 1996-03-15 1998-05-05 Interval Research Corporation System for automatically morphing audio information
US6278970B1 (en) * 1996-03-29 2001-08-21 British Telecommunications Plc Speech transformation using log energy and orthogonal matrix
JP3687181B2 (ja) * 1996-04-15 2005-08-24 ソニー株式会社 有声音/無声音判定方法及び装置、並びに音声符号化方法
US5822729A (en) * 1996-06-05 1998-10-13 Massachusetts Institute Of Technology Feature-based speech recognizer having probabilistic linguistic processor providing word matching based on the entire space of feature vectors
US5918223A (en) 1996-07-22 1999-06-29 Muscle Fish Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
US6314392B1 (en) * 1996-09-20 2001-11-06 Digital Equipment Corporation Method and apparatus for clustering-based signal segmentation
US5890111A (en) * 1996-12-24 1999-03-30 Technology Research Association Of Medical Welfare Apparatus Enhancement of esophageal speech by injection noise rejection
US5924065A (en) * 1997-06-16 1999-07-13 Digital Equipment Corporation Environmently compensated speech processing
US6173260B1 (en) 1997-10-29 2001-01-09 Interval Research Corporation System and method for automatic classification of speech based upon affective content
GB2342828A (en) * 1998-10-13 2000-04-19 Nokia Mobile Phones Ltd Speech parameter compression; distributed speech recognition
US6199041B1 (en) 1998-11-20 2001-03-06 International Business Machines Corporation System and method for sampling rate transformation in speech recognition
US6182036B1 (en) * 1999-02-23 2001-01-30 Motorola, Inc. Method of extracting features in a voice recognition system
US6377916B1 (en) 1999-11-29 2002-04-23 Digital Voice Systems, Inc. Multiband harmonic transform coder

Also Published As

Publication number Publication date
AU2002243594A1 (en) 2002-08-19
EP1395978B1 (en) 2015-06-24
EP2945154A1 (en) 2015-11-18
WO2002062120A3 (en) 2003-12-18
EP1395978A4 (en) 2005-09-21
EP1395978A2 (en) 2004-03-10
WO2002062120A2 (en) 2002-08-15
US6633839B2 (en) 2003-10-14
CN1327405C (zh) 2007-07-18
CN1552059A (zh) 2004-12-01
US20020147579A1 (en) 2002-10-10

Similar Documents

Publication Publication Date Title
BR0206910A (pt) Método e aparelho para a reconstrução da fala em um sistema de reconhecimento de fala distribuìdo
WO1999051033A3 (en) Method and device for modifying data in an encoded data stream
DE69903421D1 (de) Authentifizierungsverfahren eines persönlichen kodes eines chipkartenbenützers
BR0115897A (pt) Método e sistema de transferência segura de arquivos
CA2973512A1 (en) Voice recognition system and method of robot system
BR0214042A (pt) Método para pré-processar um dicionário de pronúncia para compressão em um dispositivo de processamento de dados, dispositivo eletrônico para converter uma entrada de uma cadeia de textos em uma seqüência de unidades de fonemas, dispositivo eletrônico configurado para converter a entrada de informação de voz em uma seqüência de unidades de caracter, sistema compreendendo o primeiro e o segundo dispositivos eletrônicos, e, programa de computador
DE60125397D1 (de) Sprachunabhängige stimmbasierte benutzeroberfläche
BR9606800B1 (pt) método e aparelho para detecção e desvio de sintetização de voz tipo "tandem".
BR0112478A (pt) Método e sistema para facilitar uma transação sem fio
BR0208692A (pt) Método e aparelho para autenticação utilizando tecnologia sim de acesso múltiplo remoto
DE60109956D1 (de) Vorrichtung und verfahren zur telefonie-basierten spracherkennung für das bereitstellen von informationen zum sortieren von poststücken und paketen.
WO2004068817A3 (fr) Procede et systeme dynamique de securisation d'un reseau de communication au moyen d'agents portables
BRPI0604482A (pt) método de transmissão de dados de usuário e controlador de rede de rádio
DK1389372T3 (da) Testslöjfer til kanal-codecs
Ahn et al. The Effect of Syntactic Complexity on Sentence Repetition Performance and Intelligibility between Specific Language Impairment and Normal Children.
BR0213756A (pt) Sistema e método para configurar remotamente um rádio, e, estrutura de dados em um perfil de usuário localizado em um banco de dados
BR0000301A (pt) Método de verificação da identidade de uma pessoa
WO2002049004A3 (de) Verfahren und anordnung zur spracherkennung für ein kleingerät
NO20052142L (no) Fremgangsmate og anordning for a muliggjore elektroniske transaksjoner
SE0201366L (sv) Framställning av en frekvenskod ur en transformerad bild som representerar ett fingeravtryck att användas vid kontroll av en persons identitet
DK1425588T3 (da) Bestemmelse af den aktive parathormon-aktivitet i en pröve
Descatha Exemple de la création d'un comité scientifique au sein de la Commission internationale de la santé au travail (ICOH): anticipation et Réponse d'Urgence en Santé au Travail (EPROH)
Drozd LANGUAGE AND LINGUISTICS IN POSTMODERN DISCOURSE
Wang et al. Dialogue act analysis of spoken Chinese based on neural networks
Clements et al. Automatic recognition of speech in stressful environments

Legal Events

Date Code Title Description
B25D Requested change of name of applicant approved

Owner name: MOTOROLA SOLUTIONS, INC. (US)

B25A Requested transfer of rights approved

Owner name: MOTOROLA MOBILITY, INC. (US)

B25G Requested change of headquarter approved

Owner name: MOTOROLA MOBILITY, INC. (US)

B25E Requested change of name of applicant rejected

Owner name: MOTOROLA MOBILITY, INC. (US)

Free format text: INDEFERIDA A ALTERACAO DE NOME SOLICITADA ATRAVES DA PETICAO NO 020130025965-RJ, DE 28/03/2013, UMA VEZ QUE NAO FOI PAGA A RESPECTIVA TAXA DE RETRIBUICAO.

B08F Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette]

Free format text: REFERENTE A 14A ANUIDADE.

B25D Requested change of name of applicant approved

Owner name: MOTOROLA MOBILITY, LLC (US)

B08K Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette]

Free format text: EM VIRTUDE DO ARQUIVAMENTO PUBLICADO NA RPI 2344 DE 08-12-2015 E CONSIDERANDO AUSENCIA DE MANIFESTACAO DENTRO DOS PRAZOS LEGAIS, INFORMO QUE CABE SER MANTIDO O ARQUIVAMENTO DO PEDIDO DE PATENTE, CONFORME O DISPOSTO NO ARTIGO 12, DA RESOLUCAO 113/2013.