BRPI0406952A - Quantificação de classe para o reconhecimento de fala distribuìda - Google Patents

Quantificação de classe para o reconhecimento de fala distribuìda

Info

Publication number
BRPI0406952A
BRPI0406952A BR0406952-8A BRPI0406952A BRPI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A
Authority
BR
Brazil
Prior art keywords
class
tone
frame
codeword
quantification
Prior art date
Application number
BR0406952-8A
Other languages
English (en)
Inventor
Tenkasi V Ramabadran
Alexander Sorin
Original Assignee
Motorola Inc
Ibm
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc, Ibm filed Critical Motorola Inc
Publication of BRPI0406952A publication Critical patent/BRPI0406952A/pt
Publication of BRPI0406952B1 publication Critical patent/BRPI0406952B1/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/72Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for transmitting results of analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • G10L2025/935Mixed voiced class; Transitions

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Telephonic Communication Services (AREA)

Abstract

"QUANTIFICAçãO DE CLASSE PARA O RECONHECIMENTO DE FALA DISTRIBUìDA". é revelado um sistema, método e meio lido por computador para quantificar informação de classe e informação de tom de áudio. O método em um sistema de processamento de informação inclui o recebimento e a captura de um quadro do áudio. O método ainda inclui determinar um tom do quadro e calcular uma palavra de código que representa o tom do quadro, em que o primeiro valor de palavra de código indica um tom indefinido. O método ainda inclui determinar uma classe do quadro, em que a classe é qualquer uma de pelo menos duas classes que indicam um tom indefinido e pelo menos uma classe que indica um tom definitivo. O método ainda inclui calcular uma palavra de código que representa a classe do quadro, em que o comprimento da palavra de código é o máximo do número mínimo de bits necessários para representar as pelo menos duas classes e o número mínimo de bits necessários para representar pelo menos uma classe.
BRPI0406952-8A 2003-02-07 2004-02-05 “Quantização de informação de classe para reconhecimento de fala distríbuido” BRPI0406952B1 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US10/360,582 US6961696B2 (en) 2003-02-07 2003-02-07 Class quantization for distributed speech recognition
US10/360,582 2003-02-07
PCT/US2004/003419 WO2004072948A2 (en) 2003-02-07 2004-02-05 Class quantization for distributed speech recognition

Publications (2)

Publication Number Publication Date
BRPI0406952A true BRPI0406952A (pt) 2006-01-03
BRPI0406952B1 BRPI0406952B1 (pt) 2018-02-27

Family

ID=32824044

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0406952-8A BRPI0406952B1 (pt) 2003-02-07 2004-02-05 “Quantização de informação de classe para reconhecimento de fala distríbuido”

Country Status (8)

Country Link
US (1) US6961696B2 (pt)
EP (1) EP1595249B1 (pt)
KR (1) KR100763325B1 (pt)
CN (1) CN101160380B (pt)
BR (1) BRPI0406952B1 (pt)
RU (1) RU2348019C2 (pt)
TW (1) TWI326447B (pt)
WO (1) WO2004072948A2 (pt)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7783488B2 (en) * 2005-12-19 2010-08-24 Nuance Communications, Inc. Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information
CN102256372B (zh) * 2010-05-17 2016-06-22 中兴通讯股份有限公司 Mtc终端接入方法及系统和mtc终端
US9883312B2 (en) 2013-05-29 2018-01-30 Qualcomm Incorporated Transformed higher order ambisonics audio data
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
RU2701120C1 (ru) * 2018-05-14 2019-09-24 Федеральное государственное казенное военное образовательное учреждение высшего образования "Военный учебно-научный центр Военно-Морского Флота "Военно-морская академия имени Адмирала флота Советского Союза Н.Г. Кузнецова" Устройство для обработки речевого сигнала

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5680508A (en) * 1991-05-03 1997-10-21 Itt Corporation Enhancement of speech coding in background noise for low-rate speech coder
US5233660A (en) * 1991-09-10 1993-08-03 At&T Bell Laboratories Method and apparatus for low-delay celp speech coding and decoding
AU684872B2 (en) * 1994-03-10 1998-01-08 Cable And Wireless Plc Communication system
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
SE512613C2 (sv) * 1996-12-30 2000-04-10 Ericsson Telefon Ab L M Metod och organ för informationshantering
US6058205A (en) * 1997-01-09 2000-05-02 International Business Machines Corporation System and method for partitioning the feature space of a classifier in a pattern classification system
JP3011678B2 (ja) * 1997-07-09 2000-02-21 株式会社精研 たわし
US5924066A (en) * 1997-09-26 1999-07-13 U S West, Inc. System and method for classifying a speech signal
US6038535A (en) * 1998-03-23 2000-03-14 Motorola, Inc. Speech classifier and method using delay elements
GB9811019D0 (en) * 1998-05-21 1998-07-22 Univ Surrey Speech coders
US6377915B1 (en) * 1999-03-17 2002-04-23 Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. Speech decoding using mix ratio table
US6377916B1 (en) * 1999-11-29 2002-04-23 Digital Voice Systems, Inc. Multiband harmonic transform coder
US20020016161A1 (en) * 2000-02-10 2002-02-07 Telefonaktiebolaget Lm Ericsson (Publ) Method and apparatus for compression of speech encoded parameters
US6934756B2 (en) * 2000-11-01 2005-08-23 International Business Machines Corporation Conversational networking via transport, coding and control conversational protocols
US6915256B2 (en) * 2003-02-07 2005-07-05 Motorola, Inc. Pitch quantization for distributed speech recognition
KR20060068278A (ko) * 2004-12-16 2006-06-21 한국전자통신연구원 분산 음성 인식 시스템에서의 멜켑스트럼 계수의 양자화방법 및 장치

Also Published As

Publication number Publication date
BRPI0406952B1 (pt) 2018-02-27
KR20050097928A (ko) 2005-10-10
RU2005127871A (ru) 2006-01-20
TW200501055A (en) 2005-01-01
EP1595249A4 (en) 2007-06-20
EP1595249B1 (en) 2017-07-12
RU2348019C2 (ru) 2009-02-27
WO2004072948A2 (en) 2004-08-26
CN101160380B (zh) 2011-09-21
WO2004072948A3 (en) 2004-12-16
US20040158461A1 (en) 2004-08-12
TWI326447B (en) 2010-06-21
EP1595249A2 (en) 2005-11-16
CN101160380A (zh) 2008-04-09
KR100763325B1 (ko) 2007-10-05
US6961696B2 (en) 2005-11-01

Similar Documents

Publication Publication Date Title
BRPI0406952A (pt) Quantificação de classe para o reconhecimento de fala distribuìda
Priva Not so fast: Fast speech correlates with lower lexical and structural information
BRPI0412184A (pt) renderização de anúncios com documentos tendo um ou mais tópicos utilizando informação de interesse de tópico do usuário
EP1229547A3 (en) System and method for thematically analyzing and annotating an audio-visual sequence
BR0107718A (pt) Método e sistema para a provisão de uma lista de meio personalizada
IL172518A0 (en) System and method for configuring voice readers using semantic analysis
EP1759273A4 (en) EVALUATION OF SHARES INVOLVING CAPTURED INFORMATION AND THE ELECTRONIC CONTENT CORRESPONDING TO THE RETURNED DOCUMENTS
ATE325384T1 (de) Systeme und verfahren zur integritätszertifikation und verifikation von inhaltsverbrauchsumgebungen
BRPI0410320A (pt) método e aparelho para representação de granularidade de imagem por um ou mais parámetros
ATE375553T1 (de) Erkennung von speichermangel und feinabschaltung
BR0205150A (pt) Métodos e arranjos para incorporar e para detectar uma marca d'água em um sinal de informação, dispositivo para processar conteúdo de multimìdia, sinal de informação tendo uma marca d'água incorporada, meio de armazenamento, e, dispositivo para transmitir um sinal de informação
Lai et al. A corpus study of the prosody of polysyllabic words in Mandarin Chinese
Allen et al. A linguistic ‘time capsule’: the Newcastle Electronic Corpus of Tyneside English
Gutkin et al. Developing an open-source corpus of yoruba speech
TW200632643A (en) System and method for data analysis
Tang et al. Mutual intelligibility and similarity of Chinese dialects: Predicting judgments from objective measures
BR0017086A (pt) Processo para calcular uma distancia perceptual de um sinal de dados e uma primeira representação do sinal de dados, sistema de compressão, e, processo de compressão de dados
DE60336188D1 (de) Datenfilterungsverwaltungsvorrichtung
BR0206446A (pt) Método e arranjo para ajustar um sinal de dados suplementares a ser embutido em um sinal de informação, dispositivo para embutir um sinal de dados suplementares em um sinal de informação, sinal de informação tendo embutido no mesmo um sinal de dados suplementares, e, meio de armazenamento
BRPI0406956A (pt) Quantificação do tom para reconhecimento de fala distribuìda
van Son et al. Perisegmental speech improves consonant and vowel identification
DE50008116D1 (de) Anonymisierungsverfahren
Van der Spuy The morphology of the Zulu locative
WO2004109471A3 (en) System and method for voice activating web pages
BR0307046A (pt) Sistema e método para a provisão de múltiplas interpretações de conteúdo de documento

Legal Events

Date Code Title Description
B25D Requested change of name of applicant approved

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) ,

B25A Requested transfer of rights approved

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) ,

B25G Requested change of headquarter approved

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) ,

B25E Requested change of name of applicant rejected

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) ,

Free format text: INDEFERIDA A ALTERACAO DE NOME DO SEGUNDO DEPOSITANTE SOLICITADA ATRAVES DA PETICAO NO 020130025782-RJ, DE 28/03/2013, UMA VEZ QUE NAO FOI PAGA A RESPECTIVA TAXA DE RETRIBUICAO.

B25D Requested change of name of applicant approved

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) ,

B25A Requested transfer of rights approved

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) ,

B15K Others concerning applications: alteration of classification

Ipc: G10L 25/72 (2013.01), G10L 25/90 (2013.01), G10L 2

B06A Notification to applicant to reply to the report for non-patentability or inadequacy of the application [chapter 6.1 patent gazette]
B09A Decision: intention to grant [chapter 9.1 patent gazette]
B16A Patent or certificate of addition of invention granted