BRPI0406952A - Quantificação de classe para o reconhecimento de fala distribuìda - Google Patents
Quantificação de classe para o reconhecimento de fala distribuìdaInfo
- Publication number
- BRPI0406952A BRPI0406952A BR0406952-8A BRPI0406952A BRPI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A BR PI0406952 A BRPI0406952 A BR PI0406952A
- Authority
- BR
- Brazil
- Prior art keywords
- class
- tone
- frame
- codeword
- quantification
- Prior art date
Links
- 238000011002 quantification Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 abstract 5
- 230000010365 information processing Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/72—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for transmitting results of analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G10L2025/935—Mixed voiced class; Transitions
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
Abstract
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/360,582 US6961696B2 (en) | 2003-02-07 | 2003-02-07 | Class quantization for distributed speech recognition |
US10/360,582 | 2003-02-07 | ||
PCT/US2004/003419 WO2004072948A2 (en) | 2003-02-07 | 2004-02-05 | Class quantization for distributed speech recognition |
Publications (2)
Publication Number | Publication Date |
---|---|
BRPI0406952A true BRPI0406952A (pt) | 2006-01-03 |
BRPI0406952B1 BRPI0406952B1 (pt) | 2018-02-27 |
Family
ID=32824044
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BRPI0406952-8A BRPI0406952B1 (pt) | 2003-02-07 | 2004-02-05 | Quantização de informação de classe para reconhecimento de fala distríbuido |
Country Status (8)
Country | Link |
---|---|
US (1) | US6961696B2 (pt) |
EP (1) | EP1595249B1 (pt) |
KR (1) | KR100763325B1 (pt) |
CN (1) | CN101160380B (pt) |
BR (1) | BRPI0406952B1 (pt) |
RU (1) | RU2348019C2 (pt) |
TW (1) | TWI326447B (pt) |
WO (1) | WO2004072948A2 (pt) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7783488B2 (en) * | 2005-12-19 | 2010-08-24 | Nuance Communications, Inc. | Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information |
CN102256372B (zh) * | 2010-05-17 | 2016-06-22 | 中兴通讯股份有限公司 | Mtc终端接入方法及系统和mtc终端 |
US9883312B2 (en) | 2013-05-29 | 2018-01-30 | Qualcomm Incorporated | Transformed higher order ambisonics audio data |
US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
US9489955B2 (en) | 2014-01-30 | 2016-11-08 | Qualcomm Incorporated | Indicating frame parameter reusability for coding vectors |
US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
RU2701120C1 (ru) * | 2018-05-14 | 2019-09-24 | Федеральное государственное казенное военное образовательное учреждение высшего образования "Военный учебно-научный центр Военно-Морского Флота "Военно-морская академия имени Адмирала флота Советского Союза Н.Г. Кузнецова" | Устройство для обработки речевого сигнала |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5680508A (en) * | 1991-05-03 | 1997-10-21 | Itt Corporation | Enhancement of speech coding in background noise for low-rate speech coder |
US5233660A (en) * | 1991-09-10 | 1993-08-03 | At&T Bell Laboratories | Method and apparatus for low-delay celp speech coding and decoding |
AU684872B2 (en) * | 1994-03-10 | 1998-01-08 | Cable And Wireless Plc | Communication system |
US5732389A (en) * | 1995-06-07 | 1998-03-24 | Lucent Technologies Inc. | Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures |
US5699485A (en) * | 1995-06-07 | 1997-12-16 | Lucent Technologies Inc. | Pitch delay modification during frame erasures |
SE512613C2 (sv) * | 1996-12-30 | 2000-04-10 | Ericsson Telefon Ab L M | Metod och organ för informationshantering |
US6058205A (en) * | 1997-01-09 | 2000-05-02 | International Business Machines Corporation | System and method for partitioning the feature space of a classifier in a pattern classification system |
JP3011678B2 (ja) * | 1997-07-09 | 2000-02-21 | 株式会社精研 | たわし |
US5924066A (en) * | 1997-09-26 | 1999-07-13 | U S West, Inc. | System and method for classifying a speech signal |
US6038535A (en) * | 1998-03-23 | 2000-03-14 | Motorola, Inc. | Speech classifier and method using delay elements |
GB9811019D0 (en) * | 1998-05-21 | 1998-07-22 | Univ Surrey | Speech coders |
US6377915B1 (en) * | 1999-03-17 | 2002-04-23 | Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. | Speech decoding using mix ratio table |
US6377916B1 (en) * | 1999-11-29 | 2002-04-23 | Digital Voice Systems, Inc. | Multiband harmonic transform coder |
US20020016161A1 (en) * | 2000-02-10 | 2002-02-07 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for compression of speech encoded parameters |
US6934756B2 (en) * | 2000-11-01 | 2005-08-23 | International Business Machines Corporation | Conversational networking via transport, coding and control conversational protocols |
US6915256B2 (en) * | 2003-02-07 | 2005-07-05 | Motorola, Inc. | Pitch quantization for distributed speech recognition |
KR20060068278A (ko) * | 2004-12-16 | 2006-06-21 | 한국전자통신연구원 | 분산 음성 인식 시스템에서의 멜켑스트럼 계수의 양자화방법 및 장치 |
-
2003
- 2003-02-07 US US10/360,582 patent/US6961696B2/en not_active Expired - Lifetime
-
2004
- 2004-02-05 KR KR1020057012452A patent/KR100763325B1/ko active IP Right Grant
- 2004-02-05 CN CN2004800036671A patent/CN101160380B/zh not_active Expired - Lifetime
- 2004-02-05 WO PCT/US2004/003419 patent/WO2004072948A2/en active Application Filing
- 2004-02-05 BR BRPI0406952-8A patent/BRPI0406952B1/pt active IP Right Grant
- 2004-02-05 EP EP04708622.8A patent/EP1595249B1/en not_active Expired - Lifetime
- 2004-02-05 RU RU2005127871/09A patent/RU2348019C2/ru active
- 2004-02-06 TW TW093102827A patent/TWI326447B/zh not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
BRPI0406952B1 (pt) | 2018-02-27 |
KR20050097928A (ko) | 2005-10-10 |
RU2005127871A (ru) | 2006-01-20 |
TW200501055A (en) | 2005-01-01 |
EP1595249A4 (en) | 2007-06-20 |
EP1595249B1 (en) | 2017-07-12 |
RU2348019C2 (ru) | 2009-02-27 |
WO2004072948A2 (en) | 2004-08-26 |
CN101160380B (zh) | 2011-09-21 |
WO2004072948A3 (en) | 2004-12-16 |
US20040158461A1 (en) | 2004-08-12 |
TWI326447B (en) | 2010-06-21 |
EP1595249A2 (en) | 2005-11-16 |
CN101160380A (zh) | 2008-04-09 |
KR100763325B1 (ko) | 2007-10-05 |
US6961696B2 (en) | 2005-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BRPI0406952A (pt) | Quantificação de classe para o reconhecimento de fala distribuìda | |
Priva | Not so fast: Fast speech correlates with lower lexical and structural information | |
BRPI0412184A (pt) | renderização de anúncios com documentos tendo um ou mais tópicos utilizando informação de interesse de tópico do usuário | |
EP1229547A3 (en) | System and method for thematically analyzing and annotating an audio-visual sequence | |
BR0107718A (pt) | Método e sistema para a provisão de uma lista de meio personalizada | |
IL172518A0 (en) | System and method for configuring voice readers using semantic analysis | |
EP1759273A4 (en) | EVALUATION OF SHARES INVOLVING CAPTURED INFORMATION AND THE ELECTRONIC CONTENT CORRESPONDING TO THE RETURNED DOCUMENTS | |
ATE325384T1 (de) | Systeme und verfahren zur integritätszertifikation und verifikation von inhaltsverbrauchsumgebungen | |
BRPI0410320A (pt) | método e aparelho para representação de granularidade de imagem por um ou mais parámetros | |
ATE375553T1 (de) | Erkennung von speichermangel und feinabschaltung | |
BR0205150A (pt) | Métodos e arranjos para incorporar e para detectar uma marca d'água em um sinal de informação, dispositivo para processar conteúdo de multimìdia, sinal de informação tendo uma marca d'água incorporada, meio de armazenamento, e, dispositivo para transmitir um sinal de informação | |
Lai et al. | A corpus study of the prosody of polysyllabic words in Mandarin Chinese | |
Allen et al. | A linguistic ‘time capsule’: the Newcastle Electronic Corpus of Tyneside English | |
Gutkin et al. | Developing an open-source corpus of yoruba speech | |
TW200632643A (en) | System and method for data analysis | |
Tang et al. | Mutual intelligibility and similarity of Chinese dialects: Predicting judgments from objective measures | |
BR0017086A (pt) | Processo para calcular uma distancia perceptual de um sinal de dados e uma primeira representação do sinal de dados, sistema de compressão, e, processo de compressão de dados | |
DE60336188D1 (de) | Datenfilterungsverwaltungsvorrichtung | |
BR0206446A (pt) | Método e arranjo para ajustar um sinal de dados suplementares a ser embutido em um sinal de informação, dispositivo para embutir um sinal de dados suplementares em um sinal de informação, sinal de informação tendo embutido no mesmo um sinal de dados suplementares, e, meio de armazenamento | |
BRPI0406956A (pt) | Quantificação do tom para reconhecimento de fala distribuìda | |
van Son et al. | Perisegmental speech improves consonant and vowel identification | |
DE50008116D1 (de) | Anonymisierungsverfahren | |
Van der Spuy | The morphology of the Zulu locative | |
WO2004109471A3 (en) | System and method for voice activating web pages | |
BR0307046A (pt) | Sistema e método para a provisão de múltiplas interpretações de conteúdo de documento |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B25D | Requested change of name of applicant approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
B25A | Requested transfer of rights approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
B25G | Requested change of headquarter approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
B25E | Requested change of name of applicant rejected |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , Free format text: INDEFERIDA A ALTERACAO DE NOME DO SEGUNDO DEPOSITANTE SOLICITADA ATRAVES DA PETICAO NO 020130025782-RJ, DE 28/03/2013, UMA VEZ QUE NAO FOI PAGA A RESPECTIVA TAXA DE RETRIBUICAO. |
|
B25D | Requested change of name of applicant approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
B25A | Requested transfer of rights approved |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION (US) , |
|
B15K | Others concerning applications: alteration of classification |
Ipc: G10L 25/72 (2013.01), G10L 25/90 (2013.01), G10L 2 |
|
B06A | Notification to applicant to reply to the report for non-patentability or inadequacy of the application [chapter 6.1 patent gazette] | ||
B09A | Decision: intention to grant [chapter 9.1 patent gazette] | ||
B16A | Patent or certificate of addition of invention granted |