MX346334B - Método y sistema para el reconocimiento automático del habla. - Google Patents
Método y sistema para el reconocimiento automático del habla.Info
- Publication number
- MX346334B MX346334B MX2015009813A MX2015009813A MX346334B MX 346334 B MX346334 B MX 346334B MX 2015009813 A MX2015009813 A MX 2015009813A MX 2015009813 A MX2015009813 A MX 2015009813A MX 346334 B MX346334 B MX 346334B
- Authority
- MX
- Mexico
- Prior art keywords
- network
- sub
- token
- classification
- primary
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/193—Formal grammars, e.g. finite state automata, context free grammars or word networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/083—Recognition networks
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
Abstract
Se proporciona un método para reconocer el habla que incluye generar una red de decodificación que incluye una sub-red primaria y una sub-red de clasificación; la sub-red primaria incluye un nodo de clasificación que corresponde a la sub-red de clasificación; la sub-red de clasificación corresponde a un grupo de palabras raras; se recibe una entrada del habla y se decodifica representando una autenticación en la sub-red primaria y pasando la autenticación a través de la red primaria; cuando la autenticación alcanza el nodo de clasificación, el método incluye transferir la autenticación a la sub-red de clasificación y pasar la autenticación a través de la sub-red de clasificación; cuando la autenticación alcanza un nodo de aceptación de la sub-red de clasificación, el método incluye devolver un resultado del autenticación que pasa a través de la subred de clasificación a la sub-red primaria; el resultado incluye una o más palabras en el grupo de palabras raras; una cadena que corresponde a la entrada de habla es producida que incluye la una o más palabras.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310037464.5A CN103971686B (zh) | 2013-01-30 | 2013-01-30 | 自动语音识别方法和系统 |
PCT/CN2013/087816 WO2014117577A1 (en) | 2013-01-30 | 2013-11-26 | Method and system for automatic speech recognition |
Publications (2)
Publication Number | Publication Date |
---|---|
MX2015009813A MX2015009813A (es) | 2015-10-29 |
MX346334B true MX346334B (es) | 2017-03-14 |
Family
ID=51241104
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2015009813A MX346334B (es) | 2013-01-30 | 2013-11-26 | Método y sistema para el reconocimiento automático del habla. |
Country Status (7)
Country | Link |
---|---|
US (1) | US9472190B2 (es) |
CN (1) | CN103971686B (es) |
AR (1) | AR094605A1 (es) |
CA (1) | CA2898265C (es) |
MX (1) | MX346334B (es) |
SG (1) | SG11201505405TA (es) |
WO (1) | WO2014117577A1 (es) |
Families Citing this family (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6315980B2 (ja) * | 2013-12-24 | 2018-04-25 | 株式会社東芝 | デコーダ、デコード方法およびプログラム |
US9530404B2 (en) * | 2014-10-06 | 2016-12-27 | Intel Corporation | System and method of automatic speech recognition using on-the-fly word lattice generation with word histories |
KR102267405B1 (ko) * | 2014-11-21 | 2021-06-22 | 삼성전자주식회사 | 음성 인식 장치 및 음성 인식 장치의 제어 방법 |
US20170031896A1 (en) * | 2015-07-28 | 2017-02-02 | Xerox Corporation | Robust reversible finite-state approach to contextual generation and semantic parsing |
CN106940998B (zh) * | 2015-12-31 | 2021-04-16 | 阿里巴巴集团控股有限公司 | 一种设定操作的执行方法及装置 |
CN105869624B (zh) * | 2016-03-29 | 2019-05-10 | 腾讯科技(深圳)有限公司 | 数字语音识别中语音解码网络的构建方法及装置 |
CN106128454A (zh) * | 2016-07-08 | 2016-11-16 | 成都之达科技有限公司 | 基于车联网的语音信号匹配方法 |
CN106202045B (zh) * | 2016-07-08 | 2019-04-02 | 成都之达科技有限公司 | 基于车联网的专项语音识别方法 |
CN106356054A (zh) * | 2016-11-23 | 2017-01-25 | 广西大学 | 一种基于语音识别的农产品信息采集方法和系统 |
CN108288467B (zh) * | 2017-06-07 | 2020-07-14 | 腾讯科技(深圳)有限公司 | 一种语音识别方法、装置及语音识别引擎 |
US10832658B2 (en) * | 2017-11-15 | 2020-11-10 | International Business Machines Corporation | Quantized dialog language model for dialog systems |
CN108597517B (zh) * | 2018-03-08 | 2020-06-05 | 深圳市声扬科技有限公司 | 标点符号添加方法、装置、计算机设备和存储介质 |
CN108597497B (zh) * | 2018-04-03 | 2020-09-08 | 中译语通科技股份有限公司 | 一种字幕语音精准同步系统及方法、信息数据处理终端 |
US11886473B2 (en) | 2018-04-20 | 2024-01-30 | Meta Platforms, Inc. | Intent identification for agent matching by assistant systems |
US10782986B2 (en) | 2018-04-20 | 2020-09-22 | Facebook, Inc. | Assisting users with personalized and contextual communication content |
US11307880B2 (en) | 2018-04-20 | 2022-04-19 | Meta Platforms, Inc. | Assisting users with personalized and contextual communication content |
US11676220B2 (en) | 2018-04-20 | 2023-06-13 | Meta Platforms, Inc. | Processing multimodal user input for assistant systems |
US11715042B1 (en) | 2018-04-20 | 2023-08-01 | Meta Platforms Technologies, Llc | Interpretability of deep reinforcement learning models in assistant systems |
CN108694939B (zh) * | 2018-05-23 | 2020-11-03 | 广州视源电子科技股份有限公司 | 语音搜索优化方法、装置和系统 |
DE102018208932A1 (de) | 2018-06-06 | 2019-12-12 | Glatt Gesellschaft Mit Beschränkter Haftung | Anströmboden für einen Fluidisierungsapparat |
CN110689881B (zh) * | 2018-06-20 | 2022-07-12 | 深圳市北科瑞声科技股份有限公司 | 语音识别方法、装置、计算机设备和存储介质 |
CN108989349B (zh) * | 2018-08-31 | 2022-11-29 | 平安科技(深圳)有限公司 | 用户账号解锁方法、装置、计算机设备及存储介质 |
CN111354347B (zh) * | 2018-12-21 | 2023-08-15 | 中国科学院声学研究所 | 一种基于自适应热词权重的语音识别方法及系统 |
CN110322884B (zh) * | 2019-07-09 | 2021-12-07 | 科大讯飞股份有限公司 | 一种解码网络的插词方法、装置、设备及存储介质 |
CN110610700B (zh) * | 2019-10-16 | 2022-01-14 | 科大讯飞股份有限公司 | 解码网络构建方法、语音识别方法、装置、设备及存储介质 |
CN111128172B (zh) * | 2019-12-31 | 2022-12-16 | 达闼机器人股份有限公司 | 一种语音识别方法、电子设备和存储介质 |
CN111477217B (zh) * | 2020-04-08 | 2023-10-10 | 北京声智科技有限公司 | 一种命令词识别方法及装置 |
CN111787169B (zh) * | 2020-07-13 | 2021-06-15 | 南京硅基智能科技有限公司 | 一种用于移动式人机协作呼叫机器人的三方通话终端 |
CN111862958B (zh) * | 2020-08-07 | 2024-04-02 | 广州视琨电子科技有限公司 | 发音插入错误检测方法、装置、电子设备及存储介质 |
CN112233664B (zh) | 2020-10-15 | 2021-11-09 | 北京百度网讯科技有限公司 | 语义预测网络的训练方法、装置、设备以及存储介质 |
CN112820277B (zh) * | 2021-01-06 | 2023-08-25 | 网易(杭州)网络有限公司 | 语音识别服务定制方法、介质、装置和计算设备 |
CN114937450B (zh) * | 2021-02-05 | 2024-07-09 | 清华大学 | 一种语音关键词识别方法及系统 |
CN113450803B (zh) * | 2021-06-09 | 2024-03-19 | 上海明略人工智能(集团)有限公司 | 会议录音转写方法、系统、计算机设备和可读存储介质 |
CN114299945A (zh) * | 2021-12-15 | 2022-04-08 | 北京声智科技有限公司 | 语音信号的识别方法、装置、电子设备、存储介质及产品 |
CN114898754B (zh) * | 2022-07-07 | 2022-09-30 | 北京百度网讯科技有限公司 | 解码图生成、语音识别方法、装置、电子设备及存储介质 |
Family Cites Families (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6032111A (en) * | 1997-06-23 | 2000-02-29 | At&T Corp. | Method and apparatus for compiling context-dependent rewrite rules and input strings |
DE19754957A1 (de) * | 1997-12-11 | 1999-06-17 | Daimler Chrysler Ag | Verfahren zur Spracherkennung |
US6574597B1 (en) * | 1998-05-08 | 2003-06-03 | At&T Corp. | Fully expanded context-dependent networks for speech recognition |
US6185535B1 (en) * | 1998-10-16 | 2001-02-06 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice control of a user interface to service applications |
US7881936B2 (en) * | 1998-12-04 | 2011-02-01 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US20020032564A1 (en) * | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US6587844B1 (en) * | 2000-02-01 | 2003-07-01 | At&T Corp. | System and methods for optimizing networks of weighted unweighted directed graphs |
US20040205671A1 (en) * | 2000-09-13 | 2004-10-14 | Tatsuya Sukehiro | Natural-language processing system |
JP4215418B2 (ja) * | 2001-08-24 | 2009-01-28 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 単語予測方法、音声認識方法、その方法を用いた音声認識装置及びプログラム |
US7398209B2 (en) * | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
GB2409750B (en) * | 2004-01-05 | 2006-03-15 | Toshiba Res Europ Ltd | Speech recognition system and technique |
US7620549B2 (en) * | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US7949529B2 (en) * | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
US8195462B2 (en) * | 2006-02-16 | 2012-06-05 | At&T Intellectual Property Ii, L.P. | System and method for providing large vocabulary speech processing based on fixed-point arithmetic |
US7877256B2 (en) * | 2006-02-17 | 2011-01-25 | Microsoft Corporation | Time synchronous decoding for long-span hidden trajectory model |
CN101751924A (zh) * | 2009-12-10 | 2010-06-23 | 清华大学 | 嵌入式平台大词汇量语音命令词的识别方法 |
US8914286B1 (en) * | 2011-04-14 | 2014-12-16 | Canyon IP Holdings, LLC | Speech recognition with hierarchical networks |
CN102376305B (zh) * | 2011-11-29 | 2013-06-19 | 安徽科大讯飞信息科技股份有限公司 | 语音识别方法及系统 |
CN102592595B (zh) * | 2012-03-19 | 2013-05-29 | 安徽科大讯飞信息科技股份有限公司 | 语音识别方法及系统 |
US8374865B1 (en) * | 2012-04-26 | 2013-02-12 | Google Inc. | Sampling training data for an automatic speech recognition system based on a benchmark classification distribution |
EP2893435B1 (en) * | 2012-09-07 | 2019-05-08 | Carnegie Mellon University | Methods for hybrid gpu/cpu data processing |
US9123333B2 (en) * | 2012-09-12 | 2015-09-01 | Google Inc. | Minimum bayesian risk methods for automatic speech recognition |
US8972243B1 (en) * | 2012-11-20 | 2015-03-03 | Amazon Technologies, Inc. | Parse information encoding in a finite state transducer |
US9594744B2 (en) * | 2012-11-28 | 2017-03-14 | Google Inc. | Speech transcription including written text |
CN103065630B (zh) * | 2012-12-28 | 2015-01-07 | 科大讯飞股份有限公司 | 用户个性化信息语音识别方法及系统 |
-
2013
- 2013-01-30 CN CN201310037464.5A patent/CN103971686B/zh active Active
- 2013-11-26 MX MX2015009813A patent/MX346334B/es active IP Right Grant
- 2013-11-26 SG SG11201505405TA patent/SG11201505405TA/en unknown
- 2013-11-26 CA CA2898265A patent/CA2898265C/en active Active
- 2013-11-26 WO PCT/CN2013/087816 patent/WO2014117577A1/en active Application Filing
-
2014
- 2014-01-28 AR ARP140100257A patent/AR094605A1/es active IP Right Grant
- 2014-04-28 US US14/263,958 patent/US9472190B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN103971686A (zh) | 2014-08-06 |
US20140236591A1 (en) | 2014-08-21 |
CA2898265A1 (en) | 2014-08-07 |
US9472190B2 (en) | 2016-10-18 |
AR094605A1 (es) | 2015-08-12 |
CA2898265C (en) | 2017-08-01 |
WO2014117577A1 (en) | 2014-08-07 |
CN103971686B (zh) | 2015-06-10 |
MX2015009813A (es) | 2015-10-29 |
SG11201505405TA (en) | 2015-08-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2015009813A (es) | Metodo y sistema para el reconocimiento automatico del habla. | |
AR125774A2 (es) | Procesador de datos de audio para decodificadores de audio y/o renderizadores y método para procesar datos de audio | |
MX2018001498A (es) | Control de una nube de dispositivos. | |
WO2014144395A3 (en) | User training by intelligent digital assistant | |
MX2017015597A (es) | Provision de elementos de mensaje aumentados en hilos de comunicacion electronica. | |
PH12019501745A1 (en) | Service data processing method and device, and service processing method and device | |
CL2018001771A1 (es) | Tecnologías de red | |
BR112017006053A2 (pt) | método de comunicação e aparelho | |
MX2017002593A (es) | Transformacion de secuencias de eventos. | |
MX2017001004A (es) | Dimensionamiento y posicionamiento adaptativos de ventanas de aplicacion. | |
WO2014121234A3 (en) | Method and apparatus for contextual text to speech conversion | |
MX350012B (es) | Plantillas de búsqueda por el cliente para redes sociales en línea. | |
PH12015000372A1 (en) | Conversion of documents of different types to a uniform and an editable or a searchable format | |
BR112016024153A2 (pt) | método e sistema para implementar uma carteira digital sem fio | |
CL2017002238A1 (es) | Método de tratamiento con tradipitant | |
BR112016030050A2 (pt) | modulação de pulso assíncrono para codificação de sinal baseada em limite | |
MY185366A (en) | Audio information processing method and device | |
AR098687A1 (es) | Sistema y método para tratar una formación subterránea con una composición de desvío | |
WO2018212584A3 (ko) | 딥 뉴럴 네트워크를 이용하여 문장이 속하는 클래스를 분류하는 방법 및 장치 | |
PH12017501843A1 (en) | Systems and methods for customized load control | |
MX2017012465A (es) | Dispositivo de transmision, metodo de transmision, dispositivo de recepcion, y metodo de recepcion. | |
SA520411109B1 (ar) | نظم، وأجهزة، وطرق لفصل الماء في أسفل البئر | |
BR112017001950A2 (pt) | sistema e método de monitoramento configuráveis | |
TW201613308A (en) | Method and gateway for controlling external device and device connected to gateway | |
MX2018008126A (es) | Aparato de transmision, metodo de transmision, aparato de recepcion, y metodo de recepcion. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |