MX346334B - Método y sistema para el reconocimiento automático del habla. - Google Patents

Método y sistema para el reconocimiento automático del habla.

Info

Publication number
MX346334B
MX346334B MX2015009813A MX2015009813A MX346334B MX 346334 B MX346334 B MX 346334B MX 2015009813 A MX2015009813 A MX 2015009813A MX 2015009813 A MX2015009813 A MX 2015009813A MX 346334 B MX346334 B MX 346334B
Authority
MX
Mexico
Prior art keywords
network
sub
token
classification
primary
Prior art date
Application number
MX2015009813A
Other languages
English (en)
Other versions
MX2015009813A (es
Inventor
Xiang Zhang
Li Lu
Shuai Yue
Feng Rao
Bo Chen
dadong Xie
Original Assignee
Tencent Tech Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Tech Shenzhen Co Ltd filed Critical Tencent Tech Shenzhen Co Ltd
Publication of MX2015009813A publication Critical patent/MX2015009813A/es
Publication of MX346334B publication Critical patent/MX346334B/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/083Recognition networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)

Abstract

Se proporciona un método para reconocer el habla que incluye generar una red de decodificación que incluye una sub-red primaria y una sub-red de clasificación; la sub-red primaria incluye un nodo de clasificación que corresponde a la sub-red de clasificación; la sub-red de clasificación corresponde a un grupo de palabras raras; se recibe una entrada del habla y se decodifica representando una autenticación en la sub-red primaria y pasando la autenticación a través de la red primaria; cuando la autenticación alcanza el nodo de clasificación, el método incluye transferir la autenticación a la sub-red de clasificación y pasar la autenticación a través de la sub-red de clasificación; cuando la autenticación alcanza un nodo de aceptación de la sub-red de clasificación, el método incluye devolver un resultado del autenticación que pasa a través de la subred de clasificación a la sub-red primaria; el resultado incluye una o más palabras en el grupo de palabras raras; una cadena que corresponde a la entrada de habla es producida que incluye la una o más palabras.
MX2015009813A 2013-01-30 2013-11-26 Método y sistema para el reconocimiento automático del habla. MX346334B (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310037464.5A CN103971686B (zh) 2013-01-30 2013-01-30 自动语音识别方法和系统
PCT/CN2013/087816 WO2014117577A1 (en) 2013-01-30 2013-11-26 Method and system for automatic speech recognition

Publications (2)

Publication Number Publication Date
MX2015009813A MX2015009813A (es) 2015-10-29
MX346334B true MX346334B (es) 2017-03-14

Family

ID=51241104

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2015009813A MX346334B (es) 2013-01-30 2013-11-26 Método y sistema para el reconocimiento automático del habla.

Country Status (7)

Country Link
US (1) US9472190B2 (es)
CN (1) CN103971686B (es)
AR (1) AR094605A1 (es)
CA (1) CA2898265C (es)
MX (1) MX346334B (es)
SG (1) SG11201505405TA (es)
WO (1) WO2014117577A1 (es)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6315980B2 (ja) * 2013-12-24 2018-04-25 株式会社東芝 デコーダ、デコード方法およびプログラム
US9530404B2 (en) * 2014-10-06 2016-12-27 Intel Corporation System and method of automatic speech recognition using on-the-fly word lattice generation with word histories
KR102267405B1 (ko) * 2014-11-21 2021-06-22 삼성전자주식회사 음성 인식 장치 및 음성 인식 장치의 제어 방법
US20170031896A1 (en) * 2015-07-28 2017-02-02 Xerox Corporation Robust reversible finite-state approach to contextual generation and semantic parsing
CN106940998B (zh) * 2015-12-31 2021-04-16 阿里巴巴集团控股有限公司 一种设定操作的执行方法及装置
CN105869624B (zh) * 2016-03-29 2019-05-10 腾讯科技(深圳)有限公司 数字语音识别中语音解码网络的构建方法及装置
CN106128454A (zh) * 2016-07-08 2016-11-16 成都之达科技有限公司 基于车联网的语音信号匹配方法
CN106202045B (zh) * 2016-07-08 2019-04-02 成都之达科技有限公司 基于车联网的专项语音识别方法
CN106356054A (zh) * 2016-11-23 2017-01-25 广西大学 一种基于语音识别的农产品信息采集方法和系统
CN108288467B (zh) * 2017-06-07 2020-07-14 腾讯科技(深圳)有限公司 一种语音识别方法、装置及语音识别引擎
US10832658B2 (en) * 2017-11-15 2020-11-10 International Business Machines Corporation Quantized dialog language model for dialog systems
CN108597517B (zh) * 2018-03-08 2020-06-05 深圳市声扬科技有限公司 标点符号添加方法、装置、计算机设备和存储介质
CN108597497B (zh) * 2018-04-03 2020-09-08 中译语通科技股份有限公司 一种字幕语音精准同步系统及方法、信息数据处理终端
US11886473B2 (en) 2018-04-20 2024-01-30 Meta Platforms, Inc. Intent identification for agent matching by assistant systems
US10782986B2 (en) 2018-04-20 2020-09-22 Facebook, Inc. Assisting users with personalized and contextual communication content
US11307880B2 (en) 2018-04-20 2022-04-19 Meta Platforms, Inc. Assisting users with personalized and contextual communication content
US11676220B2 (en) 2018-04-20 2023-06-13 Meta Platforms, Inc. Processing multimodal user input for assistant systems
US11715042B1 (en) 2018-04-20 2023-08-01 Meta Platforms Technologies, Llc Interpretability of deep reinforcement learning models in assistant systems
CN108694939B (zh) * 2018-05-23 2020-11-03 广州视源电子科技股份有限公司 语音搜索优化方法、装置和系统
DE102018208932A1 (de) 2018-06-06 2019-12-12 Glatt Gesellschaft Mit Beschränkter Haftung Anströmboden für einen Fluidisierungsapparat
CN110689881B (zh) * 2018-06-20 2022-07-12 深圳市北科瑞声科技股份有限公司 语音识别方法、装置、计算机设备和存储介质
CN108989349B (zh) * 2018-08-31 2022-11-29 平安科技(深圳)有限公司 用户账号解锁方法、装置、计算机设备及存储介质
CN111354347B (zh) * 2018-12-21 2023-08-15 中国科学院声学研究所 一种基于自适应热词权重的语音识别方法及系统
CN110322884B (zh) * 2019-07-09 2021-12-07 科大讯飞股份有限公司 一种解码网络的插词方法、装置、设备及存储介质
CN110610700B (zh) * 2019-10-16 2022-01-14 科大讯飞股份有限公司 解码网络构建方法、语音识别方法、装置、设备及存储介质
CN111128172B (zh) * 2019-12-31 2022-12-16 达闼机器人股份有限公司 一种语音识别方法、电子设备和存储介质
CN111477217B (zh) * 2020-04-08 2023-10-10 北京声智科技有限公司 一种命令词识别方法及装置
CN111787169B (zh) * 2020-07-13 2021-06-15 南京硅基智能科技有限公司 一种用于移动式人机协作呼叫机器人的三方通话终端
CN111862958B (zh) * 2020-08-07 2024-04-02 广州视琨电子科技有限公司 发音插入错误检测方法、装置、电子设备及存储介质
CN112233664B (zh) 2020-10-15 2021-11-09 北京百度网讯科技有限公司 语义预测网络的训练方法、装置、设备以及存储介质
CN112820277B (zh) * 2021-01-06 2023-08-25 网易(杭州)网络有限公司 语音识别服务定制方法、介质、装置和计算设备
CN114937450B (zh) * 2021-02-05 2024-07-09 清华大学 一种语音关键词识别方法及系统
CN113450803B (zh) * 2021-06-09 2024-03-19 上海明略人工智能(集团)有限公司 会议录音转写方法、系统、计算机设备和可读存储介质
CN114299945A (zh) * 2021-12-15 2022-04-08 北京声智科技有限公司 语音信号的识别方法、装置、电子设备、存储介质及产品
CN114898754B (zh) * 2022-07-07 2022-09-30 北京百度网讯科技有限公司 解码图生成、语音识别方法、装置、电子设备及存储介质

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6032111A (en) * 1997-06-23 2000-02-29 At&T Corp. Method and apparatus for compiling context-dependent rewrite rules and input strings
DE19754957A1 (de) * 1997-12-11 1999-06-17 Daimler Chrysler Ag Verfahren zur Spracherkennung
US6574597B1 (en) * 1998-05-08 2003-06-03 At&T Corp. Fully expanded context-dependent networks for speech recognition
US6185535B1 (en) * 1998-10-16 2001-02-06 Telefonaktiebolaget Lm Ericsson (Publ) Voice control of a user interface to service applications
US7881936B2 (en) * 1998-12-04 2011-02-01 Tegic Communications, Inc. Multimodal disambiguation of speech recognition
US20020032564A1 (en) * 2000-04-19 2002-03-14 Farzad Ehsani Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface
US6587844B1 (en) * 2000-02-01 2003-07-01 At&T Corp. System and methods for optimizing networks of weighted unweighted directed graphs
US20040205671A1 (en) * 2000-09-13 2004-10-14 Tatsuya Sukehiro Natural-language processing system
JP4215418B2 (ja) * 2001-08-24 2009-01-28 インターナショナル・ビジネス・マシーンズ・コーポレーション 単語予測方法、音声認識方法、その方法を用いた音声認識装置及びプログラム
US7398209B2 (en) * 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
GB2409750B (en) * 2004-01-05 2006-03-15 Toshiba Res Europ Ltd Speech recognition system and technique
US7620549B2 (en) * 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US7949529B2 (en) * 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US8195462B2 (en) * 2006-02-16 2012-06-05 At&T Intellectual Property Ii, L.P. System and method for providing large vocabulary speech processing based on fixed-point arithmetic
US7877256B2 (en) * 2006-02-17 2011-01-25 Microsoft Corporation Time synchronous decoding for long-span hidden trajectory model
CN101751924A (zh) * 2009-12-10 2010-06-23 清华大学 嵌入式平台大词汇量语音命令词的识别方法
US8914286B1 (en) * 2011-04-14 2014-12-16 Canyon IP Holdings, LLC Speech recognition with hierarchical networks
CN102376305B (zh) * 2011-11-29 2013-06-19 安徽科大讯飞信息科技股份有限公司 语音识别方法及系统
CN102592595B (zh) * 2012-03-19 2013-05-29 安徽科大讯飞信息科技股份有限公司 语音识别方法及系统
US8374865B1 (en) * 2012-04-26 2013-02-12 Google Inc. Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
EP2893435B1 (en) * 2012-09-07 2019-05-08 Carnegie Mellon University Methods for hybrid gpu/cpu data processing
US9123333B2 (en) * 2012-09-12 2015-09-01 Google Inc. Minimum bayesian risk methods for automatic speech recognition
US8972243B1 (en) * 2012-11-20 2015-03-03 Amazon Technologies, Inc. Parse information encoding in a finite state transducer
US9594744B2 (en) * 2012-11-28 2017-03-14 Google Inc. Speech transcription including written text
CN103065630B (zh) * 2012-12-28 2015-01-07 科大讯飞股份有限公司 用户个性化信息语音识别方法及系统

Also Published As

Publication number Publication date
CN103971686A (zh) 2014-08-06
US20140236591A1 (en) 2014-08-21
CA2898265A1 (en) 2014-08-07
US9472190B2 (en) 2016-10-18
AR094605A1 (es) 2015-08-12
CA2898265C (en) 2017-08-01
WO2014117577A1 (en) 2014-08-07
CN103971686B (zh) 2015-06-10
MX2015009813A (es) 2015-10-29
SG11201505405TA (en) 2015-08-28

Similar Documents

Publication Publication Date Title
MX2015009813A (es) Metodo y sistema para el reconocimiento automatico del habla.
AR125774A2 (es) Procesador de datos de audio para decodificadores de audio y/o renderizadores y método para procesar datos de audio
MX2018001498A (es) Control de una nube de dispositivos.
WO2014144395A3 (en) User training by intelligent digital assistant
MX2017015597A (es) Provision de elementos de mensaje aumentados en hilos de comunicacion electronica.
PH12019501745A1 (en) Service data processing method and device, and service processing method and device
CL2018001771A1 (es) Tecnologías de red
BR112017006053A2 (pt) método de comunicação e aparelho
MX2017002593A (es) Transformacion de secuencias de eventos.
MX2017001004A (es) Dimensionamiento y posicionamiento adaptativos de ventanas de aplicacion.
WO2014121234A3 (en) Method and apparatus for contextual text to speech conversion
MX350012B (es) Plantillas de búsqueda por el cliente para redes sociales en línea.
PH12015000372A1 (en) Conversion of documents of different types to a uniform and an editable or a searchable format
BR112016024153A2 (pt) método e sistema para implementar uma carteira digital sem fio
CL2017002238A1 (es) Método de tratamiento con tradipitant
BR112016030050A2 (pt) modulação de pulso assíncrono para codificação de sinal baseada em limite
MY185366A (en) Audio information processing method and device
AR098687A1 (es) Sistema y método para tratar una formación subterránea con una composición de desvío
WO2018212584A3 (ko) 딥 뉴럴 네트워크를 이용하여 문장이 속하는 클래스를 분류하는 방법 및 장치
PH12017501843A1 (en) Systems and methods for customized load control
MX2017012465A (es) Dispositivo de transmision, metodo de transmision, dispositivo de recepcion, y metodo de recepcion.
SA520411109B1 (ar) نظم، وأجهزة، وطرق لفصل الماء في أسفل البئر
BR112017001950A2 (pt) sistema e método de monitoramento configuráveis
TW201613308A (en) Method and gateway for controlling external device and device connected to gateway
MX2018008126A (es) Aparato de transmision, metodo de transmision, aparato de recepcion, y metodo de recepcion.

Legal Events

Date Code Title Description
FG Grant or registration