CN112037792B - 一种语音识别方法、装置、电子设备及存储介质 - Google Patents
一种语音识别方法、装置、电子设备及存储介质 Download PDFInfo
- Publication number
- CN112037792B CN112037792B CN202010842909.7A CN202010842909A CN112037792B CN 112037792 B CN112037792 B CN 112037792B CN 202010842909 A CN202010842909 A CN 202010842909A CN 112037792 B CN112037792 B CN 112037792B
- Authority
- CN
- China
- Prior art keywords
- industry
- text information
- word
- communication range
- preset
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 77
- 239000013598 vector Substances 0.000 claims abstract description 267
- 230000006854 communication Effects 0.000 claims abstract description 207
- 238000004891 communication Methods 0.000 claims abstract description 203
- 238000012512 characterization method Methods 0.000 claims abstract description 125
- 238000000605 extraction Methods 0.000 claims description 9
- 238000005516 engineering process Methods 0.000 claims description 7
- 238000001914 filtration Methods 0.000 claims description 6
- 238000012549 training Methods 0.000 claims description 5
- 230000008030 elimination Effects 0.000 claims 1
- 238000003379 elimination reaction Methods 0.000 claims 1
- 230000000875 corresponding effect Effects 0.000 description 66
- 230000008569 process Effects 0.000 description 11
- 230000006870 function Effects 0.000 description 9
- 238000004590 computer program Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 230000003287 optical effect Effects 0.000 description 6
- 230000009286 beneficial effect Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 101150054987 ChAT gene Proteins 0.000 description 4
- 101100203187 Mus musculus Sh2d3c gene Proteins 0.000 description 4
- 238000003058 natural language processing Methods 0.000 description 4
- 230000003993 interaction Effects 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 2
- 239000013307 optical fiber Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (14)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010842909.7A CN112037792B (zh) | 2020-08-20 | 2020-08-20 | 一种语音识别方法、装置、电子设备及存储介质 |
PCT/CN2021/112754 WO2022037526A1 (zh) | 2020-08-20 | 2021-08-16 | 一种语音识别方法、装置、电子设备及存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010842909.7A CN112037792B (zh) | 2020-08-20 | 2020-08-20 | 一种语音识别方法、装置、电子设备及存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112037792A CN112037792A (zh) | 2020-12-04 |
CN112037792B true CN112037792B (zh) | 2022-06-17 |
Family
ID=73579934
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010842909.7A Active CN112037792B (zh) | 2020-08-20 | 2020-08-20 | 一种语音识别方法、装置、电子设备及存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112037792B (zh) |
WO (1) | WO2022037526A1 (zh) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111161739B (zh) * | 2019-12-28 | 2023-01-17 | 科大讯飞股份有限公司 | 语音识别方法及相关产品 |
CN112037792B (zh) * | 2020-08-20 | 2022-06-17 | 北京字节跳动网络技术有限公司 | 一种语音识别方法、装置、电子设备及存储介质 |
CN112509567B (zh) * | 2020-12-25 | 2024-05-10 | 阿波罗智联(北京)科技有限公司 | 语音数据处理的方法、装置、设备、存储介质及程序产品 |
CN113077789A (zh) * | 2021-03-29 | 2021-07-06 | 南北联合信息科技有限公司 | 语音实时转化方法、系统、计算机设备及存储介质 |
CN113241070B (zh) * | 2021-04-28 | 2024-02-27 | 北京字跳网络技术有限公司 | 热词召回及更新方法、装置、存储介质和热词系统 |
CN113377904B (zh) * | 2021-06-04 | 2024-05-10 | 百度在线网络技术(北京)有限公司 | 行业动作识别方法、装置、电子设备及存储介质 |
CN114996506A (zh) * | 2022-05-24 | 2022-09-02 | 腾讯科技(深圳)有限公司 | 语料生成方法、装置、电子设备和计算机可读存储介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999003092A2 (en) * | 1997-07-07 | 1999-01-21 | Motorola Inc. | Modular speech recognition system and method |
CN103700370A (zh) * | 2013-12-04 | 2014-04-02 | 北京中科模识科技有限公司 | 一种广播电视语音识别系统方法及系统 |
WO2018049960A1 (zh) * | 2016-09-14 | 2018-03-22 | 厦门幻世网络科技有限公司 | 一种为文本信息匹配资源的方法及装置 |
CN109727598A (zh) * | 2018-12-28 | 2019-05-07 | 浙江省公众信息产业有限公司 | 大噪音语境下的意图识别方法 |
WO2019171128A1 (en) * | 2018-03-06 | 2019-09-12 | Yogesh Chunilal Rathod | In-media and with controls advertisement, ephemeral, actionable and multi page photo filters on photo, automated integration of external contents, automated feed scrolling, template based advertisement post and actions and reaction controls on recognized objects in photo or video |
CN111274783A (zh) * | 2020-01-14 | 2020-06-12 | 广州供电局有限公司 | 一种基于语义相似分析的围串标智能识别方法 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2058800B1 (en) * | 2007-10-24 | 2010-09-01 | Harman Becker Automotive Systems GmbH | Method and system for recognizing speech for searching a database |
CN102867511A (zh) * | 2011-07-04 | 2013-01-09 | 余喆 | 自然语音识别方法和装置 |
CN103514170B (zh) * | 2012-06-20 | 2017-03-29 | 中国移动通信集团安徽有限公司 | 一种语音识别的文本分类方法和装置 |
CN103236260B (zh) * | 2013-03-29 | 2015-08-12 | 京东方科技集团股份有限公司 | 语音识别系统 |
CN106683662A (zh) * | 2015-11-10 | 2017-05-17 | 中国电信股份有限公司 | 一种语音识别方法和装置 |
US20180143970A1 (en) * | 2016-11-18 | 2018-05-24 | Microsoft Technology Licensing, Llc | Contextual dictionary for transcription |
KR101965880B1 (ko) * | 2017-03-30 | 2019-04-04 | 엘지전자 주식회사 | 음성 인식 방법 |
CN108847241B (zh) * | 2018-06-07 | 2022-09-13 | 平安科技(深圳)有限公司 | 将会议语音识别为文本的方法、电子设备及存储介质 |
CN109190125A (zh) * | 2018-09-14 | 2019-01-11 | 广州达美智能科技有限公司 | 医学语言文本的处理方法、装置和存储介质 |
KR20200074349A (ko) * | 2018-12-14 | 2020-06-25 | 삼성전자주식회사 | 음성을 인식하기 위한 방법 및 장치 |
CN109410923B (zh) * | 2018-12-26 | 2022-06-10 | 中国联合网络通信集团有限公司 | 语音识别方法、装置、系统及存储介质 |
CN110544477A (zh) * | 2019-09-29 | 2019-12-06 | 北京声智科技有限公司 | 一种语音识别方法、装置、设备及介质 |
CN112037792B (zh) * | 2020-08-20 | 2022-06-17 | 北京字节跳动网络技术有限公司 | 一种语音识别方法、装置、电子设备及存储介质 |
-
2020
- 2020-08-20 CN CN202010842909.7A patent/CN112037792B/zh active Active
-
2021
- 2021-08-16 WO PCT/CN2021/112754 patent/WO2022037526A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999003092A2 (en) * | 1997-07-07 | 1999-01-21 | Motorola Inc. | Modular speech recognition system and method |
CN103700370A (zh) * | 2013-12-04 | 2014-04-02 | 北京中科模识科技有限公司 | 一种广播电视语音识别系统方法及系统 |
WO2018049960A1 (zh) * | 2016-09-14 | 2018-03-22 | 厦门幻世网络科技有限公司 | 一种为文本信息匹配资源的方法及装置 |
WO2019171128A1 (en) * | 2018-03-06 | 2019-09-12 | Yogesh Chunilal Rathod | In-media and with controls advertisement, ephemeral, actionable and multi page photo filters on photo, automated integration of external contents, automated feed scrolling, template based advertisement post and actions and reaction controls on recognized objects in photo or video |
CN109727598A (zh) * | 2018-12-28 | 2019-05-07 | 浙江省公众信息产业有限公司 | 大噪音语境下的意图识别方法 |
CN111274783A (zh) * | 2020-01-14 | 2020-06-12 | 广州供电局有限公司 | 一种基于语义相似分析的围串标智能识别方法 |
Non-Patent Citations (1)
Title |
---|
面向未来的交互信息技术――听觉视觉双模态语音识别(AVSR)(下);徐彦君等;《电子商务》;19990201(第02期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
WO2022037526A1 (zh) | 2022-02-24 |
CN112037792A (zh) | 2020-12-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112037792B (zh) | 一种语音识别方法、装置、电子设备及存储介质 | |
CN113505198B (zh) | 关键词驱动的生成式对话回复方法、装置及电子设备 | |
CN110990598B (zh) | 资源检索方法、装置、电子设备及计算机可读存储介质 | |
CN113257218B (zh) | 语音合成方法、装置、电子设备和存储介质 | |
CN112906381B (zh) | 对话归属的识别方法、装置、可读介质和电子设备 | |
CN113889113A (zh) | 分句方法、装置、存储介质及电子设备 | |
CN111883117A (zh) | 语音唤醒方法及装置 | |
CN111625649A (zh) | 文本处理方法、装置、电子设备及介质 | |
CN111681661B (zh) | 语音识别的方法、装置、电子设备和计算机可读介质 | |
CN112883968A (zh) | 图像字符识别方法、装置、介质及电子设备 | |
CN112182255A (zh) | 用于存储媒体文件和用于检索媒体文件的方法和装置 | |
CN111078849A (zh) | 用于输出信息的方法和装置 | |
CN115967833A (zh) | 视频生成方法、装置、设备计存储介质 | |
CN113053362A (zh) | 语音识别的方法、装置、设备和计算机可读介质 | |
CN112242143B (zh) | 一种语音交互方法、装置、终端设备及存储介质 | |
CN113643706B (zh) | 语音识别方法、装置、电子设备及存储介质 | |
CN111460214B (zh) | 分类模型训练方法、音频分类方法、装置、介质及设备 | |
CN113033190B (zh) | 字幕生成方法、装置、介质及电子设备 | |
CN114881008A (zh) | 一种文本生成方法、装置、电子设备及介质 | |
CN111666449B (zh) | 视频检索方法、装置、电子设备和计算机可读介质 | |
CN113420723A (zh) | 获取视频热点的方法、装置、可读介质和电子设备 | |
CN113986958A (zh) | 文本信息的转换方法、装置、可读介质和电子设备 | |
CN112562733A (zh) | 媒体数据处理方法及装置、存储介质、计算机设备 | |
CN111292766B (zh) | 用于生成语音样本的方法、装置、电子设备和介质 | |
CN117034959A (zh) | 数据处理方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address |
Address after: 100041 B-0035, 2 floor, 3 building, 30 Shixing street, Shijingshan District, Beijing. Patentee after: Douyin Vision Co.,Ltd. Country or region after: China Address before: 100041 B-0035, 2 floor, 3 building, 30 Shixing street, Shijingshan District, Beijing. Patentee before: Tiktok vision (Beijing) Co.,Ltd. Country or region before: China Address after: 100041 B-0035, 2 floor, 3 building, 30 Shixing street, Shijingshan District, Beijing. Patentee after: Tiktok vision (Beijing) Co.,Ltd. Country or region after: China Address before: 100041 B-0035, 2 floor, 3 building, 30 Shixing street, Shijingshan District, Beijing. Patentee before: BEIJING BYTEDANCE NETWORK TECHNOLOGY Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |
Effective date of registration: 20240506 Address after: Room 201-2031, floor 2, building 1, building 2 and building 3, qinchunjiayuan, Xisanqi, Haidian District, Beijing 100096 Patentee after: Beijing Feishu Technology Co.,Ltd. Country or region after: China Address before: 100041 B-0035, 2 floor, 3 building, 30 Shixing street, Shijingshan District, Beijing. Patentee before: Douyin Vision Co.,Ltd. Country or region before: China |