CN108415898A - 深度学习语言模型的词图重打分方法和系统 - Google Patents
深度学习语言模型的词图重打分方法和系统 Download PDFInfo
- Publication number
- CN108415898A CN108415898A CN201810054749.2A CN201810054749A CN108415898A CN 108415898 A CN108415898 A CN 108415898A CN 201810054749 A CN201810054749 A CN 201810054749A CN 108415898 A CN108415898 A CN 108415898A
- Authority
- CN
- China
- Prior art keywords
- word
- node
- word sequence
- score
- extension
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000013135 deep learning Methods 0.000 title claims abstract description 50
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000012545 processing Methods 0.000 claims abstract description 23
- 238000013077 scoring method Methods 0.000 claims abstract description 21
- 230000015654 memory Effects 0.000 claims abstract description 18
- 230000009467 reduction Effects 0.000 claims abstract description 15
- 238000003860 storage Methods 0.000 claims description 17
- 238000004590 computer program Methods 0.000 claims description 9
- 238000004891 communication Methods 0.000 claims description 3
- 238000013138 pruning Methods 0.000 abstract description 51
- 238000002474 experimental method Methods 0.000 description 26
- 238000004364 calculation method Methods 0.000 description 24
- 238000004422 calculation algorithm Methods 0.000 description 21
- 238000013528 artificial neural network Methods 0.000 description 19
- 238000010009 beating Methods 0.000 description 13
- 230000002776 aggregation Effects 0.000 description 11
- 238000004220 aggregation Methods 0.000 description 11
- 238000005457 optimization Methods 0.000 description 11
- 238000012549 training Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 9
- 238000012360 testing method Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 7
- 239000013598 vector Substances 0.000 description 7
- 230000001133 acceleration Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 238000013480 data collection Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000006403 short-term memory Effects 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 235000007926 Craterellus fallax Nutrition 0.000 description 1
- 240000007175 Datura inoxia Species 0.000 description 1
- 208000027877 Disorders of Sex Development Diseases 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000013016 learning Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 210000004218 nerve net Anatomy 0.000 description 1
- 238000011056 performance test Methods 0.000 description 1
- 238000003825 pressing Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
Abstract
Description
语言模型 | eval2000 | eval_rt03s |
3-gram | 107.18 | 96.18 |
4-gram | 76.28 | 62.45 |
LSTM | 58.73 | 44.99 |
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810054749.2A CN108415898B (zh) | 2018-01-19 | 2018-01-19 | 深度学习语言模型的词图重打分方法和系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810054749.2A CN108415898B (zh) | 2018-01-19 | 2018-01-19 | 深度学习语言模型的词图重打分方法和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108415898A true CN108415898A (zh) | 2018-08-17 |
CN108415898B CN108415898B (zh) | 2021-09-24 |
Family
ID=63125790
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810054749.2A Active CN108415898B (zh) | 2018-01-19 | 2018-01-19 | 深度学习语言模型的词图重打分方法和系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108415898B (zh) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109710087A (zh) * | 2018-12-28 | 2019-05-03 | 北京金山安全软件有限公司 | 输入法模型生成方法及装置 |
CN110516050A (zh) * | 2019-07-15 | 2019-11-29 | 上海文思海辉金信软件有限公司 | 一种基于知识图谱的多路径训练场景的构建方法 |
CN110797026A (zh) * | 2019-09-17 | 2020-02-14 | 腾讯科技(深圳)有限公司 | 一种语音识别方法、装置及存储介质 |
CN111145733A (zh) * | 2020-01-03 | 2020-05-12 | 深圳追一科技有限公司 | 语音识别方法、装置、计算机设备和计算机可读存储介质 |
CN111274801A (zh) * | 2020-02-25 | 2020-06-12 | 苏州跃盟信息科技有限公司 | 分词方法及装置 |
CN111667833A (zh) * | 2019-03-07 | 2020-09-15 | 国际商业机器公司 | 基于对话的语音识别 |
CN111916058A (zh) * | 2020-06-24 | 2020-11-10 | 西安交通大学 | 一种基于增量词图重打分的语音识别方法及系统 |
CN111998869A (zh) * | 2020-09-29 | 2020-11-27 | 北京嘀嘀无限科技发展有限公司 | 路线生成方法、装置、电子设备和计算机可读存储介质 |
CN112071310A (zh) * | 2019-06-11 | 2020-12-11 | 北京地平线机器人技术研发有限公司 | 语音识别方法和装置、电子设备和存储介质 |
CN112102815A (zh) * | 2020-11-13 | 2020-12-18 | 深圳追一科技有限公司 | 语音识别方法、装置、计算机设备和存储介质 |
CN112885336A (zh) * | 2021-01-29 | 2021-06-01 | 深圳前海微众银行股份有限公司 | 语音识别系统的训练、识别方法、装置、电子设备 |
WO2021136029A1 (zh) * | 2019-12-31 | 2021-07-08 | 百果园技术(新加坡)有限公司 | 重打分模型训练方法及装置、语音识别方法及装置 |
CN113487024A (zh) * | 2021-06-29 | 2021-10-08 | 任立椋 | 交替序列生成模型训练方法、从文本中抽取图的方法 |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1499484A (zh) * | 2002-11-06 | 2004-05-26 | 北京天朗语音科技有限公司 | 汉语连续语音识别系统 |
US7219058B1 (en) * | 2000-10-13 | 2007-05-15 | At&T Corp. | System and method for processing speech recognition results |
US20070239432A1 (en) * | 2006-03-30 | 2007-10-11 | Microsoft Corporation | Common word graph based multimodal input |
US20080312921A1 (en) * | 2003-11-28 | 2008-12-18 | Axelrod Scott E | Speech recognition utilizing multitude of speech features |
CN101647021A (zh) * | 2007-04-13 | 2010-02-10 | 麻省理工学院 | 语音数据检索装置、语音数据检索方法、语音数据检索程序和包含有语音数据检索程序的计算机可用介质 |
CN101645270A (zh) * | 2008-12-12 | 2010-02-10 | 中国科学院声学研究所 | 一种双向语音识别处理系统及方法 |
CN101740024A (zh) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | 基于广义流利的口语流利度自动评估方法 |
US20120290302A1 (en) * | 2011-05-10 | 2012-11-15 | Yang Jyh-Her | Chinese speech recognition system and method |
CN105513589A (zh) * | 2015-12-18 | 2016-04-20 | 百度在线网络技术(北京)有限公司 | 语音识别方法和装置 |
CN105681920A (zh) * | 2015-12-30 | 2016-06-15 | 深圳市鹰硕音频科技有限公司 | 一种具有语音识别功能的网络教学方法及系统 |
CN106803422A (zh) * | 2015-11-26 | 2017-06-06 | 中国科学院声学研究所 | 一种基于长短时记忆网络的语言模型重估方法 |
CN106856092A (zh) * | 2015-12-09 | 2017-06-16 | 中国科学院声学研究所 | 基于前向神经网络语言模型的汉语语音关键词检索方法 |
-
2018
- 2018-01-19 CN CN201810054749.2A patent/CN108415898B/zh active Active
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7219058B1 (en) * | 2000-10-13 | 2007-05-15 | At&T Corp. | System and method for processing speech recognition results |
CN1499484A (zh) * | 2002-11-06 | 2004-05-26 | 北京天朗语音科技有限公司 | 汉语连续语音识别系统 |
US20080312921A1 (en) * | 2003-11-28 | 2008-12-18 | Axelrod Scott E | Speech recognition utilizing multitude of speech features |
US20070239432A1 (en) * | 2006-03-30 | 2007-10-11 | Microsoft Corporation | Common word graph based multimodal input |
CN101647021A (zh) * | 2007-04-13 | 2010-02-10 | 麻省理工学院 | 语音数据检索装置、语音数据检索方法、语音数据检索程序和包含有语音数据检索程序的计算机可用介质 |
CN101740024A (zh) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | 基于广义流利的口语流利度自动评估方法 |
CN101645270A (zh) * | 2008-12-12 | 2010-02-10 | 中国科学院声学研究所 | 一种双向语音识别处理系统及方法 |
US20120290302A1 (en) * | 2011-05-10 | 2012-11-15 | Yang Jyh-Her | Chinese speech recognition system and method |
CN106803422A (zh) * | 2015-11-26 | 2017-06-06 | 中国科学院声学研究所 | 一种基于长短时记忆网络的语言模型重估方法 |
CN106856092A (zh) * | 2015-12-09 | 2017-06-16 | 中国科学院声学研究所 | 基于前向神经网络语言模型的汉语语音关键词检索方法 |
CN105513589A (zh) * | 2015-12-18 | 2016-04-20 | 百度在线网络技术(北京)有限公司 | 语音识别方法和装置 |
CN105681920A (zh) * | 2015-12-30 | 2016-06-15 | 深圳市鹰硕音频科技有限公司 | 一种具有语音识别功能的网络教学方法及系统 |
Non-Patent Citations (6)
Title |
---|
A UNIFIFIED CONFIFIDENCE MEASURE FRAMEWORK USING AUXILIARY NORMA: "zhehuai chen等", 《CONFERENCE: INTERNATIONAL CONFERENCE ON INTELLIGENT SCIENCE AND BIG DATA ENGINEERING》 * |
SHANKAR KUMAR等: "LATTICE RESCORING STRATEGIES FOR LONG SHORT TERM MEMORY LANGUAGE MODELS IN SPEECH RECOGNITION", 《HTTPS://ARXIV.ORG/ABS/1711.05448V1》 * |
尹明明: "连续语音识别解码技术的研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
左玲云等: "电话交谈语音识别中基于LSTM-DNN 语言模型的重评估方法研究", 《重庆邮电大学学报》 * |
张剑: "连续语音识别中的循环神经网络语言模型技术研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
郭宇弘等: "基于加权有限状态机的动态匹配词图生成算法", 《电子与信息学报》 * |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109710087A (zh) * | 2018-12-28 | 2019-05-03 | 北京金山安全软件有限公司 | 输入法模型生成方法及装置 |
CN111667833A (zh) * | 2019-03-07 | 2020-09-15 | 国际商业机器公司 | 基于对话的语音识别 |
CN111667833B (zh) * | 2019-03-07 | 2023-09-22 | 国际商业机器公司 | 基于对话的语音识别 |
CN112071310B (zh) * | 2019-06-11 | 2024-05-07 | 北京地平线机器人技术研发有限公司 | 语音识别方法和装置、电子设备和存储介质 |
CN112071310A (zh) * | 2019-06-11 | 2020-12-11 | 北京地平线机器人技术研发有限公司 | 语音识别方法和装置、电子设备和存储介质 |
CN110516050A (zh) * | 2019-07-15 | 2019-11-29 | 上海文思海辉金信软件有限公司 | 一种基于知识图谱的多路径训练场景的构建方法 |
CN110797026A (zh) * | 2019-09-17 | 2020-02-14 | 腾讯科技(深圳)有限公司 | 一种语音识别方法、装置及存储介质 |
WO2021136029A1 (zh) * | 2019-12-31 | 2021-07-08 | 百果园技术(新加坡)有限公司 | 重打分模型训练方法及装置、语音识别方法及装置 |
CN111145733A (zh) * | 2020-01-03 | 2020-05-12 | 深圳追一科技有限公司 | 语音识别方法、装置、计算机设备和计算机可读存储介质 |
CN111145733B (zh) * | 2020-01-03 | 2023-02-28 | 深圳追一科技有限公司 | 语音识别方法、装置、计算机设备和计算机可读存储介质 |
CN111274801A (zh) * | 2020-02-25 | 2020-06-12 | 苏州跃盟信息科技有限公司 | 分词方法及装置 |
CN111916058A (zh) * | 2020-06-24 | 2020-11-10 | 西安交通大学 | 一种基于增量词图重打分的语音识别方法及系统 |
CN111998869A (zh) * | 2020-09-29 | 2020-11-27 | 北京嘀嘀无限科技发展有限公司 | 路线生成方法、装置、电子设备和计算机可读存储介质 |
CN112102815B (zh) * | 2020-11-13 | 2021-07-13 | 深圳追一科技有限公司 | 语音识别方法、装置、计算机设备和存储介质 |
CN112102815A (zh) * | 2020-11-13 | 2020-12-18 | 深圳追一科技有限公司 | 语音识别方法、装置、计算机设备和存储介质 |
CN112885336A (zh) * | 2021-01-29 | 2021-06-01 | 深圳前海微众银行股份有限公司 | 语音识别系统的训练、识别方法、装置、电子设备 |
CN112885336B (zh) * | 2021-01-29 | 2024-02-02 | 深圳前海微众银行股份有限公司 | 语音识别系统的训练、识别方法、装置、电子设备 |
CN113487024A (zh) * | 2021-06-29 | 2021-10-08 | 任立椋 | 交替序列生成模型训练方法、从文本中抽取图的方法 |
Also Published As
Publication number | Publication date |
---|---|
CN108415898B (zh) | 2021-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108415898A (zh) | 深度学习语言模型的词图重打分方法和系统 | |
US10032463B1 (en) | Speech processing with learned representation of user interaction history | |
CN106683661B (zh) | 基于语音的角色分离方法及装置 | |
US20050159952A1 (en) | Pattern matching for large vocabulary speech recognition with packed distribution and localized trellis access | |
KR100486735B1 (ko) | 최적구획 분류신경망 구성방법과 최적구획 분류신경망을이용한 자동 레이블링방법 및 장치 | |
CN110263162A (zh) | 卷积神经网络及其进行文本分类的方法、文本分类装置 | |
CN104751228A (zh) | 深度神经网络的构建方法及系统 | |
WO2005103951A1 (en) | Tree index based method for accessing automatic directory | |
US20200311147A1 (en) | Sentence recommendation method and apparatus based on associated points of interest | |
CN111079899A (zh) | 神经网络模型压缩方法、系统、设备及介质 | |
CN1156820C (zh) | 使用词汇树的识别系统 | |
CN108388561A (zh) | 神经网络机器翻译方法和装置 | |
WO2021040842A1 (en) | Optimizing a keyword spotting system | |
JP7209330B2 (ja) | 識別器、学習済モデル、学習方法 | |
CN108389575A (zh) | 音频数据识别方法及系统 | |
CN104751227A (zh) | 深度神经网络的构建方法及系统 | |
CN109036471A (zh) | 语音端点检测方法及设备 | |
CN110047462B (zh) | 一种语音合成方法、装置和电子设备 | |
US7269597B2 (en) | Chart-ahead method for decision tree construction | |
US6789063B1 (en) | Acoustic modeling using a two-level decision tree in a speech recognition system | |
JP3176210B2 (ja) | 音声認識方法及び音声認識装置 | |
JP2012018403A (ja) | パタン認識方法および装置ならびにパタン認識プログラムおよびその記録媒体 | |
De Souza et al. | Real-time music tracking based on a weightless neural network | |
CN110110294A (zh) | 一种动态反向解码的方法、装置及可读存储介质 | |
CN112017641A (zh) | 一种语音处理方法、装置及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200622 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Co.,Ltd. Applicant after: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Co.,Ltd. Applicant before: SHANGHAI JIAO TONG University |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20201027 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Co.,Ltd. Applicant before: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. |
|
CB02 | Change of applicant information |
Address after: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant after: Sipic Technology Co.,Ltd. Address before: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant before: AI SPEECH Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A Word Graph Re scoring Method and System for Deep Learning Language Models Effective date of registration: 20230726 Granted publication date: 20210924 Pledgee: CITIC Bank Limited by Share Ltd. Suzhou branch Pledgor: Sipic Technology Co.,Ltd. Registration number: Y2023980049433 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |