CN104112447B - 提高统计语言模型准确度的方法及系统 - Google Patents
提高统计语言模型准确度的方法及系统 Download PDFInfo
- Publication number
- CN104112447B CN104112447B CN201410366038.0A CN201410366038A CN104112447B CN 104112447 B CN104112447 B CN 104112447B CN 201410366038 A CN201410366038 A CN 201410366038A CN 104112447 B CN104112447 B CN 104112447B
- Authority
- CN
- China
- Prior art keywords
- language model
- msub
- mrow
- language
- training set
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 238000012549 training Methods 0.000 claims abstract description 68
- 230000006870 function Effects 0.000 claims abstract description 22
- 239000000463 material Substances 0.000 claims description 35
- 230000000717 retained effect Effects 0.000 claims description 7
- 230000014509 gene expression Effects 0.000 claims description 5
- 238000005457 optimization Methods 0.000 claims description 5
- 235000003140 Panax quinquefolius Nutrition 0.000 claims description 2
- 241000208340 Araliaceae Species 0.000 claims 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 claims 1
- 235000008434 ginseng Nutrition 0.000 claims 1
- 238000009499 grossing Methods 0.000 description 6
- 239000000203 mixture Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000013179 statistical model Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 241000208125 Nicotiana Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- XCWPUUGSGHNIDZ-UHFFFAOYSA-N Oxypertine Chemical compound C1=2C=C(OC)C(OC)=CC=2NC(C)=C1CCN(CC1)CCN1C1=CC=CC=C1 XCWPUUGSGHNIDZ-UHFFFAOYSA-N 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Landscapes
- Machine Translation (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410366038.0A CN104112447B (zh) | 2014-07-28 | 2014-07-28 | 提高统计语言模型准确度的方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410366038.0A CN104112447B (zh) | 2014-07-28 | 2014-07-28 | 提高统计语言模型准确度的方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104112447A CN104112447A (zh) | 2014-10-22 |
CN104112447B true CN104112447B (zh) | 2017-08-25 |
Family
ID=51709208
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410366038.0A Active CN104112447B (zh) | 2014-07-28 | 2014-07-28 | 提高统计语言模型准确度的方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104112447B (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101833547A (zh) * | 2009-03-09 | 2010-09-15 | 三星电子(中国)研发中心 | 基于个人语料库进行短语级预测输入的方法 |
CN102509549A (zh) * | 2011-09-28 | 2012-06-20 | 盛乐信息技术(上海)有限公司 | 语言模型训练方法及系统 |
CN103294817A (zh) * | 2013-06-13 | 2013-09-11 | 华东师范大学 | 一种基于类别分布概率的文本特征抽取方法 |
CN103870447A (zh) * | 2014-03-11 | 2014-06-18 | 北京优捷信达信息科技有限公司 | 一种基于隐含狄利克雷模型的关键词抽取方法 |
CN103885938A (zh) * | 2014-04-14 | 2014-06-25 | 东南大学 | 基于用户反馈的行业拼写错误检查方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120284308A1 (en) * | 2011-05-02 | 2012-11-08 | Vistaprint Technologies Limited | Statistical spell checker |
-
2014
- 2014-07-28 CN CN201410366038.0A patent/CN104112447B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101833547A (zh) * | 2009-03-09 | 2010-09-15 | 三星电子(中国)研发中心 | 基于个人语料库进行短语级预测输入的方法 |
CN102509549A (zh) * | 2011-09-28 | 2012-06-20 | 盛乐信息技术(上海)有限公司 | 语言模型训练方法及系统 |
CN103294817A (zh) * | 2013-06-13 | 2013-09-11 | 华东师范大学 | 一种基于类别分布概率的文本特征抽取方法 |
CN103870447A (zh) * | 2014-03-11 | 2014-06-18 | 北京优捷信达信息科技有限公司 | 一种基于隐含狄利克雷模型的关键词抽取方法 |
CN103885938A (zh) * | 2014-04-14 | 2014-06-25 | 东南大学 | 基于用户反馈的行业拼写错误检查方法 |
Also Published As
Publication number | Publication date |
---|---|
CN104112447A (zh) | 2014-10-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6972265B2 (ja) | ポインタセンチネル混合アーキテクチャ | |
CN108363790B (zh) | 用于对评论进行评估的方法、装置、设备和存储介质 | |
CN110704621B (zh) | 文本处理方法、装置及存储介质和电子设备 | |
CN106815252A (zh) | 一种搜索方法和设备 | |
CN110377740A (zh) | 情感极性分析方法、装置、电子设备及存储介质 | |
CN111221962B (zh) | 一种基于新词扩展与复杂句式扩展的文本情感分析方法 | |
CN106095834A (zh) | 基于话题的智能对话方法及系统 | |
CN107480143A (zh) | 基于上下文相关性的对话话题分割方法和系统 | |
US11803731B2 (en) | Neural architecture search with weight sharing | |
CN109829162A (zh) | 一种文本分词方法及装置 | |
CN106445915B (zh) | 一种新词发现方法及装置 | |
CN108733644B (zh) | 一种文本情感分析方法、计算机可读存储介质及终端设备 | |
CN104965821B (zh) | 一种数据标注方法及装置 | |
US11645447B2 (en) | Encoding textual information for text analysis | |
WO2022183923A1 (zh) | 短语生成方法、装置和计算机可读存储介质 | |
CN105335375B (zh) | 主题挖掘方法和装置 | |
CN105488098A (zh) | 一种基于领域差异性的新词提取方法 | |
CN105740354A (zh) | 自适应潜在狄利克雷模型选择的方法及装置 | |
CN110347833B (zh) | 一种多轮对话的分类方法 | |
CN110765758A (zh) | 一种同义句生成模型的生成方法、装置及介质 | |
US20120191740A1 (en) | Document Comparison | |
CN104112447B (zh) | 提高统计语言模型准确度的方法及系统 | |
CN109670171B (zh) | 一种基于词对非对称共现的词向量表示学习方法 | |
CN104166712A (zh) | 科技文献检索方法及系统 | |
JP2010128598A (ja) | 文書検索装置及び方法及びプログラム及びプログラムを記録した記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170707 Address after: 230088, Hefei province high tech Zone, 2800 innovation Avenue, 288 innovation industry park, H2 building, room two, Anhui Applicant after: Anhui Puji Information Technology Co.,Ltd. Address before: Wangjiang Road high tech Development Zone Hefei city Anhui province 230088 No. 666 Applicant before: IFLYTEK Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder |
Address after: 230088, Hefei province high tech Zone, 2800 innovation Avenue, 288 innovation industry park, H2 building, room two, Anhui Patentee after: ANHUI IFLYTEK MEDICAL INFORMATION TECHNOLOGY CO.,LTD. Address before: 230088, Hefei province high tech Zone, 2800 innovation Avenue, 288 innovation industry park, H2 building, room two, Anhui Patentee before: Anhui Puji Information Technology Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder | ||
CP03 | Change of name, title or address |
Address after: 230088 floor 23-24, building A5, No. 666, Wangjiang West Road, high tech Zone, Hefei, Anhui Province Patentee after: Anhui Xunfei Medical Co.,Ltd. Address before: Room 288, H2 / F, phase II, innovation industrial park, 2800 innovation Avenue, high tech Zone, Hefei City, Anhui Province, 230088 Patentee before: ANHUI IFLYTEK MEDICAL INFORMATION TECHNOLOGY CO.,LTD. |
|
CP03 | Change of name, title or address | ||
CP01 | Change in the name or title of a patent holder |
Address after: 230088 floor 23-24, building A5, No. 666, Wangjiang West Road, high tech Zone, Hefei, Anhui Province Patentee after: IFLYTEK Medical Technology Co.,Ltd. Address before: 230088 floor 23-24, building A5, No. 666, Wangjiang West Road, high tech Zone, Hefei, Anhui Province Patentee before: Anhui Xunfei Medical Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder |