CN101739393A - 汉语文本智能分词法 - Google Patents
汉语文本智能分词法 Download PDFInfo
- Publication number
- CN101739393A CN101739393A CN200810203059A CN200810203059A CN101739393A CN 101739393 A CN101739393 A CN 101739393A CN 200810203059 A CN200810203059 A CN 200810203059A CN 200810203059 A CN200810203059 A CN 200810203059A CN 101739393 A CN101739393 A CN 101739393A
- Authority
- CN
- China
- Prior art keywords
- chinese
- speech
- syllable
- word
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 86
- 238000005520 cutting process Methods 0.000 claims description 58
- 230000011218 segmentation Effects 0.000 claims description 38
- 230000008878 coupling Effects 0.000 claims description 20
- 238000010168 coupling process Methods 0.000 claims description 20
- 238000005859 coupling reaction Methods 0.000 claims description 20
- 150000001875 compounds Chemical class 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 11
- 239000000203 mixture Substances 0.000 claims description 7
- 239000002245 particle Substances 0.000 claims description 5
- 238000007689 inspection Methods 0.000 claims description 4
- 238000007792 addition Methods 0.000 claims description 3
- 238000012217 deletion Methods 0.000 claims description 3
- 230000037430 deletion Effects 0.000 claims description 3
- 230000036651 mood Effects 0.000 claims description 3
- 230000000750 progressive effect Effects 0.000 claims description 3
- 206010028916 Neologism Diseases 0.000 claims description 2
- 230000000295 complement effect Effects 0.000 claims description 2
- 239000002131 composite material Substances 0.000 claims description 2
- 238000012986 modification Methods 0.000 claims description 2
- 230000004048 modification Effects 0.000 claims description 2
- 239000003607 modifier Substances 0.000 claims description 2
- 230000015572 biosynthetic process Effects 0.000 abstract description 6
- 238000006243 chemical reaction Methods 0.000 abstract description 4
- 230000010365 information processing Effects 0.000 abstract description 4
- 238000013519 translation Methods 0.000 abstract description 4
- 238000003786 synthesis reaction Methods 0.000 abstract description 3
- 230000002146 bilateral effect Effects 0.000 abstract 1
- 238000010276 construction Methods 0.000 abstract 1
- 238000005192 partition Methods 0.000 description 20
- 210000004556 brain Anatomy 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 206010011469 Crying Diseases 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000009413 insulation Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 206010011224 Cough Diseases 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 235000009508 confectionery Nutrition 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 238000012913 prioritisation Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- GVGLGOZIDCSQPN-PVHGPHFFSA-N Heroin Chemical compound O([C@H]1[C@H](C=C[C@H]23)OC(C)=O)C4=C5[C@@]12CCN(C)[C@@H]3CC5=CC=C4OC(C)=O GVGLGOZIDCSQPN-PVHGPHFFSA-N 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000001364 causal effect Effects 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000009916 joint effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 208000014451 palmoplantar keratoderma and congenital alopecia 2 Diseases 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 210000000697 sensory organ Anatomy 0.000 description 1
- 230000026676 system process Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
Landscapes
- Document Processing Apparatus (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008102030595A CN101739393B (zh) | 2008-11-20 | 2008-11-20 | 汉语文本智能分词法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008102030595A CN101739393B (zh) | 2008-11-20 | 2008-11-20 | 汉语文本智能分词法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101739393A true CN101739393A (zh) | 2010-06-16 |
CN101739393B CN101739393B (zh) | 2012-07-04 |
Family
ID=42462887
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2008102030595A Expired - Fee Related CN101739393B (zh) | 2008-11-20 | 2008-11-20 | 汉语文本智能分词法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101739393B (zh) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102541843A (zh) * | 2010-12-22 | 2012-07-04 | 陈本东 | 一种用于提高机器翻译质量的装置和方法 |
CN102819524A (zh) * | 2011-09-08 | 2012-12-12 | 金蝶软件(中国)有限公司 | 基于关键字的字符序列分割方法及装置 |
CN102902660A (zh) * | 2011-07-26 | 2013-01-30 | 苗玉水 | 汉语语音码全拼和简拼汉语全息信息处理方法 |
CN102982020A (zh) * | 2012-12-17 | 2013-03-20 | 杭州也要买电子商务有限公司 | 一种搜索系统中的中文分词方法 |
CN103279190A (zh) * | 2013-06-16 | 2013-09-04 | 江苏华音信息科技有限公司 | 汉语文本调用计算机程序运行的装置 |
CN104235998A (zh) * | 2013-06-13 | 2014-12-24 | 上海能感物联网有限公司 | 非特定人汉语语音遥控智能空调机的方法 |
CN105204445A (zh) * | 2014-06-08 | 2015-12-30 | 上海能感物联网有限公司 | 外语自然语文本现场控制汽车驾驶的控制器装置 |
CN105303643A (zh) * | 2014-06-08 | 2016-02-03 | 苗码信息科技(上海)股份有限公司 | 汉语文本现场自动导航并驾驶汽车的方法 |
CN105302082A (zh) * | 2014-06-08 | 2016-02-03 | 上海能感物联网有限公司 | 非特定人外语语音现场自动导航并驾驶汽车的控制器装置 |
CN105302079A (zh) * | 2014-06-08 | 2016-02-03 | 上海能感物联网有限公司 | 汉语语音现场控制汽车驾驶的控制器装置 |
CN105893353A (zh) * | 2016-04-20 | 2016-08-24 | 广东万丈金数信息技术股份有限公司 | 分词方法和分词系统 |
CN106292349A (zh) * | 2015-05-28 | 2017-01-04 | 上海能感物联网有限公司 | 汉语文本遥控舵机的方法 |
CN106547813A (zh) * | 2016-09-22 | 2017-03-29 | 苏州小璐机器人有限公司 | 一种提升机器人语音问答能力的方法 |
CN107291684A (zh) * | 2016-04-12 | 2017-10-24 | 华为技术有限公司 | 语言文本的分词方法和系统 |
CN108198484A (zh) * | 2018-01-31 | 2018-06-22 | 李勤骞 | 英语时态学习系统及对应时态学习方法 |
CN108647208A (zh) * | 2018-05-09 | 2018-10-12 | 上海应用技术大学 | 一种基于中文的新型分词方法 |
CN110909537A (zh) * | 2019-11-19 | 2020-03-24 | 曲英洲 | 现代汉语成分分析的一种人工智能方法 |
CN110941715A (zh) * | 2019-10-23 | 2020-03-31 | 北京精英系统科技有限公司 | 一种实体对象分类判断的方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101000764B (zh) * | 2006-12-18 | 2011-05-18 | 黑龙江大学 | 基于韵律结构的语音合成文本处理方法 |
CN101004737A (zh) * | 2007-01-24 | 2007-07-25 | 贵阳易特软件有限公司 | 基于关键词的个性化文档处理系统 |
CN101158969B (zh) * | 2007-11-23 | 2010-06-02 | 腾讯科技(深圳)有限公司 | 一种整句生成方法及装置 |
-
2008
- 2008-11-20 CN CN2008102030595A patent/CN101739393B/zh not_active Expired - Fee Related
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102541843A (zh) * | 2010-12-22 | 2012-07-04 | 陈本东 | 一种用于提高机器翻译质量的装置和方法 |
CN102541843B (zh) * | 2010-12-22 | 2017-09-01 | 陈本东 | 一种用于提高机器翻译质量的装置和方法 |
CN102902660A (zh) * | 2011-07-26 | 2013-01-30 | 苗玉水 | 汉语语音码全拼和简拼汉语全息信息处理方法 |
CN102902660B (zh) * | 2011-07-26 | 2016-04-20 | 青海汉拉信息科技股份有限公司 | 汉语语音码全拼和混拼汉语全息信息处理方法 |
CN102819524B (zh) * | 2011-09-08 | 2015-06-03 | 金蝶软件(中国)有限公司 | 基于关键字的字符序列分割方法及装置 |
CN102819524A (zh) * | 2011-09-08 | 2012-12-12 | 金蝶软件(中国)有限公司 | 基于关键字的字符序列分割方法及装置 |
CN102982020A (zh) * | 2012-12-17 | 2013-03-20 | 杭州也要买电子商务有限公司 | 一种搜索系统中的中文分词方法 |
CN104235998A (zh) * | 2013-06-13 | 2014-12-24 | 上海能感物联网有限公司 | 非特定人汉语语音遥控智能空调机的方法 |
CN103279190A (zh) * | 2013-06-16 | 2013-09-04 | 江苏华音信息科技有限公司 | 汉语文本调用计算机程序运行的装置 |
CN103279190B (zh) * | 2013-06-16 | 2016-01-13 | 青海汉拉信息科技股份有限公司 | 汉语文本调用计算机程序运行的装置 |
CN105204445A (zh) * | 2014-06-08 | 2015-12-30 | 上海能感物联网有限公司 | 外语自然语文本现场控制汽车驾驶的控制器装置 |
CN105303643A (zh) * | 2014-06-08 | 2016-02-03 | 苗码信息科技(上海)股份有限公司 | 汉语文本现场自动导航并驾驶汽车的方法 |
CN105302082A (zh) * | 2014-06-08 | 2016-02-03 | 上海能感物联网有限公司 | 非特定人外语语音现场自动导航并驾驶汽车的控制器装置 |
CN105302079A (zh) * | 2014-06-08 | 2016-02-03 | 上海能感物联网有限公司 | 汉语语音现场控制汽车驾驶的控制器装置 |
CN106292349A (zh) * | 2015-05-28 | 2017-01-04 | 上海能感物联网有限公司 | 汉语文本遥控舵机的方法 |
CN107291684A (zh) * | 2016-04-12 | 2017-10-24 | 华为技术有限公司 | 语言文本的分词方法和系统 |
CN105893353A (zh) * | 2016-04-20 | 2016-08-24 | 广东万丈金数信息技术股份有限公司 | 分词方法和分词系统 |
CN105893353B (zh) * | 2016-04-20 | 2018-10-26 | 广东万丈金数信息技术股份有限公司 | 分词方法和分词系统 |
CN106547813A (zh) * | 2016-09-22 | 2017-03-29 | 苏州小璐机器人有限公司 | 一种提升机器人语音问答能力的方法 |
CN108198484A (zh) * | 2018-01-31 | 2018-06-22 | 李勤骞 | 英语时态学习系统及对应时态学习方法 |
CN108198484B (zh) * | 2018-01-31 | 2019-11-05 | 李勤骞 | 英语时态学习系统及对应时态学习方法 |
CN108647208A (zh) * | 2018-05-09 | 2018-10-12 | 上海应用技术大学 | 一种基于中文的新型分词方法 |
CN110941715A (zh) * | 2019-10-23 | 2020-03-31 | 北京精英系统科技有限公司 | 一种实体对象分类判断的方法 |
CN110909537A (zh) * | 2019-11-19 | 2020-03-24 | 曲英洲 | 现代汉语成分分析的一种人工智能方法 |
Also Published As
Publication number | Publication date |
---|---|
CN101739393B (zh) | 2012-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101739393B (zh) | 汉语文本智能分词法 | |
CN101131689B (zh) | 汉语外语句型转换双向机器翻译方法 | |
CN102902660B (zh) | 汉语语音码全拼和混拼汉语全息信息处理方法 | |
CN101118541B (zh) | 汉语语音码汉语语音识别方法 | |
Sandler | The medium and the message: Prosodic interpretation of linguistic content in Israeli Sign Language | |
CN102479208B (zh) | 汉语语音码多样网页信息搜索转换翻译方法 | |
Young et al. | Speech synthesis from concept: a method for speech output from information systems | |
CN105957518A (zh) | 一种蒙古语大词汇量连续语音识别的方法 | |
CN104756100A (zh) | 意图估计装置以及意图估计方法 | |
CN105404621A (zh) | 一种用于盲人读取汉字的方法及系统 | |
CN111489746A (zh) | 一种基于bert的电网调度语音识别语言模型构建方法 | |
Zhang | Syntax-phonology interface: argumentation from tone sandhi in Chinese dialects | |
CN114757184A (zh) | 实现航空领域知识问答的方法和系统 | |
Cardenas et al. | A morphological analyzer for Shipibo-konibo | |
Jiang et al. | Braille to print translations for Chinese | |
Arivazhagan et al. | Labeling the semantic roles of commas | |
Mrini et al. | Building the moroccan darija wordnet (mdw) using bilingual resources | |
Cao et al. | Syntactic and lexical constraint in prosodic segmentation and grouping | |
Shibatani et al. | Handbook of Japanese syntax | |
Sainz | Literacy acquisition in Spanish | |
Keenan | Large vocabulary syntactic analysis for text recognition | |
Newell et al. | Minimalism and the syntax-phonology interface | |
Mao et al. | Speech synthesis of Chinese Braille with limited training data | |
Iunn et al. | Modeling Taiwanese Southern-Min Tone Sandhi Using Rule-Based Methods | |
Ralli et al. | Greek compounds: a challenging case for the parsing techniques of PC-KIMMO v. 2 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: JIANGSU TEAISI INTELLIGENT TECHNOLOGY CO., LTD. Free format text: FORMER OWNER: MIAO YUSHUI Effective date: 20141028 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 200093 YANGPU, SHANGHAI TO: 215427 SUZHOU, JIANGSU PROVINCE |
|
TR01 | Transfer of patent right |
Effective date of registration: 20141028 Address after: 215427 Suzhou Province, Taicang City, Huang Jing Town, unity street, No. 83, No. Patentee after: Jiangsu special Ace smart Polytron Technologies Inc Address before: 200093 Shanghai city Yangpu District Kongjiang village 44 room 105 Patentee before: Miao Yushui |
|
DD01 | Delivery of document by public notice | ||
DD01 | Delivery of document by public notice |
Addressee: Patent director of Jiangsu teaisi Intelligent Technology Co.,Ltd. Document name: payment instructions |
|
DD01 | Delivery of document by public notice | ||
DD01 | Delivery of document by public notice |
Addressee: Patent of Jiangsu teaisi Intelligent Technology Co.,Ltd. The person in charge Document name: Notice of termination of patent right |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120704 Termination date: 20201120 |