CN105824800B - 一种中文真词错误自动校对方法 - Google Patents
一种中文真词错误自动校对方法 Download PDFInfo
- Publication number
- CN105824800B CN105824800B CN201610145237.8A CN201610145237A CN105824800B CN 105824800 B CN105824800 B CN 105824800B CN 201610145237 A CN201610145237 A CN 201610145237A CN 105824800 B CN105824800 B CN 105824800B
- Authority
- CN
- China
- Prior art keywords
- word
- ternary
- true
- confusable
- synonym
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000007476 Maximum Likelihood Methods 0.000 claims description 5
- 238000000034 method Methods 0.000 abstract description 10
- 238000012937 correction Methods 0.000 abstract description 4
- 230000001915 proofreading effect Effects 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 2
- 238000003058 natural language processing Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/232—Orthographic correction, e.g. spell checking or vowelisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610145237.8A CN105824800B (zh) | 2016-03-15 | 2016-03-15 | 一种中文真词错误自动校对方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610145237.8A CN105824800B (zh) | 2016-03-15 | 2016-03-15 | 一种中文真词错误自动校对方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105824800A CN105824800A (zh) | 2016-08-03 |
CN105824800B true CN105824800B (zh) | 2018-06-26 |
Family
ID=56987260
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610145237.8A Active CN105824800B (zh) | 2016-03-15 | 2016-03-15 | 一种中文真词错误自动校对方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105824800B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107577668A (zh) * | 2017-09-15 | 2018-01-12 | 电子科技大学 | 基于语义的社交媒体非规范词纠正方法 |
CN107729318B (zh) * | 2017-10-17 | 2021-04-20 | 语联网(武汉)信息技术有限公司 | 一种自动更正部分文字的方法-由中文词性判断 |
CN111259654B (zh) * | 2018-11-30 | 2023-09-15 | 北京嘀嘀无限科技发展有限公司 | 一种文本检错方法及装置 |
CN110716674B (zh) * | 2019-08-28 | 2021-07-13 | 云知声智能科技股份有限公司 | 一种电子病历缺陷定位方法和系统 |
CN110532572A (zh) * | 2019-09-12 | 2019-12-03 | 四川长虹电器股份有限公司 | 基于tan树形朴素贝叶斯的拼写检查方法 |
CN111428478B (zh) * | 2020-03-20 | 2023-08-15 | 北京百度网讯科技有限公司 | 一种词条同义判别的寻证方法、装置、设备和存储介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101369265A (zh) * | 2008-01-14 | 2009-02-18 | 北京百问百答网络技术有限公司 | 一种自动生成问题的语义模板的方法和系统 |
CN102207946A (zh) * | 2010-06-29 | 2011-10-05 | 天津海量信息技术有限公司 | 一种知识网络的半自动生成方法 |
CN102956231A (zh) * | 2011-08-23 | 2013-03-06 | 上海交通大学 | 基于半自动校正的语音关键信息记录装置及方法 |
CN103020045A (zh) * | 2012-12-11 | 2013-04-03 | 中国科学院自动化研究所 | 一种基于谓词论元结构的统计机器翻译方法 |
CN103324621A (zh) * | 2012-03-21 | 2013-09-25 | 北京百度网讯科技有限公司 | 一种泰语文本拼写纠正方法及装置 |
CN104965819A (zh) * | 2015-07-12 | 2015-10-07 | 大连理工大学 | 一种基于句法词向量的生物医学事件触发词识别方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7941437B2 (en) * | 2007-08-24 | 2011-05-10 | Symantec Corporation | Bayesian surety check to reduce false positives in filtering of content in non-trained languages |
-
2016
- 2016-03-15 CN CN201610145237.8A patent/CN105824800B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101369265A (zh) * | 2008-01-14 | 2009-02-18 | 北京百问百答网络技术有限公司 | 一种自动生成问题的语义模板的方法和系统 |
CN102207946A (zh) * | 2010-06-29 | 2011-10-05 | 天津海量信息技术有限公司 | 一种知识网络的半自动生成方法 |
CN102956231A (zh) * | 2011-08-23 | 2013-03-06 | 上海交通大学 | 基于半自动校正的语音关键信息记录装置及方法 |
CN103324621A (zh) * | 2012-03-21 | 2013-09-25 | 北京百度网讯科技有限公司 | 一种泰语文本拼写纠正方法及装置 |
CN103020045A (zh) * | 2012-12-11 | 2013-04-03 | 中国科学院自动化研究所 | 一种基于谓词论元结构的统计机器翻译方法 |
CN104965819A (zh) * | 2015-07-12 | 2015-10-07 | 大连理工大学 | 一种基于句法词向量的生物医学事件触发词识别方法 |
Non-Patent Citations (4)
Title |
---|
NN型复合结构的语义关系识别及相似度计算;张韬文,陆汝占;《计算机应用与软件》;20110331;第28卷(第3期);5-8 * |
中文篇章级句间语义关系识别;张牧宇 等;《中文信息学报》;20131130;第27卷(第6期);51-57 * |
机器可读词典中词汇属性信息的获取;宋孜攀,陆汝占;《计算机工程与应用》;20090211;第45卷(第5期);138-141 * |
领域问答系统中的文本错误自动发现方法;刘亮亮 等;《中文信息学报》;20130531;第27卷(第3期);77-83 * |
Also Published As
Publication number | Publication date |
---|---|
CN105824800A (zh) | 2016-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105824800B (zh) | 一种中文真词错误自动校对方法 | |
CN105045778B (zh) | 一种汉语同音词错误自动校对方法 | |
Ling et al. | Latent predictor networks for code generation | |
Chollampatt et al. | Neural network translation models for grammatical error correction | |
CN108304445B (zh) | 一种文本摘要生成方法和装置 | |
CN106096664B (zh) | 一种基于社交网络数据的情感分析方法 | |
US20150278195A1 (en) | Text data sentiment analysis method | |
Shaalan et al. | Arabic word generation and modelling for spell checking. | |
US9646512B2 (en) | System and method for automated teaching of languages based on frequency of syntactic models | |
CN103970765A (zh) | 一种改错模型训练方法、装置和文本改错方法、装置 | |
KR101633556B1 (ko) | 문법 오류 수정 장치 및 이를 이용한 문법 오류 수정 방법 | |
Janssen | NeoTag: a POS Tagger for Grammatical Neologism Detection. | |
US11593557B2 (en) | Domain-specific grammar correction system, server and method for academic text | |
US20070282596A1 (en) | Generating grammatical elements in natural language sentences | |
US20160217122A1 (en) | Apparatus for generating self-learning alignment-based alignment corpus, method therefor, apparatus for analyzing destructne expression morpheme by using alignment corpus, and morpheme analysis method therefor | |
CN103995853A (zh) | 基于关键句的多语言情感数据处理分类方法及系统 | |
Zhang et al. | HANSpeller++: A unified framework for Chinese spelling correction | |
CN104899335A (zh) | 一种对网络舆情信息进行情感分类的方法 | |
CN107577663A (zh) | 一种关键短语抽取方法和装置 | |
Khalifa et al. | Morphological analysis and disambiguation for Gulf Arabic: The interplay between resources and methods | |
US10515148B2 (en) | Arabic spell checking error model | |
Sagcan et al. | Toponym recognition in social media for estimating the location of events | |
Shrestha | Codeswitching detection via lexical features in conditional random fields | |
Formiga Fanals et al. | Improving English to Spanish out-of-domain translations by morphology generalization and generation | |
CN105183807A (zh) | 一种基于结构句法的情绪原因事件识别方法及系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20160803 Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Denomination of invention: An automatic correction method for Chinese true word errors Granted publication date: 20180626 License type: Common License Record date: 20201029 |
|
EC01 | Cancellation of recordation of patent licensing contract | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Date of cancellation: 20201223 |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231012 Address after: 215600, 2nd Floor, Building 1, No. 3 Xingyuan Road, Nanfeng Town, Zhangjiagang City, Suzhou City, Jiangsu Province Patentee after: Suzhou Dingyi Intelligent Technology Co.,Ltd. Address before: 212003, No. 2, Mengxi Road, Zhenjiang, Jiangsu Patentee before: JIANGSU University OF SCIENCE AND TECHNOLOGY |