CN104991889A - Fuzzy word segmentation based non-multi-character word error automatic proofreading method - Google Patents
Fuzzy word segmentation based non-multi-character word error automatic proofreading method Download PDFInfo
- Publication number
- CN104991889A CN104991889A CN201510361877.8A CN201510361877A CN104991889A CN 104991889 A CN104991889 A CN 104991889A CN 201510361877 A CN201510361877 A CN 201510361877A CN 104991889 A CN104991889 A CN 104991889A
- Authority
- CN
- China
- Prior art keywords
- word
- character
- fuzzy
- participle
- similarity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510361877.8A CN104991889B (en) | 2015-06-26 | 2015-06-26 | A kind of non-multi-character word error auto-collation based on fuzzy participle |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510361877.8A CN104991889B (en) | 2015-06-26 | 2015-06-26 | A kind of non-multi-character word error auto-collation based on fuzzy participle |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104991889A true CN104991889A (en) | 2015-10-21 |
CN104991889B CN104991889B (en) | 2018-02-02 |
Family
ID=54303705
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510361877.8A Active CN104991889B (en) | 2015-06-26 | 2015-06-26 | A kind of non-multi-character word error auto-collation based on fuzzy participle |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104991889B (en) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105512110A (en) * | 2015-12-15 | 2016-04-20 | 江苏科技大学 | Wrong word knowledge base construction method based on fuzzy matching and statistics |
CN105573979A (en) * | 2015-12-10 | 2016-05-11 | 江苏科技大学 | Chinese character confusion set based wrong word knowledge generation method |
CN106527757A (en) * | 2016-10-28 | 2017-03-22 | 上海智臻智能网络科技股份有限公司 | Input error correction method and apparatus |
CN106528532A (en) * | 2016-11-07 | 2017-03-22 | 上海智臻智能网络科技股份有限公司 | Text error correction method and device and terminal |
CN106547741A (en) * | 2016-11-21 | 2017-03-29 | 江苏科技大学 | A kind of Chinese language text auto-collation based on collocation |
CN106598939A (en) * | 2016-10-21 | 2017-04-26 | 北京三快在线科技有限公司 | Method and device for text error correction, server and storage medium |
CN106610953A (en) * | 2016-09-30 | 2017-05-03 | 四川用联信息技术有限公司 | Method for solving text similarity based on Gini index |
CN108572998A (en) * | 2017-03-14 | 2018-09-25 | 北京橙鑫数据科技有限公司 | A kind of data search method and device for electronic card data |
CN108717412A (en) * | 2018-06-12 | 2018-10-30 | 北京览群智数据科技有限责任公司 | Chinese check and correction error correction method based on Chinese word segmentation and system |
CN108766437A (en) * | 2018-05-31 | 2018-11-06 | 平安科技(深圳)有限公司 | Audio recognition method, device, computer equipment and storage medium |
CN109492202A (en) * | 2018-11-12 | 2019-03-19 | 浙江大学山东工业技术研究院 | A kind of Chinese error correction of coding and decoded model based on phonetic |
CN109558596A (en) * | 2018-12-14 | 2019-04-02 | 平安城市建设科技(深圳)有限公司 | Recognition methods, device, terminal and computer readable storage medium |
CN109657738A (en) * | 2018-10-25 | 2019-04-19 | 平安科技(深圳)有限公司 | Character identifying method, device, equipment and storage medium |
CN110020005A (en) * | 2019-03-28 | 2019-07-16 | 云知声(上海)智能科技有限公司 | Symptom matching process in main suit and present illness history in a kind of case history |
CN111209748A (en) * | 2019-12-16 | 2020-05-29 | 合肥讯飞数码科技有限公司 | Wrong-recognized word recognition method, related equipment and readable storage medium |
CN112765318A (en) * | 2021-01-20 | 2021-05-07 | 阅尔基因技术(苏州)有限公司 | Natural language processing method and system for infertility clinical phenotype information |
CN112954387A (en) * | 2021-01-26 | 2021-06-11 | 广州欢网科技有限责任公司 | Method, system and readable storage medium for updating and optimizing television program list |
CN113033193A (en) * | 2021-01-20 | 2021-06-25 | 山谷网安科技股份有限公司 | C + + language-based mixed Chinese text word segmentation method |
CN114091436A (en) * | 2022-01-21 | 2022-02-25 | 万商云集(成都)科技股份有限公司 | Sensitive word detection method based on decision tree and variant recognition |
CN114781371A (en) * | 2022-04-07 | 2022-07-22 | 山东新一代信息产业技术研究院有限公司 | Chinese word segmentation method based on statistics and dictionary |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1514387A (en) * | 2002-12-31 | 2004-07-21 | 中国科学院计算技术研究所 | Sound distinguishing method in speech sound inquiry |
CN102393850A (en) * | 2011-07-22 | 2012-03-28 | 镇江诺尼基智能技术有限公司 | Chinese character pattern cognition similarity computing method |
-
2015
- 2015-06-26 CN CN201510361877.8A patent/CN104991889B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1514387A (en) * | 2002-12-31 | 2004-07-21 | 中国科学院计算技术研究所 | Sound distinguishing method in speech sound inquiry |
CN102393850A (en) * | 2011-07-22 | 2012-03-28 | 镇江诺尼基智能技术有限公司 | Chinese character pattern cognition similarity computing method |
Non-Patent Citations (7)
Title |
---|
刘亮亮 等: "领域问答系统中的文本错误自动发现方法", 《中文信息学报》 * |
张仰森 等: "基于规则与统计相结合的中文文本自动查错模型与算法", 《中文信息学报》 * |
张华平 等: "基于N-最短路径方法的中文词语粗分模型", 《中文信息学报》 * |
张磊 等: "基于快速模糊词匹配算法的中文自动校对方法", 《PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION》 * |
施恒利 等: "汉字种子混淆集的构建方法研究", 《计算机科学》 * |
施恒利: "汉字种子混淆集的构建方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
王思力 等: "双数组Trie树算法优化及其应用研究", 《中文信息学报》 * |
Cited By (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105573979B (en) * | 2015-12-10 | 2018-05-22 | 江苏科技大学 | A kind of wrongly written character word knowledge generation method that collection is obscured based on Chinese character |
CN105573979A (en) * | 2015-12-10 | 2016-05-11 | 江苏科技大学 | Chinese character confusion set based wrong word knowledge generation method |
CN105512110A (en) * | 2015-12-15 | 2016-04-20 | 江苏科技大学 | Wrong word knowledge base construction method based on fuzzy matching and statistics |
CN105512110B (en) * | 2015-12-15 | 2018-04-06 | 江苏科技大学 | A kind of wrongly written character word construction of knowledge base method based on fuzzy matching with statistics |
CN106610953A (en) * | 2016-09-30 | 2017-05-03 | 四川用联信息技术有限公司 | Method for solving text similarity based on Gini index |
CN106598939B (en) * | 2016-10-21 | 2019-09-17 | 北京三快在线科技有限公司 | A kind of text error correction method and device, server, storage medium |
CN106598939A (en) * | 2016-10-21 | 2017-04-26 | 北京三快在线科技有限公司 | Method and device for text error correction, server and storage medium |
CN106527757A (en) * | 2016-10-28 | 2017-03-22 | 上海智臻智能网络科技股份有限公司 | Input error correction method and apparatus |
CN106528532B (en) * | 2016-11-07 | 2019-03-12 | 上海智臻智能网络科技股份有限公司 | Text error correction method, device and terminal |
CN106528532A (en) * | 2016-11-07 | 2017-03-22 | 上海智臻智能网络科技股份有限公司 | Text error correction method and device and terminal |
CN106547741A (en) * | 2016-11-21 | 2017-03-29 | 江苏科技大学 | A kind of Chinese language text auto-collation based on collocation |
CN108572998A (en) * | 2017-03-14 | 2018-09-25 | 北京橙鑫数据科技有限公司 | A kind of data search method and device for electronic card data |
CN108766437A (en) * | 2018-05-31 | 2018-11-06 | 平安科技(深圳)有限公司 | Audio recognition method, device, computer equipment and storage medium |
CN108717412A (en) * | 2018-06-12 | 2018-10-30 | 北京览群智数据科技有限责任公司 | Chinese check and correction error correction method based on Chinese word segmentation and system |
CN109657738A (en) * | 2018-10-25 | 2019-04-19 | 平安科技(深圳)有限公司 | Character identifying method, device, equipment and storage medium |
CN109657738B (en) * | 2018-10-25 | 2024-04-30 | 平安科技(深圳)有限公司 | Character recognition method, device, equipment and storage medium |
WO2020082562A1 (en) * | 2018-10-25 | 2020-04-30 | 平安科技(深圳)有限公司 | Symbol identification method, apparatus, device, and storage medium |
CN109492202B (en) * | 2018-11-12 | 2022-12-27 | 浙江大学山东工业技术研究院 | Chinese error correction method based on pinyin coding and decoding model |
CN109492202A (en) * | 2018-11-12 | 2019-03-19 | 浙江大学山东工业技术研究院 | A kind of Chinese error correction of coding and decoded model based on phonetic |
CN109558596A (en) * | 2018-12-14 | 2019-04-02 | 平安城市建设科技(深圳)有限公司 | Recognition methods, device, terminal and computer readable storage medium |
CN110020005B (en) * | 2019-03-28 | 2021-03-26 | 云知声(上海)智能科技有限公司 | Method for matching main complaints in medical records with symptoms in current medical history |
CN110020005A (en) * | 2019-03-28 | 2019-07-16 | 云知声(上海)智能科技有限公司 | Symptom matching process in main suit and present illness history in a kind of case history |
CN111209748A (en) * | 2019-12-16 | 2020-05-29 | 合肥讯飞数码科技有限公司 | Wrong-recognized word recognition method, related equipment and readable storage medium |
CN111209748B (en) * | 2019-12-16 | 2023-10-24 | 合肥讯飞数码科技有限公司 | Error word recognition method, related device and readable storage medium |
CN112765318A (en) * | 2021-01-20 | 2021-05-07 | 阅尔基因技术(苏州)有限公司 | Natural language processing method and system for infertility clinical phenotype information |
CN113033193A (en) * | 2021-01-20 | 2021-06-25 | 山谷网安科技股份有限公司 | C + + language-based mixed Chinese text word segmentation method |
CN113033193B (en) * | 2021-01-20 | 2024-04-16 | 山谷网安科技股份有限公司 | Mixed Chinese text word segmentation method based on C++ language |
CN112954387A (en) * | 2021-01-26 | 2021-06-11 | 广州欢网科技有限责任公司 | Method, system and readable storage medium for updating and optimizing television program list |
CN114091436A (en) * | 2022-01-21 | 2022-02-25 | 万商云集(成都)科技股份有限公司 | Sensitive word detection method based on decision tree and variant recognition |
CN114781371A (en) * | 2022-04-07 | 2022-07-22 | 山东新一代信息产业技术研究院有限公司 | Chinese word segmentation method based on statistics and dictionary |
Also Published As
Publication number | Publication date |
---|---|
CN104991889B (en) | 2018-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104991889A (en) | Fuzzy word segmentation based non-multi-character word error automatic proofreading method | |
CN105045778A (en) | Chinese homonym error auto-proofreading method | |
Ling et al. | Latent predictor networks for code generation | |
CN112016304A (en) | Text error correction method and device, electronic equipment and storage medium | |
CN112801010A (en) | Visual rich document information extraction method for actual OCR scene | |
CN106528526B (en) | A kind of Chinese address semanteme marking method based on Bayes's segmentation methods | |
CN105279149A (en) | Chinese text automatic correction method | |
CN108519974A (en) | English composition automatic detection of syntax error and analysis method | |
CN103020022A (en) | Chinese unregistered word recognition system and method based on improvement information entropy characteristics | |
CN105512110A (en) | Wrong word knowledge base construction method based on fuzzy matching and statistics | |
CN110276069A (en) | A kind of Chinese braille mistake automatic testing method, system and storage medium | |
CN112364623A (en) | Bi-LSTM-CRF-based three-in-one word notation Chinese lexical analysis method | |
CN111444706A (en) | Referee document text error correction method and system based on deep learning | |
CN100543735C (en) | File similarity measure method based on file structure | |
CN104699797A (en) | Webpage data structured analytic method and device | |
CN106610937A (en) | Information theory-based Chinese automatic word segmentation method | |
CN105824800A (en) | Automatic Chinese real word error proofreading method | |
CN107832297A (en) | A kind of field sentiment dictionary construction method of Feature Oriented word granularity | |
CN110222338A (en) | A kind of mechanism name entity recognition method | |
CN111428501A (en) | Named entity recognition method, recognition system and computer readable storage medium | |
CN110705261B (en) | Chinese text word segmentation method and system thereof | |
CN106528863A (en) | Training and technology of CRF recognizer and method for extracting attribute name relation pairs of CRF recognizer | |
CN103714053B (en) | Japanese verb identification method for machine translation | |
CN104572618A (en) | Question-answering system semantic-based similarity analyzing method, system and application | |
CN108763218A (en) | A kind of video display retrieval entity recognition method based on CRF |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20151021 Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Denomination of invention: An automatic proofreading method for non multi word errors based on fuzzy segmentation Granted publication date: 20180202 License type: Common License Record date: 20201029 |
|
EC01 | Cancellation of recordation of patent licensing contract | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Date of cancellation: 20201223 |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20221222 Address after: Room 02A-084, Building C (Second Floor), No. 28, Xinxi Road, Haidian District, Beijing 100085 Patentee after: Jingchuang United (Beijing) Intellectual Property Service Co.,Ltd. Address before: 212003, No. 2, Mengxi Road, Zhenjiang, Jiangsu Patentee before: JIANGSU University OF SCIENCE AND TECHNOLOGY Effective date of registration: 20221222 Address after: Room 606-609, Compound Office Complex Building, No. 757, Dongfeng East Road, Yuexiu District, Guangzhou, Guangdong Province, 510699 Patentee after: China Southern Power Grid Internet Service Co.,Ltd. Address before: Room 02A-084, Building C (Second Floor), No. 28, Xinxi Road, Haidian District, Beijing 100085 Patentee before: Jingchuang United (Beijing) Intellectual Property Service Co.,Ltd. |