CN109460552B - 基于规则和语料库的汉语语病自动检测方法及设备 - Google Patents
基于规则和语料库的汉语语病自动检测方法及设备 Download PDFInfo
- Publication number
- CN109460552B CN109460552B CN201811268613.8A CN201811268613A CN109460552B CN 109460552 B CN109460552 B CN 109460552B CN 201811268613 A CN201811268613 A CN 201811268613A CN 109460552 B CN109460552 B CN 109460552B
- Authority
- CN
- China
- Prior art keywords
- word
- corpus
- words
- node
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811268613.8A CN109460552B (zh) | 2018-10-29 | 2018-10-29 | 基于规则和语料库的汉语语病自动检测方法及设备 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811268613.8A CN109460552B (zh) | 2018-10-29 | 2018-10-29 | 基于规则和语料库的汉语语病自动检测方法及设备 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109460552A CN109460552A (zh) | 2019-03-12 |
| CN109460552B true CN109460552B (zh) | 2023-04-18 |
Family
ID=65608694
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201811268613.8A Active CN109460552B (zh) | 2018-10-29 | 2018-10-29 | 基于规则和语料库的汉语语病自动检测方法及设备 |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109460552B (zh) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110765274B (zh) * | 2019-10-10 | 2023-10-24 | 东华大学 | 语音输入甲状腺超声异常描述自动生成超声报告的方法 |
| CN110781665B (zh) * | 2019-10-29 | 2023-04-07 | 腾讯科技(深圳)有限公司 | 纠错对的质量评估方法、装置、设备及存储介质 |
| CN113128226B (zh) * | 2019-12-31 | 2024-09-27 | 阿里巴巴集团控股有限公司 | 命名实体识别方法、装置、电子设备及计算机存储介质 |
| CN111428469B (zh) * | 2020-02-27 | 2023-06-16 | 宋继华 | 面向句式结构图解分析的交互式标注方法和系统 |
| CN115066679B (zh) * | 2020-03-25 | 2024-02-20 | 苏州七星天专利运营管理有限责任公司 | 一种提取专业领域内的自造术语的方法及系统 |
| CN111553155B (zh) * | 2020-04-29 | 2023-05-09 | 上海交通大学 | 基于语义结构的口令分词系统及方法 |
| CN112241445B (zh) * | 2020-10-26 | 2023-11-07 | 竹间智能科技(上海)有限公司 | 一种标注方法及装置、电子设备、存储介质 |
| CN112650843A (zh) * | 2020-12-23 | 2021-04-13 | 平安银行股份有限公司 | 问答对知识库的构建方法、装置、设备及存储介质 |
| CN115587599B (zh) * | 2022-09-16 | 2023-07-14 | 粤港澳大湾区数字经济研究院(福田) | 一种机器翻译语料的质量检测方法及装置 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0981568A (ja) * | 1995-09-11 | 1997-03-28 | Matsushita Electric Ind Co Ltd | 機械翻訳用の中国語生成装置 |
| CN102541837A (zh) * | 2010-12-22 | 2012-07-04 | 张家港市赫图阿拉信息技术有限公司 | 一种校正输入中文拼写的方法 |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4994966A (en) * | 1988-03-31 | 1991-02-19 | Emerson & Stern Associates, Inc. | System and method for natural language parsing by initiating processing prior to entry of complete sentences |
| CN1116342A (zh) * | 1994-07-08 | 1996-02-07 | 唐武 | 一种中文自动校对方法及其系统 |
| CN102789504A (zh) * | 2012-07-19 | 2012-11-21 | 姜赢 | 一种基于xml规则的中文语法校正方法与系统 |
| CN103500160B (zh) * | 2013-10-18 | 2016-07-06 | 大连理工大学 | 一种基于滑动语义串匹配的句法分析方法 |
| CN104391837A (zh) * | 2014-11-19 | 2015-03-04 | 熊玮 | 一种基于格语义的智能语法分析方法 |
| CN105279149A (zh) * | 2015-10-21 | 2016-01-27 | 上海应用技术学院 | 一种中文文本自动校正方法 |
| CN106598951B (zh) * | 2016-12-23 | 2019-08-16 | 北京金山办公软件股份有限公司 | 一种依存结构树库获取方法及系统 |
| CN106844348B (zh) * | 2017-02-13 | 2020-01-17 | 哈尔滨工业大学 | 一种汉语句子功能成分分析方法 |
-
2018
- 2018-10-29 CN CN201811268613.8A patent/CN109460552B/zh active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0981568A (ja) * | 1995-09-11 | 1997-03-28 | Matsushita Electric Ind Co Ltd | 機械翻訳用の中国語生成装置 |
| CN102541837A (zh) * | 2010-12-22 | 2012-07-04 | 张家港市赫图阿拉信息技术有限公司 | 一种校正输入中文拼写的方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109460552A (zh) | 2019-03-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109460552B (zh) | 基于规则和语料库的汉语语病自动检测方法及设备 | |
| Joty et al. | Combining intra-and multi-sentential rhetorical parsing for document-level discourse analysis | |
| JP5356197B2 (ja) | 単語意味関係抽出装置 | |
| JP5362353B2 (ja) | 文書中のコロケーション誤りを処理すること | |
| Jabbar et al. | An improved Urdu stemming algorithm for text mining based on multi-step hybrid approach | |
| Chang et al. | Error diagnosis of Chinese sentences using inductive learning algorithm and decomposition-based testing mechanism | |
| Van Der Goot et al. | Lexical normalization for code-switched data and its effect on POS tagging | |
| Huo et al. | ARCLIN: automated API mention resolution for unformatted texts | |
| Özateş et al. | A hybrid deep dependency parsing approach enhanced with rules and morphology: A case study for Turkish | |
| RU61442U1 (ru) | Система автоматизированного упорядочения неструктурированного информационного потока входных данных | |
| Uchimoto et al. | Morphological analysis of the Corpus of Spontaneous Japanese | |
| Duran et al. | Some issues on the normalization of a corpus of products reviews in Portuguese | |
| Masanti et al. | Novel benchmark data set for automatic error detection and correction | |
| CN115034209A (zh) | 文本分析方法、装置、电子设备以及存储介质 | |
| Boulaknadel et al. | Amazighe Named Entity Recognition using a A rule based approach | |
| Elsaid et al. | Abstractive arabic text summarization based on mt5 and arabart transformers | |
| Trye et al. | A hybrid architecture for labelling bilingual māori-english tweets | |
| Chakraborty et al. | Syntactic category based assamese question pattern extraction using n-grams | |
| Salam et al. | Developing the bangladeshi national corpus-a balanced and representative bangla corpus | |
| Shekhar et al. | Computational linguistic retrieval framework using negative bootstrapping for retrieving transliteration variants | |
| Gupta et al. | Identification and extraction of multiword expressions from Hindi & Urdu language in natural language processing | |
| Sankaravelayuthan et al. | English to Tamil machine translation system using parallel corpus | |
| Ljunglöf et al. | Assessing the quality of Språkbanken’s annotations | |
| Born | Applications of natural language processing to archaeological decipherment: A survey of proto-Elamite | |
| Zitouni et al. | Cross-language information propagation for arabic mention detection |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| OL01 | Intention to license declared | ||
| OL01 | Intention to license declared | ||
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190312 Contract record no.: X2026980004264 Denomination of invention: Automatic detection method and equipment for Chinese language disorders based on rules and corpora Granted publication date: 20230418 License type: Open License Record date: 20260402 |
|
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190312 Contract record no.: X2026980004733 Denomination of invention: Automatic detection method and equipment for Chinese language disorders based on rules and corpora Granted publication date: 20230418 License type: Open License Record date: 20260410 Application publication date: 20190312 Contract record no.: X2026980004736 Denomination of invention: Automatic detection method and equipment for Chinese language disorders based on rules and corpora Granted publication date: 20230418 License type: Open License Record date: 20260410 |
|
| EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190312 Contract record no.: X2026980005441 Denomination of invention: Automatic detection method and equipment for Chinese language disorders based on rules and corpora Granted publication date: 20230418 License type: Open License Record date: 20260415 |
































