CN105512106B - 一种汉语离合词的自动识别方法 - Google Patents
一种汉语离合词的自动识别方法 Download PDFInfo
- Publication number
- CN105512106B CN105512106B CN201510907012.7A CN201510907012A CN105512106B CN 105512106 B CN105512106 B CN 105512106B CN 201510907012 A CN201510907012 A CN 201510907012A CN 105512106 B CN105512106 B CN 105512106B
- Authority
- CN
- China
- Prior art keywords
- clutch
- word
- candidate
- mode
- dis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510907012.7A CN105512106B (zh) | 2015-12-09 | 2015-12-09 | 一种汉语离合词的自动识别方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510907012.7A CN105512106B (zh) | 2015-12-09 | 2015-12-09 | 一种汉语离合词的自动识别方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105512106A CN105512106A (zh) | 2016-04-20 |
CN105512106B true CN105512106B (zh) | 2018-04-06 |
Family
ID=55720099
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510907012.7A Active CN105512106B (zh) | 2015-12-09 | 2015-12-09 | 一种汉语离合词的自动识别方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105512106B (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1910574A (zh) * | 2004-01-06 | 2007-02-07 | 李仁燮 | 自动翻译器及其方法和用于编写该方法的记录媒体 |
CN1991819A (zh) * | 2005-12-30 | 2007-07-04 | 北京法国电信研发中心有限公司 | 语言形态分析器 |
CN102135956A (zh) * | 2011-05-06 | 2011-07-27 | 中国科学院软件研究所 | 一种基于词位标注的藏文分词方法 |
CN104375986A (zh) * | 2014-12-02 | 2015-02-25 | 江苏科技大学 | 一种汉语叠词的自动获取方法 |
CN104778256A (zh) * | 2015-04-20 | 2015-07-15 | 江苏科技大学 | 一种领域问答系统咨询的快速可增量聚类方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4476609B2 (ja) * | 2003-12-10 | 2010-06-09 | 株式会社東芝 | 中国語解析装置、中国語解析方法および中国語解析プログラム |
US20090313205A1 (en) * | 2008-06-03 | 2009-12-17 | Justsystems Corporation | Table structure analyzing apparatus, table structure analyzing method, and table structure analyzing program |
-
2015
- 2015-12-09 CN CN201510907012.7A patent/CN105512106B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1910574A (zh) * | 2004-01-06 | 2007-02-07 | 李仁燮 | 自动翻译器及其方法和用于编写该方法的记录媒体 |
CN1991819A (zh) * | 2005-12-30 | 2007-07-04 | 北京法国电信研发中心有限公司 | 语言形态分析器 |
CN102135956A (zh) * | 2011-05-06 | 2011-07-27 | 中国科学院软件研究所 | 一种基于词位标注的藏文分词方法 |
CN104375986A (zh) * | 2014-12-02 | 2015-02-25 | 江苏科技大学 | 一种汉语叠词的自动获取方法 |
CN104778256A (zh) * | 2015-04-20 | 2015-07-15 | 江苏科技大学 | 一种领域问答系统咨询的快速可增量聚类方法 |
Non-Patent Citations (5)
Title |
---|
A Magnetic Stimulation Examination of Orthographic Neighborhood Effects in Visual Word Recognition;Michal Lavidor,Vincent Walsh;《Journal of Cognitive Neuroscience》;20030430;第15卷(第3期);第354-363页 * |
Evaluating a split processing model of visual word recognition:Effects of orthographic neighborhood size;Michal Lavidor 等;《Brain and Language》;20040430;第312-320页 * |
基于依存分析的离合触发词合法分离形式判定;肖升 等;《计算机工程与应用》;20131226;第50卷(第10期);第11-17页 * |
基于语料库的现代汉语离合词形式分析;任海波,王刚;《语言科学》;20051130;第4卷(第6期);第75-87页 * |
自动获取汉语词语搭配;王素格;《中文信息学报》;20061130;第20卷(第6期);第31-37页 * |
Also Published As
Publication number | Publication date |
---|---|
CN105512106A (zh) | 2016-04-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11475209B2 (en) | Device, system, and method for extracting named entities from sectioned documents | |
CN109416705B (zh) | 利用语料库中可用的信息用于数据解析和预测 | |
CN102227724B (zh) | 对于音译的机器学习 | |
Gupta et al. | Named entity recognition for Punjabi language text summarization | |
US9195646B2 (en) | Training data generation apparatus, characteristic expression extraction system, training data generation method, and computer-readable storage medium | |
US20130246048A1 (en) | Text proofreading apparatus and text proofreading method | |
CN103678684A (zh) | 一种基于导航信息检索的中文分词方法 | |
Hussain et al. | Using linguistic knowledge to classify non-functional requirements in SRS documents | |
CN102737013A (zh) | 基于依存关系来识别语句情感的设备和方法 | |
KR20150037924A (ko) | 제품 인식에 근거한 정보 분류 기법 | |
Siddiqui et al. | Extraction and visualization of the chain of narrators from hadiths using named entity recognition and classification | |
CN105023028A (zh) | 基于hmm和决策树的阿拉伯语光学字母识别方法 | |
CN104346326A (zh) | 一种情绪文本的情绪特征确定方法及装置 | |
US8880391B2 (en) | Natural language processing apparatus, natural language processing method, natural language processing program, and computer-readable recording medium storing natural language processing program | |
Barriere et al. | May I Check Again?--A simple but efficient way to generate and use contextual dictionaries for Named Entity Recognition. Application to French Legal Texts | |
CN109086266A (zh) | 一种文本形近字的检错与校对方法 | |
CN109472020B (zh) | 一种特征对齐中文分词方法 | |
Wankhede et al. | Data preprocessing for efficient sentimental analysis | |
Sankaran et al. | Error detection in highly inflectional languages | |
Charoenpornsawat et al. | Automatic sentence break disambiguation for Thai | |
Ghaeini | Intrinsic author identification using modified weighted knn | |
CN105512106B (zh) | 一种汉语离合词的自动识别方法 | |
Sreejith et al. | N-gram based algorithm for distinguishing between Hindi and Sanskrit texts | |
CN105183807A (zh) | 一种基于结构句法的情绪原因事件识别方法及系统 | |
Goyal | Named entity recognition for south asian languages |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20160420 Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Denomination of invention: A method of Chinese word recognition Granted publication date: 20180406 License type: Common License Record date: 20201029 |
|
EC01 | Cancellation of recordation of patent licensing contract | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Date of cancellation: 20201223 |