CN103049434A - 一种变形词证认系统及证认方法 - Google Patents
一种变形词证认系统及证认方法 Download PDFInfo
- Publication number
- CN103049434A CN103049434A CN2012105378031A CN201210537803A CN103049434A CN 103049434 A CN103049434 A CN 103049434A CN 2012105378031 A CN2012105378031 A CN 2012105378031A CN 201210537803 A CN201210537803 A CN 201210537803A CN 103049434 A CN103049434 A CN 103049434A
- Authority
- CN
- China
- Prior art keywords
- word
- deformed
- words
- module
- original
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 24
- 238000001914 filtration Methods 0.000 claims description 3
- 238000001514 detection method Methods 0.000 abstract description 17
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000012937 correction Methods 0.000 description 2
- 238000000691 measurement method Methods 0.000 description 2
- 238000003058 natural language processing Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000011524 similarity measure Methods 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
Images
Landscapes
- Machine Translation (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210537803.1A CN103049434B (zh) | 2012-12-12 | 2012-12-12 | 一种变形词证认系统及证认方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210537803.1A CN103049434B (zh) | 2012-12-12 | 2012-12-12 | 一种变形词证认系统及证认方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103049434A true CN103049434A (zh) | 2013-04-17 |
CN103049434B CN103049434B (zh) | 2016-08-17 |
Family
ID=48062078
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210537803.1A Active CN103049434B (zh) | 2012-12-12 | 2012-12-12 | 一种变形词证认系统及证认方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103049434B (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104615588A (zh) * | 2014-12-25 | 2015-05-13 | 上海科阅信息技术有限公司 | 一种计算机校验汉语同音错别字的方法 |
CN112001170A (zh) * | 2020-05-29 | 2020-11-27 | 中国人民大学 | 一种识别经过变形的敏感词的方法和系统 |
CN112700764A (zh) * | 2021-03-19 | 2021-04-23 | 北京沃丰时代数据科技有限公司 | 热词语音识别方法、装置、电子设备及存储介质 |
CN117312864A (zh) * | 2023-11-30 | 2023-12-29 | 国家计算机网络与信息安全管理中心 | 基于多模态信息的变形词生成模型的训练方法及装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1228566A (zh) * | 1998-03-11 | 1999-09-15 | 英业达股份有限公司 | 不连续短语的匹配翻译装置和方法 |
US20040236566A1 (en) * | 2003-05-20 | 2004-11-25 | Simske Steven J. | System and method for identifying special word usage in a document |
US20060143564A1 (en) * | 2000-12-29 | 2006-06-29 | International Business Machines Corporation | Automated spell analysis |
CN101727440A (zh) * | 2008-10-24 | 2010-06-09 | 北大方正集团有限公司 | 一种敏感词校对的方法及系统 |
-
2012
- 2012-12-12 CN CN201210537803.1A patent/CN103049434B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1228566A (zh) * | 1998-03-11 | 1999-09-15 | 英业达股份有限公司 | 不连续短语的匹配翻译装置和方法 |
US20060143564A1 (en) * | 2000-12-29 | 2006-06-29 | International Business Machines Corporation | Automated spell analysis |
US20040236566A1 (en) * | 2003-05-20 | 2004-11-25 | Simske Steven J. | System and method for identifying special word usage in a document |
CN101727440A (zh) * | 2008-10-24 | 2010-06-09 | 北大方正集团有限公司 | 一种敏感词校对的方法及系统 |
Non-Patent Citations (1)
Title |
---|
于歌: "搜索引擎中自动分类关键技术研究", 《中国优秀硕士论文全文数据库》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104615588A (zh) * | 2014-12-25 | 2015-05-13 | 上海科阅信息技术有限公司 | 一种计算机校验汉语同音错别字的方法 |
CN104615588B (zh) * | 2014-12-25 | 2019-06-28 | 上海科阅信息技术有限公司 | 一种计算机校验汉语同音错别字的方法 |
CN112001170A (zh) * | 2020-05-29 | 2020-11-27 | 中国人民大学 | 一种识别经过变形的敏感词的方法和系统 |
CN112001170B (zh) * | 2020-05-29 | 2023-05-09 | 中国人民大学 | 一种识别经过变形的敏感词的方法和系统 |
CN112700764A (zh) * | 2021-03-19 | 2021-04-23 | 北京沃丰时代数据科技有限公司 | 热词语音识别方法、装置、电子设备及存储介质 |
CN117312864A (zh) * | 2023-11-30 | 2023-12-29 | 国家计算机网络与信息安全管理中心 | 基于多模态信息的变形词生成模型的训练方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN103049434B (zh) | 2016-08-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5144940B2 (ja) | 目次抽出におけるロバスト性向上 | |
RU2474870C1 (ru) | Способ автоматизированного анализа текстовых документов | |
RU2491622C1 (ru) | Способ классификации документов по категориям | |
Yerra et al. | A sentence-based copy detection approach for web documents | |
CN104850574A (zh) | 一种面向文本信息的敏感词过滤方法 | |
CN102662937A (zh) | 自动翻译系统及其自动翻译方法 | |
CN103049434B (zh) | 一种变形词证认系统及证认方法 | |
Wibowo et al. | Comparison between fingerprint and winnowing algorithm to detect plagiarism fraud on Bahasa Indonesia documents | |
CN105164676A (zh) | 查询特征和问题 | |
CN111985244A (zh) | 一种针对文档内容的洗稿检测方法及装置 | |
Karimzadeh | Performance evaluation measures for toponym resolution | |
KR100788440B1 (ko) | 도용 패턴에 기반한 복사 감지시스템 | |
CN113901783B (zh) | 面向领域的文档查重方法及系统 | |
Uthayamoorthy et al. | Ddspell-a data driven spell checker and suggestion generator for the tamil language | |
CN107871078A (zh) | 非结构化文本中提取漏洞信息的方法 | |
JP2011008784A (ja) | ローマ字変換を用いる日本語自動推薦システムおよび方法 | |
CN113642327A (zh) | 一种标准知识库的构建方法及装置 | |
JP2003281165A (ja) | 文書要約方法及びシステム | |
US11640501B2 (en) | Method and device for verifying the author of a short message | |
CN116542246A (zh) | 基于关键词质检文本的方法、装置和电子设备 | |
Fenogenova et al. | A general method applicable to the search for anglicisms in russian social network texts | |
US20110106849A1 (en) | New case generation device, new case generation method, and new case generation program | |
Zayed et al. | Named entity recognition of persons’ names in Arabic tweets | |
KR20150111587A (ko) | 디비피디아를 활용한 uri 스포팅 시스템 및 방법 | |
KR101634681B1 (ko) | 검사문서 내 인용구문 탐색 방법 및 프로그램 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: System and method for identifying anagrams Effective date of registration: 20161128 Granted publication date: 20160817 Pledgee: Beijing technology intellectual property financing Company limited by guarantee Pledgor: Beijing Hylanda Software Technology Co., Ltd. Registration number: 2016990001028 |
|
PLDC | Enforcement, change and cancellation of contracts on pledge of patent right or utility model | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20180410 Granted publication date: 20160817 Pledgee: Beijing technology intellectual property financing Company limited by guarantee Pledgor: Beijing Hylanda Software Technology Co., Ltd. Registration number: 2016990001028 |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20180806 Address after: Room 301, No. 19, Standard Office Building, Eco-tech Park, No. 2018 Zhongtian Avenue, Zhongtian Eco-city, Tianjin, 300000 (TG 017) Patentee after: Tianjin Haina media big data technology development Co. Ltd. Address before: 100080 Beijing Haidian District West Wudaokou Zijin digital garden 3 building 11 floor 1108 room. Patentee before: Beijing Hylanda Software Technology Co., Ltd. |