RU2725777C1 - Способ и устройство для сопоставления имен - Google Patents

Способ и устройство для сопоставления имен Download PDF

Info

Publication number
RU2725777C1
RU2725777C1 RU2019119526A RU2019119526A RU2725777C1 RU 2725777 C1 RU2725777 C1 RU 2725777C1 RU 2019119526 A RU2019119526 A RU 2019119526A RU 2019119526 A RU2019119526 A RU 2019119526A RU 2725777 C1 RU2725777 C1 RU 2725777C1
Authority
RU
Russia
Prior art keywords
name
matched
names
standard
matching
Prior art date
Application number
RU2019119526A
Other languages
English (en)
Russian (ru)
Inventor
Цинцин СУНЬ
Original Assignee
Алибаба Груп Холдинг Лимитед
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Алибаба Груп Холдинг Лимитед filed Critical Алибаба Груп Холдинг Лимитед
Application granted granted Critical
Publication of RU2725777C1 publication Critical patent/RU2725777C1/ru

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Acoustics & Sound (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Alarm Systems (AREA)
  • Stored Programmes (AREA)
RU2019119526A 2016-11-25 2017-11-17 Способ и устройство для сопоставления имен RU2725777C1 (ru)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201611055619.8A CN108108373B (zh) 2016-11-25 2016-11-25 一种名称匹配方法及装置
CN201611055619.8 2016-11-25
PCT/CN2017/111604 WO2018095281A1 (zh) 2016-11-25 2017-11-17 一种名称匹配方法及装置

Publications (1)

Publication Number Publication Date
RU2725777C1 true RU2725777C1 (ru) 2020-07-06

Family

ID=62196168

Family Applications (1)

Application Number Title Priority Date Filing Date
RU2019119526A RU2725777C1 (ru) 2016-11-25 2017-11-17 Способ и устройство для сопоставления имен

Country Status (14)

Country Link
US (1) US10726028B2 (https=)
EP (1) EP3547164A4 (https=)
JP (1) JP6860668B2 (https=)
KR (1) KR102151367B1 (https=)
CN (1) CN108108373B (https=)
AU (1) AU2017364745C1 (https=)
BR (1) BR112019010669B1 (https=)
CA (1) CA3044847A1 (https=)
MX (1) MX384762B (https=)
PH (1) PH12019501163B1 (https=)
RU (1) RU2725777C1 (https=)
TW (1) TWI724237B (https=)
WO (1) WO2018095281A1 (https=)
ZA (1) ZA201904091B (https=)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108962232B (zh) * 2018-07-16 2021-01-01 上海小蚁科技有限公司 语音识别方法及装置、存储介质、终端
CN109408561A (zh) * 2018-10-17 2019-03-01 杭州骑轻尘信息技术有限公司 业务名称匹配方法及装置
CN109189809B (zh) * 2018-10-17 2020-01-03 北京金堤科技有限公司 一种股东名称关联匹配的方法和装置
CN109472029B (zh) * 2018-11-09 2023-04-07 天津开心生活科技有限公司 药品名称处理方法与装置
CN109471960B (zh) * 2018-11-13 2020-10-13 深圳市景旺电子股份有限公司 智能识别pcb资料工具层名的方法及装置
CN109840316A (zh) * 2018-12-21 2019-06-04 上海诺悦智能科技有限公司 一种客户信息制裁名单匹配系统
GB201902772D0 (en) * 2019-03-01 2019-04-17 Palantir Technologies Inc Fuzzy searching 7 applications thereof
CN110909532B (zh) * 2019-10-31 2021-06-11 银联智惠信息服务(上海)有限公司 用户名称匹配方法、装置、计算机设备和存储介质
CN111092758A (zh) * 2019-12-06 2020-05-01 上海上讯信息技术股份有限公司 降低告警及恢复误报的方法、装置及电子设备
US12079282B2 (en) * 2020-03-12 2024-09-03 Oracle International Corporation Name matching engine boosted by machine learning
CN111563139B (zh) * 2020-07-15 2020-10-23 平安国际智慧城市科技股份有限公司 Ocr识别发票药品名的校验方法、装置及计算机设备
CN113268986B (zh) * 2021-05-24 2024-05-24 交通银行股份有限公司 一种基于模糊匹配算法的单位名称匹配、查找方法及装置
US20230039689A1 (en) * 2021-08-05 2023-02-09 Ebay Inc. Automatic Synonyms, Abbreviations, and Acronyms Detection
CN113822049B (zh) * 2021-09-29 2023-08-25 平安银行股份有限公司 基于人工智能的地址审核方法、装置、设备及存储介质
WO2023132029A1 (ja) * 2022-01-06 2023-07-13 日本電気株式会社 情報処理装置、情報処理方法及びプログラム
CN114595379B (zh) * 2022-01-17 2025-09-19 国投智能(厦门)信息股份有限公司 一种数据标准的智能推荐方法及装置
KR102693782B1 (ko) * 2022-05-26 2024-08-08 주식회사 카카오게임즈 닉네임 간 유사도를 이용하여 다중 접속계정을 탐지하기 위한 방법 및 장치
US12282486B2 (en) 2022-04-29 2025-04-22 Oracle International Corporation Address matching from single string to address matching score
CN114880430B (zh) * 2022-05-10 2023-07-18 马上消费金融股份有限公司 名称处理方法及装置
JP2024094499A (ja) * 2022-12-28 2024-07-10 富士通株式会社 対訳コーパス生成プログラム、対訳コーパス生成方法および情報処理装置
CN116244421A (zh) * 2023-03-03 2023-06-09 广联达科技股份有限公司 项目名称匹配的方法、装置、设备及可读存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040024760A1 (en) * 2002-07-31 2004-02-05 Phonetic Research Ltd. System, method and computer program product for matching textual strings using language-biased normalisation, phonetic representation and correlation functions
US20050084152A1 (en) * 2003-10-16 2005-04-21 Sybase, Inc. System and methodology for name searches
US20080091674A1 (en) * 2006-10-13 2008-04-17 Thomas Bradley Allen Method, apparatus and article for assigning a similarity measure to names
RU2419858C2 (ru) * 2004-10-05 2011-05-27 Майкрософт Корпорейшн Система, способ и интерфейс для обеспечения персонализированного поиска и доступа к информации
US20120016663A1 (en) * 1998-03-25 2012-01-19 International Business Machines Corporation Identifying related names

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7313513B2 (en) * 2002-05-13 2007-12-25 Wordrake Llc Method for editing and enhancing readability of authored documents
US8700568B2 (en) * 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
JP2010519655A (ja) * 2007-02-26 2010-06-03 ベイシス テクノロジー コーポレーション 名前照合システムの名前インデックス付け
CN101727464B (zh) * 2008-10-29 2012-08-08 北京搜狗科技发展有限公司 获取别称匹配对的方法及装置
US20110055234A1 (en) * 2009-09-02 2011-03-03 Nokia Corporation Method and apparatus for combining contact lists
TWI443529B (zh) 2010-04-01 2014-07-01 Inst Information Industry 自動化領域名詞建置方法及系統,及其電腦程式產品
US9424556B2 (en) * 2010-10-14 2016-08-23 Nokia Technologies Oy Method and apparatus for linking multiple contact identifiers of an individual
US8468167B2 (en) * 2010-10-25 2013-06-18 Corelogic, Inc. Automatic data validation and correction
US8364692B1 (en) * 2011-08-11 2013-01-29 International Business Machines Corporation Identifying non-distinct names in a set of names
US9275339B2 (en) * 2012-04-24 2016-03-01 Raytheon Company System and method for probabilistic name matching
US9229926B2 (en) 2012-12-03 2016-01-05 International Business Machines Corporation Determining similarity of unfielded names using feature assignments
CN103167056B (zh) * 2013-01-31 2016-03-02 中国科学院计算机网络信息中心 一种基于自动审核的域名注册方法
CN103970798B (zh) * 2013-02-04 2019-05-28 商业对象软件有限公司 数据的搜索和匹配
US10089302B2 (en) 2013-02-26 2018-10-02 International Business Machines Corporation Native-script and cross-script chinese name matching
CN103177122B (zh) * 2013-04-15 2017-04-26 天津理工大学 一种基于同义词的个人桌面文件搜索方法
CN103425739B (zh) * 2013-07-09 2016-09-14 国云科技股份有限公司 一种字符串匹配方法
US9691075B1 (en) * 2014-03-14 2017-06-27 Wal-Mart Stores, Inc. Name comparison
CN104331475B (zh) * 2014-11-04 2018-03-23 郑州悉知信息科技股份有限公司 一种信息检测方法及装置
US9535903B2 (en) * 2015-04-13 2017-01-03 International Business Machines Corporation Scoring unfielded personal names without prior parsing
CN104765858A (zh) * 2015-04-21 2015-07-08 北京航天长峰科技工业集团有限公司上海分公司 公安用同义词库的构建方法及获得的公安用同义词库
CN104820713B (zh) 2015-05-19 2018-02-27 苏州中炎工业科技有限公司 一种基于用户历史数据获得工业产品名称同义词的方法
CN105843950A (zh) * 2016-04-12 2016-08-10 乐视控股(北京)有限公司 敏感词过滤方法及装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120016663A1 (en) * 1998-03-25 2012-01-19 International Business Machines Corporation Identifying related names
US20040024760A1 (en) * 2002-07-31 2004-02-05 Phonetic Research Ltd. System, method and computer program product for matching textual strings using language-biased normalisation, phonetic representation and correlation functions
US20050084152A1 (en) * 2003-10-16 2005-04-21 Sybase, Inc. System and methodology for name searches
RU2419858C2 (ru) * 2004-10-05 2011-05-27 Майкрософт Корпорейшн Система, способ и интерфейс для обеспечения персонализированного поиска и доступа к информации
US20080091674A1 (en) * 2006-10-13 2008-04-17 Thomas Bradley Allen Method, apparatus and article for assigning a similarity measure to names

Also Published As

Publication number Publication date
EP3547164A1 (en) 2019-10-02
KR20190084319A (ko) 2019-07-16
US20190251085A1 (en) 2019-08-15
EP3547164A4 (en) 2019-10-16
JP2020501255A (ja) 2020-01-16
AU2017364745A1 (en) 2019-06-20
BR112019010669A2 (pt) 2019-10-01
MX2019006027A (es) 2019-08-14
ZA201904091B (en) 2021-05-26
TWI724237B (zh) 2021-04-11
PH12019501163A1 (en) 2020-02-24
JP6860668B2 (ja) 2021-04-21
CN108108373B (zh) 2020-09-25
MX384762B (es) 2025-03-14
WO2018095281A1 (zh) 2018-05-31
BR112019010669B1 (pt) 2021-12-07
CN108108373A (zh) 2018-06-01
PH12019501163B1 (en) 2023-10-13
KR102151367B1 (ko) 2020-09-03
CA3044847A1 (en) 2018-05-31
AU2017364745B2 (en) 2020-04-09
AU2017364745C1 (en) 2020-09-10
TW201820179A (zh) 2018-06-01
US10726028B2 (en) 2020-07-28

Similar Documents

Publication Publication Date Title
RU2725777C1 (ru) Способ и устройство для сопоставления имен
CN109388801B (zh) 相似词集合的确定方法、装置和电子设备
EP3554000B1 (en) Validation code based verification method and device
JP2020524314A (ja) 危険アドレス識別方法及び機器、並びに電子装置
US8639496B2 (en) System and method for identifying phrases in text
CN107329964B (zh) 一种文本处理方法及装置
US11341190B2 (en) Name matching using enhanced name keys
US20220215170A1 (en) Framework for chinese text error identification and correction
CN110879832A (zh) 目标文本检测方法、模型训练方法、装置及设备
CN110046621A (zh) 证件识别方法及装置
US20160078072A1 (en) Term variant discernment system and method therefor
US10936814B2 (en) Responsive spell checking for web forms
Rychalska et al. How much should you ask? On the question structure in QA systems.
OA19238A (en) Name matching method and apparatus.
US11704481B1 (en) K-anonymity guarantee in text anonymization using word embeddings
US20240153500A1 (en) Data processing method, apparatus, and device
CN110991173B (zh) 一种分词方法及系统
CN120429398A (zh) 一种数据处理的方法、装置及电子设备
RU2684578C2 (ru) Языконезависимая технология исправления опечаток, с возможностью верификации результата
CN118917414A (zh) 一种任务执行方法、装置、电子设备及存储介质
CN119760049A (zh) 基于自动化幻觉检测与大模型的文本生成方法及装置
CN115222262A (zh) 数据处理方法、装置及设备
HK1247317A1 (zh) 一种文本分析方法及装置
HK1248352A1 (zh) 词向量处理方法、装置以及电子设备
WO2014190714A1 (en) Method and apparatus for word counting

Legal Events

Date Code Title Description
PC41 Official registration of the transfer of exclusive right

Effective date: 20210311

PC41 Official registration of the transfer of exclusive right

Effective date: 20210420