CN108108373B - 一种名称匹配方法及装置 - Google Patents

一种名称匹配方法及装置 Download PDF

Info

Publication number
CN108108373B
CN108108373B CN201611055619.8A CN201611055619A CN108108373B CN 108108373 B CN108108373 B CN 108108373B CN 201611055619 A CN201611055619 A CN 201611055619A CN 108108373 B CN108108373 B CN 108108373B
Authority
CN
China
Prior art keywords
name
matched
matching
names
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201611055619.8A
Other languages
English (en)
Chinese (zh)
Other versions
CN108108373A (zh
Inventor
孙清清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201611055619.8A priority Critical patent/CN108108373B/zh
Priority to TW106131720A priority patent/TWI724237B/zh
Priority to JP2019528581A priority patent/JP6860668B2/ja
Priority to CA3044847A priority patent/CA3044847A1/en
Priority to BR112019010669-3A priority patent/BR112019010669B1/pt
Priority to RU2019119526A priority patent/RU2725777C1/ru
Priority to PCT/CN2017/111604 priority patent/WO2018095281A1/zh
Priority to EP17874581.6A priority patent/EP3547164A4/en
Priority to MX2019006027A priority patent/MX384762B/es
Priority to AU2017364745A priority patent/AU2017364745C1/en
Priority to KR1020197018218A priority patent/KR102151367B1/ko
Publication of CN108108373A publication Critical patent/CN108108373A/zh
Priority to US16/397,792 priority patent/US10726028B2/en
Priority to PH12019501163A priority patent/PH12019501163B1/en
Priority to ZA2019/04091A priority patent/ZA201904091B/en
Application granted granted Critical
Publication of CN108108373B publication Critical patent/CN108108373B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Acoustics & Sound (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Alarm Systems (AREA)
  • Stored Programmes (AREA)
CN201611055619.8A 2016-11-25 2016-11-25 一种名称匹配方法及装置 Expired - Fee Related CN108108373B (zh)

Priority Applications (14)

Application Number Priority Date Filing Date Title
CN201611055619.8A CN108108373B (zh) 2016-11-25 2016-11-25 一种名称匹配方法及装置
TW106131720A TWI724237B (zh) 2016-11-25 2017-09-15 名稱匹配方法及裝置
AU2017364745A AU2017364745C1 (en) 2016-11-25 2017-11-17 Name matching method and apparatus
BR112019010669-3A BR112019010669B1 (pt) 2016-11-25 2017-11-17 Método implementado por computador para a correspondência de nomes, meio de armazenamento legível por computador não transitório e sistema implementado por computador
RU2019119526A RU2725777C1 (ru) 2016-11-25 2017-11-17 Способ и устройство для сопоставления имен
PCT/CN2017/111604 WO2018095281A1 (zh) 2016-11-25 2017-11-17 一种名称匹配方法及装置
EP17874581.6A EP3547164A4 (en) 2016-11-25 2017-11-17 NAME COMPENSATION PROCESS AND DEVICE
MX2019006027A MX384762B (es) 2016-11-25 2017-11-17 Método y aparato para comparar nombres.
JP2019528581A JP6860668B2 (ja) 2016-11-25 2017-11-17 名前マッチング方法および装置
KR1020197018218A KR102151367B1 (ko) 2016-11-25 2017-11-17 이름들을 매칭시키기 위한 방법 및 장치
CA3044847A CA3044847A1 (en) 2016-11-25 2017-11-17 Method and apparatus for matching names
US16/397,792 US10726028B2 (en) 2016-11-25 2019-04-29 Method and apparatus for matching names
PH12019501163A PH12019501163B1 (en) 2016-11-25 2019-05-24 Method and apparatus for matching names
ZA2019/04091A ZA201904091B (en) 2016-11-25 2019-06-24 Name matching method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611055619.8A CN108108373B (zh) 2016-11-25 2016-11-25 一种名称匹配方法及装置

Publications (2)

Publication Number Publication Date
CN108108373A CN108108373A (zh) 2018-06-01
CN108108373B true CN108108373B (zh) 2020-09-25

Family

ID=62196168

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611055619.8A Expired - Fee Related CN108108373B (zh) 2016-11-25 2016-11-25 一种名称匹配方法及装置

Country Status (14)

Country Link
US (1) US10726028B2 (https=)
EP (1) EP3547164A4 (https=)
JP (1) JP6860668B2 (https=)
KR (1) KR102151367B1 (https=)
CN (1) CN108108373B (https=)
AU (1) AU2017364745C1 (https=)
BR (1) BR112019010669B1 (https=)
CA (1) CA3044847A1 (https=)
MX (1) MX384762B (https=)
PH (1) PH12019501163B1 (https=)
RU (1) RU2725777C1 (https=)
TW (1) TWI724237B (https=)
WO (1) WO2018095281A1 (https=)
ZA (1) ZA201904091B (https=)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108962232B (zh) * 2018-07-16 2021-01-01 上海小蚁科技有限公司 语音识别方法及装置、存储介质、终端
CN109408561A (zh) * 2018-10-17 2019-03-01 杭州骑轻尘信息技术有限公司 业务名称匹配方法及装置
CN109189809B (zh) * 2018-10-17 2020-01-03 北京金堤科技有限公司 一种股东名称关联匹配的方法和装置
CN109472029B (zh) * 2018-11-09 2023-04-07 天津开心生活科技有限公司 药品名称处理方法与装置
CN109471960B (zh) * 2018-11-13 2020-10-13 深圳市景旺电子股份有限公司 智能识别pcb资料工具层名的方法及装置
CN109840316A (zh) * 2018-12-21 2019-06-04 上海诺悦智能科技有限公司 一种客户信息制裁名单匹配系统
GB201902772D0 (en) * 2019-03-01 2019-04-17 Palantir Technologies Inc Fuzzy searching 7 applications thereof
CN110909532B (zh) * 2019-10-31 2021-06-11 银联智惠信息服务(上海)有限公司 用户名称匹配方法、装置、计算机设备和存储介质
CN111092758A (zh) * 2019-12-06 2020-05-01 上海上讯信息技术股份有限公司 降低告警及恢复误报的方法、装置及电子设备
US12079282B2 (en) * 2020-03-12 2024-09-03 Oracle International Corporation Name matching engine boosted by machine learning
CN111563139B (zh) * 2020-07-15 2020-10-23 平安国际智慧城市科技股份有限公司 Ocr识别发票药品名的校验方法、装置及计算机设备
CN113268986B (zh) * 2021-05-24 2024-05-24 交通银行股份有限公司 一种基于模糊匹配算法的单位名称匹配、查找方法及装置
US20230039689A1 (en) * 2021-08-05 2023-02-09 Ebay Inc. Automatic Synonyms, Abbreviations, and Acronyms Detection
CN113822049B (zh) * 2021-09-29 2023-08-25 平安银行股份有限公司 基于人工智能的地址审核方法、装置、设备及存储介质
WO2023132029A1 (ja) * 2022-01-06 2023-07-13 日本電気株式会社 情報処理装置、情報処理方法及びプログラム
CN114595379B (zh) * 2022-01-17 2025-09-19 国投智能(厦门)信息股份有限公司 一种数据标准的智能推荐方法及装置
KR102693782B1 (ko) * 2022-05-26 2024-08-08 주식회사 카카오게임즈 닉네임 간 유사도를 이용하여 다중 접속계정을 탐지하기 위한 방법 및 장치
US12282486B2 (en) 2022-04-29 2025-04-22 Oracle International Corporation Address matching from single string to address matching score
CN114880430B (zh) * 2022-05-10 2023-07-18 马上消费金融股份有限公司 名称处理方法及装置
JP2024094499A (ja) * 2022-12-28 2024-07-10 富士通株式会社 対訳コーパス生成プログラム、対訳コーパス生成方法および情報処理装置
CN116244421A (zh) * 2023-03-03 2023-06-09 广联达科技股份有限公司 项目名称匹配的方法、装置、设备及可读存储介质

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103167056A (zh) * 2013-01-31 2013-06-19 中国科学院计算机网络信息中心 一种基于自动审核的域名注册方法
CN103177122A (zh) * 2013-04-15 2013-06-26 天津理工大学 一种基于同义词的个人文件搜索方法

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8812300B2 (en) * 1998-03-25 2014-08-19 International Business Machines Corporation Identifying related names
US7313513B2 (en) * 2002-05-13 2007-12-25 Wordrake Llc Method for editing and enhancing readability of authored documents
US20040024760A1 (en) * 2002-07-31 2004-02-05 Phonetic Research Ltd. System, method and computer program product for matching textual strings using language-biased normalisation, phonetic representation and correlation functions
US8423563B2 (en) * 2003-10-16 2013-04-16 Sybase, Inc. System and methodology for name searches
US20060074883A1 (en) 2004-10-05 2006-04-06 Microsoft Corporation Systems, methods, and interfaces for providing personalized search and information access
US8700568B2 (en) * 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
US9026514B2 (en) * 2006-10-13 2015-05-05 International Business Machines Corporation Method, apparatus and article for assigning a similarity measure to names
JP2010519655A (ja) * 2007-02-26 2010-06-03 ベイシス テクノロジー コーポレーション 名前照合システムの名前インデックス付け
CN101727464B (zh) * 2008-10-29 2012-08-08 北京搜狗科技发展有限公司 获取别称匹配对的方法及装置
US20110055234A1 (en) * 2009-09-02 2011-03-03 Nokia Corporation Method and apparatus for combining contact lists
TWI443529B (zh) 2010-04-01 2014-07-01 Inst Information Industry 自動化領域名詞建置方法及系統,及其電腦程式產品
US9424556B2 (en) * 2010-10-14 2016-08-23 Nokia Technologies Oy Method and apparatus for linking multiple contact identifiers of an individual
US8468167B2 (en) * 2010-10-25 2013-06-18 Corelogic, Inc. Automatic data validation and correction
US8364692B1 (en) * 2011-08-11 2013-01-29 International Business Machines Corporation Identifying non-distinct names in a set of names
US9275339B2 (en) * 2012-04-24 2016-03-01 Raytheon Company System and method for probabilistic name matching
US9229926B2 (en) 2012-12-03 2016-01-05 International Business Machines Corporation Determining similarity of unfielded names using feature assignments
CN103970798B (zh) * 2013-02-04 2019-05-28 商业对象软件有限公司 数据的搜索和匹配
US10089302B2 (en) 2013-02-26 2018-10-02 International Business Machines Corporation Native-script and cross-script chinese name matching
CN103425739B (zh) * 2013-07-09 2016-09-14 国云科技股份有限公司 一种字符串匹配方法
US9691075B1 (en) * 2014-03-14 2017-06-27 Wal-Mart Stores, Inc. Name comparison
CN104331475B (zh) * 2014-11-04 2018-03-23 郑州悉知信息科技股份有限公司 一种信息检测方法及装置
US9535903B2 (en) * 2015-04-13 2017-01-03 International Business Machines Corporation Scoring unfielded personal names without prior parsing
CN104765858A (zh) * 2015-04-21 2015-07-08 北京航天长峰科技工业集团有限公司上海分公司 公安用同义词库的构建方法及获得的公安用同义词库
CN104820713B (zh) 2015-05-19 2018-02-27 苏州中炎工业科技有限公司 一种基于用户历史数据获得工业产品名称同义词的方法
CN105843950A (zh) * 2016-04-12 2016-08-10 乐视控股(北京)有限公司 敏感词过滤方法及装置

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103167056A (zh) * 2013-01-31 2013-06-19 中国科学院计算机网络信息中心 一种基于自动审核的域名注册方法
CN103177122A (zh) * 2013-04-15 2013-06-26 天津理工大学 一种基于同义词的个人文件搜索方法

Also Published As

Publication number Publication date
EP3547164A1 (en) 2019-10-02
KR20190084319A (ko) 2019-07-16
US20190251085A1 (en) 2019-08-15
EP3547164A4 (en) 2019-10-16
JP2020501255A (ja) 2020-01-16
AU2017364745A1 (en) 2019-06-20
BR112019010669A2 (pt) 2019-10-01
MX2019006027A (es) 2019-08-14
ZA201904091B (en) 2021-05-26
TWI724237B (zh) 2021-04-11
PH12019501163A1 (en) 2020-02-24
JP6860668B2 (ja) 2021-04-21
MX384762B (es) 2025-03-14
WO2018095281A1 (zh) 2018-05-31
BR112019010669B1 (pt) 2021-12-07
CN108108373A (zh) 2018-06-01
PH12019501163B1 (en) 2023-10-13
KR102151367B1 (ko) 2020-09-03
CA3044847A1 (en) 2018-05-31
AU2017364745B2 (en) 2020-04-09
RU2725777C1 (ru) 2020-07-06
AU2017364745C1 (en) 2020-09-10
TW201820179A (zh) 2018-06-01
US10726028B2 (en) 2020-07-28

Similar Documents

Publication Publication Date Title
CN108108373B (zh) 一种名称匹配方法及装置
TWI685761B (zh) 詞向量處理方法及裝置
US10394956B2 (en) Methods, devices, and systems for constructing intelligent knowledge base
CN107402945B (zh) 词库生成方法及装置、短文本检测方法及装置
JP2020510852A (ja) 音声機能制御方法および装置
CN107784110B (zh) 一种索引建立方法及装置
CN108875743B (zh) 一种文本识别方法及装置
US20180157646A1 (en) Command transformation method and system
CN110032727A (zh) 风险识别方法及装置
CN109101489A (zh) 一种文本自动摘要方法、装置及一种电子设备
CN107329964B (zh) 一种文本处理方法及装置
CN109492401B (zh) 一种内容载体风险检测方法、装置、设备及介质
CN107491484B (zh) 一种数据匹配方法、装置及设备
CN107544753B (zh) 数据处理方法、装置及服务器
CN110046621A (zh) 证件识别方法及装置
WO2024244255A1 (zh) 同义词挖掘
CN110059312A (zh) 短语挖掘方法、装置和电子设备
CN116186231A (zh) 一种回复文本的生成方法、装置、存储介质及电子设备
CN115017905A (zh) 一种模型训练和信息推荐的方法及装置
CN109614082B (zh) 一种针对数据查询脚本的翻译方法、装置及设备
CN115423485B (zh) 数据处理方法、装置及设备
CN115658891B (zh) 一种意图识别的方法、装置、存储介质及电子设备
CN107391591B (zh) 数据处理方法、装置及服务器
CN115222262A (zh) 数据处理方法、装置及设备
WO2025139937A1 (zh) 一种关键词的扩展方法、装置和存储介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20201016

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Patentee after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Patentee before: Advanced innovation technology Co.,Ltd.

Effective date of registration: 20201016

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Patentee after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: Alibaba Group Holding Ltd.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200925