TWI724237B - 名稱匹配方法及裝置 - Google Patents

名稱匹配方法及裝置 Download PDF

Info

Publication number
TWI724237B
TWI724237B TW106131720A TW106131720A TWI724237B TW I724237 B TWI724237 B TW I724237B TW 106131720 A TW106131720 A TW 106131720A TW 106131720 A TW106131720 A TW 106131720A TW I724237 B TWI724237 B TW I724237B
Authority
TW
Taiwan
Prior art keywords
name
matched
matching
standard
detection
Prior art date
Application number
TW106131720A
Other languages
English (en)
Chinese (zh)
Other versions
TW201820179A (zh
Inventor
孫清清
Original Assignee
開曼群島商創新先進技術有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 開曼群島商創新先進技術有限公司 filed Critical 開曼群島商創新先進技術有限公司
Publication of TW201820179A publication Critical patent/TW201820179A/zh
Application granted granted Critical
Publication of TWI724237B publication Critical patent/TWI724237B/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Acoustics & Sound (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Alarm Systems (AREA)
  • Stored Programmes (AREA)
TW106131720A 2016-11-25 2017-09-15 名稱匹配方法及裝置 TWI724237B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201611055619.8A CN108108373B (zh) 2016-11-25 2016-11-25 一种名称匹配方法及装置
CN201611055619.8 2016-11-25
??201611055619.8 2016-11-25

Publications (2)

Publication Number Publication Date
TW201820179A TW201820179A (zh) 2018-06-01
TWI724237B true TWI724237B (zh) 2021-04-11

Family

ID=62196168

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106131720A TWI724237B (zh) 2016-11-25 2017-09-15 名稱匹配方法及裝置

Country Status (14)

Country Link
US (1) US10726028B2 (enExample)
EP (1) EP3547164A4 (enExample)
JP (1) JP6860668B2 (enExample)
KR (1) KR102151367B1 (enExample)
CN (1) CN108108373B (enExample)
AU (1) AU2017364745C1 (enExample)
BR (1) BR112019010669B1 (enExample)
CA (1) CA3044847A1 (enExample)
MX (1) MX384762B (enExample)
PH (1) PH12019501163B1 (enExample)
RU (1) RU2725777C1 (enExample)
TW (1) TWI724237B (enExample)
WO (1) WO2018095281A1 (enExample)
ZA (1) ZA201904091B (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108962232B (zh) * 2018-07-16 2021-01-01 上海小蚁科技有限公司 语音识别方法及装置、存储介质、终端
CN109189809B (zh) * 2018-10-17 2020-01-03 北京金堤科技有限公司 一种股东名称关联匹配的方法和装置
CN109408561A (zh) * 2018-10-17 2019-03-01 杭州骑轻尘信息技术有限公司 业务名称匹配方法及装置
CN109472029B (zh) * 2018-11-09 2023-04-07 天津开心生活科技有限公司 药品名称处理方法与装置
CN109471960B (zh) * 2018-11-13 2020-10-13 深圳市景旺电子股份有限公司 智能识别pcb资料工具层名的方法及装置
CN109840316A (zh) * 2018-12-21 2019-06-04 上海诺悦智能科技有限公司 一种客户信息制裁名单匹配系统
GB201902772D0 (en) 2019-03-01 2019-04-17 Palantir Technologies Inc Fuzzy searching 7 applications thereof
CN110909532B (zh) * 2019-10-31 2021-06-11 银联智惠信息服务(上海)有限公司 用户名称匹配方法、装置、计算机设备和存储介质
CN111092758A (zh) * 2019-12-06 2020-05-01 上海上讯信息技术股份有限公司 降低告警及恢复误报的方法、装置及电子设备
US12079282B2 (en) * 2020-03-12 2024-09-03 Oracle International Corporation Name matching engine boosted by machine learning
CN111563139B (zh) * 2020-07-15 2020-10-23 平安国际智慧城市科技股份有限公司 Ocr识别发票药品名的校验方法、装置及计算机设备
CN113268986B (zh) * 2021-05-24 2024-05-24 交通银行股份有限公司 一种基于模糊匹配算法的单位名称匹配、查找方法及装置
US20230039689A1 (en) * 2021-08-05 2023-02-09 Ebay Inc. Automatic Synonyms, Abbreviations, and Acronyms Detection
CN113822049B (zh) * 2021-09-29 2023-08-25 平安银行股份有限公司 基于人工智能的地址审核方法、装置、设备及存储介质
WO2023132029A1 (ja) * 2022-01-06 2023-07-13 日本電気株式会社 情報処理装置、情報処理方法及びプログラム
CN114595379B (zh) * 2022-01-17 2025-09-19 国投智能(厦门)信息股份有限公司 一种数据标准的智能推荐方法及装置
KR102693782B1 (ko) * 2022-05-26 2024-08-08 주식회사 카카오게임즈 닉네임 간 유사도를 이용하여 다중 접속계정을 탐지하기 위한 방법 및 장치
US12282486B2 (en) 2022-04-29 2025-04-22 Oracle International Corporation Address matching from single string to address matching score
CN114880430B (zh) * 2022-05-10 2023-07-18 马上消费金融股份有限公司 名称处理方法及装置
CN116244421A (zh) * 2023-03-03 2023-06-09 广联达科技股份有限公司 项目名称匹配的方法、装置、设备及可读存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040024760A1 (en) * 2002-07-31 2004-02-05 Phonetic Research Ltd. System, method and computer program product for matching textual strings using language-biased normalisation, phonetic representation and correlation functions
CN101727464A (zh) * 2008-10-29 2010-06-09 北京搜狗科技发展有限公司 获取别称匹配对的方法及装置
US20130282645A1 (en) * 2012-04-24 2013-10-24 Raytheon Company System and method for probabilistic name matching

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8812300B2 (en) * 1998-03-25 2014-08-19 International Business Machines Corporation Identifying related names
US7313513B2 (en) * 2002-05-13 2007-12-25 Wordrake Llc Method for editing and enhancing readability of authored documents
US8423563B2 (en) * 2003-10-16 2013-04-16 Sybase, Inc. System and methodology for name searches
US20060074883A1 (en) * 2004-10-05 2006-04-06 Microsoft Corporation Systems, methods, and interfaces for providing personalized search and information access
US8700568B2 (en) * 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
US9026514B2 (en) * 2006-10-13 2015-05-05 International Business Machines Corporation Method, apparatus and article for assigning a similarity measure to names
JP2010519655A (ja) * 2007-02-26 2010-06-03 ベイシス テクノロジー コーポレーション 名前照合システムの名前インデックス付け
US20110055234A1 (en) * 2009-09-02 2011-03-03 Nokia Corporation Method and apparatus for combining contact lists
TWI443529B (zh) 2010-04-01 2014-07-01 Inst Information Industry 自動化領域名詞建置方法及系統,及其電腦程式產品
US9424556B2 (en) * 2010-10-14 2016-08-23 Nokia Technologies Oy Method and apparatus for linking multiple contact identifiers of an individual
US8468167B2 (en) * 2010-10-25 2013-06-18 Corelogic, Inc. Automatic data validation and correction
US8364692B1 (en) * 2011-08-11 2013-01-29 International Business Machines Corporation Identifying non-distinct names in a set of names
US9229926B2 (en) 2012-12-03 2016-01-05 International Business Machines Corporation Determining similarity of unfielded names using feature assignments
CN103167056B (zh) * 2013-01-31 2016-03-02 中国科学院计算机网络信息中心 一种基于自动审核的域名注册方法
CN103970798B (zh) * 2013-02-04 2019-05-28 商业对象软件有限公司 数据的搜索和匹配
US10089302B2 (en) 2013-02-26 2018-10-02 International Business Machines Corporation Native-script and cross-script chinese name matching
CN103177122B (zh) * 2013-04-15 2017-04-26 天津理工大学 一种基于同义词的个人桌面文件搜索方法
CN103425739B (zh) * 2013-07-09 2016-09-14 国云科技股份有限公司 一种字符串匹配方法
US9691075B1 (en) * 2014-03-14 2017-06-27 Wal-Mart Stores, Inc. Name comparison
CN104331475B (zh) * 2014-11-04 2018-03-23 郑州悉知信息科技股份有限公司 一种信息检测方法及装置
US9535903B2 (en) * 2015-04-13 2017-01-03 International Business Machines Corporation Scoring unfielded personal names without prior parsing
CN104765858A (zh) * 2015-04-21 2015-07-08 北京航天长峰科技工业集团有限公司上海分公司 公安用同义词库的构建方法及获得的公安用同义词库
CN104820713B (zh) 2015-05-19 2018-02-27 苏州中炎工业科技有限公司 一种基于用户历史数据获得工业产品名称同义词的方法
CN105843950A (zh) * 2016-04-12 2016-08-10 乐视控股(北京)有限公司 敏感词过滤方法及装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040024760A1 (en) * 2002-07-31 2004-02-05 Phonetic Research Ltd. System, method and computer program product for matching textual strings using language-biased normalisation, phonetic representation and correlation functions
CN101727464A (zh) * 2008-10-29 2010-06-09 北京搜狗科技发展有限公司 获取别称匹配对的方法及装置
CN101727464B (zh) 2008-10-29 2012-08-08 北京搜狗科技发展有限公司 获取别称匹配对的方法及装置
US20130282645A1 (en) * 2012-04-24 2013-10-24 Raytheon Company System and method for probabilistic name matching

Also Published As

Publication number Publication date
MX2019006027A (es) 2019-08-14
BR112019010669A2 (pt) 2019-10-01
EP3547164A4 (en) 2019-10-16
ZA201904091B (en) 2021-05-26
KR102151367B1 (ko) 2020-09-03
AU2017364745B2 (en) 2020-04-09
BR112019010669B1 (pt) 2021-12-07
AU2017364745A1 (en) 2019-06-20
JP2020501255A (ja) 2020-01-16
US10726028B2 (en) 2020-07-28
PH12019501163A1 (en) 2020-02-24
CA3044847A1 (en) 2018-05-31
JP6860668B2 (ja) 2021-04-21
AU2017364745C1 (en) 2020-09-10
MX384762B (es) 2025-03-14
RU2725777C1 (ru) 2020-07-06
PH12019501163B1 (en) 2023-10-13
KR20190084319A (ko) 2019-07-16
US20190251085A1 (en) 2019-08-15
WO2018095281A1 (zh) 2018-05-31
TW201820179A (zh) 2018-06-01
EP3547164A1 (en) 2019-10-02
CN108108373B (zh) 2020-09-25
CN108108373A (zh) 2018-06-01

Similar Documents

Publication Publication Date Title
TWI724237B (zh) 名稱匹配方法及裝置
CN109388801B (zh) 相似词集合的确定方法、装置和电子设备
TWI685761B (zh) 詞向量處理方法及裝置
US10394956B2 (en) Methods, devices, and systems for constructing intelligent knowledge base
US20210326357A1 (en) Data processing methods, apparatuses, and devices
WO2019154162A1 (zh) 一种风控规则生成方法和装置
CN107784110B (zh) 一种索引建立方法及装置
WO2017063538A1 (zh) 挖掘相关词的方法、搜索方法、搜索系统
US10970339B2 (en) Generating a knowledge graph using a search index
US20180157646A1 (en) Command transformation method and system
WO2021143299A1 (zh) 语义纠错方法、电子设备及存储介质
US9110986B2 (en) System and method for using a combination of semantic and statistical processing of input strings or other data content
US20180173694A1 (en) Methods and computer systems for named entity verification, named entity verification model training, and phrase expansion
CN107402945B (zh) 词库生成方法及装置、短文本检测方法及装置
CN110276009B (zh) 一种联想词的推荐方法、装置、电子设备及存储介质
CN110427492B (zh) 生成关键词库的方法、装置和电子设备
US20150039290A1 (en) Knowledge-rich automatic term disambiguation
CN107329964B (zh) 一种文本处理方法及装置
WO2024244255A1 (zh) 同义词挖掘
CN109190115B (zh) 一种文本匹配方法、装置、服务器及存储介质
CN115658891B (zh) 一种意图识别的方法、装置、存储介质及电子设备
Kang An Effect of Semantic Relatedness on Entity Disambiguation: Using Korean Wikipedia
OA19238A (en) Name matching method and apparatus.
CN104809192A (zh) 提取输入法候选项的方法以及装置
WO2025139937A1 (zh) 一种关键词的扩展方法、装置和存储介质