MX2016005489A - Metodo y aparato para determinar similitud y terminal. - Google Patents
Metodo y aparato para determinar similitud y terminal.Info
- Publication number
- MX2016005489A MX2016005489A MX2016005489A MX2016005489A MX2016005489A MX 2016005489 A MX2016005489 A MX 2016005489A MX 2016005489 A MX2016005489 A MX 2016005489A MX 2016005489 A MX2016005489 A MX 2016005489A MX 2016005489 A MX2016005489 A MX 2016005489A
- Authority
- MX
- Mexico
- Prior art keywords
- string
- sequence
- similarity
- editing distance
- determination method
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/90335—Query processing
- G06F16/90344—Query processing by using string matching techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
- G06V30/274—Syntactic or semantic context, e.g. balancing
Abstract
La presente divulgación se refiere a un método y un aparato para determinar la similitud y una terminal, y pertenece al campo del procesamiento del lenguaje natural; el método incluye: ejecutar segmentación de palabras en una primera secuencia de caracteres y una segunda secuencia de caracteres, respectivamente, para obtener una primera secuencia y una segunda secuencia, cada una incluyendo al menos un palabra; determinar una distancia de edición entre la primera secuencia de caracteres y la segunda secuencia de caracteres de acuerdo con un algoritmo de distancia de edición predefinido, la primera secuencia y la segunda secuencia; y determinar una similitud entre la primera secuencia de caracteres y la segunda secuencia de caracteres de acuerdo con la distancia de edición y la información sobre operaciones para convertir la primera secuencia en la segunda secuencia; la segmentación de palabras se ejecuta en la primera secuencia de caracteres y la segunda secuencia de caracteres para obtener una primera secuencia y una segunda secuencia respectivamente, de esta manera, la distancia de edición se determina con base en palabras en la secuencia de caracteres en lugar de caracteres en la secuencia de caracteres; además, cada palabra en la secuencia de caracteres puede incluir al menos un carácter, de manera que se determina una similitud de acuerdo con la distancia de edición en combinación con una correlación entre caracteres en la secuencia de caracteres, permitiendo que la similitud determinada sea más precisa.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510882468.2A CN105446957B (zh) | 2015-12-03 | 2015-12-03 | 相似性确定方法、装置及终端 |
PCT/CN2015/099523 WO2017092122A1 (zh) | 2015-12-03 | 2015-12-29 | 相似性确定方法、装置及终端 |
Publications (2)
Publication Number | Publication Date |
---|---|
MX2016005489A true MX2016005489A (es) | 2017-11-30 |
MX365897B MX365897B (es) | 2019-06-19 |
Family
ID=55557172
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2016005489A MX365897B (es) | 2015-12-03 | 2015-12-29 | Método y aparato para determinar similitud y terminal. |
Country Status (8)
Country | Link |
---|---|
US (1) | US10089301B2 (es) |
EP (1) | EP3179379A1 (es) |
JP (1) | JP6321306B2 (es) |
KR (1) | KR101782923B1 (es) |
CN (1) | CN105446957B (es) |
MX (1) | MX365897B (es) |
RU (1) | RU2664002C2 (es) |
WO (1) | WO2017092122A1 (es) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10296788B1 (en) * | 2016-12-19 | 2019-05-21 | Matrox Electronic Systems Ltd. | Method and system for processing candidate strings detected in an image to identify a match of a model string in the image |
US10853457B2 (en) * | 2018-02-06 | 2020-12-01 | Didi Research America, Llc | System and method for program security protection |
US10515149B2 (en) | 2018-03-30 | 2019-12-24 | BlackBoiler, LLC | Method and system for suggesting revisions to an electronic document |
WO2020061910A1 (zh) * | 2018-09-27 | 2020-04-02 | 北京字节跳动网络技术有限公司 | 用于生成信息的方法和装置 |
SG10201904554TA (en) * | 2019-05-21 | 2019-09-27 | Alibaba Group Holding Ltd | Methods and devices for quantifying text similarity |
CN110750615B (zh) * | 2019-09-30 | 2020-07-24 | 贝壳找房(北京)科技有限公司 | 文本重复性判定方法和装置、电子设备和存储介质 |
CN110909161B (zh) * | 2019-11-12 | 2022-04-08 | 西安电子科技大学 | 基于密度聚类和视觉相似度的英文单词分类方法 |
CN111352549B (zh) * | 2020-02-25 | 2022-01-07 | 腾讯科技(深圳)有限公司 | 一种数据对象展示方法、装置、设备及存储介质 |
US11776529B2 (en) * | 2020-04-28 | 2023-10-03 | Samsung Electronics Co., Ltd. | Method and apparatus with speech processing |
KR20210132855A (ko) * | 2020-04-28 | 2021-11-05 | 삼성전자주식회사 | 음성 처리 방법 및 장치 |
CN111967270B (zh) * | 2020-08-16 | 2023-11-21 | 云知声智能科技股份有限公司 | 一种基于字符与语义融合的方法和设备 |
CA3203926A1 (en) | 2021-01-04 | 2022-07-07 | Liam Roshan Dunan EMMART | Editing parameters |
CN112597313B (zh) * | 2021-03-03 | 2021-06-29 | 北京沃丰时代数据科技有限公司 | 短文本聚类方法、装置、电子设备及存储介质 |
KR102517661B1 (ko) * | 2022-07-15 | 2023-04-04 | 주식회사 액션파워 | 텍스트 정보에서 타겟 단어에 대응하는 단어를 식별하는 방법 |
CN116564414B (zh) * | 2023-07-07 | 2024-03-26 | 腾讯科技(深圳)有限公司 | 分子序列的比对方法、装置、电子设备、存储介质及产品 |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5757959A (en) * | 1995-04-05 | 1998-05-26 | Panasonic Technologies, Inc. | System and method for handwriting matching using edit distance computation in a systolic array processor |
NO983175L (no) | 1998-07-10 | 2000-01-11 | Fast Search & Transfer Asa | Soekesystem for gjenfinning av data |
JP2001291060A (ja) * | 2000-04-04 | 2001-10-19 | Toshiba Corp | 単語列照合装置および単語列照合方法 |
US7107204B1 (en) * | 2000-04-24 | 2006-09-12 | Microsoft Corporation | Computer-aided writing system and method with cross-language writing wizard |
US6810376B1 (en) * | 2000-07-11 | 2004-10-26 | Nusuara Technologies Sdn Bhd | System and methods for determining semantic similarity of sentences |
US7734565B2 (en) * | 2003-01-18 | 2010-06-08 | Yahoo! Inc. | Query string matching method and apparatus |
EP1668541A1 (en) * | 2003-09-30 | 2006-06-14 | British Telecommunications Public Limited Company | Information retrieval |
JP2005352888A (ja) * | 2004-06-11 | 2005-12-22 | Hitachi Ltd | 表記揺れ対応辞書作成システム |
US8077984B2 (en) * | 2008-01-04 | 2011-12-13 | Xerox Corporation | Method for computing similarity between text spans using factored word sequence kernels |
US8775441B2 (en) | 2008-01-16 | 2014-07-08 | Ab Initio Technology Llc | Managing an archive for approximate string matching |
US8812493B2 (en) * | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US8170969B2 (en) * | 2008-08-13 | 2012-05-01 | Siemens Aktiengesellschaft | Automated computation of semantic similarity of pairs of named entity phrases using electronic document corpora as background knowledge |
US8219583B2 (en) * | 2008-11-10 | 2012-07-10 | Nbcuniversal Media, Llc | Methods and systems for mining websites |
US8290989B2 (en) * | 2008-11-12 | 2012-10-16 | Sap Ag | Data model optimization |
CN101751430A (zh) * | 2008-12-12 | 2010-06-23 | 汉王科技股份有限公司 | 电子词典模糊检索方法 |
CN101561813B (zh) * | 2009-05-27 | 2010-09-29 | 东北大学 | 一种Web环境下的字符串相似度的分析方法 |
CN101957828B (zh) | 2009-07-20 | 2013-03-06 | 阿里巴巴集团控股有限公司 | 一种对搜索结果进行排序的方法和装置 |
CN102622338B (zh) * | 2012-02-24 | 2014-02-26 | 北京工业大学 | 一种短文本间语义距离的计算机辅助计算方法 |
DE112013006764T5 (de) * | 2013-03-04 | 2015-11-19 | Mitsubishi Electric Corporation | Suchvorrichtung |
CN103399907A (zh) * | 2013-07-31 | 2013-11-20 | 深圳市华傲数据技术有限公司 | 一种基于编辑距离计算中文字符串相似度的方法及装置 |
US20150051896A1 (en) * | 2013-08-14 | 2015-02-19 | National Research Council Of Canada | Method and apparatus to construct program for assisting in reviewing |
JP6143638B2 (ja) * | 2013-10-17 | 2017-06-07 | 株式会社日立ソリューションズ東日本 | データ処理装置およびデータ処理方法 |
US9430463B2 (en) * | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9672206B2 (en) * | 2015-06-01 | 2017-06-06 | Information Extraction Systems, Inc. | Apparatus, system and method for application-specific and customizable semantic similarity measurement |
-
2015
- 2015-12-03 CN CN201510882468.2A patent/CN105446957B/zh active Active
- 2015-12-29 KR KR1020167006741A patent/KR101782923B1/ko active IP Right Grant
- 2015-12-29 JP JP2017553299A patent/JP6321306B2/ja active Active
- 2015-12-29 MX MX2016005489A patent/MX365897B/es active IP Right Grant
- 2015-12-29 RU RU2016118758A patent/RU2664002C2/ru active
- 2015-12-29 WO PCT/CN2015/099523 patent/WO2017092122A1/zh active Application Filing
-
2016
- 2016-09-26 EP EP16190672.2A patent/EP3179379A1/en not_active Ceased
- 2016-11-10 US US15/348,697 patent/US10089301B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN105446957B (zh) | 2018-07-20 |
KR101782923B1 (ko) | 2017-09-28 |
JP2018501597A (ja) | 2018-01-18 |
EP3179379A1 (en) | 2017-06-14 |
JP6321306B2 (ja) | 2018-05-09 |
US20170161260A1 (en) | 2017-06-08 |
US10089301B2 (en) | 2018-10-02 |
WO2017092122A1 (zh) | 2017-06-08 |
RU2664002C2 (ru) | 2018-08-14 |
CN105446957A (zh) | 2016-03-30 |
RU2016118758A (ru) | 2017-11-20 |
MX365897B (es) | 2019-06-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX365897B (es) | Método y aparato para determinar similitud y terminal. | |
CO2017007032A2 (es) | Actualización de modelos de clasificador de entendimiento de lenguaje para un asistente digital personal basándose en externalización masiva | |
MY195917A (en) | Blockchain-Based Data Processing Method And Device | |
WO2018038385A3 (ko) | 음성 인식 방법 및 이를 수행하는 전자 장치 | |
MX2018008104A (es) | Identificacion de entidades utilizando un modelo de aprendizaje profundo. | |
EP4242892A3 (en) | Code pointer authentication for hardware flow control | |
EP3153978A1 (en) | Address search method and device | |
WO2014210548A3 (en) | Extracting card data using card art | |
SG11201803636RA (en) | Service processing method and apparatus | |
AU2017408800A1 (en) | Method and system of mining information, electronic device and readable storable medium | |
WO2013181116A3 (en) | Method and apparatus of recommending candidate terms based on geographical location | |
MX2016005225A (es) | Metodo y aparato de reconocimiento de huellas dactilares. | |
MY182121A (en) | Electronic payment service processing method and device, and electronic payment method and device | |
MX2016003768A (es) | Metodo y dispositivo para conectar equipo externo. | |
MY185366A (en) | Audio information processing method and device | |
AU2014205024A8 (en) | Methods and apparatus for identifying concepts corresponding to input information | |
MX2017015383A (es) | Sistema y metodo para la oferta de paquetes de funcionalidades con base en un analisis de sitios web editados y sus usos. | |
GB2559709A (en) | Translation of natural language into user interface actions | |
PH12018501577A1 (en) | Risk control method and device | |
US20170154056A1 (en) | Matching image searching method, image searching method and devices | |
SG10201907046QA (en) | Method and apparatus for assigning device fingerprints to internet devices | |
SG10201901587VA (en) | Application testing | |
PH12019500429A1 (en) | Verification method and device | |
SG10201907393WA (en) | Position information providing method and device | |
SG11201909119YA (en) | Search method and apparatus and non-temporary computer-readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |