CN104035949B - 一种基于局部敏感哈希改进算法的相似性数据检索方法 - Google Patents
一种基于局部敏感哈希改进算法的相似性数据检索方法 Download PDFInfo
- Publication number
- CN104035949B CN104035949B CN201310664350.3A CN201310664350A CN104035949B CN 104035949 B CN104035949 B CN 104035949B CN 201310664350 A CN201310664350 A CN 201310664350A CN 104035949 B CN104035949 B CN 104035949B
- Authority
- CN
- China
- Prior art keywords
- hash
- data
- hash table
- function
- similarity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
- G06F16/9014—Indexing; Data structures therefor; Storage structures hash tables
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310664350.3A CN104035949B (zh) | 2013-12-10 | 2013-12-10 | 一种基于局部敏感哈希改进算法的相似性数据检索方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310664350.3A CN104035949B (zh) | 2013-12-10 | 2013-12-10 | 一种基于局部敏感哈希改进算法的相似性数据检索方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104035949A CN104035949A (zh) | 2014-09-10 |
CN104035949B true CN104035949B (zh) | 2017-05-10 |
Family
ID=51466720
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310664350.3A Expired - Fee Related CN104035949B (zh) | 2013-12-10 | 2013-12-10 | 一种基于局部敏感哈希改进算法的相似性数据检索方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104035949B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019165546A1 (en) * | 2018-03-01 | 2019-09-06 | Huawei Technologies Canada Co., Ltd. | Layered locality sensitive hashing (lsh) partition indexing for big data applications |
Families Citing this family (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104516946B (zh) * | 2014-11-27 | 2017-08-04 | 宁波大学 | 一种基于高维数据过滤器的近似成员查询方法 |
CN105989078B (zh) * | 2015-02-11 | 2019-05-07 | 烟台中科网络技术研究所 | 一种结构化对等网络构建索引的方法、检索方法、装置及系统 |
CN104731882B (zh) * | 2015-03-11 | 2018-05-25 | 北京航空航天大学 | 一种基于哈希编码加权排序的自适应查询方法 |
CN104778234A (zh) * | 2015-03-31 | 2015-07-15 | 南京邮电大学 | 基于局部敏感哈希技术的多标记文件近邻查询方法 |
CN104866471B (zh) * | 2015-06-05 | 2017-09-19 | 南开大学 | 一种基于局部敏感哈希策略的实例匹配方法 |
CN105095435A (zh) * | 2015-07-23 | 2015-11-25 | 北京京东尚科信息技术有限公司 | 一种图像高维特征的相似比较方法及装置 |
CN105183792B (zh) * | 2015-08-21 | 2017-05-24 | 东南大学 | 一种基于局部敏感哈希的分布式快速文本分类方法 |
CN110175258B (zh) * | 2016-02-05 | 2024-01-23 | 大连大学 | 建立基于位置敏感哈希索引的移动感知数据查询方法 |
CN105760469B (zh) * | 2016-02-05 | 2019-05-31 | 大连大学 | 云计算环境下基于倒排lsh的高维近似图象检索方法 |
US10778707B1 (en) | 2016-05-12 | 2020-09-15 | Amazon Technologies, Inc. | Outlier detection for streaming data using locality sensitive hashing |
CN106570141B (zh) * | 2016-11-04 | 2020-05-19 | 中国科学院自动化研究所 | 近似重复图像检测方法 |
US10496706B2 (en) | 2017-04-17 | 2019-12-03 | International Business Machines Corporation | Matching strings in a large relational database |
CN107515937B (zh) * | 2017-08-29 | 2020-10-27 | 千寻位置网络有限公司 | 差分账户的归类方法及系统、服务终端、存储器 |
CN107656989B (zh) * | 2017-09-13 | 2019-09-13 | 华中科技大学 | 云存储系统中基于数据分布感知的近邻查询方法 |
CN109697641A (zh) * | 2017-10-20 | 2019-04-30 | 北京京东尚科信息技术有限公司 | 计算商品相似度的方法和装置 |
CN107729557A (zh) * | 2017-11-08 | 2018-02-23 | 北京大学 | 一种编目信息的分类、检索方法和装置 |
US10949467B2 (en) * | 2018-03-01 | 2021-03-16 | Huawei Technologies Canada Co., Ltd. | Random draw forest index structure for searching large scale unstructured data |
CN108959441A (zh) * | 2018-06-13 | 2018-12-07 | 新华智云科技有限公司 | 一种基于局部敏感哈希的近相似快速查找方法 |
CN109189964A (zh) * | 2018-07-20 | 2019-01-11 | 杭州电子科技大学 | 基于局部敏感哈希索引和图像路标的场景识别方法 |
CN109213874A (zh) * | 2018-08-30 | 2019-01-15 | 福建师范大学 | 一种wmsn区块链的多媒体混合数据近似近邻二元查询方法 |
CN110889422A (zh) * | 2018-09-10 | 2020-03-17 | 百度在线网络技术(北京)有限公司 | 同行车辆的判断方法、装置、设备及计算机可读介质 |
CN109445703B (zh) * | 2018-10-26 | 2019-10-25 | 黄淮学院 | 一种基于块级数据去重的Delta压缩存储组件 |
CN111294728A (zh) * | 2018-12-06 | 2020-06-16 | 西安光启未来技术研究院 | 同行分析方法及装置 |
CN109766341B (zh) * | 2018-12-27 | 2022-04-22 | 厦门市美亚柏科信息股份有限公司 | 一种建立哈希映射的方法、装置、存储介质 |
CN110134714B (zh) * | 2019-05-22 | 2021-04-20 | 东北大学 | 适用于大数据迭代计算的分布式计算框架缓存索引方法 |
WO2020252639A1 (zh) * | 2019-06-17 | 2020-12-24 | 深圳市欢太科技有限公司 | 内容推送方法及相关产品 |
CN110543622A (zh) * | 2019-08-02 | 2019-12-06 | 北京三快在线科技有限公司 | 文本相似度检测方法、装置、电子设备及可读存储介质 |
CN110502629B (zh) * | 2019-08-27 | 2020-09-11 | 桂林电子科技大学 | 一种基于lsh的过滤验证字符串相似性连接方法 |
CN110795469B (zh) * | 2019-10-11 | 2022-02-22 | 安徽工业大学 | 基于Spark的高维序列数据相似性查询方法及系统 |
CN111241106B (zh) * | 2020-01-15 | 2023-08-29 | 平安科技(深圳)有限公司 | 近似数据处理方法、装置、介质及电子设备 |
CN111352834B (zh) * | 2020-02-25 | 2023-06-09 | 江苏大学 | 一种基于局部敏感哈希的自适应随机测试方法 |
CN112559170B (zh) * | 2020-11-30 | 2022-09-20 | 河海大学 | 一种边缘计算环境下缓存数据的近似匹配方法 |
CN112699676B (zh) * | 2020-12-31 | 2024-04-12 | 中国农业银行股份有限公司 | 一种地址相似关系生成方法及装置 |
CN113515450A (zh) * | 2021-05-20 | 2021-10-19 | 广东工业大学 | 一种环境异常检测方法和系统 |
CN114332742B (zh) * | 2022-03-08 | 2022-06-03 | 西安科技大学 | 一种基于深度神经网络的异常视频大数据清洗方法 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8185561B1 (en) * | 2005-08-15 | 2012-05-22 | Google Inc. | Scalable user clustering based on set similarity |
CN103336963A (zh) * | 2013-07-08 | 2013-10-02 | 天脉聚源(北京)传媒科技有限公司 | 一种图像特征提取的方法及装置 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100070509A1 (en) * | 2008-08-15 | 2010-03-18 | Kai Li | System And Method For High-Dimensional Similarity Search |
-
2013
- 2013-12-10 CN CN201310664350.3A patent/CN104035949B/zh not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8185561B1 (en) * | 2005-08-15 | 2012-05-22 | Google Inc. | Scalable user clustering based on set similarity |
CN103336963A (zh) * | 2013-07-08 | 2013-10-02 | 天脉聚源(北京)传媒科技有限公司 | 一种图像特征提取的方法及装置 |
Non-Patent Citations (1)
Title |
---|
一种重复视频的快速检测算法;刘大伟等;《小型微型计算机系统》;20130630;第34卷(第6期);第1400-1404页 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019165546A1 (en) * | 2018-03-01 | 2019-09-06 | Huawei Technologies Canada Co., Ltd. | Layered locality sensitive hashing (lsh) partition indexing for big data applications |
Also Published As
Publication number | Publication date |
---|---|
CN104035949A (zh) | 2014-09-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104035949B (zh) | 一种基于局部敏感哈希改进算法的相似性数据检索方法 | |
Zheng et al. | Packing and padding: Coupled multi-index for accurate image retrieval | |
US11048966B2 (en) | Method and device for comparing similarities of high dimensional features of images | |
CN101866366B (zh) | 一种基于内容的图像格式中文文档检索方法 | |
Chen et al. | Ranking consistency for image matching and object retrieval | |
CN104081435A (zh) | 一种基于级联二值编码的图像匹配方法 | |
Zheng et al. | Visual phraselet: Refining spatial constraints for large scale image search | |
CN104199827A (zh) | 基于局部敏感哈希的大规模多媒体数据的高维索引方法 | |
CN102254015A (zh) | 基于视觉词组的图像检索方法 | |
CN103617217A (zh) | 一种基于层次索引的图像检索方法及系统 | |
CN102609441A (zh) | 基于分布熵的局部敏感哈希高维索引方法 | |
CN107180079B (zh) | 基于卷积神经网络以及树与哈希结合索引的图像检索方法 | |
CN106570165A (zh) | 一种基于内容的视频检索方法及装置 | |
CN110334290B (zh) | 一种基于MF-Octree的时空数据快速检索方法 | |
Dong et al. | Mining data correlation from multi-faceted sensor data in the Internet of Things | |
Liu et al. | Visual reranking with improved image graph | |
Zhu et al. | SVS-JOIN: efficient spatial visual similarity join for geo-multimedia | |
CN106557533B (zh) | 一种单目标多图像联合检索的方法和装置 | |
CN104978729A (zh) | 一种基于数据感知的图像哈希方法 | |
Zhou et al. | Accurate querying of frequent subgraphs in power grid graph data | |
Zhou et al. | Large scale nearest neighbors search based on neighborhood graph | |
Mu et al. | Coordinate Discrete Optimization for Efficient Cross-View Image Retrieval. | |
Buaba et al. | Locality sensitive hashing for satellite images using texture feature vectors | |
Kong et al. | Coarse2Fine: Two-layer fusion for image retrieval | |
Li et al. | Partial-duplicate clustering and visual pattern discovery on web scale image database |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20181126 Address after: Room 1108, 11th floor, 23 Zhichun Road, Haidian District, Beijing, 100089 Patentee after: BEIJING ZHIXIN FUTURE INFORMATION TECHNOLOGY Co.,Ltd. Address before: Room 610, No. 999 Shanxi Road, Changning District, Shanghai 200000 Patentee before: Shanghai Airlines Intellectual Property Services Ltd. Effective date of registration: 20181126 Address after: Room 610, No. 999 Shanxi Road, Changning District, Shanghai 200000 Patentee after: Shanghai Airlines Intellectual Property Services Ltd. Address before: Room 2310, Building 2, Wuzhong Science and Technology Pioneering Park, 70 Zhongshan East Road, Mudu Town, Wuzhong District, Suzhou City, Jiangsu Province Patentee before: Nanjing University of Information Science and Technology |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20170510 Termination date: 20211210 |