CN114707007A - 一种图像文本检索方法、装置及计算机存储介质 - Google Patents
一种图像文本检索方法、装置及计算机存储介质 Download PDFInfo
- Publication number
- CN114707007A CN114707007A CN202210635337.4A CN202210635337A CN114707007A CN 114707007 A CN114707007 A CN 114707007A CN 202210635337 A CN202210635337 A CN 202210635337A CN 114707007 A CN114707007 A CN 114707007A
- Authority
- CN
- China
- Prior art keywords
- image
- text
- retrieval
- sample
- label
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 61
- 238000012549 training Methods 0.000 claims abstract description 46
- 238000012216 screening Methods 0.000 claims abstract description 36
- 238000013507 mapping Methods 0.000 claims abstract description 18
- 239000000523 sample Substances 0.000 claims description 118
- 230000006870 function Effects 0.000 claims description 25
- 238000004590 computer program Methods 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 6
- 238000010276 construction Methods 0.000 claims description 5
- 239000011521 glass Substances 0.000 description 9
- 238000007500 overflow downdraw method Methods 0.000 description 8
- 238000004364 calculation method Methods 0.000 description 5
- 230000004927 fusion Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/41—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/319—Inverted lists
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/432—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/432—Query formulation
- G06F16/434—Query formulation using image data, e.g. images, photos, pictures taken by a user
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/45—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/483—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/51—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/55—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/5866—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Library & Information Science (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
键 | 值 |
1 | 男人,帽子,眼镜 |
2 | 狗,木棍,玻璃 |
… | … |
18 | 男人,杯子,眼镜 |
键 | 值 |
男人 | 1,9,18 |
眼镜 | 6,11,18 |
… | … |
帽子 | 1,4,6 |
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210635337.4A CN114707007B (zh) | 2022-06-07 | 2022-06-07 | 一种图像文本检索方法、装置及计算机存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210635337.4A CN114707007B (zh) | 2022-06-07 | 2022-06-07 | 一种图像文本检索方法、装置及计算机存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114707007A true CN114707007A (zh) | 2022-07-05 |
CN114707007B CN114707007B (zh) | 2022-08-30 |
Family
ID=82177858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210635337.4A Active CN114707007B (zh) | 2022-06-07 | 2022-06-07 | 一种图像文本检索方法、装置及计算机存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114707007B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116049459A (zh) * | 2023-03-30 | 2023-05-02 | 浪潮电子信息产业股份有限公司 | 跨模态互检索的方法、装置、服务器及存储介质 |
CN116844168A (zh) * | 2023-06-30 | 2023-10-03 | 北京百度网讯科技有限公司 | 确定文本的方法、深度学习模型的训练方法和装置 |
WO2024041479A1 (zh) * | 2022-08-22 | 2024-02-29 | 华为技术有限公司 | 一种数据处理方法及其装置 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102023989A (zh) * | 2009-09-23 | 2011-04-20 | 阿里巴巴集团控股有限公司 | 一种信息检索方法及其系统 |
CN103678694A (zh) * | 2013-12-26 | 2014-03-26 | 乐视网信息技术(北京)股份有限公司 | 视频资源的倒排索引文件建立方法及其系统 |
CN108895987A (zh) * | 2018-07-17 | 2018-11-27 | 苏州大学 | 基于复合涡旋光干涉的透镜曲率半径测量方法 |
US10614366B1 (en) * | 2006-01-31 | 2020-04-07 | The Research Foundation for the State University o | System and method for multimedia ranking and multi-modal image retrieval using probabilistic semantic models and expectation-maximization (EM) learning |
CN111030952A (zh) * | 2019-12-25 | 2020-04-17 | 内蒙古大学 | 一种毫米波系统的波束空间信道估计方法及系统 |
CN111680173A (zh) * | 2020-05-31 | 2020-09-18 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | 统一检索跨媒体信息的cmr模型 |
CN112148831A (zh) * | 2020-11-26 | 2020-12-29 | 广州华多网络科技有限公司 | 图文混合检索方法、装置、存储介质、计算机设备 |
CN114201621A (zh) * | 2021-11-24 | 2022-03-18 | 人民网股份有限公司 | 基于图文协同注意力的跨模态检索模型构建及检索方法 |
-
2022
- 2022-06-07 CN CN202210635337.4A patent/CN114707007B/zh active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10614366B1 (en) * | 2006-01-31 | 2020-04-07 | The Research Foundation for the State University o | System and method for multimedia ranking and multi-modal image retrieval using probabilistic semantic models and expectation-maximization (EM) learning |
CN102023989A (zh) * | 2009-09-23 | 2011-04-20 | 阿里巴巴集团控股有限公司 | 一种信息检索方法及其系统 |
CN103678694A (zh) * | 2013-12-26 | 2014-03-26 | 乐视网信息技术(北京)股份有限公司 | 视频资源的倒排索引文件建立方法及其系统 |
CN108895987A (zh) * | 2018-07-17 | 2018-11-27 | 苏州大学 | 基于复合涡旋光干涉的透镜曲率半径测量方法 |
CN111030952A (zh) * | 2019-12-25 | 2020-04-17 | 内蒙古大学 | 一种毫米波系统的波束空间信道估计方法及系统 |
CN111680173A (zh) * | 2020-05-31 | 2020-09-18 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | 统一检索跨媒体信息的cmr模型 |
CN112148831A (zh) * | 2020-11-26 | 2020-12-29 | 广州华多网络科技有限公司 | 图文混合检索方法、装置、存储介质、计算机设备 |
CN114201621A (zh) * | 2021-11-24 | 2022-03-18 | 人民网股份有限公司 | 基于图文协同注意力的跨模态检索模型构建及检索方法 |
Non-Patent Citations (3)
Title |
---|
ZHIQIANG YUAN ET AL.: "A Lightweight Multi-Scale Crossmodal Text-Image Retrieval Method in Remote Sensing", 《IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING》 * |
王洋: "多模态图像检索技术", 《中国博士学位论文全文数据库 信息科技辑》 * |
董丽丽 等: "基于深度学习的大规模语义文本重叠区域检索", 《吉林大学学报(工学版)》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024041479A1 (zh) * | 2022-08-22 | 2024-02-29 | 华为技术有限公司 | 一种数据处理方法及其装置 |
CN116049459A (zh) * | 2023-03-30 | 2023-05-02 | 浪潮电子信息产业股份有限公司 | 跨模态互检索的方法、装置、服务器及存储介质 |
CN116844168A (zh) * | 2023-06-30 | 2023-10-03 | 北京百度网讯科技有限公司 | 确定文本的方法、深度学习模型的训练方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
CN114707007B (zh) | 2022-08-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114707007B (zh) | 一种图像文本检索方法、装置及计算机存储介质 | |
CN108363790B (zh) | 用于对评论进行评估的方法、装置、设备和存储介质 | |
KR102288249B1 (ko) | 정보 처리 방법, 단말기, 및 컴퓨터 저장 매체 | |
CN110727779A (zh) | 基于多模型融合的问答方法及系统 | |
CN108932342A (zh) | 一种语义匹配的方法、模型的学习方法及服务器 | |
CN110674252A (zh) | 一种面向司法领域的高精度语义搜索系统 | |
CN109597493B (zh) | 一种表情推荐方法及装置 | |
CN112836487B (zh) | 一种自动评论方法、装置、计算机设备及存储介质 | |
CN108846138B (zh) | 一种融合答案信息的问题分类模型构建方法、装置和介质 | |
CN111291172B (zh) | 用于处理文本的方法和装置 | |
CN112270188A (zh) | 一种提问式的分析路径推荐方法、系统及存储介质 | |
CN110990532A (zh) | 一种处理文本的方法和装置 | |
CN111125457A (zh) | 一种深度跨模态哈希检索方法及装置 | |
CN115455171B (zh) | 文本视频的互检索以及模型训练方法、装置、设备及介质 | |
CN109522396B (zh) | 一种面向国防科技领域的知识处理方法及系统 | |
CN112597285A (zh) | 一种基于知识图谱的人机交互方法及系统 | |
CN113742488A (zh) | 基于多任务学习的嵌入式知识图谱补全方法和装置 | |
CN113946698A (zh) | 一种融合多粒度数据和近邻数据的跨媒体检索方法及系统 | |
CN117648429A (zh) | 基于多模态自适应检索式增强大模型的问答方法及系统 | |
CN117574898A (zh) | 基于电网设备的领域知识图谱更新方法及系统 | |
CN110659392B (zh) | 检索方法及装置、存储介质 | |
CN117291192B (zh) | 一种政务文本语义理解分析方法及系统 | |
CN114491079A (zh) | 知识图谱构建和查询方法、装置、设备和介质 | |
CN112528003B (zh) | 一种基于语义排序和知识修正的多项选择问答方法 | |
CN112712056A (zh) | 视频语义分析方法、装置、存储介质及电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240120 Address after: Room 1505, 15th Floor, West Building of Wanda Plaza, No. 188 Shihu West Road, Changqiao Street, Wuzhong District, Suzhou City, Jiangsu Province, 215000 (Suzhou University National University Science and Technology Park Wuzhong Branch) Patentee after: Suzhou Zhongyao Intelligent System Co.,Ltd. Country or region after: China Address before: No. 188, Shihu West Road, Wuzhong District, Suzhou City, Jiangsu Province Patentee before: SOOCHOW University Country or region before: China |
|
TR01 | Transfer of patent right |