CN112528989A - 一种图像语义细粒度的描述生成方法 - Google Patents
一种图像语义细粒度的描述生成方法 Download PDFInfo
- Publication number
- CN112528989A CN112528989A CN202011387365.6A CN202011387365A CN112528989A CN 112528989 A CN112528989 A CN 112528989A CN 202011387365 A CN202011387365 A CN 202011387365A CN 112528989 A CN112528989 A CN 112528989A
- Authority
- CN
- China
- Prior art keywords
- image
- region
- description
- semantic
- lstm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Computing Systems (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Probability & Statistics with Applications (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011387365.6A CN112528989B (zh) | 2020-12-01 | 2020-12-01 | 一种图像语义细粒度的描述生成方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011387365.6A CN112528989B (zh) | 2020-12-01 | 2020-12-01 | 一种图像语义细粒度的描述生成方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112528989A true CN112528989A (zh) | 2021-03-19 |
CN112528989B CN112528989B (zh) | 2022-10-18 |
Family
ID=74996036
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011387365.6A Active CN112528989B (zh) | 2020-12-01 | 2020-12-01 | 一种图像语义细粒度的描述生成方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112528989B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114037831A (zh) * | 2021-07-20 | 2022-02-11 | 星汉智能科技股份有限公司 | 图像深度密集描述方法、系统及存储介质 |
CN114417891A (zh) * | 2022-01-22 | 2022-04-29 | 平安科技(深圳)有限公司 | 基于粗糙语义的回复语句确定方法、装置及电子设备 |
Citations (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170147910A1 (en) * | 2015-10-02 | 2017-05-25 | Baidu Usa Llc | Systems and methods for fast novel visual concept learning from sentence descriptions of images |
CN107680109A (zh) * | 2017-09-15 | 2018-02-09 | 盐城禅图智能科技有限公司 | 一种引用逆注意力与像素相似度学习的图像语义分割方法 |
WO2018094296A1 (en) * | 2016-11-18 | 2018-05-24 | Salesforce.Com, Inc. | Sentinel long short-term memory |
CN109086357A (zh) * | 2018-07-18 | 2018-12-25 | 深圳大学 | 基于变分自动编码器的情感分类方法、装置、设备及介质 |
CN109726696A (zh) * | 2019-01-03 | 2019-05-07 | 电子科技大学 | 基于推敲注意力机制的图像描述生成系统及方法 |
CN110033008A (zh) * | 2019-04-29 | 2019-07-19 | 同济大学 | 一种基于模态变换与文本归纳的图像描述生成方法 |
CN110168573A (zh) * | 2016-11-18 | 2019-08-23 | 易享信息技术有限公司 | 用于图像标注的空间注意力模型 |
CN110188779A (zh) * | 2019-06-03 | 2019-08-30 | 中国矿业大学 | 一种图像语义描述的生成方法 |
CN110390363A (zh) * | 2019-07-29 | 2019-10-29 | 上海海事大学 | 一种图像描述方法 |
CN110458282A (zh) * | 2019-08-06 | 2019-11-15 | 齐鲁工业大学 | 一种融合多角度多模态的图像描述生成方法及系统 |
CN110472642A (zh) * | 2019-08-19 | 2019-11-19 | 齐鲁工业大学 | 基于多级注意力的细粒度图像描述方法及系统 |
CN110674850A (zh) * | 2019-09-03 | 2020-01-10 | 武汉大学 | 一种基于注意力机制的图像描述生成方法 |
WO2020081314A1 (en) * | 2018-10-15 | 2020-04-23 | Ancestry.Com Operations Inc. | Image captioning with weakly-supervised attention penalty |
CN111160467A (zh) * | 2019-05-31 | 2020-05-15 | 北京理工大学 | 一种基于条件随机场和内部语义注意力的图像描述方法 |
CN111310676A (zh) * | 2020-02-21 | 2020-06-19 | 重庆邮电大学 | 基于CNN-LSTM和attention的视频动作识别方法 |
CN111462282A (zh) * | 2020-04-02 | 2020-07-28 | 哈尔滨工程大学 | 一种场景图生成方法 |
CN111612103A (zh) * | 2020-06-23 | 2020-09-01 | 中国人民解放军国防科技大学 | 结合抽象语义表示的图像描述生成方法、系统及介质 |
CN111859005A (zh) * | 2020-07-01 | 2020-10-30 | 江西理工大学 | 一种跨层多模型特征融合与基于卷积解码的图像描述方法 |
-
2020
- 2020-12-01 CN CN202011387365.6A patent/CN112528989B/zh active Active
Patent Citations (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170147910A1 (en) * | 2015-10-02 | 2017-05-25 | Baidu Usa Llc | Systems and methods for fast novel visual concept learning from sentence descriptions of images |
US20200117854A1 (en) * | 2016-11-18 | 2020-04-16 | Salesforce.Com, Inc. | Adaptive Attention Model for Image Captioning |
WO2018094296A1 (en) * | 2016-11-18 | 2018-05-24 | Salesforce.Com, Inc. | Sentinel long short-term memory |
CN110168573A (zh) * | 2016-11-18 | 2019-08-23 | 易享信息技术有限公司 | 用于图像标注的空间注意力模型 |
CN107680109A (zh) * | 2017-09-15 | 2018-02-09 | 盐城禅图智能科技有限公司 | 一种引用逆注意力与像素相似度学习的图像语义分割方法 |
CN109086357A (zh) * | 2018-07-18 | 2018-12-25 | 深圳大学 | 基于变分自动编码器的情感分类方法、装置、设备及介质 |
WO2020081314A1 (en) * | 2018-10-15 | 2020-04-23 | Ancestry.Com Operations Inc. | Image captioning with weakly-supervised attention penalty |
CN109726696A (zh) * | 2019-01-03 | 2019-05-07 | 电子科技大学 | 基于推敲注意力机制的图像描述生成系统及方法 |
CN110033008A (zh) * | 2019-04-29 | 2019-07-19 | 同济大学 | 一种基于模态变换与文本归纳的图像描述生成方法 |
CN111160467A (zh) * | 2019-05-31 | 2020-05-15 | 北京理工大学 | 一种基于条件随机场和内部语义注意力的图像描述方法 |
CN110188779A (zh) * | 2019-06-03 | 2019-08-30 | 中国矿业大学 | 一种图像语义描述的生成方法 |
CN110390363A (zh) * | 2019-07-29 | 2019-10-29 | 上海海事大学 | 一种图像描述方法 |
CN110458282A (zh) * | 2019-08-06 | 2019-11-15 | 齐鲁工业大学 | 一种融合多角度多模态的图像描述生成方法及系统 |
CN110472642A (zh) * | 2019-08-19 | 2019-11-19 | 齐鲁工业大学 | 基于多级注意力的细粒度图像描述方法及系统 |
CN110674850A (zh) * | 2019-09-03 | 2020-01-10 | 武汉大学 | 一种基于注意力机制的图像描述生成方法 |
CN111310676A (zh) * | 2020-02-21 | 2020-06-19 | 重庆邮电大学 | 基于CNN-LSTM和attention的视频动作识别方法 |
CN111462282A (zh) * | 2020-04-02 | 2020-07-28 | 哈尔滨工程大学 | 一种场景图生成方法 |
CN111612103A (zh) * | 2020-06-23 | 2020-09-01 | 中国人民解放军国防科技大学 | 结合抽象语义表示的图像描述生成方法、系统及介质 |
CN111859005A (zh) * | 2020-07-01 | 2020-10-30 | 江西理工大学 | 一种跨层多模型特征融合与基于卷积解码的图像描述方法 |
Non-Patent Citations (10)
Title |
---|
HARTATIK等: "Captioning Image Using Convolutional Neural Network (CNN) and Long-Short Term Memory (LSTM)", 《IEEE》 * |
LUN HUANG等: "Attention on Attention for Image Captioning", 《IEEE》 * |
PENG, YUQING等: "Image caption model of double LSTM with scene factors", 《IMAGE AND VISION COMPUTING》 * |
PETER ANDERSON等: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering", 《IEEE》 * |
WANG, CHENG等: "Image captioning with deep bidirectional LSTMs and multi-task learning", 《ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS》 * |
张家硕等: "基于双向注意力机制的图像描述生成", 《中文信息学报》 * |
武文博等: "基于深度卷积与全局特征的图像密集字幕描述", 《信号处理》 * |
汤跃: "基于深度学习的图像语义细粒度描述方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
王俊豪等: "通过细粒度的语义特征与Transformer丰富图像描述", 《华东师范大学学报(自然科学版)》 * |
赵小虎等: "基于全局-局部特征和自适应注意力机制的图像语义描述算法", 《浙江大学学报(工学版)》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114037831A (zh) * | 2021-07-20 | 2022-02-11 | 星汉智能科技股份有限公司 | 图像深度密集描述方法、系统及存储介质 |
CN114417891A (zh) * | 2022-01-22 | 2022-04-29 | 平安科技(深圳)有限公司 | 基于粗糙语义的回复语句确定方法、装置及电子设备 |
CN114417891B (zh) * | 2022-01-22 | 2023-05-09 | 平安科技(深圳)有限公司 | 基于粗糙语义的回复语句确定方法、装置及电子设备 |
Also Published As
Publication number | Publication date |
---|---|
CN112528989B (zh) | 2022-10-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110502749B (zh) | 一种基于双层注意力机制与双向gru的文本关系抽取方法 | |
CN113254599B (zh) | 一种基于半监督学习的多标签微博文本分类方法 | |
CN107943784B (zh) | 基于生成对抗网络的关系抽取方法 | |
Xie et al. | Attention-based dense LSTM for speech emotion recognition | |
CN113011186B (zh) | 命名实体识别方法、装置、设备及计算机可读存储介质 | |
CN109815485B (zh) | 一种微博短文本情感极性识别的方法、装置及存储介质 | |
Li et al. | Vision-language intelligence: Tasks, representation learning, and large models | |
CN111581970B (zh) | 一种网络语境的文本识别方法、装置及存储介质 | |
CN112528989B (zh) | 一种图像语义细粒度的描述生成方法 | |
CN111949824A (zh) | 基于语义对齐的视觉问答方法和系统、存储介质 | |
CN113051887A (zh) | 一种公告信息元素抽取方法、系统及装置 | |
CN110968725A (zh) | 图像内容描述信息生成方法、电子设备及存储介质 | |
Agrawal et al. | Image Caption Generator Using Attention Mechanism | |
CN113761377B (zh) | 基于注意力机制多特征融合的虚假信息检测方法、装置、电子设备及存储介质 | |
CN115759119A (zh) | 一种金融文本情感分析方法、系统、介质和设备 | |
Toshevska et al. | Exploration into deep learning text generation architectures for dense image captioning | |
CN113792143B (zh) | 一种基于胶囊网络的多语言情感分类方法、装置、设备及存储介质 | |
CN114417872A (zh) | 一种合同文本命名实体识别方法及系统 | |
Cho et al. | Design of image generation system for DCGAN-based kids' book text | |
Rafi et al. | A linear sub-structure with co-variance shift for image captioning | |
El-Gayar | Automatic Generation of Image Caption Based on Semantic Relation using Deep Visual Attention Prediction | |
CN113129399A (zh) | 纹样生成 | |
Hammad et al. | Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models | |
Xie et al. | Enhancing multimodal deep representation learning by fixed model reuse | |
CN111801673A (zh) | 应用程序的介绍方法、移动终端及服务器 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230829 Address after: 200120 building C, No.888, Huanhu West 2nd Road, Lingang New District, Pudong New Area, Shanghai Patentee after: Shanghai Kailing Technology Co.,Ltd. Address before: 830000, Room 17A, Building 17, Block A, Times Square Community, No. 59 Guangming Road, Tianshan District, Urumqi, Xinjiang Uygur Autonomous Region BD00244 Patentee before: Urumqi Bangbangjun Technology Co.,Ltd. Effective date of registration: 20230829 Address after: 830000, Room 17A, Building 17, Block A, Times Square Community, No. 59 Guangming Road, Tianshan District, Urumqi, Xinjiang Uygur Autonomous Region BD00244 Patentee after: Urumqi Bangbangjun Technology Co.,Ltd. Address before: 400065 Chongwen Road, Nanshan Street, Nanan District, Chongqing Patentee before: CHONGQING University OF POSTS AND TELECOMMUNICATIONS |
|
TR01 | Transfer of patent right |