CN117725247B - 一种基于检索及分割增强的扩散图像生成方法及系统 - Google Patents
一种基于检索及分割增强的扩散图像生成方法及系统 Download PDFInfo
- Publication number
- CN117725247B CN117725247B CN202410172400.4A CN202410172400A CN117725247B CN 117725247 B CN117725247 B CN 117725247B CN 202410172400 A CN202410172400 A CN 202410172400A CN 117725247 B CN117725247 B CN 117725247B
- Authority
- CN
- China
- Prior art keywords
- image
- vector
- model
- text
- encoder
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000009792 diffusion process Methods 0.000 title claims abstract description 106
- 230000011218 segmentation Effects 0.000 title claims abstract description 83
- 238000000034 method Methods 0.000 title claims abstract description 66
- 239000013598 vector Substances 0.000 claims abstract description 221
- 238000012549 training Methods 0.000 claims abstract description 84
- 230000004927 fusion Effects 0.000 claims abstract description 39
- 230000008569 process Effects 0.000 claims abstract description 27
- 238000010276 construction Methods 0.000 claims abstract description 26
- 230000007246 mechanism Effects 0.000 claims description 29
- 238000004590 computer program Methods 0.000 claims description 9
- 230000006870 function Effects 0.000 claims description 7
- 230000005540 biological transmission Effects 0.000 claims description 6
- 238000004364 calculation method Methods 0.000 claims description 6
- 239000011159 matrix material Substances 0.000 claims description 6
- 239000000470 constituent Substances 0.000 claims description 4
- 238000005457 optimization Methods 0.000 claims description 4
- 238000012935 Averaging Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 5
- 230000000007 visual effect Effects 0.000 abstract description 4
- 238000004422 calculation algorithm Methods 0.000 description 24
- 238000010586 diagram Methods 0.000 description 10
- 230000000750 progressive effect Effects 0.000 description 7
- 238000013528 artificial neural network Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 238000007670 refining Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Image Processing (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410172400.4A CN117725247B (zh) | 2024-02-07 | 2024-02-07 | 一种基于检索及分割增强的扩散图像生成方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410172400.4A CN117725247B (zh) | 2024-02-07 | 2024-02-07 | 一种基于检索及分割增强的扩散图像生成方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117725247A CN117725247A (zh) | 2024-03-19 |
CN117725247B true CN117725247B (zh) | 2024-04-26 |
Family
ID=90210990
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202410172400.4A Active CN117725247B (zh) | 2024-02-07 | 2024-02-07 | 一种基于检索及分割增强的扩散图像生成方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117725247B (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118247440B (zh) * | 2024-05-27 | 2024-09-06 | 广东朝野科技有限公司 | 一种电视机外壳3d模型构建方法及系统 |
CN118365887B (zh) * | 2024-06-18 | 2024-09-10 | 广东电网有限责任公司 | 一种开放词汇输电线路设备图像分割方法及装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116630482A (zh) * | 2023-07-26 | 2023-08-22 | 拓尔思信息技术股份有限公司 | 一种基于多模态检索与轮廓引导的图像生成方法 |
CN116883530A (zh) * | 2023-07-06 | 2023-10-13 | 中山大学 | 一种基于细粒度语义奖励的文本到图像生成方法 |
CN117351325A (zh) * | 2023-12-06 | 2024-01-05 | 浙江省建筑设计研究院 | 一种模型训练方法、建筑效果图生成方法、设备及介质 |
CN117521672A (zh) * | 2023-12-22 | 2024-02-06 | 湖南大学 | 一种基于扩散模型的长文本生成连续图片的方法 |
-
2024
- 2024-02-07 CN CN202410172400.4A patent/CN117725247B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116883530A (zh) * | 2023-07-06 | 2023-10-13 | 中山大学 | 一种基于细粒度语义奖励的文本到图像生成方法 |
CN116630482A (zh) * | 2023-07-26 | 2023-08-22 | 拓尔思信息技术股份有限公司 | 一种基于多模态检索与轮廓引导的图像生成方法 |
CN117351325A (zh) * | 2023-12-06 | 2024-01-05 | 浙江省建筑设计研究院 | 一种模型训练方法、建筑效果图生成方法、设备及介质 |
CN117521672A (zh) * | 2023-12-22 | 2024-02-06 | 湖南大学 | 一种基于扩散模型的长文本生成连续图片的方法 |
Non-Patent Citations (2)
Title |
---|
Rapid Diffusion: Building Domain-Specifc Text-to-Image Synthesizers with Fast Inference Speed;Bingyan Liu 等;《Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics》;20230712;全文 * |
基于扩散模型的多模态引导图像合成系统;何文睿 等;《北京信息科技大学学报》;20231231;第38卷(第6期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN117725247A (zh) | 2024-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN117725247B (zh) | 一种基于检索及分割增强的扩散图像生成方法及系统 | |
CN111832501B (zh) | 一种面向卫星在轨应用的遥感影像文本智能描述方法 | |
CN111967277B (zh) | 基于多模态机器翻译模型的翻译方法 | |
CN114860893B (zh) | 基于多模态数据融合与强化学习的智能决策方法及装置 | |
CN115797495B (zh) | 一种句子-字符语义空间融合感知的文本生成图像的方法 | |
CN110110331B (zh) | 文本生成方法、装置、介质和计算设备 | |
Huang et al. | Turbo learning for captionbot and drawingbot | |
CN117058673A (zh) | 文本生成图像模型训练方法、系统以及文本生成图像方法、系统 | |
CN115129839A (zh) | 基于图感知的视觉对话答案生成方法及装置 | |
CN117437317A (zh) | 图像生成方法、装置、电子设备、存储介质和程序产品 | |
CN114626529B (zh) | 一种自然语言推理微调方法、系统、装置及存储介质 | |
CN115587924A (zh) | 一种基于循环生成对抗网络的自适应掩膜引导的图像模态转换方法 | |
CN117541668A (zh) | 虚拟角色的生成方法、装置、设备及存储介质 | |
CN117576248B (zh) | 基于姿态引导的图像生成方法和装置 | |
CN116980541B (zh) | 视频编辑方法、装置、电子设备以及存储介质 | |
CN114169408A (zh) | 一种基于多模态注意力机制的情感分类方法 | |
CN116975347A (zh) | 图像生成模型训练方法及相关装置 | |
Weerakoon et al. | SoftSkip: Empowering Multi-Modal Dynamic Pruning for Single-Stage Referring Comprehension | |
CN117034133A (zh) | 一种数据处理方法、装置、设备和介质 | |
CN110969187B (zh) | 一种图谱迁移的语义分析方法 | |
CN113392249A (zh) | 图文信息分类方法、图文分类模型训练方法、介质及设备 | |
Meira et al. | Generating Synthetic Faces for Data Augmentation with StyleGAN2-ADA. | |
CN114494774B (zh) | 一种图像分类方法、装置、电子设备及存储介质 | |
Jin et al. | A Simple and Effective Baseline for Attentional Generative Adversarial Networks | |
US20240169662A1 (en) | Latent Pose Queries for Machine-Learned Image View Synthesis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address |
Address after: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee after: Beijing Xinghe Zhiyuan Technology Co.,Ltd. Country or region after: China Patentee after: Zhiguagua (Tianjin) Big Data Technology Co.,Ltd. Address before: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee before: Beijing Zhiguagua Technology Co.,Ltd. Country or region before: China Patentee before: Zhiguagua (Tianjin) Big Data Technology Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240508 Address after: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee after: Beijing Xinghe Zhiyuan Technology Co.,Ltd. Country or region after: China Address before: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee before: Beijing Xinghe Zhiyuan Technology Co.,Ltd. Country or region before: China Patentee before: Zhiguagua (Tianjin) Big Data Technology Co.,Ltd. |
|
TR01 | Transfer of patent right |