CN113343937B - 一种基于深度卷积和注意力机制的唇语识别方法 - Google Patents
一种基于深度卷积和注意力机制的唇语识别方法 Download PDFInfo
- Publication number
- CN113343937B CN113343937B CN202110801803.7A CN202110801803A CN113343937B CN 113343937 B CN113343937 B CN 113343937B CN 202110801803 A CN202110801803 A CN 202110801803A CN 113343937 B CN113343937 B CN 113343937B
- Authority
- CN
- China
- Prior art keywords
- convolution
- lip
- layer
- inputting
- term
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110801803.7A CN113343937B (zh) | 2021-07-15 | 2021-07-15 | 一种基于深度卷积和注意力机制的唇语识别方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110801803.7A CN113343937B (zh) | 2021-07-15 | 2021-07-15 | 一种基于深度卷积和注意力机制的唇语识别方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113343937A CN113343937A (zh) | 2021-09-03 |
CN113343937B true CN113343937B (zh) | 2022-09-02 |
Family
ID=77479823
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110801803.7A Active CN113343937B (zh) | 2021-07-15 | 2021-07-15 | 一种基于深度卷积和注意力机制的唇语识别方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113343937B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113989933B (zh) * | 2021-10-29 | 2024-04-16 | 国网江苏省电力有限公司苏州供电分公司 | 一种在线行为识别模型训练、检测方法及系统 |
CN113837147B (zh) * | 2021-10-29 | 2022-08-05 | 山东省人工智能研究院 | 一种基于transformer的假视频检测方法 |
CN114581811B (zh) * | 2022-01-12 | 2023-04-18 | 北京云辰信通科技有限公司 | 基于时空注意力机制的视觉语言识别方法和相关设备 |
CN114494791B (zh) * | 2022-04-06 | 2022-07-08 | 之江实验室 | 一种基于注意力选择的transformer运算精简方法及装置 |
CN116580440B (zh) * | 2023-05-24 | 2024-01-26 | 北华航天工业学院 | 基于视觉transformer的轻量级唇语识别方法 |
CN117392672B (zh) * | 2023-12-11 | 2024-03-19 | 季华实验室 | 流式细胞分类模型的获取方法、分类方法及相关设备 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10467274B1 (en) * | 2016-11-10 | 2019-11-05 | Snap Inc. | Deep reinforcement learning-based captioning with embedding reward |
DE112019000049T5 (de) * | 2018-02-18 | 2020-01-23 | Nvidia Corporation | Für autonomes fahren geeignete objekterfassung und erfassungssicherheit |
CN109858412A (zh) * | 2019-01-18 | 2019-06-07 | 东北大学 | 一种基于混合卷积神经网络的唇语识别方法 |
US11210554B2 (en) * | 2019-03-21 | 2021-12-28 | Illumina, Inc. | Artificial intelligence-based generation of sequencing metadata |
CN110188637A (zh) * | 2019-05-17 | 2019-08-30 | 西安电子科技大学 | 一种基于深度学习的行为识别技术方法 |
CN111178157A (zh) * | 2019-12-10 | 2020-05-19 | 浙江大学 | 一种基于音调的级联序列到序列模型的中文唇语识别方法 |
CN111339908B (zh) * | 2020-02-24 | 2023-08-15 | 青岛科技大学 | 基于多模态信息融合与决策优化的组群行为识别方法 |
CN111401250A (zh) * | 2020-03-17 | 2020-07-10 | 东北大学 | 一种基于混合卷积神经网络的中文唇语识别方法及装置 |
CN111753704B (zh) * | 2020-06-19 | 2022-08-26 | 南京邮电大学 | 一种基于视频人物唇读识别的时序集中预测方法 |
CN112330713B (zh) * | 2020-11-26 | 2023-12-19 | 南京工程学院 | 基于唇语识别的重度听障患者言语理解度的改进方法 |
CN112784798B (zh) * | 2021-02-01 | 2022-11-08 | 东南大学 | 一种基于特征-时间注意力机制的多模态情感识别方法 |
CN112861791B (zh) * | 2021-03-11 | 2022-08-23 | 河北工业大学 | 一种结合图神经网络和多特征融合的唇语识别方法 |
CN113033452B (zh) * | 2021-04-06 | 2022-09-16 | 合肥工业大学 | 融合通道注意力和选择性特征融合机制的唇语识别方法 |
-
2021
- 2021-07-15 CN CN202110801803.7A patent/CN113343937B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN113343937A (zh) | 2021-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113343937B (zh) | 一种基于深度卷积和注意力机制的唇语识别方法 | |
CN108875807B (zh) | 一种基于多注意力多尺度的图像描述方法 | |
CN103605972B (zh) | 一种基于分块深度神经网络的非限制环境人脸验证方法 | |
CN109815826B (zh) | 人脸属性模型的生成方法及装置 | |
CN110633683B (zh) | 结合DenseNet和resBi-LSTM的中文句子级唇语识别方法 | |
Hao et al. | A survey of research on lipreading technology | |
CN110378208B (zh) | 一种基于深度残差网络的行为识别方法 | |
CN116580440B (zh) | 基于视觉transformer的轻量级唇语识别方法 | |
CN112307995A (zh) | 一种基于特征解耦学习的半监督行人重识别方法 | |
CN111259785B (zh) | 基于时间偏移残差网络的唇语识别方法 | |
CN113627266A (zh) | 基于Transformer时空建模的视频行人重识别方法 | |
US11908222B1 (en) | Occluded pedestrian re-identification method based on pose estimation and background suppression | |
CN114360067A (zh) | 一种基于深度学习的动态手势识别方法 | |
CN116665695B (zh) | 虚拟对象口型驱动方法、相关装置和介质 | |
CN114694255B (zh) | 基于通道注意力与时间卷积网络的句子级唇语识别方法 | |
CN111539445B (zh) | 一种半监督特征融合的对象分类方法及系统 | |
CN115035508A (zh) | 基于主题引导的Transformer的遥感图像字幕生成方法 | |
CN115601562A (zh) | 一种使用多尺度特征提取的锦鲤鱼检测与识别方法 | |
CN112906520A (zh) | 一种基于姿态编码的动作识别方法及装置 | |
CN115984485A (zh) | 一种基于自然文本描述的高保真三维人脸模型生成方法 | |
CN114040126A (zh) | 一种文字驱动的人物播报视频生成方法及装置 | |
CN117238019A (zh) | 基于时空相对变换的视频人脸表情类别识别方法和系统 | |
CN116884412A (zh) | 一种基于混合三维残差门控循环单元的唇语识别方法 | |
CN115690917B (zh) | 一种基于外观和运动智能关注的行人动作识别方法 | |
CN111488797A (zh) | 一种行人再识别方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Yuan Quanbo Inventor after: Wang Huijuan Inventor after: Pu Gangqiang Inventor before: Wang Huijuan Inventor before: Pu Gangqiang Inventor before: Yuan Quanbo |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230928 Address after: A07, 1st Floor, Office Building, No. 85 Huizhi Road, Longhe Economic Development Zone, Anci District, Langfang City, Hebei Province, 065000 Patentee after: Zhengji Taichuan Technology (Langfang) Co.,Ltd. Address before: 065099 No. 133 Aimin East Road, Langfang City, Hebei Province Patentee before: NORTH CHINA INSTITUTE OF AEROSPACE ENGINEERING |
|
TR01 | Transfer of patent right |