JP7207568B2 - 出力方法、出力プログラム、および出力装置 - Google Patents
出力方法、出力プログラム、および出力装置 Download PDFInfo
- Publication number
- JP7207568B2 JP7207568B2 JP2021555729A JP2021555729A JP7207568B2 JP 7207568 B2 JP7207568 B2 JP 7207568B2 JP 2021555729 A JP2021555729 A JP 2021555729A JP 2021555729 A JP2021555729 A JP 2021555729A JP 7207568 B2 JP7207568 B2 JP 7207568B2
- Authority
- JP
- Japan
- Prior art keywords
- vector
- modal
- output device
- modal information
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/254—Fusion techniques of classification results, e.g. of results related to same input data
- G06F18/256—Fusion techniques of classification results, e.g. of results related to same input data of results relating to different input data, e.g. multimodal recognition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Medical Informatics (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Biomedical Technology (AREA)
- Multimedia (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2019/044770 WO2021095212A1 (ja) | 2019-11-14 | 2019-11-14 | 出力方法、出力プログラム、および出力装置 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2021095212A1 JPWO2021095212A1 (https=) | 2021-05-20 |
| JPWO2021095212A5 JPWO2021095212A5 (https=) | 2022-04-06 |
| JP7207568B2 true JP7207568B2 (ja) | 2023-01-18 |
Family
ID=75911528
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021555729A Active JP7207568B2 (ja) | 2019-11-14 | 2019-11-14 | 出力方法、出力プログラム、および出力装置 |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20220237421A1 (https=) |
| JP (1) | JP7207568B2 (https=) |
| WO (1) | WO2021095212A1 (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPWO2024184975A1 (https=) * | 2023-03-03 | 2024-09-12 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2670861A1 (en) * | 2006-11-28 | 2008-06-05 | Calgary Scientific Inc. | Texture-based multi-dimensional medical image registration |
| WO2016100816A1 (en) * | 2014-12-19 | 2016-06-23 | United Technologies Corporation | Sensor data fusion for prognostics and health monitoring |
| US11210560B2 (en) * | 2019-10-02 | 2021-12-28 | Mitsubishi Electric Research Laboratories, Inc. | Multi-modal dense correspondence imaging system |
-
2019
- 2019-11-14 WO PCT/JP2019/044770 patent/WO2021095212A1/ja not_active Ceased
- 2019-11-14 JP JP2021555729A patent/JP7207568B2/ja active Active
-
2022
- 2022-04-12 US US17/719,211 patent/US20220237421A1/en active Pending
Non-Patent Citations (2)
| Title |
|---|
| LU, Jiasen, et al.,ViLBERT: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks,arXiv.org [online],2019年08月06日,pp.1-11,[検索日 2019.12.13], インターネット:<URL:https://arxiv.org/pdf/1908.02265v1.pdf> |
| NGUYEN, Duy-Kien, et al.,Improved fusion of visual and language representations by dense symmetric co-attention for visual qu,[online],2018年,pp.6087-6096,http://openaccess.thecvf.com/content_cvpr_2018/html/Nguyen_Improved_Fusion_of_CVPR_2018_paper.html,[検索日 2019.12.13], インターネット:<URL:http://openaccess.thecvf.com/content_cvpr_2018/html/Nguye |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2021095212A1 (ja) | 2021-05-20 |
| US20220237421A1 (en) | 2022-07-28 |
| JPWO2021095212A1 (https=) | 2021-05-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Tao et al. | End-to-end audiovisual speech recognition system with multitask learning | |
| US10621991B2 (en) | Joint neural network for speaker recognition | |
| US20220237263A1 (en) | Method for outputting, computer-readable recording medium storing output program, and output device | |
| US9342576B2 (en) | Information processing device, information processing terminal, information processing method, and program | |
| CN112100337B (zh) | 交互对话中的情绪识别方法及装置 | |
| CN108510982B (zh) | 音频事件检测方法、装置及计算机可读存储介质 | |
| CN118737121B (zh) | 伴随音频生成方法、相关装置和介质 | |
| JPWO2021095211A5 (https=) | ||
| WO2022160749A1 (zh) | 一种用于语音处理装置的角色分离方法及其语音处理装置 | |
| CN110970056A (zh) | 一种从视频中分离音源的方法 | |
| WO2021028236A1 (en) | Systems and methods for sound conversion | |
| CN118210383B (zh) | 一种信息输入系统 | |
| Chelali | Bimodal fusion of visual and speech data for audiovisual speaker recognition in noisy environment | |
| CN115658920A (zh) | 一种知识图谱的构建方法及装置 | |
| JP7207568B2 (ja) | 出力方法、出力プログラム、および出力装置 | |
| Xu et al. | Audio-visual wake word spotting system for MISP challenge 2021 | |
| CN109905381A (zh) | 自助面试方法、相关装置和存储介质 | |
| WO2021095213A1 (ja) | 学習方法、学習プログラム、および学習装置 | |
| CN114974253B (zh) | 一种基于人物画像的自然语言解释方法、装置及存储介质 | |
| JPWO2021095212A5 (https=) | ||
| CN120472883A (zh) | 会议语音识别方法、装置、电子设备及存储介质 | |
| KR102562387B1 (ko) | 이미지의 특징 추출 및 합성 시스템의 학습 방법 | |
| CN115881101B (zh) | 一种语音识别模型的训练方法、装置以及处理设备 | |
| Wong et al. | A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities | |
| CN115273803B (zh) | 模型训练方法和装置、语音合成方法、设备和存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20220118 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20220118 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20221206 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20221219 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7207568 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |