JP7207568B2 - 出力方法、出力プログラム、および出力装置 - Google Patents

出力方法、出力プログラム、および出力装置 Download PDF

Info

Publication number
JP7207568B2
JP7207568B2 JP2021555729A JP2021555729A JP7207568B2 JP 7207568 B2 JP7207568 B2 JP 7207568B2 JP 2021555729 A JP2021555729 A JP 2021555729A JP 2021555729 A JP2021555729 A JP 2021555729A JP 7207568 B2 JP7207568 B2 JP 7207568B2
Authority
JP
Japan
Prior art keywords
vector
modal
output device
modal information
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021555729A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2021095212A5 (https=
JPWO2021095212A1 (https=
Inventor
萌 山田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Publication of JPWO2021095212A1 publication Critical patent/JPWO2021095212A1/ja
Publication of JPWO2021095212A5 publication Critical patent/JPWO2021095212A5/ja
Application granted granted Critical
Publication of JP7207568B2 publication Critical patent/JP7207568B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/254Fusion techniques of classification results, e.g. of results related to same input data
    • G06F18/256Fusion techniques of classification results, e.g. of results related to same input data of results relating to different input data, e.g. multimodal recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Biomedical Technology (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)
JP2021555729A 2019-11-14 2019-11-14 出力方法、出力プログラム、および出力装置 Active JP7207568B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2019/044770 WO2021095212A1 (ja) 2019-11-14 2019-11-14 出力方法、出力プログラム、および出力装置

Publications (3)

Publication Number Publication Date
JPWO2021095212A1 JPWO2021095212A1 (https=) 2021-05-20
JPWO2021095212A5 JPWO2021095212A5 (https=) 2022-04-06
JP7207568B2 true JP7207568B2 (ja) 2023-01-18

Family

ID=75911528

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021555729A Active JP7207568B2 (ja) 2019-11-14 2019-11-14 出力方法、出力プログラム、および出力装置

Country Status (3)

Country Link
US (1) US20220237421A1 (https=)
JP (1) JP7207568B2 (https=)
WO (1) WO2021095212A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2024184975A1 (https=) * 2023-03-03 2024-09-12

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2670861A1 (en) * 2006-11-28 2008-06-05 Calgary Scientific Inc. Texture-based multi-dimensional medical image registration
WO2016100816A1 (en) * 2014-12-19 2016-06-23 United Technologies Corporation Sensor data fusion for prognostics and health monitoring
US11210560B2 (en) * 2019-10-02 2021-12-28 Mitsubishi Electric Research Laboratories, Inc. Multi-modal dense correspondence imaging system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LU, Jiasen, et al.,ViLBERT: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks,arXiv.org [online],2019年08月06日,pp.1-11,[検索日 2019.12.13], インターネット:<URL:https://arxiv.org/pdf/1908.02265v1.pdf>
NGUYEN, Duy-Kien, et al.,Improved fusion of visual and language representations by dense symmetric co-attention for visual qu,[online],2018年,pp.6087-6096,http://openaccess.thecvf.com/content_cvpr_2018/html/Nguyen_Improved_Fusion_of_CVPR_2018_paper.html,[検索日 2019.12.13], インターネット:<URL:http://openaccess.thecvf.com/content_cvpr_2018/html/Nguye

Also Published As

Publication number Publication date
WO2021095212A1 (ja) 2021-05-20
US20220237421A1 (en) 2022-07-28
JPWO2021095212A1 (https=) 2021-05-20

Similar Documents

Publication Publication Date Title
Tao et al. End-to-end audiovisual speech recognition system with multitask learning
US10621991B2 (en) Joint neural network for speaker recognition
US20220237263A1 (en) Method for outputting, computer-readable recording medium storing output program, and output device
US9342576B2 (en) Information processing device, information processing terminal, information processing method, and program
CN112100337B (zh) 交互对话中的情绪识别方法及装置
CN108510982B (zh) 音频事件检测方法、装置及计算机可读存储介质
CN118737121B (zh) 伴随音频生成方法、相关装置和介质
JPWO2021095211A5 (https=)
WO2022160749A1 (zh) 一种用于语音处理装置的角色分离方法及其语音处理装置
CN110970056A (zh) 一种从视频中分离音源的方法
WO2021028236A1 (en) Systems and methods for sound conversion
CN118210383B (zh) 一种信息输入系统
Chelali Bimodal fusion of visual and speech data for audiovisual speaker recognition in noisy environment
CN115658920A (zh) 一种知识图谱的构建方法及装置
JP7207568B2 (ja) 出力方法、出力プログラム、および出力装置
Xu et al. Audio-visual wake word spotting system for MISP challenge 2021
CN109905381A (zh) 自助面试方法、相关装置和存储介质
WO2021095213A1 (ja) 学習方法、学習プログラム、および学習装置
CN114974253B (zh) 一种基于人物画像的自然语言解释方法、装置及存储介质
JPWO2021095212A5 (https=)
CN120472883A (zh) 会议语音识别方法、装置、电子设备及存储介质
KR102562387B1 (ko) 이미지의 특징 추출 및 합성 시스템의 학습 방법
CN115881101B (zh) 一种语音识别模型的训练方法、装置以及处理设备
Wong et al. A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
CN115273803B (zh) 模型训练方法和装置、语音合成方法、设备和存储介质

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220118

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20220118

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20221206

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20221219

R150 Certificate of patent or registration of utility model

Ref document number: 7207568

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150