EP4123591A3 - Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium - Google Patents
Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium Download PDFInfo
- Publication number
- EP4123591A3 EP4123591A3 EP22204584.1A EP22204584A EP4123591A3 EP 4123591 A3 EP4123591 A3 EP 4123591A3 EP 22204584 A EP22204584 A EP 22204584A EP 4123591 A3 EP4123591 A3 EP 4123591A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- features
- interaction
- image
- human
- neural network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000003993 interaction Effects 0.000 title abstract 9
- 238000001514 detection method Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 title abstract 3
- 238000013528 artificial neural network Methods 0.000 title abstract 2
- 238000012549 training Methods 0.000 title abstract 2
- 230000033001 locomotion Effects 0.000 abstract 5
- 238000005516 engineering process Methods 0.000 abstract 2
- 238000000605 extraction Methods 0.000 abstract 2
- 238000013473 artificial intelligence Methods 0.000 abstract 1
- 238000013135 deep learning Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Psychiatry (AREA)
- Human Computer Interaction (AREA)
- Social Psychology (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111272717.8A CN114005177B (zh) | 2021-10-29 | 2021-10-29 | 人物交互检测方法、神经网络及其训练方法、设备和介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP4123591A2 EP4123591A2 (de) | 2023-01-25 |
EP4123591A3 true EP4123591A3 (de) | 2023-04-12 |
Family
ID=79925291
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP22204584.1A Withdrawn EP4123591A3 (de) | 2021-10-29 | 2022-10-28 | Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium |
Country Status (3)
Country | Link |
---|---|
US (1) | US20230047628A1 (de) |
EP (1) | EP4123591A3 (de) |
CN (1) | CN114005177B (de) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114663915B (zh) * | 2022-03-04 | 2024-04-05 | 西安交通大学 | 基于Transformer模型的图像人-物交互定位方法及系统 |
US11869212B1 (en) * | 2023-02-07 | 2024-01-09 | Deeping Source Inc. | Method for training video object detection model using training dataset and learning device using the same |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190180090A1 (en) * | 2017-12-07 | 2019-06-13 | Futurewei Technologies, Inc. | Activity detection by joint human and object detection and tracking |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110070107B (zh) * | 2019-03-26 | 2020-12-25 | 华为技术有限公司 | 物体识别方法及装置 |
CN110580500B (zh) * | 2019-08-20 | 2023-04-18 | 天津大学 | 一种面向人物交互的网络权重生成少样本图像分类方法 |
CN111325141B (zh) * | 2020-02-18 | 2024-03-26 | 上海商汤临港智能科技有限公司 | 交互关系识别方法、装置、设备及存储介质 |
CN112004157B (zh) * | 2020-08-11 | 2022-06-21 | 海信电子科技(武汉)有限公司 | 一种多轮语音交互方法及显示设备 |
CN112749758B (zh) * | 2021-01-21 | 2023-08-11 | 北京百度网讯科技有限公司 | 图像处理方法、神经网络的训练方法、装置、设备和介质 |
CN113112525B (zh) * | 2021-04-27 | 2023-09-01 | 北京百度网讯科技有限公司 | 目标跟踪方法、网络模型及其训练方法、设备和介质 |
-
2021
- 2021-10-29 CN CN202111272717.8A patent/CN114005177B/zh active Active
-
2022
- 2022-10-28 US US17/976,668 patent/US20230047628A1/en active Pending
- 2022-10-28 EP EP22204584.1A patent/EP4123591A3/de not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190180090A1 (en) * | 2017-12-07 | 2019-06-13 | Futurewei Technologies, Inc. | Activity detection by joint human and object detection and tracking |
Non-Patent Citations (2)
Title |
---|
K AKILA ET AL: "Discriminative human action recognition using HOI descriptor and key poses", INTERNATIONAL CONFERENCE ON SCIENCE ENGINEERING AND MANAGEMENT RESEARCH (ICSEMR 2014), 1 November 2014 (2014-11-01), pages 1 - 6, XP055617080, DOI: 10.1109/ICSEMR.2014.7043656 * |
PREST A ET AL: "Weakly Supervised Learning of Interactions between Humans and Objects", IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, IEEE COMPUTER SOCIETY, USA, vol. 34, no. 3, 1 March 2012 (2012-03-01), pages 601 - 614, XP011490647, ISSN: 0162-8828, DOI: 10.1109/TPAMI.2011.158 * |
Also Published As
Publication number | Publication date |
---|---|
EP4123591A2 (de) | 2023-01-25 |
US20230047628A1 (en) | 2023-02-16 |
CN114005177B (zh) | 2023-09-19 |
CN114005177A (zh) | 2022-02-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP4123591A3 (de) | Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium | |
CN109583342B (zh) | 基于迁移学习的人脸活体检测方法 | |
CN109165552B (zh) | 一种基于人体关键点的姿态识别方法、系统及存储器 | |
EP3933686A3 (de) | Videoverarbeitungsverfahren, gerät, elektronische vorrichtung, speichermedium und programmprodukt | |
EP4105895A3 (de) | Verfahren zur erkennung von interaktionen zwischen mensch und objekt, neuronales netzwerk und trainingsverfahren dafür, vorrichtung und medium | |
EP4075395A3 (de) | Verfahren und vorrichtung zum trainieren eines anti-spoofing-modells, verfahren und vorrichtung zum durchführen von anti-spoofing und vorrichtung | |
EP4123592A3 (de) | Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium | |
EP3905123A3 (de) | Verfahren und vorrichtung zur identifizierung eines menschlichen körpers, elektronische vorrichtung, speichermedium und programmprodukt | |
Hartanto et al. | Android based real-time static Indonesian sign language recognition system prototype | |
CN105957095B (zh) | 一种基于灰度图像的Spiking角点检测方法 | |
CN103336947A (zh) | 基于显著性和结构性的红外运动小目标识别方法 | |
Sharma et al. | Recognition of single handed sign language gestures using contour tracing descriptor | |
CN103593648B (zh) | 一个面向开放环境的人脸识别方法 | |
Jabnoun et al. | Object recognition for blind people based on features extraction | |
CN104301585A (zh) | 一种运动场景中特定种类目标实时检测方法 | |
Balasuriya et al. | Learning platform for visually impaired children through artificial intelligence and computer vision | |
Hasan et al. | Hand sign language recognition for Bangla alphabet based on Freeman Chain Code and ANN | |
Pankajakshan et al. | Sign language recognition system | |
Hachaj et al. | Real-time recognition of selected karate techniques using GDL approach | |
Amaliya et al. | Study on hand keypoint framework for sign language recognition | |
Mosayyebi et al. | Gender recognition in masked facial images using EfficientNet and transfer learning approach | |
Ruiz-Santaquiteria et al. | Improving handgun detection through a combination of visual features and body pose-based data | |
Agrawal et al. | A Tutor for the hearing impaired (developed using Automatic Gesture Recognition) | |
Nakjai et al. | Thai finger spelling localization and classification under complex background using a YOLO-based deep learning | |
Davydov et al. | Real-time Ukrainian sign language recognition system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20221028 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06V 40/20 20220101ALI20230308BHEP Ipc: G06V 20/52 20220101ALI20230308BHEP Ipc: G06V 10/82 20220101ALI20230308BHEP Ipc: G06V 10/80 20220101ALI20230308BHEP Ipc: G06V 10/62 20220101AFI20230308BHEP |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20231013 |