EP4123591A3 - Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium - Google Patents

Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium Download PDF

Info

Publication number
EP4123591A3
EP4123591A3 EP22204584.1A EP22204584A EP4123591A3 EP 4123591 A3 EP4123591 A3 EP 4123591A3 EP 22204584 A EP22204584 A EP 22204584A EP 4123591 A3 EP4123591 A3 EP 4123591A3
Authority
EP
European Patent Office
Prior art keywords
features
interaction
image
human
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP22204584.1A
Other languages
English (en)
French (fr)
Other versions
EP4123591A2 (de
Inventor
Desen Zhou
Jian Wang
Hao Sun
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Publication of EP4123591A2 publication Critical patent/EP4123591A2/de
Publication of EP4123591A3 publication Critical patent/EP4123591A3/de
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Psychiatry (AREA)
  • Human Computer Interaction (AREA)
  • Social Psychology (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Image Analysis (AREA)
EP22204584.1A 2021-10-29 2022-10-28 Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium Withdrawn EP4123591A3 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111272717.8A CN114005177B (zh) 2021-10-29 2021-10-29 人物交互检测方法、神经网络及其训练方法、设备和介质

Publications (2)

Publication Number Publication Date
EP4123591A2 EP4123591A2 (de) 2023-01-25
EP4123591A3 true EP4123591A3 (de) 2023-04-12

Family

ID=79925291

Family Applications (1)

Application Number Title Priority Date Filing Date
EP22204584.1A Withdrawn EP4123591A3 (de) 2021-10-29 2022-10-28 Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium

Country Status (3)

Country Link
US (1) US20230047628A1 (de)
EP (1) EP4123591A3 (de)
CN (1) CN114005177B (de)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114663915B (zh) * 2022-03-04 2024-04-05 西安交通大学 基于Transformer模型的图像人-物交互定位方法及系统
US11869212B1 (en) * 2023-02-07 2024-01-09 Deeping Source Inc. Method for training video object detection model using training dataset and learning device using the same

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190180090A1 (en) * 2017-12-07 2019-06-13 Futurewei Technologies, Inc. Activity detection by joint human and object detection and tracking

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110070107B (zh) * 2019-03-26 2020-12-25 华为技术有限公司 物体识别方法及装置
CN110580500B (zh) * 2019-08-20 2023-04-18 天津大学 一种面向人物交互的网络权重生成少样本图像分类方法
CN111325141B (zh) * 2020-02-18 2024-03-26 上海商汤临港智能科技有限公司 交互关系识别方法、装置、设备及存储介质
CN112004157B (zh) * 2020-08-11 2022-06-21 海信电子科技(武汉)有限公司 一种多轮语音交互方法及显示设备
CN112749758B (zh) * 2021-01-21 2023-08-11 北京百度网讯科技有限公司 图像处理方法、神经网络的训练方法、装置、设备和介质
CN113112525B (zh) * 2021-04-27 2023-09-01 北京百度网讯科技有限公司 目标跟踪方法、网络模型及其训练方法、设备和介质

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190180090A1 (en) * 2017-12-07 2019-06-13 Futurewei Technologies, Inc. Activity detection by joint human and object detection and tracking

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
K AKILA ET AL: "Discriminative human action recognition using HOI descriptor and key poses", INTERNATIONAL CONFERENCE ON SCIENCE ENGINEERING AND MANAGEMENT RESEARCH (ICSEMR 2014), 1 November 2014 (2014-11-01), pages 1 - 6, XP055617080, DOI: 10.1109/ICSEMR.2014.7043656 *
PREST A ET AL: "Weakly Supervised Learning of Interactions between Humans and Objects", IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, IEEE COMPUTER SOCIETY, USA, vol. 34, no. 3, 1 March 2012 (2012-03-01), pages 601 - 614, XP011490647, ISSN: 0162-8828, DOI: 10.1109/TPAMI.2011.158 *

Also Published As

Publication number Publication date
EP4123591A2 (de) 2023-01-25
US20230047628A1 (en) 2023-02-16
CN114005177B (zh) 2023-09-19
CN114005177A (zh) 2022-02-01

Similar Documents

Publication Publication Date Title
EP4123591A3 (de) Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium
CN109583342B (zh) 基于迁移学习的人脸活体检测方法
CN109165552B (zh) 一种基于人体关键点的姿态识别方法、系统及存储器
EP3933686A3 (de) Videoverarbeitungsverfahren, gerät, elektronische vorrichtung, speichermedium und programmprodukt
EP4105895A3 (de) Verfahren zur erkennung von interaktionen zwischen mensch und objekt, neuronales netzwerk und trainingsverfahren dafür, vorrichtung und medium
EP4075395A3 (de) Verfahren und vorrichtung zum trainieren eines anti-spoofing-modells, verfahren und vorrichtung zum durchführen von anti-spoofing und vorrichtung
EP4123592A3 (de) Verfahren zur erkennung von mensch-objekt-interaktionen, neuronales netzwerk und zugehöriges lernverfahren, vorrichtung, und medium
EP3905123A3 (de) Verfahren und vorrichtung zur identifizierung eines menschlichen körpers, elektronische vorrichtung, speichermedium und programmprodukt
Hartanto et al. Android based real-time static Indonesian sign language recognition system prototype
CN105957095B (zh) 一种基于灰度图像的Spiking角点检测方法
CN103336947A (zh) 基于显著性和结构性的红外运动小目标识别方法
Sharma et al. Recognition of single handed sign language gestures using contour tracing descriptor
CN103593648B (zh) 一个面向开放环境的人脸识别方法
Jabnoun et al. Object recognition for blind people based on features extraction
CN104301585A (zh) 一种运动场景中特定种类目标实时检测方法
Balasuriya et al. Learning platform for visually impaired children through artificial intelligence and computer vision
Hasan et al. Hand sign language recognition for Bangla alphabet based on Freeman Chain Code and ANN
Pankajakshan et al. Sign language recognition system
Hachaj et al. Real-time recognition of selected karate techniques using GDL approach
Amaliya et al. Study on hand keypoint framework for sign language recognition
Mosayyebi et al. Gender recognition in masked facial images using EfficientNet and transfer learning approach
Ruiz-Santaquiteria et al. Improving handgun detection through a combination of visual features and body pose-based data
Agrawal et al. A Tutor for the hearing impaired (developed using Automatic Gesture Recognition)
Nakjai et al. Thai finger spelling localization and classification under complex background using a YOLO-based deep learning
Davydov et al. Real-time Ukrainian sign language recognition system

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20221028

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR

RIC1 Information provided on ipc code assigned before grant

Ipc: G06V 40/20 20220101ALI20230308BHEP

Ipc: G06V 20/52 20220101ALI20230308BHEP

Ipc: G06V 10/82 20220101ALI20230308BHEP

Ipc: G06V 10/80 20220101ALI20230308BHEP

Ipc: G06V 10/62 20220101AFI20230308BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20231013