JP2022546153A - 動作認識方法、装置、コンピュータ機器及び記憶媒体 - Google Patents

動作認識方法、装置、コンピュータ機器及び記憶媒体 Download PDF

Info

Publication number
JP2022546153A
JP2022546153A JP2021565729A JP2021565729A JP2022546153A JP 2022546153 A JP2022546153 A JP 2022546153A JP 2021565729 A JP2021565729 A JP 2021565729A JP 2021565729 A JP2021565729 A JP 2021565729A JP 2022546153 A JP2022546153 A JP 2022546153A
Authority
JP
Japan
Prior art keywords
motion detection
feature
image
target object
motion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2021565729A
Other languages
English (en)
Japanese (ja)
Inventor
フェイ ワン
チェン チエン
Original Assignee
シャンハイ センスタイム リンガン インテリジェント テクノロジー カンパニー リミテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by シャンハイ センスタイム リンガン インテリジェント テクノロジー カンパニー リミテッド filed Critical シャンハイ センスタイム リンガン インテリジェント テクノロジー カンパニー リミテッド
Publication of JP2022546153A publication Critical patent/JP2022546153A/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/48Extraction of image or video features by mapping characteristic values of the pattern into a parameter space, e.g. Hough transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Tourism & Hospitality (AREA)
  • Evolutionary Computation (AREA)
  • Human Computer Interaction (AREA)
  • Social Psychology (AREA)
  • Software Systems (AREA)
  • Psychiatry (AREA)
  • General Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Educational Technology (AREA)
  • Human Resources & Organizations (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Educational Administration (AREA)
  • Computational Linguistics (AREA)
  • Image Analysis (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Hardware Redundancy (AREA)
JP2021565729A 2020-07-31 2021-04-16 動作認識方法、装置、コンピュータ機器及び記憶媒体 Pending JP2022546153A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN202010755553.3 2020-07-31
CN202010755553.3A CN111881854A (zh) 2020-07-31 2020-07-31 动作识别方法、装置、计算机设备及存储介质
PCT/CN2021/087693 WO2022021948A1 (zh) 2020-07-31 2021-04-16 动作识别方法、装置、计算机设备及存储介质

Publications (1)

Publication Number Publication Date
JP2022546153A true JP2022546153A (ja) 2022-11-04

Family

ID=73204793

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021565729A Pending JP2022546153A (ja) 2020-07-31 2021-04-16 動作認識方法、装置、コンピュータ機器及び記憶媒体

Country Status (5)

Country Link
JP (1) JP2022546153A (zh)
KR (1) KR20220122735A (zh)
CN (1) CN111881854A (zh)
TW (1) TWI776566B (zh)
WO (1) WO2022021948A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111881854A (zh) * 2020-07-31 2020-11-03 上海商汤临港智能科技有限公司 动作识别方法、装置、计算机设备及存储介质
CN113469056A (zh) * 2021-07-02 2021-10-01 上海商汤智能科技有限公司 行为识别方法、装置、电子设备及计算机可读存储介质
CN115841140B (zh) * 2022-04-20 2023-08-11 北京爱芯科技有限公司 一种反最大池化运算方法、装置、电子设备及存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018105062A1 (ja) * 2016-12-07 2018-06-14 オリンパス株式会社 画像処理装置及び画像処理方法
CN108681695A (zh) * 2018-04-26 2018-10-19 北京市商汤科技开发有限公司 视频动作识别方法及装置、电子设备和存储介质
WO2019220622A1 (ja) * 2018-05-18 2019-11-21 日本電気株式会社 画像処理装置、システム、方法及びプログラムが格納された非一時的なコンピュータ可読媒体

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7482040B2 (ja) * 2018-06-14 2024-05-13 マジック リープ, インコーポレイテッド 拡張現実深層ジェスチャネットワーク
CN108875674B (zh) * 2018-06-29 2021-11-16 东南大学 一种基于多列融合卷积神经网络的驾驶员行为识别方法
CN109726803B (zh) * 2019-01-10 2021-06-29 广州小狗机器人技术有限公司 池化方法、图像处理方法及装置
CN111435422B (zh) * 2019-01-11 2024-03-08 商汤集团有限公司 动作识别方法、控制方法及装置、电子设备和存储介质
CN109919008A (zh) * 2019-01-23 2019-06-21 平安科技(深圳)有限公司 运动目标检测方法、装置、计算机设备及存储介质
CN110879993B (zh) * 2019-11-29 2023-03-14 北京市商汤科技开发有限公司 神经网络训练方法、人脸识别任务的执行方法及装置
CN111310616B (zh) * 2020-02-03 2023-11-28 北京市商汤科技开发有限公司 图像处理方法及装置、电子设备和存储介质
CN111401144B (zh) * 2020-02-26 2023-04-07 华南理工大学 一种基于视频监控的手扶电梯乘客行为识别方法
CN111160491B (zh) * 2020-04-03 2020-09-01 北京精诊医疗科技有限公司 一种卷积神经网络中的池化方法和池化模型
CN111881854A (zh) * 2020-07-31 2020-11-03 上海商汤临港智能科技有限公司 动作识别方法、装置、计算机设备及存储介质

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018105062A1 (ja) * 2016-12-07 2018-06-14 オリンパス株式会社 画像処理装置及び画像処理方法
CN108681695A (zh) * 2018-04-26 2018-10-19 北京市商汤科技开发有限公司 视频动作识别方法及装置、电子设备和存储介质
WO2019220622A1 (ja) * 2018-05-18 2019-11-21 日本電気株式会社 画像処理装置、システム、方法及びプログラムが格納された非一時的なコンピュータ可読媒体

Also Published As

Publication number Publication date
KR20220122735A (ko) 2022-09-02
CN111881854A (zh) 2020-11-03
TWI776566B (zh) 2022-09-01
WO2022021948A1 (zh) 2022-02-03
TW202207075A (zh) 2022-02-16

Similar Documents

Publication Publication Date Title
US11455807B2 (en) Training neural networks for vehicle re-identification
CN111797893B (zh) 一种神经网络的训练方法、图像分类系统及相关设备
CN111523621B (zh) 图像识别方法、装置、计算机设备和存储介质
JP2022546153A (ja) 動作認識方法、装置、コンピュータ機器及び記憶媒体
CN110827129B (zh) 一种商品推荐方法及装置
CN110033023B (zh) 一种基于绘本识别的图像数据处理方法及系统
KR102548732B1 (ko) 신경망 학습 방법 및 이를 적용한 장치
CN111783902A (zh) 数据增广、业务处理方法、装置、计算机设备和存储介质
CN111340105A (zh) 一种图像分类模型训练方法、图像分类方法、装置及计算设备
CN112257808B (zh) 用于零样本分类的集成协同训练方法、装置及终端设备
CN112926462B (zh) 训练方法、装置、动作识别方法、装置及电子设备
Faria et al. Towards the development of affective facial expression recognition for human-robot interaction
CN112232506A (zh) 网络模型训练方法、图像目标识别方法、装置和电子设备
CN113837257A (zh) 一种目标检测方法及装置
Abed et al. KeyFrame extraction based on face quality measurement and convolutional neural network for efficient face recognition in videos
Ponce-López et al. Non-verbal communication analysis in victim–offender mediations
CN117671800A (zh) 面向遮挡的人体姿态估计方法、装置及电子设备
CN116955543A (zh) 连贯性评估模型训练和连贯性评估方法、装置及设备
CN114170439A (zh) 姿态识别方法、装置、存储介质和电子设备
CN111881855A (zh) 图像处理方法、装置、计算机设备及存储介质
CN114912540A (zh) 迁移学习方法、装置、设备及存储介质
CN114281933A (zh) 文本处理方法、装置、计算机设备及存储介质
CN113822291A (zh) 一种图像处理方法、装置、设备及存储介质
CN113822871A (zh) 基于动态检测头的目标检测方法、装置、存储介质及设备
Yang et al. Video system for human attribute analysis using compact convolutional neural network

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20211104

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230117

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20230808