JP2022546153A - 動作認識方法、装置、コンピュータ機器及び記憶媒体 - Google Patents
動作認識方法、装置、コンピュータ機器及び記憶媒体 Download PDFInfo
- Publication number
- JP2022546153A JP2022546153A JP2021565729A JP2021565729A JP2022546153A JP 2022546153 A JP2022546153 A JP 2022546153A JP 2021565729 A JP2021565729 A JP 2021565729A JP 2021565729 A JP2021565729 A JP 2021565729A JP 2022546153 A JP2022546153 A JP 2022546153A
- Authority
- JP
- Japan
- Prior art keywords
- motion detection
- feature
- image
- target object
- motion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 90
- 230000009471 action Effects 0.000 title claims description 47
- 230000033001 locomotion Effects 0.000 claims abstract description 270
- 238000001514 detection method Methods 0.000 claims abstract description 231
- 238000012545 processing Methods 0.000 claims abstract description 46
- 238000011176 pooling Methods 0.000 claims description 72
- 230000008569 process Effects 0.000 claims description 33
- 238000000605 extraction Methods 0.000 claims description 31
- 238000004590 computer program Methods 0.000 claims description 14
- 238000013507 mapping Methods 0.000 claims description 6
- 230000006399 behavior Effects 0.000 claims description 3
- 230000000875 corresponding effect Effects 0.000 description 68
- 230000004044 response Effects 0.000 description 12
- 238000013528 artificial neural network Methods 0.000 description 9
- 238000001994 activation Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 238000003384 imaging method Methods 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- 230000004913 activation Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000003062 neural network model Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000008921 facial expression Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/48—Extraction of image or video features by mapping characteristic values of the pattern into a parameter space, e.g. Hough transformation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Business, Economics & Management (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Tourism & Hospitality (AREA)
- Evolutionary Computation (AREA)
- Human Computer Interaction (AREA)
- Social Psychology (AREA)
- Software Systems (AREA)
- Psychiatry (AREA)
- General Business, Economics & Management (AREA)
- Artificial Intelligence (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Educational Technology (AREA)
- Human Resources & Organizations (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Educational Administration (AREA)
- Computational Linguistics (AREA)
- Image Analysis (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Hardware Redundancy (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010755553.3 | 2020-07-31 | ||
CN202010755553.3A CN111881854A (zh) | 2020-07-31 | 2020-07-31 | 动作识别方法、装置、计算机设备及存储介质 |
PCT/CN2021/087693 WO2022021948A1 (zh) | 2020-07-31 | 2021-04-16 | 动作识别方法、装置、计算机设备及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2022546153A true JP2022546153A (ja) | 2022-11-04 |
Family
ID=73204793
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2021565729A Pending JP2022546153A (ja) | 2020-07-31 | 2021-04-16 | 動作認識方法、装置、コンピュータ機器及び記憶媒体 |
Country Status (5)
Country | Link |
---|---|
JP (1) | JP2022546153A (zh) |
KR (1) | KR20220122735A (zh) |
CN (1) | CN111881854A (zh) |
TW (1) | TWI776566B (zh) |
WO (1) | WO2022021948A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111881854A (zh) * | 2020-07-31 | 2020-11-03 | 上海商汤临港智能科技有限公司 | 动作识别方法、装置、计算机设备及存储介质 |
CN113469056A (zh) * | 2021-07-02 | 2021-10-01 | 上海商汤智能科技有限公司 | 行为识别方法、装置、电子设备及计算机可读存储介质 |
CN115841140B (zh) * | 2022-04-20 | 2023-08-11 | 北京爱芯科技有限公司 | 一种反最大池化运算方法、装置、电子设备及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018105062A1 (ja) * | 2016-12-07 | 2018-06-14 | オリンパス株式会社 | 画像処理装置及び画像処理方法 |
CN108681695A (zh) * | 2018-04-26 | 2018-10-19 | 北京市商汤科技开发有限公司 | 视频动作识别方法及装置、电子设备和存储介质 |
WO2019220622A1 (ja) * | 2018-05-18 | 2019-11-21 | 日本電気株式会社 | 画像処理装置、システム、方法及びプログラムが格納された非一時的なコンピュータ可読媒体 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7482040B2 (ja) * | 2018-06-14 | 2024-05-13 | マジック リープ, インコーポレイテッド | 拡張現実深層ジェスチャネットワーク |
CN108875674B (zh) * | 2018-06-29 | 2021-11-16 | 东南大学 | 一种基于多列融合卷积神经网络的驾驶员行为识别方法 |
CN109726803B (zh) * | 2019-01-10 | 2021-06-29 | 广州小狗机器人技术有限公司 | 池化方法、图像处理方法及装置 |
CN111435422B (zh) * | 2019-01-11 | 2024-03-08 | 商汤集团有限公司 | 动作识别方法、控制方法及装置、电子设备和存储介质 |
CN109919008A (zh) * | 2019-01-23 | 2019-06-21 | 平安科技(深圳)有限公司 | 运动目标检测方法、装置、计算机设备及存储介质 |
CN110879993B (zh) * | 2019-11-29 | 2023-03-14 | 北京市商汤科技开发有限公司 | 神经网络训练方法、人脸识别任务的执行方法及装置 |
CN111310616B (zh) * | 2020-02-03 | 2023-11-28 | 北京市商汤科技开发有限公司 | 图像处理方法及装置、电子设备和存储介质 |
CN111401144B (zh) * | 2020-02-26 | 2023-04-07 | 华南理工大学 | 一种基于视频监控的手扶电梯乘客行为识别方法 |
CN111160491B (zh) * | 2020-04-03 | 2020-09-01 | 北京精诊医疗科技有限公司 | 一种卷积神经网络中的池化方法和池化模型 |
CN111881854A (zh) * | 2020-07-31 | 2020-11-03 | 上海商汤临港智能科技有限公司 | 动作识别方法、装置、计算机设备及存储介质 |
-
2020
- 2020-07-31 CN CN202010755553.3A patent/CN111881854A/zh not_active Withdrawn
-
2021
- 2021-04-16 JP JP2021565729A patent/JP2022546153A/ja active Pending
- 2021-04-16 WO PCT/CN2021/087693 patent/WO2022021948A1/zh active Application Filing
- 2021-04-16 KR KR1020227026434A patent/KR20220122735A/ko unknown
- 2021-06-28 TW TW110123621A patent/TWI776566B/zh active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018105062A1 (ja) * | 2016-12-07 | 2018-06-14 | オリンパス株式会社 | 画像処理装置及び画像処理方法 |
CN108681695A (zh) * | 2018-04-26 | 2018-10-19 | 北京市商汤科技开发有限公司 | 视频动作识别方法及装置、电子设备和存储介质 |
WO2019220622A1 (ja) * | 2018-05-18 | 2019-11-21 | 日本電気株式会社 | 画像処理装置、システム、方法及びプログラムが格納された非一時的なコンピュータ可読媒体 |
Also Published As
Publication number | Publication date |
---|---|
KR20220122735A (ko) | 2022-09-02 |
CN111881854A (zh) | 2020-11-03 |
TWI776566B (zh) | 2022-09-01 |
WO2022021948A1 (zh) | 2022-02-03 |
TW202207075A (zh) | 2022-02-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11455807B2 (en) | Training neural networks for vehicle re-identification | |
CN111797893B (zh) | 一种神经网络的训练方法、图像分类系统及相关设备 | |
CN111523621B (zh) | 图像识别方法、装置、计算机设备和存储介质 | |
JP2022546153A (ja) | 動作認識方法、装置、コンピュータ機器及び記憶媒体 | |
CN110827129B (zh) | 一种商品推荐方法及装置 | |
CN110033023B (zh) | 一种基于绘本识别的图像数据处理方法及系统 | |
KR102548732B1 (ko) | 신경망 학습 방법 및 이를 적용한 장치 | |
CN111783902A (zh) | 数据增广、业务处理方法、装置、计算机设备和存储介质 | |
CN111340105A (zh) | 一种图像分类模型训练方法、图像分类方法、装置及计算设备 | |
CN112257808B (zh) | 用于零样本分类的集成协同训练方法、装置及终端设备 | |
CN112926462B (zh) | 训练方法、装置、动作识别方法、装置及电子设备 | |
Faria et al. | Towards the development of affective facial expression recognition for human-robot interaction | |
CN112232506A (zh) | 网络模型训练方法、图像目标识别方法、装置和电子设备 | |
CN113837257A (zh) | 一种目标检测方法及装置 | |
Abed et al. | KeyFrame extraction based on face quality measurement and convolutional neural network for efficient face recognition in videos | |
Ponce-López et al. | Non-verbal communication analysis in victim–offender mediations | |
CN117671800A (zh) | 面向遮挡的人体姿态估计方法、装置及电子设备 | |
CN116955543A (zh) | 连贯性评估模型训练和连贯性评估方法、装置及设备 | |
CN114170439A (zh) | 姿态识别方法、装置、存储介质和电子设备 | |
CN111881855A (zh) | 图像处理方法、装置、计算机设备及存储介质 | |
CN114912540A (zh) | 迁移学习方法、装置、设备及存储介质 | |
CN114281933A (zh) | 文本处理方法、装置、计算机设备及存储介质 | |
CN113822291A (zh) | 一种图像处理方法、装置、设备及存储介质 | |
CN113822871A (zh) | 基于动态检测头的目标检测方法、装置、存储介质及设备 | |
Yang et al. | Video system for human attribute analysis using compact convolutional neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20211104 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230117 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20230808 |