CN116863509B - Method for detecting human-shaped outline and recognizing gesture by using improved polar mask - Google Patents
Method for detecting human-shaped outline and recognizing gesture by using improved polar mask Download PDFInfo
- Publication number
- CN116863509B CN116863509B CN202311119512.5A CN202311119512A CN116863509B CN 116863509 B CN116863509 B CN 116863509B CN 202311119512 A CN202311119512 A CN 202311119512A CN 116863509 B CN116863509 B CN 116863509B
- Authority
- CN
- China
- Prior art keywords
- human
- humanoid
- polar
- model
- mask
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 64
- 238000012549 training Methods 0.000 claims abstract description 37
- 230000011218 segmentation Effects 0.000 claims abstract description 35
- 230000008569 process Effects 0.000 claims abstract description 15
- 230000007246 mechanism Effects 0.000 claims abstract description 14
- 238000013461 design Methods 0.000 claims abstract description 10
- 238000013526 transfer learning Methods 0.000 claims abstract description 7
- 238000007499 fusion processing Methods 0.000 claims abstract description 4
- 230000006870 function Effects 0.000 claims description 20
- 230000004913 activation Effects 0.000 claims description 18
- 230000006835 compression Effects 0.000 claims description 13
- 238000007906 compression Methods 0.000 claims description 13
- 230000005284 excitation Effects 0.000 claims description 13
- 230000004927 fusion Effects 0.000 claims description 11
- 238000001514 detection method Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 6
- 210000002569 neuron Anatomy 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 5
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000003058 natural language processing Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 claims description 2
- 230000009191 jumping Effects 0.000 claims description 2
- 238000005516 engineering process Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 230000005012 migration Effects 0.000 description 4
- 238000013508 migration Methods 0.000 description 4
- 238000012544 monitoring process Methods 0.000 description 4
- 230000006399 behavior Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000013136 deep learning model Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000003709 image segmentation Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/096—Transfer learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/26—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Human Computer Interaction (AREA)
- Medical Informatics (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Social Psychology (AREA)
- Psychiatry (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311119512.5A CN116863509B (en) | 2023-09-01 | 2023-09-01 | Method for detecting human-shaped outline and recognizing gesture by using improved polar mask |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311119512.5A CN116863509B (en) | 2023-09-01 | 2023-09-01 | Method for detecting human-shaped outline and recognizing gesture by using improved polar mask |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116863509A CN116863509A (en) | 2023-10-10 |
CN116863509B true CN116863509B (en) | 2024-02-20 |
Family
ID=88219371
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311119512.5A Active CN116863509B (en) | 2023-09-01 | 2023-09-01 | Method for detecting human-shaped outline and recognizing gesture by using improved polar mask |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116863509B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113223027A (en) * | 2021-04-13 | 2021-08-06 | 山东师范大学 | Immature persimmon segmentation method and system based on PolarMask |
CN116188785A (en) * | 2023-05-04 | 2023-05-30 | 福建环宇通信息科技股份公司 | Polar mask old man contour segmentation method using weak labels |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220309275A1 (en) * | 2021-03-29 | 2022-09-29 | Hewlett-Packard Development Company, L.P. | Extraction of segmentation masks for documents within captured image |
-
2023
- 2023-09-01 CN CN202311119512.5A patent/CN116863509B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113223027A (en) * | 2021-04-13 | 2021-08-06 | 山东师范大学 | Immature persimmon segmentation method and system based on PolarMask |
CN116188785A (en) * | 2023-05-04 | 2023-05-30 | 福建环宇通信息科技股份公司 | Polar mask old man contour segmentation method using weak labels |
Non-Patent Citations (1)
Title |
---|
Cell image instance segmentation based on PolarMask using weak labels;Binbin Tong等;Computer Methods and Programs in Biomedicine;第231卷;第1-10页 * |
Also Published As
Publication number | Publication date |
---|---|
CN116863509A (en) | 2023-10-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dvornik et al. | On the importance of visual context for data augmentation in scene understanding | |
Wu et al. | Object detection based on RGC mask R‐CNN | |
CN109977918B (en) | Target detection positioning optimization method based on unsupervised domain adaptation | |
Yin et al. | FD-SSD: An improved SSD object detection algorithm based on feature fusion and dilated convolution | |
US9202144B2 (en) | Regionlets with shift invariant neural patterns for object detection | |
CN110738207A (en) | character detection method for fusing character area edge information in character image | |
CN110991444B (en) | License plate recognition method and device for complex scene | |
CN112651438A (en) | Multi-class image classification method and device, terminal equipment and storage medium | |
CN109478239A (en) | The method and object detection systems of object in detection image | |
CN110782420A (en) | Small target feature representation enhancement method based on deep learning | |
CN109508675B (en) | Pedestrian detection method for complex scene | |
CN113177560A (en) | Universal lightweight deep learning vehicle detection method | |
CN112036260B (en) | Expression recognition method and system for multi-scale sub-block aggregation in natural environment | |
CN114758288A (en) | Power distribution network engineering safety control detection method and device | |
CN111414916A (en) | Method and device for extracting and generating text content in image and readable storage medium | |
CN112528845A (en) | Physical circuit diagram identification method based on deep learning and application thereof | |
CN106874913A (en) | A kind of vegetable detection method | |
CN112906520A (en) | Gesture coding-based action recognition method and device | |
CN117079095A (en) | Deep learning-based high-altitude parabolic detection method, system, medium and equipment | |
CN114764941A (en) | Expression recognition method and device and electronic equipment | |
CN116188785A (en) | Polar mask old man contour segmentation method using weak labels | |
CN116863509B (en) | Method for detecting human-shaped outline and recognizing gesture by using improved polar mask | |
CN114511877A (en) | Behavior recognition method and device, storage medium and terminal | |
CN113610015A (en) | Attitude estimation method, device and medium based on end-to-end rapid ladder network | |
CN113158870A (en) | Countermeasure type training method, system and medium for 2D multi-person attitude estimation network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20231212 Address after: 362000 North China Road, Dongcheng, Fengze District, Quanzhou City, Fujian Province, 269 Applicant after: HUAQIAO University Applicant after: FUJIAN HUANYUTONG TECHNOLOGY CO.,LTD. Address before: 362000, 7th Floor, Office Building, Haixi Electronic Information Industry Development Base, Keji Road, High tech Industrial Park (formerly Xunmei Industrial Zone), Fengze District, Quanzhou City, Fujian Province Applicant before: FUJIAN HUANYUTONG TECHNOLOGY CO.,LTD. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Wen Tingxi Inventor after: Tong Binbin Inventor after: Hou Qingfei Inventor after: Chen Yuping Inventor after: Xie Jianhua Inventor after: Zeng Huanqiang Inventor before: Wen Tingxi Inventor before: Tong Binbin Inventor before: Hou Qingfei Inventor before: Chen Yuping Inventor before: Xie Jianhua Inventor before: Zeng Huanqiang |
|
CB03 | Change of inventor or designer information |