SG11201909887RA - Methods and apparatuses for recognizing video and training, electronic device and medium - Google Patents
Methods and apparatuses for recognizing video and training, electronic device and mediumInfo
- Publication number
- SG11201909887RA SG11201909887RA SG11201909887RA SG11201909887RA SG 11201909887R A SG11201909887R A SG 11201909887RA SG 11201909887R A SG11201909887R A SG 11201909887RA SG 11201909887R A SG11201909887R A SG 11201909887RA
- Authority
- SG
- Singapore
- Prior art keywords
- video
- key frame
- frame
- features
- key
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 3
- 230000004927 fusion Effects 0.000 abstract 5
- 238000001514 detection method Methods 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/48—Matching video sequences
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/50—Image enhancement or restoration using two or more images, e.g. averaging or subtraction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/49—Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/265—Mixing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20212—Image combination
- G06T2207/20221—Image fusion; Image merging
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Signal Processing (AREA)
- Computing Systems (AREA)
- Image Analysis (AREA)
Abstract
METHODS AND APPARATUSES FOR RECOGNIZING VIDEO AND TRAINING, ELECTRONIC DEVICE AND MEDIUM 5 A method and an apparatus for recognizing and training a video, an electronic device and a storage medium include: extracting features of a first key frame in a video; performing fusion on the features of the first key frame and fusion features of a second key frame in the video to obtain fusion features of the first key frame, where a detection sequence of the second key frame in the video precedes that of the first key 10 frame; and performing detection on the first key frame according to the fusion features of the first key frame to obtain an object detection result of the first key frame. Through iterative multi-frame feature fusion, information contained in shared features of these key frames in the video can be enhanced, thereby improving frame recognition accuracy and video recognition efficiency. 15 [Figure 1]
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711329718.5A CN108229336B (en) | 2017-12-13 | 2017-12-13 | Video recognition and training method and apparatus, electronic device, program, and medium |
PCT/CN2018/110500 WO2019114405A1 (en) | 2017-12-13 | 2018-10-16 | Video recognition and training method and apparatus, electronic device and medium |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11201909887RA true SG11201909887RA (en) | 2019-11-28 |
Family
ID=62652263
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201909887R SG11201909887RA (en) | 2017-12-13 | 2018-10-16 | Methods and apparatuses for recognizing video and training, electronic device and medium |
Country Status (6)
Country | Link |
---|---|
US (1) | US10909380B2 (en) |
JP (1) | JP6837158B2 (en) |
KR (1) | KR102365521B1 (en) |
CN (2) | CN108229336B (en) |
SG (1) | SG11201909887RA (en) |
WO (1) | WO2019114405A1 (en) |
Families Citing this family (45)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108229336B (en) * | 2017-12-13 | 2021-06-04 | 北京市商汤科技开发有限公司 | Video recognition and training method and apparatus, electronic device, program, and medium |
CN108810620B (en) * | 2018-07-18 | 2021-08-17 | 腾讯科技(深圳)有限公司 | Method, device, equipment and storage medium for identifying key time points in video |
CN109344703B (en) * | 2018-08-24 | 2021-06-25 | 深圳市商汤科技有限公司 | Object detection method and device, electronic equipment and storage medium |
CN109389086B (en) * | 2018-10-09 | 2021-03-05 | 北京科技大学 | Method and system for detecting unmanned aerial vehicle image target |
CN111353597B (en) * | 2018-12-24 | 2023-12-05 | 杭州海康威视数字技术股份有限公司 | Target detection neural network training method and device |
CN111383245B (en) * | 2018-12-29 | 2023-09-22 | 北京地平线机器人技术研发有限公司 | Video detection method, video detection device and electronic equipment |
CN109886951A (en) * | 2019-02-22 | 2019-06-14 | 北京旷视科技有限公司 | Method for processing video frequency, device and electronic equipment |
CN111754544B (en) * | 2019-03-29 | 2023-09-05 | 杭州海康威视数字技术股份有限公司 | Video frame fusion method and device and electronic equipment |
CN109977912B (en) * | 2019-04-08 | 2021-04-16 | 北京环境特性研究所 | Video human body key point detection method and device, computer equipment and storage medium |
CN110060264B (en) * | 2019-04-30 | 2021-03-23 | 北京市商汤科技开发有限公司 | Neural network training method, video frame processing method, device and system |
CN110427800B (en) * | 2019-06-17 | 2024-09-10 | 平安科技(深圳)有限公司 | Video object acceleration detection method, device, server and storage medium |
CN110149482B (en) * | 2019-06-28 | 2021-02-02 | Oppo广东移动通信有限公司 | Focusing method, focusing device, electronic equipment and computer readable storage medium |
CN112199978B (en) * | 2019-07-08 | 2024-07-26 | 北京地平线机器人技术研发有限公司 | Video object detection method and device, storage medium and electronic equipment |
CN110503076B (en) * | 2019-08-29 | 2023-06-30 | 腾讯科技(深圳)有限公司 | Video classification method, device, equipment and medium based on artificial intelligence |
CN110751022B (en) * | 2019-09-03 | 2023-08-22 | 平安科技(深圳)有限公司 | Urban pet activity track monitoring method based on image recognition and related equipment |
CN110738108A (en) * | 2019-09-09 | 2020-01-31 | 北京地平线信息技术有限公司 | Target object detection method, target object detection device, storage medium and electronic equipment |
CN110807379B (en) * | 2019-10-21 | 2024-08-27 | 腾讯科技(深圳)有限公司 | Semantic recognition method, semantic recognition device and computer storage medium |
CN110751646A (en) * | 2019-10-28 | 2020-02-04 | 支付宝(杭州)信息技术有限公司 | Method and device for identifying damage by using multiple image frames in vehicle video |
CN110933429B (en) * | 2019-11-13 | 2021-11-12 | 南京邮电大学 | Video compression sensing and reconstruction method and device based on deep neural network |
CN110909655A (en) * | 2019-11-18 | 2020-03-24 | 上海眼控科技股份有限公司 | Method and equipment for identifying video event |
CN110841287B (en) * | 2019-11-22 | 2023-09-26 | 腾讯科技(深圳)有限公司 | Video processing method, apparatus, computer readable storage medium and computer device |
CN112862828B (en) * | 2019-11-26 | 2022-11-18 | 华为技术有限公司 | Semantic segmentation method, model training method and device |
CN111062395B (en) * | 2019-11-27 | 2020-12-18 | 北京理工大学 | Real-time video semantic segmentation method |
CN111629262B (en) * | 2020-05-08 | 2022-04-12 | Oppo广东移动通信有限公司 | Video image processing method and device, electronic equipment and storage medium |
CN111582185B (en) * | 2020-05-11 | 2023-06-30 | 北京百度网讯科技有限公司 | Method and device for recognizing images |
CN111652081B (en) * | 2020-05-13 | 2022-08-05 | 电子科技大学 | Video semantic segmentation method based on optical flow feature fusion |
CN111881726B (en) * | 2020-06-15 | 2022-11-25 | 马上消费金融股份有限公司 | Living body detection method and device and storage medium |
CN111783784A (en) * | 2020-06-30 | 2020-10-16 | 创新奇智(合肥)科技有限公司 | Method and device for detecting building cavity, electronic equipment and storage medium |
CN111860400B (en) * | 2020-07-28 | 2024-06-07 | 平安科技(深圳)有限公司 | Face enhancement recognition method, device, equipment and storage medium |
CN112036446B (en) * | 2020-08-06 | 2023-12-12 | 汇纳科技股份有限公司 | Method, system, medium and device for fusing target identification features |
CN112085097A (en) * | 2020-09-09 | 2020-12-15 | 北京市商汤科技开发有限公司 | Image processing method and device, electronic equipment and storage medium |
CN112115299B (en) * | 2020-09-17 | 2024-08-13 | 北京百度网讯科技有限公司 | Video searching method, video searching device, video recommending method, electronic equipment and storage medium |
CN112241470B (en) * | 2020-09-24 | 2024-02-02 | 北京影谱科技股份有限公司 | Video classification method and system |
CN112435653B (en) * | 2020-10-14 | 2024-07-30 | 北京地平线机器人技术研发有限公司 | Voice recognition method and device and electronic equipment |
CN112418104B (en) * | 2020-11-24 | 2024-08-02 | 深圳云天励飞技术股份有限公司 | Pedestrian tracking method and related equipment |
CN112528786B (en) * | 2020-11-30 | 2023-10-31 | 北京百度网讯科技有限公司 | Vehicle tracking method and device and electronic equipment |
CN112766215B (en) * | 2021-01-29 | 2024-08-09 | 北京字跳网络技术有限公司 | Face image processing method and device, electronic equipment and storage medium |
CN112561912B (en) * | 2021-02-20 | 2021-06-01 | 四川大学 | Medical image lymph node detection method based on priori knowledge |
CN113011371A (en) * | 2021-03-31 | 2021-06-22 | 北京市商汤科技开发有限公司 | Target detection method, device, equipment and storage medium |
US20220383509A1 (en) * | 2021-05-21 | 2022-12-01 | Honda Motor Co., Ltd. | System and method for learning temporally consistent video synthesis using fake optical flow |
CN113674189B (en) * | 2021-08-17 | 2024-09-20 | Oppo广东移动通信有限公司 | Image processing method, apparatus, electronic device, and computer-readable storage medium |
CN113963287A (en) * | 2021-09-15 | 2022-01-21 | 北京百度网讯科技有限公司 | Scoring model obtaining and video identifying method, device and storage medium |
CN114120166B (en) * | 2021-10-14 | 2023-09-22 | 北京百度网讯科技有限公司 | Video question-answering method and device, electronic equipment and storage medium |
CN114528923B (en) * | 2022-01-25 | 2023-09-26 | 山东浪潮科学研究院有限公司 | Video target detection method, device, equipment and medium based on time domain context |
CN115115822B (en) * | 2022-06-30 | 2023-10-31 | 小米汽车科技有限公司 | Vehicle-end image processing method and device, vehicle, storage medium and chip |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07181024A (en) * | 1993-12-24 | 1995-07-18 | Canon Inc | Method and apparatus for measuring three-dimensional profile |
JP4181473B2 (en) * | 2003-10-15 | 2008-11-12 | 日本放送協会 | Video object trajectory synthesis apparatus, method and program thereof |
US8021160B2 (en) * | 2006-07-22 | 2011-09-20 | Industrial Technology Research Institute | Learning assessment method and device using a virtual tutor |
US8135221B2 (en) | 2009-10-07 | 2012-03-13 | Eastman Kodak Company | Video concept classification using audio-visual atoms |
CN101673404B (en) * | 2009-10-19 | 2015-03-04 | 北京中星微电子有限公司 | Target detection method and device |
CN102014295B (en) * | 2010-11-19 | 2012-11-28 | 嘉兴学院 | Network sensitive video detection method |
CN102682302B (en) * | 2012-03-12 | 2014-03-26 | 浙江工业大学 | Human body posture identification method based on multi-characteristic fusion of key frame |
US8989503B2 (en) * | 2012-08-03 | 2015-03-24 | Kodak Alaris Inc. | Identifying scene boundaries using group sparsity analysis |
US9129399B2 (en) * | 2013-03-11 | 2015-09-08 | Adobe Systems Incorporated | Optical flow with nearest neighbor field fusion |
US9892745B2 (en) * | 2013-08-23 | 2018-02-13 | At&T Intellectual Property I, L.P. | Augmented multi-tier classifier for multi-modal voice activity detection |
WO2015038749A1 (en) * | 2013-09-13 | 2015-03-19 | Arris Enterprises, Inc. | Content based video content segmentation |
US10262426B2 (en) * | 2014-10-31 | 2019-04-16 | Fyusion, Inc. | System and method for infinite smoothing of image sequences |
KR20160099289A (en) * | 2015-02-12 | 2016-08-22 | 대전대학교 산학협력단 | Method and system for video search using convergence of global feature and region feature of image |
CN105005772B (en) * | 2015-07-20 | 2018-06-12 | 北京大学 | A kind of video scene detection method |
KR102444712B1 (en) * | 2016-01-12 | 2022-09-20 | 한국전자통신연구원 | System for automatically re-creating a personal media with Multi-modality feature and method thereof |
US9805255B2 (en) * | 2016-01-29 | 2017-10-31 | Conduent Business Services, Llc | Temporal fusion of multimodal data from multiple data acquisition systems to automatically recognize and classify an action |
US20170277955A1 (en) * | 2016-03-23 | 2017-09-28 | Le Holdings (Beijing) Co., Ltd. | Video identification method and system |
BR102016007265B1 (en) * | 2016-04-01 | 2022-11-16 | Samsung Eletrônica da Amazônia Ltda. | MULTIMODAL AND REAL-TIME METHOD FOR FILTERING SENSITIVE CONTENT |
JP6609505B2 (en) * | 2016-04-06 | 2019-11-20 | Kddi株式会社 | Image composition apparatus and program |
CN106599907B (en) * | 2016-11-29 | 2019-11-29 | 北京航空航天大学 | The dynamic scene classification method and device of multiple features fusion |
CN107392917B (en) * | 2017-06-09 | 2021-09-28 | 深圳大学 | Video significance detection method and system based on space-time constraint |
CN107463881A (en) * | 2017-07-07 | 2017-12-12 | 中山大学 | A kind of character image searching method based on depth enhancing study |
CN107463949B (en) * | 2017-07-14 | 2020-02-21 | 北京协同创新研究院 | Video action classification processing method and device |
CN108229336B (en) * | 2017-12-13 | 2021-06-04 | 北京市商汤科技开发有限公司 | Video recognition and training method and apparatus, electronic device, program, and medium |
-
2017
- 2017-12-13 CN CN201711329718.5A patent/CN108229336B/en active Active
-
2018
- 2018-10-16 WO PCT/CN2018/110500 patent/WO2019114405A1/en active Application Filing
- 2018-10-16 SG SG11201909887R patent/SG11201909887RA/en unknown
- 2018-10-16 JP JP2019553919A patent/JP6837158B2/en active Active
- 2018-10-16 CN CN201880018915.1A patent/CN110546645B/en active Active
- 2018-10-16 KR KR1020197029255A patent/KR102365521B1/en active IP Right Grant
-
2019
- 2019-05-14 US US16/411,342 patent/US10909380B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
WO2019114405A1 (en) | 2019-06-20 |
CN110546645B (en) | 2023-09-19 |
CN108229336A (en) | 2018-06-29 |
US20190266409A1 (en) | 2019-08-29 |
KR102365521B1 (en) | 2022-02-21 |
CN110546645A (en) | 2019-12-06 |
JP2020512647A (en) | 2020-04-23 |
JP6837158B2 (en) | 2021-03-03 |
US10909380B2 (en) | 2021-02-02 |
CN108229336B (en) | 2021-06-04 |
KR20190126366A (en) | 2019-11-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11201909887RA (en) | Methods and apparatuses for recognizing video and training, electronic device and medium | |
PH12019501009A1 (en) | Face liveness detection method and apparatus, and electronic device | |
SG11201901766YA (en) | Electronic device, method and system of identity verification and computer readable storage medium | |
SG11201900263SA (en) | Method, device and server for recognizing characters of claim document, and storage medium | |
PH12018501058A1 (en) | Order clustering and malicious information combating method and apparatus | |
SG11201909139TA (en) | Methods and apparatuses for recognizing dynamic gesture, and control methods and apparatuses using gesture interaction | |
SG11201913865PA (en) | Method and apparatus for recognizing sequence in image, electronic device, and storage medium | |
WO2019133928A8 (en) | Hierarchical, parallel models for extracting in real time high-value information from data streams and system and method for creation of same | |
SG11202105174XA (en) | Text sequence recognition method and apparatus, electronic device, and storage medium | |
SG11201809816YA (en) | Vehicle identification method and apparatus | |
SG11202002078UA (en) | Method and apparatus for training semantic segmentation model, computer device, and storage medium | |
MX2019004994A (en) | Method and apparatus for verifying documents and identity. | |
MX2016003724A (en) | Picture scene determining method and apparatus, and server. | |
SG11201809210VA (en) | Face image data collection method, apparatus, terminal device and storage medium | |
EP4116940A3 (en) | Method and apparatus for processing image, electronic device and storage medium | |
MY182985A (en) | Keyframe scheduling method and apparatus, electronic device, program and medium | |
MX2017011632A (en) | System for distributing metadata embedded in video. | |
MX2016003774A (en) | Fingerprint recognition method and device. | |
HK1175358A2 (en) | Apparatus and method for recognizing content using audio signal | |
EP3364337A3 (en) | Persistent feature descriptors for video | |
SG11202103969XA (en) | Attribute value recovery method and device, storage medium, and electronic device | |
SG11201909071UA (en) | Image processing methods and apparatuses, computer readable storage media and eletronic devices | |
EP3822842A3 (en) | Method and apparatus for generating semantic representation model, electronic device, and storage medium | |
MX2021009164A (en) | Pet food recommendation devices and methods. | |
HK1100586A1 (en) | Apparatus and method for handwriting recognition |