CN110443161A - Monitoring method based on artificial intelligence under a kind of scene towards bank - Google Patents

Monitoring method based on artificial intelligence under a kind of scene towards bank Download PDF

Info

Publication number
CN110443161A
CN110443161A CN201910652598.5A CN201910652598A CN110443161A CN 110443161 A CN110443161 A CN 110443161A CN 201910652598 A CN201910652598 A CN 201910652598A CN 110443161 A CN110443161 A CN 110443161A
Authority
CN
China
Prior art keywords
voice
early warning
video
suspected target
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910652598.5A
Other languages
Chinese (zh)
Other versions
CN110443161B (en
Inventor
何金保
安鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ningbo University of Technology
Original Assignee
Ningbo University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ningbo University of Technology filed Critical Ningbo University of Technology
Priority to CN201910652598.5A priority Critical patent/CN110443161B/en
Publication of CN110443161A publication Critical patent/CN110443161A/en
Application granted granted Critical
Publication of CN110443161B publication Critical patent/CN110443161B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24147Distances to closest patterns, e.g. nearest neighbour classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Acoustics & Sound (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Closed-Circuit Television Systems (AREA)
  • Alarm Systems (AREA)
  • Image Analysis (AREA)

Abstract

The present invention provides the monitoring methods based on artificial intelligence under a kind of scene towards bank, in order to improve early warning accuracy, by the way of video image and voice two-stage early warning, video image carries out early warning to suspected target first, then it is directed to suspected target, using speech Separation technology, further confirm that whether suspected target needs early warning.During speech Separation, the optimization object function weighted using space encoding is conducive to the neighbouring relations on seeker's object space, is weighted by space encoding, the reliability of voice flow separation can be improved when coding by the way of Gray code.The present invention realizes simply, meets the needs of practical application.

Description

Monitoring method based on artificial intelligence under a kind of scene towards bank
Technical field
The present invention relates to the monitoring methods based on artificial intelligence under a kind of scene towards bank.
Background technique
Bank be manage deposit, make loans, exchange, the business such as savings, undertake the financial institution of credit intermediary, be state key Safety precaution unit, with scale diversification, financial services equipment is numerous, it is complicated to enter and leave personnel, it is wide etc. to manage coverage Feature.But the miscellaneous criminal activity in recent years, within the scope of financial industry is commonplace.
The Activity recognition of people is always an important research direction of computer vision field, different in detection and identification video Chang Hangwei has become a challenging hot research problem at present.It is traditional that rely primarily on security work person artificial The random emergency event of monitor full time and suspicious event, need a large amount of manpower.Due to the template in video monitoring system Classifier has no idea to construct all people's body posture, so, only by video image detect potential threat have compared with It is big difficult.
Summary of the invention
The shortcomings that in view of the prior art, the present invention propose the monitoring side based on artificial intelligence under a kind of scene towards bank Method, this method are applied under bank's scene, are equipped with video monitoring camera and sound pick-up, according to video and voice signal into Row two-stage early warning, it is characterised in that:
Level-one video early warning: according to video image, suspected target is extracted, the specific steps are as follows:
1. detecting the human body in video based on the Background difference of gauss hybrid models, image background is removed;
2. extracting the target interbehavior feature of video using convolutional neural networks ALexNet for target, behavior is obtained Characteristic probability value;
3. two hidden-layer network area partial objectives for normal behaviour and abnormal behaviour are utilized, to suspected target early warning;
Second level phonetic warning: being directed to suspected target, extracts speaker's voice, the specific steps are as follows:
1. mixing voice is resolved into time frequency unit;
2. marking by pitch tracking and time frequency unit, the pitch contour and corresponding voice flow of input signal are obtained;
3. extracting frequency cepstral coefficient (GFCC) eigenmatrix of mixing voice;
4. dividing region according to video image, encoded using Gray code, the optimization aim of design space coding weighting Function, objective function form are as follows:
Wherein, L is voice flow quantity to be extracted, GrIt is gray encoding, g is class vector, and F is GFCC feature square Battle array, WkIt (g) is that kth ties up component in class vector g, F value range is Wk(g), NUMk(g) and VkIt (g) is that kth is tieed up in g respectively The element number and mean value of component GFCC eigenmatrix, C are the mean value of GFCC eigenmatrix, (*)TRepresenting matrix transposition;
5. the voice flow of cluster seeking majorized function maximum value combines;
6. according to voice flow, early warning.
In conclusion the present invention forecasts accuracy to improve alert under bank's scene, using video image and voice two The mode of grade early warning, during voice flow separation, the optimization object function weighted using space encoding uses lattice when coding The mode of thunder code is conducive to the neighbouring relations on seeker's object space, is weighted by space encoding, and voice flow separation can be improved Reliability.Moreover, the present invention does not need to be trained voice data collection acquisition priori knowledge during speech Separation, it is real It is now simple, high reliablity.
Detailed description of the invention
Fig. 1 is the flow chart of the embodiment of the present invention.
Specific embodiment
Illustrate embodiments of the present invention below by way of specific specific example, those skilled in the art can be by this specification Disclosed content is implemented easily.
The present invention proposes that the monitoring method based on artificial intelligence under a kind of scene towards bank, this method are applied in bank Under scene, video monitoring camera and sound pick-up are installed, two-stage early warning is carried out according to video and voice signal, the method is as follows:
Level-one video early warning: according to video image, extracting suspected target, using Ubuntu platform, by OpenCV and The library TensorFlow, the specific steps are as follows:
1. detecting the human body in video based on the Background difference of gauss hybrid models, image background is removed, is utilized The background subtraction of BackgroundSubtrctorMOG2 function realization video image;
2. extracting the target interbehavior feature of video using convolutional neural networks ALexNet for target, behavior is obtained Characteristic probability value.ALexNet includes that 5 convolutional layers and 3 full articulamentums, the output of the last one full articulamentum are sent to In softmax layers, behavioural characteristic probability value is the value of float type.Neural network convolution operation utilizes function conv_2d () It realizes.
3. two hidden-layer network area partial objectives for normal behaviour and abnormal behaviour are utilized, to suspected target early warning.It utilizes TensorFlow deep learning platform realizes two hidden-layer network, completes the realization of two hidden-layer network and training pattern.
Second level phonetic warning: being directed to suspected target, extracts speaker's voice, the specific steps are as follows:
1. mixing voice is resolved into time frequency unit.By 64 Gammatone Superimposed Filters at bandpass filter group, The centre frequency equidistantly distributed of each filter, the frequency coverage of entire filter group are 50Hz~5000Hz.Then, with 40ms is frame length, 20ms is that frame moves, and accordingly does time domain sub-frame processing to the filtering of each frequency channel.
2. marking by pitch tracking and time frequency unit, the pitch contour and corresponding voice flow of input signal are obtained.Base Sound tracking uses viterbi algorithm, and fundamental tone observation probability is calculated by the significance of every frame candidate fundamental frequency, fundamental tone transition probability Pitch variation rate by counting voice data set obtains, and probability is the observation probability of first frame in each voiced segments.Base Sound tracking carries out in each voiced segments, finds out an optimal fundamental tone sequence.It is marked, is obtained by pitch tracking and time frequency unit The pitch contour of input signal and voice flow while correspondence.Wherein, while voice flow is indicated with two-value mask, and 1 represents correspondence Time frequency unit is labeled, and 0 indicates unmarked.
3. extracting frequency cepstral coefficient (GFCC) eigenmatrix of mixing voice.Language while by two-value mask and correspondence Sound flows through filter feature unit, obtains by the unit of 1 label, the unit not being labeled is removed.For each frame, by acquisition It is converted by the unit of 1 label by discrete cosine transform operation, ultimately forms the GFCC eigenmatrix of voice signal.
4. dividing region according to video image, encoded using Gray code, the optimization aim of design space coding weighting Function, objective function form are as follows:
Wherein, L is voice flow quantity to be extracted, GrIt is gray encoding, g is class vector, and F is GFCC feature square Battle array, WkIt (g) is that kth ties up component in class vector g, F value range is Wk(g), NUMk(g) and VkIt (g) is that kth is tieed up in g respectively The element number and mean value of component GFCC eigenmatrix, C are the mean value of GFCC eigenmatrix, (*)TRepresenting matrix transposition.
Voice flow quantity L to be extracted is determined according to Gray code adjacency on geometry number, former during gray encoding Then upper each personage corresponds to an individual Gray code, and it is reasonable that this requires image-region to divide.
5. by the method for exhaustion, the voice flow combination of cluster seeking majorized function maximum value.System starts first to choose at random L unit in choosing while voice flow, is assigned in L classification, is then ranked up to the voice flow unit not being selected.
6. according to voice flow, early warning.
In conclusion the present invention forecasts accuracy to improve alert under bank's scene, using video image and voice two The mode of grade early warning, during voice flow separation, the optimization object function weighted using space encoding uses lattice when coding The mode of thunder code is conducive to the neighbouring relations on seeker's object space, is weighted by space encoding, and voice flow separation can be improved Reliability.Moreover, the present invention does not need to be trained voice data collection acquisition priori knowledge during speech Separation, it is real It is now simple, high reliablity.The present invention effectively overcomes various shortcoming in the prior art and has height application value.

Claims (1)

1. the monitoring method based on artificial intelligence under a kind of scene towards bank, this method is applied under bank's scene, installation There are video monitoring camera and sound pick-up, two-stage early warning carried out according to video and voice signal, it is characterised in that:
Level-one video early warning: according to video image, suspected target is extracted, the specific steps are as follows:
1. detecting the human body in video based on the Background difference of gauss hybrid models, image background is removed;
2. extracting the target interbehavior feature of video using convolutional neural networks ALexNet for target, behavioural characteristic is obtained Probability value;
3. two hidden-layer network area partial objectives for normal behaviour and abnormal behaviour are utilized, to suspected target early warning;
Second level phonetic warning: being directed to suspected target, extracts speaker's voice, the specific steps are as follows:
1. mixing voice is resolved into time frequency unit;
2. marking by pitch tracking and time frequency unit, the pitch contour and corresponding voice flow of input signal are obtained;
3. extracting the frequency cepstral coefficient eigenmatrix of mixing voice;
4. dividing region according to video image, encoded using Gray code, the optimization object function of design space coding weighting, Objective function form are as follows:
Wherein, L is voice flow quantity to be extracted, GrIt is gray encoding, g is class vector, and F is frequency cepstral coefficient feature Matrix, WkIt (g) is that kth ties up component in class vector g, F value range is Wk(g), NUMk(g) and VkIt (g) is that kth is tieed up in g respectively The element number and mean value of component frequencies cepstrum coefficient eigenmatrix, C are the mean value of frequency cepstral coefficient eigenmatrix, (*)T Representing matrix transposition;
5. the voice flow of cluster seeking majorized function maximum value combines;
6. according to voice flow, early warning.
CN201910652598.5A 2019-07-19 2019-07-19 Monitoring method based on artificial intelligence in banking scene Active CN110443161B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910652598.5A CN110443161B (en) 2019-07-19 2019-07-19 Monitoring method based on artificial intelligence in banking scene

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910652598.5A CN110443161B (en) 2019-07-19 2019-07-19 Monitoring method based on artificial intelligence in banking scene

Publications (2)

Publication Number Publication Date
CN110443161A true CN110443161A (en) 2019-11-12
CN110443161B CN110443161B (en) 2023-08-29

Family

ID=68429779

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910652598.5A Active CN110443161B (en) 2019-07-19 2019-07-19 Monitoring method based on artificial intelligence in banking scene

Country Status (1)

Country Link
CN (1) CN110443161B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116475905A (en) * 2023-05-05 2023-07-25 浙江闽立电动工具有限公司 Control system and method for angle grinder

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102902972A (en) * 2012-09-14 2013-01-30 成都国科海博计算机系统有限公司 Human behavior characteristic extraction method and system and abnormal behavior detection method and system
US8704668B1 (en) * 2005-04-20 2014-04-22 Trevor Darrell System for monitoring and alerting based on animal behavior in designated environments
CN105426820A (en) * 2015-11-03 2016-03-23 中原智慧城市设计研究院有限公司 Multi-person abnormal behavior detection method based on security monitoring video data
CN106295470A (en) * 2015-05-21 2017-01-04 北京文安智能技术股份有限公司 A kind of bank self-help service area early-warning monitoring method, Apparatus and system
CN106373430A (en) * 2016-08-26 2017-02-01 华南理工大学 Intersection pass early warning method based on computer vision
CN108052859A (en) * 2017-10-31 2018-05-18 深圳大学 A kind of anomaly detection method, system and device based on cluster Optical-flow Feature
US20180357414A1 (en) * 2017-06-07 2018-12-13 International Business Machines Corporation Cognitive learning to counter security threats for kinematic actions in robots
CN109447048A (en) * 2018-12-25 2019-03-08 苏州闪驰数控系统集成有限公司 A kind of artificial intelligence early warning system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8704668B1 (en) * 2005-04-20 2014-04-22 Trevor Darrell System for monitoring and alerting based on animal behavior in designated environments
CN102902972A (en) * 2012-09-14 2013-01-30 成都国科海博计算机系统有限公司 Human behavior characteristic extraction method and system and abnormal behavior detection method and system
CN106295470A (en) * 2015-05-21 2017-01-04 北京文安智能技术股份有限公司 A kind of bank self-help service area early-warning monitoring method, Apparatus and system
CN105426820A (en) * 2015-11-03 2016-03-23 中原智慧城市设计研究院有限公司 Multi-person abnormal behavior detection method based on security monitoring video data
CN106373430A (en) * 2016-08-26 2017-02-01 华南理工大学 Intersection pass early warning method based on computer vision
US20180357414A1 (en) * 2017-06-07 2018-12-13 International Business Machines Corporation Cognitive learning to counter security threats for kinematic actions in robots
CN108052859A (en) * 2017-10-31 2018-05-18 深圳大学 A kind of anomaly detection method, system and device based on cluster Optical-flow Feature
CN109447048A (en) * 2018-12-25 2019-03-08 苏州闪驰数控系统集成有限公司 A kind of artificial intelligence early warning system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
AMIRA BEN MABROUK.ET.: "Abnormal behavior recognition for intelligent video surveillance systems: A review", vol. 91, pages 480 - 491, XP085229786, DOI: 10.1016/j.eswa.2017.09.029 *
冯亚闯: "视频中的异常事件检测算法研究", no. 4, pages 136 - 10 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116475905A (en) * 2023-05-05 2023-07-25 浙江闽立电动工具有限公司 Control system and method for angle grinder
CN116475905B (en) * 2023-05-05 2024-01-09 浙江闽立电动工具有限公司 Control system and method for angle grinder

Also Published As

Publication number Publication date
CN110443161B (en) 2023-08-29

Similar Documents

Publication Publication Date Title
CN108709633B (en) Distributed optical fiber vibration sensing intelligent safety monitoring method based on deep learning
CN105022835B (en) A kind of intelligent perception big data public safety recognition methods and system
CN106778595B (en) Method for detecting abnormal behaviors in crowd based on Gaussian mixture model
CN105095856B (en) Face identification method is blocked based on mask
CN110070530A (en) A kind of powerline ice-covering detection method based on deep neural network
CN111339883A (en) Method for identifying and detecting abnormal behaviors in transformer substation based on artificial intelligence in complex scene
CN107491749B (en) Method for detecting global and local abnormal behaviors in crowd scene
CN113707175B (en) Acoustic event detection system based on feature decomposition classifier and adaptive post-processing
CN105574489A (en) Layered stack based violent group behavior detection method
CN110728252A (en) Face detection method applied to regional personnel motion trail monitoring
CN117671887B (en) Intelligent security early warning management method and system based on big data
CN117079351B (en) Method and system for analyzing personnel behaviors in key areas
CN110443161A (en) Monitoring method based on artificial intelligence under a kind of scene towards bank
CN108256551A (en) A kind of vehicle checking method based on region convolutional neural networks
CN117079197A (en) Intelligent building site management method and system
CN115240142B (en) Outdoor key place crowd abnormal behavior early warning system and method based on cross media
CN117037264A (en) Prison personnel abnormal behavior identification method based on target and key point detection
CN115171006B (en) Detection method for automatically identifying person entering electric power dangerous area based on deep learning
CN114358667B (en) Scenic spot risk prediction model construction method based on RBF (radial basis function) network learning
CN115393802A (en) Railway scene unusual invasion target identification method based on small sample learning
Nyajowi et al. CNN real-time detection of vandalism using a hybrid-LSTM deep learning neural networks
CN112329743B (en) Abnormal body temperature monitoring method, device and medium in epidemic situation environment
CN116401290B (en) Personnel security inspection method based on metal carrying capacity data
CN113177513B (en) Method, device, equipment and storage medium for detecting wearing of safety helmet
CN113743406B (en) Deep learning-based personnel detection method for production safety

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant