CN111784947A - Active early warning method, system and equipment based on image and voiceprint - Google Patents

Active early warning method, system and equipment based on image and voiceprint Download PDF

Info

Publication number
CN111784947A
CN111784947A CN202010661932.6A CN202010661932A CN111784947A CN 111784947 A CN111784947 A CN 111784947A CN 202010661932 A CN202010661932 A CN 202010661932A CN 111784947 A CN111784947 A CN 111784947A
Authority
CN
China
Prior art keywords
information
voiceprint
image
dangerous
early warning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010661932.6A
Other languages
Chinese (zh)
Inventor
李旭滨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Maosheng Intelligent Technology Co ltd
Original Assignee
Shanghai Maosheng Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Maosheng Intelligent Technology Co ltd filed Critical Shanghai Maosheng Intelligent Technology Co ltd
Priority to CN202010661932.6A priority Critical patent/CN111784947A/en
Publication of CN111784947A publication Critical patent/CN111784947A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B21/00Alarms responsive to a single specified undesired or abnormal condition and not otherwise provided for
    • G08B21/02Alarms for ensuring the safety of persons
    • G08B21/0202Child monitoring systems using a transmitter-receiver system carried by the parent and the child
    • G08B21/0205Specific application combined with child monitoring using a transmitter-receiver system
    • G08B21/0208Combination with audio or video communication, e.g. combination with "baby phone" function
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B21/00Alarms responsive to a single specified undesired or abnormal condition and not otherwise provided for
    • G08B21/02Alarms for ensuring the safety of persons
    • G08B21/0202Child monitoring systems using a transmitter-receiver system carried by the parent and the child
    • G08B21/0233System arrangements with pre-alarms, e.g. when a first distance is exceeded
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B3/00Audible signalling systems; Audible personal calling systems
    • G08B3/10Audible signalling systems; Audible personal calling systems using electric transmission; using electromagnetic transmission
    • G08B3/1008Personal calling arrangements or devices, i.e. paging systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Child & Adolescent Psychology (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Emergency Management (AREA)
  • Social Psychology (AREA)
  • Psychiatry (AREA)
  • Theoretical Computer Science (AREA)
  • Electromagnetism (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Alarm Systems (AREA)

Abstract

The application relates to an active early warning method, system and device based on images and voiceprints, wherein the active early warning method comprises the steps of identifying image information and voiceprint information; sending an early warning call instruction under the condition that the image information comprises a preset dangerous behavior and/or the voiceprint information comprises a preset voiceprint characteristic; and forming early warning calling information based on the early warning calling instruction, wherein the early warning calling information comprises dangerous event information, dangerous event information occurrence time, dangerous event information occurrence place, dangerous personnel information and dangerous personnel real-time position. Through the method and the device, the problems that personnel escape monitoring shooting, the time delay is high, monitoring omission occurs are solved, and the technical effects of image and voiceprint dual automatic identification and automatic calling based on early warning calling information are achieved.

Description

Active early warning method, system and equipment based on image and voiceprint
Technical Field
The application relates to the technical field of monitoring and early warning, in particular to an active early warning method, system and device based on images and voiceprints.
Background
In order to improve the responsiveness of illegal criminal events occurring in public areas or people flow dense areas, multiple paths of image monitoring devices are usually arranged in the public areas and the people flow dense areas to acquire real-time image information of the public areas and the people flow dense areas, and real-time monitoring is performed through a large monitoring screen of a monitoring center, so that under the condition that the illegal criminal events occur, notifications are timely sent to security personnel near the places where the illegal criminal events occur, and the security personnel can arrive at related places, so that related problems are solved.
However, the above monitoring method has certain drawbacks:
1) the image monitoring equipment has a certain blind area, and the information of the criminal or dangerous personnel can shield the face by wearing a hat or a mask so as to prevent the image monitoring equipment from acquiring the image information;
2) the real-time image information transmitted by the image monitoring equipment judges the illegal criminal behaviors and informs security personnel nearby the relevant scheduling places of the illegal criminal behaviors, and when the security personnel arrive at the relevant places, the illegal criminal events are generated or even completed, so that innocent personnel are infringed;
3) the number of the image monitoring devices is large, and monitoring personnel cannot observe real-time image information transmitted by each image monitoring device at the same time, so that the problems of omission, neglect and the like are caused.
Therefore, no effective solution is provided for the problems of people in the related art such as avoidance of monitoring shooting, high time delay, missing of monitoring and the like.
Disclosure of Invention
The embodiment of the application provides an active early warning method, system and device based on images and voiceprints, and aims to at least solve the problems that personnel in related technologies escape monitoring shooting, time delay is high, monitoring omission occurs and the like.
In a first aspect, an embodiment of the present application provides an active early warning method based on an image and a voiceprint, including:
identifying image information and voiceprint information;
sending an early warning call instruction under the condition that the image information comprises a preset dangerous behavior and/or the voiceprint information comprises a preset voiceprint characteristic;
and forming early warning calling information based on the early warning calling instruction, wherein the early warning calling information comprises dangerous event information, dangerous event information occurrence time, dangerous event information occurrence place, dangerous personnel information and dangerous personnel real-time position.
In some of these embodiments, after forming the early warning call information, the method further comprises:
sending the early warning call information to at least one call terminal within a preset range, wherein the preset range is a range which takes the dangerous event information occurrence place and/or the real-time position of the dangerous personnel as the center and takes a preset distance as the radius;
acquiring alarm information of at least one calling terminal corresponding to the early warning calling information, wherein the alarm information comprises alarm personnel and the real-time positions of the alarm personnel;
and generating dynamic path information based on the dangerous event information occurrence place and/or the real-time positions of the dangerous personnel and the real-time positions of the police personnel.
In some of these embodiments, first dynamic path information is generated based on the location of the hazardous event information occurrence and the real-time location of the police officer.
In some of these embodiments, second dynamic path information is generated based on the real-time location of the hazardous person and the real-time location of the police officer.
In some of these embodiments, identifying the image information comprises:
inputting image information into a behavior model to generate behavior information, wherein the behavior information comprises personnel information and action information;
and under the condition that the behavior information is preset dangerous behavior and/or the personnel information is preset dangerous personnel information, continuously acquiring a plurality of image information including the personnel information in a preset time period to generate image stream information.
In some of these embodiments, identifying the voiceprint information comprises:
extracting voiceprint characteristics of the voiceprint information;
inputting the voiceprint features into a voiceprint library;
and under the condition that the voiceprint features are matched with a preset voiceprint feature in the voiceprint library, associating the voiceprint information with attribute information corresponding to the preset voiceprint features, wherein the attribute information comprises dangerous personnel information and dangerous behavior information.
In some embodiments, the voiceprint library is updated based on the voiceprint feature if the voiceprint feature matches a predetermined voiceprint feature in the voiceprint library.
In some of these embodiments, the voiceprint feature is at least one of: energy characteristics, harmonic noise ratio characteristics, mel-frequency cepstrum coefficient characteristics.
In a second aspect, an embodiment of the present application provides an active early warning system based on an image and a voiceprint, including:
the image acquisition equipment is used for acquiring image information of different positions;
the voiceprint acquisition equipment is used for acquiring voiceprint information of different positions;
the image identification unit is connected with the image acquisition devices and used for identifying the image information transmitted by the image acquisition devices so as to generate image identification information, wherein the image identification information is used for indicating whether the image information comprises preset dangerous behaviors;
the voiceprint recognition unit is connected with the voiceprint acquisition devices and used for recognizing the voiceprint information transmitted by the voiceprint acquisition devices so as to generate voiceprint recognition information, wherein the voiceprint recognition information is used for indicating whether the voiceprint information comprises preset voiceprint characteristics;
an instruction unit connected with the image recognition unit and the voiceprint recognition unit and used for generating an early warning call instruction under the condition that the image information comprises the preset dangerous behavior and/or the voiceprint information comprises preset voiceprint characteristics
The call server is connected with the instruction unit and used for forming early warning call information based on the early warning call instruction, wherein the early warning call information comprises dangerous event information, dangerous event information occurrence time, dangerous event information occurrence place, dangerous personnel information and dangerous personnel real-time position;
and the calling terminals are connected with the calling server and used for receiving the early warning calling information transmitted by the calling server.
In some of these embodiments, further comprising:
the position calculation unit is connected with the call server and is used for generating a preset range by taking the dangerous event information occurrence place and/or the real-time position of the dangerous person as a center and taking a preset distance as a radius;
the call server is used for sending the early warning call information to at least one call terminal in the preset range.
In some of these embodiments, further comprising:
the monitoring unit is respectively connected with the image acquisition equipment, the voiceprint acquisition equipment, the image identification unit, the voiceprint identification unit, the instruction unit, the call server and the call terminal, is used for monitoring a plurality of image information and a plurality of voiceprint information transmitted by the image acquisition equipment, and is used for acquiring the image identification information transmitted by the image identification unit, acquiring the voiceprint identification information transmitted by the voiceprint identification unit, acquiring the early warning call instruction transmitted by the instruction unit, acquiring the early warning call information transmitted by the call server, and is used for transmitting the early warning call information to the call terminal.
In a third aspect, an embodiment of the present application provides a computer device, including:
at least one processor;
and a memory communicatively coupled to the at least one processor;
wherein the memory stores a computer program executable by the at least one processor, the computer program being executable by the at least one processor to cause the at least one processor to perform the image and voiceprint based active pre-warning method of the first aspect.
In a fourth aspect, the present application provides a computer-readable storage medium, which stores a computer program, and when the computer program is executed by a processor, the computer program implements the active warning method based on image and voiceprint according to the first aspect.
Compared with the related art, the active early warning method, the active early warning system and the active early warning equipment based on the images and the voiceprints, which are provided by the embodiment of the application, are realized by identifying the image information and the voiceprint information; sending an early warning call instruction under the condition that the image information comprises a preset dangerous behavior and/or the voiceprint information comprises a preset voiceprint characteristic; and forming early warning calling information based on the early warning calling instruction, wherein the early warning calling information comprises dangerous event information, dangerous event information occurrence time, dangerous event information occurrence place, dangerous personnel information and dangerous personnel real-time position, the problems that personnel escape from monitoring shooting, the time delay is high and monitoring omission occurs are solved, and the technical effects of image and voiceprint dual automatic identification and automatic calling based on the early warning calling information are realized.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
fig. 1 is a block diagram (one) of the structure of an active warning system according to an embodiment of the present application;
fig. 2 is a block diagram of the structure of an active warning system according to an embodiment of the present application (ii);
fig. 3 is a block diagram (iii) of the structure of the active warning system according to the embodiment of the present application;
fig. 4 is a flowchart (one) of an active warning method according to an embodiment of the present application;
fig. 5 is a flowchart of an active warning method according to an embodiment of the present application (ii);
fig. 6 is a flowchart (iii) of an active warning method according to an embodiment of the present application;
fig. 7 is a flowchart (iv) of an active warning method according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application will be described and illustrated below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments provided in the present application without any inventive step are within the scope of protection of the present application.
It is obvious that the drawings in the following description are only examples or embodiments of the present application, and that it is also possible for a person skilled in the art to apply the present application to other similar contexts on the basis of these drawings without inventive effort. Moreover, it should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another.
Reference in the specification to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the specification. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those of ordinary skill in the art will explicitly and implicitly appreciate that the embodiments described herein may be combined with other embodiments without conflict.
Unless defined otherwise, technical or scientific terms referred to herein shall have the ordinary meaning as understood by those of ordinary skill in the art to which this application belongs. Reference to "a," "an," "the," and similar words throughout this application are not to be construed as limiting in number, and may refer to the singular or the plural. The present application is directed to the use of the terms "including," "comprising," "having," and any variations thereof, which are intended to cover non-exclusive inclusions; for example, a process, method, system, article, or apparatus that comprises a list of steps or modules (elements) is not limited to the listed steps or elements, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus. Reference to "connected," "coupled," and the like in this application is not intended to be limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect. The term "plurality" as referred to herein means two or more. "and/or" describes an association relationship of associated objects, meaning that three relationships may exist, for example, "A and/or B" may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. Reference herein to the terms "first," "second," "third," and the like, are merely to distinguish similar objects and do not denote a particular ordering for the objects.
Fig. 1 is a block diagram (a) of an active warning system according to an embodiment of the present application. As shown in fig. 1, the active warning system 100 includes a plurality of image capturing devices 110, a plurality of voiceprint capturing devices 120, an image recognition unit 130, a voiceprint recognition unit 140, an instruction unit 150, a call server 160, and a plurality of call terminals 170. The image recognition unit 130 is in communication connection with the image acquisition devices 110, the voiceprint recognition unit 140 is in communication connection with the voiceprint acquisition devices 120, the instruction unit 150 is in communication connection with the image recognition unit 130, the voiceprint recognition unit 140 and the call server 160, and the call server 160 is in communication connection with the call terminals 170.
The image capturing devices 110 are installed at different positions and used for acquiring image information of different positions in real time. In some embodiments, the image capturing device 110 may be a surveillance camera or a snapshot device.
The plurality of voiceprint acquisition devices 120 are installed at different positions and are used for acquiring voiceprint information of different positions in real time.
In some embodiments, image capture device 110 and voiceprint capture device 120 may be installed simultaneously in the same location, and only image capture device 110 or voiceprint capture device 120 may be installed. In the active early warning system, the problem that the image capturing device 110 has a monitoring blind area can be solved by installing the voiceprint capturing device 120.
The image recognition unit 130 is communicatively connected to the image capturing devices 110, and is configured to recognize image information transmitted by the image capturing devices 110 to generate image recognition information, where the image recognition information is used to indicate whether the image information includes a preset dangerous behavior.
In some embodiments, the image recognition unit 130 may be a separate image recognition server or may be a cloud platform capable of image recognition.
The voiceprint recognition unit 140 is communicatively connected to the voiceprint acquisition devices 120, and is configured to recognize the voiceprint information transmitted by the voiceprint acquisition devices 120 to generate voiceprint recognition information, where the voiceprint recognition information is used to indicate whether the voiceprint information includes preset voiceprint features.
In some embodiments, the voiceprint recognition unit 140 may be a separate voiceprint recognition server, or may be a cloud platform capable of voiceprint recognition.
The instruction unit 150 acquires the image identification information transmitted by the image recognition unit 130 and acquires the voiceprint identification information transmitted by the voiceprint recognition unit 140, and generates an early warning call instruction in the case where the image information includes a preset dangerous behavior and/or the voiceprint information includes a preset voiceprint feature.
In some embodiments, instruction unit 150 may be integrated with image recognition unit 130, and instruction unit 150 may be integrated with voiceprint recognition unit 140. Namely, two instruction units 150 are integrated with the image recognition unit 130 and the voiceprint recognition unit 140, respectively.
The call server 160 is configured to obtain the early warning call instruction transmitted by the instruction unit 150, and form early warning call information based on the early warning call instruction, where the early warning call information includes dangerous event information, dangerous event information occurrence time, dangerous event information occurrence location, dangerous person information, and dangerous person real-time location.
In some embodiments, the instruction unit 150 and the call server 160 may be integrated.
The call server 160 sends the early warning call information to the plurality of call terminals 170 so that security personnel equipped with the call terminals 170 can go to the dangerous event occurrence place or the real-time position of dangerous personnel according to the early warning call information.
In some embodiments, the call terminal 170 may be a smart phone, a smart tablet, an intercom, or the like.
The call terminal 170 is further configured to generate alarm information corresponding to the early warning call information, where the alarm information includes an alarm person and a real-time position of the alarm person.
The communication connection is connected through a network, and may be a wired network connection or a wireless network connection. In some of these embodiments, the network may include a public network (e.g., the internet), a private network (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), etc.), a wireless network (e.g., an 802.11 network, a Wi-Fi network, etc.), a cellular network (e.g., a 4G network, a 5G network, etc.), a frame relay network, a Virtual Private Network (VPN), a satellite network, a router, hub, switch, server, etc., or any combination thereof. By way of example only, the network may include a cable network, a wireline network, a fiber optic network, a telecommunications network, an intranet, a Wireless Local Area Network (WLAN), a Metropolitan Area Network (MAN), a Public Switched Telephone Network (PSTN), the like, or any combination thereof. In some embodiments, the network may include one or more network access points. For example, the network may include wired and/or wireless network access points, such as base stations and/or internet exchange points, through which various devices of the active warning system 100 may connect to the network to exchange information and/or data.
In the related technology, the problems of monitoring blind areas, high time delay, monitoring omission and the like easily occur due to manual monitoring by workers. The image information and the voiceprint information can be automatically identified through the image identification unit and the voiceprint identification unit, so that an early warning call instruction and early warning call information are automatically formed, early warning call information is actively sent to a plurality of call terminals through the call server, and the problems of monitoring blind areas, time delay, monitoring omission and the like are solved.
Fig. 2 is a block diagram of a structure of an active warning system according to an embodiment of the present application (ii). As shown in fig. 2, the active warning system 100 further includes a location calculating unit 180, and the location calculating unit 180 is communicatively connected to the call server 160.
The position calculating unit 180 is configured to extract a dangerous event occurrence location and a real-time position of a dangerous person in the early warning call information, and generate a preset range by using the dangerous event occurrence location and/or the real-time position of the dangerous person as a center and using a preset distance as a radius.
Based on the preset range, the call server 160 sends the early warning call information to a number of call terminals 170 within the preset range.
In this embodiment, by setting the preset range, the transmission amount of the call server 160 is reduced, so that security personnel equipped with the call terminal 170 in the preset range can quickly go to the dangerous event occurrence location and/or the real-time position of the dangerous personnel, and the dangerous event can be killed or stopped.
Fig. 3 is a block diagram (iii) of the structure of the active warning system according to the embodiment of the present application. As shown in fig. 3, the active warning system 100 further includes a monitoring unit 190, and the monitoring unit 190 is in communication connection with the image capturing devices 110, the voiceprint capturing devices 120, the image recognition unit 130, the voiceprint recognition unit 140, the instruction unit 150, the call server 160, and the call terminals 170, respectively.
The monitoring unit 190 is configured to monitor image information transmitted by the image capturing devices 110 and voiceprint information transmitted by the voiceprint capturing devices 120, and is configured to acquire image identification information transmitted by the image identifying unit 130, acquire voiceprint identification information transmitted by the voiceprint identifying unit 140, acquire an early warning call instruction transmitted by the instruction unit 150, acquire early warning call information transmitted by the call server 160, and transmit early warning call information to the call terminals 170.
In some embodiments, the monitoring unit 190 is a monitoring center or a monitoring large screen.
In the embodiment, the problems of monitoring blind areas, timeliness delay, monitoring omission and the like are solved through manual identification and automatic identification, and the technical effects of no monitoring blind area, strong timeliness and no monitoring omission are achieved.
In addition, the active warning system 100 further includes a path generating unit, which is communicatively connected to the call server 160, the plurality of call terminals 170, and the monitoring unit 190, and is configured to generate dynamic path information based on the warning call information and the warning information.
In some embodiments, the path generating unit generates the first dynamic path information based on the dangerous event occurrence location and the real-time position of the police officer, and sends the first dynamic path information to the monitoring unit 190 for display.
In some embodiments, the path generating unit generates the second dynamic path information based on the real-time location of the dangerous person and the real-time location of the police officer, and sends the second dynamic path information to the monitoring unit 190 for display.
Through the path generation unit of this embodiment, can show dangerous personnel and the real-time positional information who gives police personnel in real time in the monitoring unit, the monitoring staff of being convenient for knows relevant condition in real time.
In some embodiments, in the case that the image identification information indicates that the image information includes a preset dangerous behavior, based on dangerous person information in the early warning call information, the image acquisition devices 110 continuously acquire the image information including the dangerous person information for a preset time period and generate image stream information.
Specifically, video information of dangerous personnel in a certain time period is acquired in a face tracking mode, so that security personnel and monitoring personnel can master the whereabouts of the dangerous personnel in real time.
Fig. 4 is a flowchart (one) of an active warning method according to an embodiment of the present application. As shown in fig. 4, the active warning method includes:
step S402, identifying image information and voiceprint information;
step S404, sending an early warning call instruction under the condition that the image information comprises a preset dangerous behavior and/or the voiceprint information comprises a preset voiceprint characteristic;
step S406, forming early warning calling information based on the early warning calling instruction, wherein the early warning calling information comprises dangerous event information, dangerous event occurrence time, dangerous event occurrence place, dangerous personnel information and dangerous personnel real-time position.
In some embodiments, the image recognition information is obtained by recognizing image information transmitted by the image capturing device, and is used for indicating whether the image information includes preset dangerous behaviors or not. The image identification information may be presented in a text form or an image form. When the image is presented in a text form, the image identification information is that the preset dangerous behavior is included or the preset dangerous behavior is not included; when the image is presented in the form of an image, the image identification information is original image information (i.e. does not include the preset dangerous behavior) or is marked on the basis of the original image information (i.e. includes the preset dangerous behavior). Specifically, labeling on the basis of the original image information includes enclosing dangerous people, enclosing victims, and the like.
In some embodiments, the voiceprint identification information is obtained by identifying voiceprint information transmitted by the voiceprint acquisition device, and is used for indicating whether the voiceprint information comprises preset voiceprint characteristics. The voiceprint identification information can be presented in a text form or a voiceprint form. When the voice print identification information is presented in a text form, the voice print identification information comprises a preset voice print characteristic or does not comprise the preset voice print characteristic; when the voice print is presented in the form of voice print, the voice print identification information is original voice print information (i.e. does not include the preset voice print characteristics) or is marked on the basis of the original voice print information (i.e. includes the preset voice print characteristics). Specifically, the labeling includes circling out key voiceprint features and the like on the basis of the original voiceprint information.
In the related technology, the problems of monitoring blind areas, high time delay, monitoring omission and the like easily occur due to manual monitoring by workers. By automatically identifying the image information and the voiceprint information, the method and the system can automatically form the early warning call instruction and the early warning call information, and actively send the early warning call information to a plurality of call terminals through the call server, so that the problems of monitoring blind areas, time delay, monitoring omission and the like are solved.
Fig. 5 is a flowchart of an active warning method according to an embodiment of the present application (ii). As shown in fig. 5, after the formation of the early warning call information, the method further includes:
step S502, early warning calling information is sent to at least one calling terminal within a preset range, wherein the preset range is a range which takes the occurrence place of a dangerous event and/or the real-time position of dangerous personnel as the center and takes a preset distance as the radius;
step S504, acquiring alarm information of at least one calling terminal corresponding to the early warning calling information, wherein the alarm information comprises alarm personnel and the real-time positions of the alarm personnel;
and step S506, generating dynamic path information based on the dangerous event occurrence place and/or the real-time positions of dangerous personnel and the real-time positions of police personnel.
Wherein the preset distance is 200-1000 m.
In some embodiments, in extreme cases, such as when there is no calling terminal within the preset range, the coverage area of the preset range is increased by at least 200m based on the original preset distance, so as to find the calling terminal closest to the dangerous event occurrence location and/or the dangerous person real-time location.
In some embodiments, the real-time location of the dangerous person may be dynamically changed, so that the preset range may be continuously updated, thereby ensuring that the distance between at least one security personnel equipped with the call terminal and the real-time location of the dangerous person is minimized.
In some embodiments, after receiving the warning call information, the calling terminal determines whether to generate warning information based on its own state. If the security personnel equipped with the calling terminal is processing other affairs, the security personnel will not give an alarm after the calling terminal receives the early warning calling information, therefore, the alarm information will not be generated.
In some embodiments, first dynamic path information is generated based on the dangerous event occurrence place and the real-time position of the police officer; and generating second dynamic path information based on the real-time position of the dangerous personnel and the real-time position of the police personnel. Through the first dynamic path information and the second dynamic path information, at least one security personnel provided with the calling terminal can be ensured to go to the dangerous event occurrence place, and at least one security personnel provided with the calling terminal can track the dangerous personnel.
In the related art, when a dangerous behavior occurs, a monitoring worker generally makes a global call so that all call terminals receive the warning information, which results in low accuracy of transmitting the warning information and occupies a large amount of network bandwidth. The early warning calling information is sent to the calling terminal within the preset range through the embodiment, so that the transmission accuracy can be improved, the occupation of network bandwidth is reduced, and security personnel equipped with the calling terminal and closest to the dangerous event occurrence place or the real-time position of dangerous personnel can timely and quickly go to the dangerous event occurrence place or the dangerous personnel real-time position.
Fig. 6 is a flowchart (iii) of an active warning method according to an embodiment of the present application. As shown in fig. 6, the recognition image information includes:
step S602, inputting image information into a behavior model to generate behavior information, wherein the behavior information comprises personnel information and action information;
step S604, continuously acquiring a plurality of image information including the person information in a preset time period to generate image stream information under the condition that the behavior information is a preset dangerous behavior and/or the person information is a preset dangerous person.
The personnel information comprises normal personnel information and wanted personnel information, and the action information comprises actions such as fighting, wrestling and the like.
In some embodiments, the behavior model is constructed based on a deep neural network model, and is trained by inputting a large amount of image information including dangerous behaviors, image information not including dangerous behaviors and related personnel information (such as wanted person image information) so as to improve the identification accuracy.
In some of these embodiments, the predetermined period of time is between 5min and 30 min. By means of the image flow information of 5-30 min, dangerous behaviors or dangerous personnel can be continuously monitored, and real-time acquisition of specific conditions is facilitated.
Fig. 7 is a flowchart (iv) of an active warning method according to an embodiment of the present application. As shown in fig. 7, identifying voiceprint information includes:
step S702, extracting the voiceprint characteristics of the voiceprint information;
step S704, inputting the voiceprint characteristics into a voiceprint library;
step S706, under the condition that the voiceprint features are matched with a preset voiceprint feature in the voiceprint library, associating the voiceprint information with attribute information corresponding to the preset voiceprint features, wherein the attribute information comprises dangerous personnel information and dangerous behavior information.
The voiceprint library stores voiceprint characteristics of a plurality of dangerous persons, voiceprint characteristics of dangerous behaviors and voiceprint characteristics of victims, including but not limited to voiceprint characteristics of shouting, calling for help, crying and the like.
In some of these embodiments, the search alignment is performed in a voiceprint library based on the input voiceprint characteristics. If the retrieval result returned by the voiceprint library is none or null, the voiceprint feature is indicated to be a normal voiceprint feature; and if the retrieval result returned by the voiceprint library is not empty, selecting the preset voiceprint features with the highest similarity or the highest matching degree, and thus judging the condition of the voiceprint information.
In some of these embodiments, the voiceprint features include at least one of energy features, harmonic-to-noise ratio features, mel-frequency cepstral coefficient features.
Specifically, according to different application environments, only the energy feature of the voiceprint information may be extracted, only the harmonic-to-noise ratio feature of the voiceprint information may be extracted, and only the mel-frequency cepstrum coefficient feature of the voiceprint information may be extracted.
Through the voiceprint feature of this embodiment is drawed to voiceprint information, can judge and give attribute information to the voiceprint information who acquires fast, can assist discernment image information, and then fix a position dangerous action, dangerous personnel more accurately fast, have the problem that monitoring blind area, ageing delay are high among the correlation technique solved.
In addition, the active early warning method of the embodiment of the application can be realized by computer equipment. Components of the computer device may include, but are not limited to, a processor and a memory storing computer program instructions.
In some embodiments, the processor may include a Central Processing Unit (CPU), or A Specific Integrated Circuit (ASIC), or may be configured to implement one or more integrated circuits of embodiments of the present application.
In some embodiments, the memory may include mass storage for data or instructions. By way of example, and not limitation, memory may include a hard disk drive (hard disk drive, abbreviated HDD), a floppy disk drive, a Solid State Drive (SSD), flash memory, an optical disk, a magneto-optical disk, tape, or a Universal Serial Bus (USB) drive or a combination of two or more of these. The memory may include removable or non-removable (or fixed) media, where appropriate. The memory may be internal or external to the data processing apparatus, where appropriate. In a particular embodiment, the memory is a Non-Volatile (Non-Volatile) memory. In particular embodiments, the memory includes Read-only memory (ROM) and Random Access Memory (RAM). The ROM may be mask-programmed ROM, Programmable ROM (PROM), Erasable PROM (EPROM), Electrically Erasable PROM (EEPROM), electrically rewritable ROM (EAROM), or FLASH Memory (FLASH), or a combination of two or more of these, where appropriate. The RAM may be a static Random-Access Memory (SRAM) or a dynamic Random-Access Memory (DRAM), where the DRAM may be a fast page mode dynamic Random-Access Memory (FPMDRAM), an extended data output dynamic Random-Access Memory (EDODRAM), a synchronous dynamic Random-Access Memory (SDRAM), and the like.
The memory may be used to store or cache various data files for processing and/or communication use, as well as possibly computer program instructions for execution by the processor.
The processor reads and executes the computer program instructions stored in the memory to implement any one of the active warning methods in the above embodiments.
In some of these embodiments, the computer device may also include a communication interface and a bus. The processor, the memory and the communication interface are connected through a bus and complete mutual communication.
The communication interface is used for realizing communication among modules, devices, units and/or equipment in the embodiment of the application. The communication interface may also be implemented with other components such as: the data communication is carried out among external equipment, image/data acquisition equipment, a database, external storage, an image/data processing workstation and the like.
A bus comprises hardware, software, or both that couple components of a computer device to one another. Buses include, but are not limited to, at least one of the following: data bus (DataBus), address bus (AddressBus), control bus (ControlBus), expansion bus (expansion bus), and local bus (LocalBus). By way of example and not limitation, a bus may include an Accelerated Graphics Port (AGP) or other graphics bus, an Enhanced Industry Standard Architecture (EISA) bus, a front-side bus (FSB), a HyperTransport (HT) Interconnect, an ISA (ISA) bus, an InfiniBand (InfiniBand) Interconnect, a Low Pin Count (LPC) bus, a memory bus, a MicroChannel Architecture (MCA) bus, a PCI (peripheral component Interconnect) bus, a PCI-Express (PCI-X) bus, a Serial Advanced Technology Attachment (SATA) bus, a video bus, or a combination of two or more of these suitable electronic buses. A bus may include one or more buses, where appropriate. Although specific buses are described and shown in the embodiments of the application, any suitable buses or interconnects are contemplated by the application.
The computer device may execute the active warning method in the embodiment of the present application.
In addition, in combination with the active early warning method in the foregoing embodiments, the embodiments of the present application may provide a computer-readable storage medium to implement. The computer readable storage medium having stored thereon computer program instructions; the computer program instructions, when executed by a processor, implement any of the active warning methods in the above embodiments.
The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. An active early warning method based on images and voiceprints is characterized by comprising the following steps:
identifying image information and voiceprint information;
sending an early warning call instruction under the condition that the image information comprises a preset dangerous behavior and/or the voiceprint information comprises a preset voiceprint characteristic;
and forming early warning calling information based on the early warning calling instruction, wherein the early warning calling information comprises dangerous event information, dangerous event information occurrence time, dangerous event information occurrence place, dangerous personnel information and dangerous personnel real-time position.
2. The active warning method based on image and voiceprint of claim 1 wherein after forming the warning call information, the method further comprises:
sending the early warning call information to at least one call terminal within a preset range, wherein the preset range is a range which takes the dangerous event information occurrence place and/or the real-time position of the dangerous personnel as the center and takes a preset distance as the radius;
acquiring alarm information of at least one calling terminal corresponding to the early warning calling information, wherein the alarm information comprises alarm personnel and the real-time positions of the alarm personnel;
and generating dynamic path information based on the dangerous event information occurrence place and/or the real-time positions of the dangerous personnel and the real-time positions of the police personnel.
3. The active pre-warning method based on image and voiceprint of claim 1 wherein identifying image information comprises:
inputting image information into a behavior model to generate behavior information, wherein the behavior information comprises personnel information and action information;
and under the condition that the behavior information is preset dangerous behavior and/or the personnel information is preset dangerous personnel information, continuously acquiring a plurality of image information including the personnel information in a preset time period to generate image stream information.
4. The active pre-warning method based on image and voiceprint of claim 1 wherein identifying voiceprint information comprises:
extracting voiceprint characteristics of the voiceprint information;
inputting the voiceprint features into a voiceprint library;
and under the condition that the voiceprint features are matched with a preset voiceprint feature in the voiceprint library, associating the voiceprint information with attribute information corresponding to the preset voiceprint features, wherein the attribute information comprises dangerous personnel information and dangerous behavior information.
5. The active pre-warning method based on image and voiceprint according to claim 4, wherein the voiceprint characteristic is at least one of: energy characteristics, harmonic noise ratio characteristics, mel-frequency cepstrum coefficient characteristics.
6. An active early warning system based on images and voiceprints, comprising:
the image acquisition equipment is used for acquiring image information of different positions;
the voiceprint acquisition equipment is used for acquiring voiceprint information of different positions;
the image identification unit is connected with the image acquisition devices and used for identifying the image information transmitted by the image acquisition devices so as to generate image identification information, wherein the image identification information is used for indicating whether the image information comprises preset dangerous behaviors;
the voiceprint recognition unit is connected with the voiceprint acquisition devices and used for recognizing the voiceprint information transmitted by the voiceprint acquisition devices so as to generate voiceprint recognition information, wherein the voiceprint recognition information is used for indicating whether the voiceprint information comprises preset voiceprint characteristics;
an instruction unit connected with the image recognition unit and the voiceprint recognition unit and used for generating an early warning call instruction under the condition that the image information comprises the preset dangerous behavior and/or the voiceprint information comprises preset voiceprint characteristics
The call server is connected with the instruction unit and used for forming early warning call information based on the early warning call instruction, wherein the early warning call information comprises dangerous event information, dangerous event information occurrence time, dangerous event information occurrence place, dangerous personnel information and dangerous personnel real-time position;
and the calling terminals are connected with the calling server and used for receiving the early warning calling information transmitted by the calling server.
7. The active image and voiceprint based warning system of claim 6 further comprising:
the position calculation unit is connected with the call server and is used for generating a preset range by taking the dangerous event information occurrence place and/or the real-time position of the dangerous person as a center and taking a preset distance as a radius;
the call server is used for sending the early warning call information to at least one call terminal in the preset range.
8. The active image and voiceprint based warning system of claim 6 further comprising:
the monitoring unit is respectively connected with the image acquisition equipment, the voiceprint acquisition equipment, the image identification unit, the voiceprint identification unit, the instruction unit, the call server and the call terminal, is used for monitoring a plurality of image information and a plurality of voiceprint information transmitted by the image acquisition equipment, and is used for acquiring the image identification information transmitted by the image identification unit, acquiring the voiceprint identification information transmitted by the voiceprint identification unit, acquiring the early warning call instruction transmitted by the instruction unit, acquiring the early warning call information transmitted by the call server, and is used for transmitting the early warning call information to the call terminal.
9. A computer device, comprising:
at least one processor;
and a memory communicatively coupled to the at least one processor;
wherein the memory stores a computer program executable by the at least one processor to cause the at least one processor to perform the image and voiceprint based active pre-warning method of any one of claims 1 to 5.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which, when executed by a processor, implements the image and voiceprint based active pre-warning method according to any one of claims 1 to 5.
CN202010661932.6A 2020-07-10 2020-07-10 Active early warning method, system and equipment based on image and voiceprint Pending CN111784947A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010661932.6A CN111784947A (en) 2020-07-10 2020-07-10 Active early warning method, system and equipment based on image and voiceprint

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010661932.6A CN111784947A (en) 2020-07-10 2020-07-10 Active early warning method, system and equipment based on image and voiceprint

Publications (1)

Publication Number Publication Date
CN111784947A true CN111784947A (en) 2020-10-16

Family

ID=72767177

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010661932.6A Pending CN111784947A (en) 2020-07-10 2020-07-10 Active early warning method, system and equipment based on image and voiceprint

Country Status (1)

Country Link
CN (1) CN111784947A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112907809A (en) * 2021-01-29 2021-06-04 深圳市兴海物联科技有限公司 Management method, system, equipment and computer storage medium
CN114845026A (en) * 2022-05-02 2022-08-02 北京万合恒安科技有限公司 Dynamic monitoring communication device based on big data and use method thereof

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622818A (en) * 2011-01-26 2012-08-01 北京海鑫智圣技术有限公司 All-directional intelligent monitoring method for bank ATMs
CN105679313A (en) * 2016-04-15 2016-06-15 福建新恒通智能科技有限公司 Audio recognition alarm system and method
CN106713868A (en) * 2017-01-03 2017-05-24 捷开通讯(深圳)有限公司 Random target monitoring method and system
CN108010289A (en) * 2017-12-28 2018-05-08 深圳市永达电子信息股份有限公司 A kind of internet alarm method and system based on Application on Voiceprint Recognition
CN108257362A (en) * 2018-01-11 2018-07-06 广州广大声像灯光科技有限公司 Interactive electronic security police guard method, system and device based on GIS
CN109598885A (en) * 2018-12-21 2019-04-09 广东中安金狮科创有限公司 Monitoring system and its alarm method
CN110675585A (en) * 2019-09-23 2020-01-10 北京华毅东方展览有限公司 Exhibition safety control system
CN110830771A (en) * 2019-11-11 2020-02-21 广州国音智能科技有限公司 Intelligent monitoring method, device, equipment and computer readable storage medium
CN110992609A (en) * 2019-11-11 2020-04-10 云知声智能科技股份有限公司 Automatic distress system and method based on voiceprint detection

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622818A (en) * 2011-01-26 2012-08-01 北京海鑫智圣技术有限公司 All-directional intelligent monitoring method for bank ATMs
CN105679313A (en) * 2016-04-15 2016-06-15 福建新恒通智能科技有限公司 Audio recognition alarm system and method
CN106713868A (en) * 2017-01-03 2017-05-24 捷开通讯(深圳)有限公司 Random target monitoring method and system
CN108010289A (en) * 2017-12-28 2018-05-08 深圳市永达电子信息股份有限公司 A kind of internet alarm method and system based on Application on Voiceprint Recognition
CN108257362A (en) * 2018-01-11 2018-07-06 广州广大声像灯光科技有限公司 Interactive electronic security police guard method, system and device based on GIS
CN109598885A (en) * 2018-12-21 2019-04-09 广东中安金狮科创有限公司 Monitoring system and its alarm method
CN110675585A (en) * 2019-09-23 2020-01-10 北京华毅东方展览有限公司 Exhibition safety control system
CN110830771A (en) * 2019-11-11 2020-02-21 广州国音智能科技有限公司 Intelligent monitoring method, device, equipment and computer readable storage medium
CN110992609A (en) * 2019-11-11 2020-04-10 云知声智能科技股份有限公司 Automatic distress system and method based on voiceprint detection

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112907809A (en) * 2021-01-29 2021-06-04 深圳市兴海物联科技有限公司 Management method, system, equipment and computer storage medium
CN114845026A (en) * 2022-05-02 2022-08-02 北京万合恒安科技有限公司 Dynamic monitoring communication device based on big data and use method thereof

Similar Documents

Publication Publication Date Title
CN107278369B (en) Personnel searching method, device and communication system
CN109583278B (en) Face recognition alarm method, device and system and computer equipment
CN109815818B (en) Target person tracking method, system and related device
CN109672853B (en) Early warning method, device and equipment based on video monitoring and computer storage medium
CN105847769A (en) Construction site behavior analysis system and analysis method
CN110555918B (en) Attendance management method and attendance management equipment
US9942520B2 (en) Interactive and targeted monitoring service
JP2023174753A (en) Video sharing system, video sharing method, and program
WO2018146938A1 (en) Person detection system
CN106503666A (en) A kind of method for safety monitoring, device and electronic equipment
CN111784947A (en) Active early warning method, system and equipment based on image and voiceprint
CA2983316C (en) Monitoring system having personal information protection function and method thereof
CN107959748A (en) Automatic alarm method and device
JP2010191620A (en) Method and system for detecting suspicious person
WO2020042637A1 (en) Supervision method, apparatus and system, cloud server and storage medium
KR20160074208A (en) System and method for providing safety service using beacon signals
KR20220063280A (en) Crowd Overcrowding Prediction Method and Apparatus
CN115880631A (en) Power distribution station fault identification system, method and medium
CN109214326A (en) A kind of information processing method, device and system
CN109800656B (en) Positioning method and related product
CN111540020A (en) Method and device for determining target behavior, storage medium and electronic device
CN117238120B (en) Security monitoring method, device, equipment and medium
US11100784B2 (en) Method and system for detecting and notifying actionable events during surveillance
KR102367584B1 (en) Automatic video surveillance system using skeleton video analysis technique
KR20190127101A (en) Security service system and method based on cloud

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201016

RJ01 Rejection of invention patent application after publication