CN108766439A - A kind of monitoring method and device based on Application on Voiceprint Recognition - Google Patents

A kind of monitoring method and device based on Application on Voiceprint Recognition Download PDF

Info

Publication number
CN108766439A
CN108766439A CN201810394740.6A CN201810394740A CN108766439A CN 108766439 A CN108766439 A CN 108766439A CN 201810394740 A CN201810394740 A CN 201810394740A CN 108766439 A CN108766439 A CN 108766439A
Authority
CN
China
Prior art keywords
audio
vocal print
typing
preset
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810394740.6A
Other languages
Chinese (zh)
Inventor
吴松海
陈昊亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou National Sound Technology Co Ltd
Original Assignee
Guangzhou National Sound Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou National Sound Technology Co Ltd filed Critical Guangzhou National Sound Technology Co Ltd
Priority to CN201810394740.6A priority Critical patent/CN108766439A/en
Publication of CN108766439A publication Critical patent/CN108766439A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Signal Processing (AREA)
  • Emergency Alarm Devices (AREA)

Abstract

The embodiment of the invention discloses a kind of monitoring method and device based on Application on Voiceprint Recognition; it solves existing monitoring technology and generally uses camera; and camera can not normally obtain image after being blocked intentionally; and the result of camera shooting is easy to be limited by angle and light environment, the infull technical problem of caused monitoring.Present invention method includes:The audio that S1, acquisition listen to;S2, speech recognition is carried out to the audio listened to, when the audio listened to includes preset keyword, executes step S3;S3, Application on Voiceprint Recognition is carried out to the audio listened to, and corresponding first vocal print of the audio listened to is compared with the second vocal print in preset vocal print library, if being matched to identical vocal print, location information is sent to early warning platform and responds the early warning platform.

Description

A kind of monitoring method and device based on Application on Voiceprint Recognition
Technical field
The present invention relates to monitoring technology field more particularly to a kind of monitoring method and device based on Application on Voiceprint Recognition.
Background technology
With camera and the growing prosperity of face recognition technology, the block used, the application scenarios such as interior, Ke Yishi When monitoring and regional extent of deploying to ensure effective monitoring and control of illegal activities, target tracking, public security safety etc. practical applications.
Existing monitoring technology generally uses camera, and camera can not normally obtain image after being blocked intentionally, and The result of camera shooting is easy to be limited by angle and light environment, causes to monitor infull technical problem.
Invention content
The present invention provides a kind of monitoring method and device based on Application on Voiceprint Recognition, it is general to solve existing monitoring technology Camera is used, and camera can not normally obtain image after being blocked intentionally, and the result imaged is easy by angle and light Thread environment limits, the infull technical problem of caused monitoring.
The present invention provides a kind of monitoring methods based on Application on Voiceprint Recognition, including:
The audio that S1, acquisition listen to;
S2, speech recognition is carried out to the audio listened to, when the audio listened to includes preset keyword When, execute step S3;
S3, Application on Voiceprint Recognition is carried out to the audio that listens to, and by corresponding first vocal print of the audio listened to It is compared with the second vocal print in preset vocal print library, if being matched to identical vocal print, sends location information to early warning platform And respond the early warning platform.
Optionally, further include before the step S1:
S01, the audio for obtaining typing;
S02, the extraction typing audio in the second vocal print and preserve into preset vocal print library.
Optionally, after the step S01, further include before the step S02:
To progress voice quality detection in the audio of the typing, including:
Calculate the first signal-to-noise ratio, first the average energy value and the first efficient voice duration of the audio of the typing;
Successively by the first signal-to-noise ratio of the audio of the typing, first the average energy value and the first efficient voice duration with it is right The first preset threshold value answered is compared, if the first signal-to-noise ratio, first the average energy value and the first efficient voice duration are above Corresponding first predetermined threshold value, it is determined that the voice quality of the audio of the typing is qualified, and executes next step, and otherwise prompt is used Family re-types audio and returns to the audio for reacquiring typing.
Optionally, the first signal-to-noise ratio, first the average energy value and the first effective language of the audio for calculating the typing Further include before sound duration:
Judge that the content type in the audio of the typing, content type include random digit, random phrase, random long sentence And fixed phrase;
Corresponding first preset threshold value of the first efficient voice duration is determined according to the content type in the audio of the typing.
Optionally, the step S3 is specifically included:
Application on Voiceprint Recognition, the first vocal print in the audio listened to described in extraction are carried out to the audio listened to;
The first vocal print in the audio listened to is compared with the second vocal print in preset vocal print library, is obtained With value;
Judge whether matching value is higher than preset matching threshold, when determining that matching value is higher than preset matching threshold, it is fixed to send Position information to early warning platform and responds the early warning platform.
Optionally, when matching value is less than preset matching threshold, the first vocal print in the audio listened to is added Extremely in the preset vocal print library, and respond early warning platform.
The present invention provides a kind of monitoring devices based on Application on Voiceprint Recognition, including:
First acquisition unit, for obtaining the audio listened to;
Voice recognition unit, for carrying out speech recognition to the audio listened to, when in the audio listened to When including preset keyword, vocal print comparing unit is jumped to;
Vocal print comparing unit, for carrying out Application on Voiceprint Recognition to the audio that listens to, and by the audio listened to Corresponding first vocal print is compared with the second vocal print in preset vocal print library, if being matched to identical vocal print, sends positioning Information is to early warning platform and responds the early warning platform.
Optionally, a kind of monitoring device based on Application on Voiceprint Recognition provided by the invention further includes:
Second acquisition unit, the audio for obtaining typing;
Voiceprint extraction unit, the second vocal print in audio for extracting the typing are simultaneously preserved into preset vocal print library.
Optionally, a kind of monitoring device based on Application on Voiceprint Recognition provided by the invention further includes:
Voice quality detection unit, for carrying out voice quality detection in the audio of the typing;
Institute's Voice Quality detection unit includes:
Computation subunit, the first signal-to-noise ratio, first the average energy value and first of the audio for calculating the typing have Imitate voice duration;
Comparison subunit, for successively by the first signal-to-noise ratio of the audio of the typing, first the average energy value and first Efficient voice duration is compared with corresponding first preset threshold value, if the first signal-to-noise ratio, first the average energy value and first have Effect voice duration is above corresponding first predetermined threshold value, it is determined that the voice quality of the audio of the typing is qualified, and executes In next step, otherwise prompt user re-types audio and returns to the audio for reacquiring typing.
Optionally, voice quality detection unit further includes:
Judgment sub-unit, the content type in audio for judging the typing, content type include random digit, with Machine phrase, random long sentence and fixed phrase;
Threshold value determination subelement determines the first efficient voice duration for the content type in the audio according to the typing Corresponding first preset threshold value.
As can be seen from the above technical solutions, the present invention has the following advantages:
The present invention provides a kind of monitoring methods based on Application on Voiceprint Recognition, including:The audio that S1, acquisition listen to;It is S2, right The audio listened to carries out speech recognition, when the audio listened to includes preset keyword, executes step S3; S3, Application on Voiceprint Recognition is carried out to the audio that listens to, and by corresponding first vocal print of the audio listened to and preset sound The second vocal print in line library is compared, if being matched to identical vocal print, sends location information to early warning platform and responds institute State early warning platform.
In the present invention, by obtaining the audio listened to, and the preset keyword in the audio listened to is identified, if monitoring Preset keyword has been arrived, then Application on Voiceprint Recognition has been carried out to the audio that listens to, and by the first vocal print recognized and preset vocal print library In the second vocal print be compared, judge whether be tracking target, solve existing monitoring technology generally use camera, And camera can not normally obtain image after being blocked intentionally, and the result imaged is easy to be limited by angle and light environment, The infull technical problem of caused monitoring.
Description of the drawings
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention without having to pay creative labor, may be used also for those of ordinary skill in the art To obtain other attached drawings according to these attached drawings.
Fig. 1 is a kind of flow diagram of one embodiment of the monitoring method based on Application on Voiceprint Recognition provided by the invention;
Fig. 2 is a kind of flow signal of another embodiment of the monitoring method based on Application on Voiceprint Recognition provided by the invention Figure;
Fig. 3 is a kind of structural schematic diagram of one embodiment of the monitoring device based on Application on Voiceprint Recognition provided by the invention;
Fig. 4 is a kind of structural representation of another embodiment of the monitoring device based on Application on Voiceprint Recognition provided by the invention Figure.
Specific implementation mode
An embodiment of the present invention provides a kind of monitoring method and device based on Application on Voiceprint Recognition, solve existing monitoring skill Art generally uses camera, and camera can not normally obtain image after being blocked intentionally, and the result imaged is easy by angle Degree and light environment limitation, the infull technical problem of caused monitoring.
In order to make the invention's purpose, features and advantages of the invention more obvious and easy to understand, below in conjunction with the present invention Attached drawing in embodiment, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that disclosed below Embodiment be only a part of the embodiment of the present invention, and not all embodiment.Based on the embodiments of the present invention, this field All other embodiment that those of ordinary skill is obtained without making creative work, belongs to protection of the present invention Range.
Referring to Fig. 1, the present invention provides a kind of monitoring methods based on Application on Voiceprint Recognition, including:
101, the audio listened to is obtained;
102, speech recognition is carried out to the audio listened to, when the audio listened to includes preset keyword, executed Step 103;
103, Application on Voiceprint Recognition carried out to the audio that listens to, and by corresponding first vocal print of the audio listened to and preset sound The second vocal print in line library is compared, if being matched to identical vocal print, sends location information to early warning platform and responds pre- Alert platform.
In the embodiment of the present invention, by obtaining the audio listened to, and the preset keyword in the audio listened to is identified, If having listened to preset keyword, Application on Voiceprint Recognition carried out to the audio that listens to, and by the first vocal print recognized with it is preset The second vocal print in vocal print library is compared, and judges whether it is the target tracked, solves existing monitoring technology and generally use Camera, and camera can not normally obtain image after being blocked intentionally, and the result imaged is easy by angle and light ring Border limits, the infull technical problem of caused monitoring.
It is the explanation carried out to a kind of one embodiment of the monitoring method based on Application on Voiceprint Recognition provided by the invention above, A kind of another embodiment of the monitoring method based on Application on Voiceprint Recognition provided by the invention will be illustrated below.
Referring to Fig. 2, the present invention provides a kind of monitoring methods based on Application on Voiceprint Recognition, including:
201, the audio of typing is obtained;
It should be noted that before building preset vocal print library, first choice obtains the audio for needing typing.
202, to progress voice quality detection in the audio of typing, including:
2021, judge that the content type in the audio of typing, content type include random digit, random phrase, with captain Sentence and fixed phrase;
It should be noted that the content type in judging the audio of typing, content type includes random digit, random short Language, random long sentence and fixed phrase.
2022, the corresponding first preset threshold of the first efficient voice duration is determined according to the content type in the audio of typing Value;
It should be noted that determining the first efficient voice duration corresponding first according to the content type in the audio of typing Preset threshold value, if random digit, then corresponding first preset threshold value of the first efficient voice duration is 1.2 seconds;If random short Language, then corresponding first preset threshold value of the first efficient voice duration is 1.8 seconds;If random long sentence, then when the first efficient voice Long corresponding first preset threshold value is 16 seconds;If fixed phrase, then corresponding first preset threshold value of the first efficient voice duration It is 0.8 second.
2023, the first signal-to-noise ratio, first the average energy value and the first efficient voice duration of the audio of typing are calculated;
It should be noted that calculating the first signal-to-noise ratio, first the average energy value and the first efficient voice of the audio of typing Duration.
2024, successively by the first signal-to-noise ratio of the audio of typing, first the average energy value and the first efficient voice duration with Corresponding first preset threshold value is compared, if the first signal-to-noise ratio, first the average energy value and the first efficient voice duration are high In corresponding first predetermined threshold value, it is determined that the voice quality of the audio of typing is qualified, and executes next step, otherwise prompts user It re-types audio and returns to the audio for reacquiring typing;
It should be noted that successively by the first signal-to-noise ratio of the audio of typing, first the average energy value and first effective language Sound duration is compared with corresponding first preset threshold value, if the first signal-to-noise ratio, first the average energy value and the first efficient voice Duration is above corresponding first predetermined threshold value, it is determined that the voice quality of the audio of typing is qualified, and executes next step, otherwise Prompt user re-types audio and returns to the audio for reacquiring typing, wherein the corresponding first default threshold of the first signal-to-noise ratio Value is 10 decibels, and corresponding first predetermined threshold value of first the average energy value is [1000,30000], the first efficient voice duration pair The first preset threshold value answered has determined in previous step.
203, it extracts the second vocal print in the audio of typing and preserves into preset vocal print library;
It should be noted that after the voice quality qualification for the audio for determining typing, second in the audio of typing is extracted Vocal print is simultaneously preserved into preset vocal print library.
204, the audio listened to is obtained;
It should be noted that in monitoring, the audio listened to is obtained.
205, speech recognition is carried out to the audio listened to, when the audio listened to includes preset keyword, executed Step 206;
It should be noted that carry out speech recognition to the audio that listens to, judge among the audio listened to whether include Preset keyword, if so, thening follow the steps 206, wherein preset keyword is user's sets itself.
206, Application on Voiceprint Recognition is carried out to the audio listened to, extracts the first vocal print in the audio listened to;
It should be noted that there are the audios of the typing of preset keyword to carry out Application on Voiceprint Recognition, the sound listened to is extracted The first vocal print in frequency.
207, the first vocal print in the audio listened to is compared with the second vocal print in preset vocal print library, is obtained With value;
It should be noted that the first vocal print in the audio listened to is compared with the second vocal print in preset vocal print library Right, preset vocal print library includes the second vocal print of at least one user of typing, therefore obtains at least one matching value.
208, judge whether matching value is higher than preset matching threshold, when determining that matching value is higher than preset matching threshold, hair Location information is sent to early warning platform and responds early warning platform;
It should be noted that judging whether the matching value obtained is higher than preset matching threshold, that is, judge the audio listened to In whether have the corresponding vocal print of user of typing in preset vocal print library, if so, sending location information to early warning platform and sound Answer early warning platform.
209, when matching value is less than preset matching threshold, the first vocal print in the audio listened to is added to preset sound In line library, and respond early warning platform;
It should be noted that when matching value is less than preset matching threshold, illustrate related without preserving in preset vocal print library Second vocal print, but there are preset keyword in the audio due to listening to, need corresponding first vocal print of the audio that will be listened to It preserves into preset vocal print library, and responds early warning platform.
It is saying to a kind of another embodiment progress of the monitoring method based on Application on Voiceprint Recognition provided by the invention above It is bright, a kind of one embodiment of the monitoring device based on Application on Voiceprint Recognition provided by the invention will be illustrated below.
Referring to Fig. 3, the present invention provides a kind of one embodiment of the monitoring device based on Application on Voiceprint Recognition, including:
First acquisition unit 301, for obtaining the audio listened to;
Voice recognition unit 302, for carrying out speech recognition to the audio listened to, when the audio listened to includes pre- When setting keyword, vocal print comparing unit 33 is jumped to;
Vocal print comparing unit 303, for carrying out Application on Voiceprint Recognition to the audio listened to, and the audio listened to is corresponding First vocal print is compared with the second vocal print in preset vocal print library, if being matched to identical vocal print, sends location information extremely Early warning platform simultaneously responds early warning platform.
It is the explanation carried out to a kind of one embodiment of the monitoring device based on Application on Voiceprint Recognition provided by the invention above, A kind of another embodiment of the monitoring device based on Application on Voiceprint Recognition provided by the invention will be illustrated below.
Referring to Fig. 4, the present invention provides a kind of another embodiments of the monitoring device based on Application on Voiceprint Recognition, including:
Second acquisition unit 401, the audio for obtaining typing;
Voice quality detection unit 402, for carrying out voice quality detection in the audio of typing;
Voice quality detection unit 402 includes:
Judgment sub-unit 4021, the content type in audio for judging typing, content type include random digit, with Machine phrase, random long sentence and fixed phrase;
Threshold value determination subelement 4022 determines the first efficient voice duration for the content type in the audio according to typing Corresponding first preset threshold value;
Computation subunit 4023, the first signal-to-noise ratio, first the average energy value and first of the audio for calculating typing have Imitate voice duration;
Comparison subunit 4024, for successively by the first signal-to-noise ratio of the audio of typing, first the average energy value and first Efficient voice duration is compared with corresponding first preset threshold value, if the first signal-to-noise ratio, first the average energy value and first have Effect voice duration is above corresponding first predetermined threshold value, it is determined that the voice quality of the audio of typing is qualified, and executes next Otherwise step prompts user to re-type audio and returns to the audio for reacquiring typing;
Voiceprint extraction unit 403, the second vocal print in audio for extracting typing are simultaneously preserved into preset vocal print library;
First acquisition unit 404, for obtaining the audio listened to;
Voice recognition unit 405, for carrying out speech recognition to the audio listened to, when the audio listened to includes pre- When setting keyword, vocal print comparing unit 406 is jumped to;
Vocal print comparing unit 406, for carrying out Application on Voiceprint Recognition to the audio listened to, and the audio listened to is corresponding First vocal print is compared with the second vocal print in preset vocal print library, if being matched to identical vocal print, sends location information extremely Early warning platform simultaneously responds early warning platform;
Vocal print comparing unit 406 specifically includes:
Subelement 4061 is extracted, for carrying out Application on Voiceprint Recognition to the audio that listens to, extracts the in the audio listened to One vocal print;
Comparison subunit 4062, for by the second vocal print in the first vocal print and the preset vocal print library in the audio listened to It is compared, obtains matching value;
Coupling subelement 4063, for judging whether matching value is higher than preset matching threshold, when determining matching value higher than pre- When setting matching threshold, sends location information and to early warning platform and respond early warning platform;
Coupling subelement 4063 is additionally operable to when matching value is less than preset matching threshold, by first in the audio listened to Vocal print is added in preset vocal print library, and responds early warning platform.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, device and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of division of logic function, formula that in actual implementation, there may be another division manner, such as multiple units or component It can be combined or can be integrated into another system, or some features can be ignored or not executed.Another point, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be the indirect coupling by some interfaces, device or unit It closes or communicates to connect, can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, you can be located at a place, or may be distributed over multiple In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme 's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also It is that each unit physically exists alone, it can also be during two or more units be integrated in one unit.Above-mentioned integrated list The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can be stored in a computer read/write memory medium.Based on this understanding, technical scheme of the present invention is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that a computer Equipment (can be personal computer, server or the network equipment etc.) executes the complete of each embodiment the method for the present invention Portion or part steps.And storage medium above-mentioned includes:USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can store journey The medium of sequence code.
The above, the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although with reference to before Stating embodiment, invention is explained in detail, it will be understood by those of ordinary skill in the art that:It still can be to preceding The technical solution recorded in each embodiment is stated to modify or equivalent replacement of some of the technical features;And these Modification or replacement, the spirit and scope for various embodiments of the present invention technical solution that it does not separate the essence of the corresponding technical solution.

Claims (10)

1. a kind of monitoring method based on Application on Voiceprint Recognition, which is characterized in that including:
The audio that S1, acquisition listen to;
S2, the audio progress speech recognition listened to is held when the audio listened to includes preset keyword Row step S3;
S3, Application on Voiceprint Recognition is carried out to the audio listened to, and by corresponding first vocal print of the audio listened to and in advance The second vocal print set in vocal print library is compared, if being matched to identical vocal print, sends location information to early warning platform and sound Answer the early warning platform.
2. the monitoring method according to claim 1 based on Application on Voiceprint Recognition, which is characterized in that also wrapped before the step S1 It includes:
S01, the audio for obtaining typing;
S02, the extraction typing audio in the second vocal print and preserve into preset vocal print library.
3. the monitoring method according to claim 2 based on Application on Voiceprint Recognition, which is characterized in that after the step S01, institute Further include before stating step S02:
To progress voice quality detection in the audio of the typing, including:
Calculate the first signal-to-noise ratio, first the average energy value and the first efficient voice duration of the audio of the typing;
Successively by the first signal-to-noise ratio of the audio of the typing, first the average energy value and the first efficient voice duration with it is corresponding First preset threshold value is compared, if the first signal-to-noise ratio, first the average energy value and the first efficient voice duration are above correspondence The first predetermined threshold value, it is determined that the voice quality of the audio of the typing is qualified, and executes next step, otherwise user is prompted to weigh New inputting audio simultaneously returns to the audio for reacquiring typing.
4. the monitoring method according to claim 3 based on Application on Voiceprint Recognition, which is characterized in that the calculating typing Further include before the first signal-to-noise ratio, first the average energy value and the first efficient voice duration of audio:
Judge that the content type in the audio of the typing, content type include random digit, random phrase, random long sentence and consolidate Determine phrase;
Corresponding first preset threshold value of the first efficient voice duration is determined according to the content type in the audio of the typing.
5. the monitoring method according to claim 1 based on Application on Voiceprint Recognition, which is characterized in that the step S3 is specifically wrapped It includes:
Application on Voiceprint Recognition, the first vocal print in the audio listened to described in extraction are carried out to the audio listened to;
The first vocal print in the audio listened to is compared with the second vocal print in preset vocal print library, is matched Value;
Judge whether matching value is higher than preset matching threshold, when determining that matching value is higher than preset matching threshold, sends positioning letter Breath is to early warning platform and responds the early warning platform.
6. the monitoring method according to claim 5 based on Application on Voiceprint Recognition, which is characterized in that when matching value is less than preset When with threshold value, the first vocal print in the audio listened to is added in the preset vocal print library, and respond early warning platform.
7. a kind of monitoring device based on Application on Voiceprint Recognition, which is characterized in that including:
First acquisition unit, for obtaining the audio listened to;
Voice recognition unit, for carrying out speech recognition to the audio listened to, when the audio listened to includes When preset keyword, vocal print comparing unit is jumped to;
Vocal print comparing unit for carrying out Application on Voiceprint Recognition to the audio listened to, and the audio listened to is corresponded to The first vocal print be compared with the second vocal print in preset vocal print library, if being matched to identical vocal print, send location information To early warning platform and respond the early warning platform.
8. the monitoring device according to claim 7 based on Application on Voiceprint Recognition, which is characterized in that further include:
Second acquisition unit, the audio for obtaining typing;
Voiceprint extraction unit, the second vocal print in audio for extracting the typing are simultaneously preserved into preset vocal print library.
9. the monitoring device according to claim 8 based on Application on Voiceprint Recognition, which is characterized in that further include:
Voice quality detection unit, for carrying out voice quality detection in the audio of the typing;
Institute's Voice Quality detection unit includes:
Computation subunit, the first signal-to-noise ratio, first the average energy value and the first effective language of the audio for calculating the typing Sound duration;
Comparison subunit, for successively that the first signal-to-noise ratio of the audio of the typing, first the average energy value and first is effective Voice duration is compared with corresponding first preset threshold value, if the first signal-to-noise ratio, first the average energy value and first effective language Sound duration is above corresponding first predetermined threshold value, it is determined that the voice quality of the audio of the typing is qualified, and executes next Otherwise step prompts user to re-type audio and returns to the audio for reacquiring typing.
10. the monitoring device according to claim 9 based on Application on Voiceprint Recognition, which is characterized in that voice quality detection unit Further include:
Judgment sub-unit, the content type in audio for judging the typing, content type includes random digit, random short Language, random long sentence and fixed phrase;
Threshold value determination subelement determines that the first efficient voice duration corresponds to for the content type in the audio according to the typing The first preset threshold value.
CN201810394740.6A 2018-04-27 2018-04-27 A kind of monitoring method and device based on Application on Voiceprint Recognition Pending CN108766439A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810394740.6A CN108766439A (en) 2018-04-27 2018-04-27 A kind of monitoring method and device based on Application on Voiceprint Recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810394740.6A CN108766439A (en) 2018-04-27 2018-04-27 A kind of monitoring method and device based on Application on Voiceprint Recognition

Publications (1)

Publication Number Publication Date
CN108766439A true CN108766439A (en) 2018-11-06

Family

ID=64012425

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810394740.6A Pending CN108766439A (en) 2018-04-27 2018-04-27 A kind of monitoring method and device based on Application on Voiceprint Recognition

Country Status (1)

Country Link
CN (1) CN108766439A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109448722A (en) * 2018-12-28 2019-03-08 合肥凯捷技术有限公司 A kind of speech analysis system
CN109616125A (en) * 2018-12-13 2019-04-12 苏州思必驰信息科技有限公司 Monitoring method and system based on Application on Voiceprint Recognition
CN109634554A (en) * 2018-12-18 2019-04-16 三星电子(中国)研发中心 Method and apparatus for output information
CN109754811A (en) * 2018-12-10 2019-05-14 平安科技(深圳)有限公司 Sound-source follow-up method, apparatus, equipment and storage medium based on biological characteristic
CN109817224A (en) * 2019-02-22 2019-05-28 深圳云游四海信息科技有限公司 A kind of voice sensitive word monitor system and method
CN110830771A (en) * 2019-11-11 2020-02-21 广州国音智能科技有限公司 Intelligent monitoring method, device, equipment and computer readable storage medium
CN110827829A (en) * 2019-10-24 2020-02-21 秒针信息技术有限公司 Passenger flow analysis method and system based on voice recognition
CN111128199A (en) * 2019-12-27 2020-05-08 中国人民解放军陆军工程大学 Sensitive speaker monitoring and recording control method and system based on deep learning
CN111275909A (en) * 2018-12-04 2020-06-12 阿里巴巴集团控股有限公司 Security early warning method and device
CN111768789A (en) * 2020-08-03 2020-10-13 上海依图信息技术有限公司 Electronic equipment and method, device and medium for determining identity of voice sender thereof
CN112992154A (en) * 2021-05-08 2021-06-18 北京远鉴信息技术有限公司 Voice identity determination method and system based on enhanced voiceprint library
WO2021133504A1 (en) * 2019-12-23 2021-07-01 Motorola Solutions, Inc. Using a sensor hub to generate a tracking profile for tracking an object
CN113593581A (en) * 2021-07-12 2021-11-02 西安讯飞超脑信息科技有限公司 Voiceprint distinguishing method and device, computer equipment and storage medium
WO2024197594A1 (en) * 2023-03-28 2024-10-03 京东方科技集团股份有限公司 Audio monitoring method, system and device, and computer storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833843A (en) * 2009-03-13 2010-09-15 新奥特硅谷视频技术有限责任公司 Monitoring system based on voiceprint authentication
CN103646481A (en) * 2013-11-27 2014-03-19 大连创达技术交易市场有限公司 Ward calling system
CN104065836A (en) * 2014-05-30 2014-09-24 小米科技有限责任公司 Method and device for monitoring calls
CN104269016A (en) * 2014-09-22 2015-01-07 北京奇艺世纪科技有限公司 Alarm method and device
US20150131788A1 (en) * 2010-09-07 2015-05-14 Securus Technologies Multi-party conversation analyzer & logger
CN104954429A (en) * 2015-04-26 2015-09-30 安徽味唯网络科技有限公司 Method of automatic help seeking system in danger
CN105321514A (en) * 2014-05-28 2016-02-10 西安中兴新软件有限责任公司 Alarm method and terminal
CN105679313A (en) * 2016-04-15 2016-06-15 福建新恒通智能科技有限公司 Audio recognition alarm system and method
CN107886958A (en) * 2017-11-10 2018-04-06 广州势必可赢网络科技有限公司 Express cabinet pickup method and device based on voiceprint

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833843A (en) * 2009-03-13 2010-09-15 新奥特硅谷视频技术有限责任公司 Monitoring system based on voiceprint authentication
US20150131788A1 (en) * 2010-09-07 2015-05-14 Securus Technologies Multi-party conversation analyzer & logger
CN103646481A (en) * 2013-11-27 2014-03-19 大连创达技术交易市场有限公司 Ward calling system
CN105321514A (en) * 2014-05-28 2016-02-10 西安中兴新软件有限责任公司 Alarm method and terminal
CN104065836A (en) * 2014-05-30 2014-09-24 小米科技有限责任公司 Method and device for monitoring calls
CN104269016A (en) * 2014-09-22 2015-01-07 北京奇艺世纪科技有限公司 Alarm method and device
CN104954429A (en) * 2015-04-26 2015-09-30 安徽味唯网络科技有限公司 Method of automatic help seeking system in danger
CN105679313A (en) * 2016-04-15 2016-06-15 福建新恒通智能科技有限公司 Audio recognition alarm system and method
CN107886958A (en) * 2017-11-10 2018-04-06 广州势必可赢网络科技有限公司 Express cabinet pickup method and device based on voiceprint

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111275909A (en) * 2018-12-04 2020-06-12 阿里巴巴集团控股有限公司 Security early warning method and device
CN111275909B (en) * 2018-12-04 2021-12-28 阿里巴巴集团控股有限公司 Security early warning method and device
CN109754811A (en) * 2018-12-10 2019-05-14 平安科技(深圳)有限公司 Sound-source follow-up method, apparatus, equipment and storage medium based on biological characteristic
CN109754811B (en) * 2018-12-10 2023-06-02 平安科技(深圳)有限公司 Sound source tracking method, device, equipment and storage medium based on biological characteristics
CN109616125A (en) * 2018-12-13 2019-04-12 苏州思必驰信息科技有限公司 Monitoring method and system based on Application on Voiceprint Recognition
CN109634554A (en) * 2018-12-18 2019-04-16 三星电子(中国)研发中心 Method and apparatus for output information
CN109448722A (en) * 2018-12-28 2019-03-08 合肥凯捷技术有限公司 A kind of speech analysis system
CN109817224A (en) * 2019-02-22 2019-05-28 深圳云游四海信息科技有限公司 A kind of voice sensitive word monitor system and method
CN110827829A (en) * 2019-10-24 2020-02-21 秒针信息技术有限公司 Passenger flow analysis method and system based on voice recognition
CN110830771A (en) * 2019-11-11 2020-02-21 广州国音智能科技有限公司 Intelligent monitoring method, device, equipment and computer readable storage medium
WO2021133504A1 (en) * 2019-12-23 2021-07-01 Motorola Solutions, Inc. Using a sensor hub to generate a tracking profile for tracking an object
US11188775B2 (en) 2019-12-23 2021-11-30 Motorola Solutions, Inc. Using a sensor hub to generate a tracking profile for tracking an object
CN111128199A (en) * 2019-12-27 2020-05-08 中国人民解放军陆军工程大学 Sensitive speaker monitoring and recording control method and system based on deep learning
CN111768789A (en) * 2020-08-03 2020-10-13 上海依图信息技术有限公司 Electronic equipment and method, device and medium for determining identity of voice sender thereof
CN111768789B (en) * 2020-08-03 2024-02-23 上海依图信息技术有限公司 Electronic equipment, and method, device and medium for determining identity of voice generator of electronic equipment
CN112992154A (en) * 2021-05-08 2021-06-18 北京远鉴信息技术有限公司 Voice identity determination method and system based on enhanced voiceprint library
CN113593581A (en) * 2021-07-12 2021-11-02 西安讯飞超脑信息科技有限公司 Voiceprint distinguishing method and device, computer equipment and storage medium
CN113593581B (en) * 2021-07-12 2024-04-19 西安讯飞超脑信息科技有限公司 Voiceprint discrimination method, voiceprint discrimination device, computer device and storage medium
WO2024197594A1 (en) * 2023-03-28 2024-10-03 京东方科技集团股份有限公司 Audio monitoring method, system and device, and computer storage medium

Similar Documents

Publication Publication Date Title
CN108766439A (en) A kind of monitoring method and device based on Application on Voiceprint Recognition
CN109769099B (en) Method and device for detecting abnormality of call person
CN105139858B (en) A kind of information processing method and electronic equipment
CN105869645B (en) Voice data processing method and device
CN108039176A (en) Voiceprint authentication method and device for preventing recording attack and access control system
US9142210B2 (en) Method and device for speaker recognition
CN105976815A (en) Vehicle voice recognition method and vehicle voice recognition device
CN108564948A (en) A kind of audio recognition method and electronic equipment
CN109346055A (en) Active denoising method, device, earphone and computer storage medium
CN108615537A (en) A kind of multichannel way of recording, apparatus and system
Kiktova et al. Gun type recognition from gunshot audio recordings
CN110767214A (en) Speech recognition method and device and speech recognition system
CN111816185A (en) Method and device for identifying speaker in mixed voice
CN107871236A (en) Electronic equipment voiceprint payment method and device
CN106330915A (en) Voice verification processing method and device
WO2001024163A1 (en) Speaker verification using hidden markov models with clusters and score thresholding
JP2008145988A (en) Noise detecting device and noise detecting method
CN107680592A (en) A kind of mobile terminal sound recognition methods and mobile terminal and storage medium
CN110599751A (en) Danger alarm method and device, computer equipment and storage medium
CN106971715A (en) A kind of speech recognition equipment applied to robot
CN113270112A (en) Electronic camouflage voice automatic distinguishing and restoring method and system
CN110060682B (en) Sound box control method and device
CN108806693A (en) Method and device for preventing Internet from refreshing list
US20230274377A1 (en) An end-to-end proctoring system and method for conducting a secure online examination
CN116013255A (en) High-security voice recognition method, system, equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181106