CN109389993A - A kind of data under voice method, apparatus, equipment and storage medium - Google Patents
A kind of data under voice method, apparatus, equipment and storage medium Download PDFInfo
- Publication number
- CN109389993A CN109389993A CN201811533747.8A CN201811533747A CN109389993A CN 109389993 A CN109389993 A CN 109389993A CN 201811533747 A CN201811533747 A CN 201811533747A CN 109389993 A CN109389993 A CN 109389993A
- Authority
- CN
- China
- Prior art keywords
- voice
- parameter threshold
- data
- metrics
- testing result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Abstract
The invention discloses a kind of data under voice methods, method includes the following steps: receiving the data under voice triggering command of user;Acquire the tone testing data of user;Voice activity detection is carried out to tone testing data, obtains the testing result of each voice metrics;Adjust the corresponding parameter threshold of the unacceptable voice metrics of testing result;The step of repeating the tone testing data of acquisition user, until saving parameter threshold adjusted when the testing result of each voice metrics passes through;During acquiring the voice sample data of user, voice activity detection is carried out to each voice sample data using parameter threshold adjusted.Using technical solution provided by the embodiment of the present invention, the best recording result of terminal is reached by adjusting parameter threshold, guarantees that effective voice sample data can be collected by different terminals.The invention also discloses a kind of data under voice device, equipment and storage mediums, have relevant art effect.
Description
Technical field
The present invention relates to data acquisition technology field, more particularly to a kind of data under voice method, apparatus, equipment and
Storage medium.
Background technique
With the raising of the degree of concern to information security, sound groove recognition technology in e is had gradually developed.In voice print database
Process of construction in, in the training process of Application on Voiceprint Recognition model, require to acquire a large amount of voice sample data.
Currently, being mostly the acquisition for carrying out voice sample data by terminals such as mobile phone, tablet computer, laptops.For
Effective voice sample data can be collected, screening and Weeding after acquisition to invalid voice is avoided, is adopting
Will increase voice activity detection, i.e. VAD during collection voice sample data, full name is Voice Activity Detection,
Also known as speech terminals detection, speech endpoint detection refer to the presence or absence that voice is detected in noise circumstance.Difference is mobile eventually
End uses identical voice activity detection standard.
But because the microphone sound quality of different terminals is different, the audio frequency effect for recording out is also different, using same
Voice activity detection standard detects voice sample data, for the terminal having, may be such that a large amount of speech samples
Data are removed, and are increased collecting work amount, are influenced collecting efficiency.
Summary of the invention
The object of the present invention is to provide a kind of data under voice method, apparatus, equipment and storage mediums, to reduce acquisition
Workload improves collecting efficiency.
In order to solve the above technical problems, the invention provides the following technical scheme:
A kind of data under voice method, comprising:
Receive the data under voice triggering command of user;
Acquire the tone testing data of the user;
Voice activity detection is carried out to the tone testing data, obtains the testing result of each voice metrics;
Adjust the corresponding parameter threshold of the unacceptable voice metrics of testing result;
The step of repeating the tone testing data of the acquisition user, until the detection knot of each voice metrics
When fruit passes through, parameter threshold adjusted is saved;
During acquiring the voice sample data of the user, using parameter threshold adjusted to each voice sample
Notebook data carries out voice activity detection.
In a kind of specific embodiment of the invention, voice activity detection is carried out to the tone testing data described
Before, further includes:
Determine whether the corresponding parameter threshold of each voice metrics is default value;
If it is not, then the corresponding parameter threshold of each voice metrics is adjusted to default value.
In a kind of specific embodiment of the invention, it is described receive user data under voice triggering command it
Afterwards, before the tone testing data of the acquisition user, further includes:
Export test prompts information.
In a kind of specific embodiment of the invention, the voice metrics include energy value, signal-to-noise ratio and effective voice.
In a kind of specific embodiment of the invention, the corresponding ginseng of the unacceptable voice metrics of the adjustment testing result
Number threshold value, comprising:
If the unacceptable voice metrics of testing result are energy value, reduce energy parameter threshold value.
In a kind of specific embodiment of the invention, the corresponding ginseng of the unacceptable voice metrics of the adjustment testing result
Number threshold value, comprising:
If the unacceptable voice metrics of testing result are signal-to-noise ratio, reduce signal-to-noise ratio parameter threshold.
In a kind of specific embodiment of the invention, the corresponding ginseng of the unacceptable voice metrics of the adjustment testing result
Number threshold value, comprising:
If the unacceptable voice metrics of testing result are effective voice, reduce effective voice parameter threshold.
A kind of data under voice device, comprising:
Triggering command receiving module, for receiving the data under voice triggering command of user;
Test data collection module, for acquiring the tone testing data of the user;
Testing result obtains module, for carrying out voice activity detection to the tone testing data, obtains each voice
The testing result of index;
Parameter threshold adjusts module, for adjusting the corresponding parameter threshold of the unacceptable voice metrics of testing result;
Module is repeated, for the step of repeating the tone testing data of the acquisition user, until each
When the testing result of a voice metrics passes through, trigger parameter threshold value preserving module;
The parameter threshold preserving module, for saving parameter threshold adjusted;
Sample data acquisition module, for during acquiring the voice sample data of the user, after adjustment
Parameter threshold to each voice sample data carry out voice activity detection.
A kind of data under voice equipment, comprising:
Memory, for storing computer program;
Processor realizes the step of data under voice method described in any of the above-described when for executing the computer program
Suddenly.
A kind of computer readable storage medium is stored with computer program on the computer readable storage medium, described
The step of data under voice method described in any of the above-described is realized when computer program is executed by processor.
Using technical solution provided by the embodiment of the present invention, in the data under voice triggering command for receiving user
When, can tone testing data to collected user carry out voice activity detection, obtain the detection knot of each voice metrics
Fruit, and the corresponding parameter threshold of the unacceptable voice metrics of testing result is adjusted, acquisition, detecting step are repeated, until each
When the testing result of a voice metrics passes through, parameter threshold adjusted is saved, is acquiring the voice sample data of user
In the process, voice activity detection is carried out to each voice sample data using parameter threshold adjusted.By adjusting parameter threshold
Value reaches the best recording result of terminal, guarantees that effective voice sample data can be collected by different terminals.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below
There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this
Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with
It obtains other drawings based on these drawings.
Fig. 1 is a kind of implementation flow chart of data under voice method in the embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of data under voice device in the embodiment of the present invention;
Fig. 3 is a kind of structural schematic diagram of data under voice equipment in the embodiment of the present invention.
Specific embodiment
In order to enable those skilled in the art to better understand the solution of the present invention, with reference to the accompanying drawings and detailed description
The present invention is described in further detail.Obviously, described embodiments are only a part of the embodiments of the present invention, rather than
Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise
Under every other embodiment obtained, shall fall within the protection scope of the present invention.
Core of the invention is to provide a kind of data under voice method, this method can be applied to mobile phone, tablet computer,
The terminals such as desktop computer, laptop, terminal can acquire user in the data under voice triggering command for receiving user
Tone testing data, to tone testing data carry out voice activity detection, by testing result adjust relevant parameter threshold value, and
During acquiring the voice sample data of user, it is living that voice is carried out to voice sample data using parameter threshold adjusted
Dynamic detection, the best recording result of terminal is reached by adjusting parameter threshold, guarantees to collect effectively by different terminals
Voice sample data.
Specifically, terminal can carry out above-mentioned processing operation by the parameter threshold to voice activity detection automatically, can also lead to
The a certain application program crossed in terminal carries out above-mentioned processing operation, for example carries out above-mentioned processing operation by recording application program.
For convenience of description, the explanation of step each in the embodiment of the present invention is accordingly described in terminal point.
It is shown in Figure 1, it is a kind of implementation flow chart of data under voice method provided by the embodiment of the present invention, it should
Method may comprise steps of:
S110: the data under voice triggering command of user is received.
In practical applications, user can carry out the recording of voice data by terminal.When detecting that user enters voice
When activity detection parameter configures the page, the data under voice triggering command for receiving user can be considered as, show that user will
The recording of using terminal progress voice data.
After the data under voice triggering command for receiving user, the operation of step S120 can be continued to execute.
S120: the tone testing data of user are acquired.
After receiving the data under voice triggering command of user, the tone testing data of user can be acquired.
Specifically, test prompts information can be exported first, prompt user with normal word speed, sound size, quiet
One section of word is read in environment.After user completes voice data recording, using the voice data as tone testing data.
S130: voice activity detection is carried out to tone testing data, obtains the testing result of each voice metrics.
After the tone testing data for acquiring user, tone testing data can be carried out with voice activity detection, it is specific to detect
Method is the prior art, and the embodiment of the present invention repeats no more this.
The voice metrics of voice activity detection include energy value, signal-to-noise ratio and effective voice.
Energy value is the sound size quantified;
Signal-to-noise ratio is the energy ratio for measuring voice signal and noise signal in a segment of audio, the i.e. clean level of voice;
Effective voice is the phonological component spoken in a segment of audio comprising people.
After carrying out voice activity detection to tone testing data, the testing result of each voice metrics can be obtained.Detection
It as a result include passing through or not passing through.
S140: the corresponding parameter threshold of the adjustment unacceptable voice metrics of testing result.
After carrying out voice activity detection to tone testing data, the testing result of each voice metrics can get.For every
A voice metrics show the corresponding parameter threshold setting of the voice metrics if the testing result of the voice metrics is to pass through
It is relatively reasonable, if the testing result of the voice metrics is not pass through, the corresponding parameter threshold of the adjustable voice metrics.
The initial value of the corresponding parameter threshold of each voice metrics can be the preset value with higher evaluation criteria.
Specifically, can reduce energy parameter threshold value if the unacceptable voice metrics of testing result are energy value.
Tone testing data are one section that collected user is read with normal word speed, sound size in quiet environment
Words, if determining that the corresponding voice metrics detection of energy value does not pass through by voice activity detection, then showing the voice metrics pair
The energy parameter threshold value answered may be arranged larger, can reduce energy parameter threshold value.Specifically, prompt information can be exported,
Prompt user manually adjusts, or is spaced automatic reduce according to setting.If present energy parameter threshold is 10,2 are divided between setting,
Then energy parameter threshold can be reduced to 8.
If the unacceptable voice metrics of testing result are signal-to-noise ratio, signal-to-noise ratio parameter threshold can reduce.
As described, tone testing data be collected user with normal word speed, sound size quiet
One section of word that environment is read determine the corresponding voice metrics inspection of signal-to-noise ratio if carrying out voice activity detection to tone testing data
Survey does not pass through, then it is larger to show that the corresponding signal-to-noise ratio parameter threshold of the voice metrics may be arranged, and can reduce signal-to-noise ratio ginseng
Number threshold value.Specifically, prompt information can be exported, user is prompted to manually adjust, or is spaced automatic reduce according to setting.Such as work as
Preceding signal-to-noise ratio parameter threshold is 10, and 2 are divided between setting, then signal-to-noise ratio parameter threshold can be reduced to 8.
If the unacceptable voice metrics of testing result are effective voice, effective voice parameter threshold can reduce.
As described, tone testing data be collected user with normal word speed, sound size quiet
One section of word that environment is read determine the corresponding voice metrics of effective voice if carrying out voice activity detection to tone testing data
Detection does not pass through, then it is larger to show that the corresponding effective voice parameter threshold of the voice metrics may be arranged, and can reduce effectively
Voice parameter threshold.Specifically, prompt information can be exported, user is prompted to manually adjust, or subtracted automatically according to setting interval
It is small.If the currently active voice parameter threshold is 10 seconds, it is divided between setting 2 seconds, then effective voice parameter threshold can be reduced to 8
Second.
After adjusting the corresponding parameter threshold of the unacceptable voice metrics of testing result, can repeat step S120 and its
It operates below.
S150: when the testing result of each voice metrics passes through, parameter threshold adjusted is saved.
Voice activity detection is carried out to tone testing data, the testing result of each voice metrics is obtained, if there is inspection
The unacceptable voice metrics of result to be surveyed, then adjust the corresponding parameter threshold of the voice metrics, repeated acquisition, detection operate, if
The testing result of each voice metrics passes through, then can save the parameter threshold after final adjustment.
S160: during acquiring the voice sample data of user, using parameter threshold adjusted to each voice
Sample data carries out voice activity detection.
By the operation of above-mentioned several steps, the corresponding parameter threshold of each voice metrics of voice activity detection is carried out
After corresponding adjustment, the testing result of each voice metrics can be made equal when carrying out voice activity detection to tone testing data
Pass through.
During acquiring the voice sample data of user, parameter threshold adjusted can be used to each voice sample
Notebook data carries out voice activity detection, to screen and reject invalid voice data.
It can in the data under voice triggering command for receiving user using method provided by the embodiment of the present invention
Voice activity detection is carried out with the tone testing data to collected user, obtains the testing result of each voice metrics, and
The corresponding parameter threshold of the unacceptable voice metrics of testing result is adjusted, acquisition, detecting step are repeated, until each voice
When the testing result of index passes through, parameter threshold adjusted is saved, during acquiring the voice sample data of user,
Voice activity detection is carried out to each voice sample data using parameter threshold adjusted.Reach eventually by adjusting parameter threshold
The best recording result at end guarantees that effective voice sample data can be collected by different terminals.
In one embodiment of the invention, before carrying out voice activity detection to tone testing data, this method is also
It may comprise steps of:
Determine whether the corresponding parameter threshold of each voice metrics is default value;
If it is not, then the corresponding parameter threshold of each voice metrics is adjusted to default value.
In embodiments of the present invention, in the data under voice triggering command for receiving user, voice can be first determined
Whether the corresponding parameter threshold of each voice metrics of activity detection is default value, which can be based on to compared with multiple terminals
Microphone location detection setting an empirical value, can also be it is corresponding with present terminal model it is preset one warp
Test value.
If it is, can directly be executed, if it is not, then can the step of carrying out voice activity detection to tone testing data
The corresponding parameter threshold of each voice metrics is first adjusted to default value, then executes and speech activity is carried out to tone testing data
The step of detection.Percent of pass is detected to improve.
The embodiment of the present invention can carry out the adjustment of voice activity detection parameter threshold to different terminals, and end is effectively ensured
The voice for recording out is held to meet acquisition standard.
Corresponding to above method embodiment, the embodiment of the invention also provides a kind of data under voice devices, hereafter
A kind of data under voice device of description can correspond to each other reference with a kind of above-described data under voice method.
Shown in Figure 2, which comprises the following modules:
Triggering command receiving module 210, for receiving the data under voice triggering command of user;
Test data collection module 220, for acquiring the tone testing data of user;
Testing result obtains module 230, for carrying out voice activity detection to tone testing data, obtains each voice and refers to
Target testing result;
Parameter threshold adjusts module 240, for adjusting the corresponding parameter threshold of the unacceptable voice metrics of testing result;
Module 250 is repeated, for the step of repeating the tone testing data of acquisition user, until each voice
When the testing result of index passes through, trigger parameter threshold value preserving module 260;
Parameter threshold preserving module 260, for saving parameter threshold adjusted;
Sample data acquisition module 270, for during acquiring the voice sample data of user, use to be adjusted
Parameter threshold carries out voice activity detection to each voice sample data.
It can in the data under voice triggering command for receiving user using device provided by the embodiment of the present invention
Voice activity detection is carried out with the tone testing data to collected user, obtains the testing result of each voice metrics, and
The corresponding parameter threshold of the unacceptable voice metrics of testing result is adjusted, acquisition, detecting step are repeated, until each voice
When the testing result of index passes through, parameter threshold adjusted is saved, during acquiring the voice sample data of user,
Voice activity detection is carried out to each voice sample data using parameter threshold adjusted.Reach eventually by adjusting parameter threshold
The best recording result at end guarantees that effective voice sample data can be collected by different terminals.
In a kind of specific embodiment of the invention, further includes default value setup module, is used for:
Before carrying out voice activity detection to tone testing data, determine that the corresponding parameter threshold of each voice metrics is
No is default value;
If it is not, then the corresponding parameter threshold of each voice metrics is adjusted to default value.
In a kind of specific embodiment of the invention, further includes prompt information output module, is used for:
After the data under voice triggering command for receiving user, before the tone testing data of acquisition user, output
Test prompts information.
In a kind of specific embodiment of the invention, voice metrics include energy value, signal-to-noise ratio and effective voice.
In a kind of specific embodiment of the invention, parameter threshold adjusts module 240, is specifically used for:
If the unacceptable voice metrics of testing result are energy value, reduce energy parameter threshold value.
In a kind of specific embodiment of the invention, parameter threshold adjusts module 240, is specifically used for:
If the unacceptable voice metrics of testing result are signal-to-noise ratio, reduce signal-to-noise ratio parameter threshold.
In a kind of specific embodiment of the invention, parameter threshold adjusts module 240, is specifically used for:
If the unacceptable voice metrics of testing result are effective voice, reduce effective voice parameter threshold.
Corresponding to above method embodiment, the embodiment of the invention also provides a kind of data under voice equipment, such as Fig. 3
It is shown, comprising:
Memory 310, for storing computer program;
Processor 320, the step of above-mentioned data under voice method is realized when for executing computer program.
Corresponding to above method embodiment, the embodiment of the invention also provides a kind of computer readable storage medium, meters
It is stored with computer program on calculation machine readable storage medium storing program for executing, realizes that above-mentioned voice data is adopted when computer program is executed by processor
The step of set method.
Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with it is other
The difference of embodiment, same or similar part may refer to each other between each embodiment.
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure
And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and
The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These
Function is implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Profession
Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered
Think beyond the scope of this invention.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor
The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit
Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology
In any other form of storage medium well known in field.
Used herein a specific example illustrates the principle and implementation of the invention, and above embodiments are said
It is bright to be merely used to help understand technical solution of the present invention and its core concept.It should be pointed out that for the common of the art
, without departing from the principle of the present invention, can be with several improvements and modifications are made to the present invention for technical staff, these
Improvement and modification are also fallen within the protection scope of the claims of the present invention.
Claims (10)
1. a kind of data under voice method characterized by comprising
Receive the data under voice triggering command of user;
Acquire the tone testing data of the user;
Voice activity detection is carried out to the tone testing data, obtains the testing result of each voice metrics;
Adjust the corresponding parameter threshold of the unacceptable voice metrics of testing result;
The step of repeating the tone testing data of the acquisition user, until the testing result of each voice metrics is equal
By when, save parameter threshold adjusted;
During acquiring the voice sample data of the user, using parameter threshold adjusted to each speech samples number
According to progress voice activity detection.
2. the method according to claim 1, wherein carrying out speech activity to the tone testing data described
Before detection, further includes:
Determine whether the corresponding parameter threshold of each voice metrics is default value;
If it is not, then the corresponding parameter threshold of each voice metrics is adjusted to default value.
3. the method according to claim 1, wherein in the data under voice triggering command for receiving user
Later, before the tone testing data of the acquisition user, further includes:
Export test prompts information.
4. according to claim 1 to described in any item methods among 3, which is characterized in that the voice metrics include energy value,
Signal-to-noise ratio and effective voice.
5. according to the method described in claim 4, it is characterized in that, the unacceptable voice metrics of the adjustment testing result are corresponding
Parameter threshold, comprising:
If the unacceptable voice metrics of testing result are energy value, reduce energy parameter threshold value.
6. according to the method described in claim 4, it is characterized in that, the unacceptable voice metrics of the adjustment testing result are corresponding
Parameter threshold, comprising:
If the unacceptable voice metrics of testing result are signal-to-noise ratio, reduce signal-to-noise ratio parameter threshold.
7. according to the method described in claim 4, it is characterized in that, the unacceptable voice metrics of the adjustment testing result are corresponding
Parameter threshold, comprising:
If the unacceptable voice metrics of testing result are effective voice, reduce effective voice parameter threshold.
8. a kind of data under voice device characterized by comprising
Triggering command receiving module, for receiving the data under voice triggering command of user;
Test data collection module, for acquiring the tone testing data of the user;
Testing result obtains module, for carrying out voice activity detection to the tone testing data, obtains each voice metrics
Testing result;
Parameter threshold adjusts module, for adjusting the corresponding parameter threshold of the unacceptable voice metrics of testing result;
Module is repeated, for the step of repeating the tone testing data of the acquisition user, until each language
When the testing result of sound index passes through, trigger parameter threshold value preserving module;
The parameter threshold preserving module, for saving parameter threshold adjusted;
Sample data acquisition module, for using ginseng adjusted during acquiring the voice sample data of the user
Number threshold value carries out voice activity detection to each voice sample data.
9. a kind of data under voice equipment characterized by comprising
Memory, for storing computer program;
Processor realizes the data under voice side as described in any one of claim 1 to 7 when for executing the computer program
The step of method.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium
Program realizes the data under voice method as described in any one of claim 1 to 7 when the computer program is executed by processor
The step of.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811533747.8A CN109389993A (en) | 2018-12-14 | 2018-12-14 | A kind of data under voice method, apparatus, equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811533747.8A CN109389993A (en) | 2018-12-14 | 2018-12-14 | A kind of data under voice method, apparatus, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109389993A true CN109389993A (en) | 2019-02-26 |
Family
ID=65429288
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811533747.8A Pending CN109389993A (en) | 2018-12-14 | 2018-12-14 | A kind of data under voice method, apparatus, equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109389993A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110473525A (en) * | 2019-09-16 | 2019-11-19 | 百度在线网络技术(北京)有限公司 | The method and apparatus for obtaining voice training sample |
CN110910283A (en) * | 2019-10-18 | 2020-03-24 | 平安科技(深圳)有限公司 | Method, device, equipment and storage medium for generating legal document |
CN112017698A (en) * | 2020-10-30 | 2020-12-01 | 北京淇瑀信息科技有限公司 | Method and device for optimizing manual recording adopted by voice robot and electronic equipment |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101393744A (en) * | 2007-09-19 | 2009-03-25 | 华为技术有限公司 | Method for regulating threshold and detection module |
CN103730118A (en) * | 2012-10-11 | 2014-04-16 | 百度在线网络技术(北京)有限公司 | Voice signal collecting method and mobile terminal |
US20160203834A1 (en) * | 2013-05-21 | 2016-07-14 | Speech Morphing Systems, Inc. | Method and apparatus for exemplary segment classification |
CN107305774A (en) * | 2016-04-22 | 2017-10-31 | 腾讯科技(深圳)有限公司 | Speech detection method and device |
CN108877776A (en) * | 2018-06-06 | 2018-11-23 | 平安科技(深圳)有限公司 | Sound end detecting method, device, computer equipment and storage medium |
CN108986822A (en) * | 2018-08-31 | 2018-12-11 | 出门问问信息科技有限公司 | Audio recognition method, device, electronic equipment and non-transient computer storage medium |
-
2018
- 2018-12-14 CN CN201811533747.8A patent/CN109389993A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101393744A (en) * | 2007-09-19 | 2009-03-25 | 华为技术有限公司 | Method for regulating threshold and detection module |
CN103730118A (en) * | 2012-10-11 | 2014-04-16 | 百度在线网络技术(北京)有限公司 | Voice signal collecting method and mobile terminal |
US20160203834A1 (en) * | 2013-05-21 | 2016-07-14 | Speech Morphing Systems, Inc. | Method and apparatus for exemplary segment classification |
CN107305774A (en) * | 2016-04-22 | 2017-10-31 | 腾讯科技(深圳)有限公司 | Speech detection method and device |
CN108877776A (en) * | 2018-06-06 | 2018-11-23 | 平安科技(深圳)有限公司 | Sound end detecting method, device, computer equipment and storage medium |
CN108986822A (en) * | 2018-08-31 | 2018-12-11 | 出门问问信息科技有限公司 | Audio recognition method, device, electronic equipment and non-transient computer storage medium |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110473525A (en) * | 2019-09-16 | 2019-11-19 | 百度在线网络技术(北京)有限公司 | The method and apparatus for obtaining voice training sample |
CN110473525B (en) * | 2019-09-16 | 2022-04-05 | 百度在线网络技术(北京)有限公司 | Method and device for acquiring voice training sample |
CN110910283A (en) * | 2019-10-18 | 2020-03-24 | 平安科技(深圳)有限公司 | Method, device, equipment and storage medium for generating legal document |
CN112017698A (en) * | 2020-10-30 | 2020-12-01 | 北京淇瑀信息科技有限公司 | Method and device for optimizing manual recording adopted by voice robot and electronic equipment |
CN112017698B (en) * | 2020-10-30 | 2021-01-29 | 北京淇瑀信息科技有限公司 | Method and device for optimizing manual recording adopted by voice robot and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106486131B (en) | A kind of method and device of speech de-noising | |
CN103578468B (en) | The method of adjustment and electronic equipment of a kind of confidence coefficient threshold of voice recognition | |
US10242677B2 (en) | Speaker dependent voiced sound pattern detection thresholds | |
CN108766454A (en) | A kind of voice noise suppressing method and device | |
CN109389993A (en) | A kind of data under voice method, apparatus, equipment and storage medium | |
CN110517670A (en) | Promote the method and apparatus for waking up performance | |
CN110021307A (en) | Audio method of calibration, device, storage medium and electronic equipment | |
EP2905780A1 (en) | Voiced sound pattern detection | |
US20150081287A1 (en) | Adaptive noise reduction for high noise environments | |
CN105118522A (en) | Noise detection method and device | |
WO2020037555A1 (en) | Method, device, apparatus, and system for evaluating microphone array consistency | |
CN110473539A (en) | Promote the method and apparatus that voice wakes up performance | |
CN108152788A (en) | Sound-source follow-up method, sound-source follow-up equipment and computer readable storage medium | |
US20060100866A1 (en) | Influencing automatic speech recognition signal-to-noise levels | |
CN109545242A (en) | A kind of audio data processing method, system, device and readable storage medium storing program for executing | |
CN105308679A (en) | Method and system for identifying location associated with voice command to control home appliance | |
CN110265064A (en) | Audio sonic boom detection method, device and storage medium | |
CN108648765A (en) | A kind of method, apparatus and terminal of voice abnormality detection | |
CN110232933A (en) | Audio-frequency detection, device, storage medium and electronic equipment | |
CN111540342B (en) | Energy threshold adjusting method, device, equipment and medium | |
US20170296081A1 (en) | Frame based spike detection module | |
KR101762723B1 (en) | Method and apparatus for detecting correctness of pitch period | |
CN110600048A (en) | Audio verification method and device, storage medium and electronic equipment | |
CN109994129A (en) | Speech processing system, method and apparatus | |
CN110689885A (en) | Machine-synthesized speech recognition method, device, storage medium and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190226 |