US12431160B2 - Voice signal detection method, terminal device and storage medium - Google Patents
Voice signal detection method, terminal device and storage mediumInfo
- Publication number
- US12431160B2 US12431160B2 US18/044,954 US202018044954A US12431160B2 US 12431160 B2 US12431160 B2 US 12431160B2 US 202018044954 A US202018044954 A US 202018044954A US 12431160 B2 US12431160 B2 US 12431160B2
- Authority
- US
- United States
- Prior art keywords
- acquiring
- frequency domain
- signal
- spectral energy
- preset
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Definitions
- a main purpose of the present application is to provide a voice signal detection method, a terminal device and a storage medium, and is intended to simplify the recognition for the voice.
- the present application provides a voice signal detection method, which is applied to a terminal device, and the voice signal detection method includes the following steps:
- the step of acquiring a time domain feature in the time domain signal includes:
- the step of acquiring a short-term zero-crossing rate of the time domain signal includes:
- the step of acquiring a pitch period of the time domain signal includes:
- the step of acquiring a spectral center of gravity of the frequency domain signal includes:
- the step of acquiring a frequency domain feature in the frequency domain signal further includes:
- the step of acquiring a spectral energy ratio of the frequency domain signal further includes:
- the present application further provides a computer readable storage medium, which is characterized in that, the computer readable storage medium stores a voice signal detection program thereon, and when the voice signal detection program is executed by a processor, the steps of the voice signal detection method as described above are implemented.
- the voice signal detection method, the terminal device and the storage medium proposed in this application receives a time domain signal detected by a bone conduction sensor in the terminal device, acquires a time domain feature in the time domain signal, converts the time domain signal into a frequency domain signal, acquires a frequency domain feature in the frequency domain signal, and when the time domain feature satisfies a first preset condition and the frequency domain feature satisfies a second preset condition, it is determined that the bone conduction sensor detects the voice signal, so that the voice detection can be performed according to the time domain signal detected by the bone conduction sensor, without being combined with the signal detected by the microphone, therefore the voice detection is simpler, in the meanwhile, the cost is lower since only the bone conduction sensor is combined in the recognition for the voice.
- the terminal device may be a wearable device worn on the head, such as a headset, glasses, and a VR device and the like.
- the terminal device in this embodiment includes a memory 110 , a processor 130 , and a bone conduction sensor 120 .
- the memory 110 can store a voice signal detection program.
- the terminal device in this embodiment is a headset
- the terminal device may further include a microphone, and the microphone is connected to the processor 130 .
- Step S 10 receiving a time domain signal detected by a bone conduction sensor in the terminal device, and acquiring a time domain feature in the time domain signal.
- Bone conduction is a method of sound conduction, that is, converting sound into mechanical vibrations of different frequencies, and transmitting sound waves through the human skull, bone labyrinth, inner ear lymph, auger, and auditory center. Compared to the conventional sound conduction method that generates sound waves through the diaphragm, bone conduction eliminates many steps of transmitting sound waves, and can achieve clear sound reproduction in noisy environments, and sound waves may not affect others because they diffuse in the air.
- the step of acquiring a time domain feature in the time-domain signal includes: acquiring the short-term zero-crossing rate of the time domain signal; and acquiring the pitch period of the time domain signal.
- the time domain feature includes the short-term zero-crossing rate and the pitch period.
- the corresponding step of acquiring a short-term zero-crossing rate of the time domain signal includes:
- the voice signal detection method includes:
- the first preset condition includes that the short-term zero-crossing rate is greater than a preset short-term zero-crossing rate and the pitch period is greater than a first preset pitch period or less than a second preset pitch period.
- the preset short-term zero-crossing rate may be 0.6
- the first preset pitch period may be 94
- the second preset pitch period may be 8.
- the second preset condition includes that the spectral center of gravity is greater than a preset spectral center of gravity.
- the preset spectral center of gravity may be 3.
- the frequency domain feature may further include at least one of the logarithmic spectral energy and the spectral energy ratio.
- the step of acquiring the frequency domain feature in the frequency domain signal further includes:
- the step of acquiring a spectral energy ratio of the frequency domain signal includes:
- E L ⁇ i - 1 2 ⁇ 4 ⁇ ⁇ " ⁇ [LeftBracketingBar]” Y ⁇ ( k ) ⁇ " ⁇ [RightBracketingBar]” 2 , wherein E L is the first spectral energy.
- the calculation formula of the second spectral energy corresponding to the second preset frequency band may be:
- the calculation formula of the spectral energy ratio is:
- E ratio E L E H , E ratio is the spectral energy ratio.
- the step of acquiring a logarithmic spectral energy of the frequency domain signal includes:
- the 128 KHZ bandwidth of the frequency domain signal is divided into 128 sub-bands.
- 1-24 sub-bands is taken as the third preset frequency band.
- the calculation formula of the corresponding logarithmic spectral energy is:
- the microphone in the terminal device can be turned on when the bone conduction sensor detects the voice signal. It is also possible to perform other preset operations when a voice signal is detected, and the preset operations may be set according to requirements.
- the bone conduction sensor when any of the time domain features does not satisfy the first preset condition or any of the frequency domain features does not satisfy the second preset condition, it is determined that the bone conduction sensor does not detect a voice signal.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Description
-
- receiving a time domain signal detected by a bone conduction sensor in the terminal device, and acquiring a time domain feature in the time domain signal;
- converting the time domain signal into a frequency domain signal, and acquiring a frequency domain feature in the frequency domain signal; and
- when the time domain feature satisfies a first preset condition and the frequency domain feature satisfies a second preset condition, determining that a voice signal has been detected by the bone conduction sensor.
-
- acquiring a short-term zero-crossing rate of the time domain signal; and
- acquiring a pitch period of the time domain signal, the time domain feature comprises the short-term zero-crossing rate and the pitch period, and the first preset condition includes that the short-term zero-crossing rate is greater than a preset short-term zero-crossing rate and the pitch period is greater than a first preset pitch period and smaller than a second preset pitch period, and
- the step of acquiring a frequency domain feature in the frequency domain signal includes:
- acquiring a spectral center of gravity of the frequency domain signal, the frequency domain feature includes the spectral center of gravity, and the second preset condition includes that the spectral center of gravity is greater than a preset spectral center of gravity.
-
- acquiring difference values between sign functions of adjacent sampling points in the time domain signal, wherein parameters of the sign functions are sampling signals of the sampling points respectively; and
- obtaining the short-term zero-crossing rate by summing up absolute values of each of the difference values.
-
- sampling the time domain signal in accordance with a preset period to obtain a sampling signal, and acquiring a reference signal after a preset time interval for the sampling signal; and
- acquiring a similarity between the sampling signal and the reference signal, and determining the pitch period according to the similarity.
-
- acquiring frequency and spectral energy of each of sampling points in the frequency domain signal, and calculating product of the frequency and the spectral energy of each of the sampling points;
- summing up products of corresponding sampling points to obtain a first sum value, and summing up spectral energies of the sampling points to obtain a second sum value; and
- acquiring a ratio of the first sum to the second sum to obtain the spectral center of gravity.
-
- acquiring a logarithmic spectral energy of the frequency domain signal;
- acquiring a spectral energy ratio of the frequency domain signal, the frequency domain feature further includes the logarithmic spectral energy and the spectral energy ratio of the frequency domain signal, and the second preset condition further includes that, the logarithmic spectral energy is less than a preset logarithmic spectral energy, and the spectral energy ratio is less than a preset spectral energy ratio.
-
- acquiring amplitudes of sub-frequency domain microphone signals of sub-bands of a frequency domain microphone signal in a first preset frequency band, and determining a first spectral energy according to the amplitudes;
- acquiring amplitudes of sub-frequency domain microphone signals of sub-bands of a frequency domain microphone signal in a second preset frequency band, and determining a second spectral energy according to the amplitudes, the highest frequency in the first preset frequency band is less than the lowest frequency in the second preset frequency band; and
- acquiring a ratio of the first spectral energy to the second spectral energy, and obtaining the spectral energy ratio according to the ratio of the first spectral energy to the second spectral energy.
-
- acquiring amplitudes of sub-frequency domain microphone signals of sub-bands of a frequency domain microphone signal in a third preset frequency band, and determining a third spectral energy according to the amplitudes; and
- obtaining the logarithmic spectral energy by taking logarithm of the third spectral energy.
- in addition, in order to achieve the above purpose, the present application further provides a terminal device, the terminal device includes a memory, a processor, and a voice signal detection program stored on the memory and executable by the processor, and when the voice signal detection program is executed by the processor, the voice signal detection method as described above is implemented.
-
- receiving a time domain signal detected by a bone conduction sensor in the terminal device, and acquiring a time domain feature in the time domain signal;
- converting the time domain signal into a frequency domain signal, and acquiring a frequency domain feature in the frequency domain signal; and
- when the time domain feature satisfies a first preset condition and the frequency domain feature satisfies a second preset condition, determining that a voice signal has been detected by the bone conduction sensor.
-
- acquiring difference values between sign functions of adjacent individual sampling points in the time domain signal, a parameter of the sign function is a sampling signal of the sampling point; and
- summing up absolute values of the difference values to obtain the short-term zero-crossing rate.
wherein sgn is a sign function, and the value of sgn can refer to the formula:
wherein x(m) is the sampling signal obtained by sampling, and Zn is the short-time zero-crossing rate.
-
- sampling the time domain signal in accordance to a preset period to obtain a sampling signal, and acquiring a reference signal after a preset time interval for the sampling signal; and
- acquiring a similarity between the sampling signal and the reference signal, and determining the pitch period according to the similarity. It can be understood that the maximum similarity among the similarities may be taken as the pitch period.
wherein Rm is the similarity, the formula of the pitch period is
Pitch=max{Rm}, wherein Pitch is the pitch period.
-
- Step S20: converting the time domain signal into a frequency domain signal, and acquiring a frequency domain feature in the frequency domain signal.
-
- acquiring frequencies and spectral energies of sampling points in the frequency domain signal, and acquiring product of the frequency and the spectral energy of each of the sampling points;
- summing up the corresponding product of each of the sampling points to obtain a first sum value, and summing up the spectral energy of each of the sampling points to obtain a second sum value; and
- acquiring a ratio of the first sum to the second sum to obtain the spectral center of gravity.
wherein brightness is the spectral center of gravity, N is the number of sampling points, N=128, f (k) is the frequency of the sampling point, E(k) is the spectral energy, and the calculation formula of the spectral energy is: E(k)=|Y(k)|2, wherein Y(k) is the amplitude of the frequency domain signal.
-
- Step S30: when the time domain feature satisfies a first preset condition and the frequency domain feature satisfies a second preset condition, determining that a voice signal has been detected by the bone conduction sensor.
-
- acquiring a logarithmic spectral energy of the frequency domain signal; and/or
- acquiring a spectral energy ratio of the frequency domain signal, the frequency domain feature further includes the logarithmic spectral energy and the spectral energy ratio of the frequency domain signal, and the second preset condition further includes at least one of conditions that the logarithmic spectral energy is less than a preset logarithmic spectral energy and the spectral energy ratio is less than a preset spectral energy ratio.
-
- acquiring amplitudes of sub-frequency domain microphone signals of sub-bands of a frequency domain microphone signal in a first preset frequency band, and determining a first spectral energy according to the amplitudes;
- acquiring amplitudes of sub-frequency domain microphone signals of sub-bands of a frequency domain microphone signal in a second preset frequency band, and determining a second spectral energy according to the amplitudes, a highest frequency in the first preset frequency band is less than a lowest frequency in the second preset frequency band; and
- acquiring a ratio of the first spectral energy to the second spectral energy, and obtaining the spectral energy ratio according to the ratio of the first spectral energy to the second spectral energy.
wherein EL is the first spectral energy.
The calculation formula of the second spectral energy corresponding to the second preset frequency band may be:
wherein EH is the second spectral energy, and Y(k) is the amplitude of the frequency domain signal.
The calculation formula of the spectral energy ratio is:
Eratio is the spectral energy ratio.
-
- acquiring amplitudes of sub-frequency domain microphone signals of sub-bands of a frequency domain microphone signal in a third preset frequency band, and determining a third spectral energy according to the amplitudes; and
- obtaining the logarithmic spectral energy by taking logarithm of the third spectral energy.
wherein Y(k) is the amplitude of the frequency domain signal and Eg is the logarithmic spectral energy.
Claims (9)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010953527.1 | 2020-09-10 | ||
| CN202010953527.1A CN112017639B (en) | 2020-09-10 | 2020-09-10 | Voice signal detection method, terminal equipment and storage medium |
| PCT/CN2020/124896 WO2022052246A1 (en) | 2020-09-10 | 2020-10-29 | Voice signal detection method, terminal device and storage medium |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20230360666A1 US20230360666A1 (en) | 2023-11-09 |
| US12431160B2 true US12431160B2 (en) | 2025-09-30 |
Family
ID=73522552
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US18/044,954 Active 2041-08-02 US12431160B2 (en) | 2020-09-10 | 2020-10-29 | Voice signal detection method, terminal device and storage medium |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US12431160B2 (en) |
| CN (1) | CN112017639B (en) |
| WO (1) | WO2022052246A1 (en) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112951243A (en) * | 2021-02-07 | 2021-06-11 | 深圳市汇顶科技股份有限公司 | Voice awakening method, device, chip, electronic equipment and storage medium |
| CN113470694A (en) * | 2021-04-25 | 2021-10-01 | 重庆市科源能源技术发展有限公司 | Remote listening monitoring method, device and system for hydraulic turbine set |
| CN113709645B (en) * | 2021-09-02 | 2025-04-15 | 声佗医疗科技(上海)有限公司 | Hearing aid and its intraoral device, external device, control method and control device, and storage device |
| CN115290133A (en) * | 2022-06-30 | 2022-11-04 | 苏州经贸职业技术学院 | Method and system for monitoring track structure at joint of light rail platform |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH08265887A (en) | 1995-03-23 | 1996-10-11 | Mitsubishi Electric Corp | Bone conduction microphone and bone conduction earphone microphone |
| US20050069162A1 (en) | 2003-09-23 | 2005-03-31 | Simon Haykin | Binaural adaptive hearing aid |
| CN101399039A (en) | 2007-09-30 | 2009-04-01 | 华为技术有限公司 | Method and device for determining non-noise audio signal classification |
| CN101601088A (en) | 2007-09-11 | 2009-12-09 | 松下电器产业株式会社 | Sound judgment means, sound detection device and sound determination methods |
| CN102314884A (en) | 2011-08-16 | 2012-01-11 | 捷思锐科技(北京)有限公司 | Voice-activation detecting method and device |
| US8315854B2 (en) | 2006-01-26 | 2012-11-20 | Samsung Electronics Co., Ltd. | Method and apparatus for detecting pitch by using spectral auto-correlation |
| CN104144377A (en) | 2013-05-09 | 2014-11-12 | Dsp集团有限公司 | Low power activation of voice activated device |
| CN106714023A (en) | 2016-12-27 | 2017-05-24 | 广东小天才科技有限公司 | Bone conduction earphone-based voice awakening method and system and bone conduction earphone |
| US10535364B1 (en) * | 2016-09-08 | 2020-01-14 | Amazon Technologies, Inc. | Voice activity detection using air conduction and bone conduction microphones |
| US20200184996A1 (en) * | 2018-12-10 | 2020-06-11 | Cirrus Logic International Semiconductor Ltd. | Methods and systems for speech detection |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2562751B1 (en) * | 2011-08-22 | 2014-06-11 | Svox AG | Temporal interpolation of adjacent spectra |
| CN111599345B (en) * | 2020-04-03 | 2023-02-10 | 厦门快商通科技股份有限公司 | Speech recognition algorithm evaluation method, system, mobile terminal and storage medium |
-
2020
- 2020-09-10 CN CN202010953527.1A patent/CN112017639B/en active Active
- 2020-10-29 US US18/044,954 patent/US12431160B2/en active Active
- 2020-10-29 WO PCT/CN2020/124896 patent/WO2022052246A1/en not_active Ceased
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH08265887A (en) | 1995-03-23 | 1996-10-11 | Mitsubishi Electric Corp | Bone conduction microphone and bone conduction earphone microphone |
| US20050069162A1 (en) | 2003-09-23 | 2005-03-31 | Simon Haykin | Binaural adaptive hearing aid |
| US8315854B2 (en) | 2006-01-26 | 2012-11-20 | Samsung Electronics Co., Ltd. | Method and apparatus for detecting pitch by using spectral auto-correlation |
| CN101601088A (en) | 2007-09-11 | 2009-12-09 | 松下电器产业株式会社 | Sound judgment means, sound detection device and sound determination methods |
| CN101399039A (en) | 2007-09-30 | 2009-04-01 | 华为技术有限公司 | Method and device for determining non-noise audio signal classification |
| CN102314884A (en) | 2011-08-16 | 2012-01-11 | 捷思锐科技(北京)有限公司 | Voice-activation detecting method and device |
| CN104144377A (en) | 2013-05-09 | 2014-11-12 | Dsp集团有限公司 | Low power activation of voice activated device |
| US10535364B1 (en) * | 2016-09-08 | 2020-01-14 | Amazon Technologies, Inc. | Voice activity detection using air conduction and bone conduction microphones |
| CN106714023A (en) | 2016-12-27 | 2017-05-24 | 广东小天才科技有限公司 | Bone conduction earphone-based voice awakening method and system and bone conduction earphone |
| US20200184996A1 (en) * | 2018-12-10 | 2020-06-11 | Cirrus Logic International Semiconductor Ltd. | Methods and systems for speech detection |
Non-Patent Citations (1)
| Title |
|---|
| International Search Report from International Application No. PCT/CN2020/124896 mailed Jun. 9, 2021. |
Also Published As
| Publication number | Publication date |
|---|---|
| US20230360666A1 (en) | 2023-11-09 |
| CN112017639B (en) | 2023-11-07 |
| WO2022052246A1 (en) | 2022-03-17 |
| CN112017639A (en) | 2020-12-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12431160B2 (en) | Voice signal detection method, terminal device and storage medium | |
| US11677879B2 (en) | Howl detection in conference systems | |
| CN112951259B (en) | Audio noise reduction method and device, electronic equipment and computer readable storage medium | |
| Ma et al. | Efficient voice activity detection algorithm using long-term spectral flatness measure | |
| EP4604119A1 (en) | Transient signal encoding method and device, decoding method and device, and processing system | |
| US10074384B2 (en) | State estimating apparatus, state estimating method, and state estimating computer program | |
| US9183846B2 (en) | Method and device for adaptively adjusting sound effect | |
| EP2083417B1 (en) | Sound processing device and program | |
| CN100490314C (en) | Audio signal processing for speech communication | |
| CN101821971A (en) | Systems and methods for noisy activity detection | |
| CN111640411B (en) | Audio synthesis method, device and computer readable storage medium | |
| US8364475B2 (en) | Voice processing apparatus and voice processing method for changing accoustic feature quantity of received voice signal | |
| CN111768800B (en) | Voice signal processing method, device and storage medium | |
| CN106024010B (en) | A kind of voice signal dynamic feature extraction method based on formant curve | |
| US20060100866A1 (en) | Influencing automatic speech recognition signal-to-noise levels | |
| US8423357B2 (en) | System and method for biometric acoustic noise reduction | |
| JPWO2006011405A1 (en) | Digital filtering method, digital filter device, digital filter program, computer-readable recording medium, and recorded device | |
| CN106024017A (en) | Voice detection method and device | |
| US12462826B2 (en) | Adapting sibilance detection based on detecting specific sounds in an audio signal | |
| JP6268916B2 (en) | Abnormal conversation detection apparatus, abnormal conversation detection method, and abnormal conversation detection computer program | |
| KR101414233B1 (en) | Apparatus and method for improving intelligibility of speech signal | |
| CN114429763A (en) | Real-time voice tone style conversion technology | |
| CN111755028A (en) | Near-field remote controller voice endpoint detection method and system based on fundamental tone characteristics | |
| JP6197367B2 (en) | Communication device and masking sound generation program | |
| Dai et al. | An improved model of masking effects for robust speech recognition system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: GOERTEK INC., CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:CHEN, GUOMING;REEL/FRAME:062951/0361 Effective date: 20230307 |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |