CN111128192A - Voice recognition noise reduction method, system, mobile terminal and storage medium - Google Patents
Voice recognition noise reduction method, system, mobile terminal and storage medium Download PDFInfo
- Publication number
- CN111128192A CN111128192A CN201911424022.XA CN201911424022A CN111128192A CN 111128192 A CN111128192 A CN 111128192A CN 201911424022 A CN201911424022 A CN 201911424022A CN 111128192 A CN111128192 A CN 111128192A
- Authority
- CN
- China
- Prior art keywords
- voice
- audio information
- noise
- information
- noise reduction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000009467 reduction Effects 0.000 title claims abstract description 51
- 238000000034 method Methods 0.000 title claims abstract description 38
- 238000003860 storage Methods 0.000 title claims abstract description 16
- 230000007613 environmental effect Effects 0.000 claims abstract description 31
- 238000012545 processing Methods 0.000 claims abstract description 11
- 230000009471 action Effects 0.000 claims description 36
- 238000004590 computer program Methods 0.000 claims description 12
- 238000001228 spectrum Methods 0.000 claims description 10
- 238000013135 deep learning Methods 0.000 claims description 5
- 238000004519 manufacturing process Methods 0.000 description 11
- 238000013461 design Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 4
- 230000001174 ascending effect Effects 0.000 description 3
- 238000005452 bending Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Manipulator (AREA)
Abstract
The invention provides a voice recognition noise reduction method, a system, a mobile terminal and a storage medium, wherein the method comprises the following steps: when a voice instruction sent by a user is received, audio information containing the voice instruction and environment audio information of the current environment are obtained; acquiring operation information of the steering engine, and inquiring noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information; comparing the target noise data with the voice audio information; and performing noise reduction processing on the voice audio information according to the comparison result, and performing voice recognition on the voice audio information subjected to noise reduction. According to the invention, the target audio data is analyzed, the data of the target noise data characteristic is removed, and relatively clean noise-free audio data is obtained, so that the interference of environmental noise or noise of a steering engine product is reduced, and the accuracy of voice recognition is improved.
Description
Technical Field
The invention belongs to the technical field of voice recognition, and particularly relates to a voice recognition noise reduction method, a voice recognition noise reduction system, a mobile terminal and a storage medium.
Background
With the development of human-computer interaction technology, robots are expected to have human-like perception capabilities and to cooperate with humans. In order to achieve the aim, some researchers use a voice technology to enable the robot to understand human language so as to directly control the robot to correspondingly execute the operation in a voice mode, so that the control of the robot is simplified, and the control efficiency of the robot is improved.
The noise that can inevitably produce among the motion state of current robot, noise for example steering wheel, fan or motor produced, and because of the adapter is closer to the robot, consequently user's language information is compared to these noises and is acquireed more easily, and then has leaded to the speech recognition accuracy of robot low down.
Disclosure of Invention
The embodiment of the invention aims to provide a voice recognition noise reduction method, a voice recognition noise reduction system, a mobile terminal and a storage medium, and aims to solve the problem that the voice recognition accuracy is low due to noise emitted by a robot in the motion process in the use process of the existing robot.
The embodiment of the invention is realized in such a way that a speech recognition noise reduction method comprises the following steps:
when a voice instruction sent by a user is received, audio information containing the voice instruction and environment audio information of the current environment are obtained;
acquiring operation information of a steering engine, and inquiring noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
comparing the target noise data with the voice audio information stored in the audio information;
and performing noise reduction processing on the voice audio information according to the comparison result, and performing voice recognition on the voice audio information subjected to noise reduction.
Further, before the step of receiving the voice command sent by the user, the method further includes:
according to a locally pre-stored sounding state and an operation instruction, sequentially controlling the steering engine to execute different operations, and acquiring audio data generated when the steering engine executes various operations to obtain sample audio;
and performing deep learning according to the sample audio to obtain the noise database.
Furthermore, the controlling the robot to perform different operations in sequence according to the locally pre-stored sound production state and the operation instruction includes:
acquiring the volume value in the sounding state, and acquiring the running number and the operation action in the operation command and the rotation angle of a steering engine of the operation action;
and controlling a corresponding sounding unit to sound according to the volume value, and controlling the corresponding steering engine to execute the operation action according to the operation number.
Further, the querying noise data in a noise database according to the operation information and the environmental audio information includes:
acquiring sounding numbers and operation actions stored in the operation information and rotation angles of steering engines of the operation actions, and acquiring volume values stored in the environment audio information;
and matching the sounding numbers, the operation actions, the rotation angles of steering engines of the operation actions and the volume values with the noise database to obtain the target noise data.
Furthermore, the comparing the target noise data with the voice audio information stored in the audio information includes:
and carrying out frequency spectrum comparison, power spectrum comparison and cepstrum comparison on the target noise data and the voice audio information to obtain a comparison result.
Further, before the step of receiving the voice command sent by the user, the method further includes:
when a control instruction sent by a user is received, sound collection is carried out on the current environment within a first preset time to obtain the environment audio information, wherein the control instruction is used for triggering voice control aiming at the robot;
and sending a voice acquisition prompt after the sound acquisition of the current environment is finished, and continuously performing voice acquisition until the voice acquisition state meets the stop condition, and stopping the voice acquisition to obtain the voice audio information.
Further, after the step of continuing to perform voice capturing, the method further comprises:
when the voice acquisition time is judged to be larger than a time threshold value, judging that the voice acquisition state meets the stop condition; or
When an acquisition stopping instruction is received, judging that the voice acquisition state meets the stopping condition; or
And when the volume of the current environment is continuously smaller than a volume threshold value within a second preset time, judging that the voice acquisition state meets the stopping condition and the corresponding sound generating unit generates sound.
Another object of an embodiment of the present invention is to provide a speech recognition noise reduction system, including:
the information acquisition module is used for acquiring audio information containing a voice instruction and environmental audio information of the current environment when the voice instruction sent by a user is received;
the noise query module is used for acquiring the operation information of the steering engine, and querying noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
the audio comparison module is used for comparing the target noise data with the voice audio information stored in the audio information;
and the audio denoising module is used for performing denoising processing on the voice audio information according to the comparison result and performing voice recognition on the denoised voice audio information.
Another object of an embodiment of the present invention is to provide a mobile terminal, including a storage device and a processor, where the storage device is used to store a computer program, and the processor runs the computer program to make the mobile terminal execute the above-mentioned speech recognition noise reduction method.
Another object of an embodiment of the present invention is to provide a storage medium, which stores a computer program used in the above-mentioned mobile terminal, wherein the computer program, when executed by a processor, implements the steps of the above-mentioned speech recognition noise reduction method.
According to the embodiment of the invention, the target audio data is analyzed, the data of the target noise data characteristic is removed, and relatively clean noise-free audio data is obtained, so that the interference of environmental noise or noise of a steering engine product is reduced, and the accuracy of voice recognition is improved.
Drawings
FIG. 1 is a flow chart of a method for noise reduction in speech recognition according to a first embodiment of the present invention;
FIG. 2 is a flowchart of a method for noise reduction in speech recognition according to a second embodiment of the present invention;
FIG. 3 is a schematic structural diagram of a speech recognition noise reduction system according to a third embodiment of the present invention;
fig. 4 is a schematic structural diagram of a mobile terminal according to a fourth embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The embodiment of the invention aims to solve the problem that the existing robot has low voice recognition accuracy due to the noise generated in the motion process.
In order to explain the technical means of the present invention, the following description will be given by way of specific examples.
Example one
Referring to fig. 1, a flowchart of a speech recognition noise reduction method according to a first embodiment of the present invention is shown, which includes the steps of:
step S10, when receiving a voice instruction sent by a user, acquiring audio information containing the voice instruction and environmental audio information of the current environment;
the voice audio information is obtained after the current mobile device collects voice sent by a user.
Specifically, the environmental audio information in this step may be obtained based on the usage status of the robot or the collection of the current environmental volume value, for example:
when the robot is judged to be in an unvoiced state (all sounding units on the robot are in the unvoiced state) and the volume value of the current environment is smaller than a first preset volume value, judging that the environment audio information is in a quiet environment;
when the robot is judged to be in a sounding state (any sounding unit on the robot is in the sounding state) and the volume value of the current environment is smaller than a first preset volume value, judging that the environment audio information is an audio environment played by the robot;
when the robot is judged to be in a sounding state (any sounding unit on the robot is in the sounding state) and the volume value of the current environment is greater than a first preset volume value, the environment audio information is judged to be the human-computer interaction environment.
Step S20, acquiring operation information of the steering engine, and inquiring noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
the operation information is obtained by sequentially inquiring the operation state of each steering engine on the robot through a steering engine code table prestored locally, the installation position and the current operation state of each steering engine are sequentially stored in the operation information, and the operation state comprises states of ascending, descending, bending or contracting and the like.
Preferably, in this embodiment, the operation information may be further divided into actions such as stretching hands, stretching legs, raising head, nodding head, or twisting waist according to the operation state of the steering engine on the robot.
Specifically, in the step, the acquired environmental audio information and the operating states of all the steering engines are matched with the noise database to query target noise data, wherein the target noise data are noise data obtained by overlapping the current environmental noise and the steering engine operating noise.
Step S30, comparing the target noise data with the voice audio information stored in the audio information;
the repeated audio information between the target noise data and the voice audio information can be inquired by adopting a frequency spectrum comparison mode, a power spectrum comparison mode or a cepstrum comparison mode, and preferably, the accuracy of subsequent noise reduction processing is effectively improved by a design of comparing the target noise data with the voice audio information stored in the audio information.
Step S40, carrying out noise reduction processing on the voice audio information according to the comparison result, and carrying out voice recognition on the voice audio information after noise reduction;
specifically, when the target noise data is removed by the passive noise reduction method, the noise reduction circuit may be used to perform noise reduction, that is, the noise reduction circuit is subjected to parameter setting by using the target noise data, and the voice audio information is input to the noise reduction circuit after the parameter setting, so as to remove the target noise data.
Preferably, in the step, when the target noise data is removed by active noise reduction, the target noise data is removed by performing parameter setting on a filter according to the target noise data and inputting the voice audio information to the filter after the parameter setting.
In the embodiment, the design of inquiring the target noise data according to the operation information and the environment audio information is used for acquiring the noise data caused by the current environment factors and the operation of the steering engine, and the target audio data is analyzed to remove the data of the characteristics of the target noise data and obtain relatively clean noise-free audio data, so that the interference of the noise of the environment noise or the steering engine product is reduced, and the accuracy of voice recognition is improved.
Example two
Referring to fig. 2, a flowchart of a speech recognition noise reduction method according to a second embodiment of the present invention is shown, which includes the steps of:
step S11, sequentially controlling the steering engine to execute different operations according to a locally pre-stored sound production state and an operation instruction, and collecting audio data generated when the steering engine executes various operations to obtain sample audio;
specifically, in this step, according to the locally pre-stored sound production state and the operation instruction, the step of sequentially controlling the steering engine to execute different operations includes:
step S111, obtaining the volume value in the sounding state, and obtaining the running number and the operation action in the operation command and the rotation angle of the steering engine of the operation action;
the volume value, the running number and the operation action corresponding to the number can be set according to requirements, for example, when the volume value can be set to 0 decibel, 40 decibel, 50 decibel or 60 decibel, the volume value is used for controlling the sound volume of the sound generating unit on the robot.
Preferably, in this step, the operation action may be a state of ascending, descending, bending, or contracting, and in this embodiment, the operation action may further be divided into a hand stretching, a leg stretching, a head raising, a head nodding, or a waist twisting and the like according to an operation state of a steering engine on the robot, the operation number is a unique number corresponding to each steering engine, and the operation number may be stored in a manner of a number or a character number, for example, a number 1 steering engine, a number 2 steering engine, a number 3 steering engine, a left hand steering engine, a right hand steering engine, a left leg steering engine, a head steering engine, and the like.
Step S112, controlling a corresponding sound production unit to produce sound according to the volume value, and controlling a corresponding steering engine to execute the operation action according to the operation number;
the corresponding sounding unit is controlled to sound according to the volume value, the steering engine is controlled to execute the design of the operation action according to the operation number, noise data generated by environmental factors and steering engine working factors in different environments and different states are collected, the diversity of noise collection is effectively improved, and the accuracy of follow-up voice noise is improved.
Continuing to refer to fig. 2, in step S21, performing deep learning according to the sample audio to obtain the noise database;
and extracting audio features in the sample audio by using an in-depth learning method, training a neural network model based on the audio features, and defining the neural network model as noise to obtain the noise database.
Specifically, in this step, deep learning training may be performed by using a convolutional neural network model, a deep belief network model, or a stacked self-coding network model, and iterative operation may be implemented by using an algorithm of back propagation, stochastic gradient descent, or learning rate attenuation in the model training process.
Step S31, when a control instruction sent by a user is received, sound collection is carried out on the current environment within a first preset time so as to obtain the environment audio information;
the control instruction is used for triggering voice control for the robot, that is, when the control instruction is received, it is determined that the user needs to perform voice control on the robot, preferably, the first preset time may be set according to a requirement, for example, 0.5s, 1s, or 2s, and specifically, in this step, when the control instruction is received, a sound pickup arranged on the robot is correspondingly controlled to perform sound volume collection, so as to obtain the environmental audio information.
Step S41, when sound collection of the current environment is finished, sending out a voice collection prompt, and continuously collecting voice until the voice collection state meets the stop condition, stopping voice collection to obtain the voice audio information;
wherein, the judgment parameter in the stop condition can be set according to the requirement; specifically, in this step, after the step of continuously performing voice acquisition, the method further includes:
when the voice acquisition time is judged to be larger than a time threshold value, judging that the voice acquisition state meets the stop condition; or
When an acquisition stopping instruction is received, judging that the voice acquisition state meets the stopping condition; or
When the volume of the current environment is continuously smaller than a volume threshold value within a second preset time, judging that the voice acquisition state meets the stop condition;
wherein, this time threshold and second preset time all can set up according to the demand.
Step S51, acquiring operation information of the steering engine, and inquiring noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
specifically, in this step, the querying noise data in a noise database according to the operation information and the environmental audio information includes:
acquiring sounding numbers and operation actions stored in the operation information and rotation angles of steering engines of the operation actions, and acquiring volume values stored in the environment audio information;
and matching the sounding numbers, the operation actions, the rotation angles of steering engines of the operation actions and the volume values with the noise database to obtain the target noise data.
Step S61, carrying out frequency spectrum comparison, power spectrum comparison and cepstrum comparison on the target noise data and the voice audio information to obtain a comparison result;
step S71, carrying out noise reduction processing on the voice audio information according to the comparison result, and carrying out voice recognition on the voice audio information after noise reduction;
in the embodiment, the design of inquiring the target noise data according to the operation information and the environment audio information is used for acquiring the noise data caused by the environment factors and the operation of the steering engine at present, and the design of comparing the target noise data with the voice audio information stored in the audio information is used for accurately deleting the noise in the voice audio information, so that the interference of the environment noise or the operation noise of the steering engine on the voice recognition is prevented, and the accuracy of the voice recognition is improved.
EXAMPLE III
Referring to fig. 3, a schematic structural diagram of a speech recognition noise reduction system 100 according to a third embodiment of the present invention is shown, including: the device comprises an information acquisition module 10, a noise query module 11, an audio comparison module 12 and an audio denoising module 13, wherein:
the information obtaining module 10 is configured to, when a voice instruction sent by a user is received, obtain audio information including the voice instruction and environmental audio information of a current environment where the user is located.
Wherein, the information obtaining module 10 is further configured to: the system comprises a control instruction and a voice acquisition module, wherein the control instruction is used for acquiring sound of the current environment within a first preset time to obtain environment audio information when receiving a control instruction sent by a user, and the control instruction is used for triggering voice control aiming at the robot; and sending a voice acquisition prompt after the sound acquisition of the current environment is finished, and continuously performing voice acquisition until the voice acquisition state meets the stop condition, and stopping the voice acquisition to obtain the voice audio information.
Further, the information obtaining module 10 is further configured to: the voice acquisition state is judged to meet the stop condition when the voice acquisition time is judged to be larger than a time threshold; or when receiving an acquisition stopping instruction, judging that the voice acquisition state meets the stopping condition; or when the volume of the current environment is continuously smaller than a volume threshold value within a second preset time, judging that the voice acquisition state meets the stop condition.
And the noise query module 11 is used for acquiring the operation information of the steering engine, and querying noise data in a noise database according to the operation information and the environment audio information to obtain target noise data matched with the operation information and the environment audio information.
Wherein the noise querying module 11 is further configured to: acquiring sounding numbers and operation actions stored in the operation information and rotation angles of steering engines of the operation actions, and acquiring volume values stored in the environment audio information; and matching the sounding numbers, the operation actions, the rotation angles of steering engines of the operation actions and the volume values with the noise database to obtain the target noise data.
And the audio comparison module 12 is configured to compare the target noise data with the voice audio information stored in the audio information.
Wherein, the audio comparison module 12 is further configured to: and carrying out frequency spectrum comparison, power spectrum comparison and cepstrum comparison on the target noise data and the voice audio information to obtain a comparison result.
And the audio denoising module 13 is configured to perform denoising processing on the voice audio information according to the comparison result, and perform voice recognition on the denoised voice audio information.
In this embodiment, the speech recognition noise reduction system 100 further includes:
the audio acquisition module 14 is used for sequentially controlling the steering engine to execute different operations according to a locally pre-stored sound production state and an operation instruction, and acquiring audio data generated when the steering engine executes various operations to obtain a sample audio; and performing deep learning according to the sample audio to obtain the noise database.
Preferably, the audio acquisition module 14 is further configured to: acquiring the volume value in the sounding state, and acquiring the running number and the operation action in the operation command and the rotation angle of a steering engine of the operation action; according to the volume value control corresponds the sound production unit and carries out the sound production, and according to operation serial number control corresponds the steering wheel is carried out the operation action corresponds the sound production unit and carries out the sound production, wherein, this volume value, operation serial number and the operation action that this serial number corresponds all can set up according to the demand, for example can set up to 0 decibel, 40 decibels, 50 decibels or 60 decibels etc. as the volume value, this volume value is used for controlling the sound production volume of sound production unit on this robot
Preferably, in the module, the operation action may be a state of ascending, descending, bending, or contracting, in this embodiment, the operation action may further be divided into a hand stretching, a leg stretching, a head raising, a head nodding, or a waist twisting, according to an operation state of the steering engine on the robot, the operation number is a unique number corresponding to each steering engine, and the operation number may be stored in a manner of a number or a character number, for example, a No. 1 steering engine, a No. 2 steering engine, a No. 3 steering engine, a left-hand steering engine, a right-hand steering engine, a left-leg steering engine, a head steering engine, or the like.
In the embodiment, the design of inquiring the target noise data according to the operation information and the environment audio information is used for acquiring the noise data caused by the current environment factors and the operation of the steering engine, and the target audio data is analyzed to remove the data of the characteristics of the target noise data and obtain relatively clean noise-free audio data, so that the interference of the noise of the environment noise or the steering engine product is reduced, and the accuracy of voice recognition is improved.
Example four
Referring to fig. 4, a mobile terminal 101 according to a fourth embodiment of the present invention includes a storage device and a processor, where the storage device is used to store a computer program, and the processor runs the computer program to make the mobile terminal 101 execute the above-mentioned voice recognition noise reduction method, and the mobile terminal 101 may be a robot.
The present embodiment also provides a storage medium on which a computer program used in the above-mentioned mobile terminal 101 is stored, which when executed, includes the steps of:
when a voice instruction sent by a user is received, audio information containing the voice instruction and environment audio information of the current environment are obtained;
acquiring operation information of a steering engine, and inquiring noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
comparing the target noise data with the voice audio information stored in the audio information;
and performing noise reduction processing on the voice audio information according to the comparison result, and performing voice recognition on the voice audio information subjected to noise reduction. The storage medium, such as: ROM/RAM, magnetic disk, optical disk, etc.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the above-mentioned division of the functional units and modules is used as an example, in practical applications, the above-mentioned function distribution may be performed by different functional units or modules according to needs, that is, the internal structure of the storage device is divided into different functional units or modules to perform all or part of the above-mentioned functions. Each functional unit and module in the embodiments may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit, and the integrated unit may be implemented in a form of hardware, or may be implemented in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application.
Those skilled in the art will appreciate that the component structures shown in fig. 3 are not intended to be limiting of the speech recognition noise reduction system of the present invention and may include more or fewer components than those shown, or some components in combination, or a different arrangement of components, and that the speech recognition noise reduction methods of fig. 1-2 may be implemented using more or fewer components than those shown in fig. 3, or some components in combination, or a different arrangement of components. The units, modules, etc. referred to herein are a series of computer programs that can be executed by a processor (not shown) of the current speech recognition noise reduction system and that can perform specific functions, and all of them can be stored in a storage device (not shown) of the current speech recognition noise reduction system.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.
Claims (10)
1. A method for speech recognition noise reduction, the method comprising:
when a voice instruction sent by a user is received, audio information containing the voice instruction and environment audio information of the current environment are obtained;
acquiring operation information of a steering engine, and inquiring noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
comparing the target noise data with the voice audio information stored in the audio information;
and performing noise reduction processing on the voice audio information according to the comparison result, and performing voice recognition on the voice audio information subjected to noise reduction.
2. The speech recognition noise reduction method of claim 1, wherein the step of receiving a speech instruction sent by a user is preceded by the method further comprising:
according to a locally pre-stored sounding state and an operation instruction, sequentially controlling the steering engine to execute different operations, and acquiring audio data generated when the steering engine executes various operations to obtain sample audio;
and performing deep learning according to the sample audio to obtain the noise database.
3. The method of speech recognition noise reduction according to claim 2, wherein the sequentially controlling the steering engine to perform different operations comprises:
acquiring the volume value in the sounding state, and acquiring the running number and the operation action in the operation command and the rotation angle of a steering engine of the operation action;
and controlling a corresponding sounding unit to sound according to the volume value, and controlling the corresponding steering engine to execute the operation action according to the operation number.
4. The method of speech recognition noise reduction according to claim 1, wherein the querying noise data in a noise database based on the operational information and the environmental audio information comprises:
acquiring sounding numbers and operation actions stored in the operation information and rotation angles of steering engines of the operation actions, and acquiring volume values stored in the environment audio information;
and matching the sounding numbers, the operation actions, the rotation angles of steering engines of the operation actions and the volume values with the noise database to obtain the target noise data.
5. The method of reducing noise in speech recognition according to claim 1, wherein comparing the target noise data with speech audio information stored in the audio information comprises:
and carrying out frequency spectrum comparison, power spectrum comparison and cepstrum comparison on the target noise data and the voice audio information to obtain a comparison result.
6. The speech recognition noise reduction method of claim 1, wherein the step of receiving a speech instruction sent by a user is preceded by the method further comprising:
when a control instruction sent by a user is received, sound collection is carried out on the current environment within a first preset time to obtain the environment audio information, wherein the control instruction is used for triggering voice control aiming at the robot;
and sending a voice acquisition prompt after the sound acquisition of the current environment is finished, and continuously performing voice acquisition until the voice acquisition state meets the stop condition, and stopping the voice acquisition to obtain the voice audio information.
7. The speech recognition noise reduction method of claim 6, wherein after the step of continuing speech acquisition, the method further comprises:
when the voice acquisition time is judged to be larger than a time threshold value, judging that the voice acquisition state meets the stop condition; or
When an acquisition stopping instruction is received, judging that the voice acquisition state meets the stopping condition; or
And when the volume of the current environment is continuously smaller than a volume threshold value within a second preset time, judging that the voice acquisition state meets the stop condition.
8. A speech recognition noise reduction system, the system comprising:
the information acquisition module is used for acquiring audio information containing a voice instruction and environmental audio information of the current environment when the voice instruction sent by a user is received;
the noise query module is used for acquiring the operation information of the steering engine, and querying noise data in a noise database according to the operation information and the environmental audio information to obtain target noise data matched with the operation information and the environmental audio information;
the audio comparison module is used for comparing the target noise data with the voice audio information stored in the audio information;
and the audio denoising module is used for performing denoising processing on the voice audio information according to the comparison result and performing voice recognition on the denoised voice audio information.
9. A mobile terminal, characterized by comprising a storage device for storing a computer program and a processor for executing the computer program to make the mobile terminal execute the speech recognition noise reduction method according to any one of claims 1 to 7.
10. A storage medium, characterized in that it stores a computer program for use in a mobile terminal according to claim 9, which computer program, when executed by a processor, implements the steps of the speech recognition noise reduction method according to any one of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911424022.XA CN111128192A (en) | 2019-12-31 | 2019-12-31 | Voice recognition noise reduction method, system, mobile terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911424022.XA CN111128192A (en) | 2019-12-31 | 2019-12-31 | Voice recognition noise reduction method, system, mobile terminal and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111128192A true CN111128192A (en) | 2020-05-08 |
Family
ID=70507168
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911424022.XA Pending CN111128192A (en) | 2019-12-31 | 2019-12-31 | Voice recognition noise reduction method, system, mobile terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111128192A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111899733A (en) * | 2020-07-02 | 2020-11-06 | 北京如影智能科技有限公司 | Method and device for determining volume |
CN112056763A (en) * | 2020-09-08 | 2020-12-11 | 深圳创维-Rgb电子有限公司 | Intelligent tea table, voice instruction recognition method and storage medium |
CN112509594A (en) * | 2020-06-22 | 2021-03-16 | 中兴通讯股份有限公司 | Terminal, sound production method, storage medium and electronic device |
CN112735457A (en) * | 2020-12-11 | 2021-04-30 | 中国汽车技术研究中心有限公司 | Voice denoising method and system |
CN114400001A (en) * | 2021-12-20 | 2022-04-26 | 上海华兴数字科技有限公司 | Voice interaction method and system for operation machine and operation machine |
CN114913845A (en) * | 2021-02-09 | 2022-08-16 | 北京小米移动软件有限公司 | Speech recognition method, and training method and device of speech recognition model |
CN115031266A (en) * | 2022-05-10 | 2022-09-09 | 海信(山东)厨卫有限公司 | Range hood and voice recognition method thereof |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070288242A1 (en) * | 2006-06-12 | 2007-12-13 | Lockheed Martin Corporation | Speech recognition and control system, program product, and related methods |
CN106952654A (en) * | 2017-04-24 | 2017-07-14 | 北京奇虎科技有限公司 | Robot noise-reduction method, device and robot |
CN107610698A (en) * | 2017-08-28 | 2018-01-19 | 深圳市金立通信设备有限公司 | A kind of method for realizing Voice command, robot and computer-readable recording medium |
WO2019235134A1 (en) * | 2018-06-07 | 2019-12-12 | ソニー株式会社 | Information generation device, information processing system, information processing method, and program |
-
2019
- 2019-12-31 CN CN201911424022.XA patent/CN111128192A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070288242A1 (en) * | 2006-06-12 | 2007-12-13 | Lockheed Martin Corporation | Speech recognition and control system, program product, and related methods |
CN106952654A (en) * | 2017-04-24 | 2017-07-14 | 北京奇虎科技有限公司 | Robot noise-reduction method, device and robot |
CN107610698A (en) * | 2017-08-28 | 2018-01-19 | 深圳市金立通信设备有限公司 | A kind of method for realizing Voice command, robot and computer-readable recording medium |
WO2019235134A1 (en) * | 2018-06-07 | 2019-12-12 | ソニー株式会社 | Information generation device, information processing system, information processing method, and program |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112509594A (en) * | 2020-06-22 | 2021-03-16 | 中兴通讯股份有限公司 | Terminal, sound production method, storage medium and electronic device |
CN111899733A (en) * | 2020-07-02 | 2020-11-06 | 北京如影智能科技有限公司 | Method and device for determining volume |
CN112056763A (en) * | 2020-09-08 | 2020-12-11 | 深圳创维-Rgb电子有限公司 | Intelligent tea table, voice instruction recognition method and storage medium |
CN112735457A (en) * | 2020-12-11 | 2021-04-30 | 中国汽车技术研究中心有限公司 | Voice denoising method and system |
CN112735457B (en) * | 2020-12-11 | 2022-05-17 | 中国汽车技术研究中心有限公司 | Voice denoising method and system |
CN114913845A (en) * | 2021-02-09 | 2022-08-16 | 北京小米移动软件有限公司 | Speech recognition method, and training method and device of speech recognition model |
CN114913845B (en) * | 2021-02-09 | 2024-05-24 | 北京小米移动软件有限公司 | Speech recognition method, training method and device of speech recognition model |
CN114400001A (en) * | 2021-12-20 | 2022-04-26 | 上海华兴数字科技有限公司 | Voice interaction method and system for operation machine and operation machine |
CN115031266A (en) * | 2022-05-10 | 2022-09-09 | 海信(山东)厨卫有限公司 | Range hood and voice recognition method thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111128192A (en) | Voice recognition noise reduction method, system, mobile terminal and storage medium | |
CN110853663B (en) | Speech enhancement method based on artificial intelligence, server and storage medium | |
US10068566B2 (en) | Method and system for considering information about an expected response when performing speech recognition | |
CN1295672C (en) | Pattern recognition | |
US10902853B2 (en) | Electronic device and voice command identification method thereof | |
WO2020014899A1 (en) | Voice control method, central control device, and storage medium | |
US20060178886A1 (en) | Methods and systems for considering information about an expected response when performing speech recognition | |
CN110942779A (en) | Noise processing method, device and system | |
CN106439967B (en) | A kind of method and system improving kitchen ventilator phonetic recognization rate | |
CN114863927B (en) | Numerical control machine tool interaction control method and system based on voice recognition | |
CN102855873A (en) | Electronic equipment and method used for controlling same | |
CN113129927A (en) | Voice emotion recognition method, device, equipment and storage medium | |
CN208284230U (en) | A kind of speech recognition equipment, speech recognition system and smart machine | |
CN111681649B (en) | Speech recognition method, interaction system and achievement management system comprising system | |
CN109346099B (en) | Iterative denoising method and chip based on voice recognition | |
CN113593565A (en) | Intelligent home device management and control method and system | |
Loh et al. | Speech recognition interactive system for vehicle | |
CN115547352A (en) | Electronic device, method, apparatus and medium for processing noise thereof | |
CN109410928B (en) | Denoising method and chip based on voice recognition | |
CN109602333B (en) | Voice denoising method and chip based on cleaning robot | |
CN110265053B (en) | Signal noise reduction control method and device and electronic equipment | |
CN111599364A (en) | Voice recognition noise reduction method, system, mobile terminal and storage medium | |
KR102236437B1 (en) | Method and apparatus for mental workload assessment of multitasking user using physiological data and simulated data | |
CN113506360A (en) | Virtual character expression driving method and system | |
CN112656309A (en) | Function execution method and device of sweeper, readable storage medium and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200508 |
|
RJ01 | Rejection of invention patent application after publication |