CN110827792B - Voice broadcasting method and device - Google Patents

Voice broadcasting method and device Download PDF

Info

Publication number
CN110827792B
CN110827792B CN201911116654.XA CN201911116654A CN110827792B CN 110827792 B CN110827792 B CN 110827792B CN 201911116654 A CN201911116654 A CN 201911116654A CN 110827792 B CN110827792 B CN 110827792B
Authority
CN
China
Prior art keywords
voice
broadcasted
word
sensitive
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911116654.XA
Other languages
Chinese (zh)
Other versions
CN110827792A (en
Inventor
宋夏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Guangzhou Shikun Electronic Technology Co Ltd
Original Assignee
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Guangzhou Shikun Electronic Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Shiyuan Electronics Thecnology Co Ltd, Guangzhou Shikun Electronic Technology Co Ltd filed Critical Guangzhou Shiyuan Electronics Thecnology Co Ltd
Priority to CN201911116654.XA priority Critical patent/CN110827792B/en
Publication of CN110827792A publication Critical patent/CN110827792A/en
Application granted granted Critical
Publication of CN110827792B publication Critical patent/CN110827792B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The application provides a voice broadcasting method and device. The method comprises the following steps: the method comprises the steps of obtaining a text to be broadcasted by voice, determining whether the text to be broadcasted by voice contains sensitive words according to a sensitive word database, wherein the sensitive words are words with the same or similar pronunciation as a preset voice recognition command, and if the text to be broadcasted by voice contains the sensitive words, adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted by voice is broadcasted. Therefore, the probability of triggering voice misrecognition can be reduced, and the user experience is improved.

Description

Voice broadcasting method and device
Technical Field
The present application relates to the field of communications technologies, and in particular, to a voice broadcast method and apparatus.
Background
At present, many intelligent household electrical appliances all possess speech recognition and voice broadcast's function, in order to prevent that the voice broadcast content of self from triggering the speech recognition of self by mistake, among the prior art, usually through gathering the audio signal of loudspeaker broadcast as reference signal, use echo cancellation algorithm, "subtract" the audio signal of loudspeaker broadcast in the audio signal of microphone collection. However, since the indoor environments of the household appliances are different, and the structural positions of the speaker and the microphone of different household appliances are different, the echo cancellation algorithm cannot ensure that the audio signal played by the speaker is completely cancelled from the audio signal acquired by the microphone, so that the voice misrecognition is occasionally triggered, and the user experience is influenced.
Disclosure of Invention
The application provides a voice broadcasting method and device, which can reduce the probability of triggering voice misrecognition and improve user experience.
In a first aspect, the present application provides a voice broadcast method, including:
acquiring a text to be broadcasted in voice;
determining whether the text to be broadcasted in the voice contains a sensitive word according to a sensitive word database, wherein the sensitive word is a word with pronunciation the same as or similar to that of a preset voice recognition command;
and if the text to be broadcasted contains the sensitive words, adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted.
Optionally, adjusting parameters of a speech recognition algorithm according to the sensitive words when the speech to be broadcasted corresponding to the text to be broadcasted is broadcasted includes:
synthesizing the text to be broadcasted into the voice to be broadcasted, and extracting the time point of the occurrence of the sensitive words according to the voice to be broadcasted; playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, and adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be voice broadcast into the voice to be broadcast; playing the voice to be broadcasted;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into voice to be broadcasted, and extracting time points of the sensitive words according to the voice to be broadcasted; and playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached.
Optionally, before determining whether the text to be broadcasted by voice includes a sensitive word according to the sensitive word database, the method further includes:
and determining that the audio length of the text to be broadcasted after being synthesized into voice is larger than a preset threshold value.
Optionally, the method further includes:
if the audio length is smaller than the preset threshold value, closing a voice recognition algorithm, synthesizing the text to be broadcasted into voice to be broadcasted, and then broadcasting, and starting the voice recognition algorithm after the voice to be broadcasted is broadcasted.
Optionally, the method further includes:
monitoring whether voice recognition is triggered or not in the process of playing the voice to be broadcasted;
if the fact that the voice recognition is triggered is monitored, recording a playing word which triggers the voice recognition and a command word which is triggered by the playing word;
determining whether the playing word triggers voice recognition according to the playing word and the command word;
and if the fact that the played words trigger voice recognition is determined, storing the played words and the voice synthesis parameters of the played words into the sensitive word database.
Optionally, the determining whether the playing word triggers voice recognition according to the playing word and the command word includes:
synthesizing the played words into voice;
inputting the voice synthesized by the played words into the voice recognition algorithm, and calculating the matching scores of the played words and the command words through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the played word triggers voice recognition;
and if the matching score is smaller than a preset value, determining that the speech recognition is not triggered by the played word.
Optionally, the method further includes:
and if the playing words exist in the data sensitive word database, adjusting the voice synthesis parameters corresponding to the playing words stored in the sensitive word database.
In a second aspect, the present application provides a device for preventing voice broadcast from causing voice misrecognition, comprising:
the acquisition module is used for acquiring a text to be broadcasted in a voice mode;
the first determining module is used for determining whether the text to be broadcasted by voice contains a sensitive word according to a sensitive word database, wherein the sensitive word is a word with pronunciation the same as or similar to that of a preset voice recognition command;
and the processing module is used for determining that the text to be broadcasted contains the sensitive words in the determining module, and adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted.
Optionally, the processing module is configured to:
synthesizing the text to be broadcasted into the voice to be broadcasted, and extracting the time point of the occurrence of the sensitive words according to the voice to be broadcasted; playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, and adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted; playing the voice to be broadcasted;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into voice to be broadcasted, and extracting time points of the sensitive words according to the voice to be broadcasted; and playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached.
Optionally, the apparatus further comprises:
and the second determining module is used for determining that the audio length of the text to be broadcasted after being synthesized into voice is larger than a preset threshold value before the first determining module determines whether the text to be broadcasted contains the sensitive words according to the sensitive word database.
Optionally, the processing module is further configured to:
and when the second determining module determines that the audio length is smaller than the preset threshold value, closing a voice recognition algorithm, synthesizing the text to be broadcasted into voice to be broadcasted and then broadcasting the voice, and starting the voice recognition algorithm after the voice to be broadcasted is broadcasted.
Optionally, the apparatus further comprises:
the monitoring module is used for monitoring whether voice recognition is triggered or not in the process of playing the voice to be broadcasted;
the processing module is further configured to: if the monitoring module monitors that voice recognition is triggered, recording a playing word which triggers the voice recognition and a command word which triggers the playing word;
a third determining module, configured to determine whether voice recognition is triggered by the playing word according to the playing word and the command word;
the processing module is further configured to: and if the third determining module determines that the played word triggers voice recognition, storing the played word and the voice synthesis parameter of the played word into the sensitive word database.
Optionally, the third determining module is configured to:
synthesizing the played words into voice;
inputting the voice synthesized by the played word into the voice recognition algorithm, and calculating the matching score of the played word and the command word through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the played word triggers voice recognition;
and if the matching score is smaller than a preset value, determining that the speech recognition is not triggered by the played word.
Optionally, the processing module is further configured to:
and if the playing words exist in the data sensitive word database, adjusting the voice synthesis parameters corresponding to the playing words stored in the sensitive word database.
According to the voice broadcasting method and the voice broadcasting device, the text to be broadcasted is obtained, whether the text to be broadcasted contains the sensitive words or not is determined according to the sensitive word database, if the text to be broadcasted contains the sensitive words is determined, the parameters of the voice recognition algorithm are adjusted according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted, so that the probability of triggering voice misrecognition can be reduced, and the user experience is improved.
Drawings
In order to more clearly illustrate the technical solutions in the present application or the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a flowchart of an embodiment of a voice broadcast method provided in the present application;
fig. 2 is a flowchart of an embodiment of a voice broadcast method provided in the present application;
fig. 3 is a schematic structural diagram of an embodiment of a voice broadcast device provided in the present application;
fig. 4 is a schematic structural diagram of an embodiment of a voice broadcast device provided in the present application;
fig. 5 is a schematic structural diagram of an embodiment of a voice broadcast device provided in the present application;
fig. 6 is a schematic diagram of a hardware structure of an electronic device provided in the present application.
Detailed Description
To make the purpose, technical solutions and advantages of the present application clearer, the technical solutions in the present application will be clearly and completely described below with reference to the drawings in the present application, and it is obvious that the described embodiments are some, but not all embodiments of the present application. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
In the existing intelligent household appliance, in order to prevent the voice broadcast content of the intelligent household appliance from triggering the voice recognition of the intelligent household appliance by mistake, the audio signal played by a loudspeaker is generally collected to be used as a reference signal, an echo cancellation algorithm is used, the audio signal played by the loudspeaker is subtracted from the audio signal collected by a microphone, but the echo cancellation algorithm cannot ensure that the audio signal played by the loudspeaker is completely cancelled from the audio signal collected by the microphone, so that the voice recognition by mistake can be triggered occasionally, and the user experience can be influenced. In order to solve the problem, the application provides a voice broadcasting method and device, whether a text to be subjected to voice broadcasting contains a sensitive word is determined according to a sensitive word database, if the text to be subjected to voice broadcasting contains the sensitive word is determined, parameters of a voice recognition algorithm are adjusted according to the sensitive word when voice to be broadcasted corresponding to the text to be subjected to voice broadcasting is broadcasted, so that the probability of triggering voice misrecognition can be reduced, and user experience is improved. The following describes a specific implementation process of the voice broadcast method according to the embodiment of the present application in detail by using specific embodiments with reference to the accompanying drawings.
Fig. 1 is a flowchart of an embodiment of a voice broadcast method provided in the present application, where an execution subject in the present embodiment may be an intelligent appliance, and as shown in fig. 1, the method of the present embodiment may include:
s101, obtaining a text to be subjected to voice broadcast.
In particular text, i.e. text content.
S102, determining whether the text to be subjected to voice broadcast contains a sensitive word according to the sensitive word database, wherein the sensitive word is a word with the same or similar pronunciation as the preset voice recognition command.
Specifically, the sensitive word refers to a word having the same or similar pronunciation as a preset voice recognition command, for example, the voice recognition command word "turn on light" is preset in the smart home appliance, and if the word having the same or similar pronunciation as the "turn on light" is "turn on light" for example, the word "turn on light" is a sensitive word. The sensitive word database can be determined according to a preset voice recognition command word list of the product in the development stage of the household appliance product, for example, words with similar pronunciations to the words such as 'turn on light' and 'turn on light' can be stored in the sensitive word database. If the voice misrecognition is triggered, the words which trigger the voice misrecognition can be stored in a sensitive word database.
Wherein, S102 may specifically be: and searching words which are the same as the words in the text to be broadcasted in voice from the sensitive word database.
S103, if the text to be broadcasted contains the sensitive words, adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted.
Specifically, if it is determined that the text to be subjected to voice broadcast contains the sensitive word, the parameters of the voice recognition algorithm are adjusted according to the sensitive word when the voice to be broadcast corresponding to the text to be subjected to voice broadcast is played, and the embodiment has three implementable modes:
the method comprises the steps of synthesizing a text to be broadcasted into a voice to be broadcasted, extracting a time point of occurrence of a sensitive word according to the voice to be broadcasted, then broadcasting the voice to be broadcasted, and adjusting a recognition threshold value of a voice recognition command corresponding to the sensitive word when the time point of occurrence of the sensitive word is reached.
Specifically, a Text-to-Speech (TTS) Speech synthesis technology may be used to synthesize a Text to be broadcasted into a Speech to be broadcasted, and when the Speech to be broadcasted is broadcasted and a time point at which a sensitive word appears is reached, a recognition threshold of a Speech recognition command corresponding to the sensitive word is adjusted, and if the sensitive word "turn on light" is to be broadcasted, the recognition threshold of the Speech recognition command word "turn on light" in a Speech recognition algorithm is synchronously increased, so as to prevent the sensitive word from being falsely triggered for Speech recognition.
And secondly, acquiring the voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted, and then broadcasting the voice to be broadcasted.
The speech synthesis parameters include volume, speech rate and pitch. The sensitive word database stores the sensitive words and the speech synthesis parameters of the sensitive words, and when synthesizing the text to be broadcasted into the speech to be broadcasted, the speech synthesis parameters of the sensitive words are adjusted, such as reducing the volume, increasing the speed of speech, increasing or decreasing the pitch, etc., and the adjustment method can be various, such as multiplying the fixed proportionality coefficient each time, for example, adjusting the volume to 95% of the original volume, increasing the speed of speech to 105% of the original volume, etc. Thus, the probability of triggering speech misrecognition can be reduced.
Acquiring voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted, and extracting the time points of the appearance of the sensitive words according to the voice to be broadcasted; and then playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the occurrence of the sensitive word is reached.
In the third mode, the first mode and the second mode are executed simultaneously, the voice synthesis parameters of the sensitive words are adjusted when the text to be broadcasted is synthesized into the voice to be broadcasted, and the recognition threshold value of the voice recognition command corresponding to the sensitive words is adjusted when the time point of the appearance of the sensitive words is reached when the voice to be broadcasted is broadcasted. Thereby further reducing the probability of triggering speech misrecognition.
Further, before determining whether the text to be voice-broadcasted includes the sensitive word according to the sensitive word database in S102, the method of this embodiment may further include:
and S104, determining that the audio length of the text to be subjected to voice broadcast after being synthesized into voice is larger than a preset threshold value. The preset threshold is, for example, 2 seconds or another value, that is, if it is determined that the audio length after the text to be broadcasted is synthesized into the voice is greater than the preset threshold, S102 is executed next.
And S105, if the audio length is determined to be smaller than the preset threshold value, closing the voice recognition algorithm, synthesizing the text to be broadcasted into the voice to be broadcasted, then broadcasting, and starting the voice recognition algorithm after the voice to be broadcasted is broadcasted. In the embodiment, when the audio length is determined to be smaller than the preset threshold, the voice recognition algorithm is closed, so that the voice recognition error caused by playing the audio can be prevented. And the voice recognition algorithm is directly closed for the short voice, so that the voice recognition error caused by playing the audio can be prevented.
Through the method, the probability of triggering the voice misrecognition in the voice broadcasting process can be reduced, and if the voice misrecognition is triggered in the broadcasting process, further, the method of the embodiment can also comprise the following steps:
and S106, monitoring whether voice recognition is triggered or not in the process of playing the voice to be broadcasted.
And S107, if the voice recognition is triggered, recording the playing words which trigger the voice recognition and command words which trigger the playing words, such as recording 'large light on' and 'light on'.
And S108, determining whether the playing word triggers voice recognition according to the playing word and the command word.
Wherein, S108 may specifically be: synthesizing the played words into voice;
inputting the voice synthesized by the played words into a voice recognition algorithm, and calculating matching scores of the played words and the command words through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the speech recognition is triggered by the played word;
and if the matching score is smaller than the preset value, determining that the speech recognition is not triggered by the played word.
And S109, if the fact that the played word triggers the voice recognition is determined, storing the played word and the voice synthesis parameter of the played word into a sensitive word database.
If the playing words exist in the data sensitive word database, adjusting the voice synthesis parameters corresponding to the playing words stored in the sensitive word database. For example, the volume is decreased, the speech rate is increased, the pitch is increased or decreased, etc., and the adjustment method may be many, for example, multiplying by a fixed scale factor each time, such as adjusting the volume to 95% of the original volume, increasing the speech rate to 105% of the original volume, etc., thereby decreasing the probability that the sensitive word is misrecognized by the speech.
According to the voice broadcasting method provided by the embodiment, the text to be subjected to voice broadcasting is obtained, whether the text to be subjected to voice broadcasting contains the sensitive words or not is determined according to the sensitive word database, and if the text to be subjected to voice broadcasting contains the sensitive words, the parameters of the voice recognition algorithm are adjusted according to the sensitive words when the voice to be broadcasted corresponding to the text to be subjected to voice broadcasting is broadcasted, so that the probability of triggering voice misrecognition can be reduced, and the user experience is improved.
The following describes the technical solution of the embodiment of the method shown in fig. 1 in detail by using a specific embodiment.
Fig. 2 is a flowchart of an embodiment of a voice broadcast method provided in the present application, and as shown in fig. 2, the method of the present embodiment may include:
s201, obtaining a text to be broadcasted in voice.
S202, determining whether the audio length of the text to be subjected to voice broadcast after being synthesized into voice is larger than a preset threshold value. If not, S203 is executed, and if yes, S204 is executed.
S203, closing the voice recognition algorithm, synthesizing the text to be broadcasted into voice to be broadcasted, and then broadcasting, and starting the voice recognition algorithm after the voice to be broadcasted is broadcasted.
And S204, determining whether the text to be broadcasted in the voice contains the sensitive words according to the sensitive word database, wherein the sensitive words are words with pronunciation the same as or similar to that of a preset voice recognition command.
And S205, if the text to be broadcasted contains the sensitive words, adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted.
Specifically, the parameters of the voice recognition algorithm are adjusted according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted, and three implementable modes are provided:
the method comprises the steps of synthesizing a text to be broadcasted into a voice to be broadcasted, extracting a time point of occurrence of a sensitive word according to the voice to be broadcasted, then broadcasting the voice to be broadcasted, and adjusting a recognition threshold value of a voice recognition command corresponding to the sensitive word when the time point of occurrence of the sensitive word is reached. The sensitive word can be prevented from being triggered by mistake in speech recognition.
And secondly, acquiring the voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted, and then broadcasting the voice to be broadcasted. Thus, the probability of triggering speech misrecognition can be reduced.
Acquiring voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted, and extracting the time points of the appearance of the sensitive words according to the voice to be broadcasted;
and playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the occurrence of the sensitive word is reached. In the third mode, the first mode and the second mode are executed simultaneously, so that the probability of triggering the voice misrecognition can be further reduced.
For the detailed description of the above three embodiments, reference may be made to the description in the embodiment shown in fig. 1, and details are not repeated here.
And S206, monitoring that voice recognition is triggered in the process of playing the voice to be broadcasted.
And S207, recording the playing words which trigger the voice recognition and the command words triggered by the playing words, such as recording 'large light on' and 'light on'.
And S208, determining that the playing words trigger voice recognition according to the playing words and the command words.
Wherein, S208 may specifically be: synthesizing the played words into voice;
inputting the voice synthesized by the played words into a voice recognition algorithm, and calculating matching scores of the played words and the command words through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the speech recognition is triggered by the played word;
and if the matching score is smaller than the preset value, determining that the speech recognition is not triggered by the played word.
S209, if the fact that the played words trigger voice recognition is determined, storing the played words and voice synthesis parameters of the played words into a sensitive word database.
And if the playing words exist in the data sensitive word database, adjusting the speech synthesis parameters corresponding to the playing words stored in the sensitive word database. For example, the volume is decreased, the speech rate is increased, the pitch is increased or decreased, etc., and the adjustment method may be many, for example, multiplying by a fixed scale factor each time, such as adjusting the volume to 95% of the original volume, increasing the speech rate to 105% of the original volume, etc., thereby decreasing the probability that the sensitive word is misrecognized by the speech.
Fig. 3 is a schematic structural diagram of an embodiment of a voice broadcast device provided in the present application, and as shown in fig. 3, the device of the present embodiment may include: an acquisition module 11, a first determination module 12 and a processing module 13, wherein,
the acquisition module 11 is used for acquiring a text to be broadcasted by voice;
the first determining module 12 is configured to determine whether a text to be voice broadcast contains a sensitive word according to the sensitive word database, where the sensitive word is a word having a pronunciation the same as or similar to that of a preset voice recognition command;
the processing module 13 is configured to determine, at the determining module, that the text to be subjected to voice broadcast includes a sensitive word, and adjust a parameter of a voice recognition algorithm according to the sensitive word when the voice to be broadcast corresponding to the text to be subjected to voice broadcast is played.
Further, the processing module 13 is configured to:
synthesizing a text to be broadcasted into a voice to be broadcasted, and extracting a time point of occurrence of a sensitive word according to the voice to be broadcasted; playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive words when the time point of the occurrence of the sensitive words is reached;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, and adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted; playing the voice to be broadcasted;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from a sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing a text to be broadcasted into voice to be broadcasted, and extracting time points of the occurrence of the sensitive words according to the voice to be broadcasted; and playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the occurrence of the sensitive word is reached.
The apparatus provided in the embodiment of the present application may implement the method embodiment, and specific implementation principles and technical effects thereof may be referred to the method embodiment, which is not described herein again.
Fig. 4 is a schematic structural diagram of an embodiment of a voice broadcast device provided in the present application, and as shown in fig. 4, the device of the present embodiment may further include, on the basis of the device shown in fig. 3: the second determining module 14 is configured to determine that an audio length of a text to be subjected to voice broadcast after being synthesized into voice is greater than a preset threshold before the first determining module 12 determines whether the text to be subjected to voice broadcast includes a sensitive word according to the sensitive word database.
Optionally, the processing module 13 is further configured to:
when the second determining module 14 determines that the audio length is smaller than the preset threshold, the voice recognition algorithm is turned off, the text to be broadcasted is synthesized into the voice to be broadcasted and then broadcasted, and the voice recognition algorithm is turned on after the voice to be broadcasted is broadcasted.
The apparatus provided in the embodiment of the present application may implement the method embodiment, and specific implementation principles and technical effects thereof may be referred to the method embodiment, which is not described herein again.
Fig. 5 is a schematic structural diagram of an embodiment of a voice broadcasting device provided in the present application, and as shown in fig. 5, the device of the present embodiment may further include, on the basis of the device shown in fig. 3: the device comprises a monitoring module 15 and a third determining module 16, wherein the monitoring module 15 is used for monitoring whether voice recognition is triggered or not in the process of playing the voice to be broadcasted;
the processing module 13 is further configured to: if the monitoring module 15 monitors that the voice recognition is triggered, recording a playing word which triggers the voice recognition and a command word which triggers the playing word;
the third determining module 16 is configured to determine whether the playing word triggers voice recognition according to the playing word and the command word;
the processing module 13 is further configured to: if the third determining module 16 determines that the played word triggers speech recognition, the played word and the speech synthesis parameter of the played word are stored in the sensitive word database.
Further, the third determining module 16 is configured to:
synthesizing the played words into voice;
inputting the voice synthesized by the played words into a voice recognition algorithm, and calculating matching scores of the played words and the command words through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the speech recognition is triggered by the played word;
and if the matching score is smaller than the preset value, determining that the speech recognition is not triggered by the played word.
Further, the processing module 13 is further configured to:
and if the playing words exist in the data sensitive word database, adjusting the speech synthesis parameters corresponding to the playing words stored in the sensitive word database.
The apparatus provided in the embodiment of the present application may implement the method embodiment, and specific implementation principles and technical effects thereof may be referred to the method embodiment, which is not described herein again.
Fig. 6 is a schematic diagram of a hardware structure of an electronic device provided in the present application. As shown in fig. 6, the electronic device 60 of the present embodiment, may include: a memory 61 and a processor 62;
a memory 61 for storing a computer program;
and a processor 62 for executing the computer program stored in the memory to implement the voice broadcasting method in the above embodiments. Reference may be made in particular to the description relating to the method embodiments described above.
Alternatively, the memory 61 may be separate or integrated with the processor 62.
When the memory 61 is a device separate from the processor 62, the electronic device 60 may further include:
a bus 63 for connecting the memory 61 and the processor 62.
Optionally, this embodiment further includes: a communication interface 64, the communication interface 64 being connectable to the processor 62 via a bus 63. Processor 62 may control communication interface 63 to perform the above-described receiving and transmitting functions of electronic device 60.
The electronic device provided by this embodiment can be used to execute the above method, and its implementation manner and technical effect are similar, and this embodiment is not described herein again.
The present application also provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the voice broadcasting method in the above embodiment is implemented.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described embodiments of the apparatus are merely illustrative, and for example, the division of modules is only one logical division, and other divisions may be realized in practice, for example, a plurality of modules may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or modules, and may be in an electrical, mechanical or other form.
Modules described as separate parts may or may not be physically separate, and parts displayed as modules may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment.
In addition, functional modules in the embodiments of the present application may be integrated into one processing unit, or each module may exist alone physically, or two or more modules are integrated into one unit. The unit formed by the modules can be realized in a hardware form, and can also be realized in a form of hardware and a software functional unit.
The integrated module implemented in the form of a software functional module may be stored in a computer-readable storage medium. The software functional module is stored in a storage medium and includes several instructions to enable a computer device (which may be a personal computer, a server, or a network device) or a processor (processor) to execute some steps of the methods according to the embodiments of the present application.
It should be understood that the Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of a method disclosed in connection with the present invention may be embodied directly in a hardware processor, or in a combination of the hardware and software modules within the processor.
The memory may comprise a high-speed RAM memory, and may further comprise a non-volatile storage NVM, such as at least one disk memory, and may also be a usb disk, a removable hard disk, a read-only memory, a magnetic or optical disk, etc.
The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended ISA (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, the buses in the figures of the present application are not limited to only one bus or one type of bus.
The computer-readable storage medium may be implemented by any type or combination of volatile or non-volatile memory devices, such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks. A storage media may be any available media that can be accessed by a general purpose or special purpose computer.
Those of ordinary skill in the art will understand that: all or a portion of the steps of implementing the above-described method embodiments may be performed by hardware associated with program instructions. The program may be stored in a computer-readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
Finally, it should be noted that: the above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present application.

Claims (12)

1. A voice broadcast method, comprising:
acquiring a text to be broadcasted in voice;
determining that the audio length of the text to be broadcasted after being synthesized into voice is larger than a preset threshold;
determining whether a text to be subjected to voice broadcast contains a sensitive word according to a sensitive word database, wherein the sensitive word is a word with the same or similar pronunciation as a preset voice recognition command;
if the text to be broadcasted contains the sensitive words, adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted;
if the audio length is smaller than the preset threshold value, closing a voice recognition algorithm, synthesizing the text to be broadcasted into voice to be broadcasted, and then broadcasting, and starting the voice recognition algorithm after the voice to be broadcasted is broadcasted.
2. The method according to claim 1, wherein the adjusting parameters of a speech recognition algorithm according to the sensitive words when playing the speech to be broadcasted corresponding to the text to be broadcasted comprises:
synthesizing the text to be broadcasted into the voice to be broadcasted, and extracting the time point of the occurrence of the sensitive words according to the voice to be broadcasted; playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, and adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be voice broadcast into the voice to be broadcast; playing the voice to be broadcasted;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into voice to be broadcasted, and extracting time points of the sensitive words according to the voice to be broadcasted; and playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached.
3. The method of claim 1, further comprising:
monitoring whether voice recognition is triggered or not in the process of playing the voice to be broadcasted;
if the fact that the voice recognition is triggered is monitored, recording a playing word which triggers the voice recognition and a command word which is triggered by the playing word;
determining whether the playing word triggers voice recognition according to the playing word and the command word;
and if the fact that the played words trigger voice recognition is determined, storing the played words and the voice synthesis parameters of the played words into the sensitive word database.
4. The method of claim 3, wherein the determining whether the played word triggers speech recognition based on the played word and the command word comprises:
synthesizing the played words into voice;
inputting the voice synthesized by the played word into the voice recognition algorithm, and calculating the matching score of the played word and the command word through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the played word triggers voice recognition;
and if the matching score is smaller than a preset value, determining that the speech recognition is not triggered by the played word.
5. The method according to claim 3 or 4, characterized in that the method further comprises:
and if the playing words exist in the data sensitive word database, adjusting the voice synthesis parameters corresponding to the playing words stored in the sensitive word database.
6. A voice broadcast device, comprising:
the acquisition module is used for acquiring a text to be broadcasted in voice;
the first determining module is used for determining whether the text to be broadcasted by voice contains a sensitive word according to a sensitive word database, wherein the sensitive word is a word with pronunciation the same as or similar to that of a preset voice recognition command;
the processing module is used for determining that the text to be broadcasted contains the sensitive words in the determining module, and adjusting parameters of a voice recognition algorithm according to the sensitive words when the voice to be broadcasted corresponding to the text to be broadcasted is broadcasted;
the second determining module is used for determining that the audio length of the text to be broadcasted after being synthesized into voice is larger than a preset threshold value before the first determining module determines whether the text to be broadcasted contains the sensitive words according to the sensitive word database;
the processing module is further configured to:
and when the second determining module determines that the audio length is smaller than the preset threshold value, closing a voice recognition algorithm, synthesizing the text to be broadcasted into voice to be broadcasted and then broadcasting the voice, and starting the voice recognition algorithm after the voice to be broadcasted is broadcasted.
7. The apparatus of claim 6, wherein the processing module is configured to:
synthesizing the text to be broadcasted into the voice to be broadcasted, and extracting the time point of the occurrence of the sensitive words according to the voice to be broadcasted; playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, and adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into the voice to be broadcasted; playing the voice to be broadcasted;
alternatively, the first and second electrodes may be,
acquiring voice synthesis parameters of the sensitive words from the sensitive word database, adjusting the voice synthesis parameters of the sensitive words when synthesizing the text to be broadcasted into voice to be broadcasted, and extracting time points of the sensitive words according to the voice to be broadcasted; and playing the voice to be broadcasted, and adjusting the recognition threshold value of the voice recognition command corresponding to the sensitive word when the time point of the sensitive word is reached.
8. The apparatus of claim 6, further comprising:
the monitoring module is used for monitoring whether voice recognition is triggered or not in the process of playing the voice to be broadcasted;
the processing module is further configured to: if the monitoring module monitors that voice recognition is triggered, recording a playing word which triggers the voice recognition and a command word which triggers the playing word;
a third determining module, configured to determine whether voice recognition is triggered by the playing word according to the playing word and the command word;
the processing module is further configured to: and if the third determining module determines that the played word triggers voice recognition, storing the played word and the voice synthesis parameter of the played word into the sensitive word database.
9. The apparatus of claim 8, wherein the third determining module is configured to:
synthesizing the played words into voice;
inputting the voice synthesized by the played word into the voice recognition algorithm, and calculating the matching score of the played word and the command word through the voice recognition algorithm;
if the matching score is larger than a preset value, determining that the played word triggers voice recognition;
and if the matching score is smaller than a preset value, determining that the speech recognition is not triggered by the played word.
10. The apparatus of claim 8 or 9, wherein the processing module is further configured to:
if the played word exists in the data sensitive word database, adjusting the voice synthesis parameters corresponding to the played word stored in the sensitive word database.
11. A computer-readable storage medium on which a computer program is stored, the computer program being characterized by implementing the voice broadcasting method according to any one of claims 1 to 5 when executed by a processor.
12. An electronic device, comprising:
a processor; and
a memory for storing executable instructions of the processor;
wherein the processor is configured to perform the voice broadcasting method of any one of claims 1 to 5 via execution of the executable instructions.
CN201911116654.XA 2019-11-15 2019-11-15 Voice broadcasting method and device Active CN110827792B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911116654.XA CN110827792B (en) 2019-11-15 2019-11-15 Voice broadcasting method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911116654.XA CN110827792B (en) 2019-11-15 2019-11-15 Voice broadcasting method and device

Publications (2)

Publication Number Publication Date
CN110827792A CN110827792A (en) 2020-02-21
CN110827792B true CN110827792B (en) 2022-06-03

Family

ID=69555418

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911116654.XA Active CN110827792B (en) 2019-11-15 2019-11-15 Voice broadcasting method and device

Country Status (1)

Country Link
CN (1) CN110827792B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112435668A (en) * 2020-11-06 2021-03-02 联想(北京)有限公司 Voice recognition method, device and storage medium
CN112542168B (en) * 2020-12-08 2024-06-11 维沃移动通信有限公司 Voice control method and device
CN116072123B (en) * 2023-03-06 2023-06-23 南昌航天广信科技有限责任公司 Broadcast information playing method and device, readable storage medium and electronic equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103208283A (en) * 2012-01-11 2013-07-17 三星电子株式会社 Method and apparatus for executing a user function by using voice recognition
CN105227656A (en) * 2015-09-28 2016-01-06 百度在线网络技术(北京)有限公司 Based on information-pushing method and the device of speech recognition
CN106409294A (en) * 2016-10-18 2017-02-15 广州视源电子科技股份有限公司 Method and device for preventing voice command from being recognized by mistake
CN106611597A (en) * 2016-12-02 2017-05-03 百度在线网络技术(北京)有限公司 Voice wakeup method and voice wakeup device based on artificial intelligence
CN107393526A (en) * 2017-07-19 2017-11-24 腾讯科技(深圳)有限公司 Speech silence detection method, device, computer equipment and storage medium
CN108831459A (en) * 2018-05-30 2018-11-16 出门问问信息科技有限公司 Audio recognition method and device
CN108831477A (en) * 2018-06-14 2018-11-16 出门问问信息科技有限公司 A kind of audio recognition method, device, equipment and storage medium
JP2019086599A (en) * 2017-11-03 2019-06-06 アルパイン株式会社 Voice recognition device
CN109887507A (en) * 2019-04-22 2019-06-14 成都启英泰伦科技有限公司 A method of reducing comparable speech order word false recognition rate

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101614756B1 (en) * 2014-08-22 2016-04-27 현대자동차주식회사 Apparatus of voice recognition, vehicle and having the same, method of controlling the vehicle
US20170337922A1 (en) * 2016-05-19 2017-11-23 Julia Komissarchik System and methods for modifying user pronunciation to achieve better recognition results

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103208283A (en) * 2012-01-11 2013-07-17 三星电子株式会社 Method and apparatus for executing a user function by using voice recognition
CN105227656A (en) * 2015-09-28 2016-01-06 百度在线网络技术(北京)有限公司 Based on information-pushing method and the device of speech recognition
CN106409294A (en) * 2016-10-18 2017-02-15 广州视源电子科技股份有限公司 Method and device for preventing voice command from being recognized by mistake
CN106611597A (en) * 2016-12-02 2017-05-03 百度在线网络技术(北京)有限公司 Voice wakeup method and voice wakeup device based on artificial intelligence
CN107393526A (en) * 2017-07-19 2017-11-24 腾讯科技(深圳)有限公司 Speech silence detection method, device, computer equipment and storage medium
JP2019086599A (en) * 2017-11-03 2019-06-06 アルパイン株式会社 Voice recognition device
CN108831459A (en) * 2018-05-30 2018-11-16 出门问问信息科技有限公司 Audio recognition method and device
CN108831477A (en) * 2018-06-14 2018-11-16 出门问问信息科技有限公司 A kind of audio recognition method, device, equipment and storage medium
CN109887507A (en) * 2019-04-22 2019-06-14 成都启英泰伦科技有限公司 A method of reducing comparable speech order word false recognition rate

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Robust speech recognition for similar pronunciation phrases using MMSE under noise environments;Masumi Watanabe;《2013 13th International Symposium on Communications and Information Technologies (ISCIT)》;20131024;802-807 *
基于语音控制的智能电动汽车研究;敖勤;《通信电源技术》;20191025;26-27 *

Also Published As

Publication number Publication date
CN110827792A (en) 2020-02-21

Similar Documents

Publication Publication Date Title
CN110827792B (en) Voice broadcasting method and device
US9196247B2 (en) Voice recognition method and voice recognition apparatus
WO2017031846A1 (en) Noise elimination and voice recognition method, apparatus and device, and non-volatile computer storage medium
CN108335700B (en) Voice adjusting method and device, voice interaction equipment and storage medium
CN111161728B (en) Awakening method, awakening device, awakening equipment and awakening medium of intelligent equipment
US9374651B2 (en) Sensitivity calibration method and audio device
CN111968644B (en) Intelligent device awakening method and device and electronic device
JP6587742B2 (en) Sound mixing processing method and apparatus, apparatus, and storage medium
CN110782891A (en) Audio processing method and device, computing equipment and storage medium
CN113707183B (en) Audio processing method and device in video
CN111386566A (en) Device control method, cloud device, intelligent device, computer medium and device
CN107948854B (en) Operation audio generation method and device, terminal and computer readable medium
CN108320757A (en) Distribution information reminding method, device, intelligent sound box and storage medium
CN112017622B (en) Audio data alignment method, device, equipment and storage medium
WO2019041871A1 (en) Voice object recognition method and device
CN111540357A (en) Voice processing method, device, terminal, server and storage medium
EP4254400A1 (en) Method and device for determining user intent
CN111145748A (en) Audio recognition confidence determining method, device, equipment and storage medium
CN105895098A (en) Play control method and device
CN112509556B (en) Voice awakening method and device
CN112151025A (en) Volume adjusting method, device, equipment and storage medium
JP6044490B2 (en) Information processing apparatus, speech speed data generation method, and program
CN112781185A (en) Air conditioner, control method thereof, and computer-readable storage medium
CN110853633A (en) Awakening method and device
CN112118511A (en) Earphone noise reduction method and device, earphone and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant