WO2019033987A1 - Procédé et appareil d'invite, support d'informations et terminal - Google Patents
Procédé et appareil d'invite, support d'informations et terminal Download PDFInfo
- Publication number
- WO2019033987A1 WO2019033987A1 PCT/CN2018/099662 CN2018099662W WO2019033987A1 WO 2019033987 A1 WO2019033987 A1 WO 2019033987A1 CN 2018099662 W CN2018099662 W CN 2018099662W WO 2019033987 A1 WO2019033987 A1 WO 2019033987A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- preset
- user
- analysis result
- call
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M19/00—Current supply arrangements for telephone systems
- H04M19/02—Current supply arrangements for telephone systems providing ringing current or supervisory tones, e.g. dialling tone or busy tone
- H04M19/04—Current supply arrangements for telephone systems providing ringing current or supervisory tones, e.g. dialling tone or busy tone the ringing-current being generated at the substations
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/725—Cordless telephones
Definitions
- the embodiments of the present application relate to the field of terminal technologies, for example, to a prompting method, device, storage medium, and terminal.
- call or recording functions such as mobile phones, tablets, music players, and voice recorders.
- the terminal implements a call or recording function through an integrated microphone (microphone, mic), also known as a microphone.
- a microphone is an energy conversion device that converts a sound signal into an electrical signal. When the vibration of the sound is transmitted to the diaphragm of the microphone, the magnet inside changes a current signal, and the current signal is processed by the sound processing circuit. Transfer to the opposite end of the call or store it for call or recording.
- the embodiment of the present application provides a prompting method, device, storage medium, and terminal, which can automatically prompt the user to adjust the call or recording state during a call or recording process.
- the embodiment of the present application provides a prompting method, including:
- the user is prompted to adjust the call state or the recording state.
- the embodiment of the present application provides a prompting apparatus, including:
- the audio signal collecting module is configured to control the microphone to collect the audio signal when the preset call event is detected or the preset recording event is triggered;
- An acoustic sound analysis module configured to analyze an acoustic sound level corresponding to the audio signal
- the prompt module is set to prompt the user to adjust the call state or the recording state according to the analysis result.
- the embodiment of the present application provides a computer readable storage medium, where the computer program stores a computer program, and when the program is executed by the processor, the prompting method as described in the embodiment of the present application is implemented.
- the embodiment of the present application provides a terminal, including a microphone, a memory, a processor, and a computer program stored in the memory and executable by the processor, and the processor executes the computer program to implement the embodiment of the present application. Prompt method.
- the embodiment of the present application further provides a computer program product, the computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer
- the computer is caused to perform any of the above methods.
- the prompting solution provided in the embodiment of the present application analyzes the sound and sound level corresponding to the audio signal collected by the microphone during the call or recording process, and prompts the user to adjust the call state or the recording state according to the analysis result.
- the user can promptly and accurately prompt the user to adjust the call state or the recording state, and can maintain a good call effect or recording effect.
- FIG. 1 is a schematic flow chart of a prompting method provided by an embodiment.
- FIG. 2 is a schematic structural diagram of an audio processing hardware system of a smart phone according to an embodiment.
- FIG. 3 is a block diagram of an audio system architecture provided by an embodiment.
- FIG. 4 is a schematic flow chart of another prompting method provided by an embodiment.
- FIG. 5 is a schematic flow chart of another prompting method provided by an embodiment.
- FIG. 6 is a schematic flow chart of another prompting method provided by an embodiment.
- FIG. 7 is a structural block diagram of a prompting apparatus according to an embodiment.
- FIG. 8 is a schematic structural diagram of a terminal according to an embodiment.
- FIG. 1 is a schematic flowchart diagram of a schematic diagram of a prompting method according to an embodiment of the present disclosure.
- the method may be performed by a prompting apparatus, where the apparatus may be implemented by software and/or hardware, and generally Integrated in the terminal.
- the method includes:
- Step 110 When detecting a preset call event or a preset recording event is triggered, the control microphone collects an audio signal.
- the terminal in the embodiment of the present application may include a device configured with a microphone, such as a mobile phone, a tablet computer, a music player, and a voice recorder.
- the microphone can be either built-in or external.
- a microphone (referred to as a microphone, also known as a microphone or a microphone) is an energy conversion device that converts a sound signal into an electrical signal. When the vibration of the sound is transmitted to the diaphragm of the microphone, the magnet inside changes a current signal. The current signal is processed by the sound processing circuit and transmitted to the opposite end of the call or stored, thereby implementing a call or recording.
- the embodiment of the present application does not limit the type, the number, and the location of the microphone. For example, for a mobile phone, it may be at least one electret microphone disposed on the lower side of the mobile phone.
- the preset call event may be a call event having the prompt function in the embodiment of the present application
- the preset recording event may be a recording event having the prompt function in the embodiment of the present application.
- a call event when a call event is detected when the prompt function is in an open state, it may be determined that a preset call event is detected or a recording event is detected when the prompt function is turned on, and it may be determined that the preset recording event is detected.
- the call event is, for example, a phone call or a voice chat call
- the recording event is, for example, to start recording.
- a smart phone is taken as an example to briefly introduce the audio processing hardware system and system architecture.
- the audio processing circuit is generally in the main control board.
- the location of the audio processing circuit may be different due to the different designs of different mobile phones.
- the audio processing circuit of the smartphone includes an audio signal processing circuit, a baseband signal processing circuit, an audio power amplifier, a headphone signal amplifier, an earpiece, a speaker, a microphone, and a headphone interface. Among them, the audio signal processing circuit is the core of the entire audio processing circuit.
- the audio processing circuit is composed of a receiving audio circuit, a sending circuit, a headphone talking circuit, and the like, including analog/digital (A/D) conversion, digital/analog (A/D) conversion, digital voice signal processing, and analog audio amplification of analog audio. Circuits, etc.
- the local microphone is first called to convert the sound acoustic signal of the sound into an analog audio signal, amplified by an analog audio amplifier circuit, and subjected to A/D conversion by an internal multimode converter to obtain a digital audio signal;
- the digital audio signal is sent to the baseband processor for speech encoding, channel encoding, etc.; a series of processing such as encryption and interleaving is performed again; finally, the digital narrowband modulation module sent to the baseband processor is modulated to generate the transmitted baseband signal.
- the RF circuit is modulated into a transmitting intermediate frequency and sent to the other party.
- the mechanical sound wave signal of the sound is first converted into an analog audio signal by a microphone, amplified by an analog audio amplifying circuit, and digital audio signal is obtained after A/D conversion, according to a preset audio format. Coding and storage.
- the audio signal collected by the microphone in the embodiment of the present application may be the above-mentioned analog audio signal converted from the mechanical sound wave signal, or may be an amplified analog audio signal, or may be an A/D converted digital audio signal.
- the embodiment of the present application is not limited.
- the audio system architecture provided in this embodiment includes a user space, a kernel space, and a hardware system.
- the user space includes an application layer, an application framework layer, and a hardware abstraction layer (HAL), and the kernel space includes a driver layer.
- the application layer is the top layer of the audio system. You can write an application to perform corresponding logical operations, such as detecting an application that triggers a recording event, presetting standard audio conditions, and issuing audio playback commands.
- the application framework layer includes an audio control interface and a standardized plug-in module to provide an audio playback form control interface, as well as a speaker volume control interface.
- the application framework layer provides two classes, AudioTrack and AudioRecorder, as well as the AudioManager, AudioService, and AudioSystem classes.
- a system runtime layer (Libraries) is also included between the application framework layer and the hardware abstraction layer.
- libraries Many classes in the framework layer are actually just “mediations” for applications that use Android library files.
- upper-level applications are generally written in Java, they require the most direct support of the Java interface, which is one of the meanings of the framework layer.
- intermediary they don't really implement specific functions, or only implement some of them, but focus on the library. For example, the above AudioTrack, AudioRecorder, MediaPlayer and MediaRecorder can find the corresponding class in the library. This part of the code is placed in the frameworks/av/media/libmedia of the project, mostly written in C++.
- the hardware abstraction layer of audio is mainly divided into two parts, namely AudioFlinger and AudioPolicyService. In fact, the latter is not a real device, just a virtual device to allow manufacturers to easily customize their own strategies. Depending on the product, audio devices vary widely. In the audio architecture of Android Android, these problems are solved by the audio.primary of the HAL layer, etc., without the need to modify the upper layer implementation on a large scale.
- the hardware abstraction layer is the transition from the application framework layer to the driver layer to achieve compatibility with the underlying hardware.
- the driver layer controls the audio codec according to the characteristics of the audio codec to ensure that the audio codec can work normally, and the audio data obtained by the audio codec is provided to the system layer. In the embodiment of the present application, when the audio signal is collected, the main class involved is the above-mentioned AudioRecorder class.
- Step 120 Analyze the acoustic sound level corresponding to the audio signal.
- the three main properties of the sound are volume, pitch, and tone.
- the volume is also called loudness or sound intensity, which refers to the subjective feeling of the human ear on the strength of the sound heard.
- the objective evaluation scale is the amplitude of the sound. Amplitude refers to the maximum distance from the original position during the vibration of the object.
- the loudness of the sound heard by the human ear is related to the amplitude of the sound source. Generally, the louder the louder the stronger the amplitude.
- the sound source as the user's mouth as an example
- the user can hear the sound of his own voice during the call, but cannot know whether the volume of the current speech is loud or small for the person at the opposite end of the call, if the sound is louder than Small, the other party may not be able to hear, if the sound is loud, it may make the other party feel deafening; the user can also hear the sound of his own voice when recording, but can not know the size of his own voice recorded by the terminal, if the sound is small It may not be audible when playing a recording. If the sound is loud, the listener may feel uncomfortable while playing the recording.
- the sound level corresponding to the audio signal is analyzed to know whether the sound level corresponding to the audio signal collected by the terminal is appropriate.
- the preset time period may be an analysis unit, and the sound sound level in each preset time period may be analyzed, and the sound sound level corresponding to the current preset time period may be recorded as the current sound sound level.
- the preset time period may be a preset time length that is timed forward from the current time. In order to ensure real-time performance, the preset time length corresponding to the preset time period may be set shorter, for example, within 0.5 seconds of the current time as the starting point.
- Step 130 prompt the user to adjust the call state or the recording state according to the analysis result.
- factors related to the call state or the recording state can generally be determined by the user.
- factors related to the state of the call or the state of the recording may include the size of the voice of the user's own voice or the intensity of the voice of other sound sources that the user wishes to record, and may also include a sound source (such as a user's mouth or other sound source) and a microphone.
- the relative distance may also include the orientation of the microphone and the like.
- the target sound level information of the audio signal that satisfies the requirement may be determined by the system by default or according to actual conditions, and the target sound level of the audio signal that satisfies the requirement may be determined by the user in advance according to the demand.
- the current sound level is compared with the target sound level, and then the user is prompted to make corresponding adjustments. For example, during the recording process, it is judged that the current sound level is smaller than the target sound level, and the user may be prompted to increase the intensity of the voice, or to bring the microphone to the mouth and the like.
- the user when the user is prompted according to the analysis result, the user may be reminded in the form of a text, a voice, or an indicator.
- the description text of the adjustment mode may be displayed on the display screen; the prompt voice of the adjustment mode may be played, and the prompt voice may be filtered out in the audio signal collected by the microphone; different adjustment modes may be represented by different colors of the indicator light, Or different flashing frequencies with different indicators to indicate different adjustment methods, and so on.
- the prompting method provided by the embodiment of the present invention analyzes the sound and sound degree corresponding to the audio signal collected by the microphone during the call or recording process, and prompts the user to adjust the call state or the recording state according to the analysis result.
- the user can promptly and accurately prompt the user to adjust the call state or the recording state, and can maintain a good call effect or recording effect.
- the prompting the user to adjust the call state or the recording state according to the analysis result comprises: prompting the user to adjust the relative distance between the microphone and the sound source and/or the sounding state of the sound source according to the analysis result.
- the analyzing the acoustic sound level corresponding to the audio signal may include: analyzing amplitude information of the audio signal to obtain an acoustic sound analysis result.
- the objective evaluation scale of the acoustic sound is the amplitude of the sound, and the audio signal is converted from the mechanical acoustic signal of the sound, so the amplitude information can be used to analyze the acoustic sound of the audio signal. The larger the amplitude, the louder the sound, that is, the higher the sound energy value.
- the preset time period may be an analysis unit, and the amplitude of the audio signal is collected at a preset sampling frequency within a preset time period to obtain a plurality of amplitude values, and the average value of the amplitude in the preset time period is used as the current sound level. Optimization here converts the analysis of acoustic sound into analysis of amplitude information, which simplifies the analysis process and speeds up the analysis.
- analyzing the amplitude information of the audio signal to obtain an acoustic sound analysis result further comprising: extracting amplitude information of the corresponding human voice in the audio signal; and performing amplitude information of the corresponding human voice Analyze to obtain the results of the acoustic sound analysis.
- the vocal is the subject in the audio signal, and other ambient sounds can be regarded as interference sounds; in addition, during recording, the user can select the vocal recording mode, in which case the vocal is also an audio signal. In the main body, other environmental sounds can be regarded as interference sounds.
- the sound source that is, the user's mouth
- the sound source after extracting the amplitude information corresponding to the human voice in the audio signal
- prompting the user to adjust the relative distance between the microphone and the sound source according to the analysis result may include: prompting the user when the current sound intensity analysis result includes the current sound sound level being less than the first preset loudness threshold.
- the relative distance between the microphone and the sound source is reduced; the method further includes: prompting the user to compare the relative distance between the microphone and the sound source when the current sound intensity analysis result includes the current sound sound level being greater than the second preset loudness threshold turn up.
- the first preset loudness threshold and the second preset loudness threshold may be the same or different, and the value may be a preset fixed value or a dynamically adjusted change value according to actual conditions.
- the current acoustic sound when the current acoustic sound is less than the first preset loudness threshold, it may be stated that the current sound level in the audio signal is small, the opposite end of the call may not be able to hear the words spoken by the local user, or the recorded file may be caused during subsequent playback. Others cannot hear the sound of the recording at this time. Therefore, the user can be prompted to reduce the relative distance between the microphone and the sound source to improve the sound level in the audio signal collected next.
- the prompt when the current sound level is greater than or equal to the first preset loudness threshold, the prompt may not be performed. The advantage of this setting is that it can be applied to an application scenario where the upper limit of the acoustic sound is not required, and the prompting efficiency is ensured.
- the current sound level in the audio signal may be large, and the call to the opposite end may cause the other party to feel deafening, or the recorded file may cause subsequent playback.
- Excessive sound affects listening, so the user can be prompted to increase the relative distance between the microphone and the sound source to reduce the sound level in the audio signal collected next.
- no prompt may be made. The advantage of this setting is that it can be applied to an application scenario where the lower limit of the acoustic sound is not required, and the prompting efficiency is ensured.
- the above two prompt modes may be combined, that is, when the current sound level is small and when the sound is large, the reminder is performed, and the current sound level is greater than or equal to the first preset loudness threshold and less than or equal to the second preset.
- the loudness threshold it indicates that the sound level in the audio signal is appropriate, and no prompt is given.
- prompting the user to adjust the sounding condition of the sound source according to the analysis result may include: prompting the user to sound source when the sound sound intensity analysis result includes the current sound sound level being less than the third preset loudness threshold.
- the vocal intensity is increased; the method further includes: prompting the user to reduce the vocal intensity of the sound source when the current sound intensity analysis result includes the current sound loudness being greater than the fourth preset loudness threshold.
- the third preset loudness threshold and the fourth preset loudness threshold may be the same or different.
- the third preset loudness threshold and the first preset loudness threshold may be the same or different.
- the fourth preset loudness threshold and the second preset loudness threshold may be the same or different.
- the value of the third preset loudness threshold and the fourth preset loudness threshold may be a preset fixed value, or may be a dynamically adjusted change value according to actual conditions.
- the current sound level is less than the third preset loudness threshold
- the opposite end of the call may not be able to hear the words spoken by the local user, or the recorded file may be caused during subsequent playback. Others cannot hear the sound of the recording at this time. Therefore, the user can be prompted to reduce the relative distance between the microphone and the sound source to improve the sound level in the audio signal collected next.
- the current sound level is greater than or equal to the third preset loudness threshold, no prompt may be made.
- the advantage of this setting is that it can be applied to an application scenario where the upper limit of the acoustic sound is not required, and the prompting efficiency is ensured.
- the current sound level when the current sound level is greater than the fourth preset loudness threshold, it may indicate that the current sound level in the audio signal is large, and the call to the opposite end may cause the other party to feel deafening, or the recording file may cause subsequent playback. Excessive sound affects listening, so the user can be prompted to increase the relative distance between the microphone and the sound source to reduce the sound level in the audio signal collected next.
- the current acoustic level is less than or equal to the fourth preset loudness threshold, no prompt may be made.
- the advantage of this setting is that it can be applied to an application scenario where the lower limit of the acoustic sound is not required, and the prompting efficiency is ensured.
- the above two adjustment modes may be combined, that is, when the current sound level is small and when the sound is large, the reminder is performed, and the current sound level is greater than or equal to the third preset loudness threshold and less than or equal to the fourth preset.
- the loudness threshold When the loudness threshold is used, it indicates that the sound level in the audio signal is appropriate, and no prompt is given.
- the advantage of this setting is that it can be applied to the application scenarios that require the upper and lower limits of the acoustic sound level to ensure the accuracy of the prompts.
- the user may also be prompted to adjust the relative distance between the microphone and the sound source and the sounding of the sound source.
- the current acoustic sound analysis result includes the current acoustic soundness being less than the fifth preset loudness threshold, prompting the user to reduce the relative distance between the microphone and the sound source and increase the sound intensity of the sound source; and/or And when the current acoustic sound analysis result includes the current sound loudness greater than the sixth preset loudness threshold, prompting the user to increase the relative distance between the microphone and the sound source and to reduce the sound intensity of the sound source.
- the user may be more specifically prompted based on the difference (eg, the difference) between the current acoustic level and the corresponding preset loudness threshold.
- different difference values correspond to different adjustment values, and the correspondence relationship may be determined in advance according to a simulation or the like.
- the difference between the current acoustic sound level and the first preset loudness threshold is a
- the corresponding relative distance adjustment value is 5 cm, prompting the user to reduce the relative distance between the microphone and the sound source by 5 cm.
- the advantage of this setting is that the prompts to the user are more clear, and the user can quickly adjust the call state or the recording state to the ideal state.
- the prompting the user to adjust the relative distance between the microphone and the sound source according to the analysis result may further include: acquiring an attribute of the call peer contact Information mode and/or profile mode information of the call end; determining a corresponding first preset loudness threshold and/or second preset loudness threshold according to the attribute information and/or the scene mode information.
- the advantage of this setting is that the first preset loudness threshold and/or the second preset loudness threshold can be determined more accurately according to actual conditions, thereby improving the call effect.
- the attribute information may include age (or age group), or include whether it is an elderly person or a child or the like.
- obtaining the attribute information of the call peer contact may include: obtaining the note information of the call peer contact in the address book, extracting the attribute information from the note information, and may also include contacting the call end.
- the human voice performs speech recognition, and the corresponding attribute information is determined according to the recognition result.
- the other party is an elderly person, the hearing may be poor, and the required sound level should be larger, so the first preset loudness threshold and/or the second preset loudness threshold may be set higher.
- the scene mode information of the call end may include mode information such as a silent mode, a conference mode, a normal mode, and an outdoor mode.
- mode information such as a silent mode, a conference mode, a normal mode, and an outdoor mode.
- Users usually set the corresponding scene according to their environment. For example, in a relatively quiet environment such as class or meeting, you may choose silent mode or conference mode. In a noisy environment outside, you may choose outdoor mode.
- the corresponding relationship between the different preset modes and the first preset loudness threshold and/or the second preset loudness threshold may be preset, and in the actual call, the acquired context mode information may be determined.
- the acquiring the scenario mode information of the call peer may include: sending a scenario mode information acquisition request to the call peer end, and receiving the scenario mode information that the call peer end requests the feedback according to the scenario mode information acquisition request.
- the prompting the user to adjust the utterance of the sound source according to the analysis result further comprising: obtaining attribute information and/or a call of the call peer contact The context mode information of the peer end; determining a corresponding third preset loudness threshold and/or fourth preset loudness threshold according to the attribute information and/or the scene mode information.
- the triggered event is a preset call event
- the prompting the user to adjust the utterance of the sound source according to the analysis result further comprising: obtaining attribute information and/or a call of the call peer contact The context mode information of the peer end; determining a corresponding third preset loudness threshold and/or fourth preset loudness threshold according to the attribute information and/or the scene mode information.
- FIG. 4 is a schematic flowchart of another prompting method according to an embodiment of the present application, where the method is applicable to a recording scenario, including:
- Step 401 When it is detected that the preset recording event is triggered, the control microphone collects the audio signal.
- Step 402 Perform real-time analysis on amplitude information of the collected audio signal to obtain an acoustic sound analysis result.
- Step 403 Determine whether the current sound level is less than the preset loudness threshold A. If the current sound level is less than the preset loudness threshold A, perform step 404; if the current sound level is greater than or equal to the preset loudness threshold A, perform step 405.
- the terminal analyzes the amplitude information of the audio signal collected by the microphone in real time, and the current acoustic sound level also changes continuously as the analysis progresses.
- Step 404 prompting the user to reduce the relative distance between the microphone and the sound source or prompting the user to increase the sound intensity of the sound source.
- step 405 it is determined whether the current sound level is greater than the preset loudness threshold B. If the current sound level is greater than the preset loudness threshold B, step 406 is performed; if the current sound level is less than or equal to the preset loudness threshold B, step 407 is performed.
- Step 406 prompting the user to increase the relative distance between the microphone and the sound source or prompting the user to reduce the sound intensity of the sound source.
- Step 407 Determine whether a recording pause or recording stop command is received, and if the recording pause or recording stop command is received, the process ends; if the recording pause or recording stop command is not received, the process returns to step 403.
- the prompting method provided by the embodiment of the present application can accurately prompt the user to adjust the relative distance between the microphone and the sound source or the sounding intensity of the sound source during the recording process, and can maintain a good recording effect.
- FIG. 5 is a schematic flowchart of another prompting method according to an embodiment of the present disclosure. The method is applicable to a call scenario, and includes:
- Step 501 When it is detected that the preset call event is triggered, the control microphone collects the audio signal.
- Step 502 Analyze amplitude information of the collected audio signal in real time to obtain an acoustic sound analysis result.
- step 503 it is determined whether the current sound level is less than the preset loudness threshold C. If the current sound level is less than the preset loudness threshold C, step 504 is performed; if the current sound level is greater than or equal to the preset loudness threshold C, step 505 is performed.
- the terminal analyzes the amplitude information of the audio signal collected by the microphone in real time, and the current acoustic sound level also changes continuously as the analysis progresses.
- Step 504 prompting the user to reduce the relative distance between the microphone and the sound source or prompting the user to increase the sound intensity of the sound source.
- the sound source during the call is the user's mouth
- the adjustment of the sound intensity of the sound source means that the user adjusts the size of his own voice.
- the user when the call is a call made through a mobile phone, the user can be prompted in a voice form, which can be more easily perceived by the user.
- the voice prompt is played through the earpiece to prompt the user to reduce the relative distance between the microphone and the sound source or to prompt the user to increase the sound intensity of the sound source. For example, play "Please be closer to the microphone" or "Please increase the volume of the voice" through the handset.
- Step 505 Determine whether the current sound level is greater than the preset loudness threshold D. If the current sound level is greater than the preset loudness threshold D, perform step 506; if the current sound level is less than or equal to the preset loudness threshold D, perform step 507.
- Step 506 prompting the user to increase the relative distance between the microphone and the sound source or prompting the user to reduce the sound intensity of the sound source.
- Step 507 Determine whether a call end command is received. If the call end command is received, the process ends. If the call end command is not received, the process returns to step 503.
- the prompting method provided by the embodiment of the present application can accurately prompt the user to adjust the relative distance of the microphone and the mouth or the size of the speaking voice during the call, and can maintain a good call effect.
- FIG. 6 is a schematic flowchart of another prompting method according to an embodiment of the present disclosure. The method is applicable to a call scenario, and includes:
- Step 601 When it is detected that the preset call event is triggered, the control microphone collects the audio signal.
- Step 602 Extract amplitude information of the corresponding human voice in the audio signal in real time, and analyze amplitude information of the corresponding human voice to obtain an acoustic sound analysis result.
- Step 603 Acquire context mode information of the opposite end of the call, and determine a corresponding preset loudness threshold E and a preset loudness threshold F according to the scene mode information.
- the attribute information of the opposite end of the call may also be determined. For example, when the scene mode is the outdoor mode, the attribute information is young, the preset loudness threshold is determined as E 1 ; when the scene mode is the outdoor mode, and the attribute information is the elderly, the preset loudness threshold is determined as E 2 ; 2 is greater than E 1 .
- step 604 it is determined whether the current sound level is less than the preset loudness threshold E. If the current sound level is less than the preset loudness threshold E, step 605 is performed; if the current sound level is greater than or equal to the preset loudness threshold E, step 606 is performed.
- the current sound level corresponds to the loudness of the user's speech sound.
- Step 605 prompting the user to reduce the relative distance between the microphone and the sound source or prompting the user to increase the sound intensity of the sound source.
- Step 606 Determine whether the current sound level is greater than the preset loudness threshold F. If the current sound level is greater than the preset loudness threshold F, perform step 607; if the current sound level is less than or equal to the preset loudness threshold F, perform step 608.
- Step 607 prompt the user to increase the relative distance between the microphone and the sound source or prompt the user to reduce the sound intensity of the sound source.
- Step 608 Determine whether a call end command is received. If the call end command is received, the process ends; if the call end command is not received, the process returns to step 604.
- the prompting method provided by the embodiment of the present invention can promptly prompt the user to adjust the relative distance between the microphone and the mouth or the size of the speaking voice according to the scene mode information of the calling party during the call, and can improve the call effect more specifically.
- FIG. 7 is a structural block diagram of a prompting apparatus according to an embodiment of the present disclosure.
- the apparatus may be implemented by software and/or hardware, and is generally integrated in a terminal, and may prompt the user by executing a prompting method.
- the device includes:
- the audio signal collection module 701 is configured to control the microphone to collect an audio signal when a preset call event is detected or a preset recording event is triggered;
- the acoustic sound analysis module 702 is configured to analyze the sound intensity corresponding to the audio signal
- the prompting module 703 is configured to prompt the user to adjust the call state or the recording state according to the analysis result.
- the prompting device provided by the embodiment of the present invention can promptly and promptly prompt the user to adjust the call state or the recording state during the call or recording process, and can maintain a good call effect or a recording effect.
- the prompting module 703 is configured to:
- the user is prompted to adjust the relative distance of the microphone and the sound source and/or the sounding of the sound source according to the analysis result.
- the acoustic sound analysis module 702 is configured to:
- the amplitude information of the audio signal is analyzed to obtain an acoustic sound analysis result.
- the acoustic sound analysis module 702 is configured to:
- the amplitude information of the corresponding human voice is analyzed to obtain an acoustic sound analysis result.
- the prompting module 703 is configured to include:
- the sound intensity analysis result includes that the current sound level is less than the first preset loudness threshold, prompting the user to reduce the relative distance between the microphone and the sound source; and/or,
- the user is prompted to increase the relative distance between the microphone and the sound source.
- the prompting module 703 further includes:
- the first obtaining submodule is configured to obtain attribute information of the call peer contact and/or context mode information of the call peer end;
- the first determining submodule is configured to determine a corresponding first preset loudness threshold and/or second preset loudness threshold according to the attribute information and/or the scene mode information.
- the prompting module 703 is configured to:
- the sound intensity analysis result includes that the current sound level is less than the third preset loudness threshold, prompting the user to increase the sound intensity of the sound source; and/or,
- the user is prompted to reduce the sound intensity of the sound source.
- the prompting module 703 further includes:
- the second obtaining submodule is configured to obtain attribute information of the call peer contact and/or context mode information of the call peer end;
- the second determining submodule is configured to determine a corresponding third preset loudness threshold and/or fourth preset loudness threshold according to the attribute information and/or the scene mode information.
- the embodiment of the present application further provides a storage medium including computer executable instructions for executing a prompting method when executed by a computer processor, the method comprising:
- the user is prompted to adjust the call state or the recording state.
- Storage media any different type of storage device or storage device.
- the term "storage medium” is intended to include: a mounting medium such as a CD-ROM, a floppy disk or a tape device; a computer system memory or a random access memory (RAM) such as a dynamic random access memory (dynamic random access memory). DRAM), display data random access memory (DDR RAM), static random access memory (SRAM), extended data output random access memory (EDO RAM) , Rambus RAM, etc.; non-volatile memory, such as flash memory, magnetic media (such as hard disk or optical storage); registers or other similar types of memory components, and the like.
- the storage medium may also include other types of memory or a combination thereof.
- the storage medium may be located in a first computer system in which the program is executed, or may be located in a different second computer system, the second computer system being coupled to the first computer system via a network, such as the Internet.
- the second computer system can provide program instructions to the first computer for execution.
- the term "storage medium" can include at least two storage mediums that can reside in different locations (eg, in different computer systems connected through a network).
- the storage medium may store program instructions (eg, implemented as a computer program) executable by the at least one processor.
- the storage medium containing the computer executable instructions provided by the embodiment of the present application is not limited to the prompt operation as described above, and may also be related to the prompting method provided by any embodiment of the present application. operating.
- FIG. 8 is a schematic structural diagram of a terminal according to an embodiment of the present disclosure.
- the terminal may be, for example, a mobile terminal.
- the terminal may include a casing (not shown), a memory 801, and a central processing unit (CPU) 802 (also referred to as a processor, below).
- CPU central processing unit
- the terminal may include a casing (not shown), a memory 801, and a central processing unit (CPU) 802 (also referred to as a processor, below).
- CPU central processing unit
- circuit board not shown
- a power supply circuit not shown
- microphone 813 Referred to as a microphone 813.
- the circuit board is disposed inside a space enclosed by the casing; the CPU 802 and the memory 801 are disposed on the circuit board; and the power circuit is configured to supply power to a plurality of circuits or devices of the terminal
- the memory 801 is configured to store executable program code; the CPU 802 runs a computer program corresponding to the executable program code by reading executable program code stored in the memory 801 to implement the following steps:
- the user is prompted to adjust the call state or the recording state.
- the terminal further includes: a peripheral interface 803, a radio frequency (RF) circuit 805, an audio circuit 806, a speaker 811, a power management chip 808, other input/output (I/O) subsystem input/control devices, and a touch screen. 812, other input/control devices 810 and external port 804, these components communicate via at least one communication bus or signal line 807.
- RF radio frequency
- I/O input/output
- the illustrated terminal 800 is merely one example of a terminal, and that the terminal 800 may have more or fewer components than those shown in the figures, may combine at least two components, or may have different Component configuration.
- the various components shown in the figures can be implemented in hardware, software, or a combination of hardware and software, including at least one signal processing and/or application specific integrated circuit.
- the terminal uses a mobile phone as an example.
- the memory 801 can be accessed by the CPU 802, the peripheral interface 803, etc., and the memory 801 can include a high speed random access memory, and can also include a non-volatile memory, such as one or more magnetic disk storage devices, flash memory devices. Or other volatile solid-state storage devices.
- a non-volatile memory such as one or more magnetic disk storage devices, flash memory devices. Or other volatile solid-state storage devices.
- Peripheral interface 803, which can connect the input and output peripherals of the device to CPU 802 and memory 801.
- the I/O subsystem 809 can connect input and output peripherals on the device, such as touch screen 812 and other input/control devices 810, to peripheral interface 803.
- the I/O subsystem 809 can include a display controller 8091 and at least one input controller 8092 configured to control other input/control devices 810.
- at least one input controller 8092 receives electrical signals from other input/control devices 810 or transmits electrical signals to other input/control devices 810, and other input/control devices 810 may include physical buttons (press buttons, rocker buttons, etc.), Dial, slide switch, joystick, click wheel.
- the input controller 8092 can be connected to any of the following: a keyboard, an infrared port, a USB interface, and a pointing device such as a mouse.
- the touch screen 812 is an input interface and an output interface between the user terminal and the user, and displays the visual output to the user.
- the visual output may include graphics, text, icons, videos, and the like.
- Display controller 8091 in I/O subsystem 809 receives an electrical signal from touch screen 812 or an electrical signal to touch screen 812.
- the touch screen 812 detects the contact on the touch screen, and the display controller 8091 converts the detected contact into an interaction with the user interface object displayed on the touch screen 812, that is, realizes human-computer interaction, and the user interface object displayed on the touch screen 812 may be running.
- the icon of the game, the icon of the network to the corresponding network, and the like.
- the device may also include a light mouse, which is a touch sensitive surface that does not display a visual output, or an extension of a touch sensitive surface formed by the touch screen.
- the RF circuit 805 is configured to establish communication between the mobile phone and the wireless network (ie, the network side) to implement data reception and transmission between the mobile phone and the wireless network. For example, sending and receiving short messages, emails, and the like.
- the RF circuit 805 receives and transmits an RF signal, also referred to as an electromagnetic signal, and the RF circuit 805 converts the electrical signal into an electromagnetic signal or converts the electromagnetic signal into an electrical signal and communicates with the communication network and other devices through the electromagnetic signal.
- the RF circuit 805 can include known circuitry configured to perform these functions including, but not limited to, an antenna system, an RF transceiver, at least one amplifier, a tuner, at least one oscillator, a digital signal processor, a codec (COder- DECoder, CODEC) Chipset, Subscriber Identity Module (SIM), etc.
- an antenna system an RF transceiver, at least one amplifier, a tuner, at least one oscillator, a digital signal processor, a codec (COder- DECoder, CODEC) Chipset, Subscriber Identity Module (SIM), etc.
- CODEC codec
- SIM Subscriber Identity Module
- the audio circuit 806 is arranged to receive audio data from the peripheral interface 803, convert the audio data into an electrical signal, and transmit the electrical signal to the speaker 811.
- the speaker 811 is arranged to restore the voice signal received by the mobile phone from the wireless network through the RF circuit 805 to sound and play the sound to the user.
- the power management chip 808 is configured to provide power and power management for the hardware connected to the CPU 802, the I/O subsystem, and the peripheral interface.
- the terminal provided by the embodiment of the present invention can promptly and promptly prompt the user to adjust the call state or the recording state during the call or recording process, and can maintain a good call effect or a recording effect.
- the embodiment of the present application further provides a computer program product, the computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer
- the computer is caused to perform any of the above-described prompting methods.
- the prompting device, the storage medium and the terminal provided in the above embodiments can perform the prompting method provided by any embodiment of the present application, and have the corresponding functional modules and beneficial effects of executing the method.
- the prompting method provided by any embodiment of the present application can perform the prompting method provided by any embodiment of the present application, and have the corresponding functional modules and beneficial effects of executing the method.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Telephone Function (AREA)
Abstract
L'invention concerne un procédé et un appareil d'invite, un support d'informations et un terminal. Le procédé consiste à : lorsqu'il est détecté qu'un événement d'appel prédéfini ou un événement d'enregistrement vocal prédéfini est déclenché, commander un microphone pour acquérir un signal audio ; analyser l'intensité sonore correspondant au signal audio ; et inviter, en fonction du résultat d'analyse, l'utilisateur à ajuster l'état d'appel ou l'état d'enregistrement vocal.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710711506.7A CN107580113B (zh) | 2017-08-18 | 2017-08-18 | 提示方法、装置、存储介质及终端 |
CN201710711506.7 | 2017-08-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019033987A1 true WO2019033987A1 (fr) | 2019-02-21 |
Family
ID=61033820
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/099662 WO2019033987A1 (fr) | 2017-08-18 | 2018-08-09 | Procédé et appareil d'invite, support d'informations et terminal |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107580113B (fr) |
WO (1) | WO2019033987A1 (fr) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111479193A (zh) * | 2020-03-18 | 2020-07-31 | 深圳传音制造有限公司 | 一种振动发声装置及方法 |
CN112000411A (zh) * | 2020-08-18 | 2020-11-27 | 青岛海信移动通信技术股份有限公司 | 移动终端及其录音通道占用信息的显示方法 |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107580113B (zh) * | 2017-08-18 | 2019-09-24 | Oppo广东移动通信有限公司 | 提示方法、装置、存储介质及终端 |
CN108391206A (zh) * | 2018-03-30 | 2018-08-10 | 广东欧珀移动通信有限公司 | 信号处理方法、装置、终端、耳机及可读存储介质 |
CN108668017A (zh) * | 2018-04-19 | 2018-10-16 | Oppo广东移动通信有限公司 | 音量提示方法、模式切换方法、装置、终端及存储介质 |
CN109005281A (zh) * | 2018-06-27 | 2018-12-14 | 努比亚技术有限公司 | 一种通话音量调节方法、移动终端及计算机可读存储介质 |
CN108882094B (zh) * | 2018-07-27 | 2020-03-13 | 歌尔科技有限公司 | 一种反馈降噪耳机及其反馈电路 |
CN110365852B (zh) * | 2019-08-13 | 2021-08-27 | 上饶市中科院云计算中心大数据研究院 | 一种语音引导录音的方法和装置 |
CN112533081B (zh) * | 2019-09-19 | 2023-04-18 | 成都鼎桥通信技术有限公司 | 录音处理方法、设备及存储介质 |
CN110931047A (zh) * | 2019-12-06 | 2020-03-27 | 广州国音智能科技有限公司 | 语音数据采集方法、装置、采集终端及可读存储介质 |
CN111210839A (zh) * | 2019-12-31 | 2020-05-29 | 秒针信息技术有限公司 | 对录音设备的检测方法及装置 |
CN111641794B (zh) * | 2020-05-25 | 2023-03-28 | 维沃移动通信有限公司 | 声音信号采集方法和电子设备 |
CN111863001A (zh) * | 2020-06-17 | 2020-10-30 | 广州华燎电气科技有限公司 | 一种多方通话系统中抑制背景噪声的方法 |
CN113014844A (zh) * | 2021-02-08 | 2021-06-22 | Oppo广东移动通信有限公司 | 一种音频处理方法、装置、存储介质及电子设备 |
CN115208976B (zh) * | 2021-04-13 | 2024-10-18 | 深圳市万普拉斯科技有限公司 | 通话通道切换的处理方法、装置、通话设备和存储介质 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1815555A (zh) * | 2005-02-04 | 2006-08-09 | 光宝科技股份有限公司 | 电子收音装置及其音量提醒方法 |
CN101197871A (zh) * | 2006-12-07 | 2008-06-11 | 乐金电子(中国)研究开发中心有限公司 | 自动语音测量和提示装置 |
CN101419795A (zh) * | 2008-12-03 | 2009-04-29 | 李伟 | 音频信号检测方法及装置、以及辅助口语考试系统 |
CN103685673A (zh) * | 2012-09-20 | 2014-03-26 | 索尼公司 | 信号处理设备和存储介质 |
CN104954579A (zh) * | 2015-06-30 | 2015-09-30 | 深圳市鸿宇顺科技有限公司 | 一种带通话分贝提醒功能的通讯设备 |
CN105120063A (zh) * | 2015-07-13 | 2015-12-02 | 联想(北京)有限公司 | 一种输入语音的音量提示方法及电子设备 |
CN107580113A (zh) * | 2017-08-18 | 2018-01-12 | 广东欧珀移动通信有限公司 | 提示方法、装置、存储介质及终端 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW201234828A (en) * | 2011-02-15 | 2012-08-16 | Wistron Corp | Mobile communication device and mehtod for adjusting volume thereof |
JP6071054B2 (ja) * | 2013-03-12 | 2017-02-01 | 株式会社リコー | 通信装置、及び音声入出力ユニットの制御方法 |
CN104424073A (zh) * | 2013-08-21 | 2015-03-18 | 联想(北京)有限公司 | 一种信息处理的方法及电子设备 |
KR102287943B1 (ko) * | 2014-10-14 | 2021-08-09 | 삼성전자주식회사 | 전자 기기, 상기 전자 기기의 음량 조절 방법 및 상기 전자 기기의 제어 방법 |
CN105405441B (zh) * | 2015-10-20 | 2019-06-18 | 北京云知声信息技术有限公司 | 一种语音信息的反馈方法及装置 |
CN107040661A (zh) * | 2017-04-18 | 2017-08-11 | 广东欧珀移动通信有限公司 | 移动终端播放音量的控制方法、装置及移动终端 |
-
2017
- 2017-08-18 CN CN201710711506.7A patent/CN107580113B/zh active Active
-
2018
- 2018-08-09 WO PCT/CN2018/099662 patent/WO2019033987A1/fr active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1815555A (zh) * | 2005-02-04 | 2006-08-09 | 光宝科技股份有限公司 | 电子收音装置及其音量提醒方法 |
CN101197871A (zh) * | 2006-12-07 | 2008-06-11 | 乐金电子(中国)研究开发中心有限公司 | 自动语音测量和提示装置 |
CN101419795A (zh) * | 2008-12-03 | 2009-04-29 | 李伟 | 音频信号检测方法及装置、以及辅助口语考试系统 |
CN103685673A (zh) * | 2012-09-20 | 2014-03-26 | 索尼公司 | 信号处理设备和存储介质 |
CN104954579A (zh) * | 2015-06-30 | 2015-09-30 | 深圳市鸿宇顺科技有限公司 | 一种带通话分贝提醒功能的通讯设备 |
CN105120063A (zh) * | 2015-07-13 | 2015-12-02 | 联想(北京)有限公司 | 一种输入语音的音量提示方法及电子设备 |
CN107580113A (zh) * | 2017-08-18 | 2018-01-12 | 广东欧珀移动通信有限公司 | 提示方法、装置、存储介质及终端 |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111479193A (zh) * | 2020-03-18 | 2020-07-31 | 深圳传音制造有限公司 | 一种振动发声装置及方法 |
CN112000411A (zh) * | 2020-08-18 | 2020-11-27 | 青岛海信移动通信技术股份有限公司 | 移动终端及其录音通道占用信息的显示方法 |
CN112000411B (zh) * | 2020-08-18 | 2023-07-07 | 青岛海信移动通信技术有限公司 | 移动终端及其录音通道占用信息的显示方法 |
Also Published As
Publication number | Publication date |
---|---|
CN107580113A (zh) | 2018-01-12 |
CN107580113B (zh) | 2019-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11251763B2 (en) | Audio signal adjustment method, storage medium, and terminal | |
WO2019033987A1 (fr) | Procédé et appareil d'invite, support d'informations et terminal | |
CN107509153B (zh) | 声音播放器件的检测方法、装置、存储介质及终端 | |
JP2019117623A (ja) | 音声対話方法、装置、デバイス及び記憶媒体 | |
US20170126192A1 (en) | Method, device, and computer-readable medium for adjusting volume | |
WO2018059030A1 (fr) | Procédé de réglage de volume et terminal | |
US20180152163A1 (en) | Noise control method and device | |
US11201598B2 (en) | Volume adjusting method and mobile terminal | |
CN107371102B (zh) | 音频播放音量的控制方法、装置及存储介质和移动终端 | |
WO2019174492A1 (fr) | Procédé de détection de données d'appel vocal, dispositif, support de stockage et terminal mobile | |
US12069451B2 (en) | User hearing protection method, apparatus, and electronic device | |
CN107483731A (zh) | 音频播放音量的控制方法、装置及存储介质和移动终端 | |
CN109067965B (zh) | 翻译方法、翻译装置、可穿戴装置及存储介质 | |
WO2019033921A1 (fr) | Procédé et appareil de réglage de volume, dispositif terminal et support de stockage | |
EP4203447A1 (fr) | Procédé de traitement de son et appareil associé | |
WO2019033940A1 (fr) | Procédé et appareil de réglage de volume, dispositif terminal et support d'informations | |
CN107707721B (zh) | 移动终端的录音方法、装置、存储介质及移动终端 | |
KR102110515B1 (ko) | 음향 광고 또는 음향 데이터를 재생하는 보청기 | |
WO2019061292A1 (fr) | Procédé de réduction du bruit pour borne et borne | |
WO2019033442A1 (fr) | Procédé et appareil de commande de volume de terminal mobile, support d'informations et terminal mobile | |
WO2018058331A1 (fr) | Procédé et appareil de commande de volume | |
CN116132869A (zh) | 耳机音量的调节方法、耳机及存储介质 | |
WO2023245715A1 (fr) | Procédé et appareil de réglage de volume d'appel, dispositif d'appel et support | |
CN106293607B (zh) | 自动切换音频输出模式的方法及系统 | |
CN111556406B (zh) | 音频处理方法、音频处理装置及耳机 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18846349 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18846349 Country of ref document: EP Kind code of ref document: A1 |