CN112769979B - Voice call method and device based on terminal, computer equipment and storage medium - Google Patents

Voice call method and device based on terminal, computer equipment and storage medium Download PDF

Info

Publication number
CN112769979B
CN112769979B CN201911064300.5A CN201911064300A CN112769979B CN 112769979 B CN112769979 B CN 112769979B CN 201911064300 A CN201911064300 A CN 201911064300A CN 112769979 B CN112769979 B CN 112769979B
Authority
CN
China
Prior art keywords
signal
screen
voice
terminal
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911064300.5A
Other languages
Chinese (zh)
Other versions
CN112769979A (en
Inventor
彭功良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oneplus Technology Shenzhen Co Ltd
Original Assignee
Oneplus Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oneplus Technology Shenzhen Co Ltd filed Critical Oneplus Technology Shenzhen Co Ltd
Priority to CN201911064300.5A priority Critical patent/CN112769979B/en
Publication of CN112769979A publication Critical patent/CN112769979A/en
Application granted granted Critical
Publication of CN112769979B publication Critical patent/CN112769979B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/0202Portable telephone sets, e.g. cordless phones, mobile phones or bar type handsets
    • H04M1/026Details of the structure or mounting of specific components
    • H04M1/0266Details of the structure or mounting of specific components for a display module assembly
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/03Constructional features of telephone transmitters or receivers, e.g. telephone hand-sets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/04Circuits for transducers, loudspeakers or microphones for correcting frequency response
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The application provides a voice call method, a voice call device, computer equipment and a storage medium based on a terminal. The terminal comprises a first screen and more than one voice acquisition component arranged on different sides of the first screen; the method comprises the following steps: acquiring voice signals of target members acquired by each voice acquisition component; distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal strength of the voice signals; determining an auxiliary signal with a signal strength difference from the main signal being greater than a first threshold value as a noise signal; correcting the main signal according to the noise signal to obtain a target signal; and sending the target signal to the other party member participating in the voice call. By adopting the method, the voice quality in the voice call process can be improved.

Description

Voice call method and device based on terminal, computer equipment and storage medium
Technical Field
The present disclosure relates to the field of intelligent terminals, and in particular, to a voice call method, apparatus, computer device and storage medium based on a terminal.
Background
With the continuous development of computer technology, increasingly diversified intelligent terminals enter lives of people, such as intelligent mobile phones. For the instant communication equipment installed at the intelligent terminal and the call equipment built-in the terminal, in the process of realizing voice call, voice collection is mainly realized through a microphone arranged at the bottom of the terminal, and voice playing is mainly realized through a receiver arranged at the top of the terminal.
Currently, voice call technology based on a terminal is mainly implemented through an earpiece and a microphone provided on the terminal. This technique has specific requirements on the posture of the handheld terminal during the call, resulting in the quality of the voice call being affected by the posture of the handheld terminal.
Disclosure of Invention
In view of the foregoing, it is desirable to provide a terminal-based voice call method, apparatus, computer device, and storage medium that can improve the quality of voice calls.
A voice call method based on a terminal, wherein the terminal comprises a first screen and more than one voice acquisition component arranged on different sides of the first screen; the method comprises the following steps:
acquiring voice signals of target members acquired by each voice acquisition component;
Distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal strength of the voice signals;
determining an auxiliary signal with a signal strength difference from the main signal being greater than a first threshold value as a noise signal;
correcting the main signal according to the noise signal to obtain a target signal;
and sending the target signal to the other party member participating in the voice call.
In one embodiment, the voice signal is acquired in a target period; the target period includes a preset number of time points; the determining the secondary signal having a difference from the signal strength of the primary signal greater than a first threshold as a noise signal comprises:
determining a signal strength difference of each auxiliary signal relative to the main signal at each time point within the target period;
calculating the similarity of the corresponding auxiliary signal and the main signal according to the number of time points when the signal intensity difference is larger than a first threshold value and the preset number;
and determining the auxiliary signal with the similarity smaller than a second threshold value as a noise signal.
In one embodiment, said correcting the main signal according to the noise signal includes:
determining a correction factor according to the signal mildness of all the noise signals;
And adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal.
In one embodiment, said determining a correction factor based on signal strengths of all of said noise signals comprises:
calculating the average intensity of the signal intensities of all the noise signals;
the average intensity is determined as a correction factor.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition component arranged on different sides of the second screen; the step of acquiring the voice signals of the target members acquired by each voice acquisition component comprises the following steps:
acquiring the magnetic field intensity between the first screen and the second screen;
judging the use state of the terminal according to the magnetic field intensity;
determining a main screen according to the first screen or the second screen according to the using state;
and acquiring voice signals of the target members acquired by each voice acquisition component corresponding to the main screen.
In one embodiment, the determining the usage status of the terminal according to the magnetic field strength includes:
when the magnetic field strength is larger than a third threshold value, judging that the terminal is in a folded state;
And when the magnetic field strength is smaller than or equal to a third threshold value, judging that the terminal is in an unfolding state.
In one embodiment, said determining the first screen or the second screen to be the main screen according to the usage status includes:
when the using state of the terminal is a folding state, acquiring sensor data;
and determining the first screen or the second screen facing the target member as a main screen according to the sensor data.
In one embodiment, said determining the first screen or the second screen to be the main screen according to the usage status includes:
and when the using state of the terminal is an unfolding state, determining at least one of the first screen and the second screen as a main screen.
In one embodiment, the terminal further comprises a voice playing component arranged on the first screen and a voice playing component arranged on the second screen; the method further comprises the steps of:
and playing the voice signal from the opposite party member based on the voice playing component on the main screen.
A terminal-based voice call apparatus, the terminal comprising a first screen and more than one voice acquisition component arranged on different sides of the first screen; the device comprises:
The signal acquisition module is used for acquiring the voice signal of the target member acquired by each voice acquisition component;
the signal correction module is used for distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal intensity of the voice signals; determining an auxiliary signal with a signal strength difference from the main signal being greater than a first threshold value as a noise signal; correcting the main signal according to the noise signal to obtain a target signal;
and the signal sending module is used for sending the target signal to the other party member participating in the voice call.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition component arranged on different sides of the second screen; the signal acquisition module is further configured to: acquiring the magnetic field intensity between the first screen and the second screen; judging the use state of the terminal according to the magnetic field intensity; determining a main screen according to the first screen or the second screen according to the using state; and acquiring the voice signals of the target members acquired by each voice acquisition component on the main screen.
A computer device comprising a memory storing a computer program and a processor implementing the steps of the above-described terminal-based voice communication method when the computer program is executed.
A computer readable storage medium having stored thereon a computer program which when executed by a processor implements the steps of the above-described terminal-based voice communication method.
According to the voice communication method, the device, the computer equipment and the storage medium based on the terminal, the voice signals of the target members are collected by the voice collection assemblies arranged on different sides of the first screen, and the signal intensity of the voice signals collected by the different voice collection assemblies is greatly different. The main signal is corrected by the one or more noise signals obtained through screening, so that the target signal with higher voice call quality can be acquired under any hand-held terminal posture in the voice call process.
Drawings
FIG. 1 is an application scenario diagram of a terminal-based voice call method in one embodiment;
FIG. 2 is a flow chart of a method of terminal-based voice call in one embodiment;
FIG. 3 is a schematic diagram of a voice call based terminal according to an embodiment;
fig. 4 is a schematic structural diagram of a terminal based on a voice call according to another embodiment;
FIG. 5 is a block diagram of a terminal-based voice call apparatus in one embodiment;
fig. 6 is an internal structural diagram of a computer device in one embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application will be further described in detail with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the present application.
The voice call method based on the terminal can be applied to an application environment shown in fig. 1. The voice call method based on the terminal is applied to a voice call system based on the terminal. The terminal-based voice call system includes an audio processor 102, a voice acquisition component 104, and a voice playback component 106. The audio processor 102 is configured to perform signal processing on the voice signal, including processing the voice signal collected by the voice collection component and the voice signal of the counterpart; the audio processor 102 may be a Field programmable gate array (Field-Programmable Gate Array, FPGA) processor. The voice acquisition component 104 is configured to acquire a voice signal of a target member in a voice communication process, and the voice acquisition component may be a microphone; the voice acquisition components 104 are disposed on different sides of a first screen of a terminal that includes a plurality of voice components 104; each voice component 104 is interconnected with the audio processor 102. The voice playing component 106 is used for playing the voice of the opposite party in the voice communication process, and the voice playing component 106 can be a receiver, a screen sounding driver and the like; the screen generating driver is driven by an exciter arranged on the screen and generates driving force with a direction under the driving of an audio signal, so that the screen is driven to vibrate and sound. The voice playing component 106 is arranged at any position of the first screen of the terminal, and the terminal comprises at least one voice playing component 106; the voice playing component 106 is interconnected with the audio processor 102. In the voice call based on the terminal, the audio processor 102 acquires the voice signal of the target member acquired by each voice acquisition component 104, determines to obtain a main signal and a plurality of auxiliary signals according to the signal intensity of each voice signal, further determines the noise signal in the plurality of auxiliary signals according to the difference value of the signal intensity of each auxiliary signal and the main signal, and corrects the main signal by the obtained one or more noise signals to obtain the target signal to be sent to the counterpart member. The terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices, among others.
In one embodiment, as shown in fig. 2, a voice call method based on a terminal is provided, and the method is applied to the audio processor in fig. 1 for illustration, and includes the following steps:
step S202, acquiring voice signals of target members acquired by each voice acquisition component.
The target member refers to any party in the answering party or the calling party in the voice call process. The voice signals are acquired in a target period, wherein the target period comprises a preset number of time points, and each time point corresponds to the signal intensity of the voice signals.
Specifically, the terminal comprises a first screen, a plurality of voice acquisition components are arranged on different sides of the first screen, voice signals of target members are acquired based on the plurality of voice acquisition components in the voice communication process, and the acquired voice signals are transmitted to the audio processor; the collected voice signals may also be stored in a voice collection component. The audio processor acquires the voice signals of the target members acquired in the target period from each voice acquisition component. The voice signal comprises the voice of a target member in the voice communication process and the environmental noise; the environmental noise includes stationary noise and non-stationary noise. The steady-state noise means that the variation amplitude of the signal intensity of the noise at the corresponding time point does not exceed the preset variation amplitude along with the time variation; such as wind sounds; the unsteady noise means that the variation amplitude of the signal intensity of the noise at the corresponding time point exceeds the preset variation amplitude along with the time variation; such as voice signals of non-target members.
Step S204, the plurality of voice signals are distinguished into a main signal and an auxiliary signal according to the signal strength of the voice signals.
Wherein, the signal intensity represents the intensity of the voice signal, and the signal intensity can be represented by frequency amplitude.
Specifically, based on a plurality of collected voice signals, the audio processor screens the voice signal corresponding to the strongest voice signal strength according to the signal strength of each voice signal, and judges that the voice signal is a main signal and the other voice signals are auxiliary signals. The main signals are used for voice signals collected by the voice collecting component closest to the target member in voice communication, and only one main signal is used. The auxiliary signal refers to a voice signal except the main signal among the voice signals collected by all the voice collecting components. At each time point, the primary signal has a stronger signal strength than the secondary signal.
In one embodiment, the audio processor determines a signal strength corresponding to each time point of each voice signal in a target time period, where the target time period includes a start time point and an end time point, selects the start time point as a current time point, determines a signal strength corresponding to the current time point in each voice signal, and selects a voice signal corresponding to a maximum signal strength as a target voice signal; and taking the time point of the next sequence as the current time point, further determining the corresponding signal intensity of each voice signal at the current time point, selecting a target voice signal corresponding to the maximum signal intensity, and if the target voice signals corresponding to the maximum signal intensity determined by a plurality of continuous time points are all the same voice signal, judging that the target voice signal is a main signal and the other voice signals are auxiliary signals.
In step S206, the auxiliary signal having a difference from the signal intensity of the main signal greater than the first threshold is determined as the noise signal.
The first threshold is used for judging a signal intensity index of the noise signal according to the difference value of the signal intensity and is used for measuring whether the difference value of the signal intensity accords with the noise signal. The first threshold is much greater than the signal strength of the noise signal.
Specifically, based on the determined main signal and at least one auxiliary signal, the audio processor calculates a difference value of signal intensity corresponding to each time point of each auxiliary signal and the main signal in a target period to obtain a preset number of difference values of signal intensity; comparing the difference value of the signal intensity with a first threshold value, calculating the duty ratio of the number of time points corresponding to the difference value larger than the first threshold value in all preset numbers, further determining the similarity between the auxiliary signal and the main signal based on the duty ratio, and judging whether the auxiliary signal is a noise signal according to the similarity. The larger the difference value of the signal intensities is, the larger the difference exists between the signal intensities corresponding to the main signal and the auxiliary signal at the same time point, which means that the lower the similarity between the main signal and the auxiliary signal is, the lower the similarity is, the more the auxiliary signal tends to be a noise signal, and when the similarity is lower than a preset threshold value, the auxiliary signal is judged to be the noise signal.
In one embodiment, determining the secondary signal having a difference from the signal strength of the primary signal greater than a first threshold as the noise signal comprises: determining a signal strength difference of each auxiliary signal relative to the main signal at each time point within the target period; calculating the similarity of the corresponding auxiliary signal and the main signal according to the time points and the preset quantity when the signal intensity difference is larger than the first threshold value; and determining the auxiliary signal with the similarity smaller than the second threshold value as a noise signal.
The second threshold is used for measuring the similarity between the auxiliary signal and the main signal. Specifically, based on the obtained main signal and one or more auxiliary signals, the audio processor calculates the signal intensity difference between each time point of each auxiliary signal and the main signal in the target period, determines the number of time points with the signal intensity difference larger than a first threshold value, calculates the duty ratio of the number and the preset number, namely, the duty ratio of the similarity between the corresponding auxiliary signal and the main signal is obtained, further compares the duty ratio with a second threshold value, and judges the auxiliary signal corresponding to the duty ratio smaller than the second threshold value as a noise signal. In this embodiment, by calculating the signal intensity difference between the auxiliary signal and the main signal at each time point, determining the similarity between the auxiliary signal and the main signal according to the time point of the signal intensity difference conforming to the first threshold and all time points in the target period, further determining that the auxiliary signal is a noise signal according to the similarity, simply and conveniently determining the noise signal from a plurality of auxiliary signals, and providing a basis for the correction of the subsequent main signal.
Step S208, the main signal is corrected according to the noise signal, and the target signal is obtained.
Specifically, the audio processor corrects the main signal for each noise signal based on the determined noise signal and the main signal, respectively, to obtain the target signal. The specific correction method may be that, for each time point of the target time period, the audio processor subtracts the signal intensity corresponding to the main signal at each time point from the signal intensity corresponding to all noise signals at the same time point, so as to obtain the corrected target signal.
In one embodiment, correcting the primary signal based on the noise signal to obtain the target signal includes: determining a correction factor according to the signal intensity of all noise signals; and adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal.
The correction factor may be an average intensity of signal intensities of all noise signals, or may be a sum of signal intensities obtained by adding signal intensities of all noise signals.
In one embodiment, the determining the correction factor according to the signal strengths of all the noise signals includes: calculating the average intensity of the signal intensities of all the noise signals; the average intensity is determined as a correction factor.
Specifically, for each time point of the target period, the audio processor calculates the average signal strength of all the auxiliary signals corresponding to each time point, and further subtracts the signal strength of the main signal corresponding to each time point from the average signal strength of the auxiliary signals to obtain the corrected target signal.
The audio processor may also calculate a sum of signal intensities of all the auxiliary signals corresponding to each time point, and further subtract the sum of signal intensities of the main signal and the auxiliary signal corresponding to each time point, so as to obtain a corrected target signal.
Step S210, the target signal is sent to the counterpart member participating in the voice call.
Specifically, based on the target signal obtained after correction, the audio processor transmits the target signal to the other party member participating in the voice call through the communication network system built in the terminal; the communication network system comprises LTE, GSM, CDMA, WCDMA, TD-SCDMA or 5G. The opposite party member can be a calling party in the voice call process or an answering party.
As shown in fig. 3, a schematic structural diagram of a terminal based on voice call is provided. As can be seen from the figure, the terminal 30 comprises a first screen 300, where the voice capturing components are arranged at different positions, such as a microphone 302 at the top of the first screen and a microphone 304 at the bottom of the first screen, as well as a microphone 3022 at the top of the terminal and a microphone 3042 at the bottom of the terminal. And a screen sound driver 306 is provided on the first screen. In the voice call process, it is assumed that the user holds the bottom of the terminal to answer the call, and the ear and mouth of the user are close to the bottom of the terminal and far from the top of the terminal. The audio processor acquires the voice signals of the target members collected by each microphone, distinguishes the main signals from the auxiliary signals according to the signal intensity of the voice signals, can judge that the voice signals collected by the microphones arranged at the bottom of the terminal are the main signals at the moment, and further judges whether the auxiliary signals are noise signals or not, if yes, the main signals are corrected according to the noise signals to obtain corrected target signals, and the audio processor sends the target signals to the other members participating in voice communication. And the screen sounding driver arranged on the first screen generates driving force with a direction under the driving of the audio signal transmitted by the opposite party member, and the voice of the opposite party member is played through driving the screen to vibrate.
In the above embodiment, the voice signals of the target member collected by the plurality of voice collecting components disposed at different sides of the first screen have a large difference in signal strength of the voice signals collected by the different voice collecting components, so as to avoid the influence of the gesture of the handheld terminal on the voice quality in the call process. The main signal is corrected by the one or more noise signals obtained through screening, so that the target signal with higher voice call quality can be acquired under any hand-held terminal posture in the voice call process.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition component arranged on different sides of the second screen; the terminal also comprises a first voice playing component arranged on the first screen and a second voice playing component arranged on the second screen; the step of acquiring the voice signals of the target members acquired by each voice acquisition component comprises the following steps: acquiring the magnetic field intensity between the first screen and the second screen; judging the use state of the terminal according to the magnetic field intensity; determining a main screen according to the first screen or the second screen according to the using state;
Specifically, the permanent magnet and the switch sensor are respectively arranged at symmetrical side edges of the first screen and the second screen, the magnetic field intensity is generated according to the distance between the permanent magnet and the switch sensor, the audio processor acquires the generated magnetic field intensity in real time or in preset time, the use state of the first screen and the second screen is judged according to the magnetic field intensity, and then the main screen is determined according to the use state.
In one embodiment, the terminal further comprises a permanent magnet and a switch sensor respectively arranged at symmetrical sides of the first screen and the second screen; the method further comprises the following steps: the magnetic field strength is determined based on the distance between the permanent magnet and the switch sensor. Namely, when the permanent magnet and the switch sensor are close to each other, the magnetic field intensity tends to increase; when the permanent magnet and the switch sensor are far away, the magnetic field intensity tends to decrease.
Specifically, the permanent magnet and the switch sensor are respectively arranged at symmetrical sides of the first screen and the second screen, wherein the switch sensor can be a Hall sensor. For example, a permanent magnet may be provided on the side of the first screen and a switch sensor may be provided on the symmetrical side of the second screen; the switch sensor may be provided on the side of the first screen, and the permanent magnet may be provided on the symmetrical side of the second screen. The magnitude of the magnetic field strength is determined based on the distance between the permanent magnet and the switch sensor.
In one embodiment, the determining the usage status of the terminal according to the magnetic field strength includes: when the magnetic field intensity is larger than a third threshold value, judging that the terminal is in a folded state; and when the magnetic field strength is smaller than or equal to the third threshold value, judging that the terminal is in the unfolding state.
Specifically, the audio processor acquires the magnetic field intensity, and judges the states of the first screen and the second screen according to the magnetic field intensity and the third threshold value. Namely, when the magnetic field intensity is larger than a third threshold value, judging that the first screen and the second screen are in a folded state; when the magnetic field strength is lower than the third threshold value, the first screen and the second screen are judged to be in an unfolding state. The third threshold value is used for judging the state of the terminal according to the magnetic field intensity.
The magnetic field intensity of the terminal is detected in real time through the switch sensor, the detected magnetic field intensity is transmitted to the audio processor, and the audio processor judges the states of the first screen and the second screen according to the magnetic field intensity and the third threshold value. When the permanent magnet gradually approaches the switch sensor, the magnetic field intensity detected by the switch sensor gradually increases from 0; once the distance between the permanent magnet and the switch sensor reaches a minimum threshold value, that is, when the permanent magnet and the switch sensor are mutually attached, the magnetic field intensity detected by the switch sensor reaches a maximum value. When the permanent magnet is gradually far away from the switch sensor, the magnetic field intensity detected by the switch sensor gradually decreases from the maximum value to 0.
The audio processor compares the received magnetic field strength with a third threshold value, and when the magnetic field strength is larger than the third threshold value, the audio processor judges that the first screen and the second screen are in a folded state; conversely, when the magnetic field strength is lower than the third threshold value, the first screen and the second screen are judged to be in the unfolded state. Therefore, the states of the first screen and the second screen can be determined according to the above principle. The third threshold is typically chosen to be close to the maximum value of the magnetic field strength, e.g. Max, which may be 95% Max.
In one embodiment, the switch sensor detects the magnetic field strength and converts the magnetic field strength into a voltage or current, and the converted voltage or current is transmitted to the audio processor. The audio processor judges states of the first screen and the second screen according to the received voltage or current and a third threshold value.
In one embodiment, the determining the first screen or the second screen according to the usage state includes: when the using state of the terminal is a folding state, acquiring sensor data; and determining the first screen or the second screen facing the target member as a main screen according to the sensor data.
The sensor data is used for judging a home screen facing the target member in the voice call process, namely, determining the home screen from the first screen and the second screen in the voice call process. The home screen refers to a terminal screen facing the target member. The sensor data comprises data acquired by a distance sensor, a photosensitive sensor or a gesture sensor; wherein the distance sensor may be an infrared distance sensor or an ultrasonic distance sensor. The use state includes a folded state and an unfolded state.
Specifically, when the use state of the terminal is a folding state, the distance parameters of the terminal and the target member are monitored in real time through a distance sensor arranged on the terminal, the audio processor acquires the distance parameters acquired by the sensor, and the first screen or the second screen facing the target member is determined to be a main screen according to the distance parameters. Further, after determining the home screen, the audio processor acquires the voice signal of the target member acquired by each voice acquisition component corresponding to the home screen.
In one embodiment, determining the first screen or the second screen according to the usage status includes: and when the using state of the terminal is an unfolding state, determining at least one of the first screen and the second screen as a main screen.
In one embodiment, when the distance sensor is an ultrasonic ranging sensor, determining the first screen or the second screen toward the target member as a home screen according to the sensor data includes: and after the ultrasonic signal reaches a target member or an obstacle, the ultrasonic signal is received by the sound playing component of the terminal, the distance parameter between the terminal and the target member is judged according to the transmitted ultrasonic signal and the reflected ultrasonic signal, and a terminal screen facing the target member is determined to be a main screen according to the distance parameter.
In this embodiment, the terminal based on the voice call is a dual-screen terminal, the terminal includes two screens, a plurality of voice acquisition components are disposed on different sides of each screen, and a sensor is disposed on the terminal for determining a home screen where the terminal is located in the process of the voice call. When the terminal is in a folding state, judging a main screen of the terminal facing the target member in the voice call process through sensor data acquired by a sensor arranged on the terminal, and further acquiring a voice signal of the target member acquired by a voice acquisition component corresponding to the main screen. According to the embodiment, the main screen is determined through the sensor data aiming at the double-screen terminal in the folding state, and then the voice signal of the target member is collected through the voice collecting component corresponding to the main screen, so that the voice of the target member can be normally collected in the voice call process, and high-quality voice call can be realized no matter whether the target member faces any screen of the terminal.
In one embodiment, the terminal further comprises a voice playing component arranged on the first screen and a voice playing component arranged on the second screen; the method further comprises the steps of: and playing the voice signal from the opposite party member based on the voice playing component on the main screen.
Specifically, during a voice call, the voice of the counterpart member is played through a first voice playing component provided on the first screen and a second voice playing component provided on the second screen. But the strategies of selecting the playing components for playing the voice of the opposite member are different due to the different states of the first screen and the second screen. Based on the screen state determined by the magnetic field intensity detected by the switch sensor, the corresponding voice playing component is determined to play the voice of the opposite member. When the first screen and the second screen are in a folding state, playing the voice of the opposite party member through a voice playing component arranged on the main screen; when the first screen and the second screen are in an unfolding state, the voice of the opposite party member is played through voice playing components respectively arranged on the first screen and the second screen. By arranging the voice playing components on different screens of the double-screen terminal, the voice of the opposite party member can be normally played no matter the terminal screen is in a folded state or an unfolded state, and high-quality voice communication is realized in the voice communication process.
In this embodiment, the magnetic field strength detected by the switch sensor is determined by the motion states of the permanent magnet and the switch sensor, and the use state of the terminal can be determined according to the magnetic field strength, when the magnetic field strength is greater than the third threshold, the first screen and the second screen are in a folded state, otherwise, when the magnetic field strength is lower than the third threshold, the first screen and the second screen are in an unfolded state, so that the state determination of the dual-screen terminal is simply and conveniently realized.
In one embodiment, as shown in fig. 4, another schematic structural diagram of a terminal based on voice call is provided. The terminal 30 further comprises a second screen 400, where the voice capturing components are arranged at different positions of the terminal, such as a microphone 402 arranged at the top of the second screen and a microphone 404 arranged at the bottom of the second screen, and a microphone 4022 arranged at the top of the terminal and a microphone 4042 arranged at the bottom of the terminal; and a screen sound driver 406 is provided on the second screen. The terminal further includes a permanent magnet 408 disposed at the left side of the first screen and a switch sensor 410 disposed at the right side of the second screen, respectively. The terminal also includes a distance sensor 412, which may be located anywhere on the terminal.
In the voice call process, as the terminal is provided with two screens, aiming at the double-screen terminal, the state of the first screen and the second screen is judged to be in a folded state or an unfolded state, namely, the magnetic field intensity detected by the switch sensor is obtained through the motion states of the permanent magnet and the switch sensor, when the magnetic field intensity is larger than a third threshold value, the first screen and the second screen are judged to be in the folded state, otherwise, when the magnetic field intensity is lower than the third threshold value, the first screen and the second screen are judged to be in the unfolded state.
After determining the state of the terminal screen, detecting a main screen facing the target member in the voice call process through a distance sensor; and the voice signal of the target member is collected through the voice collection component corresponding to the main screen, the main signal and the auxiliary signal are distinguished according to the signal intensity of the voice signal, whether the auxiliary signal is a noise signal is further judged, if yes, the main signal is corrected according to the noise signal, the corrected target signal is obtained, and the audio processor sends the target signal to the other member participating in the voice call. Further, according to the states of the first screen and the second screen, a voice playing component for playing the voice of the opposite party member is determined, and then the voice is played according to the determined voice playing component.
In this embodiment, the terminal includes two screens, and a plurality of voice acquisition components are disposed on different sides of each screen, and a distance sensor is disposed on the terminal for determining a home screen facing the target member in the voice call process. In the voice call process of the double-screen terminal, the states of the first screen and the second screen are judged according to the magnetic field intensity detected by the switch sensor, and further, the voice of the opposite party member is played by adopting the corresponding voice playing component according to different screen states. Meanwhile, after the state of the screen is determined, the main screen facing the target member is further judged according to the distance sensor, the voice signal of the target member is collected through the voice collection component corresponding to the main screen, no specific requirement is made on the gesture of the target member handheld terminal in the voice communication process, the dual-screen terminal can be held at will, high-quality voice communication can be realized, the voice quality of the played voice signal is not influenced by the gesture of the handheld terminal, and meanwhile, the voice quality of the collected voice signal is not influenced by the gesture of the handheld terminal.
It should be understood that, although the steps in the flowcharts of fig. 2-4 are shown in order as indicated by the arrows, these steps are not necessarily performed in order as indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in fig. 2-4 may include multiple sub-steps or stages that are not necessarily performed at the same time, but may be performed at different times, nor do the order in which the sub-steps or stages are performed necessarily occur sequentially, but may be performed alternately or alternately with at least a portion of the sub-steps or stages of other steps or steps.
In one embodiment, as shown in fig. 5, a terminal-based voice call apparatus 500 is provided, the terminal including a first screen and more than one voice acquisition component disposed on different sides of the first screen; the device comprises: a signal acquisition module 502, a signal correction module 504, and a signal transmission module 506, wherein:
the signal acquisition module 502 is configured to acquire the voice signal of the target member acquired by each voice acquisition component.
A signal correction module 504, configured to distinguish the plurality of voice signals from the main signal and the auxiliary signal according to signal strength of the voice signals; determining an auxiliary signal with a signal strength difference from the main signal being greater than a first threshold value as a noise signal; and correcting the main signal according to the noise signal to obtain a target signal.
The signal sending module 506 is configured to send the target signal to a partner member participating in the voice call.
In one embodiment, the signal correction module is further configured to determine a signal strength difference of each auxiliary signal relative to the main signal at each time point in the target period; calculating the similarity of the corresponding auxiliary signal and the main signal according to the time points and the preset quantity when the signal intensity difference is larger than the first threshold value; and determining the auxiliary signal with the similarity smaller than the second threshold value as a noise signal.
In one embodiment, the signal correction module is further configured to determine a correction factor according to signal strengths of all noise signals; and adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition component arranged on different sides of the second screen; the terminal also comprises a first voice playing component arranged on the first screen and a second voice playing component arranged on the second screen; the signal acquisition module is also used for: acquiring sensor data; determining a main screen from the first screen and the second screen according to the sensor data; and acquiring the voice signals of the target members acquired by each voice acquisition component arranged on the main screen.
In one embodiment, the apparatus further comprises a screen state determination module for acquiring a magnetic field strength; when the magnetic field intensity is larger than a third threshold value, judging that the first screen and the second screen are in a folded state; when the magnetic field strength is lower than the third threshold value, the first screen and the second screen are judged to be in an unfolding state.
In one embodiment, the terminal further comprises a permanent magnet and a switch sensor respectively arranged at symmetrical sides of the first screen and the second screen; the device also comprises a magnetic field strength analysis module, wherein the magnetic field strength analysis module is used for increasing the magnetic field strength when the permanent magnet is close to the switch sensor; when the permanent magnet and the switch sensor are far away, the magnetic field intensity tends to decrease.
In one embodiment, the terminal further comprises a first voice playing component arranged on the first screen and a second voice playing component arranged on the second screen; the device also comprises a voice playing module, a voice playing module and a voice processing module, wherein the voice playing module is used for playing the voice of the opposite party member through a voice playing component arranged on the main screen when the first screen and the second screen are in a folding state; when the first screen and the second screen are in an unfolding state, the voice of the opposite party member is played through voice playing components respectively arranged on the first screen and the second screen.
In this embodiment, through the voice signals of the target member collected by the plurality of voice collecting components disposed at different sides of the first screen, because there is a large difference in signal strength of the voice signals collected by the different voice collecting components, in order to avoid the voice quality in the conversation process from being affected by the gesture of the handheld terminal, the scheme determines a main signal and a plurality of auxiliary signals according to the signal strength of the voice signals, and further determines a noise signal in the plurality of auxiliary signals according to the difference in signal strength of each auxiliary signal and the main signal, so that the obtained one or more noise signals correct the main signal to obtain the target signal. The main signal is corrected by the one or more noise signals obtained through screening, so that the target signal with higher voice call quality can be acquired under any hand-held terminal posture in the voice call process.
For specific limitations on the terminal-based voice call apparatus, reference may be made to the above limitations on the terminal-based voice call method, and no further description is given here. The various modules in the terminal-based voice call apparatus described above may be implemented in whole or in part by software, hardware, or a combination thereof. The above modules may be embedded in hardware or may be independent of a processor in the computer device, or may be stored in software in a memory in the computer device, so that the processor may call and execute operations corresponding to the above modules.
In one embodiment, a computer device is provided, which may be a terminal, and the internal structure of which may be as shown in fig. 6. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program, when executed by a processor, implements a terminal-based voice call method. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, can also be keys, a track ball or a touch pad arranged on the shell of the computer equipment, and can also be an external keyboard, a touch pad or a mouse and the like.
It will be appreciated by those skilled in the art that the structure shown in fig. 6 is merely a block diagram of some of the structures associated with the present application and is not limiting of the computer device to which the present application may be applied, and that a particular computer device may include more or fewer components than shown, or may combine certain components, or have a different arrangement of components.
A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor, when executing the computer program, implements the steps of the terminal-based voice call method provided in any one of the embodiments of the present application.
A computer readable storage medium having stored thereon a computer program which when executed by a processor implements the steps of the terminal-based voice call method provided in any one of the embodiments of the present application.
Those skilled in the art will appreciate that implementing all or part of the above-described methods in accordance with the embodiments may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed may comprise the steps of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the various embodiments provided herein may include non-volatile and/or volatile memory. The nonvolatile memory can include Read Only Memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), memory bus direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), among others.
The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description.
The foregoing examples represent only a few embodiments of the present application, which are described in more detail and are not to be construed as limiting the scope of the invention. It should be noted that it would be apparent to those skilled in the art that various modifications and improvements could be made without departing from the spirit of the present application, which would be within the scope of the present application. Accordingly, the scope of protection of the present application is to be determined by the claims appended hereto.

Claims (13)

1. A voice call method based on a terminal, wherein the terminal comprises a first screen and more than one voice acquisition component arranged on different sides of the first screen; the method comprises the following steps:
acquiring voice signals of target members acquired by each voice acquisition component; the voice signals are acquired in a target period, the target period comprises a preset number of time points, and each time point corresponds to the signal intensity of the voice signals;
If the target voice signals corresponding to the maximum signal intensities determined by a plurality of continuous time points are the same voice signal, judging that the target voice signals are main signals and the other voice signals are auxiliary signals;
determining an auxiliary signal with a signal strength difference from the main signal being greater than a first threshold value as a noise signal;
determining a correction factor according to the signal intensity of all the noise signals, and adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal;
and sending the target signal to the other party member participating in the voice call.
2. The method of claim 1, wherein determining the secondary signal having a difference from the signal strength of the primary signal greater than a first threshold as the noise signal comprises:
determining a signal strength difference of each auxiliary signal relative to the main signal at each time point within the target period;
calculating the similarity of the corresponding auxiliary signal and the main signal according to the data of the time points with the signal intensity difference larger than the first threshold value and the preset quantity;
and determining the auxiliary signal with the similarity smaller than a second threshold value as a noise signal.
3. The method of claim 1, wherein the correction factor is a sum of signal strengths obtained by adding signal strengths of all of the noise signals.
4. The method of claim 1, wherein said determining a correction factor based on signal strengths of all of said noise signals comprises:
calculating the average intensity of the signal intensities of all the noise signals;
the average intensity is determined as a correction factor.
5. The method of claim 1, wherein the terminal further comprises a second screen and more than one voice acquisition component disposed on different sides of the second screen; the step of acquiring the voice signals of the target members acquired by each voice acquisition component comprises the following steps:
acquiring the magnetic field intensity between the first screen and the second screen;
judging the use state of the terminal according to the magnetic field intensity;
determining a main screen according to the first screen or the second screen according to the using state;
and acquiring the voice signals of the target members acquired by each voice acquisition component on the main screen.
6. The method of claim 5, wherein said determining the usage status of the terminal based on the magnetic field strength comprises:
when the magnetic field strength is larger than a third threshold value, judging that the terminal is in a folded state;
And when the magnetic field strength is smaller than or equal to a third threshold value, judging that the terminal is in an unfolding state.
7. The method of claim 5, wherein the determining the first screen or the second screen to be the home screen according to the usage status comprises:
when the using state of the terminal is a folding state, acquiring sensor data;
and determining the first screen or the second screen facing the target member as a main screen according to the sensor data.
8. The method of claim 5, wherein the determining the first screen or the second screen to be the home screen according to the usage status comprises:
and when the using state of the terminal is an unfolding state, determining at least one of the first screen and the second screen as a main screen.
9. The method of claim 5, wherein the terminal further comprises a voice playing component disposed on the first screen and a voice playing component disposed on the second screen; the method further comprises the steps of:
and playing the voice signal from the opposite party member based on the voice playing component on the main screen.
10. A terminal-based voice call apparatus, wherein the terminal comprises a first screen and more than one voice acquisition component arranged on different sides of the first screen; the device comprises:
the signal acquisition module is used for acquiring the voice signal of the target member acquired by each voice acquisition component; the voice signals are acquired in a target period, the target period comprises a preset number of time points, and each time point corresponds to the signal intensity of the voice signals;
the signal correction module is used for judging that the target voice signal is a main signal and the other voice signals are auxiliary signals if the target voice signals corresponding to the maximum signal intensities determined by a plurality of continuous time points are the same voice signal; determining an auxiliary signal with a signal strength difference from the main signal being greater than a first threshold value as a noise signal; determining a correction factor according to the signal intensity of all the noise signals, and adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal;
and the signal sending module is used for sending the target signal to the other party member participating in the voice call.
11. The apparatus of claim 10, wherein the terminal further comprises a second screen and more than one voice acquisition component disposed on different sides of the second screen; the signal acquisition module is further configured to: acquiring the magnetic field intensity between the first screen and the second screen; judging the use state of the terminal according to the magnetic field intensity; determining a main screen according to the first screen or the second screen according to the using state; and acquiring the voice signals of the target members acquired by each voice acquisition component on the main screen.
12. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor implements the steps of the method of any one of claims 1 to 9 when the computer program is executed.
13. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 9.
CN201911064300.5A 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium Active CN112769979B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911064300.5A CN112769979B (en) 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911064300.5A CN112769979B (en) 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN112769979A CN112769979A (en) 2021-05-07
CN112769979B true CN112769979B (en) 2023-05-05

Family

ID=75692267

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911064300.5A Active CN112769979B (en) 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112769979B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108737615A (en) * 2018-06-27 2018-11-02 努比亚技术有限公司 microphone reception method, mobile terminal and computer readable storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104092802A (en) * 2014-05-27 2014-10-08 中兴通讯股份有限公司 Method and system for de-noising audio signal
CN107393548B (en) * 2017-07-05 2021-05-07 海信视像科技股份有限公司 Method and device for processing voice information collected by multiple voice assistant devices
CN107742523B (en) * 2017-11-16 2022-01-07 Oppo广东移动通信有限公司 Voice signal processing method and device and mobile terminal
CN109739465B (en) * 2018-12-21 2021-01-29 维沃移动通信有限公司 Audio output method and mobile terminal

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108737615A (en) * 2018-06-27 2018-11-02 努比亚技术有限公司 microphone reception method, mobile terminal and computer readable storage medium

Also Published As

Publication number Publication date
CN112769979A (en) 2021-05-07

Similar Documents

Publication Publication Date Title
US9792602B2 (en) Apparatus and method for providing emotion expression service in mobile communication terminal
US20210217433A1 (en) Voice processing method and apparatus, and device
CN107742523B (en) Voice signal processing method and device and mobile terminal
CN108076226B (en) Method for adjusting call quality, mobile terminal and storage medium
KR101933642B1 (en) Method and apparatus for triggering execution of operation instruction
RU2656693C2 (en) Event prompting method and device
CN108665889B (en) Voice signal endpoint detection method, device, equipment and storage medium
CN109819489B (en) Terminal calling method and terminal
KR20200063095A (en) Multi-beam selection method and apparatus
CN109739465B (en) Audio output method and mobile terminal
EP3021207A1 (en) Portable electronic device including touch-sensitive display and method of controlling audio output
US20160286031A1 (en) Voice call establishing method and apparatus
RU2665300C2 (en) Method and device for processing point reporting of touch screen
CN110830644A (en) Display screen control method and device, electronic equipment and storage medium
CN106713656B (en) Shooting method and mobile terminal
US11025764B2 (en) Method and system for reducing current noise of touch terminal
CN109561255B (en) Terminal photographing method and device and storage medium
CN105188027A (en) Nearby user searching method and device
CN108668018B (en) Mobile terminal, volume control method and related product
CN112769979B (en) Voice call method and device based on terminal, computer equipment and storage medium
CN107454265B (en) Method and device for recording call information based on call mode change
CN115378302A (en) Control method, device and equipment of linear motor and readable storage medium
CN105611044A (en) Conversation control method and device and terminal
KR102155555B1 (en) Method for providing a hearing aid compatibility and an electronic device thereof
CN108650595B (en) Hole plugging processing method and device, electronic equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant