CN112769979A - Voice call method and device based on terminal, computer equipment and storage medium - Google Patents

Voice call method and device based on terminal, computer equipment and storage medium Download PDF

Info

Publication number
CN112769979A
CN112769979A CN201911064300.5A CN201911064300A CN112769979A CN 112769979 A CN112769979 A CN 112769979A CN 201911064300 A CN201911064300 A CN 201911064300A CN 112769979 A CN112769979 A CN 112769979A
Authority
CN
China
Prior art keywords
signal
screen
voice
terminal
main
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911064300.5A
Other languages
Chinese (zh)
Other versions
CN112769979B (en
Inventor
彭功良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oneplus Technology Shenzhen Co Ltd
Original Assignee
Oneplus Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oneplus Technology Shenzhen Co Ltd filed Critical Oneplus Technology Shenzhen Co Ltd
Priority to CN201911064300.5A priority Critical patent/CN112769979B/en
Publication of CN112769979A publication Critical patent/CN112769979A/en
Application granted granted Critical
Publication of CN112769979B publication Critical patent/CN112769979B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/0202Portable telephone sets, e.g. cordless phones, mobile phones or bar type handsets
    • H04M1/026Details of the structure or mounting of specific components
    • H04M1/0266Details of the structure or mounting of specific components for a display module assembly
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/03Constructional features of telephone transmitters or receivers, e.g. telephone hand-sets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/04Circuits for transducers, loudspeakers or microphones for correcting frequency response
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Telephone Function (AREA)

Abstract

The application provides a voice call method and device based on a terminal, computer equipment and a storage medium. The terminal comprises a first screen and more than one voice acquisition assembly arranged on different sides of the first screen; the method comprises the following steps: acquiring a voice signal of a target member acquired by each voice acquisition component; distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal intensity of the voice signals; determining an auxiliary signal having a signal intensity difference with the main signal greater than a first threshold as a noise signal; correcting the main signal according to the noise signal to obtain a target signal; and sending the target signal to the other party participating in the voice call. The method can improve the voice quality in the voice call process.

Description

Voice call method and device based on terminal, computer equipment and storage medium
Technical Field
The present application relates to the field of intelligent terminal technologies, and in particular, to a terminal-based voice call method, apparatus, computer device, and storage medium.
Background
With the continuous development of computer technology, more and more diversified intelligent terminals enter the lives of people, such as smart phones. For instant messaging equipment installed on an intelligent terminal and call equipment built in the terminal, in the process of realizing voice call, voice collection is mainly realized through a microphone arranged at the bottom of the terminal, and voice playing is mainly realized through a receiver arranged at the top of the terminal.
Currently, the terminal-based voice call technology is mainly implemented by an earphone and a microphone disposed on the terminal. The technology has specific requirements on the posture of the handheld terminal in the call process, so that the quality of the voice call is influenced by the posture of the handheld terminal.
Disclosure of Invention
In view of the foregoing, it is desirable to provide a terminal-based voice call method, apparatus, computer device, and storage medium capable of improving voice call quality.
A voice call method based on a terminal is provided, wherein the terminal comprises a first screen and more than one voice acquisition assembly arranged on different sides of the first screen; the method comprises the following steps:
acquiring a voice signal of a target member acquired by each voice acquisition component;
distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal intensity of the voice signals;
determining an auxiliary signal having a signal intensity difference with the main signal greater than a first threshold as a noise signal;
correcting the main signal according to the noise signal to obtain a target signal;
and sending the target signal to the other party participating in the voice call.
In one embodiment, the speech signal is acquired during a target time period; the target time period comprises a preset number of time points; determining the auxiliary signal having a signal strength difference greater than a first threshold from the main signal as a noise signal includes:
determining a signal strength difference of each secondary signal with respect to the primary signal at each time point within the target period;
calculating the similarity between the corresponding auxiliary signal and the main signal according to the number of the time points with the signal intensity difference larger than a first threshold and the preset number;
and determining the auxiliary signal with the similarity smaller than a second threshold value as a noise signal.
In one embodiment, said modifying said main signal according to said noise signal to obtain a target signal comprises:
determining a correction factor according to the signal lightness of all the noise signals;
and adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal.
In one embodiment, said determining a correction factor based on signal strengths of all of said noise signals comprises:
calculating an average intensity of signal intensities of all the noise signals;
determining the average intensity as a correction factor.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition assembly arranged on different sides of the second screen; the acquiring the voice signal of the target member acquired by each voice acquisition component comprises:
acquiring the magnetic field intensity between the first screen and the second screen;
judging the use state of the terminal according to the magnetic field intensity;
determining the first screen or the second screen as a main screen according to the use state;
and acquiring the voice signal of the target member acquired by each corresponding voice acquisition component on the main screen.
In one embodiment, the determining the use state of the terminal according to the magnetic field strength includes:
when the magnetic field intensity is larger than a third threshold value, determining that the terminal is in a folded state;
and when the magnetic field intensity is smaller than or equal to a third threshold value, determining that the terminal is in an unfolding state.
In one embodiment, the determining the first screen or the second screen as a main screen according to the use status includes:
when the use state of the terminal is a folding state, acquiring sensor data;
determining the first screen or the second screen toward the target member as a main screen according to the sensor data.
In one embodiment, the determining the first screen or the second screen as a main screen according to the use status includes:
and when the using state of the terminal is the unfolding state, determining at least one of the first screen and the second screen as a main screen.
In one embodiment, the terminal further comprises a voice playing component arranged on the first screen and a voice playing component arranged on the second screen; the method further comprises the following steps:
and playing the voice signal from the other party member based on the voice playing component on the main screen.
A voice call device based on a terminal comprises a first screen and more than one voice acquisition assembly, wherein the more than one voice acquisition assembly is arranged on different sides of the first screen; the device comprises:
the signal acquisition module is used for acquiring the voice signal of the target member acquired by each voice acquisition component;
the signal correction module is used for distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal intensity of the voice signals; determining an auxiliary signal having a signal intensity difference with the main signal greater than a first threshold as a noise signal; correcting the main signal according to the noise signal to obtain a target signal;
and the signal sending module is used for sending the target signal to the other party participating in the voice call.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition assembly arranged on different sides of the second screen; the signal acquisition module is further configured to: acquiring the magnetic field intensity between the first screen and the second screen; judging the use state of the terminal according to the magnetic field intensity; determining the first screen or the second screen as a main screen according to the use state; and acquiring the voice signal of the target member acquired by each voice acquisition component on the main screen.
A computer device comprising a memory storing a computer program and a processor implementing the steps of the terminal-based voice communication method described above when executing the computer program.
A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the above-mentioned terminal-based voice communication method.
According to the voice communication method based on the terminal, the device, the computer equipment and the storage medium, the voice signals of the target member are acquired by the voice acquisition assemblies arranged on different sides of the first screen, the signal intensity of the voice signals acquired by the different voice acquisition assemblies is greatly different, so that the voice quality in the conversation process is prevented from being influenced by the posture of the handheld terminal, a main signal and a plurality of auxiliary signals are judged according to the signal intensity of the voice signals, then the noise signals in the auxiliary signals are determined according to the difference value of the signal intensity of each auxiliary signal and the signal intensity of the main signal, and the main signal is corrected by one or more obtained noise signals to obtain the target signal. The main signal is corrected through one or more noise signals obtained through screening, so that a target signal with higher voice call quality can be acquired under any hand-held terminal posture in the voice call process.
Drawings
Fig. 1 is an application scenario diagram of a terminal-based voice call method in an embodiment;
FIG. 2 is a flow chart illustrating a method for voice over terminal in one embodiment;
FIG. 3 is a diagram illustrating a structure of a terminal based on a voice call in an embodiment;
fig. 4 is a schematic structural diagram of a terminal based on a voice call in another embodiment;
FIG. 5 is a block diagram of a terminal-based voice communicator in one embodiment;
FIG. 6 is a diagram illustrating an internal structure of a computer device according to an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The terminal-based voice call method provided by the application can be applied to the application environment shown in fig. 1. The terminal-based voice call method is applied to a terminal-based voice call system. The terminal-based voice call system includes an audio processor 102, a voice capture component 104, and a voice playback component 106. The audio processor 102 is configured to perform signal processing on a voice signal, including processing a voice signal acquired by the voice acquisition component and a voice signal of a member of an opposite party; the audio processor 102 may be a Field-Programmable Gate Array (FPGA) processor. The voice acquisition component 104 is used for acquiring a voice signal of a target member in a voice call process, and may be a microphone; the voice acquisition components 104 are arranged on different sides of a first screen of the terminal, and the terminal comprises a plurality of voice components 104; each speech component 104 is interconnected with the audio processor 102. The voice playing component 106 is used for playing the voice of the other member in the voice call process, and the voice playing component 106 can be a receiver, a screen sounding driver and the like; the screen generating driver is driven by an exciter arranged on the screen to generate a driving force with a direction under the driving of an audio signal, so that the screen is driven to vibrate and generate sound. The voice playing component 106 is arranged at any position of a first screen of the terminal, and the terminal comprises at least one voice playing component 106; the voice playback component 106 is interconnected with the audio processor 102. In the voice call based on the terminal, the audio processor 102 acquires the voice signal of the target member acquired by each voice acquisition component 104, determines to obtain a main signal and a plurality of auxiliary signals according to the signal strength of each voice signal, determines noise signals in the plurality of auxiliary signals according to the difference value of the signal strength of each auxiliary signal and the main signal, corrects the main signal by using one or more obtained noise signals, and obtains the target signal to be sent to the other member. The terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices.
In one embodiment, as shown in fig. 2, a terminal-based voice call method is provided, which is described by taking the method as an example applied to the audio processor in fig. 1, and includes the following steps:
step S202, acquiring the voice signal of the target member acquired by each voice acquisition component.
The target member refers to any one of the answering party or the calling party in the voice call process. The speech signal is acquired during a target period, wherein the target period includes a preset number of time points, each time point corresponding to a signal strength of the speech signal.
Specifically, the terminal comprises a first screen, a plurality of voice acquisition components are arranged on different sides of the first screen, and in the voice call process, voice signals of a target member are acquired based on the plurality of voice acquisition components and transmitted to an audio processor; the collected voice signals may also be stored in a voice collection component. The audio processor acquires the voice signals of the target members acquired in the target time period from each voice acquisition component. The voice signal comprises voice of a target member and environmental noise in the voice call process; the ambient noise includes stationary noise and non-stationary noise. The steady-state noise means that the change amplitude of the signal intensity of the noise at the corresponding time point does not exceed the preset change amplitude along with the change of time; such as wind noise; the unsteady noise is that the change amplitude of the signal intensity of the noise at the corresponding time point exceeds the preset change amplitude along with the time change; such as the speech signal of a non-target member.
Step S204, distinguishing the main signal and the auxiliary signal from the plurality of voice signals according to the signal intensity of the voice signals.
Wherein, the signal strength represents the strength of the voice signal, and the signal strength can be represented by frequency amplitude.
Specifically, based on the collected multiple voice signals, the audio processor screens the voice signal corresponding to the strongest voice signal strength according to the signal strength of each voice signal, and determines that the voice signal is a main signal and the other voice signals are auxiliary signals. The main signal is used for the voice signal collected by the voice collection component closest to the target member in the voice call, and only one main signal is used. The auxiliary signal is a voice signal excluding the main signal from the voice signals collected by all the voice collecting components. At each time point, the signal strength of the primary signal is strong compared to the secondary signal.
In one embodiment, the audio processor determines a signal strength corresponding to each time point of each voice signal in a target time period, wherein the target time period comprises a starting time point and an ending time point, the starting time point is selected as a current time point, the signal strength corresponding to the current time point in each voice signal is determined, and the voice signal corresponding to the maximum signal strength is selected as a target voice signal; and taking the time point of the next sequence as the current time point, further determining the corresponding signal intensity of the current time point in each voice signal, selecting a target voice signal corresponding to the maximum signal intensity, and if the target voice signals corresponding to the maximum signal intensities determined at a plurality of continuous time points are the same voice signal, judging that the target voice signal is a main signal and the other voice signals are auxiliary signals.
In step S206, the auxiliary signal having a signal intensity difference from the main signal greater than the first threshold is determined as a noise signal.
The first threshold is used for judging the signal intensity index of the noise signal according to the difference of the signal intensity and is used for measuring whether the difference of the signal intensity accords with the noise signal or not. The first threshold is much greater than the signal strength of the noise signal.
Specifically, based on the determined main signal and at least one auxiliary signal, the audio processor calculates a difference value of signal intensity corresponding to each auxiliary signal and each main signal at each time point in a target time period to obtain a difference value of a preset number of signal intensities; and comparing the difference of the signal intensity with a first threshold, calculating the ratio of the number of the time points corresponding to the difference greater than the first threshold to all preset numbers, further determining the similarity between the auxiliary signal and the main signal based on the ratio, and judging whether the auxiliary signal is a noise signal according to the similarity. The larger the difference of the signal intensity is, the larger the difference of the signal intensity corresponding to the main signal and the auxiliary signal at the same time point exists, which indicates that the lower the similarity between the main signal and the auxiliary signal is, the lower the similarity is, the more the auxiliary signal tends to be a noise signal, and when the similarity is lower than a preset threshold, the auxiliary signal is determined to be a noise signal.
In one embodiment, determining the secondary signal having a signal strength that differs from the primary signal by more than a first threshold as the noise signal comprises: determining a signal strength difference of each secondary signal with respect to the primary signal at each time point within a target period; calculating the similarity between the corresponding auxiliary signal and the main signal according to the time point and the preset number of the signal intensity differences larger than the first threshold; and determining the auxiliary signal with the similarity smaller than a second threshold value as a noise signal.
Wherein the second threshold is used for measuring the similarity between the auxiliary signal and the main signal. Specifically, based on the obtained main signal and one or more auxiliary signals, the audio processor calculates the signal intensity difference between each auxiliary signal and the main signal at each time point in the target time period, determines the number of time points with the signal intensity difference larger than a first threshold, calculates the ratio of the number to a preset number to obtain the ratio of the similarity between the corresponding auxiliary signal and the main signal, further compares the ratio with a second threshold, and determines the auxiliary signal with the ratio smaller than the second threshold as the noise signal. In the embodiment, the signal intensity difference between the auxiliary signal and the main signal at each time point is calculated, the similarity between the auxiliary signal and the main signal is determined according to the time point of the signal intensity difference meeting the first threshold and all the time points in the target time period, and the auxiliary signal is further judged to be a noise signal according to the similarity, so that the noise signal is simply and conveniently judged from the auxiliary signals, and a basis is provided for the correction of the subsequent main signal.
And S208, correcting the main signal according to the noise signal to obtain a target signal.
Specifically, the audio processor corrects each noise signal to the main signal based on the determined noise signal and main signal to obtain the target signal. The specific correction method may be that, for each time point of the target time period, the audio processor subtracts the signal intensity of the main signal corresponding to each time point from the signal intensity of all the noise signals corresponding to the same time point to obtain a corrected target signal.
In one embodiment, modifying the main signal according to the noise signal to obtain the target signal comprises: determining a correction factor according to the signal intensity of all the noise signals; and adjusting the signal intensity of the main signal based on the correction factor to obtain the target signal.
The correction factor may be an average intensity of signal intensities of all noise signals, or may be a sum of signal intensities obtained by adding signal intensities of all noise signals.
In one embodiment, the determining the correction factor according to the signal strength of all the noise signals includes: calculating an average intensity of signal intensities of all the noise signals; determining the average intensity as a correction factor.
Specifically, for each time point of the target time period, the audio processor calculates an average signal intensity of all the auxiliary signals corresponding to each time point, and further subtracts the signal intensity of the main signal corresponding to each time point from the average signal intensity of the auxiliary signals to obtain the corrected target signal.
The audio processor may also calculate the signal intensity sum of all the auxiliary signals corresponding to each time point, and further subtract the signal intensity sum of the main signal corresponding to each time point from the signal intensity sum of the auxiliary signal to obtain the corrected target signal.
Step S210, sending the target signal to the other party participating in the voice call.
Specifically, based on the target signal obtained after correction, the audio processor sends the target signal to the other party member participating in the voice call through a communication network system built in the terminal; the communication network system comprises LTE, GSM, CDMA2000, WCDMA, TD-SCDMA or 5G. The other party member can be a calling party in the voice communication process or an answering party.
As shown in fig. 3, a schematic structural diagram of a terminal based on a voice call is provided. As can be seen, the terminal 30 comprises a first screen 300, and speech acquisition components are arranged at different positions of the terminal, such as a microphone 302 arranged at the top of the first screen and a microphone 304 arranged at the bottom of the first screen, and a microphone 3022 arranged at the top of the terminal and a microphone 3042 arranged at the bottom of the terminal. And a screen-sound driver 306 is provided on the first screen. During a voice call, it is assumed that the user holds the bottom of the terminal to receive the call, and both the ear and mouth of the user are close to the bottom of the terminal and far from the top of the terminal. The audio processor acquires the voice signals of the target members collected by each microphone, distinguishes main signals and auxiliary signals according to the signal intensity of the voice signals, can judge whether the voice signals collected by the microphones arranged at the bottom of the terminal are main signals at the moment, judges whether the voice signals collected by the microphones arranged at the top of the terminal are auxiliary signals, further judges whether the auxiliary signals are noise signals, corrects the main signals according to the noise signals if the auxiliary signals are noise signals to obtain corrected target signals, and sends the target signals to the members of the other party participating in voice communication. And the screen sound-producing driver arranged on the first screen is used for generating driving force with direction under the driving of the audio signal transmitted by the other member, and the voice of the other member is played by driving the screen to vibrate.
In the above embodiment, the voice signals of the target member collected by the plurality of voice collecting assemblies arranged on different sides of the first screen are used, because the signal strength of the voice signals collected by the different voice collecting assemblies is greatly different, in order to avoid the influence of the gesture of the handheld terminal on the voice quality in the conversation process, the main signal and the plurality of auxiliary signals are judged according to the signal strength of the voice signals, and then the noise signals in the plurality of auxiliary signals are determined according to the difference value of the signal strength of each auxiliary signal and the main signal, so that the obtained one or more noise signals modify the main signal to obtain the target signal. The main signal is corrected through one or more noise signals obtained through screening, so that a target signal with higher voice call quality can be acquired under any hand-held terminal posture in the voice call process.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition assembly arranged on different sides of the second screen; the terminal also comprises a first voice playing component arranged on the first screen and a second voice playing component arranged on the second screen; acquiring the voice signal of the target member acquired by each voice acquisition component comprises the following steps: acquiring the magnetic field intensity between the first screen and the second screen; judging the use state of the terminal according to the magnetic field intensity; determining the first screen or the second screen as a main screen according to the use state;
specifically, set up permanent magnet and the switch sensor in first screen and second screen symmetry avris respectively, according to permanent magnet and switch sensor's distance generation magnetic field intensity, audio processor acquires the magnetic field intensity that produces in real time or at the time of predetermineeing, judges the user state that first screen and second screen were located according to this magnetic field intensity, and then confirms the main screen according to the user state.
In one embodiment, the terminal further comprises a permanent magnet and a switch sensor which are respectively arranged at the symmetrical sides of the first screen and the second screen; the method further comprises the following steps: the magnetic field strength is determined based on the distance between the permanent magnet and the switch sensor. When the permanent magnet and the switch sensor are close to each other, the magnetic field intensity tends to increase; when the permanent magnet and the switch sensor are far away, the magnetic field intensity tends to decrease.
Specifically, the permanent magnet and the switch sensor are respectively arranged at the symmetrical sides of the first screen and the second screen, wherein the switch sensor can be a Hall sensor. For example, a permanent magnet may be disposed on the side of the first screen, and a switch sensor may be disposed on the symmetrical side of the second screen; the switch sensor may be provided on the side of the first screen, and the permanent magnet may be provided on the side of the second screen. And determining the magnitude of the magnetic field intensity according to the distance between the permanent magnet and the switch sensor.
In one embodiment, the determining the use state of the terminal according to the magnetic field strength includes: when the magnetic field intensity is larger than a third threshold value, judging that the terminal is in a folded state; and when the magnetic field intensity is less than or equal to a third threshold value, judging that the terminal is in the unfolding state.
Specifically, the audio processor acquires the magnetic field intensity, and determines the states of the first screen and the second screen according to the magnetic field intensity and a third threshold value. When the magnetic field intensity is larger than a third threshold value, the first screen and the second screen are judged to be in a folded state; and when the magnetic field intensity is lower than a third threshold value, judging that the first screen and the second screen are in the unfolded state. The third threshold is used for judging the state of the terminal according to the magnetic field intensity.
The magnetic field intensity of the terminal is detected in real time through the switch sensor, the detected magnetic field intensity is transmitted to the audio processor, and the audio processor judges the states of the first screen and the second screen according to the magnetic field intensity and a third threshold value. When the permanent magnet is gradually close to the switch sensor, the magnetic field intensity detected by the switch sensor is gradually increased from 0; once the distance between the permanent magnet and the switch sensor reaches a minimum critical value, namely the permanent magnet and the switch sensor are mutually attached, the magnetic field intensity detected by the switch sensor reaches a maximum value at the moment. When the permanent magnet is gradually spaced away from the switching sensor, the magnetic field intensity detected by the switching sensor gradually decreases from the maximum value to 0.
The audio processor compares the received magnetic field intensity with a third threshold value, and when the magnetic field intensity is greater than the third threshold value, the first screen and the second screen are judged to be in folded states; conversely, when the magnetic field strength is lower than the third threshold value, the first screen and the second screen are judged to be in the unfolded state. Therefore, the states of the first screen and the second screen can be determined according to the above principle. The third threshold is typically chosen to be close to the maximum value of the magnetic field strength, for example Max, which may be 95% × Max.
In one embodiment, the switch sensor detects the magnetic field strength and converts the magnetic field strength into a voltage or current, which is transmitted to the audio processor. The audio processor determines the states of the first screen and the second screen according to the received voltage or current and a third threshold.
In one embodiment, the determining the first screen or the second screen as a main screen according to the usage status includes: when the use state of the terminal is a folding state, acquiring sensor data; determining the first screen or the second screen toward the target member as a main screen according to the sensor data.
Wherein the sensor data is used to determine a home screen for the target member during the voice call, i.e., the home screen is determined from the first screen and the second screen during the voice call. The home screen refers to a terminal screen facing the target member. The sensor data comprises data collected by a distance sensor, a light sensitive sensor or a posture sensor; wherein the distance sensor may be an infrared distance measuring sensor or an ultrasonic distance measuring sensor. The use state includes a folded state and an unfolded state.
Specifically, when the using state of the terminal is the folding state, the distance parameter between the terminal and the target member is monitored in real time through the distance sensor arranged on the terminal, the audio processor acquires the distance parameter collected by the sensor, and the first screen or the second screen facing the target member is determined to be the main screen according to the distance parameter. Further, after determining the main screen, the audio processor acquires the voice signal of the target member acquired by each voice acquisition component corresponding to the main screen.
In one embodiment, determining the first screen or the second screen as a main screen according to the usage status includes: and when the using state of the terminal is the unfolding state, determining at least one of the first screen and the second screen as a main screen.
In one embodiment, when the distance sensor is an ultrasonic ranging sensor, determining the first screen or the second screen facing the target member as a main screen according to the sensor data includes: the ultrasonic signal is sent through the sound playing component arranged on the terminal, when the ultrasonic signal reaches a target member or an obstacle, the reflected ultrasonic signal is received through the sound playing component of the terminal, the distance parameter between the terminal and the target member is judged according to the sent ultrasonic signal and the reflected ultrasonic signal, and the terminal screen facing the target member is determined to be a main screen according to the distance parameter.
In this embodiment, the terminal based on the voice call is a dual-screen terminal, the terminal includes two screens, and a plurality of voice acquisition components are arranged on different sides of each screen, and a sensor is arranged on the terminal for determining a main screen where the terminal is located in the voice call process. When the terminal is in fold condition, through the sensor data that the sensor that sets up on the terminal gathered, judge the main screen that the terminal faced the target member at voice conversation in-process, and then obtain the speech signal of the target member that the pronunciation collection subassembly that the main screen corresponds gathered. This embodiment is in under fold condition to double screen terminal, confirms main screen through sensor data, and then gathers the speech signal of target member through the pronunciation collection subassembly that main screen corresponds for also can be normal collection target member's pronunciation at the voice conversation in-process, no matter the target member is to arbitrary screen at terminal, can both realize high-quality voice conversation.
In one embodiment, the terminal further comprises a voice playing component arranged on the first screen and a voice playing component arranged on the second screen; the method further comprises the following steps: and playing the voice signal from the other party member based on the voice playing component on the main screen.
Specifically, in the voice call process, the voice of the other member is played through a first voice playing component arranged on a first screen and a second voice playing component arranged on a second screen. However, since the first screen and the second screen are in different states, the strategy for selecting the playing component for playing the voice of the counterpart member is also different. And determining the corresponding voice playing component to play the voice of the other member based on the screen state determined by the magnetic field intensity detected by the switch sensor. When the first screen and the second screen are in folded states, the voice of the other member is played through the voice playing component arranged on the main screen; when the first screen and the second screen are in the unfolded state, the voice of the other member is played through the voice playing components respectively arranged on the first screen and the second screen. The voice playing component is arranged on different screens of the double-screen terminal, so that the voice of the other member can be normally played no matter the terminal screen is in a folded state or an unfolded state, and high-quality voice call is realized in the voice call process.
In this embodiment, through the motion state of permanent magnet and switch sensor, the magnetic field intensity that judges switch sensor and detect can judge the user state at terminal according to magnetic field intensity, and when magnetic field intensity was greater than the third threshold value, first screen and second screen were in fold condition, otherwise, when magnetic field intensity was less than the third threshold value, first screen and second screen were in the state of expanding, simple and convenient realization the state judgement at double screen terminal.
In one embodiment, as shown in fig. 4, a schematic structural diagram of another voice call based terminal is provided. The terminal 30 further comprises a second screen 400, and voice collecting components are arranged at different positions of the terminal, such as a microphone 402 arranged at the top of the second screen and a microphone 404 arranged at the bottom of the second screen, or a microphone 4022 arranged at the top of the terminal and a microphone 4042 arranged at the bottom of the terminal; and a screen-sound driver 406 is provided on the second screen. The terminal further includes permanent magnets 408 disposed on the left side of the first screen and switch sensors 410 disposed on the right side of the second screen, respectively. The terminal also includes a distance sensor 412, which may be located anywhere on the terminal.
In the voice call process, the terminal is provided with two screens, and aiming at the double-screen terminal, the state of the first screen and the second screen is judged to be a folded state or an unfolded state, namely, the magnetic field intensity detected by the switch sensor is obtained through the motion states of the permanent magnet and the switch sensor, when the magnetic field intensity is greater than a third threshold value, the first screen and the second screen are judged to be in the folded state, otherwise, when the magnetic field intensity is lower than the third threshold value, the first screen and the second screen are judged to be in the unfolded state.
After the state of the terminal screen is determined, detecting a main screen facing a target member in the voice call process through a distance sensor; and then gather the voice signal of target member through the pronunciation collection component that the main screen corresponds, distinguish main signal and supplementary signal according to voice signal's signal strength, further judge whether supplementary signal is the noise signal, if, revise main signal according to the noise signal, obtain the target signal after the revision, audio processor sends the target signal to participating in the other side member of voice conversation. And further, determining a voice playing component for playing the voice of the other member according to the states of the first screen and the second screen, and playing the voice according to the determined voice playing component.
In the embodiment, the terminal comprises two screens, a plurality of voice acquisition components are arranged on different sides of each screen, and a distance sensor is arranged on the terminal and used for judging a main screen facing a target member in the voice call process. In the voice call process of the double-screen terminal, the states of the first screen and the second screen are judged according to the magnetic field intensity detected by the switch sensor, and further, the voice of the other member is played by adopting the corresponding voice playing component according to different screen states. Meanwhile, after the state of the screen is determined, the main screen facing the target member is further judged according to the distance sensor, the voice signal of the target member is collected through the voice collecting assembly corresponding to the main screen, no specific requirement is required for the posture of the target member for holding the terminal in the voice call process, the double-screen terminal can be held at will, high-quality voice call can be achieved, the voice quality of the played voice signal is not affected by the posture of the holding terminal, and meanwhile, the voice quality of the collected voice signal is not affected by the posture of the holding terminal.
It should be understood that although the various steps in the flow charts of fig. 2-4 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 2-4 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternating with other steps or at least some of the sub-steps or stages of other steps.
In one embodiment, as shown in fig. 5, there is provided a terminal-based voice call apparatus 500, the terminal including a first screen and more than one voice collecting component disposed at different sides of the first screen; the device comprises: a signal acquisition module 502, a signal modification module 504, and a signal transmission module 506, wherein:
a signal obtaining module 502, configured to obtain the voice signal of the target member collected by each voice collecting component.
A signal modification module 504, configured to perform module, configured to distinguish the multiple voice signals into a main signal and an auxiliary signal according to the signal strength of the voice signals; determining an auxiliary signal having a signal intensity difference with the main signal greater than a first threshold as a noise signal; and correcting the main signal according to the noise signal to obtain a target signal.
And a signal sending module 506, configured to send the target signal to the other party participating in the voice call.
In one embodiment, the signal modification module is further configured to determine a signal strength difference of each secondary signal with respect to the primary signal at each time point in the target period; calculating the similarity between the corresponding auxiliary signal and the main signal according to the time point and the preset number of the signal intensity differences larger than the first threshold; and determining the auxiliary signal with the similarity smaller than a second threshold value as a noise signal.
In one embodiment, the signal modification module is further configured to determine a modification factor according to signal strengths of all noise signals; and adjusting the signal intensity of the main signal based on the correction factor to obtain the target signal.
In one embodiment, the terminal further comprises a second screen and more than one voice acquisition assembly arranged on different sides of the second screen; the terminal also comprises a first voice playing component arranged on the first screen and a second voice playing component arranged on the second screen; the signal acquisition module is further configured to: acquiring sensor data; determining a main screen from the first screen and the second screen according to the sensor data; the voice signal of the target member collected by each voice collection component set on the main screen is acquired.
In one embodiment, the device further comprises a screen state judgment module for acquiring the magnetic field intensity; when the magnetic field intensity is larger than a third threshold value, the first screen and the second screen are judged to be in a folded state; and when the magnetic field intensity is lower than a third threshold value, judging that the first screen and the second screen are in the unfolded state.
In one embodiment, the terminal further comprises a permanent magnet and a switch sensor which are respectively arranged at the symmetrical sides of the first screen and the second screen; the device also comprises a magnetic field intensity analysis module, a switch sensor and a control module, wherein the magnetic field intensity is in an increasing trend when the permanent magnet and the switch sensor are close to each other; when the permanent magnet and the switch sensor are far away, the magnetic field intensity tends to decrease.
In one embodiment, the terminal further comprises a first voice playing component arranged on the first screen and a second voice playing component arranged on the second screen; the device also comprises a voice playing module, a voice playing module and a voice playing module, wherein the voice playing module is used for playing the voice of the other member through the voice playing component arranged on the main screen when the first screen and the second screen are in the folded state; when the first screen and the second screen are in the unfolded state, the voice of the other member is played through the voice playing components respectively arranged on the first screen and the second screen.
In this embodiment, through the voice signal of the target member that a plurality of voice acquisition components that set up on different avris of first screen gathered, because there is great difference in the signal strength of the voice signal that different voice acquisition components gathered, in order to avoid the voice quality in the conversation process to receive the influence of handheld terminal posture, this scheme judges a main signal and a plurality of supplementary signal according to the signal strength of voice signal, and then confirms the noise signal in a plurality of supplementary signals according to the difference of each supplementary signal and the signal strength of main signal, revise the main signal with one or more noise signal that will obtain with this, obtain the target signal. The main signal is corrected through one or more noise signals obtained through screening, so that a target signal with higher voice call quality can be acquired under any hand-held terminal posture in the voice call process.
For specific limitations of the terminal-based voice call apparatus, reference may be made to the above limitations of the terminal-based voice call method, which will not be described herein again. Each module in the terminal-based voice call apparatus may be wholly or partially implemented by software, hardware, or a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 6. The computer device includes a processor, a memory, a network interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a terminal-based voice call method. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 6 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
A computer device comprising a memory and a processor, the memory storing a computer program, wherein the processor when executing the computer program implements the steps of the terminal-based voice call method provided in any one of the embodiments of the present application.
A computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of the terminal-based voice call method provided in any one of the embodiments of the present application.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware related to instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above examples only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (13)

1. A voice call method based on a terminal is provided, wherein the terminal comprises a first screen and more than one voice acquisition assembly arranged on different sides of the first screen; the method comprises the following steps:
acquiring a voice signal of a target member acquired by each voice acquisition component;
distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal intensity of the voice signals;
determining an auxiliary signal having a signal intensity difference with the main signal greater than a first threshold as a noise signal;
correcting the main signal according to the noise signal to obtain a target signal;
and sending the target signal to the other party participating in the voice call.
2. The method of claim 1, wherein the speech signal is acquired during a target time period; the target time period comprises a preset number of time points; determining the auxiliary signal having a signal strength difference greater than a first threshold from the main signal as a noise signal includes:
determining a signal strength difference of each secondary signal with respect to the primary signal at each time point within the target period;
calculating the similarity between the corresponding auxiliary signal and the main signal according to the data of the time points with the signal intensity difference larger than a first threshold and the preset number;
and determining the auxiliary signal with the similarity smaller than a second threshold value as a noise signal.
3. The method of claim 1, wherein said modifying the main signal based on the noise signal to obtain a target signal comprises:
determining a correction factor according to the signal intensity of all the noise signals;
and adjusting the signal intensity of the main signal based on the correction factor to obtain a target signal.
4. The method of claim 1, wherein determining a correction factor based on the signal strengths of all of the noise signals comprises:
calculating an average intensity of signal intensities of all the noise signals;
determining the average intensity as a correction factor.
5. The method according to claim 1, wherein the terminal further comprises a second screen and more than one voice capture component disposed on different sides of the second screen; the acquiring the voice signal of the target member acquired by each voice acquisition component comprises:
acquiring the magnetic field intensity between the first screen and the second screen;
judging the use state of the terminal according to the magnetic field intensity;
determining the first screen or the second screen as a main screen according to the use state;
and acquiring the voice signal of the target member acquired by each voice acquisition component on the main screen.
6. The method according to claim 5, wherein the determining the use state of the terminal according to the magnetic field strength comprises:
when the magnetic field intensity is larger than a third threshold value, determining that the terminal is in a folded state;
and when the magnetic field intensity is smaller than or equal to a third threshold value, determining that the terminal is in an unfolding state.
7. The method of claim 5, wherein the determining the first screen or the second screen as a home screen according to the usage status comprises:
when the use state of the terminal is a folding state, acquiring sensor data;
determining the first screen or the second screen toward the target member as a main screen according to the sensor data.
8. The method of claim 5, wherein the determining the first screen or the second screen as a home screen according to the usage status comprises:
and when the using state of the terminal is the unfolding state, determining at least one of the first screen and the second screen as a main screen.
9. The method according to claim 5, wherein the terminal further comprises a voice playing component disposed on the first screen and a voice playing component disposed on the second screen; the method further comprises the following steps:
and playing the voice signal from the other party member based on the voice playing component on the main screen.
10. A voice call device based on a terminal is characterized in that the terminal comprises a first screen and more than one voice acquisition assembly arranged on different sides of the first screen; the device comprises:
the signal acquisition module is used for acquiring the voice signal of the target member acquired by each voice acquisition component;
the signal correction module is used for distinguishing a plurality of voice signals into a main signal and an auxiliary signal according to the signal intensity of the voice signals; determining an auxiliary signal having a signal intensity difference with the main signal greater than a first threshold as a noise signal; correcting the main signal according to the noise signal to obtain a target signal;
and the signal sending module is used for sending the target signal to the other party participating in the voice call.
11. The device of claim 10, wherein the terminal further comprises a second screen and more than one voice capture component disposed on different sides of the second screen; the signal acquisition module is further configured to: acquiring the magnetic field intensity between the first screen and the second screen; judging the use state of the terminal according to the magnetic field intensity; determining the first screen or the second screen as a main screen according to the use state; and acquiring the voice signal of the target member acquired by each voice acquisition component on the main screen.
12. A computer device comprising a memory and a processor, the memory storing a computer program, wherein the processor implements the steps of the method of any one of claims 1 to 9 when executing the computer program.
13. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 9.
CN201911064300.5A 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium Active CN112769979B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911064300.5A CN112769979B (en) 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911064300.5A CN112769979B (en) 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN112769979A true CN112769979A (en) 2021-05-07
CN112769979B CN112769979B (en) 2023-05-05

Family

ID=75692267

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911064300.5A Active CN112769979B (en) 2019-11-04 2019-11-04 Voice call method and device based on terminal, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112769979B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104092802A (en) * 2014-05-27 2014-10-08 中兴通讯股份有限公司 Method and system for de-noising audio signal
CN107393548A (en) * 2017-07-05 2017-11-24 青岛海信电器股份有限公司 The processing method and processing device of the voice messaging of multiple voice assistant equipment collections
CN107742523A (en) * 2017-11-16 2018-02-27 广东欧珀移动通信有限公司 Audio signal processing method, device and mobile terminal
CN108737615A (en) * 2018-06-27 2018-11-02 努比亚技术有限公司 microphone reception method, mobile terminal and computer readable storage medium
CN109739465A (en) * 2018-12-21 2019-05-10 维沃移动通信有限公司 Audio-frequency inputting method and mobile terminal

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104092802A (en) * 2014-05-27 2014-10-08 中兴通讯股份有限公司 Method and system for de-noising audio signal
CN107393548A (en) * 2017-07-05 2017-11-24 青岛海信电器股份有限公司 The processing method and processing device of the voice messaging of multiple voice assistant equipment collections
CN107742523A (en) * 2017-11-16 2018-02-27 广东欧珀移动通信有限公司 Audio signal processing method, device and mobile terminal
CN108737615A (en) * 2018-06-27 2018-11-02 努比亚技术有限公司 microphone reception method, mobile terminal and computer readable storage medium
CN109739465A (en) * 2018-12-21 2019-05-10 维沃移动通信有限公司 Audio-frequency inputting method and mobile terminal

Also Published As

Publication number Publication date
CN112769979B (en) 2023-05-05

Similar Documents

Publication Publication Date Title
CN107742523B (en) Voice signal processing method and device and mobile terminal
CN107817939B (en) Image processing method and mobile terminal
CN108664190B (en) Page display method, device, mobile terminal and storage medium
CN109982228B (en) Microphone fault detection method and mobile terminal
CN107369459B (en) A kind of audio-frequency processing method and mobile terminal
CN111554321B (en) Noise reduction model training method and device, electronic equipment and storage medium
CN109739465B (en) Audio output method and mobile terminal
CN108494030B (en) Wireless charging method, terminal and transmitting terminal equipment
CN106101929A (en) A kind of protection human ear audition method and mobile terminal
CN110730260B (en) Control method of electronic equipment and electronic equipment
CN102662554B (en) Information processing apparatus and code input mode switching method thereof
CN104424073A (en) Information processing method and electronic equipment
CN108448673B (en) Charging method, mobile terminal and charger
CN111124108B (en) Model training method, gesture control method, device, medium and electronic equipment
CN108196815B (en) Method for adjusting call sound and mobile terminal
CN108665889B (en) Voice signal endpoint detection method, device, equipment and storage medium
CN110830644B (en) Display screen control method and device, electronic equipment and storage medium
CN110390953B (en) Method, device, terminal and storage medium for detecting howling voice signal
CN111477243B (en) Audio signal processing method and electronic equipment
KR20200063095A (en) Multi-beam selection method and apparatus
CN108881721B (en) Display method and terminal
CN106331509A (en) Photographing method and mobile terminal
CN108769299A (en) Screen control method, device and mobile terminal
CN104898839A (en) Terminal device and control method thereof
CN108712563A (en) call control method, device and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant