CN110010162A

CN110010162A - A kind of song recordings method repairs sound method and electronic equipment

Info

Publication number: CN110010162A
Application number: CN201910152655.3A
Authority: CN
Inventors: 陶凯; 尹明婕; 李延奇; 徐德著; 戴恒杰; 缪海波
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2019-02-28
Filing date: 2019-02-28
Publication date: 2019-07-12
Also published as: EP3905246A1; EP3905246A4; US20220130360A1; JP2022522456A; JP7222112B2; WO2020173391A1

Abstract

Embodiments herein provides a kind of song recordings method, repairs sound method and electronic equipment, is related to Audiotechnica field, can add corresponding singing skills in the song that user records, to improve the recording result and usage experience when user is applied using K song class.This method comprises: electronic equipment shows the first interface, it include the recording button for recording the first song in the first interface；The operation for recording button is selected in response to user, electronic equipment obtains the accompaniment of first song and the characteristic information of the dry sound of original singer, and the characteristic information of the dry sound of original singer is for reflecting singing characteristics of the original singer when singing first song；Electronic equipment starts to play the accompaniment of first song and starts to record the dry sound of user that user sings；During recording user's dry sound, electronic equipment shows tutorial message according to the characteristic information of the dry sound of the original singer in second contact surface, and the tutorial message is used for one or more in ventilation when user being instructed to sing or trill.

Description

Song recording method, sound repairing method and electronic equipment

Technical Field

The present application relates to the field of audio technologies, and in particular, to a song recording method, a sound modification method, and an electronic device.

Background

For users who like singing, the K-song application has become a popular application installed on electronic devices such as mobile phones and tablet computers. Illustratively, a user can select a song to be recorded after opening the K song application, and then the K song application can load the accompaniment of the song, and call a microphone and other pickup components on a mobile phone to record the audio of the song sung by the user, so that the singing voice of the user is blended into the accompaniment provided by the K song application.

Currently, some karaoke applications provide a sound modifying function for songs recorded by a user in order to improve the singing experience of the user using the karaoke applications. For example, the karaoke application may compare the song a recorded by the user with the original singing audio of the song a, and modify the intonation and rhythm of the singing voice recorded by the user to the intonation and rhythm of the original singing audio, thereby modifying the intonation and rhythm problems that occur when the user records songs.

However, in addition to the problems of accuracy and rhythm, the ordinary users generally do not have professional vocal training, so that the users with insufficient singing skills still cannot record songs well.

Disclosure of Invention

The application provides a song recording method, a sound modifying method and electronic equipment, which can guide a user to use corresponding singing skills when the user records a song, so that the recording effect and the use experience when the user uses Karaoke applications are improved.

In order to achieve the purpose, the technical scheme is as follows:

in a first aspect, the present application provides a song recording method, including: the method comprises the steps that the electronic equipment displays a first interface, wherein the first interface comprises a recording button for recording a first song; if the fact that the user selects the recording button is detected, the electronic equipment can obtain the accompaniment of the first song and the feature information of the original singing dry sound, and the feature information of the original singing dry sound can be used for reflecting the singing feature of the original singing when the first song is singed; furthermore, the electronic equipment can start playing the accompaniment of the first song and start recording the user voice of singing of the user; in the process of recording the dry voice of the user, the electronic equipment can display guide information in the second interface according to the characteristic information of the original dry voice, wherein the guide information is used for guiding one or more items of ventilation or vibrato when the user sings. That is to say, when the user records a song, the electronic device may display the guidance information according to the singing characteristics of the original singing, and guide the user to perform one or more singing techniques such as ventilation or vibrato during singing, so as to improve the recording effect and the use experience of the user for recording the song.

In a possible implementation manner, the feature information of the original vocal stem includes a ventilation position of the original vocal stem; at this time, the guidance information may include a ventilation prompt; then, the electronic device displays guidance information in the second interface according to the feature information of the original vocal stem, including: the electronic equipment can display the ventilation prompt corresponding to the ventilation position of the original singing stem sound in the second interface according to the ventilation position of the original singing stem sound, so that a user is helped to control the change of the breath when singing, more obvious and disordered ventilation sound in the recorded song is reduced, and the listening quality of the recorded song is improved.

Illustratively, the second interface may include lyrics of the first song; at this time, the electronic device displays a ventilation prompt corresponding to the ventilation position of the original vocal stem in a second interface according to the ventilation position of the original vocal stem, and the method includes: the electronic device displays the ventilation prompt at a first position of the lyrics, wherein the first position is the same as the ventilation position of the original singing dry sound.

Or, the second interface may include a singer, where the singer is configured to display a note of each word in the lyrics being recorded; at this time, the electronic device displays a ventilation prompt corresponding to the ventilation position of the original vocal stem in a second interface according to the ventilation position of the original vocal stem, and the method includes: the electronic device displays the ventilation prompt at a second position of the singing machine, and the second position is the same as the ventilation position of the original singing dry sound.

In a possible implementation manner, the feature information of the original vocal stem includes a vibrato feature of the original vocal stem; at the moment, the corresponding guide information comprises a trill prompt; then, the electronic device displays guidance information in the second interface according to the feature information of the original vocal stem, including: and the electronic equipment displays a trill prompt corresponding to the trill characteristic of the original singing stem sound in the second interface according to the trill characteristic of the original singing stem sound. Therefore, when the user records the song, the user can sing by using the vibrato according to the vibrato position prompted in the lyric, so that the hierarchy sense and the singing skill of the song when the user sings are improved, and the listening quality of the recorded song is improved.

Illustratively, the vibrato feature includes a vibrato position; the second interface comprises lyrics of the first song; at this time, the electronic device displays a trill prompt corresponding to the trill feature of the original vocal stem in the second interface according to the trill feature of the original vocal stem, and the method comprises the following steps: the electronic equipment displays a trill prompt at a third position of the lyrics, wherein the third position is the same as the trill position in the original singing dry sound.

Or, the trill feature comprises a trill position; the second interface comprises a singer, and the singer is used for displaying the musical notes of each word in the lyrics being recorded; at this time, the electronic device displays a trill prompt corresponding to the trill feature of the original vocal stem in the second interface according to the trill feature of the original vocal stem, and the method comprises the following steps: the electronic equipment displays a trill prompt at the fourth position of the singing device, and the fourth position is the same as the trill position in the original singing stem.

For example, the characteristic information of the original vocal stem sound may further include fundamental frequency information of the original vocal stem sound; the electronic equipment can generate intonation prompts according to the fundamental frequency information of the original vocal stem and display the intonation prompts in the second interface to guide the intonation of the user during singing.

In a possible implementation manner, during recording of the user voice, the method further includes: the electronic equipment extracts the characteristic information of the user stem voice from the user stem voice, and the characteristic information of the user stem voice is used for reflecting the singing characteristic of the user when singing the first song; and the electronic equipment displays the evaluation result in the second interface by comparing the characteristic information in the user dry sound with the characteristic information of the original singing dry sound.

It can be seen that, in the process of recording a song by a user, the electronic device can perform real-time evaluation on one or more singing skills such as intonation, vibrato, ventilation and the like according to the characteristic information of the original singing stem, and compared with the evaluation of the singing level of the user solely according to the intonation, the evaluation method and the electronic device further consider the factors such as ventilation and vibrato of the user when evaluating the singing of the user, so that the obtained evaluation result is more accurate, rich and objective. And the electronic equipment can feed back the evaluation result of singing of the user to the user in real time in the second interface, so that the user can know the current singing effect in time and can adjust the singing mode in time in the subsequent recording process.

In one possible implementation manner, the characteristic information in the user stem includes a ventilation position of the user stem, and the characteristic information of the original vocal stem includes a ventilation position of the original vocal stem; the electronic device displays the evaluation result in the second interface by comparing the feature information in the user stem voice with the feature information of the original vocal stem voice, and the evaluation result comprises the following steps: the electronic device displays the evaluation result of the ventilation skill in the second interface by comparing the ventilation position of the dry sound of the user with the ventilation position of the original dry sound.

Illustratively, the electronic device displays the result of the evaluation of the ventilatory skill in the second interface by comparing the ventilatory position of the vocal stem of the user with the ventilatory position of the vocal stem of the original, including: if the ventilation position of the dry sound of the user is the same as the ventilation position of the original dry sound, the electronic equipment displays a prompt of accurate ventilation in a second interface; and if the ventilation position of the dry sound of the user is different from the ventilation position of the original dry sound, the electronic equipment displays a ventilation error prompt in a second interface.

In a possible implementation manner, the feature information in the user stem includes a vibrato feature of the user stem, and the feature information of the original vocal stem includes a vibrato feature of the original vocal stem; the electronic device displays the evaluation result in the second interface by comparing the feature information in the user stem voice with the feature information of the original vocal stem voice, and the evaluation result comprises the following steps: the electronic device displays the assessment result of the vibrato skill in the second interface by comparing the vibrato characteristic of the user's stem and the vibrato characteristic of the original vocal stem.

Illustratively, the electronic device displays the assessment result of the vibrato skill in the second interface by comparing the vibrato characteristic of the user's stem sound and the vibrato characteristic of the original vocal stem sound, including: if the trill characteristics of the dry sound of the user are matched with the trill characteristics of the original dry sound, the electronic equipment displays accurate trill prompts in a second interface; and if the trill characteristics of the dry sound of the user are not matched with the trill characteristics of the original dry sound, the electronic equipment displays a trill error prompt in the second interface.

In one possible implementation manner, the electronic device displays the evaluation result in the second interface, and the method includes: the electronic equipment can display the evaluation result in the lyrics of the second interface; alternatively, the electronic device may also display the evaluation result in the singer of the second interface.

In a possible implementation manner, after the electronic device displays the guidance information in the second interface according to the feature information of the original vocal stem, the method further includes: after the first song is recorded, the electronic equipment corrects singing problems appearing in the user dry sound according to the characteristic information of the original dry sound, wherein the singing problems comprise one or more of ventilation problems or vibrato problems. Therefore, a common user who does not have professional vocal music training can record the singing with better listening feeling quality, and therefore the recording effect and the use experience of the user in the process of recording the songs are improved.

In a second aspect, the present application provides a method for modifying sound, including: the method comprises the steps that the electronic equipment displays a first interface, wherein the first interface comprises a recording button for recording a first song; if the fact that the user selects the recording button is detected, the electronic equipment can obtain the accompaniment of the first song and the characteristic information of the original singing dry sound, and the characteristic information of the original singing dry sound is used for reflecting the singing characteristics of the original singing when the first song is singed; furthermore, the electronic equipment can start playing the accompaniment of the first song and start recording the user voice of singing of the user; after the first song is recorded, the electronic equipment can correct singing problems appearing in the recorded user dry sound according to the characteristic information of the original dry sound, wherein the singing problems comprise one or more of ventilation problems or vibrato problems. That is, the electronic device may correct singing skills problems such as ventilation, tremolo, etc. in the user's recorded song. Therefore, a common user who does not have professional vocal music training can record the singing with better listening feeling quality, and therefore the recording effect and the use experience of the user for recording the songs are improved.

In one possible implementation, the singing question is a ventilation question; the electronic equipment corrects the singing problem appearing in the user stem according to the characteristic information of the original singing stem, and the method comprises the following steps: the electronic equipment extracts ventilation sound in the user dry sound from the user dry sound; the electronic equipment reduces the energy and frequency response of ventilation sound in the dry sound of the user, thereby weakening the ventilation problem in the dry sound of the user and improving the listening quality of the recorded song.

In a possible implementation manner, the singing problem is a vibrato problem, wherein the electronic device corrects the singing problem occurring in the user stem according to the feature information of the original singing stem, and the singing problem includes: the electronic equipment extracts the trill characteristics of the user dry sound from the user dry sound; if the trill characteristics of the dry sound of the user do not match with the trill characteristics in the characteristic information of the original dry sound, for example, one of the trill position, the tremor amplitude, the tremor frequency or the stationary trend item in the dry sound of the user does not match with the original dry sound, the electronic device can correct the trill characteristics in the dry sound of the user by using the trill characteristics in the original dry sound so as to correct the trill problem occurring in the dry sound of the user. For example, the electronic device may synthesize new vibrato in the user's stem using the vibration amplitude, vibration frequency and stationary trend items of the corresponding vibrato in the original vocal stem, thereby improving the vibrato effect appearing in the user's stem.

Of course, the singing problem may also be a intonation problem, and the electronic device may perform a tone-down process on a place with too high tone in the user's stem voice, and perform a tone-up process on a place with too low tone in the user's stem voice, so that the tone of the user's stem voice is the same as or close to the tone of the original singing stem voice.

In a possible implementation manner, after the recording of the first song is finished, the method further includes: the electronic equipment displays a second interface, wherein the second interface comprises a sound repairing switch; the electronic equipment corrects the singing problem appearing in the user stem according to the characteristic information of the original singing stem, and the method comprises the following steps: if the tone modification switch is detected to be turned on by the user, the electronic equipment can modify the singing problem appearing in the voice of the user according to the feature information of the original singing voice.

In a possible implementation manner, after the recording of the first song is finished, the method further includes: the electronic equipment displays a second interface, wherein the second interface comprises a sound modification option, and the sound modification option comprises an option for modifying tone, an option for modifying ventilation sound and an option for modifying vibrato; the electronic equipment corrects the singing problem appearing in the user stem according to the characteristic information of the original singing stem, and the method comprises the following steps: and responding to the voice modification option selected by the user, and modifying the singing problem corresponding to the voice modification option in the user voice by the electronic equipment according to the characteristic information of the original singing voice.

In a possible implementation manner, after the recording of the first song is finished, the method further includes: the electronic equipment displays a second interface, wherein the second interface comprises a single sentence finishing option; in response to the user selecting the single sentence refinement option, the electronic device may display a third interface including N (N > 1) sentences of lyrics in the first song; at this time, the electronic device corrects the singing problem appearing in the user vocal according to the feature information of the original vocal stem, and the method specifically comprises the following steps: if the user selects the first lyric in the N lyrics, the electronic equipment can correct the singing problem in the user's dry voice corresponding to the first lyric according to the characteristic information of the first lyric sung in the original singing dry voice. Therefore, the user can correct the singing problem in the recorded voice stem of the user sentence by sentence, and the song recorded by the user is corrected to have a more satisfactory singing effect.

In addition, when the electronic equipment displays the third interface, the position of the singing problem can be marked in the N words of the lyrics of the third interface. Thus, the user can intuitively see the specific problems of singing each lyric.

In a third aspect, the present application provides an electronic device, comprising: a touchscreen, one or more processors, one or more memories, one or more microphones, and one or more computer programs; wherein the processor is coupled to the touch screen, the memory and the microphone, the one or more computer programs are stored in the memory, and when the electronic device is operated, the processor executes the one or more computer programs stored in the memory, so that the electronic device executes the song recording method or the sound modification method.

In a fourth aspect, the present application provides a computer storage medium comprising computer instructions which, when run on an electronic device, cause the electronic device to perform the song recording method or the sound modifying method according to any one of the first aspect.

In a fifth aspect, the present application provides a computer program product, which when run on an electronic device, causes the electronic device to perform the song recording method or the sound modification method according to any one of the first aspect.

In a sixth aspect, the present application provides a Graphical User Interface (GUI) stored in an electronic device, the electronic device comprising a touch screen, a memory, and a processor configured to execute one or more computer programs stored in the memory, the GUI comprising: the method comprises the steps that a first GUI displayed on a touch screen comprises a recording button used for recording a first song; and displaying a second GUI on the touch screen in response to the touch event aiming at the recording button, wherein the second GUI is a recording interface for recording the first song, and the second GUI comprises guidance information for guiding the user to sing the first song, and the guidance information can comprise a ventilation prompt and/or a vibrato prompt.

For example, the second GUI may further include a result of evaluating a ventilatory and/or vibrato skill when the user sings.

After the second GUI is displayed on the touch screen, the method further includes: when the first song is recorded, displaying a third GUI on the touch screen, wherein the third GUI is a sound repairing interface of the first song and comprises a sound repairing switch; when the sound correcting switch is turned on by a user, the electronic device can correct ventilation problems and/or vibrato problems occurring in the dry sound of the user.

It is to be understood that the electronic device of the third aspect, the computer storage medium of the fourth aspect, the computer program product of the fifth aspect, and the GUI of the sixth aspect are all configured to perform the corresponding methods provided above, and therefore, the beneficial effects achieved by the electronic device of the third aspect, the computer storage medium of the fourth aspect, the computer program product of the fifth aspect, and the GUI of the sixth aspect may refer to the beneficial effects of the corresponding methods provided above, and are not repeated herein.

Drawings

Fig. 1 is a first schematic structural diagram of an electronic device according to an embodiment of the present disclosure;

fig. 2 is a schematic diagram illustrating an architecture of an operating system in an electronic device according to an embodiment of the present disclosure;

fig. 3 is a schematic flowchart of a song recording method and a sound modification method according to an embodiment of the present application;

fig. 4 is a first scene schematic diagram of a song recording method according to an embodiment of the present application;

fig. 5 is a second scenario diagram of a song recording method according to an embodiment of the present application;

fig. 6 is a third schematic view of a scene of a song recording method according to an embodiment of the present application;

fig. 7 is a fourth scene schematic diagram of a song recording method according to an embodiment of the present application;

fig. 8 is a scene schematic diagram five of a song recording method according to an embodiment of the present application;

fig. 9 is a sixth schematic view of a scene of a song recording method according to an embodiment of the present application;

fig. 10 is a seventh schematic view of a scene of a song recording method according to an embodiment of the present application;

fig. 11 is a scene schematic diagram eight of a song recording method according to an embodiment of the present application;

fig. 12 is a scene schematic diagram nine of a song recording method according to an embodiment of the present application;

fig. 13 is a first scenario diagram of a sound modifying method according to an embodiment of the present application;

fig. 14 is a second scenario diagram of a sound modifying method according to an embodiment of the present application;

fig. 15 is a third scenario schematic diagram of a sound modifying method according to an embodiment of the present application;

fig. 16 is a fourth scene schematic diagram of a sound modifying method according to an embodiment of the present application;

fig. 17 is a schematic structural diagram of an electronic device according to an embodiment of the present application.

Detailed Description

Embodiments of the present embodiment will be described in detail below with reference to the accompanying drawings.

For example, the song recording method and the sound modification method provided in the embodiments of the present application may be applied to electronic devices such as a mobile phone, a tablet computer, a notebook computer, an ultra-mobile personal computer (UMPC), a handheld computer, a netbook, a Personal Digital Assistant (PDA), a wearable electronic device, and a virtual reality device, and the embodiments of the present application do not limit the present application in any way.

Fig. 1 shows a schematic structural diagram of an electronic device 100.

The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a Universal Serial Bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2, a mobile communication module 150, a wireless communication module 160, an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, a sensor module 180, a key 190, a motor 191, an indicator 192, a camera 193, a display screen 194, a Subscriber Identification Module (SIM) card interface 195, and the like. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, and the like.

It is to be understood that the illustrated structure of the embodiment of the present invention does not specifically limit the electronic device 100. In other embodiments of the present application, electronic device 100 may include more or fewer components than shown, or some components may be combined, some components may be split, or a different arrangement of components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.

Processor 110 may include one or more processing units, such as: the processor 110 may include an Application Processor (AP), a modem processor, a Graphics Processor (GPU), an Image Signal Processor (ISP), a controller, a video codec, a Digital Signal Processor (DSP), a baseband processor, and/or a neural-Network Processing Unit (NPU), etc. The different processing units may be separate devices or may be integrated into one or more processors.

The controller can generate an operation control signal according to the instruction operation code and the timing signal to complete the control of instruction fetching and instruction execution.

A memory may also be provided in processor 110 for storing instructions and data. In some embodiments, the memory in the processor 110 is a cache memory. The memory may hold instructions or data that have just been used or recycled by the processor 110. If the processor 110 needs to reuse the instruction or data, it can be called directly from the memory. Avoiding repeated accesses reduces the latency of the processor 110, thereby increasing the efficiency of the system.

In some embodiments, processor 110 may include one or more interfaces. The interface may include an integrated circuit (I2C) interface, an integrated circuit built-in audio (I2S) interface, a Pulse Code Modulation (PCM) interface, a universal asynchronous receiver/transmitter (UART) interface, a Mobile Industry Processor Interface (MIPI), a general-purpose input/output (GPIO) interface, a Subscriber Identity Module (SIM) interface, and/or a Universal Serial Bus (USB) interface, etc.

The I2C interface is a bi-directional synchronous serial bus that includes a serial data line (SDA) and a Serial Clock Line (SCL). In some embodiments, processor 110 may include multiple sets of I2C buses. The processor 110 may be coupled to the touch sensor 180K, the charger, the flash, the camera 193, etc. through different I2C bus interfaces, respectively. For example: the processor 110 may be coupled to the touch sensor 180K via an I2C interface, such that the processor 110 and the touch sensor 180K communicate via an I2C bus interface to implement the touch functionality of the electronic device 100.

The I2S interface may be used for audio communication. In some embodiments, processor 110 may include multiple sets of I2S buses. The processor 110 may be coupled to the audio module 170 via an I2S bus to enable communication between the processor 110 and the audio module 170. In some embodiments, the audio module 170 may communicate audio signals to the wireless communication module 160 via the I2S interface, enabling answering of calls via a bluetooth headset.

The PCM interface may also be used for audio communication, sampling, quantizing and encoding analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled by a PCM bus interface. In some embodiments, the audio module 170 may also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to implement a function of answering a call through a bluetooth headset. Both the I2S interface and the PCM interface may be used for audio communication.

The UART interface is a universal serial data bus used for asynchronous communications. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is generally used to connect the processor 110 with the wireless communication module 160. For example: the processor 110 communicates with a bluetooth module in the wireless communication module 160 through a UART interface to implement a bluetooth function. In some embodiments, the audio module 170 may transmit the audio signal to the wireless communication module 160 through a UART interface, so as to realize the function of playing music through a bluetooth headset.

MIPI interfaces may be used to connect processor 110 with peripheral devices such as display screen 194, camera 193, and the like. The MIPI interface includes a Camera Serial Interface (CSI), a display screen serial interface (DSI), and the like. In some embodiments, processor 110 and camera 193 communicate through a CSI interface to implement the capture functionality of electronic device 100. The processor 110 and the display screen 194 communicate through the DSI interface to implement the display function of the electronic device 100.

The GPIO interface may be configured by software. The GPIO interface may be configured as a control signal and may also be configured as a data signal. In some embodiments, a GPIO interface may be used to connect the processor 110 with the camera 193, the display 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface may also be configured as an I2C interface, an I2S interface, a UART interface, a MIPI interface, and the like.

It should be understood that the connection relationship between the modules according to the embodiment of the present invention is only illustrative, and is not limited to the structure of the electronic device 100. In other embodiments of the present application, the electronic device 100 may also adopt different interface connection manners or a combination of multiple interface connection manners in the above embodiments.

The charging management module 140 is configured to receive charging input from a charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive charging input from a wired charger via the USB interface 130. In some wireless charging embodiments, the charging management module 140 may receive a wireless charging input through a wireless charging coil of the electronic device 100. The charging management module 140 may also supply power to the electronic device through the power management module 141 while charging the battery 142.

The power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110. The power management module 141 receives input from the battery 142 and/or the charge management module 140, and supplies power to the processor 110, the internal memory 121, the display 194, the camera 193, the wireless communication module 160, and the like. The power management module 141 may also be used to monitor parameters such as battery capacity, battery cycle count, battery state of health (leakage, impedance), etc. In some other embodiments, the power management module 141 may also be disposed in the processor 110. In other embodiments, the power management module 141 and the charging management module 140 may be disposed in the same device.

The wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, a modem processor, a baseband processor, and the like.

The antennas 1 and 2 are used for transmitting and receiving electromagnetic wave signals. Each antenna in the electronic device 100 may be used to cover a single or multiple communication bands. Different antennas can also be multiplexed to improve the utilization of the antennas. For example: the antenna 1 may be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.

The mobile communication module 150 may provide a solution including 2G/3G/4G/5G wireless communication applied to the electronic device 100. The mobile communication module 150 may include one or more filters, switches, power amplifiers, Low Noise Amplifiers (LNAs), and the like. The mobile communication module 150 may receive the electromagnetic wave from the antenna 1, filter, amplify, etc. the received electromagnetic wave, and transmit the electromagnetic wave to the modem processor for demodulation. The mobile communication module 150 may also amplify the signal modulated by the modem processor, and convert the signal into electromagnetic wave through the antenna 1 to radiate the electromagnetic wave. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be disposed in the processor 110. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be disposed in the same device as at least some of the modules of the processor 110.

The modem processor may include a modulator and a demodulator. The modulator is used for modulating a low-frequency baseband signal to be transmitted into a medium-high frequency signal. The demodulator is used for demodulating the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then passes the demodulated low frequency baseband signal to a baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and then transferred to the application processor. The application processor outputs a sound signal through an audio device (not limited to the speaker 170A, the receiver 170B, etc.) or displays an image or video through the display screen 194. In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be provided in the same device as the mobile communication module 150 or other functional modules, independent of the processor 110.

The wireless communication module 160 may provide a solution for wireless communication applied to the electronic device 100, including Wireless Local Area Networks (WLANs) (e.g., wireless fidelity (Wi-Fi) networks), Bluetooth (BT), Global Navigation Satellite System (GNSS), Frequency Modulation (FM), Near Field Communication (NFC), Infrared (IR), and the like. The wireless communication module 160 may be one or more devices that integrate one or more communication processing modules. The wireless communication module 160 receives electromagnetic waves via the antenna 2, performs frequency modulation and filtering processing on electromagnetic wave signals, and transmits the processed signals to the processor 110. The wireless communication module 160 may also receive a signal to be transmitted from the processor 110, perform frequency modulation and amplification on the signal, and convert the signal into electromagnetic waves through the antenna 2 to radiate the electromagnetic waves.

In some embodiments, antenna 1 of electronic device 100 is coupled to mobile communication module 150 and antenna 2 is coupled to wireless communication module 160 so that electronic device 100 can communicate with networks and other devices through wireless communication techniques. The wireless communication technology may include global system for mobile communications (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), time-division code division multiple access (time-division code division multiple access, TD-SCDMA), Long Term Evolution (LTE), LTE, BT, GNSS, WLAN, NFC, FM, and/or IR technologies, etc. The GNSS may include a Global Positioning System (GPS), a global navigation satellite system (GLONASS), a beidou satellite navigation system (BDS), a quasi-zenith satellite system (QZSS), and/or a Satellite Based Augmentation System (SBAS).

The electronic device 100 implements display functions via the GPU, the display screen 194, and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and an application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. The processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.

The display screen 194 is used to display images, video, and the like. The display screen 194 includes a display panel. The display panel may be a Liquid Crystal Display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode (active-matrix organic light-emitting diode, AMOLED), a flexible light-emitting diode (FLED), a miniature, a Micro-oeld, a quantum dot light-emitting diode (QLED), or the like. In some embodiments, the electronic device 100 may include 1 or N display screens 194, with N being a positive integer greater than 1.

The electronic device 100 may implement a shooting function through the ISP, the camera 193, the video codec, the GPU, the display 194, the application processor, and the like.

The ISP is used to process the data fed back by the camera 193. For example, when a photo is taken, the shutter is opened, light is transmitted to the camera photosensitive element through the lens, the optical signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing and converting into an image visible to naked eyes. The ISP can also carry out algorithm optimization on the noise, brightness and skin color of the image. The ISP can also optimize parameters such as exposure, color temperature and the like of a shooting scene. In some embodiments, the ISP may be provided in camera 193.

The camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects the optical image to the photosensitive element. The photosensitive element may be a Charge Coupled Device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The light sensing element converts the optical signal into an electrical signal, which is then passed to the ISP where it is converted into a digital image signal. And the ISP outputs the digital image signal to the DSP for processing. The DSP converts the digital image signal into image signal in standard RGB, YUV and other formats. In some embodiments, the electronic device 100 may include 1 or N cameras 193, N being a positive integer greater than 1.

The digital signal processor is used for processing digital signals, and can process digital image signals and other digital signals. For example, when the electronic device 100 selects a frequency bin, the digital signal processor is used to perform fourier transform or the like on the frequency bin energy.

Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 may play or record video in a variety of encoding formats, such as: moving Picture Experts Group (MPEG) 1, MPEG2, MPEG3, MPEG4, and the like.

The NPU is a neural-network (NN) computing processor that processes input information quickly by using a biological neural network structure, for example, by using a transfer mode between neurons of a human brain, and can also learn by itself continuously. Applications such as intelligent recognition of the electronic device 100 can be realized through the NPU, for example: image recognition, face recognition, speech recognition, text understanding, and the like.

The external memory interface 120 may be used to connect an external memory card, such as a Micro SD card, to extend the memory capability of the electronic device 100. The external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. For example, files such as music, video, etc. are saved in an external memory card.

Internal memory 121 may be used to store one or more computer programs, including instructions. The processor 110 may execute the above-mentioned instructions stored in the internal memory 121, so as to enable the electronic device 100 to execute the method for intelligent contact recommendation provided in some embodiments of the present application, and various functional applications and data processing. The internal memory 121 may include a program storage area and a data storage area. Wherein, the storage program area can store an operating system; the storage area may also store one or more application programs (e.g., gallery, contacts, etc.), etc. The storage data area may store data (such as photos, contacts, etc.) created during use of the electronic device 101, and the like. In addition, the internal memory 121 may include a high-speed random access memory, and may also include a nonvolatile memory, such as one or more magnetic disk storage devices, flash memory devices, Universal Flash Storage (UFS), and the like. In other embodiments, the processor 110 may cause the electronic device 100 to execute the method of intelligently recommending numbers provided in the embodiments of the present application, and various functional applications and data processing by executing instructions stored in the internal memory 121 and/or instructions stored in a memory provided in the processor.

The electronic device 100 may implement audio functions via the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone interface 170D, and the application processor. Such as music playing, recording, etc.

The audio module 170 is used to convert digital audio information into an analog audio signal output and also to convert an analog audio input into a digital audio signal. The audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be disposed in the processor 110, or some functional modules of the audio module 170 may be disposed in the processor 110.

The speaker 170A, also called a "horn", is used to convert the audio electrical signal into an acoustic signal. The electronic apparatus 100 can listen to music through the speaker 170A or listen to a handsfree call.

The receiver 170B, also called "earpiece", is used to convert the electrical audio signal into an acoustic signal. When the electronic apparatus 100 receives a call or voice information, it can receive voice by placing the receiver 170B close to the ear of the person.

The microphone 170C, also referred to as a "microphone," is used to convert sound signals into electrical signals. When making a call or transmitting voice information, the user can input a voice signal to the microphone 170C by speaking the user's mouth near the microphone 170C. The electronic device 100 may be provided with one or more microphones 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C to achieve a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further include three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, perform directional recording, and so on.

In the embodiment of the present application, when the electronic device 100 runs the karaoke application, the microphone 170C may be used to collect the stem voice of the user when the user sings a song, that is, the pure voice without music. And, the audio module 170 may correct the collected user voice stem using a corresponding sound correction algorithm.

For example, if a user recorded song a has a clear or disordered ventilation sound in the user speech, the audio module 170 may weaken the ventilation sound at a corresponding position in the user speech, thereby improving the listening quality of the song sound.

For another example, if a vibrato occurs in the original dry sound of song a, the audio module 170 may detect whether the vibrato is also used in the corresponding location in the user's dry sound of the user recording song a. If the user uses a vibrato with a poor effect, the audio module 170 may correct the vibrato at a corresponding position in the user's dry voice, thereby improving the listening quality of the singing voice.

For another example, the audio module 170 may identify a position where a pitch occurs in the user's dry voice by comparing the user's dry voice of song a with the original dry voice, and then the audio module 170 may perform corresponding pitch up or pitch down on the position where the pitch occurs in the user's dry voice, thereby repairing the intonation problem of the user's dry voice. The specific method for the electronic device 100 to correct the user noise collected by the microphone 170C will be described in detail in the following embodiments, and therefore, details are not described herein.

It can be seen that, in the embodiment of the present application, the electronic device may not only calibrate the intonation of the song recorded by the user, but also correct the problems of singing skills such as ventilation, vibrato and the like in the song recorded by the user. Therefore, a common user who does not have professional vocal music training can record the singing with better listening feeling quality, and therefore the recording effect and the use experience of the user when the user uses the Karaoke applications are improved.

The headphone interface 170D is used to connect a wired headphone. The headset interface 170D may be the USB interface 130, or may be a 3.5mm open mobile electronic device platform (OMTP) standard interface, a cellular telecommunications industry association (cellular telecommunications industry association of the USA, CTIA) standard interface.

The sensors 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, and the like.

The keys 190 include a power-on key, a volume key, and the like. The keys 190 may be mechanical keys. Or may be touch keys. The electronic apparatus 100 may receive a key input, and generate a key signal input related to user setting and function control of the electronic apparatus 100.

The motor 191 may generate a vibration cue. The motor 191 may be used for incoming call vibration cues, as well as for touch vibration feedback. For example, touch operations applied to different applications (e.g., photographing, audio playing, etc.) may correspond to different vibration feedback effects. The motor 191 may also respond to different vibration feedback effects for touch operations applied to different areas of the display screen 194. Different application scenes (such as time reminding, receiving information, alarm clock, game and the like) can also correspond to different vibration feedback effects. The touch vibration feedback effect may also support customization.

Indicator 192 may be an indicator light that may be used to indicate a state of charge, a change in charge, or a message, missed call, notification, etc.

The SIM card interface 195 is used to connect a SIM card. The SIM card can be brought into and out of contact with the electronic apparatus 100 by being inserted into the SIM card interface 195 or being pulled out of the SIM card interface 195. The electronic device 100 may support 1 or N SIM card interfaces, N being a positive integer greater than 1. The SIM card interface 195 may support a Nano SIM card, a Micro SIM card, a SIM card, etc. The same SIM card interface 195 can be inserted with multiple cards at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 195 may also be compatible with different types of SIM cards. The SIM card interface 195 may also be compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as communication and data communication. In some embodiments, the electronic device 100 employs esims, namely: an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100.

The software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a micro-core architecture, a micro-service architecture, or a cloud architecture. The embodiment of the present application takes an Android system with a layered architecture as an example, and exemplarily illustrates a software structure of the electronic device 100.

Fig. 2 is a block diagram of a software structure of the electronic device 100 according to the embodiment of the present application.

The layered architecture divides the software into several layers, each layer having a clear role and division of labor. The layers communicate with each other through a software interface. In some embodiments, the Android system is divided into four layers, an application layer, an application framework layer, an Android runtime (Android runtime) and system library, and a kernel layer from top to bottom.

The application layer may include a series of application packages.

As shown in fig. 2, the application package may include APP (application) such as call, contact, camera, gallery, calendar, map, navigation, bluetooth, music, video, short message, etc.

In the embodiment of the application, karaoke applications, such as a song playing APP, a national karaoke APP and the like, can be installed in the application program layer. The user can record songs by using the K song application and integrate own singing voice into the accompaniment provided by the K song application.

The application framework layer provides an Application Programming Interface (API) and a programming framework for the application programs of the application layer. The application framework layer includes a number of predefined functions.

As shown in FIG. 2, the application framework layers may include a window manager, content provider, view system, resource manager, notification manager, and the like.

Wherein, the window manager is used for managing the window program. The window manager can obtain the size of the display screen, judge whether a status bar exists, lock the screen, intercept the screen and the like. The content provider is used to store and retrieve data and make it accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phone books, etc. The view system includes visual controls such as controls to display text, controls to display pictures, and the like. The view system may be used to build applications. The display interface may be composed of one or more views. For example, the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures. The resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and the like. The notification manager enables the application to display notification information in the status bar, can be used to convey notification-type messages, can disappear automatically after a short dwell, and does not require user interaction. Such as a notification manager used to inform download completion, message alerts, etc. The notification manager may also be a notification that appears in the form of a chart or scroll bar text at the top status bar of the system, such as a notification of a background running application, or a notification that appears on the screen in the form of a dialog window. For example, prompting text information in the status bar, sounding a prompt tone, vibrating the electronic device, flashing an indicator light, etc.

The system library may include a plurality of functional modules. For example: surface managers (surface managers), Media Libraries (Media Libraries), three-dimensional graphics processing Libraries (e.g., OpenGL ES), 2D graphics engines (e.g., SGL), and the like.

The surface manager is used to manage the display subsystem and provide fusion of 2D and 3D layers for multiple applications. The media library supports a variety of commonly used audio, video format playback and recording, and still image files, among others. The media library may support a variety of audio-video encoding formats, such as MPEG4, h.264, MP3, AAC, AMR, JPG, PNG, and the like. The three-dimensional graphic processing library is used for realizing three-dimensional graphic drawing, image rendering, synthesis, layer processing and the like. The 2D graphics engine is a drawing engine for 2D drawing.

In the embodiment of the present application, as also shown in fig. 2, the related sound modifying algorithm provided in the embodiment of the present application may be arranged in a system library in the form of a Dynamic Link Library (DLL). For example, a dynamic library of algorithms for the tonal modification algorithm may be stored under the system/lib directory. When the K song application is operated on an application program layer, the K song application can call an algorithm dynamic library of a sound modification algorithm to modify the problems of intonation, vibrato or ventilation sound and the like in the dry sound of the user, and the sound modification function is realized.

The Android Runtime comprises a core library and a virtual machine. The Android runtime is responsible for scheduling and managing an Android system.

The core library comprises two parts: one part is a function which needs to be called by java language, and the other part is a core library of android.

The application layer and the application framework layer run in a virtual machine. And executing java files of the application program layer and the application program framework layer into a binary file by the virtual machine. The virtual machine is used for performing the functions of object life cycle management, stack management, thread management, safety and exception management, garbage collection and the like.

The kernel layer is a layer between hardware and software. The kernel layer at least comprises a display driver, a camera driver, an audio driver, a sensor driver and the like, and the embodiment of the application does not limit the display driver, the camera driver, the audio driver, the sensor driver and the like.

A song recording method and a sound modification method provided in the embodiments of the present application will be described in detail below with reference to the accompanying drawings by taking a mobile phone as an example of an electronic device. As shown in fig. 3, the method comprises the following steps S301-S306.

S301, when the K song application is operated, the mobile phone receives the operation that the user selects to record the song A.

The karaoke applications are application software capable of providing song requesting, song recording and voice modifying functions for the user, for example, the karaoke applications can be national karaoke APPs, song bars APPs and the like, and the embodiment of the application does not limit the functions.

If the mobile phone detects an operation of opening the K song application by the user, as shown in fig. 4, the mobile phone may start to run the K song application and display a first interface 401 of the K song application. The user may search for songs that are preferred by the user for recording in the first interface 401. Alternatively, the user may manually search for and select songs in the search field 402 that need to be recorded. Certainly, the user may also select a song to be recorded in other interfaces of the song-by-song application, which is not limited in this embodiment of the present application.

Still referring to fig. 4, if the user wants to record the song "yew", the user can click the record button 403 of the song "yew" in the first interface 401, and at this time, the mobile phone can receive an operation of the user selecting to record the song "yew". In response to the user selecting the operation of recording the song "red bean", the mobile phone may continue to perform the following steps S302-S306 to complete the recording and sound modification of the song.

S302, the mobile phone acquires the characteristic information of the accompaniment and the original singing dry sound of the song A.

Still taking the example of the user recording the song "red bean", after detecting that the user clicks the recording button 403 of the song "red bean", the mobile phone can download the accompaniment of the song "red bean" from the server. Of course, if the mobile phone stores the accompaniment of the song "red bean" this time, the mobile phone can obtain the accompaniment from the local memory. In general, the parts of songs other than human voice may be called accompaniment, and some accompaniment may also include harmony voice and background voice.

In the embodiment of the application, after detecting that the user clicks the recording button 403 of the song "red bean", the mobile phone may further download the feature information of the original vocal stem of the song "red bean" from the server. Wherein, the characteristic information can reflect the singing characteristics of the original singing (such as Wangfei) when singing the song 'Hongdou'. For example, the feature information may include one or more items of fundamental frequency information of the vocal stem, ventilation position, and trill feature. The trill features may include one or more of trill position, fundamental frequency stability information of the trill, trill frequency, and trill amplitude.

As shown in fig. 5, the server may extract a variation curve f (t) of the fundamental frequency-time in units of frames from the original vocal stem of the song "red bean" in advance, where the variation curve f (t) is the fundamental frequency information of the song "red bean". As also shown in fig. 5, the server may also detect the position of the original vocal ventilation in the original vocal stem, and use the timing of the original vocal ventilation as a feature information. Similarly, as also shown in fig. 5, the server may also detect the location of the vibrato in the original vocal stem and use the time when the vibrato occurs as a feature information.

Typically, trills exhibit periodic pitch changes. Three time-varying characteristics of the tremor amplitude a (t), the tremor frequency Φ (t), and the stationary trend term v (t) can be extracted from the fundamental frequency curve with the vibrato characteristics. Taking the original singing stem of the third sentence of the song "hong bean" as an example to generate the vibrato 1, the fundamental frequency curve f3(t) corresponding to the vibrato 1 is v (t) + a (t) cos (Φ (t)), and t belongs to [8s, 10s ]. Where v (t) is the fundamental frequency stationary information of the vibrato 1, Φ (t) is the vibrato frequency of the vibrato 1, and a (t) is the vibrato amplitude of the vibrato 1. The server can send fundamental frequency stationary information, vibrato frequency and vibrato amplitude of vibrato at each position in the song red bean as characteristic information to the mobile phone.

For example, each time a new song is added to the song library in the server, the server can extract corresponding feature information according to the original vocal stem sound of the new song, and the extracted feature information is stored in the server. Subsequently, if the user selects to record a certain song in the mobile phone, the mobile phone can acquire the accompaniment of the song from the server, and can also acquire the characteristic information of the original vocal stem corresponding to the song from the server. The obtained characteristic information of the original vocal stem can be used for guiding the user to record the song and can also be used as a basis for modifying the song recorded by the user.

And S303, when the recording interface of the K song application is displayed, the mobile phone plays the accompaniment of the song A and records the user voice of singing of the user.

For example, the user records the song "red bean", and after the mobile phone acquires the accompaniment of the song "red bean", the mobile phone can start playing the accompaniment of the song "red bean". Meanwhile, the mobile phone can open the microphone to start recording the user voice of the user singing the song red bean.

As shown in fig. 6, during the process of recording the song "red bean", the mobile phone may display a recording interface 601 of the song "red bean". The recording interface 601 may display lyrics 602 of the song "red bean", and the position of the lyrics currently being recorded may be marked in the lyrics 602 in real time, so that the user may record the song as prompted by the lyrics 602.

S304, when recording the song A, the mobile phone prompts the user to adjust the singing modes such as intonation, ventilation or vibrato and the like according to the characteristic information of the original singing dry sound.

For example, when the user records a song using the K song application, the mobile phone may instruct the user to sing the song when displaying the lyric 602 according to the feature information obtained in step S302, for example, instruct the user to sing modes such as intonation, ventilation, vibrato, and the like.

For example, the user records a song "red bean", and the characteristic information of the original dry singing sound acquired by the mobile phone includes information such as a ventilation position and a vibrato position. Then, as shown in fig. 7, when the lyric 602 of the song "red bean" is displayed in real time, the mobile phone may mark the ventilation position 701 (i.e. ventilation prompt) at the corresponding position of the lyric 602 according to the ventilation position recorded in the feature information of the original vocal stem. Therefore, when the user records the song red bean, the user can exchange air according to the air exchange position 701 prompted in the lyric 602, so that the user is helped to control the change of the breath when singing, and the more obvious and disordered air exchange sound in the recorded song is reduced, and the auditory quality of the recorded song is improved.

For another example, as shown in fig. 7, when the lyric 602 of the song "red bean" is displayed in real time, the mobile phone may further mark a vibrato position 702 (i.e., a vibrato prompt) at a corresponding position of the lyric 602 according to the vibrato position recorded in the feature information of the original vocal stem. Thus, when recording the song red bean, the user can sing by using the vibrato according to the vibrato position 702 prompted in the lyric 602, thereby improving the hierarchy sense and the singing skill of the song when the user sings, and improving the listening quality of the recorded song.

In other embodiments, the mobile phone may instruct the user to sing in the lyrics displayed in real time, and may further include a singer in the recording interface, where the mobile phone may instruct the user to adjust the singing modes such as intonation, ventilation, vibrato, and the like.

Still taking the example of the user recording the song "red bean", as shown in fig. 8, the mobile phone may set a singer 801 in the recording interface 601 when displaying the recording interface 601. The musical notes for each word of a lyric currently being recorded may be displayed in the singer 801. For example, the mobile phone can display the tone line 802 (i.e. intonation cue) of each note in real time in the singer 801 according to the fundamental frequency information of the original vocal stem. The higher the pitch of the current note, the higher the position of the pitch line 802. For another example, the mobile phone may display a tremolo note 803 (i.e., a tremolo cue) in the singer 801 according to the tremolo position of the original vocal stem. The tremor note 803 may be represented by a wavy line or a line segment of a particular color. The tremolo note 803 appears at the same position as the tremolo in the original vocal stem. For another example, if a ventilation sound occurs in the original vocal stem corresponding to the lyric of the sentence, the mobile phone may display the ventilation symbol 804 (i.e., ventilation prompt) in the singer 801 according to the ventilation position in the original vocal stem. The air mover 804 may be represented by a graphic of a specified shape, such as a triangle, circle, or the like. The ventilation symbol 804 appears at the same position as the position where the ventilation sound appears in the original vocal stem sound.

Thus, when recording a song, the user can adjust the tone of the human voice according to the tone line displayed in real time in the singer 801, can also sing at the corresponding position by using the vibrato according to the vibrato symbol displayed in real time in the singer 801, and can also replace air at the corresponding position according to the air changing symbol displayed in real time in the singer 801, thereby guiding the singing skills such as tone, vibrato, air changing and the like when the user sings in the recording process and improving the recording quality of the song.

It can be understood that, when a user records a song, besides instructing the user to sing in the lyrics displayed on the recording interface and the singing device, a person skilled in the art can also instruct the user to sing a better song when the user records a song by using the characteristic information of the original dry singing sound according to actual experience or actual application scenarios, which is not limited in any way by the embodiment of the present application.

S305, when recording the song A, the mobile phone evaluates the singing of the user by comparing the characteristic information of the original singing dry sound with the characteristic information of the user dry sound.

For example, the user records the song "yew", and when recording the song "yew", the mobile phone can also evaluate the current singing of the user in real time according to the characteristic information of the original singing noise. For example, the mobile phone may evaluate the singing of the user at a granularity of words, phrases or sentences in the lyrics. As shown in fig. 9, after the mobile phone collects the user speech of the sentence "i will believe that all have the end" the user sings, the mobile phone can extract the feature information of the user speech. Similar to the feature information of the original dry sound, the mobile phone can extract the fundamental frequency information of the user dry sound, and then detect one or more feature information of the ventilation position, the vibrato position, the fundamental frequency stability information of the vibrato, the vibrato frequency, the vibrato amplitude and the like of the user based on the fundamental frequency information of the user dry sound.

Then, the mobile phone can determine the evaluation result of singing skills such as intonation, vibrato, ventilation and the like of the user by comparing the characteristic information 1 when the user sings the lyric of which i believe that all the lyrics have the end with the characteristic information 2 when the user sings the lyric of which i believe that all the lyrics have the end.

For example, if the difference between the fundamental frequency information of the user's voice stem and the fundamental frequency information of the original voice stem is smaller than the threshold, it indicates that the intonation of the user playing the sentence matches the original voice, as shown in (a) of fig. 10, the mobile phone may display a prompt 1001 of "intonation arrival" in the recording interface 601, to prompt the user that the intonation is correct when playing the sentence. If the location of the trill in the user's dry sound is the same as the location of the trill in the original dry sound, and the difference between the trill characteristics (i.e., the information on the frequency, amplitude, and fundamental frequency of the trill) of the trill in the user's dry sound and the corresponding trill characteristics in the original dry sound is less than the threshold, it is indicated that the trill generated when the user plays the sentence matches the original sound, as shown in fig. 10 (b), the mobile phone may display a prompt 1002 "after trill day" in the recording interface 601. If the position of the ventilation sound in the user stem sound is the same as the position of the ventilation sound in the original vocal stem sound, it indicates that the adjustment of the breath when the user plays the sentence matches the original vocal, as shown in (c) of fig. 10, the mobile phone may display a prompt 1003 of "ventilation get" in the recording interface 601.

Besides prompting the current evaluation result of the user in real time through characters, the mobile phone can also show the current evaluation result to the user in real time through modes of color, vibration or special effect and the like.

For example, as shown in fig. 11 (a), if the vibrato used when the user sings a lyric of "i will believe that all have the end" does not coincide with the vibrato used when the user sings the original lyric, the mobile phone may mark the "end" 1101 as a different color or a different font or the like from other lyrics when displaying the "end" 1101 of the lyric corresponding to the vibrato. For another example, as shown in fig. 11 (b), if the ventilation position when the user plays the album does not match the ventilation position when the user played the album, the mobile phone may mark a ventilation error flag 1102 when displaying the lyrics corresponding to the ventilation position.

Still alternatively, the mobile phone may also present the evaluation result determined by the mobile phone to the user in the singer 801 of the recording interface 601. For example, as shown in fig. 12 (a), when the user sings a "word," the mobile phone can display not only the tone line 1201 of the original singing of the "word" in real time in the singer 801, but also the tone line 1202 of the "word" being sung by the user this time. If the fundamental frequency of the word 'with' is sung by the user is lower than the fundamental frequency of the word 'with' as the original singing, which indicates that the tone of the word 'with' is sung by the user is lower than the tone of the word 'with' as the original singing, the tone line 1202 displayed by the mobile phone is lower than the tone line 1201. Accordingly, if it is detected that the user sings a "with" word at a higher pitch than the original one, the handset may display a higher pitch line 1202 than the pitch line 1201. In addition, if it is detected that the user sings the "with" word at the same pitch as the original singing the "with" word, the pitch line 1202 in the singer 801 coincides with the pitch line 1201. In this way, the user can know the tone level of the current singing in the singing device 801 during the real-time singing.

For another example, as shown in fig. 12 (b), the cellular phone may display a vibrato symbol 1203 of an original singing vibrato 1 in the singer 801. After detecting the user's voice stem when the user sings the trill 1, the mobile phone can compare whether the position, the amplitude of vibration A (t), the frequency of vibration phi (t) and the steady trend item V (t) of the trill when the user sings the trill 1 are matched with the position, the amplitude of vibration A (t), the frequency of vibration phi (t) and the steady trend item V (t) of the trill when the user sings the trill 1. If the two are not matched, it indicates that the user has not achieved the preset trill effect when singing trill 1 this time, the mobile phone can display the trill symbol 1203 as the display effect of dotted line, etc. Correspondingly, if the two are matched, which indicates that the preset trill effect is achieved when the user performs the trill 1 this time, as shown in (b) of fig. 12, the mobile phone may thicken or light the trill symbol 1203, so that the user can know the current trill singing effect in the singing device 801 in the real-time singing process.

It can be understood that, a person skilled in the art may also evaluate the user stem voice sung by the user in real time when recording a song according to the feature information of the original voice stem voice according to the actual experience or the actual application scenario, and show the evaluation results of the ventilation skill, the vibrato skill, the intonation accuracy, and the like to the user, for example, the mobile phone may score the user stem voice according to the feature information of the original voice stem voice and display the scoring result to the user, which is not limited in this embodiment of the present application.

It can be seen that in the process of recording a song by a user, the embodiment of the application can evaluate one or more singing skills such as intonation, vibrato, ventilation and the like of the user in real time according to the characteristic information of the original singing stem. Moreover, the mobile phone can feed back the evaluation result of singing of the user to the user in real time in the recording process, so that the user can know the current singing effect in time, and the user can adjust the singing mode in time in the subsequent recording process.

And S306, after the song A is recorded, the mobile phone corrects the user voice according to the feature information of the original voice and the feature information of the user voice.

When the mobile phone finishes playing the accompaniment of the song A or detects that the user inputs the operation of finishing recording, the mobile phone can close the microphone and does not continue to collect the user noise emitted by the user. And after song a recording is finished, as shown in fig. 13, the mobile phone can display a sound modification interface 1301 for the K song applications. A mute switch 1302 may be included in mute interface 1301. If it is detected that the user turns on the mute switch 1302, the mobile phone can mute the singing problem (for example, one or more of the singing problems such as intonation, vibrato, or ventilation sound) appearing in the user's voice collected when the user records the song a according to the characteristic information of the original singing voice of the song a.

Illustratively, still taking the example of the user recording the song "yew", after the user records the song "yew" in the recording interface, the mobile phone may display a sound modification interface 1301 as shown in fig. 13. If the user is detected to turn on the sound modification switch 1302 in the sound modification interface 1301, the mobile phone can modify the intonation, ventilation sound and vibrato in the collected user's dry sound according to the feature information of the original singing dry sound.

For example, the mobile phone can compare the fundamental frequency information of the original vocal stem with the fundamental frequency information of the song "red bean" recorded by the user at this time, so as to correct the tone accuracy problem appearing in the vocal stem of the user at this time. For example, if the pitch of the 10 th to 20 th seconds in the user's stem is lower than the pitch of the 10 th to 20 th seconds in the original vocal stem, the handset may perform an up-shift process on the pitch of the part of the user's stem so that the pitch of the part of the user's stem is the same as or close to the pitch of the original vocal stem. Accordingly, if the pitch of the 1 st 12 th-1 st 27 th second in the user stem is higher than the pitch of the 1 st 12 th-1 st 27 th second in the original vocal stem, the mobile phone can perform pitch reduction processing on the pitch of the part of the user stem, so that the pitch of the part of the user stem is the same as or close to the pitch of the original vocal stem.

For another example, the mobile phone may extract the position of the ventilation sound from the user's voice of the song "red bean" recorded by the user this time. For example, the handset may detect the presence of ventilation sounds at both 14 th and 78 th seconds of the user's dry voice using a preset ventilation detection model. Furthermore, the mobile phone can reduce the energy and frequency response of the dry sound of the user at the 14 th and 78 th seconds, thereby weakening the ventilation sound appearing at the 14 th and 78 th seconds to improve the listening quality of the recorded song.

For another example, the mobile phone may compare the trill feature of the original dry sound with the trill feature of the song "red bean" recorded by the user this time, so as to correct the trill with poor trill effect in the dry sound of the user this time. For example, the handset may detect the location of the presence of a vibrato in the user's dry sound using a preset vibrato detection model. For example, the tremolo 1 is detected in the 15 th second of the dry sound of the user, if the tremolo 1 vibration amplitude A1(t), the vibration frequency phi 1(t) and the steady trend item V1(t) are matched with the tremolo 2 vibration amplitude A2(t), the vibration frequency phi 2(t) and the steady trend item V2(t) in the 15 th second of the original sound dry sound, which shows that the original singing effect of the tremolo 1 sung by the user is basically consistent, the mobile phone does not need to modify the tremolo 1. Accordingly, if the tremolo amplitude a1(t), the tremolo frequency Φ 1(t), and the stationary tendency item V1(t) of the tremolo 1 do not match the tremolo amplitude a2(t), the tremolo frequency Φ 2(t), and the stationary tendency item V2(t) of the tremolo 2 at the 15 th second in the original vocal stem, the cellular phone can synthesize a new tremolo in the 15 th second user stem using the tremolo amplitude a2(t), the tremolo frequency Φ 2(t), and the stationary tendency item V2(t) of the tremolo 2, thereby improving the tremolo sound effect occurring in the user stem.

In some embodiments, as shown in fig. 14, after detecting that the user turns on the sound modification switch 1302 in the sound modification interface 1301, the mobile phone may further display a sound modification option 1401 for modifying the tone, a sound modification option 1402 for modifying the ventilation sound, and a sound modification option 1403 for modifying the vibrato sound. In this way, the user can manually select and repair the intonation, ventilation or vibrato problem in the recorded user's dry voice in the sound repairing interface 1301.

In other embodiments, after the mobile phone collects the user's voice of the user recording the song "red bean", the problems of intonation, ventilation, vibrato and the like in the user's voice can be automatically corrected, so that the listening quality of the song during the subsequent playing and recording of the song is improved. For example, as also shown in fig. 14, a play button 1404 is provided in the sound correction interface 1301. If the user clicks the play button 1404, the mobile phone can automatically correct the problems of intonation, ventilation sound, vibrato and the like in the dry sound of the user according to the characteristic information of the original singing dry sound, and the corrected dry sound of the user is blended into the accompaniment of the song red bean to be played.

In some embodiments, as shown in fig. 15 (a), a single sentence refinement option 1501 may also be set in the voicing interface 1301 of the K song-like application. If a user click on option 1501 is detected, the handset may display a refinement interface 1502 as shown in fig. 15 (b). In the refinement interface 1502, the mobile phone can display each lyric sung by the user in the song "red bean", and use each lyric as an option for the user to select. Taking the example that the user selects the lyric 1503 that i believe that all have the end, after the user selects the lyric 1503, the mobile phone can play the recorded part of the song corresponding to the lyric 1503. Also provided in the manicure interface 1301 are a intonation button 1504, a vibrato button 1505, and a ventilation button 1506. The user can manually select in the refinement interface 1502 whether to repair one or more singing problems such as intonation, vibrato, or ventilation occurring in the user's voice corresponding to the lyrics 1503. If the user clicks the trill button 1525 and the intonation button 1504 after selecting the lyric 1503, the mobile phone can correct the intonation and trill problems in the user stem corresponding to the lyric 1503, and blend the corrected user stem into the corresponding accompaniment for playing. Therefore, the user can correct the singing problem in the recorded voice stem of the user sentence by sentence, and the user can compare the recording effect before and after correction sentence by sentence, so that the song recorded by the user is corrected to be more satisfactory singing effect for the user.

In other embodiments, as shown in fig. 16 (a), a single sentence editing option 1601 may also be provided in the dubbing interface 1301 of the K song application. If the user is detected to click on the option 1601, as shown in fig. 16 (b), the cellular phone may display an editing interface 1602. The mobile phone can display each lyric sung by the user in the song "red bean" in the editing interface 1602, and when each lyric is displayed, the mobile phone can prompt the user at the corresponding position of the lyric to sung the problem of sung when the user sung the lyric, such as inaccurate tone, large ventilation sound, unstable vibrato, lack of vibrato, and the like. Thus, the user can intuitively see the specific problems of singing each lyric.

In addition, as also shown in (b) in fig. 16, an edit button 1603 corresponding to each sentence of lyrics may also be provided in the edit interface 1602. If it is detected that the user clicks the edit button 1603 corresponding to a certain lyric, the mobile phone can play the accompaniment corresponding to the lyric, and open the microphone to collect the user's dry voice of the user singing the lyric again. After the mobile phone collects the user voice of the user singing the lyric again, the user voice recorded again can be used for replacing the corresponding user voice in the recorded song red bean. Since the mobile phone already prompts the user for a specific singing problem when singing the lyrics when the editing interface 1602 displays the lyrics, the user can adjust the singing mode in time to correct the singing problem occurring when recording the lyrics last time when singing the lyrics again, thereby improving the singing and recording quality.

As shown in fig. 17, an embodiment of the present application discloses an electronic device, including: a touch screen 1701, the touch screen 1701 comprising a touch sensitive surface 1706 and a display screen 1707; one or more processors 1702; a memory 1703; one or more microphones 1708; and one or more computer programs 1704. The various devices described above may be connected by one or more communication buses 1705. Wherein the one or more computer programs 1704 are stored in the memory 1703 and configured to be executed by the one or more processors 1702, the one or more computer programs 1704 including instructions that may be used to perform the steps of the embodiments described above.

For example, the processor 1702 may be specifically the processor 110 shown in fig. 1, the memory 1703 may be specifically the internal memory 121 and/or the external memory 120 shown in fig. 1, the display screen 1707 may be specifically the display screen 194 shown in fig. 1, the microphone 1708 may be specifically the microphone 170C shown in fig. 1, and the touch-sensitive surface 1706 may be specifically the touch sensor 180K in the sensor module 180 shown in fig. 1, which is not limited in this embodiment of the present invention.

In some embodiments, the present application also provides a Graphical User Interface (GUI) that may be stored in an electronic device. Illustratively, the electronic device may be the electronic device shown in fig. 1 or fig. 17.

Illustratively, the graphical user interface includes: a first GUI displayed on the touch screen, as shown in fig. 4, the first GUI may be an interface 401 of a song-K application, and the first GUI includes a record button 403 for recording a first song; in response to the touch event for the record button, the electronic device may display a second GUI on the touch screen, as shown in fig. 7 or 8, which may be a recording interface 601 for the user to record the first song. Instructional information directing the user to sing the first song may be included in the second GUI, the instructional information including ventilation prompts and/or vibrato prompts. For example, as shown in FIG. 7, recording interface 601 may display lyrics 602 including a ventilation prompt 701 and a vibrato prompt 702. For another example, as shown in fig. 8, a tone cue 802, a ventilation cue 804, and a tremolo cue 803 are displayed in the singer 801 on the recording interface 601. The guide information in the second GUI is generated by the electronic device based on the characteristic information of the original vocal stem.

In some embodiments, the second GUI may further include a result of evaluating a ventilatory and/or vibrato skill when the user sings. For example, as shown in (a) to (c) of fig. 10, the recording interface 601 includes an evaluation result 1001 of the user's ventilatory skills, an evaluation result 1002 of the user's vibrato skills, and an evaluation result 1003 of the user's intonation. For another example, as shown in (a) - (b) of fig. 12, the result 1202 of evaluating the intonation of the user, the result 1203 of evaluating the vibrato skill of the user, and the like may be displayed in the singer of the recording interface 601, which is not limited in this embodiment.

In some embodiments, after the recording of the first song is finished, the electronic device may display a third GUI on the touch screen, as shown in fig. 13 or 14, where the third GUI may be a sound modification interface 1301 for the first song, and the third GUI includes a sound modification switch 1302; when the user turns on the tone modification switch 1302, the electronic device can modify singing problems such as ventilation problems and/or vibrato problems occurring in the user's dry voice.

Through the above description of the embodiments, it is clear to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely used as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device may be divided into different functional modules to complete all or part of the above described functions. For the specific working processes of the system, the apparatus and the unit described above, reference may be made to the corresponding processes in the foregoing method embodiments, and details are not described here again.

Each functional unit in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solutions of the embodiments of the present application may be essentially implemented or make a contribution to the prior art, or all or part of the technical solutions may be implemented in the form of a software product stored in a storage medium and including several instructions for causing a computer device (which may be a personal computer, a server, or a network device) or a processor to execute all or part of the steps of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: flash memory, removable hard drive, read only memory, random access memory, magnetic or optical disk, and the like.

The above description is only a specific implementation of the embodiments of the present application, but the scope of the embodiments of the present application is not limited thereto, and any changes or substitutions within the technical scope disclosed in the embodiments of the present application should be covered by the scope of the embodiments of the present application. Therefore, the protection scope of the embodiments of the present application shall be subject to the protection scope of the claims.

Claims

1. A method for song recording, comprising:

the method comprises the steps that electronic equipment displays a first interface, wherein the first interface comprises a recording button used for recording a first song;

responding to the operation of selecting the recording button by a user, the electronic equipment acquires the accompaniment of the first song and the characteristic information of the original singing dry sound, wherein the characteristic information of the original singing dry sound is used for reflecting the singing characteristic of the original singing when the first song is singed;

the electronic equipment starts to play the accompaniment of the first song and records the user voice of singing of the user;

in the process of recording the dry voice of the user, the electronic equipment displays guide information in a second interface according to the characteristic information of the original dry voice, wherein the guide information is used for guiding one or more items of ventilation or vibrato when the user sings.

2. The method of claim 1, wherein the characteristic information of the original vocal stem comprises ventilation position of the original vocal stem; the guidance information comprises ventilation prompts;

wherein, the electronic device displays guidance information in the second interface according to the feature information of the original vocal stem, and the method comprises the following steps:

and the electronic equipment displays a ventilation prompt corresponding to the ventilation position of the original vocal stem sound in the second interface according to the ventilation position of the original vocal stem sound.

3. The method of claim 2, wherein lyrics of the first song are included in the second interface;

wherein, the electronic device displays a ventilation prompt corresponding to the ventilation position of the original vocal stem sound in the second interface according to the ventilation position of the original vocal stem sound, and the method comprises the following steps:

and the electronic equipment displays the ventilation prompt at a first position of the lyrics, wherein the first position is the same as the ventilation position of the original singing dry sound.

4. The method of claim 2, wherein the second interface includes a singer for displaying a note for each word of the lyrics being recorded;

and the electronic equipment displays the ventilation prompt at a second position of the singing device, wherein the second position is the same as the ventilation position of the original singing stem sound.

5. The method according to claim 1, wherein the feature information of the original vocal stem comprises a vibrato feature of the original vocal stem; the guidance information comprises a trill prompt;

and the electronic equipment displays a trill prompt corresponding to the trill characteristic of the original singing stem sound in the second interface according to the trill characteristic of the original singing stem sound.

6. The method of claim 5, wherein the vibrato features include a vibrato location; the second interface comprises lyrics of the first song;

the electronic device displays a trill prompt corresponding to the trill feature of the original vocal stem in the second interface according to the trill feature of the original vocal stem, and the method comprises the following steps:

and the electronic equipment displays the trill prompt at a third position of the lyrics, wherein the third position is the same as the trill position in the original singing dry sound.

7. The method of claim 5, wherein the vibrato features include a vibrato location; the second interface comprises a singer, and the singer is used for displaying the musical notes of each word in the lyrics being recorded;

and the electronic equipment displays the trill prompt at a fourth position of the singing device, wherein the fourth position is the same as the trill position in the original singing stem sound.

8. The method of any of claims 1-7, wherein during recording of user dry sound, the method further comprises:

the electronic equipment extracts feature information of the user stem from the user stem, wherein the feature information of the user stem is used for reflecting singing features of a user when singing the first song;

and the electronic equipment displays an evaluation result in the second interface by comparing the characteristic information in the user stem voice with the characteristic information of the original vocal stem voice.

9. The method of claim 8, wherein the characteristic information in the user vocal stem comprises a ventilation position of the user vocal stem, and the characteristic information of the original vocal stem comprises a ventilation position of the original vocal stem;

wherein the electronic device displays the evaluation result in the second interface by comparing the feature information in the user vocal stem with the feature information of the original vocal stem, and the evaluation result includes:

and the electronic equipment displays the evaluation result of the ventilation skill in the second interface by comparing the ventilation position of the dry sound of the user with the ventilation position of the original singing dry sound.

10. The method of claim 9, wherein the electronic device displays the result of the evaluation of ventilatory skills in the second interface by comparing the ventilatory position of the user's stem voice with the ventilatory position of the original vocal stem voice, comprising:

if the ventilation position of the user dry sound is the same as the ventilation position of the original singing dry sound, the electronic equipment displays a prompt of accurate ventilation in the second interface;

and if the ventilation position of the user stem sound is different from the ventilation position of the original singing stem sound, the electronic equipment displays a ventilation error prompt in the second interface.

11. The method according to claim 8, wherein the characteristic information in the user's dry voice comprises a vibrato characteristic of the user's dry voice, and the characteristic information of the original dry voice comprises a vibrato characteristic of the original dry voice;

and the electronic equipment displays the evaluation result of the trill skill in the second interface by comparing the trill characteristic of the dry sound of the user with the trill characteristic of the original dry sound.

12. The method of claim 11, wherein the electronic device displays the assessment of vibrato skills in the second interface by comparing the vibrato features of the user's stem and the original vocal stem, comprising:

if the trill characteristics of the dry sound of the user are matched with the trill characteristics of the original dry sound, the electronic equipment displays a prompt for accurate trill in the second interface;

and if the trill characteristic of the dry sound of the user is not matched with the trill characteristic of the original dry sound, the electronic equipment displays a trill error prompt in the second interface.

13. The method of any of claims 8-12, wherein the electronic device displays the assessment results in the second interface, comprising:

the electronic equipment displays the evaluation result in the lyrics of the second interface; or,

and the electronic equipment displays the evaluation result in a singer of the second interface, wherein the singer is used for displaying the musical notes of each word in the lyrics being recorded.

14. The method according to any one of claims 1-13, further comprising, after the electronic device displays guidance information in the second interface according to the feature information of the original vocal stem sound:

and after the first song is recorded, the electronic equipment corrects the singing problem in the user stem sound according to the characteristic information of the original singing stem sound, wherein the singing problem comprises one or more of a ventilation problem or a vibrato problem.

15. A method of modifying sound, comprising:

16. The method of claim 15, wherein the singing question is a ventilation question;

the electronic equipment corrects the singing problem appearing in the user stem according to the feature information of the original singing stem, and the method comprises the following steps:

the electronic equipment extracts ventilation sound in the user dry sound from the user dry sound;

the electronic device reduces the energy and frequency response of ventilation sound in the user's dry sound to correct ventilation problems that occur in the user's dry sound.

17. The method of claim 15, wherein the singing question is a vibrato question,

the electronic equipment extracts the trill characteristics of the user dry sound from the user dry sound;

and if the trill feature of the user dry sound is not matched with the trill feature in the feature information of the original singing dry sound, the electronic equipment corrects the trill feature in the user dry sound by using the trill feature in the original singing dry sound so as to correct the trill problem in the user dry sound.

18. The method of any of claims 15-17, wherein after the end of the recording of the first song, the method further comprises:

the electronic equipment displays a second interface, wherein the second interface comprises a sound repairing switch;

and if the fact that the user turns on the tone modifying switch is detected, the electronic equipment modifies the singing problem appearing in the user stem sound according to the feature information of the original singing stem sound.

19. The method of any of claims 15-17, wherein after the end of the recording of the first song, the method further comprises:

the electronic equipment displays a second interface, wherein the second interface comprises a sound modification option, and the sound modification option comprises a tone correction option, a ventilation sound modification option and a vibrato modification option;

and responding to a voice modifying option selected by a user, and modifying the singing problem corresponding to the voice modifying option in the user voice by the electronic equipment according to the characteristic information of the original singing voice.

20. The method of any of claims 15-17, wherein after the end of the recording of the first song, the method further comprises:

the electronic equipment displays a second interface, wherein the second interface comprises a single sentence finishing option;

responding to the option of the single sentence refinement selected by the user, and displaying a third interface by the electronic equipment, wherein the third interface comprises N sentences of lyrics in the first song, and N is more than 1;

and if the user selects the first lyric in the N lyrics, the electronic equipment corrects the singing problem in the user stem corresponding to the first lyric according to the characteristic information of the original singing stem.

21. The method of claim 20, wherein the electronic device displays a third interface comprising:

and when the electronic equipment displays the N words of the lyrics in the third interface, marking the position where the singing problem appears.

22. An electronic device, comprising:

a touch screen comprising a touch sensitive surface and a display screen;

one or more processors;

one or more memories;

one or more microphones;

and one or more computer programs, wherein the one or more computer programs are stored in the one or more memories, the one or more computer programs comprising instructions which, when executed by the electronic device, cause the electronic device to perform the song recording method of any one of claims 1-14, or the sound modifying method of any one of claims 15-21.

23. A computer-readable storage medium having instructions stored thereon, which, when run on an electronic device, cause the electronic device to perform the song recording method of any one of claims 1-14, or the sound modification method of any one of claims 15-21.

24. A computer program product comprising instructions for causing an electronic device to perform the method of recording songs according to any one of claims 1 to 14 or the method of modifying sounds according to any one of claims 15 to 21 when the computer program product is run on the electronic device.

25. A graphical user interface, GUI, stored in an electronic device comprising a touch screen, a memory, a processor for executing one or more computer programs stored in the memory, the graphical user interface comprising:

a first GUI displayed on the touch screen, wherein the first GUI comprises a recording button for recording a first song;

and responding to a touch event aiming at the recording button, and displaying a second GUI on the touch screen, wherein the second GUI is a recording interface for recording the first song, the second GUI comprises guidance information for guiding a user to sing the first song, and the guidance information comprises a ventilation prompt and/or a vibrato prompt.

26. The GUI of claim 25, further comprising results of an assessment of ventilatory and/or vibrato skills while a user is singing.

27. The GUI of claim 25 or 26, further comprising, after displaying the second GUI on the touch screen:

after the first song is recorded, displaying a third GUI on the touch screen, wherein the third GUI is a sound repairing interface of the first song and comprises a sound repairing switch; when the sound correcting switch is turned on, the electronic equipment corrects the ventilation problem and/or the vibrato problem occurring in the dry sound of the user.