CN113204673A - Audio processing method, device, terminal and computer readable storage medium - Google Patents

Audio processing method, device, terminal and computer readable storage medium Download PDF

Info

Publication number
CN113204673A
CN113204673A CN202110470416.XA CN202110470416A CN113204673A CN 113204673 A CN113204673 A CN 113204673A CN 202110470416 A CN202110470416 A CN 202110470416A CN 113204673 A CN113204673 A CN 113204673A
Authority
CN
China
Prior art keywords
audio
pitch
target
character
pitch adjustment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110470416.XA
Other languages
Chinese (zh)
Inventor
张昆
马小坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Dajia Internet Information Technology Co Ltd
Original Assignee
Beijing Dajia Internet Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Dajia Internet Information Technology Co Ltd filed Critical Beijing Dajia Internet Information Technology Co Ltd
Priority to CN202110470416.XA priority Critical patent/CN113204673A/en
Publication of CN113204673A publication Critical patent/CN113204673A/en
Priority to PCT/CN2021/136890 priority patent/WO2022227589A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/738Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

The invention relates to an audio processing method, an audio processing device, a terminal and a computer readable storage medium, belongs to the technical field of audio processing, and aims to adjust the pitch of a corresponding audio clip while visually displaying the lyrics of an adjusted part by providing a pitch adjustment interface, so that a professional is not required to look at a spectrum for adjustment, the learning cost is low, convenience and rapidness are realized, and the audio processing efficiency is improved.

Description

Audio processing method, device, terminal and computer readable storage medium
Technical Field
The present disclosure relates to the field of audio processing technologies, and in particular, to an audio processing method, an audio processing device, a terminal, and a computer-readable storage medium.
Background
With the continuous development of computer technology, people begin to record songs sung by themselves through electronic equipment, and share the recorded songs on various platforms, such as a streaming media platform, a karaoke platform or a social platform, so as to meet the entertainment requirements of showing singing level by themselves or playing karaoke with others.
At present, the accompaniment used when recording the song is made in advance by the professional, the cost is higher, and the professional adjusts the music score to realize the accompaniment, so that the operation is complex, and the audio processing efficiency is lower.
Disclosure of Invention
The present disclosure provides an audio processing method, apparatus, terminal, and computer-readable storage medium, which can simplify the operation of an audio processing process and improve audio processing efficiency. The technical scheme of the disclosure is as follows:
according to a first aspect of embodiments of the present disclosure, there is provided an audio processing method, the method including:
displaying characters in lyrics of a first audio and a pitch adjustment control of the characters in a pitch adjustment interface of the first audio, wherein the pitch adjustment control is used for adjusting the pitch of an audio fragment corresponding to the characters in the first audio;
responding to the adjustment operation of a pitch adjustment control corresponding to the target character, and determining a target pitch of a target audio fragment, wherein the target audio fragment is an audio fragment corresponding to the target character;
and generating second audio based on the adjusted target audio segment.
The pitch of the corresponding audio clip can be adjusted while the lyrics of the adjusted part are visually displayed through providing the pitch adjusting interface, so that a professional is not required to look at the music score for adjustment, the learning cost is low, convenience and rapidness are realized, and the audio processing efficiency is improved.
In some embodiments, the displaying the character in the lyric of the first audio and the pitch adjustment control of the character in the pitch adjustment interface of the first audio includes:
displaying a part of characters in the lyric in a character display area of the pitch adjustment interface;
and displaying a pitch adjustment control of the target character in the area corresponding to the target character in the partial character based on the pitch of the target audio fragment.
The character display area and the area corresponding to the characters are divided in the pitch adjustment interface, so that the characters are displayed in the character display area, the pitch adjustment control is displayed in the area corresponding to the characters, the display effect is improved, convenience is brought to a user to use, and the user experience is improved.
In some embodiments, the displaying, in the region corresponding to the target character in the portion of characters, a pitch adjustment control for the target character based on the pitch of the target audio segment includes:
and displaying a pitch adjustment control of the target character in a region corresponding to the target character, wherein the display position of the pitch adjustment control corresponds to the pitch of the target audio fragment.
The display position of the pitch adjustment control of the target character is determined according to the pitch of the target audio clip, so that the user can determine the pitch of the target audio clip through the display position of the audio adjustment control, the use by the user is facilitated, and the user experience is improved.
In some embodiments, the method further comprises: and displaying a column diagram corresponding to the target character under the pitch adjustment control of the target character, wherein the height of the column diagram corresponds to the pitch of the target audio fragment.
By displaying a bar graph with a height corresponding to the pitch of the target audio piece below the pitch adjustment control, the display form in the audio processing process can be enriched.
In some embodiments, before the displaying the partial character of the lyric in the character display area of the pitch adjustment interface, the method further comprises:
obtaining lyrics of the first audio, wherein the lyrics comprise a plurality of characters and time labels corresponding to the characters;
and acquiring a pitch corresponding to the time label corresponding to the target character from the music file of the first audio, executing a region corresponding to the target character in the part of characters, and displaying a pitch adjustment control of the target character based on the pitch of the target audio fragment.
The method comprises the steps of obtaining lyrics of a first audio, obtaining pitches corresponding to characters in the first audio, displaying the characters included in the lyrics in a pitch adjusting interface, and displaying adjusting controls corresponding to the characters in the pitch adjusting interface based on the pitches, so that a user can adjust the pitches of audio segments corresponding to the characters through the audio adjusting interface, and further processing the first audio is achieved.
In some embodiments, the displaying the partial character of the lyric in the character display area of the pitch adjustment interface includes:
displaying the part of characters of the target lyrics in the lyrics in a character display area of the pitch adjustment interface;
and displaying the remaining characters of the target lyric in the character display area in response to a sliding operation on the displayed characters.
Because the character display area may not be capable of completely displaying all characters included in the target lyric, the method displays part of characters first, and sets a sliding function in the area for displaying the characters, so that a user can display the remaining characters in the target lyric in the character display area by performing sliding operation on the displayed characters, and further adjust the pitch of the audio segment corresponding to the remaining characters, thereby realizing the processing of the audio segment corresponding to the target lyric.
In some embodiments, the determining a target pitch of the target audio segment in response to the adjusting operation of the pitch-adjustment control corresponding to the target character includes:
responding to the sliding operation of the pitch adjustment control corresponding to the target character, and determining the pitch corresponding to the target position of the sliding operation as the target pitch;
alternatively, the first and second electrodes may be,
and in response to the triggering operation in the region corresponding to the target character, determining the pitch corresponding to the triggered target position as the target pitch.
The adjustment operation of the pitch adjustment control is provided as two selectable operation forms, namely sliding operation and triggering operation, so that the selectable operation forms of the user are increased, and the flexibility of the operation process of the user is improved.
In some embodiments, the method further comprises:
and responding to the adjustment operation of the pitch adjustment control corresponding to the target character, and playing the target audio clip based on the pitch corresponding to the target position of the adjustment operation.
And when the adjustment operation is finished, playing the target audio clip based on the adjusted pitch corresponding to the target position, so that the user can know the effect of the target audio clip in time, and the user can further process the target audio clip.
In some embodiments, the method further comprises:
displaying a first song name in a corresponding area of the target character in a pitch adjustment interface of a first audio, wherein the first song name corresponds to the pitch of the target audio fragment;
and if the first song name is different from the second song name, updating the first song name displayed in the corresponding area of the target character into the second song name, wherein the second song name corresponds to the pitch corresponding to the target position of the adjusting operation.
Through in the corresponding region of target character, the name of singing that corresponds to the pitch of target audio frequency fragment is shown to based on user's adjustment operation, the name of singing that shows is updated in real time, makes the user can directly judge the pitch of target audio frequency fragment through the name of singing that shows, improves the display effect, thereby improves user experience.
In some embodiments, after displaying the character in the lyric of the first audio and the pitch adjustment control of the character in the pitch adjustment interface of the first audio, the method further comprises:
and responding to the adjustment operation of the pitch adjustment control corresponding to the target character, and displaying a pitch adjustment range.
By displaying the operable pitch adjustment range when the user performs an adjustment operation on the pitch adjustment control, the user performs the adjustment operation based on the displayed pitch adjustment range, and the occurrence of the over-range adjustment situation is reduced.
In some embodiments, after the pitch adjustment range is displayed in response to the adjustment operation on the pitch adjustment control corresponding to the target character, the method further comprises:
and responding to the adjustment operation exceeding the pitch adjustment range, and displaying prompt information for prompting that the adjustment operation exceeds the pitch adjustment range.
When the adjustment operation of the user exceeds the pitch adjustment range, the user is prompted in time through the prompt information, so that the user can timely process the adjustment out-of-range condition.
In some embodiments, the pitch adjustment range includes a minimum adjusted pitch and a maximum adjusted pitch;
after the pitch adjustment range is displayed in response to the adjustment operation of the pitch adjustment control corresponding to the character, the method further includes:
if the pitch corresponding to the target position of the adjustment operation is smaller than the minimum adjustment pitch, determining the minimum adjustment pitch as the target pitch;
and if the pitch corresponding to the target position of the adjustment operation is larger than the maximum adjustment pitch, determining the maximum adjustment pitch as the target pitch.
When the adjustment operation of the user exceeds the pitch adjustment range, the minimum adjustment pitch or the maximum adjustment pitch is directly determined as the target pitch according to the condition of the adjustment operation, so that the user can continuously adjust on the basis of the minimum adjustment pitch or the maximum adjustment pitch without readjusting from the most original pitch, the operation cost of the user is reduced, and the efficiency of the audio processing process is improved.
In some embodiments, the pitch adjustment interface further comprises a first lyric switching control for switching to a previous sentence of lyrics and a second lyric switching control for switching to a next sentence of lyrics;
the method further comprises the following steps:
responding to the triggering operation of the first lyric switching control, and switching the currently displayed lyrics of the pitch adjustment interface into the lyrics of the previous sentence;
and responding to the triggering operation of the second lyric switching control, and switching the currently displayed lyrics of the pitch adjustment interface into the next lyric.
The first lyric switching control and the second lyric switching control are provided in the pitch adjustment interface, so that a user can switch the currently displayed lyrics through the two lyric switching controls, other lyrics are processed, and further all lyrics in the first audio are processed.
In some embodiments, the pitch adjustment interface further comprises a play control;
the method further comprises the following steps:
and in response to the triggering operation of the playing control, playing a first audio segment based on a target pitch of the first audio segment, wherein the first audio segment corresponds to the lyrics currently displayed by the pitch adjustment interface.
By providing the playing control in the pitch adjustment interface, a user can play the first audio segment through the playing control based on the pitch of the first audio segment corresponding to the currently displayed lyrics, so that the user can preview the audio segment after each sentence of lyrics is adjusted, and further perform subsequent processing based on the preview result.
In some embodiments, the method further comprises:
highlighting the currently played character while playing the first audio clip;
and/or the presence of a gas in the gas,
while playing the first audio clip, highlighting a pitch adjustment control corresponding to the currently played character.
In the process of playing the first audio segment, the currently played character or the pitch adjustment control corresponding to the currently played character is highlighted, or both are highlighted, so that a user can clearly know which character is played currently, and further processing is performed based on the effect of the played audio.
According to a second aspect of embodiments of the present disclosure, there is provided an audio processing apparatus, the apparatus comprising:
the display unit is configured to display characters in lyrics of a first audio and a pitch adjustment control of the characters in a pitch adjustment interface of the first audio, wherein the pitch adjustment control is used for adjusting the pitch of an audio fragment corresponding to the characters in the first audio;
the determining unit is configured to execute adjustment operation of a pitch adjustment control corresponding to the target character, and determine a target pitch of a target audio fragment, wherein the target audio fragment is an audio fragment corresponding to the target character;
a generating unit configured to perform generating a second audio based on the adjusted target audio piece.
In some embodiments, the display unit is configured to perform displaying a part of characters in the lyric in a character display area of the pitch adjustment interface;
the display unit is also configured to execute a region corresponding to the target character in the part of characters, and display a pitch adjustment control of the target character based on the pitch of the target audio segment.
In some embodiments, the display unit is configured to perform displaying a pitch adjustment control of the target character in a region corresponding to the target character, a display position of the pitch adjustment control corresponding to a pitch of the target audio segment.
In some embodiments, the display unit is further configured to perform displaying a bar graph corresponding to the target character below the pitch adjustment control of the target character, a height of the bar graph corresponding to a pitch of the target audio piece.
In some embodiments, the apparatus further comprises:
an obtaining unit configured to perform obtaining lyrics of the first audio, the lyrics including a plurality of characters and time tags corresponding to the characters;
the obtaining unit is further configured to perform the steps of obtaining a pitch corresponding to the time label corresponding to the target character from the music file of the first audio, performing a region corresponding to the target character in the part of characters, and displaying a pitch adjustment control of the target character based on the pitch of the target audio segment.
In some embodiments, the display unit is configured to perform displaying the part of the characters of the target lyric in the lyric in a character display area of the pitch adjustment interface; and displaying the remaining characters of the target lyric in the character display area in response to a sliding operation on the displayed characters.
In some embodiments, the determining unit is configured to perform, in response to a sliding operation on a pitch adjustment control corresponding to the target character, determining a pitch corresponding to a target position of the sliding operation as the target pitch;
the determining unit is also configured to execute the step of determining the pitch corresponding to the triggered target position as the target pitch in response to the triggering operation in the region corresponding to the target character.
In some embodiments, the apparatus further comprises:
and the first playing unit is configured to execute an adjusting operation of a pitch adjusting control corresponding to the target character, and play the target audio fragment based on a pitch corresponding to a target position of the adjusting operation.
In some embodiments, the display unit is further configured to perform displaying a first song name in a corresponding area of the target character in a pitch adjustment interface of the first audio, the first song name corresponding to a pitch of the target audio segment;
the device also includes:
and the updating unit is configured to update the first record name displayed in the corresponding area of the target character to a second record name if the first record name is different from the second record name, wherein the second record name corresponds to the pitch corresponding to the target position of the adjusting operation.
In some embodiments, the display unit is further configured to perform a pitch adjustment range display in response to an adjustment operation on a pitch adjustment control corresponding to the target character.
In some embodiments, the display unit is further configured to perform, in response to the adjustment operation exceeding the pitch adjustment range, displaying a prompt message for prompting the adjustment operation exceeding the pitch adjustment range.
In some embodiments, the pitch adjustment range includes a minimum adjusted pitch and a maximum adjusted pitch;
the determining unit is further configured to determine the minimum adjusted pitch as the target pitch if the pitch corresponding to the target position of the adjusting operation is smaller than the minimum adjusted pitch;
the determining unit is further configured to determine the maximum adjusted pitch as the target pitch if the pitch corresponding to the target position of the adjusting operation is larger than the maximum adjusted pitch.
In some embodiments, the pitch adjustment interface further comprises a first lyric switching control for switching to a previous sentence of lyrics and a second lyric switching control for switching to a next sentence of lyrics;
the device also includes:
the switching unit is configured to execute the triggering operation of responding to the first lyric switching control, and switch the currently displayed lyrics of the pitch adjustment interface into the lyrics of the previous sentence;
the switching unit is also configured to execute the operation of responding to the triggering operation of the second lyric switching control, and the currently displayed lyrics of the pitch adjustment interface are switched into the next lyric.
In some embodiments, the pitch adjustment interface further comprises a play control;
the device also includes:
and the second playing unit is configured to execute playing of the first audio segment based on the target pitch of the first audio segment in response to the triggering operation of the playing control, wherein the first audio segment corresponds to the currently displayed lyrics of the pitch adjustment interface.
In some embodiments, the display unit is further configured to perform highlighting the currently played character while playing the first audio clip;
the display unit is also configured to highlight the pitch adjustment control corresponding to the currently played character when the first audio segment is played.
According to a third aspect of the embodiments of the present disclosure, there is provided a terminal, including:
one or more processors;
a memory for storing the processor executable program code;
wherein the processor is configured to execute the program code to implement the audio processing method described above.
According to a fourth aspect of embodiments of the present disclosure, there is provided a computer-readable storage medium in which program code is provided that, when executed by a processor of a terminal, enables the terminal to perform the above-described audio processing method.
According to a fifth aspect of embodiments of the present disclosure, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the audio processing method described above.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure and are not to be construed as limiting the disclosure.
FIG. 1 is a schematic diagram of an implementation environment of an audio processing method according to an example embodiment;
FIG. 2 is a flow diagram illustrating a method of audio processing according to an exemplary embodiment;
FIG. 3 is a flow diagram illustrating a method of audio processing according to an exemplary embodiment;
FIG. 4 is an interface diagram illustrating a pitch adjustment interface in accordance with an exemplary embodiment;
FIG. 5 is a block diagram illustrating an audio processing device according to an exemplary embodiment;
fig. 6 is a block diagram illustrating a terminal 600 according to an example embodiment.
Detailed Description
In order to make the technical solutions of the present disclosure better understood by those of ordinary skill in the art, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.
It should be noted that the terms "first," "second," and the like in the description and claims of the present disclosure and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein are capable of operation in sequences other than those illustrated or otherwise described herein. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
The data (e.g., audio) to which the present disclosure relates is data that is authorized by a user or substantially authorized by parties.
Fig. 1 is a schematic diagram of an implementation environment of an audio processing method according to an embodiment of the present disclosure, as shown in fig. 1, the implementation environment includes: a terminal 101 and a server 102.
The terminal 101 is at least one of a smart phone, a smart watch, a desktop computer, a laptop computer, a virtual reality terminal, an augmented reality terminal, a wireless terminal, a laptop portable computer and the like, and the terminal 101 has a communication function and can be directly or indirectly connected with the server 102 through a wired or wireless communication mode. Terminal 101 generally refers to one of a plurality of terminals, and the disclosed embodiments are illustrated only with terminal 101. Those skilled in the art will appreciate that the number of terminals described above is greater or lesser in the more possible implementations. The terminal 101 is installed and operated with various different types of applications, for example, an audio type application (e.g., a karaoke application, an audio play type application, an audio clip type application, etc.). The terminal 101 obtains audio information (such as audio name, audio author, audio creation time, etc.) of a plurality of audios from the server 102, and displays the obtained audio information, so that the user can select an audio of interest based on the audio information. The terminal 101 responds to a triggering operation of a user on any audio information, sends an audio acquisition request to the server 102, the audio acquisition request carries an audio identifier, receives a first audio returned by the server 102, and adjusts the pitch of an audio clip included in the first audio through the terminal 101 to obtain a second audio, so that the user can sing based on the second audio obtained through self adjustment. Optionally, the user can also upload the second audio to the server 102 through the terminal 101, so that other users can acquire the second audio from the server 102.
The server 102 is an independent physical server, or the server 102 is a server cluster or a distributed file system formed by a plurality of physical servers, or the server 102 is a cloud server providing basic cloud computing services such as a cloud service, a cloud database, cloud computing, a cloud function, cloud storage, a web service, cloud communication, a middleware service, a domain name service, a security service, a CDN (Content Delivery Network), a big data and artificial intelligence platform, and the like. Optionally, the server 102 and the terminal 101 are directly or indirectly connected through wired or wireless communication, which is not limited in the embodiment of the present disclosure. Server 102 has associated therewith an audio database for storing a plurality of audios and audio information for the plurality of audios. The server 102, in response to receiving the audio information acquisition request of the terminal 101, acquires audio information of multiple audios from the audio database, and further sends the acquired audio information to the terminal 101, so that the terminal 101 displays the received audio information. The server 102 receives the audio acquisition request of the terminal 101, further acquires the audio corresponding to the audio identifier carried by the audio acquisition request from the audio database, and returns the acquired audio to the terminal 101. Optionally, the server 102 may further receive the audio uploaded by the terminal 101, and further store the received audio in an audio database, so that when another terminal requests to acquire the audio uploaded by the terminal 101, the corresponding audio is acquired from the audio database and sent to the other terminal. Optionally, the number of the servers 102 is greater or less, which is not limited by the embodiment of the disclosure. Of course, the server 102 may also include other functional servers to provide more comprehensive and diverse services.
Fig. 2 is a flowchart illustrating an audio processing method according to an exemplary embodiment, which is performed by a terminal as illustrated in fig. 2, and includes the steps of:
in step 201, the terminal displays characters in lyrics of a first audio and a pitch adjustment control of the characters in a pitch adjustment interface of the first audio, wherein the pitch adjustment control is used for adjusting the pitch of an audio segment corresponding to the characters in the first audio.
In step 202, the terminal determines a target pitch of a target audio segment in response to an adjustment operation on a pitch adjustment control corresponding to a target character, where the target audio segment is an audio segment corresponding to the target character.
In step 203, the terminal generates a second audio based on the adjusted target audio segment.
According to the technical scheme, the pitch adjustment interface is provided, so that the lyrics of the adjusted part can be visually displayed, the pitch of the corresponding audio fragment can be adjusted, a professional is not required to look at the spectrum for adjustment, the learning cost is low, convenience and rapidness are realized, and the audio processing efficiency is improved.
In some embodiments, the displaying the character in the lyric of the first audio and the pitch adjustment control of the character in the pitch adjustment interface of the first audio includes:
displaying a part of characters in the lyric in a character display area of the pitch adjustment interface;
and displaying a pitch adjustment control of the target character in the area corresponding to the target character in the partial character based on the pitch of the target audio fragment.
In some embodiments, the displaying, in the region corresponding to the target character in the portion of characters, a pitch adjustment control for the target character based on the pitch of the target audio segment includes:
and displaying a pitch adjustment control of the target character in a region corresponding to the target character, wherein the display position of the pitch adjustment control corresponds to the pitch of the target audio fragment.
In some embodiments, the method further comprises: and displaying a column diagram corresponding to the target character under the pitch adjustment control of the target character, wherein the height of the column diagram corresponds to the pitch of the target audio fragment.
In some embodiments, before the displaying the partial character of the lyric in the character display area of the pitch adjustment interface, the method further comprises:
obtaining lyrics of the first audio, wherein the lyrics comprise a plurality of characters and time labels corresponding to the characters;
and acquiring a pitch corresponding to the time label corresponding to the target character from the music file of the first audio, executing a region corresponding to the target character in the part of characters, and displaying a pitch adjustment control of the target character based on the pitch of the target audio fragment.
In some embodiments, the displaying the partial character of the lyric in the character display area of the pitch adjustment interface includes:
displaying the part of characters of the target lyrics in the lyrics in a character display area of the pitch adjustment interface;
and displaying the remaining characters of the target lyric in the character display area in response to a sliding operation on the displayed characters.
In some embodiments, the determining a target pitch of the target audio segment in response to the adjusting operation of the pitch-adjustment control corresponding to the target character includes:
responding to the sliding operation of the pitch adjustment control corresponding to the target character, and determining the pitch corresponding to the target position of the sliding operation as the target pitch;
alternatively, the first and second electrodes may be,
and in response to the triggering operation in the region corresponding to the target character, determining the pitch corresponding to the triggered target position as the target pitch.
In some embodiments, the method further comprises:
and responding to the adjustment operation of the pitch adjustment control corresponding to the target character, and playing the target audio clip based on the pitch corresponding to the target position of the adjustment operation.
In some embodiments, the method further comprises:
displaying a first song name in a corresponding area of the target character in a pitch adjustment interface of a first audio, wherein the first song name corresponds to the pitch of the target audio fragment;
and if the first song name is different from the second song name, updating the first song name displayed in the corresponding area of the target character into the second song name, wherein the second song name corresponds to the pitch corresponding to the target position of the adjusting operation.
In some embodiments, after displaying the character in the lyric of the first audio and the pitch adjustment control of the character in the pitch adjustment interface of the first audio, the method further comprises:
and responding to the adjustment operation of the pitch adjustment control corresponding to the target character, and displaying a pitch adjustment range.
In some embodiments, after the pitch adjustment range is displayed in response to the adjustment operation on the pitch adjustment control corresponding to the target character, the method further comprises:
and responding to the adjustment operation exceeding the pitch adjustment range, and displaying prompt information for prompting that the adjustment operation exceeds the pitch adjustment range.
In some embodiments, the pitch adjustment range includes a minimum adjusted pitch and a maximum adjusted pitch;
after the pitch adjustment range is displayed in response to the adjustment operation of the pitch adjustment control corresponding to the character, the method further includes:
if the pitch corresponding to the target position of the adjustment operation is smaller than the minimum adjustment pitch, determining the minimum adjustment pitch as the target pitch;
and if the pitch corresponding to the target position of the adjustment operation is larger than the maximum adjustment pitch, determining the maximum adjustment pitch as the target pitch.
In some embodiments, the pitch adjustment interface further comprises a first lyric switching control for switching to a previous sentence of lyrics and a second lyric switching control for switching to a next sentence of lyrics;
the method further comprises the following steps:
responding to the triggering operation of the first lyric switching control, and switching the currently displayed lyrics of the pitch adjustment interface into the lyrics of the previous sentence;
and responding to the triggering operation of the second lyric switching control, and switching the currently displayed lyrics of the pitch adjustment interface into the next lyric.
In some embodiments, the pitch adjustment interface further comprises a play control;
the method further comprises the following steps:
and in response to the triggering operation of the playing control, playing a first audio segment based on a target pitch of the first audio segment, wherein the first audio segment corresponds to the lyrics currently displayed by the pitch adjustment interface.
In some embodiments, the method further comprises:
highlighting the currently played character while playing the first audio clip;
and/or the presence of a gas in the gas,
while playing the first audio clip, highlighting a pitch adjustment control corresponding to the currently played character.
Fig. 2 is a basic flow chart of the present disclosure, and the following further explains the scheme provided by the present disclosure based on a specific implementation, and fig. 3 is a flow chart of an audio processing method according to an exemplary embodiment, and as shown in fig. 3, the audio processing method is executed by a terminal, and the audio processing method includes the following steps:
in step 301, the terminal obtains lyrics of a first audio, where the lyrics include a plurality of characters and time tags corresponding to the characters.
Wherein, the first audio is the audio produced in advance by the professional. The first audio and the lyric file and the music score file corresponding to the first audio are stored in an audio database associated with the server, when a user triggers an audio acquisition request on the terminal, the terminal sends the audio acquisition request to the server to acquire the first audio and the lyric file and the music score file corresponding to the first audio, and the audio acquisition request carries an audio identifier of the first audio. The server responds to a received audio acquisition request of the terminal, acquires a first audio corresponding to the audio identifier, a lyric file and a music score file corresponding to the first audio from an audio database, and further sends the acquired first audio and the lyric file and the music score file corresponding to the first audio to the terminal.
In some embodiments, the terminal obtains the lyrics of the first audio from a lyrics file of the first audio. The acquired lyrics of the first audio comprise a plurality of characters, and each character is provided with a corresponding time tag.
In step 302, the terminal obtains a pitch corresponding to the time stamp corresponding to the target character from the music file of the first audio.
Wherein, the target character is any character in the lyric.
In some embodiments, the terminal obtains, from a music file of the first audio, a pitch corresponding to each audio segment in the first audio, where each audio segment is an audio segment corresponding to one character, and each audio segment is provided with a corresponding time tag, and further determines, in combination with the time tag corresponding to each character obtained in step 301, a pitch corresponding to each character, so as to obtain a pitch corresponding to the target character.
When the pitch corresponding to each character is determined based on the time label corresponding to each audio clip and the time label corresponding to each character, the audio clip and the character with the consistent time labels are determined, and then the pitch of the audio clip is determined to be the pitch corresponding to the character with the consistent time label.
In step 303, the terminal displays a part of characters in the lyric in a character display area of a pitch adjustment interface of the first audio.
In some embodiments, the pitch adjustment interface includes a character display area for displaying characters in the lyrics. For example, referring to fig. 4, fig. 4 is an interface diagram illustrating a pitch adjustment interface according to an exemplary embodiment, in the pitch adjustment interface shown in fig. 4, a region 401 is the character display region.
It should be noted that, since the area size of the character display area is limited, it may not be possible to display all characters in the lyrics in the character display area at the same time, in some embodiments, when displaying characters, a part of characters in the lyrics are displayed in the character display area of the pitch adjustment interface, and a sliding function is provided in the character display area, so that a user can perform a sliding operation on the characters displayed in the character display area, so that the terminal can display the remaining characters in the lyrics in the character display area in response to the sliding operation on the displayed characters.
Still taking the interface diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, only five characters of "i", "will", "month", "light", and "send" are displayed in the area 401 as the character display area, and not all characters in the lyrics are displayed, and the user can perform a sliding operation on the five displayed characters, so that the terminal can display more characters in the area 401.
Because the character display area may not be capable of completely displaying all characters included in the lyric, the user can display part of the characters first and set a sliding function in the area for displaying the characters, so that the user can display the rest characters in the character display area by performing sliding operation on the displayed characters, and further adjust the pitch of the audio frequency segment corresponding to the rest characters, thereby realizing the processing of the whole audio frequency.
Optionally, the audio adjustment interface further includes a lyric display area, where the lyric display area is used to display lyrics corresponding to the characters displayed in the character display area.
In some embodiments, if the audio adjustment interface includes a lyric display area, the terminal displays a target lyric in the lyric display area, where the target lyric is any one of the lyrics; and displaying a part of characters of the target lyric in a character display area of the pitch adjustment interface. In addition, if the user performs a sliding operation on the characters displayed in the lyric display area, the terminal displays the remaining characters of the target lyric in the character display area in response to the sliding operation on the displayed characters.
Still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, the region 403 is the lyric display region, and the lyrics displayed in the lyric display region are the lyrics to which the characters displayed in the character display region belong, that is, the five characters "i", "will", "month", "bright" and "send" displayed in the character display region are all the characters in the lyric "i send the moon to you" displayed in the lyric display region. If the user performs a sliding operation on the characters displayed in the character display area, the terminal can respond to the sliding operation of the user and display the two characters of the rest 'give' and 'you' in the 'i send the moon to you'; or, the terminal responds to the sliding operation of the user, displays the remaining five characters of the month, the light, the sending, the giving and the you in the step that the moon is sent to the you by me, so that the characters displayed in the character display area each time can meet the maximum number of the characters displayed in the character display area, and the display effect is improved.
Optionally, the pitch adjustment interface further includes a first lyric switching control and a second lyric switching control, the first lyric switching control is used to switch to a lyric of a previous sentence, and the second lyric switching control is used to switch to a lyric of a next sentence. For example, in a lyric display area of the pitch adjustment interface, a first lyric switching control and a second lyric switching control are arranged.
The first lyric switching control and the second lyric switching control are provided in the pitch adjustment interface, so that a user can switch the currently displayed lyrics through the two lyric switching controls, other lyrics are processed, and further all lyrics in the first audio are processed.
Still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, in the area 402 as the lyric display area, a first lyric switching control 411 and a second lyric switching control 412 are arranged.
In some embodiments, if the user wants to process an audio segment corresponding to a previous lyric of a currently displayed target lyric, the user triggers the first switching control, and the terminal switches the currently displayed lyric of the pitch adjustment interface to the previous lyric in response to the triggering operation of the user on the first switching control. For example, the terminal switches the target lyric currently displayed in the lyric display area of the pitch adjustment interface to the previous lyric of the target lyric in response to the user's trigger operation on the first switching control.
Correspondingly, if the currently displayed lyrics are switched to the lyrics of the previous sentence, the characters displayed in the character display area are also correspondingly switched to partial characters in the lyrics of the previous sentence.
In other embodiments, if the user wants to process the audio segment corresponding to the next lyric of the currently displayed target lyric, the user triggers the second switching control, and the terminal switches the currently displayed lyric of the pitch adjustment interface to the next lyric in response to the triggering operation of the second switching control by the user. For example, the terminal switches the currently displayed target lyric in the lyric display area of the pitch adjustment interface to the next lyric of the target lyric in response to the user's trigger operation on the second switching control.
Correspondingly, if the currently displayed lyrics are switched to the next lyric, the characters displayed in the character display area are also correspondingly switched to partial characters in the next lyric.
In step 304, the terminal displays a pitch adjustment control of the target character in an area corresponding to the target character in the part of characters based on the pitch of the target audio segment, wherein the display position of the pitch adjustment control corresponds to the pitch of the target audio segment, and the pitch adjustment control is used for adjusting the pitch of the audio segment corresponding to the target character in the first audio.
In some embodiments, the pitch adjustment interface includes a control display area to display a pitch adjustment control corresponding to the displayed character based on a pitch of an audio clip corresponding to the displayed character. The control display area is further divided into a plurality of small areas, each small area is used for displaying one audio adjustment control, and each small area corresponds to one target character. For example, still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, the region 403 is the control display region, and the controls displayed in the control display region correspond to the characters displayed in the character display region one to one.
Different functional areas are divided in the pitch adjustment interface, so that lyrics are displayed in a lyric display area, characters are displayed in a character display area, and a pitch adjustment control is displayed in an area corresponding to the characters, so that the display effect is improved, the use by a user is facilitated, and the user experience is improved.
Still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, only five characters of "i", "will", "month", "bright", and "send" are displayed in the region 401 as the character display region, and the pitch adjustment controls displayed in the control display region also only include five pitch adjustment controls corresponding to the five characters of "i", "will", "month", "bright", and "send", that is, the pitch adjustment control 413 corresponding to the character "i", the pitch adjustment control 414 corresponding to the character "will", the pitch adjustment control 415 corresponding to the character "month", the pitch adjustment control 416 corresponding to the character "bright", and the pitch adjustment control 417 corresponding to the character "send".
It should be noted that, in step 303, if a part of characters in the lyrics is displayed in the character display area of the pitch adjustment interface, the pitch adjustment control displayed in the control display area is only the pitch adjustment control corresponding to the part of characters. And if the user performs sliding operation on the characters displayed in the character display area, so that the characters displayed in the character display area are updated, the display position of the pitch adjustment control displayed in the pitch adjustment interface is also correspondingly changed into the position corresponding to the pitch of the audio clip corresponding to the updated displayed characters.
In addition, if the user triggers the first lyric switching control or the second lyric switching control of the pitch adjustment interface, the display position of the pitch adjustment control in the pitch adjustment interface is also updated along with the switching of the lyrics, and the updated display position of the audio adjustment control corresponds to the pitch of the audio segment corresponding to part of characters in the switched lyrics.
In some embodiments, the terminal displays a bar graph corresponding to the target character below the pitch adjustment control of the target character, the height of the bar graph corresponding to the pitch of the target audio piece.
Still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, a corresponding column diagram is displayed below each pitch adjustment control, and the height of each column diagram corresponds to the pitch of the audio segment corresponding to the corresponding character.
By displaying a bar graph with a height corresponding to the pitch of the target audio piece below the pitch adjustment control, the display form in the audio processing process can be enriched.
In some embodiments, the terminal displays a first song name in a corresponding area of the target character in a pitch adjustment interface of the first audio, the first song name corresponding to a pitch of the target audio segment. The note name is a name used for facilitating the music score when singing the melody, and commonly used note names include "do", "re", "mi", "fa", "sol", "la" and "si".
Still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, the record names corresponding to the pitches of the audio segments corresponding to the characters are displayed at the bottoms of the column diagrams located below each pitch adjustment control.
By displaying the singing name corresponding to the pitch of the target audio fragment in the corresponding area of the target character, the user can directly judge the pitch of the target audio fragment through the displayed singing name, the display effect is improved, and the user experience is improved.
It should be noted that, the sequence indicated by the step numbers of the step 301 to the step 304 does not limit the execution sequence of the step 301 to the step 304, in other embodiments, after the terminal executes the step 301, the terminal executes the step 303, then executes the step 302, and after the step 302 is executed, the terminal executes the step 304, or executes the step 301 to the step 304 in other sequences, which is not limited in the embodiment of the present disclosure.
In step 305, the terminal determines a target pitch of a target audio segment in response to the adjustment operation of the pitch adjustment control corresponding to the target character, where the target audio segment is an audio segment corresponding to the target character.
Optionally, the user performs an adjustment operation on the pitch adjustment control, and a manner of performing a sliding operation on the pitch adjustment control is adopted, or a manner of performing a triggering operation in a region corresponding to the target character is adopted, or more optional manners are adopted, which is not limited in this disclosure.
In some embodiments, the user adjusts the pitch of the target audio segment by sliding the pitch adjustment control corresponding to the target character, and the terminal determines the pitch corresponding to the target position of the sliding operation as the target pitch in response to the sliding operation on the pitch adjustment control corresponding to the target character.
In other embodiments, the user adjusts the pitch of the target audio segment by performing a trigger operation in the region corresponding to the target character, and the terminal determines the pitch corresponding to the triggered target position as the target pitch in response to the trigger operation in the region corresponding to the target character.
The adjustment operation of the pitch adjustment control is provided as two selectable operation forms, namely sliding operation and triggering operation, so that the selectable operation forms of the user are increased, and the flexibility of the operation process of the user is improved.
When the user adjusts the pitch adjustment control, the terminal responds to the adjustment operation of the pitch adjustment control corresponding to the target character, and displays the pitch adjustment range, so that the user can adjust the position of the pitch adjustment control based on the displayed pitch adjustment range. Wherein, the pitch adjusting range is the pitches of the corresponding whole tone and half tone of the 7 titles "do", "re", "mi", "fa", "sol", "la", "si", that is, the pitch adjusting range includes 14 selectable pitches.
By displaying the operable pitch adjustment range when the user performs an adjustment operation on the pitch adjustment control, the user performs the adjustment operation based on the displayed pitch adjustment range, and the occurrence of the over-range adjustment situation is reduced.
In the process of adjusting the pitch adjustment control by the user, if the adjustment operation of the user exceeds the pitch adjustment range, the terminal responds to the fact that the adjustment operation exceeds the pitch adjustment range, and prompt information is displayed and used for prompting that the adjustment operation exceeds the pitch adjustment range.
When the adjustment operation of the user exceeds the pitch adjustment range, the user is prompted in time through the prompt information, so that the user can timely process the adjustment out-of-range condition.
Wherein the pitch adjustment range includes a minimum adjusted pitch and a maximum adjusted pitch. In some embodiments, if the pitch corresponding to the target position of the adjustment operation is smaller than the minimum adjusted pitch, the minimum adjusted pitch is determined as the target pitch. In other embodiments, if the pitch corresponding to the target position of the adjustment operation is greater than the maximum adjusted pitch, the maximum adjusted pitch is determined as the target pitch.
When the adjustment operation of the user exceeds the pitch adjustment range, the minimum adjustment pitch or the maximum adjustment pitch is directly determined as the target pitch according to the condition of the adjustment operation, so that the user can continuously adjust on the basis of the minimum adjustment pitch or the maximum adjustment pitch without readjusting from the most original pitch, the operation cost of the user is reduced, and the efficiency of the audio processing process is improved.
Optionally, each time the user performs an adjustment operation on the pitch adjustment control, the terminal may respond to the adjustment operation on the pitch adjustment control corresponding to the target character, and play the target audio clip based on the pitch corresponding to the target position of the adjustment operation. That is, each time the user performs an adjustment operation on the pitch adjustment control, the terminal can automatically play the target audio clip based on the pitch corresponding to the target position of the adjustment operation.
The duration of playing the target audio segment is 0.3 seconds(s), or the duration of playing the target audio segment is other values, which is not limited in the embodiment of the present disclosure.
And when the adjustment operation is finished, playing the target audio clip based on the adjusted pitch corresponding to the target position, so that the user can know the effect of the target audio clip in time, and the user can further process the target audio clip.
In addition, in the process of adjusting the pitch adjustment control corresponding to the target character by the user, the song name of the pitch corresponding to the target audio segment may change, for convenience of explanation, the song name corresponding to the pitch corresponding to the target position of the adjustment operation is recorded as a second song name, and if the first song name currently displayed in the region corresponding to the target character is different from the second song name, the terminal updates the first song name displayed in the region corresponding to the target character to the second song name.
In some embodiments, the pitch adjustment interface further includes a play control, and the user can perform trial listening on the first audio segment corresponding to the lyrics currently displayed by the pitch adjustment interface by triggering the play control. And the terminal responds to the triggering operation of the playing control and plays the first audio fragment based on the target pitch of the first audio fragment, wherein the first audio fragment corresponds to the lyrics currently displayed on the pitch adjustment interface.
Still taking the interface schematic diagram of the pitch adjustment interface shown in fig. 4 as an example, in the pitch adjustment interface shown in fig. 4, the play control is provided as a play button 418, and the user can play the adjusted first audio clip by triggering the play button 418.
By providing the playing control in the pitch adjustment interface, a user can play the first audio segment through the playing control based on the pitch of the first audio segment corresponding to the currently displayed lyrics, so that the user can preview the audio segment after each sentence of lyrics is adjusted, and further perform subsequent processing based on the preview result.
When the first audio clip is played, the terminal highlights the currently played character; or, when the first audio clip is played, the terminal highlights a pitch adjustment control corresponding to the currently played character; or, when the first audio segment is played, the terminal highlights the currently played character and the pitch adjustment control corresponding to the currently played character, which is not limited in the embodiment of the present disclosure.
In the process of playing the first audio segment, the currently played character or the pitch adjustment control corresponding to the currently played character is highlighted, or both are highlighted, so that a user can clearly know which character is played currently, and further processing is performed based on the effect of the played audio.
In step 306, the terminal generates a second audio based on the adjusted target audio segment.
In some embodiments, the pitch adjustment interface further includes a saving control, and the user can store the pitch corresponding to the audio segment corresponding to each character in the lyric by triggering the saving control, and further update the music score file of the first audio based on the stored pitch to obtain an updated music score file, so as to generate a second audio based on the updated music score file, where the second audio is the audio obtained by the user by adjusting the pitch of the audio segment.
Still taking the interface schematic of the pitch-adjustment interface shown in fig. 4 as an example, in the pitch-adjustment interface shown in fig. 4, the save control is provided as a save button 419.
According to the scheme provided by the embodiment of the disclosure, through providing the pitch adjustment interface, the pitch of the corresponding audio fragment can be adjusted while the lyrics of the adjusted part are visually displayed, so that a professional is not required to look at the spectrum for adjustment, the learning cost is low, convenience and rapidness are realized, and the audio processing efficiency is improved. Through the scheme provided by the embodiment of the disclosure, the user can adjust the pitch of the audio clip on the terminal, so that the audio processing becomes more convenient, and the function of real-time audition of the adjustment effect can be provided for the user, thereby improving the user experience. In addition, the user can adjust the pitch of the audio segments, the pitch of the audio segments corresponding to each character can be adjusted, and the flexibility of the audio processing process is improved, so that the adjusted audio can meet the personalized music aesthetic feeling of the user.
Fig. 5 is a block diagram illustrating an audio processing device according to an example embodiment. Referring to fig. 5, the apparatus includes:
a display unit 501 configured to display, in a pitch adjustment interface of a first audio, a character in lyrics of the first audio and a pitch adjustment control of the character, where the pitch adjustment control is used to adjust a pitch of an audio segment corresponding to the character in the first audio;
a determining unit 502 configured to perform an adjusting operation in response to the pitch adjustment control corresponding to the target character, and determine a target pitch of a target audio segment, where the target audio segment is an audio segment corresponding to the target character;
a generating unit 503 configured to perform generating a second audio based on the adjusted target audio piece.
The device provided by the embodiment of the disclosure can adjust the pitch of the corresponding audio segment while visually displaying the lyrics of the adjusted part by providing a pitch adjustment interface, does not need a professional to look at the spectrum for adjustment, has low learning cost, is convenient and fast, and improves the efficiency of audio processing.
In some embodiments, the display unit 501 is configured to perform displaying a part of characters in the lyric in a character display area of the pitch adjustment interface;
the display unit 501 is further configured to execute a region corresponding to the target character in the partial character, and display a pitch adjustment control of the target character based on the pitch of the target audio segment.
In some embodiments, the display unit 501 is configured to execute displaying a pitch adjustment control of the target character in a region corresponding to the target character, where a display position of the pitch adjustment control corresponds to a pitch of the target audio piece.
In some embodiments, the display unit 501 is further configured to perform displaying a bar graph corresponding to the target character below the pitch adjustment control of the target character, the height of the bar graph corresponding to the pitch of the target audio piece.
In some embodiments, the apparatus further comprises:
an obtaining unit configured to perform obtaining lyrics of the first audio, the lyrics including a plurality of characters and time tags corresponding to the characters;
the obtaining unit is further configured to perform the steps of obtaining a pitch corresponding to the time label corresponding to the target character from the music file of the first audio, performing a region corresponding to the target character in the part of characters, and displaying a pitch adjustment control of the target character based on the pitch of the target audio segment.
In some embodiments, the display unit 501 is configured to perform displaying the part of the characters of the target lyric in the lyric in a character display area of the pitch adjustment interface; and displaying the remaining characters of the target lyric in the character display area in response to a sliding operation on the displayed characters.
In some embodiments, the determining unit 502 is configured to perform, in response to a sliding operation on a pitch adjustment control corresponding to the target character, determining a pitch corresponding to a target position of the sliding operation as the target pitch;
the determining unit 502 is further configured to perform a determination of a pitch corresponding to the triggered target position as the target pitch in response to the triggering operation in the region corresponding to the target character.
In some embodiments, the apparatus further comprises:
and the first playing unit is configured to execute an adjusting operation of a pitch adjusting control corresponding to the target character, and play the target audio fragment based on a pitch corresponding to a target position of the adjusting operation.
In some embodiments, the display unit 501 is further configured to perform displaying a first song name in a corresponding area of the target character in a pitch adjustment interface of the first audio, the first song name corresponding to a pitch of the target audio segment;
the device also includes:
and the updating unit is configured to update the first record name displayed in the corresponding area of the target character to a second record name if the first record name is different from the second record name, wherein the second record name corresponds to the pitch corresponding to the target position of the adjusting operation.
In some embodiments, the display unit 501 is further configured to perform a pitch adjustment range display in response to an adjustment operation on a pitch adjustment control corresponding to the target character.
In some embodiments, the display unit 501 is further configured to perform displaying a prompt message for prompting that the adjustment operation exceeds the pitch adjustment range in response to the adjustment operation exceeding the pitch adjustment range.
In some embodiments, the pitch adjustment range includes a minimum adjusted pitch and a maximum adjusted pitch;
the determining unit 502 is further configured to determine the minimum adjusted pitch as the target pitch if the pitch corresponding to the target position of the adjusting operation is smaller than the minimum adjusted pitch;
the determining unit 502 is further configured to determine the maximum adjusted pitch as the target pitch if the pitch corresponding to the target position of the adjusting operation is larger than the maximum adjusted pitch.
In some embodiments, the pitch adjustment interface further comprises a first lyric switching control for switching to a previous sentence of lyrics and a second lyric switching control for switching to a next sentence of lyrics;
the device also includes:
the switching unit is configured to execute the triggering operation of responding to the first lyric switching control, and switch the currently displayed lyrics of the pitch adjustment interface into the lyrics of the previous sentence;
the switching unit is also configured to execute the operation of responding to the triggering operation of the second lyric switching control, and the currently displayed lyrics of the pitch adjustment interface are switched into the next lyric.
In some embodiments, the pitch adjustment interface further comprises a play control;
the device also includes:
and the second playing unit is configured to execute playing of the first audio segment based on the target pitch of the first audio segment in response to the triggering operation of the playing control, wherein the first audio segment corresponds to the currently displayed lyrics of the pitch adjustment interface.
In some embodiments, the display unit 501 is further configured to highlight the currently played character when the first audio segment is played;
the display unit 501 is further configured to highlight the pitch adjustment control corresponding to the currently played character when the first audio clip is played.
It should be noted that: in the audio processing apparatus provided in the foregoing embodiment, when processing audio, only the division of the functional modules is illustrated, and in practical applications, the functions may be distributed by different functional modules according to needs, that is, the internal structure of the device is divided into different functional modules, so as to complete all or part of the functions described above. In addition, the audio processing apparatus and the audio processing method provided by the above embodiments belong to the same concept, and specific implementation processes thereof are described in the method embodiments and are not described herein again.
Fig. 6 shows a block diagram of a terminal 600 according to an exemplary embodiment of the present disclosure. The terminal 600 may be: a smart phone, a tablet computer, an MP3 player (Moving Picture Experts Group Audio Layer III, motion video Experts compression standard Audio Layer 3), an MP4 player (Moving Picture Experts Group Audio Layer IV, motion video Experts compression standard Audio Layer 4), a notebook computer, or a desktop computer. The terminal 600 may also be referred to by other names such as user equipment, portable terminal, laptop terminal, desktop terminal, etc.
In general, the terminal 600 includes: a processor 601 and a memory 602.
The processor 601 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and so on. The processor 601 may be implemented in at least one hardware form of a DSP (Digital Signal Processing), an FPGA (Field-Programmable Gate Array), and a PLA (Programmable Logic Array). The processor 601 may also include a main processor and a coprocessor, where the main processor is a processor for Processing data in an awake state, and is also called a Central Processing Unit (CPU); a coprocessor is a low power processor for processing data in a standby state. In some embodiments, the processor 601 may be integrated with a GPU (Graphics Processing Unit), which is responsible for rendering and drawing the content that the display screen needs to display. In some embodiments, processor 601 may also include an AI (Artificial Intelligence) processor for processing computational operations related to machine learning.
The memory 602 may include one or more computer-readable storage media, which may be non-transitory. The memory 602 may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash memory storage devices. In some embodiments, a non-transitory computer readable storage medium in the memory 602 is used to store at least one program code for execution by the processor 601 to implement the audio processing method provided by the method embodiments in the present disclosure.
In some embodiments, the terminal 600 may further optionally include: a peripheral interface 603 and at least one peripheral. The processor 601, memory 602, and peripheral interface 603 may be connected by buses or signal lines. Various peripheral devices may be connected to the peripheral interface 603 via a bus, signal line, or circuit board. Specifically, the peripheral device includes: at least one of a radio frequency circuit 604, a display 605, a camera assembly 606, an audio circuit 607, a positioning component 608, and a power supply 609.
The peripheral interface 603 may be used to connect at least one peripheral related to I/O (Input/Output) to the processor 601 and the memory 602. In some embodiments, the processor 601, memory 602, and peripheral interface 603 are integrated on the same chip or circuit board; in some other embodiments, any one or two of the processor 601, the memory 602, and the peripheral interface 603 may be implemented on a separate chip or circuit board, which is not limited in this embodiment.
The Radio Frequency circuit 604 is used for receiving and transmitting RF (Radio Frequency) signals, also called electromagnetic signals. The radio frequency circuitry 604 communicates with communication networks and other communication devices via electromagnetic signals. The rf circuit 604 converts an electrical signal into an electromagnetic signal to transmit, or converts a received electromagnetic signal into an electrical signal. Optionally, the radio frequency circuit 604 comprises: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a subscriber identity module card, and so forth. The radio frequency circuitry 604 may communicate with other terminals via at least one wireless communication protocol. The wireless communication protocols include, but are not limited to: metropolitan area networks, various generation mobile communication networks (2G, 3G, 4G, and 5G), Wireless local area networks, and/or WiFi (Wireless Fidelity) networks. In some embodiments, the radio frequency circuit 604 may also include NFC (Near Field Communication) related circuits, which are not limited by this disclosure.
The display 605 is used to display a UI (User Interface). The UI may include graphics, text, icons, video, and any combination thereof. When the display screen 605 is a touch display screen, the display screen 605 also has the ability to capture touch signals on or over the surface of the display screen 605. The touch signal may be input to the processor 601 as a control signal for processing. At this point, the display 605 may also be used to provide virtual buttons and/or a virtual keyboard, also referred to as soft buttons and/or a soft keyboard. In some embodiments, the display 605 may be one, disposed on the front panel of the terminal 600; in other embodiments, the display 605 may be at least two, respectively disposed on different surfaces of the terminal 600 or in a folded design; in other embodiments, the display 605 may be a flexible display disposed on a curved surface or a folded surface of the terminal 600. Even more, the display 605 may be arranged in a non-rectangular irregular pattern, i.e., a shaped screen. The Display 605 may be made of LCD (Liquid Crystal Display), OLED (Organic Light-Emitting Diode), and the like.
The camera assembly 606 is used to capture images or video. Optionally, camera assembly 606 includes a front camera and a rear camera. Generally, a front camera is disposed at a front panel of the terminal, and a rear camera is disposed at a rear surface of the terminal. In some embodiments, the number of the rear cameras is at least two, and each rear camera is any one of a main camera, a depth-of-field camera, a wide-angle camera and a telephoto camera, so that the main camera and the depth-of-field camera are fused to realize a background blurring function, and the main camera and the wide-angle camera are fused to realize panoramic shooting and VR (Virtual Reality) shooting functions or other fusion shooting functions. In some embodiments, camera assembly 606 may also include a flash. The flash lamp can be a monochrome temperature flash lamp or a bicolor temperature flash lamp. The double-color-temperature flash lamp is a combination of a warm-light flash lamp and a cold-light flash lamp, and can be used for light compensation at different color temperatures.
Audio circuitry 607 may include a microphone and a speaker. The microphone is used for collecting sound waves of a user and the environment, converting the sound waves into electric signals, and inputting the electric signals to the processor 601 for processing or inputting the electric signals to the radio frequency circuit 604 to realize voice communication. For the purpose of stereo sound collection or noise reduction, a plurality of microphones may be provided at different portions of the terminal 600. The microphone may also be an array microphone or an omni-directional pick-up microphone. The speaker is used to convert electrical signals from the processor 601 or the radio frequency circuit 604 into sound waves. The loudspeaker can be a traditional film loudspeaker or a piezoelectric ceramic loudspeaker. When the speaker is a piezoelectric ceramic speaker, the speaker can be used for purposes such as converting an electric signal into a sound wave audible to a human being, or converting an electric signal into a sound wave inaudible to a human being to measure a distance. In some embodiments, audio circuitry 607 may also include a headphone jack.
The positioning component 608 is used for positioning the current geographic Location of the terminal 600 to implement navigation or LBS (Location Based Service). The Positioning component 608 can be a Positioning component based on the united states GPS (Global Positioning System), the chinese beidou System, the russian graves System, or the european union's galileo System.
Power supply 609 is used to provide power to the various components in terminal 600. The power supply 609 may be ac, dc, disposable or rechargeable. When the power supply 609 includes a rechargeable battery, the rechargeable battery may support wired or wireless charging. The rechargeable battery may also be used to support fast charge technology.
In some embodiments, the terminal 600 also includes one or more sensors 610. The one or more sensors 610 include, but are not limited to: acceleration sensor 611, gyro sensor 612, pressure sensor 613, fingerprint sensor 614, optical sensor 615, and proximity sensor 616.
The acceleration sensor 611 may detect the magnitude of acceleration in three coordinate axes of the coordinate system established with the terminal 600. For example, the acceleration sensor 611 may be used to detect components of the gravitational acceleration in three coordinate axes. The processor 601 may control the display screen 605 to display the user interface in a landscape view or a portrait view according to the gravitational acceleration signal collected by the acceleration sensor 611. The acceleration sensor 611 may also be used for acquisition of motion data of a game or a user.
The gyro sensor 612 may detect a body direction and a rotation angle of the terminal 600, and the gyro sensor 612 and the acceleration sensor 611 may cooperate to acquire a 3D motion of the user on the terminal 600. The processor 601 may implement the following functions according to the data collected by the gyro sensor 612: motion sensing (such as changing the UI according to a user's tilting operation), image stabilization at the time of photographing, game control, and inertial navigation.
Pressure sensors 613 may be disposed on the side bezel of terminal 600 and/or underneath display screen 605. When the pressure sensor 613 is disposed on the side frame of the terminal 600, a user's holding signal of the terminal 600 can be detected, and the processor 601 performs left-right hand recognition or shortcut operation according to the holding signal collected by the pressure sensor 613. When the pressure sensor 613 is disposed at the lower layer of the display screen 605, the processor 601 controls the operability control on the UI interface according to the pressure operation of the user on the display screen 605. The operability control comprises at least one of a button control, a scroll bar control, an icon control and a menu control.
The fingerprint sensor 614 is used for collecting a fingerprint of a user, and the processor 601 identifies the identity of the user according to the fingerprint collected by the fingerprint sensor 614, or the fingerprint sensor 614 identifies the identity of the user according to the collected fingerprint. Upon identifying that the user's identity is a trusted identity, the processor 601 authorizes the user to perform relevant sensitive operations including unlocking the screen, viewing encrypted information, downloading software, paying, and changing settings, etc. The fingerprint sensor 614 may be disposed on the front, back, or side of the terminal 600. When a physical button or vendor Logo is provided on the terminal 600, the fingerprint sensor 614 may be integrated with the physical button or vendor Logo.
The optical sensor 615 is used to collect the ambient light intensity. In one embodiment, processor 601 may control the display brightness of display screen 605 based on the ambient light intensity collected by optical sensor 615. Specifically, when the ambient light intensity is high, the display brightness of the display screen 605 is increased; when the ambient light intensity is low, the display brightness of the display screen 605 is adjusted down. In another embodiment, the processor 601 may also dynamically adjust the shooting parameters of the camera assembly 606 according to the ambient light intensity collected by the optical sensor 615.
A proximity sensor 616, also known as a distance sensor, is typically disposed on the front panel of the terminal 600. The proximity sensor 616 is used to collect the distance between the user and the front surface of the terminal 600. In one embodiment, when proximity sensor 616 detects that the distance between the user and the front face of terminal 600 gradually decreases, processor 601 controls display 605 to switch from the bright screen state to the dark screen state; when the proximity sensor 616 detects that the distance between the user and the front face of the terminal 600 is gradually increased, the processor 601 controls the display 605 to switch from the breath-screen state to the bright-screen state.
Those skilled in the art will appreciate that the configuration shown in fig. 6 is not intended to be limiting of terminal 600 and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components may be used.
In an exemplary embodiment, a computer readable storage medium comprising program code, such as the memory 602 comprising program code, which is executable by the processor 601 of the terminal 600 to perform the above-described audio processing method is also provided. Alternatively, the computer-readable storage medium may be a ROM (Read-Only Memory), a RAM (Random Access Memory), a CD-ROM (Compact-Disc Read-Only Memory), a magnetic tape, a floppy disk, an optical data storage device, and the like.
In an exemplary embodiment, a computer program product is also provided, comprising a computer program, which when executed by the processor 601 of the terminal 600, implements the audio processing method described above.
In some embodiments, the computer program according to the embodiments of the present application may be deployed to be executed on one terminal or on a plurality of terminals located at one site, or on a plurality of terminals distributed at a plurality of sites and interconnected by a communication network, and the plurality of terminals distributed at the plurality of sites and interconnected by the communication network may constitute a block chain system.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This disclosure is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (10)

1. A method of audio processing, the method comprising:
displaying characters in lyrics of a first audio and a pitch adjustment control of the characters in a pitch adjustment interface of the first audio, wherein the pitch adjustment control is used for adjusting the pitch of an audio fragment corresponding to the characters in the first audio;
responding to an adjustment operation of a pitch adjustment control corresponding to a target character, and determining a target pitch of a target audio fragment, wherein the target audio fragment is an audio fragment corresponding to the target character;
and generating second audio based on the adjusted target audio segment.
2. The audio processing method of claim 1, wherein the displaying the characters in the lyrics of the first audio and the pitch adjustment controls of the characters in the pitch adjustment interface of the first audio comprises:
displaying a part of characters in the lyrics in a character display area of the pitch adjustment interface;
and displaying a pitch adjustment control of the target character based on the pitch of the target audio fragment in the area corresponding to the target character in the partial character.
3. The audio processing method according to claim 2, wherein the displaying, in the area corresponding to the target character in the partial character, a pitch adjustment control of the target character based on the pitch of the target audio segment includes:
and displaying a pitch adjustment control of the target character in an area corresponding to the target character, wherein the display position of the pitch adjustment control corresponds to the pitch of the target audio fragment.
4. The audio processing method of claim 3, further comprising: displaying a bar graph corresponding to the target character below a pitch adjustment control of the target character, a height of the bar graph corresponding to a pitch of the target audio piece.
5. The audio processing method of claim 1, wherein the method further comprises:
and responding to the adjustment operation of the pitch adjustment control corresponding to the target character, and playing the target audio clip based on the pitch corresponding to the target position of the adjustment operation.
6. The audio processing method of claim 1, wherein the pitch adjustment interface further comprises a first lyric switching control for switching to a previous sentence of lyrics and a second lyric switching control for switching to a next sentence of lyrics;
the method further comprises the following steps:
responding to the triggering operation of the first lyric switching control, and switching the currently displayed lyrics of the pitch adjustment interface into the lyrics of the previous sentence;
and responding to the triggering operation of the second lyric switching control, and switching the currently displayed lyrics of the pitch adjustment interface into the next lyric.
7. The audio processing method of claim 1, wherein the pitch adjustment interface further comprises a play control;
the method further comprises the following steps:
in response to a triggering operation of the playing control, playing a first audio segment based on a target pitch of the first audio segment, wherein the first audio segment corresponds to the lyrics currently displayed by the pitch adjustment interface.
8. An audio processing apparatus, characterized in that the apparatus comprises:
a display unit configured to execute displaying, in a pitch adjustment interface of a first audio, a character in lyrics of the first audio and a pitch adjustment control of the character, the pitch adjustment control being used to adjust a pitch of an audio segment corresponding to the character in the first audio;
the determining unit is configured to execute adjustment operation of a pitch adjustment control corresponding to a target character, and determine a target pitch of a target audio fragment, wherein the target audio fragment is an audio fragment corresponding to the target character;
a generating unit configured to perform generating a second audio based on the adjusted target audio piece.
9. A terminal, characterized in that the terminal comprises:
one or more processors;
a memory for storing the processor executable program code;
wherein the processor is configured to execute the program code to implement the audio processing method of any of claims 1 to 7.
10. A computer-readable storage medium, characterized in that program code in the computer-readable storage medium, when executed by a processor of a terminal, enables the terminal to perform the audio processing method of any of claims 1 to 7.
CN202110470416.XA 2021-04-28 2021-04-28 Audio processing method, device, terminal and computer readable storage medium Pending CN113204673A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202110470416.XA CN113204673A (en) 2021-04-28 2021-04-28 Audio processing method, device, terminal and computer readable storage medium
PCT/CN2021/136890 WO2022227589A1 (en) 2021-04-28 2021-12-09 Audio processing method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110470416.XA CN113204673A (en) 2021-04-28 2021-04-28 Audio processing method, device, terminal and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN113204673A true CN113204673A (en) 2021-08-03

Family

ID=77029463

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110470416.XA Pending CN113204673A (en) 2021-04-28 2021-04-28 Audio processing method, device, terminal and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN113204673A (en)
WO (1) WO2022227589A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022227589A1 (en) * 2021-04-28 2022-11-03 北京达佳互联信息技术有限公司 Audio processing method and apparatus

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1326228A1 (en) * 2002-01-04 2003-07-09 DBTech Systems and methods for creating, modifying, interacting with and playing musical compositions
WO2014079186A1 (en) * 2012-11-20 2014-05-30 华为技术有限公司 Method for making audio file and terminal device
JP2015011146A (en) * 2013-06-28 2015-01-19 ヤマハ株式会社 Acoustic processing device
CN109584910A (en) * 2017-09-29 2019-04-05 雅马哈株式会社 It sings editor's householder method of audio and sings editor's auxiliary device of audio
CN110600034A (en) * 2019-09-12 2019-12-20 广州酷狗计算机科技有限公司 Singing voice generation method, singing voice generation device, singing voice generation equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105788589B (en) * 2016-05-04 2021-07-06 腾讯科技(深圳)有限公司 Audio data processing method and device
CN111026907B (en) * 2019-12-09 2023-10-27 腾讯音乐娱乐科技(深圳)有限公司 Method and device for displaying audio information in audio playing process
CN113204673A (en) * 2021-04-28 2021-08-03 北京达佳互联信息技术有限公司 Audio processing method, device, terminal and computer readable storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1326228A1 (en) * 2002-01-04 2003-07-09 DBTech Systems and methods for creating, modifying, interacting with and playing musical compositions
WO2014079186A1 (en) * 2012-11-20 2014-05-30 华为技术有限公司 Method for making audio file and terminal device
JP2015011146A (en) * 2013-06-28 2015-01-19 ヤマハ株式会社 Acoustic processing device
CN109584910A (en) * 2017-09-29 2019-04-05 雅马哈株式会社 It sings editor's householder method of audio and sings editor's auxiliary device of audio
CN110600034A (en) * 2019-09-12 2019-12-20 广州酷狗计算机科技有限公司 Singing voice generation method, singing voice generation device, singing voice generation equipment and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022227589A1 (en) * 2021-04-28 2022-11-03 北京达佳互联信息技术有限公司 Audio processing method and apparatus

Also Published As

Publication number Publication date
WO2022227589A1 (en) 2022-11-03

Similar Documents

Publication Publication Date Title
CN108008930B (en) Method and device for determining K song score
CN109033335B (en) Audio recording method, device, terminal and storage medium
CN108538302B (en) Method and apparatus for synthesizing audio
CN109168073B (en) Method and device for displaying cover of live broadcast room
CN108965757B (en) Video recording method, device, terminal and storage medium
CN109327608B (en) Song sharing method, terminal, server and system
CN110688082B (en) Method, device, equipment and storage medium for determining adjustment proportion information of volume
CN111031386B (en) Video dubbing method and device based on voice synthesis, computer equipment and medium
CN109144346B (en) Song sharing method and device and storage medium
CN109192218B (en) Method and apparatus for audio processing
CN109346111B (en) Data processing method, device, terminal and storage medium
CN111061405B (en) Method, device and equipment for recording song audio and storage medium
CN110139143B (en) Virtual article display method, device, computer equipment and storage medium
CN111753125A (en) Song audio frequency display method and device
CN110266982B (en) Method and system for providing songs while recording video
CN110996167A (en) Method and device for adding subtitles in video
CN111083526A (en) Video transition method and device, computer equipment and storage medium
CN111081277B (en) Audio evaluation method, device, equipment and storage medium
CN110808021A (en) Audio playing method, device, terminal and storage medium
CN113204672B (en) Resource display method, device, computer equipment and medium
CN113963707A (en) Audio processing method, device, equipment and storage medium
CN112118482A (en) Audio file playing method and device, terminal and storage medium
CN109003627B (en) Method, device, terminal and storage medium for determining audio score
CN109036463B (en) Method, device and storage medium for acquiring difficulty information of songs
WO2022227589A1 (en) Audio processing method and apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination