CN114387994A - Audio data acquisition method and device - Google Patents

Audio data acquisition method and device Download PDF

Info

Publication number
CN114387994A
CN114387994A CN202111616161.XA CN202111616161A CN114387994A CN 114387994 A CN114387994 A CN 114387994A CN 202111616161 A CN202111616161 A CN 202111616161A CN 114387994 A CN114387994 A CN 114387994A
Authority
CN
China
Prior art keywords
audio
audio clip
clip
current audio
current
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111616161.XA
Other languages
Chinese (zh)
Inventor
马小坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Dajia Internet Information Technology Co Ltd
Original Assignee
Beijing Dajia Internet Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Dajia Internet Information Technology Co Ltd filed Critical Beijing Dajia Internet Information Technology Co Ltd
Priority to CN202111616161.XA priority Critical patent/CN114387994A/en
Publication of CN114387994A publication Critical patent/CN114387994A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

The present disclosure relates to the field of computer technologies, and in particular, to an audio data acquisition method and apparatus. The audio data acquisition method comprises the following steps: in the audio data acquisition process, if the current audio clip is determined not to meet the audio quality condition, sending prompt information aiming at the current audio clip, wherein the prompt information is used for prompting to acquire the current audio clip again; if a confirmation instruction aiming at the prompt message is received, acquiring the reacquiring duration information corresponding to the current audio clip; and acquiring the audio clip which is acquired again aiming at the current audio clip based on the re-acquisition duration information so as to acquire the audio data meeting the audio quality condition. By adopting the method and the device, the user experience can be improved while the convenience of audio data acquisition is improved.

Description

Audio data acquisition method and device
Technical Field
The present disclosure relates to the field of computer technologies, and in particular, to an audio data acquisition method and apparatus.
Background
At present, in the application of karaoke, a user can sing through a vocal accompaniment function of a terminal, and audio data generated by singing of the user can be recorded through the terminal. However, when a user is dissatisfied with a certain piece of audio data and needs to sing again, the whole song can only be recorded from the beginning, so that the song recording steps are more, the convenience of song recording is poor, and the user experience is poor.
Disclosure of Invention
The present disclosure provides an audio data acquisition method, apparatus, and system to at least solve the problem of low convenience of audio data acquisition in the related art. The technical scheme of the disclosure is as follows:
according to a first aspect of the embodiments of the present disclosure, there is provided an audio data acquisition method, including:
in the audio data acquisition process, if the current audio clip is determined not to meet the audio quality condition, sending prompt information aiming at the current audio clip, wherein the prompt information is used for prompting to acquire the current audio clip again;
if a confirmation instruction aiming at the prompt message is received, acquiring the reacquiring duration information corresponding to the current audio clip;
and acquiring the audio clip which is acquired again aiming at the current audio clip based on the re-acquisition duration information so as to acquire the audio data meeting the audio quality condition.
Optionally, before the sending out the prompt information for the current audio segment, the method further includes:
the method comprises the steps of obtaining a selection instruction input aiming at an audio quality condition set, and determining an audio quality condition corresponding to an audio data acquisition process based on the selection instruction.
Optionally, after the acquiring the audio segment recaptured for the current audio segment based on the recapture duration information, the method further includes:
and if the reacquired audio clip is determined to meet the audio quality condition, replacing the current audio clip with the reacquired audio clip.
Optionally, after acquiring the audio clip re-acquired for the current audio clip based on the re-acquisition duration information, the method further includes:
if an audition instruction aiming at the reacquired audio clip is received, playing the reacquired audio clip;
and if a confirmation instruction aiming at the re-collected audio clip is received, replacing the current audio clip with the re-collected audio clip.
Optionally, if it is determined that the current audio segment does not satisfy the audio quality condition, sending a prompt message for the current audio segment, where the prompt message includes:
acquiring a current audio clip;
acquiring an actual singing parameter and a target singing parameter corresponding to the current audio clip, wherein the target singing parameter is a singing parameter stored in a Musical Instrument Digital Interface (MIDI) file;
and if the coincidence degree of the actual singing parameter and the target singing parameter is lower than the coincidence degree threshold value, sending out prompt information aiming at the current audio clip.
Optionally, if a confirmation instruction for the prompt information is received, acquiring the reacquisition duration information corresponding to the current audio clip includes:
if a confirmation instruction aiming at the prompt information is received, displaying audio acquisition time information corresponding to the current audio clip;
acquiring a selection instruction input aiming at the audio acquisition time information;
and acquiring the reacquisition duration information corresponding to the current audio clip based on the selection instruction.
Optionally, the displaying of the audio capture time information corresponding to the current audio clip includes:
generating a floating window on a display interface, and displaying audio acquisition time information corresponding to the current audio clip in the floating window;
or
And displaying the audio acquisition time information corresponding to the current audio clip on a sub-display interface of the display interface.
According to a second aspect of the embodiments of the present disclosure, there is provided an audio data acquisition apparatus including:
the quality judgment unit is configured to execute in the audio data acquisition process, if the current audio clip is determined not to meet the audio quality condition, send prompt information aiming at the current audio clip, wherein the prompt information is used for prompting to acquire the current audio clip again;
the information acquisition unit is configured to execute that if a confirmation instruction aiming at the prompt information is received, the re-acquisition duration information corresponding to the current audio clip is acquired;
an audio acquisition unit configured to perform acquisition of an audio clip re-acquired for the current audio clip based on the re-acquisition duration information to acquire audio data satisfying the audio quality condition.
Optionally, the apparatus further includes a condition selection unit, configured to perform, before the issuing of the prompt information for the current audio segment, specifically, to perform:
the method comprises the steps of obtaining a selection instruction input aiming at an audio quality condition set, and determining an audio quality condition corresponding to an audio data acquisition process based on the selection instruction.
Optionally, the apparatus further includes an audio replacing unit, configured to perform, after acquiring the audio segment re-acquired for the current audio segment based on the re-acquisition duration information, specifically, perform:
and if the reacquired audio clip is determined to meet the audio quality condition, replacing the current audio clip with the reacquired audio clip.
Optionally, the apparatus further includes an audio listening unit and an audio confirmation unit, configured to perform, after acquiring the audio segment re-acquired for the current audio segment based on the re-acquisition duration information:
the audio audition unit is configured to execute playing the reacquired audio clip if an audition instruction for the reacquired audio clip is received;
the audio confirming unit is configured to execute replacing the current audio clip with the re-collected audio clip if a confirming instruction for the re-collected audio clip is received.
Optionally, the quality determining unit includes a segment obtaining subunit, a parameter obtaining subunit, and an information prompting subunit, and is configured to execute, if it is determined that the current audio segment does not satisfy the audio quality condition, when the prompting information for the current audio segment is sent out:
the segment acquiring subunit is configured to perform acquiring a current audio segment;
the parameter acquiring subunit is configured to execute acquiring an actual singing parameter and a target singing parameter corresponding to the current audio clip, where the target singing parameter is a singing parameter stored in a Musical Instrument Digital Interface (MIDI) file;
the information prompting subunit is configured to execute sending a prompting message for the current audio clip if the coincidence degree of the actual singing parameter and the target singing parameter is lower than a coincidence degree threshold.
Optionally, the information obtaining unit includes an information displaying subunit, an instruction obtaining subunit, and a duration obtaining subunit, and is configured to execute, if a confirmation instruction for the prompt information is received, when the reacquisition duration information corresponding to the current audio clip is obtained:
the information display subunit is configured to display audio acquisition time information corresponding to the current audio clip if a confirmation instruction for the prompt information is received;
the instruction acquisition subunit is configured to execute acquisition of a selection instruction input for the audio acquisition time information;
the time length obtaining subunit is configured to execute, based on the selection instruction, obtaining the re-acquisition time length information corresponding to the current audio clip.
Optionally, the information presentation subunit, when being configured to perform presentation of the audio capture time information corresponding to the current audio clip, is specifically configured to perform:
generating a floating window on a display interface, and displaying audio acquisition time information corresponding to the current audio clip in the floating window;
or
And displaying the audio acquisition time information corresponding to the current audio clip on a sub-display interface of the display interface.
According to a third aspect of the embodiments of the present disclosure, there is provided a terminal, including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of the preceding aspects.
According to a fourth aspect of the present application, there is provided a non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of the preceding aspects.
According to a fifth aspect of the present application, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the method of any of the preceding aspects.
The technical scheme provided by the embodiment of the disclosure at least brings the following beneficial effects:
in some or related embodiments, in the audio data collection process, if it is determined that the current audio segment does not satisfy the audio quality condition, prompt information for the current audio segment is issued, where the prompt information is used to prompt to re-collect the current audio segment; if a confirmation instruction aiming at the prompt message is received, acquiring the reacquiring duration information corresponding to the current audio clip; and acquiring the audio clip which is acquired again aiming at the current audio clip based on the re-acquisition duration information so as to acquire the audio data meeting the audio quality condition. Therefore, when the terminal detects that the audio frequency segment does not meet the requirement, the user can sing the segment which does not meet the requirement again without starting to sing all audio frequencies from the beginning, so that the steps of acquiring the audio data are reduced, the convenience of acquiring the audio data can be improved, the efficiency of acquiring the audio data is improved, and the use experience of the user can be improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure and are not to be construed as limiting the disclosure.
FIG. 1 is a background schematic diagram illustrating a method of audio data acquisition according to an exemplary embodiment;
FIG. 2 is a diagram illustrating a background architecture of a method of audio data acquisition, according to an exemplary embodiment;
FIG. 3 is a flow chart illustrating a method of audio data acquisition according to an exemplary embodiment;
FIG. 4 is a flow chart illustrating a method of audio data acquisition according to an exemplary embodiment;
FIG. 5 is a pictorial illustration of a presentation of a prompt message, according to an exemplary embodiment;
FIG. 6 is a schematic flow diagram illustrating selection of reacquisition duration information in accordance with an exemplary embodiment;
FIG. 7 is a schematic flow diagram illustrating the reacquisition of an audio clip in accordance with an exemplary embodiment;
FIG. 8 is a flow chart illustrating a method of audio data acquisition according to an exemplary embodiment;
FIG. 9 is a schematic illustration of a presentation of an audition page shown in accordance with an exemplary embodiment;
FIG. 10 is a block diagram illustrating an audio data acquisition device in accordance with an exemplary embodiment;
FIG. 11 is a block diagram illustrating an audio data acquisition device in accordance with an exemplary embodiment;
FIG. 12 is a block diagram illustrating an audio data acquisition device in accordance with an exemplary embodiment;
FIG. 13 is a block diagram illustrating an audio data acquisition device in accordance with an exemplary embodiment;
FIG. 14 is a block diagram illustrating an audio data acquisition device according to an exemplary embodiment;
FIG. 15 is a block diagram illustrating an audio data acquisition device in accordance with an exemplary embodiment;
fig. 16 is a block diagram illustrating a terminal according to an example embodiment.
Detailed Description
In order to make the technical solutions of the present disclosure better understood by those of ordinary skill in the art, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.
It should be noted that the terms "first," "second," and the like in the description and claims of the present disclosure and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein are capable of operation in sequences other than those illustrated or otherwise described herein. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
With the development of scientific technology, terminal technology is mature day by day, and convenience of production and life of users is improved. In the terminal application scenario, for example, a user can sing through a vocal accompaniment function of the terminal, and audio data generated by the user singing can be recorded through the terminal.
According to some embodiments, fig. 1 is a background schematic diagram illustrating a method of audio data acquisition according to an example embodiment. As shown in fig. 1, when a user needs to sing, the user can click on the terminal's singing application. When the terminal detects that the user clicks on the application, the terminal may display a singing interface on the display. The singing interface includes, but is not limited to, song name, lyrics, category button, start button, finish button, etc. When the user starts singing, the user can click the start button. When the terminal detects that the user clicks the start button, the terminal can start playing the accompaniment and record the audio data sung by the user.
In some embodiments, when a user records the audio data being sung by using the singing application program, if the user is dissatisfied with a certain section of recorded audio data and needs to sing again, the audio data of the whole song can only be recorded from the beginning, so that the user needs to spend a lot of time to repeatedly sing the selected song during recording, which affects the recording efficiency and further affects the user experience.
According to some embodiments, fig. 2 is a schematic diagram illustrating a background architecture of an audio data acquisition method according to an example embodiment. As shown in fig. 2, the terminal 11 sets a singing application. When the terminal 11 records the audio data sung by the user through the audio collecting module, the recorded audio data may be stored to the server 13 through the network 12. When the user needs to listen to the recorded audio data 13 on trial, the terminal may acquire and play the audio data stored in the server 13.
It is readily understood that the terminal includes, but is not limited to: wearable devices, handheld devices, personal computers, tablet computers, in-vehicle devices, smart phones, computing devices or other processing devices connected to a wireless modem, and the like. The terminal devices in different networks may be called different names, for example: user equipment, access terminal, subscriber unit, subscriber station, mobile station, remote terminal, mobile device, user terminal, wireless communication device, user agent or user equipment, cellular telephone, cordless telephone, Personal Digital Assistant (PDA), terminal equipment in a 5th generation mobile network or future evolution network, and the like. The terminal can be installed with an operating system, which is an operating system capable of running in the terminal, is a program for managing and controlling terminal hardware and terminal applications, and is an indispensable system application in the terminal. The operating system includes, but is not limited to, Android, IOS, Windows Phone (WP), and Ubuntu mobile operating system.
According to some embodiments, the terminal 11 may be connected to the server 13 via the network 12. The network 12 is arranged to provide a communication link between the terminal 11 and the server 13. Network 12 may include various types of connections, such as wire, wireless communication links, or fiber optic cables, to name a few. It should be understood that the number of terminals 11, networks 12 and servers 13 in fig. 2 is merely illustrative. There may be any number of terminals, networks and servers, as desired for the reality. For example, the server 13 may be a server cluster composed of a plurality of servers. The user can use the terminal 11 to interact with the server 13 via the network 12 for listening on trial to recorded audio data or the like.
Fig. 3 is a flowchart illustrating an audio data acquisition method according to an exemplary embodiment, for example, as shown in fig. 3, the audio data acquisition method may be applied to a multimedia scene, and the method may be implemented by a computer program, and may be executed on a device including the audio data acquisition method, and includes the following steps:
in step S11, in the audio data collection process, if it is determined that the current audio clip does not satisfy the audio quality condition, a prompt message for the current audio clip is issued;
according to some embodiments, the audio data refers to audio data received by the terminal through the collection module during the process of singing by the user. For example, the collection module may be a bluetooth headset or a microphone. The audio data does not refer to a fixed audio data. For example, when a song selected by the user changes, the audio data may also transmit the change. The audio data may also be sent changes when the singing user changes.
According to some embodiments, the current audio clip refers to an audio clip being captured by the terminal during the process of singing by the user. The audio segment does not refer specifically to a fixed segment. For example, when the content of the audio data changes, the audio piece may also change. When the terminal acquires the audio clip in a changed manner, the audio clip can be changed.
In some embodiments, the manner in which the terminal acquires the audio clip includes, but is not limited to, acquiring by time length, acquiring by audio content, and the like. When the terminal acquires the mode modification instruction for the acquisition mode, the acquisition mode of the audio clip is changed correspondingly. For example, the terminal may acquire audio clips at a period of 10 s. The terminal can also obtain the audio clip by taking the whole sentence of lyrics as a condition. The terminal can also obtain the audio clips by taking a set number of syllables as a period.
According to some embodiments, the audio quality condition refers to a condition for determining whether the quality of the acquired audio satisfies a user's demand. The audio quality condition does not refer to a certain fixed condition. When the terminal acquires the condition modification instruction for the audio quality condition, the audio quality condition changes correspondingly.
According to some embodiments, the prompt information refers to information for prompting a user that the current audio clip does not satisfy the audio quality condition and needs to be re-acquired. The hint information does not refer to a fixed information. When the terminal acquires the information modification instruction aiming at the prompt information, the prompt information changes correspondingly. For example, when the content of the presentation information changes, the presentation information also changes, and when the presentation mode of the presentation information changes, the presentation information also changes.
It should be noted that, when the user sings, the terminal may collect the received audio data. In the process of collecting audio data, the terminal can judge whether the current audio clip meets the audio quality condition. When the terminal judges that the current audio clip does not meet the audio quality condition, the terminal can send prompt information aiming at the current audio clip, so that the terminal can prompt a user to acquire the current audio clip again.
In step S12, if a confirmation instruction for the prompt information is received, acquiring the reacquisition duration information corresponding to the current audio clip;
according to some embodiments, the confirmation instruction refers to an instruction issued when the user confirms to re-collect the current audio clip when the user receives the prompt message issued by the terminal. The confirmation instruction does not specifically refer to a fixed confirmation instruction. The confirmation instructions include, but are not limited to, click confirmation instructions, voice confirmation instructions, and the like. When the terminal acquires the modification instruction aiming at the confirmation instruction, the confirmation instruction is changed correspondingly. For example, when the user clicks the confirmation button, the terminal may re-capture the current audio clip. Alternatively, the terminal may also recapture the current audio clip when the user says "confirm recapture" voice information.
In some embodiments, the re-acquiring the duration information refers to duration information acquired by the terminal according to the current audio clip when the terminal determines that the current audio clip does not satisfy the audio quality condition. The reacquisition duration information does not refer to a fixed information. For example, when the duration of the current audio piece changes, the reacquisition duration information also changes. When the manner of acquiring the reacquired time length information according to the current audio clip changes, the reacquired time length information also changes.
It is easy to understand that when the terminal prompts the user to re-acquire the current audio clip, the terminal may obtain a confirmation instruction issued by the user to confirm to re-acquire the current audio clip. When the terminal acquires the confirmation instruction, the terminal can acquire the reacquisition duration information corresponding to the current audio clip.
In step S13, based on the re-acquisition duration information, the audio clip re-acquired for the current audio clip is acquired to acquire audio data satisfying the audio quality condition.
According to some embodiments, the re-collected audio clip refers to audio data collected by the terminal when the user re-sings based on the re-collection duration information. The recaptured audio segment does not refer specifically to a fixed segment. For example, when the reacquisition duration information changes, the reacquisition audio piece may change. When the content of the current audio clip changes, the reacquired audio clip also changes.
In some embodiments, the audio data satisfying the audio quality condition refers to the audio data when all audio segments in the audio data received by the terminal satisfy the audio quality condition. The audio data does not refer to a fixed audio data. For example, when a song sung by the user changes, the audio data also changes. When a user singing a song changes, the audio data also changes.
It is easy to understand that, when the terminal acquires the reacquisition duration information corresponding to the current audio clip, the terminal may acquire the audio clip reacquired for the current audio clip based on the reacquisition duration information until all the audio clips satisfy the audio quality condition. Therefore, the terminal can acquire the audio data meeting the audio quality condition.
In some or related embodiments, in the audio data acquisition process, if it is determined that the current audio segment does not satisfy the audio quality condition, prompt information for the current audio segment is sent out, and the prompt information is used for prompting to acquire the current audio segment again; if a confirmation instruction aiming at the prompt message is received, acquiring the reacquiring duration information corresponding to the current audio clip; and acquiring the audio clip which is acquired again aiming at the current audio clip based on the re-acquisition duration information so as to acquire the audio data meeting the audio quality condition. Therefore, when the terminal detects that the audio frequency segment does not meet the requirement, the user can sing the segment which does not meet the requirement again without starting to sing all audio frequencies from the beginning, the convenience of audio data acquisition can be improved, the efficiency of audio data acquisition is improved, and the use experience of the user can be improved.
FIG. 4 is a flow chart illustrating a method of audio data acquisition according to an exemplary embodiment. As shown in fig. 4, the audio data acquisition method may be applied to a multimedia scene, for example, and may be implemented by a computer program and may be run on a device including the audio data acquisition method, and the audio data acquisition method includes the following steps:
in step S21, a selection instruction input for the audio quality condition set is acquired, and an audio quality condition corresponding to the audio data acquisition process is determined based on the selection instruction;
the specific process is as described above, and is not described herein again.
According to some embodiments, when a user singing a song changes, the audio quality condition required by the user may also change. Thus, at least two audio quality conditions are set in the terminal for the user to select. These at least two audio quality conditions are put into the same set, resulting in a set of audio quality conditions. The set of audio quality conditions does not refer to a fixed set. For example, when the number of audio quality conditions changes, the set of audio quality conditions also changes. When the type of audio quality condition changes, the set of audio quality conditions also changes.
In some embodiments, the audio quality condition may be, for example, a set distance between audio data sampling points of the actual singing parameter and the target singing parameter. The audio quality condition may also be, for example, a set distance between the actual singing parameter and the target singing parameter voicing curve. The audio quality condition may also be, for example, a set distance between the actual singing parameter and the fundamental frequency sequence of the target singing parameter. The audio quality condition may also be, for example, a threshold of coincidence of the actual singing parameter and the target singing parameter.
In some embodiments, the actual singing parameters refer to parameters of a received audio clip collected by the terminal when the user sings a song. The actual singing parameter is not specifically referred to as a fixed parameter. For example, when a user singing a song changes, the actual singing parameters may also change. When the song selected by the user changes, the actual singing parameters may also change. The actual singing parameters include, but are not limited to, the frequency of the sound and the time corresponding to the frequency, the voicing curve, the fundamental frequency sequence, etc.
In some embodiments, the target singing parameter refers to a singing parameter stored in the terminal. The target singing parameter is not particularly limited to a fixed parameter. When the terminal acquires a parameter modification instruction for the target singing parameter, the target singing parameter changes correspondingly. For example, when a song selected by the user changes, the target singing parameter may also change. When the current audio segment corresponding to the target singing parameter changes, the target singing parameter can also change. The target singing parameters include, but are not limited to, the frequency of the sound and the time corresponding to the frequency, the sound level curve, the fundamental frequency sequence, etc.
For example, the set of audio quality conditions may include that the distance between the audio data sampling point of the actual singing parameter of 3 consecutive syllables and the target singing parameter exceeds 1mm, the distance between the audio data sampling point of the actual singing parameter of 4 consecutive syllables and the target singing parameter exceeds 3mm, and the distance between the data sampling point of the actual singing parameter of 5 consecutive syllables and the target singing parameter exceeds 5 mm. The audio quality condition set can further comprise that the distance between the acoustic flat curve of the target singing parameter and the actual singing parameter of the continuous 3 syllables exceeds 1mm, the distance between the acoustic flat curve of the target singing parameter and the actual singing parameter of the continuous 4 syllables exceeds 3mm, and the distance between the acoustic flat curve of the target singing parameter and the actual singing parameter of the continuous 5 syllables exceeds 5 mm. The set of audio quality conditions may further comprise that the distance between the actual singing parameter of consecutive 3 syllables and the fundamental frequency sequence of the target singing parameter exceeds 1mm, the distance between the actual singing parameter of consecutive 4 syllables and the fundamental frequency sequence of the target singing parameter exceeds 3mm, and the distance between the actual singing parameter of consecutive 5 syllables and the fundamental frequency sequence of the target singing parameter exceeds 5 mm. The audio quality condition set may further include that the coincidence degree of the actual singing parameter and the target singing parameter is higher than 90%, the coincidence degree of the actual singing parameter and the target singing parameter is higher than 85%, and the coincidence degree of the actual singing parameter and the target singing parameter is higher than 80%.
In some embodiments, the selection instruction input for the set of audio quality conditions refers to an instruction issued when a user sings a song to select a desired audio quality condition from the set of audio quality conditions. The select instruction does not refer to a fixed select instruction. The selection instruction includes, but is not limited to, a click confirmation instruction, a voice confirmation instruction, and the like. When the terminal acquires the modification instruction aiming at the selection instruction, the selection instruction is changed correspondingly. For example, when the user clicks a key corresponding to the audio quality condition, the terminal may acquire a selection instruction corresponding to the audio quality condition. Alternatively, when the user speaks the voice information of "select first condition", the terminal may also acquire a selection instruction corresponding to the audio quality condition.
It is easily understood that when the user sings a song, the user may select a desired audio quality condition among the set of audio quality conditions and transmit a selection instruction. When the terminal acquires the selection instruction input for the audio quality condition set, the terminal may determine the audio quality condition corresponding to the audio data acquisition process based on the selection instruction.
In step S22, in the audio data collection process, if it is determined that the current audio clip does not satisfy the audio quality condition, a prompt message for the current audio clip is issued;
the specific process is as described above, and is not described herein again.
According to some embodiments, when a user sings a song, the terminal may acquire a current audio clip and acquire an actual singing parameter and a target singing parameter corresponding to the current audio clip. When the terminal acquires the actual singing parameter and the target singing parameter corresponding to the current audio clip, the terminal can judge whether the coincidence degree of the actual singing parameter and the target singing parameter is lower than the coincidence degree threshold value. If the terminal determines that the coincidence degree of the actual singing parameter and the target singing parameter is lower than the coincidence degree threshold, the terminal may send a prompt message for the current audio clip, and an exemplary schematic diagram of the terminal interface may be as shown in fig. 5. Therefore, the terminal can judge whether the current audio clip meets the audio quality condition based on the contact ratio, and the accuracy of judging the current audio clip can be improved.
For example, when the user sings and selects the coincidence threshold of 80%, the terminal acquires an audio clip of 10 s. And, the level curve corresponding to the 10s audio segment and the target level curve are acquired. The terminal judges that the coincidence degree of the sound level curves of the actual singing parameters and the target singing parameters is 70% and is less than the coincidence degree threshold of 80%. At this time, the terminal may issue a prompt message for the 10s audio clip.
In some embodiments, the manner in which the terminal obtains the target singing parameters includes, but is not limited to, obtaining from the singing parameters stored in a Musical Instrument Digital Interface (MIDI) file, obtaining from a cloud server, and obtaining from an internal storage module.
It is easily understood that the terminal may collect received audio data when the user sings a song. In the process of collecting audio data, the terminal can judge whether the current audio clip meets the audio quality condition. When the terminal judges that the current audio clip does not meet the audio quality condition, the terminal can send prompt information aiming at the current audio clip, so that the terminal can prompt a user to acquire the current audio clip again.
In step S23, if a confirmation instruction for the prompt information is received, audio acquisition time information corresponding to the current audio clip is displayed;
the specific process is as described above, and is not described herein again.
According to some embodiments, the manner in which the terminal presents the audio capture time information corresponding to the current audio clip includes, but is not limited to, full screen presentation, cross screen presentation, half-window presentation at the beginning, floating window presentation, and the like. The presentation mode of the audio acquisition time information includes but is not limited to a curve mode, a digital mode and the like. Therefore, the display mode of the audio acquisition time information can be enriched, and the convenience of use of the user can be improved.
In some embodiments, when the terminal displays the audio capture time information corresponding to the current audio clip through the floating window, the terminal may generate the floating window on the display interface and display the audio capture time information corresponding to the current audio clip in the floating window.
In some embodiments, floating a window refers to a window that can float over a single page or multiple pages. The floating window is not specific to a fixed floating window. For example, when the current audio clip changes, the content of the floating window may also change. When the audio acquisition time information changes, the content of the floating window may also change. For example, a user may move the floating window and adjust the size of the floating window.
In some embodiments, when the terminal displays the audio capture time information corresponding to the current audio clip through the bottom half window, the terminal may display the audio capture time information corresponding to the current audio clip on a sub-display interface of the display interface.
In some embodiments, when the sub-display interface cannot display all the audio capture time information corresponding to the current audio clip, the terminal may set a scroll bar, and display all the audio capture time information corresponding to the current audio clip through the scroll bar. And displaying all audio acquisition time information corresponding to the current audio clip by adjusting the size of the sub-display interface.
It is easy to understand that when the terminal receives the confirmation instruction for the prompt information, the terminal may present the audio capture time information corresponding to the current audio clip.
In step S24, a selection instruction input for the audio capture time information is acquired;
according to some embodiments, the audio capture time information refers to audio capture time information corresponding to the current audio clip. The audio capture time information does not refer to a fixed information. For example, when the duration of the current audio segment changes, the audio capture time information also changes. When the mode that the terminal acquires the audio acquisition time information according to the current audio clip changes, the reacquiring duration information also changes.
For example, the audio capture time information may be equal to the duration information of the current audio piece. The audio capture time information may also be greater than the duration information of the current audio clip. The audio acquisition time information may also be duration information of the entire piece of audio data including the current audio clip.
According to some embodiments, the selection instruction input for the audio acquisition time information refers to an instruction issued by selecting the re-acquisition duration information when the user acquires the audio acquisition time information corresponding to the current audio clip displayed by the terminal. The select instruction does not refer to a fixed select instruction. The selection instruction includes, but is not limited to, a click selection instruction, a voice selection instruction, a drag selection instruction, and the like. When the terminal acquires the modification instruction aiming at the selection instruction, the selection instruction is changed correspondingly.
For example, when the user controls to change the size of the selection box and drag the selection box to any one region in the audio capturing time information, the terminal may acquire the re-capturing time length information corresponding to the selection box, as shown in fig. 6. Or when the user clicks any position of the display page of the audio acquisition time information, the terminal determines that the clicking position is the starting position of the re-acquisition time length information, and when the user clicks at another position for the second time, the terminal determines that the position clicked for the second time is the ending position of the re-acquisition time length information.
It is easy to understand that the user can select the re-acquisition duration information and issue a selection instruction based on the audio acquisition time information corresponding to the current audio clip displayed by the terminal. The terminal may acquire a selection instruction input for the audio capture time information.
In step S25, based on the selection instruction, acquiring reacquisition duration information corresponding to the current audio clip;
it is easy to understand that, when the terminal acquires the selection instruction input for the audio capture time information, the terminal may acquire the re-capture duration information corresponding to the current audio clip based on the selection instruction. For example, when the user drags the selection box to the 5th to 7 th syllables of the audio capturing time information, the terminal may acquire the re-capturing time period information corresponding to the 5th to 7 th syllables.
In step S26, based on the reacquisition duration information, acquiring an audio clip reacquired for the current audio clip;
the specific process is as described above, and is not described herein again.
According to some embodiments, when the user selects the re-acquisition duration information and sends a selection instruction based on the audio acquisition time information corresponding to the current audio clip displayed by the terminal, the terminal may acquire the acquisition instruction. Based on the acquisition instruction, the terminal can expand the sing-over fragment page. Thus, the terminal can acquire the audio clip recaptured for the current audio clip based on the chorus clip page.
In some embodiments, the acquisition instruction refers to an instruction issued when the user selects the re-acquisition duration information and issues a selection instruction, and the user sings the audio clip corresponding to the selected re-acquisition duration information. The acquisition instruction does not refer to a fixed acquisition instruction. The capture instructions include, but are not limited to, click capture instructions, voice capture instructions, and the like. When the terminal acquires the modification instruction aiming at the acquisition instruction, the acquisition instruction is changed correspondingly. For example, when the user clicks the sing-over segment button, the terminal may expand the sing-over segment page, as shown in fig. 6. When the user speaks the "chong segment," the terminal may also expand the chong segment page.
In some embodiments, the chorus fragment page does not refer specifically to a FixedPage. For example, when the reacquisition duration information changes, the singing segment page changes. The display mode of the re-singing segment page includes, but is not limited to, full-screen display, horizontal-screen display, half-window display at the beginning, floating-window display and the like. The content of the re-singing segment page includes, but is not limited to, lyrics corresponding to the audio capture time information, time corresponding to the audio capture time information, a start button, a pause button, and a finish button. When the user clicks the completion button, the terminal may acquire the audio clip re-captured for the current audio clip, as shown in fig. 7.
It is easy to understand that, when the terminal acquires the re-acquisition duration information corresponding to the current audio clip, the terminal may acquire the audio clip re-acquired for the current audio clip based on the re-acquisition duration information.
In step S27, if it is determined that the re-captured audio clip satisfies the audio quality condition, the current audio clip is replaced with the re-captured audio clip.
According to some embodiments, when the terminal replaces the current audio clip with the re-collected audio clip, the audio clip that needs to be re-collected is deleted based on the re-collection duration information. At the same time, the recaptured audio clip is placed in the deleted location of the audio clip that needs to be recaptured. And selecting the position with the weakest audio waveform at the splicing position for seamless splicing. Ensure the alignment of similar waveforms and form a seamless auditory sensation.
It is easily understood that when the terminal determines that the re-captured audio clip satisfies the audio quality condition, the terminal may replace the current audio clip with the re-captured audio clip. Until all audio segments satisfy the audio quality condition. Therefore, the terminal can acquire the audio data meeting the audio quality condition.
In some or related embodiments, a selection instruction input for the set of audio quality conditions is obtained, and an audio quality condition corresponding to the audio data acquisition process is determined based on the selection instruction; therefore, the corresponding audio quality condition can be determined based on the selection of the user, and the convenience of the user can be improved. In the audio data acquisition process, if the current audio clip is determined not to meet the audio quality condition, sending prompt information aiming at the current audio clip; therefore, in the process of singing by a user, prompt information aiming at the current audio clip can be sent out, and the convenience of audio data acquisition is improved. If a confirmation instruction aiming at the prompt information is received, displaying audio acquisition time information corresponding to the current audio clip; acquiring a selection instruction input aiming at the audio acquisition time information; acquiring the reacquiring duration information corresponding to the current audio clip based on the selection instruction; therefore, the re-acquisition duration information corresponding to the current audio clip needing to be sung can be determined based on the selection of the user, and the use convenience of the user is improved. Finally, acquiring the audio clip which is acquired again aiming at the current audio clip based on the acquisition time length information; if the reacquired audio clip is determined to meet the audio quality condition, replacing the current audio clip with the reacquired audio clip; therefore, the user can sing the segments which do not meet the requirements again without the need of the user to sing all the audio from the beginning, the convenience of audio data acquisition can be improved, the efficiency of audio data acquisition can be improved, and the use experience of the user can be improved.
FIG. 8 is a flow chart illustrating a method of audio data acquisition according to an exemplary embodiment. As shown in fig. 8, the audio data acquisition method may be applied to a multimedia scene, for example, and may be implemented by a computer program and may be run on a device including the audio data acquisition method, and the audio data acquisition method includes the following steps:
in step S31, in the audio data collection process, if it is determined that the current audio clip does not satisfy the audio quality condition, a prompt message for the current audio clip is issued;
the specific process is as described above, and is not described herein again.
In step S32, if a confirmation instruction for the prompt information is received, acquiring the reacquisition duration information corresponding to the current audio clip;
the specific process is as described above, and is not described herein again.
In step S33, based on the reacquisition duration information, acquiring an audio clip reacquired for the current audio clip;
the specific process is as described above, and is not described herein again.
In step S34, if a listening trial instruction for the newly acquired audio clip is received, playing the newly acquired audio clip;
according to some embodiments, the audition instructions refer to instructions issued when the user needs to audition the re-captured audio segment for the current audio segment. The audition instruction does not refer to a fixed audition instruction. The audition instructions include, but are not limited to, click audition instructions, voice audition instructions, and the like. When the terminal acquires the modification instruction aiming at the audition instruction, the audition instruction changes correspondingly. For example, when the user clicks an audition key, the terminal may audite the re-captured audio clip for the current audio clip. Alternatively, when the user says "listen on trial", the terminal may also listen on trial for the newly captured audio piece for the current audio piece.
In some embodiments, when the terminal receives the audition instructions for the re-captured audio segment, the terminal may present a segment audition page. The section listening trial page does not refer to a fixed page. For example, when the reacquired audio clip changes, the clip listening page changes. The presentation mode of the clip listening page includes, but is not limited to, full screen presentation, horizontal screen presentation, half-window presentation at the beginning, floating window presentation, and the like. The content of the segment listening trial page includes, but is not limited to, lyrics corresponding to the re-captured audio segment, time corresponding to the re-captured audio segment, start key, pause key, re-recording key, and overlay key. When the user clicks the overlay key, the terminal may acquire a newly-acquired audio clip for the current audio clip and replace the current audio clip with the newly-acquired audio clip, as shown in fig. 9. When the user clicks the re-record key, the terminal may present the sing-all page to re-capture the audio clip again.
In some embodiments, while the terminal plays the recaptured audio clip, the terminal may, for example, only play the recaptured audio clip. Audio larger than the re-captured audio clip may also be played to ensure a seamless listening experience.
It is easy to understand that when the terminal acquires the audio clip recaptured for the current audio clip, the user may issue a listening trial instruction for the audio clip recaptured for the current audio clip. When the terminal receives the audition instruction aiming at the recaptured audio clip, the terminal can play the recaptured audio clip.
In step S35, if a confirmation instruction for the recaptured audio piece is received, the current audio piece is replaced with the recaptured audio piece.
The specific process is as described above, and is not described herein again.
According to some embodiments, the confirmation instruction for the recaptured audio clip refers to an instruction issued to the terminal when the user determines that the recaptured audio clip satisfies the requirements while the user is listening to the recaptured audio clip on trial. The confirmation instruction does not specifically refer to a fixed confirmation instruction. The confirmation instructions include, but are not limited to, click confirmation instructions, voice confirmation instructions, and the like. When the terminal acquires the modification instruction aiming at the confirmation instruction, the confirmation instruction is changed correspondingly. For example, when the user clicks the overlay key, the terminal may replace the current audio clip with the re-captured audio clip. Alternatively, when the user says "override," the terminal may also replace the current audio clip with the re-captured audio clip.
In some or related embodiments, in the audio data collection process, if it is determined that the current audio clip does not satisfy the audio quality condition, a prompt message for the current audio clip is issued; therefore, in the process of singing by a user, prompt information aiming at the current audio clip can be sent out, and the convenience of audio data acquisition is improved. If a confirmation instruction aiming at the prompt message is received, acquiring the reacquiring duration information corresponding to the current audio clip; acquiring a newly acquired audio clip for the current audio clip based on the newly acquired duration information; therefore, the re-acquisition duration information corresponding to the current audio clip needing to be sung can be determined based on the selection of the user, and the use convenience of the user is improved. Finally, if an audition instruction aiming at the audio clip which is collected again is received, the audio clip which is collected again is played; if a confirmation instruction aiming at the re-collected audio clip is received, replacing the current audio clip with the re-collected audio clip; therefore, the user can listen to the newly collected audio clip in an audition mode, the convenience of audio data acquisition can be improved, the efficiency of audio data acquisition is improved, and the use experience of the user can be improved.
FIG. 10 is a block diagram illustrating an apparatus for audio data acquisition, according to an example embodiment. Referring to fig. 10, the audio data acquisition apparatus 1000 includes a quality determination unit 1001, an information acquisition unit 1002, and an audio capture unit 1003.
A quality determination unit 1001 configured to, during audio data acquisition, if it is determined that the current audio segment does not satisfy the audio quality condition, send out prompt information for the current audio segment, where the prompt information is used to prompt to reacquire the current audio segment;
the information acquisition unit 1002 is configured to execute, if a confirmation instruction for the prompt information is received, acquiring reacquisition duration information corresponding to the current audio clip;
an audio acquisition unit 1003 configured to perform acquiring, based on the re-acquisition duration information, an audio clip re-acquired for the current audio clip to acquire audio data satisfying the audio quality condition.
According to some embodiments, fig. 11 is a block diagram illustrating an apparatus for audio data acquisition, according to an example embodiment. Referring to fig. 11, the audio data acquisition apparatus 1000 further includes a condition selection unit 1004 configured to perform, before issuing prompt information for a current audio clip, specifically:
the selection instruction input for the audio quality condition set is acquired, and the audio quality condition corresponding to the audio data acquisition process is determined based on the selection instruction.
According to some embodiments, fig. 12 is a block diagram illustrating an apparatus for audio data acquisition, according to an example embodiment. Referring to fig. 12, the audio data acquiring apparatus 1000 further includes an audio replacing unit 1005 configured to, after acquiring the audio clip recaptured for the current audio clip based on the recapture duration information, specifically configured to perform:
and if the reacquired audio clip is determined to meet the audio quality condition, replacing the current audio clip with the reacquired audio clip.
According to some embodiments, fig. 13 is a block diagram illustrating an apparatus for audio data acquisition, according to an example embodiment. Referring to fig. 13, the audio data acquisition apparatus 1000 further includes an audio listening trial unit 1006 and an audio confirmation unit 1007, and is configured to perform, after acquiring an audio section re-acquired for the current audio section based on the re-acquisition duration information:
an audio listening trial unit 1006 configured to execute playing the reacquired audio clip if an audio listening trial instruction for the reacquired audio clip is received;
an audio confirmation unit 1007 configured to execute replacing the current audio clip with the newly captured audio clip if a confirmation instruction for the newly captured audio clip is received.
According to some embodiments, fig. 14 is a block diagram illustrating an apparatus for audio data acquisition, according to an example embodiment. Referring to fig. 14, the quality determination unit 1001 includes a section acquisition sub-unit 1011, a parameter acquisition sub-unit 1021, and an information presentation sub-unit 1031, and is configured to perform, when it is determined that the current audio section does not satisfy the audio quality condition, issuing presentation information for the current audio section:
a section acquiring subunit 1011 configured to perform acquiring a current audio section;
a parameter obtaining subunit 1021, configured to perform obtaining of an actual singing parameter and a target singing parameter corresponding to the current audio segment, where the target singing parameter is a singing parameter stored in a musical instrument digital interface MIDI file;
and an information prompt subunit 1031 configured to execute issuing prompt information for the current audio segment if the coincidence degree of the actual singing parameter and the target singing parameter is lower than the coincidence degree threshold.
According to some embodiments, fig. 15 is a block diagram illustrating an apparatus for audio data acquisition, according to an example embodiment. Referring to fig. 15, the information obtaining unit 1002 includes an information presentation subunit 1012, an instruction obtaining subunit 1022, and a duration obtaining subunit 1032, and is configured to execute, when receiving a confirmation instruction for the prompt information, obtaining the reacquisition duration information corresponding to the current audio clip:
an information presentation subunit 1012 configured to perform, if a confirmation instruction for the prompt information is received, presenting audio acquisition time information corresponding to the current audio clip;
an instruction acquisition subunit 1022 configured to execute acquisition of a selection instruction input for the audio capture time information;
and a duration obtaining subunit 1032 configured to execute obtaining, based on the selection instruction, the re-acquisition duration information corresponding to the current audio piece.
According to some embodiments, the information presentation subunit 1012, when being configured to perform presentation of audio capture time information corresponding to the current audio piece, is specifically configured to perform:
generating a floating window on a display interface, and displaying audio acquisition time information corresponding to the current audio clip in the floating window;
or
And displaying the audio acquisition time information corresponding to the current audio clip on a sub-display interface of the display interface.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
In some or related embodiments, the quality determination unit determines whether the current audio segment meets the audio quality condition during the audio data acquisition process, and if it is determined that the current audio segment does not meet the audio quality condition, a prompt message for the current audio segment is sent out, and the prompt message is used for prompting to acquire the current audio segment again; if the information acquisition unit receives a confirmation instruction aiming at the prompt information, acquiring the reacquiring duration information corresponding to the current audio clip; the audio acquisition unit acquires an audio clip acquired again for the current audio clip based on the re-acquisition duration information to acquire audio data meeting the audio quality condition. Therefore, when the terminal detects that the audio frequency segment does not meet the requirement, the user can sing the segment which does not meet the requirement again without starting to sing all audio frequencies from the beginning, the convenience of audio data acquisition can be improved, the efficiency of audio data acquisition is improved, and the use experience of the user can be improved.
Referring to fig. 16, a block diagram of a terminal is shown in accordance with an example embodiment. As shown in fig. 16, the terminal 1600 may include: at least one processor 1601, at least one network interface 1604, a user interface 1603, memory 1605, at least one communication bus 1602.
Wherein a communication bus 1602 is used to enable connective communication between these components.
Where the user interface 1603 may include speakers and a display screen, the optional user interface 1603 may also include standard wired, wireless interfaces.
The network interface 1604 may optionally comprise a standard wired interface, a wireless interface (e.g., WI-FI interface), among others.
Processor 1601 may include one or more processing cores, among other things. The processor 1601 interfaces various components throughout the terminal 1600 using various interfaces and lines to perform various functions and process data of the terminal 1600 by executing or executing instructions, programs, code sets, or instruction sets stored in the memory 1605, as well as invoking data stored in the memory 1605. Optionally, the processor 1601 may be implemented in at least one hardware form of Digital Signal Processing (DSP), Field-Programmable Gate Array (FPGA), and Programmable Logic Array (PLA). The processor 1601 may integrate one or more of a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), a modem, and the like. Wherein, the CPU mainly processes an operating system, a user interface, an application program and the like; the GPU is used for rendering and drawing the content required to be displayed by the display screen; the modem is used to handle wireless communications. It is to be understood that the modem may not be integrated into the processor 1601, but may be implemented by a single chip.
The Memory 1605 may include a Random Access Memory (RAM) or a Read-Only Memory (Read-Only Memory). Optionally, the memory 1605 includes a non-transitory computer-readable storage medium. The memory 1605 may be used to store instructions, programs, code sets, or instruction sets. The memory 1605 may include a stored program area and a stored data area, wherein the stored program area may store instructions for implementing an operating system, instructions for at least one function (such as a touch function, a sound playing function, an image playing function, etc.), instructions for implementing the various method embodiments described above, and the like; the storage data area may store data and the like referred to in the above respective method embodiments. The memory 1605 may alternatively be at least one memory device located remotely from the processor 1601 as previously described. As shown in fig. 16, a memory 1605, which is a kind of computer storage medium, may include therein an operating system, a network communication module, a user interface module, and an application program for audio data acquisition.
In the terminal 1600 shown in fig. 16, the user interface 1603 is mainly used to provide an input interface for a user to obtain data input by the user; and the processor 1601 may be used to invoke an application for audio data retrieval stored in the memory 1605 and specifically perform the steps in the method embodiments of fig. 3-10.
Correspondingly, the embodiment of the application also provides a computer readable storage medium storing the computer program. The computer-readable storage medium stores a computer program, and the computer program, when executed by one or more processors, causes the one or more processors to perform the steps in the method embodiments of fig. 3-9.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It is noted that, in this document, relational terms such as "first" and "second," and the like, may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The foregoing are merely exemplary embodiments of the present application and are presented to enable those skilled in the art to understand and practice the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (10)

1. An audio data acquisition method, comprising:
in the audio data acquisition process, if the current audio clip is determined not to meet the audio quality condition, sending prompt information aiming at the current audio clip, wherein the prompt information is used for prompting to acquire the current audio clip again;
if a confirmation instruction aiming at the prompt message is received, acquiring the reacquiring duration information corresponding to the current audio clip;
and acquiring the audio clip which is acquired again aiming at the current audio clip based on the re-acquisition duration information so as to acquire the audio data meeting the audio quality condition.
2. The method of claim 1, further comprising, prior to said issuing the cue information for the current audio segment:
the method comprises the steps of obtaining a selection instruction input aiming at an audio quality condition set, and determining an audio quality condition corresponding to an audio data acquisition process based on the selection instruction.
3. The method of claim 1, further comprising, after the obtaining of the audio clip recaptured for the current audio clip based on the recapture duration information:
and if the reacquired audio clip is determined to meet the audio quality condition, replacing the current audio clip with the reacquired audio clip.
4. The method of claim 1, wherein after obtaining the audio clip recaptured for the current audio clip based on the recapture duration information, further comprising:
if an audition instruction aiming at the reacquired audio clip is received, playing the reacquired audio clip;
and if a confirmation instruction aiming at the re-collected audio clip is received, replacing the current audio clip with the re-collected audio clip.
5. The method of claim 1, wherein issuing a prompt for the current audio segment if it is determined that the current audio segment does not satisfy the audio quality condition comprises:
acquiring a current audio clip;
acquiring an actual singing parameter and a target singing parameter corresponding to the current audio clip, wherein the target singing parameter is a singing parameter stored in a Musical Instrument Digital Interface (MIDI) file;
and if the coincidence degree of the actual singing parameter and the target singing parameter is lower than the coincidence degree threshold value, sending out prompt information aiming at the current audio clip.
6. The method according to claim 1, wherein the obtaining of the reacquisition duration information corresponding to the current audio clip if the confirmation instruction for the prompt information is received comprises:
if a confirmation instruction aiming at the prompt information is received, displaying audio acquisition time information corresponding to the current audio clip;
acquiring a selection instruction input aiming at the audio acquisition time information;
and acquiring the reacquisition duration information corresponding to the current audio clip based on the selection instruction.
7. The method of claim 6, wherein presenting audio capture time information corresponding to the current audio clip comprises:
generating a floating window on a display interface, and displaying audio acquisition time information corresponding to the current audio clip in the floating window;
or
And displaying the audio acquisition time information corresponding to the current audio clip on a sub-display interface of the display interface.
8. An audio data acquisition apparatus, comprising:
the quality judgment unit is configured to execute in the audio data acquisition process, if the current audio clip is determined not to meet the audio quality condition, send prompt information aiming at the current audio clip, wherein the prompt information is used for prompting to acquire the current audio clip again;
the information acquisition unit is configured to execute that if a confirmation instruction aiming at the prompt information is received, the re-acquisition duration information corresponding to the current audio clip is acquired;
an audio acquisition unit configured to perform acquisition of an audio clip re-acquired for the current audio clip based on the re-acquisition duration information to acquire audio data satisfying the audio quality condition.
9. A terminal, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; it is characterized in that the preparation method is characterized in that,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.
10. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-7.
CN202111616161.XA 2021-12-27 2021-12-27 Audio data acquisition method and device Pending CN114387994A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111616161.XA CN114387994A (en) 2021-12-27 2021-12-27 Audio data acquisition method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111616161.XA CN114387994A (en) 2021-12-27 2021-12-27 Audio data acquisition method and device

Publications (1)

Publication Number Publication Date
CN114387994A true CN114387994A (en) 2022-04-22

Family

ID=81197475

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111616161.XA Pending CN114387994A (en) 2021-12-27 2021-12-27 Audio data acquisition method and device

Country Status (1)

Country Link
CN (1) CN114387994A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104392731A (en) * 2014-11-30 2015-03-04 陆俊 Singing practicing method and system
CN110010162A (en) * 2019-02-28 2019-07-12 华为技术有限公司 A kind of song recordings method repairs sound method and electronic equipment
CN110601962A (en) * 2019-09-11 2019-12-20 腾讯科技(深圳)有限公司 Message prompting method, device, terminal and storage medium
CN111061405A (en) * 2019-12-13 2020-04-24 广州酷狗计算机科技有限公司 Method, device and equipment for recording song audio and storage medium
CN112309351A (en) * 2019-07-31 2021-02-02 武汉Tcl集团工业研究院有限公司 Song generation method and device, intelligent terminal and storage medium
CN113707113A (en) * 2021-08-24 2021-11-26 北京达佳互联信息技术有限公司 Method and device for modifying singing voice of user and electronic equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104392731A (en) * 2014-11-30 2015-03-04 陆俊 Singing practicing method and system
CN110010162A (en) * 2019-02-28 2019-07-12 华为技术有限公司 A kind of song recordings method repairs sound method and electronic equipment
CN112309351A (en) * 2019-07-31 2021-02-02 武汉Tcl集团工业研究院有限公司 Song generation method and device, intelligent terminal and storage medium
CN110601962A (en) * 2019-09-11 2019-12-20 腾讯科技(深圳)有限公司 Message prompting method, device, terminal and storage medium
CN111061405A (en) * 2019-12-13 2020-04-24 广州酷狗计算机科技有限公司 Method, device and equipment for recording song audio and storage medium
CN113707113A (en) * 2021-08-24 2021-11-26 北京达佳互联信息技术有限公司 Method and device for modifying singing voice of user and electronic equipment

Similar Documents

Publication Publication Date Title
US7973230B2 (en) Methods and systems for providing real-time feedback for karaoke
JP5318095B2 (en) System and method for automatically beat-mixing a plurality of songs using an electronic device
CN109272975B (en) Automatic adjustment method and device for singing accompaniment and KTV jukebox
US10062367B1 (en) Vocal effects control system
JPH09244900A (en) Communication karaoke sing-along machine, host computer for communication karaoke, and communication karaoke system
JP2016157225A (en) Voice search apparatus, voice search method, and program
CN110324718A (en) Audio-video generation method, device, electronic equipment and readable medium
CN110718239A (en) Audio processing method and device, electronic equipment and storage medium
WO2017028686A1 (en) Information processing method, terminal device and computer storage medium
JP5986387B2 (en) Information processing program, information processing apparatus, lyrics display method, and communication system
CN114387994A (en) Audio data acquisition method and device
WO2018211748A1 (en) Information processing device and information processing method
JP5902119B2 (en) Karaoke device, karaoke program, and recording medium
KR20180012397A (en) Management system and method for digital sound source, device and method of playing digital sound source
JP2013213907A (en) Evaluation apparatus
CN102822887A (en) Mixing data delivery server
CN102044238B (en) Music reproducing system
KR102078336B1 (en) Karaoke player to support natural transition between original sound sources and midi sound sources, and operating method thereof
CN2909452Y (en) Electronic musical instrument for playback received musice
JP6316099B2 (en) Karaoke equipment
JP2013122561A (en) Information processing program, communication system, information processing device, and method for drawing lyric telop
CN101902527A (en) Recording method and device with mobile terminal
JP5234950B2 (en) Singing recording system
JP5551983B2 (en) Karaoke performance control system
JP4546431B2 (en) Song data providing device, song data providing method, song data extraction program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination