CN112243151A - Audio playing control method, device, equipment and medium - Google Patents

Audio playing control method, device, equipment and medium Download PDF

Info

Publication number
CN112243151A
CN112243151A CN201910641673.8A CN201910641673A CN112243151A CN 112243151 A CN112243151 A CN 112243151A CN 201910641673 A CN201910641673 A CN 201910641673A CN 112243151 A CN112243151 A CN 112243151A
Authority
CN
China
Prior art keywords
audio
segment
data
volume
volume loudness
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910641673.8A
Other languages
Chinese (zh)
Inventor
张彪
钟雨
黄杰锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201910641673.8A priority Critical patent/CN112243151A/en
Publication of CN112243151A publication Critical patent/CN112243151A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Abstract

The application discloses an audio playing control method, device, equipment and medium, which are applied to the technical field of audio and video processing and are used for solving the problem of volume mutation when audio and video are played in the prior art. The method specifically comprises the following steps: and acquiring each audio segment of the audio to be played and volume loudness data of each audio segment, smoothing the volume loudness data of each audio segment by taking the standard volume loudness range as a reference to obtain target volume loudness data of each audio segment, and playing each audio segment according to the target volume loudness data of each audio segment. Therefore, before each audio is played, the volume loudness data of each audio segment of the audio is smoothed by taking the standard volume loudness range as a reference, and the audio is played according to the volume loudness data after smoothing, so that the volume loudness of each played audio tends to be in the same loudness range, the audibility brought to a user is approximately the same, and the problem of volume mutation is avoided as much as possible.

Description

Audio playing control method, device, equipment and medium
Technical Field
The present application relates to the field of audio and video processing technologies, and in particular, to an audio playing control method, apparatus, device, and medium.
Background
With the continuous development of modern science and technology, the audio and video can be played through an audio and video playing client without being limited to playing the audio and video through devices such as a radio, a television and the like.
At present, the volume of each audio/video is inconsistent due to different recording environments, recording modes and the like, and the situation of abrupt change of the volume is likely to occur in the process of playing the audio/video by an audio/video playing client, so that the listening experience of a user is influenced.
Disclosure of Invention
The embodiment of the application provides an audio playing control method, an audio playing control device, audio playing control equipment and an audio playing control medium, which are used for solving the problem of volume mutation possibly occurring in the process of playing audio and video by a client in the prior art.
The technical scheme provided by the embodiment of the application is as follows:
in one aspect, an embodiment of the present application provides an audio playing control method, including:
acquiring each audio segment of audio to be played;
acquiring volume loudness data of each audio segment, and respectively smoothing the volume loudness data of each audio segment by taking a standard volume loudness range as a reference to obtain target volume loudness data of each audio segment;
and playing each audio segment according to the target volume loudness data of each audio segment.
In one possible implementation, obtaining each audio segment of the audio to be played includes:
acquiring audio coding data according to the audio identification of the audio;
decoding the audio coded data to obtain audio;
and determining the playing time length of the audio, and segmenting the audio according to a segmentation mode of time length sharing to obtain each audio segment of the audio.
In one possible implementation, obtaining each audio segment of the audio to be played includes:
acquiring audio coding data according to the audio identification of the audio;
determining the data volume of the audio coding data, and segmenting the audio coding data according to a segmentation mode of data volume equalization to obtain each audio coding data segment;
and respectively decoding each audio coding data segment to obtain each audio segment of the audio.
In a possible implementation manner, before performing the smoothing processing on the volume loudness data of each audio segment respectively by using the standard volume loudness as a reference, the method further includes:
and determining that the volume loudness of each audio segment is not in the standard volume loudness range based on the volume loudness data of each audio segment.
In a possible implementation manner, the audio playing control method provided in an embodiment of the present application further includes:
and when the volume loudness of any audio segment is determined to be in the standard volume loudness range, playing any audio segment based on the volume loudness data of any audio segment.
In a possible implementation manner, with reference to the standard volume loudness range, smoothing the volume loudness data of each audio segment to obtain target volume loudness data of each audio segment, respectively, includes:
acquiring an initialization parameter set of a set algorithm based on an audio identifier of an audio, wherein the initialization parameter set is a set of initialization parameters which are configured for the audio and enable the average volume loudness of the audio to be within a standard volume loudness range;
and initializing the setting algorithm based on the initialization parameter set, calling the initialized setting algorithm, and respectively smoothing the volume loudness data of each audio segment to obtain the target volume loudness data of each audio segment.
In a possible implementation manner, the audio playing control method provided in an embodiment of the present application further includes:
and reporting the audio identifier of the audio and the target volume loudness data of each audio segment to a server.
On the other hand, an embodiment of the present application provides a volume adjustment device, including:
the segmentation processing unit is used for acquiring each audio segment of the audio to be played;
the loudness balancing unit is used for acquiring volume loudness data of each audio segment, and smoothing the volume loudness data of each audio segment by taking the standard volume loudness range as a reference to obtain target volume loudness data of each audio segment;
and the audio playing unit is used for playing each audio segment according to the target volume loudness data of each audio segment.
In a possible implementation manner, when obtaining each audio segment of the audio to be played, the segmentation processing unit is specifically configured to:
acquiring audio coding data according to the audio identification of the audio;
decoding the audio coded data to obtain audio;
and determining the playing time length of the audio, and segmenting the audio according to a segmentation mode of time length sharing to obtain each audio segment of the audio.
In a possible implementation manner, when obtaining each audio segment of the audio to be played, the segmentation processing unit is specifically configured to:
acquiring audio coding data according to the audio identification of the audio;
determining the data volume of the audio coding data, and segmenting the audio coding data according to a segmentation mode of data volume equalization to obtain each audio coding data segment;
and respectively decoding each audio coding data segment to obtain each audio segment of the audio.
In a possible implementation manner, the volume adjustment device provided in an embodiment of the present application further includes:
and the loudness detection unit is used for determining that the volume loudness of each audio segment is not in the standard volume loudness range based on the volume loudness data of each audio segment before the loudness equalization unit respectively smoothes the volume loudness data of each audio segment by taking the standard volume loudness range as a reference.
In a possible embodiment, the audio playing unit is further configured to:
and when the loudness detection unit determines that the volume loudness of any audio segment is within the standard volume loudness range, playing any audio segment based on the volume loudness data of any audio segment.
In a possible implementation manner, when the standard volume loudness range is used as a reference, and the volume loudness data of each audio segment is respectively smoothed to obtain the target volume loudness data of each audio segment, the loudness equalization unit is specifically configured to:
acquiring an initialization parameter set of a set algorithm based on an audio identifier of an audio, wherein the initialization parameter set is a set of initialization parameters which are configured for the audio and enable the average volume loudness of the audio to be within a standard volume loudness range;
and initializing the setting algorithm based on the initialization parameter set, calling the initialized setting algorithm, and respectively smoothing the volume loudness data of each audio segment to obtain the target volume loudness data of each audio segment.
In a possible implementation manner, an audio playing apparatus provided in an embodiment of the present application further includes:
and the data reporting unit is used for reporting the audio identifier of the audio and the target volume loudness data of each audio segment to the server.
On the other hand, an embodiment of the present application provides an audio playback control apparatus, including: the audio playing control method comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the processor executes the computer program to realize the audio playing control method provided by the embodiment of the application.
On the other hand, an embodiment of the present application further provides a computer-readable storage medium, where computer instructions are stored, and when the computer instructions are executed by a processor, the audio playing control method provided in the embodiment of the present application is implemented.
The beneficial effects of the embodiment of the application are as follows:
in the embodiment of the application, before each audio is played, by taking the standard volume loudness range as a reference, smoothing the volume loudness data of each audio segment of the audio, playing according to the smoothed volume loudness data, the average volume loudness of each played audio is driven to the standard volume loudness range, thereby the audibility brought to the user is approximately the same, the problem of sudden change of volume which possibly occurs in the audio playing process is avoided as much as possible, moreover, by processing each audio segment of the audio respectively, the playing process of each audio segment and the smoothing process of the volume loudness data of each audio segment can be executed in parallel, therefore, the real-time processing of the volume loudness of the audio is realized, and the listening experience of a user is improved while the playing efficiency of the audio is ensured.
Drawings
Fig. 1 is a schematic view of an application scenario of an audio playing control method according to an embodiment of the present application;
fig. 2 is a schematic flowchart of an audio playing control method according to an embodiment of the present application;
fig. 3A is a schematic diagram of an audio segment according to an embodiment of the present application;
FIG. 3B is a schematic diagram of another audio segment provided by an embodiment of the present application;
fig. 4A is a schematic diagram of a single playing effect provided in the embodiment of the present application;
fig. 4B is a schematic diagram of an overall playing effect provided in the embodiment of the present application;
fig. 5 is a schematic flowchart illustrating an exemplary audio playback control method according to an embodiment of the present disclosure;
fig. 6 is a schematic functional structure diagram of an audio playback control apparatus according to an embodiment of the present application;
fig. 7 is a schematic diagram of a hardware structure of an audio playback control apparatus according to an embodiment of the present application.
Detailed Description
In order to make the present application better understood by those skilled in the art, technical terms mentioned in the present application will first be explained.
1. The audio, in this application, is the audio portion of a pure audio file or a video file.
2. In the present application, when the audio is a pure audio file, the audio identifier is an identifier of the audio file, and when the audio is an audio portion in the video file, the audio identifier is a video identifier of the video file.
3. The client is an application that can be installed on a terminal device such as a mobile phone, a computer, a Personal Digital Assistant (PDA), a smart television, and a vehicle-mounted device, can play audio and/or video, and supports user interaction, for example: short video players, music players, video players, etc.
4. The server is background running equipment which is used for managing audio resources and/or video resources and providing various services such as database service, computing service and the like for the client.
5. The volume loudness data of the audio segment is a parameter that characterizes the loudness level of the volume of the audio segment.
6. The EBU R128 algorithm, which is a short name for the European Broadcasting Union (EBU) R128 algorithm, is a smoothing algorithm, and the algorithm may smooth volume loudness data based on a standard volume loudness range, thereby implementing gain or attenuation of volume loudness, and may perform processing such as linear interpolation, true peak limit, and the like on the volume loudness data during the smoothing processing, thereby smoothly processing sudden changes in volume loudness while ensuring that the volume loudness does not exceed a maximum true peak.
In order to make the purpose, technical solution and advantages of the present application more clearly and clearly understood, the technical solution in the embodiments of the present application will be described below in detail and completely with reference to the accompanying drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
Currently, when the volume changes suddenly during the process of playing audio, the client end may cause the volume to increase or decrease suddenly, and the user is usually required to adjust the volume manually. In order to reduce the operation of manually adjusting the volume of a user, a conventional solution is that a client acquires the volume of an audio to be played, determines a proportional value between the volume of the audio and a standard volume, adjusts the system volume of a terminal device according to the proportional value, and plays the audio according to the adjusted system volume. In a specific practical process, the applicant of the present application finds that, because there is no necessary association relationship between the proportional value of the audio volume and the standard volume and the system volume of the terminal device, the system volume of the terminal device is adjusted according to the proportional value of the audio volume and the standard volume, the volume balancing effect is poor, and the client still has a situation of sudden volume increase or sudden volume decrease in the process of playing the audio, thereby affecting the listening experience of the user.
For this purpose, in the embodiment of the present application, referring to fig. 1, a client 101 may be installed on a terminal device 102, and the terminal device 102 is in communication connection with a server 103 through the internet, so that a user may record, edit and distribute audio through the client 101, and the client 101 may send the audio distributed by the user to the server 103 for storage and management. In addition, the user can also play audio through the client 101, when the client 101 receives a play instruction initiated by the user, each audio segment of the audio to be played can be obtained according to an audio identifier carried by the play instruction, after the volume loudness data of each audio segment is obtained, the volume loudness data of each audio segment is smoothed by taking the standard volume loudness range as a reference, the target volume loudness data of each audio segment is obtained, and each audio segment is played according to the target volume loudness data of each audio segment. Therefore, before each audio is played, the volume loudness data of each audio segment of the audio is smoothed by taking the standard volume loudness range as a reference, and the audio is played according to the volume loudness data after smoothing, so that the average volume loudness of each audio played tends to the standard volume loudness range, the listening feelings brought to users are approximately the same, the problem of volume mutation possibly occurring in the audio playing process is avoided as much as possible, and the playing process of each audio segment and the smoothing process of the volume loudness data of each audio segment can be executed in parallel by processing each audio segment of the audio respectively, so that the real-time processing of the volume loudness of the audio is realized, and the listening feelings experience of the users is improved while the playing efficiency of the audio is ensured.
After introducing the application scenario and the design concept of the embodiment of the present application, the following describes a technical solution provided by the embodiment of the present application.
Referring to fig. 2, a flow of an audio playing control method provided in an embodiment of the present application is as follows:
step 201: the client acquires each audio segment of the audio to be played.
In practical application, when the client receives a play instruction initiated by a user, the client may first obtain audio coded data according to an audio identifier carried by the play instruction, and then obtain each audio segment of the audio to be played according to the audio coded data.
Specifically, the client may empty the cache based on user triggering or timing triggering, and based on this, when acquiring the audio coded data, the client may first determine whether the audio coded data corresponding to the audio identifier is cached in the local cache region according to the audio identifier carried by the play instruction, and if it is determined that the audio coded data corresponding to the audio identifier is cached in the local cache region, the client may directly read the audio coded data from the local cache region; if it is determined that the audio encoding data corresponding to the audio identifier is not cached in the local cache region, the audio encoding data corresponding to the audio identifier may be obtained from the server. Of course, the client may also directly obtain the audio encoded data corresponding to the audio identifier from the server, and the specific implementation manner is not limited in this application.
Specifically, when the client acquires each audio segment of the audio to be played according to the audio encoded data, the following modes may be adopted, but are not limited to:
the first mode is as follows: referring to fig. 3A, the client may decode the audio encoded data to obtain an audio to be played, and after determining the playing time of the audio, segment the audio according to a segmentation manner with equally divided time length to obtain each audio segment of the audio.
The second mode is as follows: referring to fig. 3B, the client may determine the data amount of the audio coded data, segment the audio coded data according to a segmentation mode of data amount averaging to obtain each audio coded data segment, and decode each audio coded data segment to obtain each audio segment of the audio.
Step 202: and the client acquires the volume loudness data of each audio segment, and respectively smoothes the volume loudness data of each audio segment by taking the standard volume loudness range as a reference to obtain the target volume loudness data of each audio segment.
In the embodiment of the application, in order to reduce resource waste, before smoothing the volume loudness data of each audio segment based on the standard volume loudness range, the client may also detect whether the volume loudness of each audio segment is within the standard volume loudness range according to the volume loudness data of each audio segment, and when detecting that the volume loudness of any audio segment is within the standard volume loudness range, play any audio segment based on the volume loudness data of the any audio segment directly, and not smooth the volume loudness data of the any audio segment, thereby reducing resource waste.
In practical application, when detecting that the volume loudness of each audio segment is not within the standard volume loudness range, the client can perform smoothing processing on the volume loudness data of each audio segment by taking the standard volume loudness range as a reference, so as to obtain target volume loudness data of each audio segment. In the embodiment of the present application, when the client performs smoothing processing on the volume loudness data of each audio segment by using the standard volume loudness range as a reference, the following manner may be adopted, but is not limited to:
firstly, based on the audio identifier of the audio, acquiring an initialization parameter set of a set algorithm, wherein the initialization parameter set is a set of initialization parameters configured for the audio, and the average volume loudness of the audio is within a standard volume loudness range.
In this embodiment of the present application, the configuration process of the initialization parameter set may be performed by the client at idle time. In this case, the client may store the configured initialization parameter set in the local cache area and report the initialization parameter set to the server; in order to reduce resource consumption of the client, the configuration process of the initialization parameter set may also be executed by the server when the server receives the audio sent by the client for the first time, in this case, the server may issue the configured initialization parameter set to the client, and the client stores the initialization parameter set in the local cache area when receiving the initialization parameter set issued by the server. The embodiment of the present application is described only by taking an initialization parameter set configuration process as an example, which is executed by a server.
Specifically, the client may clear the cache based on user triggering or timing triggering, and based on this, when the client acquires the initialization parameter set based on the audio identifier of the audio, it may first determine whether the initialization parameter set corresponding to the audio identifier is cached in the local cache region according to the audio identifier of the audio, and if it is determined that the initialization parameter set corresponding to the audio identifier is cached in the local cache region, the initialization parameter set may be directly read from the local cache region; if the initialization parameter set corresponding to the audio identifier is determined not to be cached in the local cache region, the initialization parameter set corresponding to the audio identifier can be obtained from the server; of course, the client may also directly obtain the initialization parameter set corresponding to the audio identifier from the server, and the specific implementation manner is not limited in this application.
Then, the setting algorithm is initialized based on the initialization parameter set. In the embodiment of the present application, the EBU R128 algorithm is merely used as an example of the setting algorithm.
And finally, calling an initialized setting algorithm, and respectively smoothing the volume loudness data of each audio segment to obtain target volume loudness data of each audio segment.
In practical application, after the client acquires the target volume loudness data of each audio segment, the client can report the audio identifier of the audio and the target volume loudness data of each audio segment to the server. Therefore, when the audio is played subsequently, if the target volume loudness data of each audio segment corresponding to the audio is not cached in the local cache region, the target volume loudness data of each audio segment corresponding to the audio can be obtained from the server according to the audio identifier of the audio, and the volume loudness data of each audio segment of the video does not need to be smoothed again, so that the resource waste is further reduced.
Step 203: and the client plays each audio segment according to the target volume loudness data of each audio segment.
In the embodiment of the application, the client can sequentially execute the smoothing process of the volume loudness data of each audio segment and the playing process of each audio segment, and in order to realize the real-time processing of the volume loudness of the audio, the client can also simultaneously execute the smoothing process of the volume loudness data of each audio segment and the playing process of each audio segment, for example, the client can adopt a first process to execute the smoothing process of the volume loudness data of each audio segment and adopt a second process to execute the playing process of each audio segment, so that the real-time processing of the volume loudness of the audio is realized by executing the playing process of each audio segment and the smoothing process of the volume loudness data of each audio video segment in parallel, thereby ensuring the playing efficiency of the audio and avoiding the problem of volume mutation in the audio playing process as much as possible, the listening experience of the user is improved. The embodiment of the present application is described only by taking the case where step 202 and step 203 are performed simultaneously.
It is worth mentioning that, as shown in fig. 4A and 4B, when the audio playing control method provided by the embodiment of the present application is used to play audio, the audio with a small average volume loudness can be smoothly gained, and the audio with a large average volume loudness can be smoothly attenuated, so that the average volume loudness of each played audio tends to a standard loudness range, the listening feeling brought to the user is approximately the same, the problem of volume abrupt change occurring in the audio playing process is avoided as much as possible, and a more immersive experience is provided for the user.
The following describes the audio playing control method provided by the embodiment of the present application in further detail by using "the client is a short video player, and the audio is an audio portion in the short video" as a specific application scenario. Referring to fig. 5, a specific process of the audio playing control method provided in the embodiment of the present application is as follows:
step 501: the short video player receives a play request initiated by a user, wherein the play request at least carries a short video identifier requested to be played by the user.
Step 502: the short video player judges whether short video coding data corresponding to the short video identification is cached in a local cache region; if yes, go to step 503; if not, go to step 504.
Step 503: the short video player reads the short video encoded data from the local buffer area and proceeds to step 508.
Step 504: and the short video player carries the short video identification in the short video acquisition request and sends the short video identification to the server.
Step 505: the server receives a short video acquisition request sent by the short video player, and acquires short video coding data of a corresponding short video based on a short video identifier carried by the short video acquisition request.
Step 506: and the server carries the obtained short video coding data in a short video acquisition response and returns the short video coding data to the short video player.
Step 507: the short video player receives the short video obtaining response returned by the server, obtains the short video coding data from the short video obtaining response, and continues to execute step 508.
Step 508: and the short video player respectively segments the video coding data and the audio coding data contained in the short video coding data according to a segmentation mode of uniform data volume to obtain each video coding data segment and each audio coding data segment.
Step 509: the short video player decodes each video coding data segment respectively to obtain each video segment, and decodes each audio coding data segment respectively to obtain each audio segment.
Step 510: the short video player judges whether an initialization parameter set corresponding to the short video identifier is cached in a local cache region; if yes, go to step 511; if not, go to step 512.
Step 511: the short video player reads the initialization parameter set from the local cache area and proceeds to step 516.
Step 512: and the short video player carries the short video identification in the initialization parameter acquisition request and sends the initialization parameter acquisition request to the server.
Step 513: the server receives an initialization parameter acquisition request sent by the short video player, and acquires an initialization parameter set of a corresponding short video based on a short video identifier carried by the initialization parameter acquisition request.
Step 514: and the server carries the obtained initialization parameter set in the initialization parameter acquisition response and returns the initialization parameter set to the short video player.
Step 515: the short video player receives the initialization parameter obtaining response returned by the server, obtains the initialization parameter set from the initialization parameter obtaining response, and continues to execute step 516.
Step 516: the short video player initializes the EBU R128 algorithm based on the initialization parameter set and performs step 517 and step 518 simultaneously.
517: the short video player adopts a first process, does not carry out smoothing processing on each audio segment of which the volume loudness is in a standard range, respectively takes the volume loudness data of each audio segment as target volume loudness data, calls an initialized EBU R128 algorithm on each audio segment of which the volume loudness is not in the standard range, and respectively carries out smoothing processing on the volume loudness data of each audio segment to obtain the target volume loudness data of each audio segment. Wherein, the standard loudness range may be, but is not limited to, 14-17 Lufs.
Step 518: and synchronously playing each audio segment by adopting a second process according to the target volume loudness data of each audio segment while the short video player plays each video segment.
Step 519: and the short video player reports the short video identification of the short video and the target volume loudness data of each audio segment to a server.
Step 520: when the short video player receives the play request initiated by the user for the short video again, the steps 502 to 509 are executed to obtain the video segments and audio segments corresponding to the short video, and then step 521 is executed.
Step 521: the short video player judges whether target volume loudness data of each audio segment corresponding to the short video is cached in the local cache region, if so, step 522 is executed; if not, go to step 523.
Step 522: and the short video player reads the target volume loudness data of each audio segment corresponding to the short video from the local buffer area.
Step 523: and the short video player carries the short video identification of the short video in the target loudness acquisition request and sends the target loudness acquisition request to the server.
Step 524: and the server receives a target loudness acquisition request sent by the short video player, and acquires target volume loudness data of each audio segment of the corresponding short video based on the short video identifier carried by the target loudness acquisition request.
Step 525: and the server carries the obtained target volume loudness data of each audio segment in a target loudness acquisition response and returns the target volume loudness data to the short video player.
Step 526: and the short video player receives a target loudness acquisition response returned by the server, and acquires target volume loudness data of each audio segment of the short video from the target loudness acquisition response.
Step 527: and synchronously playing each audio segment according to the target volume loudness data of each audio segment while the short video player plays each video segment.
Based on the foregoing embodiments, an embodiment of the present application provides a volume adjustment device, and referring to fig. 6, a volume adjustment device 600 provided in an embodiment of the present application at least includes:
a segment processing unit 601, configured to obtain each audio segment of an audio to be played;
the loudness balancing unit 602 is configured to obtain volume loudness data of each audio segment, and perform smoothing processing on the volume loudness data of each audio segment with the standard volume loudness range as a reference, so as to obtain target volume loudness data of each audio segment;
and an audio playing unit 603, configured to play each audio segment according to the target volume loudness data of each audio segment.
In one possible implementation, when acquiring each audio segment of the audio to be played, the segmentation processing unit 601 is specifically configured to:
acquiring audio coding data according to the audio identification of the audio;
decoding the audio coded data to obtain audio;
and determining the playing time length of the audio, and segmenting the audio according to a segmentation mode of time length sharing to obtain each audio segment of the audio.
In one possible implementation, when acquiring each audio segment of the audio to be played, the segmentation processing unit 601 is specifically configured to:
acquiring audio coding data according to the audio identification of the audio;
determining the data volume of the audio coding data, and segmenting the audio coding data according to a segmentation mode of data volume equalization to obtain each audio coding data segment;
and respectively decoding each audio coding data segment to obtain each audio segment of the audio.
In a possible implementation manner, the volume adjustment apparatus 600 provided in the embodiment of the present application further includes:
the loudness detection unit 604 is configured to determine that the volume loudness of each audio segment is not within the standard volume loudness range based on the volume loudness data of each audio segment before the loudness equalization unit 602 performs smoothing processing on the volume loudness data of each audio segment with the standard volume loudness range as a reference.
In a possible implementation, the audio playing unit 603 is further configured to:
when the loudness detection unit 604 determines that the volume loudness of any audio segment is within the standard volume loudness range, any audio segment is played based on the volume loudness data of any audio segment.
In a possible implementation manner, when the standard volume loudness range is used as a reference, and the volume loudness data of each audio segment is respectively smoothed to obtain the target volume loudness data of each audio segment, the loudness equalization unit 602 is specifically configured to:
acquiring an initialization parameter set of a set algorithm based on an audio identifier of an audio, wherein the initialization parameter set is a set of initialization parameters which are configured for the audio and enable the average volume loudness of the audio to be within a standard volume loudness range;
and initializing the setting algorithm based on the initialization parameter set, calling the initialized setting algorithm, and respectively smoothing the volume loudness data of each audio segment to obtain the target volume loudness data of each audio segment.
In a possible implementation manner, the audio playing apparatus 600 provided in this embodiment of the present application further includes:
and a data reporting unit 605, configured to report the audio identifier of the audio and the target volume loudness data of each audio segment to the server.
It should be noted that, because the principle of the volume adjustment device 600 provided in the embodiment of the present application for solving the technical problem is similar to the audio playing control method provided in the embodiment of the present application, the implementation of the volume adjustment device 600 provided in the embodiment of the present application can refer to the implementation of the audio playing control method provided in the embodiment of the present application, and repeated details are not repeated.
After the audio playing control method and apparatus provided in the embodiment of the present application are introduced, a brief description is provided next for the audio playing control device provided in the embodiment of the present application.
Referring to fig. 7, an audio playback control apparatus 700 provided in the embodiment of the present application at least includes: the audio playback control device comprises a processor 71, a memory 72 and a computer program stored on the memory 72 and capable of running on the processor 71, wherein the processor 71 realizes the audio playback control method provided by the embodiment of the application when executing the computer program.
It should be noted that the audio playback control apparatus 700 shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
The audio playback control apparatus 700 provided by the embodiment of the present application may further include a bus 73 connecting the different components (including the processor 71 and the memory 72). Bus 73 represents one or more of any of several types of bus structures, including a memory bus, a peripheral bus, a local bus, and so forth.
The Memory 72 may include readable media in the form of volatile Memory, such as Random Access Memory (RAM) 721 and/or cache Memory 722, and may further include Read Only Memory (ROM) 723.
Memory 72 may also include program means 725 having a set (at least one) of program modules 724, program modules 724 including, but not limited to: an operating subsystem, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
The audio playback control device 700 may also communicate with one or more external devices 74 (e.g., a keyboard, a remote control, etc.), with one or more devices that enable a user to interact with the audio playback control device 700 (e.g., a cell phone, a computer, etc.), and/or with any device that enables the audio playback control device 700 to communicate with one or more other audio playback control devices 700 (e.g., a router, a modem, etc.). This communication may be via an Input/Output (I/O) interface 75. Also, the audio playback control apparatus 700 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public Network such as the internet) via the Network adapter 76. As shown in fig. 7, the network adapter 76 communicates with the other modules of the audio playback control apparatus 700 via the bus 73. It should be understood that although not shown in fig. 7, other hardware and/or software modules may be used in conjunction with the audio playback control device 700, including but not limited to: microcode, device drivers, Redundant processors, external disk drive Arrays, disk array (RAID) subsystems, tape drives, and data backup storage subsystems, to name a few.
The following describes a computer-readable storage medium provided by embodiments of the present application. The embodiment of the application provides a computer-readable storage medium, which stores computer instructions, and the computer instructions, when executed by a processor, implement the audio playing control method provided by the embodiment of the application. Specifically, the executable program may be built in the audio playback control apparatus 700, so that the audio playback control apparatus 700 may implement the audio playback control method provided in the embodiment of the present application by executing the built-in executable program.
In addition, the audio playing control method provided by the embodiment of the present application can also be implemented as a program product, where the program product includes program code, and when the program product can be run on the audio playing control apparatus 700, the program code is used to make the audio playing control apparatus 700 execute the audio playing control method provided by the embodiment of the present application.
The program product provided by the embodiments of the present application may be any combination of one or more readable media, where the readable media may be a readable signal medium or a readable storage medium, and the readable storage medium may be, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof, and in particular, more specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a RAM, a ROM, an Erasable Programmable Read-Only Memory (EPROM), an optical fiber, a portable Compact disk Read-Only Memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The program product provided by the embodiment of the application can adopt a CD-ROM and comprises program codes, and can run on a computing device. However, the program product provided by the embodiments of the present application is not limited thereto, and in the embodiments of the present application, the readable storage medium may be any tangible medium that can contain or store a program, which can be used by or in connection with an instruction execution system, apparatus, or device.
It should be noted that although several units or sub-units of the apparatus are mentioned in the above detailed description, such division is merely exemplary and not mandatory. Indeed, the features and functions of two or more units described above may be embodied in one unit, according to embodiments of the application. Conversely, the features and functions of one unit described above may be further divided into embodiments by a plurality of units.
Further, while the operations of the methods of the present application are depicted in the drawings in a particular order, this does not require or imply that these operations must be performed in this particular order, or that all of the illustrated operations must be performed, to achieve desirable results. Additionally or alternatively, certain steps may be omitted, multiple steps combined into one step execution, and/or one step broken down into multiple step executions.
While the preferred embodiments of the present application have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all alterations and modifications as fall within the scope of the application.
It will be apparent to those skilled in the art that various changes and modifications may be made in the embodiments of the present application without departing from the spirit and scope of the embodiments of the present application. Thus, if such modifications and variations of the embodiments of the present application fall within the scope of the claims of the present application and their equivalents, the present application is also intended to encompass such modifications and variations.

Claims (10)

1. An audio playback control method, comprising:
acquiring each audio segment of audio to be played;
acquiring the volume loudness data of each audio segment, and respectively smoothing the volume loudness data of each audio segment by taking a standard volume loudness range as a reference to obtain target volume loudness data of each audio segment;
and playing each audio segment according to the target volume loudness data of each audio segment.
2. The audio playback control method of claim 1, wherein obtaining each audio segment of the audio to be played comprises:
acquiring audio coding data according to the audio identifier of the audio;
decoding the audio coded data to obtain the audio;
and determining the playing time length of the audio, and segmenting the audio according to a segmentation mode of time length sharing to obtain each audio segment of the audio.
3. The audio playback control method of claim 1, wherein obtaining each audio segment of the audio to be played comprises:
acquiring audio coding data according to the audio identifier of the audio;
determining the data volume of the audio coded data, and segmenting the audio coded data according to a segmentation mode of data volume equalization to obtain each audio coded data segment;
and respectively decoding each audio coding data segment to obtain each audio segment of the audio.
4. The audio playback control method as claimed in claim 1, 2 or 3, wherein before the smoothing processing is performed on the volume loudness data of each audio segment with reference to a standard volume loudness range, the method further comprises:
and determining that the volume loudness of each audio segment is not in the standard volume loudness range based on the volume loudness data of each audio segment.
5. The audio playback control method of claim 4, further comprising:
and when the volume loudness of any audio segment is determined to be in the standard volume loudness range, playing the audio segment based on the volume loudness data of the audio segment.
6. The audio playing control method of claim 1, wherein the step of smoothing the volume loudness data of each audio segment with reference to a standard volume loudness range to obtain the target volume loudness data of each audio segment comprises:
acquiring an initialization parameter set of a set algorithm based on the audio identifier of the audio, wherein the initialization parameter set is a set of initialization parameters which are configured for the audio and enable the average volume loudness of the audio to be within the standard volume loudness range;
and initializing the setting algorithm based on the initialization parameter set, calling the initialized setting algorithm, and respectively smoothing the volume loudness data of each audio segment to obtain the target volume loudness data of each audio segment.
7. The audio playback control method as claimed in claim 1, 2, 3 or 6, further comprising:
and reporting the audio identifier of the audio and the target volume loudness data of each audio segment to a server.
8. An audio playback control apparatus, comprising:
the segmentation processing unit is used for acquiring each audio segment of the audio to be played;
the loudness balancing unit is used for acquiring the volume loudness data of each audio segment, and smoothing the volume loudness data of each audio segment by taking a standard volume loudness range as a reference to obtain target volume loudness data of each audio segment;
and the audio playing unit is used for playing each audio segment according to the target volume loudness data of each audio segment.
9. An audio playback control apparatus, comprising: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the audio playback control method according to any of claims 1 to 7 when executing the computer program.
10. A computer-readable storage medium storing computer instructions which, when executed by a processor, implement the audio playback control method according to any one of claims 1 to 7.
CN201910641673.8A 2019-07-16 2019-07-16 Audio playing control method, device, equipment and medium Pending CN112243151A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910641673.8A CN112243151A (en) 2019-07-16 2019-07-16 Audio playing control method, device, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910641673.8A CN112243151A (en) 2019-07-16 2019-07-16 Audio playing control method, device, equipment and medium

Publications (1)

Publication Number Publication Date
CN112243151A true CN112243151A (en) 2021-01-19

Family

ID=74167263

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910641673.8A Pending CN112243151A (en) 2019-07-16 2019-07-16 Audio playing control method, device, equipment and medium

Country Status (1)

Country Link
CN (1) CN112243151A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114449413A (en) * 2022-02-16 2022-05-06 深圳万兴软件有限公司 Method, device and equipment for controlling loudness of audio signal and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101048935A (en) * 2004-10-26 2007-10-03 杜比实验室特许公司 Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
CN101426112A (en) * 2008-11-05 2009-05-06 青岛海信电子产业控股股份有限公司 Method and apparatus for television sound signal regulation
CN101465625A (en) * 2007-12-20 2009-06-24 瑞昱半导体股份有限公司 Device and method for automatically adjusting gain
CN102610232A (en) * 2012-01-10 2012-07-25 天津大学 Method for adjusting self-adaptive audio sensing loudness
CN102946520A (en) * 2012-10-30 2013-02-27 深圳创维数字技术股份有限公司 Channel volume automatic control method and digital television terminal
CN104471855A (en) * 2012-07-12 2015-03-25 Dts公司 Loudness control with noise detection and loudness drop detection
US20160254794A1 (en) * 2012-11-13 2016-09-01 Snell Limited Management of broadcast audio loudness
CN106412689A (en) * 2016-10-28 2017-02-15 北京数码视讯科技股份有限公司 Audio volume adjusting method and apparatus
CN108711435A (en) * 2018-05-30 2018-10-26 中南大学 A kind of high efficiency audio control method towards loudness

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101048935A (en) * 2004-10-26 2007-10-03 杜比实验室特许公司 Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
CN101465625A (en) * 2007-12-20 2009-06-24 瑞昱半导体股份有限公司 Device and method for automatically adjusting gain
CN101426112A (en) * 2008-11-05 2009-05-06 青岛海信电子产业控股股份有限公司 Method and apparatus for television sound signal regulation
CN102610232A (en) * 2012-01-10 2012-07-25 天津大学 Method for adjusting self-adaptive audio sensing loudness
CN104471855A (en) * 2012-07-12 2015-03-25 Dts公司 Loudness control with noise detection and loudness drop detection
CN102946520A (en) * 2012-10-30 2013-02-27 深圳创维数字技术股份有限公司 Channel volume automatic control method and digital television terminal
US20160254794A1 (en) * 2012-11-13 2016-09-01 Snell Limited Management of broadcast audio loudness
CN106412689A (en) * 2016-10-28 2017-02-15 北京数码视讯科技股份有限公司 Audio volume adjusting method and apparatus
CN108711435A (en) * 2018-05-30 2018-10-26 中南大学 A kind of high efficiency audio control method towards loudness

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114449413A (en) * 2022-02-16 2022-05-06 深圳万兴软件有限公司 Method, device and equipment for controlling loudness of audio signal and storage medium
CN114449413B (en) * 2022-02-16 2023-12-22 深圳万兴软件有限公司 Method, device, equipment and storage medium for controlling loudness of audio signal

Similar Documents

Publication Publication Date Title
CN111135569B (en) Cloud game processing method and device, storage medium and electronic equipment
CN111030936B (en) Current-limiting control method and device for network access and computer-readable storage medium
CN112087633B (en) Video decoding method, device and storage medium
US8510763B2 (en) Changing streaming media quality level based on current device resource usage
CN110807128B (en) Video preloading method, device, equipment and storage medium
US20210168418A1 (en) Method and device for controlling video transcoding code rate
US20140067898A1 (en) Cost-aware cloud-based content delivery
US11490173B2 (en) Switch of audio and video
JP6314252B2 (en) Network video playback method and apparatus
US9917876B2 (en) Video information playing system and method
CN102648584B (en) Use the system of forward error correction inspection available bandwidth, method and medium
CN111510755A (en) Audio and video switching method and device, computer equipment and readable storage medium
CN107333163A (en) A kind of method for processing video frequency and device, a kind of terminal and storage medium
CN114501062A (en) Video rendering coordination method, device, equipment and storage medium
EP3800894A1 (en) Method and apparatus for detecting video playing quality
CN110740313A (en) Hardware coding capability detection method and device
CN102918594A (en) Cache control for adaptive stream player
WO2021143692A1 (en) Audio encoding and decoding methods and audio encoding and decoding devices
US8499055B2 (en) File decoding system and method
CN108632667B (en) A kind of coding/decoding method and terminal of multimedia file
CN114040245B (en) Video playing method and device, computer storage medium and electronic equipment
CN108829370B (en) Audio resource playing method and device, computer equipment and storage medium
WO2019033940A1 (en) Volume adjustment method and apparatus, terminal device, and storage medium
CN112243151A (en) Audio playing control method, device, equipment and medium
CN114286128A (en) Live video parameter adjusting method, system, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210119