CN110620793A - Method, device and medium for improving audio quality - Google Patents

Method, device and medium for improving audio quality Download PDF

Info

Publication number
CN110620793A
CN110620793A CN201911049703.2A CN201911049703A CN110620793A CN 110620793 A CN110620793 A CN 110620793A CN 201911049703 A CN201911049703 A CN 201911049703A CN 110620793 A CN110620793 A CN 110620793A
Authority
CN
China
Prior art keywords
ssd cache
time delay
cache size
jitter
threshold value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911049703.2A
Other languages
Chinese (zh)
Other versions
CN110620793B (en
Inventor
张新秀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Wave Intelligent Technology Co Ltd
Original Assignee
Suzhou Wave Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Wave Intelligent Technology Co Ltd filed Critical Suzhou Wave Intelligent Technology Co Ltd
Priority to CN201911049703.2A priority Critical patent/CN110620793B/en
Publication of CN110620793A publication Critical patent/CN110620793A/en
Application granted granted Critical
Publication of CN110620793B publication Critical patent/CN110620793B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/764Media network packet handling at the destination 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/155Conference systems involving storage of or access to video conference sessions

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a method, equipment and a readable storage medium for improving audio quality, wherein the method comprises the following steps: receiving audio data and storing the audio data into a CPU (central processing unit) for caching; judging whether the CPU cache has a residual space; responding to the CPU cache that no residual space exists, storing the audio data into the SSD cache; judging whether the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice; responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice, and judging whether the time delay is smaller than a second threshold value; and in response to the latency being not less than the second threshold, reducing the SSD cache size to reduce the latency. The method, the equipment and the medium for improving the audio quality design a multi-level buffer, dynamically adjust the size of a secondary buffer according to the voice time delay, the flow of uplink and downlink voice and the compression condition of the voice, ensure the best overall performance of voice jitter quality and voice time delay, and further improve the audio quality to the maximum extent.

Description

Method, device and medium for improving audio quality
Technical Field
The present invention relates to the field of signal transmission, and more particularly, to a method, device and readable medium for improving audio quality.
Background
In a video conferencing system, audio quality is very critical. There is a possibility of delay in the transmission of voice data in a network environment, and the delay of the network may change at different time intervals, and such changes in the network delay may cause jitter, which may cause distortion and discontinuity in audio quality. Therefore, the video conference system needs to perform anti-jitter processing of audio. The traditional audio anti-jitter technology is to establish a buffer area for storing data with a sufficient time, so that the slowest data packet can arrive in time before audio restoration, thereby eliminating the adverse effect of delay. Therefore, although the conventional voice anti-jitter method solves the problem of audio jitter elimination, a fixed processing delay time is increased, and the delay time cannot be adjusted along with the jitter change of the data of the network.
Disclosure of Invention
In view of this, embodiments of the present invention provide a method, an apparatus, and a medium for improving audio quality, which design a multi-level buffer, and dynamically adjust the size of a secondary buffer according to a voice delay, a flow of uplink and downlink voice, and a compression condition of the voice, so as to ensure that the voice jitter quality and the overall performance of the voice delay are the best, thereby improving the audio quality to the greatest extent.
Based on the above object, an aspect of the embodiments of the present invention provides a method for improving audio quality, including the following steps: receiving audio data and storing the audio data into a CPU (central processing unit) for caching; judging whether the CPU cache has a residual space or not; in response to the CPU cache not having the remaining space, storing audio data in an SSD cache; judging whether the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice; responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of voice transmission of a network link, and judging whether time delay is smaller than a second threshold value; and in response to the latency being not less than a second threshold, reducing the SSD cache size to reduce the latency.
In some embodiments, further comprising: responding to the fact that the total voice flow stored in the SSD cache is not smaller than a first threshold value of network link voice transmission, and judging whether time delay is smaller than a second threshold value or not; and in response to the latency being less than a second threshold, increasing the SSD cache size to reduce jitter.
In some embodiments, further comprising: and responding to the time delay not less than a second threshold value, adjusting the SSD cache size, and determining the optimal SSD cache size based on the time delay and the jitter corresponding to each SSD cache size.
In some embodiments, determining the optimal SSD cache size based on the latency and jitter for each SSD cache size comprises: and respectively distributing a first weight and a second weight to the time delay and the jitter, calculating the numerical value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight, and determining the SSD cache size corresponding to the minimum numerical value as the optimal SSD cache size.
In some embodiments, the calculating the value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight includes: respectively taking the time delay and the jitter corresponding to the SSD cache size before adjustment as an initial time delay and an initial jitter; subtracting the initial time delay from the time delay corresponding to each adjusted SSD cache size to obtain a time delay component, and subtracting the initial jitter from the jitter corresponding to each adjusted SSD cache size to obtain a jitter component; and adding the product of the first weight and the delay component to the product of the second weight and the jitter component to obtain a value of the audio quality.
In another aspect of the embodiments of the present invention, there is also provided a computer device, including: at least one processor; and a memory storing computer instructions executable on the processor, the instructions being executable by the processor to perform the steps of: receiving audio data and storing the audio data into a CPU (central processing unit) for caching; judging whether the CPU cache has a residual space or not; in response to the CPU cache not having the remaining space, storing audio data in an SSD cache; judging whether the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice; responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of voice transmission of a network link, and judging whether time delay is smaller than a second threshold value; and in response to the latency being not less than a second threshold, reducing the SSD cache size to reduce the latency.
In some embodiments, the steps further comprise: responding to the fact that the total voice flow stored in the SSD cache is not smaller than a first threshold value of network link voice transmission, and judging whether time delay is smaller than a second threshold value or not; and in response to the latency being less than a second threshold, increasing the SSD cache size to reduce jitter.
In some embodiments, the steps further comprise: and responding to the time delay not less than a second threshold value, adjusting the SSD cache size, and determining the optimal SSD cache size based on the time delay and the jitter corresponding to each SSD cache size.
In some embodiments, determining the optimal SSD cache size based on the latency and jitter for each SSD cache size comprises: and respectively distributing a first weight and a second weight to the time delay and the jitter, calculating the numerical value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight, and determining the SSD cache size corresponding to the minimum numerical value as the optimal SSD cache size.
In a further aspect of the embodiments of the present invention, a computer-readable storage medium is also provided, in which a computer program for implementing the above method steps is stored when the computer program is executed by a processor.
The invention has the following beneficial technical effects: by designing the multi-level cache and dynamically adjusting the size of the secondary cache according to the voice time delay, the flow of uplink and downlink voice and the compression condition of the voice, the voice jitter quality and the voice time delay overall performance are ensured to be the best, and the audio quality is improved to the greatest extent.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other embodiments can be obtained by using the drawings without creative efforts.
Fig. 1 is a schematic diagram of an embodiment of a method for improving audio quality according to the present invention;
FIG. 2 is a flow chart of an embodiment of a method for improving audio quality provided by the present invention;
fig. 3 is a schematic hardware structure diagram of an embodiment of the method for improving audio quality according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention are described in further detail with reference to the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
In view of the above object, a first aspect of the embodiments of the present invention proposes an embodiment of a method for improving audio quality. Fig. 1 is a schematic diagram illustrating an embodiment of the method for improving audio quality provided by the present invention. As shown in fig. 1, the embodiment of the present invention includes the following steps:
s1, receiving audio data and storing the audio data into a CPU cache;
s2, judging whether the CPU cache has a residual space;
s3, responding to the CPU cache that no residual space exists, storing the audio data into the SSD cache;
s4, judging whether the total voice flow stored in the SSD buffer is smaller than a first threshold value of the network link for transmitting voice;
s5, responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice, and judging whether the time delay is smaller than a second threshold value; and
and S6, responding to the time delay not less than the second threshold value, reducing the SSD cache size to reduce the time delay.
In this embodiment, a malllanox low-latency network card may be selected, and an LSI RAID card supporting the CacheCade technology and other hardware may be selected, thereby further reducing latency. The cachede technology of LSI RAID cards is employed to configure SSD storage as a cache for traditional mechanical storage.
And receiving audio data and storing the audio data in a CPU cache. The CPU cache is used as a first-level cache, and when audio data are received, the audio data are firstly transmitted to the CPU cache.
And judging whether the CPU cache has a residual space, and storing the audio data into the SSD cache in response to the CPU cache having no residual space. If the CPU cache space is full, the data can be transmitted to the second-level cache SSD cache.
Judging whether the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice, responding to the fact that the total voice flow stored in the SSD cache is smaller than the first threshold value of the network link for transmitting voice, judging whether the time delay is smaller than a second threshold value, responding to the fact that the time delay is not smaller than the second threshold value, and reducing the SSD cache size to reduce the time delay. Assuming that a threshold value of a network link for transmitting voice is P, considering network instability, a first threshold value may be set to 2P/3, counting total uplink and downlink voice flow as F, a voice compression ratio as c, and a time delay as T, the total voice flow stored in the SSD buffer is F/c, if F/c <2P/3, determining whether the time delay is less than a second threshold value, which may be 300ms, and when T >300ms, reducing the SSD buffer to reduce the time delay until T <300 ms.
In some embodiments, further comprising: responding to the fact that the total voice flow stored in the SSD cache is not smaller than a first threshold value of network link voice transmission, and judging whether time delay is smaller than a second threshold value or not; and in response to the latency being less than a second threshold, increasing the SSD cache size to reduce jitter.
In some embodiments, further comprising: and responding to the time delay not less than a second threshold value, adjusting the SSD cache size, and determining the optimal SSD cache size based on the time delay and the jitter corresponding to each SSD cache size.
In some embodiments, determining the optimal SSD cache size based on the latency and jitter for each SSD cache size comprises: and respectively distributing a first weight and a second weight to the time delay and the jitter, calculating the numerical value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight, and determining the SSD cache size corresponding to the minimum numerical value as the optimal SSD cache size. The sum of the first weight and the second weight is 1, and specific values of the first weight and the second weight can be set according to experience or big data, and in the embodiment, the first weight and the second weight are both set to be 50%.
In some embodiments, the calculating the value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight includes: respectively taking the time delay and the jitter corresponding to the SSD cache size before adjustment as an initial time delay and an initial jitter; subtracting the initial time delay from the time delay corresponding to each adjusted SSD cache size to obtain a time delay component, and subtracting the initial jitter from the jitter corresponding to each adjusted SSD cache size to obtain a jitter component; and adding the product of the first weight and the delay component to the product of the second weight and the jitter component to obtain a value of the audio quality. For example, the SSD buffer size before adjustment is 2G, the initial latency is 500ms, the initial jitter is quantized to a specific value, for example, 500, the SSD buffer size after adjustment may be 1G, 1.5G, 2.5G, 3G, and the like, and may be increased or decreased equidistantly, the latency corresponding to 1G of the SSD buffer size is 450ms, the jitter value is 550, the latency component is-50, the jitter component is 50, and the audio quality corresponding to 1GSSD buffer size is 50% (-50) + 50% ("0); a SSD buffer size of 1.5G corresponds to a delay of 480ms, a jitter value of 510, and an audio quality value of 50% (-20) + 50% × 10 ═ -5, and since-5 is less than 0, the performance of the SSD buffer size of 1.5G is better than that of 1G.
Fig. 2 is a flow chart of an embodiment of the method for improving audio quality provided by the present invention. As shown in fig. 2, beginning at block 101 and proceeding to block 102, audio data is received and stored in a CPU buffer; then, the process proceeds to a block 103, whether the CPU cache has a residual space is judged, if yes, the process proceeds to a block 104, and if not, the process returns to the block 103; block 104, storing the audio data in an SSD cache; then, the flow proceeds to a block 105, and whether the total voice flow stored in the SSD cache is smaller than a first threshold for transmitting voice over the network link is determined, if yes, the flow proceeds to a block 106, and if not, the flow is directly ended; block 106, determine whether the latency is less than the second threshold, if yes, end directly, if no, proceed to block 107, reduce SSD cache size to reduce latency, then proceed to block 108 and end.
It should be particularly noted that the steps in the embodiments of the method for improving audio quality may be mutually intersected, replaced, added, or deleted, and therefore, these reasonable permutation and combination transformations should also fall within the scope of the present invention, and should not limit the scope of the present invention to the embodiments.
In view of the above object, a second aspect of the embodiments of the present invention provides a computer device, including: at least one processor; and a memory storing computer instructions executable on the processor, the instructions being executable by the processor to perform the steps of: s1, receiving audio data and storing the audio data into a CPU cache; s2, judging whether the CPU cache has a residual space; s3, responding to the CPU cache that no residual space exists, storing the audio data into the SSD cache; s4, judging whether the total voice flow stored in the SSD buffer is smaller than a first threshold value of the network link for transmitting voice; s5, responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice, and judging whether the time delay is smaller than a second threshold value; and S6, in response to the latency not being less than the second threshold, reducing the SSD cache size to reduce the latency.
In some embodiments, further comprising: responding to the fact that the total voice flow stored in the SSD cache is not smaller than a first threshold value of network link voice transmission, and judging whether time delay is smaller than a second threshold value or not; and in response to the latency being less than a second threshold, increasing the SSD cache size to reduce jitter.
In some embodiments, further comprising: and responding to the time delay not less than a second threshold value, adjusting the SSD cache size, and determining the optimal SSD cache size based on the time delay and the jitter corresponding to each SSD cache size.
In some embodiments, determining the optimal SSD cache size based on the latency and jitter for each SSD cache size comprises: and respectively distributing a first weight and a second weight to the time delay and the jitter, calculating the numerical value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight, and determining the SSD cache size corresponding to the minimum numerical value as the optimal SSD cache size.
In some embodiments, the calculating the value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight includes: respectively taking the time delay and the jitter corresponding to the SSD cache size before adjustment as an initial time delay and an initial jitter; subtracting the initial time delay from the time delay corresponding to each adjusted SSD cache size to obtain a time delay component, and subtracting the initial jitter from the jitter corresponding to each adjusted SSD cache size to obtain a jitter component; and adding the product of the first weight and the delay component to the product of the second weight and the jitter component to obtain a value of the audio quality.
Fig. 3 is a schematic hardware structure diagram of an embodiment of the method for improving audio quality according to the present invention.
Taking the apparatus shown in fig. 3 as an example, the apparatus includes a processor 301 and a memory 302, and may further include: an input device 303 and an output device 304.
The processor 301, the memory 302, the input device 303 and the output device 304 may be connected by a bus or other means, and fig. 3 illustrates the connection by a bus as an example.
The memory 302, which is a non-volatile computer-readable storage medium, may be used to store non-volatile software programs, non-volatile computer-executable programs, and modules, such as program instructions/modules corresponding to the method for improving audio quality in the embodiments of the present application. The processor 301 executes various functional applications of the server and data processing by running the nonvolatile software programs, instructions and modules stored in the memory 302, that is, implements the method for improving audio quality of the above-described method embodiment.
The memory 302 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the method of improving audio quality, and the like. Further, the memory 302 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 302 optionally includes memory located remotely from processor 301, which may be connected to a local module via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input device 303 may receive information such as a user name and a password that are input. The output means 304 may comprise a display device such as a display screen.
Program instructions/modules corresponding to one or more methods of improving audio quality are stored in the memory 302 and, when executed by the processor 301, perform the methods of improving audio quality in any of the method embodiments described above.
Any embodiment of a computer device implementing the method for improving audio quality as described above may achieve the same or similar effects as any of the preceding method embodiments corresponding thereto.
The invention also provides a computer readable storage medium storing a computer program which, when executed by a processor, performs the method as above.
Finally, it should be noted that, as one of ordinary skill in the art can appreciate that all or part of the processes of the methods of the above embodiments can be implemented by a computer program to instruct related hardware, and the program of the method for improving audio quality can be stored in a computer readable storage medium, and when executed, the program can include the processes of the embodiments of the methods as described above. The storage medium of the program may be a magnetic disk, an optical disk, a Read Only Memory (ROM), a Random Access Memory (RAM), or the like. The embodiments of the computer program may achieve the same or similar effects as any of the above-described method embodiments.
Furthermore, the methods disclosed according to embodiments of the present invention may also be implemented as a computer program executed by a processor, which may be stored in a computer-readable storage medium. Which when executed by a processor performs the above-described functions defined in the methods disclosed in embodiments of the invention.
Further, the above method steps and system elements may also be implemented using a controller and a computer readable storage medium for storing a computer program for causing the controller to implement the functions of the above steps or elements.
Further, it should be appreciated that the computer-readable storage media (e.g., memory) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. By way of example, and not limitation, nonvolatile memory can include Read Only Memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM), which can act as external cache memory. By way of example and not limitation, RAM is available in a variety of forms such as synchronous RAM (DRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), and Direct Rambus RAM (DRRAM). The storage devices of the disclosed aspects are intended to comprise, without being limited to, these and other suitable types of memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments of the present invention.
The various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with the following components designed to perform the functions herein: a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination of these components. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP, and/or any other such configuration.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary designs, the functions may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk, blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the present disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments of the invention may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The numbers of the embodiments disclosed in the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, and the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, of embodiments of the invention is limited to these examples; within the idea of an embodiment of the invention, also technical features in the above embodiment or in different embodiments may be combined and there are many other variations of the different aspects of the embodiments of the invention as described above, which are not provided in detail for the sake of brevity. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of the embodiments of the present invention are intended to be included within the scope of the embodiments of the present invention.

Claims (10)

1. A method for improving audio quality, comprising the steps of:
receiving audio data and storing the audio data into a CPU (central processing unit) for caching;
judging whether the CPU cache has a residual space or not;
in response to the CPU cache not having the remaining space, storing audio data in an SSD cache;
judging whether the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice;
responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of voice transmission of a network link, and judging whether time delay is smaller than a second threshold value; and
in response to the latency being not less than a second threshold, reducing the SSD cache size to reduce latency.
2. The method of claim 1, further comprising:
responding to the fact that the total voice flow stored in the SSD cache is not smaller than a first threshold value of network link voice transmission, and judging whether time delay is smaller than a second threshold value or not; and
in response to the latency being less than a second threshold, increasing the SSD cache size to reduce jitter.
3. The method of claim 2, further comprising:
and responding to the time delay not less than a second threshold value, adjusting the SSD cache size, and determining the optimal SSD cache size based on the time delay and the jitter corresponding to each SSD cache size.
4. The method of claim 3, wherein determining the optimal SSD cache size based on latency and jitter for each SSD cache size comprises:
and respectively distributing a first weight and a second weight to the time delay and the jitter, calculating the numerical value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight, and determining the SSD cache size corresponding to the minimum numerical value as the optimal SSD cache size.
5. The method of claim 4, wherein calculating the value of the audio quality for each SSD cache size based on the first weight and the second weight comprises:
respectively taking the time delay and the jitter corresponding to the SSD cache size before adjustment as an initial time delay and an initial jitter;
subtracting the initial time delay from the time delay corresponding to each adjusted SSD cache size to obtain a time delay component, and subtracting the initial jitter from the jitter corresponding to each adjusted SSD cache size to obtain a jitter component; and
the product of the first weight and the delay component is added to the product of the second weight and the jitter component to obtain a value for the audio quality.
6. A computer device, comprising:
at least one processor; and
a memory storing computer instructions executable on the processor, the instructions when executed by the processor implementing the steps of:
receiving audio data and storing the audio data into a CPU (central processing unit) for caching;
judging whether the CPU cache has a residual space or not;
in response to the CPU cache not having the remaining space, storing audio data in an SSD cache;
judging whether the total voice flow stored in the SSD cache is smaller than a first threshold value of the network link for transmitting voice;
responding to the fact that the total voice flow stored in the SSD cache is smaller than a first threshold value of voice transmission of a network link, and judging whether time delay is smaller than a second threshold value; and
in response to the latency being not less than a second threshold, reducing the SSD cache size to reduce latency.
7. The computer device of claim 6, wherein the steps further comprise:
responding to the fact that the total voice flow stored in the SSD cache is not smaller than a first threshold value of network link voice transmission, and judging whether time delay is smaller than a second threshold value or not; and
in response to the latency being less than a second threshold, increasing the SSD cache size to reduce jitter.
8. The computer device of claim 7, wherein the steps further comprise:
and responding to the time delay not less than a second threshold value, adjusting the SSD cache size, and determining the optimal SSD cache size based on the time delay and the jitter corresponding to each SSD cache size.
9. The computer device of claim 8, wherein determining an optimal SSD cache size based on latency and jitter for each SSD cache size comprises:
and respectively distributing a first weight and a second weight to the time delay and the jitter, calculating the numerical value of the audio quality corresponding to each SSD cache size based on the first weight and the second weight, and determining the SSD cache size corresponding to the minimum numerical value as the optimal SSD cache size.
10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 5.
CN201911049703.2A 2019-10-31 2019-10-31 Method, device and medium for improving audio quality Active CN110620793B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911049703.2A CN110620793B (en) 2019-10-31 2019-10-31 Method, device and medium for improving audio quality

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911049703.2A CN110620793B (en) 2019-10-31 2019-10-31 Method, device and medium for improving audio quality

Publications (2)

Publication Number Publication Date
CN110620793A true CN110620793A (en) 2019-12-27
CN110620793B CN110620793B (en) 2022-03-15

Family

ID=68927154

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911049703.2A Active CN110620793B (en) 2019-10-31 2019-10-31 Method, device and medium for improving audio quality

Country Status (1)

Country Link
CN (1) CN110620793B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111402914A (en) * 2020-03-11 2020-07-10 广州虎牙科技有限公司 Noise elimination method, device, electronic equipment and storage medium
CN111402958A (en) * 2020-03-13 2020-07-10 苏州浪潮智能科技有限公司 Method, system, equipment and medium for establishing gene comparison table
CN112350958A (en) * 2020-10-30 2021-02-09 深圳Tcl新技术有限公司 Voice data dynamic uploading method, Internet of things equipment and storage medium
CN112468841A (en) * 2020-11-26 2021-03-09 Oppo广东移动通信有限公司 Audio transmission method and device, intelligent equipment and computer readable storage medium
CN112822502A (en) * 2020-12-28 2021-05-18 阿里巴巴(中国)有限公司 Live broadcast jitter removal intelligent caching and live broadcast method, equipment and storage medium
CN113382081A (en) * 2021-06-28 2021-09-10 阿波罗智联(北京)科技有限公司 Time delay estimation adjusting method, device, equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060092918A1 (en) * 2004-11-04 2006-05-04 Alexander Talalai Audio receiver having adaptive buffer delay
CN102325261A (en) * 2011-09-14 2012-01-18 上海交通大学 Synchronization method for eliminating jitter of inter-video video data of stereo video collecting and synthetizing system
CN102387409A (en) * 2011-12-07 2012-03-21 江苏远为科技有限公司 Method for video playing and based on stream media system
CN102571561A (en) * 2011-12-16 2012-07-11 瑞斯康达科技发展股份有限公司 PWE3 device and method for reading and writing data in jitter buffer of device
CN103402136A (en) * 2013-07-29 2013-11-20 重庆大学 Self-adaptive cache adjustment control method and device and self-adaptive player
CN103533451A (en) * 2013-09-30 2014-01-22 广州华多网络科技有限公司 Method and system for regulating jitter buffer
CN103685070A (en) * 2013-12-18 2014-03-26 广州华多网络科技有限公司 Method and device for adjusting jitter buffer

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060092918A1 (en) * 2004-11-04 2006-05-04 Alexander Talalai Audio receiver having adaptive buffer delay
CN102325261A (en) * 2011-09-14 2012-01-18 上海交通大学 Synchronization method for eliminating jitter of inter-video video data of stereo video collecting and synthetizing system
CN102387409A (en) * 2011-12-07 2012-03-21 江苏远为科技有限公司 Method for video playing and based on stream media system
CN102571561A (en) * 2011-12-16 2012-07-11 瑞斯康达科技发展股份有限公司 PWE3 device and method for reading and writing data in jitter buffer of device
CN103402136A (en) * 2013-07-29 2013-11-20 重庆大学 Self-adaptive cache adjustment control method and device and self-adaptive player
CN103533451A (en) * 2013-09-30 2014-01-22 广州华多网络科技有限公司 Method and system for regulating jitter buffer
CN103685070A (en) * 2013-12-18 2014-03-26 广州华多网络科技有限公司 Method and device for adjusting jitter buffer

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111402914A (en) * 2020-03-11 2020-07-10 广州虎牙科技有限公司 Noise elimination method, device, electronic equipment and storage medium
CN111402958A (en) * 2020-03-13 2020-07-10 苏州浪潮智能科技有限公司 Method, system, equipment and medium for establishing gene comparison table
CN111402958B (en) * 2020-03-13 2022-05-17 苏州浪潮智能科技有限公司 Method, system, equipment and medium for establishing gene comparison table
CN112350958A (en) * 2020-10-30 2021-02-09 深圳Tcl新技术有限公司 Voice data dynamic uploading method, Internet of things equipment and storage medium
CN112350958B (en) * 2020-10-30 2024-01-23 深圳Tcl新技术有限公司 Voice data dynamic uploading method, internet of things equipment and storage medium
CN112468841A (en) * 2020-11-26 2021-03-09 Oppo广东移动通信有限公司 Audio transmission method and device, intelligent equipment and computer readable storage medium
CN112822502A (en) * 2020-12-28 2021-05-18 阿里巴巴(中国)有限公司 Live broadcast jitter removal intelligent caching and live broadcast method, equipment and storage medium
CN112822502B (en) * 2020-12-28 2022-06-07 阿里巴巴(中国)有限公司 Live broadcast jitter removal intelligent caching and live broadcast method, equipment and storage medium
CN113382081A (en) * 2021-06-28 2021-09-10 阿波罗智联(北京)科技有限公司 Time delay estimation adjusting method, device, equipment and storage medium
CN113382081B (en) * 2021-06-28 2023-04-07 阿波罗智联(北京)科技有限公司 Time delay estimation adjusting method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN110620793B (en) 2022-03-15

Similar Documents

Publication Publication Date Title
CN110620793B (en) Method, device and medium for improving audio quality
CN111107017A (en) Method, equipment and storage medium for processing switch message congestion
CN110781129B (en) Resource scheduling method, device and medium in FPGA heterogeneous accelerator card cluster
CN111708484A (en) Method, system, device and medium for controlling data brushing speed
CN111708482B (en) Method, system, device and medium for limiting bandwidth in distributed storage
CN111737616B (en) Page caching method, device, equipment and medium based on B/S architecture
CN111708486B (en) Method, system, equipment and medium for balanced optimization of main placement group
US20150244827A1 (en) Data Transmission
CN109446093B (en) Method and device for testing expansion platform interface
CN112785408A (en) Account checking method and device based on Hash
CN110413354B (en) Method, equipment and storage medium for realizing interaction with third-party platform
CN111258856A (en) Method, system, equipment and medium for monitoring running state of solid state disk
CN110225423B (en) Method, apparatus and medium for adaptive optical signal rate for switch system
US20160295416A1 (en) Method and apparatus for performing a message integrity check
CN111309264B (en) Method, system, device and medium for making directory quota compatible with snapshot
WO2021129742A1 (en) Network data recovery method and device, and mobile terminal
CN116505954B (en) Huffman coding method, system, device and medium
CN107562790B (en) Method and system for realizing batch warehousing of data processing
CN111722959B (en) Method, system, equipment and medium for expanding storage pool
CN111787268B (en) Audio signal processing method and device, electronic equipment and storage medium
CN111736985B (en) Method, system, equipment and medium for accelerating log to fall into disk
CN111177066A (en) Method, device and medium for improving efficiency of accessing off-chip memory
CN110780865B (en) Method, equipment and medium for right-side alignment display of command line
CN114916029A (en) Method, apparatus and medium for triggering user equipment policy update
CN111090432A (en) Interface processing method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant