CN114121050A - Audio playing method and device, electronic equipment and storage medium - Google Patents
Audio playing method and device, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN114121050A CN114121050A CN202111451000.XA CN202111451000A CN114121050A CN 114121050 A CN114121050 A CN 114121050A CN 202111451000 A CN202111451000 A CN 202111451000A CN 114121050 A CN114121050 A CN 114121050A
- Authority
- CN
- China
- Prior art keywords
- audio
- played
- abnormal
- segment
- abnormal segment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 230000002159 abnormal effect Effects 0.000 claims abstract description 121
- 230000004044 response Effects 0.000 claims abstract description 16
- 238000004590 computer program Methods 0.000 claims description 10
- 238000011038 discontinuous diafiltration by volume reduction Methods 0.000 claims description 7
- 238000012545 processing Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 238000005457 optimization Methods 0.000 description 7
- 238000004891 communication Methods 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 4
- 239000012634 fragment Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 210000005069 ears Anatomy 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10009—Improvement or modification of read or write signals
- G11B20/10481—Improvement or modification of read or write signals optimisation methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The embodiment of the application discloses an audio playing method and device, electronic equipment and a storage medium. One embodiment of the method comprises: acquiring audio to be played; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; and playing the optimized audio. The method and the device optimize the abnormal segments in the audio to be played and improve the user experience.
Description
Technical Field
The embodiment of the application relates to the technical field of computers, in particular to an audio playing method, an audio playing device, electronic equipment and a storage medium.
Background
Along with the development of the intelligent terminal, the entertainment functions of the intelligent terminal are more and more abundant, and a user can use the intelligent terminal to perform entertainment activities such as listening to music, watching videos or playing games, but the user often receives the interference of some abnormal sound segments, such as noise segments, in the process, and the experience is very poor.
Disclosure of Invention
The embodiment of the application provides an audio playing method, an audio playing device, electronic equipment and a storage medium.
In a first aspect, some embodiments of the present application provide an audio playing method, including: acquiring audio to be played; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; and playing the optimized audio.
In some embodiments, determining whether the audio to be played includes an abnormal segment includes: determining whether the audio to be played can be played through the earphone; and responding to the determination that the audio to be played is played through the earphone, and determining whether the audio to be played comprises an abnormal segment.
In some embodiments, determining whether the audio to be played includes an abnormal segment includes: acquiring a voiceprint to be played by audio; and determining whether the acquired voiceprints comprise abnormal voiceprints or not through a pre-established abnormal voiceprint recognition library.
In some embodiments, the abnormal voiceprint recognition library comprises a recognition library established via the steps of: and carrying out statistical storage on the voiceprints of the common noise and the preset abnormal sound.
In some embodiments, optimizing abnormal segments included in the audio to be played to obtain an optimized audio includes: and carrying out volume reduction on the abnormal segment included in the audio to be played and/or replacing the abnormal segment by using preset audio content.
In a second aspect, some embodiments of the present application provide an audio playback apparatus, including: an acquisition unit configured to acquire audio to be played; a determination unit configured to determine whether an abnormal section is included in the audio to be played; the optimizing unit is configured to respond to the fact that the abnormal segments are determined to be included in the audio to be played, and optimize the abnormal segments included in the audio to be played to obtain optimized audio; a playback unit configured to play the optimized audio.
In some embodiments, the determining unit comprises: a first determining subunit configured to determine whether audio to be played will be played through the headphones; and the second determining subunit is configured to determine whether the abnormal segment is included in the audio to be played in response to determining that the audio to be played is to be played through the earphone.
In some embodiments, the determining unit comprises: an obtaining subunit configured to obtain a voiceprint to be played by audio; and the identifying subunit is configured to determine whether the acquired voiceprints comprise abnormal voiceprints through a pre-established abnormal voiceprint identifying library.
In some embodiments, the apparatus further comprises an abnormal voiceprint recognition library creation unit configured to: and carrying out statistical storage on the voiceprints of the common noise and the preset abnormal sound.
In some embodiments, the optimization unit is further configured to: and carrying out volume reduction on the abnormal segment included in the audio to be played and/or replacing the abnormal segment by using preset audio content.
In a third aspect, some embodiments of the present application provide an apparatus comprising: one or more processors; a storage device, on which one or more programs are stored, which, when executed by the one or more processors, cause the one or more processors to implement the method as described above in the first aspect.
In a fourth aspect, some embodiments of the present application provide a computer readable medium having stored thereon a computer program which, when executed by a processor, implements the method as described above in the first aspect.
According to the audio playing method, the audio playing device, the electronic equipment and the storage medium, the audio to be played is obtained; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; the optimized audio is played, the abnormal segment in the audio to be played is optimized, and the user experience is improved.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is a diagram of an exemplary system architecture to which some of the present application may be applied;
FIG. 2 is a flow diagram of one embodiment of an audio playback method according to the present application;
FIG. 3 is a schematic block diagram of an embodiment of an audio playback device according to the present application;
FIG. 4 is a block diagram of a computer system suitable for use in implementing a server or terminal of some embodiments of the present application.
Detailed Description
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 shows an exemplary system architecture 100 to which embodiments of the audio playback method or audio playback apparatus of the present application may be applied.
As shown in fig. 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. Various client applications, such as music playing applications, video playing applications, e-commerce applications, game applications, etc., may be installed on the terminal devices 101, 102, 103.
The terminal apparatuses 101, 102, and 103 may be hardware or software. When the terminal devices 101, 102, 103 are hardware, they may be various electronic devices with audio playing function, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like. When the terminal apparatuses 101, 102, 103 are software, they can be installed in the electronic apparatuses listed above. It may be implemented as multiple pieces of software or software modules, or as a single piece of software or software module. And is not particularly limited herein.
The server 105 may be a server providing various services, for example, a background server providing support for applications installed on the terminal devices 101, 102, and 103, and the server 105 may obtain audio to be played; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; and playing the optimized audio.
It should be noted that the audio playing method provided in the embodiment of the present application may be executed by the server 105, or may be executed by the terminal devices 101, 102, and 103, and accordingly, the audio playing apparatus may be disposed in the server 105, or may be disposed in the terminal devices 101, 102, and 103.
The server may be hardware or software. When the server is hardware, it may be implemented as a distributed server cluster formed by multiple servers, or may be implemented as a single server. When the server is software, it may be implemented as multiple pieces of software or software modules (e.g., to provide distributed services), or as a single piece of software or software module. And is not particularly limited herein.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
With continued reference to FIG. 2, a flow 200 of one embodiment of an audio playback method according to the present application is shown. The audio playing method comprises the following steps:
In this embodiment, an audio playing method execution main body (for example, the server or the terminal shown in fig. 1) may obtain the audio to be played in response to receiving an audio playing instruction or in response to some playing setting. The audio to be played may include audio to be played while the user is engaged in an entertainment activity such as listening to music, watching video, or playing a game.
In this embodiment, the executing entity may determine whether the audio to be played acquired in step 201 includes an abnormal segment, where the abnormal segment is an abnormal sound segment, and may include various dissonant sound segments, such as sounds with too large volume, and segments of annoying sounds such as glaring or unintelligent words, for example. The abnormal fragments can be detected by a preset abnormal voiceprint recognition library or a pre-trained abnormal voiceprint recognition model, and since the voiceprints also belong to the category of images, the abnormal fragments can be obtained by training by referring to a training method of a common image recognition model. In addition, since the voiceprint can be regarded as a waveform, the abnormal voiceprint can be identified by setting upper and lower limits of the waveform and the like.
In some optional implementations of this embodiment, determining whether the audio to be played includes an abnormal segment includes: determining whether the audio to be played can be played through the earphone; and responding to the determination that the audio to be played is played through the earphone, and determining whether the audio to be played comprises an abnormal segment. In this implementation manner, the execution main body may determine whether the audio to be played will be played through the earphone by detecting whether the terminal is connected to the earphone and/or by determining whether the user wears the earphone through hardware devices such as the earphone.
Because for the situation of playing audio through the loudspeaker, the influence of hearing the abnormal segment is more direct when the user wears the earphone, and the damage to the hearing ability or the experience of the user is larger, therefore, in the implementation manner, before determining whether the abnormal segment is included in the audio to be played, it is determined whether the audio to be played can be played through the earphone, and after determining that the audio to be played can be played through the earphone, it is determined whether the abnormal segment is included in the audio to be played, so that the detection of the abnormal segment is more accurate, and the audio playing efficiency is further improved.
In some optional implementations of this embodiment, determining whether the audio to be played includes an abnormal segment includes: acquiring a voiceprint to be played by audio; and determining whether the acquired voiceprints comprise abnormal voiceprints or not through a pre-established abnormal voiceprint recognition library. Voiceprint (Voiceprint) is the spectrum of sound waves carrying verbal information displayed with an electro-acoustic instrument. In this implementation manner, the abnormal voiceprint recognition library may store information that can determine the abnormal voiceprint, such as the abnormal voiceprint itself and/or the characteristics of the abnormal voiceprint, and the execution main body may determine whether a voiceprint fragment in the voiceprint to be played matches with information in the abnormal voiceprint recognition library that is established in advance. According to the implementation mode, the abnormal voiceprint recognition library is established in advance, so that whether the acquired voiceprints comprise the abnormal voiceprints can be determined more comprehensively and in an individualized mode.
In some optional implementations of this embodiment, the abnormal voiceprint recognition library comprises a recognition library established via: and carrying out statistical storage on the voiceprints of the common noise and the preset abnormal sound. In this implementation manner, the voiceprints of the common noise and the preset abnormal sound can be directly stored, and the common characteristics of the voiceprints of the common noise and the preset abnormal sound can be counted and stored to obtain the abnormal voiceprint recognition library. The preset abnormal sound may include an abnormal sound set by a user and/or an application provider.
In this embodiment, the executing entity may perform, in response to determining that the audio to be played includes the abnormal segment in step 202, optimization processing on the abnormal segment included in the audio to be played to obtain an optimized audio. The optimization processing may include volume reduction and/or replacement using preset audio content, or may perform more personalized optimization processing according to a difference of the abnormal segment or a difference of the user, for example, volume reduction processing is performed on an abnormal segment with too large volume, tone reduction processing is performed on an abnormal segment with sharp ears, deletion processing is performed on an abnormal segment repeated for many times, and replacement processing is performed on an abnormal segment of the inexplicable language.
In some optional implementation manners of this embodiment, optimizing an abnormal segment included in the audio to be played to obtain an optimized audio includes: and carrying out volume reduction on the abnormal segment included in the audio to be played and/or replacing the abnormal segment by using preset audio content. The execution main body can reduce the volume to a range comfortable for human ears, and the preset audio content can comprise common audio such as 'drop' sound or the like or can use audio content set by a user. According to the implementation mode, the abnormal segment can be optimized rapidly by reducing the volume and/or replacing the preset audio content, and the audio playing efficiency is further improved.
And step 204, playing the optimized audio.
In this embodiment, the execution subject may play the audio obtained by optimizing the abnormal segment included in the audio to be played in step 203.
The method provided by the above embodiment of the present application obtains the audio to be played; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; the optimized audio is played, the abnormal segment in the audio to be played is optimized, and the user experience is improved.
With further reference to fig. 3, as an implementation of the methods shown in the above-mentioned figures, the present application provides an embodiment of an audio playing apparatus, which corresponds to the embodiment of the method shown in fig. 2, and which can be applied to various electronic devices.
As shown in fig. 3, the audio playing device 300 of the present embodiment includes: an acquisition unit 301, a first determination unit 302, a second determination unit 303, and a first generation unit 304. The acquisition unit is configured to acquire audio to be played; a determination unit configured to determine whether an abnormal section is included in the audio to be played; the optimizing unit is configured to respond to the fact that the abnormal segments are determined to be included in the audio to be played, and optimize the abnormal segments included in the audio to be played to obtain optimized audio; a playback unit configured to play the optimized audio.
In this embodiment, the specific processing of the acquiring unit 301, the determining unit 302, the optimizing unit 303 and the playing unit 304 of the audio playing apparatus 300 may refer to step 201, step 202, step 203 and step 204 in the corresponding embodiment of fig. 2.
In some optional implementations of this embodiment, the determining unit includes: a first determining subunit configured to determine whether audio to be played will be played through the headphones; and the second determining subunit is configured to determine whether the abnormal segment is included in the audio to be played in response to determining that the audio to be played is to be played through the earphone.
In some optional implementations of this embodiment, the determining unit includes: an obtaining subunit configured to obtain a voiceprint to be played by audio; and the identifying subunit is configured to determine whether the acquired voiceprints comprise abnormal voiceprints through a pre-established abnormal voiceprint identifying library.
In some optional implementations of this embodiment, the apparatus further includes an abnormal voiceprint recognition library establishing unit configured to: and carrying out statistical storage on the voiceprints of the common noise and the preset abnormal sound.
In some optional implementations of this embodiment, the optimization unit is further configured to: and carrying out volume reduction on the abnormal segment included in the audio to be played and/or replacing the abnormal segment by using preset audio content.
The device provided by the above embodiment of the present application obtains the audio to be played; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; the optimized audio is played, the abnormal segment in the audio to be played is optimized, and the user experience is improved.
Referring now to FIG. 4, a block diagram of a computer system 400 suitable for use in implementing a server or terminal of an embodiment of the present application is shown. The server or the terminal shown in fig. 4 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 4, the computer system 400 includes a Central Processing Unit (CPU)401 that can perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM)402 or a program loaded from a storage section 408 into a Random Access Memory (RAM) 403. In the RAM 403, various programs and data necessary for the operation of the system 400 are also stored. The CPU 401, ROM 402, and RAM 403 are connected to each other via a bus 404. An input/output (I/O) interface 405 is also connected to bus 404.
The following components may be connected to the I/O interface 405: an input section 406 including a keyboard, a mouse, and the like; an output section 407 including a display device such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 408 including a hard disk and the like; and a communication section 409 including a network interface card such as a LAN card, a modem, or the like. The communication section 409 performs communication processing via a network such as the internet. A driver 410 is also connected to the I/O interface 405 as needed. A removable medium 411 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 410 as necessary, so that a computer program read out therefrom is mounted into the storage section 408 as necessary.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 409, and/or installed from the removable medium 411. The computer program performs the above-described functions defined in the method of the present application when executed by a Central Processing Unit (CPU) 401. It should be noted that the computer readable medium described herein can be a computer readable signal medium or a computer readable medium or any combination of the two. A computer readable medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present application, a computer readable medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + +, and conventional procedural programming languages, such as the C language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present application may be implemented by software or hardware. The described units may also be provided in a processor, and may be described as: a processor includes an acquisition unit, a determination unit, an optimization unit, and a playback unit. Where the names of the units do not in some cases constitute a limitation on the units themselves, for example, the obtaining unit may also be described as "configured to obtain the unit to be audio played".
As another aspect, the present application also provides a computer-readable medium, which may be contained in the apparatus described in the above embodiments; or may be present separately and not assembled into the device. The computer readable medium carries one or more programs which, when executed by the apparatus, cause the apparatus to: acquiring audio to be played; determining whether the audio to be played comprises an abnormal segment; in response to the fact that the audio to be played comprises the abnormal segment, optimizing the abnormal segment in the audio to be played to obtain an optimized audio; and playing the optimized audio.
The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention herein disclosed is not limited to the particular combination of features described above, but also encompasses other arrangements formed by any combination of the above features or their equivalents without departing from the spirit of the invention. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.
Claims (10)
1. An audio playback method, comprising:
acquiring to-be-played audio;
determining whether the to-be-audio playing comprises an abnormal segment;
in response to the fact that the abnormal segment is included in the to-be-played audio, optimizing the abnormal segment included in the to-be-played audio to obtain an optimized audio;
and playing the optimized audio.
2. The method of claim 1, wherein the determining whether the to-be-audio-played includes an abnormal segment comprises:
determining whether the audio to be played can be played through an earphone;
and in response to determining that the to-be-played audio can be played through the earphone, determining whether the to-be-played audio comprises an abnormal segment.
3. The method of claim 1, wherein the determining whether the to-be-audio-played includes an abnormal segment comprises:
acquiring the voiceprint to be played by the audio;
and determining whether the acquired voiceprints comprise abnormal voiceprints or not through a pre-established abnormal voiceprint recognition library.
4. The method of claim 1, wherein the abnormal voiceprint recognition library comprises a recognition library established via:
and carrying out statistical storage on the voiceprints of the common noise and the preset abnormal sound.
5. The method according to any one of claims 1 to 4, wherein the optimizing the abnormal segment included in the to-be-played audio to obtain an optimized audio includes:
and carrying out volume reduction on the abnormal segment included in the audio playing to be processed and/or replacing the abnormal segment by using preset audio content.
6. An audio playback apparatus comprising:
an acquisition unit configured to acquire a to-be-audio-played;
a determining unit configured to determine whether an abnormal segment is included in the to-be-audio-played;
the optimizing unit is configured to respond to the fact that the abnormal segment is determined to be included in the to-be-audio playing, and optimize the abnormal segment included in the to-be-audio playing to obtain an optimized audio;
a playback unit configured to play the optimized audio.
7. The apparatus of claim 6, wherein the determining unit comprises:
a first determining subunit configured to determine whether the to-be-audio-played will be played through a headphone;
and the second determining subunit is configured to determine whether the to-be-audio-played audio includes an abnormal segment or not in response to determining that the to-be-audio-played audio is to be played through the earphone.
8. The apparatus of claim 6, wherein the determining unit comprises:
an obtaining subunit, configured to obtain the voiceprint to be played by the audio;
and the identifying subunit is configured to determine whether the acquired voiceprints comprise abnormal voiceprints through a pre-established abnormal voiceprint identifying library.
9. An electronic device, comprising:
one or more processors;
a storage device having one or more programs stored thereon;
the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the method recited in any of claims 1-5.
10. A computer-readable medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111451000.XA CN114121050B (en) | 2021-11-30 | 2021-11-30 | Audio playing method, device, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111451000.XA CN114121050B (en) | 2021-11-30 | 2021-11-30 | Audio playing method, device, electronic equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114121050A true CN114121050A (en) | 2022-03-01 |
CN114121050B CN114121050B (en) | 2024-09-03 |
Family
ID=80369085
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111451000.XA Active CN114121050B (en) | 2021-11-30 | 2021-11-30 | Audio playing method, device, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114121050B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115394316A (en) * | 2022-08-23 | 2022-11-25 | 汉桑(南京)科技股份有限公司 | Audio processing method, system, device and storage medium |
Citations (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020110356A1 (en) * | 1999-12-16 | 2002-08-15 | Sony Corporation | Audio signal processing method, audio signal processing apparatus, Hi-Fi video apparatus, digital video apparatus and 8 mm video apparatus |
CN101695134A (en) * | 2009-10-15 | 2010-04-14 | 中兴通讯股份有限公司 | Terminal, system and method for improving play performance of terminal in weak signal environment |
US20120185418A1 (en) * | 2009-04-24 | 2012-07-19 | Thales | System and method for detecting abnormal audio events |
WO2017028704A1 (en) * | 2015-08-18 | 2017-02-23 | 阿里巴巴集团控股有限公司 | Method and device for providing accompaniment music |
CN106686226A (en) * | 2016-12-21 | 2017-05-17 | 惠州Tcl移动通信有限公司 | Method and system for playing audio of terminal |
CN106973168A (en) * | 2017-05-04 | 2017-07-21 | 广东欧珀移动通信有限公司 | Speech playing method, device and computer equipment |
CN107086039A (en) * | 2017-05-25 | 2017-08-22 | 北京小鱼在家科技有限公司 | A kind of acoustic signal processing method and device |
CN107256139A (en) * | 2017-05-08 | 2017-10-17 | 深圳市科迈爱康科技有限公司 | Method of adjustment, terminal and the computer-readable recording medium of audio volume |
CN107493500A (en) * | 2017-08-03 | 2017-12-19 | 北京小米移动软件有限公司 | Multimedia resource player method and device |
CN107566890A (en) * | 2017-09-15 | 2018-01-09 | 深圳国微技术有限公司 | Handle audio stream broadcasting abnormal method, apparatus, computer installation and computer-readable recording medium |
CN107728990A (en) * | 2017-09-30 | 2018-02-23 | 努比亚技术有限公司 | A kind of audio frequency playing method, mobile terminal and computer-readable recording medium |
JP2018120319A (en) * | 2017-01-24 | 2018-08-02 | 株式会社ディーアンドエムホールディングス | Audio system, audio player, remote controller, and computer readable program |
CN108495164A (en) * | 2018-04-09 | 2018-09-04 | 珠海全志科技股份有限公司 | Audio-visual synchronization processing method and processing device, computer installation and storage medium |
CN108986830A (en) * | 2018-08-28 | 2018-12-11 | 安徽淘云科技有限公司 | A kind of audio corpus screening technique and device |
CN109375894A (en) * | 2018-11-29 | 2019-02-22 | 努比亚技术有限公司 | Earpiece volume based reminding method, device, mobile terminal and readable storage medium storing program for executing |
CN109451349A (en) * | 2018-10-31 | 2019-03-08 | 维沃移动通信有限公司 | A kind of video broadcasting method, device and mobile terminal |
CN109672961A (en) * | 2018-12-14 | 2019-04-23 | 歌尔科技有限公司 | A kind of volume adjusting method, equipment and storage medium |
CN109817227A (en) * | 2018-12-06 | 2019-05-28 | 洛阳语音云创新研究院 | A kind of the abnormal sound monitoring method and system of farm |
TWI662544B (en) * | 2018-05-28 | 2019-06-11 | 塞席爾商元鼎音訊股份有限公司 | Method for detecting ambient noise to change the playing voice frequency and sound playing device thereof |
CN109951729A (en) * | 2019-03-22 | 2019-06-28 | 百度在线网络技术(北京)有限公司 | Method and apparatus for handling data |
CN110148402A (en) * | 2019-05-07 | 2019-08-20 | 平安科技(深圳)有限公司 | Method of speech processing, device, computer equipment and storage medium |
CN110489076A (en) * | 2019-08-22 | 2019-11-22 | 百度在线网络技术(北京)有限公司 | Ambient sound monitoring method, device and electronic equipment |
WO2020103070A1 (en) * | 2018-11-22 | 2020-05-28 | 深圳市欢太科技有限公司 | Method and apparatus for processing application program, and electronic device |
WO2020107290A1 (en) * | 2018-11-28 | 2020-06-04 | 深圳市欢太科技有限公司 | Audio output control method and apparatus, computer readable storage medium, and electronic device |
CN111294642A (en) * | 2018-12-10 | 2020-06-16 | 杭州海康威视数字技术股份有限公司 | Video stream playing method and device |
WO2020124541A1 (en) * | 2018-12-21 | 2020-06-25 | 深圳市欢太科技有限公司 | Audio processing method and apparatus, computer readable storage medium, and electronic device |
CN111669625A (en) * | 2020-06-12 | 2020-09-15 | 北京字节跳动网络技术有限公司 | Processing method, device and equipment for shot file and storage medium |
CN112233696A (en) * | 2020-10-14 | 2021-01-15 | 李小红 | Oil field pumping unit abnormal sound detection and reporting system based on artificial intelligence and big data |
CN112399302A (en) * | 2020-11-25 | 2021-02-23 | 维沃移动通信有限公司 | Audio playing method and device of wearable audio playing device |
CN112860213A (en) * | 2021-03-09 | 2021-05-28 | 腾讯科技(深圳)有限公司 | Audio processing method, storage medium and electronic equipment |
CN112948636A (en) * | 2021-03-24 | 2021-06-11 | 黑龙江省能嘉教育科技有限公司 | Regional education cloud resource sharing system and method |
CN113205815A (en) * | 2021-04-28 | 2021-08-03 | 维沃移动通信有限公司 | Voice processing method and electronic equipment |
CN113421578A (en) * | 2021-06-02 | 2021-09-21 | 广州小鹏智慧出行科技有限公司 | Audio processing method and device, electronic equipment and storage medium |
-
2021
- 2021-11-30 CN CN202111451000.XA patent/CN114121050B/en active Active
Patent Citations (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020110356A1 (en) * | 1999-12-16 | 2002-08-15 | Sony Corporation | Audio signal processing method, audio signal processing apparatus, Hi-Fi video apparatus, digital video apparatus and 8 mm video apparatus |
US20120185418A1 (en) * | 2009-04-24 | 2012-07-19 | Thales | System and method for detecting abnormal audio events |
CN101695134A (en) * | 2009-10-15 | 2010-04-14 | 中兴通讯股份有限公司 | Terminal, system and method for improving play performance of terminal in weak signal environment |
WO2017028704A1 (en) * | 2015-08-18 | 2017-02-23 | 阿里巴巴集团控股有限公司 | Method and device for providing accompaniment music |
CN106686226A (en) * | 2016-12-21 | 2017-05-17 | 惠州Tcl移动通信有限公司 | Method and system for playing audio of terminal |
JP2018120319A (en) * | 2017-01-24 | 2018-08-02 | 株式会社ディーアンドエムホールディングス | Audio system, audio player, remote controller, and computer readable program |
CN106973168A (en) * | 2017-05-04 | 2017-07-21 | 广东欧珀移动通信有限公司 | Speech playing method, device and computer equipment |
CN107256139A (en) * | 2017-05-08 | 2017-10-17 | 深圳市科迈爱康科技有限公司 | Method of adjustment, terminal and the computer-readable recording medium of audio volume |
CN107086039A (en) * | 2017-05-25 | 2017-08-22 | 北京小鱼在家科技有限公司 | A kind of acoustic signal processing method and device |
CN107493500A (en) * | 2017-08-03 | 2017-12-19 | 北京小米移动软件有限公司 | Multimedia resource player method and device |
CN107566890A (en) * | 2017-09-15 | 2018-01-09 | 深圳国微技术有限公司 | Handle audio stream broadcasting abnormal method, apparatus, computer installation and computer-readable recording medium |
CN107728990A (en) * | 2017-09-30 | 2018-02-23 | 努比亚技术有限公司 | A kind of audio frequency playing method, mobile terminal and computer-readable recording medium |
CN108495164A (en) * | 2018-04-09 | 2018-09-04 | 珠海全志科技股份有限公司 | Audio-visual synchronization processing method and processing device, computer installation and storage medium |
TWI662544B (en) * | 2018-05-28 | 2019-06-11 | 塞席爾商元鼎音訊股份有限公司 | Method for detecting ambient noise to change the playing voice frequency and sound playing device thereof |
CN108986830A (en) * | 2018-08-28 | 2018-12-11 | 安徽淘云科技有限公司 | A kind of audio corpus screening technique and device |
CN109451349A (en) * | 2018-10-31 | 2019-03-08 | 维沃移动通信有限公司 | A kind of video broadcasting method, device and mobile terminal |
CN113039516A (en) * | 2018-11-22 | 2021-06-25 | 深圳市欢太科技有限公司 | Method and device for processing application program and electronic equipment |
WO2020103070A1 (en) * | 2018-11-22 | 2020-05-28 | 深圳市欢太科技有限公司 | Method and apparatus for processing application program, and electronic device |
WO2020107290A1 (en) * | 2018-11-28 | 2020-06-04 | 深圳市欢太科技有限公司 | Audio output control method and apparatus, computer readable storage medium, and electronic device |
CN109375894A (en) * | 2018-11-29 | 2019-02-22 | 努比亚技术有限公司 | Earpiece volume based reminding method, device, mobile terminal and readable storage medium storing program for executing |
CN109817227A (en) * | 2018-12-06 | 2019-05-28 | 洛阳语音云创新研究院 | A kind of the abnormal sound monitoring method and system of farm |
CN111294642A (en) * | 2018-12-10 | 2020-06-16 | 杭州海康威视数字技术股份有限公司 | Video stream playing method and device |
CN109672961A (en) * | 2018-12-14 | 2019-04-23 | 歌尔科技有限公司 | A kind of volume adjusting method, equipment and storage medium |
WO2020124541A1 (en) * | 2018-12-21 | 2020-06-25 | 深圳市欢太科技有限公司 | Audio processing method and apparatus, computer readable storage medium, and electronic device |
CN113168303A (en) * | 2018-12-21 | 2021-07-23 | 深圳市欢太科技有限公司 | Audio processing method and device, computer readable storage medium and electronic equipment |
CN109951729A (en) * | 2019-03-22 | 2019-06-28 | 百度在线网络技术(北京)有限公司 | Method and apparatus for handling data |
CN110148402A (en) * | 2019-05-07 | 2019-08-20 | 平安科技(深圳)有限公司 | Method of speech processing, device, computer equipment and storage medium |
CN110489076A (en) * | 2019-08-22 | 2019-11-22 | 百度在线网络技术(北京)有限公司 | Ambient sound monitoring method, device and electronic equipment |
CN111669625A (en) * | 2020-06-12 | 2020-09-15 | 北京字节跳动网络技术有限公司 | Processing method, device and equipment for shot file and storage medium |
CN112233696A (en) * | 2020-10-14 | 2021-01-15 | 李小红 | Oil field pumping unit abnormal sound detection and reporting system based on artificial intelligence and big data |
CN112399302A (en) * | 2020-11-25 | 2021-02-23 | 维沃移动通信有限公司 | Audio playing method and device of wearable audio playing device |
CN112860213A (en) * | 2021-03-09 | 2021-05-28 | 腾讯科技(深圳)有限公司 | Audio processing method, storage medium and electronic equipment |
CN112948636A (en) * | 2021-03-24 | 2021-06-11 | 黑龙江省能嘉教育科技有限公司 | Regional education cloud resource sharing system and method |
CN113205815A (en) * | 2021-04-28 | 2021-08-03 | 维沃移动通信有限公司 | Voice processing method and electronic equipment |
CN113421578A (en) * | 2021-06-02 | 2021-09-21 | 广州小鹏智慧出行科技有限公司 | Audio processing method and device, electronic equipment and storage medium |
Non-Patent Citations (2)
Title |
---|
张雅琪;: "基于用户、环境及信源特征的音频用户体验优化", 信息与电脑(理论版), no. 07, 31 July 2013 (2013-07-31) * |
彭小光;吕连新;吴洁;单雪松;: "基于声纹识别的广告监测系统研究与建设", 广播与电视技术, no. 09, 30 September 2020 (2020-09-30) * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115394316A (en) * | 2022-08-23 | 2022-11-25 | 汉桑(南京)科技股份有限公司 | Audio processing method, system, device and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN114121050B (en) | 2024-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9131298B2 (en) | Constrained dynamic amplitude panning in collaborative sound systems | |
CN112306448A (en) | Method, apparatus, device and medium for adjusting output audio according to environmental noise | |
CN114073057B (en) | Method and system for server-side rendering of audio using client-side audio parameters | |
WO2022033534A1 (en) | Method for generating target video, apparatus, server, and medium | |
JP2020149038A (en) | Method and apparatus for waking up device | |
US11113092B2 (en) | Global HRTF repository | |
CN114121050B (en) | Audio playing method, device, electronic equipment and storage medium | |
CN114845212A (en) | Volume optimization method and device, electronic equipment and readable storage medium | |
CN111045634B (en) | Audio processing method and device | |
CN112309352A (en) | Audio information processing method, apparatus, device and medium | |
CN112307161B (en) | Method and apparatus for playing audio | |
CN114470774A (en) | Game sound effect processing method and device, storage medium and electronic equipment | |
CN111147655B (en) | Model generation method and device | |
CN109375892B (en) | Method and apparatus for playing audio | |
CN111145770B (en) | Audio processing method and device | |
CN111145792B (en) | Audio processing method and device | |
CN113779372A (en) | User group portrait establishing method and device | |
CN109445873B (en) | Method and device for displaying setting interface | |
CN111048107B (en) | Audio processing method and device | |
CN111145769A (en) | Audio processing method and device | |
CN111145776B (en) | Audio processing method and device | |
CN111045635B (en) | Audio processing method and device | |
CN111145793B (en) | Audio processing method and device | |
CN111210837B (en) | Audio processing method and device | |
CN111048108B (en) | Audio processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |