WO2022227655A1 - Procédé et appareil de lecture de sons, dispositif électronique et support de stockage lisible - Google Patents

Procédé et appareil de lecture de sons, dispositif électronique et support de stockage lisible Download PDF

Info

Publication number
WO2022227655A1
WO2022227655A1 PCT/CN2021/141559 CN2021141559W WO2022227655A1 WO 2022227655 A1 WO2022227655 A1 WO 2022227655A1 CN 2021141559 W CN2021141559 W CN 2021141559W WO 2022227655 A1 WO2022227655 A1 WO 2022227655A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
real
time
playback
playing
Prior art date
Application number
PCT/CN2021/141559
Other languages
English (en)
Chinese (zh)
Inventor
赵祥军
Original Assignee
歌尔股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 歌尔股份有限公司 filed Critical 歌尔股份有限公司
Publication of WO2022227655A1 publication Critical patent/WO2022227655A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1091Details not provided for in groups H04R1/1008 - H04R1/1083

Definitions

  • the present invention relates to the technical field of audio processing, and in particular, to a sound playback method, device, electronic device and readable storage medium.
  • Using an electronic device with a noise reduction function can eliminate the interference of ambient sounds, but at the same time, the user cannot hear the ambient sound while reducing the noise.
  • the purpose of the present invention is to provide a sound playback method, device, electronic equipment and readable storage medium, which can collect real-time environmental sound during the normal use of electronic equipment with audio playback function, and match the real-time environmental sound with the playback sign.
  • playing the real-time ambient sound or turning off the audio playback function enables the user not to miss important ambient sounds when using the electronic device.
  • the present invention provides the following technical solutions:
  • a sound playback method, applied to an electronic device with playback function includes:
  • the playback flag If the sound content matches the playback flag, the real-time ambient sound is played or the audio playback function is turned off.
  • the playback mark is a semantic mark or a scene mark
  • content detection is performed on the real-time environmental sound to obtain the sound content, including:
  • the semantic recognition result or the scene recognition result is determined as the sound content.
  • the real-time ambient sound is enhanced and played.
  • the electronic device includes at least two collection ends arranged at intervals, and the collection of real-time ambient sound includes:
  • the sound content is matched with the playback sign, then compare the audio sizes of the sound content collected by the two described collection terminals respectively, and play the real-time environment collected by the collection terminal corresponding to the high-frequency audio after the comparison is completed. sound.
  • the electronic device includes two playback terminals corresponding to the two acquisition terminals, and if the sound content matches the playback flag, the high audio frequency is played by the playback terminal corresponding to the acquisition terminal corresponding to the high audio frequency. Corresponding to the real-time ambient sound collected by the collecting terminal.
  • the method further includes:
  • the method further includes:
  • the semantic recognition result includes a preset conversation end marker, stop playing the real-time ambient sound.
  • a sound playback device applied to electronic equipment with playback function, includes:
  • Sound acquisition module used to collect real-time ambient sound
  • a speech recognition module for performing content detection on the real-time environmental sound to obtain the sound content
  • a playback control module configured to play the real-time ambient sound or disable the audio playback function if the sound content matches the playback flag.
  • An electronic device comprising:
  • the acquisition terminal is used to collect real-time ambient sound
  • a playback end for playing a sound, the sound including the real-time environmental sound
  • the processor is configured to implement the steps of the above-mentioned sound playing method when executing the computer program.
  • a readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of the above sound playback method.
  • an electronic device with a playback function collects real-time environmental sound; performs content detection on the real-time environmental sound to obtain the sound content; if the sound content matches the playback flag, the real-time environmental sound is played or the audio playback is turned off Function.
  • a play flag can be set in the electronic device in advance.
  • Electronic equipment captures real-time ambient sound. Then, content detection is performed on the real-time ambient sound to obtain the corresponding sound content.
  • the real-time ambient sound can be played or the audio playback function can be turned off. If the real-time ambient sound is played, the user of the electronic device can hear the played real-time ambient sound, avoiding the wrong and important ambient sound; if the audio playback function is turned off, the user of the electronic device can play the audio without the interference of the audio playback. to hear the ambient sound directly.
  • the electronic device collects and recognizes the ambient sound and plays the ambient sound under certain conditions or directly turns off the audio playback function, so that the earphone wearer can use the earphone normally. Ambient sounds corresponding to play marks can also be heard.
  • the embodiments of the present invention also provide a sound playing device, an electronic device, and a readable storage medium corresponding to the above-mentioned sound playing method, which have the above-mentioned technical effects, and are not repeated here.
  • Fig. 1 is the implementation flow chart of a kind of sound playing method in the embodiment of the present invention
  • FIG. 2 is a schematic structural diagram of a sound playback device according to an embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of an electronic device in an embodiment of the present invention.
  • the sound playback method provided by the present invention can be applied to an electronic device with a sound playback function, and the electronic device can be specifically a wearable device or a non-wearable device.
  • the sound playback method can be applied to headphones, wearable devices such as VR, AR, Bluetooth glasses, etc.; in addition, it can also be applied to non-wearable devices that require active noise reduction, such as home theaters, car audio, etc. other scenes.
  • FIG. 1 is a flowchart of a method for playing a sound in an embodiment of the present invention.
  • the method can be applied to an earphone, and the method includes the following steps:
  • the earphone When the earphone is worn by the user, due to the structure of the earphone itself and the special function of sound insulation and noise reduction, the wearer of the earphone cannot hear or hear the ambient sound clearly. In order to prevent the earphone wearer from erroneously making an important environmental sound, in this embodiment, the earphone can collect and wait for an opportunity to play the environmental sound, so that the earphone wearer does not miss the important environmental sound.
  • the specific model and type of the earphone are not limited. That is to say, the earphones can be wired earphones, wireless earphones (such as ordinary Bluetooth earphones or TWS earphones), ordinary earphones without special noise reduction function, or earphones with special noise reduction function (such as active noise cancelling headphones).
  • the sound playback method provided by the embodiment of the present invention is applied to the earphone that can isolate the ambient sound more, and the effect is better.
  • the headset can detect its own working state or the state of the sensors on the headset to determine whether it is in a wearing state. For example, if the headset detects that music or a call is currently being played, it is determined that it is currently in a wearing state; or it is determined that it is in a wearing state by detecting the signals collected by sensors related to wearing detection in the headset, such as a Hall sensor and a pressure sensor.
  • the device that collects the sound of the implementation link may be a device on an earphone, such as a microphone on the earphone, or a device not on the earphone, such as a host or earphone corresponding to a communication connection with the earphone. microphone.
  • the microphone on the earphone is used for real-time environmental sound collection
  • the microphone in the current earphone can be multiplexed, for example, the sound collected by the active noise reduction MIC (microphone) in the earphone can be directly used for multiplexing.
  • only one microphone can be used for acquisition or two microphones can be used for acquisition.
  • S102 Perform content detection on real-time ambient sound to obtain sound content.
  • the sound content of the real-time ambient sound can be detected.
  • the sound content may be concrete semantic content in the sound, or a corresponding scene situation, or a special sound (such as a siren).
  • Content detection for real-time ambient sound can be performed according to actual application requirements. For example, if the semantic content needs to be detected, the content detection of the real-time environmental sound refers to the semantic recognition of the real-time environmental sound.
  • the following takes the specific implementation of the separate semantic recognition and the separate scene recognition as an example to describe how to perform content detection on the real-time environmental sound to obtain the sound content, and to describe the distance.
  • the playback sign is a semantic sign
  • step S102 performs content detection on the real-time environmental sound to obtain the sound content, which may specifically include:
  • Step 1 Perform semantic recognition on the real-time environmental sound to obtain a semantic recognition result
  • Step 2 Determining the semantic recognition result as sound content.
  • the implementation environment sound can be input into the semantic recognition module in the earphone for recognition, and the speech recognition result in the real-time environmental sound can be obtained, for example, the text content corresponding to the speech sound of the person talking is recognized.
  • the real-time environmental sound can also be transmitted to the host computer or mobile phone connected to the headset, and then use the trained semantic recognition model or recognition algorithm to semantically recognize the real-time environmental sound on these devices, and then obtain the semantic recognition result.
  • the speech recognition result is used as the sound content of the real-time ambient sound.
  • model and technology to be used to perform semantic recognition on real-time environmental sound reference may be made to relevant solutions for semantic recognition, which will not be repeated here.
  • the playback flag is a scene flag
  • step S102 performs content detection on the real-time environmental sound to obtain the sound content, which may specifically include:
  • Step 1 Perform scene recognition on the real-time environmental sound to obtain a scene recognition result
  • Step 2 Determine the scene recognition result as the sound content.
  • the scene recognition module that implements the ambient sound input into the earphone can be used for identification, such as identifying the current indoor scene, outdoor scene, or some special scene (such as a warning scene of whistle and alarm sound), and get Scene recognition results in real-time ambient sound.
  • identification such as identifying the current indoor scene, outdoor scene, or some special scene (such as a warning scene of whistle and alarm sound)
  • Scene recognition results in real-time ambient sound.
  • devices such as the host or mobile phone connected to the headset, and then use the trained scene recognition model or recognition algorithm to perform scene recognition on the real-time environmental sound on these devices, and then obtain the scene recognition result.
  • the scene recognition result is used as the sound content of the real-time ambient sound.
  • model and technology to be used for scene recognition of real-time environmental sound reference may be made to the relevant scheme of scene recognition, which will not be repeated here.
  • the above-mentioned specific embodiment 1 and specific embodiment 2 can be used in combination, which can solve the problem of not being able to distinguish danger signals at the first time when wearing headphones, resulting in the problem of endangering personal safety, and wearing headphones with others.
  • the user When the user is talking, they cannot hear and answer the first time, causing the user to have social misunderstandings.
  • the content detection performed can detect the left real-time ambient sound and the right real-time ambient sound respectively.
  • the left sound content and the right sound content are obtained respectively.
  • only the sound content corresponding to the left real-time ambient sound or the right real-time ambient sound matches the playback flag, only the corresponding one-sided real-time ambient sound is played, so as to keep one earphone and continue to play the original sound content, such as Music or call sound.
  • the real-time ambient sound is enhanced for playback.
  • enhanced playback refers to highlighting the real-time environmental sound, such as playing the real-time environmental sound only after increasing the volume, or playing it in a loop, or playing the real-time environmental sound after shielding the current audio sound (such as the sound of music being played). sound.
  • the collected real-time environmental sounds include danger signals such as car roaring, wind whistling, and car roaring (corresponding to the environmental signs that need to play real-time environmental sounds), pause the working state of a certain headset, and when the dangerous sound of a certain headset
  • danger signals such as car roaring, wind whistling, and car roaring
  • one or more playback flags may be preset, and when there are multiple playback flags, the types of playback flags may be different.
  • the play flags that can be set include semantic flags and scene flags.
  • the semantic mark can be a preset wearer's name, or other titles.
  • the scene flag may specifically be a scene in which the wearer needs to hear the ambient sound, such as an outdoor scene, a traffic condition scene, or the like.
  • the sound content can be compared with the play mark. If any play mark is included in the sound content, it is determined that the sound content matches the play mark, and the real-time ambient sound can be played or closed. Audio playback function. Specifically, if the electronic device itself does not have a noise reduction function, that is, after the audio playback function is turned off, the user can clearly hear the ambient sound, then the audio playback can be directly turned off when it is determined that the sound content matches the playback logo. Function. If the electronic device itself has a certain noise reduction function, even if the audio playback function is turned off, it is difficult for the user to hear the external sound. In order to ensure that the user does not miss the important environmental sound, the real-time environmental sound can be played.
  • the real-time ambient sound can be played only in one earphone for use, or the real-time ambient sound can be simultaneously played in two earphones.
  • the real-time ambient sound is played in two headphones at the same time, the currently final playing sound content can be directly interrupted, and the real-time ambient sound can be played instead; After adding a prompt tone on the basis of the currently playing sound, the real-time environmental sound can be superimposed and played to remind the wearer to pay attention.
  • the power of the earphone can also be checked, and then the state of the earphone is adjusted based on the circuit detection result, thereby ensuring the normal operation of the earphone. Specifically, when the power of one earphone is too lower than another, for example, the power difference exceeds 10%.
  • the host mobile phone, tablet, computer
  • the stored headset state information is switched, and the switched state is returned to the two headsets.
  • the master-slave headset status can be switched, which can avoid that when the power of the master headset with low power is insufficient, it will cause the other headset to suspend work at the same time, avoiding the pause of music playback and HFP (Hands-free Profile, Bluetooth transmission control, That is, the Bluetooth device can control the phone, such as answering, hanging up, rejecting, voice dialing, etc. The rejection and voice dialing depend on whether the Bluetooth headset and the phone support it. It is a settable mode for the Bluetooth headset to enter a high-fidelity call) 's suspension. It can avoid the problem that HFP transmission is interrupted and music playback is suspended when any one of the TWS earphones is insufficient in power, causing inconvenience to users.
  • HFP High-free Profile
  • an electronic device with a playback function collects real-time environmental sound; performs content detection on the real-time environmental sound to obtain the sound content; if the sound content matches the playback flag, the real-time environmental sound is played or the audio playback is turned off Function.
  • a play flag can be set in the electronic device in advance.
  • Electronic equipment captures real-time ambient sound. Then, content detection is performed on the real-time ambient sound to obtain the corresponding sound content.
  • the real-time ambient sound can be played or the audio playback function can be turned off. If the real-time ambient sound is played, the user of the electronic device can hear the played real-time ambient sound, avoiding the wrong and important ambient sound; if the audio playback function is turned off, the user of the electronic device can play the audio without the interference of the audio playback. to hear the ambient sound directly.
  • the electronic device collects and recognizes the ambient sound and plays the ambient sound under certain conditions or directly turns off the audio playback function, so that the earphone wearer can use the earphone normally. Ambient sounds corresponding to play marks can also be heard.
  • the embodiments of the present invention also provide corresponding improvement solutions.
  • the same steps or corresponding steps in the above-mentioned embodiments can be referred to each other, and corresponding beneficial effects can also be referred to each other, which will not be repeated in the preferred/improved embodiments herein.
  • the electronic device in order to better simulate the real real-time ambient sound, that is, in order to present the stereo effect (binaural effect), the electronic device may specifically include at least two acquisition ends of the spacer device.
  • the real-time ambient sound can be captured on both the left and right sides of the wearer, so that when the real-time ambient sound is played, the corresponding real-time ambient sound can be played in both earphones.
  • the real-time ambient sound includes the left real-time ambient sound and the right-side real-time ambient sound
  • the above step S101 may specifically include: using at least two acquisition terminals to collect the corresponding real-time ambient sound respectively; If the signs match, compare the audio sizes of the sound content collected by the two collection terminals respectively, and play the real-time ambient sound collected by the collection terminal corresponding to the high-frequency audio after the comparison is completed.
  • a sound direction judgment terminal can be specially set, and then the corresponding direction playback terminal can be opened according to the direction.
  • the playback terminal on the other side is used to play the sound of the high-frequency acquisition terminal, and the near terminal is paused, or the transparent mode is turned on or played, so that the effect of transmitting the remote environmental sound can be achieved.
  • the left microphone is used to collect the left real-time ambient sound
  • the right microphone is used to collect the right real-time ambient sound.
  • the collected left real-time ambient sound and right real-time ambient sound are both real-time ambient sounds.
  • the left earphone will play the left real-time ambient sound
  • the right earphone will play the real-time ambient sound, which can be close to the effect of listening to the ambient sound normally without wearing the earphone, which is helpful for the wearer to quickly locate the sound.
  • Source direction quick response.
  • the electronic device includes two playing ends corresponding to the two collecting ends. If the sound content matches the playing mark, the high-frequency audio corresponding to the playing end corresponding to the collecting end is used to play the high-frequency sound.
  • the audio corresponds to the real-time ambient sound collected by the collecting terminal.
  • step S103 if the sound content matches the playback flag, the real-time ambient sound is played, which may specifically include:
  • Step 1 If the sound content matches the playback sign, compare the audio size of the left real-time ambient sound and the right real-time ambient sound, and determine the high-frequency side from the left and right sides;
  • Step 2 Play the corresponding real-time ambient sound by using the earphone corresponding to the high-frequency side.
  • the left side is determined to be the high-frequency side; if the audio of the left real-time ambient sound is smaller than that of the right real-time ambient sound, then the right The side is the high-frequency side; if the audio of the left real-time ambient sound is the same as the right-side real-time ambient sound, the left or right side can be determined as the high-frequency side, and of course, both the left and right sides can be determined. for the high-frequency side.
  • the earphone corresponding to the high-frequency side can be directly used to play real-time ambient sound. Specifically, if the left side is the high audio side, the left earphone is used to play the left real-time ambient sound, and if the right side is the high audio side, the right earphone is used to play the right real-time ambient sound. In this way, the corresponding real-time ambient sound can be played only on the louder side, the binaural effect of the wearer can be basically maintained, and at the same time, another earphone can be used to continue playing the original sound.
  • the ambient sound playback exit operation may also be performed.
  • Ambient sound playback and exit operations including:
  • Step 1 Collect motion data
  • Step 2 using the motion data to identify the current motion state
  • Step 3 Stop playing the real-time ambient sound when the motion state change between the current motion state and the preset motion state satisfies the preset external sound stopping condition.
  • motion data can be collected by using motion sensors inside or outside the electronic device.
  • the motion sensor may specifically be a motion sensor integrated in the earphone, or may be a motion sensor worn by the earphone wearer alone.
  • the motion sensor may specifically be motion data capable of collecting subtle movements of the wearer, such as head rotation, or may be motion data of home appliances capable of collecting the wearer's range of motion, such as a treadmill.
  • the specific model and type of the motion sensor are not limited.
  • the motion data is acquired for identification, so as to obtain the current motion state. Specifically, how to identify the motion data can be identified according to the specific principle and usage scheme strategy of the specific motion sensor, which will not be repeated here.
  • the preset motion state may specifically be the motion state collected last time, or may specifically be a state corresponding to a set standard static posture (such as a standing state). Of course, you can also customize the settings according to actual needs, which will not be listed here.
  • the preset external sound stopping conditions may be specifically that the user's head rotation angle changes at +75 degrees or -75 degrees, and the head travels a distance of 2 meters away, the conversation is deemed to be over, and the real-time ambient sound is stopped; Conversely, it can be determined that the conversation is continuing, and the real-time ambient sound can continue to be played.
  • the ambient sound playback exit operation can also be performed, specifically including:
  • Step 1 Perform semantic recognition on the real-time environmental sound to obtain a semantic recognition result
  • Step 2 If the semantic recognition result includes a preset conversation end sign, stop playing the real-time ambient sound.
  • semantic recognition of the real-time environmental sound can be performed to obtain a semantic recognition result.
  • the semantic recognition result includes a preset conversation end sign (such as the preset words such as goodbye, Bye, bye, etc.)
  • the real-time ambient sound can be stopped immediately, or after a period of time (such as 5 seconds), the playback will be stopped. Real-time ambient sound and reverts to normal music playback or call sound playback.
  • a preset conversation end sign such as the preset words such as goodbye, Bye, bye, etc.
  • the state of playing the ambient sound can be automatically exited based on the user's dialogue content, and the state of the earphone can be freely switched according to the actual application situation.
  • the corresponding processing operation is to turn off the audio playback function.
  • the audio can be turned on. playback function.
  • the embodiments of the present invention also provide a sound playback device applied to an electronic device with a playback function.
  • the sound playback device described below and the sound playback method described above may refer to each other correspondingly.
  • the device includes the following modules:
  • the speech recognition module 102 is used to perform content detection on the real-time environmental sound to obtain the sound content
  • the playback control module 103 is configured to play the real-time ambient sound or disable the audio playback function if the sound content matches the playback flag.
  • the device provided by the embodiment of the present invention is applied to collect real-time environmental sound; content detection is performed on the real-time environmental sound to obtain the sound content; if the sound content matches the playback flag, the real-time environmental sound is played or the audio playback function is turned off.
  • a play flag can be set in the electronic device in advance.
  • Electronic equipment captures real-time ambient sound. Then, content detection is performed on the real-time ambient sound to obtain the corresponding sound content.
  • the real-time ambient sound can be played or the audio playback function can be turned off. If the real-time ambient sound is played, the user of the electronic device can hear the played real-time ambient sound, avoiding the wrong and important ambient sound; if the audio playback function is turned off, the user of the electronic device can play the audio without the interference of the audio playback. to hear the ambient sound directly. That is to say, the device and the electronic device can enable the wearer of the earphone to use the earphone normally while still using the earphone. Ambient sounds corresponding to play marks can be heard.
  • the playback sign is a semantic sign or a scene sign
  • the speech recognition module 102 is specifically configured to perform semantic recognition on real-time environmental sounds to obtain a semantic recognition result; or, perform scene recognition on real-time environmental sounds , get the scene recognition result;
  • the semantic recognition result or the scene recognition result is determined as the sound content.
  • the playback control module 103 is specifically configured to perform enhanced playback of the real-time ambient sound if the sound content matches the danger signal in the playback flag.
  • the electronic device includes at least two collection ends arranged at intervals, and the sound collection module 101 is specifically configured to use the at least two collection ends to collect corresponding real-time environmental sounds respectively;
  • the playback control module 103 is specifically configured to, if the sound content matches the playback flag, compare the audio sizes of the sound content collected by the two collection terminals respectively, and play the audio data collected by the collection terminal corresponding to the high-frequency audio after the comparison is completed. Real-time ambient sound.
  • the electronic device includes two playback terminals corresponding to the two acquisition terminals.
  • the playback control module 103 is specifically configured to use high-frequency audio to correspond to the acquisition terminal if the sound content matches the playback flag.
  • the corresponding playback end plays the real-time ambient sound collected by the corresponding acquisition end of the high-frequency audio. It also includes: after playing the real-time environmental sound, a state recovery module for collecting motion data; using the motion data to identify the current motion state; the motion state change between the current motion state and the preset motion state satisfies the preset external sound stop broadcasting condition, stop playing the real-time ambient sound.
  • a state recovery module which is specifically configured to perform semantic recognition on the real-time environmental sound after playing the real-time environmental sound, and obtain a semantic recognition result; if the semantic recognition result includes a preset conversation End the sign to stop playing the real-time ambient sound.
  • the embodiments of the present invention further provide an electronic device, and an electronic device described below and a sound playback method described above may refer to each other correspondingly.
  • the electronic device includes:
  • the collection terminal 301 is used to collect real-time ambient sound
  • the playback terminal 302 is used for playing sound, and the sound includes real-time ambient sound;
  • the processor 304 is configured to implement the steps of the sound playing method of the above method embodiments when executing the computer program.
  • the collecting end may be a device capable of collecting sound, such as a microphone, and the playing end may be a device having a sound playback function, such as a speaker.
  • the steps in the sound playing method described above can be implemented by the structure of the electronic device.
  • the embodiments of the present invention further provide a readable storage medium, and a readable storage medium described below and a sound playback method described above may refer to each other correspondingly.
  • a readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of the sound playing method in the above method embodiment.
  • the readable storage medium may specifically be a USB flash drive, a removable hard disk, a read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk, or an optical disk, etc. Readable storage medium.
  • the steps of a method or algorithm described in connection with the embodiments disclosed herein may be directly implemented in hardware, a software module executed by a processor, or a combination of the two.
  • the software module can be placed in random access memory (RAM), internal memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other in the technical field. in any other known form of storage medium.

Abstract

Certains modes de réalisation de la présente demande concernent un procédé et un appareil de lecture de sons, un dispositif électronique et un support de stockage lisible. Le procédé comprend : par un dispositif électronique qui a une fonction de lecture audio, la collecte d'un son environnemental en temps réel ; la réalisation d'une détection de contenu sur le son environnemental en temps réel pour obtenir un contenu sonore ; et si le contenu sonore correspond à une marque de lecture, la lecture du son environnemental en temps réel, ou la désactivation de la fonction de lecture audio. Dans le procédé, un dispositif électronique collecte et reconnaît un son environnemental et lit le son environnemental lorsqu'une condition spécifique est satisfaite, ou désactive une fonction de lecture audio de sorte qu'un utilisateur du dispositif électronique peut entendre le son environnemental correspondant à une marque de lecture lorsqu'il utilise le dispositif électronique.
PCT/CN2021/141559 2021-04-29 2021-12-27 Procédé et appareil de lecture de sons, dispositif électronique et support de stockage lisible WO2022227655A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110475845.6A CN113194383A (zh) 2021-04-29 2021-04-29 一种声音播放方法、装置、电子设备及可读存储介质
CN202110475845.6 2021-04-29

Publications (1)

Publication Number Publication Date
WO2022227655A1 true WO2022227655A1 (fr) 2022-11-03

Family

ID=76980813

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/141559 WO2022227655A1 (fr) 2021-04-29 2021-12-27 Procédé et appareil de lecture de sons, dispositif électronique et support de stockage lisible

Country Status (2)

Country Link
CN (1) CN113194383A (fr)
WO (1) WO2022227655A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113194383A (zh) * 2021-04-29 2021-07-30 歌尔科技有限公司 一种声音播放方法、装置、电子设备及可读存储介质
CN113766383A (zh) * 2021-09-08 2021-12-07 度小满科技(北京)有限公司 一种控制耳机静音的方法和装置
CN116055932B (zh) * 2022-08-12 2023-09-15 荣耀终端有限公司 一种耳机主副耳切换方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090138507A1 (en) * 2007-11-27 2009-05-28 International Business Machines Corporation Automated playback control for audio devices using environmental cues as indicators for automatically pausing audio playback
CN101840700A (zh) * 2010-04-28 2010-09-22 宇龙计算机通信科技(深圳)有限公司 基于移动终端的声音识别方法及移动终端
CN107438209A (zh) * 2016-05-27 2017-12-05 易音特电子株式会社 具有助听特征的有源降噪耳机设备
CN110689882A (zh) * 2018-07-04 2020-01-14 上海博泰悦臻网络技术服务有限公司 车辆及其播放设备和多媒体播放自动控制方法
CN113194383A (zh) * 2021-04-29 2021-07-30 歌尔科技有限公司 一种声音播放方法、装置、电子设备及可读存储介质

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103796125A (zh) * 2013-11-21 2014-05-14 广州视源电子科技股份有限公司 一种基于耳机播放的声音调节方法
CN106162413B (zh) * 2016-09-07 2019-11-19 合肥中感微电子有限公司 具体环境声音提醒模式的耳机装置
CN106601272B (zh) * 2016-11-24 2019-09-17 歌尔股份有限公司 耳机及其语音识别方法
CN107799117A (zh) * 2017-10-18 2018-03-13 倬韵科技(深圳)有限公司 识别关键信息以控制音频输出的方法、装置及音频设备
CN108391206A (zh) * 2018-03-30 2018-08-10 广东欧珀移动通信有限公司 信号处理方法、装置、终端、耳机及可读存储介质
CN110691300B (zh) * 2019-09-12 2022-07-19 连尚(新昌)网络科技有限公司 音频播放设备及用于提供信息的方法
CN110719545B (zh) * 2019-09-12 2022-11-08 连尚(新昌)网络科技有限公司 音频播放设备及用于播放音频的方法
CN111491236A (zh) * 2020-04-23 2020-08-04 歌尔科技有限公司 一种主动降噪耳机及其唤醒方法、装置及可读存储介质

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090138507A1 (en) * 2007-11-27 2009-05-28 International Business Machines Corporation Automated playback control for audio devices using environmental cues as indicators for automatically pausing audio playback
CN101840700A (zh) * 2010-04-28 2010-09-22 宇龙计算机通信科技(深圳)有限公司 基于移动终端的声音识别方法及移动终端
CN107438209A (zh) * 2016-05-27 2017-12-05 易音特电子株式会社 具有助听特征的有源降噪耳机设备
CN110689882A (zh) * 2018-07-04 2020-01-14 上海博泰悦臻网络技术服务有限公司 车辆及其播放设备和多媒体播放自动控制方法
CN113194383A (zh) * 2021-04-29 2021-07-30 歌尔科技有限公司 一种声音播放方法、装置、电子设备及可读存储介质

Also Published As

Publication number Publication date
CN113194383A (zh) 2021-07-30

Similar Documents

Publication Publication Date Title
WO2022227655A1 (fr) Procédé et appareil de lecture de sons, dispositif électronique et support de stockage lisible
US11294619B2 (en) Earphone software and hardware
CN105814913B (zh) 对名字敏感的收听装置
CN106162413B (zh) 具体环境声音提醒模式的耳机装置
KR102513461B1 (ko) 헤드폰 시스템
JP2017211640A (ja) 補聴器機能の能動騷音除去ヘッドセット装置
CN108551604B (zh) 一种降噪方法、降噪装置及降噪耳机
US20080089530A1 (en) Method and system for automatically muting headphones
EP2961195A2 (fr) Système et appareil ne-pas-déranger
WO2020019820A1 (fr) Procédé de détection d'obstruction de trou de microphone et produit associé
CN106210960A (zh) 具有本地通话情况确认模式的耳机装置
US11822367B2 (en) Method and system for adjusting sound playback to account for speech detection
WO2010131144A1 (fr) Procédé et appareil destinés à fournir des informations sur la source d'un son par l'intermédiaire d'un circuit audio
CN106170108A (zh) 具有分贝提醒模式的耳机装置
WO2020019857A1 (fr) Procédé de détection de blocage de trou de microphone et produit connexe
JP2023542968A (ja) 定位されたフィードバックによる聴力増強及びウェアラブルシステム
CN103106060A (zh) 计算机音量调节方法
CN104469587A (zh) 耳机
KR101693483B1 (ko) 헤드셋의 하울링 및 에코 제거 방법 및 컴퓨터 프로그램
WO2015030642A1 (fr) Réduction de volume pour un dispositif électronique
KR101693482B1 (ko) 하울링 및 에코 제거 기능을 갖는 헤드셋
WO2020019822A1 (fr) Procédé de détection de blocage de trou de microphone et produit associé
TWI665662B (zh) 可攜式電子裝置及音訊播放方法
CN105657605A (zh) 一种安全智能音响
TWI839798B (zh) 具有環境音感知功能的耳機、音頻系統及相關的方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21939114

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21939114

Country of ref document: EP

Kind code of ref document: A1