WO2022228377A1 - 录音方法、装置、电子设备和可读存储介质 - Google Patents

录音方法、装置、电子设备和可读存储介质 Download PDF

Info

Publication number
WO2022228377A1
WO2022228377A1 PCT/CN2022/088952 CN2022088952W WO2022228377A1 WO 2022228377 A1 WO2022228377 A1 WO 2022228377A1 CN 2022088952 W CN2022088952 W CN 2022088952W WO 2022228377 A1 WO2022228377 A1 WO 2022228377A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio data
target
recording
application
input
Prior art date
Application number
PCT/CN2022/088952
Other languages
English (en)
French (fr)
Inventor
曹璟毅
Original Assignee
维沃移动通信(杭州)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 维沃移动通信(杭州)有限公司 filed Critical 维沃移动通信(杭州)有限公司
Publication of WO2022228377A1 publication Critical patent/WO2022228377A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10546Audio or video recording specifically adapted for audio data

Definitions

  • the present application belongs to the technical field of recording, and specifically relates to a recording method, apparatus, electronic device and readable storage medium.
  • users can send and receive voice messages through social applications or play audio data through multimedia applications.
  • users can send and receive voice messages through social applications or play audio data through multimedia applications.
  • a user wants to play the audio data or voice confidence in the above application program, he needs to enter the corresponding application program, so as to play the audio content that the user wants to listen to.
  • the purpose of the embodiments of the present application is to provide a recording method, apparatus, electronic device, and readable storage medium, which can solve the problem that the audio data in the application cannot be played or stored in other ways, and the playback mode is single.
  • an embodiment of the present application provides a recording method, the method comprising:
  • the first recording file includes at least target audio data.
  • an embodiment of the present application provides a recording device, the device comprising:
  • a receiving module configured to receive the first input of the user when the audio data is collected by the first application
  • a processing module for playing the target audio data in the target application program in response to the first input received by the receiving module, and obtaining the first recording file through the first application program;
  • the first recording file includes at least target audio data.
  • an embodiment of the present application provides an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored in the memory and executable on the processor.
  • the program or instruction is executed by the processor, the The steps of the method of the first aspect.
  • an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method of the first aspect are implemented.
  • an embodiment of the present application provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run programs or instructions to implement the method of the first aspect.
  • the electronic device when the electronic device collects audio data through the first application, the electronic device receives the user's first input (the first input is used to play the target audio data in the target application) Then, in response to the first input, play the target audio data, and obtain a first recording file (including at least the audio content of the target audio data) through the first application program.
  • a first application for example, an application with a recording function such as a voice recorder, a memo, etc.
  • the electronic device in response to the user's first input, the electronic device switches to another target application (for example: storage or The target audio data is played in the multimedia application program and social application program that plays the audio data.
  • the electronic device continues to record until the first recording file is obtained after the recording is completed.
  • FIG. 1 is a schematic diagram of a recording method according to an embodiment of the present application.
  • FIG 3 is the second schematic diagram of the operation of the recording method provided by the embodiment of the present application.
  • FIG. 5 is the fourth schematic diagram of the operation of the recording method provided by the embodiment of the present application.
  • FIG. 6 is the fifth schematic diagram of the operation of the recording method provided by the embodiment of the present application.
  • FIG. 7 is a sixth schematic diagram of the operation of the recording method provided by the embodiment of the present application.
  • FIG. 8 is a schematic structural diagram of a recording device provided by an embodiment of the present application.
  • FIG. 9 is one of the hardware schematic diagrams of the electronic device provided by the embodiment of the present application.
  • FIG. 10 is the second schematic diagram of the hardware of the electronic device provided by the embodiment of the present application.
  • first, second and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between “first”, “second”, etc.
  • the objects are usually of one type, and the number of objects is not limited.
  • the first object may be one or more than one.
  • “and/or” in the description and the claims indicates at least one of the connected objects, and the character “/" generally indicates that the contextual objects are in an "or” relationship.
  • identifiers in the embodiments of the present application are used to indicate words, symbols, images, etc. of information, and controls or other containers may be used as carriers for displaying information, including but not limited to text identifiers, symbol identifiers, and image identifiers.
  • an embodiment of the present application provides a recording method, and the method includes the following steps 101 and 102 .
  • Step 101 In the case of collecting audio data through the first application, the recording device receives the first input from the user.
  • the above-mentioned first application program is an application program with a recording function, such as an application program such as a voice recorder and a memo.
  • the recording device before the recording device receives the first input from the user, the recording device has already controlled the first application to be in the audio data collection state, so as to facilitate the playback of the target application when the user receives the first input. After the first input of the target audio data, it can respond to the first request in time, so that the recording device can completely collect the target audio data.
  • the recording device in the initial transition period between the time point when the above-mentioned recording device starts recording audio through the first application and the time point when the recording device starts to play the target audio data, if the recording device is recording to When the voice content of the user's speech is not recognized in the audio data, the recording device can automatically cut the audio data in the initial transition period to reduce the space occupied by the recording file and prevent the user from playing a long time in the subsequent playback process. Blank content for the period.
  • the above-mentioned first input is the input that the user triggers the recording apparatus to enter the target application program and trigger the playback of the target audio data.
  • the above-mentioned first input may be: the user's input on the screen of the recording device, or a specific gesture input by the user, which may be specifically determined according to actual usage requirements, which is not limited in this embodiment of the present invention.
  • the recording device responds to the user's input of a virtual play button of the target audio data (for example, a file in audio format only or a file in audio and video format) in the target application program through a touch device such as a finger or a stylus, or , in response to the user's input of voice information in a certain social application, the recording device will play the audio or video file, or play the voice information.
  • a virtual play button of the target audio data for example, a file in audio format only or a file in audio and video format
  • the recording device will play the audio or video file, or play the voice information.
  • Step 102 The recording device plays the target audio data in the target application program in response to the first input, and obtains the first recording file through the first application program.
  • the above-mentioned first recording file includes at least target audio data.
  • the above-mentioned recording device plays the target audio data in response to the first input, and the first application program records the target audio data as the first recording file.
  • the first application is in the background running state, and the first application starts to perform audio collection on the target audio data in the target application running in the foreground.
  • the recording device uses the cross-application recording function, the first application running in the background needs to obtain the authorization of the target application.
  • the descriptions are made on the premise that the first application program obtains the authorization of the target application program.
  • the recording function of the above-mentioned first application has been enabled, and the first application is running in the background.
  • the above-mentioned first input includes: a first sub-input for switching applications and a second sub-input for playing target audio data.
  • the recording device switches the current display interface of the recording device to the running interface of the target application by responding to the first sub-input, so as to collect audio data across applications, and at the same time, the recording device can respond to the second sub-input. , so as to find the target audio data in the target application program interface, and play the target audio data.
  • the user executes the first sub-input of switching the target application, and the recording device switches the display interface to the running interface of the target application in response to the first sub-input, so as to select audio data to be collected across applications.
  • the user finds the target audio data in the target application program interface by executing the second sub-input, and the recording device plays the target audio data in response to the second sub-input.
  • the above-mentioned first recording file may include all the audio content of the target audio data, and may also include part of the audio content of the target audio data.
  • the recording device can form the first recording file by performing the operation of ending the recording on the first application of the recording device.
  • the above-mentioned first recording file may further include environmental audio data recorded by the recording device.
  • the above-mentioned ambient audio data includes: the user's voice content, the soundtrack played by the user using other devices in order to match the scene of the audio content of the target audio data, and the like.
  • the first recording file may further include a target segment identifier, where the target segment identifier is used to indicate the target content of the first recording file.
  • the target content may be a summary of the stated content, a chorus part of a song, an interval paragraph between two target audio data, and the like.
  • the target segment identifier can be differentiated and displayed on the progress bar when playing the first recording file, so that the user can directly play the target content indicated by the target segment identifier.
  • the above-mentioned recording device may display the suspension indicating the first application in the running interface of the target application. controls.
  • a floating control 201 is displayed in the running interface of the target application. At this time, the floating control 201 displays that audio is being recorded, and the recording device can receive the first input from the user.
  • the way of playing the target audio data includes at least one of the following:
  • Mode 1 The recording device uses the earpiece to play the target audio data.
  • a general recording device in order to reduce the noise reduction effect during a voice call, a general recording device generally configures an earpiece microphone at the position of the earpiece. If the recording device detects that the earpiece microphone is configured, it can automatically use the earpiece to play the target audio data. Since the sound of the earpiece is lower than the playback volume of the external speaker, when the content of the target audio data to be played involves privacy, the privacy leakage caused by the playback of the target audio data by the external speaker can be significantly reduced.
  • the recording device when the recording device detects that the earpiece microphone is configured, in order to improve the file quality of the first recording file and reduce the noise in the first recording file, the recording device can control the first application to use the earpiece microphone to perform recording. Audio data collection.
  • Mode 2 The recording device plays the target audio data at the target volume, and compensates the volume for the corresponding audio content in the first recording file.
  • the target volume may be lower than the volume set by the user for playing audio data.
  • the volume for playing media files set by the user is volume 8
  • the target volume is suitable for playing at a volume lower than 8, so as to avoid leakage of private content in the target audio data.
  • the volume of the corresponding recording segment in the first recording file may be low, and the recording device may convert the audio content of the played target audio data into the first recording file.
  • the recording device may convert the audio content of the played target audio data into the first recording file.
  • the recording device is equivalent to expanding the target in the first recording file.
  • the volume of the voice content corresponding to the audio data 1 is not compensated for other target audio data played at normal volume, thereby reducing the possibility of leakage of private information and ensuring the file quality of the first recording file.
  • the recording apparatus may directly use Mode 2 to play the target audio data, and may also use Mode 2 to play the target audio data only when it is detected that the recording apparatus is not configured with an earpiece microphone.
  • the recording device may also use the playback methods of Mode 1 and Mode 2 in combination when it is detected that the recording device has been configured with an earpiece microphone.
  • Mode 3 The recording device plays the target audio data in mute, and acquires the audio stream in the target audio data.
  • the recording device uses the mode of silent playback to convert the target audio data into a data that can be stored by the first application program. Audio data format to avoid privacy leakage caused by audio content involving private content when playing the target audio data.
  • the above-mentioned recording apparatus can preferentially use the above-mentioned playback method.
  • the above target application is a social target application. Since the audio data stored in the social target application generally has strong privacy, when the user plays the target audio data in it, the recording device can record the target application. If it is a social target application, use the above method 1, method 2 and/or method 3 to play the target audio data.
  • the recording device is in the process of audio collection, and the floating control displays that audio is being recorded.
  • the recording device recognizes that the social target application is running in the foreground, and in response to the user's operation of playing the voice of "Zhang San", the recording device plays the voice of "Zhang San” in mode 3, and displays the voice of "Zhang San” in the interface of the target application.
  • the recording device is in the process of audio collection, and the floating control displays that audio is being recorded.
  • the recording device recognizes that the social target application is running in the foreground, and in response to the user's operation of playing the voice of "Li Si", the recording device plays the voice of "Zhang San” in the mode 1 combined with the playback mode 2, and in The interface of the target application shows: Recording privacy protection: The volume of the external playback has been intelligently adjusted; Recording privacy protection: Listening to the earpiece playing; Prompting the user to record the target audio data through the combination of privacy playback mode 1 and mode 2 , to protect user privacy.
  • step 102 is described by taking Mode 1, Mode 2 and/or Mode 3 playing the target audio data as an example, which does not limit the embodiments of the present application, and the recording device can also use the above three methods. Play the target audio data in other ways.
  • the recording device can record audio across applications, and further record the target audio data that cannot be exported in the target application as a first recording file, so that it is convenient for users to play the recorded first recording file by playing In order to obtain the audio content of the target audio data, the form diversity of the target audio data played by the recording device is improved.
  • the target audio data that cannot be exported in the target application program can be stored as the first recording file, and the user can share the audio content in the target audio data to other users by sharing the first recording file. , which breaks the target application's use of the audio data stored in it.
  • the recording device can play a plurality of first audio data, so that the user can splicing the audio content in the plurality of target audio data into a first audio data.
  • a recording file can be
  • the recording method provided by this embodiment of the present application may further include step 103 .
  • Step 103 During the process of switching from the first audio data to the second audio data for playback, the recording device identifies the audio content in the environmental audio data within the target time period.
  • the target period is: the period between the time point when the first audio data ends playing and the time point when the second audio data starts playing.
  • the above-mentioned first audio data and the second audio data may be audio data stored in the same target application program, and the user may play the first audio data through the first input, and then continue to play the second audio data in the same application program. audio data.
  • the floating control displays the recorded segment 1
  • the target application displays the playback interface of the second audio data.
  • the recording device plays the second audio data, and the recording segment 2 is displayed in the floating control in (b) in FIG. 4 .
  • the time period between when the first audio data is played and when the user clicks the play button in (a) in FIG. 4 is the target time period.
  • the first audio data and the second audio data may also be audio data stored in different target applications, and the user may switch to the second target application after playing the first audio data in the first target application.
  • the second audio data is played in the program.
  • the recording device still collects audio data through the first application program.
  • the user can freely choose the number of clips recorded in the first recording file, that is, the audio content of multiple target audio data can be simultaneously recorded at one time. in the same target application.
  • the time period between the time point when the first audio data ends playing and the time point when the second audio data starts playing may be referred to as a target time period.
  • the target period includes the interval period between any two target audio data played in the same target application, and the target period may also include the recording device switching to the second application after playing the first audio data in the first target application. Interval period for playing the second audio data.
  • the recording apparatus can identify the audio content of the environmental audio data within the target time period, and use the audio content to mark the first recording file, so as to perform personalized processing on the first recording file.
  • the recording apparatus may set the target segment identifier of the target content in the first recording file according to the identified audio content in the environmental audio data. For example, the recording device recognizes that the audio content in the environmental audio data within the target time period is "If you want to listen to dry stuff, please start listening from here.” The recording device will process the first recording file according to the audio content, so that the first recording file is playing The target segment identifier is displayed in the corresponding part of the progress bar at the time, so that the user can directly play the target content indicated by the target segment identifier.
  • the recording device continues to record the environmental audio data, the user can speak in the target time period, and the user's speech is recorded as a transition between playing the two target audio data. in the first recording file.
  • the audio recording device uses the recognized audio content in the environmental audio data to mark the first audio recording file, which can facilitate the user to perform personalized processing for the first audio recording file.
  • the recording method provided in this embodiment of the present application may further include step 104 .
  • Step 104 The recording device adds a target tag to the first recording file.
  • the target tag is determined based on the target audio data or the audio content in the ambient audio data.
  • the above-mentioned recording device may add a target tag to the first recording file after recording the first recording file, so as to facilitate the user to classify the first recording file, so that subsequent users can quickly search for the first recording file that they want to play.
  • a recording file may be added.
  • the above-mentioned target tag may be determined based on the content of the target audio data. For example, when the recording device plays the target audio data, the recording device performs speech recognition on the content of the target audio data through the speech recognition device. The recording device extracts a plurality of keywords according to the recognition result of the speech content of the target audio data by the speech recognition device, so that the user can set a label for the first recording file.
  • the above-mentioned target tag may be determined based on the content of the ambient audio data.
  • the recording device can perform speech recognition on the voice content within the target time period through the voice recognition device, and the recording device extracts a plurality of keywords from the recognition result of the voice content within the target time period for the user to set labels on the first recording file .
  • the speech recognition device in the recording device may perform speech recognition on the environmental audio data.
  • the speech recognition device may include an acoustic model, a dictionary module, a language model and a decoding module.
  • the recording device may perform feature extraction processing on the environmental audio data, and then input the extracted features into the acoustic model, the dictionary module and the language model to obtain A plurality of probability values, so that the recording device can perform speech recognition on the environmental audio data according to the plurality of probability values and the decoding module.
  • the recording device can add a custom label to the first recording file in the target application program interface, or can switch back to the first application program to add a custom label to the first recording file.
  • the recording device switches to the running interface of the first application, and the suspension control is switched to end recording and Ready for the next recording.
  • the recording device pops up and displays the control "Add Label” for the user to select a custom label for the first recording file.
  • the multiple keywords "sweet time”, “student times”, and “high three and ninth class” extracted from the recognition results of the voice content in the target period are added to the first recording file according to the user's input of the above keywords.
  • Custom tags i.e. target tags above).
  • the recording device can use the user's voice content in the target period as a tag keyword, so that the user can add a custom label to the first recording file, so that the user can add a custom label to the first recording file.
  • the classification is performed so that subsequent users can quickly search for the first recording file that they want to play.
  • the recording method provided in this embodiment of the present application may further include step 105 .
  • Step 105 The recording device displays the filter function control, and processes the first recording file in response to the user's selection of the filter type.
  • the recording device displays a filter function control, and the filter function control includes an “AI smart setting” control and a “manual selection” control. If the user inputs an "AI smart setting” control, the recording device automatically matches a sound filter for the first recording file according to the audio content of the target audio data in response to the operation.
  • a sound filter is configured for the first recording file, so that when the first recording file is played, it sounds more layered, more three-dimensional and has sound texture.
  • the above-mentioned process of processing the first recording file according to the filter type may be performed after the recording of the first recording file is completed, and the recording device switches back to the first application program.
  • the recording method provided by this embodiment of the present application may further include steps 106 and 107 .
  • Step 106 The recording device displays a first target identifier, where the first target identifier is used to indicate the playback order of the N target audio data.
  • N is a positive integer.
  • the recording device in the process of recording the first recording file by the recording device, the recording device will play N pieces of target audio data in succession, then the above-mentioned recording device may display the first target identification, so that the user can understand the recording process.
  • the first target identifier is used to indicate the recording sequence of the N target audio data.
  • the above-mentioned first target identifier may be displayed in a floating control indicating the first application.
  • Step 107 When starting to play the i-th target audio data, the recording device updates the first target identifier once.
  • i is a positive integer, i ⁇ N.
  • the recording apparatus can continuously update the first target identifier, so as to facilitate the user to understand the recording process.
  • the first target identifier is updated and displayed from segment 0 to “segment 1”, until the user clicks to play the second target audio data.
  • the recording device starts to play the next target audio data.
  • the recording apparatus updates and displays the first target identifier from "segment 1" to "segment 2".
  • the display form of the above-mentioned first target identification may be Chinese characters combined with numbers, or the first target identification may only be displayed in the form of numbers.
  • the recording device can display the segment sequence number of the target audio data, which is helpful for prompting the user of the quantity of the target audio data that has been recorded in the first recording file, helping the user to know the recording process at any time, so as to control the recording process.
  • the file duration of the first recording file is helpful for prompting the user of the quantity of the target audio data that has been recorded in the first recording file, helping the user to know the recording process at any time, so as to control the recording process.
  • the recording method provided in this embodiment of the present application may further include step 107 , step 108 , and step 109 .
  • Step 107 The recording device receives the second input.
  • the above-mentioned second input is an input for the user to trigger the recording apparatus to use the first application to start collecting audio data.
  • Step 108 The recording device displays the second target identifier in response to the second input.
  • the second target identifier is used to indicate: record audio in the application.
  • the recording device displays a second target identification, and the second target identification is used to indicate that the recording application is in the recording mode.
  • the audio recording function that is, the cross-application recording function in this embodiment of the present application.
  • the display of the recording device may also display other signs, and the other signs may be used to indicate “recording” or “recording to text”.
  • Step 109 the recording device receives a third input of the second target identifier by the user, and in response to the third input, the recording device controls the first application to start audio data collection.
  • the recording device only after the user triggers the second target identifier, the recording device enters the function of cross-application recording, and can prepare to start recording the target audio data in the target application.
  • the recording device in response to the third input, displays in the operation interface of the first application "The suspension control has been turned on. After switching to the application that needs to be recorded, click the suspension microphone. to start recording".
  • the recording device does not enable the cross-application recording function, that is, the recording device does not enable the function of recording the target audio data in the target application program through the first reference program.
  • the recording device provides the user with multiple recording scenarios using the first application, and the cross-application recording function is enabled only when the user chooses to use the cross-application recording, which can meet the different recording needs of the user.
  • the execution subject may be a recording device, or a control module in the recording device for executing the recording method.
  • the recording device provided by the embodiment of the present application is described by taking the recording method performed by the recording device as an example.
  • an embodiment of the present application provides a recording apparatus 800 .
  • the recording device includes a receiving module 801 and a processing module 802 .
  • a receiving module 801 configured to receive a first input from a user in the case of collecting audio data through a first application
  • the processing module 802 is configured to play the target audio data in the target application program in response to the first input received by the receiving module 801, and obtain the first recording file through the first application program; wherein, the first recording file includes at least the target audio data.
  • the processing module 802 is further configured to identify the audio content in the environmental audio data within the target time period during the process of switching from the first audio data to the second audio data for playback; wherein, The target period is: the period between the time point when the first audio data ends playing and the time point when the second audio data starts playing.
  • the processing module 802 is further configured to add a target tag to the first audio recording file, wherein the target tag is determined based on the target audio data or audio content in the environmental audio data.
  • the number of target audio data is N
  • the recording device 800 further includes: a display module 803, configured to display a first target identifier, and the first target identifier is used to indicate the number of the N target audio data. Play order.
  • the processing module 802 is further configured to update the first target identifier when starting to play the i-th target audio data.
  • the processing module 803 is further configured to use the earpiece to play the target audio data; or play the target audio data at the target volume, and compensate the volume for the corresponding audio content in the first recording file; or play it in silence target audio data, and get the audio stream in the target audio data.
  • the receiving module 801 is further configured to receive a second input from the user, and the display module 803 is further configured to display a second target identifier in response to the second input, and the second target identifier is used to indicate : record audio in the application; the receiving module 801 is further configured to receive a third input from the user to the second target identifier; the processing module 802 is further configured to control the first application to start audio data collection in response to the third input.
  • An embodiment of the present application provides a recording device.
  • the recording device collects audio data through a first application
  • the recording device receives a first input from a user (the first input is used to play the target audio in the target application). data), in response to the first input, play the target audio data, and obtain a first recording file (including at least the audio content of the target audio data) through the first application program.
  • the recording device records through the first application (for example, an application with a recording function such as a recorder, a memo, etc.)
  • the recording device switches to other target applications (for example: storage or
  • the target audio data is played in the multimedia application program and social application program that plays the audio data.
  • the recording device continues to record until the first recording file is obtained after the recording is completed.
  • the recording device can realize cross-application recording audio, and then can record the target audio data that cannot be exported in the target application program as the first recording file, which is convenient for the user to obtain the audio of the target audio data by playing the recorded first recording file. content, which improves the form diversity of the target audio data played by the recording device.
  • the recording device in this embodiment of the present application may be a device, or may be a component, an integrated circuit, or a chip in a terminal.
  • the apparatus may be a mobile electronic device or a non-mobile electronic device.
  • the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant).
  • UMPC ultra-mobile personal computer
  • PDA personal digital assistant
  • non-mobile electronic devices can be servers, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
  • the recording device in the embodiment of the present application may be a device having an operating system.
  • the operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
  • the recording apparatus provided in the embodiment of the present application can implement each process implemented by the method embodiments in FIG. 1 to FIG. 7 , and to avoid repetition, details are not repeated here.
  • an embodiment of the present application further provides an electronic device 900, including a processor 901, a memory 902, a program or instruction stored in the memory 902 and executable on the processor 901, the program Or, when the instruction is executed by the processor 901, each process of the above-mentioned recording method embodiment can be implemented, and the same technical effect can be achieved. To avoid repetition, details are not repeated here.
  • the electronic devices in the embodiments of the present application include the above-mentioned mobile electronic devices and non-mobile electronic devices.
  • FIG. 10 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
  • the electronic device 1000 includes but is not limited to: a radio frequency unit 1001, a network module 1002, an audio output unit 1003, an input unit 1004, a sensor 1005, a display unit 1006, a user input unit 107, an interface unit 1008, a memory 1009, and a processor 1010, etc. part.
  • the electronic device 1000 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 1010 through a power management system, so that the power management system can manage charging, discharging, and power functions. consumption management and other functions.
  • a power source such as a battery
  • the power management system can manage charging, discharging, and power functions. consumption management and other functions.
  • the structure of the electronic device shown in FIG. 10 does not constitute a limitation on the electronic device, and the electronic device may include more or less components than the one shown, or combine some components, or arrange different components, which will not be repeated here. .
  • the user input unit 1007 is configured to receive the first input of the user in the case of collecting audio data through the first application program.
  • the processor 1010 is configured to play the target audio data in the target application program in response to the first input received by the user input unit 1007, and obtain the first recording file through the first application program.
  • the first recording file includes at least the audio content of the target audio data.
  • the target audio data includes first audio data and second audio data
  • the processor 1010 is specifically configured to, in the process of switching from the first audio data to the second audio data for playback, identify the ambient audio in the target time period.
  • the audio content in the data; wherein, the target period is: the period between the time point when the first audio data ends playing and the time point when the second audio data starts playing.
  • the processor 1010 is further configured to add a target tag to the first recording file; wherein, the target tag is determined based on the target audio data or the audio content in the ambient audio data.
  • the first recording file includes N pieces of target audio data
  • the display unit 1006 is used to display the first target mark, and the first target mark is used to indicate the recording sequence of the N pieces of target audio data
  • the processor 1010 is also used for When starting to play the ith target audio data, the first target identifier is updated once, where N and i are positive integers, and i ⁇ N.
  • the processor 1010 is specifically further configured to: use the earpiece to play the target audio data; or play the target audio data at the target volume, and compensate the volume for the corresponding audio content in the first recording file; or play the target audio data muted, and Get the audio stream in the target audio data.
  • the user input unit 1007 is further configured to receive a second input from the user, and the display unit 1006 is further configured to display a second target identifier in response to the second input, and the second target identifier is used to indicate: in the recording application
  • the user input unit 1007 is further configured to receive the user's third input of the second target identifier; the processor 1010 is further configured to control the first application to start audio data collection in response to the third input.
  • the electronic device when the electronic device collects audio data through the first application, the electronic device receives the user's first input (the first input is used to play the target audio data in the target application) Then, in response to the first input, the target audio data is played, and a first recording file (including at least the audio content of the target audio data) is obtained through the first application program.
  • a first application for example, an application with a recording function such as a voice recorder, a memo, etc.
  • the electronic device in response to the user's first input, the electronic device switches to another target application (for example: storage or The target audio data is played in the multimedia application program and social application program that plays the audio data.
  • the electronic device continues to record until the first recording file is obtained after the recording is completed.
  • the electronic device can realize cross-application recording audio, and then can record the target audio data that cannot be exported in the target application program as the first recording file, so that the user can obtain the audio of the target audio data by playing the recorded first recording file. content, which improves the form diversity of the target audio data played by the electronic device.
  • the input unit 1004 may include a graphics processor (Graphics Processing Unit, GPU) 10041 and a microphone 10042. Such as camera) to obtain still pictures or video image data for processing.
  • the display unit 1006 may include a display panel 10061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like.
  • the user input unit 1007 includes a touch panel 10071 and other input devices 10072 .
  • the touch panel 10071 is also called a touch screen.
  • the touch panel 10071 may include two parts, a touch detection device and a touch controller.
  • Other input devices 10072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which will not be repeated here.
  • Memory 1009 may be used to store software programs as well as various data, including but not limited to application programs and operating systems.
  • the processor 1010 may integrate an application processor and a modem processor, wherein the application processor mainly processes the operating system, user interface, and application programs, and the like, and the modem processor mainly processes wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 1010.
  • the embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, each process of the above-mentioned recording method embodiment can be realized, and the same technical effect can be achieved , in order to avoid repetition, it will not be repeated here.
  • the processor is the processor in the electronic device in the above embodiment.
  • the readable storage medium includes a computer-readable storage medium, such as a computer read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
  • An embodiment of the present application further provides a chip, the chip includes a processor and a communication interface, the communication interface and the processor are coupled, and the processor is used for running a program or an instruction to implement the various processes of the above recording method embodiments, and can achieve the same technology The effect, in order to avoid repetition, is not repeated here.
  • the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.

Abstract

本申请公开了一种录音方法、装置、电子设备和可读存储介质,该方法包括:在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入;响应于第一输入,播放目标应用程序中的目标音频数据,并通过第一应用程序得到第一录音文件;其中,第一录音文件至少包括目标音频数据。

Description

录音方法、装置、电子设备和可读存储介质
相关申请的交叉引用
本申请主张在2021年04月26日在中国提交的中国专利申请号202110455989.5的优先权,其全部内容通过引用包含于此。
技术领域
本申请属于录音技术领域,具体涉及一种录音方法、装置、电子设备和可读存储介质。
背景技术
随着通信技术的高速发展,电子设备的应用越来越广泛,用户对电子设备的性能要求也越来越高。
目前,用户可以通过社交类应用程序收发语音信息或者通过多媒体应用程序播放音频数据。通常,用户想要播放上述应用程序中的音频数据或者语音信心时,需要进入到相应的应用程序内,从而播放用户希望收听的音频内容。
然而,由于语音信息或者音频数据通常无法直接从上述应用程序中导出,用户希望收听上述应用程序中的音频内容时,需要通过上述播放过程,导致电子设备播放音频数据的受约束程度较高,播放应用程序内的音频数据的方式比较单一。
发明内容
本申请实施例的目的是提供一种录音方法、装置、电子设备和可读存储介质,能够解决应用程序内的音频数据无法通过其他方式播放或者存储,播放方式单一的问题。
第一方面,本申请实施例提供了一种录音方法,该方法包括:
在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入;
响应于第一输入,播放目标应用程序中的目标音频数据,并通过第一应用程序得到第一录音文件;
其中,第一录音文件至少包括目标音频数据。
第二方面,本申请实施例提供了一种录音装置,该装置包括:
接收模块,用于在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入;
处理模块,用于响应于接收模块接收到的第一输入,播放目标应用程序中的目标音频数据,并通过第一应用程序得到第一录音文件;
其中,第一录音文件至少包括目标音频数据。
第三方面,本申请实施例提供了一种电子设备,该电子设备包括处理器、存储器 及存储在存储器上并可在处理器上运行的程序或指令,程序或指令被处理器执行时实现如第一方面的方法的步骤。
第四方面,本申请实施例提供了一种可读存储介质,可读存储介质上存储程序或指令,程序或指令被处理器执行时实现如第一方面的方法的步骤。
第五方面,本申请实施例提供了一种芯片,芯片包括处理器和通信接口,通信接口和处理器耦合,处理器用于运行程序或指令,实现如第一方面的方法。
在本申请实施例中,在电子设备通过第一应用程序进行音频数据采集的情况下,电子设备在接收到用户的第一输入(该第一输入用于播放目标应用程序中的目标音频数据)后,响应于该第一输入,播放该目标音频数据,并通过该第一应用程序得到第一录音文件(至少包括该目标音频数据的音频内容)。如此,在电子设备通过第一应用程序(例如:录音机、备忘录等具有录音功能的应用程序)录音的情况下,响应于用户的第一输入,电子设备切换至其他目标应用程序(例如:存储或者播放音频数据的多媒体应用程序和社交应用程序)中播放目标音频数据,在此过程中,电子设备持续进行录音,直至录音完成得到第一录音文件。
附图说明
图1为本申请实施例提供的一种录音方法的示意图;
图2为本申请实施例提供的录音方法的操作示意图之一;
图3为本申请实施例提供的录音方法的操作示意图之二;
图4为本申请实施例提供的录音方法的操作示意图之三;
图5为本申请实施例提供的录音方法的操作示意图之四;
图6为本申请实施例提供的录音方法的操作示意图之五;
图7为本申请实施例提供的录音方法的操作示意图之六;
图8为本申请实施例提供的录音装置的结构示意图;
图9为本申请实施例提供的电子设备的硬件示意图之一;
图10为本申请实施例提供的电子设备的硬件示意图之二。
具体实施方式
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员获得的所有其他实施例,都属于本申请保护的范围。
本申请的说明书和权利要求书中的术语“第一”、“第二”等是用于区别类似的对象,而不用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施,且“第一”、“第二”等所区分的对象通常为一类,并不限定对象的个数,例如第一对象可以是一个,也可以是多个。此外,说明书以及权利要求中“和/或”表示所连接对象的至少其中之一, 字符“/”,一般表示前后关联对象是一种“或”的关系。
需要说明的是,本申请实施例中的标识用于指示信息的文字、符号、图像等,可以以控件或者其他容器作为显示信息的载体,包括但不限于文字标识、符号标识、图像标识。
下面结合附图,通过具体的实施例及其应用场景对本申请实施例提供的录音方法进行详细地说明。
如图1所示,本申请实施例提供一种录音方法,该方法包括下述的步骤101和步骤102。
步骤101、在通过第一应用程序进行音频数据采集的情况下,录音装置接收用户的第一输入。
本申请实施例中,上述第一应用程序为具有录音功能的应用程序,如录音机、备忘录等应用程序。
需要说明的是,本申请实施例中,在录音装置接收用户的第一输入之前,录音装置就已经控制第一应用程序处于音频数据采集状态,以便于在接收到用户用于播放目标应用程序中的目标音频数据的第一输入以后,能够及时响应该第一请求,以使录音装置可以完整的采集目标音频数据。
可选地,本申请实施例中,上述录音装置开始通过第一应用程序录制音频的时间点、到录音装置开始播放目标音频数据的时间点之间的初始过渡时段中,若录音装置在录制到的音频数据中未识别到用户讲话的语音内容时,录音装置可以自动将初始过度时段内的音频数据进行剪切,以减小录音文件的占用空间,也避免用户在后续播放过程中播放较长时段的空白内容。
示例性的,上述第一输入为用户触发录音装置通过进入目标应用程序并触发播放目标音频数据的输入。在一种示例中,上述第一输入可以为:用户在录音装置的屏幕上的输入,或者,用户输入的特定手势,具体的可以根据实际使用需求确定,本发明实施例不作限定。通常,在录音装置响应于用户通过手指或者触控笔等触控装置对目标应用程序中的目标音频数据(例如仅为音频格式的文件或音视频格式的文件)的虚拟播放按键的输入,或者,响应于用户对某个社交类应用程序中的语音信息的输入,录音装置会播放该音频或者视频文件,或者播放该语音信息。
步骤102、录音装置响应于第一输入,播放目标应用程序中的目标音频数据,并通过第一应用程序得到第一录音文件。
其中,上述第一录音文件至少包括目标音频数据。
本申请实施例中,上述录音装置响应于第一输入,播放目标音频数据,第一应用程序录制该目标音频数据为第一录音文件。
需要说明的是,在录音装置接收到第一输入后,第一应用程序处于后台运行状态,第一应用程序开始对处于前台运行的目标应用程序中的目标音频数据进行音频采集。录音装置使用跨应用录音的功能,需要使处于后台运行的第一应用程序获得目标应用程序的授权。本申请实施例中,均以第一应用程序获得目标应用程序的授权的前提下进行的说明。
示例性地,在本申请实施例中,上述第一应用程序的录音功能已经开启,第一应用程序处于后台运行过程中。
示例性的,上述第一输入包括:用于切换应用程序的第一子输入和用于播放目标音频数据的第二子输入。进一步的,录音装置通过响应于第一子输入,从而将录音装置当前的显示界面切换为目标应用程序的运行界面,以实现跨应用采集音频数据,同时,录音装置可以通过响应于第二子输入,从而在在该目标应用程序界面内找到目标音频数据,并播放该目标音频数据。
用户通过执行切换目标应用程序的第一子输入,录音装置响应于第一子输入将显示界面切换至目标应用程序的运行界面,实现跨应用选择需要采集的音频数据。用户通过执行第二子输入在该目标应用程序界面内找到目标音频数据,录音装置响应于该第二子输入播放该目标音频数据。
可以理解的是,上述第一录音文件中可以包括目标音频数据的全部音频内容,也可以包括目标音频数据的部分音频内容。当用户播放上述目标音频数据之后,即便目标音频数据未全部播放完而用户希望能够结束录音时,可以通过对录音装置的第一应用程序进行结束录制的操作,录音装置即形成第一录音文件。
可选地,上述第一录音文件中还可以包括录音装置录制的环境音频数据。示例性的,上述环境音频数据包括:用户的语音内容、用户为了配合目标音频数据的音频内容的场景而使用其他设备播放的配乐等。
可选地,上述第一录音文件中还可以包括目标片段标识,该目标片段标识用于指示第一录音文件的目标内容。例如:目标内容可以为陈述内容的总结、歌曲的副歌部分、两个目标音频数据中间的间隔段落等。该目标片段标识可以区别显示于播放第一录音文件时的进度条上,以便于用户可以直接播放该目标片段标识所指示的目标内容。
可选地,为了提示用户录音装置处于音频数据采集的状态,也便于用户对第一应用程序的录音功能进行操作,上述录音装置可以在目标应用程序的运行界面内显示指示第一应用程序的悬浮控件。示例性的,如图2所示,在目标应用程序的运行界面内,显示悬浮控件201。此时,悬浮控件201中显示正在录制音频,录音装置可以接收用户的第一输入。
可选地,播放目标音频数据的方式包括以下至少一项:
方式1、录音装置使用听筒播放目标音频数据。
需要说明的是,一般的录音装置为了降低语音通话过程中的降噪效果,一般会在听筒的位置配置听筒麦克风。如果录音装置检测到配置了听筒麦克风,则可以自动使用听筒播放目标音频数据。由于听筒的声音相较于外放扬声器的播放音量更低,在播放的目标音频数据的内容涉及隐私时,能够显著降低外放扬声器播放目标音频数据造成的隐私泄露。
示例性的,当录装置在检测到配置了听筒麦克风的情况下,为了能够提高第一录音文件的文件质量,减少第一录音文件中的噪声,录音装置可以控制第一应用程序使用听筒麦克风进行音频数据采集。
方式2、录音装置以目标音量播放目标音频数据,并对第一录音文件中相应的音频内容补偿音量。
在上述方式中,上述目标音量可以低于用户设定的播放音频数据的音量。示例性的,用户设定的播放媒体类文件的音量为音量8,目标音量适宜以较低于8的音量进行播放,以避免目标音频数据中的隐私性的内容泄露。
另外,录音装置以目标音量播放目标音频数据之后,可能会导致第一录音文件中相应的录音片段的音量较小,录音装置可以在将播放的目标音频数据的音频内容转化为第一录音文件的过程中进行音量放大补偿。示例性的,用户播放了多个目标音频数据,只有在播放目标音频数据1时是以目标音量播放的,则在形成第一录音文件的过程中,录音装置等效扩大第一录音文件中目标音频数据1对应的语音内容的音量,而对其他正常音量播放的目标音频数据不予补偿,从而降低了隐私信息泄露的可能,也保证了第一录音文件的文件质量。
示例性的,录音装置可以直接使用方式2播放目标音频数据,也可以在检测录音装置未配置听筒麦克风的情况下,才使用方式2播放目标音频数据。录音装置也可以在检测录音装置已配置听筒麦克风的情况下,结合使用方式1和方式2的播放方式。
方式3、录音装置静音播放目标音频数据,并获取目标音频数据中的音频流。
需要说明的是,如果目标应用程序支持第一应用程序进行音频流数据捕获,则在播放该目标音频数据时,录音装置使用静音播放的方式,将目标音频数据转化为第一应用程序可以存储的音频数据格式,以避免造成播放目标音频数据时的音频内容涉及隐私内容而造成的隐私泄露。
如此,上述录音装置在目标应用程序开放读取音频流数据时,可以优先选用上述播放方式。
可以理解的是,上述目标应用程序为社交类目标应用程序,由于社交类目标应用程序中存储的音频数据一般隐私性较强,在用户播放其中的目标音频数据时,录音装置可以对目标应用程序的类型进行识别,如果是社交类目标应用程序则使用上述方式1、方式2和/或方式3播放目标音频数据。
示例性的,如图2所示,录音装置处于音频采集的过程中,悬浮控件显示正在录制音频。此时,录音装置识别到社交类目标应用程序正在前台运行,录音装置响应于用户播放“张三”的语音的操作,以方式3播放“张三”的语音,并在目标应用程序的界面中显示:录音隐私保护,静音录制中,以提示用户虽然没有播放目标音频数据,但是仍在录音过程中,并且录音装置通过静音录制的方式保护用户隐私。
示例性的,如图3所示,录音装置处于音频采集的过程中,悬浮控件显示正在录制音频。此时,录音装置识别到社交类目标应用程序正在前台运行,录音装置响应于用户播放“李四”的语音的操作,以方式1结合方式2的播放方式播放“张三”的语音,并在目标应用程序的界面中显示:录音隐私保护:已智能调节外放音量;录音隐私保护:听过听筒 播放中;以提示用户录音装置通过隐私播放方式1和方式2结合使用的方式录制目标音频数据,以保护用户隐私。
需要说明的是,上述步骤102是以通过方式1、方式2和/或方式3播放目标音频数据为例进行说明的,其并不对本申请实施例形成限定,录音装置也可以通过上述三种方式之外的其他方式播放目标音频数据。
本申请实施例提供的录音方法,录音装置可以实现跨应用程序录制音频,进而可以将目标应用程序中的无法导出的目标音频数据录制为第一录音文件,方便用户通过播放录制的第一录音文件以获得目标音频数据的音频内容,提高了录音装置播放目标音频数据的形式多样性。此外,通过上述方法可以将存储在目标应用程序中无法导出的目标音频数据作为第一录音文件进行存储,用户可以通过分享该第一录音文件的方式将目标音频数据中的音频内容分享给其他用户,突破了目标应用程序对其中存储的音频数据的使用限制。
可选地,在目标音频数据包括第一音频数据和第二音频数据的情况下,录音装置可以播放多个第一音频数据,以供用户将多个目标音频数据中的音频内容拼接为一个第一录音文件。
示例性地,在上述步骤102之后,本申请实施例提供的录音方法还可以包括步骤103。
步骤103、录音装置在从第一音频数据切换至第二音频数据进行播放的过程中,识别目标时段内的环境音频数据中的音频内容。
其中,目标时段为:第一音频数据的结束播放的时间点至第二音频数据的开始播放的时间点间的时段。
示例性的,上述第一音频数据和第二音频数据可以是同一个目标应用程序中存储的音频数据,用户可以通过第一输入播放第一音频数据之后,再同一个应用程序中继续播放第二音频数据。如图4中的(a)所示,在录音装置已经播放第一音频数据之后,悬浮控件中显示已经录制片段1,目标应用程序显示的是第二音频数据的播放界面,此时,如用户点击图4中的(a)中的播放键时,录音装置播放该第二音频数据,并且在图4中的(b)中的悬浮控件中显示正在录制片段2。在播放完成第一音频数据,到用户点击图4中的(a)中的播放键之间的时段即为目标时段。
示例性的,第一音频数据和第二音频数据也可以是不同的目标应用程序中存储的音频数据,用户可以在第一目标应用程序中播放完第一音频数据之后,切换至第二目标应用程序中播放第二音频数据。在切换应用程序的过程中,录音装置仍然通过第一应用程序进行音频数据采集。
可以理解的是,用户可以自由选择第一录音文件中录制的片段数量,即可以一次同时录制多个目标音频数据的音频内容,多个目标音频数据可以来源于不同的目标应用程序,也可以来源于同一个目标应用程序。
为了便于描述,第一音频数据的结束播放的时间点至第二音频数据的开始播放的时间点间的时段,可以称为目标时段。目标时段包括在同一个目标应用程序中播放的任意两个 目标音频数据的间隔时段,目标时段也可以包括录音装置在第一目标应用程序中播放第一音频数据之后,切换至第二应用程序中播放第二音频数据的间隔时段。
可以理解的是,在目标时段中,由于录音装置保持音频数据采集状态,并且在目标时段内未播放目标音频数据,用户可以在目标时段内说话,以作为播放两个目标音频数据中间承上启下的转场。如此,录音装置可以识别目标时段内的环境音频数据的音频内容,并利用该音频内容标记第一录音文件,以对第一录音文件进行个性化处理。
示例性的,录音装置可以根据识别到的环境音频数据中的音频内容,设置第一录音文件中目标内容的目标片段标识。举例说明,录音设备识别目标时段内环境音频数据中的音频内容为“想听干货的请从这里开始听”,录音装置将根据该音频内容处理第一录音文件,以使第一录音文件在播放时的进度条的相应部分中显示目标片段标识,以便于用户可以直接播放该目标片段标识所指示的目标内容。
如此,录音装置在切换播放第一音频数据和第二音频数据的过程中,持续录制环境音频数据,用户可以在目标时段内讲话,用户的讲话作为播放两个目标音频数据之间的转场录制在第一录音文件中。录音装置将识别到的环境音频数据中的音频内容,用于标记第一录音文件,能够便于用户为第一录音文件进行者个性化处理。
可选地,在录音装置录制完成第一录音文件之后,示例性地,在上述步骤103之后,本申请实施例提供的录音方法还可以包括步骤104。
步骤104、录音装置为第一录音文件添加目标标签。
其中,目标标签是基于目标音频数据或者环境音频数据中的音频内容确定的。
本申请实施例中,上述录音装置在录制完成第一录音文件之后可以为第一录音文件添加目标标签,以便于用户对第一录音文件进行归类,便于后续用户能够快速搜索到希望播放的第一录音文件。
进一步可选地,上述目标标签可以基于目标音频数据的内容确定。例如:如录音装置通过播放目标音频数据时,录音装置通过语音识别装置对目标音频数据的内容进行语音识别。录音装置根据语音识别装置对目标音频数据的语音内容的识别结果中提取出多个关键词,以供用户对第一录音文件设置标签。
进一步可选地,上述目标标签可以基于环境音频数据的内容确定。例如:录音装置可以通过语音识别装置对上述目标时段内的语音内容进行语音识别,录音装置在目标时段内语音内容的识别结果中提取出多个关键词,以供用户对第一录音文件设置标签。
可选地,本申请实施例中,上述语音识别过程可以通过录音装置中的语音识别装置进行对环境音频数据的语音识别。该语音识别装置可以包括声学模型、字典模块、语言模型和解码模块,录音装置可以通过对环境音频数据进行特征提取处理,再将提取后的特征输入声学模型、字典模块和语言模型中,以得到多个概率值,从而录音装置可以根据该多个概率值和解码模块,对环境音频数据进行语音识别。
进一步可选地,录音装置在录制完成第一录音文件之后,可以在目标应用程序界面为 第一录音文件添加自定义标签,也可以切换回第一应用程序内为第一录音文件添加自定义标签。
示例性的,如图5所示,在音频数据录制完成后,用户通过对悬浮控件中的录制完成功能键进行输入,录音装置切换至第一应用程序的运行界面,悬浮控件切换为结束录制并准备进行下一次录制的状态。录音装置得到第一录音文件后,弹出显示供用户选择对第一录音文件的自定义标签的控件“添加标签”,如果用户对“添加标签”控件进行输入,则录音装置进一步显示根据录音装置对目标时段内语音内容的识别结果中提取出的多个关键词“甜蜜时光”、“学生时代”、“高三九班”,根据用户对上述多个关键词的输入,为第一录音文件添加自定义标签(即上述目标标签)。
如此,录音装置在录制完成第一录音文件之后,录音装置可以根据用户在目标时段的语音内容作为标签关键词,以供用户为第一录音文件添加自定义标签,以便于用户对第一录音文件进行归类,便于后续用户能够快速搜索到希望播放的第一录音文件。
可选地,在录音装置录制完成第一录音文件之后,在上述步骤103之后,本申请实施例提供的录音方法还可以包括步骤105。
步骤105、录音装置显示滤镜功能控件,并响应于用户对滤镜种类的选择,处理第一录音文件。
示例性的,如图6所示,在第一录音文件录制完成之后,录音装置显示滤镜功能控件,滤镜功能控件中包括“AI智能设置”控件和“手动选择”控件。如果用户对“AI智能设置”控件进行输入,则录音装置响应于该操作,根据目标音频数据的音频内容,自动的为第一录音文件匹配声音滤镜。如果用户对“手动选择”控件进行输入,则录音装置响应于该操作,继续弹出“人生柔和”、“歌唱美化”、“回声模式”等种类的滤镜供用户选择,并根据用户对上述滤镜种类的选择结果,为第一录音文件配置声音滤镜,从而使第一录音文件在播放时,听起来更有层次、更加立体且具有声音质感。
示例性的,上述根据滤镜种类处理第一录音文件的过程可以是在第一录音文件录制完成之后,录音装置切换回第一应用程序中进行的。
可选地,在上述第一录音文件中包括N个目标音频数据的情况下,在上述步骤101之后,本申请实施例提供的录音方法还可以包括步骤106和步骤107。
步骤106、录音装置显示第一目标标识,第一目标标识用于指示N个目标音频数据的播放顺序。
其中,N为正整数。
本申请实施例中,在录音装置录制第一录音文件的过程中,该录音装置将相继播放N个目标音频数据,则上述录音装置上可以显示第一目标标识,以便于用户了解录制进程,该第一目标标识用于指示N个目标音频数据的录制顺序。
本申请实施例中,上述第一目标标识可以显示在指示第一应用程序的悬浮控件中。
步骤107、开始播放第i个目标音频数据时,录音装置更新一次第一目标标识。
其中,i为正整数,i≤N。
可以理解的是,录音装置在播放N个目标音频数据的过程中,可以不断第更新第一目标标识,以便于用户了解录音进程。示例性的,如图4中的(a)所示,如果录音装置开始录制第1个目标音频数据,则第一目标标识从片段0更新显示为“片段1”,直至当用户点击播放第2个目标音频数据按键时,录音装置开始播放下一个目标音频数据。如图4中的(b)所示,此时,录音装置响应于用户的点击操作,将该第一目标标识从“片段1”更新显示为“片段2”。
示例性的,上述第一目标标识的显示形式可以是汉字结合数字,也可以仅以数字的方式显示第一目标标识。
如此,录音装置在录制目标音频数据的过程中,可以显示目标音频数据的片段序号,有助于提示用户第一录音文件中已经录制的目标音频数据的数量,帮助用户随时了解录制进程,以控制第一录音文件的文件时长。
可选地,在上述步骤101之前,本申请实施例提供的录音方法还可以包括步骤107、步骤108和步骤109。
步骤107、录音装置接收第二输入。
示例性的,上述第二输入为用户触发录音装置使用第一应用程序开始进行音频数据采集的输入。
步骤108、录音装置响应于第二输入,显示第二目标标识。
其中,第二目标标识用于指示:录制应用程序中的音频。
可以理解的,用户在使用第一应用程序进行音频数据采集时,需要向录音装置发出录音指令,该录音指令通常情况下是用过用户触发第一应用程序中的录音按键的第二输入产生的。示例性的,如图7中的(a)所示,当用户通过第二输入触发第一应用程序的录音按键后,录音装置显示第二目标标识,第二目标标识用于指示录制应用程序中的音频的录音功能,也就是本申请实施例中的跨应用录音的功能。
示例性的,当用户触发第一应用程序的录音按键后,录音装置显示还可以显示其他标识,其他标识可以用于指示“录音”或者“录音转文字”。
步骤109、录音装置接收用户对第二目标标识的第三输入,录音装置响应于第三输入,控制第一应用程序开始音频数据采集。
本申请实施例中,只有在用户触发第二目标标识之后,录音装置才进入跨应用录音的功能,可以准备开始录制目标应用程序内的目标音频数据。
示例性的,如图7中的(b)所示,录音装置响应于第三输入,在第一应用程序的操作界面内显示“悬浮控件已开启,切换到需要录音的应用后,点击悬浮麦克风即可开始录音”。而当用户触发其他标识时,录音装置不启用跨应用录音功能,也即录音装置不启用通过第一引用程序录制目标应用程序中的目标音频数据的功能。
如此,录音装置提供用户多个使用第一应用程序进行录音的场景,在用户选择使用跨 应用录音时,才启用跨应用录音的功能,能够满足用户的不同录音需求。
需要说明的是,本申请实施例提供的录音方法,执行主体可以为录音装置,或者该录音装置中的用于执行录音方法的控制模块。本申请实施例中以录音装置执行录音方法为例,说明本申请实施例提供的录音装置。
如图8所示,本申请实施例提供一种录音装置800。该录音装置包括接收模块801、处理模块802。
接收模块801,用于在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入;
处理模块802,用于响应于接收模块801接收到的第一输入,播放目标应用程序中的目标音频数据,并通过第一应用程序得到第一录音文件;其中,第一录音文件至少包括目标音频数据。
可选地,本申请实施例中,处理模块802,还用于在从第一音频数据切换至第二音频数据进行播放的过程中,识别目标时段内的环境音频数据中的音频内容;其中,目标时段为:第一音频数据的结束播放的时间点至第二音频数据的开始播放的时间点间的时段。
可选地,本申请实施例中,处理模块802,还用于为第一录音文件添加目标标签;其中,目标标签是基于目标音频数据或者环境音频数据中的音频内容确定的。
可选地,本申请实施例中,目标音频数据的数量为N个,录音装置800还包括:显示模块803,用于显示第一目标标识,第一目标标识用于指示N个目标音频数据的播放顺序。处理模块802,还用于开始播放第i个目标音频数据时,更新第一目标标识。
可选地,本申请实施例中,处理模块803,还用于使用听筒播放目标音频数据;或以目标音量播放目标音频数据,并对第一录音文件中相应的音频内容补偿音量;或静音播放目标音频数据,并获取目标音频数据中的音频流。
可选地,本申请实施例中,接收模块801,还用于接收用户的第二输入,显示模块803,还用于响应于第二输入,显示第二目标标识,第二目标标识用于指示:录制应用程序中的音频;接收模块801,还用于接收用户对第二目标标识的第三输入;处理模块802,还用于响应于第三输入,控制第一应用程序开始音频数据采集。
本申请实施例提供一种录音装置,录音装置通过第一应用程序进行音频数据采集的情况下,录音装置在接收到用户的第一输入(该第一输入用于播放目标应用程序中的目标音频数据)后,响应于该第一输入,播放目标音频数据,并通过该第一应用程序得到第一录音文件(至少包括该目标音频数据的音频内容)。如此,在录音装置通过第一应用程序(例如:录音机、备忘录等具有录音功能的应用程序)录音的情况下,响应于用户的第一输入,录音装置切换至其他目标应用程序(例如:存储或者播放音频数据的多媒体应用程序和社交应用程序)中播放目标音频数据,在此过程中,录音装置持续进行录音,直至录音完成得到第一录音文件。如此,录音装置可以实现跨应用程序录制音频,进而可以将目标应用程序中的无法导出的目标音频数据录制为第一录音文件,方便用户通过播放录制的第一录 音文件以获得目标音频数据的音频内容,提高了录音装置播放目标音频数据的形式多样性。
本申请实施例中的录音装置可以是装置,也可以是终端中的部件、集成电路、或芯片。该装置可以是移动电子设备,也可以为非移动电子设备。示例性的,移动电子设备可以为手机、平板电脑、笔记本电脑、掌上电脑、车载电子设备、可穿戴设备、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本或者个人数字助理(personal digital assistant,PDA)等,非移动电子设备可以为服务器、网络附属存储器(Network Attached Storage,NAS)、个人计算机(personal computer,PC)、电视机(television,TV)、柜员机或者自助机等,本申请实施例不作具体限定。
本申请实施例中的录音装置可以为具有操作系统的装置。该操作系统可以为安卓(Android)操作系统,可以为ios操作系统,还可以为其他可能的操作系统,本申请实施例不作具体限定。
本申请实施例提供的录音装置能够实现图1至图7的方法实施例实现的各个过程,为避免重复,这里不再赘述。
可选地,如图9所示,本申请实施例还提供一种电子设备900,包括处理器901,存储器902,存储在存储器902上并可在处理器901上运行的程序或指令,该程序或指令被处理器901执行时实现上述录音方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。
需要说明的是,本申请实施例中的电子设备包括上述的移动电子设备和非移动电子设备。
图10为实现本申请实施例的一种电子设备的硬件结构示意图。
该电子设备1000包括但不限于:射频单元1001、网络模块1002、音频输出单元1003、输入单元1004、传感器1005、显示单元1006、用户输入单元107、接口单元1008、存储器1009、以及处理器1010等部件。
本领域技术人员可以理解,电子设备1000还可以包括给各个部件供电的电源(比如电池),电源可以通过电源管理系统与处理器1010逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。图10中示出的电子设备结构并不构成对电子设备的限定,电子设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置,在此不再赘述。
其中,用户输入单元1007,用于在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入。处理器1010,用于响应于用户输入单元1007接收到的第一输入,播放目标应用程序中的目标音频数据,并通过第一应用程序得到第一录音文件。其中,第一录音文件至少包括目标音频数据的音频内容。
可选地,目标音频数据包括第一音频数据和第二音频数据,处理器1010具体用于,在从第一音频数据切换至第二音频数据进行播放的过程中,识别目标时段内的环境音频数 据中的音频内容;其中,目标时段为:第一音频数据的结束播放的时间点至第二音频数据的开始播放的时间点间的时段。
可选地,处理器1010还用于为第一录音文件添加目标标签;其中,目标标签是基于目标音频数据或者环境音频数据中的音频内容确定的。
可选地,第一录音文件中包括N个目标音频数据,显示单元1006用于显示第一目标标识,第一目标标识用于指示N个目标音频数据的录制顺序;处理器1010,还用于开始播放第i个目标音频数据时,则更新一次第一目标标识,其中,N和i为正整数,i≤N。
可选地,处理器1010具体还用于:使用听筒播放目标音频数据;或以目标音量播放目标音频数据,并对第一录音文件中相应的音频内容补偿音量;或静音播放目标音频数据,并获取目标音频数据中的音频流。
可选地,用户输入单元1007,还用于接收用户的第二输入,显示单元1006,还用于响应于第二输入,显示第二目标标识,第二目标标识用于指示:录制应用程序中的音频;用户输入单元1007,还用于接收用户对第二目标标识的第三输入;处理器1010,还用于响应于第三输入,控制第一应用程序开始音频数据采集。
在本申请实施例中,在电子设备通过第一应用程序进行音频数据采集的情况下,电子设备在接收到用户的第一输入(该第一输入用于播放目标应用程序中的目标音频数据)后,响应于该第一输入,播放目标音频数据,并通过该第一应用程序得到第一录音文件(至少包括该目标音频数据的音频内容)。如此,在电子设备通过第一应用程序(例如:录音机、备忘录等具有录音功能的应用程序)录音的情况下,响应于用户的第一输入,电子设备切换至其他目标应用程序(例如:存储或者播放音频数据的多媒体应用程序和社交应用程序)中播放目标音频数据,在此过程中,电子设备持续进行录音,直至录音完成得到第一录音文件。如此,电子设备可以实现跨应用程序录制音频,进而可以将目标应用程序中的无法导出的目标音频数据录制为第一录音文件,方便用户通过播放录制的第一录音文件以获得目标音频数据的音频内容,提高了电子设备播放目标音频数据的形式多样性。
应理解的是,本申请实施例中,输入单元1004可以包括图形处理器(Graphics Processing Unit,GPU)10041和麦克风10042,图形处理器10041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。显示单元1006可包括显示面板10061,可以采用液晶显示器、有机发光二极管等形式来配置显示面板10061。用户输入单元1007包括触控面板10071以及其他输入设备10072。触控面板10071,也称为触摸屏。触控面板10071可包括触摸检测装置和触摸控制器两个部分。其他输入设备10072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆,在此不再赘述。存储器1009可用于存储软件程序以及各种数据,包括但不限于应用程序和操作系统。处理器1010可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器1010中。
本申请实施例还提供一种可读存储介质,可读存储介质上存储有程序或指令,该程序或指令被处理器执行时实现上述录音方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。
其中,处理器为上述实施例中的电子设备中的处理器。可读存储介质,包括计算机可读存储介质,如计算机只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等。
本申请实施例另提供了一种芯片,芯片包括处理器和通信接口,通信接口和处理器耦合,处理器用于运行程序或指令,实现上述录音方法实施例的各个过程,且能达到相同的技术效果,为避免重复,这里不再赘述。
应理解,本申请实施例提到的芯片还可以称为系统级芯片、系统芯片、芯片系统或片上系统芯片等。
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。此外,需要指出的是,本申请实施方式中的方法和装置的范围不限按示出或讨论的顺序来执行功能,还可包括根据所涉及的功能按基本同时的方式或按相反的顺序来执行功能,例如,可以按不同于所描述的次序来执行所描述的方法,并且还可以添加、省去、或组合各种步骤。另外,参照某些示例所描述的特征可在其他示例中被组合。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以计算机软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器,或者网络设备等)执行本申请各个实施例的方法。
上面结合附图对本申请的实施例进行了描述,但是本申请并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本申请的启示下,在不脱离本申请宗旨和权利要求所保护的范围情况下,还可做出很多形式,均属于本申请的保护之内。

Claims (16)

  1. 一种录音方法,所述方法包括:
    在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入;
    响应于所述第一输入,播放目标应用程序中的目标音频数据,并通过所述第一应用程序得到第一录音文件;
    其中,所述第一录音文件至少包括所述目标音频数据。
  2. 根据权利要求1所述的方法,其中,所述目标音频数据包括第一音频数据和第二音频数据;
    所述接收用户的第一输入之后,所述方法还包括:
    在从所述第一音频数据切换至所述第二音频数据进行播放的过程中,识别目标时段内的环境音频数据中的音频内容;
    其中,所述目标时段为:所述第一音频数据的结束播放的时间点至所述第二音频数据的开始播放的时间点间的时段。
  3. 根据权利要求2所述的方法,其中,所述通过所述第一应用程序得到第一录音文件之后,所述方法还包括:
    为所述第一录音文件添加目标标签;
    其中,所述目标标签是基于所述目标音频数据或者所述环境音频数据中的音频内容确定的。
  4. 根据权利要求1所述的方法,其中,所述第一录音文件中包括N个目标音频数据;
    所述接收用户的第一输入之后,所述方法还包括:
    显示第一目标标识,所述第一目标标识用于指示N个所述目标音频数据的播放顺序;
    开始播放第i个所述目标音频数据时,更新一次所述第一目标标识;
    其中,N和i为正整数,i≤N。
  5. 根据权利要求1至4中任一项所述的方法,其中,所述播放所述目标音频数据,包括:
    使用听筒播放所述目标音频数据;或,
    以目标音量播放所述目标音频数据,并对所述第一录音文件中相应的音频内容补 偿音量;或,
    静音播放所述目标音频数据,并获取所述目标音频数据中的音频流。
  6. 根据权利要求1至4中任一项所述的方法,其中,所述接收第一输入之前,所述方法还包括:
    接收用户的第二输入;
    响应于所述第二输入,显示第二目标标识,所述第二目标标识用于指示:录制应用程序中的音频;
    接收用户对所述第二目标标识的第三输入;
    响应于所述第三输入,控制所述第一应用程序开始音频数据采集。
  7. 一种录音装置,所述装置包括:
    接收模块,用于在通过第一应用程序进行音频数据采集的情况下,接收用户的第一输入;
    处理模块,用于响应于所述接收模块接收到的所述第一输入,播放目标应用程序中的目标音频数据,并通过所述第一应用程序得到第一录音文件;
    其中,所述第一录音文件至少包括所述目标音频数据。
  8. 根据权利要求7所述的装置,其中,所述目标音频数据包括第一音频数据和第二音频数据,所述装置还包括:
    所述处理模块,还具体用于在从所述第一音频数据切换至所述第二音频数据进行播放的过程中,识别目标时段内的环境音频数据中的音频内容;
    其中,所述目标时段为:所述第一音频数据的结束播放的时间点至所述第二音频数据的开始播放的时间点间的时段。
  9. 根据权利要求8所述的装置,其中,所述装置还包括:
    所述处理模块,还具体用于为所述第一录音文件添加目标标签;
    其中,所述目标标签是基于所述目标音频数据或者所述环境音频数据中的音频内容确定的。
  10. 根据权利要求7所述的装置,其中,所述第一录音文件中包括N个目标音频数据,所述装置还包括:
    显示模块,用于显示第一目标标识,所述第一目标标识用于指示N个所述目标音频数据的播放顺序;
    所述处理模块,还具体用于开始播放第i个所述目标音频数据时,更新一次所述第一目标标识;
    其中,N和i为正整数,i≤N。
  11. 根据权利要求7至10中任一项所述的装置,其中,所述处理模块具体用于:
    使用听筒播放所述目标音频数据;或,
    以目标音量播放所述目标音频数据,并对所述第一录音文件中相应的音频内容补偿音量;或,
    静音播放所述目标音频数据,并获取所述目标音频数据中的音频流。
  12. 根据权利要求7至10中任一项所述的装置,其中,所述装置还包括:
    所述接收模块,还用于接收第二输入;
    所述处理模块,还用于响应于所述第二输入,显示第二目标标识,所述第二目标标识用于指示:录制应用程序中的音频;
    所述接收模块,还用于接收用户对所述第二目标标识的第三输入;
    所述处理模块,还用于响应于所述第三输入,控制所述第一应用程序开始音频数据采集。
  13. 一种电子设备,包括处理器,存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令,所述程序或指令被所述处理器执行时实现如权利要求1至6中任一项所述的录音方法的步骤。
  14. 一种可读存储介质,所述可读存储介质上存储程序或指令,所述程序或指令被处理器执行时实现如权利要求1至6中任一项所述的录音方法的步骤。
  15. 一种计算机程序产品,所述程序产品被至少一个处理器执行以实现如权利要求1至6中任一项所述的录音方法。
  16. 一种电子设备,包括所述电子设备被配置成用于执行如权利要求1至6中任一项所述的录音方法。
PCT/CN2022/088952 2021-04-26 2022-04-25 录音方法、装置、电子设备和可读存储介质 WO2022228377A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110455989.5A CN113241097A (zh) 2021-04-26 2021-04-26 录音方法、装置、电子设备和可读存储介质
CN202110455989.5 2021-04-26

Publications (1)

Publication Number Publication Date
WO2022228377A1 true WO2022228377A1 (zh) 2022-11-03

Family

ID=77129308

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/088952 WO2022228377A1 (zh) 2021-04-26 2022-04-25 录音方法、装置、电子设备和可读存储介质

Country Status (2)

Country Link
CN (1) CN113241097A (zh)
WO (1) WO2022228377A1 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113241097A (zh) * 2021-04-26 2021-08-10 维沃移动通信(杭州)有限公司 录音方法、装置、电子设备和可读存储介质
CN113889154B (zh) * 2021-09-30 2024-03-19 广州维梦科技有限公司 声音录制方法、终端、存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050159957A1 (en) * 2001-09-05 2005-07-21 Voice Signal Technologies, Inc. Combined speech recognition and sound recording
CN1822189A (zh) * 2006-03-02 2006-08-23 无敌科技(西安)有限公司 一种数字录音文件的内容识别方法
CN101833980A (zh) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 一种基于语音识别的法庭庭审音频文件实时标引系统
CN111381798A (zh) * 2018-12-28 2020-07-07 广州市百果园信息技术有限公司 音频处理方法、装置、终端和存储介质
CN111512370A (zh) * 2017-12-29 2020-08-07 瑞欧威尔公司 在录制的同时对视频作语音标记
CN113241097A (zh) * 2021-04-26 2021-08-10 维沃移动通信(杭州)有限公司 录音方法、装置、电子设备和可读存储介质

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105630141A (zh) * 2014-10-31 2016-06-01 小米科技有限责任公司 信息浏览提醒方法及装置
CN107393568A (zh) * 2017-08-16 2017-11-24 广东小天才科技有限公司 一种多媒体文件的录制方法、系统及终端设备
CN108831513B (zh) * 2018-06-19 2021-01-01 广州酷狗计算机科技有限公司 录制音频数据的方法、终端、服务器和系统
CN112243064B (zh) * 2020-10-19 2022-03-04 维沃移动通信(深圳)有限公司 音频处理方法及装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050159957A1 (en) * 2001-09-05 2005-07-21 Voice Signal Technologies, Inc. Combined speech recognition and sound recording
CN1822189A (zh) * 2006-03-02 2006-08-23 无敌科技(西安)有限公司 一种数字录音文件的内容识别方法
CN101833980A (zh) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 一种基于语音识别的法庭庭审音频文件实时标引系统
CN111512370A (zh) * 2017-12-29 2020-08-07 瑞欧威尔公司 在录制的同时对视频作语音标记
CN111381798A (zh) * 2018-12-28 2020-07-07 广州市百果园信息技术有限公司 音频处理方法、装置、终端和存储介质
CN113241097A (zh) * 2021-04-26 2021-08-10 维沃移动通信(杭州)有限公司 录音方法、装置、电子设备和可读存储介质

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ANONYMOUS: "How to forward someone else's voice on WeChat? In fact, the method is very simple, but unfortunately many people do not know", 12 May 2020 (2020-05-12), pages 1 - 7, XP055982583, Retrieved from the Internet <URL:https://www.163.com/dy/article/FCEQUVB60544AQTF.html> [retrieved on 20221117] *
ANONYMOUS: "It turns out that the voice received by WeChat can be forwarded to others in this way! Don't you know now? it's a pity!", 17 October 2019 (2019-10-17), pages 1 - 13, XP055982579, Retrieved from the Internet <URL:https://page.om.qq.com/page/OqHg_uCfqvsl4D-pypwdBKFw0> [retrieved on 20221117] *
ANONYMOUS: "WeChat voice can not be forwarded to friends? It is your operation that is wrong! This is the correct forwarding method", 30 October 2019 (2019-10-30), pages 1 - 6, XP055982574, Retrieved from the Internet <URL:https://cloud.tencent.com/developer/news/464858> [retrieved on 20221117] *

Also Published As

Publication number Publication date
CN113241097A (zh) 2021-08-10

Similar Documents

Publication Publication Date Title
US10782856B2 (en) Method and device for displaying application function information, and terminal device
RU2666966C2 (ru) Способ и прибор управления для воспроизведения аудио
CN110634483B (zh) 人机交互方法、装置、电子设备及存储介质
WO2022228377A1 (zh) 录音方法、装置、电子设备和可读存储介质
US8893052B2 (en) System and method for controlling mobile terminal application using gesture
US9542949B2 (en) Satisfying specified intent(s) based on multimodal request(s)
CN107832434A (zh) 基于语音交互生成多媒体播放列表的方法和装置
WO2022022536A1 (zh) 音频播放方法、音频播放装置和电子设备
US20140164371A1 (en) Extraction of media portions in association with correlated input
WO2020238938A1 (zh) 信息输入方法及移动终端
KR20140091236A (ko) 전자 기기 및 전자 기기의 제어 방법
WO2021147785A1 (zh) 思维导图显示方法及电子设备
CN105827516A (zh) 消息处理方法和装置
US20230015943A1 (en) Scratchpad creation method and electronic device
US20140163956A1 (en) Message composition of media portions in association with correlated text
WO2022068721A1 (zh) 截屏方法、装置及电子设备
CN108174270A (zh) 数据处理方法、装置、存储介质及电子设备
US20210405767A1 (en) Input Method Candidate Content Recommendation Method and Electronic Device
WO2021179869A1 (zh) 音频播放方法、装置、存储介质及终端
CN114020197A (zh) 跨应用的消息的处理方法、电子设备及可读存储介质
WO2021163884A1 (zh) 视频精彩瞬间的录屏方法、装置及可读存储介质
WO2022143888A1 (zh) 音频处理方法、装置及电子设备
CN113055529B (zh) 录音控制方法和录音控制装置
WO2022213986A1 (zh) 语音识别的方法、装置、电子设备和可读存储介质
CN112837668B (zh) 一种语音处理方法、装置和用于处理语音的装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22794847

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE