WO2019029494A1 - 移动终端、存储器及录音文件的编辑方法 - Google Patents

移动终端、存储器及录音文件的编辑方法 Download PDF

Info

Publication number
WO2019029494A1
WO2019029494A1 PCT/CN2018/099029 CN2018099029W WO2019029494A1 WO 2019029494 A1 WO2019029494 A1 WO 2019029494A1 CN 2018099029 W CN2018099029 W CN 2018099029W WO 2019029494 A1 WO2019029494 A1 WO 2019029494A1
Authority
WO
WIPO (PCT)
Prior art keywords
segment
mark
user
segmentation
time
Prior art date
Application number
PCT/CN2018/099029
Other languages
English (en)
French (fr)
Inventor
邹章锋
邹永军
陈开�
冯邦全
Original Assignee
捷开通讯(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 捷开通讯(深圳)有限公司 filed Critical 捷开通讯(深圳)有限公司
Publication of WO2019029494A1 publication Critical patent/WO2019029494A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10546Audio or video recording specifically adapted for audio data

Definitions

  • the present application relates to the field of electronic device technologies, and in particular, to a mobile terminal, a memory, and a method for editing a recorded file.
  • the mobile terminal When the mobile terminal performs a recording operation, the user sometimes stops talking during the recording process, so that there is a silent area in the obtained recording file. In order to obtain continuous recording files, sometimes the user wants to delete these silent areas, or when the user After recording a sentence, I found that it was not recorded correctly. I hope to delete this sentence and then re-record this sentence. At present, to solve such problems, it is necessary to edit the entire recording file with a relatively professional audio editing software after the end of the recording, which is inconvenient to operate on the mobile terminal.
  • the present application provides a method for editing a mobile terminal, a memory, and a recording file, which can quickly edit a recording file, and is more convenient for a user to operate on the mobile terminal.
  • the specific technical solution proposed by the present application is: providing an editing method of a recording file, and the editing method includes:
  • detecting the ambient sound source and marking according to the size of the decibel value of the ambient sound source includes:
  • the segmentation mark of the time when the decibel value is greater than the sound threshold is the first segmentation mark
  • the segmentation mark at the time when the decibel value is smaller than the sound threshold is the second segmentation flag.
  • the segment before the time when the user initiates the pause or end instruction is marked as the first segment mark
  • the segment mark at the time when the user initiates the pause or end instruction is the second segment mark
  • the segmentation of the time at which the user initiates the pause or end instruction is marked as the first segmentation marker.
  • editing multiple segment files includes:
  • the adjacent two segment marks on the time axis are sequentially the segment file between the first segment mark and the second segment mark as a voice region, and the adjacent two segment markers on the time axis are sequentially ranked as the second segment. a segment file between the segment mark and the first segment mark as a silent region;
  • the ambient sound source includes a signal detected when the user does not speak and a signal detected when the user speaks; wherein the decibel value of the signal detected by the user when speaking is greater than the decibel value of the signal detected when the user does not speak.
  • the editing method further includes: displaying the plurality of segment files.
  • the recording file is segmented according to the plurality of segment marks, and obtaining the plurality of segment files includes:
  • a plurality of segment marks divide the recording file into a plurality of segment files, and a segment file is formed between each adjacent two segment markers.
  • the application also provides a memory in which a plurality of instructions are stored, the instructions being adapted to be loaded and executed by the processor:
  • the instructions are also adapted to be loaded and executed by the processor:
  • the segmentation mark of the time when the decibel value is greater than the sound threshold is the first segmentation mark
  • the segmentation mark at the time when the decibel value is smaller than the sound threshold is the second segmentation flag.
  • the instructions are also adapted to be loaded and executed by the processor:
  • the segment mark at the time when the user initiates the pause or end instruction is the second segment mark
  • the segmentation of the time at which the user initiates the pause or end instruction is marked as the first segmentation marker.
  • the instructions are also suitable for loading and execution by the processor:
  • the adjacent two segment marks on the time axis are sequentially the segment file between the first segment mark and the second segment mark as a voice region, and the adjacent two segment markers on the time axis are sequentially ranked as the second segment. a segment file between the segment mark and the first segment mark as a silent region;
  • the ambient sound source includes a signal detected when the user does not speak and a signal detected when the user speaks, wherein the decibel value of the signal detected by the user when speaking is greater than the decibel value of the signal detected when the user does not speak.
  • instructions are also adapted to be loaded and executed by the processor:
  • a plurality of segment marks divide the recording file into a plurality of segment files, and a segment file is formed between each adjacent two segment markers.
  • the application also provides a mobile terminal, including:
  • processor adapted to implement a plurality of instructions
  • a memory adapted to store a plurality of instructions, the plurality of instructions being adapted to be loaded and executed by the processor:
  • the editing method of the recording file proposed by the present application obtains a recording file by detecting an ambient sound source and performing segmentation marking according to the size of the decibel value of the ambient sound source, and then segmenting the recording file according to the segmentation mark to obtain a plurality of points. Segment file; finally edit multiple segment files. Segmentation is performed by segmentation marks, and then the segment file is edited, so that the recording file can be quickly edited, which is more convenient for the user to operate on the mobile terminal.
  • 1 is a flow chart of a method of editing a sound recording file
  • FIG. 2 is a schematic diagram of a mobile terminal.
  • the method for editing a recorded file includes the following steps:
  • S2 Detecting an ambient sound source and marking according to the size of the decibel value of the ambient sound source to obtain a recording file having a plurality of segment marks.
  • the ambient sound source includes a signal detected when the user does not speak and a signal detected when the user speaks, wherein the decibel value of the signal detected by the user when speaking is greater than the decibel value of the signal detected when the user does not speak. Therefore, the recording file can be marked by the size of the decibel value of the ambient sound source to obtain a recording file having a plurality of segment marks.
  • step S4 the plurality of segment marks divide the recording file into a plurality of segment files, and a segment file is formed between each adjacent two segment markers.
  • step S2 if it is detected that the decibel value of the ambient sound source is greater than the sound threshold, the segmentation mark of the time when the decibel value is greater than the sound threshold is the first segmentation flag; if the decibel value of the ambient sound source is detected to be less than The sound threshold, the segmentation of the time when the decibel value is less than the sound threshold is the second segmentation flag. Wherein the first segment mark and the second segment mark alternately appear.
  • step S2 if the segment before the time when the user initiates the pause or end instruction is marked as the first segmentation flag, the segmentation of the time when the user initiates the pause or end instruction is marked as the second segmentation flag, if the user The segment before the moment when the pause or end instruction is initiated is marked as the second segment marker, and the segment at the moment when the user initiates the pause or end instruction is marked as the first segment marker.
  • step S5 includes:
  • the adjacent two segment marks on the time axis are sequentially the segment file between the first segment mark and the second segment mark as a voice region, and the adjacent two segment markers on the time axis are sequentially
  • the segment file between the two segment marks and the first segment mark serves as a silent zone, where the voice zone refers to a recording file in a time period in which the user is in a speaking state, and the silent zone refers to a time in which the user is in a non-talking state.
  • the recording file in the segment is sequentially the segment file between the first segment mark and the second segment mark as a voice region, and the adjacent two segment markers on the time axis are sequentially
  • the segment file between the two segment marks and the first segment mark serves as a silent zone, where the voice zone refers to a recording file in a time period in which the user is in a speaking state, and the silent zone refers to a time in which the user is in a non-talking state.
  • the recording file in the segment is sequentially the segment file between
  • the two adjacent segment marks on the time axis are, in order, the first segment mark, and the second segment mark refers to the first segment mark at the moment before and the second segment mark at the time adjacent thereto.
  • the first segment mark at the moment indicates that the user just shifts from the non-speaking state to the speaking state
  • the second segment marker at the time indicates the moment when the user just transitions from the talking state to the non-speaking state, therefore,
  • the two adjacent segment marks on the time axis are the first segment mark and the segment file between the second segment marks is the voice region.
  • the adjacent two segment marks on the time axis are in turn The segment file between the two segment marks and the first segment marks is a silent zone.
  • step S52 the user can select to audition the voice zone and the silence zone according to the need, or the user can select to delete all the silence zones when the user wants to delete the silence zone to obtain a continuous recording file, and then the remaining voice zone is performed.
  • Splicing to obtain a continuous recording file the user wants to delete a certain voice zone, re-record, and then insert the recording into the location of the deleted voice zone and splicing with other voice zones or video zones to obtain the user.
  • Required recording file The technique of splicing a recording file is a technique well known in the art and will not be described here.
  • the editing method in this embodiment further includes: displaying a plurality of segment files before the editing of the plurality of segment files, which is more convenient for the user to perform operations.
  • the content corresponding to the segment file can be directly obtained during editing, so that it can be deleted without being auditioned, thereby speeding up
  • the editing speed is more convenient for the user to operate.
  • the segment mark at the time when the user initiates the pause or end instruction is the second segment mark
  • the segment before the time when the user initiates the pause or end instruction is marked as the second segment mark
  • the segment at the time when the user initiates the pause or end instruction is marked as the first segment mark
  • the user wants to delete all the silent areas if the segment before the time when the user initiates the pause or end instruction is marked as the first segment mark, it means that the last segment file is a silent zone, and if the user still initiates the user at this time. The time of the pause or end instruction is marked as the first segment mark. When the silence zone is deleted, the last segment file will not be deleted. Similarly, if the user initiates the pause or end instruction If the segment is marked as the second segmentation mark, it means that the last segment file is a voice zone. If the time when the user initiates the pause or end instruction is still marked as the second segmentation mark, when the voice zone is auditioned, The last segment file will not be used as a speech area.
  • the embodiment further provides a mobile terminal, which includes a processor 1 and a memory 2.
  • the processor 1 is adapted to implement a plurality of instructions, the memory 2 being adapted to store a plurality of instructions, wherein the plurality of instructions are adapted to be loaded and executed by the processor 1:
  • the instructions are further adapted to be loaded and executed by the processor 1:
  • the segmentation mark of the time when the decibel value is greater than the sound threshold is the first segmentation mark
  • the segmentation mark at the time when the decibel value is smaller than the sound threshold is the second segmentation flag.
  • the instructions are further adapted to be loaded and executed by the processor 1:
  • the segment mark at the time when the user initiates the pause or end instruction is the second segment mark
  • the segmentation of the time at which the user initiates the pause or end instruction is marked as the first segmentation marker.
  • the instructions are further adapted to be loaded and executed by the processor 1:
  • the adjacent two segment marks on the time axis are sequentially the segment file between the first segment mark and the second segment mark as a voice region, and the adjacent two segment markers on the time axis are sequentially ranked as the second segment. a segment file between the segment mark and the first segment mark as a silent region;
  • the instructions are also adapted to be loaded by the processor 1 and to perform display of a plurality of segmented files.
  • the processor 1 includes a mode switching module 11, a marking module 12, a segmentation module 13, and an editing module 14.
  • the mode switching module 11 is configured to switch the mobile terminal to the recording mode according to the user's opening instruction and to switch the mobile terminal to the editing mode according to the user's pause or end instruction.
  • the marking module 12 is configured to detect an ambient sound source and mark according to the size of the decibel value of the ambient sound source to obtain a recording file having a plurality of segment marks.
  • the segmentation module 13 is configured to segment the recording file according to the plurality of segmentation marks to obtain a plurality of segmentation files.
  • the editing module 14 is used to edit a plurality of segment files.
  • the editing module 14 is further configured to use the segmentation file between the first segment marker and the second segment marker as the voice region in the adjacent two segment markers on the time axis, and the adjacent two segments on the time axis.
  • the mark is in turn the second segment mark, the segment file between the first segment marks as a silent area, and the voice area and the silence area are respectively auditioned or deleted.

Abstract

本申请提供一种移动终端、存储器及录音文件的编辑方法,所述编辑方法包括:根据用户的开启指令进入录音模式;检测环境声源并根据所述环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;根据用户的暂停或结束指令进入编辑模式;根据所述多个分段标记对所述录音文件进行分段,获得多个分段文件;对所述多个分段文件进行编辑。本申请提出的录音文件的编辑方法通过分段标记进行分段,然后对分段文件进行编辑,从而实现快速对录音文件进行编辑,更便于用户在移动终端上进行操作。

Description

移动终端、存储器及录音文件的编辑方法 技术领域
本申请涉及电子设备技术领域,尤其涉及一种移动终端、存储器及录音文件的编辑方法。
背景技术
移动终端在进行录音操作时,用户有时候在录音过程中停止说话,从而使得在获得的录音文件中有一段静默区,为了能够获得连续的录音文件,有时用户希望删除这些静默区,或者当用户录完一句话后,发现录得不对,希望将这段话进行删除后重新录这一句话。目前要解决这类问题,需要在录音结束之后,对整个录音文件采用比较专业的音频编辑软件进行编辑,在移动终端上不便于操作。
发明内容
为了解决现有技术的不足,本申请提供一种移动终端、存储器及录音文件的编辑方法,能够快速对录音文件进行编辑,更便于用户在移动终端上进行操作。
本申请提出的具体技术方案为:提供一种录音文件的编辑方法,编辑方法包括:
根据用户的开启指令进入录音模式;
检测环境声源并根据环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
根据用户的暂停或结束指令进入编辑模式;
根据多个分段标记对录音文件进行分段,获得多个分段文件;
对多个分段文件进行编辑。
为了加快编辑速度,更便于用户操作,进一步地,检测环境声源并根据环境声源的分贝值的大小进行标记包括:
若检测到环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;
若检测到环境声源的分贝值小于声音阈值,则分贝值小于声音阈值的时刻的分段标记为第二分段标记。
为了避免误操作,进一步地,若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记;
若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
为了进一步加快编辑速度,更便于用户操作,对多个分段文件进行编辑包括:
将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区;
对语音区、静默区分别进行试听或删除。
其中,环境声源包括用户不说话时检测到的信号和用户说话时检测到的信号;其中,用户说话时检测到的信号的分贝值大于用户不说话时检测到的信号的分贝值。
进一步地,对多个分段文件进行编辑之前,编辑方法还包括:对多个分段文件进行显示。
进一步地,根据多个分段标记对录音文件进行分段,获得多个分段文件包括:
多个分段标记将录音文件分割成多个分段文件,每相邻两个分段标记之间形成一个分段文件。
本申请还提供了一种存储器,存储器中存储有多条指令,指令适于由处理器加载并执行:
根据用户的开启指令进入录音模式;
检测环境声源并根据环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
根据用户的暂停或结束指令进入编辑模式;
根据多个分段标记对录音文件进行分段,获得多个分段文件;
对多个分段文件进行编辑。
为了加快编辑速度,更便于用户操作,进一步地,指令还适于由处理器加载并执行:
若检测到环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;
若检测到环境声源的分贝值小于声音阈值,则分贝值小于声音阈值的时刻的分段标记为第二分段标记。
为了避免误操作,进一步地,指令还适于由处理器加载并执行:
若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记;
若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
为了进一步加快编辑速度,更便于用户操作,指令还适于由处理器加载并执行:
将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区;
对语音区、静默区分别进行试听或删除。
其中,环境声源包括用户不说话时检测到的信号和用户说话时检测到的信号,其中,用户说话时检测到的信号的分贝值大于用户不说话时检测到的信号的分贝值。
进一步地,指令还适于由处理器加载并执行:
多个分段标记将录音文件分割成多个分段文件,每相邻两个分段标记之间形成一个分段文件。
本申请还提供了一种移动终端,包括:
处理器,适于实现多条指令;
存储器,适于存储多条指令,多条指令适于由处理器加载并执行:
根据用户的开启指令进入录音模式;
检测环境声源并根据环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
根据用户的暂停或结束指令进入编辑模式;
根据多个分段标记对录音文件进行分段,获得多个分段文件;
对多个分段文件进行编辑。
本申请提出的录音文件的编辑方法通过检测环境声源并根据环境声源的分贝值的大小进行分段标记来获得录音文件,然后再根据分段标记对录音文件进行分段,获得多个分段文件;最后对多个分段文件进行编辑。通过分段标记进行分段,然后对分段文件进行编辑,从而实现快速对录音文件进行编辑,更便于用户在移动终端上进行操作。
附图说明
下面结合附图,通过对本申请的具体实施方式详细描述,将使本申请的技术方案及其它有益效果显而易见。
图1为录音文件的编辑方法的流程图;
图2为移动终端的示意图。
具体实施方式
以下,将参照附图来详细描述本申请的实施例。然而,可以以许多不同的形式来实施本申请,并且本申请不应该被解释为限制于这里阐述的具体实施例。相反,提供这些实施例是为了解释本申请的原理及其实际应用,从而使本领域的其他技术人员能够理解本申请的各种实施例和适合于特定预期应用的各种修改。在附图中,相同的标号将始终被用于表示相同的元件。
参照图1,本实施例提供的录音文件的编辑方法包括步骤:
S1、根据用户的开启指令进入录音模式。
S2、检测环境声源并根据环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件。
在步骤S2中,环境声源包括用户不说话时检测到的信号和用户说话时检测到的信号,其中,用户说话时检测到的信号的分贝值大于用户不说话时检测到的信号的分贝值,因此,通过环境声源的分贝值的大小就可以对录音文件进行标记,获得具有多个分段标记的录音文件。
S3、根据用户的暂停或结束指令进入编辑模式,在录音过程中,用户可以发起暂停指令或者结束指令进入编辑模式来对获得的录音文件进行编辑。
S4、根据多个分段标记对录音文件进行分段,获得多个分段文件。
在步骤S4中,多个分段标记将录音文件分割成多个分段文件,每相邻两个分段标记之间形成一个分段文件。
S5、对多个分段文件进行编辑。
具体地,在步骤S2中,若检测到环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;若检测到环境声源的分贝值小于声音阈值,则分贝值小于声音阈值的时刻的分段标记为第二分段标记。其中,第一分段标记和第二分段标记交替出现。
在步骤S2中,若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记,若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
具体地,步骤S5包括:
S51、将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区,这里语音区指的是用户处于说话状态的时间段内的录音文件,静默区指的是用户处于不说话状态的时间段内的录音文件。
时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记指的是时刻在前的第一分段标记与和其相邻的时刻在后的第二分段标记,时刻 在前的第一分段标记表示用户刚好从处于不说话状态转换为说话状态的时刻,时刻在后的第二分段标记表示用户刚好从说话状态转换为不说话状态的时刻,因此,时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件即为语音区,同理,时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件即为静默区。
S52、对语音区、静默区分别进行试听或删除。
在步骤S52中,用户可以根据需要选择对语音区、静默区进行试听,或者用户想删掉静默区获得连续的录音文件时可以选择将所有的静默区进行删除,然后将剩下的语音区进行拼接获得连续的录音文件,用户想将某段语音区删掉,重新进行录音,然后再将该录音插入到删掉的语音区所在的位置并与其它语音区或视频区进行拼接,获得符合用户要求的录音文件。对录音文件进行拼接的技术为本领域所熟知的技术,这里不再赘述。
本实施例中的编辑方法在对多个分段文件进行编辑之前还包括:对多个分段文件进行显示,可以更便于用户进行操作。
本实施例通过将多个分段文件划分为语音区或静默区,在编辑的时候可以直接的获取该分段文件对应的内容,从而在不需要试听的情况下便可以对其进行删除,加快了编辑速度,更便于用户操作。
此外,本实施例中,若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记,若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记,这样在对分段内文件进行删除时,可以避免对最后一个分段文件进行误操作。
例如,用户想将静默区全部删除,如果在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则表示最后一个分段文件为静默区,如果此时仍将用户发起暂停或结束指令的时刻标记为第一分段标记,在对静默区进行删除的时候,最后一个分段文件将不会被删除,同样的,如果在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则表示最后一个分段文件为语音区,如果此时仍将用户发起暂停或结束指令的时刻标记为第二分段标记,在对语音区进行试听的时候,最后一个分段文件将不会被作为语音区。
参照图2,本实施例还提供了一种移动终端,其包括处理器1和存储器2。处理器1适于实现多条指令,存储器2适于存储多条指令,其中,多条指令适于由处理器1加载并执行:
根据用户的开启指令进入录音模式;
检测环境声源并根据环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
根据用户的暂停或结束指令进入编辑模式;
根据多个分段标记对录音文件进行分段,获得多个分段文件;
对多个分段文件进行编辑。
本实施例中,指令还适于由处理器1加载并执行:
若检测到环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;
若检测到环境声源的分贝值小于声音阈值,则分贝值小于声音阈值的时刻的分段标记为第二分段标记。
本实施例中,指令还适于由处理器1加载并执行:
若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记;
若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
此外,本实施例中,指令还适于由处理器1加载并执行:
将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区;
对语音区、静默区分别进行试听或删除。
为了更便于用户操作,指令还适于由处理器1加载并执行对多个分段文件进行显示。
具体地,处理器1包括模式切换模块11、标记模块12、分段模块13及编辑模块14。模式切换模块11用于根据用户的开启指令将移动终端切换为录 音模式以及用于根据用户的暂停或结束指令将移动终端切换为编辑模式。标记模块12用于检测环境声源并根据环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件。分段模块13用于根据多个分段标记对录音文件进行分段,获得多个分段文件。编辑模块14用于对多个分段文件进行编辑。
编辑模块14还用于将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区以及对语音区、静默区分别进行试听或删除。
以上所述仅是本申请的具体实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本申请原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本申请的保护范围。

Claims (20)

  1. 一种录音文件的编辑方法,其中,包括:
    根据用户的开启指令进入录音模式;
    检测环境声源并根据所述环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
    根据用户的暂停或结束指令进入编辑模式;
    根据所述多个分段标记对所述录音文件进行分段,获得多个分段文件;
    对所述多个分段文件进行编辑。
  2. 根据权利要求1所述的编辑方法,其中,检测环境声源并根据所述环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件包括:
    若检测到所述环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;
    若检测到所述环境声源的分贝值小于所述声音阈值,则分贝值小于所述声音阈值的时刻的分段标记为第二分段标记。
  3. 根据权利要求2所述的编辑方法,其中,
    若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记;
    若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
  4. 根据权利要求3所述的编辑方法,其中,对所述多个分段文件进行编辑包括:
    将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区;
    对所述语音区、所述静默区分别进行试听或删除。
  5. 根据权利要求2所述的编辑方法,其中,所述环境声源包括用户不说话时检测到的信号和用户说话时检测到的信号,其中,用户说话时检测到的信号的分贝值大于用户不说话时检测到的信号的分贝值。
  6. 根据权利要求1所述的编辑方法,其中,对所述多个分段文件进行编辑 之前,所述编辑方法还包括:对所述多个分段文件进行显示。
  7. 根据权利要求1所述的编辑方法,其中,根据多个分段标记对录音文件进行分段,获得多个分段文件包括:
    多个分段标记将录音文件分割成多个分段文件,每相邻两个分段标记之间形成一个分段文件。
  8. 一种存储器,其中,所述存储器中存储有多条指令,所述指令适于由处理器加载并执行:
    根据用户的开启指令进入录音模式;
    检测环境声源并根据所述环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
    根据用户的暂停或结束指令进入编辑模式;
    根据所述多个分段标记对所述录音文件进行分段,获得多个分段文件;
    对所述多个分段文件进行编辑。
  9. 根据权利要求8所述的存储器,其中,所述指令还适于由所述处理器加载并执行:
    若检测到所述环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;
    若检测到所述环境声源的分贝值小于所述声音阈值,则分贝值小于所述声音阈值的时刻的分段标记为第二分段标记。
  10. 根据权利要求9所述的存储器,其中,所述指令还适于由所述处理器加载并执行:
    若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记;
    若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
  11. 根据权利要求10所述的存储器,其中,所述指令还适于由所述处理器加载并执行:
    将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区;
    对所述语音区、所述静默区分别进行试听或删除。
  12. 根据权利要求9所述的存储器,其中,所述环境声源包括用户不说话时检测到的信号和用户说话时检测到的信号,其中,用户说话时检测到的信号的分贝值大于用户不说话时检测到的信号的分贝值。
  13. 根据权利要求8所述的存储器,其中,所述指令还适于由所述处理器加载并执行:
    多个分段标记将录音文件分割成多个分段文件,每相邻两个分段标记之间形成一个分段文件。
  14. 一种移动终端,其中,包括:
    处理器,适于实现多条指令;
    存储器,适于存储所述多条指令,所述多条指令适于由所述处理器加载并执行:
    根据用户的开启指令进入录音模式;
    检测环境声源并根据所述环境声源的分贝值的大小进行标记,获得具有多个分段标记的录音文件;
    根据用户的暂停或结束指令进入编辑模式;
    根据所述多个分段标记对所述录音文件进行分段,获得多个分段文件;
    对所述多个分段文件进行编辑。
  15. 根据权利要求14所述的移动终端,其中,所述指令还适于由所述处理器加载并执行:
    若检测到所述环境声源的分贝值大于声音阈值,则分贝值大于声音阈值的时刻的分段标记为第一分段标记;
    若检测到所述环境声源的分贝值小于所述声音阈值,则分贝值小于所述声音阈值的时刻的分段标记为第二分段标记。
  16. 根据权利要求15所述的移动终端,其中,所述指令还适于由所述处理器加载并执行:
    若在用户发起暂停或结束指令的时刻之前的分段标记为第一分段标记,则用户发起暂停或结束指令的时刻的分段标记为第二分段标记;
    若在用户发起暂停或结束指令的时刻之前的分段标记为第二分段标记,则用户发起暂停或结束指令的时刻的分段标记为第一分段标记。
  17. 根据权利要求16所述的移动终端,其中,所述指令还适于由所述处理器加载并执行:
    将时间轴上相邻两个分段标记依次为第一分段标记、第二分段标记之间的分段文件作为语音区,将时间轴上相邻两个分段标记依次为第二分段标记、第一分段标记之间的分段文件作为静默区;
    对所述语音区、所述静默区分别进行试听或删除。
  18. 根据权利要求15所述的移动终端,其中,所述环境声源包括用户不说话时检测到的信号和用户说话时检测到的信号,其中,用户说话时检测到的信号的分贝值大于用户不说话时检测到的信号的分贝值。
  19. 根据权利要求14所述的移动终端,其中,所述指令还适于由所述处理器加载并执行:
    多个分段标记将录音文件分割成多个分段文件,每相邻两个分段标记之间形成一个分段文件。
  20. 根据权利要求14所述的移动终端,其中,所述指令还适于由所述处理器加载并执行:
    对所述多个分段文件进行编辑之前,将所述多个分段文件在所述移动终端中进行显示。
PCT/CN2018/099029 2017-08-07 2018-08-06 移动终端、存储器及录音文件的编辑方法 WO2019029494A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710664935.3 2017-08-07
CN201710664935.3A CN107481743A (zh) 2017-08-07 2017-08-07 移动终端、存储器及录音文件的编辑方法

Publications (1)

Publication Number Publication Date
WO2019029494A1 true WO2019029494A1 (zh) 2019-02-14

Family

ID=60597721

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/099029 WO2019029494A1 (zh) 2017-08-07 2018-08-06 移动终端、存储器及录音文件的编辑方法

Country Status (2)

Country Link
CN (1) CN107481743A (zh)
WO (1) WO2019029494A1 (zh)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107481743A (zh) * 2017-08-07 2017-12-15 捷开通讯(深圳)有限公司 移动终端、存储器及录音文件的编辑方法
CN108124059B (zh) * 2017-12-21 2020-03-03 维沃移动通信有限公司 一种录音方法及移动终端
CN108363765B (zh) * 2018-02-06 2020-12-08 深圳市鹰硕技术有限公司 音频段落识别方法以及装置
CN111328418A (zh) * 2018-03-29 2020-06-23 华为技术有限公司 自动识别音频中不同人声的方法
CN108419124B (zh) * 2018-05-08 2020-11-17 北京酷我科技有限公司 一种音频处理方法
CN111445929A (zh) * 2020-03-12 2020-07-24 维沃移动通信有限公司 一种语音信息处理方法及电子设备
CN111933176B (zh) * 2020-09-22 2020-12-22 成都启英泰伦科技有限公司 一种批量定位语音内容的方法及装置
CN112732139A (zh) * 2021-01-12 2021-04-30 Oppo广东移动通信有限公司 录音处理方法、装置、移动终端及存储介质
CN112887480B (zh) * 2021-01-22 2022-07-29 维沃移动通信有限公司 音频信号处理方法、装置、电子设备和可读存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104092809A (zh) * 2014-07-24 2014-10-08 广东欧珀移动通信有限公司 通话录音方法、通话录音播放方法及其相应装置
CN104157301A (zh) * 2014-07-25 2014-11-19 广州三星通信技术研究有限公司 删除语音信息空白片段的方法、装置和终端
CN105895102A (zh) * 2015-11-15 2016-08-24 乐视移动智能信息技术(北京)有限公司 录音编辑方法及录音装置
US20160379684A1 (en) * 2015-06-25 2016-12-29 Intel Corporation Techniques to Save or Delete a Video Clip
CN107481743A (zh) * 2017-08-07 2017-12-15 捷开通讯(深圳)有限公司 移动终端、存储器及录音文件的编辑方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104092809A (zh) * 2014-07-24 2014-10-08 广东欧珀移动通信有限公司 通话录音方法、通话录音播放方法及其相应装置
CN104157301A (zh) * 2014-07-25 2014-11-19 广州三星通信技术研究有限公司 删除语音信息空白片段的方法、装置和终端
US20160379684A1 (en) * 2015-06-25 2016-12-29 Intel Corporation Techniques to Save or Delete a Video Clip
CN105895102A (zh) * 2015-11-15 2016-08-24 乐视移动智能信息技术(北京)有限公司 录音编辑方法及录音装置
CN107481743A (zh) * 2017-08-07 2017-12-15 捷开通讯(深圳)有限公司 移动终端、存储器及录音文件的编辑方法

Also Published As

Publication number Publication date
CN107481743A (zh) 2017-12-15

Similar Documents

Publication Publication Date Title
WO2019029494A1 (zh) 移动终端、存储器及录音文件的编辑方法
US20180286459A1 (en) Audio processing
JP6242773B2 (ja) 会議情報蓄積装置、方法およびプログラム
US20140006948A1 (en) Method and mobile phone for capturing audio file or video file
CN104409087B (zh) 歌曲文件播放方法和系统
WO2016197708A1 (zh) 一种录音方法及终端
KR20180090294A (ko) 오디오 파일 재 녹음 방법, 장치 및 저장매체
CN111527746A (zh) 用于将音乐链接到摄影的电子设备及其控制方法
WO2022217944A1 (zh) 字幕与音源的绑定方法及装置
WO2022001579A1 (zh) 音频处理方法、装置、设备及存储介质
JP2005044409A (ja) 情報再生装置、情報再生方法および情報再生プログラム
US20200244809A1 (en) Method of automatically playing a voice message, and smart phone and computer program product implementing the same
KR20230125284A (ko) 오디오 처리 방법, 장치 및 전자기기
JP2014142501A (ja) テキスト再生装置、方法、及びプログラム
JP2005221565A (ja) 音声データファイル格納方法および録音処理装置
KR20170005590A (ko) 음성 통화 녹음 방법 및 이를 수행하는 단말
KR101964359B1 (ko) 딥러닝용 오디오 데이터 생성방법 및 장치
JP2005107617A (ja) 音声データ検索装置。
WO2019061192A1 (zh) 音频处理方法及相关产品
JP2005107617A5 (zh)
JP2015025842A (ja) 音声再生装置
JP4779954B2 (ja) 音声データ処理装置、方法及びプログラム
JP4973431B2 (ja) 音声再生プログラム及び音声再生装置
JP2010008938A (ja) ボイスレコーダー、及び音声録音方法
JP3704968B2 (ja) マルチメディア編集装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18844641

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18844641

Country of ref document: EP

Kind code of ref document: A1