CN106782506A

CN106782506A - A kind of method that recorded audio is divided into section

Info

Publication number: CN106782506A
Application number: CN201611037945.6A
Authority: CN
Inventors: 张悦
Original assignee: Language Network (wuhan) Information Technology Co Ltd
Current assignee: Language Network (wuhan) Information Technology Co Ltd
Priority date: 2016-11-23
Filing date: 2016-11-23
Publication date: 2017-05-31

Abstract

Recorded audio is divided into the method for section the invention discloses a kind of, it is characterized in that comprising the following steps：Recorded audio data are obtained and traveled through, phonological component and mute part is obtained；At setting pause；Several nodes are formed according to time division, node serial number is set；Section is formed between two adjacent nodes；Node is modified；It is described to be whether decision node belongs at pause to the method that node is modified, if node is not belonging at pause, then at the supreme pause of knot adjustment；If node belongs at pause, continue to correct next node until terminating；The time of the mute part is the time difference between two adjacent phonological components.Advantage is:1. the audio segmentation of Large Copacity facilitates storage to take into some sections；2. the node for splitting formation during segmentation belongs at pause（Usually sentence tail or section tail）, it is to avoid audio loss, enhance Consumer's Experience.

Description

A kind of method that recorded audio is divided into section

Technical field

The invention belongs to the present invention relates to field of audio processing, more particularly to a kind of side that recorded audio is divided into section Method.

Background technology

With continuing to develop for Internet technology, the multi-medium data such as image, video, audio has been increasingly becoming internet letter Main information medium form in breath process field.Wherein, voice data occupies critically important position.Original audio data is in itself It is that a kind of non-semantic symbol is represented and non-structured binary stream.Often capacity is very for the recorded audio formed in convention Greatly, the time is very long, and person for recording has many people, and that user needs is often wherein a bit of, or someone audio, The audio segmentation Large Copacity is at this time accomplished by into some sections, facilitates storage to take, shape is often split during segmentation Into node at be not a tail or section tail（It is defined as at pause）, audio loss can be caused, while also resulting in user's body Test bad.

The content of the invention

The technical problems to be solved by the invention be audio segmentation formed node at be not so as to cause audio at pause Loss and the bad problem of Consumer's Experience, and the method for improving this problem will be exactly adjusted to pause at spliting node.

In order to solve the above technical problems, the invention provides a kind of method that recorded audio is divided into section, it is characterized in that Comprise the following steps：

Recorded audio data are obtained and traveled through, phonological component and mute part is obtained；

At setting pause；

Several nodes are formed according to time division, node serial number is set；

Section is formed between two adjacent nodes；

Node is modified；

It is described to be whether decision node belongs at pause to the method that node is modified, if node is not belonging at pause, that At the supreme pause of knot adjustment；

If node belongs at pause, continue to correct next node until terminating；

The time of the mute part is the time difference between two adjacent phonological components.

Further, the method at the setting pause is that the average mute time of the Time Calculation according to mute part will be big It is judged as at pause in the mute part of the threshold value of average mute time.

Further, the step of Time Calculation according to mute part average mute time is to obtain mute part Total duration, and mute part quantity, calculated divided by the quantity of mute part with the total duration of mute part average Jing Yin Time.

Further, the method at the setting pause is the median of the time for taking mute part and is set as at pause.

Further, the method at the setting pause is to record the sample of recorded audio according to custom word speed by person for recording, The sample of the recorded audio is including at a pause, will be set as the pause of recorded audio at the pause of the sample of recorded audio Place.

Further, it is described amendment node method also include node before and/or node after character whether with node label Tag match in storehouse, the node label storehouse be store some sentences section start or word label that section terminates language material Storehouse.

Further, whether the method also character including decision node of the amendment node changes tag match with personage, It is to be used to the personage's differentiation identifier distinguished according to what the sound of people was differently formed in recording that the personage changes label.

Using above-mentioned technical proposal, following effect is can reach：

1. the audio segmentation of Large Copacity facilitates storage to take into some sections；

2. the node for splitting formation during segmentation belongs at pause（Usually sentence tail or section tail）, it is to avoid audio damage Lose, enhance Consumer's Experience.

Brief description of the drawings

Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the application, this hair Bright schematic description and description does not constitute inappropriate limitation of the present invention, in the accompanying drawings for explaining the present invention：

Fig. 1 shows a kind of schematic flow sheet of the method that recorded audio is divided into section.

Specific embodiment

Technical scheme is further described in detail with reference to the accompanying drawings and detailed description.

In order to solve the above technical problems, as shown in figure 1, the invention provides a kind of side that recorded audio is divided into section Method, it is characterized in that comprising the following steps：

At setting pause；

Section is formed between two adjacent nodes；

Node is modified；

If node belongs at pause, continue to correct next node until all node processings are terminated；

It should also be appreciated by one skilled in the art that the foregoing is only the preferred embodiments of the present invention, it is not used to The limitation present invention, for a person skilled in the art, the present invention can have various modifications and variations.It is all in essence of the invention Within god and principle, any modification, equivalent substitution and improvements made etc. should be included within the scope of the present invention.

Claims

1. it is a kind of that recorded audio is divided into the method for section, it is characterized in that comprising the following steps：

At setting pause；

Section is formed between two adjacent nodes；

Node is modified；

If node belongs at pause, continue to correct next node until terminating；

2. the method that recorded audio is divided into section according to claim 1, it is characterized in that the side at the setting pause Method is that the average mute time of the Time Calculation according to mute part, the mute part that will be greater than the threshold value of average mute time is sentenced Break as at pause.

3. the method that recorded audio is divided into section according to claim 2, it is characterized in that described according to mute part The step of Time Calculation average mute time is the quantity for obtaining the total duration of mute part, and mute part, uses Jing Yin portion The total duration divided calculates average mute time divided by the quantity of mute part.

4. the method that recorded audio is divided into section according to claim 1, it is characterized in that the side at the setting pause Method is the median of the time for taking mute part and is set as at pause.

5. the method that recorded audio is divided into section according to claim 1, it is characterized in that the side at the setting pause Method is to record the sample of recorded audio according to custom word speed by person for recording, and the sample of the recorded audio is included at a pause, To be set as at the pause of recorded audio at the pause of the sample of recorded audio.

6. the method that recorded audio is divided into section according to claim 1, it is characterized in that the method for the amendment node Also include node before and/or node after character whether with node label storehouse in tag match, the node label storehouse is to deposit Stored up some sentences section start or word label that section terminates corpus.

7. the method that recorded audio is divided into section according to claim 1, it is characterized in that the method for the amendment node Also whether the character including decision node changes tag match with personage, and it is according to people in recording that the personage changes label What sound was differently formed is used to the personage's differentiation identifier distinguished.