CN105047203A - Audio processing method, device and terminal - Google Patents

Audio processing method, device and terminal Download PDF

Info

Publication number
CN105047203A
CN105047203A CN201510271769.1A CN201510271769A CN105047203A CN 105047203 A CN105047203 A CN 105047203A CN 201510271769 A CN201510271769 A CN 201510271769A CN 105047203 A CN105047203 A CN 105047203A
Authority
CN
China
Prior art keywords
temporal characteristics
simple sentence
paragraph
character simple
numerical value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510271769.1A
Other languages
Chinese (zh)
Other versions
CN105047203B (en
Inventor
赵伟峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Kugou Computer Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510271769.1A priority Critical patent/CN105047203B/en
Publication of CN105047203A publication Critical patent/CN105047203A/en
Priority to EP16799218.9A priority patent/EP3340238B1/en
Priority to PCT/CN2016/081999 priority patent/WO2016188329A1/en
Priority to US15/576,198 priority patent/US20180158469A1/en
Priority to JP2018513709A priority patent/JP6586514B2/en
Application granted granted Critical
Publication of CN105047203B publication Critical patent/CN105047203B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Studio Circuits (AREA)

Abstract

The embodiment of the invention provides an audio processing method, device and terminal. The method comprises the steps: obtaining a subtitle file corresponding to a target audio file, wherein the subtitle file consists of at least one character sentence; building a time characteristic sequence according to the time interval of at least one character sentence, wherein the time characteristic sequence comprises at least one time characteristic element; adjusting the values of all time characteristic elements in the time characteristic sequence according to the total number of preset paragraphs; determining paragraph change time according to the values of at least one time characteristic elements in the adjusted time characteristic sequence; and dividing the target audio file into a preset number of paragraphs according to the paragraph change time. The method can achieve the paragraph division of the target audio file according to the time interval between character sentences in the subtitle file corresponding to the target audio file, improves the efficiency of paragraph division, and improves the intelligent performance of audio processing.

Description

A kind of audio-frequency processing method, device and terminal
Technical field
Internet technical field of the present invention, is specifically related to audio signal processing technique field, particularly relates to a kind of audio-frequency processing method, device and terminal.
Background technology
Along with the development of Internet technology, included a large amount of audio files such as such as song, snatch of song etc. in internet audio storehouse, the application about internet audio also day by day increases, such as: K sings system, listens song system etc.The application scenarios of many audio files needs to carry out paragraph division to audio file, such as: when will realize song segmentation chorus in K song system, usually need to carry out paragraph division to song; For another example: listen when needing emphasis to listen to snatch of song in song system, usually need to carry out paragraph division to song; Etc..At present, usually adopt and manually carry out paragraph division to audio file, staging treating efficiency is lower, cannot meet the user demand of user to audio file, thus reduce the intelligent of audio frequency process.
Summary of the invention
The embodiment of the present invention provides a kind of audio-frequency processing method, device and terminal, can realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file corresponding to audio file, promote staging treating efficiency, promote the intelligent of audio frequency process.
Embodiment of the present invention first aspect provides a kind of audio-frequency processing method, can comprise:
Obtain the subtitle file that target audio file is corresponding, described subtitle file is made up of at least one character simple sentence order;
Build temporal characteristics sequence according to the time interval between at least one character simple sentence described, described temporal characteristics sequence comprises at least one temporal characteristics element;
According to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment;
According to the numerical value determination paragraph transformation period of at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment;
Be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period.
Embodiment of the present invention second aspect provides a kind of apparatus for processing audio, can comprise:
Acquiring unit, for obtaining subtitle file corresponding to target audio file, described subtitle file is made up of at least one character simple sentence order;
Construction unit, for building temporal characteristics sequence according to the time interval between at least one character simple sentence described, described temporal characteristics sequence comprises at least one temporal characteristics element;
Adjustment unit, for the numerical value according to each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment;
Determining unit, for the numerical value determination paragraph transformation period according at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment;
Segmenting unit, for according to described paragraph transformation period by described target audio Divide File being the paragraph of described default paragraph sum.
The embodiment of the present invention third aspect provides a kind of terminal, can comprise the apparatus for processing audio that above-mentioned second aspect provides.
Implement the embodiment of the present invention, there is following beneficial effect:
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
The process flow diagram of a kind of audio-frequency processing method that Fig. 1 provides for the embodiment of the present invention;
The process flow diagram of the another kind of audio-frequency processing method that Fig. 2 provides for the embodiment of the present invention;
The structural representation of a kind of apparatus for processing audio that Fig. 3 provides for the embodiment of the present invention;
Fig. 4 is the structural representation of the embodiment of the construction unit shown in Fig. 3;
Fig. 5 is the structural representation of the embodiment of the adjustment unit shown in Fig. 3;
Fig. 6 is the structural representation of the embodiment of the determining unit shown in Fig. 3.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
In the embodiment of the present invention, audio file can include but not limited to: the file such as song, snatch of song.Subtitle file can include but not limited to: the files such as the lyrics, lyrics fragment.An audio file may correspond to a subtitle file.A subtitle file can be formed by least one character simple sentence order arrangement, and for song A, the subtitle file that song A is corresponding can be expressed as follows:
[641,770],[641,20]a 1[661,60]a 2[721,170]a 3[891,200]a 4[1091,70]a 5[1161,180]a 6[1341,20]a 7[1361,50]a 8
[1541,180],[1541,20]b 1[1561,50]b 2[1611,20]b 3[1631,30]b 4[1661,0]b 5[1661,10]b 6[1671,20]b 7[1701,30]b 8
[1871,730],[1871,60]c 1[1931,100]c 2[2031,110]c 3[2141,200]c 4[2341,70]c 5[2411,60]c 6[2471,50]c 7[2421,80]c 8
……
In the subtitle file that above-mentioned song A is corresponding, such as " a 1a 2a 3a 4a 5a 6a 7a 8", " b 1b 2b 3b 4b 5b 6b 7b 8", " c 1c 2c 3c 4c 5c 6c 7c 8" expression character simple sentence can be respectively used to, " [] " before each character simple sentence, for describing the time attribute of corresponding character simple sentence, its unit interval is generally ms, such as: above-mentioned [641,770] are for describing character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" time attribute, " 641 " wherein represent character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" start time, " 770 " represent character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" duration, suppose song A totally 5 minutes, character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" then sing from 641ms, lasting 770ms terminates to sing.In each character simple sentence, " [] " before each character, for describing the time attribute of corresponding character, its unit interval is generally ms, such as: above-mentioned [641,20] are for describing character " a 1" time attribute, " 641 " wherein represent character " a 1" start time, " 20 " represent character " a 1" duration.According to the sequencing of start time, the order of each character simple sentence that subtitle file comprises can be determined, such as: according to the description of subtitle file corresponding to above-mentioned song A, character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" be first character simple sentence; Character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8" be second character simple sentence; Character simple sentence " c 1c 2c 3c 4c 5c 6c 7c 8" be the 3rd character simple sentence, by that analogy.Wherein, character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" and character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8" be character simple sentence " c 1c 2c 3c 4c 5c 6c 7c 8" at first character simple sentence, character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8" and character simple sentence " c 1c 2c 3c 4c 5c 6c 7c 8" be character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" at rear character simple sentence, by that analogy.Further, character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" be character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8" adjacent at first character simple sentence; Character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8" be character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8" adjacent at rear character simple sentence, by that analogy.
An audio file can be divided into multiple audio frequency paragraph, usually has longer pause, namely usually have the longer time interval between audio frequency paragraph between audio frequency paragraph; So, a subtitle file may correspond to and is divided into multiple captions paragraph, there is the longer time interval between captions paragraph, that is, there is the longer time interval between the character simple sentence comprised between captions paragraph.The embodiment of the present invention can utilize the time interval feature of the character simple sentence between above-mentioned captions paragraph, realizes dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file.
Based on foregoing description, below in conjunction with accompanying drawing 1-accompanying drawing 2, the audio-frequency processing method that the embodiment of the present invention provides is described in detail.
Referring to Fig. 1, is the process flow diagram of a kind of audio-frequency processing method that the embodiment of the present invention provides; The method can comprise the following steps S101-step S105.
S101, obtains the subtitle file that target audio file is corresponding, and described subtitle file is made up of at least one character simple sentence order.
A corresponding subtitle file of audio file.Described subtitle file comprises the key message of at least one character simple sentence and each character simple sentence; The key message of a character simple sentence comprises: mark (ID), start time (start_time) and end time (end_time).Usually, the subtitle file that multiple audio file, the attribute of each audio file and each audio file are corresponding can be stored in internet audio storehouse, wherein, the attribute of audio file can include but not limited to: the audio frequency characteristics of audio file, mark of audio file etc.In this step, subtitle file corresponding to target audio file can be obtained from internet audio storehouse; Concrete obtain manner can include but not limited to: according to the mark of target audio file, can search the subtitle file that this target audio file is corresponding in internet audio storehouse, and obtains the subtitle file found; Or the audio frequency characteristics that can extract target audio file mates with the audio frequency characteristics of the audio file in internet audio storehouse, localizing objects audio file in internet audio storehouse thus, and obtain corresponding subtitle file.
In the embodiment of the present invention, hypothetical target audio file is song A, the structure of the subtitle file that song A is corresponding can see example shown in the present embodiment, suppose that described subtitle file is made up of the individual character simple sentence order of N (N is positive integer), suppose that this N number of character simple sentence adopts p (0) to represent to p (N-1), so, p (0) can be used for representing first character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8", p (1) can be used for expression second character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8", p (2) can be used for expression the 3rd character simple sentence " c 1c 2c 3c 4c 5c 6c 7c 8", by that analogy, p (N-1) is for representing N number of character simple sentence.
S102, build temporal characteristics sequence according to the time interval between at least one character simple sentence described, described temporal characteristics sequence comprises at least one temporal characteristics element.
Described temporal characteristics sequence can be used for reflecting the time interval degree between at least one character simple sentence described.In this step, first calculate the time interval between at least one character simple sentence described, need to calculate time interval p (1) .start_time-p (0) .end_time between p (1) and p (0) herein; Calculate time interval p (2) .start_time-p (1) .end_time between p (2) and p (1); By that analogy, time interval p (N-1) .start_time-p (N-2) .end_time between p (N-1) and p (N-2) is calculated.The time interval that secondly can obtain according to the quantity of at least one character simple sentence described, order and calculating builds described temporal characteristics sequence.
According to example shown in the present embodiment, suppose to adopt t (n) to represent described temporal characteristics sequence, then constructed temporal characteristics sequence t (n) comprises N number of temporal characteristics element altogether, is respectively t (0), t (1) ... t (N-1).Wherein, the numerical value of t (0) can be set to the numerical value of 0, t (1) for representing the time interval between p (1) and p (0); The numerical value of t (2) is for representing the time interval between p (2) and p (1); By that analogy, the numerical value of t (N-1) is for representing the time interval between p (N-1) and p (N-2).
S103, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment.
Described default paragraph sum can according to the actual segment requirements set of user to target audio file.Suppose to adopt M (M is positive integer and M>1) to represent described default paragraph sum, the numerical value object of each temporal characteristics element then adjusted in described temporal characteristics sequence t (n) according to default paragraph sum M is, make described temporal characteristics sequence t (n) after adjustment just can extract turning point corresponding to M captions paragraph, thus realize the actual segment demand to target audio file.
S104, according to the numerical value determination paragraph transformation period of at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment.
The numerical value of each temporal characteristics element in temporal characteristics sequence t (n) after described adjustment can reflect the turning point that M captions paragraph is corresponding, so, this step according to the numerical value of at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment, can obtain the beginning and ending time of M captions paragraph from subtitle file.
Described target audio Divide File is the paragraph of described default paragraph sum according to described paragraph transformation period by S105.Because audio file and subtitle file are mutually corresponding, so, according to the beginning and ending time of obtained M captions paragraph, paragraph division can be carried out to described target audio file accordingly, obtain M audio frequency paragraph.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
Referring to Fig. 2, is the process flow diagram of the another kind of audio-frequency processing method that the embodiment of the present invention provides; The method can comprise the following steps S201-step S105.
S201, obtains the subtitle file that target audio file is corresponding, and described subtitle file is made up of at least one character simple sentence order.
In the embodiment of the present invention, hypothetical target audio file is song A, the structure of the subtitle file that song A is corresponding can see example shown in the present embodiment, suppose that described subtitle file is made up of the individual character simple sentence order of N (N is positive integer), suppose that this N number of character simple sentence adopts p (0) to represent to p (N-1), so, p (0) can be used for representing first character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8", p (1) can be used for expression second character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8", p (2) can be used for expression the 3rd character simple sentence " c 1c 2c 3c 4c 5c 6c 7c 8", by that analogy, p (N-1) is for representing N number of character simple sentence.
The step S201 of the present embodiment can the step S101 of embodiment shown in Figure 1, is not repeated herein.
S202, determines the quantity of the temporal characteristics element building temporal characteristics sequence according to the quantity of at least one character simple sentence described.
Described subtitle file is made up of the individual character simple sentence order of N (N is positive integer), namely the quantity of at least one character simple sentence described is N, so, this step can determine that the quantity of the temporal characteristics element of described temporal characteristics sequence is also N, and namely the length of described temporal characteristics sequence is N.Suppose to adopt t (n) to represent described temporal characteristics sequence, then constructed temporal characteristics sequence t (n) comprises N number of temporal characteristics element altogether, is respectively t (0), t (1) ... t (N-1).
S203, according to the order of each character simple sentence at least one character simple sentence described, determines the index of each temporal characteristics element building described temporal characteristics sequence.
The order of the N number of character simple sentence of described subtitle file is arranged as p (0), p (1) ... p (N-1), suppose in described temporal characteristics sequence t (n): t (0) corresponding p (0), t (1) corresponding p (1), by that analogy, t (N-1) corresponding p (N-1).So, in described temporal characteristics sequence t (n), the index of t (0) is 1, i.e. first temporal characteristics element; The index of t (1) is 2, i.e. second temporal characteristics element; By that analogy, the index of t (N-1) is N, i.e. N number of temporal characteristics element.
S204, for any one the target character simple sentence at least one character simple sentence described, described target character simple sentence and the adjacent time interval between first character simple sentence of described target character simple sentence are set to the numerical value of temporal characteristics element corresponding to described target character simple sentence.
The concrete processing procedure of this step S204 can comprise the following steps s11-s12:
S11, calculate each character simple sentence and be adjacent time interval between first character simple sentence, need to calculate time interval p (1) .start_time-p (0) .end_time between p (1) and p (0) herein; Calculate time interval p (2) .start_time-p (1) .end_time between p (2) and p (1); By that analogy, time interval p (N-1) .start_time-p (N-2) .end_time between p (N-1) and p (N-2) is calculated.
S12, is set to the numerical value of corresponding temporal characteristics element by the time interval calculating acquisition; So, t (0)=0 can be set, t (1)=p (1) .start_time-p (0) .end_time, t (2)=p (2) .start_time-p (1) .end_time, by that analogy, t (N-1)=p (N-1) .start_time-p (N-2) .end_time.
S205, according to building the quantity of temporal characteristics element of described temporal characteristics sequence, index and numerical value, builds described temporal characteristics sequence.
Constructed described temporal characteristics sequence is t (n), t (n) is by N number of temporal characteristics element t (0), t (1) ... t (N-1) order composition, and the numerical value of each temporal characteristics element is t (0)=0 in described temporal characteristics sequence t (n), t (1)=p (1) .start_time-p (0) .end_time, t (2)=p (2) .start_time-p (1) .end_time, by that analogy, t (N-1)=p (N-1) .start_time-p (N-2) .end_time.
The step S202-step S205 of the present embodiment can be the concrete refinement step of step S102 embodiment illustrated in fig. 1.
S206, presets the temporal characteristics element that paragraph quantity subtracts 1 greatest measure from described temporal characteristics sequence before searching.Suppose to adopt M (M is positive integer and M>1) to represent described default paragraph sum, this step needs the temporal characteristics element searching a front M-1 greatest measure from described temporal characteristics sequence t (n).
S207, is adjusted to desired value by the numerical value of the temporal characteristics element found, and the numerical value of the other times characteristic element in described temporal characteristics sequence except the temporal characteristics element found is adjusted to reference value.Described desired value and described eigenwert can set according to actual needs, and it is 1 that the embodiment of the present invention can arrange described desired value, and described reference value is 0.
The concrete processing procedure of step S206-S207 can be: the numerical value first traveling through each temporal characteristics element in described temporal characteristics sequence t (n), therefrom finds the temporal characteristics element that greatest measure is corresponding; After getting rid of the temporal characteristics element found, again travel through the numerical value of each temporal characteristics element in described temporal characteristics sequence t (n), therefrom find the temporal characteristics element that greatest measure is corresponding; Circulate above-mentioned ergodic process, until find M-1 greatest measure.Finally the M-1 found in described temporal characteristics sequence t (n) greatest measure is all adjusted to 1, other numerical value are adjusted to 0.
The step S206-step S207 of the present embodiment can be the concrete refinement step of step S103 embodiment illustrated in fig. 1.Due to M captions paragraph just corresponding M-1 paragraph turning point, described temporal characteristics sequence t (n) after adjustment can be made just can to extract M-1 paragraph turning point corresponding to M captions paragraph through step S206-step S207, thus realize the actual segment demand to target audio file.
S208, from the described temporal characteristics sequence after adjustment, obtain numerical value is the target index that the temporal characteristics element of desired value is corresponding.This step needs to obtain target index corresponding to temporal characteristics element that numerical value is 1, namely needs the index obtaining M-1 the temporal characteristics element found.
S209, locates the character simple sentence of paragraph turnover in described subtitle file according to described target index.
Suppose that one of them target index is 5, the character simple sentence then can locating paragraph turnover in described subtitle file is the 5th character simple sentence, that is, the 5th character simple sentence is the reference position of a captions paragraph, and namely in described subtitle file, 1-4 character simple sentence forms a captions paragraph.In like manner, the character simple sentence of M-1 paragraph turnover can be located.
S210, reads paragraph transformation period according to the character simple sentence that described paragraph is transferred from described subtitle file.
Owing to have recorded the key message of each character simple sentence in described subtitle file, comprise start time and the end time of each character simple sentence; This step can read paragraph transformation period from described subtitle file, according to example shown in the present embodiment, in described subtitle file, 1-4 character simple sentence forms a captions paragraph, and so read paragraph transformation period is: the start time of the end time of the 4th character simple sentence and the 5th character simple sentence.
The step S208-step S210 of the present embodiment can be the concrete refinement step of step S104 embodiment illustrated in fig. 1.The beginning and ending time of M captions paragraph can be obtained according to step S208-step S210.
Described target audio Divide File is the paragraph of described default paragraph sum according to described paragraph transformation period by S211.Because audio file and subtitle file are mutually corresponding, so, according to the beginning and ending time of obtained M captions paragraph, paragraph division can be carried out to described target audio file accordingly, obtain M audio frequency paragraph.
The step S211 of the present embodiment can the step S105 of embodiment shown in Figure 1, is not repeated herein.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
Following general 3-accompanying drawing 6 by reference to the accompanying drawings, describes in detail to the 26S Proteasome Structure and Function of the apparatus for processing audio that the embodiment of the present invention provides.It should be noted that, the shown device of following accompanying drawing 3-accompanying drawing 6 can run in terminal, to be applied to performing the method shown in above-mentioned accompanying drawing 1-accompanying drawing 2.
Referring to Fig. 3, is the structural representation of a kind of apparatus for processing audio that the embodiment of the present invention provides; This device can comprise: acquiring unit 101, construction unit 102, adjustment unit 103, determining unit 104 and segmenting unit 105.
Acquiring unit 101, for obtaining subtitle file corresponding to target audio file, described subtitle file is made up of at least one character simple sentence order.
A corresponding subtitle file of audio file.Described subtitle file comprises the key message of at least one character simple sentence and each character simple sentence; The key message of a character simple sentence comprises: mark (ID), start time (start_time) and end time (end_time).Usually, the subtitle file that multiple audio file, the attribute of each audio file and each audio file are corresponding can be stored in internet audio storehouse, wherein, the attribute of audio file can include but not limited to: the audio frequency characteristics of audio file, mark of audio file etc.Described acquiring unit 101 can obtain subtitle file corresponding to target audio file from internet audio storehouse; Concrete obtain manner can include but not limited to: according to the mark of target audio file, can search the subtitle file that this target audio file is corresponding in internet audio storehouse, and obtains the subtitle file found; Or the audio frequency characteristics that can extract target audio file mates with the audio frequency characteristics of the audio file in internet audio storehouse, localizing objects audio file in internet audio storehouse thus, and obtain corresponding subtitle file.
In the embodiment of the present invention, hypothetical target audio file is song A, the structure of the subtitle file that song A is corresponding can see example shown in the present embodiment, suppose that described subtitle file is made up of the individual character simple sentence order of N (N is positive integer), suppose that this N number of character simple sentence adopts p (0) to represent to p (N-1), so, p (0) can be used for representing first character simple sentence " a 1a 2a 3a 4a 5a 6a 7a 8", p (1) can be used for expression second character simple sentence " b 1b 2b 3b 4b 5b 6b 7b 8", p (2) can be used for expression the 3rd character simple sentence " c 1c 2c 3c 4c 5c 6c 7c 8", by that analogy, p (N-1) is for representing N number of character simple sentence.
Construction unit 102, for building temporal characteristics sequence according to the time interval between at least one character simple sentence described, described temporal characteristics sequence comprises at least one temporal characteristics element.
Described temporal characteristics sequence can be used for reflecting the time interval degree between at least one character simple sentence described.First described construction unit 102 calculates the time interval between at least one character simple sentence described, needs to calculate time interval p (1) .start_time-p (0) .end_time between p (1) and p (0) herein; Calculate time interval p (2) .start_time-p (1) .end_time between p (2) and p (1); By that analogy, time interval p (N-1) .start_time-p (N-2) .end_time between p (N-1) and p (N-2) is calculated.Secondly the time interval that described construction unit 102 can obtain according to the quantity of at least one character simple sentence described, order and calculating builds described temporal characteristics sequence.
According to example shown in the present embodiment, suppose to adopt t (n) to represent described temporal characteristics sequence, then constructed temporal characteristics sequence t (n) comprises N number of temporal characteristics element altogether, is respectively t (0), t (1) ... t (N-1).Wherein, the numerical value of t (0) can be set to the numerical value of 0, t (1) for representing the time interval between p (1) and p (0); The numerical value of t (2) is for representing the time interval between p (2) and p (1); By that analogy, the numerical value of t (N-1) is for representing the time interval between p (N-1) and p (N-2).
Adjustment unit 103, for the numerical value according to each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment.
Described default paragraph sum can according to the actual segment requirements set of user to target audio file.Suppose to adopt M (M is positive integer and M>1) to represent described default paragraph sum, the numerical value object of each temporal characteristics element that then described adjustment unit 103 adjusts in described temporal characteristics sequence t (n) according to default paragraph sum M is, make described temporal characteristics sequence t (n) after adjustment just can extract turning point corresponding to M captions paragraph, thus realize the actual segment demand to target audio file.
Determining unit 104, for the numerical value determination paragraph transformation period according at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment.
The numerical value of each temporal characteristics element in temporal characteristics sequence t (n) after described adjustment can reflect the turning point that M captions paragraph is corresponding, so, described determining unit 104 according to the numerical value of at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment, can obtain the beginning and ending time of M captions paragraph from subtitle file.
Segmenting unit 105, for according to described paragraph transformation period by described target audio Divide File being the paragraph of described default paragraph sum.
Because audio file and subtitle file are mutually corresponding, so, described segmenting unit 105, according to the beginning and ending time of obtained M captions paragraph, can carry out paragraph division to described target audio file accordingly, obtains M audio frequency paragraph.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
Referring to Fig. 4, is the structural representation of the embodiment of the construction unit shown in Fig. 3; This construction unit 102 can comprise: quantity determining unit 1001, index determining unit 1002, numerical value setting unit 1003 and sequence construct unit 1004.
Quantity determining unit 1001, for determining the quantity of the temporal characteristics element building temporal characteristics sequence according to the quantity of at least one character simple sentence described.
Described subtitle file is made up of the individual character simple sentence order of N (N is positive integer), namely the quantity of at least one character simple sentence described is N, so, described quantity determining unit 1001 can determine that the quantity of the temporal characteristics element of described temporal characteristics sequence is also N, and namely the length of described temporal characteristics sequence is N.Suppose to adopt t (n) to represent described temporal characteristics sequence, then constructed temporal characteristics sequence t (n) comprises N number of temporal characteristics element altogether, is respectively t (0), t (1) ... t (N-1).
Index determining unit 1002, for the order according to each character simple sentence at least one character simple sentence described, determines the index of each temporal characteristics element building described temporal characteristics sequence.
The order of the N number of character simple sentence of described subtitle file is arranged as p (0), p (1) ... p (N-1), suppose in described temporal characteristics sequence t (n): t (0) corresponding p (0), t (1) corresponding p (1), by that analogy, t (N-1) corresponding p (N-1).So, in described temporal characteristics sequence t (n), the index of t (0) is 1, i.e. first temporal characteristics element; The index of t (1) is 2, i.e. second temporal characteristics element; By that analogy, the index of t (N-1) is N, i.e. N number of temporal characteristics element.
Numerical value setting unit 1003, for for any one the target character simple sentence at least one character simple sentence described, described target character simple sentence and the adjacent time interval between first character simple sentence of described target character simple sentence are set to the numerical value of temporal characteristics element corresponding to described target character simple sentence.
The concrete processing procedure of described numerical value setting unit 1003 can comprise following A-B:
A, calculate each character simple sentence and be adjacent time interval between first character simple sentence, need to calculate time interval p (1) .start_time-p (0) .end_time between p (1) and p (0) herein; Calculate time interval p (2) .start_time-p (1) .end_time between p (2) and p (1); By that analogy, time interval p (N-1) .start_time-p (N-2) .end_time between p (N-1) and p (N-2) is calculated.
B, be set to the numerical value of corresponding temporal characteristics element by calculating the time interval obtained; So, t (0)=0 can be set, t (1)=p (1) .start_time-p (0) .end_time, t (2)=p (2) .start_time-p (1) .end_time, by that analogy, t (N-1)=p (N-1) .start_time-p (N-2) .end_time.
Sequence construct unit 1004, for according to building the quantity of temporal characteristics element of described temporal characteristics sequence, index and numerical value, builds described temporal characteristics sequence.
Constructed described temporal characteristics sequence is t (n), t (n) is by N number of temporal characteristics element t (0), t (1) ... t (N-1) order composition, and the numerical value of each temporal characteristics element is t (0)=0 in described temporal characteristics sequence t (n), t (1)=p (1) .start_time-p (0) .end_time, t (2)=p (2) .start_time-p (1) .end_time, by that analogy, t (N-1)=p (N-1) .start_time-p (N-2) .end_time.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
Referring to Fig. 5, is the structural representation of the embodiment of the adjustment unit shown in Fig. 3; This adjustment unit 103 can comprise: element searches unit 2001 and numerical value adjustment unit 2002.
Element searches unit 2001, before searching from described temporal characteristics sequence, preset the temporal characteristics element that paragraph quantity subtracts 1 greatest measure.
Suppose to adopt M (M is positive integer and M>1) to represent described default paragraph sum, described element searches the temporal characteristics element that unit 2001 needs to search a front M-1 greatest measure from described temporal characteristics sequence t (n).
Numerical value adjustment unit 2002, for the numerical value of the temporal characteristics found element is adjusted to desired value, is adjusted to reference value by the numerical value of the other times characteristic element in described temporal characteristics sequence except the temporal characteristics element found.Described desired value and described eigenwert can set according to actual needs, and it is 1 that the embodiment of the present invention can arrange described desired value, and described reference value is 0.
The concrete processing procedure that described element searches unit 2001 and described numerical value adjustment unit 2002 can be: first described element searches the numerical value that unit 2001 travels through each temporal characteristics element in described temporal characteristics sequence t (n), therefrom finds the temporal characteristics element that greatest measure is corresponding; After getting rid of the temporal characteristics element found, again travel through the numerical value of each temporal characteristics element in described temporal characteristics sequence t (n), therefrom find the temporal characteristics element that greatest measure is corresponding; Circulate above-mentioned ergodic process, until find M-1 greatest measure.The M-1 found in described temporal characteristics sequence t (n) greatest measure is all adjusted to 1 by last described numerical value adjustment unit 2002, and other numerical value are adjusted to 0.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
Referring to Fig. 6, is the structural representation of the embodiment of the determining unit shown in Fig. 3; This determining unit 104 can comprise: target index acquiring unit 3001, positioning unit 3002 and time reading unit 3003.
Target index acquiring unit 3001 is the target index that the temporal characteristics element of desired value is corresponding for obtaining numerical value from the described temporal characteristics sequence after adjustment.
According to example embodiment illustrated in fig. 5, described target index acquiring unit 3001 needs to obtain target index corresponding to temporal characteristics element that numerical value is 1, namely needs the index obtaining M-1 the temporal characteristics element found.
Positioning unit 3002, for locating the character simple sentence of paragraph turnover in described subtitle file according to described target index.
Suppose that one of them target index is 5, the character simple sentence that described positioning unit 3002 can locate paragraph turnover in described subtitle file is the 5th character simple sentence, that is, 5th character simple sentence is the reference position of a captions paragraph, and namely in described subtitle file, 1-4 character simple sentence forms a captions paragraph.In like manner, the character simple sentence of M-1 paragraph turnover can be located.
Time reading unit 3003, reads paragraph transformation period for the character simple sentence of transferring according to described paragraph from described subtitle file.
Owing to have recorded the key message of each character simple sentence in described subtitle file, comprise start time and the end time of each character simple sentence; Described time reading unit 3003 to read paragraph transformation period from described subtitle file, according to example shown in the present embodiment, in described subtitle file, 1-4 character simple sentence forms a captions paragraph, and so read paragraph transformation period is: the start time of the end time of the 4th character simple sentence and the 5th character simple sentence.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
The embodiment of the invention also discloses a kind of terminal, this terminal can be the equipment such as PC (PersonalComputer, personal computer), notebook computer, mobile phone, PAD (panel computer), car-mounted terminal, intelligent wearable device.Can comprise an apparatus for processing audio in this terminal, the 26S Proteasome Structure and Function of this device see the associated description of above-mentioned Fig. 3-embodiment illustrated in fig. 6, can be not repeated herein.
In the embodiment of the present invention, temporal characteristics sequence can be built according to the time interval between at least one the character simple sentence in subtitle file corresponding to target audio file, according to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, and according to the numerical value determination paragraph transformation period of at least one temporal characteristics element in the described temporal characteristics sequence after adjustment, then be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period, this audio processing process utilizes the time interval feature of the character simple sentence between captions paragraph, realize dividing the paragraph of target audio file based on the time interval between the character simple sentence in subtitle file, staging treating efficiency can be promoted, promote the intelligent of audio frequency process.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, described program can be stored in a computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-OnlyMemory, ROM) or random store-memory body (RandomAccessMemory, RAM) etc.
Above disclosedly be only present pre-ferred embodiments, certainly can not limit the interest field of the present invention with this, therefore according to the equivalent variations that the claims in the present invention are done, still belong to the scope that the present invention is contained.

Claims (11)

1. an audio-frequency processing method, is characterized in that, comprising:
Obtain the subtitle file that target audio file is corresponding, described subtitle file is made up of at least one character simple sentence order;
Build temporal characteristics sequence according to the time interval between at least one character simple sentence described, described temporal characteristics sequence comprises at least one temporal characteristics element;
According to the numerical value of each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment;
According to the numerical value determination paragraph transformation period of at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment;
Be the paragraph of described default paragraph sum by described target audio Divide File according to described paragraph transformation period.
2. the method for claim 1, is characterized in that, the time interval described in described basis between at least one character simple sentence builds temporal characteristics sequence, comprising:
The quantity of the temporal characteristics element building temporal characteristics sequence is determined according to the quantity of at least one character simple sentence described;
According to the order of each character simple sentence at least one character simple sentence described, determine the index of each temporal characteristics element building described temporal characteristics sequence;
For any one the target character simple sentence at least one character simple sentence described, described target character simple sentence and the adjacent time interval between first character simple sentence of described target character simple sentence are set to the numerical value of temporal characteristics element corresponding to described target character simple sentence;
According to building the quantity of temporal characteristics element of described temporal characteristics sequence, index and numerical value, build described temporal characteristics sequence.
3. method as claimed in claim 2, is characterized in that, the described numerical value according to each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment, comprising:
The temporal characteristics element that paragraph quantity subtracts 1 greatest measure is preset before searching from described temporal characteristics sequence;
The numerical value of the temporal characteristics element found is adjusted to desired value, the numerical value of the other times characteristic element in described temporal characteristics sequence except the temporal characteristics element found is adjusted to reference value.
4. method as claimed in claim 3, is characterized in that, the described numerical value determination paragraph transformation period according at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment, comprising:
From the described temporal characteristics sequence after adjustment, obtain numerical value is the target index that the temporal characteristics element of desired value is corresponding;
In described subtitle file, the character simple sentence of paragraph turnover is located according to described target index;
From described subtitle file, paragraph transformation period is read according to the character simple sentence that described paragraph is transferred.
5. the method as described in any one of claim 1-4, is characterized in that, described subtitle file comprises the key message of at least one character simple sentence and each character simple sentence;
The key message of a character simple sentence comprises: mark, start time and end time.
6. an apparatus for processing audio, is characterized in that, comprising:
Acquiring unit, for obtaining subtitle file corresponding to target audio file, described subtitle file is made up of at least one character simple sentence order;
Construction unit, for building temporal characteristics sequence according to the time interval between at least one character simple sentence described, described temporal characteristics sequence comprises at least one temporal characteristics element;
Adjustment unit, for the numerical value according to each temporal characteristics element in the described temporal characteristics sequence of default paragraph sum adjustment;
Determining unit, for the numerical value determination paragraph transformation period according at least one the temporal characteristics element in the described temporal characteristics sequence after adjustment;
Segmenting unit, for according to described paragraph transformation period by described target audio Divide File being the paragraph of described default paragraph sum.
7. device as claimed in claim 6, it is characterized in that, described construction unit comprises:
Quantity determining unit, for determining the quantity of the temporal characteristics element building temporal characteristics sequence according to the quantity of at least one character simple sentence described;
Index determining unit, for the order according to each character simple sentence at least one character simple sentence described, determines the index of each temporal characteristics element building described temporal characteristics sequence;
Numerical value setting unit, for for any one the target character simple sentence at least one character simple sentence described, described target character simple sentence and the adjacent time interval between first character simple sentence of described target character simple sentence are set to the numerical value of temporal characteristics element corresponding to described target character simple sentence;
Sequence construct unit, for according to building the quantity of temporal characteristics element of described temporal characteristics sequence, index and numerical value, builds described temporal characteristics sequence.
8. device as claimed in claim 7, it is characterized in that, described adjustment unit comprises:
Element searches unit, before searching from described temporal characteristics sequence, preset the temporal characteristics element that paragraph quantity subtracts 1 greatest measure;
Numerical value adjustment unit, for the numerical value of the temporal characteristics found element is adjusted to desired value, is adjusted to reference value by the numerical value of the other times characteristic element in described temporal characteristics sequence except the temporal characteristics element found.
9. device as claimed in claim 8, it is characterized in that, described determining unit comprises:
Target index acquiring unit is the target index that the temporal characteristics element of desired value is corresponding for obtaining numerical value from the described temporal characteristics sequence after adjustment;
Positioning unit, for locating the character simple sentence of paragraph turnover in described subtitle file according to described target index;
Time reading unit, reads paragraph transformation period for the character simple sentence of transferring according to described paragraph from described subtitle file.
10. the device as described in any one of claim 6-9, is characterized in that, described subtitle file comprises the key message of at least one character simple sentence and each character simple sentence;
The key message of a character simple sentence comprises: mark, start time and end time.
11. 1 kinds of terminals, is characterized in that, comprise the apparatus for processing audio as described in any one of claim 6-10.
CN201510271769.1A 2015-05-25 2015-05-25 A kind of audio-frequency processing method, device and terminal Active CN105047203B (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN201510271769.1A CN105047203B (en) 2015-05-25 2015-05-25 A kind of audio-frequency processing method, device and terminal
EP16799218.9A EP3340238B1 (en) 2015-05-25 2016-05-13 Method and device for audio processing
PCT/CN2016/081999 WO2016188329A1 (en) 2015-05-25 2016-05-13 Audio processing method and apparatus, and terminal
US15/576,198 US20180158469A1 (en) 2015-05-25 2016-05-13 Audio processing method and apparatus, and terminal
JP2018513709A JP6586514B2 (en) 2015-05-25 2016-05-13 Audio processing method, apparatus and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510271769.1A CN105047203B (en) 2015-05-25 2015-05-25 A kind of audio-frequency processing method, device and terminal

Publications (2)

Publication Number Publication Date
CN105047203A true CN105047203A (en) 2015-11-11
CN105047203B CN105047203B (en) 2019-09-10

Family

ID=54453690

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510271769.1A Active CN105047203B (en) 2015-05-25 2015-05-25 A kind of audio-frequency processing method, device and terminal

Country Status (1)

Country Link
CN (1) CN105047203B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104978961A (en) * 2015-05-25 2015-10-14 腾讯科技(深圳)有限公司 Audio processing method, device and terminal
WO2016188329A1 (en) * 2015-05-25 2016-12-01 广州酷狗计算机科技有限公司 Audio processing method and apparatus, and terminal
CN107562760A (en) * 2016-06-30 2018-01-09 科大讯飞股份有限公司 A kind of voice data processing method and device
CN108630240A (en) * 2017-03-23 2018-10-09 北京小唱科技有限公司 A kind of chorus method and device
CN110008378A (en) * 2019-01-28 2019-07-12 平安科技(深圳)有限公司 Corpus collection method, device, equipment and storage medium based on artificial intelligence
CN110400580A (en) * 2019-08-30 2019-11-01 北京百度网讯科技有限公司 Audio-frequency processing method, device, equipment and medium
CN110895654A (en) * 2018-09-07 2020-03-20 台达电子工业股份有限公司 Segmentation method, segmentation system and non-transitory computer readable medium

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08152898A (en) * 1994-11-29 1996-06-11 Just Syst Corp Voice recognition device using time series hypothetical feedback
KR20010001846A (en) * 1999-06-09 2001-01-05 김영환 Device and method for constructing voice data base
JP3286881B2 (en) * 1994-06-24 2002-05-27 三菱電機株式会社 TDMA communication apparatus and method
CN1404609A (en) * 2000-10-30 2003-03-19 皇家菲利浦电子有限公司 System and method for detecting highlights in a video program using audio properties
CN1560816A (en) * 2004-02-18 2005-01-05 陈德卫 Method and device for sync controlling voice frequency and text information
CN1598923A (en) * 2004-09-10 2005-03-23 清华大学 Popular song key segment pick-up method for music listening
CN101079301A (en) * 2006-07-28 2007-11-28 埃里克·路易斯·汉森 Device and method for text to audio mapping, and animation of the text
JP2010085581A (en) * 2008-09-30 2010-04-15 Victor Co Of Japan Ltd Lyrics data display, lyrics data display method, and lyrics data display program
CN101901622A (en) * 2009-05-27 2010-12-01 鸿富锦精密工业(深圳)有限公司 Audio data positioning method and electronic system using same
CN102055845A (en) * 2010-11-30 2011-05-11 深圳市五巨科技有限公司 Mobile communication terminal and picture switching method of music player thereof
JP4779954B2 (en) * 2006-12-11 2011-09-28 株式会社ケンウッド Audio data processing apparatus, method and program
JP4827721B2 (en) * 2006-12-26 2011-11-30 ニュアンス コミュニケーションズ,インコーポレイテッド Utterance division method, apparatus and program
CN102467939A (en) * 2010-11-04 2012-05-23 北京彩云在线技术开发有限公司 Song audio frequency cutting apparatus and method thereof
CN102724598A (en) * 2011-12-05 2012-10-10 新奥特(北京)视频技术有限公司 Method for splitting news items
CN103327397A (en) * 2012-03-22 2013-09-25 联想(北京)有限公司 Subtitle synchronous display method and system of media file

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3286881B2 (en) * 1994-06-24 2002-05-27 三菱電機株式会社 TDMA communication apparatus and method
JPH08152898A (en) * 1994-11-29 1996-06-11 Just Syst Corp Voice recognition device using time series hypothetical feedback
KR20010001846A (en) * 1999-06-09 2001-01-05 김영환 Device and method for constructing voice data base
CN1404609A (en) * 2000-10-30 2003-03-19 皇家菲利浦电子有限公司 System and method for detecting highlights in a video program using audio properties
CN1560816A (en) * 2004-02-18 2005-01-05 陈德卫 Method and device for sync controlling voice frequency and text information
CN1598923A (en) * 2004-09-10 2005-03-23 清华大学 Popular song key segment pick-up method for music listening
CN101079301A (en) * 2006-07-28 2007-11-28 埃里克·路易斯·汉森 Device and method for text to audio mapping, and animation of the text
JP4779954B2 (en) * 2006-12-11 2011-09-28 株式会社ケンウッド Audio data processing apparatus, method and program
JP4827721B2 (en) * 2006-12-26 2011-11-30 ニュアンス コミュニケーションズ,インコーポレイテッド Utterance division method, apparatus and program
JP2010085581A (en) * 2008-09-30 2010-04-15 Victor Co Of Japan Ltd Lyrics data display, lyrics data display method, and lyrics data display program
CN101901622A (en) * 2009-05-27 2010-12-01 鸿富锦精密工业(深圳)有限公司 Audio data positioning method and electronic system using same
CN102467939A (en) * 2010-11-04 2012-05-23 北京彩云在线技术开发有限公司 Song audio frequency cutting apparatus and method thereof
CN102055845A (en) * 2010-11-30 2011-05-11 深圳市五巨科技有限公司 Mobile communication terminal and picture switching method of music player thereof
CN102724598A (en) * 2011-12-05 2012-10-10 新奥特(北京)视频技术有限公司 Method for splitting news items
CN103327397A (en) * 2012-03-22 2013-09-25 联想(北京)有限公司 Subtitle synchronous display method and system of media file

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016188329A1 (en) * 2015-05-25 2016-12-01 广州酷狗计算机科技有限公司 Audio processing method and apparatus, and terminal
CN104978961B (en) * 2015-05-25 2019-10-15 广州酷狗计算机科技有限公司 A kind of audio-frequency processing method, device and terminal
CN104978961A (en) * 2015-05-25 2015-10-14 腾讯科技(深圳)有限公司 Audio processing method, device and terminal
CN107562760A (en) * 2016-06-30 2018-01-09 科大讯飞股份有限公司 A kind of voice data processing method and device
CN107562760B (en) * 2016-06-30 2020-11-17 科大讯飞股份有限公司 Voice data processing method and device
CN108630240B (en) * 2017-03-23 2020-05-26 北京小唱科技有限公司 Chorus method and apparatus
CN108630240A (en) * 2017-03-23 2018-10-09 北京小唱科技有限公司 A kind of chorus method and device
CN110895654A (en) * 2018-09-07 2020-03-20 台达电子工业股份有限公司 Segmentation method, segmentation system and non-transitory computer readable medium
WO2020155750A1 (en) * 2019-01-28 2020-08-06 平安科技(深圳)有限公司 Artificial intelligence-based corpus collecting method, apparatus, device, and storage medium
CN110008378A (en) * 2019-01-28 2019-07-12 平安科技(深圳)有限公司 Corpus collection method, device, equipment and storage medium based on artificial intelligence
CN110008378B (en) * 2019-01-28 2024-03-19 平安科技(深圳)有限公司 Corpus collection method, device, equipment and storage medium based on artificial intelligence
CN110400580A (en) * 2019-08-30 2019-11-01 北京百度网讯科技有限公司 Audio-frequency processing method, device, equipment and medium
CN110400580B (en) * 2019-08-30 2022-06-17 北京百度网讯科技有限公司 Audio processing method, apparatus, device and medium

Also Published As

Publication number Publication date
CN105047203B (en) 2019-09-10

Similar Documents

Publication Publication Date Title
CN105047203A (en) Audio processing method, device and terminal
EP3480819B1 (en) Audio data processing method and apparatus
CN107452372B (en) Training method and device of far-field speech recognition model
CN104282322B (en) A kind of mobile terminal and its method and apparatus for identifying song climax parts
WO2018045988A1 (en) Method and device for generating digital music score file of song, and storage medium
CN105023559A (en) Karaoke processing method and system
CN105161116B (en) The determination method and device of multimedia file climax segment
CN106055659B (en) Lyric data matching method and equipment thereof
CN104978973A (en) Audio processing method and device
EP3340238B1 (en) Method and device for audio processing
CN110688518A (en) Rhythm point determining method, device, equipment and storage medium
CN104978961A (en) Audio processing method, device and terminal
CN110019922B (en) Audio climax identification method and device
CN105975568A (en) Audio processing method and apparatus
CN104090883A (en) Playing control processing method and playing control processing device for audio file
CN110599989B (en) Audio processing method, device and storage medium
CN103903625A (en) Audio sound mixing method and device
CN104978377A (en) Multimedia data processing method, multimedia data processing device and terminal
CN111210850B (en) Lyric alignment method and related product
EP3159895A1 (en) Method and apparatus for editing audio files
CN105047202A (en) Audio processing method, device and terminal
CN104091594A (en) Audio classifying method and device
CN104143340B (en) A kind of audio frequency assessment method and device
CN104157296B (en) A kind of audio frequency assessment method and device
CN104091591A (en) Audio processing method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20161226

Address after: 510000 Guangzhou, Tianhe District branch Yun Yun Road, No. 16, self built room 2, building 1301

Applicant after: Guangzhou Kugou Inc.

Address before: Shenzhen Futian District City, Guangdong province 518000 Zhenxing Road, SEG Science Park 2 East Room 403

Applicant before: Tencent Technology (Shenzhen) Co., Ltd.

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 510000 Guangzhou City, Guangzhou, Guangdong, Whampoa Avenue, No. 315, self - made 1-17

Applicant after: Guangzhou KuGou Networks Co., Ltd.

Address before: 510000 Guangzhou, Tianhe District branch Yun Yun Road, No. 16, self built room 2, building 1301

Applicant before: Guangzhou KuGou Networks Co., Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant