CN107103917A - Music rhythm detection method and its system - Google Patents
Music rhythm detection method and its system Download PDFInfo
- Publication number
- CN107103917A CN107103917A CN201710159699.XA CN201710159699A CN107103917A CN 107103917 A CN107103917 A CN 107103917A CN 201710159699 A CN201710159699 A CN 201710159699A CN 107103917 A CN107103917 A CN 107103917A
- Authority
- CN
- China
- Prior art keywords
- audio frame
- energy
- current audio
- frame
- current
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000033764 rhythmic process Effects 0.000 title claims abstract description 84
- 238000001514 detection method Methods 0.000 title claims abstract description 37
- 238000001228 spectrum Methods 0.000 claims abstract description 29
- 238000000034 method Methods 0.000 claims abstract description 22
- 230000003595 spectral effect Effects 0.000 claims description 33
- 238000004364 calculation method Methods 0.000 claims description 11
- 230000000694 effects Effects 0.000 description 5
- 230000003993 interaction Effects 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 230000001174 ascending effect Effects 0.000 description 3
- 238000005314 correlation function Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000005507 spraying Methods 0.000 description 2
- 206010039740 Screaming Diseases 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/81—Detection of presence or absence of voice signals for discriminating voice from music
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Auxiliary Devices For Music (AREA)
- Electrophonic Musical Instruments (AREA)
Abstract
The invention discloses a kind of music rhythm detection method and its system, method includes:Obtain the voice data of music;An audio frame is sequentially obtained in the voice data as current audio frame, by the spectrum energy of current audio frame and a upper audio frame and difference, as current audio frame energy differences and preserve the energy differences;Determine the corresponding energy threshold of current audio frame;The energy differences of current audio frame and its continuous audio frame of two or more adjacent thereto before are obtained, the energy differences of the audio frame of more than three are obtained;If there is peak value in the energy differences of the audio frame of described more than three, and the peak value is more than the corresponding energy threshold of current audio frame, then the corresponding audio frame of the peak value is labeled as into rhythm point.The present invention can accurately and rapidly detect the position of rhythm point in music, and strong adaptability.
Description
Technical Field
The invention relates to the technical field of audio data processing, in particular to a music rhythm detection method and a system thereof.
Background
At present, most of light control in most stages adopts a DMX control console to carry out manual control, the labor cost is extremely consumed, and the audio control light technology mainly replaces manual work to detect the rhythm of songs in real time through software and control light. The existing audio control light technology is not high in detection accuracy and adaptability to song rhythm, and the purpose of intelligent interaction of music and light is difficult to achieve.
In chinese patent publication No. CN201210477064.1, a music tempo detection method and a detection apparatus are disclosed, in which the music tempo detection method includes: acquiring an audio signal of the detected music; calculating a cross-correlation function between the audio signal of the detected music and the audio signal of a preset music rhythm model; obtaining the number of preset music rhythm models contained in the detected music according to the type of the detected music and the frequency of the wave crest positions of the cross-correlation function curve appearing in the detected music; and comparing the number of preset music rhythm models contained in the detected music with preset rhythm information to determine the rhythm of the detected music. The comparison file obtains the correlation function of the detected audio signal by pre-establishing a music rhythm model, and further obtains the rhythm of the detected audio. By adopting the detection method, audio data needs to be obtained in advance, a music rhythm model is established, detection steps and complexity are increased, and inconvenience is brought to practical application.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: a music rhythm detection method and a system thereof are provided, which can improve the detection efficiency and accuracy.
In order to solve the technical problems, the invention adopts the technical scheme that: a music tempo detection method comprising:
acquiring audio data of music;
sequentially acquiring an audio frame from the audio data as a current audio frame, taking the difference value of the spectral energy sum of the current audio frame and the previous audio frame as the energy difference value of the current audio frame, and storing the energy difference value;
determining an energy threshold corresponding to the current audio frame;
acquiring energy difference values of a current audio frame and more than two adjacent continuous audio frames before the current audio frame to obtain energy difference values of more than three audio frames;
and if the energy difference value of the more than three audio frames has a peak value, and the peak value is larger than the energy threshold value corresponding to the current audio frame, marking the audio frame corresponding to the peak value as a rhythm point.
The invention also relates to a music tempo detection system comprising:
the first acquisition module is used for acquiring audio data of music;
a second obtaining module, configured to sequentially obtain an audio frame from the audio data as a current audio frame, use a difference between a sum of spectral energies of the current audio frame and a previous audio frame as an energy difference of the current audio frame, and store the energy difference;
the determining module is used for determining an energy threshold corresponding to the current audio frame;
the third acquisition module is used for acquiring the energy difference value of the current audio frame and more than two adjacent continuous audio frames before the current audio frame to obtain the energy difference value of more than three audio frames;
and the marking module is used for marking the audio frame corresponding to the peak value as a rhythm point if the peak value exists in the energy difference values of the more than three audio frames and the peak value is larger than the energy threshold value corresponding to the current audio frame.
The invention has the beneficial effects that: the rhythm point in the audio data is detected according to the energy difference value between the audio frames, which can be carried out in real time and has higher accuracy; the rhythm point is determined by analyzing and comparing the energy difference values of a plurality of adjacent audio frames without establishing a model, so that the detection efficiency is improved; the energy threshold value is adaptively adjusted according to the energy difference value of the processed audio frame, so that the energy threshold value is more matched with the currently processed audio data, the detected rhythm points are prevented from being too few or too many, the detection accuracy is further improved, the method is suitable for rhythm detection of various types of music, and the method has strong adaptability and strong robustness.
Drawings
FIG. 1 is a flow chart of a music tempo detection method according to the present invention;
FIG. 2 is a flowchart of a method according to a first embodiment of the present invention;
FIG. 3 is a flowchart of a method of step S2 according to an embodiment of the present invention;
FIG. 4 is a flowchart of a method of step S3 according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of a music tempo detection system according to the present invention;
fig. 6 is a schematic structural diagram of a system according to a third embodiment of the present invention.
Description of reference numerals:
1. a first acquisition module; 2. a second acquisition module; 3. a determination module; 4. a third obtaining module;
5. a marking module; 6. continuing to execute the module; 7. a control module;
21. a first acquisition unit; 22. a first obtaining unit; 23. a fourth calculation unit; 24. a second acquisition unit; 25. a second obtaining unit; 26. a fifth calculation unit; 27. a third obtaining unit;
31. a first setting unit; 32. a first calculation unit; 33. a second setting unit;
321. a second calculation unit; 322. and a third calculation unit.
Detailed Description
In order to explain technical contents, objects and effects of the present invention in detail, the following detailed description is given with reference to the accompanying drawings in conjunction with the embodiments.
The most key concept of the invention is as follows: and determining a rhythm point according to the energy difference value, and simultaneously determining an energy threshold corresponding to the audio frame in real time.
Referring to fig. 1, a music tempo detection method includes:
acquiring audio data of music;
sequentially acquiring an audio frame from the audio data as a current audio frame, taking the difference value of the spectral energy sum of the current audio frame and the previous audio frame as the energy difference value of the current audio frame, and storing the energy difference value;
determining an energy threshold corresponding to the current audio frame;
acquiring energy difference values of a current audio frame and more than two adjacent continuous audio frames before the current audio frame to obtain energy difference values of more than three audio frames;
and if the energy difference value of the more than three audio frames has a peak value, and the peak value is larger than the energy threshold value corresponding to the current audio frame, marking the audio frame corresponding to the peak value as a rhythm point.
From the above description, the beneficial effects of the present invention are: the rhythm of the audio data can be detected in real time, and the detection result is high in accuracy and efficiency.
Further, the "determining an energy threshold corresponding to the current audio frame" specifically includes:
if the sequence number corresponding to the current audio frame acquired in sequence is less than or equal to the preset frame number N, setting the energy threshold corresponding to the current audio frame as a preset first energy threshold;
if the sequence number corresponding to the current audio frame obtained in sequence is a natural number multiple of the preset frame number N, calculating to obtain a second energy threshold value according to the energy difference value of each audio frame in a first audio frame group, wherein the first audio frame group comprises the current audio frame and the previous continuous N-1 audio frames adjacent to the current audio frame;
and setting the energy threshold corresponding to the continuous N audio frames adjacent to the current audio frame after the current audio frame as the second energy threshold.
Further, if the sequence numbers corresponding to the sequentially obtained current audio frames are natural numbers times of the preset number of frames N, the step of calculating the second energy threshold according to the energy difference of each audio frame in the first audio frame group specifically includes:
if the sequence numbers corresponding to the current audio frames which are sequentially acquired are natural numbers times of the preset number of frames N, calculating to obtain an average value and a median value of energy difference values according to the energy difference values of the audio frames in the first audio frame group;
and calculating to obtain a second energy threshold according to the average value and the median.
Further, the "calculating a second energy threshold according to the average value and the median" specifically includes:
and calculating to obtain a second energy threshold according to a formula of alpha × mean + beta × mean + gamma, wherein mean is the average value, mean is the median, alpha is the weight corresponding to the average value, beta is the weight corresponding to the median, and gamma is a preset constant.
According to the description, the corresponding energy threshold of the audio frame is updated in real time according to the condition of the energy difference value of the audio frames with a certain number of frames, the robustness is strong, and the adaptability to rhythm detection of different types of songs is strong.
Further, if there is a peak in the energy difference values of the more than three audio frames and the peak is greater than the energy threshold corresponding to the current audio frame, the step of marking the audio frame corresponding to the peak as a rhythm point specifically includes: when the number of the energy difference values of the obtained audio frames is three, if the energy difference values of the three audio frames satisfy Dn-2<Dn-1And Dn-1>DnWhile D isn-1>nThen D will ben-1Marking the corresponding audio frame as a rhythm point; wherein,nfor the energy threshold corresponding to the current audio frame, DnFor the energy difference of the current audio frame, Dn-1For the energy difference of the current audio frame and the previous audio frame, Dn-2The energy difference value of two adjacent audio frames of the current audio frame is obtained.
Further, the "sequentially obtaining an audio frame from the audio data as a current audio frame, taking a difference between a sum of spectral energies of the current audio frame and a previous audio frame as an energy difference of the current audio frame, and storing the energy difference" specifically includes:
acquiring a first audio frame of the audio data according to a preset frame length;
fourier transform is carried out on the first audio frame to obtain the frequency spectrum of the first audio frame;
calculating to obtain the sum of the spectral energy of the frequency spectrum of the first audio frame in a preset frequency band;
acquiring a next audio frame of the audio data according to a preset frame length to serve as a current audio frame;
performing Fourier transform on the current audio frame to obtain a frequency spectrum of the current audio frame;
calculating to obtain the sum of the spectral energy of the frequency spectrum of the current audio frame in a preset frequency band;
and subtracting the sum of the spectral energy of the last audio frame from the sum of the spectral energy of the current audio frame to obtain an energy difference value of the current audio frame, and storing the energy difference value.
Further, after the step of marking the audio frame corresponding to the peak as a rhythm point, the method further includes:
and continuously executing the step of obtaining the next audio frame of the audio data according to the preset frame length as the current audio frame.
Further, after the step of marking the audio frame corresponding to the peak as a rhythm point, the method further includes:
and controlling the linkage of external equipment according to the rhythm point, or displaying the audio data according to the rhythm point and the frequency spectrum energy of the corresponding audio frame.
According to the above description, the rhythm point is detected and then applied to the control of external equipment, such as the control of stage lighting, so that the intelligent interaction effect of music and lighting can be realized; and displaying the audio characteristics corresponding to the rhythm points, so that the user can visually see the rhythm change corresponding to the audio data.
Referring to fig. 5, the present invention further provides a music tempo detection system, comprising:
the first acquisition module is used for acquiring audio data of music;
a second obtaining module, configured to sequentially obtain an audio frame from the audio data as a current audio frame, use a difference between a sum of spectral energies of the current audio frame and a previous audio frame as an energy difference of the current audio frame, and store the energy difference;
the determining module is used for determining an energy threshold corresponding to the current audio frame;
the third acquisition module is used for acquiring the energy difference value of the current audio frame and more than two adjacent continuous audio frames before the current audio frame to obtain the energy difference value of more than three audio frames;
and the marking module is used for marking the audio frame corresponding to the peak value as a rhythm point if the peak value exists in the energy difference values of the more than three audio frames and the peak value is larger than the energy threshold value corresponding to the current audio frame.
Further, the determining module includes:
the first setting unit is used for setting the energy threshold corresponding to the current audio frame as a preset first energy threshold if the sequence number corresponding to the current audio frame acquired in sequence is less than or equal to a preset frame number N;
the first calculating unit is used for calculating to obtain a second energy threshold value according to the energy difference value of each audio frame in a first audio frame group if the sequence number corresponding to the current audio frame acquired in sequence is a natural number multiple of the preset frame number N, wherein the first audio frame group comprises the current audio frame and the previous continuous N-1 audio frames adjacent to the current audio frame;
and the second setting unit is used for setting the energy threshold corresponding to the continuous N audio frames adjacent to the current audio frame after the current audio frame as the second energy threshold.
Further, the first calculation unit includes:
the second calculation unit is used for calculating to obtain the average value and the median value of the energy difference values according to the energy difference values of all the audio frames in the first audio frame group if the sequence numbers corresponding to the current audio frames acquired in sequence are natural number times of the preset frame number N;
and the third calculating unit is used for calculating to obtain a second energy threshold according to the average value and the median.
Further, the third calculating unit is specifically configured to calculate a second energy threshold according to a formula ═ α × mean + β × mean + γ, where mean is the average value, mean is the median, α is a weight corresponding to the average value, β is a weight corresponding to the median, and γ is a preset constant.
Further, the marking module is specifically configured to, when the number of the energy difference values of the acquired audio frames is three, if the energy difference values of the three audio frames satisfy Dn-2<Dn-1And Dn-1>DnWhile D isn-1>nThen D will ben-1Marking the corresponding audio frame as a rhythm point; wherein,nfor the energy threshold corresponding to the current audio frame, DnFor the energy difference of the current audio frame, Dn-1For the energy difference of the current audio frame and the previous audio frame, Dn-2The energy difference value of two adjacent audio frames of the current audio frame is obtained.
Further, the second obtaining module includes:
the first acquisition unit is used for acquiring a first audio frame of the audio data according to a preset frame length;
the first obtaining unit is used for carrying out Fourier transform on the first audio frame to obtain the frequency spectrum of the first audio frame;
the fourth calculating unit is used for calculating and obtaining the sum of the spectral energy of the frequency spectrum of the first audio frame in a preset frequency band;
the second acquisition unit is used for acquiring the next audio frame of the audio data as the current audio frame according to the preset frame length;
the second obtaining unit is used for carrying out Fourier transform on the current audio frame to obtain the frequency spectrum of the current audio frame;
the fifth calculating unit is used for calculating and obtaining the sum of the spectrum energy of the frequency spectrum of the current audio frame in a preset frequency band;
and the third obtaining unit is used for subtracting the sum of the spectral energy of the last audio frame from the sum of the spectral energy of the current audio frame to obtain an energy difference value of the current audio frame and storing the energy difference value.
Further, still include:
and the continuous execution module is used for continuously executing the step of acquiring the next audio frame of the audio data according to the preset frame length as the current audio frame.
Further, still include:
and the control module is used for controlling the linkage of external equipment according to the rhythm point or displaying the audio data according to the rhythm point and the frequency spectrum energy of the corresponding audio frame.
Example one
Referring to fig. 2, a first embodiment of the present invention is: a music tempo detection method comprises the following steps:
s1: acquiring audio data of music; further, after the audio data are acquired, the audio data are normalized. The music includes songs and accompaniment.
S2: sequentially acquiring an audio frame from the audio data as a current audio frame, and taking the difference value of the spectral energy sum of the current audio frame and the previous audio frame as the energy difference value D of the current audio framenAnd storing said energy difference Dn;
S3: determining the corresponding energy of the current audio frameVolume thresholdn(ii) a In this embodiment, the energy threshold may be a preset empirical value, and in this embodiment, the energy thresholds corresponding to each audio frame are the same and are the preset empirical value.
S4: acquiring a current audio frame and more than two continuous audio frames adjacent to the current audio frame before the current audio frame to obtain energy difference values of more than three audio frames;
s5: and judging whether a peak value exists in the energy difference values of the more than three audio frames, wherein the peak value is greater than the energy threshold value corresponding to the current audio frame, and if so, executing the step S6. For example, when the energy difference values of three audio frames are acquired, it is determined whether the following condition is satisfied: dn-2<Dn-1And Dn-1>DnAt the same time, Dn-1>nWherein D isn-1For the energy difference of the current audio frame and the previous audio frame, Dn-2The energy difference value of two adjacent audio frames of the current audio frame is obtained.
S6: and marking the audio frame corresponding to the peak value as a rhythm point. The execution returns to step S2 until the audio data processing is completed.
As shown in fig. 3, step S2 includes the following steps:
s201: acquiring a first audio frame of the audio data according to a preset frame length; further, the respective frame lengths are set according to different sampling rates, for example, the sampling rate is 44.1khz, and the frame length is 1024.
S202: fourier transform is carried out on the first audio frame to obtain the frequency spectrum of the first audio frame;
s203: calculating to obtain the sum of the spectral energy of the frequency spectrum of the first audio frame in a preset frequency band; the frequency band range can be selected according to requirements, such as a low-frequency part, a medium-frequency part and a high-frequency part, and can also be full frequency bandnIt is indicated that the subscript n indicates the frame number, counting from 1, i.e. corresponding to the audio frameA serial number.
S204: acquiring a next audio frame of the audio data according to a preset frame length to serve as a current audio frame;
s205: performing Fourier transform on the current audio frame to obtain a frequency spectrum of the current audio frame;
s206: calculating to obtain the sum of the spectral energy of the frequency spectrum of the current audio frame in a preset frequency band;
s207: and subtracting the sum of the spectral energy of the last audio frame from the sum of the spectral energy of the current audio frame to obtain an energy difference value of the current audio frame, and storing the energy difference value. In particular, according to formula Dn=Sn-Sn-1And calculating the energy difference value of the current audio frame relative to the last audio frame. The energy difference value of the first audio frame can be ignored, and the spectral energy value of the first audio frame can also be directly used as the energy difference value.
Further, after step S6, the process returns to step S204.
Preferably, after step S6, the method further includes: and controlling the linkage of external equipment according to the rhythm point, or displaying the audio data according to the rhythm point and the frequency spectrum energy of the corresponding audio frame.
And the step of controlling the external equipment linkage by the rhythm point comprises controlling the light according to the rhythm point. The method specifically comprises the following steps: a. flashing a light of one color corresponding to one rhythm point; b. flashing a type of light (e.g., a spot light); c. lights of two or more colors are sequentially flickered corresponding to one rhythm point; d. the lamps of various types are matched and twinkle corresponding to one rhythm point; e. the light flicker is matched with the water spraying, air spraying, screaming or applause effects at the same time corresponding to a rhythm point, so that the display of the light is changed in coordination with the change of the rhythm point.
The audio data according to the rhythm point and the spectral energy of the corresponding audio frame and the display may be:
displaying the audio data by using an electrocardiogram, wherein the potential change of the electrocardiogram represents the change of the frequency spectrum energy sum of the audio frame corresponding to the rhythm point, and the speed of the potential change represents the speed of the occurrence of the rhythm point;
or displaying the audio data by using a left-right moving sound column diagram, wherein the length of the sound column represents the sum of the spectral energy of the audio frame corresponding to the rhythm point, and the moving speed of the sound column represents the appearance speed of the rhythm point;
or displaying the audio data by using a sound column graph with ascending and descending changes, wherein the ascending height of the sound column represents the spectral energy sum of the audio frame corresponding to the rhythm point, and the ascending and descending speed of the sound column represents the speed of the rhythm point.
In the embodiment, the rhythm point in the audio data is detected according to the energy difference value between the audio frames, which can be carried out in real time and has higher accuracy; and a model is not required to be established, and the rhythm points are determined by analyzing and comparing the energy difference values of a plurality of adjacent audio frames, so that the detection efficiency is improved. After the rhythm point is detected, the rhythm point is applied to the control of external equipment, such as the control of stage lighting, so that the intelligent interaction effect of music and lighting can be realized; and displaying the audio characteristics corresponding to the rhythm points, so that the user can visually see the rhythm change corresponding to the audio data.
Example two
This embodiment is a further development of the first embodiment, and the same points are not described again, except that in step S3, the energy threshold is not fixed.
As shown in fig. 4, step S3 includes the following steps:
s301: and judging whether the sequence number corresponding to the current audio frame acquired in sequence is less than or equal to a preset frame number N, if so, executing the step S302, and if not, executing the step S303.
S302: setting an energy threshold corresponding to a current audio frame as a preset first energy threshold;
s303: judging whether the sequence number corresponding to the current audio frame acquired in sequence is a natural number multiple of the preset frame number N, if so, executing the step S304. The method comprises the steps of obtaining sequence numbers corresponding to current audio frames obtained in sequence, and judging whether the sequence numbers can completely divide a preset first frame number.
S304: and calculating to obtain a second energy threshold according to the energy difference value of each audio frame in a first audio frame group, wherein the first audio frame group comprises the current audio frame and the previous continuous N-1 audio frames adjacent to the current audio frame. Acquiring the energy difference value of the current audio frame and the previous continuous N-1 audio frames adjacent to the current audio frame; calculating to obtain the average value and the median value of the energy difference values according to the energy difference values; and calculating to obtain a second energy threshold according to the average value and the median. Specifically, the second energy threshold is calculated according to a formula of α × mean + β × mean + γ, where mean is the average value, mean is the median, α is a weight corresponding to the average value, β is a weight corresponding to the median, and γ is a preset constant.
S305: and setting the energy threshold corresponding to the continuous N audio frames adjacent to the current audio frame after the current audio frame as the second energy threshold.
For example, assuming that the preset frame number N is 150 frames, the energy thresholds corresponding to the first 150 frames, i.e., the 1 st frame to the 150 th frame, in the audio data are all preset first energy thresholds; meanwhile, when the 150 th frame is traversed, the sequence number is also a natural number multiple of the preset frame number N, so that the energy difference value of the 150 th frame and the previous continuous 149 frames adjacent to the 150 th frame, that is, the energy difference value from the 1 st frame to the 150 th frame is obtained, the average value and the median value of the energy difference values of the 150 th frame are calculated, then, the second energy threshold value is obtained through weighting calculation, and the second energy threshold value is used as the audio frame of the 150 th frame after the 150 th frame and adjacent to the 150 th frame, that is, the energy threshold value corresponding to the 151 th frame to the 300 th frame. Similarly, when the frame 300 is traversed, the energy threshold corresponding to the frame 301 to the frame 450 is calculated according to the energy difference value from the frame 151 to the frame 300, and so on.
The energy ranges of different songs are different, and the energy ranges of different stages of the same song can be deviated, so that the adaptability is not strong by only setting a uniform threshold, if the threshold is larger, the detected rhythm point is missing, and if the threshold is smaller, the detected rhythm point is redundant. The embodiment provides a threshold self-adaptive method, which adjusts an energy threshold according to an energy difference value of a processed audio frame, so that the energy threshold is more matched with currently processed audio data, too few or too many detected rhythm points are avoided, the detection accuracy is further improved, and the method is applicable to rhythm detection of various types of music, and has strong adaptability and robustness.
EXAMPLE III
Referring to fig. 6, the present embodiment is a music tempo detection system corresponding to the above embodiment, including:
a first obtaining module 1, configured to obtain audio data of music;
a second obtaining module 2, configured to sequentially obtain an audio frame from the audio data as a current audio frame, use a difference between a sum of spectral energies of the current audio frame and a previous audio frame as an energy difference of the current audio frame, and store the energy difference;
a determining module 3, configured to determine an energy threshold corresponding to a current audio frame;
a third obtaining module 4, configured to obtain an energy difference value between a current audio frame and two or more previous consecutive audio frames adjacent to the current audio frame, so as to obtain energy difference values of the three or more audio frames;
and the marking module 5 is configured to mark the audio frame corresponding to the peak as a rhythm point if the peak exists in the energy difference values of the more than three audio frames and the peak is greater than the energy threshold corresponding to the current audio frame.
Further, the determining module 3 includes:
a first setting unit 31, configured to set an energy threshold corresponding to the current audio frame as a preset first energy threshold if a sequence number corresponding to the sequentially acquired current audio frame is less than or equal to a preset frame number N;
the first calculating unit 32 is configured to calculate a second energy threshold according to an energy difference value of each audio frame in a first audio frame group if a sequence number corresponding to a current audio frame obtained in sequence is a natural number multiple of a preset frame number N, where the first audio frame group includes the current audio frame and N-1 consecutive audio frames before the current audio frame and adjacent to the current audio frame;
a second setting unit 33, configured to set an energy threshold corresponding to N consecutive audio frames after the current audio frame and adjacent to the current audio frame as the second energy threshold.
Further, the first calculation unit 32 includes:
a second calculating unit 321, configured to calculate, if the sequence number corresponding to the sequentially obtained current audio frame is a natural number multiple of the preset frame number N, an average value and a median value of energy difference values according to the energy difference value of each audio frame in the first audio frame group;
and a third calculating unit 322, configured to calculate a second energy threshold according to the average value and the median.
Further, the third calculating unit 322 is specifically configured to calculate a second energy threshold according to a formula ═ α × mean + β × mean + γ, where mean is the average value, mean is the median, α is a weight corresponding to the average value, β is a weight corresponding to the median, and γ is a preset constant.
Further, the marking module 5 is specifically configured to, when the number of the energy difference values of the obtained audio frames is three, if the energy difference values of the three audio frames satisfy Dn-2<Dn-1And Dn-1>DnWhile D isn-1>nThen D will ben-1Marking the corresponding audio frame as a rhythm point; wherein,nfor the energy threshold corresponding to the current audio frame, DnFor the energy difference of the current audio frame, Dn-1For the energy difference of the current audio frame adjacent to the previous frame, Dn-2The energy difference value of two adjacent previous frames of the current audio frame is obtained.
Further, the second obtaining module 2 includes:
a first obtaining unit 21, configured to obtain a first audio frame of the audio data according to a preset frame length;
a first obtaining unit 22, configured to perform fourier transform on the first audio frame to obtain a frequency spectrum of the first audio frame;
a fourth calculating unit 23, configured to calculate a sum of spectral energies of the frequency spectrum of the first audio frame in a preset frequency band;
a second obtaining unit 24, configured to obtain a next audio frame of the audio data according to a preset frame length, as a current audio frame;
a second obtaining unit 25, configured to perform fourier transform on the current audio frame to obtain a frequency spectrum of the current audio frame;
a fifth calculating unit 26, configured to calculate a sum of spectral energies of the frequency spectrum of the current audio frame in a preset frequency band;
a third obtaining unit 27, configured to subtract the sum of the spectral energy of the previous audio frame from the sum of the spectral energy of the current audio frame to obtain an energy difference value of the current audio frame, and store the energy difference value.
Further, still include:
and a continuous execution module 6, configured to continuously execute the step of acquiring a next audio frame of the audio data according to the preset frame length as a current audio frame.
Further, still include:
and the control module 7 is used for controlling the linkage of external equipment according to the rhythm point, or displaying the audio data according to the rhythm point and the frequency spectrum energy of the corresponding audio frame.
In summary, the music tempo detection method and system provided by the invention can detect the tempo point in the audio data according to the energy difference between the audio frames, and can be performed in real time, thereby having higher accuracy; the energy threshold value is adaptively adjusted according to the energy difference value of the processed audio frame, so that the energy threshold value is more matched with the currently processed audio data, the detected rhythm points are prevented from being too few or too many, the detection accuracy is further improved, the method is suitable for rhythm detection of various types of music, and the method has strong adaptability and strong robustness; after the rhythm point is detected, the rhythm point is applied to the control of external equipment, such as the control of stage lighting, so that the intelligent interaction effect of music and lighting can be realized; and displaying the audio characteristics corresponding to the rhythm points, so that the user can visually see the rhythm change corresponding to the audio data.
The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent changes made by using the contents of the present specification and the drawings, or applied directly or indirectly to the related technical fields, are included in the scope of the present invention.
Claims (12)
1. A music tempo detection method, comprising:
acquiring audio data of music;
sequentially acquiring an audio frame from the audio data as a current audio frame, taking the difference value of the spectral energy sum of the current audio frame and the previous audio frame as the energy difference value of the current audio frame, and storing the energy difference value;
determining an energy threshold corresponding to the current audio frame;
acquiring energy difference values of a current audio frame and more than two adjacent continuous audio frames before the current audio frame to obtain energy difference values of more than three audio frames;
and if the energy difference value of the more than three audio frames has a peak value, and the peak value is larger than the energy threshold value corresponding to the current audio frame, marking the audio frame corresponding to the peak value as a rhythm point.
2. The method according to claim 1, wherein the "determining the energy threshold corresponding to the current audio frame" specifically comprises:
if the sequence number corresponding to the current audio frame acquired in sequence is less than or equal to the preset frame number N, setting the energy threshold corresponding to the current audio frame as a preset first energy threshold;
if the sequence number corresponding to the current audio frame obtained in sequence is a natural number multiple of the preset frame number N, calculating to obtain a second energy threshold value according to the energy difference value of each audio frame in a first audio frame group, wherein the first audio frame group comprises the current audio frame and the previous continuous N-1 audio frames adjacent to the current audio frame;
and setting the energy threshold corresponding to the continuous N audio frames adjacent to the current audio frame after the current audio frame as the second energy threshold.
3. The method for detecting a music tempo according to claim 2, wherein said "if the sequence number corresponding to the current audio frame obtained in sequence is a natural number multiple of the preset number of frames N, then calculating a second energy threshold according to the energy difference of each audio frame in the first audio frame group" specifically comprises:
if the sequence numbers corresponding to the current audio frames which are sequentially acquired are natural numbers times of the preset number of frames N, calculating to obtain an average value and a median value of energy difference values according to the energy difference values of the audio frames in the first audio frame group;
and calculating to obtain a second energy threshold according to the average value and the median.
4. The music tempo detection method according to claim 3, wherein said "calculating a second energy threshold value according to the average value and the median" specifically comprises:
and calculating to obtain a second energy threshold according to a formula of alpha × mean + beta × mean + gamma, wherein mean is the average value, mean is the median, alpha is the weight corresponding to the average value, beta is the weight corresponding to the median, and gamma is a preset constant.
5. The method according to claim 1, wherein if there is a peak in the energy difference values of the more than three audio frames and the peak is greater than the energy threshold corresponding to the current audio frame, the step of marking the audio frame corresponding to the peak as a tempo point specifically comprises: when the number of the energy difference values of the obtained audio frames is three, if the energy difference values of the three audio frames satisfy Dn-2<Dn-1And Dn-1>DnWhile D isn-1>nThen D will ben-1Marking the corresponding audio frame as a rhythm point; wherein,nfor the energy threshold corresponding to the current audio frame, DnFor the energy difference of the current audio frame, Dn-1For the energy difference of the current audio frame and the previous audio frame, Dn-2The energy difference value of two adjacent audio frames of the current audio frame is obtained.
6. The method for detecting a music tempo according to claim 1, wherein said sequentially obtaining an audio frame from the audio data as a current audio frame, and using a difference between a spectral energy sum of the current audio frame and a previous audio frame as an energy difference of the current audio frame and storing the energy difference specifically comprises:
acquiring a first audio frame of the audio data according to a preset frame length;
fourier transform is carried out on the first audio frame to obtain the frequency spectrum of the first audio frame;
calculating to obtain the sum of the spectral energy of the frequency spectrum of the first audio frame in a preset frequency band;
acquiring a next audio frame of the audio data according to a preset frame length to serve as a current audio frame;
performing Fourier transform on the current audio frame to obtain a frequency spectrum of the current audio frame;
calculating to obtain the sum of the spectral energy of the frequency spectrum of the current audio frame in a preset frequency band;
and subtracting the sum of the spectral energy of the last audio frame from the sum of the spectral energy of the current audio frame to obtain an energy difference value of the current audio frame, and storing the energy difference value.
7. The method according to claim 6, wherein after said marking the audio frame corresponding to the peak as a tempo point, further comprising:
and continuously executing the step of obtaining the next audio frame of the audio data according to the preset frame length as the current audio frame.
8. The method for detecting music tempo according to claim 1, wherein after said marking the audio frame corresponding to said peak as a tempo point, further comprising:
and controlling the linkage of external equipment according to the rhythm point, or displaying the audio data according to the rhythm point and the frequency spectrum energy of the corresponding audio frame.
9. A music tempo detection system characterized by comprising:
the first acquisition module is used for acquiring audio data of music;
a second obtaining module, configured to sequentially obtain an audio frame from the audio data as a current audio frame, use a difference between a sum of spectral energies of the current audio frame and a previous audio frame as an energy difference of the current audio frame, and store the energy difference;
the determining module is used for determining an energy threshold corresponding to the current audio frame;
the third acquisition module is used for acquiring the energy difference value of the current audio frame and more than two adjacent continuous audio frames before the current audio frame to obtain the energy difference value of more than three audio frames;
and the marking module is used for marking the audio frame corresponding to the peak value as a rhythm point if the peak value exists in the energy difference values of the more than three audio frames and the peak value is larger than the energy threshold value corresponding to the current audio frame.
10. The music tempo detection system according to claim 9, wherein said determination module comprises:
the first setting unit is used for setting the energy threshold corresponding to the current audio frame as a preset first energy threshold if the sequence number corresponding to the current audio frame acquired in sequence is less than or equal to a preset frame number N;
the first calculating unit is used for calculating to obtain a second energy threshold value according to the energy difference value of each audio frame in a first audio frame group if the sequence number corresponding to the current audio frame acquired in sequence is a natural number multiple of the preset frame number N, wherein the first audio frame group comprises the current audio frame and the previous continuous N-1 audio frames adjacent to the current audio frame;
and the second setting unit is used for setting the energy threshold corresponding to the continuous N audio frames adjacent to the current audio frame after the current audio frame as the second energy threshold.
11. The music tempo detection system according to claim 10, wherein said first calculation unit comprises:
the second calculation unit is used for calculating to obtain the average value and the median value of the energy difference values according to the energy difference values of all the audio frames in the first audio frame group if the sequence numbers corresponding to the current audio frames acquired in sequence are natural number times of the preset frame number N;
and the third calculating unit is used for calculating to obtain a second energy threshold according to the average value and the median.
12. The music tempo detection system according to claim 9, further comprising:
and the control module is used for controlling the linkage of external equipment according to the rhythm point or displaying the audio data according to the rhythm point and the frequency spectrum energy of the corresponding audio frame.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710159699.XA CN107103917B (en) | 2017-03-17 | 2017-03-17 | Music rhythm detection method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710159699.XA CN107103917B (en) | 2017-03-17 | 2017-03-17 | Music rhythm detection method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107103917A true CN107103917A (en) | 2017-08-29 |
CN107103917B CN107103917B (en) | 2020-05-05 |
Family
ID=59675459
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710159699.XA Active CN107103917B (en) | 2017-03-17 | 2017-03-17 | Music rhythm detection method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107103917B (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108038863A (en) * | 2018-01-29 | 2018-05-15 | 歌尔科技有限公司 | Image partition method and device |
CN108111909A (en) * | 2017-12-15 | 2018-06-01 | 广州市百果园信息技术有限公司 | Method of video image processing and computer storage media, terminal |
CN108259984A (en) * | 2017-12-29 | 2018-07-06 | 广州市百果园信息技术有限公司 | Method of video image processing, computer readable storage medium and terminal |
CN108259925A (en) * | 2017-12-29 | 2018-07-06 | 广州市百果园信息技术有限公司 | Music gifts processing method, storage medium and terminal in net cast |
CN108259983A (en) * | 2017-12-29 | 2018-07-06 | 广州市百果园信息技术有限公司 | A kind of method of video image processing, computer readable storage medium and terminal |
CN108281157A (en) * | 2017-12-28 | 2018-07-13 | 广州市百果园信息技术有限公司 | The detection method of drum beat and computer storage media, terminal in music |
CN108319657A (en) * | 2018-01-04 | 2018-07-24 | 广州市百果园信息技术有限公司 | Detect method, storage medium and the terminal of strong rhythm point |
CN108322802A (en) * | 2017-12-29 | 2018-07-24 | 广州市百果园信息技术有限公司 | Stick picture disposing method, computer readable storage medium and the terminal of video image |
CN110265057A (en) * | 2019-07-10 | 2019-09-20 | 腾讯科技(深圳)有限公司 | Generate multimedia method and device, electronic equipment, storage medium |
CN110688518A (en) * | 2019-10-12 | 2020-01-14 | 广州酷狗计算机科技有限公司 | Rhythm point determining method, device, equipment and storage medium |
CN110753238A (en) * | 2019-10-29 | 2020-02-04 | 北京字节跳动网络技术有限公司 | Video processing method, device, terminal and storage medium |
CN110853677A (en) * | 2019-11-20 | 2020-02-28 | 北京雷石天地电子技术有限公司 | Drumbeat beat recognition method and device for songs, terminal and non-transitory computer readable storage medium |
CN110890083A (en) * | 2019-10-31 | 2020-03-17 | 北京达佳互联信息技术有限公司 | Audio data processing method and device, electronic equipment and storage medium |
CN111081271A (en) * | 2019-11-29 | 2020-04-28 | 福建星网视易信息系统有限公司 | Music rhythm detection method based on frequency domain and time domain and storage medium |
CN111105769A (en) * | 2019-12-26 | 2020-05-05 | 广州酷狗计算机科技有限公司 | Method, device, equipment and storage medium for detecting intermediate frequency rhythm point of audio |
CN111128100A (en) * | 2019-12-20 | 2020-05-08 | 网易(杭州)网络有限公司 | Rhythm point detection method and device and electronic equipment |
CN111630590A (en) * | 2018-02-14 | 2020-09-04 | 字节跳动有限公司 | Method for generating music data |
CN112261767A (en) * | 2020-10-23 | 2021-01-22 | 深圳市智岩科技有限公司 | Method, device and system for controlling lamp effect and storage medium |
CN112435687A (en) * | 2020-11-25 | 2021-03-02 | 腾讯科技(深圳)有限公司 | Audio detection method and device, computer equipment and readable storage medium |
CN112866732A (en) * | 2020-12-30 | 2021-05-28 | 广州方硅信息技术有限公司 | Music broadcasting method and device, equipment and medium thereof |
CN112866770A (en) * | 2020-12-31 | 2021-05-28 | 北京奇艺世纪科技有限公司 | Equipment control method and device, electronic equipment and storage medium |
CN113192531A (en) * | 2021-05-28 | 2021-07-30 | 腾讯音乐娱乐科技(深圳)有限公司 | Method, terminal and storage medium for detecting whether audio is pure music audio |
CN113948106A (en) * | 2021-08-31 | 2022-01-18 | 深圳市智岩科技有限公司 | Audio rhythm detection method, light effect control system, device, equipment and medium |
WO2023093333A1 (en) * | 2021-11-25 | 2023-06-01 | 歌尔股份有限公司 | Vibration signal generation method and apparatus, electronic device, and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5614687A (en) * | 1995-02-20 | 1997-03-25 | Pioneer Electronic Corporation | Apparatus for detecting the number of beats |
CN1484116A (en) * | 2002-09-20 | 2004-03-24 | 红芯有限责任公司 | Beat number detector |
US20050217461A1 (en) * | 2004-03-31 | 2005-10-06 | Chun-Yi Wang | Method for music analysis |
CN106098081A (en) * | 2016-06-01 | 2016-11-09 | 腾讯科技(深圳)有限公司 | The acoustic fidelity identification method of audio files and device |
-
2017
- 2017-03-17 CN CN201710159699.XA patent/CN107103917B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5614687A (en) * | 1995-02-20 | 1997-03-25 | Pioneer Electronic Corporation | Apparatus for detecting the number of beats |
CN1484116A (en) * | 2002-09-20 | 2004-03-24 | 红芯有限责任公司 | Beat number detector |
US20050217461A1 (en) * | 2004-03-31 | 2005-10-06 | Chun-Yi Wang | Method for music analysis |
CN106098081A (en) * | 2016-06-01 | 2016-11-09 | 腾讯科技(深圳)有限公司 | The acoustic fidelity identification method of audio files and device |
Cited By (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108111909A (en) * | 2017-12-15 | 2018-06-01 | 广州市百果园信息技术有限公司 | Method of video image processing and computer storage media, terminal |
CN108281157A (en) * | 2017-12-28 | 2018-07-13 | 广州市百果园信息技术有限公司 | The detection method of drum beat and computer storage media, terminal in music |
CN108322802A (en) * | 2017-12-29 | 2018-07-24 | 广州市百果园信息技术有限公司 | Stick picture disposing method, computer readable storage medium and the terminal of video image |
CN108259984A (en) * | 2017-12-29 | 2018-07-06 | 广州市百果园信息技术有限公司 | Method of video image processing, computer readable storage medium and terminal |
CN108259925A (en) * | 2017-12-29 | 2018-07-06 | 广州市百果园信息技术有限公司 | Music gifts processing method, storage medium and terminal in net cast |
CN108259983A (en) * | 2017-12-29 | 2018-07-06 | 广州市百果园信息技术有限公司 | A kind of method of video image processing, computer readable storage medium and terminal |
CN108319657A (en) * | 2018-01-04 | 2018-07-24 | 广州市百果园信息技术有限公司 | Detect method, storage medium and the terminal of strong rhythm point |
CN108319657B (en) * | 2018-01-04 | 2022-02-01 | 广州市百果园信息技术有限公司 | Method for detecting strong rhythm point, storage medium and terminal |
CN108038863A (en) * | 2018-01-29 | 2018-05-15 | 歌尔科技有限公司 | Image partition method and device |
CN108038863B (en) * | 2018-01-29 | 2021-02-19 | 歌尔科技有限公司 | Image segmentation method and device |
CN111630590A (en) * | 2018-02-14 | 2020-09-04 | 字节跳动有限公司 | Method for generating music data |
CN111630590B (en) * | 2018-02-14 | 2024-05-14 | 字节跳动有限公司 | Method for generating music data |
US11887566B2 (en) | 2018-02-14 | 2024-01-30 | Bytedance Inc. | Method of generating music data |
CN110265057A (en) * | 2019-07-10 | 2019-09-20 | 腾讯科技(深圳)有限公司 | Generate multimedia method and device, electronic equipment, storage medium |
CN110265057B (en) * | 2019-07-10 | 2024-04-26 | 腾讯科技(深圳)有限公司 | Method and device for generating multimedia, electronic equipment and storage medium |
CN110688518A (en) * | 2019-10-12 | 2020-01-14 | 广州酷狗计算机科技有限公司 | Rhythm point determining method, device, equipment and storage medium |
CN110688518B (en) * | 2019-10-12 | 2024-05-24 | 广州酷狗计算机科技有限公司 | Determination method, device, equipment and storage medium for rhythm point |
CN110753238A (en) * | 2019-10-29 | 2020-02-04 | 北京字节跳动网络技术有限公司 | Video processing method, device, terminal and storage medium |
CN110890083B (en) * | 2019-10-31 | 2022-09-02 | 北京达佳互联信息技术有限公司 | Audio data processing method and device, electronic equipment and storage medium |
CN110890083A (en) * | 2019-10-31 | 2020-03-17 | 北京达佳互联信息技术有限公司 | Audio data processing method and device, electronic equipment and storage medium |
CN110853677A (en) * | 2019-11-20 | 2020-02-28 | 北京雷石天地电子技术有限公司 | Drumbeat beat recognition method and device for songs, terminal and non-transitory computer readable storage medium |
CN110853677B (en) * | 2019-11-20 | 2022-04-26 | 北京雷石天地电子技术有限公司 | Drumbeat beat recognition method and device for songs, terminal and non-transitory computer readable storage medium |
CN111081271B (en) * | 2019-11-29 | 2022-09-06 | 福建星网视易信息系统有限公司 | Music rhythm detection method based on frequency domain and time domain and storage medium |
CN111081271A (en) * | 2019-11-29 | 2020-04-28 | 福建星网视易信息系统有限公司 | Music rhythm detection method based on frequency domain and time domain and storage medium |
US12033605B2 (en) | 2019-12-20 | 2024-07-09 | Netease (Hangzhou) Network Co., Ltd. | Rhythm point detection method and apparatus and electronic device |
CN111128100A (en) * | 2019-12-20 | 2020-05-08 | 网易(杭州)网络有限公司 | Rhythm point detection method and device and electronic equipment |
CN111128100B (en) * | 2019-12-20 | 2021-04-20 | 网易(杭州)网络有限公司 | Rhythm point detection method and device and electronic equipment |
CN111105769A (en) * | 2019-12-26 | 2020-05-05 | 广州酷狗计算机科技有限公司 | Method, device, equipment and storage medium for detecting intermediate frequency rhythm point of audio |
CN111105769B (en) * | 2019-12-26 | 2023-01-10 | 广州酷狗计算机科技有限公司 | Method, device, equipment and storage medium for detecting intermediate frequency rhythm point of audio |
CN112261767A (en) * | 2020-10-23 | 2021-01-22 | 深圳市智岩科技有限公司 | Method, device and system for controlling lamp effect and storage medium |
CN112435687A (en) * | 2020-11-25 | 2021-03-02 | 腾讯科技(深圳)有限公司 | Audio detection method and device, computer equipment and readable storage medium |
CN112866732A (en) * | 2020-12-30 | 2021-05-28 | 广州方硅信息技术有限公司 | Music broadcasting method and device, equipment and medium thereof |
CN112866770B (en) * | 2020-12-31 | 2023-12-05 | 北京奇艺世纪科技有限公司 | Equipment control method and device, electronic equipment and storage medium |
CN112866770A (en) * | 2020-12-31 | 2021-05-28 | 北京奇艺世纪科技有限公司 | Equipment control method and device, electronic equipment and storage medium |
CN113192531B (en) * | 2021-05-28 | 2024-04-16 | 腾讯音乐娱乐科技(深圳)有限公司 | Method, terminal and storage medium for detecting whether audio is pure audio |
CN113192531A (en) * | 2021-05-28 | 2021-07-30 | 腾讯音乐娱乐科技(深圳)有限公司 | Method, terminal and storage medium for detecting whether audio is pure music audio |
CN113948106A (en) * | 2021-08-31 | 2022-01-18 | 深圳市智岩科技有限公司 | Audio rhythm detection method, light effect control system, device, equipment and medium |
WO2023093333A1 (en) * | 2021-11-25 | 2023-06-01 | 歌尔股份有限公司 | Vibration signal generation method and apparatus, electronic device, and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN107103917B (en) | 2020-05-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107103917B (en) | Music rhythm detection method and system | |
CN107889323B (en) | Control method and device for light display | |
US10366121B2 (en) | Apparatus, method, and computer-readable medium for cue point generation | |
CN104978962B (en) | Singing search method and system | |
US7183479B2 (en) | Beat analysis of musical signals | |
US8706274B2 (en) | Information processing apparatus, information processing method, and program | |
US4314379A (en) | Apparatus for discrimination of myoelectric potential patterns | |
CN113727501B (en) | Sound-based light dynamic control method, device, system and storage medium | |
CN111081271B (en) | Music rhythm detection method based on frequency domain and time domain and storage medium | |
CN106211502A (en) | A kind of method and system of audio frequency control light | |
CN102903357A (en) | Method, device and system for extracting chorus of song | |
RU2010102675A (en) | METHOD FOR ELECTRONIC ANALYSIS OF DIALOGUE AND SYSTEM FOR IMPLEMENTING THIS METHOD | |
KR20030070178A (en) | Method and system for real-time music/speech discrimination in digital audio signals | |
US10492276B2 (en) | Lighting control device, lighting control method, and lighting control program | |
CN112261767A (en) | Method, device and system for controlling lamp effect and storage medium | |
CN102543052B (en) | A kind of method and apparatus analyzing music BPM | |
CN110933406B (en) | Objective evaluation method for short video music matching quality | |
Jensen et al. | Real-time beat estimationusing feature extraction | |
WO2010047444A1 (en) | Music-and-fountain drive device and method, and a music-and-fountain scenario-producing device and method therefor | |
CN108492807B (en) | Method and device for displaying sound modification state | |
KR100911689B1 (en) | A method and an apparatus for detecting realtime music beat period | |
Vinutha et al. | Reliable tempo detection for structural segmentation in sarod concerts | |
JP2009265493A (en) | Sound processing device and program | |
CN113539296B (en) | Audio climax detection algorithm based on sound intensity, storage medium and device | |
CN116437539B (en) | Multi-degree-of-freedom stage-design light control method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |