CN113012713A

CN113012713A - Music genre classification method based on logistic regression algorithm in machine learning

Info

Publication number: CN113012713A
Application number: CN202110229785.XA
Authority: CN
Inventors: 蔡向东; 梁佳雨
Original assignee: Harbin University of Science and Technology
Current assignee: Harbin University of Science and Technology
Priority date: 2021-03-02
Filing date: 2021-03-02
Publication date: 2021-06-22

Abstract

A music genre classification method based on a logistic regression algorithm in machine learning belongs to the technical field of data processing. The invention provides a method for solving manual classification, improving the accuracy and speed of music genre classification and providing great convenience for users to retrieve and distinguish various types of music genres. The method comprises the steps of collecting preset music data of various types, carrying out format conversion processing on the collected music data, extracting characteristic parameters of the music data from the music data by utilizing a Mel cepstrum coefficient, training and modeling extracted characteristic parameter vectors of the music data by utilizing a logistic regression algorithm in machine learning, and predicting classification of music genres by utilizing a trained model.

Description

Music genre classification method based on logistic regression algorithm in machine learning

Technical Field

The invention relates to the technical field of data processing, in particular to a music genre classification method based on a logistic regression algorithm in machine learning

Background

At present, with the popularization of computers and the rapid development of networks, a large amount of music resources are transmitted to users through the networks, and different users have different preference degrees for different types of music. Users have a need to find related music types that meet their preferences, so the classification of music genres becomes one of the popular research directions in the field of music information retrieval.

Music is extremely diversified and huge data volume, and music information retrieval becomes very difficult. The classification is a necessary means for managing music data, the traditional classification is music retrieval based on manually labeled text, and a large amount of human resources and time are consumed, so that the method adopts a logistic regression algorithm in machine learning to perform supervised learning modeling on music characteristic parameter vectors to obtain corresponding genre classification, avoids manual classification, improves the accuracy and speed of music genre classification, and provides great convenience for users to retrieve a certain type of music.

Disclosure of Invention

The invention aims to solve the problem of music genre classification in huge music data volume, and provides a music genre classification method based on a logistic regression algorithm in machine learning.

1. The invention mainly comprises the following steps:

step 101: collecting preset music data of various types, and carrying out format conversion processing on the collected music data;

step 102: extracting characteristic parameters of the music data by utilizing a Mel cepstrum coefficient;

step 103: and training and modeling the extracted music data characteristic parameter vectors by using a logistic regression algorithm in machine learning, and predicting the classification of music genres by using the trained models.

Further, the format conversion processing of the collected music data includes the following steps:

the format of music data is converted into a lossless waveform audio format, each point of real sound waves is sampled according to fixed frequency, and the conversion from real signals to analog signals is realized; the sampling frequency needs to satisfy the nyquist sampling law to ensure the original signal to be restored, and meanwhile, in order to improve the signal-to-noise ratio, the lossless waveform audio data needs to be preprocessed.

Further, the step of extracting the characteristic parameters of the music data by using mel frequency cepstrum coefficients for the music data comprises the following steps:

extracting characteristic parameters of the music data by utilizing the Mel cepstrum coefficient, and performing discrete Fourier transform and Mel cepstrum on the preprocessed data to obtain music characteristics.

Further, training and modeling the extracted music data characteristic parameter vector by using a logistic regression algorithm in machine learning, and predicting the classification of music genres by using a trained model comprises the following steps:

converting the characteristic parameters of the music data into vectors, normalizing the characteristics, writing a weight function by using a logistic regression algorithm in machine learning, and iterating by using a gradient descent algorithm by using a cross entropy loss function according to an activation function. And determining the model weight according to the minimized cross entropy loss function, and performing classification prediction.

The invention has the advantages that: the music genre classification method based on the machine learning algorithm is reasonable in setting, easy to operate, simple and understandable in applied algorithm, high in training running speed, and capable of greatly reducing manual intervention and improving the efficiency and accuracy of genre classification by combining the machine learning algorithm and the audio data.

Drawings

FIG. 1 is a flowchart of music genre classification steps based on logistic regression algorithm in machine learning according to the present invention

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention. The first embodiment is as follows: the following describes the present embodiment with reference to fig. 1, and the music genre classification method based on the logistic regression algorithm in machine learning according to the present embodiment mainly includes:

first, for each type of music data that is collected and preset, format conversion processing is performed on the collected music data.

Secondly, extracting characteristic parameters of the music data by utilizing the Mel cepstrum coefficient.

And finally, training and modeling the extracted music data characteristic parameter vectors by using a logistic regression algorithm in machine learning, and predicting the classification of music genres by using a trained model.

The second embodiment: the specific embodiment example of classifying genres by using music characteristic parameters based on a logistic regression algorithm in machine learning mainly includes:

Claims

1. A music genre classification method based on a logistic regression algorithm in machine learning. The method extracts characteristic parameters from various audios, performs characteristic engineering, constructs a music genre classification model by using a logistic regression algorithm, can classify different types of music, and mainly comprises the following steps:

step 1: collecting preset music data of various types, and carrying out format conversion processing on the collected music data;

step 2: extracting characteristic parameters of the music data by utilizing a Mel cepstrum coefficient;

and step 3: and training and modeling the extracted music data characteristic parameter vectors by using a logistic regression algorithm in machine learning, and predicting the classification of music genres by using the trained models.

2. The method for classifying music genres based on machine learning according to claim 1, wherein the step 1, music data format processing in a sample for collecting music of a preset genre, further comprises: the format of music data is converted into a lossless waveform audio format, each point of real sound waves is sampled according to fixed frequency, and the conversion from real signals to analog signals is realized; the sampling frequency needs to satisfy the Nyquist sampling law, and the original signal is ensured to be restored; meanwhile, in order to improve the signal-to-noise ratio, the lossless waveform audio data needs to be preprocessed, and the preprocessing is approximately expressed by the following formula:

y(t)＝x(t)-α*x(t-1)

3. the method for classifying music genre based on machine learning according to claim 1, wherein said step 2 extracts feature parameters of music data by using mel frequency cepstrum coefficients, and said method further comprises: performing discrete Fourier transform and Mel cepstrum on the preprocessed data to obtain music characteristics; mel cepstral coefficients are cepstral parameters extracted in the Mel-scale frequency domain, the Mel-scale describes the non-linear characteristic of human ear frequency, and the common frequency scale is converted into Mel-frequency scale, wherein the mapping relationship is as follows:

Mel(f)＝2595*lg(1+f/700)

by this mapping relationship, the sensitivity of the human ear to frequency becomes linear under the Mel scale.

4. The method for classifying music genres according to claim 1, wherein in step 3, the extracted music data feature parameters are trained by using a machine learning algorithm to obtain classifications of music genres, and the method further comprises: converting the characteristic parameters of the music data into vectors, carrying out normalization processing on the characteristics, writing a weight function by utilizing a logistic regression algorithm in machine learning, and adopting a cross entropy loss function according to an activation function:

L(Y,P(Y|X))＝-logP(Y|X)

and (4) iterating by using a gradient descent algorithm, determining the weight of the model according to the minimized cross entropy loss function, and performing classified prediction. Collecting data from an audio file, selecting extracted features and models according to the characteristics of the collected data, intercepting a part of data to train a classifier, and finally adjusting and finally determining the parameters of the classifier according to the test result.