KR20120109763A - Apparatus and method for analyzing information of polyphonic sound source using neural computer - Google Patents
Apparatus and method for analyzing information of polyphonic sound source using neural computer Download PDFInfo
- Publication number
- KR20120109763A KR20120109763A KR1020110027308A KR20110027308A KR20120109763A KR 20120109763 A KR20120109763 A KR 20120109763A KR 1020110027308 A KR1020110027308 A KR 1020110027308A KR 20110027308 A KR20110027308 A KR 20110027308A KR 20120109763 A KR20120109763 A KR 20120109763A
- Authority
- KR
- South Korea
- Prior art keywords
- music
- harmony
- neural network
- sound source
- voice
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Auxiliary Devices For Music (AREA)
- Electrophonic Musical Instruments (AREA)
Abstract
Description
The present invention relates to music information retrieval. More particularly, the present invention analyzes music information on multiple sound sources (hereinafter referred to as music) including a single source sound source (monophonic) to generate a note. The present invention relates to an apparatus and method for analyzing music information of multiple sound sources using neural network computing.
In general, music information analysis refers to an analysis of digitized music data through algorithms to obtain information on elements (pitch, chord, voice, etc.) that make up music. Most MIR techniques have been mainly studied on key analysis, chord analysis, and source partitioning through rule-based algorithms, but it is very difficult to commercialize them because of their low accuracy when applied to off-sample music. It is.
In addition, conventional music information analysis systems are often configured for a single purpose only, and there is no system for analyzing overall information on music.
Therefore, the problem to be solved by the present invention in order to solve the above-described problems, by applying a neural network (Neural-network) technology to machine (computer) by itself to improve the accuracy of music information analysis, the overall information (music) It is to provide an apparatus and method for analyzing music information of multiple sound sources using neural network computing, which can analyze voice-multiplexed sound as a single chord-extraction, rhythm, pitch, and chord.
In addition, another technical problem to be solved by the present invention is to provide an apparatus and method for analyzing music information of multiple sound sources using neural network computing, which enables composition of music scores from unspecified music.
Problems to be solved by the present invention are not limited to the above-mentioned problems, and other problems not mentioned will be clearly understood by those skilled in the art from the following description.
According to an aspect of the present invention, there is provided an apparatus and method for analyzing music information of multiple sound sources using neural network computing, comprising: a first process of performing neural network learning for voice extraction; A second process of neural network learning for chord pattern recognition; A third step of extracting a polyphonic into a monophonic; A fourth step of analyzing a pitch of each extracted voice; A fifth step of analyzing the chord from the multiple chord sound source; And a sixth step of reconstructing the analyzed data into the music score file.
Specific details of other embodiments are included in the detailed description and the drawings.
According to the apparatus and method for analyzing music information of multiple sound sources using neural network computing according to an embodiment of the present invention as described above, one or more of the following effects exist.
First, since the basic information for composing music using music can be obtained, it can be used in various applications.
Second, the rhythm information may be applied to a rhythm action game or the chord information may be applied to a performance program.
Third, it can be used in music education programs using the generated sheet music.
The effects of the present invention are not limited to the effects mentioned above, and other effects not mentioned can be clearly understood by those skilled in the art from the description of the claims.
1 is a block diagram of an apparatus for analyzing music information according to an embodiment of the present invention.
Advantages and features of the present invention, and methods of achieving the same will become apparent with reference to the embodiments described below in detail in conjunction with the accompanying drawings. However, it is to be understood that the present invention is not limited to the disclosed embodiments, but may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. It is intended that the disclosure of the present invention be limited only by the terms of the appended claims.
Also, the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. In the present specification, the singular form includes plural forms unless otherwise specified in the specification. As used herein, "comprises" and / or "comprising" does not exclude the presence or addition of components other than the mentioned components. Unless otherwise defined, all terms (including technical and scientific terms) used in the present specification may be used in a sense that can be commonly understood by those skilled in the art.
Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings in order to describe the present invention in more detail. Like reference numerals refer to like elements throughout.
1 is a block diagram illustrating an apparatus for analyzing music information of the present invention, wherein the apparatus for analyzing music information of the present invention is mounted on a terminal capable of driving a software, for example, a PC, a PDA, a mobile phone, or the like in hardware or software. .
The music information analyzer is composed of voice extractors (1, 3, 4, 5), chord extractors (2, 6, 7, 8, 9), and an output unit (10, 11) composed of scores using analyzed data. It consists of
Each neural network must be trained using sample data to determine neural network coefficients. This requires more learning for higher accuracy.
When a music file (MOV / MP3 / MP4, etc.) is input, the music information analyzer first determines a block size of PCM data through bit analysis (1), and then finds a key used for each music section (2).
After each voice is normalized (3) with data using the C-major as the main key, each voice is extracted using the learned neural network (4).
The pitch of the extracted single chord voice is analyzed (5).
In the chord extraction section, autocorrelation coefficient data is determined from a given multiple chord sound source (6), and a pitch class profile is constructed (7).
Obtaining the PCP Data Configured as Key Key Values The continuous chord (9) data is generated using the comparison with the reference PCP (8) and the neural network.
The single
As described above, the present invention has been described with reference to the embodiments shown in the drawings, but it is only for the purpose of describing the present invention, and those skilled in the art to which the present invention pertains various modifications or equivalents from the detailed description of the invention. It will be appreciated that one embodiment is possible. Therefore, the true scope of the present invention should be determined by the technical spirit of the claims.
Claims (1)
A second process of neural network learning for chord pattern recognition;
A third step of extracting a polyphonic into a monophonic;
A fourth step of analyzing a pitch of each extracted voice;
A fifth step of analyzing the chord from the multiple chord sound source;
And a sixth process of reconstructing the analyzed data into the music score file, wherein the music information analysis method of the multiple sound sources using neural network computing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020110027308A KR20120109763A (en) | 2011-03-28 | 2011-03-28 | Apparatus and method for analyzing information of polyphonic sound source using neural computer |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020110027308A KR20120109763A (en) | 2011-03-28 | 2011-03-28 | Apparatus and method for analyzing information of polyphonic sound source using neural computer |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20120109763A true KR20120109763A (en) | 2012-10-09 |
Family
ID=47280836
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020110027308A KR20120109763A (en) | 2011-03-28 | 2011-03-28 | Apparatus and method for analyzing information of polyphonic sound source using neural computer |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20120109763A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103908A (en) * | 2017-05-02 | 2017-08-29 | 大连民族大学 | The application of many pitch estimation methods of polyphony and pseudo- bispectrum in multitone height estimation |
WO2020181782A1 (en) * | 2019-03-12 | 2020-09-17 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio data processing method and device, and computer storage medium |
CN112017621A (en) * | 2020-08-04 | 2020-12-01 | 河海大学常州校区 | LSTM multi-track music generation method based on alignment harmony relationship |
KR20210059301A (en) | 2019-11-15 | 2021-05-25 | 김병헌 | Music Search System and Music Search Method Using the Same |
-
2011
- 2011-03-28 KR KR1020110027308A patent/KR20120109763A/en not_active Application Discontinuation
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103908A (en) * | 2017-05-02 | 2017-08-29 | 大连民族大学 | The application of many pitch estimation methods of polyphony and pseudo- bispectrum in multitone height estimation |
WO2020181782A1 (en) * | 2019-03-12 | 2020-09-17 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio data processing method and device, and computer storage medium |
KR20210059301A (en) | 2019-11-15 | 2021-05-25 | 김병헌 | Music Search System and Music Search Method Using the Same |
CN112017621A (en) * | 2020-08-04 | 2020-12-01 | 河海大学常州校区 | LSTM multi-track music generation method based on alignment harmony relationship |
CN112017621B (en) * | 2020-08-04 | 2024-05-28 | 河海大学常州校区 | LSTM multi-track music generation method based on alignment and sound relation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3723080A1 (en) | Music classification method and beat point detection method, storage device and computer device | |
KR100671505B1 (en) | Method for classifying a music genre and recognizing a musical instrument signal using bayes decision rule | |
JP6732296B2 (en) | Audio information processing method and device | |
CN108630193A (en) | Audio recognition method and device | |
CN109308912B (en) | Music style recognition method, device, computer equipment and storage medium | |
US20210366488A1 (en) | Speaker Identification Method and Apparatus in Multi-person Speech | |
Zapata et al. | Multi-feature beat tracking | |
Kaleem et al. | Pathological speech signal analysis and classification using empirical mode decomposition | |
CN106997769B (en) | Trill recognition method and device | |
CN109801646A (en) | Voice endpoint detection method and device based on fusion features | |
KR20120109763A (en) | Apparatus and method for analyzing information of polyphonic sound source using neural computer | |
Zhang et al. | Speech emotion recognition using combination of features | |
CN112671985A (en) | Agent quality inspection method, device, equipment and storage medium based on deep learning | |
CN108021635A (en) | The definite method, apparatus and storage medium of a kind of audio similarity | |
CN110782915A (en) | Waveform music component separation method based on deep learning | |
CN112309372A (en) | Tone-based intention identification method, device, equipment and storage medium | |
Lu et al. | Metric learning based data augmentation for environmental sound classification | |
CN104731891A (en) | Method for extracting mass data in ETL (extract transform load) | |
Wiem et al. | Unsupervised single channel speech separation based on optimized subspace separation | |
Lopatka et al. | Acceleration of decision making in sound event recognition employing supercomputing cluster | |
Südholt et al. | Pruning deep neural network models of guitar distortion effects | |
CN111477248B (en) | Audio noise detection method and device | |
CN105006231A (en) | Distributed large population speaker recognition method based on fuzzy clustering decision tree | |
CN110910905B (en) | Mute point detection method and device, storage medium and electronic equipment | |
CN111508530A (en) | Speech emotion recognition method, device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Withdrawal due to no request for examination |