KR102028164B1

KR102028164B1 - producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same

Info

Publication number: KR102028164B1
Application number: KR1020170083339A
Authority: KR
Inventors: 김준호; 윤창호; 박종원
Original assignee: (주)주스
Priority date: 2017-06-30
Filing date: 2017-06-30
Publication date: 2019-10-08
Also published as: KR20190002978A

Abstract

The present invention relates to a semantic unit partial sound source generation system that generates a plurality of short partial sound sources by cutting off portions of one sound source data that are easy for a user to hear.
To this end, the present invention in the semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using the sound source data,
An rms extraction module for calculating total rms of the entire sound source data using the sound source data loaded into the semantic unit partial sound source generation system; a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time; A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells; The rms extraction module calculates cell rms for each of the plurality of cells, and the control unit compares the cell rms and rms threshold values, and if the cell rms is smaller than the rms threshold value, the cell is classified as a divided cell. When more than the cut threshold value in succession, the index value of one of the consecutive divided cells is stored as a cut index, and the portion stored as the first cut index after the basic time among the sound source data is generated as a semantic unit partial sound source. Provides a semantic unit partial sound source generation system.
Therefore, according to the present invention, the user can listen to the main part of the music for a short time is advantageous to the music promotion and users can enjoy the important part of the music naturally without dissimilarity, and the satisfaction is increased.

Description

Producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same}

The present invention relates to a semantic unit partial sound source generation system and a method of generating a semantic unit partial sound source using the same, and specifically, a semantic unit partial sound source generation system that generates a plurality of short partial sound sources by cutting off a portion of a sound source that is easy for a user to hear. It is about.

After the age of listening to music using physical storage media such as CD or LP, many people are listening to music using music data files such as MP3. These sound source data files can be downloaded at any time via a terminal such as a smartphone or a PC, and can be directly listened to, so that they can be conveniently used.

On the other hand, such sound source data files may be personally downloaded and used, but are also used as background music on a blog or SNS online. At this time, since users change and move web pages in a short time, they are often changed before the music reaches its peak unless it is for the purpose of listening to music. Because of this, they do not properly promote their music, and only the front pole part is heard, and users often hear only the front part of the music, so they do not listen to the important part, so many cases occur.

According to Korean Patent No. 10-1504522, a database for searching for music is created and stored, and a plurality of melodies are divided and stored based on a point to be cut off from the lyrics, and the user is divided among the stored melodies. A method for detecting a melody similar to a query received from the present invention is described.

However, this method cannot be applied to music without lyrics because lyrics must be included in the sound source, and it is not a completely automated method because the node information and lyrics information must be stored in advance.

The present invention has been made to solve the above problems, and to generate a plurality of semantic unit partial sound sources so that the user can naturally listen to the important parts of the music without feeling heterogeneous, regardless of whether the music has lyrics or not.

In addition, to provide a system for generating a plurality of semantic unit partial sound sources by analyzing the sound source data by themselves without additional data input.

For the above purpose, the present invention provides a semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using sound source data,

An rms extraction module for calculating total rms of all sound source data using sound source data loaded into the semantic unit partial sound source generation system; a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time; A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells; The rms extraction module calculates cell rms for each of the plurality of cells,

The control unit compares the cell rms and rms thresholds, and if the cell rms is less than the rms threshold, classifies the cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,

It provides a semantic unit sound source generation system for extracting the portion stored as the first cut index after the basic time of the sound source data to generate a semantic unit sound source.

The semantic unit partial sound source generation system may further include a finishing module, and the finishing module may process the front part of the semantic unit partial sound source and the fade out process of the rear part of the semantic unit partial sound source. desirable.

According to the present invention,

Loading a sound source data file into the semantic unit partial sound source generating system; storing the rms threshold value, the cut threshold value, the cell time and the basic time in the storage; calculating total rms of the sound source data by an rms extraction module; Generating a plurality of cells by dividing the sound source data according to cell time stored in a storage unit by a cell generation module; calculating cell rms for each cell by the rms extraction module;

The control unit compares the cell rms and rms thresholds, and if the cell rms is smaller than the rms threshold, classifies the corresponding cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,

The method of claim 1, further comprising extracting a portion stored as the first cut index after the basic time from the sound source data to generate a semantic unit partial sound source.

According to the present invention, the user can listen to the main part of the music for a short time is advantageous to the music promotion and users can enjoy the important part of the music naturally without dissimilarity, the satisfaction is increased.

1 is a block diagram showing the configuration of a semantic unit partial sound source generation system according to the present invention;
2 is an illustration showing normalized rms energy (volume) over time (ms);
3 is an exemplary diagram illustrating rms threshold and cell division;
4 is an explanatory diagram for explaining a fade in process by the finishing module;
5 is an exemplary view showing an example of a web server using the semantic unit partial sound source generation system according to the present invention;
6 is a flowchart illustrating a method of generating a partial sound source of a semantic unit according to the present invention.

The configuration and operation of the embodiments of the present invention will be described in detail with reference to the accompanying drawings.

Referring to FIG. 1, the semantic unit partial sound source generation system 100 according to the present invention includes an rms extraction module 110, a cell generation module 130, a storage unit 150, a control unit 170, and a finishing module 190. It is configured to include). The semantic unit partial sound source generation system 100 may be provided in a server form.

When the sound source data file is loaded into the server, the rms extraction module 110 calculates the total rms of the sound source data. The total rms of the sound source data means an average energy value of the whole sound source data, and is calculated as follows.

Here, x _n means the amplitude of the sampled sound signal and n means the number of sampled data.

When the total rms is calculated, the rms extraction module 110 stores the value of 20% of the total rms as the rms threshold value in the storage unit 150.

For example, if the total rms normalized in FIG. 2 and FIG. 3 is calculated as 0.8, the rms extraction module stores 0.16, which is 20% of the total rms, as an rms threshold and stores it in the storage unit. In this embodiment, 20%, which is the critical ratio of total rms, is set as the rms threshold.

Meanwhile, cell time is stored in the storage unit, and the cell generation module 130 generates a plurality of cells by dividing sound source data according to cell time.

In this embodiment, the cell time is set to 100 ms, and the cell generation module 130 divides the sound source data into a plurality of cells in units of 100 ms.

The rms extraction module 110 calculates cell rms for each cell. The cell rms may be calculated in the same manner as the total rms, and the calculated cell rms may be stored in the storage unit.

The controller 170 compares the calculated plurality of cell rms and rms thresholds, and when the cell rms is smaller than the rms threshold, classifies the cell into divided cells and stores the divided cells. It counts how many consecutive cells there are.

In addition, the storage unit 150 stores the cut threshold value, and when the number of the divided cells continuously exceeds the cut threshold value, the controller designates one index value of the consecutive divided cells as the cut index. In this case, the controller preferably designates the index value of the divided cell located in the middle of the consecutive divided cells as a cut index and stores the index in the storage unit.

In the present embodiment, the cut threshold value is set to 5, and when the consecutive divided cells are extracted five or more times, the control unit designates the index value of the divided cells among them as the cut index and stores them in the storage unit.

The controller may change the cut threshold value based on the total playing time of the sound source data. Specifically, if the total playing time is 600 seconds, the total playing time is divided by 60 seconds which is the reference partial sound source time. Then, the resultant range of 10 to 10 minus 2 to 8 becomes the cut index valid range.

The controller increases the cut threshold when the number of cut indexes previously specified is larger than the cut index valid range, and decreases the cut threshold when the number of cut indexes previously specified is smaller than the cut index valid range. Therefore, the semantic unit partial sound source generation system can generate a semantic unit partial sound source having an appropriate play time.

Meanwhile, the rms threshold may be set again according to the number of split cells. That is, the control unit calculates the ratio of the divided cells in the total number of cells and resets the rms threshold value when not in a predetermined range (for example, 15% to 25%).

Specifically, if the ratio of split cells is 12%, it is not 15%, so the rms threshold is adjusted.The 3% p difference between the reference range 15% and the actual split cell 12% is rms threshold at all stages. In addition to the 20% used to calculate the value, 23% of the total rms is calculated as the rms threshold.

Based on the calculated new rms threshold value, the controller 170 classifies and stores newly divided cells. That is, the controller performs a new calculation to increase the ratio of the rms threshold when the ratio of the split cells is less than the reference range and to decrease the ratio of the rms threshold when the ratio of the split cells is greater than the reference range.

Meanwhile, the storage unit stores the basic time, and the controller extracts the portion designated as the first cut index by exceeding the basic time, and generates the semantic unit sound source from the beginning to the portion designated as the cut index.

After the cut index, the process of extracting the portion designated as the cut index by exceeding the basic time is repeated to generate a plurality of semantic unit sound sources. In the present embodiment, the basic time is set to 25 seconds, and the control unit generates a partial unit sound source based on the first cut index specified after 25 seconds.

When the semantic unit partial sound source is generated, the fade in and fade out processing is performed for each semantic unit partial sound source by the finishing module 190.

The fade in means increasing the volume values gradually, and the fade out means increasing the volume values.

Referring to FIG. 4, in the present invention, the fade in is performed within a preset fade in time, and in this embodiment, the fade in time is set to 1.4 seconds.

When the fade in is performed, the finishing module 190 ignores the divided cells smaller than the rms threshold based on the rms threshold and fades in only the cells that are greater than the rms threshold.

In addition, the fade-in is performed by adjusting the volume of each cell based on the rms threshold, thereby preventing the volume value from becoming smaller than the rms threshold. In addition, since the divided cells are all excluded from the fading in or the fading out, the fading in and the fading out can be performed at a very high speed.

That is, when a cell with an rms threshold value or more comes out according to the time axis, the sound source data is adjusted by subtracting 50% of the difference of the cell volume minus the rms threshold value from the volume of the cell again, and then changing the sound source data in the same manner. Let's do it. At this time, it is preferable to reduce the range of the volume of each cell to 40%, 30%, 20%, 10%, etc. along the time axis. Fade-out is done in the same way as fade-in.

According to the semantic unit sound source generation system according to the present invention, the user can transmit the background music when the user is viewing a web page, and at this time, the stream is automatically generated by generating a repetitive chorus or climax rather than the beginning of the music. Can give

In addition, when the user transmits another page at the request of the user, the background music may be changed and other music may be transmitted.

Referring to FIG. 5, the server may transmit background music along with a web page, and transmit a partial unit sound source according to the present invention. Preferably, the semantic unit partial sound source has a play time of about 1 minute.

A plurality of semantic unit partial sound sources generated based on one song correspond to each numeric button. Therefore, the user can listen to the partial sound source of each semantic unit by pressing the number button, and can listen to the main part of one music at a quick time, which is an effective means of promotion for the music provider, and the user can quickly configure the whole composition of a music in a short time. You can appreciate it.

Although the above has been described with reference to the embodiments of the present invention, those skilled in the art may variously modify and modify the present invention without departing from the spirit and scope of the present invention as set forth in the claims below. It will be appreciated that it can be changed.

100: semantic unit partial sound source generation system 110: rms extraction module
130: cell generation module 150: storage unit
170: control unit 190: finishing module

Claims

A semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using sound source data,
An rms extraction module for calculating total rms of the entire sound source data using the sound source data loaded into the semantic unit partial sound source generation system;
a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time;
A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells;
The rms extraction module calculates cell rms for each of the plurality of cells,
The control unit compares the cell rms and rms thresholds, and when the cell rms is smaller than the rms threshold, classifies the cell into divided cells. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
A semantic unit partial sound source generation system for extracting a portion stored as the first cut index after the basic time among the sound source data to generate a semantic unit partial sound source.

The method of claim 1,
The semantic unit partial sound source generation system further includes a finishing module, wherein the finishing module fades in the front portion of the semantic unit partial sound source and fades out the rear portion of the semantic unit partial sound source. The unit sound source generation system.

Loading a sound source data file into the semantic unit partial sound source generation system;
storing the rms threshold value, the cut threshold value, the cell time and the basic time in the storage;
calculating total rms of the sound source data by an rms extraction module;
Generating a plurality of cells by dividing the sound source data according to cell time stored in a storage unit by a cell generation module;
calculating cell rms for each cell by the rms extraction module;
The control unit compares the cell rms and rms thresholds, and if the cell rms is smaller than the rms threshold, classifies the corresponding cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
And extracting a portion stored as the first cut index from the sound source data as the first cut index to generate a semantic unit partial sound source.