KR102028164B1 - producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same - Google Patents

producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same Download PDF

Info

Publication number
KR102028164B1
KR102028164B1 KR1020170083339A KR20170083339A KR102028164B1 KR 102028164 B1 KR102028164 B1 KR 102028164B1 KR 1020170083339 A KR1020170083339 A KR 1020170083339A KR 20170083339 A KR20170083339 A KR 20170083339A KR 102028164 B1 KR102028164 B1 KR 102028164B1
Authority
KR
South Korea
Prior art keywords
sound source
rms
cell
source data
partial sound
Prior art date
Application number
KR1020170083339A
Other languages
Korean (ko)
Other versions
KR20190002978A (en
Inventor
김준호
윤창호
박종원
Original Assignee
(주)주스
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by (주)주스 filed Critical (주)주스
Priority to KR1020170083339A priority Critical patent/KR102028164B1/en
Publication of KR20190002978A publication Critical patent/KR20190002978A/en
Application granted granted Critical
Publication of KR102028164B1 publication Critical patent/KR102028164B1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a semantic unit partial sound source generation system that generates a plurality of short partial sound sources by cutting off portions of one sound source data that are easy for a user to hear.
To this end, the present invention in the semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using the sound source data,
An rms extraction module for calculating total rms of the entire sound source data using the sound source data loaded into the semantic unit partial sound source generation system; a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time; A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells; The rms extraction module calculates cell rms for each of the plurality of cells, and the control unit compares the cell rms and rms threshold values, and if the cell rms is smaller than the rms threshold value, the cell is classified as a divided cell. When more than the cut threshold value in succession, the index value of one of the consecutive divided cells is stored as a cut index, and the portion stored as the first cut index after the basic time among the sound source data is generated as a semantic unit partial sound source. Provides a semantic unit partial sound source generation system.
Therefore, according to the present invention, the user can listen to the main part of the music for a short time is advantageous to the music promotion and users can enjoy the important part of the music naturally without dissimilarity, and the satisfaction is increased.

Description

Producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same}

The present invention relates to a semantic unit partial sound source generation system and a method of generating a semantic unit partial sound source using the same, and specifically, a semantic unit partial sound source generation system that generates a plurality of short partial sound sources by cutting off a portion of a sound source that is easy for a user to hear. It is about.

After the age of listening to music using physical storage media such as CD or LP, many people are listening to music using music data files such as MP3. These sound source data files can be downloaded at any time via a terminal such as a smartphone or a PC, and can be directly listened to, so that they can be conveniently used.

On the other hand, such sound source data files may be personally downloaded and used, but are also used as background music on a blog or SNS online. At this time, since users change and move web pages in a short time, they are often changed before the music reaches its peak unless it is for the purpose of listening to music. Because of this, they do not properly promote their music, and only the front pole part is heard, and users often hear only the front part of the music, so they do not listen to the important part, so many cases occur.

According to Korean Patent No. 10-1504522, a database for searching for music is created and stored, and a plurality of melodies are divided and stored based on a point to be cut off from the lyrics, and the user is divided among the stored melodies. A method for detecting a melody similar to a query received from the present invention is described.

However, this method cannot be applied to music without lyrics because lyrics must be included in the sound source, and it is not a completely automated method because the node information and lyrics information must be stored in advance.

The present invention has been made to solve the above problems, and to generate a plurality of semantic unit partial sound sources so that the user can naturally listen to the important parts of the music without feeling heterogeneous, regardless of whether the music has lyrics or not.

In addition, to provide a system for generating a plurality of semantic unit partial sound sources by analyzing the sound source data by themselves without additional data input.

For the above purpose, the present invention provides a semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using sound source data,

An rms extraction module for calculating total rms of all sound source data using sound source data loaded into the semantic unit partial sound source generation system; a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time; A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells; The rms extraction module calculates cell rms for each of the plurality of cells,

The control unit compares the cell rms and rms thresholds, and if the cell rms is less than the rms threshold, classifies the cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,

It provides a semantic unit sound source generation system for extracting the portion stored as the first cut index after the basic time of the sound source data to generate a semantic unit sound source.

The semantic unit partial sound source generation system may further include a finishing module, and the finishing module may process the front part of the semantic unit partial sound source and the fade out process of the rear part of the semantic unit partial sound source. desirable.

According to the present invention,

Loading a sound source data file into the semantic unit partial sound source generating system; storing the rms threshold value, the cut threshold value, the cell time and the basic time in the storage; calculating total rms of the sound source data by an rms extraction module; Generating a plurality of cells by dividing the sound source data according to cell time stored in a storage unit by a cell generation module; calculating cell rms for each cell by the rms extraction module;

The control unit compares the cell rms and rms thresholds, and if the cell rms is smaller than the rms threshold, classifies the corresponding cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,

The method of claim 1, further comprising extracting a portion stored as the first cut index after the basic time from the sound source data to generate a semantic unit partial sound source.

According to the present invention, the user can listen to the main part of the music for a short time is advantageous to the music promotion and users can enjoy the important part of the music naturally without dissimilarity, the satisfaction is increased.

1 is a block diagram showing the configuration of a semantic unit partial sound source generation system according to the present invention;
2 is an illustration showing normalized rms energy (volume) over time (ms);
3 is an exemplary diagram illustrating rms threshold and cell division;
4 is an explanatory diagram for explaining a fade in process by the finishing module;
5 is an exemplary view showing an example of a web server using the semantic unit partial sound source generation system according to the present invention;
6 is a flowchart illustrating a method of generating a partial sound source of a semantic unit according to the present invention.

The configuration and operation of the embodiments of the present invention will be described in detail with reference to the accompanying drawings.

Referring to FIG. 1, the semantic unit partial sound source generation system 100 according to the present invention includes an rms extraction module 110, a cell generation module 130, a storage unit 150, a control unit 170, and a finishing module 190. It is configured to include). The semantic unit partial sound source generation system 100 may be provided in a server form.

When the sound source data file is loaded into the server, the rms extraction module 110 calculates the total rms of the sound source data. The total rms of the sound source data means an average energy value of the whole sound source data, and is calculated as follows.

Figure 112017062985912-pat00001

Here, x n means the amplitude of the sampled sound signal and n means the number of sampled data.

When the total rms is calculated, the rms extraction module 110 stores the value of 20% of the total rms as the rms threshold value in the storage unit 150.

For example, if the total rms normalized in FIG. 2 and FIG. 3 is calculated as 0.8, the rms extraction module stores 0.16, which is 20% of the total rms, as an rms threshold and stores it in the storage unit. In this embodiment, 20%, which is the critical ratio of total rms, is set as the rms threshold.

Meanwhile, cell time is stored in the storage unit, and the cell generation module 130 generates a plurality of cells by dividing sound source data according to cell time.

In this embodiment, the cell time is set to 100 ms, and the cell generation module 130 divides the sound source data into a plurality of cells in units of 100 ms.

The rms extraction module 110 calculates cell rms for each cell. The cell rms may be calculated in the same manner as the total rms, and the calculated cell rms may be stored in the storage unit.

The controller 170 compares the calculated plurality of cell rms and rms thresholds, and when the cell rms is smaller than the rms threshold, classifies the cell into divided cells and stores the divided cells. It counts how many consecutive cells there are.

In addition, the storage unit 150 stores the cut threshold value, and when the number of the divided cells continuously exceeds the cut threshold value, the controller designates one index value of the consecutive divided cells as the cut index. In this case, the controller preferably designates the index value of the divided cell located in the middle of the consecutive divided cells as a cut index and stores the index in the storage unit.

In the present embodiment, the cut threshold value is set to 5, and when the consecutive divided cells are extracted five or more times, the control unit designates the index value of the divided cells among them as the cut index and stores them in the storage unit.

The controller may change the cut threshold value based on the total playing time of the sound source data. Specifically, if the total playing time is 600 seconds, the total playing time is divided by 60 seconds which is the reference partial sound source time. Then, the resultant range of 10 to 10 minus 2 to 8 becomes the cut index valid range.

The controller increases the cut threshold when the number of cut indexes previously specified is larger than the cut index valid range, and decreases the cut threshold when the number of cut indexes previously specified is smaller than the cut index valid range. Therefore, the semantic unit partial sound source generation system can generate a semantic unit partial sound source having an appropriate play time.

Meanwhile, the rms threshold may be set again according to the number of split cells. That is, the control unit calculates the ratio of the divided cells in the total number of cells and resets the rms threshold value when not in a predetermined range (for example, 15% to 25%).

Specifically, if the ratio of split cells is 12%, it is not 15%, so the rms threshold is adjusted.The 3% p difference between the reference range 15% and the actual split cell 12% is rms threshold at all stages. In addition to the 20% used to calculate the value, 23% of the total rms is calculated as the rms threshold.

Based on the calculated new rms threshold value, the controller 170 classifies and stores newly divided cells. That is, the controller performs a new calculation to increase the ratio of the rms threshold when the ratio of the split cells is less than the reference range and to decrease the ratio of the rms threshold when the ratio of the split cells is greater than the reference range.

Meanwhile, the storage unit stores the basic time, and the controller extracts the portion designated as the first cut index by exceeding the basic time, and generates the semantic unit sound source from the beginning to the portion designated as the cut index.

After the cut index, the process of extracting the portion designated as the cut index by exceeding the basic time is repeated to generate a plurality of semantic unit sound sources. In the present embodiment, the basic time is set to 25 seconds, and the control unit generates a partial unit sound source based on the first cut index specified after 25 seconds.

When the semantic unit partial sound source is generated, the fade in and fade out processing is performed for each semantic unit partial sound source by the finishing module 190.

The fade in means increasing the volume values gradually, and the fade out means increasing the volume values.

Referring to FIG. 4, in the present invention, the fade in is performed within a preset fade in time, and in this embodiment, the fade in time is set to 1.4 seconds.

When the fade in is performed, the finishing module 190 ignores the divided cells smaller than the rms threshold based on the rms threshold and fades in only the cells that are greater than the rms threshold.

In addition, the fade-in is performed by adjusting the volume of each cell based on the rms threshold, thereby preventing the volume value from becoming smaller than the rms threshold. In addition, since the divided cells are all excluded from the fading in or the fading out, the fading in and the fading out can be performed at a very high speed.

That is, when a cell with an rms threshold value or more comes out according to the time axis, the sound source data is adjusted by subtracting 50% of the difference of the cell volume minus the rms threshold value from the volume of the cell again, and then changing the sound source data in the same manner. Let's do it. At this time, it is preferable to reduce the range of the volume of each cell to 40%, 30%, 20%, 10%, etc. along the time axis. Fade-out is done in the same way as fade-in.

According to the semantic unit sound source generation system according to the present invention, the user can transmit the background music when the user is viewing a web page, and at this time, the stream is automatically generated by generating a repetitive chorus or climax rather than the beginning of the music. Can give

In addition, when the user transmits another page at the request of the user, the background music may be changed and other music may be transmitted.

Referring to FIG. 5, the server may transmit background music along with a web page, and transmit a partial unit sound source according to the present invention. Preferably, the semantic unit partial sound source has a play time of about 1 minute.

A plurality of semantic unit partial sound sources generated based on one song correspond to each numeric button. Therefore, the user can listen to the partial sound source of each semantic unit by pressing the number button, and can listen to the main part of one music at a quick time, which is an effective means of promotion for the music provider, and the user can quickly configure the whole composition of a music in a short time. You can appreciate it.

Although the above has been described with reference to the embodiments of the present invention, those skilled in the art may variously modify and modify the present invention without departing from the spirit and scope of the present invention as set forth in the claims below. It will be appreciated that it can be changed.

100: semantic unit partial sound source generation system 110: rms extraction module
130: cell generation module 150: storage unit
170: control unit 190: finishing module

Claims (3)

A semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using sound source data,
An rms extraction module for calculating total rms of the entire sound source data using the sound source data loaded into the semantic unit partial sound source generation system;
a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time;
A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells;
The rms extraction module calculates cell rms for each of the plurality of cells,
The control unit compares the cell rms and rms thresholds, and when the cell rms is smaller than the rms threshold, classifies the cell into divided cells. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
A semantic unit partial sound source generation system for extracting a portion stored as the first cut index after the basic time among the sound source data to generate a semantic unit partial sound source.
The method of claim 1,
The semantic unit partial sound source generation system further includes a finishing module, wherein the finishing module fades in the front portion of the semantic unit partial sound source and fades out the rear portion of the semantic unit partial sound source. The unit sound source generation system.
Loading a sound source data file into the semantic unit partial sound source generation system;
storing the rms threshold value, the cut threshold value, the cell time and the basic time in the storage;
calculating total rms of the sound source data by an rms extraction module;
Generating a plurality of cells by dividing the sound source data according to cell time stored in a storage unit by a cell generation module;
calculating cell rms for each cell by the rms extraction module;
The control unit compares the cell rms and rms thresholds, and if the cell rms is smaller than the rms threshold, classifies the corresponding cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
And extracting a portion stored as the first cut index from the sound source data as the first cut index to generate a semantic unit partial sound source.
KR1020170083339A 2017-06-30 2017-06-30 producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same KR102028164B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020170083339A KR102028164B1 (en) 2017-06-30 2017-06-30 producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020170083339A KR102028164B1 (en) 2017-06-30 2017-06-30 producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same

Publications (2)

Publication Number Publication Date
KR20190002978A KR20190002978A (en) 2019-01-09
KR102028164B1 true KR102028164B1 (en) 2019-10-08

Family

ID=65017176

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020170083339A KR102028164B1 (en) 2017-06-30 2017-06-30 producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same

Country Status (1)

Country Link
KR (1) KR102028164B1 (en)

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101002028B1 (en) * 2008-09-04 2010-12-16 고려대학교 산학협력단 System and Method of voice activity detection using microphone and temporal-spatial information, and Recording medium using it

Also Published As

Publication number Publication date
KR20190002978A (en) 2019-01-09

Similar Documents

Publication Publication Date Title
US20210005222A1 (en) Looping audio-visual file generation based on audio and video analysis
EP2659485B1 (en) Semantic audio track mixer
US8831953B2 (en) Systems and methods for filtering objectionable content
US9326082B2 (en) Song transition effects for browsing
CN105161116B (en) The determination method and device of multimedia file climax segment
CN108268530B (en) Lyric score generation method and related device
US20120101606A1 (en) Information processing apparatus, content data reconfiguring method and program
EP3215961A1 (en) A system and method of classifying, comparing and ordering songs in a playlist to smooth the overall playback and listening experience
JP2014006480A (en) Information processing apparatus, information processing method, and program
CN104715760A (en) KTV song matching analyzing method and system
CN105718486B (en) Online humming retrieval method and system
CN105047203A (en) Audio processing method, device and terminal
JP2003177784A (en) Method and device for extracting sound turning point, method and device for sound reproducing, sound reproducing system, sound delivery system, information providing device, sound signal editing device, recording medium for sound turning point extraction method program, recording medium for sound reproducing method program, recording medium for sound signal editing method program, sound turning point extraction method program, sound reproducing method program, and sound signal editing method program
KR20190108027A (en) Method, system and non-transitory computer-readable recording medium for generating music associated with a video
EP3552200B1 (en) Audio variations editing using tempo-range metadata.
CN104978377A (en) Multimedia data processing method, multimedia data processing device and terminal
CN110797001B (en) Method and device for generating voice audio of electronic book and readable storage medium
KR102028164B1 (en) producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same
CN103871433A (en) Control method and electronic device
CN108022604A (en) The method and apparatus of amended record audio content
CN104426915A (en) Method, server and system for realizing online music subsection downloading
KR20070048484A (en) Apparatus and method for classification of signal features of music files, and apparatus and method for automatic-making playing list using the same
US10847129B2 (en) Data format
CN106448710B (en) A kind of calibration method and music player devices of music play parameters
KR20010096297A (en) System of advertisement by union of digital sound and advertisement and thereof method

Legal Events

Date Code Title Description
A201 Request for examination
E701 Decision to grant or registration of patent right