KR102028164B1 - producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same - Google Patents
producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same Download PDFInfo
- Publication number
- KR102028164B1 KR102028164B1 KR1020170083339A KR20170083339A KR102028164B1 KR 102028164 B1 KR102028164 B1 KR 102028164B1 KR 1020170083339 A KR1020170083339 A KR 1020170083339A KR 20170083339 A KR20170083339 A KR 20170083339A KR 102028164 B1 KR102028164 B1 KR 102028164B1
- Authority
- KR
- South Korea
- Prior art keywords
- sound source
- rms
- cell
- source data
- partial sound
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 13
- 238000000605 extraction Methods 0.000 claims abstract description 16
- 238000005562 fading Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 2
- 241001342895 Chorus Species 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- HAORKNGNJCEJBX-UHFFFAOYSA-N cyprodinil Chemical compound N=1C(C)=CC(C2CC2)=NC=1NC1=CC=CC=C1 HAORKNGNJCEJBX-UHFFFAOYSA-N 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000003825 pressing Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Library & Information Science (AREA)
- Theoretical Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Reverberation, Karaoke And Other Acoustics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention relates to a semantic unit partial sound source generation system that generates a plurality of short partial sound sources by cutting off portions of one sound source data that are easy for a user to hear.
To this end, the present invention in the semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using the sound source data,
An rms extraction module for calculating total rms of the entire sound source data using the sound source data loaded into the semantic unit partial sound source generation system; a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time; A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells; The rms extraction module calculates cell rms for each of the plurality of cells, and the control unit compares the cell rms and rms threshold values, and if the cell rms is smaller than the rms threshold value, the cell is classified as a divided cell. When more than the cut threshold value in succession, the index value of one of the consecutive divided cells is stored as a cut index, and the portion stored as the first cut index after the basic time among the sound source data is generated as a semantic unit partial sound source. Provides a semantic unit partial sound source generation system.
Therefore, according to the present invention, the user can listen to the main part of the music for a short time is advantageous to the music promotion and users can enjoy the important part of the music naturally without dissimilarity, and the satisfaction is increased.
Description
The present invention relates to a semantic unit partial sound source generation system and a method of generating a semantic unit partial sound source using the same, and specifically, a semantic unit partial sound source generation system that generates a plurality of short partial sound sources by cutting off a portion of a sound source that is easy for a user to hear. It is about.
After the age of listening to music using physical storage media such as CD or LP, many people are listening to music using music data files such as MP3. These sound source data files can be downloaded at any time via a terminal such as a smartphone or a PC, and can be directly listened to, so that they can be conveniently used.
On the other hand, such sound source data files may be personally downloaded and used, but are also used as background music on a blog or SNS online. At this time, since users change and move web pages in a short time, they are often changed before the music reaches its peak unless it is for the purpose of listening to music. Because of this, they do not properly promote their music, and only the front pole part is heard, and users often hear only the front part of the music, so they do not listen to the important part, so many cases occur.
According to Korean Patent No. 10-1504522, a database for searching for music is created and stored, and a plurality of melodies are divided and stored based on a point to be cut off from the lyrics, and the user is divided among the stored melodies. A method for detecting a melody similar to a query received from the present invention is described.
However, this method cannot be applied to music without lyrics because lyrics must be included in the sound source, and it is not a completely automated method because the node information and lyrics information must be stored in advance.
The present invention has been made to solve the above problems, and to generate a plurality of semantic unit partial sound sources so that the user can naturally listen to the important parts of the music without feeling heterogeneous, regardless of whether the music has lyrics or not.
In addition, to provide a system for generating a plurality of semantic unit partial sound sources by analyzing the sound source data by themselves without additional data input.
For the above purpose, the present invention provides a semantic unit partial sound source generation system for generating a plurality of meaningful partial sound sources using sound source data,
An rms extraction module for calculating total rms of all sound source data using sound source data loaded into the semantic unit partial sound source generation system; a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time; A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells; The rms extraction module calculates cell rms for each of the plurality of cells,
The control unit compares the cell rms and rms thresholds, and if the cell rms is less than the rms threshold, classifies the cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
It provides a semantic unit sound source generation system for extracting the portion stored as the first cut index after the basic time of the sound source data to generate a semantic unit sound source.
The semantic unit partial sound source generation system may further include a finishing module, and the finishing module may process the front part of the semantic unit partial sound source and the fade out process of the rear part of the semantic unit partial sound source. desirable.
According to the present invention,
Loading a sound source data file into the semantic unit partial sound source generating system; storing the rms threshold value, the cut threshold value, the cell time and the basic time in the storage; calculating total rms of the sound source data by an rms extraction module; Generating a plurality of cells by dividing the sound source data according to cell time stored in a storage unit by a cell generation module; calculating cell rms for each cell by the rms extraction module;
The control unit compares the cell rms and rms thresholds, and if the cell rms is smaller than the rms threshold, classifies the corresponding cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
The method of
According to the present invention, the user can listen to the main part of the music for a short time is advantageous to the music promotion and users can enjoy the important part of the music naturally without dissimilarity, the satisfaction is increased.
1 is a block diagram showing the configuration of a semantic unit partial sound source generation system according to the present invention;
2 is an illustration showing normalized rms energy (volume) over time (ms);
3 is an exemplary diagram illustrating rms threshold and cell division;
4 is an explanatory diagram for explaining a fade in process by the finishing module;
5 is an exemplary view showing an example of a web server using the semantic unit partial sound source generation system according to the present invention;
6 is a flowchart illustrating a method of generating a partial sound source of a semantic unit according to the present invention.
The configuration and operation of the embodiments of the present invention will be described in detail with reference to the accompanying drawings.
Referring to FIG. 1, the semantic unit partial sound
When the sound source data file is loaded into the server, the
Here, x n means the amplitude of the sampled sound signal and n means the number of sampled data.
When the total rms is calculated, the
For example, if the total rms normalized in FIG. 2 and FIG. 3 is calculated as 0.8, the rms extraction module stores 0.16, which is 20% of the total rms, as an rms threshold and stores it in the storage unit. In this embodiment, 20%, which is the critical ratio of total rms, is set as the rms threshold.
Meanwhile, cell time is stored in the storage unit, and the
In this embodiment, the cell time is set to 100 ms, and the
The
The
In addition, the
In the present embodiment, the cut threshold value is set to 5, and when the consecutive divided cells are extracted five or more times, the control unit designates the index value of the divided cells among them as the cut index and stores them in the storage unit.
The controller may change the cut threshold value based on the total playing time of the sound source data. Specifically, if the total playing time is 600 seconds, the total playing time is divided by 60 seconds which is the reference partial sound source time. Then, the resultant range of 10 to 10
The controller increases the cut threshold when the number of cut indexes previously specified is larger than the cut index valid range, and decreases the cut threshold when the number of cut indexes previously specified is smaller than the cut index valid range. Therefore, the semantic unit partial sound source generation system can generate a semantic unit partial sound source having an appropriate play time.
Meanwhile, the rms threshold may be set again according to the number of split cells. That is, the control unit calculates the ratio of the divided cells in the total number of cells and resets the rms threshold value when not in a predetermined range (for example, 15% to 25%).
Specifically, if the ratio of split cells is 12%, it is not 15%, so the rms threshold is adjusted.The 3% p difference between the reference range 15% and the actual split cell 12% is rms threshold at all stages. In addition to the 20% used to calculate the value, 23% of the total rms is calculated as the rms threshold.
Based on the calculated new rms threshold value, the
Meanwhile, the storage unit stores the basic time, and the controller extracts the portion designated as the first cut index by exceeding the basic time, and generates the semantic unit sound source from the beginning to the portion designated as the cut index.
After the cut index, the process of extracting the portion designated as the cut index by exceeding the basic time is repeated to generate a plurality of semantic unit sound sources. In the present embodiment, the basic time is set to 25 seconds, and the control unit generates a partial unit sound source based on the first cut index specified after 25 seconds.
When the semantic unit partial sound source is generated, the fade in and fade out processing is performed for each semantic unit partial sound source by the
The fade in means increasing the volume values gradually, and the fade out means increasing the volume values.
Referring to FIG. 4, in the present invention, the fade in is performed within a preset fade in time, and in this embodiment, the fade in time is set to 1.4 seconds.
When the fade in is performed, the
In addition, the fade-in is performed by adjusting the volume of each cell based on the rms threshold, thereby preventing the volume value from becoming smaller than the rms threshold. In addition, since the divided cells are all excluded from the fading in or the fading out, the fading in and the fading out can be performed at a very high speed.
That is, when a cell with an rms threshold value or more comes out according to the time axis, the sound source data is adjusted by subtracting 50% of the difference of the cell volume minus the rms threshold value from the volume of the cell again, and then changing the sound source data in the same manner. Let's do it. At this time, it is preferable to reduce the range of the volume of each cell to 40%, 30%, 20%, 10%, etc. along the time axis. Fade-out is done in the same way as fade-in.
According to the semantic unit sound source generation system according to the present invention, the user can transmit the background music when the user is viewing a web page, and at this time, the stream is automatically generated by generating a repetitive chorus or climax rather than the beginning of the music. Can give
In addition, when the user transmits another page at the request of the user, the background music may be changed and other music may be transmitted.
Referring to FIG. 5, the server may transmit background music along with a web page, and transmit a partial unit sound source according to the present invention. Preferably, the semantic unit partial sound source has a play time of about 1 minute.
A plurality of semantic unit partial sound sources generated based on one song correspond to each numeric button. Therefore, the user can listen to the partial sound source of each semantic unit by pressing the number button, and can listen to the main part of one music at a quick time, which is an effective means of promotion for the music provider, and the user can quickly configure the whole composition of a music in a short time. You can appreciate it.
Although the above has been described with reference to the embodiments of the present invention, those skilled in the art may variously modify and modify the present invention without departing from the spirit and scope of the present invention as set forth in the claims below. It will be appreciated that it can be changed.
100: semantic unit partial sound source generation system 110: rms extraction module
130: cell generation module 150: storage unit
170: control unit 190: finishing module
Claims (3)
An rms extraction module for calculating total rms of the entire sound source data using the sound source data loaded into the semantic unit partial sound source generation system;
a storage unit for storing the rms threshold value, cut threshold value, cell time, and basic time;
A cell generation module for dividing the sound source data according to the cell time to generate a plurality of cells;
The rms extraction module calculates cell rms for each of the plurality of cells,
The control unit compares the cell rms and rms thresholds, and when the cell rms is smaller than the rms threshold, classifies the cell into divided cells. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
A semantic unit partial sound source generation system for extracting a portion stored as the first cut index after the basic time among the sound source data to generate a semantic unit partial sound source.
The semantic unit partial sound source generation system further includes a finishing module, wherein the finishing module fades in the front portion of the semantic unit partial sound source and fades out the rear portion of the semantic unit partial sound source. The unit sound source generation system.
storing the rms threshold value, the cut threshold value, the cell time and the basic time in the storage;
calculating total rms of the sound source data by an rms extraction module;
Generating a plurality of cells by dividing the sound source data according to cell time stored in a storage unit by a cell generation module;
calculating cell rms for each cell by the rms extraction module;
The control unit compares the cell rms and rms thresholds, and if the cell rms is smaller than the rms threshold, classifies the corresponding cell as a divided cell. When the divided cells are continuously equal to or greater than the cut threshold, one of the consecutive divided cells is determined. Save the index value as cut index,
And extracting a portion stored as the first cut index from the sound source data as the first cut index to generate a semantic unit partial sound source.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020170083339A KR102028164B1 (en) | 2017-06-30 | 2017-06-30 | producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020170083339A KR102028164B1 (en) | 2017-06-30 | 2017-06-30 | producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20190002978A KR20190002978A (en) | 2019-01-09 |
KR102028164B1 true KR102028164B1 (en) | 2019-10-08 |
Family
ID=65017176
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020170083339A KR102028164B1 (en) | 2017-06-30 | 2017-06-30 | producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR102028164B1 (en) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101002028B1 (en) * | 2008-09-04 | 2010-12-16 | 고려대학교 산학협력단 | System and Method of voice activity detection using microphone and temporal-spatial information, and Recording medium using it |
-
2017
- 2017-06-30 KR KR1020170083339A patent/KR102028164B1/en active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
KR20190002978A (en) | 2019-01-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210005222A1 (en) | Looping audio-visual file generation based on audio and video analysis | |
EP2659485B1 (en) | Semantic audio track mixer | |
US8831953B2 (en) | Systems and methods for filtering objectionable content | |
US9326082B2 (en) | Song transition effects for browsing | |
CN105161116B (en) | The determination method and device of multimedia file climax segment | |
CN108268530B (en) | Lyric score generation method and related device | |
US20120101606A1 (en) | Information processing apparatus, content data reconfiguring method and program | |
EP3215961A1 (en) | A system and method of classifying, comparing and ordering songs in a playlist to smooth the overall playback and listening experience | |
JP2014006480A (en) | Information processing apparatus, information processing method, and program | |
CN104715760A (en) | KTV song matching analyzing method and system | |
CN105718486B (en) | Online humming retrieval method and system | |
CN105047203A (en) | Audio processing method, device and terminal | |
JP2003177784A (en) | Method and device for extracting sound turning point, method and device for sound reproducing, sound reproducing system, sound delivery system, information providing device, sound signal editing device, recording medium for sound turning point extraction method program, recording medium for sound reproducing method program, recording medium for sound signal editing method program, sound turning point extraction method program, sound reproducing method program, and sound signal editing method program | |
KR20190108027A (en) | Method, system and non-transitory computer-readable recording medium for generating music associated with a video | |
EP3552200B1 (en) | Audio variations editing using tempo-range metadata. | |
CN104978377A (en) | Multimedia data processing method, multimedia data processing device and terminal | |
CN110797001B (en) | Method and device for generating voice audio of electronic book and readable storage medium | |
KR102028164B1 (en) | producing system for partial sound data made by meaningful units and producing method for partial sound data made by meaningful units using the same | |
CN103871433A (en) | Control method and electronic device | |
CN108022604A (en) | The method and apparatus of amended record audio content | |
CN104426915A (en) | Method, server and system for realizing online music subsection downloading | |
KR20070048484A (en) | Apparatus and method for classification of signal features of music files, and apparatus and method for automatic-making playing list using the same | |
US10847129B2 (en) | Data format | |
CN106448710B (en) | A kind of calibration method and music player devices of music play parameters | |
KR20010096297A (en) | System of advertisement by union of digital sound and advertisement and thereof method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E701 | Decision to grant or registration of patent right |