WO2017098743A1

WO2017098743A1 - Service management system, computer readable storage medium, and service management method

Info

Publication number: WO2017098743A1
Application number: PCT/JP2016/067122
Authority: WO
Inventors: 健一郎堀
Original assignee: 健一郎堀
Priority date: 2015-12-08
Filing date: 2016-06-08
Publication date: 2017-06-15
Also published as: JP2017156749A

Abstract

This service management system is provided with a control unit that: specifies the range of sounds to be modified in a master file stored in a master storage unit; creates a mask file for adjusting the volume of the sounds individually within a mix of sounds on the basis of the specified range; provides a user terminal with sound modified content including the mask file and stores a number of playbacks into a playback result storage unit; and, on the basis of funds stored in a fund storage unit and the number of playbacks recorded in the playback result storage unit, calculates payments to be distributed to eligible parties for the modified sound.

Description

Service management system, computer-readable storage medium, and service management method

The present invention relates to a service management system, a computer-readable storage medium, and a service management method for managing a service that provides music.

As an example of a service that provides music, a karaoke service that plays accompaniment sounds is known (see, for example, Patent Document 1). The sound source used in the karaoke service is not a music master having mixed sound data including accompaniment sounds and vocals, but data consisting only of accompaniment sounds newly created for karaoke.

JP 2002-99290 A

On the other hand, since the sound newly created for karaoke and the sound based on the original music master are different from each other, it is difficult to satisfy the user who expects the sound based on the music master using the above-described technology. Further, since the profits from the content providing service are not distributed to the performers who have produced the original music, it is difficult to spread the music through the content providing service.

Note that the above-mentioned problems are not limited to karaoke services for reproducing accompaniment sounds, but are common to technologies for reproducing mixed sounds from music data such as concert performances and theater performances.

In the service management system for solving the above-described problem, a sound including a sound to be adjusted and a sound other than the sound to be adjusted is a mixed sound, and a master file including data for reproducing the mixed sound is obtained. A stored master recording unit, a source storage unit storing a source of service for providing sound adjustment content to a user terminal, a playback record storage unit storing the number of times of sound adjustment content played by the user terminal, and the user terminal And a connected control unit. Then, the control unit specifies a range of the adjustment target sound in the master file stored in the master storage unit, and based on the specified range, adjusts the volume of the adjustment target sound in the mixed sound. A mask file for individual adjustment is generated, the sound adjustment content including the mask file is provided to the user terminal, the number of reproductions is stored in the reproduction result storage unit, and the resource stored in the resource storage unit And a distribution amount to be distributed to the distribution target person of the adjustment target sound, based on the number of times of reproduction stored in the reproduction result storage unit.

A computer-readable storage medium for solving the above-described problem is a master file that includes a sound that includes a sound to be adjusted and a sound other than the sound to be adjusted as a mixed sound, and includes data for reproducing the mixed sound A master disk storage unit, a resource storage unit that stores resources for providing sound adjustment content to a user terminal, a playback record storage unit that stores the number of times the sound adjustment content is played by the user terminal, and the user terminal A computer-readable storage medium storing a program for performing service management using a service management system including a control unit connected to the computer. When executing the program, the control unit specifies a range of the adjustment target sound in the master file stored in the master storage unit, and based on the specified range, the control unit includes the A mask file for individually adjusting the volume of the adjustment target sound is generated, the sound adjustment content including the mask file is provided to the user terminal, the number of times of reproduction is stored in the reproduction result storage unit, and the resource Based on the resources stored in the storage unit and the number of reproductions stored in the reproduction record storage unit, the distribution amount to be distributed to the distribution target person of the adjustment target sound is calculated.

In the service management method for solving the above-described problem, a sound including a sound to be adjusted and a sound other than the sound to be adjusted is a mixed sound, and a master file including data for reproducing the mixed sound is obtained. A stored master recording unit, a source storage unit storing a source of service for providing sound adjustment content to a user terminal, a playback record storage unit storing the number of times of sound adjustment content played by the user terminal, and the user terminal In this method, service management is performed using a service management system including a connected control unit. In the service management method, the control unit specifies a range of the adjustment target sound in the master file stored in the master storage unit, and the adjustment is performed in the mixed sound based on the specified range. A mask file for individually adjusting the volume of the target sound is generated, the sound adjustment content including the mask file is provided to the user terminal, the number of reproductions is stored in the reproduction result storage unit, and the resource storage A distribution amount to be distributed to the distribution target person of the adjustment target sound is calculated based on the resources stored in the section and the number of times of reproduction stored in the reproduction performance storage section.

The lineblock diagram showing a 1st embodiment of a service management system. The block diagram of the data which an original disk memory | storage part memorize | stores. The lineblock diagram of the data which a lyric information storage part memorizes. The block diagram of the data which a music memory | storage part memorize | stores. The block diagram of the data which a reproduction | regeneration performance memory | storage part memorize | stores. The block diagram of the data which a resource storage part memorize | stores. The block diagram of the data which a distribution management information storage part memorize | stores. The block diagram of the data which a distribution performance information storage part memorize | stores. The figure which shows the procedure of the process by a service management method. The figure which shows the procedure of the process by a service management method. The figure which shows the procedure of a 1st mask production | generation process. The graph which shows the range of a mask file. The figure which shows the procedure of a 2nd mask production | generation process. The graph which shows the range of a mask file. The figure which shows the procedure of a 3rd mask production | generation process. The graph which shows the range of a mask file. The graph which shows the range of a mask file. The flowchart which shows the procedure of a synchronous lyrics process. The block diagram which shows the structure of an XML file. The figure which shows the procedure of a reproduction | regeneration determination process. The figure which shows the procedure of a distribution management process. The figure which shows the procedure in the distribution process of fixed amount music. The figure explaining calculation of the frequency | count of reproduction | regeneration by reproduction | regeneration of a group. The figure which shows the procedure of the process in group reproduction | regeneration management. The figure which shows the procedure of the process in a lyrics synchronous adjustment. The block diagram which shows 2nd Embodiment of a service management system. The figure which shows the procedure of the process by a service management method. The graph which shows the range in fixed mode. The graph which shows the sound range in a 1st fluctuation mode. The graph which shows the sound range in 2nd fluctuation mode. The block diagram which shows 3rd Embodiment of a service management system. The figure which shows the procedure of the process by a service management method. The block diagram which shows the structure in the modification of a user terminal.

(First embodiment)
Hereinafter, a first embodiment of the service management system will be described. The service management system provides a master karaoke service that is an example of a content providing service. The service management system uses a music master provided by a record company or the like, and provides a mixed sound in which the volume of vocals is individually adjusted with respect to the volume of other sounds. Vocal is a singing voice sung by a singer who is an example of a performer, and is an example of a sound to be adjusted.

The music master disc contains data for playing mixed sounds including vocals and other sounds. The music master may be a music master file, or may be a box format file including a music master file and a video master file, such as a promotion video.

As shown in FIG. 1, the content server 10 is a computer system and provides a master file. The master disk file is content for providing data for reproducing the mixed sound. The content is, for example, a digital file such as a promotion video created in the mp4 format.

The person-in-charge terminal 30 is a computer terminal and is used for editing data for reproducing music. The editing of music is adjustment of the timing at which telops such as lyrics are displayed and vocal mask adjustment. The person-in-charge terminal 30 includes a control unit, an output unit, and an input unit. The output unit is configured by a display, for example, and outputs various information. The input unit is configured by a keyboard and a pointing device, and acquires an instruction input by a person in charge.

The user terminal 40 is a computer terminal and is used by a user who uses a karaoke service. The user terminal 40 is a business (BtoB) device or a customer (BtoC) device. Business equipment is a computer terminal installed in a karaoke box or the like. Devices for customers are smartphones and personal computers possessed by individual users.

The user terminal 40 downloads a karaoke file including a music file and a video file from the management server 20 as an example of sound adjustment content that is content for adjusting the volume of the adjustment target sound. The karaoke file includes a vocal mask file as an example of a mask file.

The user terminal 40 includes a music playback unit 41 and a vocal volume adjustment unit 42. The music playback unit 41 plays back music and video using the downloaded music file and video file. The vocal volume adjustment unit 42 performs a process of individually adjusting the volume of the vocal in the mixed sound based on the vocal mask file. The process of adjusting the volume of the vocal includes a process of deleting the vocal. The vocal mask file is a file for specifying the vocal range included in the music.

The management server 20 is a computer system and is used for providing a karaoke file by a karaoke service provider. The management server 20 includes a control unit 21, a master disk storage unit 22, a lyrics information storage unit 23, a music storage unit 24, a reproduction performance storage unit 25, a resource storage unit 26, a distribution management information storage unit 27, and a distribution result information storage unit. 28.

The control unit 21 includes control means including a CPU, a RAM, a ROM, and the like, and performs each process of a management stage, vocal processing stage, lyrics management stage, editor stage, reproduction management stage, and distribution management stage. The control unit 21 functions as a management unit 210, a vocal processing unit 211, a lyrics management unit 212, an editor 213, a reproduction management unit 214, and a distribution management unit 215 by executing a service management program.
The management unit 210 executes processing for storing the master file acquired from the content server 10 in the master storage unit 22.
The vocal processing unit 211 executes a vocal mask file generation process for identifying the vocal range and adjusting the vocal volume.
The lyrics management unit 212 executes a process of adding lyrics to the file so that the timing at which the lyrics are displayed matches the timing at which the vocals are reproduced.

The editor 213 executes processing for adjusting the vocal mask and processing for supporting adjustment of the addition position of the lyrics. The editor 213 is connected to the person-in-charge terminal 30 used by the person in charge of editing. The person in charge of the editor uses the person-in-charge terminal 30 to confirm the setting of the vocal mask and the position where the lyrics are added. In addition, when the setting position of the vocal mask and the addition position of the lyrics are not appropriate, the editing person in charge adjusts the setting position of the vocal mask and the addition position of the lyrics using the person-in-charge terminal 30. The setting position of the vocal mask is a sound range masked by the vocal mask and a reproduction time for which the vocal mask is set.

The playback management unit 214 executes processing for managing the playback status of the music on the user terminal 40. The reproduction management unit 214 holds information for calculating the reproduction determination time. The reproduction determination time is a time for determining whether or not the music has been reproduced.

The distribution management unit 215 executes a process of distributing the resources to each distribution target person according to the reproduction status of the music. The distribution target person is, for example, a copyright management organization (for example, JASRAC (registered trademark)), a performer, or a record company. The distribution management unit 215 holds data regarding a distribution rate for determining a distribution amount for each party.
Next, the configuration of information stored in each storage unit will be described.

2, the master storage unit 22 stores a master file 220 such as a promotion video. When the management server 20 acquires the master file 220 from the content server 10, the management server 20 stores the master file 220. The master file 220 includes data related to music codes, attributes, and compressed moving images.

The music code data area stores data relating to an identifier for specifying a music. The attribute data area stores data related to the attributes of music. The attribute of the music is information relating to the record company, performer, composer, songwriter, and the like. The compressed moving image data area stores compressed data for music and video. The compressed data is created in the mp4 format, for example.

As shown in FIG. 3, the lyrics information storage unit 23 stores a lyrics file 230 related to lyrics. The lyrics file 230 is stored when the lyrics information is registered in the management server 20. The lyric file 230 includes data related to music codes and lyrics.

The music code data area stores data relating to an identifier for specifying a music. The lyric data area stores lyric information related to lyric text and ruby. Lyric text and ruby indicated by the lyric information function as a telop corresponding to vocals.

4, the music storage unit 24 stores music management data 240 to be provided to the user terminal 40. The music management data 240 is stored when karaoke data is generated from the master music.

The music management data 240 is associated with a music code and includes an HLS file 241, an XML file 242, a vocal mask file 243, and a music management record 244. The song code data area stores data relating to an identifier for specifying a song.

The HLS file 241 includes data obtained by transcoding the compressed video of the master disc. The HLS file 241 includes data according to the HD format for high speed lines and data according to the SD format for low speed lines.

The XML file 242 stores song attribute information and lyrics synchronization information indicating lyrics associated with a time index that is a reproduction time. The XML file 242 includes music composition information in which each of A melody, B melody, and C melody, which are musical composition elements, is associated with a time index. Also, the XML file 242 includes chapter information in which each chapter in the medley music is associated with a time index. Specifically, the XML file 242 includes the following description.
• Lyrics: Lyrics file versions are specified.
-Header: The music meta information and style definition are stored.
Song meta (song_meta): The meta information of the music is stored.
Style: Style definitions and references are stored.
-Lyrics track (lyrics_track): The lyrics body is stored.
・ Verse: The lyrics break is stored.
-Page: Lyrics for one page and display / erase timing are defined.
-Line: Lyric definitions for one line are stored.
-Ruby (ruby): Ruby definitions are stored.
Word: Lyric text and wipe processing timing are defined.

The vocal mask file 243 stores the vocal range by associating the time index with the vocal range. The vocal mask file 243 is used for specifying a vocal line.

To adjust the volume of the vocal range is to use the vocal mask file 243 to remove the vocal from the mixed sound and to individually reduce the volume of the vocal. For example, the user uses the vocal as a guide by individually reducing the volume of the vocal.
The music management record 244 is data relating to the performer, voice quality, tone, performance time, video usage, medley attribute, status, scene, and sound range.
The performer data area stores data relating to an identifier such as a singer name for identifying a performer such as a singer.
The voice quality data area stores data related to the feature amount of the singer's voice. The feature amount for the singer's voice is, for example, a statistical value calculated from the main melody.

Tune data area stores data related to tempo and rhythm of music. The data relating to the tempo and rhythm of the music is, for example, a feature value that is a statistical value calculated from the music.

The scene data area stores information for identifying, for example, the season associated with the song, the scenery such as the sea, mountains, and sky associated with the song, and the emotions such as joy, sadness, and resentment associated with the song. To do.
The range data area stores data related to the range required for the vocal of the music.
The performance time data area stores data related to the performance time, which is the time required to reproduce the music from the beginning to the end.
The video use data area stores a flag for identifying whether or not the music master disk uses video in the music HLS file 241.

The medley attribute data area stores information for identifying whether or not the music is medley. When the music is a medley, the medley attribute data area stores the music code of the music constituting the medley.

The status data area stores a flag for specifying the creation status of the music HLS file 241 and XML file 242. The status data area stores a “before editing” flag for a file immediately after registration, and a “provided” flag for a file after editing.

As shown in FIG. 5, the reproduction record storage unit 25 stores reproduction record data 250 relating to the use record of the karaoke file. The reproduction result data 250 is stored when the user terminal 40 uses a karaoke file. The reproduction performance data 250 includes a user performance record 251 and a music performance record 252.
The user performance record 251 includes data related to playback date / time, user code, user code, and music code.
The reproduction date / time data area stores data related to the date and time when the music was reproduced in the user terminal 40.
The user code data area stores data relating to an identifier for specifying the user who has played the music.
The user code data area stores data relating to an identifier for identifying the user who has played the music.
The music code data area stores data relating to an identifier for specifying the reproduced music.
The track record 252 includes data related to the track code and the number of times of playback.
The music code data area stores data relating to an identifier for specifying the reproduced music.
The reproduction frequency data area stores data relating to the total number of times that a music piece has been reproduced.

As illustrated in FIG. 6, the resource storage unit 26 stores a resource management file 260 relating to a karaoke usage fee acquired from the user. Karaoke usage fees are an example of funding. The resource management file 260 is stored when use registration of the karaoke service is performed. The resource management file 260 includes data relating to a user code, a contract form, and an information fee.
The user code data area stores data relating to an identifier for identifying a user who pays a usage fee.

The contract form data area stores a flag for specifying a contract form with the user. The contract form with the user is, for example, a contract form for customers such as individual billing, fixed amount payment (unlimited singing). Moreover, the contract form with a user is a contract form for business like the fixed amount payment for every user terminal 40, for example.
The information fee data area stores data related to the information fee, which is the amount to be paid when using the service, based on the contract form with the user.

As shown in FIG. 7, the distribution management information storage unit 27 stores a distribution management file 270 for specifying the distribution of the funds for each distribution target person. The distribution management file 270 is stored when payment conditions with each distribution target person are registered. The distribution management file 270 includes data relating to music codes, payees, and distribution conditions.
The song code data area stores data relating to an identifier for specifying a song.

The payee data area stores data related to payees of distributions related to music. The payee of the distribution money is, for example, a player or a record company that is an example of a distribution target person.

The distribution condition data area stores data relating to a calculation method of distribution paid to the payee. In the distribution condition data area, when the contract form is individual charging for each song, a ratio to the charging fee is stored as a payment condition. In the distribution condition data area, when the contract form is a fixed amount payment, a method for calculating the distribution money based on the distribution source is stored as the payment condition.

As shown in FIG. 8, the distribution record information storage unit 28 stores a distribution record record 280 related to the distribution record for the distribution destination. The distribution record 280 is stored when payment is made. The distribution record 280 includes data relating to distribution date, distribution destination, and distribution amount.
The distribution date data area stores data relating to the date of distribution of the funds.

The distribution destination data area stores data relating to an identifier for specifying the distribution destination of the distribution money. The identifier for specifying the distribution destination is data for specifying the performer or record company as the distribution destination.
The distribution amount data area stores data related to the amount to be distributed to the distribution destination.

Next, operations performed by the service management system will be described.
(Master karaoke data generation process)
The master karaoke data generation process will be described with reference to FIG.
First, the control unit 21 executes a process of acquiring a promotion video that is an example of a master file (step S1-1).
Specifically, the management unit 210 acquires a promotion video from the content server 10 as an mp4 file. Further, the management unit 210 acquires attribute information of the promotion video. Then, the management unit 210 registers the acquired promotion video and attribute information in the master disk storage unit 22.

Next, the control unit 21 executes transcoding processing (step S1-2). Specifically, the management unit 210 converts the mp4 file registered in the master storage unit 22 into an HLS file. The management unit 210 converts the mp4 file into an HD file used for a high-speed line and an SD file used for a low-speed line. Then, the management unit 210 registers the generated HLS file in the music storage unit 24 in association with the music code.

Also, the control unit 21 executes main melody estimation processing (step S1-3). Specifically, the vocal processing unit 211 converts the mp4 file into a predetermined file format. Then, the vocal processing unit 211 estimates a main melody such as a vocal line in the music.

Next, the control unit 21 executes a vocal mask file creation process (step S1-4). Specifically, the vocal processing unit 211 identifies the vocal sound range based on the estimated main melody.

Also, the vocal processing unit 211 creates a vocal mask file that stores the specified vocal range. The vocal processing unit 211 stores the created vocal mask file in the music storage unit 24 in association with the music code.

Next, the control unit 21 executes a voice quality registration process (step S1-5). Specifically, the vocal processing unit 211 calculates a feature quantity of the voice quality of the vocal included in the music. Then, the vocal processing unit 211 registers the calculated voice quality in the music management record 244 stored in the music storage unit 24.

Further, the control unit 21 executes a lyrics allocation process (step S1-6). Specifically, the lyrics management unit 212 acquires the lyrics file 230 in which the song code of the processing target song is stored from the lyrics information storage unit 23.

The lyrics management unit 212 associates the reproduction time corresponding to the vocal range specified by the vocal processing unit 211 with the lyrics, and generates an XML file 242 that is a lyrics synchronization file in which lyrics are assigned. Then, the lyrics management unit 212 registers the XML file 242 in the music storage unit 24.

Next, the control unit 21 executes an editing process (step S1-7). Specifically, the management unit 210 stores the “before editing” flag in the music management record 244 after the voice quality registration process and the lyrics allocation process are completed. Then, the editor 213 provides the created HLS file 241, XML file 242, and vocal mask file 243 to the person-in-charge terminal 30.

The person-in-charge terminal 30 reproduces the karaoke file according to the operation of the person in charge of the editor, and confirms whether or not the volume of the vocal sound changes depending on the vocal mask, whether or not the vocal sound and the lyrics are synchronized.

When the vocal adjustment by the vocal mask is insufficient, or when the vocal and the singing voice are not synchronized, the editor 213 corrects the XML file 242 and the vocal mask file 243 according to the operation of the editor. The editor 213 that has acquired the editing end information from the person-in-charge terminal 30 stores the “provided” flag in the music management record 244.

(Vocal mask file generation process)
The vocal mask file generation process will be described with reference to FIG.
First, the vocal processing unit 211 acquires singer information stored in the attribute data area of the master file 220 (step S2-1).
When the music to be processed is a solo music, the vocal processing unit 211 acquires the singer information of the singer. On the other hand, when the music to be processed is a duet music, the vocal processing unit 211 acquires singer information of each singer.

Next, the vocal processing unit 211 identifies the tone of the music to be processed. The vocal processing unit 211 extracts, from the music storage unit 24, the music management record 244 in which the processing target singer is stored in the singer data area. In the extracted music management record 244, the vocal processing unit 211 specifies a music similar to the music tone of the music to be processed as a similar music (step S2-2).
Next, the control unit 21 executes processing for specifying the voice quality of the processing target singer (step S2-3). The voice quality identification process uses the following two methods.

First, the control unit 21 executes a process for specifying voice quality based on similar music (step S3-1). Specifically, the vocal processing unit 211 acquires a vocal range that is the voice quality of a singer stored in the music management record 244 of similar music.

Also, the control unit 21 executes a process for specifying the voice quality by the singing location (step S3-2). Specifically, the vocal processing unit 211 identifies a singing area in which the sound of only the processing target singer is included among the sounds included in the HLS file 241. Then, the vocal processing unit 211 specifies the vocal sound range in the specified singing area.

Next, the control unit 21 executes a first mask generation process (step S2-4). The control unit 21 generates a vocal mask file corresponding to the vocal range through the entire song using the specified vocal range.

Next, the control unit 21 determines whether or not the vocal mask is sufficient (step S2-5). Specifically, the vocal processing unit 211 uses the generated vocal mask to play a song with a reduced vocal volume, and confirms whether or not the singer's singing voice can be recognized.

Also, the vocal processing unit 211 confirms whether or not the main melody of the music is distorted before and after the vocal mask. If the singing voice cannot be recognized or the main melody is not distorted, the vocal processing unit 211 determines that the vocal mask is sufficient.

When the vocal processing unit 211 determines that the vocal mask is sufficient (in the case of “YES” in step S2-5), the control unit 21 ends the vocal mask file generation processing for the processing target singer.

On the other hand, when the vocal processing unit 211 determines that the vocal mask is insufficient (in the case of “NO” in step S2-5), the control unit 21 executes the second mask generation process (step S2-6). ). As the second mask generation process, the control unit 21 specifies a vocal range for each vocal phrase, and generates a vocal mask file corresponding to each vocal range.
Next, similarly to step S2-5, the control unit 21 executes processing for determining whether or not the vocal mask is sufficient (step S2-7).

When the vocal processing unit 211 determines that the vocal mask is sufficient (in the case of “YES” in step S2-7), the control unit 21 ends the vocal mask file generation processing for the processing target singer.

On the other hand, when the vocal processing unit 211 determines that the vocal mask is insufficient (in the case of “NO” in step S2-7), the control unit 21 executes a third mask generation process (step S2-8). ). As the third mask generation process, the control unit 21 estimates a vocal sound range for each predetermined time interval, and generates a vocal mask file that follows the dynamically specified vocal sound range. The control part 21 repeats the above process about all the singers contained in a process target music.

<First mask generation process>
The first mask generation process will be described with reference to FIG.
The vocal processing unit 211 estimates that the voice quality of the singer specified based on the similar music or the singing part is the vocal sound range of this music (step S4-1).
Next, the vocal processing unit 211 generates a vocal mask file for masking a single vocal range (step S4-2).
As shown in the time series graph 500 shown in FIG. 12, a vocal mask 501 for masking a single vocal range is generated throughout the music. In the time-series graph 500, the horizontal axis indicates time, and the vertical axis indicates a sound range. In the time-series graph 500, the upper thick line indicates the guitar sound, and the lower broken line indicates the drum sound range.

<Second Mask Generation Process>
Next, the second mask generation process will be described.
As shown in FIG. 13, the vocal processing unit 211 identifies a playback time zone in which the vocal is included based on the vocal line. The vocal processing unit 211 identifies a single group of continuous singing as a phrase (step S5-1).
The vocal processing unit 211 identifies a vocal range for each phrase based on the vocal line (step S5-2).
Next, the vocal processing unit 211 generates a vocal mask file for masking the vocal range estimated for each phrase (step S5-3).

As shown in the time-series graph 510 shown in FIG. 14,

vocal masks

511 and 512 created for each phrase are generated. Also in the time series graph 510, the horizontal axis indicates time, and the vertical axis indicates a sound range. In the time series graph 510, the upper thick line indicates the guitar sound, and the lower broken line indicates the drum sound range.

<Third mask generation process>
Next, the third mask generation process will be described with reference to FIG.
First, the vocal processing unit 211 divides the music for each time step that is a predetermined time (step S6-1). And the control part 21 specifies the time step of a process target sequentially, and repeats the following processes.

The control unit 21 executes a process for specifying a vocal range included in the specified time step (step S6-2). Next, the control unit 21 executes a process of generating a vocal mask file that masks the specified vocal range (step S6-3).

As shown in the time-series graph 520 shown in FIG. 16,

vocal masks

521 and 522 created for each time step are generated. In the

vocal masks

521 and 522, the mask area dynamically changes with respect to the vocal sound area. Also in the time series graph 520, the horizontal axis indicates time, and the vertical axis indicates the sound range. In the time series graph 520, the upper thick line indicates the guitar sound, and the lower broken line indicates the drum sound range.

(Duet compatible processing)
Next, the process corresponding to a duet music is demonstrated using FIG.
The time series graph 600 shows an example including a female vocal and a male vocal. A time series graph 601 shows an example in which the volume of a female vocal is individually lowered using a female vocal vocal mask file. The time series graph 602 shows an example in which the volume of the male vocal is individually lowered using the vocal mask file of the male vocal. A time series graph 603 shows an example in which the volume of both vocals is individually lowered using both female and male vocal mask files.

(Synchronized lyrics processing)
Next, synchronized lyrics processing will be described with reference to FIG.
First, the lyrics management unit 212 extracts the lyrics file 230 in which the song code of the processing target song is stored from the lyrics information storage unit 23, and acquires the lyrics phrase to be assigned to the song (step S7-1).

Next, the lyrics management unit 212 estimates a singing voice section that includes a singing voice and a non-singing voice section that does not include a singing voice based on the vocal line. Then, the lyrics management unit 212 uses the singing voice section to specify the start time that is the start position of the phrase and the end time that is the end position (step S7-2).

Next, the lyrics management unit 212 generates a description in which characters and ruby are assigned to the start position and end position of the phrase in the XML file (step S7-3).
And the control part 21 performs the determination process about whether all the lyrics were allocated (step S7-4).

When the control unit 21 determines that lyrics that have not been assigned remain (in the case of “NO” in step S7-4), the control unit 21 performs processing subsequent to the processing for specifying the assigned lyrics (step S7-1). repeat.
On the other hand, if it is determined that all the lyrics have been assigned (“YES” in step S7-4), the control unit 21 ends the synchronized lyrics process.

The XML file 700 shown in FIG. 19 is generated by the above processing. The description 701 includes song metadata relating to lyrics, composition, arrangement, etc. stored in the attribute data area of the master file 220. The description 702 includes music metadata relating to the tempo stored in the attribute data area of the master file 220.

The description 703 includes a description for wiping “August” with “hachigatsu” added as ruby over a predetermined period. The description 704 includes a description for wiping “sun” with “taiyo” added as ruby over a predetermined period.

The description 705 includes a description for wiping “NO” over a predetermined period, and the description 706 includes a description for wiping “NO” over a predetermined period. The description 707 includes a description for changing a page to be displayed on one screen during karaoke playback.
Thus, an XML file for displaying lyrics in accordance with the progress of the music is generated.

(Playback judgment process)
The playback determination process will be described with reference to FIG. This process is performed when a karaoke file is reproduced on the user terminal 40.
First, the reproduction management unit 214 provides a karaoke file to the user terminal 40 and instructs the user terminal 40 to reproduce the karaoke file (step S8-1).
Next, the playback management unit 214 acquires the playback status from the user terminal 40, and determines whether or not playback has ended (step S8-2). When all the melody in the music ends or when the music ends in the middle, the reproduction management unit 214 determines that the reproduction has ended.
Next, the playback management unit 214 calculates the playback time from the start time of the music to the time when the playback end is detected (step S8-3).

Also, the reproduction management unit 214 uses the music storage unit 24 to extract a music management record 244 in which the music code of the reproduced music is stored. Then, the reproduction management unit 214 acquires the length of the song that is the performance time from the extracted song management record 244 (step S8-4).

Next, the reproduction management unit 214 specifies a reproduction determination time based on the performance time (step S8-5). The playback management unit 214 uses 2/3, which is a performance reference criterion, as a playback determination time for a song whose performance time is less than 60 seconds. On the other hand, the playback management unit 214 uses 40 seconds as a playback determination time for a musical piece having a performance time of 60 seconds or longer.
Next, the playback management unit 214 executes processing for determining whether or not the playback determination time has elapsed (step S8-6).

When the playback time has passed the playback determination time (in the case of “YES” in step S8-6), the playback management unit 214 generates a user result record 251 and registers it in the playback result storage unit 25. Furthermore, the reproduction management unit 214 increments the number of reproductions of the music record 252 for this music (step S8-7).

Note that the playback management unit 214 newly generates a music record record 252 for the music played for the first time and registers it in the playback record storage unit 25. In this case, the reproduction management unit 214 sets an initial value “1” for the number of reproductions of the music record record 252.

On the other hand, if the playback time has not passed the playback determination time (“NO” in step S8-6), playback management unit 214 skips playback count processing (step S8-7) and performs playback determination processing. finish.

(Distribution management processing)
Next, the distribution management process will be described with reference to FIG. The division management process is executed every predetermined period such as one month.
First, the distribution management unit 215 specifies a piece of music for individual charging in the distribution conditions of the distribution management file 270 stored in the distribution management information storage unit 27 (step S9-1).
Next, the distribution management unit 215 calculates the distribution money based on the individual rules stored in the distribution management file 270 (step S9-2).
In addition, the distribution management unit 215 identifies a medley song based on the medley attribute stored in the song management record 244 of the song storage unit 24 (step S9-3).

Next, the distribution management unit 215 obtains the music code of the medley constituent music that is an individual music constituting the medley music from the medley attribute data area in the music management record 244 of the medley music (step S9-4).

Next, the distribution management unit 215 calculates the number of distributions obtained by dividing the number of times the medley music has been played by the number of medley constituent songs. Then, the distribution management unit 215 adds the calculated number of distributions to the number of reproductions of the music record record 252 in which the music code of each medley constituent music is stored (step S9-5).

Next, the control unit 21 executes a distribution process of the fixed amount music for the customer (step S9-6). Specifically, the distribution management unit 215 distributes resources by dividing individual billing and unlimited singing.

Next, the control unit 21 executes a distribution process for a fixed amount music for business (step S9-7). Specifically, the distribution management unit 215 distributes resources according to the amount of monthly information for each terminal.

(Distribution of fixed amount music)
Next, a fixed amount music distribution process will be described with reference to FIG. In this process, a process target period is determined, and the process is performed separately for the customer and the business. The distribution process for the customer and the distribution process for the business differ only in the resources, and the processing procedure is the same.

First, the control unit 21 executes an information fee total acquisition process (step S10-1). Specifically, the distribution management unit 215 acquires the total amount of information fee for the fixed amount payment in the resource storage unit 26. The distribution management unit 215 calculates separately for customers and for businesses.

Next, the control unit 21 executes a process for calculating the total number of reproductions (step S10-2). Specifically, the distribution management unit 215 specifies the music code of the music for which the individual charge is not stored in the distribution management file 270 stored in the distribution management information storage unit 27. Next, the distribution management unit 215 obtains a music record record 252 in which the specified music code is stored from the reproduction record storage unit 25. Then, the distribution management unit 215 sums the number of reproductions stored in the acquired music record record 252.

Next, the control part 21 repeats the following processes for every music.
First, the control unit 21 executes a process for obtaining the number of reproductions (step S10-3). Specifically, the distribution management unit 215 acquires a music record record 252 in which the music code to be processed is stored from the reproduction record storage unit 25.

Next, the control unit 21 executes a process for calculating the reproduction ratio (step S10-4). Specifically, the distribution management unit 215 calculates the reproduction ratio by dividing the number of reproductions of the music to be processed by the total number of reproductions.

Next, the control unit 21 executes a process for calculating the distribution money (step S10-5). Specifically, the distribution management unit 215 acquires the resources for the processing target period from the resource storage unit 26. Here, the distribution management unit 215 calculates the total information fee for “all-you-can sing” for the process for the customer. On the other hand, the distribution management unit 215 calculates the total amount of information for each terminal for business-oriented processing. Then, the distribution management unit 215 calculates the distribution amount by multiplying the total information fee by the reproduction ratio of the music to be processed.

Next, the control unit 21 executes a calculation process for the copyright management organization in the distribution (step S10-6). Specifically, the distribution management unit 215 calculates the distribution amount for the copyright management organization by multiplying the distribution money by the distribution rate of the copyright management organization. Then, the distribution management unit 215 generates a distribution record record 280 and stores it in the distribution record information storage unit 28.

Next, the control unit 21 executes a calculation process for the performer in the distribution money (step S10-7). Specifically, the distribution management unit 215 calculates the distribution amount for the performer by multiplying the distribution amount by the distribution rate for the song. Then, the distribution management unit 215 generates a distribution record record 280 and stores it in the distribution record information storage unit 28.

Next, the control unit 21 executes a process for determining whether or not the processing target uses video (step S10-8). Specifically, the distribution management unit 215 confirms whether or not the video is included in the HLS file 241 stored in the music storage unit 24. When the use flag is set in the video use data area of the music management record 244, the distribution management unit 215 determines that the processing target uses video.

When the processing target uses video (in the case of “YES” in step S10-8), the control unit 21 executes calculation processing for the video in the distribution (step S10-9). Specifically, the distribution management unit 215 calculates the distribution amount for the video by multiplying the distribution by the distribution rate for the video. Then, the distribution management unit 215 generates a distribution record record 280 and stores it in the distribution record information storage unit 28.

On the other hand, when the processing target does not use the video (in the case of “NO” in step S10-8), the control unit 21 skips the calculation processing for the video in the distribution (step S10-9), and creates the fixed amount music. The distribution process ends.

As described above, according to the first embodiment, the following effects can be obtained.
(1) Since a karaoke file can be generated using the master music, the service user can sing with the same accompaniment as the performer.
(2) Even when a plurality of vocals are included, since a vocal mask file for each singer is generated, the volume of the vocal can be adjusted for each singer.
(3) Since the second mask generation process is executed when it is determined that the vocal mask is insufficient, distortion of music due to the vocal mask can be suppressed. Further, since the third mask generation process is executed, it is possible to generate a precise vocal mask.

(4) Since the reproduction count process is executed when the reproduction time has passed the reproduction determination time, the appropriate number of reproductions can be counted.
(5) Since the distribution calculation process is executed based on the individual rules, the distribution can be calculated based on the contract with the distribution target person.
(6) Even in various medley songs including a plurality of music pieces, it is possible to distribute the distribution money in consideration of the constituent music pieces.
(7) Even when the flat fee system is applied, it is possible to calculate the distribution according to the reproduction status. Further, since the calculation processing for the video in the distribution money is also executed, it is possible to distribute the resources for the video.

In addition, the said 1st Embodiment can be implemented by changing into the following aspects.
The voice quality specifying process (step S2-3) performed by the control unit 21 is any one of a voice quality specifying process (step S3-1) using a similar music piece and a voice quality specifying process using a solo location (step S3-2). Either one is acceptable. When the voice quality is specified by both methods, the control unit 21 generates a vocal mask file using the voice quality specified based on a predetermined rule. The predetermined rule is, for example, to specify a range including both vocal ranges as a voice quality.

The distribution conditions included in the distribution management file 270 may be set based on the difficulty of specifying the vocal sound range. For example, when a vocal mask file is generated by the first mask process, a lower distribution rate is set for the performer than when a vocal mask file is generated by the second mask process. Furthermore, when the vocal mask file is generated by the second mask process, a lower distribution ratio is set for the performer than when the vocal mask file is generated by the third mask process. Thereby, the distribution rate can be changed according to the difficulty of generating the vocal mask file.

The method for determining whether or not the music has been reproduced may be, for example, determining whether or not the position of the chorus in the music has been reproduced. In this case, the control unit 21 specifies the position of the rust based on the specific pattern in the process of estimating the main melody (step S1-3). Then, the specified chorus position is stored in the music management record 244 in association with the time index. Then, the playback management unit 214 determines whether or not the music has been played based on the playback time and the position of the chorus. Even if the playback time has not passed the playback determination time, the playback management unit 214 determines that playback has been performed when rust is played back.
-When each user who comprises a single group uses a karaoke service using his user terminal 40, you may adjust the frequency | count of reproduction | regeneration as follows.

As shown in FIG. 23, when a plurality of

user terminals

40a, 40b, and 40c play the same music at the same time, the number of times of playback is adjusted. For example, the management server 20 includes a group information storage unit. The group information storage unit stores data related to the user code of each user terminal 40 used as a group with respect to the group management code. When the management server 20 acquires the designation of a song that uses the karaoke service, the management server 20 stores the parent device information in association with the user code of the user terminal 40 for which the song is designated. Then, the service management system executes a group reproduction management process shown in FIG.

(Group playback management process)
First, the control unit 21 executes a process for registering a group (step S11-1). Specifically, when a plurality of users use the karaoke service using their own user terminals 40, each user terminal 40 accesses the management server 20. The reproduction management unit 214 assigns a group management code to the group. The reproduction management unit 214 stores the user code associated with the group management code in the group information storage unit for the user terminal 40 logged in using the group management code (group registration). Here, it is assumed that the user terminals 40a to 40c log in using the same group management code.

Next, the control unit 21 executes a process of accepting music designation (step S11-2). Specifically, the reproduction management unit 214 obtains the music code of the music to be used from any user terminal 40 registered as a group. For example, it is assumed that a music designation is received from the user terminal 40b.

Next, the control unit 21 executes a process for specifying the designated person as the parent device (step S11-3). Specifically, the reproduction management unit 214 registers the user terminal 40b as a parent device for the previously specified music in the user terminals 40a to 40c to which the group management code is assigned.

Next, the control unit 21 executes a process for instructing synchronization (step S11-4). Specifically, the reproduction management unit 214 transmits a synchronization instruction to the user terminals 40a to 40c registered in the group. In this case, the reproduction management unit 214 acquires the music download status at each of the user terminals 40a to 40c registered in the group. Then, the reproduction management unit 214 instructs each user terminal 40 to wait until downloading of all the user terminals 40a to 40c is completed.

Next, the control unit 21 executes a reproduction process (step S11-5). Specifically, when the download management information is acquired in all the user terminals 40a to 40c registered in the group, the playback management unit 214 instructs to start playback of the music at the same time.

Next, the control unit 21 executes a process for storing the playback results (step S11-6). Specifically, the reproduction management unit 214 registers the user performance record 251 using the user code of the user terminal 40b designated as the parent device.

In the case of group registration, the playback management unit 214 may use the playback of music at the same place as a condition. In this case, location information is acquired from each of the user terminals 40a to 40c. As the location information, for example, position information by a GPS function provided in the user terminals 40a to 40c can be used. Accordingly, the control unit 21 can determine that the service is being used at the same place at the same time.

The lyrics synchronization status may be confirmed based on information from the user. Here, the management server 20 includes a correction information storage unit, and executes lyrics synchronization adjustment processing. The correction information storage unit stores a request management record relating to a correction request acquired from the user. The request management record stores data related to the user code, the reproduction time, the score, and the correspondence result for the music code.
The user code data area stores data relating to an identifier for specifying a user who has made a correction request.
The reproduction time data area stores data relating to a time index for specifying a portion desired to be corrected in the music.
The score data area stores data related to the score given to the correction request.

The response result data area stores a flag for specifying the response result for the correction request. Either the “correction” flag when the correction is performed or the “correction unnecessary” flag when the correction is not performed is stored.

Furthermore, the lyrics management unit 212 stores data related to the correction confirmation reference point for confirming whether correction is necessary. In addition, the lyrics management unit 212 stores weighting information to be given to the score based on the user attribute. The weighting information stores a weighting value to be given to the correction ratio corrected based on the correction request.

Next, the lyrics synchronization adjustment process will be described with reference to FIG.
(Lyrics synchronization adjustment processing)
First, the control unit 21 executes a process of acquiring user correction information (step S12-1). Specifically, when it is determined that there is a deviation in the synchronization state of lyrics during reproduction on the user terminal 40, the service user inputs a correction request on the user terminal 40. In this case, the user terminal 40 transmits a correction request to the management server 20. This correction request includes data relating to the music code and the playback time at which the correction request was input. The lyrics management unit 212 receives a correction request from the user terminal 40.

Next, the control unit 21 executes processing for storing correction information (step S12-2). Specifically, the lyrics management unit 212 generates a request management record for the acquired correction request, and stores it in the correction information storage unit. The request management record includes data related to the music code, user code, and playback time.

Next, the control unit 21 executes a process of assigning points weighted according to user attributes (step S12-3). Specifically, the lyrics management unit 212 extracts, from the correction information storage unit, the record in which the processed flag is stored in the request management record in which the user code is stored. Then, the lyrics management unit 212 calculates the correction ratio of the number of records in which the correction flag is stored with respect to the number of requests. And the lyrics management part 212 gives the score weighted according to the correction ratio with respect to this correction request, and memorize | stores it in a request management record.

Next, the control unit 21 executes a process for determining whether or not there is the same correction as the correction request (step S12-4). Specifically, the lyrics management unit 212 extracts from the correction information storage unit a correction management record in which the same music code as the correction request is stored and no flag is stored in the corresponding result data area. Further, the lyrics management unit 212 extracts a record in which the same playback time zone as the correction request is stored from the extracted correction management records.

When it is determined that there is the same correction as the correction request (in the case of “YES” in step S12-4), the control unit 21 executes the total processing of the points (step S12-5). Specifically, the lyrics management unit 212 sums up the points stored in the extracted correction management records, and calculates it as a total score.

On the other hand, when it is determined that there is no correction same as the correction request (in the case of “NO” in step S12-4), the control unit 21 skips the score totaling process (step S12-5).

Next, the control unit 21 executes processing for determining whether or not the total score exceeds the reference point (step S12-6). Specifically, the lyrics management unit 212 compares the calculated total score with the correction confirmation reference point.
If the total score does not exceed the reference point (“NO” in step S12-6), the control unit 21 ends the lyrics synchronization adjustment process.

On the other hand, when the total score exceeds the reference point (in the case of “YES” in step S12-6), the control unit 21 executes a process for proposing correction (step S12-7). Specifically, the lyrics management unit 212 transmits a confirmation request to the person-in-charge terminal 30. The confirmation request includes information related to the music code and the reproduction time stored in the request management record.

Next, the control unit 21 executes a correction determination process (step S12-8). The person-in-charge terminal 30 that has acquired the confirmation request displays the content of the confirmation request. In this case, the editor in charge reproduces the music and confirms the synchronization state of the lyrics.

Then, when the editor in charge determines that the lyrics are not synchronized properly, the editor in charge modifies the XML file. In addition, the lyrics management unit 212 stores a corrected flag in all correction management records related to the same correction request.

On the other hand, when the editor in charge determines that the synchronization of lyrics is appropriate, the editor in charge inputs correction unnecessary from the person in charge terminal 30. In addition, the lyrics management unit 212 stores all correction unnecessary flags related to the same correction request in the correction management record.
The provided content may be background music in which the volume of the vocal is individually adjusted or video in which the volume of the vocal is individually adjusted.

(Second Embodiment)
Hereinafter, a second embodiment of the service management system will be described. The service management system according to the second embodiment is different from the first embodiment and the modification thereof in that the degree of individually changing the vocal volume is set. Therefore, below, especially a different point from 1st Embodiment and its modification is demonstrated, and the description of the structure which mutually overlaps is omitted.

26, the user terminal 40 includes a control unit 41A, a communication unit 42A, an input unit 43, a data storage unit 44, a volume detection unit 45, and a music output unit 46. The control unit 41A includes a CPU, DSP, ROM, RAM, and the like. 41 A of control parts control the process which each part, such as the input part 43, the communication part 42A, the data storage part 44, the sound volume detection part 45, and the music output part 46, performs.

The input unit 43 is, for example, a remote controller or a volume switch that provides operation buttons. The input unit 43 inputs various types of information to the control unit 41A in a format that can be processed by the control unit 41A. The information input by the input unit 43 includes a designated mode, a designated mask degree, and a designated period.

The degree of masking indicates the degree to which vocals are individually masked during the period when the music is being played based on the music data. In other words, the degree of masking is the degree to which the volume of the vocal is relatively lowered with respect to the volume of the music that is an example of the mixed sound.

It should be noted that the vocal volume when not masked in the vocal volume relative to the volume of the music is the master volume equal to the volume being played from the master file. On the other hand, the volume of the vocal when masked is a support volume smaller than the master volume.

The designated mask degree is a mask degree input by the user of the karaoke service to the user terminal 40. The designated mask degree can be input one by one with respect to the vocals included in the music before the music using the music data is reproduced. For example, when the music is a solo music, it is possible to input one designated mask degree for the vocal. When the music is a duet music, it is possible to input a designated mask degree for each of the two types of vocals.

The designation mode is a reproduction format that is input to the user terminal 40 by the user of the karaoke service. The designated mode can be input one by one with respect to the reproduction of the music before the music using the music data is reproduced. The designation mode is, for example, any one of a fixed mode, a first variation mode, and a second variation mode.
41 A of control parts employ | adopt a fixed value as a mask degree in fixed mode, and mask a vocal with a fixed value, when reproducing | regenerating a music from music data.

In the first variation mode, the control unit 41A employs a variation value as the masking degree, and masks the vocal based on the variation value when reproducing the song from the song data. The variation value in the first variation mode changes according to the detected sound volume of the performance sound that is the sound volume detected by the sound volume detector 45.

Specifically, the variation value in the first variation mode changes, for example, as the detected volume is larger than the reference volume, so that the support volume is lower. For example, as the detected volume is smaller than the reference volume, the support volume is It changes as big. The reference volume may be a volume obtained by masking the master volume at a certain rate, or may be a certain volume.

In the second variation mode, the control unit 41A adopts a variation value as the mask degree, and masks the vocal with the variation value when reproducing the song from the song data. The variation value in the second variation mode changes so that the total amount of the detected volume and the support volume matches the master volume.

The input unit 43 inputs, to the control unit 41A, an application period that is a period in which the designated mode is applied during a period in which music is played, in a format that can be processed by the control unit 41A. The application period is, for example, at least one of A melody, B melody, C melody, rust, and chapter that compose the music.

42 A of communication parts perform the transmission request | requirement of the karaoke file with respect to the management server 20, and the reception notification of the karaoke file with respect to the management server 20. FIG. The control unit 41A controls transmission of data executed by the communication unit 42A and reception of data executed by the communication unit 42A.

The data storage unit 44 includes a storage area for storing various data input by the input unit 43 to the control unit 41A and various data such as a karaoke file received by the communication unit 42A. The data storage unit 44 stores a music playback program executed by the control unit 41A to change the volume of the vocal and various data used by the control unit 41A to change the volume of the vocal.

The data stored in the data storage unit 44 includes a standard mask degree as a default mask degree. The data stored in the data storage unit 44 includes a standard period as a default application period.

The standard mask degree is adopted in place of the designated mask degree in the control unit 41A when the designated mask degree is not inputted for the reproduction of the music during the reproduction of the music.

The standard period is adopted in place of the designated period in the control unit 41A when the designated period is not input for the reproduction of the music. The data stored in the data storage unit 44 includes data related to an identifier for the control unit 41A to specify the designated mode.
The control unit 41 </ b> A controls data storage executed by the data storage unit 44 and reading of data stored in the data storage unit 44 from the data storage unit 44.

The control unit 41A reads the music reproduction program stored in the data storage unit 44, and generates a music reproduction signal for reproducing the music from the music data according to the read music reproduction program.

The control unit 41A inputs the generated music reproduction signal to the music output unit 46, and causes the music output unit 46 to perform reproduction based on the music reproduction signal. The control unit 41A and the music output unit 46 function as a volume setting unit and a playback processing unit by executing a music playback program.

The volume detector 45 detects the volume of the performance sound input to the user terminal 40 as a detected volume. The volume detection unit 45 detects, for example, the volume of a singing voice input to a microphone by a user who plays vocals as a detected volume.
The control unit 41A includes a volume setting unit 411 and a reproduction signal generation unit 412.

The volume setting unit 411 executes a process for changing the volume of the vocal using the music data and the vocal mask included in the karaoke file, the designated mask degree stored in the data storage unit 44, and the standard mask degree. .

The vocal mask is data for specifying a vocal line, which is a range including the vocal scale, within the range of the reproduced sound. The vocal mask includes a vocal range as data, and the vocal range is defined with respect to a time index that is a reproduction time of a song. The sound included in the vocal line is specified based on the vocal mask, and the volume of the specified sound is changed based on the mask degree, thereby changing the support volume.

The volume setting unit 411 sets the designated mask degree as the volume setting when the designated mask degree is input in the fixed mode or the first variation mode. On the other hand, the volume setting unit 411 sets the standard mask degree as the volume setting when the designated mask degree is not input. Among the volume settings executed by the volume setting unit 411, the settings in the first variation mode and the second variation mode are auxiliary settings using a varying mask degree.

The volume setting unit 411 sets the specified period as the volume setting when the specified period is input in the fixed mode, the first variation mode, and the second variation mode. On the other hand, the volume setting unit 411 sets the standard period when the specified period is not input.

For example, when the setting format executed by the user terminal 40 is the fixed mode, the specified mask degree is input, and the specified period is input, the volume setting unit 411 changes the volume of the vocal. The processing for setting is executed as follows.

That is, the volume setting unit 411 refers to music data, vocal mask, designated mask degree, and designated period. The volume setting unit 411 sets the designated masking degree to the masking degree adopted by the reproduction signal generating unit 412 so that the volume of the vocal played from the music data becomes the support volume masked by the designated masking degree. Then, the sound volume setting unit 411 sets the designated period as a period during which the reproduction signal generation unit 412 adopts the designated mask degree.

For example, when the setting format executed by the user terminal 40 is the first variation mode, the specified mask degree is input, and the specified period is input, the volume setting unit 411 sets the volume of the vocal. The setting process for changing the value is executed as follows.

That is, the volume setting unit 411 refers to music data, vocal mask, designated mask degree, and designated period. The volume setting unit 411 sets the specified mask degree to the mask degree adopted by the reproduction signal generation unit 412 so that the vocal volume reproduced from the music data becomes the support volume when vocal reproduction is started from the music data. Set.

Then, after the reproduction of the vocal is started, the volume setting unit 411 decreases the masking degree adopted by the reproduction signal generation unit 412 so that the support volume is increased when the volume detected by the volume detection unit 45 is decreased. On the contrary, the volume setting unit 411 increases the masking degree adopted by the reproduction signal generation unit 412 so that the support volume is lowered when the volume detected by the volume detection unit 45 is increased. Note that the volume setting unit 411 sets a designated period as a period during which the reproduction signal generation unit 412 adopts these mask degrees.

For example, when the setting format executed by the user terminal 40 is the second variation mode and the specified period is input, the volume setting unit 411 executes the setting process for changing the volume of the vocal as follows. To do.

That is, the volume setting unit 411 sequentially performs adjustment of the mask degree adopted by the reproduction signal generation unit 412 with reference to the music data, vocal mask, and specified period. At this time, the volume setting unit 411 adjusts the mask degree so that the total volume of the volume detected by the volume detection unit 45 and the support volume becomes the master volume. Further, the volume setting unit 411 sets a designated period as a period in which the reproduction signal generation unit 412 adopts such a mask degree.

The reproduction signal generation unit 412 generates a music reproduction signal that is a signal for reproducing music from the music data downloaded from the management server 20. The reproduction signal generation unit 412 generates a video reproduction signal that is a signal for reproducing a video from the video data downloaded from the management server 20.

The reproduction signal generation unit 412 uses the music data and the vocal mask, and identifies the vocal from the music reproduced from the music data. In other words, the reproduction signal generation unit 412 generates a signal for reproducing the music from the music data, and specifies a signal corresponding to vocal among the generated signals. Further, the reproduction signal generation unit 412 employs the mask degree set by the volume setting unit 411. Then, the reproduction signal generation unit 412 processes a signal for reproducing the music so that the signal specified as vocal becomes a signal masked with the mask degree, and uses it as a music reproduction signal. Note that the reproduction signal generation unit 412 adopts the period set by the volume setting unit 411 as the period in which the masking degree is adopted, and executes the above processing only during the set period.

As shown in FIG. 27, first, the volume setting unit 411 reads the music data included in the karaoke file provided from the management server 20 from the data storage unit 44 (step S13-1).

Next, the volume setting unit 411 reads the vocal mask included in the karaoke file provided from the management server 20 from the data storage unit 44 (step S13-2).

Next, the volume setting unit 411 reads the designation mode input from the input unit 43 during the current music reproduction from the data storage unit 44, and sets the read designation mode in the reproduction signal generation unit 412 (step S13-). 3).

When the designated period is input, the volume setting unit 411 sets the designated period in the reproduction signal generating unit 412 as the adoption period of the mask degree adopted for the current music reproduction. When the designated period is not input, the volume setting unit 411 sets the standard period in the reproduction signal generation unit 412 as the adoption period of the mask degree employed for the current music reproduction.

When the specified mask level is input, the volume setting unit 411 sets the mask level used for the current music playback to the playback signal generation unit 412 as the specified mask level. When the designated mask level is not input, the volume setting unit 411 sets the mask level employed for the current music playback as the standard mask level in the playback signal generation unit 412.

Next, the control unit 41A determines whether or not the designated mode employed for the current music reproduction is the fixed mode (step S13-4).
When it is determined that the designated mode employed for the current music reproduction is the first variation mode or the second variation mode (in the case of “NO” in step S13-4), the control unit 41A performs the designation mode. The reproduction of music in a certain first variation mode or second variation mode is started (step S13-5).

On the other hand, when it is determined that the designated mode employed in the current music reproduction is the fixed mode (in the case of “YES” in step S13-4), the control unit 41A reproduces the music in the fixed mode which is the designated mode. Is started (step S13-15).

Next, the control unit 41A performs a sound volume detection unit within the designated period every time the reproduction time reaches the designated period in the reproduction of the music in the first fluctuation mode or the reproduction of the music in the second fluctuation mode. The following processing is repeated based on the volume detected by 45.

In other words, the volume setting unit 411 of the control unit 41A, when playing the music in the first variation mode, every time the volume detection unit 45 detects the volume, the volume detected by the volume detection unit 45 falls below the reference volume. The masking degree adopted by the reproduction signal generation unit 412 is lowered so as to increase the support volume. On the contrary, the volume setting unit 411 increases the masking degree adopted by the reproduction signal generation unit 412 so that the support volume is lowered when the volume detected by the volume detection unit 45 is higher than the reference volume.

In addition, the volume setting unit 411, when playing the music in the second variation mode, every time the volume detection unit 45 detects the volume, the total volume of the volume detected by the volume detection unit 45 and the support volume becomes the master volume. Thus, the mask degree is adjusted (step S13-6).

Then, the reproduction signal generation unit 412 generates a music reproduction signal for generating a reproduction sound at a new vocal volume by using the mask degree reflecting the detection result of this time (step S13-7).

Note that the control unit 41A sets the standard mask degree so that the volume of the vocal is masked with the current mask degree every time the reproduction time reaches the specified period in the reproduction of the music in the fixed mode (step S13). -17). Then, the reproduction signal generation unit 412 generates a music reproduction signal for generating a reproduction sound at a new vocal volume using the mask degree set this time (step S13-18).
With reference to FIGS. 28-30, an example of the reproduction | regeneration result of the music by fixed mode, 1st fluctuation mode, and 2nd fluctuation mode is demonstrated.

In each of FIGS. 28 to 30, the solid line indicates the transition of the detected volume, which is the volume detected by the volume detector 45, with respect to the playback time. A broken line indicates transition of the support volume, which is a vocal in the music reproduced by the user terminal 40, with respect to the reproduction time. The alternate long and two short dashes line shows the transition of the master volume, which is the volume of the unmasked vocal, with respect to the playback time.

FIG. 28 shows the transition of the volume of each sound in the fixed mode. FIG. 29 shows the transition of the volume of each sound in the first variation mode. FIG. 30 shows the transition of the volume of each sound in the second variation mode.

As shown in FIG. 28, as the playback time elapses, the detected volume changes so as to follow the change in the master volume. On the other hand, when the playback time is the period T, for example, the user can play vocals as the user's voice range deviates from the vocal range required for vocals or the user's voice dies. Accordingly, the detected sound volume is lowered to a level where it is hardly detected.

Here, the support volume masked at a certain masking degree is smaller than the master volume, and changes throughout the playback time so as to follow the change. As a result, in the period T in which the user should play vocals, even if the user cannot play vocals, the music is composed of the support sound played by the user terminal 40 and other sounds. Therefore, playing the music is assisted by the user terminal 40.

29, as shown in FIG. 29, the detected sound volume also changes so as to follow the change in the master disk sound volume as the reproduction time elapses. On the other hand, when the reproduction time is the period T, the detected sound volume is reduced to a level where it is hardly detected.

Here, the support sound volume due to the varying masking degree changes following the change of the difference dV1 between the detected sound volume and the master disk sound volume with reference to the support sound volume due to the fixed masking degree. When the specified mask degree is input, the fixed mask degree is the specified mask degree. When the specified mask degree is not input, the fixed mask degree is the standard mask degree.

Further, the support volume generated with the varying mask degree is higher as the volume of the singing voice is lower than the master volume. Further, the support volume generated with the varying mask degree is lower as the volume of the singing voice is higher than the master volume.

That is, the control unit 41A changes the volume obtained by multiplying the difference dV1 between the detected volume and the master volume by a predetermined value α of 1 or less as a reference volume by masking the master volume with a fixed degree of masking. Minutes, and the total level of these is the support volume.

The support volume obtained in this way may be higher or lower than the volume obtained by masking the master volume with a fixed degree of masking. The predetermined value α may be configured to be input by the input unit 43 together with the specified mode, the specified mask degree, the specified period, etc. It may be a configuration.

As a result, as described above, even if the user cannot play vocals, the music is composed of the support sound generated by the user terminal 40 and other sounds, and consequently the music is played. It becomes possible to assist.

In addition, in a period in which the user can sufficiently play vocals, the support sound is generated with a volume sufficiently lower than the support volume generated with a fixed mask degree. Therefore, it is possible to prevent the support sound from being played by the user terminal 40 from interfering with the vocal performance by the user.

As shown in FIG. 30, the detected sound volume also changes so as to follow the change in the master disk sound volume as the reproduction time elapses. On the other hand, when the reproduction time is the period T, the detected sound volume is reduced to a level where it is hardly detected.

Here, the support volume based on the varying masking degree is reproduced when the detected volume is smaller than the master volume, and the level of the volume is set to a value corresponding to the difference dV2 between the detected volume and the master volume. It changes following the change of the difference dV2 between the volume and the master volume.

And the support volume due to the changing mask level increases in the period T to the same level as the master volume. On the other hand, such a support volume is sufficiently suppressed during a period in which the user can sufficiently play vocals.

As a result, as described above, even when the user cannot play vocals, the music is composed of the support sound generated by the user terminal 40 and other sounds, and consequently the music is played. Can help. In addition, it is further suppressed that the vocal reproduction by the user terminal 40 hinders the vocal performance by the user.

In the first variation mode and the second variation mode, for example, in a duet music including female vocals and male vocals, both vocal masks are used, and the detected sound volume of either one is hardly detected. Even if it is a level, the part can be assisted by the support volume.

Therefore, it is necessary to separately generate a vocal mask for masking only female vocals in order to reproduce only male vocals, and a vocal mask for masking only male vocals in order to reproduce only female vocals. do not do.

As described above, according to the second embodiment, the following effects can be obtained.
(1) Even when the user cannot play vocals, only the accompaniment sound flows in the space can be suppressed by generating the support sound. In addition, since the target sound such as vocals included in the master music is adopted as the support sound, it is possible to prevent a common person who listens to the playback sound together with the user from feeling uncomfortable with the support sound.

(2) Since the variation mask degree is generated so as to reduce the discrepancy between the support volume and the detected volume generated with a fixed mask degree, the music played by the sound reproduced from the song data and the user's singing voice Will be harmonized as music.

(3) Since the support volume fluctuates so that the total volume of the detected volume and the support volume becomes the master volume, the music played by the sound reproduced from the song data and the user's singing voice is further harmonized as a song. It will be.

(Third embodiment)
With reference to FIGS. 31 and 32, a third embodiment of the service management system will be described. The third embodiment differs from the music playback device in the second embodiment in that the volume setting up to the previous time is handled as a history. In the following, points different from the configurations described in the second embodiment will be mainly described, and components having the same functions as those described in the second embodiment will be denoted by the same reference numerals and description thereof will be omitted. To do.

42 A of communication parts transmit the setting of the volume which the volume setting part 411 performed in this reproduction | regeneration using music data to the management server 20 as a setting candidate after the next time. The control unit 41A performs transmission of setting candidates by the communication unit 42A each time reproduction of a song using the song data is executed.

The management unit 210 of the management server 20 functions as a history management unit, and executes a process of storing the setting candidates received from the user terminal 40 in the reproduction result storage unit 25. The playback record storage unit 25 includes the playback date and time, the user code, the music code, and the setting candidate in the playback record data 250.

The setting candidates included in the playback performance data 250 include data relating to an identifier for specifying the designated mode in the current playback. Further, the setting candidates included in the reproduction performance data 250 include the mask degree used for reproducing the support sound volume, such as the designated mask degree and the standard mask degree. In addition, the setting candidates included in the playback record data 250 include an application period used for playback of the support volume, such as a specified period and a standard period.

When the control unit 21 of the management server 20 provides a karaoke file to the user terminal 40, the control unit 21 refers to the reproduction result data 250 stored in the reproduction result storage unit 25. The control unit 21 determines whether or not there is a setting candidate corresponding to the user code related to the current reproduction and the music code.

When it is determined that the setting candidates are stored in the reproduction record storage unit 25, the control unit 21 of the management server 20 sets all the setting candidates corresponding to the user code and the music code related to the current reproduction as karaoke. It is included in the file and transmitted to the user terminal 40. On the other hand, when it is determined that the setting candidate is not stored in the reproduction record storage unit 25, the control unit 21 of the management server 20 transmits to the user terminal 40 that the karaoke file includes no setting candidate.

The user terminal 40 includes a history analysis unit 413 for analyzing the setting candidates received from the management server 20. The history analysis unit 413 handles each setting candidate received from the management server 20 as a setting history by the user, and generates a volume setting for the current reproduction from the setting history by the user according to a predetermined algorithm.

The history analysis unit 413, for example, extracts the setting that has been adopted most frequently from a plurality of setting candidates, and sets the extracted setting as the current setting. For example, the history analysis unit 413 extracts the designated mode that has been adopted most frequently from the setting candidates, and sets the extracted designated mode as the current designated mode.

Further, for example, the history analysis unit 413 determines whether or not there is a continuous designation mode in order from the setting candidate with the latest reproduction date and time, and sets the latest continuous designation mode as the current designation mode. Further, the history analysis unit 413 uses, for example, the mask degrees included in the setting candidates, calculates the average value or mode value thereof as a representative value, and sets the calculated representative value as the current designated mask degree.

As shown in FIG. 32, first, the control unit 41A reads music data and a vocal mask from the provided karaoke file (step S13-1). At this time, the control unit 41A determines whether or not setting candidates are included in the karaoke file (step S13-2).

Next, when the setting candidate is not included in the karaoke file (in the case of “NO” in step S13-2), the volume setting unit 411 sets the fixed mode as the designated mode adopted by the reproduction signal generating unit 412. Further, the volume setting unit 411 sets the standard mask degree as the mask degree adopted by the reproduction signal generation unit 412 and sets the standard period as the application period adopted by the reproduction signal generation unit 412 (step S13-3).

On the other hand, if the karaoke file contains setting candidates (in the case of “YES” in step S13-2), the history analysis unit 413 of the user terminal 40 uses the designated mode, mask degree, and The application period is generated from the read setting candidates (step S14-3).

Also, the volume setting unit 411 temporarily sets the designation mode, the masking degree, and the designation period adopted by the reproduction signal generation unit 412 according to the analysis result by the history analysis unit 413. Further, the volume setting unit 411 prompts the user to confirm the setting in the current reproduction by outputting the temporarily set content to the display unit or the like (step S14-4).

At this time, the volume setting unit 411 permits the change of the designated mode, the degree of masking, and the designated period by an external operation on the input unit 43 of the user terminal 40. Then, the volume setting unit 411 sets the confirmed contents in the designation mode, the masking degree, and the application period adopted by the reproduction signal generation unit 412 (step S13-3).

Thereafter, similarly to the second embodiment, the control unit 41A reproduces the music from the music data included in the karaoke file according to the volume setting by the volume setting unit 411. Then, the control unit 41A transmits the volume setting for the current reproduction to the management server 20 as the next setting candidate (step S14-5).

As described above, according to the third embodiment, the effects listed below can be obtained.
(1) The volume required for the current playback as the volume of the support sound and the application period thereof are the same volume and the same application period as any of the settings made up to the previous playback if the music to be played is the same. There are many things. In this regard, since the volume setting unit 411 sets the current volume using the volume setting performed up to the previous playback, the load on the user who inputs a new setting is reduced.

In addition, you may change the said 2nd Embodiment and 3rd Embodiment into the following aspects.
The user terminal 40 is not limited to a terminal connected to the management server 20 via a network, and may be a device that is not connected to the network, for example. At this time, the user terminal 40 may include a vocal mask generation function included in the management server 20. That is, the user terminal includes a functional unit corresponding to the vocal processing unit 211 included in the management server 20, and specifies a vocal range from the music data included in the master file and generates a vocal mask for changing the vocal volume. May be executed.

The reproduction of the music performed by the user terminal 40 is not limited to the process using the music data stored in the music storage unit 24 or the vocal mask. The user terminal 40 acquires, for example, a streaming file for music for reproducing the mixed sound and a streaming file for masking for masking the target sound in the mixed sound from the management server 20, and acquires these. The music may be streamed using the file. Note that the user terminal 40 in the first embodiment may also have a similar function for performing such streaming playback.

The adjustment target sound may be, for example, a sound that is played by two or more musical instruments that constitute an orchestra and are different from each other. In this way, the target sound may be included separately in two or more sound ranges associated with the common time index. At this time, the user terminal 40 may use a plurality of vocal masks for masking different sound ranges, and a masking degree for each vocal mask. Alternatively, the user terminal 40 may use one vocal mask defined so as to mask different sound ranges and one mask degree for each sound range. In this way, if two or more sound ranges include different adjustment target sounds, they are generated from a plurality of sounds that are played outside each having a different scale and other reproduced sounds. It is possible to assist in playing the mixed sound.

A plurality of vocal masks, each masking a different range, may be adopted when playing one piece of music. At this time, each vocal mask and the application period in which it is adopted are associated one by one with the vocal mask in the user terminal 40 based on an operation input to the input unit 43 or the like.

In the case of adopting such a vocal mask, for example, a dynamic vocal mask such as the third vocal mask is used in an application period in which high accuracy of the mask by the vocal mask is required. Further, a static vocal mask such as the first vocal mask is used in an application period in which the accuracy of the mask by the vocal mask is not required. As a result, it is possible to reduce the storage capacity required for storing the vocal mask as compared with a configuration in which only a dynamic vocal mask such as the third vocal mask is used, and the effort required to generate the vocal mask. Can be reduced.

The user terminal 40 may further include a data processing unit that generates a vocal mask from the music data stored in the data storage unit 44 according to a predetermined algorithm and causes the data storage unit 44 to store the generated vocal mask. At this time, the data processing unit includes, for example, a CPU, a ROM, and a RAM, stores an algorithm for generating a vocal mask, and generates a vocal mask from music data input from the input unit 43 according to a predetermined algorithm. . The reproduction processing unit receives the processing result of the data processing unit, reads out the music data and the vocal mask from the data storage unit 44, and reproduces the music from the music data using the vocal mask and the mask degree.

-The application period is not limited to A melody, B melody, rust, and chapter, for example, a period that is higher than a predetermined scale in a vocal range or a lower range than a predetermined scale in a vocal range It may be determined in advance as a period that is a sound range. In setting such a period, first, a vocal range for determining the application period is determined, and a time index in which the vocal scale is included in the range is set as the application period.

The human voice range is divided into, for example, a high range and a low range.A user who does not have a voice range in the high range requires a support sound in the high range, and a user who does not have a voice range in the low range has a low range. Support sound is required in the range. In this regard, if the application period is determined by the vocal range, the support sound is generated only during the application period in which the support sound is originally required. Therefore, it is possible to reduce the load. In addition, if the auxiliary setting is performed in the first variation mode and the second variation mode in such an application period, assistance can be performed at a volume that is preferable as a support volume even in an application period in which the support sound is originally required. It is also possible.

As shown in FIG. 33, the user terminal 40 may further include a sound range specifying unit 47 that specifies the sound range of the sound played by the performer from the sound input by the performer to the microphone. The processing for specifying the sound range performed by the sound range specifying unit 47 may be performed before the music is reproduced, or may be sequentially performed during the reproduction of the music.

For example, the range specifying unit 47 specifies the range of the sound played by the performer in advance, and the volume setting unit 411 includes the mode specifying unit 414 for performing assistance based on the designated mode in a range other than the range specified in advance. You may prepare.

The mode specifying unit 414 specifies, for example, a time index associated with a sound range other than the sound range specified by the sound range specifying unit 47 in the time index associated with the adjustment target sound based on the music data. And the mode specific | specification part 414 sets the period which employ | adopts designated mask degree and standard mask degree to the specified time index. According to such reproduction, it is possible to specialize the period during which the adjustment target sound is reproduced at the support sound volume in a range where the performer cannot perform.

For example, the sound range specifying unit 47 sequentially specifies the sound range of the sound played by the performer during performance, and the volume setting unit 411 is a mode specifying unit for performing assistance based on the designated mode based on the sequentially specified sound range. 414 may be provided. For example, the mode specifying unit 414 determines whether or not the difference between the player's range specified by the range specifying unit 47 and the range of the target sound based on the song file is within a predetermined range. Repeat during playback. When the difference between the player's range specified by the range specification unit 47 and the range of the target sound based on the music file is outside the predetermined range, the mode specification unit 414 determines the mask degree based on the designated mode at that time. Set.

Note that the predetermined range, which is an allowable range of the difference between the performer's range specified by the range specification unit 47 and the range of the adjustment target sound based on the music file, is the same as the specified mask degree and the like from the input unit 43. The structure which is input may be sufficient and the structure which the data storage part 44 memorize | stores beforehand may be sufficient. According to such reproduction, the adjustment target sound is reproduced with the support sound when the difference between the range of the sound played by the performer and the range of the adjustment target sound is large. Therefore, it becomes possible for the performer to grasp that the sound range of the sound played by the performer is out of the sound range originally required for the music. In addition, since the target sound is reproduced in a sound range originally required for the music, it is possible to suppress a sense of incongruity due to the divergence for a viewer who appreciates the mixed sound.

For example, the range specifying unit 47 specifies the range of the sound played by the performer in advance, and the control unit 41A transmits the specified performer's range to the control unit 21 via the communication of the communication unit 42A. Good. At this time, the control unit 21 refers to the data related to the sound range in each piece of music management data 240, and specifies the music management data 240 in which the sound range indicated by the data related to the sound range includes the player's sound range. In addition, the control unit 21 transmits the specified music management data 240 to the communication unit 42A as music management data 240 of the recommended music. Then, the control unit 41A may cause the display unit to output a list of recommended songs based on the song management data 240 of the recommended songs received by the communication unit 42A. According to the recommendation of such music, it is possible to recommend to the performer a music suitable for the sound range played by the performer.

Note that the control unit 41A may transmit to the control unit 21 information related to a scene requested by the performer via communication of the communication unit 42A in addition to the range of the sound played by the performer. And the control part 21 specifies the music management data 240 in which the scene contained in the scene which a player requires is referred with reference to the data regarding a scene in each music management data 240. FIG. In addition, the control unit 21 transmits the specified music management data 240 to the communication unit 42A as music management data 240 of the recommended music. According to the recommendation of such music, it is possible to recommend to the performer a music that is suitable for the sound range played by the performer and suitable for the scene that the performer desires.

· The playback determination method executed by the playback determination process is not limited to the method based on the playback determination time. For example, the reproduction determination may be performed based on the position of rust in the music. In this case, the control unit 21 of the management server 20 specifies the chorus position based on the specific pattern in the main melody estimation process (step S1-3). The specified chorus position is stored in the music management data 240. Then, the playback management unit 214 performs playback determination based on the time index and the chorus position. Here, even if the playback determination time has not elapsed, if the rust is played back, it is determined to be played back.

· The billing method may be changed according to the set mask degree. In this case, a rule is stored in the distribution management file 270 to calculate the distribution money based on the volume of vocals output according to the masking degree of the mask file.

DESCRIPTION OF SYMBOLS 10 ... Content server, 20 ... Management server, 21, 41, 41A ... Control part, 22 ... Master disk storage part, 23 ... Lyric information storage part, 24 ... Music storage part, 25 ... Reproduction record storage part, 30 ... Person in

charge terminal

40, 40a, 40b, 40c ... user terminal, 41 ... control unit, 42 ... communication unit, 43 ... input unit, 44 ... data storage unit, 45 ... volume detection unit, 46 ... music output unit, 211 ... management unit, 212 ... Vocal processing unit, 213 ... Lyric management unit, 214 ... Editor, 215 ... Playback management unit, 220 ... Master file, 230 ... Lyric file, 240 ... Music management data, 241 ... HLS file, 242 ... XML file, 243 ... Vocal mask, 244 ... music management record, 250 ... reproduction performance data, 251 ... user performance record, 252 ... music performance record, 26 ... stock memory 411: Volume setting unit, 412: Reproduction signal generation unit, 42A: Communication unit, 500, 510, 520, 600, 601, 602, 603 ... Scale transition graph, 501: First vocal mask, 511: Second vocal mask 521 ... Third vocal mask, 700 ... XML file.

Claims

A master disk storage unit that stores a master disk file that includes data for reproducing the mixed sound, and the sound including the adjustment target sound that is the target of volume adjustment and the other sound is a mixed sound;
A resource storage unit that stores resources for providing sound adjustment content to the user terminal;
A playback record storage unit for storing the number of playback times of the sound adjustment content by the user terminal;
A control unit connected to the user terminal,
The control unit is
In the master file stored in the master storage unit, the range of the adjustment target sound is specified,
Based on the specified range, generate a mask file for individually adjusting the volume of the adjustment target sound in the mixed sound,
Providing the sound adjustment content including the mask file to the user terminal, storing the number of times of reproduction in the reproduction result storage unit,
A service management system that calculates a distribution amount to be distributed to the distribution target person of the adjustment target sound based on the resource stored in the resource storage unit and the number of reproductions stored in the reproduction result storage unit.
The controller is
Providing a plurality of the sound adjustment contents;
Calculating a total number of times of reproduction that is the total number of times of reproduction for each of the sound adjustment contents provided by the control unit;
The service management system according to claim 1, wherein the distribution amount of each sound adjustment content is calculated based on a ratio of each reproduction count to the total reproduction count.
The control unit is
Providing the sound adjustment content to a plurality of the user terminals;
When the sound adjustment contents provided to the user terminals are equal to each other and the user terminals synchronize the reproduction of the sound adjustment contents, the number of reproductions of the sound adjustment contents is adjusted. The service management system according to claim 1 or 2.
The control unit is
A medley composed of a part of each master sound source is the mixed sound, and specifies a master sound source constituting the medley,
The service management system according to any one of claims 1 to 3, wherein the reproduction count is distributed to the sound adjustment content of the master sound source.
Further comprising the user terminal;
The user terminal is
A volume setting section for setting the volume;
Based on the setting by the volume setting unit, the music data for playing the mixed sound and the mask file are used so that the volume of the adjustment target sound changes following the setting of the volume setting unit. The service management system according to claim 1, further comprising: a reproduction processing unit that reproduces the mixed sound.
The said volume setting part memorize | stores the setting of the said volume setting part as a setting candidate for every reproduction | regeneration of the said sound adjustment content, and performs the setting in this reproduction | regeneration using the said stored setting candidate. Service management system.
A history management unit that manages the history of the volume setting for each user and transmits the history to the user terminal;
The user terminal is
A communication unit that receives the history up to the previous use managed for the user from the history management unit, and transmits information related to the setting performed this time to the history management unit;
The volume setting unit that performs the current setting using the history received by the communication unit;
The service management system according to claim 6, further comprising: a reproduction processing unit that reproduces the mixed sound based on the current setting so that the volume of the adjustment target sound changes following the current setting.
A master disk storage unit that stores a master disk file that includes data for reproducing the mixed sound, and the sound including the adjustment target sound that is the target of volume adjustment and the other sound is a mixed sound;
A resource storage unit that stores resources for providing sound adjustment content to the user terminal;
A playback record storage unit for storing the number of playback times of the sound adjustment content by the user terminal;
A computer-readable storage medium storing a program for performing service management using a service management system including a control unit connected to the user terminal,
During execution of the program, the control unit
In the master file stored in the master storage unit, the range of the adjustment target sound is specified,
Based on the specified range, generate a mask file for individually adjusting the volume of the adjustment target sound in the mixed sound,
Providing the sound adjustment content including the mask file to the user terminal, storing the number of times of reproduction in the reproduction result storage unit,
A computer readable computer that functions to calculate a distribution amount to be distributed to the distribution target person of the adjustment target sound based on the resource stored in the resource storage unit and the number of reproductions stored in the reproduction result storage unit. Storage medium.
A master disk storage unit that stores a master disk file that includes data for reproducing the mixed sound, and the sound including the adjustment target sound that is the target of volume adjustment and the other sound is a mixed sound;
A resource storage unit that stores resources for providing sound adjustment content to the user terminal;
A playback record storage unit for storing the number of playback times of the sound adjustment content by the user terminal;
A service management system using a service management system including a control unit connected to the user terminal,
The control unit is
In the master file stored in the master storage unit, the range of the adjustment target sound is specified,
Based on the specified range, generate a mask file for individually adjusting the volume of the adjustment target sound in the mixed sound,
Providing the sound adjustment content including the mask file to the user terminal, storing the number of times of reproduction in the reproduction result storage unit,
A service management method for calculating a distribution amount to be distributed to the distribution target person of the adjustment target sound based on the resource stored in the resource storage unit and the number of reproductions stored in the reproduction result storage unit.