WO2018155353A1

WO2018155353A1 - Generation method, generation device, reproduction method, and reproduction system

Info

Publication number: WO2018155353A1
Application number: PCT/JP2018/005615
Authority: WO
Inventors: 旭谷口; 敦宏辻; 幸　裕弘; 坂井　剛; 羊佑塩田; 浩充森下
Original assignee: パナソニックＩｐマネジメント株式会社
Priority date: 2017-02-21
Filing date: 2018-02-19
Publication date: 2018-08-30

Abstract

Provided is a generation method for generating content using a computer, the method comprising: acquiring sound data representing a specific sound (S11); accepting input of control information that includes maximum sound volume information representing the maximum volume of the specific sound represented by the acquired sound data, the control information being used to prohibit the specific sound from being outputted from a reproduction device (100) at a sound volume exceeding a predetermined sound volume set in the reproduction device (100) (S12); and generating content by associating the acquired sound data with the control information the input of which has been accepted (S13).

Description

GENERATION METHOD, GENERATION DEVICE, REPRODUCTION METHOD, AND REPRODUCTION SYSTEM

The present disclosure relates to a generation method and a generation device for generating content, a playback method and a playback system for playing back content.

Patent Document 1 discloses a video distribution device and a video reproduction device in VOD (Video On Demand) distribution.

Japanese Patent Laying-Open No. 2015-222861

The present disclosure provides a generation method and the like that can reduce discomfort given to the user by the playback device.

The generation method according to the present disclosure is a generation method for generating content using a computer, acquiring sound data indicating a predetermined sound, and setting the predetermined sound indicated by the acquired sound data to a playback device Input of control information including maximum volume information indicating the maximum volume of the predetermined sound, which is control information used for prohibiting output by the playback device at a volume exceeding a set volume that is set Content is generated by associating the received and acquired sound data with the control information that has received the input.

These general or specific aspects may be realized by a system, an apparatus, an integrated circuit, a computer program, or a recording medium such as a computer-readable CD-ROM. The system, the apparatus, the integrated circuit, and the computer program And any combination of recording media.

The method according to the present disclosure can reduce discomfort given to the user by the playback device.

FIG. 1 is a schematic diagram of a reproduction system according to an embodiment. FIG. 2 is a block diagram illustrating an example of a hardware configuration of the playback device. FIG. 3 is a block diagram illustrating an example of the hardware configuration of the server. FIG. 4 is a block diagram illustrating an example of a hardware configuration of the generation apparatus. FIG. 5 is a block diagram illustrating an example of a functional configuration of the reproduction system according to the embodiment. FIG. 6 is a diagram illustrating an example of a UI displayed on the display of the generation apparatus according to the embodiment. FIG. 7 is a diagram illustrating an example of a content configuration. FIG. 8 is a diagram showing a temporal change in the playback time of the volume of the content. FIG. 9 is a diagram showing a temporal change in the playback time of the playback volume output when the content is played back by the playback device. FIG. 10 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the third reproduction control is performed. FIG. 11 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the fourth reproduction control is performed. FIG. 12 is a flowchart illustrating an example of a generation method by the generation device according to the embodiment. FIG. 13 is a flowchart illustrating an example of a reproduction method by the reproduction apparatus according to the embodiment. FIG. 14 is a flowchart illustrating an example of details of the reproduction processing by the reproduction unit of the reproduction apparatus according to the embodiment. FIG. 15 is a flowchart illustrating another example of the details of the reproduction process performed by the reproduction unit of the reproduction apparatus according to the embodiment.

Hereinafter, embodiments will be described in detail with reference to the drawings as appropriate. However, more detailed description than necessary may be omitted. For example, detailed descriptions of already well-known matters and repeated descriptions for substantially the same configuration may be omitted. This is to avoid the following description from becoming unnecessarily redundant and to facilitate understanding by those skilled in the art.

In addition, the inventor provides the accompanying drawings and the following description in order for those skilled in the art to fully understand the present disclosure, and is not intended to limit the claimed subject matter. .

(Embodiment)
Hereinafter, embodiments will be described with reference to FIGS.

[1-1. Constitution]
FIG. 1 is a schematic diagram of a reproduction system according to an embodiment.

Specifically, in FIG. 1, a playback device 100, a server 200, a communication network 300, and a generation device 400 are shown. For example, the playback system 1 includes the playback device 100 and the server 200 among these components. The playback system 1 may further include a generation device 400. In the playback system 1, a plurality of playback devices 100 may be connected to the communication network 300. In the reproduction system 1, a plurality of generation devices 400 may be connected to the communication network 300.

The playback system 1 is a system for providing a first user with content configured by a combination of independent video content and sound content from the server 200 to the playback device 100. One playback device 100 may correspond to one first user or a plurality of first users. When the reproduction system 1 includes a plurality of reproduction apparatuses 100, a plurality of first users may correspond to each of the plurality of reproduction apparatuses 100 in a one-to-one correspondence or a one-to-many correspondence. Also good. Further, the plurality of playback devices 100 may correspond to one first user. Similarly, one second user may correspond to one generation device 400, or a plurality of second users may correspond to the one generation device 400. When the reproduction system 1 includes a plurality of generation devices 400, each of the plurality of generation devices 400 may correspond to a plurality of second users on a one-to-one basis or on a one-to-many basis. Also good. Further, the plurality of generation devices 400 may correspond to one second user. For example, video content or sound content is provided to the server 200 via the generation device 400 from a second user such as a content creator.

Hereinafter, the configuration of the playback system 1 for performing the playback process will be described in detail.

Next, the hardware configuration of the playback apparatus 100 will be described with reference to FIG.

FIG. 2 is a block diagram showing an example of the hardware configuration of the playback device.

As shown in FIG. 2, the playback device 100 includes a CPU 101 (Central Processing Unit), a main memory 102, a storage 103, a communication IF (Interface) 104, a display 105, and a speaker 106 as hardware configurations. Prepare.

The CPU 101 is a processor that executes a control program stored in the storage 103 or the like.

The main memory 102 is a volatile storage area used as a work area used when the CPU 101 executes a control program.

The storage 103 is a non-volatile storage area that holds a control program, content, and the like.

The communication IF 104 is a communication interface that communicates with the server 200 via the communication network 300. The communication IF 104 is, for example, a wired LAN interface. The communication IF 104 may be a wireless LAN interface. Further, the communication IF 104 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.

The display 105 is a display device that displays a processing result in the CPU 101. The display 105 displays, for example, video obtained by playing video content. The display 105 is, for example, a liquid crystal display or an organic EL display.

Speaker 106 outputs the processing result in CPU 101. The speaker 106 outputs, for example, sound or music obtained by playing sound content.

The hardware configuration of the server 200 will be described with reference to FIG.

FIG. 3 is a block diagram showing an example of the hardware configuration of the server.

As shown in FIG. 3, the server 200 includes a CPU 201 (Central Processing Unit), a main memory 202, a storage 203, and a communication IF (Interface) 204 as hardware configurations.

The CPU 201 is a processor that executes a control program stored in the storage 203 or the like.

The main memory 202 is a volatile storage area used as a work area used when the CPU 201 executes a control program.

The storage 203 is a non-volatile storage area that holds a control program, content, and the like.

The communication IF 204 is a communication interface that communicates with the playback device 100 or the generation device 400 via the communication network 300. The communication IF 204 is, for example, a wired LAN interface. Note that the communication IF 204 may be a wireless LAN interface. The communication IF 204 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.

The hardware configuration of the generation device 400 will be described with reference to FIG.

FIG. 4 is a block diagram illustrating an example of a hardware configuration of the generation apparatus.

As illustrated in FIG. 4, the generation apparatus 400 includes a CPU 401 (Central Processing Unit), a main memory 402, a storage 403, a communication IF (Interface) 404, an input IF (Interface) 405, as hardware configurations. And a display 406.

The CPU 401 is a processor that executes a control program stored in the storage 403 or the like.

The main memory 402 is a volatile storage area used as a work area used when the CPU 401 executes a control program.

The storage 403 is a non-volatile storage area that holds a control program, content, and the like.

The communication IF 404 is a communication interface that communicates with the server 200 via the communication network 300. The communication IF 404 is, for example, a wired LAN interface. Note that the communication IF 404 may be a wireless LAN interface. The communication IF 404 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.

The input IF 405 is an input device such as a numeric keypad, a keyboard, and a mouse.

The display 406 is a display device that displays a processing result in the CPU 401, for example. The display 406 displays, for example, a UI (User Interface) for receiving input from the input IF 405. The display 406 is, for example, a liquid crystal display or an organic EL display.

Next, the functional configuration of the playback system 1 will be described with reference to FIG.

FIG. 5 is a block diagram illustrating an example of a functional configuration of the reproduction system according to the embodiment.

First, the functional configuration of the generation apparatus 400 will be described.

The generation apparatus 400 includes a database (DB) 410, an acquisition unit 420, an input reception unit 430, a generation unit 440, and a communication unit 450.

The database 410 stores video data that is a source of video content or sound data that is a source of sound content. The database 410 is realized by the storage 403, for example.

The acquisition unit 420 acquires sound data indicating a predetermined sound from the database 410 in response to the input by the second user received by the input reception unit 430. The acquisition unit 420 may acquire video data from the database 410 according to the input by the second user received by the input reception unit 430. Note that the acquisition unit 420 is not limited to acquiring sound data or video data from the database 410, but may be acquired from another information processing apparatus via the communication network 300 using the communication unit 450. Alternatively, it may be acquired directly from another information processing apparatus connected by wire or wireless. Other information processing apparatuses in this case are, for example, PCs (Personal Computers), servers, smartphones, tablet terminals, video cameras, digital cameras, IC recorders, and the like. The acquisition unit 420 is realized by the CPU 401, the main memory 402, and the storage 403, for example.

The input reception unit 430 receives an input by the second user. Specifically, the input receiving unit 430 receives an input for the second user to generate content from video data or sound data stored in the database 410. The input receiving unit 430 receives input of content control information as input for generating content.

The content control information received by the input receiving unit 430 includes, for example, a predetermined sound indicated by the sound data acquired by the acquisition unit 420 at a volume that exceeds a set volume set in the playback apparatus 100. Is used for prohibiting the output of the sound, and includes maximum volume information indicating the maximum volume of a predetermined sound.

The content control information received by the input receiving unit 430 may further include, for example, attribute information indicating whether or not the adjustment of the volume of the sound data is permitted. The control information in this case is information that causes the playback device 100 to perform the following playback control when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. In this case, in the reproduction control, when the attribute information included in the control information indicates that the volume adjustment is permitted, the volume of the predetermined sound of the sound data associated with the control information is reduced to a setting volume or less. This is the second reproduction control for outputting a predetermined sound. Further, in the reproduction control, when the attribute information included in the control information indicates that the adjustment of the volume is not permitted, the first reproduction that prohibits the reproduction apparatus 100 from reproducing the sound data associated with the control information. Control. When the reproduction apparatus 100 reproduces content according to this control information, if the attribute information indicates that the volume adjustment is permitted, the reproduction apparatus 100 performs the second reproduction control, and the attribute information does not permit the volume adjustment. When shown, the first reproduction control is performed. In this way, it is possible to cause the playback apparatus 100 to selectively switch between the first playback control and the second playback control according to the attribute information set by the second user.

Note that the second reproduction control may include third reproduction control and fourth reproduction control. That is, the third regeneration control may be performed instead of the second regeneration control, or the fourth control may be performed.

In addition, the content control information received by the input receiving unit 430 includes (i) allowing adjustment of the overall volume of the sound data, (ii) allowing adjustment of the volume of a part of the sound data, and (iii) sound. It may further include attribute information indicating that the adjustment of the volume of the data is not permitted. The control information in this case is information that causes the playback device 100 to perform the following playback control when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. In this case, when the attribute information included in the control information indicates that the adjustment of the overall volume of the sound data is permitted, the reproduction control is performed at a predetermined sound volume of the sound data associated with the control information. Third reproduction control for outputting a predetermined sound in a state where the average volume indicated by the average volume information included in the information is reduced until the maximum volume indicated by the maximum volume information included in the control information is equal to or lower than the set volume. It is. In addition, in the reproduction control, when the attribute information included in the control information permits the adjustment of the volume of a part of the sound data, the reproduction control is performed on the part of the sound data associated with the control information that exceeds the set volume of the predetermined sound. This is the fourth reproduction control for outputting a predetermined sound in a state where the volume is lowered below the set volume. Further, in the reproduction control, when the attribute information included in the control information indicates that the adjustment of the volume is not permitted, the first reproduction that prohibits the reproduction apparatus 100 from reproducing the sound data associated with the control information. Control.

For example, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the sound data associated with the control information is received by the playback device 100 when the content reception control information received by the input reception unit 430 exceeds the set volume. It may be information for prohibiting reproduction. When the content is reproduced according to the control information, the reproduction device 100 performs the first reproduction control not to reproduce the content whose maximum volume exceeds the set volume. For this reason, it can suppress that the predetermined | prescribed sound contained in a content is output by the said reproducing | regenerating apparatus 100 with the volume exceeding the setting volume set to the reproducing | regenerating apparatus 100. FIG.

The content control information received by the input receiving unit 430 is, for example, a predetermined volume of sound data associated with the control information when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. The information may be information for causing the playback device 100 to perform playback control for outputting a predetermined sound in a state where the volume of the sound is lowered below a set volume. When reproducing the content according to the control information, the reproducing device 100 performs second reproduction control for reproducing the content by reducing the volume of the content whose maximum volume exceeds the set volume to be equal to or lower than the set volume. For this reason, it can suppress that the predetermined | prescribed sound contained in a content is output by the said reproducing | regenerating apparatus 100 with the volume exceeding the setting volume set to the reproducing | regenerating apparatus 100. FIG.

Also, the content control information received by the input receiving unit 430 may further include, for example, average volume information indicating the average volume of a predetermined sound of the sound data. The control information in this case is the control information at a predetermined sound volume of the sound data associated with the control information when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. Even if it is information for causing the playback apparatus 100 to perform playback control for outputting a predetermined sound in a state where the average volume indicated by the average volume information included in the information is reduced until the maximum volume is lower than or equal to the set volume. Good. When reproducing the content according to the control information, the reproducing device 100 performs third reproduction control for reproducing the content by reducing the average volume of the content until the maximum volume becomes equal to or lower than the set volume. For this reason, it can suppress that the predetermined | prescribed sound contained in a content is output by the said reproducing | regenerating apparatus 100 with the volume exceeding the setting volume set to the reproducing | regenerating apparatus 100. FIG.

The content control information received by the input receiving unit 430 is, for example, a predetermined sound of sound data associated with the control information when the maximum volume indicated by the maximum volume information in the control information exceeds the set volume. This is information for causing the playback apparatus 100 to perform playback control for outputting a predetermined sound in a state where the volume of the portion exceeding the set volume in is reduced below the set volume. This control information is information for reproducing the content without adjusting the volume of a portion of the sound data associated with the control information that does not exceed the set volume of the predetermined sound. When the content is reproduced according to the control information, the reproduction device 100 performs fourth reproduction control for reproducing the content by reducing the volume of the portion of the predetermined sound that exceeds the set volume below the set volume. . For this reason, it can suppress that the predetermined | prescribed sound contained in a content is output by the said reproducing | regenerating apparatus 100 with the volume exceeding the setting volume set to the reproducing | regenerating apparatus 100. FIG.

The content control information may include, for example, content metadata (that is, attribute information) in addition to the information described above. One set of metadata exists for one content, and includes information on reproduction time, author, ambient level, video ambient level, or sound ambient level, and content genre. Details of the ambient degree, the video ambient degree, and the sound ambient degree will be described later.

The playback time is information indicating the length of time when the content is played back.

The author is information indicating the author of the content, and includes information including the author's name and contact information.

The ambient degree is an ambient degree associated with the content.

The video ambient degree is the ambient degree associated with the video part included in the content.

The sound ambient degree is an ambient degree associated with a sound part included in the content.

Thus, the ambient degree of content and the like can be set by metadata.

Metadata is created in a predetermined format. The index is obtained by analyzing the metadata according to the metadata format. The index is an index associated with the content, and is an index expressed by a continuous value. An example of the index is an estimated index that indicates the degree of attention the user is directed to the content being played back. More specifically, the index is an index that is an index having a smaller value as the degree of attention directed to the content being played by the user is greater, or the user is directed to the content being played. As the degree of attention directed is greater, an index having a larger value may be employed. Here, the former is also referred to as an ambient level and the latter is also referred to as a conscious level. As the degree of attention directed by the user increases, for example, it is more likely to continue watching the screen on which the video is displayed from the beginning to the end of the playback time of the content, and concentrate on viewing the output sound. It can be said that it is suitable.

The index may include brightness, saturation, hue, or the like that is an index related to the color of the video included in the content being played back, or volume or frequency distribution that is an index of the sound included in the content being played back Etc. may be included. Further, the index may include an index calculated by a predetermined calculation method from the plurality of indexes.

Hereinafter, the explanation will be made using the ambient degree as an index, but the same explanation can be established by using the consciousness degree and other indices. The ambient degree is an index expressed as a continuous value from 0 to 100, for example. When the degree of ambient is 0, it means that the degree of attention estimated to be directed by the user is the largest, and when the degree of ambient is 100, the degree of attention estimated to be directed by the user is the smallest. Then.

The ambient degree associated with the content can be calculated from the video ambient degree that is the ambient degree associated with the video part of the content and the sound ambient degree that is the ambient degree associated with the sound part of the content. The video ambient degree is an example of a video index. The sound ambient degree is an example of a sound index.

The video ambient degree may be calculated based on, for example, the brightness, saturation or hue of the video of the content, or the scene change mode. More specifically, it is calculated as follows.

・ The higher the brightness of the content video, the lower the ambient degree is calculated.

・ The higher the saturation of the content video, the lower the ambient degree is calculated.

Based on the color of the content video, the higher the warm color such as red, orange or yellow, the higher the ambient, the higher the cold color such as blue or purple, the lower the ambient Calculated.

・ The lower the degree of ambientity, the more scene changes in the video.

-As a mode of video switching at the time of a scene change, when switching from one scene to the next scene, the more the image gradually changes like fade out, fade in or cross fade, the more A high degree of ambient is calculated. When switching from one scene to the next, the more frequently the images are switched, the lower the degree of ambient is calculated.

In addition, the sound ambient degree may be calculated based on, for example, the volume of the sound of the content, the frequency distribution of the sound, or the change in volume. More specifically, it is calculated as follows.

・ The lower the degree of ambient, the higher the volume of the content sound.

-Regarding the frequency distribution of the sound of the content, the higher the sound in the high sound range (for example, about 1 kHz to 20 kHz) or the low sound range (for example, about 20 Hz to 200 Hz), the higher the ambient degree is calculated, and the medium sound range (for example, about 200 Hz to 1 kHz) ), The lower the degree of ambient is calculated.

・ The steeper change in volume results in a lower ambient level.

Note that, as a method of calculating the content ambient degree from the video ambient degree and the sound ambient degree, any method can be adopted, but for example, an average or a weighted average can be used. For example, when the weighted average weight is in the range from 0 to 1 and the video ambient degree weight is α, the ambient degree of the content is expressed as (Equation 1) below.

Ambient degree of content = α x (Video ambient degree) + (1-α) x (Sound ambient degree) (Formula 1)

Here, the weighting of the video ambient degree and the sound ambient is determined as follows, for example.

(1) Increasing the weight of the sound ambient level Generally, in order to prevent a person from intentionally paying attention to the video presented by the playback device 100 or the like, the eyes are meditated or the eyes or body It is only relatively easy to change the direction. On the other hand, in order to prevent a person from paying attention to the sound presented by the playback device 100 or the like, there is a method of closing the ear, but it is not so easy, and the ear is temporarily blocked. Even so, it is difficult to completely eliminate the sound felt by the user. Therefore, the user can intentionally turn away the attention regarding the video portion of the content regardless of the degree of video ambient, but the degree of attention does not have to be close to the degree of sound ambient regarding the sound portion of the content. I do not get.

Therefore, it is effective to make the weight of the sound ambient degree heavier than the weight of the video ambient degree, that is, to make α smaller than 0.5. In this way, in the degree of ambient that is linked to the content, by making the contribution of the degree of attention directed by the person relative to the sound relatively large, the attention that the user directs the behavior of the ambient degree that is linked to the content. It is possible to get close to the sense of the degree.

(2) When increasing the weight of the video ambient degree It has been stated that it is relatively easy for humans not to pay attention to the video presented by the playback device 100, but the size of the display 105 is large. This makes it difficult to distract from the video presented by the playback device 100.

Therefore, it is effective to increase the weight of the video ambient degree as the size of the display 105 on which the content is assumed to be displayed is larger. For example, when a threshold value is set for the dimension of the display 105 that is assumed to display the content, and the content is assumed to be displayed by the display 105 having a dimension that exceeds the threshold, the weight of the video ambient degree is set to sound. It is effective to make it heavier than the weight of the ambient degree, that is, to make α larger than 0.5. This threshold value can be about 50 inches or 70 inches in the length of the diagonal line of the display 105, for example.

In this way, in the index associated with the content, the contribution of the degree of attention directed by the person to the video is relatively increased, so that the behavior of the ambient degree associated with the content is noticed by the user. You can get close to a sense of degree.

Note that α may be changed by an input from the operator of the playback system 1, the provider of the content, or the user. In this way, the operator of the playback system 1 can flexibly change the weight of the video ambient level and the sound ambient level. As a result, there is an advantage that it is possible to specify more flexible content suitable for the user's sense.

The video ambient level and the sound ambient level may be classified into a plurality of ranks according to the magnitude of the ambient level. In this case, the plurality of ranges of ambient degrees that define the plurality of ranks of the video ambient degree and the plurality of ranges of ambient degrees that define the plurality of ranks of the sound ambient degree do not have to coincide with each other. For example, the video ambient degree may be classified as rank A in the range of 0 to 20, and the sound ambient degree may be classified as rank A in the range of 0 to 30. That is, the video ambient degree and the sound ambient degree may be classified into a plurality of ranks within the same rank or different ambient degree ranges.

Also, the video ambient degree and the sound ambient degree may be normalized so that the minimum value and the maximum value coincide.

There can be a variety of content, but it is part of the environment, such as paintings on the wall or parts of wallpaper, floor or ceiling that are not often watched by users It may be content. Note that the content may be content that is assumed to be watched in order to acquire information on news or culture or to obtain entertainment.

Next, a UI for receiving input by the input receiving unit 430 will be described with reference to FIG.

FIG. 6 is a diagram illustrating an example of a UI displayed on the display of the generation apparatus according to the embodiment.

The input reception unit 430 displays the UI 431 on the display 406 and receives an input to the UI 431 by the input IF 405. The UI 431 receives a UI 432 for receiving a selection of a sound data file, a UI 433 for receiving a maximum volume setting, a UI 434 for receiving an average volume setting, and input of information indicating whether or not volume adjustment is permitted. It includes a UI 435 for accepting and a UI 436 for accepting input of a character string indicating the author. Note that the input reception unit 430 does not have to display all of the UIs 432 to 436 on the display 406. By displaying at least the UI 432 and the UI 433, information indicating the sound data file and maximum volume information indicating the maximum volume are displayed. Can be accepted. Further, the input receiving unit 430 may receive input of information indicating a sound data file and maximum volume information indicating the maximum volume without displaying a UI.

In the UI 432, the second user can select a sound data file stored in the storage 403 of the generation apparatus 400, for example, by pressing a reference button. The file shown in FIG. 6 is an example, and is not limited to a flac file, but may be another audio file such as an aac file, a wav file, or an mp3 file.

In UI433, the maximum volume can be set by moving the slider knob to the left or right. Instead of the UI 433, an input of a numerical value indicating the maximum volume may be accepted.

In UI434, the average volume can be set by moving the slider knob to the left or right. Instead of the UI 434, an input of a numerical value indicating the average sound volume may be accepted.

In UI435, by selecting a radio button (option button), permission or non-permission of volume adjustment can be set. Note that the UI 435 is a UI for setting permission or disapproval of volume adjustment, but (i) allows the entire volume of the sound data, (ii) allows a part of the sound data, and ( iii) It is good also as UI which sets one of not permitting volume adjustment.

The UI 436 can accept a character string input in a text box as an author. Note that a user name set in advance in the generation device 400 may be automatically input as the author.

Note that the input receiving unit 430 is realized by the input IF 405 and the display 406, for example.

The generating unit 440 generates content by associating the sound data acquired by the acquiring unit 420 with the control information that has received the input. The generation unit 440 generates content C10 as illustrated in FIG. 7 by receiving an input to the UI 431 illustrated in FIG. 6, for example. That is, the generation unit 440 generates the content C10 by associating the sound data C11 selected by the UI 432 with the control information C12 received by the UI 433 to UI 436. Note that the reproduction time is obtained from, for example, information indicating the reproduction time included in the sound data C11 by analyzing the sound data C11. The ambient degree is calculated by analyzing the sound data C11 by the method described above, for example. The generation unit 440 is realized by, for example, the CPU 401, the main memory 402, and the storage 403.

The communication unit 450 transmits the content generated by the generation unit 440 to the server 200 via the communication network 300. Note that the communication unit 450 may transmit the content to the playback device 100 via the communication network 300. The communication unit 450 is realized by, for example, the CPU 401, the main memory 402, the storage 403, and the communication IF 404.

The functional configuration of the playback device 100 will be described.

The playback apparatus 100 includes a communication unit 110 and a playback unit 130. The playback device 100 may further include a content DB (Database) 120.

The communication unit 110 acquires content from the server 200 via the communication network 300. The content is, for example, content including sound data indicating a predetermined sound, and is video content or sound content. That is, the content is content in which sound is output from the speaker 106 of the playback device 100 when played back by the playback device 100. The communication unit 110 may acquire one content from the server 200 or may acquire a plurality of contents. The communication unit 110 is realized by the CPU 101, the main memory 102, the storage 103, and the communication IF 104, for example.

The content DB 120 stores content acquired by the communication unit 110. The content DB 120 is realized by the storage 103, for example. Note that the content stored in the content DB 120 is not limited to the content acquired by the communication unit 110 but may be content stored in advance, or stored in advance with the content acquired by the communication unit 110. May be mixed with existing content. Note that the content DB 120 stores content in advance, for example, by storing content generated by the generation device 400 before factory shipment.

The playback unit 130 plays back the content acquired by the communication unit 110. Note that the reproduction unit 130 may perform streaming reproduction of the content acquired by the communication unit 110, or may read and reproduce the content from the content DB 120. The reproduction unit 130 reproduces sound data included in the content according to control information included in the content. When the content includes video data, the playback unit 130 may play sound data together with the video data.

The playback unit 130 uses, for example, the maximum volume information included in the control information included in the content acquired by the communication unit 110, and the predetermined sound of the sound data included in the content exceeds the preset volume. Playback control that does not output at volume is performed. Note that the preset volume may be set by the first user or may be set as an initial state at the time of factory shipment or the like.

Specifically, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the reproduction unit 130 performs the first reproduction control that does not reproduce the sound data associated with the control information. You may go. Further, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the playback unit 130 reduces the volume of the predetermined sound of the sound data associated with the control information to be equal to or lower than the set volume. In this state, the second reproduction control for outputting a predetermined sound may be performed. In addition, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the playback unit 130 indicates that the control information is set at a predetermined sound volume of the sound data associated with the control information. The third reproduction control for outputting a predetermined sound may be performed in a state where the average volume indicated by the included average volume information is lowered until the maximum volume is equal to or lower than the set volume. In addition, when the maximum volume indicated by the maximum volume information in the control information exceeds the set volume, the playback unit 130 is a part of the sound data associated with the control information that exceeds the set volume in the predetermined sound. The fourth playback control for outputting a predetermined sound may be performed in a state where the volume is lowered to a set volume or less.

Further, the playback unit 130 includes attribute information indicating whether or not the control information permits adjustment of the volume of the sound data when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. If included, the first reproduction control and the second reproduction control may be selectively performed according to the attribute information. Specifically, when the attribute information included in the control information indicates that the volume adjustment is permitted, the playback unit 130 performs the second playback control, and the attribute information included in the control information permits the volume adjustment. In the case of indicating not, the first reproduction control may be performed.

Further, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the playback unit 130 allows the control information to (i) adjust the overall volume of the sound data. A first reproduction control according to the attribute information, when including attribute information indicating that the adjustment of the volume of a part of the sound data is permitted; and (iii) the adjustment of the volume of the sound data is not permitted. The third reproduction control and the fourth reproduction control may be selectively performed. Specifically, when the attribute information included in the control information indicates that the adjustment of the overall volume of the sound data is permitted, the playback unit 130 performs the third playback control, and the attribute information included in the control information When the adjustment of the volume of a part of the sound data is permitted, the fourth reproduction control is performed. When the attribute information included in the control information indicates that the adjustment of the volume is not permitted, the first reproduction control is performed.

Here, the third reproduction control and the fourth reproduction control will be described with reference to FIGS.

FIG. 8 is a diagram showing a temporal change in the playback time of the volume of the content. FIG. 9 is a diagram showing a temporal change in the playback time of the playback volume output when the content is played back by the playback device. FIG. 10 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the third reproduction control is performed. FIG. 11 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the fourth reproduction control is performed. Note that the content shown in FIGS. 8 to 11 is an example, and the volume and playback volume of the content are examples.

As shown in FIG. 8, the volume of the content becomes the maximum volume Vol _{MAX at} time t1. Further, the average volume Vol _{AVG of the} content is indicated by a one-dot chain line in FIG.

As shown in FIG. 9, the playback unit 130 of the playback apparatus 100 has a content with a volume in which the average volume Vol _AVG of the content matches the adjusted volume, which is the volume adjusted by the first user using a remote controller or the like. Reproduction control for outputting a predetermined sound included in the sound from the speaker 106 is performed. In this case, if the playback unit 130 does not adjust the volume in the sound data of the content, the playback volume output from the speaker 106 may be larger than the set volume. In other words, when the content is played back so that the average volume Vol _AVG of the content matches the adjusted volume, the playback unit 130 may output a sound having a volume higher than the set volume from the speaker 106.

Therefore, for example, as shown in FIG. 10, the playback unit 130 sets the maximum volume of the output playback volume by lowering the average volume Vol _AVG of the content below the adjustment volume as the third playback control. You may perform the reproduction | regeneration control to which it decreases until it becomes below the volume.

Further, for example, as shown in FIG. 11, the playback unit 130 performs predetermined playback as the fourth playback control in a state in which the volume of the part where the playback volume of the content exceeds the set volume is reduced below the set volume. Playback control for outputting sound may be performed.

The reproduction unit 130 is realized by, for example, the CPU 101, the main memory 102, the storage 103, the display 105, and the speaker 106.

Next, the functional configuration of the server 200 will be described.

The server 200 includes a database 210, a comparison unit 220, a generation unit 230, and a communication unit 240.

The database 210 includes a video content DB (Database) 211 and a sound content DB (Database) 212. The video content DB 211 stores a plurality of independent video contents. The video content DB 211 stores control information corresponding to each of the plurality of video contents together with the plurality of video contents. The sound content DB 212 stores a plurality of independent sound contents. The sound content DB 212 stores control information corresponding to each of the plurality of sound contents together with the plurality of sound contents. The video content DB 211 stores video content acquired from the generation apparatus 400 by the communication unit 240 via the communication network 300. Similarly, the sound content DB 212 stores sound content acquired from the generation device 400 by the communication unit 240 via the communication network 300. Each of the video content DB 211 and the sound content DB 212 is realized by the storage 203, for example.

Note that the server 200 may calculate the ambient degree using the above method using at least one of the content stored in the database 210 and the control information. When the degree of ambient is calculated in this way, the control information may not include the degree of ambient.

The comparison unit 220 compares the video attribute information included in each of the plurality of video contents with the sound attribute information included in each of the plurality of sound contents. For example, when the genre of the video content matches the genre of the sound content, the comparison unit 220 determines that they are similar to each other. The genre may include the author of the content and the date (or month, year) when the content was created. For example, the comparison unit 220 compares the video ambient degree and the sound ambient degree using a predetermined method, and determines whether or not they are similar. When the rank to which the video ambient degree of the video content belongs and the rank to which the sound ambient degree of the sound content belong are the same among the plurality of ranks classified according to the magnitude of the ambient degree, the comparison unit 220 It is determined that the video content and the sound content are similar to each other. The comparison unit 220 calculates the video ambient degree from the metadata included in the video attribute information using the above method, and calculates the sound ambient degree from the metadata included in the sound attribute information using the above method. It may be calculated. The comparison unit 220 is realized by, for example, the CPU 201, the main memory 202, and the storage 203.

The generation unit 230 generates a plurality of contents composed of video content and sound content having attribute information similar to each other according to the comparison result by the comparison unit 220. That is, the generation unit 230 generates a plurality of contents composed of combinations of video content and sound content similar to each other. The generation unit 230 is realized by the CPU 201, the main memory 202, and the storage 203, for example.

The communication unit 240 transmits two or more contents among the plurality of contents generated by the generation unit 230 to the playback device 100 via the communication network 300. When the communication unit 240 receives a content acquisition request from the playback device 100, the communication unit 240 may transmit the content corresponding to the acquisition request to the playback device 100. The communication unit 240 is realized by the communication IF 204, for example.

Note that the server 200 does not necessarily have the comparison unit 220 and the generation unit 230. That is, the server 200 acquires video content or sound content from the generation device 400 via the communication network 300, stores the video content or sound content in the database 210, and stores the stored video content or sound content via the communication network 300. Any configuration can be used as long as it can be transmitted to.

[1-2. Operation]
Next, the operation of the reproduction system 1 will be described.

FIG. 12 is a flowchart illustrating an example of a generation method by the generation device according to the embodiment.

The acquisition unit 420 acquires sound data indicating a predetermined sound (S11).

The input receiving unit 430 receives input of control information (S12). The details of the control information received by the input receiving unit 430 are as described above.

The generating unit 440 generates content by associating the sound data acquired by the acquiring unit 420 with the control information received by the input receiving unit 430 (S13).

The communication unit 450 transmits the content generated by the generation unit 440 to the server 200 or the playback device 100 via the communication network 300 (S14).

FIG. 13 is a flowchart showing an example of a reproduction method by the reproduction apparatus according to the embodiment.

The communication unit 110 acquires content from the server 200 or the generation device 400 via the communication network 300 (S21).

The reproduction unit 130 reproduces the content acquired by the communication unit 110 according to the control information included in the content (S22). Details of the reproduction process performed by the reproduction unit 130 will be described later.

FIG. 14 is a flowchart showing an example of the details of the reproduction processing by the reproduction unit of the reproduction apparatus according to the embodiment.

The playback unit 130 determines whether or not the maximum volume indicated by the maximum volume information included in the control information included in the content acquired by the communication unit 110 exceeds the set volume (S31).

When determining that the maximum volume exceeds the set volume (Yes in S31), the reproducing unit 130 determines whether or not the attribute information included in the control information indicates that the adjustment of the volume of the sound data is permitted ( S32).

When the attribute information indicates that the adjustment of the volume of the sound data is permitted (Yes in S32), the playback unit 130 performs the second playback control in which the maximum volume of the sound data is reduced below the set volume and played back ( S33).

On the other hand, when the attribute information indicates that the adjustment of the volume of the sound data is not permitted (No in S32), the reproducing unit 130 performs the first reproduction control that does not reproduce the content (S34).

When the maximum volume does not exceed the set volume (No in S31), the playback unit 130 plays the content as it is without adjusting the volume (S35).

In the above reproduction process, steps S32 and S34 may not be performed. That is, when it is determined that the maximum volume exceeds the set volume, the second reproduction control in step S33 may be performed without confirming the attribute information of the control information.

In the above reproduction processing, the attribute information is information indicating whether or not the adjustment of the volume of the sound data is permitted. However, as described above, the attribute information includes (i) the overall volume of the sound data. It may be information indicating any one of permitting adjustment, (ii) permitting adjustment of the volume of a part of the sound data, and (iii) not permitting adjustment of the volume of the sound data. The reproduction process in this case is, for example, the process shown in FIG.

FIG. 15 is a flowchart illustrating another example of the details of the reproduction process performed by the reproduction unit of the reproduction apparatus according to the embodiment.

Note that this reproduction process is different in that steps S36 to S38 are performed instead of step S33 in the reproduction process described with reference to FIG. Therefore, steps S36 to S38 will be described.

When the attribute information indicates that the volume adjustment of the sound data is permitted (Yes in S32), the reproducing unit 130 further indicates that the attribute information only permits the volume adjustment of a part of the sound data, It is determined whether or not the volume adjustment of the entire data is permitted (S36).

When the attribute information indicates that only the volume adjustment of part of the sound data is permitted (Yes in S36), the playback unit 130 adjusts the volume of the part where the maximum volume exceeds the set volume and plays back the fourth volume. The reproduction control is performed (S37).

When the attribute information indicates that the volume control of the entire sound data is permitted (No in S36), the playback unit 130 adjusts the average volume of the sound data and decreases until the maximum volume of the sound data becomes equal to or lower than the set volume. Then, the third reproduction control for reproduction is performed (S38).

[1-3. Effect etc.]
According to the generation method according to the present embodiment, the content is used to prohibit a predetermined sound from being output by the playback apparatus 100 at a volume that exceeds the set volume set in the playback apparatus 100. Control information including maximum volume information indicating the maximum volume of a predetermined sound. For this reason, when the reproducing apparatus 100 reproduces the content, it is possible to reduce the output of the content by the reproducing apparatus 100 at a volume exceeding the set volume. Therefore, when the playback device 100 plays back content, it is possible to reduce discomfort that the playback device 100 gives to the user.

Thus, for example, in the generation method, even when content with a high degree of ambient is generated, since the control information as described above is included, the playback device 100 outputs a predetermined sound. However, the output of the predetermined sound exceeding the set sound volume can be reduced. That is, the playback apparatus 100 can reduce the output of content sound at a large volume that is not suitable for ambient content.

[1-4. Modified example]
[1-4-1. Modification 1]
In the above embodiment, the content is attribute information included in the control information, which is attribute information indicating whether or not to allow adjustment of the volume of the sound data, or (i) adjustment of the overall volume of the sound data is permitted. , (Ii) includes attribute information indicating that the adjustment of the volume of a part of the sound data is permitted and (iii) the adjustment of the volume of the sound data is not permitted. However, the present invention is not limited to this. In place of the attribute information, attribute information indicating that the first reproduction control is performed, attribute information indicating that the second reproduction control is performed, attribute information indicating that the third reproduction control is performed, Any one of the attribute information indicating that the fourth reproduction control is performed may be included. When reproducing the content including the attribute information, the reproduction device 100 reproduces the reproduction control indicated by the attribute information, that is, the first reproduction control, the second reproduction control, the third reproduction control, and the fourth reproduction. Perform any one of the controls.

[1-4-2. Modification 2]
When reproducing the content, the reproducing apparatus 100 in the above embodiment may display an image related to the ambient degree together with the content. The image may include at least one of an image indicating the ambient degree of the content and an image indicating the range of the ambient degree received by a receiving unit such as a remote controller (not shown).

By displaying an image related to the ambient degree together with the content on the display 105, the user visually recognizes the image together with the content being reproduced. The user can recognize the ambient level of the currently reproduced content by visually recognizing an image indicating the ambient level. Further, the user can recognize the range of the ambient degree designated by the user by visually recognizing the image indicating the range of the ambient degree. By recognizing these, for example, the user can instruct the playback device 100 to change the specified ambient degree higher or lower than the current degree through the reception unit.

Note that, instead of presenting an image relating to the ambient degree, or together, a sound relating to the ambient degree may be output by the speaker 106, and the same effect as described above can be obtained.

[1-5. Other effects]
Further, according to the control method of the playback device shown in the present embodiment and this modification, the playback device specifies the index associated with the content within the range of the index, and thereby the content to be played back Can be specified. At that time, the user need not recall the search key. The user can specify the content to be played back by the playback device simply by specifying the rough value of the index associated with the content within the range. In this way, the playback device enables more flexible content specification. Also, since flexible content specification is possible, the problem of increase in processing load and power consumption of the playback device when determination of content reflecting the user's intention fails can be avoided.

Also, the playback device enables more flexible content specification by using, as a specific index, an estimated index that indicates the degree of attention that the user directs to the content being played back.

In addition, the playback device, the server, or the generation device calculates an index associated with the content based on the degree of attention directed by the user to each of the video and the sound included in the content. As a result, the content index can be calculated in consideration of the video and sound included in the content.

Also, the playback device, server, or generation device calculates an index associated with the content by a weighted average obtained by increasing the weight of the sound index of the video index and the sound index. In general, it is relatively easy for a person not to pay attention to the video presented by the playback device, but it is not easy to intentionally not pay attention to the sound. Absent. In other words, it is difficult to intentionally turn away from the sound presented by the playback device. Therefore, in the index linked to the content, the contribution of the degree of attention directed by the person to the sound is relatively increased, so that the index used for specifying the content can be adapted to the sense of the degree of attention directed by the user. Index.

Also, the playback device, server, or generation device calculates an index associated with the content by a weighted average obtained by increasing the weight of the video index among the video index and the sound index. In general, when the size of a display screen for displaying content is large, it is difficult for the user to distract from the video. In such a case, in the index associated with the content, the index of the index used for specifying the content is set with respect to the degree of the attention directed by the user by relatively increasing the contribution of the degree of attention directed by the person to the video. It can be an indicator that matches the sense of

Also, the playback device, server, or generation device can calculate the video index by specifically using the brightness, saturation, hue, or scene change mode of the video included in the content.

Also, the playback device, server, or generation device can calculate the sound index by specifically using the volume, frequency distribution, or volume change mode included in the content.

Also, the playback device, server, or generation device can cause the user to recognize the content index by presenting the index associated with the content along with the content being played back to the user. Then, it is possible to cause the user to make a determination as to whether or not the content that the user wants to present on the playback apparatus is compatible with the index range designated by the user.

In addition, when the playback device, the server, or the generation device plays back both video content and sound content, the index of the video content and sound content to be played back may be included in the range specified by the user. it can. Thus, the user can play both the video content and the sound content that are estimated to have the same level of attention by the playback device.

Also, the playback device can cause the content provider to recognize the index associated with the content by presenting the index when the content is stored in the server in advance.

Also, the playback device can make the content provider recognize the adjusted content index after adjusting the content. The content provider recognizes the index of the adjusted content, confirms the result of the adjustment made to the content provided by itself, and determines whether to store it in the server based on the result Can take action.

(Other embodiments)
In each of the above embodiments, each component is realized by executing a software program suitable for each component, but may be configured by dedicated hardware. Each component may be realized by a program execution unit such as a CPU or a processor reading and executing a software program recorded on a recording medium such as a hard disk or a semiconductor memory. Here, the software that realizes the reproduction method of each of the above embodiments is the following program.

That is, this program is a generation method for generating content on a computer using a computer, acquires sound data indicating a predetermined sound, and the predetermined sound indicated by the acquired sound data is reproduced by a playback device Control information used for prohibiting output by the playback device at a volume exceeding the set volume set in the control information, including control information including maximum volume information indicating the maximum volume of the predetermined sound. A generation method for generating content by receiving an input and associating the acquired sound data with the control information having received the input is executed.

Further, this program is a reproduction method by a reproduction system including a generation device that generates content and a reproduction device that acquires the content generated by the generation device and reproduces the acquired content. Then, the generation device acquires sound data indicating a predetermined sound, and the predetermined sound indicated by the acquired sound data has a volume exceeding a set volume set in the reproduction device. Control information used for prohibiting the output of the sound, and receiving the input of control information including maximum volume information indicating the maximum volume of the predetermined sound, the acquired sound data, and the received control Content is generated by associating with information, and the playback device acquires the content via a communication network. Playback that does not output the predetermined sound of the sound data included in the content at a volume exceeding the preset volume using the maximum volume information included in the control information included in the acquired content Let the method run.

As described above, the generation method, the generation device, the reproduction method, and the reproduction system according to one or more aspects of the present invention have been described based on the embodiment. However, the present invention is not limited to this embodiment. Absent. Unless it deviates from the gist of the present invention, one or more of the present invention may be applied to various modifications that can be conceived by those skilled in the art, or forms constructed by combining components in different embodiments. It may be included within the scope of the embodiments.

For example, in the above-described embodiment, the sound ambient degree is described based on the volume of the sound of the content, the frequency distribution of the sound, or the change of the volume. However, the present invention is not limited to this. Among the sound frequency characteristics, the approximation with the so-called “1 / f fluctuation” characteristic, the number of overtone components, the regularity of the timbre waveform (frequency of several Hz or less) Area) and the like.

Note that the sound ambient level is an index at the research stage compared to the video ambient level, but the mid-range sound around 200 Hz is equivalent to vocals and human speech, and is likely to be heard by humans. I know it. Therefore, it is considered that the degree of attention directed by the user increases, and the degree of consciousness increases (the degree of ambient decreases).

Human beings live while listening to sounds in a wide band that exist in nature (not artificially processed), but the brain always processes these wide band sounds unconsciously. The human brain discriminates unusual sounds using clues such as overtone structure changes and subtle delays, and the degree of attention increases in order to detect danger. That is, it is considered that the degree of consciousness increases (the degree of ambient decreases).

In addition, the human brain tries to understand what is different from nature by unknowingly complementing it, so when listening to sounds that are different from the natural world, it will use brain resources, increasing the degree of consciousness (the degree of ambient is increased). It is thought that). Therefore, music that is composed to increase the degree of user's attention is not only highly conscious (low ambient), but also sounds that exist in the natural world, such as river buzz, can be recorded in a recording environment (such as a microphone or Depending on the performance of the recording device, the degree of ambient may be reduced.

The present disclosure can be applied to a generation method that can reduce discomfort given to the user by the playback device.

1 playback system 100 playback device 101 CPU
102 Main memory 103 Storage 104 Communication IF
105 Display 106 Speaker 110 Communication Unit 120 Content DB
130 playback unit 200 server 201 CPU
202 Main memory 203 Storage 204 Communication IF
210 Database 211 Video content DB
212 Sound content DB
220 Comparison Unit 230 Generation Unit 240 Communication Unit 300 Communication Network 400 Generation Device 401 CPU
402 Main memory 403 Storage 404 Communication IF
405 Input IF
406 Display 410 Database 420 Acquisition unit 430 Input reception unit 431-436 UI
440 Generation unit 450 Communication unit C10 Content C11 Sound data C12 Control information

Claims

A generation method for generating content using a computer,
Acquire sound data indicating a given sound,
Control information used for prohibiting the predetermined sound indicated by the acquired sound data from being output by the playback device at a volume exceeding a set volume set in the playback device; Accepts input of control information including maximum volume information indicating the maximum volume of a given sound,
A generation method for generating content by associating the acquired sound data with the control information that has received the input.
In accepting the input, accepting an input of control information further including attribute information indicating whether or not to allow adjustment of the volume of the sound data;
The control information is
When the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume,
(I) When the attribute information included in the control information indicates that the adjustment of the volume is permitted, the volume of the predetermined sound of the sound data associated with the control information is reduced to the set volume or less. In the state, the playback device performs playback control to output the predetermined sound,
(Ii) Information for prohibiting reproduction of the sound data associated with the control information by the reproduction device when the attribute information included in the control information indicates that the volume adjustment is not permitted. The generation method according to claim 1.
In accepting the input, (i) allow adjustment of the overall volume of the sound data, (ii) allow adjustment of the volume of a part of the sound data, and (iii) adjust the volume of the sound data. And receiving control information further including attribute information indicating that the sound data is not permitted, and input of control information further including average volume information indicating an average volume of the predetermined sound of the sound data,
The control information is
When the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume,
(I) When the attribute information included in the control information indicates that the adjustment of the overall volume of the sound data is permitted, the control is performed at the predetermined sound volume of the sound data associated with the control information. The predetermined sound is output in a state where the average volume indicated by the average volume information included in the information is reduced until the maximum volume indicated by the maximum volume information included in the control information is equal to or lower than the set volume. Causing the playback device to perform playback control,
(Ii) When the attribute information included in the control information permits adjustment of the volume of a part of the sound data, the attribute information exceeds the set volume in the predetermined sound of the sound data associated with the control information In the state where the volume of the part is lowered below the set volume, the playback apparatus is caused to perform playback control for outputting the predetermined sound,
(Iii) Information for prohibiting reproduction of the sound data associated with the control information by the reproduction device when the attribute information included in the control information indicates that the volume adjustment is not permitted. The generation method according to claim 1.
The control information indicates that, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the sound data associated with the control information is reproduced by the reproduction device. The generation method according to claim 1, wherein the generation method is information for prohibition.
When the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the control information sets the volume of the predetermined sound of the sound data associated with the control information. The generation method according to claim 1, wherein the generation method is information for causing the reproduction apparatus to perform reproduction control for outputting the predetermined sound in a state where the volume is reduced to a volume or less.
In receiving the input, receiving input of control information further including average volume information indicating an average volume of the predetermined sound of the sound data;
When the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the control information is the volume of the predetermined sound of the sound data associated with the control information. Causing the playback apparatus to perform playback control for outputting the predetermined sound in a state where the average volume indicated by the average volume information included in the control information is reduced until the maximum volume is equal to or lower than the set volume. The generation method according to claim 5.
When the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the control information indicates the set volume in the predetermined sound of the sound data associated with the control information. The generation method according to claim 5, wherein the information is information for causing the reproduction apparatus to perform reproduction control for outputting the predetermined sound in a state in which the volume of the exceeding portion is reduced below the set volume.
A generation device for generating content,
An acquisition unit for acquiring sound data indicating a predetermined sound;
Control information used for prohibiting the predetermined sound indicated by the sound data acquired by the acquisition unit from being output by the playback device at a volume exceeding the set volume set in the playback device. An input receiving unit that receives input of control information including maximum volume information indicating the maximum volume of the predetermined sound;
A generation device comprising: a generation unit that generates content by associating the sound data acquired by the acquisition unit with the control information received by the input reception unit.
A playback method by a playback system comprising: a generation device that generates content; and a playback device that acquires the content generated by the generation device and plays back the acquired content,
In the generator,
Acquire sound data indicating a given sound,
Control information used to prohibit the predetermined sound indicated by the acquired sound data from being output by the playback device at a volume exceeding a set volume set in the playback device; Receiving input of control information including maximum volume information indicating the maximum volume of the predetermined sound;
A content is generated by associating the acquired sound data with the received control information,
In the playback device,
Acquiring the content via a communication network;
Using the maximum volume information included in the control information included in the acquired content, the predetermined sound of the sound data included in the content is not output at a volume exceeding the preset volume. Method.
A playback system comprising: a generation device that generates content; and a playback device that acquires the content generated by the generation device and plays back the acquired content,
The generator is
An acquisition unit for acquiring sound data indicating a predetermined sound;
Control used to prohibit the predetermined sound indicated by the sound data acquired by the acquisition unit from being output by the playback device at a volume that exceeds a set volume set in the playback device An input receiving unit that receives input of control information including maximum volume information indicating the maximum volume of the predetermined sound,
A generation unit that generates content by associating the sound data acquired by the acquisition unit with the control information received by the input reception unit;
The playback device
An acquisition unit for acquiring the content via a communication network;
Using the maximum volume information included in the control information included in the content acquired by the acquisition unit, the predetermined sound of the sound data included in the content has exceeded the preset volume set in advance And a playback unit that does not output at a volume.