WO2018155353A1 - Procédé et dispositif de génération, procédé et système de reproduction - Google Patents

Procédé et dispositif de génération, procédé et système de reproduction Download PDF

Info

Publication number
WO2018155353A1
WO2018155353A1 PCT/JP2018/005615 JP2018005615W WO2018155353A1 WO 2018155353 A1 WO2018155353 A1 WO 2018155353A1 JP 2018005615 W JP2018005615 W JP 2018005615W WO 2018155353 A1 WO2018155353 A1 WO 2018155353A1
Authority
WO
WIPO (PCT)
Prior art keywords
volume
control information
content
sound
sound data
Prior art date
Application number
PCT/JP2018/005615
Other languages
English (en)
Japanese (ja)
Inventor
旭 谷口
敦宏 辻
幸 裕弘
坂井 剛
羊佑 塩田
浩充 森下
Original Assignee
パナソニックIpマネジメント株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP2017189864A external-priority patent/JP2020065096A/ja
Application filed by パナソニックIpマネジメント株式会社 filed Critical パナソニックIpマネジメント株式会社
Publication of WO2018155353A1 publication Critical patent/WO2018155353A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K15/00Acoustics not otherwise provided for
    • G10K15/02Synthesis of acoustic waves
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K15/00Acoustics not otherwise provided for
    • G10K15/04Sound-producing devices
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones

Definitions

  • the present disclosure relates to a generation method and a generation device for generating content, a playback method and a playback system for playing back content.
  • Patent Document 1 discloses a video distribution device and a video reproduction device in VOD (Video On Demand) distribution.
  • the present disclosure provides a generation method and the like that can reduce discomfort given to the user by the playback device.
  • the generation method is a generation method for generating content using a computer, acquiring sound data indicating a predetermined sound, and setting the predetermined sound indicated by the acquired sound data to a playback device Input of control information including maximum volume information indicating the maximum volume of the predetermined sound, which is control information used for prohibiting output by the playback device at a volume exceeding a set volume that is set Content is generated by associating the received and acquired sound data with the control information that has received the input.
  • the method according to the present disclosure can reduce discomfort given to the user by the playback device.
  • FIG. 1 is a schematic diagram of a reproduction system according to an embodiment.
  • FIG. 2 is a block diagram illustrating an example of a hardware configuration of the playback device.
  • FIG. 3 is a block diagram illustrating an example of the hardware configuration of the server.
  • FIG. 4 is a block diagram illustrating an example of a hardware configuration of the generation apparatus.
  • FIG. 5 is a block diagram illustrating an example of a functional configuration of the reproduction system according to the embodiment.
  • FIG. 6 is a diagram illustrating an example of a UI displayed on the display of the generation apparatus according to the embodiment.
  • FIG. 7 is a diagram illustrating an example of a content configuration.
  • FIG. 8 is a diagram showing a temporal change in the playback time of the volume of the content.
  • FIG. 1 is a schematic diagram of a reproduction system according to an embodiment.
  • FIG. 2 is a block diagram illustrating an example of a hardware configuration of the playback device.
  • FIG. 3 is a block diagram illustrating an example of
  • FIG. 9 is a diagram showing a temporal change in the playback time of the playback volume output when the content is played back by the playback device.
  • FIG. 10 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the third reproduction control is performed.
  • FIG. 11 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the fourth reproduction control is performed.
  • FIG. 12 is a flowchart illustrating an example of a generation method by the generation device according to the embodiment.
  • FIG. 13 is a flowchart illustrating an example of a reproduction method by the reproduction apparatus according to the embodiment.
  • FIG. 14 is a flowchart illustrating an example of details of the reproduction processing by the reproduction unit of the reproduction apparatus according to the embodiment.
  • FIG. 15 is a flowchart illustrating another example of the details of the reproduction process performed by the reproduction unit of the reproduction apparatus according to the embodiment.
  • FIG. 1 is a schematic diagram of a reproduction system according to an embodiment.
  • a playback device 100, a server 200, a communication network 300, and a generation device 400 are shown.
  • the playback system 1 includes the playback device 100 and the server 200 among these components.
  • the playback system 1 may further include a generation device 400.
  • a plurality of playback devices 100 may be connected to the communication network 300.
  • a plurality of generation devices 400 may be connected to the communication network 300.
  • the playback system 1 is a system for providing a first user with content configured by a combination of independent video content and sound content from the server 200 to the playback device 100.
  • One playback device 100 may correspond to one first user or a plurality of first users.
  • the reproduction system 1 includes a plurality of reproduction apparatuses 100
  • a plurality of first users may correspond to each of the plurality of reproduction apparatuses 100 in a one-to-one correspondence or a one-to-many correspondence. Also good.
  • the plurality of playback devices 100 may correspond to one first user.
  • one second user may correspond to one generation device 400, or a plurality of second users may correspond to the one generation device 400.
  • each of the plurality of generation devices 400 may correspond to a plurality of second users on a one-to-one basis or on a one-to-many basis. Also good. Further, the plurality of generation devices 400 may correspond to one second user. For example, video content or sound content is provided to the server 200 via the generation device 400 from a second user such as a content creator.
  • FIG. 2 is a block diagram showing an example of the hardware configuration of the playback device.
  • the playback device 100 includes a CPU 101 (Central Processing Unit), a main memory 102, a storage 103, a communication IF (Interface) 104, a display 105, and a speaker 106 as hardware configurations.
  • a CPU 101 Central Processing Unit
  • main memory 102 main memory
  • main memory 102 main memory
  • storage 103 storage
  • communication IF (Interface) 104 communication IF
  • display 105 display
  • speaker 106 speaker
  • the CPU 101 is a processor that executes a control program stored in the storage 103 or the like.
  • the main memory 102 is a volatile storage area used as a work area used when the CPU 101 executes a control program.
  • the storage 103 is a non-volatile storage area that holds a control program, content, and the like.
  • the communication IF 104 is a communication interface that communicates with the server 200 via the communication network 300.
  • the communication IF 104 is, for example, a wired LAN interface.
  • the communication IF 104 may be a wireless LAN interface.
  • the communication IF 104 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.
  • the display 105 is a display device that displays a processing result in the CPU 101.
  • the display 105 displays, for example, video obtained by playing video content.
  • the display 105 is, for example, a liquid crystal display or an organic EL display.
  • Speaker 106 outputs the processing result in CPU 101.
  • the speaker 106 outputs, for example, sound or music obtained by playing sound content.
  • the hardware configuration of the server 200 will be described with reference to FIG.
  • FIG. 3 is a block diagram showing an example of the hardware configuration of the server.
  • the server 200 includes a CPU 201 (Central Processing Unit), a main memory 202, a storage 203, and a communication IF (Interface) 204 as hardware configurations.
  • CPU 201 Central Processing Unit
  • main memory 202 main memory
  • storage 203 main memory
  • communication IF Interface
  • the CPU 201 is a processor that executes a control program stored in the storage 203 or the like.
  • the main memory 202 is a volatile storage area used as a work area used when the CPU 201 executes a control program.
  • the storage 203 is a non-volatile storage area that holds a control program, content, and the like.
  • the communication IF 204 is a communication interface that communicates with the playback device 100 or the generation device 400 via the communication network 300.
  • the communication IF 204 is, for example, a wired LAN interface.
  • the communication IF 204 may be a wireless LAN interface.
  • the communication IF 204 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.
  • the hardware configuration of the generation device 400 will be described with reference to FIG.
  • FIG. 4 is a block diagram illustrating an example of a hardware configuration of the generation apparatus.
  • the generation apparatus 400 includes a CPU 401 (Central Processing Unit), a main memory 402, a storage 403, a communication IF (Interface) 404, an input IF (Interface) 405, as hardware configurations. And a display 406.
  • a CPU 401 Central Processing Unit
  • main memory 402 main memory
  • storage 403 main memory
  • communication IF (Interface) 404 main memory
  • input IF (Interface) 405 input IF (Interface) 405
  • the CPU 401 is a processor that executes a control program stored in the storage 403 or the like.
  • the main memory 402 is a volatile storage area used as a work area used when the CPU 401 executes a control program.
  • the storage 403 is a non-volatile storage area that holds a control program, content, and the like.
  • the communication IF 404 is a communication interface that communicates with the server 200 via the communication network 300.
  • the communication IF 404 is, for example, a wired LAN interface.
  • the communication IF 404 may be a wireless LAN interface.
  • the communication IF 404 is not limited to a LAN interface, and may be any communication interface as long as it can establish a communication connection with the communication network 300.
  • the input IF 405 is an input device such as a numeric keypad, a keyboard, and a mouse.
  • the display 406 is a display device that displays a processing result in the CPU 401, for example.
  • the display 406 displays, for example, a UI (User Interface) for receiving input from the input IF 405.
  • the display 406 is, for example, a liquid crystal display or an organic EL display.
  • FIG. 5 is a block diagram illustrating an example of a functional configuration of the reproduction system according to the embodiment.
  • the generation apparatus 400 includes a database (DB) 410, an acquisition unit 420, an input reception unit 430, a generation unit 440, and a communication unit 450.
  • DB database
  • the database 410 stores video data that is a source of video content or sound data that is a source of sound content.
  • the database 410 is realized by the storage 403, for example.
  • the acquisition unit 420 acquires sound data indicating a predetermined sound from the database 410 in response to the input by the second user received by the input reception unit 430.
  • the acquisition unit 420 may acquire video data from the database 410 according to the input by the second user received by the input reception unit 430.
  • the acquisition unit 420 is not limited to acquiring sound data or video data from the database 410, but may be acquired from another information processing apparatus via the communication network 300 using the communication unit 450. Alternatively, it may be acquired directly from another information processing apparatus connected by wire or wireless. Other information processing apparatuses in this case are, for example, PCs (Personal Computers), servers, smartphones, tablet terminals, video cameras, digital cameras, IC recorders, and the like.
  • the acquisition unit 420 is realized by the CPU 401, the main memory 402, and the storage 403, for example.
  • the input reception unit 430 receives an input by the second user. Specifically, the input receiving unit 430 receives an input for the second user to generate content from video data or sound data stored in the database 410. The input receiving unit 430 receives input of content control information as input for generating content.
  • the content control information received by the input receiving unit 430 includes, for example, a predetermined sound indicated by the sound data acquired by the acquisition unit 420 at a volume that exceeds a set volume set in the playback apparatus 100. Is used for prohibiting the output of the sound, and includes maximum volume information indicating the maximum volume of a predetermined sound.
  • the content control information received by the input receiving unit 430 may further include, for example, attribute information indicating whether or not the adjustment of the volume of the sound data is permitted.
  • the control information in this case is information that causes the playback device 100 to perform the following playback control when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume.
  • the reproduction control when the attribute information included in the control information indicates that the volume adjustment is permitted, the volume of the predetermined sound of the sound data associated with the control information is reduced to a setting volume or less. This is the second reproduction control for outputting a predetermined sound.
  • the reproduction control when the attribute information included in the control information indicates that the adjustment of the volume is not permitted, the first reproduction that prohibits the reproduction apparatus 100 from reproducing the sound data associated with the control information. Control.
  • the reproduction apparatus 100 reproduces content according to this control information, if the attribute information indicates that the volume adjustment is permitted, the reproduction apparatus 100 performs the second reproduction control, and the attribute information does not permit the volume adjustment.
  • the first reproduction control is performed. In this way, it is possible to cause the playback apparatus 100 to selectively switch between the first playback control and the second playback control according to the attribute information set by the second user.
  • the second reproduction control may include third reproduction control and fourth reproduction control. That is, the third regeneration control may be performed instead of the second regeneration control, or the fourth control may be performed.
  • the content control information received by the input receiving unit 430 includes (i) allowing adjustment of the overall volume of the sound data, (ii) allowing adjustment of the volume of a part of the sound data, and (iii) sound. It may further include attribute information indicating that the adjustment of the volume of the data is not permitted.
  • the control information in this case is information that causes the playback device 100 to perform the following playback control when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. In this case, when the attribute information included in the control information indicates that the adjustment of the overall volume of the sound data is permitted, the reproduction control is performed at a predetermined sound volume of the sound data associated with the control information.
  • Third reproduction control for outputting a predetermined sound in a state where the average volume indicated by the average volume information included in the information is reduced until the maximum volume indicated by the maximum volume information included in the control information is equal to or lower than the set volume. It is.
  • the reproduction control when the attribute information included in the control information permits the adjustment of the volume of a part of the sound data, the reproduction control is performed on the part of the sound data associated with the control information that exceeds the set volume of the predetermined sound.
  • This is the fourth reproduction control for outputting a predetermined sound in a state where the volume is lowered below the set volume.
  • the attribute information included in the control information indicates that the adjustment of the volume is not permitted, the first reproduction that prohibits the reproduction apparatus 100 from reproducing the sound data associated with the control information. Control.
  • the sound data associated with the control information is received by the playback device 100 when the content reception control information received by the input reception unit 430 exceeds the set volume. It may be information for prohibiting reproduction.
  • the reproduction device 100 performs the first reproduction control not to reproduce the content whose maximum volume exceeds the set volume. For this reason, it can suppress that the predetermined
  • the content control information received by the input receiving unit 430 is, for example, a predetermined volume of sound data associated with the control information when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume.
  • the information may be information for causing the playback device 100 to perform playback control for outputting a predetermined sound in a state where the volume of the sound is lowered below a set volume.
  • the reproducing device 100 performs second reproduction control for reproducing the content by reducing the volume of the content whose maximum volume exceeds the set volume to be equal to or lower than the set volume. For this reason, it can suppress that the predetermined
  • the content control information received by the input receiving unit 430 may further include, for example, average volume information indicating the average volume of a predetermined sound of the sound data.
  • the control information in this case is the control information at a predetermined sound volume of the sound data associated with the control information when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. Even if it is information for causing the playback apparatus 100 to perform playback control for outputting a predetermined sound in a state where the average volume indicated by the average volume information included in the information is reduced until the maximum volume is lower than or equal to the set volume. Good.
  • the reproducing device 100 When reproducing the content according to the control information, the reproducing device 100 performs third reproduction control for reproducing the content by reducing the average volume of the content until the maximum volume becomes equal to or lower than the set volume. For this reason, it can suppress that the predetermined
  • the content control information received by the input receiving unit 430 is, for example, a predetermined sound of sound data associated with the control information when the maximum volume indicated by the maximum volume information in the control information exceeds the set volume. This is information for causing the playback apparatus 100 to perform playback control for outputting a predetermined sound in a state where the volume of the portion exceeding the set volume in is reduced below the set volume.
  • This control information is information for reproducing the content without adjusting the volume of a portion of the sound data associated with the control information that does not exceed the set volume of the predetermined sound.
  • the reproduction device 100 performs fourth reproduction control for reproducing the content by reducing the volume of the portion of the predetermined sound that exceeds the set volume below the set volume. . For this reason, it can suppress that the predetermined
  • the content control information may include, for example, content metadata (that is, attribute information) in addition to the information described above.
  • content metadata that is, attribute information
  • One set of metadata exists for one content, and includes information on reproduction time, author, ambient level, video ambient level, or sound ambient level, and content genre. Details of the ambient degree, the video ambient degree, and the sound ambient degree will be described later.
  • the playback time is information indicating the length of time when the content is played back.
  • the author is information indicating the author of the content, and includes information including the author's name and contact information.
  • the ambient degree is an ambient degree associated with the content.
  • the video ambient degree is the ambient degree associated with the video part included in the content.
  • the sound ambient degree is an ambient degree associated with a sound part included in the content.
  • the ambient degree of content and the like can be set by metadata.
  • Metadata is created in a predetermined format.
  • the index is obtained by analyzing the metadata according to the metadata format.
  • the index is an index associated with the content, and is an index expressed by a continuous value.
  • An example of the index is an estimated index that indicates the degree of attention the user is directed to the content being played back. More specifically, the index is an index that is an index having a smaller value as the degree of attention directed to the content being played by the user is greater, or the user is directed to the content being played. As the degree of attention directed is greater, an index having a larger value may be employed.
  • the former is also referred to as an ambient level and the latter is also referred to as a conscious level.
  • the degree of attention directed by the user increases, for example, it is more likely to continue watching the screen on which the video is displayed from the beginning to the end of the playback time of the content, and concentrate on viewing the output sound. It can be said that it is suitable.
  • the index may include brightness, saturation, hue, or the like that is an index related to the color of the video included in the content being played back, or volume or frequency distribution that is an index of the sound included in the content being played back Etc. may be included. Further, the index may include an index calculated by a predetermined calculation method from the plurality of indexes.
  • the ambient degree is an index expressed as a continuous value from 0 to 100, for example.
  • the degree of ambient is 0, it means that the degree of attention estimated to be directed by the user is the largest, and when the degree of ambient is 100, the degree of attention estimated to be directed by the user is the smallest. Then.
  • the ambient degree associated with the content can be calculated from the video ambient degree that is the ambient degree associated with the video part of the content and the sound ambient degree that is the ambient degree associated with the sound part of the content.
  • the video ambient degree is an example of a video index.
  • the sound ambient degree is an example of a sound index.
  • the video ambient degree may be calculated based on, for example, the brightness, saturation or hue of the video of the content, or the scene change mode. More specifically, it is calculated as follows.
  • the sound ambient degree may be calculated based on, for example, the volume of the sound of the content, the frequency distribution of the sound, or the change in volume. More specifically, it is calculated as follows.
  • any method can be adopted, but for example, an average or a weighted average can be used.
  • the weighted average weight is in the range from 0 to 1 and the video ambient degree weight is ⁇
  • the ambient degree of the content is expressed as (Equation 1) below.
  • Ambient degree of content ⁇ x (Video ambient degree) + (1- ⁇ ) x (Sound ambient degree) (Formula 1)
  • the weighting of the video ambient degree and the sound ambient is determined as follows, for example.
  • the weight of the video ambient degree is set to sound. It is effective to make it heavier than the weight of the ambient degree, that is, to make ⁇ larger than 0.5.
  • This threshold value can be about 50 inches or 70 inches in the length of the diagonal line of the display 105, for example.
  • may be changed by an input from the operator of the playback system 1, the provider of the content, or the user.
  • the operator of the playback system 1 can flexibly change the weight of the video ambient level and the sound ambient level. As a result, there is an advantage that it is possible to specify more flexible content suitable for the user's sense.
  • the video ambient level and the sound ambient level may be classified into a plurality of ranks according to the magnitude of the ambient level.
  • the plurality of ranges of ambient degrees that define the plurality of ranks of the video ambient degree and the plurality of ranges of ambient degrees that define the plurality of ranks of the sound ambient degree do not have to coincide with each other.
  • the video ambient degree may be classified as rank A in the range of 0 to 20
  • the sound ambient degree may be classified as rank A in the range of 0 to 30. That is, the video ambient degree and the sound ambient degree may be classified into a plurality of ranks within the same rank or different ambient degree ranges.
  • the video ambient degree and the sound ambient degree may be normalized so that the minimum value and the maximum value coincide.
  • content There can be a variety of content, but it is part of the environment, such as paintings on the wall or parts of wallpaper, floor or ceiling that are not often watched by users It may be content. Note that the content may be content that is assumed to be watched in order to acquire information on news or culture or to obtain entertainment.
  • FIG. 6 is a diagram illustrating an example of a UI displayed on the display of the generation apparatus according to the embodiment.
  • the input reception unit 430 displays the UI 431 on the display 406 and receives an input to the UI 431 by the input IF 405.
  • the UI 431 receives a UI 432 for receiving a selection of a sound data file, a UI 433 for receiving a maximum volume setting, a UI 434 for receiving an average volume setting, and input of information indicating whether or not volume adjustment is permitted. It includes a UI 435 for accepting and a UI 436 for accepting input of a character string indicating the author. Note that the input reception unit 430 does not have to display all of the UIs 432 to 436 on the display 406.
  • the input receiving unit 430 may receive input of information indicating a sound data file and maximum volume information indicating the maximum volume without displaying a UI.
  • the second user can select a sound data file stored in the storage 403 of the generation apparatus 400, for example, by pressing a reference button.
  • the file shown in FIG. 6 is an example, and is not limited to a flac file, but may be another audio file such as an aac file, a wav file, or an mp3 file.
  • the maximum volume can be set by moving the slider knob to the left or right. Instead of the UI 433, an input of a numerical value indicating the maximum volume may be accepted.
  • the average volume can be set by moving the slider knob to the left or right.
  • an input of a numerical value indicating the average sound volume may be accepted.
  • UI435 by selecting a radio button (option button), permission or non-permission of volume adjustment can be set.
  • UI 435 is a UI for setting permission or disapproval of volume adjustment, but (i) allows the entire volume of the sound data, (ii) allows a part of the sound data, and ( iii) It is good also as UI which sets one of not permitting volume adjustment.
  • the UI 436 can accept a character string input in a text box as an author. Note that a user name set in advance in the generation device 400 may be automatically input as the author.
  • the input receiving unit 430 is realized by the input IF 405 and the display 406, for example.
  • the generating unit 440 generates content by associating the sound data acquired by the acquiring unit 420 with the control information that has received the input.
  • the generation unit 440 generates content C10 as illustrated in FIG. 7 by receiving an input to the UI 431 illustrated in FIG. 6, for example. That is, the generation unit 440 generates the content C10 by associating the sound data C11 selected by the UI 432 with the control information C12 received by the UI 433 to UI 436.
  • the reproduction time is obtained from, for example, information indicating the reproduction time included in the sound data C11 by analyzing the sound data C11.
  • the ambient degree is calculated by analyzing the sound data C11 by the method described above, for example.
  • the generation unit 440 is realized by, for example, the CPU 401, the main memory 402, and the storage 403.
  • the communication unit 450 transmits the content generated by the generation unit 440 to the server 200 via the communication network 300. Note that the communication unit 450 may transmit the content to the playback device 100 via the communication network 300.
  • the communication unit 450 is realized by, for example, the CPU 401, the main memory 402, the storage 403, and the communication IF 404.
  • the functional configuration of the playback device 100 will be described.
  • the playback apparatus 100 includes a communication unit 110 and a playback unit 130.
  • the playback device 100 may further include a content DB (Database) 120.
  • the communication unit 110 acquires content from the server 200 via the communication network 300.
  • the content is, for example, content including sound data indicating a predetermined sound, and is video content or sound content. That is, the content is content in which sound is output from the speaker 106 of the playback device 100 when played back by the playback device 100.
  • the communication unit 110 may acquire one content from the server 200 or may acquire a plurality of contents.
  • the communication unit 110 is realized by the CPU 101, the main memory 102, the storage 103, and the communication IF 104, for example.
  • the content DB 120 stores content acquired by the communication unit 110.
  • the content DB 120 is realized by the storage 103, for example.
  • the content stored in the content DB 120 is not limited to the content acquired by the communication unit 110 but may be content stored in advance, or stored in advance with the content acquired by the communication unit 110. May be mixed with existing content.
  • the content DB 120 stores content in advance, for example, by storing content generated by the generation device 400 before factory shipment.
  • the playback unit 130 plays back the content acquired by the communication unit 110.
  • the reproduction unit 130 may perform streaming reproduction of the content acquired by the communication unit 110, or may read and reproduce the content from the content DB 120.
  • the reproduction unit 130 reproduces sound data included in the content according to control information included in the content.
  • the playback unit 130 may play sound data together with the video data.
  • the playback unit 130 uses, for example, the maximum volume information included in the control information included in the content acquired by the communication unit 110, and the predetermined sound of the sound data included in the content exceeds the preset volume. Playback control that does not output at volume is performed.
  • the preset volume may be set by the first user or may be set as an initial state at the time of factory shipment or the like.
  • the reproduction unit 130 when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the reproduction unit 130 performs the first reproduction control that does not reproduce the sound data associated with the control information. You may go. Further, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the playback unit 130 reduces the volume of the predetermined sound of the sound data associated with the control information to be equal to or lower than the set volume. In this state, the second reproduction control for outputting a predetermined sound may be performed. In addition, when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume, the playback unit 130 indicates that the control information is set at a predetermined sound volume of the sound data associated with the control information.
  • the third reproduction control for outputting a predetermined sound may be performed in a state where the average volume indicated by the included average volume information is lowered until the maximum volume is equal to or lower than the set volume.
  • the playback unit 130 is a part of the sound data associated with the control information that exceeds the set volume in the predetermined sound.
  • the fourth playback control for outputting a predetermined sound may be performed in a state where the volume is lowered to a set volume or less.
  • the playback unit 130 includes attribute information indicating whether or not the control information permits adjustment of the volume of the sound data when the maximum volume indicated by the maximum volume information included in the control information exceeds the set volume. If included, the first reproduction control and the second reproduction control may be selectively performed according to the attribute information. Specifically, when the attribute information included in the control information indicates that the volume adjustment is permitted, the playback unit 130 performs the second playback control, and the attribute information included in the control information permits the volume adjustment. In the case of indicating not, the first reproduction control may be performed.
  • the playback unit 130 allows the control information to (i) adjust the overall volume of the sound data.
  • a first reproduction control according to the attribute information when including attribute information indicating that the adjustment of the volume of a part of the sound data is permitted; and (iii) the adjustment of the volume of the sound data is not permitted.
  • the third reproduction control and the fourth reproduction control may be selectively performed. Specifically, when the attribute information included in the control information indicates that the adjustment of the overall volume of the sound data is permitted, the playback unit 130 performs the third playback control, and the attribute information included in the control information When the adjustment of the volume of a part of the sound data is permitted, the fourth reproduction control is performed. When the attribute information included in the control information indicates that the adjustment of the volume is not permitted, the first reproduction control is performed.
  • FIG. 8 is a diagram showing a temporal change in the playback time of the volume of the content.
  • FIG. 9 is a diagram showing a temporal change in the playback time of the playback volume output when the content is played back by the playback device.
  • FIG. 10 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the third reproduction control is performed.
  • FIG. 11 is a diagram illustrating a temporal change in the reproduction time of the reproduction volume output when the fourth reproduction control is performed. Note that the content shown in FIGS. 8 to 11 is an example, and the volume and playback volume of the content are examples.
  • the volume of the content becomes the maximum volume Vol MAX at time t1. Further, the average volume Vol AVG of the content is indicated by a one-dot chain line in FIG.
  • the playback unit 130 of the playback apparatus 100 has a content with a volume in which the average volume Vol AVG of the content matches the adjusted volume, which is the volume adjusted by the first user using a remote controller or the like. Reproduction control for outputting a predetermined sound included in the sound from the speaker 106 is performed. In this case, if the playback unit 130 does not adjust the volume in the sound data of the content, the playback volume output from the speaker 106 may be larger than the set volume. In other words, when the content is played back so that the average volume Vol AVG of the content matches the adjusted volume, the playback unit 130 may output a sound having a volume higher than the set volume from the speaker 106.
  • the playback unit 130 sets the maximum volume of the output playback volume by lowering the average volume Vol AVG of the content below the adjustment volume as the third playback control. You may perform the reproduction
  • the playback unit 130 performs predetermined playback as the fourth playback control in a state in which the volume of the part where the playback volume of the content exceeds the set volume is reduced below the set volume. Playback control for outputting sound may be performed.
  • the reproduction unit 130 is realized by, for example, the CPU 101, the main memory 102, the storage 103, the display 105, and the speaker 106.
  • the server 200 includes a database 210, a comparison unit 220, a generation unit 230, and a communication unit 240.
  • the database 210 includes a video content DB (Database) 211 and a sound content DB (Database) 212.
  • the video content DB 211 stores a plurality of independent video contents.
  • the video content DB 211 stores control information corresponding to each of the plurality of video contents together with the plurality of video contents.
  • the sound content DB 212 stores a plurality of independent sound contents.
  • the sound content DB 212 stores control information corresponding to each of the plurality of sound contents together with the plurality of sound contents.
  • the video content DB 211 stores video content acquired from the generation apparatus 400 by the communication unit 240 via the communication network 300.
  • the sound content DB 212 stores sound content acquired from the generation device 400 by the communication unit 240 via the communication network 300.
  • Each of the video content DB 211 and the sound content DB 212 is realized by the storage 203, for example.
  • the server 200 may calculate the ambient degree using the above method using at least one of the content stored in the database 210 and the control information.
  • the control information may not include the degree of ambient.
  • the comparison unit 220 compares the video attribute information included in each of the plurality of video contents with the sound attribute information included in each of the plurality of sound contents. For example, when the genre of the video content matches the genre of the sound content, the comparison unit 220 determines that they are similar to each other.
  • the genre may include the author of the content and the date (or month, year) when the content was created.
  • the comparison unit 220 compares the video ambient degree and the sound ambient degree using a predetermined method, and determines whether or not they are similar.
  • the comparison unit 220 calculates the video ambient degree from the metadata included in the video attribute information using the above method, and calculates the sound ambient degree from the metadata included in the sound attribute information using the above method. It may be calculated.
  • the comparison unit 220 is realized by, for example, the CPU 201, the main memory 202, and the storage 203.
  • the generation unit 230 generates a plurality of contents composed of video content and sound content having attribute information similar to each other according to the comparison result by the comparison unit 220. That is, the generation unit 230 generates a plurality of contents composed of combinations of video content and sound content similar to each other.
  • the generation unit 230 is realized by the CPU 201, the main memory 202, and the storage 203, for example.
  • the communication unit 240 transmits two or more contents among the plurality of contents generated by the generation unit 230 to the playback device 100 via the communication network 300.
  • the communication unit 240 may transmit the content corresponding to the acquisition request to the playback device 100.
  • the communication unit 240 is realized by the communication IF 204, for example.
  • the server 200 does not necessarily have the comparison unit 220 and the generation unit 230. That is, the server 200 acquires video content or sound content from the generation device 400 via the communication network 300, stores the video content or sound content in the database 210, and stores the stored video content or sound content via the communication network 300. Any configuration can be used as long as it can be transmitted to.
  • FIG. 12 is a flowchart illustrating an example of a generation method by the generation device according to the embodiment.
  • the acquisition unit 420 acquires sound data indicating a predetermined sound (S11).
  • the input receiving unit 430 receives input of control information (S12). The details of the control information received by the input receiving unit 430 are as described above.
  • the generating unit 440 generates content by associating the sound data acquired by the acquiring unit 420 with the control information received by the input receiving unit 430 (S13).
  • the communication unit 450 transmits the content generated by the generation unit 440 to the server 200 or the playback device 100 via the communication network 300 (S14).
  • FIG. 13 is a flowchart showing an example of a reproduction method by the reproduction apparatus according to the embodiment.
  • the communication unit 110 acquires content from the server 200 or the generation device 400 via the communication network 300 (S21).
  • the reproduction unit 130 reproduces the content acquired by the communication unit 110 according to the control information included in the content (S22). Details of the reproduction process performed by the reproduction unit 130 will be described later.
  • FIG. 14 is a flowchart showing an example of the details of the reproduction processing by the reproduction unit of the reproduction apparatus according to the embodiment.
  • the playback unit 130 determines whether or not the maximum volume indicated by the maximum volume information included in the control information included in the content acquired by the communication unit 110 exceeds the set volume (S31).
  • the reproducing unit 130 determines whether or not the attribute information included in the control information indicates that the adjustment of the volume of the sound data is permitted ( S32).
  • the playback unit 130 performs the second playback control in which the maximum volume of the sound data is reduced below the set volume and played back ( S33).
  • the reproducing unit 130 performs the first reproduction control that does not reproduce the content (S34).
  • the playback unit 130 plays the content as it is without adjusting the volume (S35).
  • steps S32 and S34 may not be performed. That is, when it is determined that the maximum volume exceeds the set volume, the second reproduction control in step S33 may be performed without confirming the attribute information of the control information.
  • the attribute information is information indicating whether or not the adjustment of the volume of the sound data is permitted.
  • the attribute information includes (i) the overall volume of the sound data. It may be information indicating any one of permitting adjustment, (ii) permitting adjustment of the volume of a part of the sound data, and (iii) not permitting adjustment of the volume of the sound data.
  • the reproduction process in this case is, for example, the process shown in FIG.
  • FIG. 15 is a flowchart illustrating another example of the details of the reproduction process performed by the reproduction unit of the reproduction apparatus according to the embodiment.
  • the reproducing unit 130 further indicates that the attribute information only permits the volume adjustment of a part of the sound data, It is determined whether or not the volume adjustment of the entire data is permitted (S36).
  • the playback unit 130 adjusts the volume of the part where the maximum volume exceeds the set volume and plays back the fourth volume.
  • the reproduction control is performed (S37).
  • the playback unit 130 adjusts the average volume of the sound data and decreases until the maximum volume of the sound data becomes equal to or lower than the set volume. Then, the third reproduction control for reproduction is performed (S38).
  • the content is used to prohibit a predetermined sound from being output by the playback apparatus 100 at a volume that exceeds the set volume set in the playback apparatus 100.
  • Control information including maximum volume information indicating the maximum volume of a predetermined sound. For this reason, when the reproducing apparatus 100 reproduces the content, it is possible to reduce the output of the content by the reproducing apparatus 100 at a volume exceeding the set volume. Therefore, when the playback device 100 plays back content, it is possible to reduce discomfort that the playback device 100 gives to the user.
  • the playback device 100 outputs a predetermined sound.
  • the output of the predetermined sound exceeding the set sound volume can be reduced. That is, the playback apparatus 100 can reduce the output of content sound at a large volume that is not suitable for ambient content.
  • the content is attribute information included in the control information, which is attribute information indicating whether or not to allow adjustment of the volume of the sound data, or (i) adjustment of the overall volume of the sound data is permitted.
  • (Ii) includes attribute information indicating that the adjustment of the volume of a part of the sound data is permitted and (iii) the adjustment of the volume of the sound data is not permitted.
  • the present invention is not limited to this.
  • attribute information indicating that the first reproduction control is performed
  • attribute information indicating that the second reproduction control is performed
  • attribute information indicating that the third reproduction control is performed
  • Any one of the attribute information indicating that the fourth reproduction control is performed may be included.
  • the reproduction device 100 reproduces the reproduction control indicated by the attribute information, that is, the first reproduction control, the second reproduction control, the third reproduction control, and the fourth reproduction. Perform any one of the controls.
  • the reproducing apparatus 100 in the above embodiment may display an image related to the ambient degree together with the content.
  • the image may include at least one of an image indicating the ambient degree of the content and an image indicating the range of the ambient degree received by a receiving unit such as a remote controller (not shown).
  • the user By displaying an image related to the ambient degree together with the content on the display 105, the user visually recognizes the image together with the content being reproduced.
  • the user can recognize the ambient level of the currently reproduced content by visually recognizing an image indicating the ambient level. Further, the user can recognize the range of the ambient degree designated by the user by visually recognizing the image indicating the range of the ambient degree. By recognizing these, for example, the user can instruct the playback device 100 to change the specified ambient degree higher or lower than the current degree through the reception unit.
  • a sound relating to the ambient degree may be output by the speaker 106, and the same effect as described above can be obtained.
  • the playback device specifies the index associated with the content within the range of the index, and thereby the content to be played back Can be specified. At that time, the user need not recall the search key. The user can specify the content to be played back by the playback device simply by specifying the rough value of the index associated with the content within the range. In this way, the playback device enables more flexible content specification. Also, since flexible content specification is possible, the problem of increase in processing load and power consumption of the playback device when determination of content reflecting the user's intention fails can be avoided.
  • the playback device enables more flexible content specification by using, as a specific index, an estimated index that indicates the degree of attention that the user directs to the content being played back.
  • the playback device, the server, or the generation device calculates an index associated with the content based on the degree of attention directed by the user to each of the video and the sound included in the content.
  • the content index can be calculated in consideration of the video and sound included in the content.
  • the playback device, server, or generation device calculates an index associated with the content by a weighted average obtained by increasing the weight of the sound index of the video index and the sound index.
  • the playback device, server, or generation device calculates an index associated with the content by a weighted average obtained by increasing the weight of the video index among the video index and the sound index.
  • the index associated with the content the index of the index used for specifying the content is set with respect to the degree of the attention directed by the user by relatively increasing the contribution of the degree of attention directed by the person to the video. It can be an indicator that matches the sense of
  • the playback device, server, or generation device can calculate the video index by specifically using the brightness, saturation, hue, or scene change mode of the video included in the content.
  • the playback device, server, or generation device can calculate the sound index by specifically using the volume, frequency distribution, or volume change mode included in the content.
  • the playback device, server, or generation device can cause the user to recognize the content index by presenting the index associated with the content along with the content being played back to the user. Then, it is possible to cause the user to make a determination as to whether or not the content that the user wants to present on the playback apparatus is compatible with the index range designated by the user.
  • the playback device, the server, or the generation device plays back both video content and sound content
  • the index of the video content and sound content to be played back may be included in the range specified by the user. it can.
  • the user can play both the video content and the sound content that are estimated to have the same level of attention by the playback device.
  • the playback device can cause the content provider to recognize the index associated with the content by presenting the index when the content is stored in the server in advance.
  • the playback device can make the content provider recognize the adjusted content index after adjusting the content.
  • the content provider recognizes the index of the adjusted content, confirms the result of the adjustment made to the content provided by itself, and determines whether to store it in the server based on the result Can take action.
  • each component is realized by executing a software program suitable for each component, but may be configured by dedicated hardware.
  • Each component may be realized by a program execution unit such as a CPU or a processor reading and executing a software program recorded on a recording medium such as a hard disk or a semiconductor memory.
  • the software that realizes the reproduction method of each of the above embodiments is the following program.
  • this program is a generation method for generating content on a computer using a computer, acquires sound data indicating a predetermined sound, and the predetermined sound indicated by the acquired sound data is reproduced by a playback device Control information used for prohibiting output by the playback device at a volume exceeding the set volume set in the control information, including control information including maximum volume information indicating the maximum volume of the predetermined sound.
  • a generation method for generating content by receiving an input and associating the acquired sound data with the control information having received the input is executed.
  • this program is a reproduction method by a reproduction system including a generation device that generates content and a reproduction device that acquires the content generated by the generation device and reproduces the acquired content. Then, the generation device acquires sound data indicating a predetermined sound, and the predetermined sound indicated by the acquired sound data has a volume exceeding a set volume set in the reproduction device. Control information used for prohibiting the output of the sound, and receiving the input of control information including maximum volume information indicating the maximum volume of the predetermined sound, the acquired sound data, and the received control Content is generated by associating with information, and the playback device acquires the content via a communication network. Playback that does not output the predetermined sound of the sound data included in the content at a volume exceeding the preset volume using the maximum volume information included in the control information included in the acquired content Let the method run.
  • the generation method, the generation device, the reproduction method, and the reproduction system according to one or more aspects of the present invention have been described based on the embodiment.
  • the present invention is not limited to this embodiment. Absent. Unless it deviates from the gist of the present invention, one or more of the present invention may be applied to various modifications that can be conceived by those skilled in the art, or forms constructed by combining components in different embodiments. It may be included within the scope of the embodiments.
  • the sound ambient degree is described based on the volume of the sound of the content, the frequency distribution of the sound, or the change of the volume.
  • the present invention is not limited to this.
  • the sound frequency characteristics the approximation with the so-called “1 / f fluctuation” characteristic, the number of overtone components, the regularity of the timbre waveform (frequency of several Hz or less) Area) and the like.
  • the sound ambient level is an index at the research stage compared to the video ambient level, but the mid-range sound around 200 Hz is equivalent to vocals and human speech, and is likely to be heard by humans. I know it. Therefore, it is considered that the degree of attention directed by the user increases, and the degree of consciousness increases (the degree of ambient decreases).
  • the human brain tries to understand what is different from nature by unknowingly complementing it, so when listening to sounds that are different from the natural world, it will use brain resources, increasing the degree of consciousness (the degree of ambient is increased). It is thought that). Therefore, music that is composed to increase the degree of user's attention is not only highly conscious (low ambient), but also sounds that exist in the natural world, such as river buzz, can be recorded in a recording environment (such as a microphone or Depending on the performance of the recording device, the degree of ambient may be reduced.
  • a recording environment such as a microphone or Depending on the performance of the recording device, the degree of ambient may be reduced.
  • the present disclosure can be applied to a generation method that can reduce discomfort given to the user by the playback device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computer Security & Cryptography (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

L'invention concerne un procédé de génération qui permet de générer un contenu à l'aide d'un ordinateur, le procédé consistant : à acquérir des données sonores représentant un son particulier (S11); à accepter une entrée d'informations de commande qui comprennent des informations de volume sonore maximal représentant le volume maximal du son particulier représenté par les données sonores acquises, les informations de commande étant utilisées pour empêcher le son particulier d'être émis par un dispositif de reproduction (100) à un volume sonore dépassant un volume sonore prédéterminé établi dans le dispositif de reproduction (100) (S12); à générer un contenu en associant les données sonores acquises aux informations de commande dont l'entrée a été acceptée (S13).
PCT/JP2018/005615 2017-02-21 2018-02-19 Procédé et dispositif de génération, procédé et système de reproduction WO2018155353A1 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201762461359P 2017-02-21 2017-02-21
US62/461359 2017-02-21
JP2017-189864 2017-09-29
JP2017189864A JP2020065096A (ja) 2017-02-21 2017-09-29 生成方法、生成装置、再生方法および再生システム

Publications (1)

Publication Number Publication Date
WO2018155353A1 true WO2018155353A1 (fr) 2018-08-30

Family

ID=63253773

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2018/005615 WO2018155353A1 (fr) 2017-02-21 2018-02-19 Procédé et dispositif de génération, procédé et système de reproduction

Country Status (1)

Country Link
WO (1) WO2018155353A1 (fr)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013207323A (ja) * 2012-03-27 2013-10-07 Funai Electric Co Ltd 音声信号出力機器および音声出力システム
JP2016082473A (ja) * 2014-10-20 2016-05-16 三菱電機株式会社 映像再生装置および映像再生方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013207323A (ja) * 2012-03-27 2013-10-07 Funai Electric Co Ltd 音声信号出力機器および音声出力システム
JP2016082473A (ja) * 2014-10-20 2016-05-16 三菱電機株式会社 映像再生装置および映像再生方法

Similar Documents

Publication Publication Date Title
US10966044B2 (en) System and method for playing media
US11075609B2 (en) Transforming audio content for subjective fidelity
US20190018644A1 (en) Soundsharing capabilities application
JP4913038B2 (ja) 音声レベル制御
KR101251626B1 (ko) 스마트 기기를 이용한 음향기기의 특성에 대한 보상 서비스 제공 방법
US7725203B2 (en) Enhancing perceptions of the sensory content of audio and audio-visual media
US20110066438A1 (en) Contextual voiceover
CN110580141B (zh) 移动终端
JP2011130279A (ja) コンテンツ提供サーバ、コンテンツ再生装置、コンテンツ提供方法、コンテンツ再生方法、プログラムおよびコンテンツ提供システム
US20190261121A1 (en) Method Of Editing Audio Signals Using Separated Objects And Associated Apparatus
US8265935B2 (en) Method and system for media processing extensions (MPX) for audio and video setting preferences
US10091581B2 (en) Audio preferences for media content players
US20110110534A1 (en) Adjustable voice output based on device status
WO2018155353A1 (fr) Procédé et dispositif de génération, procédé et système de reproduction
US10656901B2 (en) Automatic audio level adjustment during media item presentation
US20200081681A1 (en) Mulitple master music playback
WO2018155352A1 (fr) Procédé de commande de dispositif électronique, dispositif électronique, système de commande de dispositif électronique et programme
JP2020065096A (ja) 生成方法、生成装置、再生方法および再生システム
WO2018155351A1 (fr) Procédé de reproduction, système de reproduction et appareil de reproduction
KR20110008505A (ko) 사용자 개개인의 청력에 맞추어 오디오 기기의 음질을 제어하는 장치 및 방법
US20090192636A1 (en) Media Modeling
JP2020065099A (ja) 再生方法、再生システム、および、再生装置
JP2020065098A (ja) 電子機器の制御方法、電子機器、電子機器の制御システム、及び、プログラム
US20120117373A1 (en) Method for controlling a second modality based on a first modality
EP2083422A1 (fr) Modélisation de média

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18756904

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18756904

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP