WO2020059047A1

WO2020059047A1 - Evaluation system, server device, terminal device, information processing method, and information processing program

Info

Publication number: WO2020059047A1
Application number: PCT/JP2018/034671
Authority: WO
Inventors: 大久谷川; 英士福田; 久美余川; 和俊木下; 吉田　真人; 渉鈴木
Original assignee: 楽天株式会社
Priority date: 2018-09-19
Filing date: 2018-09-19
Publication date: 2020-03-26
Also published as: JPWO2020059047A1; JP6543429B1

Abstract

The purpose of the present invention is to ensure the reliability of content evaluation information. This terminal device detects a video or a sound outputted from an output device which outputs at a video and/or a sound constituting content. The terminal device transmits evaluation information inputted from a user regarding the content and detection information indicating the detected video or sound to the server device. The server device acquires content information indicating the video or the sound which constitutes the content. The server device receives the evaluation information and the detection information from the terminal device. The server device compares the acquired content information with the received detection information. The server device selects the received evaluation information so as to be used for evaluating the content when a prescribed degree of matching is recognized between the content information and the detection information from the comparison result.

Description

Evaluation system, server device, terminal device, information processing method, and information processing program

{Circle around (1)} The present invention relates to a technical field of an evaluation system for allowing a user to view content composed of at least one of video and sound and input evaluation information for the content.

Conventionally, there has been known a system which enables a viewer of a content such as a television broadcast program to input evaluation information for the content, and tallies and analyzes evaluation information from each viewer. For example, when a predetermined button is pressed on the remote control of the television receiver while viewing the program on the television receiver, options relating to the program are displayed on the screen of the television receiver. When one of the color buttons on the remote controller is pressed, an option corresponding to the pressed button is selected, and the selection result is transmitted from the television receiver to the broadcast station. Further, Patent Literature 1 discloses a technique for ascertaining the taste of a viewer of a television broadcast. Specifically, the remote control of the set-top box is provided with a notification button for notifying the viewer's preference for people, objects, music, and the like on the air. When the notification button is pressed, the set-top box stores the viewing log as a viewing event and transmits the viewing log to the broadcasting device at predetermined time intervals. The broadcasting device counts viewing events that match the conditions set by the program creator or the like.

In the above-described technology, a dedicated remote control is used for a television receiver or a set-top box or the like for receiving a television broadcast signal and displaying a video of a television program, and is evaluated via the television receiver or the set-top box or the like. Since the information is transmitted to the broadcast station or the program creator, it is assured that the user has actually viewed the program and entered the evaluation information. Therefore, the reliability of the input evaluation information is ensured.

However, for example, a general-purpose terminal device such as a mobile phone or a tablet computer is used for inputting the evaluation information by the user, and the evaluation information of each user for the content is provided in a highly versatile system that can be used for other purposes. When performing aggregation, analysis, and the like, the issue is how to ensure the reliability of the evaluation information. The reason is that, when such a terminal device is used, the terminal device cannot be dedicated for evaluation input to the television receiver, so that the input operation of the evaluation information by the user is caused by an actual program that watches the television receiver. This is because it is easy to be performed irrespective of the viewing act of the user.

Patent Document 2 discloses that a portable communication device acquires sound output from a speaker of a television receiver by a built-in microphone and stores sound data, and broadcasts the sound data by a tuner built in the portable communication device. It is disclosed that data indicating the viewing status of a broadcast program is stored based on acquiring program data, extracting audio data, and comparing the audio data with each other. However, the technique disclosed in Patent Literature 2 attempts to evaluate the viewing situation of a broadcast program by a viewer, but does not aim to evaluate the broadcast program itself.

JP 2000-333154 A JP-A-2017-060060

The present invention has been made in view of the above points, and is an evaluation system, a server device, and a terminal device that can ensure the reliability of evaluation information on content even when evaluating content using a general-purpose terminal device. , An information processing method, and an information processing program.

In order to solve the above-described problem, one embodiment of the present invention is an evaluation system including a terminal device and a server device connected to the terminal device via a network, wherein the terminal device includes Input means for inputting, by a user, evaluation information for at least one of the contents, and a video or sound output by an output device for outputting at least one of the video and the sound, which constitutes the content Transmitting means for transmitting the input evaluation information and the detected information indicating the detected video or sound to the server device, and the server device transmits the content. An acquisition unit configured to acquire content information indicating the video or the sound, and receiving the evaluation information and the detection information from the terminal device. Receiving means, a comparing means for comparing the obtained content information with the received detection information, and a result of the comparison by the comparing means shows that there is a predetermined match between the content information and the detection information. In this case, the method further comprises: selecting means for selecting the received evaluation information to be used for evaluating the content;

According to the present invention, the content to be evaluated is actually determined based on the comparison between the detection information indicating the video or sound detected by the terminal device and the content information indicating the video or sound of the content acquired by the server device. It is possible to estimate whether or not the user is watching or listening. Therefore, it is possible to preferentially use the evaluation information that is presumed that the user has watched or listened to the content to input the content. Therefore, it is possible to ensure the reliability of the evaluation information.

In one embodiment of the present invention, the content is a content to be broadcast, and during the broadcast of the content, the evaluation information and the detection information are transmitted from each of the plurality of terminal devices to the server device as the terminal device. The server device is configured to transmit at least one of the evaluation information and the detection information by at least one terminal device of the plurality of terminal devices, and to transmit at least one of the other terminal devices of the plurality of terminal devices. Transmitting timing information indicating the transmission timing to each of the plurality of terminal devices to each of the plurality of terminal devices so as to differ from the transmission timing by the terminal device; And a timing information transmitting unit that performs Further comprising a timing information receiving means for receiving the timing information from the device, the transmission unit, in accordance with the received timing information, and transmits at least one of the evaluation information and the detection information.

According to the present invention, even when the evaluation information and the detection information are transmitted from each of the plurality of terminal devices to the server device during the broadcast of the content, the transmission timing of at least one of the evaluation information and the detection information is plural. Are distributed. Therefore, the processing load of the server device can be distributed in the time axis direction.

One embodiment of the present invention is further characterized in that the timing information transmitting means determines a transmission timing by the at least one other terminal device during an interval between transmission timings by the at least one terminal device. I do.

According to the present invention, since the transmission timing of at least one terminal device and the transmission timing of another at least one terminal device are different from each other, the number of pieces of information received by the server device per unit time is equalized. The processing load on the device can be further distributed.

In one embodiment of the present invention, the detection unit and the transmission unit repeatedly detect the video or the sound at predetermined time intervals and transmit the detection information, and the selection unit includes the reception unit. Among the detection information repeatedly received at predetermined time intervals, the detection information indicating the video or sound detected by the terminal device at the time closest to the input time before or after the input time of the evaluation information, The selection is performed based on at least a comparison result with the content information.

According to the present invention, the server device determines whether to use the evaluation information for evaluating the content by using the detection information indicating the video or the sound detected at a time close to the time when the user inputs the evaluation information. . Therefore, even in a mode in which the terminal device periodically performs detection of video or sound and transmission of the detection information, the estimation accuracy of whether the user is watching or listening to the content at the time of inputting the evaluation information. Can be increased.

In one embodiment of the present invention, the selection unit further includes, among the detection information periodically received by the reception unit, a video or a sound detected at a time relatively close to an input time of the evaluation information. The selection is performed on the basis of a comparison result between two or more pieces of detection information each indicating the following and the content information.

According to the present invention, two or more pieces of detection information indicating a video or a sound detected at a time close to the time when the user inputs the evaluation information are used. Therefore, it is possible to further increase the estimation accuracy of whether or not the user is watching or listening to the content when the evaluation information is input.

In one embodiment of the present invention, the detecting unit and the transmitting unit detect the video or the sound when the evaluation information is input, and transmit the detection information to the server device together with the evaluation information. When the content information has a portion determined to match the detection information, the selection means selects the evaluation information received together with the detection information to be used for evaluating the content. And

According to the present invention, the server device determines whether or not to use the evaluation information for evaluating the content, using the detection information indicating the video or the sound detected when the user inputs the evaluation information. Therefore, it is possible to increase the estimation accuracy of whether or not the user is watching or listening to the content at the time when the evaluation information is input.

In one embodiment of the present invention, the selection unit further includes, among the content information, an output time of the video or sound indicated by the portion determined to match the detection information by the output device, And the input time of the evaluation information received together with the evaluation information.

According to the present invention, the server device specifies, as the input time of the evaluation information, the output time of the video or sound in the content that matches the video or the sound detected when the user inputs the evaluation information. Therefore, it is possible to appropriately specify which scene of the content is the evaluation information input, and thus it is possible to appropriately evaluate the content.

In one embodiment of the present invention, the acquisition unit further includes a step of acquiring specific information converted from characteristic information time-series data composed of a time series of characteristic information indicating characteristics of the video or the sound constituting the content. Specific information time-series data configured in time series, wherein the corresponding characteristic information can be specified based on each of the specific information, and the information amount of each of the specific information is the information of the corresponding characteristic information. The terminal device acquires specific information time-series data less than the amount as the content information, and the terminal device obtains the characteristic information time-series data in advance, the characteristic information time-series data obtaining unit, and the detected video or sound. Extracting means for extracting characteristic information indicating the characteristic of the characteristic information, and a degree of coincidence between the extracted characteristic information among the characteristic information included in the acquired characteristic information time-series data is determined. Generating means for generating specific information that specifies characteristic information exceeding the value, and generating specific information having an information amount smaller than the information amount of the characteristic information, wherein the transmitting unit includes: Is transmitted as the detection information, and the selecting unit outputs the output time corresponding to the specific information that matches the specific information received as the detection information among the specific information included in the characteristic information time-series data. Is specified as the input time.

According to the present invention, the terminal device compares the characteristic information time-series data of the content acquired before the output of the content with the characteristic information extracted from the detected video or sound. The terminal device generates specific information having a smaller information amount from the characteristic information when there is characteristic information in the characteristic information time-series data in which the degree of coincidence with the extracted characteristic information exceeds a predetermined value. Then, the identification information is transmitted to the server device as detection information. The server device compares the specific information time-series data converted from the characteristic information time-series data with the specific information received from the terminal device. The server device acquires, as the input time of the evaluation information, the output time of the video or sound corresponding to the specific information that matches the received specific information, from the specific information time-series data. Therefore, since the amount of detection information is reduced, the communication load on the terminal device and the server device can be reduced.

In one embodiment of the present invention, the specific information is a hash value of the characteristic information.

One embodiment of the present invention is an acquisition unit configured to acquire content information indicating the video or the sound, which constitutes a content configured of at least one of a video and a sound, and evaluation information for the content is input by a user. And, constituting the content, from a terminal device that detects a video or sound output by an output device that outputs at least one of the video and the sound, the evaluation information, the detected video or Receiving means for receiving detection information indicating a sound; a comparing means for comparing the acquired content information with the received detection information; and a comparison result by the comparing means, wherein the content information and the detection information are obtained. If there is a predetermined match between the received evaluation information and the selected Characterized in that it comprises a and.

One embodiment of the present invention provides an input unit in which a user inputs evaluation information for a content composed of at least one of a video and a sound, and at least one of the video and the sound that constitutes the content Detection means for detecting the video or sound output by the output device that outputs the input evaluation information, and the detection information indicating the detected video or sound, constituting the content, the video and According to a comparison result of the content information indicating at least one of the sounds and the detection information, if there is a predetermined match between the content information and the detection information, the evaluation information is compared with the content. Transmitting means for transmitting to a server device selected to be used for evaluation.

One embodiment of the present invention relates to an information processing method in an evaluation system including a terminal device and a server device connected to the terminal device via a network, wherein the server device has at least one of video and sound. An acquisition step of acquiring the content information indicating the video or the sound, which constitutes a content configured by the terminal device, the terminal device being input by a user to an input unit provided in the terminal device, the evaluation information for the content, An evaluation information acquiring step of acquiring, the terminal device constituting the content, a detection step of detecting a video or a sound output by an output device that outputs at least one of the video and the sound, and the terminal The apparatus stores the acquired evaluation information and the detection information indicating the detected video or sound in the support. A transmitting step of transmitting to the server device, the server device receiving the evaluation information and the detection information from the terminal device, and the server device transmits the acquired content information and the received detection information. A comparison step of comparing the content information with the detection information according to a result of the comparison performed by the comparison step. And a selecting step of selecting to use for.

One embodiment of the present invention provides, in an information processing method executed by a computer of a server device, acquiring content information indicating the video or the sound, which constitutes a content configured of at least one of a video and a sound. An acquisition step, and a terminal device that receives the evaluation information for the content by a user and detects a video or a sound output by an output device that outputs at least one of the video and the sound that constitutes the content A receiving step of receiving the evaluation information and detection information indicating the detected video or sound, a comparing step of comparing the obtained content information with the received detection information, There is a predetermined match between the content information and the detection information according to the comparison result in the step If, characterized in that it comprises a selection step of selecting to use the received evaluation information to the evaluation of the content.

According to one embodiment of the present invention, in an information processing method executed by a computer of a terminal device, evaluation of a content constituted by at least one of video and sound, which is input by a user to input means provided in the terminal device, is performed. An evaluation information acquisition step of acquiring information; a detection step of detecting a video or a sound output by an output device that outputs at least one of the video and the sound, which constitutes the content; and the acquired evaluation Information, the detected information indicating the detected video or sound, constituting the content, content information indicating at least one of the video and the sound, and a comparison result of the detection information, If there is a predetermined match between the content information and the detection information, the evaluation information is used to evaluate the content. Characterized in that it comprises a transmission step of transmitting to the server device selected to use, a.

One aspect of the present invention provides a computer of a server device, an acquisition unit configured to acquire content information indicating the video or the sound, which constitutes a content configured of at least one of a video and a sound, Evaluation information is input by a user, and, from the terminal device that detects the video or sound output by the output device that outputs at least one of the video and the sound that constitutes the content, The detection information indicating the detected video or sound, receiving means for receiving, the comparing means for comparing the acquired content information and the received detection information, and a comparison result by the comparing means, If there is a predetermined match between the content information and the detection information, the received evaluation information is used to evaluate the content. Characterized in that to function as a selecting means for selecting to use.

One embodiment of the present invention provides a computer of a terminal device, which acquires evaluation information for a content that is input by a user to input means provided in the terminal device and that is configured by at least one of video and sound. Means, from the detection means for detecting the video or sound output by the output device that outputs at least one of the video and the sound that constitutes the content, from the detection information indicating the detected video or sound Detecting information obtaining means for obtaining, the obtained evaluation information, the obtained detection information, the content information constituting at least one of the video and the sound constituting the content, and the detection information If there is a predetermined match between the content information and the detection information according to the comparison result of Wherein transmitting means for transmitting to the server device selected to use in the evaluation of Ntsu, that function as.

According to the present invention, the reliability of the evaluation information for a content can be ensured even when the content is evaluated using a general-purpose terminal device.

It is a figure showing an example of the outline composition of program evaluation system S concerning one embodiment. It is a block diagram showing an example of the outline composition of server 1 concerning one embodiment. FIG. 3 is a diagram illustrating an example of functional blocks of a system control unit 11 of the server 1 according to an embodiment. It is a block diagram showing an example of the outline composition of user terminal 2 concerning one embodiment. FIG. 3 is a diagram illustrating an example of functional blocks of a system control unit 21 of the user terminal 2 according to one embodiment. It is a figure showing an example of the processing outline in program evaluation system S. It is a figure showing an example of the method of deciding whether to use evaluation information for program evaluation. It is a figure showing an example of an input screen for inputting evaluation information on a program. FIG. 3 is a diagram illustrating an example of information transmission timings by a plurality of user terminals 2. FIG. 9 is a diagram illustrating an example of a generated report. 9 is a flowchart illustrating an example of a terminal process executed by a system control unit 21 of the user terminal 2. 5 is a flowchart illustrating an example of a server process executed by the system control unit 11 of the server 1. It is a figure showing an example of the processing outline in program evaluation system S. 9 is a flowchart illustrating an example of a terminal process executed by a system control unit 21 of the user terminal 2. 5 is a flowchart illustrating an example of a server process executed by the system control unit 11 of the server 1.

Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. The content that can be evaluated in the present invention includes at least one of a video (especially a moving image) and a sound. Broadcasting, on-demand distribution, and the like are examples of content distribution modes. Examples of the broadcasting form include terrestrial digital television broadcasting, satellite broadcasting, cable television, radio broadcasting, Internet broadcasting, and the like. Examples of the form of on-demand distribution include satellite broadcasting, cable television, and the Internet. The embodiment described below is an embodiment in which the present invention is applied to a system for evaluating a program in digital terrestrial television broadcasting.

[1. First Embodiment]

[1-1. Configuration of Program Evaluation System]

First, the configuration of the program evaluation system S according to the present embodiment will be described with reference to FIG. FIG. 1 is a diagram illustrating an example of a schematic configuration of a program evaluation system S according to the present embodiment.

As shown in FIG. 1, the program evaluation system S includes a server 1 and one or a plurality of user terminals 2. The server 1 and each user terminal 2 can mutually transmit and receive data via the network NW using, for example, TCP / IP as a communication protocol. The network NW is constructed by, for example, the Internet, a dedicated communication line (for example, a CATV (Community @ Antenna @ Television) line), a mobile communication network (including a base station or the like), a gateway, and the like.

The server 1 is a server device for counting or analyzing user evaluations on television programs broadcasted by the broadcasting station 3. In order to determine the reliability of the evaluation by each user, the server 1 acquires information indicating the sound of the program to be evaluated as program sound information during or before the broadcast of the program. During the broadcast of the program to be evaluated, the server 1 receives, from each user terminal 2, evaluation information indicating an evaluation by the user. Further, the server 1 receives, from each user terminal 2, audio data indicating a sound detected by the user terminal 2 and detected sound information. Then, the server 1 determines whether or not to use the evaluation information from the user terminal 2 for evaluating the program based on the detected sound information.

Each user terminal 2 is used by a user who has registered as a member in the program evaluation system S. When each user is watching a program to be evaluated on each television receiver 4, the user terminal 2 used by that user detects the sound output from the television receiver 4 and outputs the detected sound information. Send to server 1. Further, when a user inputs an evaluation for a program, each user terminal 2 transmits evaluation information indicating the input evaluation to the server 1. There is no particular upper limit on the number of times that a user can input evaluation information per unit time while viewing a program. For example, a user may input evaluation information at intervals of several seconds or hundreds of milliseconds. The user terminal 2 is preferably a portable computer, but may be a stationary computer. Examples of the user terminal 2 include a mobile information terminal such as a smartphone and a tablet computer, a mobile phone, a PDA (Personal Digital Assistant), and a personal computer.

[1-2. Server configuration]

Next, the configuration of the server 1 will be described with reference to FIG. 2A. FIG. 2A is a block diagram illustrating an example of a schematic configuration of the server 1 according to the present embodiment. As shown in FIG. 2A, the server 1 includes a system control unit 11, a system bus 12, an input / output interface 13, a storage unit 14, and a communication unit 15. The system control unit 11 and the input / output interface 13 are connected via a system bus 12.

The system control unit 11 includes a CPU (Central Processing Unit) 11a, a ROM (Read Only Memory) 11b, a RAM (Random Access Memory) 11c, and the like. The CPU 11a is an example of a processor. The present invention can be applied to various processors different from the CPU. The storage unit 14, the ROM 11b, and the RAM 11c are each an example of a memory. The present invention is applicable to various memories different from a hard disk, a ROM, and a RAM.

The input / output interface 13 performs an interface process between the storage unit 14 and the communication unit 15 and the system control unit 11.

The storage unit 14 is configured by, for example, a hard disk drive or the like. The storage unit 14 stores program sound information acquired from the broadcast station 3. The storage unit 14 stores the evaluation information received from each user terminal 2. The storage unit 14 stores a user database. The user database stores information on the users registered as members in the program evaluation system S. For example, in the user database, user attributes such as user ID, name, date of birth, gender, and occupation are stored in association with each user. The user ID is information for identifying a user. Further, the storage unit 14 stores various programs such as an operating system and a server program. The server program is a program for acquiring program information, receiving evaluation information and detected sound information, determining whether to use evaluation information, and the like. The server program may be obtained from another server device or the like via the network NW, or may be recorded on a recording medium such as a magnetic tape, an optical disk, or a memory card and read via a drive device. It may be.

The communication unit 15 connects to the network NW and controls a communication state with each user terminal 2.

[1-3. Configuration of User Terminal]

Next, the configuration of the user terminal 2 will be described with reference to FIG. 3A. FIG. 3A is a block diagram illustrating an example of a schematic configuration of the user terminal 2 according to the present embodiment. As shown in FIG. 3A, the user terminal 2 includes a system control unit 21, a system bus 22, an input / output interface 23, a storage unit 24, a communication unit 25, an operation input unit 26, a display unit 27, A microphone 28 and a camera 29 are provided. The system control unit 21 and the input / output interface 23 are connected via a system bus 22.

The system control unit 21 includes a CPU 21a, a ROM 21b, a RAM 21c, and the like.

The input / output interface 23 performs an interface process between the storage unit 24 to the camera 29 and the system control unit 21.

The storage unit 24 includes, for example, a flash memory, a hard disk drive, and the like. The storage unit 24 stores various programs such as an operating system, a web browser, and a program evaluation application. The program evaluation application is a program for performing processing for using the program evaluation system S. The program evaluation application may be obtained from a server device such as the server 1 via the network NW, or may be recorded on a recording medium such as an optical disk or a memory card and read via a drive device. You may. Note that the program evaluation application may be a web application.

The communication unit 25 connects to the network NW and controls a communication state with the server 1.

The operation input unit 26 receives an operation performed by the user, and outputs a signal corresponding to the operation content to the system control unit 21. Examples of the operation input unit 26 include a touch panel, a button, a switch, a key, a keyboard, a mouse, and the like. The operation input unit 26 functions as a unit for inputting evaluation information for a broadcast program by a user.

The display unit 27 displays information such as images and characters under the control of the system control unit 21. Examples of the display unit 27 include a liquid crystal display and an organic EL (Electro Luminescence) display.

The microphone 28 is a device that converts a sound wave into an audio signal and outputs the audio signal to the system control unit 21. The microphone 28 functions as a unit that detects a sound of a broadcast program output from the television receiver 4.

The camera 29 captures a still image or a moving image. The camera 29 is configured by, for example, a CCD (Colony Collapse Disorder) image sensor or the like.

[1-4. Functional overview]

Next, an outline of functions of the system control unit 11 of the server 1 and the system control unit 21 of the user terminal 2 will be described with reference to FIGS. 2B and 3B to 6.

FIG. 3B is a diagram illustrating an example of functional blocks of the system control unit 21 of the user terminal 2 according to the present embodiment. As shown in FIG. 3B, the system control unit 21 causes the CPU 21a to read and execute various codes included in the program evaluation application, thereby obtaining an evaluation information acquisition unit 211, a detection information acquisition unit 212, an evaluation information / detection information transmission unit. 213 etc.

FIG. 4A is a diagram showing an example of a processing outline in the program evaluation system S. The evaluation information acquisition unit 211 acquires, from the operation input unit 26, the evaluation information input by the user for the broadcast program. As shown in FIG. 4A, the user normally inputs evaluation information to the user terminal 2 while viewing a broadcast program to be evaluated on the television receiver 4.

FIG. 5 is a diagram showing an example of an input screen for inputting evaluation information for a program. When the program evaluation application is activated and the user performs a program survey start operation, the evaluation information acquisition unit 211 causes the display unit 27 of the user terminal 2 to display the input screen shown in FIG. For example, the evaluation information acquisition unit 211 may perform display control so that the input screen can be displayed only in the broadcast time slot of the program to be evaluated. The input screen may include, for example,

evaluation buttons

101 and 102, a comment input area 103, a comment transmission button 104, and the like. The evaluation button 101 is a button that is pressed when the user feels that the program is boring. When the evaluation button 101 is pressed, the evaluation information obtaining unit 211 obtains evaluation information indicating “boring”. The evaluation button 102 is a button that is pressed when the user feels that the program is interesting. When the evaluation button 102 is pressed, the evaluation information obtaining unit 211 obtains evaluation information indicating “interesting”. The comment input area 103 is a button for inputting a comment on the program. The comment transmission button 104 is a button for transmitting a comment input to the comment input area 103. When the comment transmission button 104 is pressed, the evaluation information acquisition unit 211 acquires evaluation information including a character string of the input comment. The evaluation of the program is not limited to “interesting”, “boring” and comments. For example, it may be possible to input “funny XX%” by touching the indicator bar displayed on the screen. That is, the degree and tendency of the evaluation may be visually input. In this case, if the indicator bar is selected near the center, the evaluation input is made as "funny 50%". Various other forms of evaluation can be input as evaluation information.

The detection information acquisition unit 212 acquires detection sound information indicating a sound detected by the microphone 28. As shown in FIG. 4A, when the user inputs the evaluation information while watching the broadcast program to be evaluated, the sound of the program is detected by the microphone 28. The detection information acquisition unit 212 may generate the detection sound information by, for example, converting an audio signal output from the microphone 28. For example, the detection information acquisition unit 212 may extract the waveform information indicating the characteristics of the sound waveform from the audio signal by analyzing the audio signal as detected sound information. For example, the detection information acquisition unit 212 may define a plurality of amplitude bands and specify an amplitude band in which a waveform of an audio signal is sampled at predetermined sampling intervals. The detection information acquisition unit 212 may count the number of waveform samples for each amplitude band, and generate an array of these sample numbers as waveform information. Alternatively, the detection information acquisition unit 212 may extract the feature amount of the detected sound from the audio signal by analyzing the audio signal. For example, a feature amount is extracted using a discrete Fourier transform or the like. The detection information acquisition unit 212 acquires this feature amount as detection sound information. Alternatively, an audio signal may be converted into audio data in a format such as MP3 (MPEG-1 Audio Layer-3) and acquired as detected sound information.

The detection information acquisition unit 212 controls on / off of sound detection by the microphone 28. For example, the detection information acquiring unit 212 may acquire the detected sound information by turning on the sound detection at all times during the broadcast of the evaluation target program. However, in consideration of processing load, power consumption, network load, and the like, it is desirable to perform detection only at a predetermined timing. For example, the detection information acquisition unit 212 may cause the microphone 28 to repeatedly detect sound at predetermined time intervals. The sound detection interval may be, for example, 1 second, 5 seconds, 10 seconds, 30 seconds, 1 minute, or the like. In addition, as the sound detection interval is set shorter, content evaluation, which will be described later, can be performed in a timely manner in accordance with changes in the content, and feedback to content creation and editing can be accurately performed. become. The detection information acquisition unit 212 causes the microphone 28 to continue detecting the sound for a shorter time than the detection interval every time the sound is detected. Alternatively, the detection information acquisition unit 212 may cause the microphone 28 to detect a sound at the timing when the evaluation information is input by the user.

The evaluation information / detection information transmitting unit 213 transmits to the server 1 the evaluation information obtained by the evaluation information obtaining unit 211 and the detected sound information obtained by the detection information obtaining unit 212. The evaluation information / detection information transmitting unit 213 may transmit at least one of the evaluation information and the detected sound information collectively after the end of the broadcast of the program to be evaluated. On the other hand, the evaluation information / detection information transmitting unit 213 may transmit the evaluation information and the detected sound information while the program to be evaluated is being broadcast. For example, the evaluation information / detection information transmitting unit 213 may transmit the evaluation information at the timing when the evaluation information is input, and may repeatedly transmit the detection sound information at predetermined time intervals. For example, when a repeated sound is detected at predetermined time intervals by the microphone 28, the evaluation information / detection information transmitting unit 213 may transmit the detected sound information every time the detection is performed. By transmitting the detected sound information to the server 1 periodically, the server 1 can constantly grasp the user's viewing status of the program during the broadcast of the evaluation target program. When sound is detected by the microphone 28 at the timing when the evaluation information is input, the evaluation information / detection information transmitting unit 213 may repeatedly transmit the evaluation information together with the detected sound information at predetermined time intervals. In this case, the transmission of the evaluation information and the detected sound information is reserved until the periodic transmission timing comes after the input of the evaluation information. Alternatively, the evaluation information / detection information transmitting unit 213 may transmit the evaluation information together with the detection information at the timing when the evaluation information is input. Although details will be described later, the evaluation information / detection information transmitting unit 213 may transmit the detection sound information at a timing according to the timing information transmitted from the server 1.

FIG. 2B is a diagram illustrating an example of functional blocks of the system control unit 11 of the server 1 according to the present embodiment. As shown in FIG. 2B, the system control unit 11 causes the CPU 11a to read and execute various codes included in the server program, thereby obtaining a program information obtaining unit 111, an evaluation information / detection information receiving unit 112, a comparing unit 113, It functions as the information use determining unit 114, the evaluation unit 115, and the like.

The program information acquisition unit 111 acquires content information indicating at least one of a video and a sound, which constitutes the content of the broadcast program to be evaluated. In the present embodiment, the program information acquisition unit 111 acquires program sound information indicating the sound of a broadcast program as content information. For example, when a program to be evaluated is recorded in advance, the server 1 may receive the audio data of the program from the broadcast station 3 via the network NW before the broadcast of the program starts. Alternatively, audio data of a program may be recorded on a recording medium and loaded into the server 1 via a drive device. Alternatively, a tuner (not shown) may receive a broadcast signal transmitted from the broadcast station 3 while a program is being broadcast, and the server 1 may acquire audio data extracted from the broadcast signal by the tuner in real time. The program information acquisition unit 111 may cause the storage unit 14 to store the audio data of the program as program sound information. Alternatively, the program information acquisition unit 111 may extract the waveform information or feature amount of the sound of the program from the audio data at predetermined time intervals, similarly to the detection information acquisition unit 212 of the user terminal 2. Then, the program information acquisition unit 111 may convert the time series data composed of the time series of the waveform information or the feature amount into a database as the program sound information. In the program sound information, each waveform information or feature amount may be associated with a time at which a sound corresponding to the waveform information or the feature amount is broadcasted in the program to be evaluated. This broadcast time may be an absolute time or a relative time from the broadcast start time.

(4) The evaluation information / detection information receiving unit 112 receives the evaluation information and the detection sound information from each user terminal 2. As described above, the evaluation information and the detected sound information may be transmitted together, or may be transmitted at different timings.

Here, if the evaluation information and the detected sound information are transmitted from the plurality of user terminals 2 at the same time, the processing load, the communication load, and the network load of the server 1 may increase. Therefore, the evaluation information / detection information receiving unit 112 determines that the transmission timing of at least one of the evaluation information and the detection sound information by at least one user terminal 2 among the plurality of user terminals 2 The transmission timing of at least one of the evaluation information and the detected sound information is determined for each user terminal 2 so as to be different from the transmission timing of at least one of the evaluation information and the detected sound information by at least one user terminal 2. May be. Then, the server 1 may transmit timing information indicating the determined timing to each of the plurality of user terminals 2. For example, the evaluation information / detection information receiving unit 112 may determine the transmission timing of at least one other user terminal 2 during the interval of the transmission timing of at least one user terminal 2 among the plurality of user terminals 2. . Thereby, the number of pieces of detection information received by the server device per unit time is made uniform. In this case, the server 1 causes each user terminal 2 to periodically transmit the detected sound information.

For example, suppose that the transmission cycle of the information (at least one of the evaluation information and the detected sound information) by each user terminal 2 is P seconds, and the transmission timing of the information is distributed to N transmission timings. Also, the start time of the i-th transmission cycle from the start of program broadcasting is set to Ti. In this case, the information transmission timing in the i-th cycle is, for example, Ti + 0 seconds, Ti + 1 × P / N seconds, Ti + 2 × P / N seconds,... Ti + (N−1) × P / N seconds. 0, 1 × P / N, 2 × P / N,... (N−1) × P / N are offsets from the start time. For example, each user terminal 2 may notify the server 1 when an operation for starting evaluation of a program is performed by a user. In response to the notification from the user terminal 2, the evaluation information / detection information receiving unit 112 may determine one transmission timing from among a plurality of transmission timings cyclically or randomly according to the order. Since the transmission timings of the plurality of user terminals 2 need only be dispersed as a whole, there is no problem that the transmission timings of some of the user terminals 2 overlap. The evaluation information / detection information receiving unit 112 transmits timing information indicating the determined transmission timing to the user terminal 2 that has transmitted the notification. The evaluation information / detection information transmitting unit 213 of the user terminal 2 transmits information according to the timing indicated in the timing information. The timing information may include, for example, a start time of a transmission cycle (for example, 0 seconds per minute), an offset (for example, 0 seconds, 20 seconds, 40 seconds, and the like).

FIG. 6 is a diagram illustrating an example of information transmission timings by a plurality of user terminals 2. In this example, P is 60 seconds and N is 3. In FIG. 6, the user terminal 2-1 receives timing information indicating “0 second per minute” from the server 1. The user terminal 2-2 receives timing information indicating “after 20 seconds from 0 seconds every minute” from the server 1. The user terminal 2-3 receives timing information indicating “after a lapse of 40 seconds from every minute” from the server 1. It is assumed that a certain transmission cycle starts at time T. In this case, the user terminal 2-1 transmits information during a period from time T to T + 20 seconds. The user terminal 2-2 transmits information between times T + 20 and T + 40 seconds. The user terminal 2-3 transmits information between times T + 40 and T + 60 seconds. Further, the user terminal 2-1 transmits information between times T + 60 and T + 80 seconds. The user terminal 2-2 transmits information between times T + 80 and T + 100 seconds. The user terminal 2-3 transmits information between times T + 100 and T + 120 seconds.

The comparing unit 113 compares the program sound information acquired by the program information acquiring unit 111 with the detected sound information received by the evaluation information / detected information receiving unit 112. For example, when the program sound information and the detected sound information are waveform information, the comparing unit 113 compares the waveform information or feature amount of the sound detected by the user terminal 2 indicated by the detected sound information with the program indicated by the program sound information. The degree of coincidence between the detected waveform information of the sound and the waveform information of the sound of the program may be calculated by comparing the time-series waveform information of the waveform information of the sound. For example, the degree of coincidence may be calculated based on the coincidence and non-coincidence of the number of samples for each amplitude band. The comparing unit 113 compares only the waveform information broadcasted within a predetermined time before and after the time when the sound is detected in the user terminal 2 with the detected sound information, out of the time series of the waveform information indicated in the program sound information. Is also good. The same applies to the case where the program sound information and the detected sound information are feature amounts. The coincidence in the case of the feature amount may be, for example, a cosine similarity.

複数 There may be multiple programs to be evaluated and these multiple programs are broadcast in the same time zone. In this case, for example, the program evaluation application may be programmed so that the user can select a program to be evaluated using the user terminal 2. The comparison unit 113 compares the program sound information of the selected program with the detected sound information, assuming that the user is viewing the selected program. Alternatively, the comparing unit 113 may compare the program sound information of each of the plurality of programs with the detected sound information, and determine that the program with the highest degree of coincidence is the program being watched by the user.

Based on the comparison result by the comparing unit 113, when there is a predetermined match between the program sound information and the detected sound information, the evaluation information use determining unit 114 converts the evaluation information received by the evaluation information / detected information receiving unit 112 into Select to use for program evaluation. For example, as shown in FIG. 4A, when it is determined that the sound detected by the user terminal 2 and a part of the sound of the program match, the evaluation information use determining unit 114 uses the evaluation information for evaluation. If it is determined and it is determined that they do not match, it may be determined that the evaluation information is not used for the evaluation. If the sound detected by the user terminal 2 and a part of the sound of the program match, the user is highly likely to view and evaluate the program to be evaluated, and the reliability of the evaluation information is low. high. On the other hand, if they do not match, it is unlikely that the user is viewing the program to be evaluated, so the reliability of the evaluation information is low. Specifically, the evaluation information use determining unit 114 determines whether or not the program sound information has a portion that is determined to match the detected sound information. For example, when the program sound information and the detected sound information are waveform information, the evaluation information use determining unit 114 determines that the degree of coincidence with the waveform information of the sound detected by the user terminal 2 among the waveform information included in the program sound information. It is determined whether or not there is waveform information exceeding a predetermined threshold. When such waveform information exists, the evaluation information use determining unit 114 may determine that the detected sound matches the sound of the program. In the present embodiment, accurate voice recognition is not required, and if it is possible to determine the consistency of voice waveforms, it is possible to determine whether the user is actually watching a program. When the program sound information and the detected sound information are feature amounts, the processing of the evaluation information use determining unit 114 is basically the same. There is a possibility that the microphone 28 of the user terminal 2 may detect a voice of the user or the like, an environmental sound, or the like, together with the sound of the program. Therefore, the threshold may be set lower. Even in this case, it is possible to increase the accuracy of the match determination by considering the degree of coincidence between the sound detected at a predetermined number or more and the sound of the program, for example. For example, the evaluation information use determination unit 114 may determine that the detected sound matches the sound of the program when all the degrees of coincidence exceed the threshold, or that the average value of the degree of coincidence exceeds the threshold and If the standard deviation of the degree of coincidence is smaller than a predetermined value, it may be determined that the detected sound matches the sound of the program. Note that in a situation where the microphone 28 of the user terminal 2 detects a voice or environmental sound of the user or the like, preprocessing may be performed to remove the noise as noise. When the user terminal 2 repeatedly performs the detection of the sound by the microphone 28 and transmits the detected sound information at predetermined time intervals, the evaluation information use determining unit 114 determines whether the sound of the program and the detected sound are different. The match determination can be performed at predetermined time intervals. By shortening the interval, it is possible to accurately determine in real time whether or not the user is watching the program when the user inputs the evaluation information. A user's interest in a program may change in seconds as the program progresses, and if the program is not interesting, the user may stop watching the program or watch another program in the middle of the program. Therefore, the evaluation information in seconds for the program is important. The reliability of the real-time evaluation information can be secured.

The evaluation information use determining unit 114 evaluates the evaluation information of the program based on a comparison between the sound detected by the microphone 28 at the same time as or near the time when the evaluation information is input by the user and the sound of the program. May be determined. While the program to be evaluated is being broadcast, whether or not the user is watching the program may change. The evaluation information use determining unit 114 makes a determination according to the change in the viewing situation. FIG. 4B is a diagram illustrating an example of a user action that may occur when a method of determining whether to use evaluation information for evaluating a program is performed. As shown in FIG. 4B, a certain user inputs the evaluation information, for example, five minutes after the start of the broadcast of the program. At this time, the sound detected by the microphone 28 coincided with the sound of the program. In this case, the evaluation information is used for evaluation. Thereafter, the user leaves the room where the television receiver 4 is located, and inputs evaluation information 30 minutes after the broadcast of the program has started. The sound detected at this time did not match the sound of the program. In this case, the evaluation information is not used for evaluation. Thereafter, the user returned to the room, and input evaluation information 50 minutes after the broadcast of the program started. The sound detected around this time coincided with the sound of the program. In this case, the evaluation information is used for evaluation.

When the user terminal 2 transmits the detected sound information to the server 1 together with the evaluation information, the evaluation information use determining unit 114 uses the evaluation information for program evaluation based on a comparison between the detected sound information and the program sound information. It may be determined whether or not there is. When the user terminal 2 repeatedly transmits the detected sound information at predetermined time intervals, and transmits the evaluation information to the server 1 each time the evaluation information is input, the evaluation information use determination unit 114 receives the evaluation information periodically. Among the detected sound information, the comparison result between the detected sound information indicating the sound detected by the microphone 28 of the user terminal 2 at the time before or after the input time of the evaluation information and the time closest to the input time and the program sound information. At least based on this, it may be determined whether or not the evaluation information is used for evaluating the program. That is, in determining whether to use the evaluation information for the evaluation, at least the detection information at the time closest to the input time before or after the input time of the evaluation information is used. When the time width during which the evaluation information is input and the time width during which the detected sound information is detected at least partially overlap, it may be determined whether or not to use the evaluation information for evaluating the program.

The user terminal 2 may transmit the input time of the evaluation information to the server 1 together with the evaluation information, or may transmit the time at which the evaluation information is transmitted to the server 1 as the input time of the evaluation information. Alternatively, the time at which the server 1 receives the evaluation information from the user terminal 2 may be used as the input time of the evaluation information. Further, the user terminal 2 may transmit the time at which the sound is detected by the microphone 28 to the server 1 together with the detected sound information, or the time at which the detected sound information is transmitted to the server 1 as the time at which the sound is detected. May be sent. Alternatively, the time at which the server 1 receives the detected sound information from the user terminal 2 may be used as the time at which the sound is detected.

The evaluation information use determination unit 114 detects two or more detection sounds indicating the sounds detected by the microphone 28 at a time relatively close to the input time of the evaluation information among the detection sound information repeatedly received at predetermined time intervals. Based on the comparison result between the sound information detected sound information and the program sound information, it may be determined whether or not the evaluation information is used for evaluating the program. That is, two or more pieces of detected sound information are used to determine whether to use the evaluation information for evaluation. This makes it possible to increase the accuracy of determining whether or not the user is watching the program to be evaluated. For example, the evaluation information use determination unit 114 may use a predetermined number of pieces of detected sound information in order of the detected time being closer to the input time before or after the input time of the evaluation information, or the detected time may be the input time. Alternatively, detection sound information within a predetermined time from may be used. Alternatively, the evaluation information use determination unit 114 may use a predetermined number of pieces of detected sound information in order of the detected time being closer to the input time before and after the input time of the evaluation information, or the detected time may be changed from the input time. Detection sound information within a predetermined time may be used. The evaluation information use determining unit 114 may, for example, determine that the evaluation information is used for evaluation when the degree of coincidence for all of the two or more pieces of detected sound information exceeds the threshold, or the average value of the degree of coincidence may exceed the threshold. When the standard deviation of the degree of coincidence is smaller than a predetermined value, it may be determined that the evaluation information is used for the evaluation.

In the present embodiment, whether or not to use the evaluation information for evaluating the program is determined based on only the sound of the program. However, the evaluation information use determining unit 114 determines whether or not to use the evaluation information based on only the video of the program. Whether to use the evaluation information for evaluating the program may be determined based on both the sound and the sound. When using a video, the user points the lens of the camera 29 of the user terminal 2 to the television receiver 4. The detection information acquisition unit 212 of the user terminal 2 causes the camera 29 to detect an image, for example, periodically or when evaluation information is input. The detection information obtaining unit 212 may generate the detected video information by extracting the feature amount of the video from the video data output from the camera 29, for example. For example, the feature amount may be extracted using an algorithm such as SIFT (Scale Invariant Feature Transform) or SURF (Speeded Up Robust Feature). The evaluation information / detection information transmitting unit 213 transmits the detected video information to the server 1. The program information acquisition unit 111 of the server 1 causes a tuner to receive a broadcast signal from a broadcast station or acquires video data via a network NW, extracts a feature amount of a video of a program, and generates program video information. May be. The comparing unit 113 compares the program video information with the detected video information, and the evaluation information use determining unit 114 determines whether to use the evaluation information for evaluating the program based on the comparison. The details and modifications when using video may be the same as when using sound. When both video and sound are used, the evaluation information use determination unit 114 may determine that the evaluation information is used for evaluating the program, for example, when it is determined that the video matches and the sound matches.

(4) The evaluation unit 115 executes a process of evaluating a program based on the evaluation information determined to be used by the evaluation information use determining unit 114 among the evaluation information received by the evaluation information / detection information receiving unit 112. For example, the evaluation information / detection information receiving unit 112 counts and analyzes the evaluation information. The evaluation unit 115 may calculate the total evaluation of each item such as “uninteresting” and “interesting”, and the number of evaluations input at each time from the start to the end of the broadcast of the program. In addition, the evaluation unit 115 may count the total number of evaluations as a whole or the number of evaluations input at each time. The evaluation unit 115 may generate information indicating a transition of the number of evaluations, information indicating a ranking at a time when the number of evaluations is large, and the like. In addition, the evaluation unit 115 determines the number of users who have participated in the evaluation of the program, and the number of users who have been determined by the evaluation information use determination unit 114 to use at least one piece of evaluation information (the number of effective evaluations is at least one). (The number of users who performed once) may be counted. In addition, the evaluation unit 115 may generate a distribution of the attributes of the user who has performed the effective evaluation. Further, the evaluation unit 115 may generate a list of comments. The evaluation unit 115 may generate a report as a program evaluation result. FIG. 7 is a diagram illustrating an example of the generated report. The format of the report may be, for example, HTML (HyperText Markup Language), PDF (Portable Document Format), or the like. The server 1 transmits the generated report via the network NW, for example, in response to a request from a terminal device (not shown) in the broadcasting station 3. The process of evaluating a program may be executed by, for example, a terminal device used by a manager of the program evaluation system S. In addition, the evaluation unit 115 may give a privilege such as a point to the ID of the user who has performed the evaluation. As a result, a large amount of evaluation information can be collected by using the motivation of the user, and the reliability of the evaluation result can be increased by increasing the population.

[1-5. Operation of the program evaluation system]

Next, the operation of the program evaluation system S will be described with reference to FIGS. In the operation example described below, it is assumed that the server 1 acquires program sound information in advance and stores it in the storage unit 14. It is assumed that the user terminal 2 periodically detects a sound and transmits detected sound information to the server 1. The server 1 uses the detected sound information received before the evaluation information reception time and at the time closest to the reception time, and determines whether or not to use the evaluation information for program evaluation. It is assumed that waveform information is used as the program sound information and the detected sound information.

FIG. 8 is a flowchart illustrating an example of a terminal process executed by the system control unit 21 of the user terminal 2. For example, the user activates the program evaluation application and performs a program survey start operation. In response, the system control unit 21 executes terminal processing according to the program evaluation application.

First, the evaluation information / detection information transmitting unit 213 transmits a survey start notification to the server 1 together with the user ID of the user who uses the user terminal 2 (step S1). Next, the evaluation information / detection information transmission unit 213 receives the timing information from the server 1 and stores it in the RAM 21c (Step S2).

Next, the detection information acquisition unit 212 determines whether or not the sound detection timing has arrived based on the current time (step S3). For example, the detection information acquisition unit 212 determines the detection timing in time for the transmission timing based on the transmission timing of the detected sound information indicated by the timing information, the time for continuing the sound detection, and the like. If the detection information acquisition unit 212 determines that the detection timing has come (step S2: YES), the process proceeds to step S4. In step S4, the detection information acquisition unit 212 causes the microphone 28 to detect a sound. Next, the detection information acquisition unit 212 extracts waveform information from the audio signal output from the microphone 28 as detection sound information (step S5).

In step S3, when the detection information acquisition unit 212 determines that the detection timing has not arrived (step S2: NO), the detection information acquisition unit 212 proceeds with the process to step S6. In step S6, the evaluation information / detection information transmitting unit 213 determines whether the transmission timing indicated by the timing information has arrived based on the current time. When it is determined that the transmission timing has arrived (step S6: YES), the evaluation information / detection information transmitting unit 213 advances the process to step S7. In step S7, the evaluation information / detection information transmission unit 213 transmits the detection sound information stored in the RAM 21c to the server 1 together with the user ID.

評価 In step S6, when the evaluation information / detection information transmitting unit 213 determines that the transmission timing has not arrived (step S6: YES), the process proceeds to step S8. In step S8, the evaluation information acquisition unit 211 determines whether evaluation information has been input based on a signal from the operation input unit 26. When determining that the evaluation information has been input (step S8: YES), the evaluation information acquisition unit 211 advances the processing to step S9. In step S9, the evaluation information acquisition unit 211 transmits the input evaluation information to the server 1 together with the user ID.

When step S5, S7 or S9 is completed, or when it is determined in step S8 that the evaluation information has not been input (step S8: NO), the system control unit 21 arrives at the broadcast end time of the program to be evaluated. It is determined whether or not (Step S10). If it is determined that the end time has not arrived (step S10: NO), the system control unit 21 advances the processing to step S3. On the other hand, when it is determined that the end time has arrived (step S10: YES), the system control unit 21 ends the terminal processing.

FIG. 9 is a flowchart illustrating an example of a server process executed by the system control unit 11 of the server 1. The system control unit 11 starts the server processing, for example, a predetermined time before the start of the broadcast of the program to be evaluated, according to the server program.

First, the evaluation information / detection information receiving unit 112 determines whether a survey start notification has been received from any of the user terminals 2 (step S21). When the evaluation information / detection information receiving unit 112 determines that the investigation start notification has been received (step S21: YES), the process proceeds to step S22. In step S22, the evaluation information / detection information receiving unit 112 stores the viewing flag set to FALSE in the RAM 11c in association with the user ID received together with the survey start notification. The viewing flag is information indicating whether or not the user is viewing the program to be evaluated. Next, the evaluation information / detection information receiving unit 112 determines any one of a plurality of predetermined transmission timings, for example, at random (Step S23). The evaluation information / detection information receiving unit 112 transmits timing information indicating the determined transmission timing to the user terminal 2 that has transmitted the investigation start notification (step S24).

In step S21, when the evaluation information / detection information receiving unit 112 determines that the investigation start notification has not been received (step S21: NO), the process proceeds to step S25. In step S25, the evaluation information / detection information receiving unit 112 determines whether the detection sound information has been received from any of the user terminals 2. When the evaluation information / detection information receiving unit 112 determines that the detection sound information has been received (step S25: YES), the process proceeds to step S26. In step S26, the comparing unit 113 calculates the degree of coincidence between the waveform information indicated in the received detected sound information and each piece of waveform information included in the program information. The comparing unit 113 determines whether or not there is any of the pieces of waveform information included in the program information whose calculated coincidence exceeds a threshold. When the comparing unit 113 determines that there is waveform information whose degree of coincidence exceeds the threshold (step S26: YES), it sets the viewing flag associated with the user ID received with the detected sound information to TRUE (step S26). Step S27). On the other hand, when the comparing unit 113 determines that there is no waveform information whose matching degree exceeds the threshold value (step S26: NO), the viewing flag associated with the user ID received together with the detected sound information is set to FALSE. (Step S28).

In step S25, when the evaluation information / detection information receiving unit 112 determines that the detection sound information has not been received (step S25: NO), the process proceeds to step S29. In step S29, the evaluation information / detection information receiving unit 112 determines whether evaluation information has been received from any of the user terminals 2. When the evaluation information / detection information receiving unit 112 determines that the evaluation information has been received (step S29: YES), the process proceeds to step S30. In step S30, the evaluation information use determining unit 114 determines whether the viewing flag associated with the user ID received along with the evaluation information is TRUE. When the evaluation information use determining unit 114 determines that the viewing flag is TRUE (step S30: YES), the process proceeds to step S31. In step S31, the evaluation information use determining unit 114 sets the input time of the received evaluation information to the reception time of the evaluation information. Next, the evaluation information use determining unit 114 stores the evaluation information, the input time, and the user ID in the storage unit 14 in association with each other (Step S32). On the other hand, when it is determined that the viewing flag is FALSE (step S30: NO), the evaluation information use determining unit 114 discards the received evaluation information (step S33).

When step S24, S27, S28, S32 or S33 is completed, or when it is determined in step S29 that the evaluation information has not been received (step S29: NO), the evaluation unit 115 determines the broadcast end time of the program to be evaluated. Is determined (step S34). When determining that the end time has not arrived (step S34: NO), the evaluation unit 115 advances the processing to step S21. On the other hand, when the evaluation unit 115 determines that the end time has arrived (step S34: YES), the evaluation unit 115 executes a program evaluation process using the evaluation information (step S35). The evaluation unit 115 counts and analyzes the evaluation information stored in the storage unit 14. For example, the evaluation unit 115 may use the input time to calculate the transition of the number of evaluations for each evaluation item, or determine the ranking of the time with the highest number of evaluations. The evaluation unit 115 generates a report indicating the evaluation result, and causes the storage unit 14 to store the report. When the evaluation processing ends, the evaluation unit 115 ends the server processing.

As described above, according to the present embodiment, the user terminal 2 detects the sound output by the television receiver 4 that outputs the video and the sound constituting the program. Also, the user terminal 2 transmits the evaluation information input by the user and the detected sound information indicating the detected sound to the server 1. The server 1 acquires program sound information indicating the sound that constitutes the program. Further, the server 1 receives the evaluation information and the detected sound information from the user terminal 2. Further, the server 1 compares the acquired program sound information with the received detected sound information. If the server 1 determines from the comparison result that there is a predetermined match between the program sound information and the received detected sound information, the server 1 selects the received evaluation information to be used for evaluating the program. Therefore, it is possible to estimate whether the user is actually watching the program to be evaluated. Therefore, only the evaluation information that is estimated that the user is watching the program can be used for evaluating the program. Therefore, it is possible to ensure the reliability of the evaluation information.

コンテンツ Also, the content may be a content of a broadcast program. In addition, the server 1 determines that at least one of the evaluation information and the detected sound information transmitted by at least one of the user terminals 2 among the plurality of user terminals 2 during the broadcast of the program is transmitted to another of the plurality of user terminals 2 The transmission timing may be determined for each of the plurality of user terminals 2 so as to be different from the transmission timing by at least one user terminal 2. Further, the server 1 may transmit timing information indicating the determined transmission timing to each of the plurality of user terminals 2. Each user terminal 2 may receive timing information from the server 1. Further, each user terminal 2 may transmit at least one of the evaluation information and the detected sound information according to the transmission timing indicated in the received timing information. In this case, even when the detected sound information is transmitted from each of the plurality of user terminals 2 to the server 1 during the broadcast of the program, the transmission timing of at least one of the evaluation information and the detected sound information is dispersed to a plurality. You. Therefore, it is possible to suppress the processing load of the server 1 from being concentrated at the same point and to distribute the load.

The server 1 may determine the transmission timing of at least one other user terminal 2 during the interval of the transmission timing of at least one user terminal 2 among the plurality of user terminals 2. In this case, the number of pieces of detection information received by the server 1 per unit time is made uniform, so that the processing load on the server device can be further dispersed.

(4) The user terminal 2 may repeatedly detect the sound at predetermined time intervals and transmit the detected sound information. The server 1 includes, among the detected sound information repeatedly received at predetermined time intervals, detected sound information indicating a sound detected by the user terminal 2 at a time before or after the input time of the evaluation information and closest to the input time. Whether to use the evaluation information for evaluating the program may be determined based at least on the result of comparison with the program sound information. In this case, even in a mode in which the user terminal 2 periodically performs sound detection and transmission of the detected sound information, it is possible to increase the estimation accuracy of whether or not the user is watching the program at the time when the evaluation information is input. it can.

In addition, the server 1 compares the program sound information with two or more pieces of detected sound information indicating sounds detected at a time relatively close to the input time of the evaluation information among the detected sound information received periodically. Based on the result, it may be determined whether to use the evaluation information for evaluating the program. In this case, the accuracy of estimating whether or not the user is watching the program at the time when the evaluation information is input can be further increased.

The user terminal 2 may detect a sound when the evaluation information is input. When the program sound information includes a portion determined to match the detected sound information, the server 1 may determine that the evaluation information received together with the detected sound information is used for evaluating the program. In this case, the accuracy of estimating whether the user is watching the program at the time when the evaluation information is input can be improved.

[2. Second Embodiment]

[2-1. Functional overview]

Next, an outline of functions of the system control unit 11 of the server 1 and the system control unit 21 of the user terminal 2 in the second embodiment will be described with reference to FIG. In the present embodiment, the user terminal 2 detects sound at the timing when the evaluation information is input, and acquires the detected sound information. The transmission timing of the evaluation information and the detected sound information may be a point in time when the detected sound information is detected, or may be a regular period.

The server 1 specifies the time at which the evaluation information was input to the user terminal 2 based on the detected sound information. When the program sound information has a portion determined to match the detected sound information, the evaluation information use determining unit 114 determines that the evaluation information received together with the detected sound information from the user terminal 2 is used for evaluating the program. . At this time, the evaluation information use determining unit 114 receives the output time (broadcast time) of the sound indicated by the portion determined to match the detection information in the program to be evaluated by the television receiver 4 together with the detected sound information. May be specified as the input time of the evaluation information. This time may be an absolute time or a relative time from the broadcast start time. As a result, it is possible to evaluate the program using the input time with high accuracy. This input time specifying method is also effective for on-demand distribution in which the distribution time zone is not predetermined.

For the match determination in the server 1, for example, a feature value of a sound may be used, but feature value specifying information capable of specifying the feature value may be used. This feature amount specifying information is information having an information amount smaller than the feature amount. This characteristic amount specifying information is basically different for each characteristic amount in at least one program. When there are a plurality of programs to be evaluated, it is preferable that the feature amount specifying information differs between the programs. The characteristic amount specifying information may be, for example, a hash value indicating a summary of the characteristic amount, or may be identification information given to the characteristic amount based on a predetermined criterion. Alternatively, the characteristic amount specifying information may include a broadcast time of a sound corresponding to the characteristic amount. Instead of the characteristic amount and the characteristic amount specifying information, waveform information and information capable of specifying the waveform information and information having a smaller information amount than the waveform information may be used.

Hereinafter, the case where the hash value of the feature amount is used as the feature amount specifying information will be described. FIG. 10 is a diagram illustrating an example of a processing outline in the program evaluation system S. Before the broadcast of the program to be evaluated is started, the program information acquisition unit 111 extracts a feature time from the audio data of the program acquired via the network NW in the time series of the feature amount of the sound of the program. Generate series data. In other words, the audio data of the program is divided into a plurality of pieces and those arranged in chronological order are stored in the storage unit 24 in a state where they are each converted into a feature amount. Further, the program information acquisition unit 111 converts the generated feature amount time series data into specific information time series data. For example, the program information acquisition unit 111 generates a hash value of each feature amount of the feature amount time-series data using a predetermined hash function. Then, the program information acquisition unit 111 generates specific information time-series data composed of a time series of hash values as program sound information and causes the storage unit 14 to store the information. In the program sound information, each hash value may be associated with a time at which a sound corresponding to the hash value is broadcast in the program to be evaluated.

(4) The user terminal 2 acquires the feature amount time-series data as program sound feature amount information in advance. For example, the user operates the user terminal 2 before the broadcast of the program to be evaluated starts, and preliminarily enters a program survey. Upon receiving the entry notification from the user terminal 2, the server 1 transmits the program sound feature information. The user terminal 2 causes the storage unit 24 to store the program sound feature information.

(4) During the broadcast of the program to be evaluated, the user inputs the evaluation information to the user terminal 2. At this time, the detection information acquisition unit 212 causes the microphone 28 to detect a sound, and receives an audio signal from the microphone 28. The detection information acquisition unit 212 extracts a feature amount from the audio signal, compares the feature amount with each feature amount in the program sound feature amount information, and calculates a degree of coincidence. When the program sound feature amount information includes a feature amount having a matching degree exceeding a predetermined threshold, the detection information acquisition unit 212 converts the hash value of the feature amount into the same hash function as the hash function used in the server 1. Generated by a function (for example, included in the program evaluation application). The detection information acquisition unit 212 acquires this hash value as detection sound information. The evaluation information / detection information transmitting unit 213 transmits a hash value as detected sound information to the server 1 together with the evaluation information.

The comparison unit 113 of the server 1 compares the detected sound information received from the user terminal 2 with each time-series hash value of the hash value as the program sound information. As a result of the comparison, when there is a hash value that matches the detected sound information in the time series of the hash values, the evaluation information / detection information transmitting unit 213 determines that the evaluation information is used for evaluating the program. The evaluation information / detection information transmitting unit 213 specifies a time at which a sound corresponding to a hash value matching the detected sound information is broadcasted in the program to be evaluated as an input time of the evaluation information. By using the feature amount specifying information such as the hash value, the accuracy of the input time used for the evaluation is increased, and the communication amount between the server 1 and the user terminal 2 during the broadcast of the program can be reduced. This reduction in the communication amount is realized by reducing the information amount of the detected sound information by replacing the detected sound information from the characteristic amount to the characteristic amount specifying information. Further, when the sound detected by the microphone 28 does not match the sound of the program, the user terminal 2 does not need to transmit any of the detected sound information and the evaluation information to the server 1, so that the number of times of communication is reduced. You.

For example, in FIG. 10, the user terminal 2 has determined that the feature amount 102 in the program sound feature amount information matches the detected sound information. Therefore, the user terminal 2 transmits the hash value of the feature amount 102 to the server 1. On the server 1 side, the hash value 102 originally generated from the feature amount 102 in the program sound information matches the hash value received from the user terminal 2. Therefore, the time T102 associated with the hash value 102 is the input time of the evaluation information.

[2-2. Operation of the program evaluation system]

Next, the operation of the program evaluation system S will be described with reference to FIGS. In the operation example described below, it is assumed that the user terminal 2 transmits the detected sound information to the server 1 together with the evaluation information at the timing when the evaluation information is input. In addition, it is assumed that the user terminal 2 previously acquires the program sound feature amount information and stores it in the storage unit 24.

FIG. 11 is a flowchart illustrating an example of a terminal process executed by the system control unit 21 of the user terminal 2. 11, the same processes as those in FIG. 8 are denoted by the same reference numerals.

First, the evaluation information acquisition unit 211 determines whether the evaluation information has been input (step S8). When it is determined that the evaluation information has been input (step S8: YES), the evaluation information acquisition unit 211 proceeds to step S4. In step S4, the detection information acquisition unit 212 causes the microphone 28 to detect a sound. Next, the detection information acquisition unit 212 extracts a feature amount from the audio signal output from the microphone 28 (Step S41). Next, the detection information acquisition unit 212 compares the generated feature amount with each feature amount in the program sound feature amount information. The evaluation information acquisition unit 211 determines whether or not a feature amount whose degree of coincidence with the generated feature amount exceeds a threshold exists in the program sound feature amount information (step S42). If the detection information acquisition unit 212 determines that there is a feature amount whose coincidence exceeds the threshold (step S42: YES), the process proceeds to step S43. In step S43, the detection information acquisition unit 212 generates a hash value of the feature amount having the highest matching degree among the feature amounts whose matching degree of the generated feature amount exceeds the threshold value in the program sound feature amount information. . Next, the evaluation information / detection information transmitting unit 213 transmits the evaluation information and the generated hash value to the server 1 together with the user ID (Step S44).

When step S44 is completed, when it is determined in step S8 that the evaluation information has not been input (step S8: NO), or when it is determined in step S42 that there is no feature amount whose matching degree exceeds the threshold value ( (Step S42: NO), the system control unit 21 determines whether or not the broadcast end time of the evaluation target program has arrived (step S10). If it is determined that the end time has not arrived (step S10: NO), the system control unit 21 advances the processing to step S8. On the other hand, when it is determined that the end time has arrived (step S10: YES), the system control unit 21 ends the terminal processing.

FIG. 12 is a flowchart illustrating an example of a server process executed by the system control unit 11 of the server 1. In FIG. 12, the same processes as those in FIG. 9 are denoted by the same reference numerals.

First, the evaluation information / detection information receiving unit 112 determines whether evaluation information and a hash value have been received from any of the user terminals 2 (step S51). When the evaluation information / detection information receiving unit 112 determines that the evaluation information and the hash value have been received (step S51: YES), the process proceeds to step S52. In step S52, the comparison unit 52 determines whether a hash value that matches the received hash value exists in the program sound information. If the evaluation information / detection information receiving unit 112 determines that there is a hash value that matches the received hash value (step S52: YES), the process proceeds to step S53. In step S53, the evaluation information use determining unit 114 sets the input time of the received evaluation information to the broadcast time associated with the hash value that matches the received hash value. Next, the evaluation information use determining unit 114 stores the evaluation information, the input time, and the user ID in the storage unit 14 in association with each other (Step S32). On the other hand, when it is determined that there is no hash value that matches the received hash value (step S52: NO), the evaluation information use determining unit 114 discards the received evaluation information (step S33).

When step S32 or S32 is completed, or when it is determined in step S51 that the evaluation information and the hash value have not been received (step S51: NO), the evaluation unit 115 arrives at the broadcast end time of the program to be evaluated. It is determined whether or not (Step S34). If the evaluation unit 115 determines that the end time has not arrived (step S34: NO), the processing proceeds to step S51. On the other hand, when the evaluation unit 115 determines that the end time has arrived (step S34: YES), the evaluation unit 115 executes a program evaluation process using the evaluation information (step S35), and ends the server process.

As described above, according to the present embodiment, the server 1 determines the output time of the sound indicated by the portion of the program sound information determined to match the detected sound information by the television receiver 4 by using the detected sound information. It is specified as the input time of the evaluation information received together with. Therefore, it is possible to appropriately specify which scene of the program the evaluation information is input to, so that the program can be appropriately evaluated.

The server 1 may acquire, as the program sound information, the time series of the feature amount specifying information for specifying each of the feature amounts in the time series of the feature amounts of the sounds constituting the program content. The user terminal 2 may acquire in advance program sound feature amount information indicating a time series of sound feature amounts. Further, when the user terminal 2 includes a feature amount whose degree of coincidence with the detected sound feature amount exceeds a threshold value in the program sound feature amount information, the feature amount identification information for identifying the feature amount May be transmitted as detected sound information. The server 1 specifies, as the input time of the evaluation information, the output time of the sound corresponding to the characteristic amount specifying information that matches the characteristic amount specifying information received from the user terminal 2 in the program sound information. You may. In this case, since the amount of detected sound information is reduced, the communication load on the user terminal 2 and the server 1 can be reduced.

1 server 2 user terminal 11 system control unit 12 system bus 13 input / output interface 14 storage unit 15 communication unit 111 program information acquisition unit 112 evaluation information / detection information reception unit 113 comparison unit 114 evaluation information use determination unit 115 evaluation unit 21 system control Unit 22 system bus 23 input / output interface 24 storage unit 25 communication unit 26 operation input unit 27 display unit microphone 29 camera 211 evaluation information acquisition unit 212 detection information acquisition unit 213 information evaluation information / detection information transmission unit NW network

Claims

In an evaluation system including a terminal device and a server device connected to the terminal device via a network,
The terminal device,
Input means by which a user inputs evaluation information for content composed of at least one of video and sound,
Detecting means that detects the video or sound output by the output device that outputs at least one of the video and the sound that constitutes the content,
A transmitting unit that transmits the input evaluation information and detection information indicating the detected video or sound to the server device,
With
The server device,
Acquisition means for acquiring the content information indicating the video or the sound, which constitutes the content,
Receiving means for receiving the evaluation information and the detection information from the terminal device,
Comparing means for comparing the acquired content information with the received detection information,
Selecting means for selecting the received evaluation information to be used for evaluating the content, when there is a predetermined match between the content information and the detection information according to the comparison result by the comparing means,
An evaluation system comprising:
The content is broadcast content,
During the broadcasting of the content, the evaluation information and the detection information are transmitted from each of the plurality of terminal devices to the server device as the terminal device,
The server device,
A transmission timing of at least one of the evaluation information and the detection information by at least one terminal device of the plurality of terminal devices is different from a transmission timing of at least one other terminal device of the plurality of terminal devices. As described above, for each of the plurality of terminal devices, timing determining means for determining the transmission timing,
Timing information transmitting means for transmitting the determined transmission timing to each of the plurality of terminal devices,
Further comprising
The terminal device,
The apparatus further includes timing information receiving means for receiving the timing information from the server device,
2. The evaluation system according to claim 1, wherein the transmission unit transmits at least one of the evaluation information and the detection information according to the received timing information. 3.
3. The evaluation system according to claim 2, wherein the timing information transmitting unit determines a transmission timing by the at least one other terminal device during an interval between transmission timings by the at least one terminal device.
The detecting means and the transmitting means repeatedly detect the video or the sound at predetermined time intervals and transmit the detection information,
The selection unit is detected by the terminal device at a time closest to the input time before or after the input time of the evaluation information, among the detection information repeatedly received at the predetermined time interval by the reception unit. 4. The evaluation system according to claim 1, wherein the selection is performed based on at least a comparison result between detection information indicating video or sound and the content information. 5.
The selection means is, among the detection information repeatedly received at predetermined time intervals by the reception means, two or more of each of the video or sound detected at a time relatively close to the input time of the evaluation information The evaluation system according to claim 4, wherein the selection is performed based on a comparison result between detection information and the content information.
The detection means and the transmission means, when the evaluation information is input, detects the video or the sound, transmits the detection information together with the evaluation information to a server device,
When the content information has a portion determined to match the detection information, the selection unit selects the evaluation information received together with the detection information to be used for evaluating the content. The evaluation system according to claim 1.
The selecting means sets the output time of the video or sound indicated by the portion determined to match the detection information in the content information by the output time of the evaluation information received together with the detection information. The evaluation system according to claim 6, wherein the evaluation system is specified as:
The acquisition unit is a specific information time series composed of a time series of specific information converted from characteristic information time series data composed of a time series of characteristic information indicating characteristics of the video or the sound constituting the content. Data, the corresponding characteristic information can be specified based on each of the specific information, and the information amount of each of the specific information is the specific information time-series data smaller than the information amount of the corresponding characteristic information. , Obtained as the content information,
The terminal device,
Feature information time-series data acquisition means for acquiring the feature information time-series data in advance,
Extraction means for extracting feature information indicating the feature of the detected video or sound,
Among the feature information included in the acquired feature information time-series data, identification information that identifies feature information whose degree of coincidence with the extracted feature information exceeds a predetermined value, Generating means for generating specific information having an information amount smaller than the information amount;
Further comprising
The transmitting means transmits the generated specific information as the detection information,
The selecting unit specifies, as the input time, the output time corresponding to specific information that matches the specific information received as the detection information, among the specific information included in the feature information time-series data. The evaluation system according to claim 7, wherein:
9. The evaluation system according to claim 8, wherein the specific information is a hash value of the feature information.
Acquisition means for acquiring content information indicating the video or the sound, which constitutes a content configured of at least one of a video and a sound,
The evaluation information for the content is input by a user, and the terminal device that detects the video or sound output from the output device that outputs at least one of the video and the sound, which constitutes the content, Information, and detection information indicating the detected video or sound, receiving means for receiving,
Comparing means for comparing the acquired content information with the received detection information,
Selecting means for selecting the received evaluation information to be used for the evaluation of the content, when there is a predetermined match between the content information and the detection information according to the comparison result by the comparing means,
A server device comprising:
Input means by which a user inputs evaluation information for content composed of at least one of video and sound,
Detecting means that detects the video or sound output by the output device that outputs at least one of the video and the sound that constitutes the content,
The input evaluation information and the detection information indicating the detected video or sound, constituting the content, content information indicating at least one of the video and the sound, and the detection information, According to the comparison result, when there is a predetermined match between the content information and the detection information, a transmission unit that transmits the evaluation information to a server device that selects to be used for evaluating the content,
A terminal device comprising:
In an information processing method in an evaluation system including a terminal device and a server device connected to the terminal device via a network,
An acquisition step in which the server device configures a content configured of at least one of a video and a sound, and acquires content information indicating the video or the sound,
The terminal device, input by the user to the input means provided in the terminal device, evaluation information obtaining step of obtaining evaluation information for the content,
The terminal device, constituting the content, a detection step of detecting a video or sound output by an output device that outputs at least one of the video and the sound,
A transmission step of transmitting the obtained evaluation information and the detection information indicating the detected video or sound, to the server device,
A receiving step in which the server device receives the evaluation information and the detection information from the terminal device;
A comparison step in which the server device compares the acquired content information with the received detection information,
The server apparatus selects the received evaluation information to be used for evaluating the content when the comparison result in the comparing step indicates that there is a predetermined match between the content information and the detection information. A selection step to do;
An information processing method comprising:
In an information processing method executed by a computer of a server device,
Acquisition step of acquiring content information indicating the video or the sound, which constitutes a content configured of at least one of a video and a sound,
The evaluation information for the content is input by a user, and the terminal device that detects the video or sound output from the output device that outputs at least one of the video and the sound, which constitutes the content, Information, and detection information indicating the detected video or sound, a receiving step of receiving,
A comparing step of comparing the obtained content information with the received detection information,
A selecting step of selecting the received evaluation information to be used for evaluating the content, when there is a predetermined match between the content information and the detection information according to the comparison result obtained by the comparing step;
An information processing method including:
In an information processing method executed by a computer of a terminal device,
An evaluation information obtaining step of obtaining evaluation information for content configured by at least one of a video and a sound, which is input by a user to an input unit included in the terminal device,
Constituting the content, a detection step of detecting a video or sound output by an output device that outputs at least one of the video and the sound,
The acquired evaluation information, and the detection information indicating the detected video or sound, constituting the content, content information indicating at least one of the video and the sound, and the detection information, When the comparison result indicates that there is a predetermined match between the content information and the detection information, a transmitting step of transmitting the evaluation information to a server device selected to be used for evaluating the content,
An information processing method including:
The computer of the server device
Acquisition means for acquiring content information indicating the video or the sound, which constitutes a content configured of at least one of a video and a sound,
The evaluation information for the content is input by a user, and the terminal device that detects the video or sound output from the output device that outputs at least one of the video and the sound, which constitutes the content, Information, and detection information indicating the detected video or sound, receiving means for receiving,
Comparing means for comparing the acquired content information with the received detection information,
Selecting means for selecting the received evaluation information to be used for the evaluation of the content, when there is a predetermined match between the content information and the detection information according to the comparison result by the comparing means,
An information processing program characterized by functioning as a computer.
The terminal computer,
Evaluation information acquisition means for acquiring evaluation information for content constituted by at least one of video and sound, which is input by a user to input means provided in the terminal device,
A detection unit that obtains detection information indicating the detected video or sound from a detection unit that detects the video or sound output by an output device that outputs at least one of the video and the sound that constitutes the content. Information acquisition means;
The acquired evaluation information and the acquired detection information, constituting the content, content information indicating at least one of the video and the sound, and a comparison result of the detection information, When there is a predetermined match between the content information and the detection information, a transmission unit that transmits the evaluation information to a server device that selects to use the evaluation information for evaluating the content,
An information processing program characterized by functioning as a computer.