WO2017038480A1

WO2017038480A1 - Content processing device, content processing method, and program

Info

Publication number: WO2017038480A1
Application number: PCT/JP2016/074104
Authority: WO
Inventors: 真一長尾
Original assignee: ソニーセミコンダクタソリューションズ株式会社
Priority date: 2015-09-01
Filing date: 2016-08-18
Publication date: 2017-03-09
Also published as: JP2017050677A

Abstract

This technology relates to a content processing device, a content processing method, and a program which can prevent a user viewing a content from feeling uncomfortable. A timing control unit controls an output timing at which a content is outputted according to a detection result of a user's line of sight. This technology is applicable, for example, to a television receiver (TV) for displaying a content.

Description

Content processing apparatus, content processing method, and program

The present technology relates to a content processing device, a content processing method, and a program, and in particular, for example, a content processing device, a content processing method, and a content processing device that can prevent a user who views content from feeling uncomfortable. Regarding the program.

For example, a content processing apparatus that processes MPEG-encoded content skips output of a target frame, which is a frame to be output from among video frames included in the content, or repeatedly outputs a target frame. Is done.

Here, skipping the output of the target frame is also called skipping, and repeating the output of the target frame is also called repeating.

For example, Patent Document 1 proposes an MPEG decoder that displays a video frame at an appropriate timing by detecting a shift in the output timing of the video frame with respect to the system clock and performing skipping or repeating.

JP 2007-235986 JP

When video frame output is skipped or repeated, the video displayed on the display device becomes a display of so-called awkward and awkward movements, which makes the user who views the video feel uncomfortable.

The present technology has been made in view of such a situation, and can prevent a user who views content from feeling uncomfortable.

The content processing apparatus or the program according to the present technology includes a timing control unit that controls an output timing for outputting content according to a detection result of a user's line of sight, or such a content processing apparatus. It is a program for causing a computer to function.

The content processing method of the present technology is a content processing method including controlling an output timing of outputting content according to a detection result of a user's line of sight.

In the content processing apparatus, the content processing method, and the program of the present technology, the output timing for outputting the content is controlled according to the detection result of the user's line of sight.

The content processing device may be an independent device, or may be an internal block constituting one device.

Further, the program can be provided by being transmitted through a transmission medium or by being recorded on a recording medium.

According to the present technology, it is possible to prevent the user who views the content from feeling uncomfortable.

It should be noted that the effects described here are not necessarily limited, and may be any of the effects described in the present disclosure.

It is a block diagram which shows the structural example of one Embodiment of the content processing apparatus to which this technique is applied. 10 is a flowchart for explaining an example of processing of the timing control unit 24 when skip and repeat control is performed so as to prevent overflow and underflow of the video ES buffer 13; It is a flowchart explaining the example of a process of the timing control part 24 when performing control of skip and repeat so that the difference of the output time of each video and audio which should be output simultaneously becomes small. And FIG. 18 is a block diagram illustrating a configuration example of an embodiment of a computer to which the present technology is applied.

FIG. 1 is a block diagram illustrating a configuration example of an embodiment of a content processing apparatus to which the present technology is applied.

1 can be applied to a display device capable of displaying video, such as a TV (television receiver), a glasses-type wearable device, a smartphone, or the like.

In FIG. 1, a content processing apparatus includes a TS (Transport Stream) buffer 11, a DEMUX 12, a video ES (Elementary Stream) buffer 13, an audio ES buffer 14, a caption buffer 15, a video decoder 16, an audio decoder 17, a caption decoder 18, and a buffer. 19, 20, and 21, a line-of-sight detection unit 22, a buffer accumulation amount monitoring unit 23, and a timing control unit 24.

The TS buffer 11 plays, for example, a tuner, a stream server on a network such as the Internet, and storage media (for example, a Bru-ray (registered trademark) disk, a DVD (Digital Versatile Disc), a hard disk, a memory card, etc.) A content stream including video, audio, and captions as needed is supplied from a playback device or the like.

The TS buffer 11 temporarily stores the content stream supplied thereto and supplies it to the DEMUX 12.

Here, for example, an MPEG (Moving Picture Experts Group) stream can be adopted as the content stream. Examples of MPEG streams include an MPEG1 stream, MPEG2-PS (Program Stream), and MPEG2-TS.

The DEMUX 12 demultiplexes the content stream from the TS buffer 11 into video, audio, caption, etc. (data) as necessary. Further, the DEMUX 12 supplies video, audio, and caption obtained by demultiplexing to the video ES buffer 13, audio ES buffer 14, and caption buffer 15, respectively.

In other words, the DEMUX 12 includes a demultiplexing unit 31, a video PES (Packetized Elementary Stream) buffer 32, an audio PES buffer 33, a caption PES buffer 34, PES

header detection units

35, 36, and 37, and a time information extraction unit 38. Have.

The demultiplexer 31 separates video, audio, captions, and the like from the content stream supplied from the TS buffer 11 to the DEMUX 12.

When the content stream is, for example, an MPEG1 stream or MPEG2-PS, the content stream is separated into a video PES, an audio PES, and the like. When the content stream is, for example, MPEG2-TS, the content stream is separated into a video PES, an audio PES, a caption PES, a section, and the like.

The video PES, audio PES, and caption PES separated from the content stream in the demultiplexing unit 31 are supplied to the video PES buffer 32, the audio PES buffer 33, and the caption PES buffer 34, respectively.

The video PES buffer 32 temporarily stores the video PES supplied from the demultiplexing unit 31. The audio PES buffer 33 temporarily stores the audio PES supplied from the demultiplexing unit 31. The caption PES buffer 34 temporarily stores the caption PES supplied from the demultiplexing unit 31.

The PES header detection unit 35 sequentially reads video PES from the video PES buffer 32 and analyzes the PES header of the video PES to extract necessary information. Further, the PES header detection unit 35 extracts the video ES that is data of the video PES from the video PES read from the video PES buffer 32 and supplies the video ES to the video ES buffer 13.

The PES header detection unit 36 sequentially reads out the audio PES from the audio PES buffer 33 and analyzes the PES header of the audio PES, thereby extracting necessary information. Further, the PES header detection unit 36 extracts the audio ES that is data of the audio PES from the audio PES read from the audio PES buffer 33 and supplies the audio ES to the audio ES buffer 14.

The PES header detection unit 37 sequentially reads the caption PES from the caption PES buffer 34 and analyzes the PES header of the caption PES to extract necessary information. Further, the PES header detection unit 37 extracts a caption, which is data of the caption PES, from the caption PES read from the caption PES buffer 34 and supplies the caption PES to the caption buffer 14.

The time information extraction unit 38 uses time information (time stamp (PCR (PCR (PCR) (PCR) ()) to output (display) video, audio, and caption from the information extracted from the PES header by the PES header detection units 35 to 37. Program Clock Reference) and PTS (Presentation Time Stamp))) are extracted and supplied to the timing control unit 24.

The video ES buffer 13 functions as a video buffer that stores the video ES supplied from the PES header detection unit 35. The audio ES buffer 14 stores the audio ES supplied from the PES header detection unit 36. The caption buffer 15 stores the caption supplied from the PES header detection unit 37.

The video decoder 16 reads and decodes the video ES stored in the video ES buffer 13 and supplies a video frame obtained by the decoding to the buffer 19.

The audio decoder 17 reads out and decodes the audio ES stored in the audio ES buffer 14 and supplies the audio obtained by the decoding to the buffer 20.

The caption decoder 18 reads and decodes the caption stored in the caption buffer 15, and supplies the caption 21 obtained by the decoding to the buffer 21.

The buffer 19 stores video frames supplied from the video decoder 16. The video frames stored in the buffer 19 are read out under the control of the timing control unit 24, supplied to a display device (not shown), and displayed.

Buffer 20 stores the audio supplied from audio decoder 17. The audio stored in the buffer 20 is read according to the control of the timing control unit 24, supplied to an audio output device such as a speaker or headphones (not shown), and emitted.

The buffer 21 stores the caption supplied from the caption decoder 18. The caption stored in the buffer 21 is read according to the control of the timing control unit 24, supplied to a display device on which video is displayed, and displayed.

The line-of-sight detection unit 22 detects the line of sight of a user who views a video or the like displayed on the display device, and supplies (notifies) detection result information representing the detection result of the line of sight to the timing control unit 24.

The line-of-sight detection unit 22 detects a user's line of sight by performing image processing of a user's image captured by the camera and the user's image captured by the camera, for example, and a display device that displays the video And an image processing device that identifies whether the device is suitable for the computer.

When the content processing apparatus of FIG. 1 is applied to, for example, a smartphone, a camera mounted on the smartphone can be used as the camera constituting the line-of-sight detection unit 22. Furthermore, when the content processing apparatus of FIG. 1 is applied to, for example, a glasses-type wearable device, when the wearable device is equipped with a sensor that detects the user's line of sight, the sensor is connected to the line-of-sight detection unit 22. Can be used as a camera.

Note that if the timing control unit 24 can acquire the detection result information by some method, it is not necessary to provide the line-of-sight detection unit 22 in the content processing apparatus of FIG.

That is, for example, when the content processing apparatus of FIG. 1 is applied to a TV, for example, the detection result information is obtained by photographing the user with a surveillance camera installed in the room where the TV is placed. When the detection result information is provided to the timing control unit 24, it is not necessary to provide the line-of-sight detection unit 22 in the content processing apparatus.

Here, the detection result information that the line-of-sight detection unit 22 supplies to the timing control unit 24 is, for example, whether the user's line of sight is directed to the display device on which the video is displayed, that is, the user is displayed on the display device. Indicates whether you are watching a video.

The buffer accumulation amount monitoring unit 23 monitors the buffer accumulation amount (video ES) of the video ES buffer 13 and supplies it to the timing control unit 24.

The timing control unit 24 controls an output timing for outputting (reading) video from the buffer 19, an output timing for outputting audio from the buffer 20, and an output timing for outputting captions from the buffer 21.

That is, the timing control unit 24 controls the output timing according to the time information supplied from the time information extraction unit 38. Furthermore, the timing control unit 24 controls the output timing according to the detection result information supplied from the line-of-sight detection unit 22 and the buffer accumulation amount supplied from the buffer accumulation amount monitoring unit 23 as necessary.

For example, the timing control unit 24 uses the video time information supplied from the time information extraction unit 38 and, for example, the earliest output time represented by the time information among the video frames stored in the buffer 19 ( The frame that precedes in time is identified as the frame of interest to be displayed next.

The timing control unit 24 controls the output timing of the frame of interest according to the detection result information from the line-of-sight detection unit 22 and the buffer accumulation amount from the buffer accumulation amount monitoring unit 23.

As control of the output timing of the frame of interest by the timing control unit 24, adjustment of the time at which the frame of interest is output from the buffer 19, and skip and repeat control of output of the frame of interest are performed.

In skipping the output of the target frame, the output of the target frame is skipped, and the frame to be output immediately after the target frame is output from the buffer 19 as a new target frame.

Alternatively, in skipping the output of the target frame, a new frame is generated by combining a plurality of frames of the target frame and one or more frames to be output after the target frame. One frame is output from the buffer 19 as a new frame of interest.

In the present embodiment, as described above, as a method of skipping the output of the target frame, a method of simply skipping the output of the target frame, and a method of generating a new target frame from a plurality of frames including the target frame, There is.

In the repeat of the output of the target frame, after the frame specified as the target frame is output from the buffer 19, the frame is again set as the target frame and output from the buffer 19.

The timing control unit 24 controls skip and repeat according to the detection result information from the line-of-sight detection unit 22 and the buffer accumulation amount from the buffer accumulation amount monitoring unit 23.

That is, when the buffer accumulation amount is equal to or greater than (or larger than) the skip threshold for skipping, the timing control unit 24 performs skip control. In this case, since the output of the frame is skipped when the buffer accumulation amount is equal to or greater than the skip threshold, the overflow of the video ES buffer 13 can be prevented.

Further, when the buffer accumulation amount is equal to or less than (or less than) the repeat threshold value for performing repeat, the timing control unit 24 performs repeat control. In this case, since the output of the frame is repeated when the buffer accumulation amount is equal to or less than the repeat threshold, underflow of the video ES buffer 13 can be prevented.

Here, the skip threshold and the repeat threshold have a relationship of skip threshold> repeat threshold.

As described above, the timing control unit 24 performs skip and repeat control according to the buffer accumulation amount. However, when skip or repeat is performed, the video displayed on the display device becomes a jerky motion video. Therefore, the user who views the video may feel uncomfortable.

Therefore, the timing control unit 24 skips according to the detection result information from the line-of-sight detection unit 22 so that the user can easily skip or repeat when the user does not watch the video displayed on the display device. And repeat control.

Specifically, for example, when the detection result information indicates that the user is watching the video, the timing control unit 24 sets the skip threshold value and the repeat threshold value to default values.

On the other hand, when the detection result information indicates that the user does not watch the video, the timing control unit 24 sets the skip threshold value to be smaller than the default value and sets the repeat threshold value to be larger than the default value. To do.

As described above, when the detection result information indicates that the user does not watch the video, the skip threshold is set smaller than the default value, and the repeat threshold is set larger than the default value. Thus, when the user is not watching the video, it becomes easy to skip or repeat.

As a result, if the skip threshold and the repeat threshold are default values, when the possibility of skip or repeat is high in the near future, when the user does not watch the video, skip or Repeat is performed.

This makes it possible to prevent the user from feeling uncomfortable by watching the video with awkward movements when skipping or repeating is performed when the user is watching the video.

That is, it is possible to prevent the user from feeling uncomfortable by controlling the output timing of outputting the content (video) according to the detection result of the user's line of sight.

Here, for example, for content that includes video with a frame rate that does not match the display rate of the display device that displays the video, or content that includes video that is output at an output rate that does not match the video transmission rate, Overflow and underflow of the video ES buffer 13 are likely to occur, and skipping and repeating are easily performed.

Therefore, controlling the output timing of outputting the content (video) according to the detection result of the user's line of sight is particularly useful for the content as described above.

In addition, for example, content that includes video and audio that are to be output at the same time (including content in which video and audio are gradually shifted during playback (decoding)) is output in the timing control unit 24. In accordance with the time information from the time information extraction unit 38, control of video output timing, that is, skip and repeat control, is performed so that the difference between the output times of the video and audio to be output simultaneously is reduced. Can do.

In this way, even when skip and repeat are performed for content that is output with the video and audio that should be output simultaneously shifted, when the user sees a video that has become awkward due to the skip or repeat, Make the user feel uncomfortable.

Therefore, it is possible to make the user feel uncomfortable by controlling the output timing of outputting the content according to the detection result of the user's line of sight for the content that is output with the video and audio to be output simultaneously shifted. Can be prevented.

Note that when the difference between the output times of the video and audio to be output at the same time is equal to or greater than (or larger than) a predetermined threshold value, skip and repeat control are performed so that the difference becomes smaller. The difference between the output times of the video and audio to be output can be less than a predetermined threshold.

The predetermined threshold value may satisfy a content stream standard (for example, MPEG) or may not satisfy the value.

In the content processing apparatus configured as described above, the TS buffer 11 temporarily stores the content stream supplied thereto and supplies it to the DEMUX 12.

The DEMUX 12 demultiplexes the content stream from the TS buffer 11 into video, audio, caption, etc. (data) as necessary. Further, the DEMUX 12 supplies the video ES, the audio ES, and the caption obtained by the demultiplexing to the video ES buffer 13, the audio ES buffer 14, and the caption buffer 15, respectively, and stores them.

The DEMUX 12 (the time information extracting unit 38) extracts time information from the content stream from the TS buffer 11 and supplies the time information to the timing control unit 24.

The video decoder 16 reads and decodes the video ES stored in the video ES buffer 13, and supplies the video frame obtained by the decoding to the buffer 19 for storage.

The audio decoder 17 reads and decodes the audio ES stored in the audio ES buffer 14 and supplies the audio obtained by the decoding to the buffer 20 for storage.

The caption decoder 18 reads and decodes the caption stored in the caption buffer 15, and supplies the caption obtained by the decoding to the buffer 21 for storage.

The video frames stored in the buffer 19 and the captions stored in the buffer 21 are read according to the control of the timing control unit 24, supplied to the display device, and displayed.

The audio stored in the buffer 20 is read according to the control of the timing control unit 24, supplied to the audio output device, and emitted.

On the other hand, the line-of-sight detection unit 22 detects the user's line of sight, and supplies detection result information representing the detection result of the line of sight to the timing control unit 24.

Also, the buffer accumulation amount monitoring unit 23 monitors the buffer accumulation amount of the video ES buffer 13 and supplies it to the timing control unit 24.

The timing control unit 24 corresponds to the time information supplied from the DEMUX 12 (the time information extraction unit 38), the detection result information supplied from the line-of-sight detection unit 22, and the buffer storage amount supplied from the buffer storage amount monitoring unit 23. Control the output timing of video, audio and caption from each of the buffers 19 to 21.

The output timing control by the timing control unit 24 includes skip and repeat control.

Skip and repeat control can be performed so as to prevent overflow and underflow of the video ES buffer 13, and so as to reduce the difference between the output times of video and audio to be output simultaneously. Further, the skip and repeat control can be performed so as to prevent the video ES buffer 13 from overflowing and underflowing, and to reduce the difference between the output times of the video and audio to be output simultaneously.

Hereinafter, skip and repeat control are performed so as to prevent overflow and underflow of the video ES buffer 13, and when the difference between the output times of the video and audio to be output simultaneously is reduced. The processing of the timing control unit 24 will be described.

Here, in the content processing apparatus shown in FIG. 1, a plurality of video ES buffers 13 to caption buffers 15 and video PS buffers 32 to caption PES buffers 34 can be provided.

FIG. 2 is a flowchart for explaining an example of processing of the timing control unit 24 when skip and repeat control is performed so as to prevent overflow and underflow of the video ES buffer 13.

In step S11, the timing control unit 24 acquires the buffer accumulation amount of the video ES buffer 13 supplied from the buffer accumulation amount monitoring unit 23, and the process proceeds to step S12.

In step S12, the timing control unit 24 determines whether or not there is a notification that the user is not watching the video, that is, the detection result information supplied from the line-of-sight detection unit 22 indicates that the user is not watching the video. Determine whether or not.

If it is determined in step S12 that the detection result information does not indicate that the user does not watch the video, that is, if the user is watching the video, the process proceeds to step S13. In step S13, the timing control unit 24 sets each of the skip threshold and the repeat threshold to default values, and the process proceeds to step S14.

Here, in FIG. 2, the default value of the skip threshold is, for example, 90% of the storage capacity of the video ES buffer 13 (hereinafter also referred to as buffer capacity), and the default value of the repeat threshold is For example, the capacity is 10%.

In step S14, the timing control unit 24 determines whether or not the buffer accumulation amount is equal to or less than the repeat threshold value.

When it is determined in step S14 that the buffer accumulation amount is equal to or less than the repeat threshold, that is, in this case, the buffer accumulation amount is equal to or less than 10% of the buffer capacity that is the default value of the repeat threshold, and the video ES buffer When 13 underflows may occur, the process proceeds to step S15.

In step S15, the timing control unit 24 performs repeat control of the output of the frame of interest from the buffer 19, and the process ends.

注目 The frame of interest is repeatedly output from the buffer 19 by the repeat control of the output of the frame of interest. Thereby, the underflow of the video ES buffer 13 is prevented.

If it is determined in step S14 that the buffer accumulation amount is not equal to or less than the repeat threshold, the process proceeds to step S16, and the timing control unit 24 determines whether the buffer accumulation amount is equal to or greater than the skip threshold. .

When it is determined in step S16 that the buffer accumulation amount is equal to or larger than the skip threshold, that is, in this case, the buffer accumulation amount is 90% or more of the buffer capacity that is the default value of the skip threshold, and the video ES buffer When there is a possibility of 13 overflows, the process proceeds to step S17.

In step S17, the timing control unit 24 performs control of skipping output of the frame of interest from the buffer 19, and the process ends.

By controlling the skip of the output of the target frame, for example, the output of the target frame from the buffer 19 is skipped, and the frame to be displayed next to the target frame is output. Thereby, overflow of the video ES buffer 13 is prevented.

In step S16, when it is determined that the buffer accumulation amount is not equal to or greater than the skip threshold value, that is, in this case, the buffer accumulation amount is not 90% or more of the buffer capacity that is the default value of the skip threshold value, and If the repeat threshold is not less than the default value of 10% and there is a low possibility that overflow and underflow of the video ES buffer 13 will occur, the process proceeds to step S18.

In step S18, the timing control unit 24 outputs the frame of interest from the buffer 19, and the process ends.

On the other hand, if it is determined in step S12 that the detection result information indicates that the user does not watch the video, the process proceeds to step S19, and the timing control unit 24 is likely to be skipped or repeated. Each of the skip threshold and the repeat threshold is set so that

That is, the timing control unit 24 sets the skip threshold to a default value that is smaller than 90% of the buffer capacity, for example, 80% of the buffer capacity. Furthermore, the timing control unit 24 sets the repeat threshold value to a default value that is larger than 10% of the buffer capacity, for example, 20% of the buffer capacity.

As described above, after each of the skip threshold and the repeat threshold is set in step S19 so that skip and repeat are easily performed, the process proceeds to step S14, and the same process as described above is performed. Is done.

However, in this case, the user does not watch the video, and therefore, the skip threshold and the repeat threshold are set so that the skip and repeat are easily performed.

Therefore, it becomes easy to perform the repeat control of the output of the target frame in step S15 and the skip control of the output of the target frame in step S17.

As a result, if the skip threshold and the repeat threshold are default values, skip and repeat may be performed in advance if the user is not watching the video when there is a high possibility of skip or repeat in the near future. Is done.

Therefore, since the user does not notice the awkward movement of the video caused by skipping or repeating, the user can be prevented from feeling uncomfortable by watching the video with the awkward movement.

Note that the processing according to the flowchart of FIG. 2 is repeated.

FIG. 3 is a flowchart illustrating an example of processing of the timing control unit 24 when the skip and repeat control is performed so that the difference between the output times of the video and audio to be output simultaneously becomes small.

In step S <b> 31, the timing control unit 24 determines the difference between the output (display) time indicated by the time information of the target frame stored in the buffer 19 and the output time indicated by the time information of the target audio frame stored in the buffer 20 ( Hereinafter, the output time difference is also calculated, and the process proceeds to step S32.

Here, the audio frame of interest means an audio frame of audio to be output from the audio frames stored in the buffer 20 (audio at the earliest output time indicated by the time information).

In step S32, the timing control unit 24 determines whether the output time difference is equal to or greater than a predetermined threshold value.

If it is determined in step S32 that the output time difference is equal to or greater than the predetermined threshold, the process proceeds to step S33. In step S33, the timing control unit 24 increments the number of detections (a variable for counting) unsync_count in which the output time difference is continuously detected to be equal to or greater than a predetermined threshold by 1, and the process proceeds to step S35. Proceed to

If it is determined in step S32 that the output time difference is not equal to or greater than the predetermined threshold, the process proceeds to step S34. In step S34, the timing control unit 24 resets the detection count unsync_count to 0, and the process proceeds to step S35.

In step S35, as in step S12 of FIG. 2, the timing control unit 24 determines whether there is a notification that the user is not watching the video, that is, the detection result information supplied from the line-of-sight detection unit 22 Determines whether or not indicates that the video is not being watched.

If it is determined in step S35 that the detection result information does not indicate that the user does not watch the video, that is, if the user does not watch the video, the process proceeds to step S36. In step S36, the timing control unit 24 sets the threshold value resync_count of the number of detections unsync_count to a default value, and the process proceeds to step S37.

In step S37, the timing control unit 24 determines whether or not the number of detections unsync_count is greater than the threshold value resync_count.

In step S37, if it is determined that the number of detections unsync_count is greater than the threshold value resync_count, that is, if the output time difference is equal to or greater than the predetermined threshold value, the process continues as follows: Proceed to step S38.

In step S38, the timing control unit 24 performs AV synchronization control and ends the process.

Here, in the AV synchronization control, for example, output skip and repeat control of the frame of interest are performed so that the output time difference becomes smaller than a predetermined threshold.

That is, when the output time of the target frame is ahead of the output time of the target audio frame, repeat control of the target frame is performed in order to delay the output time of the target frame.

Further, when the output time of the target frame is delayed from the output time of the target audio frame, skip control of the target frame is performed in order to advance the output time of the target frame.

On the other hand, if it is determined in step S37 that the number of detections unsync_count is not greater than the threshold resync_count, that is, even if the output time difference is not equal to or greater than the predetermined threshold, If it is not consecutive that the output time difference is equal to or greater than the predetermined threshold value more than the threshold resync_count, the process proceeds to step S39.

In step S39, the timing control unit 24 outputs the frame of interest from the buffer 19, and the process ends.

On the other hand, if it is determined in step S35 that the detection result information indicates that the user does not watch the video, the process proceeds to step S40, and the timing control unit 24 is likely to be skipped or repeated. The threshold resync_count is set so that

That is, the timing control unit 24 sets the threshold resync_count to a default value, for example, 0.8 times.

As described above, after each of the skip threshold and the repeat threshold is set in step S40 so that skip and repeat are easily performed, the process proceeds to step S37, and the same process as described above is performed. Is done.

However, in this case, the threshold resync_count is set so that the user does not watch the video, and therefore, skipping and repeating are easy to be performed.

Therefore, it becomes easy to perform output repeat and skip control of the frame of interest in step S38.

As a result, if the threshold value resync_count is a default value, skipping and repeating are performed in advance when the user is not watching the video when there is a high possibility of skipping and repeating in the near future.

Therefore, it is possible to prevent the user from feeling uncomfortable by seeing the awkward movement of the video caused by skipping or repeating.

Note that the processing according to the flowchart of FIG. 3 is repeated.

Here, in FIG. 3, a threshold resync_count of the number of detections unsync_count is set depending on whether the user is watching video, and when the number of detections unsync_count is larger than the threshold resync_count, that is, the output time difference is equal to or greater than a predetermined threshold However, skip and repeat control is performed when the number of consecutive times is greater than the threshold resync_count, but skip and repeat control is simply based on the magnitude relationship between the output time difference and the predetermined threshold. Can be done accordingly.

That is, a predetermined threshold value for the output time difference is set depending on whether the user is watching the video, and when the output time difference is equal to or greater than the predetermined threshold value, skip and repeat control can be performed. In this case, when the user is watching the video, the predetermined threshold is set to a default value, and when the user is not watching the video, the predetermined threshold is set to a value smaller than the default value.

As described above, even when the predetermined threshold value of the output time difference is set depending on whether the user is watching the video, the threshold value resync_count of the number of detections unsync_count depends on whether the user is watching the video as shown in FIG. As in the case of setting, skip and repeat control can be easily performed when the user is not watching the video.

In this embodiment, an MPEG stream is used as the content stream. However, the content stream is not limited to the MPEG stream.

Next, a series of processing of the timing control unit 24 can be performed by hardware or can be performed by software. When a series of processing is performed by software, a program constituting the software is installed in a computer such as a microcomputer.

Therefore, FIG. 4 shows a configuration example of an embodiment of a computer in which a program for executing the series of processes described above is installed.

The program can be recorded in advance in a hard disk 105 or a ROM 103 as a recording medium built in the computer.

Alternatively, the program can be stored (recorded) in the removable recording medium 111. Such a removable recording medium 111 can be provided as so-called package software. Here, examples of the removable recording medium 111 include a flexible disk, a CD-ROM (Compact Disc Read Only Memory), an MO (Magneto Optical) disc, a DVD (Digital Versatile Disc), a magnetic disc, and a semiconductor memory.

The program can be installed on the computer from the removable recording medium 111 as described above, or can be downloaded to the computer via the communication network or the broadcast network and installed on the built-in hard disk 105. That is, the program is transferred from a download site to a computer wirelessly via a digital satellite broadcasting artificial satellite, or wired to a computer via a network such as a LAN (Local Area Network) or the Internet. be able to.

The computer includes a CPU (Central Processing Unit) 102, and an input / output interface 110 is connected to the CPU 102 via the bus 101.

When an instruction is input by the user operating the input unit 107 via the input / output interface 110, the CPU 102 executes a program stored in a ROM (Read Only Memory) 103 accordingly. . Alternatively, the CPU 102 loads a program stored in the hard disk 105 into a RAM (Random Access Memory) 104 and executes it.

Thereby, the CPU 102 performs processing according to the flowchart described above or processing performed by the configuration of the block diagram described above. Then, the CPU 102 outputs the processing result as necessary, for example, via the input / output interface 110, from the output unit 106, transmitted from the communication unit 108, and further recorded in the hard disk 105.

Note that the input unit 107 includes a keyboard, a mouse, a microphone, and the like. The output unit 106 includes an LCD (Liquid Crystal Display), a speaker, and the like.

Here, in the present specification, the processing performed by the computer according to the program does not necessarily have to be performed in chronological order in the order described as the flowchart. That is, the processing performed by the computer according to the program includes processing executed in parallel or individually (for example, parallel processing or object processing).

Further, the program may be processed by one computer (processor), or may be distributedly processed by a plurality of computers. Furthermore, the program may be transferred to a remote computer and executed.

Furthermore, in this specification, the system means a set of a plurality of components (devices, modules (parts), etc.), and it does not matter whether all the components are in the same housing. Accordingly, a plurality of devices housed in separate housings and connected via a network and a single device housing a plurality of modules in one housing are all systems. .

Note that the embodiments of the present technology are not limited to the above-described embodiments, and various modifications can be made without departing from the gist of the present technology.

For example, the present technology can take a cloud computing configuration in which one function is shared by a plurality of devices via a network and is jointly processed.

Further, each step described in the above flowchart can be executed by one device or can be shared by a plurality of devices.

Further, when a plurality of processes are included in one step, the plurality of processes included in the one step can be executed by being shared by a plurality of apparatuses in addition to being executed by one apparatus.

Further, the effects described in the present specification are merely examples and are not limited, and other effects may be obtained.

In addition, this technique can take the following structures.

<1>
A content processing apparatus including a timing control unit that controls output timing of outputting content according to a detection result of a user's line of sight.
<2>
The content includes a video,
The content processing apparatus according to <1>, wherein the timing control unit performs skip or repeat control of the video output as control of the output timing.
<3>
The timing control unit controls the skip or repeat according to a detection result of the user's line of sight so that the skip or repeat is easily performed when the user is not watching the video. 2>.
<4>
The video is output after being stored in a video buffer,
The timing controller is
The skip control is performed when the accumulation amount of the video stored in the video buffer is equal to or greater than a skip threshold value for performing the skip, and the accumulation amount of the video stored in the video buffer performs the repeat. If the repeat threshold is not greater than the repeat control,
The content processing device according to <3>, wherein when the user is not watching the video, the skip threshold is decreased and the repeat threshold is increased.
<5>
The content further includes audio,
The timing controller is
When the difference between the output times at which the video and the audio to be output are to be output is equal to or greater than a predetermined threshold, the skip or repeat control is performed so that the difference between the output times becomes small.
The content processing apparatus according to <3>, wherein when the user does not watch the video, the skip or repeat is controlled so that the skip or repeat is easily performed.
<6>
In the video skip, the output of the video is skipped, or one video is generated from the plurality of videos. The content processing device according to any one of <2> to <5>.
<7>
The content processing apparatus according to any one of <1> to <6>, further including a line-of-sight detection unit that detects a user's line of sight.
<8>
The video is a video having a frame rate that does not match the display rate of a display device that displays the video, or a video that is output at an output rate that does not match the transmission rate of the video. <2> to <4>. The content processing apparatus according to any one of 4).
<9>
The content processing apparatus according to <5>, wherein the content is content that is output with the video and audio to be output simultaneously shifted.
<10>
A content processing method including controlling output timing of outputting content according to a detection result of a user's line of sight.
<11>
A program for causing a computer to function as a timing control unit that controls output timing of outputting content in accordance with a detection result of a user's line of sight.

11 TS buffer, 12 DEMUX, 13 video ES buffer, 14 audio ES buffer, 15 caption buffer, 16 video decoder, 17 audio decoder, 18 caption decoder, 19-21 buffer, 19 to 21 buffer, 22 line of sight detection unit, 23 buffer accumulation amount monitoring unit, 24 timing control unit, 31 demultiplexing unit, 32 video PES buffer, 33 audio PES buffer, 34 caption PES buffer, 35 to 37 PES header detection unit, 38 time information extraction unit, 101 bus, 102 CPU, 103 ROM, 104 RAM , 105 hard disk, 106 output unit, 107 input unit, 108 communication unit, 109 drive, 110 I / O interface, 11 The removable recording medium

Claims

A content processing apparatus including a timing control unit that controls output timing of outputting content according to a detection result of a user's line of sight.
The content includes a video,
The content processing apparatus according to claim 1, wherein the timing control unit performs skip or repeat control of the video output as control of the output timing.
The timing control unit controls the skip or repeat according to a detection result of the user's line of sight so that the skip or repeat is easily performed when the user is not watching the video. Item 3. The content processing apparatus according to Item 2.
The video is output after being stored in a video buffer,
The timing controller is
The skip control is performed when the accumulation amount of the video stored in the video buffer is equal to or greater than a skip threshold value for performing the skip, and the accumulation amount of the video stored in the video buffer performs the repeat. If the repeat threshold is not greater than the repeat control,
The content processing apparatus according to claim 3, wherein when the user does not watch the video, the skip threshold is decreased and the repeat threshold is increased.
The content further includes audio,
The timing controller is
When the difference between the output times at which the video and the audio to be output are to be output is equal to or greater than a predetermined threshold, the skip or repeat control is performed so that the difference between the output times becomes small.
The content processing apparatus according to claim 3, wherein when the user does not watch the video, the skip or repeat is controlled so that the skip or repeat is easily performed.
The content processing apparatus according to claim 2, wherein in the video skip, the output of the video is skipped or one video is generated from the plurality of videos.
The content processing apparatus according to claim 1, further comprising: a line-of-sight detection unit that detects a user's line of sight.
The video is a video having a frame rate that does not match a display rate of a display device that displays the video, or a video that is output at an output rate that does not match a transmission rate of the video. Content processing apparatus.
The content processing apparatus according to claim 5, wherein the content is content that is output with the video and the audio to be output simultaneously shifted.
A content processing method including controlling output timing of outputting content according to a detection result of a user's line of sight.
A program for causing a computer to function as a timing control unit that controls output timing of outputting content in accordance with a detection result of a user's line of sight.