WO2007013407A1

WO2007013407A1 - Digest generation device, digest generation method, recording medium containing a digest generation program, and integrated circuit used in digest generation device

Info

Publication number: WO2007013407A1
Application number: PCT/JP2006/314589
Authority: WO
Inventors: Takashi Kawamura; Meiko Maeda; Kazuhiro Kuroyama
Original assignee: Matsushita Electric Industrial Co., Ltd.
Priority date: 2005-07-27
Filing date: 2006-07-24
Publication date: 2007-02-01
Also published as: CN101228786A; JPWO2007013407A1; US20090226144A1

Abstract

A feature amount calculation unit (12) calculates a feature amount from a received AV signal. A soundless section detection unit (13) detects a section where the sound power level is below a predetermined value as a soundless section. Moreover, a candidate section detection unit (14) detects a section of a sound power level above a predetermined value as a digest scene candidate section. A CM section judgment unit (15) judges a CM section according to the time interval between the soundless sections. A digest list creation unit (16) deletes the section corresponding to the judged CM section from the digest candidate sections, thereby generating digest scene information in the program sections excluding the CM section.

Description

Digest generating apparatus, digest generating method, recording medium storing digest generating program, and integrated circuit used for digest generating apparatus

Technical field

[0001] The present invention relates to the generation of digest scenes, and more specifically, the generation of digest scenes that calculate video and audio feature quantities from television broadcasts, etc., and determine specific important scenes using them. About.

Background art

Conventionally, there are digest (summary) generation apparatuses that calculate feature quantities of powerful video and audio such as television broadcasts and determine important scenes using these. In such an apparatus, the following method is generally used for generating a digest. First, the video signal “video” once recorded on the recording medium is calculated for one program, the CM section is detected based on those features, and time information such as a playlist for digest playback is calculated. There is a method to calculate the force outside the CM section.

[0003] The configuration of a conventional digest generation apparatus that employs this method will be described with reference to FIG. Fig. 14 shows an example of the configuration of a digest generation device that generates a digest excluding the CM section. In FIG. 14, a receiving unit 101 receives a broadcast radio wave and demodulates it into an audio video signal (hereinafter referred to as an AV signal). The mass storage medium 102 is a medium for recording received AV signals. HDD etc. correspond to this. The feature quantity extraction unit 103 stores a feature quantity required for digest generation (hereinafter referred to as digest feature quantity) and a feature quantity required for CM detection (hereinafter referred to as CM feature quantity) in the mass storage medium 102. The AV signal power is also calculated. Here, as the digest feature amount, a scene change detection result based on a motion vector or luminance information, an audio path, text information attached to a program, and the like can be considered. In addition, CM feature amounts may include scene change detection results based on luminance information, and information on sound silence. The CM detection unit 104 detects a CM section (start time and end time information) based on the calculated CM feature value, and outputs it to the digest detection unit 105. The detection method of the CM section is to detect the video scene change from the luminance information of the video and detect it. If the time interval is a fixed time (15 seconds, 30 seconds), use the method of determining the CM section or the method of detecting the silent part of speech and determining the CM section by examining the time interval in the same way. The digest detection unit 105 detects a CM section external force digest scene based on the digest feature value and the CM section information output from the CM detection unit 104. In addition, the detected digest scene (start time end time information) is output to the playback control unit 106 as digest information. As a method for detecting digest scenes, in the case of sports broadcasts, etc., a slow motion scene (repetitive slow motion scene) is identified from the motion vector of the video, and the previous few cuts are detected as a scene that is rising. (E.g. Patent Document 1), a method of detecting a scene that takes a locally large value of voice path information as a scene that is raised (e.g. Patent Document 2), and text given to a program A method for detecting important scenes by combining information and features of video and audio signals (for example, Patent Document 3) is used. The playback control unit 106 reads an AV signal from the large-capacity storage medium 102 and performs digest playback based on the digest information. With such a configuration, when a user watches a recorded program, that is, when an AV signal stored in the large-capacity storage medium 102 is played back, a digest scene starts from the chopping and segment excluding the CM segment. It is possible to create information and perform digest playback.

In addition, there is a method in which a feature quantity is calculated in parallel with the recording of a program and the feature quantity is stored in a recording medium. Figure 19 shows real-time digest scene candidates while calculating feature values in parallel with the recording process, storing them together with CM feature values in a large-capacity storage means, and detecting CM sections during playback. This is an example of the configuration of a digest generation device that generates correct digest information by excluding those that are included in the CM section. In FIG. 19, the receiving unit 101 records the received AV signal on the large-capacity storage medium 102 and outputs the AV signal to the feature amount extracting unit 103 as well. The feature quantity extraction unit 103 calculates a CM feature quantity and stores it in the mass storage medium 102. At the same time, the feature quantity extraction unit 103 outputs the digest feature quantity such as the speech parsing level to the digest detection unit 105. The digest detection unit 105 analyzes the digest feature value, and detects, for example, a scene whose voice power level is equal to or higher than a predetermined threshold as a digest scene candidate. The digest detection unit 105 then detects the detected scene. And stored in the mass storage medium 102 as digest candidate information. In other words, a scene that is a digest candidate is detected in parallel with the program recording. Then, the digest candidate information (time information) and the CM feature amount are recorded in the mass storage medium 102. Here, for CM detection, since the start end of the CM section cannot be specified in real time, only the CM feature quantity necessary for the subsequent detection processing is recorded here. Then, when the recorded program is reproduced according to a user instruction, the CM detection unit 104 reads the CM feature amount from the large-capacity storage medium 102 and detects a CM section. CM detecting section 104 then outputs the detection result as CM section information to CM section removing section 107. The CM section removing unit 107 deletes the portion corresponding to the digest candidate information read from the large-capacity storage medium 102 and creates the digest information. In other words, at the time of recording, for example, a scene with a voice channel level equal to or higher than a predetermined value is temporarily detected including the CM section, and recorded as digest candidate information. After the recording is completed, for example, when a playback start instruction is received, the entire recorded program is analyzed to detect the CM section, and the digest candidate power is also subtracted from the CM section. The digest section is extracted.

Patent Document 1: JP 2004-128550 A

Patent Document 2: JP-A-10-039890

Patent Document 3: Japanese Patent Laid-Open No. 2001-119649

Disclosure of the invention

Problems to be solved by the invention

However, the digest generation apparatus as described above has the following problems. First, in the first method, after recording ends, for example, when a user receives an instruction to start digest playback, processing such as feature amount calculation, CM section detection, digest scene detection, and digest information creation is performed. . Therefore, there is a problem in that the above-described processing waiting time occurs after the digest playback start instruction until playback is actually started. The second method calculates feature values and detects scene information that is a digest candidate during recording. Therefore, compared to the first method, it is possible to reduce the time required for the feature amount calculation processing performed at the time of playback instruction. But CM Section detection is performed after the end of recording (when playback is instructed, etc.) because the start and end of the CM section cannot be determined in real time. For this reason, even with this method, processing wait time for digest information creation occurs. In particular, consumer devices such as general DVD recorders are generally equipped with a CPU that does not have the power and performance of about 1Z10 compared to personal computers. For this reason, the waiting time for the above processing becomes longer, and if the discomfort caused by the waiting time is given, the user is given a bad impression such as inconvenience.

[0006] Therefore, an object of the present invention is to provide a digest generation apparatus that does not have a processing waiting time for generating digest information of a program after the recording of the program ends. Means for solving the problem

In order to achieve the above object, the present invention employs the following configuration.

[0008] A first aspect is a digest generation device that generates digest scene information related to a program when the broadcast signal of the broadcast program is received and recorded on a recording medium, and includes a feature amount calculation unit, A specific section end detection unit and a digest scene information creation unit are provided. Each time a broadcast signal of a predetermined unit time is received, the feature amount calculation unit indicates a feature amount indicating a feature related to at least one of video and audio included in the broadcast signal from the received broadcast signal for the unit time. Calculate at least one type. The specific section end detection unit determines whether or not the predetermined amount of time included in the signal portion in which the characteristic amount has already been calculated among the received broadcast signals is a force that causes the start or end of the specific section. By determining each time it is calculated, it detects the time when it becomes the start or end of a specific section. The digest scene information creation unit determines whether or not the broadcast signal for the section excluding the specific section of the entire section of the program is a digest scene, based on the feature amount, every time the feature amount is calculated. And digest scene information is generated.

[0009] In a second aspect, in the first aspect, the digest scene information creation unit determines whether the content included in the AV signal for the unit time is a digest scene or not. A digest section detecting unit that detects a digest candidate section based on the received AV signal by determining based on the feature quantity each time the feature quantity is calculated. Furthermore, the digest scene information creation unit is a specific section end detection unit. Each time a set of the start and end of a specific section is detected, it is determined whether or not the specific section from the start to the end overlaps with the digest candidate section, and the digest detected by the digest section detection unit is determined. Among the candidate sections, information indicating a section excluding the digest candidate section that overlaps with the specific section is generated as digest scene information.

[0010] In a first aspect, the third aspect includes a temporary storage unit in which the digest scene information creation unit stores the calculated feature amount up to the latest calculated time point power for a predetermined time. In addition, each time the feature amount is calculated, the digest scene information creation unit detects the time point of the feature amount stored in the temporary storage unit from the start to the end of the specific section detected by the specific section end detection unit. If it is not included, only if it is not included, the content that is a digest scene is detected from the content included in the broadcast signal for a unit time, and the digest scene information is generated.

[0011] In a fourth aspect according to the second aspect, the feature amount calculating unit calculates the first and second feature amounts, and the specific section end detection unit is configured to specify the specific section based on the first feature amount. The digest section detection unit detects a digest candidate section based on the second feature amount.

[0012] In a fifth aspect according to the first aspect, the specific section end detection unit detects a section including only the feature quantity satisfying the condition as a specific section candidate when the feature quantity satisfies a predetermined condition. A section candidate detection unit and a specific section determination unit that detects a candidate that is a start or end of a specific section based on a time difference between the specific section candidates in the program.

[0013] In a sixth aspect according to the fifth aspect, the specific section determination unit detects the specific section candidate power that is detected every time a specific section candidate is detected. If it is included in the candidate, the time point before the predetermined time is detected as the start of the specific section, and the specific section candidate is detected as the end of the specific section.

[0014] In a fifth aspect according to a seventh aspect, in the fifth aspect, the specific section detection unit detects the specific section candidate power last detected every time a specific section candidate is detected. Alternatively, the specific section candidate power detected at the end is determined by the determination section that determines whether or not there is a specific section candidate that has already been detected at a time point before the predetermined second time. When it is determined that there is a section candidate, an adder that adds points to each of the specific section candidate determined to exist and the last specific section candidate detected last, and a target candidate having a score equal to or greater than a predetermined value are detected. When the specified third time elapses, the target candidate power is determined whether there is a specific interval candidate whose score is equal to or greater than the predetermined value at the time point before the third time. A start end determination unit having the target candidate as a start point of the specific section, and whenever a predetermined third time elapses after detection of a target candidate having a score equal to or greater than a predetermined value, And determining whether there is a specific section candidate whose score is greater than or equal to the predetermined value. If there is no specific section candidate, the terminal determination section includes the target candidate as the end of the specific section.

[0015] In an eighth aspect based on the fifth aspect, the feature amount calculation unit calculates a voice level of the audio signal as a feature amount, and the specific interval candidate detection unit detects a silent interval whose par level is a predetermined value or less. Detect as a specific section candidate.

[0016] In a ninth aspect according to the fifth aspect, the feature amount calculation unit calculates the luminance information based on the video signal as the feature amount, and the specific section candidate detection unit determines that the amount of change in the luminance information is a predetermined value. The above scene change points are detected as specific section candidates.

[0017] A tenth aspect is a digest generation method for generating digest scene information related to a program when the broadcast signal of the program to be broadcast is received and recorded on a recording medium. A section end detection step and a digest scene information creation step are provided. The feature amount calculating step is a feature amount indicating a feature related to at least one of video and audio included in the broadcast signal from the received broadcast signal for the unit time every time a broadcast signal of a predetermined unit time is received. Calculate at least one type. In the specific section edge detection step, the feature amount is calculated based on whether or not a predetermined time point included in the signal portion of the received broadcast signal whose feature amount has already been calculated is the start or end of the specific section. By determining each time, the time point at the start or end of a specific section is detected. The digest scene information creation step determines whether or not the broadcast signal for a section excluding the specific section of the entire section of the program is a digest scene every time the feature amount is calculated. Digest scene information is generated.

[0018] In an eleventh aspect according to the tenth aspect, the digest scene information creation step includes By determining whether or not the content included in the broadcast signal for about several hours is a digest scene based on the feature amount every time the feature amount is calculated for the broadcast signal for the unit time, A digest section detecting step for detecting a digest candidate section for the broadcast signal. In addition, the digest scene information creation step determines whether or not the specific section up to the start force and the end overlaps with the digest candidate section every time the start and end pair of the specific section is detected by the specific section end detection step. The digest scene information is generated as digest scene information by determining and excluding digest candidate sections that overlap with the specific section among the digest candidate sections detected by the digest section detection step.

[0019] In a twelfth aspect according to the tenth aspect, the digest scene information creation step includes a temporary storage step of storing the calculated feature amount up to a latest calculation time point force for a predetermined time. In addition, the digest scene information creation step includes the start and end points of the specific section detected by the specific section end detection step when the feature amount stored in the temporary storage step is calculated each time the feature amount is calculated. Judge whether it is included or not, not included! / Only in this case, digest scene information is generated by detecting the content that is the digest scene from the content included in the AV signal for the unit time.

[0020] A thirteenth aspect stores a digest generation program to be executed by a digest generation device computer that generates digest scene information related to a program when the broadcast signal of the broadcast program is received and recorded on a recording medium. The recording medium stores a feature amount calculation step, a specific section end detection step, and a digest scene information creation step. The feature amount calculating step is a feature amount indicating a feature related to at least one of video and audio included in the broadcast signal from the received broadcast signal for the unit time every time a broadcast signal of a predetermined unit time is received. This is a process for calculating at least one of the above. The specific interval end detection step calculates whether the feature amount is whether or not a predetermined time point included in the signal portion for which the feature amount has already been calculated in the received broadcast signal is the start or end of the specific interval. This is a process for detecting the time point at the beginning or end of a specific section by determining each time it is performed. In the digest scene information creation step, every time a feature value is calculated, the entire section of the program is calculated based on the feature value. This is a process for generating digest scene information by determining whether or not a broadcast signal in a section excluding a specific section is digest scene power.

[0021] In a fourteenth aspect, in the thirteenth aspect, the digest scene information creation step determines whether or not the content included in the broadcast signal for a unit time is a digest scene, for the unit time. A digest section detecting step of detecting a digest candidate section for the received broadcast signal by making a determination based on the feature quantity each time the feature quantity is calculated for the broadcast signal is included. In addition, the digest scene information creation step includes a step in which a specific section up to the start force and the end overlaps with the digest candidate section each time a set of the start and end of the specific section is detected by the specific section end detection step. It determines whether or not, and information indicating a section excluding the digest candidate section that overlaps the specific section among the digest candidate sections detected by the digest section detection step is generated as digest scene information.

[0022] In a fifteenth aspect, in the thirteenth aspect, the digest scene information creation step includes a temporary storage step of storing the calculated feature amount up to a latest calculation time force for a predetermined time. In addition, the digest scene information creation step includes the start and end points of the specific section detected by the specific section end detection step when the feature amount stored in the temporary storage step is calculated each time the feature amount is calculated. Judge whether it is included or not, not included! / Only in this case, digest scene information is generated by detecting the content that is the digest scene from the content included in the AV signal for the unit time.

[0023] A sixteenth aspect is an integrated circuit used in a digest generation device that generates digest scene information related to a program when the broadcast signal of the program to be broadcast is received and recorded on a recording medium. A calculation unit, a specific section end detection unit, and a digest scene information creation unit are provided. Each time a broadcast signal for a predetermined unit time is received, the feature amount calculation unit indicates a feature related to at least one of video and audio included in the broadcast signal from the received broadcast signal for the unit time. At least one feature is calculated. The specific section end detection unit calculates whether or not a predetermined time point included in the signal portion of the received broadcast signal whose characteristic amount has already been calculated is the start or end of the specific section. By detecting each time it is detected, it detects the time that becomes the start or end of a specific section To do. Each time the feature amount is calculated, the digest scene information creation unit determines whether or not the broadcast signal in the entire program section excluding the specific section is a digest scene based on the feature amount. To generate digest scene information.

[0024] In a seventeenth aspect, in the sixteenth aspect, the digest scene information creation unit determines whether or not the content included in the broadcast signal for a unit time is a digest scene, for the unit time. It includes a digest section detection unit that detects a digest candidate section for a received broadcast signal by determining based on the feature quantity each time a feature amount is calculated for a broadcast signal. In addition, the digest scene information creation unit determines whether or not the specific section from the start end to the end overlaps with the digest candidate section every time the specific section end detection unit detects the set of the start and end of the specific section. It determines, and the information which shows the area except the digest candidate area which overlaps with the said specific area among the digest candidate areas detected by the digest area detection part is produced | generated as digest scene information.

[0025] In an eighteenth aspect according to the sixteenth aspect, the digest scene information creation unit includes a temporary storage unit that stores the calculated feature amount up to a predetermined calculation time force for a predetermined time. In addition, every time a feature value is calculated, the digest scene information creation unit detects the time point of the feature value stored in the temporary storage unit from the start point to the end point of the specific segment detected by the specific segment end detection unit. Only when it is not included, the digest scene information is generated by detecting the content that is the digest scene among the content included in the AV signal for the unit time.

The invention's effect

[0026] According to the first aspect of the invention, since a specific section (for example, a CM section) can be detected while recording a program, digest scene information excluding the specific section can be generated in parallel with the recording of the program. it can. As a result, it is possible to eliminate the processing waiting time for generating digest scene information after the end of recording, and to provide a user with a comfortable digest playback operation. Furthermore, even in the case of chase playback during program recording, it is possible to perform digest playback up to the point in the middle of the recording, providing a more convenient playback environment. [0027] According to the second to third inventions, the same effect as the first invention can be obtained.

[0028] According to the fourth aspect of the invention, two types of feature quantities can be used. Therefore, it is possible to detect a specific section or digest section more accurately by using feature quantities suitable for detection of each specific section or digest section.

[0029] According to the fifth to sixth inventions, the specific section is determined based on the time interval between the specific section candidates. Thereby, a specific area can be determined more accurately.

[0030] According to the seventh invention, the specific section candidates are scored based on a predetermined time interval. Thereby, it is possible to evaluate the likelihood of the start or end of the specific section. Furthermore, because the specific section candidate with a high score is the start or end of the specific section, it is possible to prevent the specific section candidate that happened to exist in the program from being erroneously determined to be the start or end of the specific section. . As a result, it is possible to create digest scene information excluding specific sections more accurately.

[0031] According to the eighth aspect, the silent section is set as the specific section candidate. This makes it possible to detect a specific specific section using the property that the first and last sections are silent sections, such as the CM section.

[0032] According to the ninth aspect, a scene change point at which the luminance information has greatly changed is determined as a specific section candidate. For this reason, the transition from a program whose luminance information greatly changes to a specific section can be set as a specific section candidate, and as a result, the specific section can be determined more accurately.

[0033] According to the tenth to eighteenth inventions, the same effects as in the first invention can be obtained.

Brief Description of Drawings

[0034] FIG. 1 is a block diagram showing a configuration of a digest generation apparatus 10 that is helpful in the first embodiment.

FIG. 2 is a diagram showing an example of data used in the present invention.

FIG. 3 is a flowchart showing digest scene list generation processing.

[FIG. 4] FIG. 4 is a flowchart showing details of the silent section detection processing shown in step S4 of FIG. It is

FIG. 5 is a flowchart showing details of the point evaluation process shown in step S 16 of FIG. 4.

[FIG. 6] FIG. 6 is a flowchart showing details of the candidate section detection process shown in step S5 of FIG.

[FIG. 7] FIG. 7 is a flowchart showing details of the CM section determination processing shown in step S6 of FIG.

FIG. 8 is a diagram showing an example of CM section determination in the CM section determination processing.

FIG. 9 is a flowchart showing details of the digest scene list output process shown in step S 7 of FIG. 3.

FIG. 10 is a block diagram showing a configuration of a digest generation apparatus 10 according to the second embodiment.

FIG. 11 is a diagram showing an example of data used in the present invention.

[FIG. 12] FIG. 12 is a flowchart showing a digest scene list generation process that is relevant to the second embodiment.

FIG. 13 is a flowchart showing details of the silent section detection process shown in step S 66 of FIG. 12.

FIG. 14 is a block diagram showing a configuration of a conventional recording / reproducing apparatus.

FIG. 15 is a block diagram showing a configuration of a conventional recording / reproducing apparatus.

Explanation of symbols

10, 30 digest generator

11 Receiver

12 Feature value calculator

13 Silent section detector

14 Candidate section detector

15 CM section judgment section

16, 32 Digest list creation part

17 Large capacity recording media 18 Playback control section

21 Feature information for comparison

22 Silence start information

23 Candidate start information

24 Silent section information

25 Candidate section information

26 Provisional CM start information

27 CM section information

28 Digest scene list

31 Temporary storage

36 Temporarily accumulated features

37 Information just before digest

38 Digest start information

BEST MODE FOR CARRYING OUT THE INVENTION

The present invention creates a digest scene list indicating the position of the digest scene in parallel with the recording of the program. In the present embodiment described below, the digest scene employs a scene in which the voice par level takes a locally large value, that is, a scene that is raised, as the digest scene. For this reason, a scene whose voice path level is equal to or higher than a predetermined value is extracted as a digest candidate section. On the other hand, a section whose voice level is equal to or less than a predetermined value is extracted as a silent section, and a section where the silent section appears at a predetermined interval (for example, every 15 seconds) is extracted as a CM section. This is because there is a silent section at the beginning and end of the CM section, and the length of the CM section is constant, so the part where the silent section appears at a constant interval is the CM section. Because it is considered. Each time one CM section is extracted, a digest scene list indicating the digest scenes in the program section is created by excluding information corresponding to the CM section from the information of the digest candidate sections. In the present embodiment, the description will be made assuming that the length of one CM section is 60 seconds at the maximum.

[0037] (First embodiment) FIG. 1 is a block diagram showing a configuration of a digest generation apparatus according to the first embodiment of the present invention. In FIG. 1, a digest generating device 10 includes a receiving unit 11, a feature amount calculating unit 12, a silent segment detecting unit 13, a candidate segment detecting unit 14, a CM segment determining unit 15, a digest list creating unit 16, and a large-capacity recording medium 17 And a playback control unit 18.

[0038] The receiving unit 11 receives the broadcast radio wave and demodulates it into an image signal and an audio signal (hereinafter referred to as AV signal). In addition, the reception unit 11 outputs the demodulated AV signal to the feature amount calculation unit 12, the large-capacity recording medium 17, and the reproduction control unit 18.

The feature amount calculation unit 12 analyzes the AV signal to calculate a feature amount, and outputs the feature amount to the silent section detection unit 13 and the candidate section detection unit 14. Here, the feature value is used to determine the CM section and digest scene in the program. As described above, since the CM section is determined based on the occurrence interval of the silent section as described above, the feature level for determining the CM section corresponds to the voice feature quantity such as the par level of the voice signal. On the other hand, as feature quantities for determining a digest scene, for example, video feature quantities such as luminance information and motion vectors of video signals, and audio feature quantities such as audio signal par level and spectrum are applicable. In the present embodiment, the description will be made on the assumption that the par level of an audio signal is used as a feature amount for determination of both a CM section and a digest scene.

[0040] The silent section detector 13 detects a silent section in the program based on the feature amount, and generates silent section information 24. Further, the silent section detection unit 13 outputs the silent section information 24 to the CM section determination unit 15.

[0041] Candidate section detection unit 14 detects a section (hereinafter referred to as a candidate section) that is a digest scene candidate in the program based on the feature amount, and generates candidate section information 25. Further, the candidate section detection unit 14 outputs the candidate section information 25 to the digest list creation unit 16.

[0042] The CM section determination unit 15 determines the CM section by looking at the time interval of the silent section based on the silent section information 24. Then, the CM section determination unit 15 outputs the determined CM section as CM section information 27 to the digest list creation unit 16.

Based on the candidate section information 25 and the CM section information 27, the digest list creating section 16 creates a digest scene list 28 that is information indicating the position of the digest scene. The digest list creation unit 16 then stores the digest scene list 28 in a large capacity. The data is output to the recording medium 17 and the reproduction control unit 18.

[0044] The large-capacity recording medium 17 is a medium for recording the AV signal and the digest scene list 28, and is realized by a DVD, an HDD, or the like.

[0045] The reproduction control unit 18 performs reproduction control such as reproduction of the received AV signal and reproduction of the AV signal recorded on the large-capacity recording medium 17 and output to the monitor.

[0046] Note that the feature quantity calculation unit 12, the silent segment detection unit 13, the candidate segment detection unit 14, the CM segment determination unit 15, and the digest list creation unit 16 illustrated in FIG. 1 are typically LSIs that are integrated circuits. It may be realized as. The feature quantity calculation unit 12, the silent segment detection unit 13, the candidate segment detection unit 14, the CM segment determination unit 15, and the digest list creation unit 16 may be individually combined, or may include some or all of them. One chip may be added. Further, the method of circuit integration may be realized by a dedicated circuit or general-purpose processor, not limited to LSI.

Next, various data used in the present embodiment will be described with reference to FIG. Various data described below are stored in a temporary storage unit (not shown) realized by a semiconductor memory, for example. In FIG. 2, comparison feature quantity information 21 (FIG. 2 (A)) is used to detect the silent section and the like, and the time information 211 for the immediately preceding frame and the voice calculated by the feature quantity calculation unit 12 are used. It has a feature value 212 immediately before the power level value is stored.

[0048] Silence start edge information 22 (Fig. 2 (B)) has a silence start edge time, and is used to detect a silence interval.

[0049] Candidate start edge information 23 (Fig. 2 (C)) has a candidate start edge time, and is used to detect a candidate section.

The silent section information 24 (FIG. 2 (D)) stores the detection result of the silent section by the silent section detector 13. The silent section information 24 includes the collective power of the section number 241, the score 242, the start time 243, and the end time 244. The section number 241 is a number for identifying each silent section. The score 242 is a value that evaluates how much the silence section is likely to be the end of the CM section. The higher the score, the higher the possibility that the silent section is the end of the CM section. Conversely, if the score is low, the silent section is a silent section that happens to appear in the program. (Ie, not the end of the CM section). The start time 243 and end time 244 are time information indicating the start time and end time of the silent section.

[0051] Candidate section information 25 (Fig. 2 (E)) stores the detection results of candidate sections by candidate section detector 14. Candidate section information 25 consists of a set of candidate number 251, start time 252 and end time 253. Candidate number 251 is a number for identifying each candidate section. The start time 252 and end time 253 are time information indicating the start time and end time of the candidate section.

[0052] Temporary CM start edge information 26 (FIG. 2 (F)) has a temporary CM start edge time used by the CM interval determination unit 15 to detect the CM interval, and the start interval time of the silent interval that can be the start edge of the CM interval. Is stored.

In the CM section information 27 (FIG. 2 (G)), information on the CM section detected by the CM section determination unit 15 is stored. CM section information 27 is also a collective force of CM number 271, CM start time 272, and CM end time 273. CM number 271 is a number for identifying each CM section. CM start time 272 and CM end time 273 are time information indicating the start time and end time of the CM section.

The digest scene list 28 (FIG. 2 (H)) is a file indicating the time information of the section that becomes the digest scene in the yarn. This is a set of digest number 281, digest start time 282, and digest end time 283. The digest number 281 is a number for identifying each digest section. The digest start time 282 and the digest end time 283 are time information indicating the start time and end time of the digest section.

[0055] The detailed operation of the digest scene list creation process performed by the digest generation device 10 will be described below with reference to FIGS. FIG. 3 is a flowchart showing the detailed operation of the digest scene list creation process according to the first embodiment. The process shown in Fig. 3 is started by a recording instruction from the user. In addition, it is assumed that the scan time of the process shown in FIG. 3 is one frame.

In FIG. 3, first, the digest generation device 10 determines whether or not the end of recording has been instructed. (Step SI). As a result, when the end of recording is instructed (YES in step S1), the digest scene list creation process is terminated. On the other hand, when the end of the recording is not instructed (NO in step S1), the feature amount calculation unit 12 acquires a signal for one frame from the reception unit 11 (step S2). Next, the feature amount calculation unit 12 analyzes the acquired signal and calculates a voice power level (feature amount) (step S3).

[0057] Next, the silent section detector 13 performs a silent section detection process to detect a silent section (step S4). FIG. 4 is a flowchart showing details of the silent section detection process shown in step S4. In FIG. 4, first, the silent section detection unit 13 determines whether or not the power level of the audio signal calculated in step S3 is equal to or less than a predetermined threshold (step S11). As a result, if it is equal to or less than the predetermined threshold value (YES in step S11), the silent section detection unit 13 refers to the immediately preceding feature value 212 in which the feature value related to the previous frame is stored, and the value is It is determined whether or not the force is equal to or less than a predetermined threshold (step S12). In other words, it determines the change in the audio power level between the current frame and the previous frame. As a result, if it is not less than the predetermined threshold value (NO in step S12), the silent section detecting unit 13 stores the time information of the frame in the silent start end information 22 (step S13). It should be noted that immediately after the start of processing, nothing is stored in the immediately preceding feature value 212, so in this case, the processing is proceeded assuming that it is not less than a predetermined threshold value. On the other hand, if it is equal to or less than the predetermined threshold (YES in step S12), the silent section is being continued, so the silent section detection process is terminated.

[0058] On the other hand, if the result of step S11 is that the level of the audio signal extracted in step S3 is not less than or equal to the predetermined threshold (NO in step S11), the silent section detection unit 13 refers to the immediately preceding feature value 212, It is determined whether or not the power level stored here is below a predetermined threshold (step S14). As a result, if it is equal to or less than the predetermined threshold value (YES in step S14), the silent period that has been continued has been completed in the previous frame. The section from the silence start time of 22 to the time information 211 of the previous frame is output to the silence section information 24 as one silence section (step S15). Next, the silent section detector 13 performs a point evaluation process (step S16) as will be described later on the silent section output in step S15. [0059] On the other hand, as a result of the determination in step S14, if the power level of the immediately preceding feature quantity 212 is not less than or equal to the predetermined threshold (NO in step S14), a non-silent section is continuing, so the silent section The detection unit 13 ends the process. It should be noted that immediately after the start of processing, nothing is stored in the immediately preceding feature value 212, so in this case as well, the processing is proceeded assuming that it is not below a predetermined threshold. Thus, the silent section detection process is completed.

Next, the details of the point evaluation process in step S16 described above will be described with reference to FIG. In this process, it is determined whether or not the time of 15 seconds, 30 seconds, and 60 seconds before the last detected silent interval is silence interval power, and if it is a silent interval, 1 point is added to each silent interval information It is. This makes it possible to increase the score for silent sections that are considered to be the beginning or end of any CM. In other words, it is a silent section that occurs during a program using the property that both ends of the CM section are silent sections and the length of one CM section is 15 seconds, 30 seconds, or 60 seconds. On the other hand, the process of evaluating the “end of CM section” by assigning points. As a result, it is possible to distinguish between silent sections that occur occasionally during a program and silent sections that indicate CM boundaries.

In FIG. 5, first, the silent section detecting unit 13 acquires the start time 243 of the silent section stored last in the silent section information 24. Then, the silent section detector 13 determines whether or not there is a silent section having a time 15 seconds before the time by searching the silent section information 24 (step S21). As a result, if a silent section can be searched (YES in step S21), the silent section detecting unit 13 adds 1 to the score 242 of each of the silent section stored last and the silent section searched in step S21 (step S21). S22). On the other hand, if the result of the determination in step S21 is that a silence interval 15 seconds ago cannot be searched (NO in step S21), the silence interval detection unit 13 proceeds to step S23 without performing step S22. . Next, the silent section detector 13 determines whether or not 30 seconds before is the silent section, as in step S21 (step S23). As a result, if the search is possible (YES in step S23), the silent section detection unit 13 adds 1 to the score 242 of the last stored silent section and the silent section searched this time (step S24). . On the other hand, if the result of the determination in step S23 is that the silent section 30 seconds before cannot be searched (NO in step S23), the silent section detector 13 proceeds to step S25 without performing the process in step S24. . Step S25 The silent section detector 13 determines whether or not there is a silent section 60 seconds before, as in steps S21 and S23. If there is, the silent section detector 13 sets 1 to 242 as in steps S22 and S24. to add. Above, the point evaluation process concerning step S16 is complete | finished. In the above description, the silent section information 24 is searched based on the start time 243 of the silent section. However, the present invention is not limited to this, and the end time 244 of the silent section or any time point in the silent section is used as a reference. Then you can search.

Returning to FIG. 3, after the process of step S4, the candidate section detection unit 14 performs a candidate section detection process (step S5). This process is a process of detecting a section where the voice path level is equal to or higher than a predetermined threshold as a digest scene candidate section.

[0063] FIG. 6 is a flowchart showing details of the candidate section detection process shown in step S5. In FIG. 6, first, the candidate section detection unit 14 determines whether or not the speech signal level extracted in step S3 is equal to or higher than a predetermined threshold (step S31). As a result, if it is equal to or greater than the predetermined threshold value (YES in step S31), then the candidate section detection unit 14 determines whether or not the preceding feature value 212 is greater than or equal to the predetermined threshold value (step S32). As a result, if it is not equal to or greater than the predetermined threshold value (NO in step S32), the candidate section detection unit 14 candidates the time information of the frame (currently processing target! /) Frame acquired in step S2. Store in the start edge information 23 (step S33). Immediately after the start of processing, nothing is stored yet in the immediately preceding feature amount 212. In this case, the processing is proceeded assuming that it is not equal to or greater than a predetermined threshold. On the other hand, if it is equal to or greater than the predetermined threshold (YES in step S32), the candidate section is being continued, and the candidate section detection unit 14 advances the process to step S36.

[0064] On the other hand, as a result of step S31, when the sound signal level calculated in step S3 is not equal to or greater than a predetermined threshold (NO in step S31), candidate section detecting unit 14 refers to immediately preceding feature quantity 212, It is determined whether or not the power level stored here is greater than or equal to a predetermined threshold (step S34). As a result, if it is equal to or greater than the predetermined threshold value (NO in step S34), the candidate section that has been continued ends in the previous frame. The section from the stored candidate start time to the time information 211 that is the time of the previous frame is output to the candidate section information 25 as one candidate section (step S35). On the other hand, as a result of the determination in step S34, if the value of the immediately preceding feature value 212 is not equal to or greater than the predetermined threshold (NO in step S34), a section that is not a candidate section is continuing, so a candidate section detection unit 14 advances the process to step S36. It should be noted that immediately after the start of processing, nothing is stored in the immediately preceding feature value 212, so the processing is recommended as not exceeding a predetermined threshold! In step S36, the candidate section detection unit 14 stores the level of the audio signal acquired in step S3 in the immediately preceding feature quantity 212 (step S36). This completes the candidate interval detection process.

Returning to FIG. 3, when the process of step S5 is completed, the CM section determination unit 15 next performs a CM section determination process (step S6). FIG. 7 is a flowchart showing details of the CM section determination process shown in step S6. In FIG. 7, first, the CM section determination unit 15 searches the silent section information 24, and there is a silent section with a score 242 greater than or equal to a predetermined value (for example, three points) at a time point 60 seconds before the current frame. It is determined whether or not to perform (step S41). That is, it is determined whether or not the power was 60 seconds before the silent section. Here, the reason for searching for the presence of a silent section is 60 seconds ago, because in this embodiment, it is assumed that the length of one CM section is a maximum of 60 seconds. Therefore, if it is assumed that the length of one CM section is 30 seconds at the maximum, the search time should be 30 seconds. As a result of the determination in step S41, if the time point 60 seconds before is not a silent section (NO in step S41), the CM section determination unit 15 advances the process to step S46 described later.

[0067] On the other hand, if the result of determination in step S41 is a silent section (YES in step S41), CM section determination unit 15 determines whether or not there is data in provisional CM start information 26 (step). S42). As a result, if there is no data in the provisional CM start end information 26 (NO in step S42), the CM section determination unit 15 outputs the searched silent section time information to the provisional CM start end information 26 (step S49). On the other hand, if data already exists (YES in step S42), the CM section determination unit 15 acquires the provisional start time from the provisional CM start information 26 and associates it with the CM number 271 as the CM start time 272. Output to CM section information 27. At the same time, the end time of the silent section searched in step S41 (that is, the silent section 60 seconds before) is output to the CM section information 27 as the CM end time 273 (step S43). [0068] Next, the CM section determination unit 15 sets the D list creation flag, which is a flag for creating a digest scene list, which will be described later, to ON (step S44). Subsequently, the CM section determination unit 15 outputs the end time of the silent section information 60 seconds before as the start time of the provisional CM start end information 26 (step S45).

[0069] Next, the CM section determination unit 15 determines whether or not the force has exceeded 120 seconds from the time of the provisional CM start end information 26 (step S46). In other words, if there is no silent section with a score of 242 or higher for 120 seconds after a silent section that has the possibility of starting CM is found, the silent section is not the start of CM. Here, the reason for the determination criterion being 120 seconds is that in this embodiment, it is assumed that one CM section is a maximum of 60 seconds. In other words, even if a start candidate for a CM section is found once and a silence section is found 60 seconds later, an additional 60 seconds are required to determine whether or not the silence section is at the end of the CM section. .

[0070] If 120 seconds or more have passed as a result of the determination in step S46! / YES (YES in step S46), the CM section determination unit 15 clears the provisional CM start end information 26 (step S47). Subsequently, the CM section determination unit 15 sets the D list creation flag to ON (step S48). On the other hand, if 120 seconds or more have not elapsed (NO in step S46), the process is terminated as it is. This is the end of the CM section determination process.

[0071] Here, the CM section determination process will be supplementarily described with reference to FIG. In FIG. 8, points A to G are silent sections and the ends of CM sections with a 15-second interval. According to the processing described above, point A is set as the temporary CM start point at point E (60 seconds) in FIG. Thereafter, at point F (75 seconds), it is determined that points A to B are CM sections, and are output to the time information SCM section information 27 of the section. At the same time, point B is the beginning of a new provisional CM. After that, at point G, points B to C are confirmed as CM sections and output to the CM section information. At the same time, point C will be the beginning of provisional CM. In this way, according to the above-described processing, an accurate CM section can be determined in parallel even during a powerful program recording in which some delay time is generated.

Returning to FIG. 3, when the process of step S6 ends, the digest list creation unit 16 next performs a digest scene list output process (step S7). Figure 9 shows in step S7 above. It is a flowchart which shows the detail of the performed digest scene list output process. In FIG. 9, first, the digest list creation unit 16 determines whether or not the D list creation flag is on (step S51). As a result, if it is not on (NO in step S51), the digest list creation unit 16 ends the process as it is. On the other hand, if it is on (YES in step S51), the digest list creation unit 16 determines whether or not a new candidate section has been added to the candidate section information 25 since the digest scene list output process has been performed previously. (Step S52). As a result, if a candidate section has not been added (NO in step S52), the digest list creation unit 16 ends the digest scene list creation process as it is. On the other hand, if the candidate candidate section has been newly added when the digest scene list output process has been performed previously (YES in step S52), the digest list creation unit 16 adds information on the candidate section for the increment to 1 (Step S53). Next, the digest list creation unit 16 determines whether or not the candidate section is included in the CM section with reference to the CM section information 27 (step S54). As a result, if it is not within the CM section (NO in step S54), the digest list creation unit 16 outputs information on the candidate section to the digest scene list 28 (step S55). On the other hand, if it is within the CM section (YES in step S54), the process proceeds to step S56. In other words, if the candidate section is also a CM section, the candidate section is not used as a digest scene.

[0073] Next, the digest list creation unit 16 determines whether or not the above-described distribution process has been performed for all of the incremented candidate sections (step S56). As a result, if an unprocessed increase candidate section still remains (NO in step S56), the digest list creation unit 16 returns to step S53 and repeats the process. On the other hand, when all the increased candidate sections have been processed, the digest list creation unit 16 sets the D list creation flag to OFF (step S57), and ends the digest scene list output process. This is the end of the digest scene list creation process that is useful for the first embodiment.

[0074] Thus, in the first embodiment, in parallel with the recording of a program, a digest candidate section whose voice level is equal to or higher than a predetermined value is simply extracted, and the one corresponding to the CM section is extracted from the digest candidate section. By subtracting, a digest scene list in which only digest scenes in the program section are extracted can be created in parallel with recording. to this As a result, there is no need to perform a separate process for generating a digest scene list after the recording of the program, and a comfortable viewing environment can be provided to the user without the waiting time for the generation process.

In the embodiment described above, the silence interval detection unit 13 performs the silence interval detection process. However, the present invention is not limited to this, and the CM interval determination unit 15 performs the silence interval detection process prior to the CM interval determination process. You may make it detect a sound area.

In addition, the digest scene detection is not limited to the above-described method using the audio sound level, but is limited to, for example, a sport that is a specific program genre. (Repeat slow motion scene) is identified, and the most important scenes are combined with the method that detects the last few cuts as a raised scene, and the combination of text information and video / audio signal features given to the program It is also possible to use a method for detecting. Of course, any method may be used as long as it detects a digest scene that is not limited to these digest scene detection methods. Similarly, the detection of the CM section is not limited to the method using the audio path level as described above. For example, the scene change point of the video is detected from the luminance information of the video, and the CM section is determined based on the occurrence interval. May be determined. In this case, the luminance information of the video may be used as the feature amount.

[0077] Further, the above-described digest list may be used to catch up and reproduce the program during program recording. In this case, the user instructs catch-up reproduction. Upon receiving this instruction, the playback control unit 18 determines whether or not two minutes have passed since the start of recording. If it is two minutes or longer, only the digest scene is played back using the digest list generated by the above-described processing. . On the other hand, if it is not 2 minutes or longer, the playback control unit 18 performs fast playback (for example, playback at a playback speed of 1.5 times). After that, if the fast-play playback catches up with the actual broadcast, the fast-play playback may be stopped and switched to the real-time broadcast output. In addition, after the digest scene has been played back, subsequent playback may be left to the user's instruction. For example, normal playback of the digest scene may be performed, or playback may be performed with thinning. For example, in a 60-minute program, when 30 minutes have elapsed since the start of the program, the user has instructed the user to “reproduce the digest scene in 10 minutes”, to add a message to the effect, and to play back. To do. In this case, the playback control unit 18 plays back the digest scene so as to end in 10 minutes based on the digest scene list created at that time. Then, for viewing after the digest scene has been played back, it is up to the user to wait for the instruction. In other words, since 40 minutes have elapsed since the start of the program after the digest scene has been played back, the 10-minute program during the digest scene playback may be thinned out in response to a user instruction. You may make it perform reproduction | regeneration. Of course, you may watch the actual broadcast without playing the 10-minute program. In this case, the playback control unit 18 ends the playback process in response to a user instruction. Thus, according to the present embodiment, the digest scene list is generated in parallel with the recording, so that digest playback can be performed at any timing during recording.

In the above-described embodiment, the digest scene information is created by subtracting the CM section from the digest candidate section. However, the section to be subtracted from the digest candidate section is not limited to the CM section. For example, a section where a still image is displayed may be detected and subtracted. For example, when rebroadcasting a certain program, there may be a scene that cannot be broadcast in the program due to the license and portrait rights. In such a case, the broadcast cannot be broadcast, and editing is performed before the broadcast so that a still image (“Cannot be displayed!” Is displayed) will be displayed instead. Will be broadcast. Therefore, the feature amount of the still image (for example, the motion vector of the video is 0) is detected, and the still image section where the still image is continuously displayed is detected. Then, the digest scene information may be created by subtracting the still image section (that is, the broadcast prohibited section) from the digest candidate section. If a section having a predetermined feature such as a CM section or a still image section is detected as a specific section and the specific section is subtracted from the digest candidate section, a digest list in which only digest scenes are appropriately extracted can be obtained. Can be generated.

[0079] (Second Embodiment)

Next, a second embodiment of the present invention will be described with reference to FIGS. In the first embodiment described above, digest scene candidate sections are detected as needed. In contrast, in the second embodiment, digest scene detection is performed without detecting candidate sections. Therefore, necessary feature values are accumulated for a predetermined time, and a digest scene is detected from the feature values other than the CM section at a predetermined timing. FIG. 10 is a block diagram showing a configuration of the digest generation device 30 according to the second exemplary embodiment of the present invention. In FIG. 10, the feature quantity calculation unit 12 associates the calculated feature quantity with the time information and stores them in the temporary storage unit 31 as the temporarily accumulated feature quantity 36. The temporary storage unit 31 has a capacity to hold frame feature values and time information for a predetermined time. In this embodiment, it is assumed that frame information for 2 minutes can be held. In addition, it is assumed that the old data is overwritten in the temporary storage unit 31 in order by the ring buffer method. The digest list creation unit 32 detects a digest scene from a section other than the CM section based on the feature amount stored in the CM section information 27 and the temporary storage unit 31, and creates the digest scene list 28. Except for these, the digest generation device 30 according to the present embodiment basically has the same configuration as that of the first embodiment described above. Therefore, the same portions are denoted by the same reference numerals, and detailed description thereof is omitted.

Next, data used in the second embodiment will be described with reference to FIG. In the second embodiment, in addition to the data used in the first embodiment, a temporarily accumulated feature 36, immediately before digest information 37, and digest start end information 38 are used. The temporarily accumulated feature quantity 36 is used for detecting a digest scene, and has time information 361 and a feature quantity 362. The time information 361 stores frame time information. The feature quantity 362 stores the feature quantity (in this embodiment, the voice path level) used by the feature quantity calculation unit 12 and used for digest scene detection. Information immediately before digest 37 (FIG. 11 (B)) is also used for detecting a digest scene, and has time information 371 immediately before digest and feature amount 372 immediately before digest. The time information immediately before the digest 371 stores the time information related to the frame immediately before the current frame to be processed. The feature value 372 immediately before the digest is stored in the feature value 372 immediately before the digest. The digest start end information 38 (FIG. 11C) has a digest start end time and is used to detect a digest scene.

Hereinafter, a digest scene list creation process that is relevant to the second embodiment of the present invention will be described with reference to FIGS. Fig. 12 shows a digest scene squirrel that is related to the second embodiment. 5 is a flowchart showing a detailed operation of the image creation process. In FIG. 11, the processing of steps S61 and S62 is the same as the processing of steps Sl and S2 described with reference to FIG. 3 in the first embodiment, and therefore detailed description thereof is omitted here. In addition, regarding the feature amount calculation processing according to step S63, the step described with reference to FIG. 3 in the first embodiment described above, except that the calculated feature amount is output to the temporary storage unit 31. Since it is the same as the process of S3, detailed description is omitted. In addition, for the silent section detection processing in step S64, the feature amount (sound level of the audio signal) calculated in step S63 is stored in the immediately preceding feature amount 212 at the end of the processing, except for the above-described first step. Since it is the same as the process of step S4 described with reference to FIG. 4 in the first embodiment, detailed description thereof is omitted.

[0082] Following step S64, the CM section determination unit 15 performs CM section determination processing and creates CM section information (step S65). Since the operation in step S65 is the same as the process in step S6 described with reference to FIG. 7 in the first embodiment, detailed description thereof is omitted.

[0083] When the process of step S65 is completed, the digest list creating unit 32 performs a digest list output process (step S66). FIG. 13 is a flowchart showing details of the digest list output process shown in step S66. In FIG. 13, first, the digest list creation unit 32 determines whether or not 120 seconds of frame feature values have been stored in the temporarily stored feature value 36 (step S71). In this embodiment, the maximum length of the CM section is assumed to be 60 seconds. For example, when the CM section is 60 seconds at the beginning of the program, it takes 120 seconds at the maximum to determine the CM section. Therefore, this processing is not performed for at least 120 seconds of program start force. If the result of determination in step S71 indicates that 120 seconds have not yet been accumulated (NO in step S71), the digest list output process ends. On the other hand, if stored (YES in step S71), the digest list creation unit 16 acquires the oldest time information 361 and feature quantity 362 from the temporarily stored feature quantity 36 (step S72).

Subsequently, the digest list creation unit 32 determines whether or not the time indicated by the time information 361 acquired in step S72 exists in the CM section with reference to the CM section information (step S73). ). If the result is within the CM section (YES in step S73), the digest list The creation unit 32 ends the digest list generation process. On the other hand, if not within the CM section (NO in step S73), the digest list creation unit 32 determines whether or not the value of the feature quantity 362 is greater than or equal to a predetermined value (step S74). As a result, if it is greater than or equal to the predetermined value (YES in step S74), the digest list creating unit 32 determines whether or not the feature quantity 372 immediately before the digest is greater than or equal to the predetermined value (step S75). That is, a change in the voice path level between the frame acquired in step S72 and the frame immediately before that frame is determined. As a result, if the feature value 372 immediately before the digest is not equal to or greater than the predetermined value (NO in step S75), the time information of the frame is saved in the digest start end information 38 (step S76). At the time of the first process, nothing is stored in the feature quantity 212 immediately before the digest. Therefore, at this time, the process is proceeded assuming that it is not equal to or greater than the predetermined value. On the other hand, if the result of determination in step S75 is that the feature amount 372 immediately before digest is equal to or greater than the predetermined value (YES in step S75), the digest list creation unit 16 performs the process in step S77 without performing the process in step S76. move on.

[0085] On the other hand, if the result of determination in step S74 is that the value of feature quantity 362 is not greater than or equal to the predetermined value (NO in step S74), then digest list creation unit 32 sets feature quantity 372 immediately before digest to a predetermined value. It is determined whether the value is greater than or equal to the value (step S78). As a result, if the immediately-digest feature quantity 372 is not equal to or greater than the predetermined value (NO in step S78), the digest list creation unit 16 ends the digest list generation process. On the other hand, if the feature value 372 immediately before the digest is equal to or greater than the predetermined value (YES in step S78), the digest scene that has been continued has been completed in the previous frame, so the digest indicated by the digest start edge information 38 is the digest. The section from the start time to the time information 371 immediately before the digest is output to the digest scene list 28 as one digest section (step S79).

Next, the digest list creation unit 16 saves the voice path level of the frame to the feature amount 372 immediately before the digest (step S77). This completes the digest scene list creation process that is relevant to the second embodiment.

As described above, in the second embodiment, the CM section can be detected in parallel with the recording of the program, and the digest scene can be detected by the program section force other than the CM section. This makes it necessary to perform a separate process for generating a digest scene list after the recording of the program is completed. This eliminates the need for a processing time for the generation process and provides a comfortable viewing environment to the user.

Note that each of the above-described embodiments may be provided in the form of a recording medium storing a program to be executed by a computer. In this case, the digest generation program stored in the recording medium is read, and the digest generation device (more precisely, the control unit not shown) may perform the processes shown in FIGS.

Industrial applicability

[0089] A digest generating device, a digest generating method, a recording medium storing a digest generating program, and an integrated circuit used in the digest generating device according to the present invention generate digest scene information while recording a program. It is useful for applications such as HDD recorders and DVD recorders.

Claims

The scope of the claims

[1] A digest generation device that generates digest scene information related to a program when the broadcast signal of the broadcast program is received and recorded on a recording medium,

Each time a broadcast signal of a predetermined unit time is received, at least one type of feature amount indicating a feature related to at least one of video and audio included in the broadcast signal is received from the received broadcast signal for the unit time. A feature amount calculation unit to be calculated;

Each time the feature value is calculated, it is determined whether or not the predetermined time point included in the signal portion of the received broadcast signal in which the feature value has already been calculated is the start or end of the specific section. A specific section edge detection unit that detects a point of time that is the start or end of the specific section,

Each time the feature quantity is calculated, based on the feature quantity, it is determined whether or not the broadcast signal applied to the entire section of the program excluding the specific section is a digest scene, and digest scene information is obtained. A digest generation apparatus, comprising: a digest scene information generation unit for generating.

[2] The digest scene information creation unit

Whether the content included in the broadcast signal for the unit time is a digest scene or not is determined based on the feature amount every time the feature amount is calculated for the broadcast signal for the unit time! A digest section detecting unit for detecting a digest candidate section based on the received broadcast signal,

Each time the specific section end detection unit detects a set of the start and end of a specific section, the start force determines whether or not the specific section up to the end overlaps with the digest candidate section, and the digest The digest generation device according to claim 1, wherein information indicating a section excluding a digest candidate section that overlaps the specific section among the digest candidate sections detected by the section detection unit is generated as digest scene information.

[3] The digest scene information creation unit

A temporary storage unit for storing the calculated feature amount up to a predetermined calculation time force for a predetermined time; Every time the feature amount is calculated, the time point corresponding to the feature amount stored in the temporary storage unit is included between the start point of the specific section detected by the specific section end detection unit and the end point. The digest scene information is generated by detecting content that is a digest scene from content included in a broadcast signal for a unit time only when the content is not included. Digest generator.

[4] The feature amount calculation unit calculates the first and second feature amounts,

The specific section end detection unit determines the start or end of the specific section based on the first feature amount,

The digest generation device according to claim 2, wherein the digest section detection unit detects the digest candidate section based on a second feature amount.

[5] The specific section end detection unit includes:

When the feature quantity satisfies a predetermined condition, a specific section candidate detection unit that detects a section including only the feature quantity satisfying the condition as a specific section candidate;

The digest generation apparatus according to claim 1, further comprising: a specific section determination unit that detects a candidate that is a start or end of a specific section based on a time difference between the specific section candidates in the program.

[6] Each time the specific section candidate is detected, the specific section determination unit detects the specific section candidate power if the specific section candidate already included in the detected specific section candidate includes a time point before a predetermined time. 6. The digest generation device according to claim 5, wherein a time point before the time is used as a start point of the specific section, and the specific section candidate is detected as the end point of the specific section.

[7] The specific section detector

Each time the specific section candidate is detected, the last detected specific section candidate power at a time point before a predetermined first time, or the last detected specific section candidate power at a time point before a predetermined second time A determination unit for determining whether or not a specific section candidate that has already been detected exists,

An adder that adds points for each of the specific section candidate determined to be present and the specific section candidate detected last when the determination section determines that the specific section candidate exists; When a target candidate with a score greater than or equal to a predetermined value is detected and the force reaches the predetermined third time, the target candidate power also has a specific section candidate with a score greater than or equal to the predetermined value at the time before the third time And if it does not exist, a start end determination unit that sets the target candidate as the start end of the specific section;

When a target candidate whose score is equal to or greater than a predetermined value is detected and the force is detected and the specified third time elapses, whether or not there is a specific section candidate whose score is equal to or greater than the predetermined value when the third time elapses 6. The digest generation device according to claim 5, further comprising: an end determination unit configured to determine whether the target candidate is the end of the specific section when the target candidate does not exist.

[8] The feature amount calculation unit calculates a voice path level of an audio signal as the feature amount, and the specific section candidate detection unit detects a silent section having the par level of a predetermined value or less as the specific section candidate. The digest production | generation apparatus of Claim 5.

[9] The feature amount calculation unit calculates luminance information based on a video signal as the feature amount, and the specific section candidate detection unit specifies the scene change point where the amount of change in the luminance information is a predetermined value or more. 6. The digest generation device according to claim 5, which is detected as a section candidate.

[10] A digest generation method for generating digest scene information related to a program when the broadcast signal of the broadcast program is received and recorded on a recording medium,

Each time a broadcast signal of a predetermined unit time is received, at least one type of feature amount indicating a feature related to at least one of video and audio included in the broadcast signal is received from the received broadcast signal for the unit time. A feature amount calculating step to calculate;

Each time the feature value is calculated, it is determined whether or not the predetermined time point included in the signal portion of the received broadcast signal in which the feature value has already been calculated is the start or end of the specific section. A specific section edge detection step for detecting a time point that is a start or end of the specific section,

Each time the feature quantity is calculated, based on the feature quantity, it is determined whether or not the broadcast signal applied to the entire section of the program excluding the specific section is a digest scene, and digest scene information is obtained. A digest generation method comprising: generating a digest scene information.

[11] The digest scene information creation step includes:

Whether the content included in the broadcast signal for the unit time is a digest scene or not is determined based on the feature amount every time the feature amount is calculated for the broadcast signal for the unit time! A digest section detecting step of detecting a digest candidate section based on the received broadcast signal,

Each time the start and end pair of a specific section is detected by the specific section end detection step, it is determined whether or not the specific section up to the start force and the end overlaps with the digest candidate section. 11. The digest generation method according to claim 10, wherein information indicating a section excluding the digest candidate section overlapping with the specific section among the digest candidate sections detected by the detection step is generated as digest scene information.

[12] The digest scene information creating step includes:

A temporary storage step of storing the calculated feature quantity up to a predetermined calculation time force for a predetermined time;

It is determined whether the start time of the specific section detected by the specific section end detection step is included between the end point of the specific section detected by the specific section end detection step and the end point each time the feature amount is calculated. 11. The digest generation method according to claim 10, wherein the digest scene information is generated by detecting content that is a digest scene from contents included in an AV signal for a unit time only when it is determined and not included.

[13] A recording medium that stores a digest generation program that is executed by a computer of a digest generation apparatus that generates digest scene information about the program when a broadcast signal of the broadcast program is received and recorded on the recording medium. There,

Each time the feature value is calculated, it is determined whether or not the predetermined time point included in the signal portion of the received broadcast signal in which the feature value has already been calculated is the start or end of the specific section. Specific to detect the start or end of a specific section An interval end detection step;

Each time the feature quantity is calculated, based on the feature quantity, it is determined whether or not the broadcast signal applied to the entire section of the program excluding the specific section is a digest scene, and digest scene information is obtained. The recording medium which stored the program which makes the said computer perform the digest scene information creation step to produce | generate.

[14] The digest scene information creating step includes:

Each time the start and end pair of a specific section is detected by the specific section end detection step, it is determined whether or not the specific section up to the start force and the end overlaps with the digest candidate section. 14. The recording medium according to claim 13, wherein information indicating a section excluding a digest candidate section that overlaps the specific section among the digest candidate sections detected by the detecting step is generated as digest scene information.

[15] The digest scene information creation step includes:

It is determined whether the start time of the specific section detected by the specific section end detection step is included between the end point of the specific section detected by the specific section end detection step and the end point each time the feature amount is calculated. 14. The recording medium according to claim 13, wherein only when it is determined and not included, the digest scene information is generated by detecting the content that is the digest scene among the content included in the AV signal for the unit time.

[16] An integrated circuit used in a digest generation device that generates digest scene information related to a program when the broadcast signal of the broadcast program is received and recorded on a recording medium.

Each time a broadcast signal of a predetermined unit time is received, a special feature relating to at least one of video and audio included in the broadcast signal is received from the received broadcast signal for the unit time. A feature amount calculation unit that calculates at least one type of feature amount indicating a sign;

Each time the feature quantity is calculated, based on the feature quantity, it is determined whether the broadcast signal applied to a section excluding the specific section of the entire section of the program is a digest scene, and digest scene information is obtained. An integrated circuit used in a digest generation device, comprising: a digest scene information generation unit for generating.

[17] The digest scene information creation unit

Each time the specific section end detection unit detects a set of the start and end of a specific section, the start force determines whether or not the specific section up to the end overlaps with the digest candidate section, and the digest 17. The integrated circuit according to claim 16, wherein information indicating a section excluding the digest candidate section that overlaps with the specific section among the digest candidate sections detected by the section detection unit is generated as digest scene information.

[18] The digest scene information creation unit

A temporary storage unit for storing the calculated feature amount up to a predetermined calculation time force for a predetermined time;

Whether the time point for the feature value stored in the temporary storage unit is included between the start time and the end force of the specific section detected by the specific section end detection unit each time the feature value is calculated The integrated circuit according to claim 16, wherein the digest scene information is generated by detecting content that is a digest scene among the contents included in the AV signal for a unit time only when the content is not included. .