WO2021240652A1 - Information processing device, control method, and storage medium - Google Patents

Information processing device, control method, and storage medium Download PDF

Info

Publication number
WO2021240652A1
WO2021240652A1 PCT/JP2020/020772 JP2020020772W WO2021240652A1 WO 2021240652 A1 WO2021240652 A1 WO 2021240652A1 JP 2020020772 W JP2020020772 W JP 2020020772W WO 2021240652 A1 WO2021240652 A1 WO 2021240652A1
Authority
WO
WIPO (PCT)
Prior art keywords
pair
data
digest
relevance
digest candidate
Prior art date
Application number
PCT/JP2020/020772
Other languages
French (fr)
Japanese (ja)
Inventor
はるな 渡辺
克 菊池
壮馬 白石
悠 鍋藤
Original Assignee
日本電気株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電気株式会社 filed Critical 日本電気株式会社
Priority to US17/926,731 priority Critical patent/US20230205815A1/en
Priority to PCT/JP2020/020772 priority patent/WO2021240652A1/en
Priority to JP2022527325A priority patent/JP7420243B2/en
Publication of WO2021240652A1 publication Critical patent/WO2021240652A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/738Presentation of query results
    • G06F16/739Presentation of query results in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/75Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/65Clustering; Classification
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/91Television signal processing therefor

Definitions

  • the present disclosure relates to technical fields of information processing devices, control methods, and storage media that perform processing related to digest generation.
  • Patent Document 1 discloses a method of confirming and producing highlights from a video stream of a sporting event on the ground.
  • An object of the present disclosure is to provide an information processing device, a control method, and a storage medium capable of generating information suitable for digest generation in consideration of the above problems.
  • One aspect of the information processing apparatus is a pair determining means for determining a pair of data including at least one of video data or sound data, wherein at least one of the data is a first digest candidate which is a digest candidate.
  • An information processing apparatus comprising, and a relevance calculation means for calculating a relevance degree indicating the degree of probability that the pair is simultaneously included in the digest.
  • One aspect of the control method is to determine by a computer the pair of data including at least one of video data or sound data, and the pair in which at least one data is a first digest candidate, which is a digest candidate. It is a control method for calculating the degree of association indicating the degree of probability that the pair is included in the digest at the same time.
  • One aspect of the storage medium is a pair of data including at least one of video data or sound data, and a pair determining means for determining the pair in which at least one data is a first digest candidate which is a digest candidate.
  • a storage medium in which a program for operating a computer as a relevance calculation means for calculating a relevance indicating the degree of probability that the pair is simultaneously included in the digest is stored.
  • the configuration of the digest candidate selection system in the first embodiment is shown.
  • the hardware configuration of the information processing device is shown.
  • This is an example of a functional block of an information processing device. It is a figure which shows the outline of the selection process of the 1st digest candidate common to the 1st selection example and the 2nd selection example. It is a figure which shows the outline of the selection process of the 2nd digest candidate which concerns on the 1st selection example after the selection of the 1st digest candidate. It is a figure which shows the outline of the selection process of the 2nd digest candidate which concerns on the 2nd selection example after the selection of the 1st digest candidate. It is a schematic block diagram of the learning system which generates the relevance inferior information. An example of the functional block configuration of the learning device is shown.
  • System Configuration Figure 1 shows the configuration of the digest candidate selection system 100 according to the first embodiment.
  • the digest candidate selection system 100 preferably selects video data as a digest candidate from video data (also referred to as “material data”) as a material.
  • the digest candidate selection system 100 mainly includes an information processing device 1, an input device 2, an output device 3, and a storage device 4.
  • the information processing device 1 performs data communication with the input device 2 and the output device 3 via a communication network or by direct communication by radio or wire.
  • the information processing device 1 refers to the relevance inferior information D2 and the importance inferior information D3 also stored in the storage device 4 from the material data D1 stored in the storage device 4, and is a video data that is a candidate for a digest. To select. Then, the information processing device 1 generates an output signal "S1" related to the above selection result, and supplies the generated output signal S1 to the output device.
  • the input device 2 is an arbitrary user interface that accepts user input, and corresponds to, for example, a button, a keyboard, a mouse, a touch panel, a voice input device, and the like.
  • the input device 2 supplies the input signal "S2" generated based on the user input to the information processing device 1.
  • the output device 3 is, for example, a display device such as a display or a projector, and a sound output device such as a speaker, and performs a predetermined display or sound output based on the output signal S1 supplied from the information processing device 1.
  • the storage device 4 is a memory for storing various information necessary for processing of the information processing device 1.
  • the storage device 4 stores, for example, the material data D1, the relevance inferior information D2, and the importance inferrer information D3.
  • the material data D1 is video data to be edited in the generation of the digest.
  • the video data corresponding to the section having a predetermined reproduction time length extracted from the material data D1 is also referred to as “section data”.
  • Each section data includes a predetermined number of time-series images, which is one or more.
  • the information processing apparatus 1 calculates the importance and generates a pair for calculating the relevance of the section data obtained by dividing the material data D1 for each unit interval.
  • Relevance inference device information D2 is information about an inference device (also referred to as “relevance inference device”) that infers the relevance to a pair of interval data (also referred to as “inference target pair Ptag").
  • the degree of relevance is an index showing the relevance in terms of whether or not the inferred pair Ptag is simultaneously included in the digest, in other words, the degree of probability (or validity) that the inferred pair Ptag is simultaneously included in the digest. It is an index showing.
  • the relevance inferior is learned in advance to infer these relevances when a predetermined number (one or more) of image pairs corresponding to the pair of interval data are input, and the relevance inferior information D2. Contains the parameters of the learned relevance inferior.
  • Importance inferrer information D3 is information about an inferior (also referred to as "importance inferior") that infers importance to interval data.
  • the above-mentioned importance is an index as a reference for determining whether the section in the material data D1 corresponding to the video data input in the generation of the digest is an important section or a non-important section.
  • the importance inferior is learned in advance so as to infer the importance of the target section when a predetermined number (one or more) of images constituting the section data are input, and the importance inferior information D3 is used. Includes learned importance inferrer parameters.
  • the learning model of the relevance inferior and the importance inferior may be a learning model based on arbitrary machine learning such as a neural network or a support vector machine, respectively.
  • a model of the relevance inferior and the importance inferior is a neural network such as a convolutional neural network
  • the relevance inferior information D2 and the importance inferior information D3 are a layer structure, a neuron structure of each layer, and the like. Includes various parameters such as the number and size of filters in each layer and the weight of each element of each filter.
  • the storage device 4 may be an external storage device such as a hard disk connected to or built in the information processing device 1, or may be a storage medium such as a flash memory. Further, the storage device 4 may be a server device that performs data communication with the information processing device 1. Further, the storage device 4 may be composed of a plurality of devices. In this case, the storage device 4 may store the material data D1, the relevance inferior information D2, and the importance inferior information D3 in a distributed manner.
  • the configuration of the digest candidate selection system 100 described above is an example, and various changes may be made to the configuration.
  • the input device 2 and the output device 3 may be integrally configured.
  • the input device 2 and the output device 3 may be configured as a tablet-type terminal integrated with the information processing device 1.
  • the information processing device 1 may be composed of a plurality of devices. In this case, the plurality of devices constituting the information processing device 1 exchange information necessary for executing the pre-assigned process among the plurality of devices.
  • FIG. 2 shows the hardware configuration of the information processing device 1.
  • the information processing apparatus 1 includes a processor 11, a memory 12, and an interface 13 as hardware.
  • the processor 11, the memory 12, and the interface 13 are connected via the data bus 19.
  • the processor 11 executes a predetermined process by executing the program stored in the memory 12.
  • the processor 11 is a processor such as a CPU (Central Processing Unit), a GPU (Graphics Processing Unit), and a quantum processor.
  • the memory 12 is composed of various volatile memories such as RAM (Random Access Memory) and ROM (Read Only Memory) and non-volatile memory. Further, the memory 12 stores a program executed by the information processing apparatus 1. Further, the memory 12 is used as a working memory and temporarily stores information and the like acquired from the storage device 4. The memory 12 may function as the storage device 4. Similarly, the storage device 4 may function as the memory 12 of the information processing device 1. The program executed by the information processing apparatus 1 may be stored in a storage medium other than the memory 12.
  • the interface 13 is an interface for electrically connecting the information processing device 1 and another device.
  • the interface for connecting the information processing device 1 and another device may be a communication interface such as a network adapter for transmitting / receiving data to / from another device based on the control of the processor 11 by wire or wirelessly. good.
  • the information processing apparatus 1 and the other apparatus may be connected by a cable or the like.
  • the interface 13 includes a hardware interface compliant with USB (Universal Serial Bus), SATA (Serial AT Atchment), etc. for exchanging data with other devices.
  • USB Universal Serial Bus
  • SATA Serial AT Atchment
  • the hardware configuration of the information processing device 1 is not limited to the configuration shown in FIG.
  • the information processing device 1 may include at least one of an input device 2 and an output device 3.
  • the information processing apparatus 1 selects a provisional digest candidate (also referred to as a “first digest candidate”) from the section data of the material data D1, and finally outputs the digest based on the first digest candidate.
  • a provisional digest candidate also referred to as a “first digest candidate”
  • Select candidates also called "second digest candidates"
  • FIG. 3 is an example of a functional block of the processor 11 of the information processing device 1.
  • the processor 11 of the information processing device 1 includes a first digest candidate selection unit 14, a pair determination unit 15, a relevance calculation unit 16, a second digest candidate selection unit 17, and an output control unit 18.
  • the blocks in which data is exchanged are connected by a solid line, but the combination of blocks in which data is exchanged is not limited to FIG. The same applies to the figures of other functional blocks described later.
  • the first digest candidate selection unit 14 calculates the importance of each of the section data constituting the material data D1, and selects the first digest candidate from the section data constituting the material data D1 based on the calculated importance. do.
  • the section data is data in which the material data D1 is divided by a section having a unit time length, and is data including one or more images for a predetermined number of sheets.
  • the first digest candidate selection unit 14 configures the importance inference device by referring to the importance inference device information D3, and sequentially inputs the section data extracted from the material data D1 into the importance inference device. Then, the importance corresponding to all the section data is acquired.
  • the first digest candidate selection unit 14 selects the section data whose importance is equal to or higher than the preset threshold value as the first digest candidate.
  • the section data that is not the first digest candidate is also referred to as a “non-first digest candidate”.
  • the first digest candidate selection unit 14 supplies information regarding the first digest candidate (also referred to as “first digest candidate information Idc1”) to the pair determination unit 15.
  • the pair determination unit 15 determines a plurality of inference target pairs Ptag that combines two section data extracted from the material data D1 based on the first digest candidate information Idc1 determined by the first digest candidate selection unit 14. In this case, in the first example, the pair determination unit 15 determines an inference target pair Ptag that combines arbitrary two interval data that are candidates for the first digest. In the second example, the pair determination unit 15 determines an inference target pair Ptag that arbitrarily combines the section data that is the first digest candidate and the section data that is the non-first digest candidate. Then, the pair determination unit 15 supplies the determined inference target pair Ptag to the relevance calculation unit 16.
  • the pair determination unit 15 may limit the inference target pair Ptag by the method described below in order to reduce the overall processing load of the information processing apparatus 1. For example, the pair determination unit 15 may determine two section data in which the difference between the corresponding reproduction times is within a predetermined time difference as the inference target pair Ptag. In another example, the pair determination unit 15 may determine the inference target pair Ptag by targeting only the interval data extracted from the material data D1 at predetermined time intervals. In yet another example, the pair determination unit 15 applies an arbitrary clustering method to the interval data to classify the interval data, and determines the inference target pair Ptag with only the interval data belonging to a predetermined class as the target of combination. You may.
  • the relevance calculation unit 16 calculates the relevance degree for each of the inference target pair Ptag supplied from the pair determination unit 15.
  • the relevance calculation unit 16 configures the relevance inference device by referring to the relevance inference device information D2, and sequentially inputs the inference target pair Ptag acquired from the pair determination unit 15 into the relevance inference device. Then, the degree of relevance to each of the inference target pair Ptag is calculated. Then, the relevance calculation unit 16 supplies the calculated relevance information (also referred to as “relevance information Ia”) to the second digest candidate selection unit 17.
  • the second digest candidate selection unit 17 selects the second digest candidate based on the relevance information Ia supplied from the relevance calculation unit 16.
  • the second digest candidate selection unit 17 supplies information regarding the selected second digest candidate (also referred to as “second digest candidate information Idc2”) to the output control unit 18.
  • the second digest candidate information Idc2 may be the section data itself that is the second digest candidate, and is the time information (information indicating the reproduction time in the material data D1) of the section data that is the second digest candidate. You may.
  • the second digest candidate selection unit 17 selects the first digest candidate constituting the inference target pair Ptag whose relevance is equal to or higher than a predetermined threshold value. Select as a second digest candidate.
  • the second digest candidate selection unit 17 can suitably determine the second digest candidate narrowed down from the first digest candidate based on the degree of relevance.
  • the second digest candidate selection unit 17 is the non-first digest candidate of the inference target pair Ptag whose relevance is equal to or higher than the threshold value. Is added to the second digest candidate.
  • the second digest candidate selection unit 17 can suitably incorporate the non-first digest candidate, which has a high degree of relevance to the first digest candidate, into the second digest candidate.
  • the output control unit 18 performs output control based on the second digest candidate information Idc2 supplied from the second digest candidate selection unit 17.
  • the output control unit 18 generates an output signal S1 related to the second digest candidate information Idc2, and transmits the generated output signal S1 to the output device 3 via the interface 13.
  • the output control unit 18 transmits, for example, an output signal S1 for reproducing the section data as the second digest candidate to the output device 3, so that the section data as the second digest candidate is transmitted to the output device 3. You may regenerate it.
  • the output control unit 18 can make the viewer confirm the suitability of the second digest candidate as a digest.
  • the output control unit 18 stores the second digest candidate information Idc2 in the storage device 4 via the interface 13.
  • the output control unit 18 transmits the second digest candidate information Idc2 to the external device that performs the final digest generation process via the interface 13.
  • the processor 11 stores, for example, each component of the first digest candidate selection unit 14, the pair determination unit 15, the relevance calculation unit 16, the second digest candidate selection unit 17, and the output control unit 18 described in FIG. This can be achieved by executing a program stored in the device 4 or the memory 12. Further, each component may be realized by recording a necessary program in an arbitrary non-volatile storage medium and installing it as needed. It should be noted that each of these components is not limited to being realized by software by a program, and may be realized by any combination of hardware, firmware, and software. Further, each of these components may be realized by using a user-programmable integrated circuit such as an FPGA (field-programmable gate array) or a microcomputer. In this case, this integrated circuit may be used to realize a program composed of each of the above components. As described above, each component may be realized by any controller including hardware other than the processor. The above is the same in other embodiments described later.
  • FIG. 4 is a diagram showing an outline of the selection process of the first digest candidate common to the first selection example and the second selection example.
  • the first digest candidate selection unit 14 extracts a plurality of section data having a unit time length from the material data D1 and refers to the importance inferrer information D3 for the extracted section. Enter the data sequentially. As a result, the first digest candidate selection unit 14 calculates the importance of each of the section data. Then, the first digest candidate selection unit 14 considers the section data having an importance of more than a predetermined threshold value as the first digest candidate, and the section data having an importance less than the threshold value as a non-first digest candidate. ..
  • FIG. 5 is a diagram showing an outline of the selection process of the second digest candidate according to the first selection example after the selection of the first digest candidate.
  • the pair determination unit 15 determines the inference target pair Ptag that combines the section data that are the first digest candidates. In this case, the pair determination unit 15 may determine all combinations of pairs of the first digest candidates as the inference target pair Ptag, or determine a part of the pair of the first digest candidates as the inference target pair Ptag. May be good.
  • the relevance calculation unit 16 inputs each of the inference target pairs Ptag determined by the pair determination unit 15 into the relevance inference device configured with reference to the relevance inference device information D2. As a result, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag. Then, the second digest candidate selection unit 17 selects the section data constituting the inference target pair Ptag whose calculated relevance degree is equal to or higher than a predetermined threshold value as the second digest candidate. As a result, the second digest candidate selection unit 17 can suitably narrow down only the section data having a high degree of relevance as the second digest candidate from the section data that becomes the first digest candidate.
  • the pair determination unit 15 selects only the pair of the section data of the first digest candidate whose reproduction time is within a predetermined time difference as the inference target pair Ptag.
  • the pair determination unit 15 determines the inference target pair Ptag only for the first digest candidate corresponding to the section data selected from the material data D1 at regular time (for example, 2 seconds) intervals.
  • the pair determination unit 15 clusters the section data that is the first digest candidate, and determines the inference target pair Ptag only for the section data included in the specific class.
  • the pair determination unit 15 performs, for example, a predetermined feature amount extraction process for each section data, and determines which of the preset classes belongs to (that is, the class) based on the extracted feature amount. Identification). In another example, the pair determination unit 15 may perform clustering based on the user input to the input device 2.
  • FIG. 6 is a diagram showing an outline of the selection process of the second digest candidate according to the second selection example after the selection of the first digest candidate.
  • the pair determination unit 15 determines an inference target pair Ptag that is taken out from each of the first digest candidate and the non-first digest candidate and combined. In this case, the pair determination unit 15 may select all combinations of pairs of the first digest candidate and the non-first digest candidate as the inference target pair Ptag, and select some combinations thereof as the inference target pair Ptag. You may.
  • the relevance calculation unit 16 inputs each of the inference target pairs Ptag determined by the pair determination unit 15 into the relevance inference device configured with reference to the relevance inference device information D2. As a result, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag. Then, the second digest candidate selection unit 17 sets the non-first digest candidate constituting the inference target pair Ptag whose calculated relevance is equal to or higher than a predetermined threshold value and all the first digest candidates as the second digest candidate. Select. In this case, the second digest candidate selection unit 17 can incorporate the non-first digest candidate having a high degree of relevance to the first digest candidate into the second digest candidate. As a result, the scenes around the important scenes necessary for understanding the story can be suitably incorporated into the digest candidates.
  • the pair determination unit 15 selects only the pair of the section data between the first digest candidate and the non-first digest candidate whose reproduction time is within a predetermined time difference as the inference target pair Ptag.
  • the pair determination unit 15 targets only the first digest candidate and the non-first digest candidate corresponding to the section data selected from the material data D1 at regular time (for example, 2 seconds) intervals, and the pair is inferred.
  • Select Ptag the pair determination unit 15 clusters the first digest candidate and the non-first digest candidate, and the inference target pair targets only the first digest candidate and the non-first digest candidate included in the specific class. Select Ptag.
  • FIG. 7 is a schematic configuration diagram of a learning system that generates the relevance inferior information D2.
  • the learning system has a learning device 6 that can refer to the learning data D4.
  • the learning device 6 has the same configuration as that of the information processing device 1 shown in FIG. 2, for example, and mainly has a processor 21, a memory 22, and an interface 23.
  • the learning device 6 may be an information processing device 1 or any device other than the information processing device 1.
  • the learning data D4 includes learning material data which is learning material data and a label indicating whether the learning material data is important or non-important for each unit interval.
  • a digest (learning digest) is generated in advance from the learning material data by manual work, and important labels are attached to the section data of the learning material data adopted as the digest, and the section data other than the digest is assigned. Is labeled as non-important.
  • the section data of the learning material data with the important label that is, the learning digest
  • the section data of the learning material data with the non-important label is "non-important”. Called "data”.
  • FIG. 8 shows an example of the functional block configuration of the learning device 6.
  • the learning device 6 mainly has a pair determination unit 61 and a learning unit 62.
  • the pair determination unit 61 and the learning unit 62 are realized by, for example, the processor 21.
  • the pair determination unit 61 refers to the learning data D4, determines the inference target pair Ptag for learning from the interval data of the learning material data, and generates a correct label for these pairs. A specific example of the processing of the pair determination unit 61 will be described later.
  • the learning unit 62 learns the relevance inference device based on the combination of the inference target pair Ptag for learning determined by the pair determination unit 61 and the correct answer label.
  • the learning device 6 has the output of the relevance inference device when the inference target pair Ptag for learning is input to the relevance inference device, and the correct answer label corresponding to the input inference target pair Ptag for learning. Determine the parameters of the relevance inferior so that the error (loss) is minimized.
  • the algorithm for determining the above parameters to minimize the loss may be any learning algorithm used in machine learning such as gradient descent or backpropagation.
  • the learning unit 62 may further perform a process of learning the importance inference device with reference to the learning data D4 and generating the importance inference device information D3.
  • the pair determination unit 61 determines all two combinations of the important data and the non-important data constituting the learning material data as the inference target pair Ptag for learning. Then, the pair determination unit 61 attaches a correct answer label indicating that it is a correct example to the inference target pair Ptag for learning that is a pair of important data, and the inference target pair Ptag for learning that is another pair. On the other hand, a correct answer label indicating that it is a negative example is attached.
  • the "other pair” refers to a pair of non-important data and a pair of important data and non-important data.
  • the learning unit 62 considers that, for example, in the case of a correct answer label indicating that it is a negative example, the degree of relevance that becomes a correct answer is the lowest value, and in the case of a correct answer label indicating that it is a correct example, it is regarded as a correct answer.
  • the degree of relevance is considered to be the maximum value, and the relevance inferior is learned.
  • the learning device 6 can suitably learn the relevance inferior so that the higher the probability that the pair of input interval data is included in the digest at the same time, the higher the relevance is output. ..
  • the pair determination unit 61 determines arbitrary two interval data (including important data and non-important data) of the learning material data as the inference target pair Ptag for learning. Then, the learning unit 62 determines a value corresponding to the difference in reproduction time of the two section data constituting the inference target pair Ptag for learning as the correct label for the inference target pair Ptag for learning.
  • the "value according to the difference in reproduction time" is, for example, a number closer to 1 (for example, the maximum value of the degree of relevance) as the reproduction time of the two section data is closer, and 0 (for example, the lowest value of the degree of relevance) as the distance is farther. It is a value normalized according to the range of the degree of relevance so that the numbers are close to each other.
  • the learning device 6 can suitably learn the relevance degree inferior so that the pair of section data having a deep connection as a story outputs a higher relevance degree.
  • the information processing apparatus 1 applies the relevance inferior learned in the second learning example to, for example, the second selection example shown in FIG. 7, so that the information processing apparatus 1 is non-first, which is close in time to the first digest candidate.
  • the digest candidate can be suitably incorporated as the second digest candidate.
  • the information processing apparatus 1 can suitably select the peripheral scenes of the important scenes as the second digest candidate, and can suitably support the generation of the digest in which the story is easy to understand.
  • FIG. 9 is an example of a flowchart showing a procedure of processing executed by the information processing apparatus 1 in the first embodiment.
  • the information processing apparatus 1 executes the processing of the flowchart shown in FIG. 9, for example, when a user input instructing the start of the processing is detected.
  • the first digest candidate selection unit 14 of the information processing device 1 acquires the material data D1 from the storage device 4 via the interface 13 (step S11).
  • the first digest candidate selection unit 14 acquires the material data D1 corresponding to the contents specified by the user input or the like. ..
  • the first digest candidate selection unit 14 selects the first digest candidate from the section data constituting the material data (step S12).
  • the first digest candidate selection unit 14 calculates the importance of each section data by inputting the section data into the importance inferior configured with reference to the importance inferior information D3, and the calculated importance.
  • the section data that is the first digest candidate is selected based on the degree.
  • the pair determination unit 15 generates an inference target pair Ptag including the first digest candidate (step S13).
  • the pair determination unit 15 may generate an inference target pair Ptag in which the first digest candidates are paired according to the first selection example described above, and the first digest candidate and the non-first digest candidate may be generated according to the second selection example.
  • An inference target pair Ptag that pairs with one digest candidate may be generated.
  • the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag generated in step S13 (step S14). In this case, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag by sequentially inputting the inference target pair Ptag into the relevance inferior configured by referring to the relevance inferior information D2. do.
  • the second digest candidate selection unit 17 selects the second digest candidate (step S15).
  • the second digest candidate selection unit 17 selects, for example, the first digest candidate which is the inference target pair Ptag having a high degree of relevance among the first digest candidates as the second digest candidate according to the first selection example described above. ..
  • the second digest candidate selection unit 17 selects a non-first digest candidate having a high degree of relevance to the first digest candidate as the second digest candidate together with the first digest candidate according to the second selection example.
  • the output control unit 18 outputs information regarding the second digest candidate (step S16).
  • the output control unit 18 may supply the information regarding the second digest candidate to an external device such as the storage device 4, or may output the information to the output device 3.
  • FIG. 10 is an example of a flowchart showing a procedure of processing executed by the learning device 6 in the first embodiment.
  • the learning device 6 executes the processing of the flowchart shown in FIG. 10, for example, when a user input instructing the start of the processing is detected.
  • the pair determination unit 61 of the learning device 6 acquires learning material data from the learning data D4 (step S21).
  • the pair determination unit 61 acquires the learning material data corresponding to the contents specified by the user input or the like.
  • the pair determination unit 61 generates an inference target pair Ptag for learning (step S22).
  • the pair determination unit 61 generates, for example, a pair of interval data extracted from the learning material data as an inference target pair Ptag according to either the first learning example or the second learning example described above.
  • the pair determination unit 61 determines the correct label for the inference target pair Ptag for learning generated in step S22 (step S23). In this case, the pair determination unit 61 may determine the correct answer label based on whether or not it is a pair of important data according to the first learning example, and according to the second learning example, the section that becomes the inference target pair Ptag for learning. A value corresponding to the difference in reproduction time between the data may be determined as the correct answer label.
  • the learning unit 62 learns the relevance inference device based on the inference target pair Ptag for learning and the correct answer label (step S24). Then, the learning device 6 generates the parameters of the relevance inferior obtained by learning as the relevance inferior information D2.
  • the generated relevance inferior information D2 may be immediately stored in the storage device 4 by data communication between the storage device 4 and the learning device 6, and is stored in the storage device 4 via a removable storage medium. You may.
  • the pair determination unit 15 may combine the interval data extracted from different material data as the inference target pair Ptag instead of combining the two interval data extracted from the same material data as the inference target pair Ptag.
  • the first digest candidate selection unit 14 selects the first digest candidate from the second material data different from the material data D1.
  • the material data D1 and the second material data may be, for example, data taken at the same place (for example, a sports venue) by different cameras in the same time zone.
  • the second material data may be labeled in advance to identify important sections and non-important sections.
  • the first digest candidate selection unit 14 selects the section data labeled with the important section as the first digest candidate.
  • the pair determination unit 15 determines a pair of the first digest candidate extracted from the second material data and the section data extracted from the material data D1 as the inference target pair Ptag.
  • the second digest candidate selection unit 17 is, for example, the section data of the material data D1 constituting the inference target pair Ptag whose relevance degree is equal to or higher than a predetermined value, and the first digest candidate extracted from the second material data. Is selected as the second digest candidate.
  • the information processing apparatus 1 can suitably select a digest candidate from a plurality of material data.
  • the information processing apparatus 1 may calculate and output the degree of relevance to the inference target pair Ptag specified by the user input.
  • FIG. 11 is an example of a functional block diagram of the information processing apparatus 1A according to the modified example 2.
  • the processor 11 of the information processing device 1A has a pair determination unit 15A, a relevance calculation unit 16A, and an output control unit 18A.
  • the same components as those in the above-described embodiment will be appropriately designated by the same reference numerals, and the description thereof will be omitted.
  • the pair determination unit 15A determines the inference target pair Ptag based on the input signal S2 received from the input device 2 via the interface 13. For example, the pair determination unit 15A determines two section data of the material data D1 designated based on the input signal S2 as the inference target pair Ptag.
  • the user of the information processing apparatus 1A designates the section data to be the digest candidate as one section data of the inference target pair Ptag, and determines whether or not it is suitable as the digest candidate as the other section data. Specify the section data to be.
  • the pair determination unit 15 may accept inputs for designating section data to be the inference target pair Ptag from different material data, and determine these section data as the inference target pair Ptag. Further, the pair determination unit 15A may determine a plurality of inference target pairs Ptag based on the input signal S2.
  • the relevance calculation unit 16A calculates and calculates the relevance of the inference target pair Ptag determined by the pair determination unit 15A based on the relevance inferior configured by referring to the relevance inferior information D2. Relevance information Ia regarding the degree is supplied to the output control unit 18A. Then, the output control unit 18A outputs based on the relevance information Ia. In this case, for example, the output control unit 18A supplies the output device 3 with an output signal S1 for displaying the relevance of the inference target pair Ptag.
  • the output control unit 18A may output only the information regarding the relevance of the inference target pair Ptags corresponding to the upper predetermined number having the highest relevance. May output only the information about the relevance of the inferred pair Ptag in which is equal to or greater than a predetermined threshold.
  • the above-mentioned predetermined number may be set to any number of 1 or more.
  • FIG. 12 is an example of a flowchart executed by the information processing apparatus 1A in the modification 2.
  • the pair determination unit 15A of the information processing apparatus 1A accepts a user input for designating the inference target pair Ptag (step S31).
  • the pair determination unit 15A may display the reproduction screen of the material data D1 including the seek bar or the like, and specify the section data corresponding to an arbitrary reproduction time as the inference target pair Ptag.
  • the relevance calculation unit 16A calculates the relevance of the inference target pair Ptag specified in step S31 (step S32).
  • the output control unit 18A outputs based on the degree of relevance calculated in step S32 (step S33).
  • the information processing apparatus 1A according to the modification 2 can suitably calculate and output the degree of relevance to the inference target pair Ptag specified by the user input.
  • the relevance calculation unit 16 may calculate the relevance using the sound data.
  • the relevance calculation unit 16 calculates the relevance with reference to the relevance inferior information D2 based on the video data and the sound data of the two section data constituting the inference target pair Ptag. do.
  • the parameter of the relevance inferior learned in advance so that the relevance is output when a pair of interval data including video data and sound data is input is stored in advance as the relevance inferior information D2.
  • the feature amount of the sound data may be input to the relevance inferior instead of directly inputting the sound data. In this case, after a predetermined feature amount extraction process or the like is performed on the sound data, the extracted feature amount is input to the relevance inferior.
  • the first digest candidate selection unit 14 may calculate the importance of each section data using sound data in addition to the video data. good.
  • the relevance calculation unit 16 may calculate the relevance by referring to the relevance inferior information D2 based only on the sound data included in the two section data constituting the inference target pair Ptag. good.
  • the parameters of the relevance inferior learned in advance so that the relevance is output when the pair of sound data is input are stored in the storage device 4 in advance as the relevance inferior information D2.
  • the information processing apparatus 1 can suitably calculate the relevance of the inference target pair Ptag using at least one of the video data and the sound data.
  • the information processing apparatus 1 does not have to include at least one of the first digest candidate selection unit 14 and the second digest candidate selection unit 17.
  • the pair determination unit 15 considers the section data of the important section as the first digest candidate, and the pair Ptag to be inferred. May be determined.
  • the output control unit 18 may perform a predetermined output based on the relevance information Ia output by the relevance calculation unit 16. In this case, the output control unit 18A may output information about the inference target pair Ptag having the highest relevance for the upper predetermined number, and output only the information about the inference target pair Ptag whose relevance is equal to or higher than the predetermined threshold. You may.
  • the above-mentioned predetermined number may be set to any number of 1 or more.
  • the above-mentioned "information about the inference target pair Ptag" may be the section data itself constituting the inference target pair Ptag, and the time information of the section data constituting the inference target pair Ptag (information indicating the reproduction time in the material data D1). ) May be.
  • FIG. 13 is a functional block diagram of the information processing apparatus 1X according to the second embodiment.
  • the information processing apparatus 1X mainly includes a pair determination unit 15X and a relevance calculation unit 16X.
  • the pair determining means 15X determines a pair of data including at least one of video data or sound data, and at least one of the data is a candidate for the first digest.
  • the "video data” may be composed of one image or may include a plurality of images.
  • the above-mentioned “data” and “pair” can be “interval data” and “inference target pair Ptag” in the first embodiment (including modification, the same applies hereinafter), respectively.
  • the pair determination means 15X can be the pair determination unit 15 or the pair determination unit 15A in the first embodiment.
  • the relevance calculation means 16X calculates the relevance degree indicating the degree of probability that the pair determined by the pair determination means 15X is included in the digest at the same time.
  • the relevance calculation means 16X can be the relevance calculation unit 16 or the relevance calculation unit 16A in the first embodiment.
  • FIG. 14 is an example of a flowchart executed by the information processing apparatus 1X in the second embodiment.
  • the pair determining means 15X determines a pair of data including at least one of video data or sound data, and at least one of the data is a candidate for the first digest (step S41).
  • the relevance calculation means 16X calculates the relevance degree indicating the degree of probability that the pair determined by the pair determination means 15X is simultaneously included in the digest (step S42).
  • the information processing apparatus 1X can suitably calculate the degree of relevance as an index for determining whether or not the paired data should be included in the digest at the same time.
  • Non-temporary computer-readable media include various types of tangible storage media.
  • Examples of non-temporary computer-readable media include magnetic storage media (eg, flexible disks, magnetic tapes, hard disk drives), magneto-optical storage media (eg, magneto-optical disks), CD-ROMs (ReadOnlyMemory), CD-Rs, Includes CD-R / W, semiconductor memory (eg, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (RandomAccessMemory)).
  • the program may also be supplied to the computer by various types of temporary computer readable medium.
  • temporary computer-readable media include electrical, optical, and electromagnetic waves.
  • the temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.
  • Appendix 2 The information according to Appendix 1, wherein the pair determining means determines the pair based on the section data constituting the section of the material data when the material data used as the material in the generation of the digest is divided into a plurality of sections. Processing equipment.
  • Appendix 3 The information processing apparatus according to Appendix 2, further comprising a second digest candidate selection means for selecting the section data to be the second digest candidate based on the degree of relevance.
  • the pair determination means determines the two section data as the first digest candidate as the pair.
  • the information processing apparatus according to Appendix 3, wherein the second digest candidate selection means selects the first digest candidate to be the second digest candidate based on the degree of relevance.
  • the pair determination means determines the first digest candidate and the non-first digest candidate, which is the section data that is not the first digest candidate, as the pair.
  • the information processing apparatus according to Appendix 3, wherein the second digest candidate selection means selects the non-first digest candidate to be the second digest candidate based on the degree of relevance.
  • the relevance calculation means uses a pair of section data of the learning digest created from the learning material data as a positive example, and a pair of section data of the learning material data other than the positive example.
  • the information processing apparatus according to any one of Supplementary note 1 to 9, which calculates the degree of relevance based on the relevance inferior learned as a negative example.
  • the relevance calculation means reproduces a pair of data when it is assumed that the pair of data is extracted from the same material data when a pair of data including at least one of video data or sound data is input.
  • the information processing apparatus according to any one of Supplementary note 1 to 9, which calculates the degree of relevance based on a relevance inferior learned to output information corresponding to a time difference.
  • a storage medium in which a program for operating a computer as a relevance calculation means for calculating a relevance indicating the degree of probability that the pair is simultaneously included in the digest is stored.

Abstract

An information processing device 1X primarily comprises a pair determination means 15X and a relevance calculating means 16X. The pair determination means 15X determines a pair of pieces of data that will serve as a first digest candidate in which at least one of the pieces of data is a digest candidate, the pair of pieces of data including image data and/or sound data. The relevance calculating means 16X calculates a relevance indicating the level of likelihood that the pair determined by the pair determination means 15X will be simultaneously included in a digest.

Description

情報処理装置、制御方法及び記憶媒体Information processing equipment, control method and storage medium
 本開示は、ダイジェストの生成に関する処理を行う情報処理装置、制御方法及び記憶媒体の技術分野に関する。 The present disclosure relates to technical fields of information processing devices, control methods, and storage media that perform processing related to digest generation.
 素材となる映像データを編集してダイジェストを生成する技術が存在する。例えば、特許文献1には、グランドでのスポーツイベントの映像ストリームからハイライトを確認して製作する方法が開示されている。 There is a technology to generate a digest by editing the video data that is the material. For example, Patent Document 1 discloses a method of confirming and producing highlights from a video stream of a sporting event on the ground.
特表2019-522948号公報Special Table 2019-522948 Gazette
 素材となる映像に対して重要度を算出し、その重要度に基づいてダイジェスト生成を自動生成する場合、関連性が低いシーン同士を組み合わせた結果、ストーリーの理解が困難なダイジェストが生成される可能性があった。 When the importance is calculated for the video as the material and the digest generation is automatically generated based on the importance, it is possible to generate a digest that makes it difficult to understand the story as a result of combining scenes with low relevance. There was sex.
 本開示の目的は、上記の課題を勘案し、ダイジェスト生成に好適な情報を生成することが可能な情報処理装置、制御方法及び記憶媒体を提供することである。 An object of the present disclosure is to provide an information processing device, a control method, and a storage medium capable of generating information suitable for digest generation in consideration of the above problems.
 情報処理装置の一の態様は、映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定するペア決定手段と、前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する関連度算出手段と、を有する情報処理装置である。 One aspect of the information processing apparatus is a pair determining means for determining a pair of data including at least one of video data or sound data, wherein at least one of the data is a first digest candidate which is a digest candidate. An information processing apparatus comprising, and a relevance calculation means for calculating a relevance degree indicating the degree of probability that the pair is simultaneously included in the digest.
 制御方法の一の態様は、コンピュータにより、映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定し、前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する、制御方法である。
One aspect of the control method is to determine by a computer the pair of data including at least one of video data or sound data, and the pair in which at least one data is a first digest candidate, which is a digest candidate. It is a control method for calculating the degree of association indicating the degree of probability that the pair is included in the digest at the same time.
 記憶媒体の一の態様は、映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定するペア決定手段と、前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する関連度算出手段としてコンピュータを機能させるプログラムが格納された記憶媒体である。 One aspect of the storage medium is a pair of data including at least one of video data or sound data, and a pair determining means for determining the pair in which at least one data is a first digest candidate which is a digest candidate. , A storage medium in which a program for operating a computer as a relevance calculation means for calculating a relevance indicating the degree of probability that the pair is simultaneously included in the digest is stored.
 本開示によれば、ダイジェスト生成に好適な情報を生成することができる。 According to the present disclosure, it is possible to generate information suitable for digest generation.
第1実施形態におけるダイジェスト候補選定システムの構成を示す。The configuration of the digest candidate selection system in the first embodiment is shown. 情報処理装置のハードウェア構成を示す。The hardware configuration of the information processing device is shown. 情報処理装置の機能ブロックの一例である。This is an example of a functional block of an information processing device. 第1選定例及び第2選定例に共通する第1ダイジェスト候補の選定処理の概要を示す図である。It is a figure which shows the outline of the selection process of the 1st digest candidate common to the 1st selection example and the 2nd selection example. 第1ダイジェスト候補の選定後における第1選定例に係る第2ダイジェスト候補の選定処理の概要を示す図である。It is a figure which shows the outline of the selection process of the 2nd digest candidate which concerns on the 1st selection example after the selection of the 1st digest candidate. 第1ダイジェスト候補の選定後における第2選定例に係る第2ダイジェスト候補の選定処理の概要を示す図である。It is a figure which shows the outline of the selection process of the 2nd digest candidate which concerns on the 2nd selection example after the selection of the 1st digest candidate. 関連度推論器情報を生成する学習システムの概略構成図である。It is a schematic block diagram of the learning system which generates the relevance inferior information. 学習装置の機能ブロック構成の一例を示す。An example of the functional block configuration of the learning device is shown. 第1実施形態において情報処理装置が実行する処理の手順を示すフローチャートの一例である。This is an example of a flowchart showing a procedure of processing executed by the information processing apparatus in the first embodiment. 第1実施形態において学習装置が実行する処理の手順を示すフローチャートの一例である。This is an example of a flowchart showing a procedure of processing executed by the learning device in the first embodiment. 変形例2に係る情報処理装置の機能ブロック図の一例である。It is an example of the functional block diagram of the information processing apparatus which concerns on modification 2. 変形例2において情報処理装置が実行する処理の手順を示すフローチャートの一例である。This is an example of a flowchart showing a procedure of processing executed by the information processing apparatus in the second modification. 第2実施形態における情報処理装置の機能ブロック図である。It is a functional block diagram of the information processing apparatus in 2nd Embodiment. 第2実施形態において情報処理装置が実行するフローチャートの一例である。This is an example of a flowchart executed by the information processing apparatus in the second embodiment.
 以下、図面を参照しながら、情報処理装置、制御方法及び記憶媒体の実施形態について説明する。 Hereinafter, embodiments of an information processing device, a control method, and a storage medium will be described with reference to the drawings.
 <第1実施形態>
 (1)システム構成
 図1は、第1実施形態に係るダイジェスト候補選定システム100の構成を示す。ダイジェスト候補選定システム100は、素材となる映像データ(「素材データ」とも呼ぶ。)からダイジェストの候補となる映像データを好適に選定する。ダイジェスト候補選定システム100は、主に、情報処理装置1と、入力装置2と、出力装置3と、記憶装置4とを備える。
<First Embodiment>
(1) System Configuration Figure 1 shows the configuration of the digest candidate selection system 100 according to the first embodiment. The digest candidate selection system 100 preferably selects video data as a digest candidate from video data (also referred to as “material data”) as a material. The digest candidate selection system 100 mainly includes an information processing device 1, an input device 2, an output device 3, and a storage device 4.
 情報処理装置1は、通信網を介し、又は、無線若しくは有線による直接通信により、入力装置2、及び出力装置3とデータ通信を行う。情報処理装置1は、記憶装置4に記憶された素材データD1から、同じく記憶装置4に記憶された関連度推論器情報D2及び重要度推論器情報D3を参照し、ダイジェストの候補となる映像データを選定する。そして、情報処理装置1は、上述の選定結果に関する出力信号「S1」を生成し、生成した出力信号S1を出力装置に供給する。 The information processing device 1 performs data communication with the input device 2 and the output device 3 via a communication network or by direct communication by radio or wire. The information processing device 1 refers to the relevance inferior information D2 and the importance inferior information D3 also stored in the storage device 4 from the material data D1 stored in the storage device 4, and is a video data that is a candidate for a digest. To select. Then, the information processing device 1 generates an output signal "S1" related to the above selection result, and supplies the generated output signal S1 to the output device.
 入力装置2は、ユーザ入力を受け付ける任意のユーザインターフェースであり、例えば、ボタン、キーボード、マウス、タッチパネル、音声入力装置などが該当する。入力装置2は、ユーザ入力に基づき生成した入力信号「S2」を、情報処理装置1へ供給する。出力装置3は、例えば、ディスプレイ、プロジェクタ等の表示装置、及び、スピーカ等の音出力装置であり、情報処理装置1から供給される出力信号S1に基づき、所定の表示又は音出力を行う。 The input device 2 is an arbitrary user interface that accepts user input, and corresponds to, for example, a button, a keyboard, a mouse, a touch panel, a voice input device, and the like. The input device 2 supplies the input signal "S2" generated based on the user input to the information processing device 1. The output device 3 is, for example, a display device such as a display or a projector, and a sound output device such as a speaker, and performs a predetermined display or sound output based on the output signal S1 supplied from the information processing device 1.
 記憶装置4は、情報処理装置1の処理に必要な各種情報を記憶するメモリである。記憶装置4は、例えば、素材データD1と、関連度推論器情報D2と、重要度推論器情報D3とを記憶する。 The storage device 4 is a memory for storing various information necessary for processing of the information processing device 1. The storage device 4 stores, for example, the material data D1, the relevance inferior information D2, and the importance inferrer information D3.
 素材データD1は、ダイジェストの生成において編集される対象となる映像データである。以後では、素材データD1から抽出される、所定の再生時間長の区間に対応する映像データを、「区間データ」とも呼ぶ。各区間データは、1枚以上となる所定枚数の時系列の画像を含む。第1実施形態では、情報処理装置1は、素材データD1を単位区間毎に分割した区間データを対象として、重要度の算出や関連度を算出するペアの生成を行う。 The material data D1 is video data to be edited in the generation of the digest. Hereinafter, the video data corresponding to the section having a predetermined reproduction time length extracted from the material data D1 is also referred to as “section data”. Each section data includes a predetermined number of time-series images, which is one or more. In the first embodiment, the information processing apparatus 1 calculates the importance and generates a pair for calculating the relevance of the section data obtained by dividing the material data D1 for each unit interval.
 関連度推論器情報D2は、区間データのペア(「推論対象ペアPtag」とも呼ぶ。)に対して関連度を推論する推論器(「関連度推論器」とも呼ぶ。)に関する情報である。関連度は、推論対象ペアPtagが同時にダイジェストに含まれるか否かという観点での関連性を示す指標であり、言い換えると、推論対象ペアPtagが同時にダイジェストに含まれる蓋然性(又は妥当性)の度合いを示す指標である。関連度推論器は、区間データのペアに相当する所定枚数(1枚以上)の画像のペアが入力された場合に、これらの関連度を推論するように予め学習され、関連度推論器情報D2には、学習された関連度推論器のパラメータが含まれる。 Relevance inference device information D2 is information about an inference device (also referred to as "relevance inference device") that infers the relevance to a pair of interval data (also referred to as "inference target pair Ptag"). The degree of relevance is an index showing the relevance in terms of whether or not the inferred pair Ptag is simultaneously included in the digest, in other words, the degree of probability (or validity) that the inferred pair Ptag is simultaneously included in the digest. It is an index showing. The relevance inferior is learned in advance to infer these relevances when a predetermined number (one or more) of image pairs corresponding to the pair of interval data are input, and the relevance inferior information D2. Contains the parameters of the learned relevance inferior.
 重要度推論器情報D3は、区間データに対して重要度を推論する推論器(「重要度推論器」とも呼ぶ。)に関する情報である。上述の重要度は、ダイジェストの生成において入力された映像データに該当する素材データD1における区間が重要区間であるか又は非重要区間であるかを判定するための基準となる指標である。重要度推論器は、区間データを構成する所定枚数(1枚以上)の画像が入力された場合に、対象の区間の重要度を推論するように予め学習され、重要度推論器情報D3には、学習された重要度推論器のパラメータが含まれる。 Importance inferrer information D3 is information about an inferior (also referred to as "importance inferior") that infers importance to interval data. The above-mentioned importance is an index as a reference for determining whether the section in the material data D1 corresponding to the video data input in the generation of the digest is an important section or a non-important section. The importance inferior is learned in advance so as to infer the importance of the target section when a predetermined number (one or more) of images constituting the section data are input, and the importance inferior information D3 is used. Includes learned importance inferrer parameters.
 関連度推論器及び重要度推論器の学習モデルは、それぞれ、ニューラルネットワーク又はサポートベクターマシンなどの任意の機械学習に基づく学習モデルであってもよい。例えば、上述の関連度推論器及び重要度推論器のモデルが畳み込みニューラルネットワークなどのニューラルネットワークである場合、関連度推論器情報D2及び重要度推論器情報D3は、層構造、各層のニューロン構造、各層におけるフィルタ数及びフィルタサイズ、並びに各フィルタの各要素の重みなどの各種パラメータを含む。 The learning model of the relevance inferior and the importance inferior may be a learning model based on arbitrary machine learning such as a neural network or a support vector machine, respectively. For example, when the above-mentioned model of the relevance inferior and the importance inferior is a neural network such as a convolutional neural network, the relevance inferior information D2 and the importance inferior information D3 are a layer structure, a neuron structure of each layer, and the like. Includes various parameters such as the number and size of filters in each layer and the weight of each element of each filter.
 なお、記憶装置4は、情報処理装置1に接続又は内蔵されたハードディスクなどの外部記憶装置であってもよく、フラッシュメモリなどの記憶媒体であってもよい。また、記憶装置4は、情報処理装置1とデータ通信を行うサーバ装置であってもよい。また、記憶装置4は、複数の装置から構成されてもよい。この場合、記憶装置4は、素材データD1、関連度推論器情報D2及び重要度推論器情報D3を分散して記憶してもよい。 The storage device 4 may be an external storage device such as a hard disk connected to or built in the information processing device 1, or may be a storage medium such as a flash memory. Further, the storage device 4 may be a server device that performs data communication with the information processing device 1. Further, the storage device 4 may be composed of a plurality of devices. In this case, the storage device 4 may store the material data D1, the relevance inferior information D2, and the importance inferior information D3 in a distributed manner.
 以上において説明したダイジェスト候補選定システム100の構成は一例であり、当該構成に種々の変更が行われてもよい。例えば、入力装置2及び出力装置3は、一体となって構成されてもよい。この場合、入力装置2及び出力装置3は、情報処理装置1と一体となるタブレット型端末として構成されてもよい。また、情報処理装置1は、複数の装置から構成されてもよい。この場合、情報処理装置1を構成する複数の装置は、予め割り当てられた処理を実行するために必要な情報の授受を、これらの複数の装置間において行う。 The configuration of the digest candidate selection system 100 described above is an example, and various changes may be made to the configuration. For example, the input device 2 and the output device 3 may be integrally configured. In this case, the input device 2 and the output device 3 may be configured as a tablet-type terminal integrated with the information processing device 1. Further, the information processing device 1 may be composed of a plurality of devices. In this case, the plurality of devices constituting the information processing device 1 exchange information necessary for executing the pre-assigned process among the plurality of devices.
 (2)情報処理装置のハードウェア構成
 図2は、情報処理装置1のハードウェア構成を示す。情報処理装置1は、ハードウェアとして、プロセッサ11と、メモリ12と、インターフェース13とを含む。プロセッサ11、メモリ12及びインターフェース13は、データバス19を介して接続されている。
(2) Hardware Configuration of Information Processing Device FIG. 2 shows the hardware configuration of the information processing device 1. The information processing apparatus 1 includes a processor 11, a memory 12, and an interface 13 as hardware. The processor 11, the memory 12, and the interface 13 are connected via the data bus 19.
 プロセッサ11は、メモリ12に記憶されているプログラムを実行することにより、所定の処理を実行する。プロセッサ11は、CPU(Central Processing Unit)、GPU(Graphics Processing Unit)、量子プロセッサなどのプロセッサである。 The processor 11 executes a predetermined process by executing the program stored in the memory 12. The processor 11 is a processor such as a CPU (Central Processing Unit), a GPU (Graphics Processing Unit), and a quantum processor.
 メモリ12は、RAM(Random Access Memory)、ROM(Read Only Memory)などの各種の揮発性メモリ及び不揮発性メモリにより構成される。また、メモリ12には、情報処理装置1が実行するプログラムが記憶される。また、メモリ12は、作業メモリとして使用され、記憶装置4から取得した情報等を一時的に記憶する。なお、メモリ12は、記憶装置4として機能してもよい。同様に、記憶装置4は、情報処理装置1のメモリ12として機能してもよい。なお、情報処理装置1が実行するプログラムは、メモリ12以外の記憶媒体に記憶されてもよい。 The memory 12 is composed of various volatile memories such as RAM (Random Access Memory) and ROM (Read Only Memory) and non-volatile memory. Further, the memory 12 stores a program executed by the information processing apparatus 1. Further, the memory 12 is used as a working memory and temporarily stores information and the like acquired from the storage device 4. The memory 12 may function as the storage device 4. Similarly, the storage device 4 may function as the memory 12 of the information processing device 1. The program executed by the information processing apparatus 1 may be stored in a storage medium other than the memory 12.
 インターフェース13は、情報処理装置1と他の装置とを電気的に接続するためのインターフェースである。例えば、情報処理装置1と他の装置とを接続するためのインターフェースは、プロセッサ11の制御に基づき他の装置とデータの送受信を有線又は無線により行うためのネットワークアダプタなどの通信インターフェースであってもよい。他の例では、情報処理装置1と他の装置とはケーブル等により接続されてもよい。この場合、インターフェース13は、他の装置とデータの授受を行うためのUSB(Universal Serial Bus)、SATA(Serial AT Attachment)などに準拠したハードウェアインターフェースを含む。 The interface 13 is an interface for electrically connecting the information processing device 1 and another device. For example, the interface for connecting the information processing device 1 and another device may be a communication interface such as a network adapter for transmitting / receiving data to / from another device based on the control of the processor 11 by wire or wirelessly. good. In another example, the information processing apparatus 1 and the other apparatus may be connected by a cable or the like. In this case, the interface 13 includes a hardware interface compliant with USB (Universal Serial Bus), SATA (Serial AT Atchment), etc. for exchanging data with other devices.
 なお、情報処理装置1のハードウェア構成は、図2に示す構成に限定されない。例えば、情報処理装置1は、入力装置2又は出力装置3の少なくとも一方を含んでもよい。 The hardware configuration of the information processing device 1 is not limited to the configuration shown in FIG. For example, the information processing device 1 may include at least one of an input device 2 and an output device 3.
 (3)機能ブロック
 次に、情報処理装置1の機能ブロックについて説明する。ここでは、情報処理装置1は、素材データD1の区間データから暫定的なダイジェスト候補(「第1ダイジェスト候補」とも呼ぶ。)を選定し、第1ダイジェスト候補を基準として最終的に出力すべきダイジェスト候補(「第2ダイジェスト候補」とも呼ぶ。)を選定する。
(3) Functional block Next, the functional block of the information processing apparatus 1 will be described. Here, the information processing apparatus 1 selects a provisional digest candidate (also referred to as a “first digest candidate”) from the section data of the material data D1, and finally outputs the digest based on the first digest candidate. Select candidates (also called "second digest candidates").
 図3は、情報処理装置1のプロセッサ11の機能ブロックの一例である。情報処理装置1のプロセッサ11は、機能的には、第1ダイジェスト候補選定部14と、ペア決定部15と、関連度算出部16と、第2ダイジェスト候補選定部17と、出力制御部18とを有する。なお、図3では、データの授受が行われるブロック同士を実線により結んでいるが、データの授受が行われるブロックの組合せは図3に限定されない。後述する他の機能ブロックの図においても同様である。 FIG. 3 is an example of a functional block of the processor 11 of the information processing device 1. Functionally, the processor 11 of the information processing device 1 includes a first digest candidate selection unit 14, a pair determination unit 15, a relevance calculation unit 16, a second digest candidate selection unit 17, and an output control unit 18. Has. In FIG. 3, the blocks in which data is exchanged are connected by a solid line, but the combination of blocks in which data is exchanged is not limited to FIG. The same applies to the figures of other functional blocks described later.
 第1ダイジェスト候補選定部14は、素材データD1を構成する区間データの各々に対して重要度を算出し、算出した重要度に基づき、素材データD1を構成する区間データから第1ダイジェスト候補を選定する。ここで、区間データは、素材データD1を単位時間長の区間により区切ったデータであり、1枚以上の所定枚数分の画像を含むデータとなる。そして、第1ダイジェスト候補選定部14は、例えば、重要度推論器情報D3を参照することで重要度推論器を構成し、素材データD1から抽出した区間データを重要度推論器に順次入力することで、全ての区間データにそれぞれ対応する重要度を取得する。そして、第1ダイジェスト候補選定部14は、重要度が予め設定された閾値以上となる区間データを、第1ダイジェスト候補として選定する。以後では、第1ダイジェスト候補ではない区間データを、「非第1ダイジェスト候補」とも呼ぶ。第1ダイジェスト候補選定部14は、第1ダイジェスト候補に関する情報(「第1ダイジェスト候補情報Idc1」とも呼ぶ。)をペア決定部15に供給する。 The first digest candidate selection unit 14 calculates the importance of each of the section data constituting the material data D1, and selects the first digest candidate from the section data constituting the material data D1 based on the calculated importance. do. Here, the section data is data in which the material data D1 is divided by a section having a unit time length, and is data including one or more images for a predetermined number of sheets. Then, the first digest candidate selection unit 14 configures the importance inference device by referring to the importance inference device information D3, and sequentially inputs the section data extracted from the material data D1 into the importance inference device. Then, the importance corresponding to all the section data is acquired. Then, the first digest candidate selection unit 14 selects the section data whose importance is equal to or higher than the preset threshold value as the first digest candidate. Hereinafter, the section data that is not the first digest candidate is also referred to as a “non-first digest candidate”. The first digest candidate selection unit 14 supplies information regarding the first digest candidate (also referred to as “first digest candidate information Idc1”) to the pair determination unit 15.
 ペア決定部15は、第1ダイジェスト候補選定部14が決定した第1ダイジェスト候補情報Idc1に基づき、素材データD1から抽出された2つの区間データを組み合わせた複数の推論対象ペアPtagを決定する。この場合、第1の例では、ペア決定部15は、第1ダイジェスト候補となる任意の2つの区間データを組み合わせた推論対象ペアPtagを決定する。第2の例では、ペア決定部15は、第1ダイジェスト候補となる区間データと、非第1ダイジェスト候補となる区間データとを任意に組み合わせた推論対象ペアPtagを決定する。そして、ペア決定部15は、決定した推論対象ペアPtagを関連度算出部16に供給する。 The pair determination unit 15 determines a plurality of inference target pairs Ptag that combines two section data extracted from the material data D1 based on the first digest candidate information Idc1 determined by the first digest candidate selection unit 14. In this case, in the first example, the pair determination unit 15 determines an inference target pair Ptag that combines arbitrary two interval data that are candidates for the first digest. In the second example, the pair determination unit 15 determines an inference target pair Ptag that arbitrarily combines the section data that is the first digest candidate and the section data that is the non-first digest candidate. Then, the pair determination unit 15 supplies the determined inference target pair Ptag to the relevance calculation unit 16.
 なお、ペア決定部15は、情報処理装置1の全体の処理負荷低減のため、推論対象ペアPtagを以下に述べる方法により限定してもよい。例えば、ペア決定部15は、対応する再生時刻の差が所定時間差以内となる2つの区間データを、推論対象ペアPtagとして決定してもよい。他の例では、ペア決定部15は、所定の時間間隔毎に素材データD1から抽出された区間データのみを組み合わせの対象として、推論対象ペアPtagを決定してもよい。さらに別の例では、ペア決定部15は、区間データに対して任意のクラスタリング手法を適用してクラス分けを行い、所定のクラスに属する区間データのみを組み合わせの対象として、推論対象ペアPtagを決定してもよい。 The pair determination unit 15 may limit the inference target pair Ptag by the method described below in order to reduce the overall processing load of the information processing apparatus 1. For example, the pair determination unit 15 may determine two section data in which the difference between the corresponding reproduction times is within a predetermined time difference as the inference target pair Ptag. In another example, the pair determination unit 15 may determine the inference target pair Ptag by targeting only the interval data extracted from the material data D1 at predetermined time intervals. In yet another example, the pair determination unit 15 applies an arbitrary clustering method to the interval data to classify the interval data, and determines the inference target pair Ptag with only the interval data belonging to a predetermined class as the target of combination. You may.
 関連度算出部16は、ペア決定部15から供給される推論対象ペアPtagの各々に対して関連度を算出する。この場合、関連度算出部16は、関連度推論器情報D2を参照することで関連度推論器を構成し、ペア決定部15から取得した推論対象ペアPtagを関連度推論器に順次入力することで、推論対象ペアPtagの各々に対する関連度を算出する。そして、関連度算出部16は、算出した関連度を示す情報(「関連度情報Ia」とも呼ぶ。)を、第2ダイジェスト候補選定部17に供給する。 The relevance calculation unit 16 calculates the relevance degree for each of the inference target pair Ptag supplied from the pair determination unit 15. In this case, the relevance calculation unit 16 configures the relevance inference device by referring to the relevance inference device information D2, and sequentially inputs the inference target pair Ptag acquired from the pair determination unit 15 into the relevance inference device. Then, the degree of relevance to each of the inference target pair Ptag is calculated. Then, the relevance calculation unit 16 supplies the calculated relevance information (also referred to as “relevance information Ia”) to the second digest candidate selection unit 17.
 第2ダイジェスト候補選定部17は、関連度算出部16から供給される関連度情報Iaに基づき、第2ダイジェスト候補を選定する。第2ダイジェスト候補選定部17は、選定した第2ダイジェスト候補に関する情報(「第2ダイジェスト候補情報Idc2」とも呼ぶ。)を出力制御部18へ供給する。ここで、第2ダイジェスト候補情報Idc2は、第2ダイジェスト候補となる区間データそのものであってもよく、第2ダイジェスト候補となる区間データの時間情報(素材データD1における再生時刻を示す情報)であってもよい。 The second digest candidate selection unit 17 selects the second digest candidate based on the relevance information Ia supplied from the relevance calculation unit 16. The second digest candidate selection unit 17 supplies information regarding the selected second digest candidate (also referred to as “second digest candidate information Idc2”) to the output control unit 18. Here, the second digest candidate information Idc2 may be the section data itself that is the second digest candidate, and is the time information (information indicating the reproduction time in the material data D1) of the section data that is the second digest candidate. You may.
 ここで、第1ダイジェスト候補から推論対象ペアPtagが選定された場合、第2ダイジェスト候補選定部17は、関連度が予め定めた閾値以上となる推論対象ペアPtagを構成する第1ダイジェスト候補を、第2ダイジェスト候補として選定する。これにより、第2ダイジェスト候補選定部17は、第1ダイジェスト候補から関連度に基づき絞り込んだ第2ダイジェスト候補を好適に決定することができる。一方、第1ダイジェスト候補と非第1ダイジェスト候補との組み合わせを推論対象ペアPtagとした場合、第2ダイジェスト候補選定部17は、関連度が閾値以上となる推論対象ペアPtagの非第1ダイジェスト候補を、第2ダイジェスト候補に加える。この場合、第2ダイジェスト候補選定部17は、第1ダイジェスト候補と関連度が高い非第1ダイジェスト候補を、第2ダイジェスト候補に好適に取り込むことができる。 Here, when the inference target pair Ptag is selected from the first digest candidate, the second digest candidate selection unit 17 selects the first digest candidate constituting the inference target pair Ptag whose relevance is equal to or higher than a predetermined threshold value. Select as a second digest candidate. As a result, the second digest candidate selection unit 17 can suitably determine the second digest candidate narrowed down from the first digest candidate based on the degree of relevance. On the other hand, when the combination of the first digest candidate and the non-first digest candidate is the inference target pair Ptag, the second digest candidate selection unit 17 is the non-first digest candidate of the inference target pair Ptag whose relevance is equal to or higher than the threshold value. Is added to the second digest candidate. In this case, the second digest candidate selection unit 17 can suitably incorporate the non-first digest candidate, which has a high degree of relevance to the first digest candidate, into the second digest candidate.
 出力制御部18は、第2ダイジェスト候補選定部17から供給される第2ダイジェスト候補情報Idc2に基づく出力制御を行う。第1の例では、出力制御部18は、第2ダイジェスト候補情報Idc2に関する出力信号S1を生成し、生成した出力信号S1を、インターフェース13を介して出力装置3に送信する。この場合、出力制御部18は、例えば、第2ダイジェスト候補となる区間データを再生するための出力信号S1を出力装置3に送信することで、第2ダイジェスト候補となる区間データを出力装置3に再生させてもよい。これにより、出力制御部18は、第2ダイジェスト候補のダイジェストとしての適否を閲覧者に確認させることができる。第2の例では、出力制御部18は、第2ダイジェスト候補情報Idc2を、インターフェース13を介して記憶装置4に記憶する。第3の例では、出力制御部18は、第2ダイジェスト候補情報Idc2を、最終的なダイジェストの生成処理を行う外部装置にインターフェース13を介して送信する。 The output control unit 18 performs output control based on the second digest candidate information Idc2 supplied from the second digest candidate selection unit 17. In the first example, the output control unit 18 generates an output signal S1 related to the second digest candidate information Idc2, and transmits the generated output signal S1 to the output device 3 via the interface 13. In this case, the output control unit 18 transmits, for example, an output signal S1 for reproducing the section data as the second digest candidate to the output device 3, so that the section data as the second digest candidate is transmitted to the output device 3. You may regenerate it. As a result, the output control unit 18 can make the viewer confirm the suitability of the second digest candidate as a digest. In the second example, the output control unit 18 stores the second digest candidate information Idc2 in the storage device 4 via the interface 13. In the third example, the output control unit 18 transmits the second digest candidate information Idc2 to the external device that performs the final digest generation process via the interface 13.
 なお、図3において説明した第1ダイジェスト候補選定部14、ペア決定部15、関連度算出部16、第2ダイジェスト候補選定部17及び出力制御部18の各構成要素は、例えば、プロセッサ11が記憶装置4又はメモリ12に格納されたプログラムを実行することによって実現できる。また、必要なプログラムを任意の不揮発性記憶媒体に記録しておき、必要に応じてインストールすることで、各構成要素を実現するようにしてもよい。なお、これらの各構成要素は、プログラムによるソフトウェアで実現することに限ることなく、ハードウェア、ファームウェア、及びソフトウェアのうちのいずれかの組み合わせ等により実現してもよい。また、これらの各構成要素は、例えばFPGA(field-programmable gate array)又はマイコン等の、ユーザがプログラミング可能な集積回路を用いて実現してもよい。この場合、この集積回路を用いて、上記の各構成要素から構成されるプログラムを実現してもよい。このように、各構成要素は、プロセッサ以外のハードウェアを含む任意のコントローラにより実現されてもよい。以上のことは、後述する他の実施の形態においても同様である。 The processor 11 stores, for example, each component of the first digest candidate selection unit 14, the pair determination unit 15, the relevance calculation unit 16, the second digest candidate selection unit 17, and the output control unit 18 described in FIG. This can be achieved by executing a program stored in the device 4 or the memory 12. Further, each component may be realized by recording a necessary program in an arbitrary non-volatile storage medium and installing it as needed. It should be noted that each of these components is not limited to being realized by software by a program, and may be realized by any combination of hardware, firmware, and software. Further, each of these components may be realized by using a user-programmable integrated circuit such as an FPGA (field-programmable gate array) or a microcomputer. In this case, this integrated circuit may be used to realize a program composed of each of the above components. As described above, each component may be realized by any controller including hardware other than the processor. The above is the same in other embodiments described later.
 (4)具体例
 次に、第2ダイジェスト候補の選定に関する具体例について説明する。以後では、第1ダイジェスト候補から第2ダイジェスト候補を選定する第1選定例と、第1ダイジェスト候補に加えて第1ダイジェスト候補と関連が高い非第1ダイジェスト候補を第2ダイジェスト候補として選定する第2選定例について説明する。
(4) Specific Examples Next, specific examples regarding the selection of the second digest candidate will be described. Hereinafter, the first selection example in which the second digest candidate is selected from the first digest candidate, and the non-first digest candidate which is highly related to the first digest candidate in addition to the first digest candidate are selected as the second digest candidate. 2 An example of selection will be described.
 図4は、第1選定例及び第2選定例に共通する第1ダイジェスト候補の選定処理の概要を示す図である。 FIG. 4 is a diagram showing an outline of the selection process of the first digest candidate common to the first selection example and the second selection example.
 まず、第1ダイジェスト候補選定部14は、素材データD1から単位時間長の複数の区間データを抽出し、重要度推論器情報D3を参照することで構成した重要度推論器に対し、抽出した区間データを順次入力する。これにより、第1ダイジェスト候補選定部14は、区間データの各々の重要度を算出する。そして、第1ダイジェスト候補選定部14は、所定の閾値以上の重要度となる区間データを、第1ダイジェスト候補とみなし、当該閾値未満の重要度となる区間データを、非第1ダイジェスト候補とみなす。 First, the first digest candidate selection unit 14 extracts a plurality of section data having a unit time length from the material data D1 and refers to the importance inferrer information D3 for the extracted section. Enter the data sequentially. As a result, the first digest candidate selection unit 14 calculates the importance of each of the section data. Then, the first digest candidate selection unit 14 considers the section data having an importance of more than a predetermined threshold value as the first digest candidate, and the section data having an importance less than the threshold value as a non-first digest candidate. ..
 図5は、第1ダイジェスト候補の選定後における第1選定例に係る第2ダイジェスト候補の選定処理の概要を示す図である。 FIG. 5 is a diagram showing an outline of the selection process of the second digest candidate according to the first selection example after the selection of the first digest candidate.
 第1選定例では、ペア決定部15は、第1ダイジェスト候補となる区間データ同士を組み合わせた推論対象ペアPtagを決定する。この場合、ペア決定部15は、第1ダイジェスト候補同士のペアの全組合せを推論対象ペアPtagとして決定してもよく、第1ダイジェスト候補同士のペアの一部を推論対象ペアPtagとして決定してもよい。 In the first selection example, the pair determination unit 15 determines the inference target pair Ptag that combines the section data that are the first digest candidates. In this case, the pair determination unit 15 may determine all combinations of pairs of the first digest candidates as the inference target pair Ptag, or determine a part of the pair of the first digest candidates as the inference target pair Ptag. May be good.
 そして、関連度算出部16は、ペア決定部15が決定した推論対象ペアPtagの各々を、関連度推論器情報D2を参照して構成した関連度推論器に入力する。これにより、関連度算出部16は、推論対象ペアPtagの各々の関連度を算出する。そして、第2ダイジェスト候補選定部17は、算出した関連度が所定の閾値以上となる推論対象ペアPtagを構成する区間データを、第2ダイジェスト候補として選定する。これにより、第2ダイジェスト候補選定部17は、第1ダイジェスト候補となる区間データから、関連度が高い区間データのみを第2ダイジェスト候補として好適に絞り込むことができる。 Then, the relevance calculation unit 16 inputs each of the inference target pairs Ptag determined by the pair determination unit 15 into the relevance inference device configured with reference to the relevance inference device information D2. As a result, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag. Then, the second digest candidate selection unit 17 selects the section data constituting the inference target pair Ptag whose calculated relevance degree is equal to or higher than a predetermined threshold value as the second digest candidate. As a result, the second digest candidate selection unit 17 can suitably narrow down only the section data having a high degree of relevance as the second digest candidate from the section data that becomes the first digest candidate.
 ここで、第1ダイジェスト候補同士のペアの一部を推論対象ペアPtagとして決定する方法について補足説明する。第1決定方法では、ペア決定部15は、再生時刻が所定時間差以内となる第1ダイジェスト候補の区間データのペアのみを、推論対象ペアPtagとして選定する。第2決定方法では、ペア決定部15は、一定の時間(例えば2秒)間隔により素材データD1から選定した区間データに該当する第1ダイジェスト候補のみを対象として推論対象ペアPtagを決定する。第3選定方法では、ペア決定部15は、第1ダイジェスト候補となる区間データのクラスタリングを行い、特定のクラスに含まれる区間データのみを対象として推論対象ペアPtagを決定する。この場合、ペア決定部15は、例えば、所定の特徴量抽出処理を各区間データに対して行い、抽出した特徴量に基づき、予め設定したクラスのうちいずれのクラスに属するかの判定(即ちクラスの識別)を行う。他の例では、ペア決定部15は、入力装置2へのユーザ入力に基づきクラスタリングを行ってもよい。 Here, a supplementary explanation will be given on a method of determining a part of the pair of the first digest candidates as the inference target pair Ptag. In the first determination method, the pair determination unit 15 selects only the pair of the section data of the first digest candidate whose reproduction time is within a predetermined time difference as the inference target pair Ptag. In the second determination method, the pair determination unit 15 determines the inference target pair Ptag only for the first digest candidate corresponding to the section data selected from the material data D1 at regular time (for example, 2 seconds) intervals. In the third selection method, the pair determination unit 15 clusters the section data that is the first digest candidate, and determines the inference target pair Ptag only for the section data included in the specific class. In this case, the pair determination unit 15 performs, for example, a predetermined feature amount extraction process for each section data, and determines which of the preset classes belongs to (that is, the class) based on the extracted feature amount. Identification). In another example, the pair determination unit 15 may perform clustering based on the user input to the input device 2.
 図6は、第1ダイジェスト候補の選定後における第2選定例に係る第2ダイジェスト候補の選定処理の概要を示す図である。 FIG. 6 is a diagram showing an outline of the selection process of the second digest candidate according to the second selection example after the selection of the first digest candidate.
 第2選定例では、ペア決定部15は、第1ダイジェスト候補と非第1ダイジェスト候補とから夫々1つ取り出して組み合わせた推論対象ペアPtagを決定する。この場合、ペア決定部15は、第1ダイジェスト候補と非第1ダイジェスト候補のペアの全組合せを推論対象ペアPtagとして選定してもよく、これらの一部の組合せを推論対象ペアPtagとして選定してもよい。 In the second selection example, the pair determination unit 15 determines an inference target pair Ptag that is taken out from each of the first digest candidate and the non-first digest candidate and combined. In this case, the pair determination unit 15 may select all combinations of pairs of the first digest candidate and the non-first digest candidate as the inference target pair Ptag, and select some combinations thereof as the inference target pair Ptag. You may.
 そして、関連度算出部16は、ペア決定部15が決定した推論対象ペアPtagの各々を、関連度推論器情報D2を参照して構成した関連度推論器に入力する。これにより、関連度算出部16は、推論対象ペアPtagの各々の関連度を算出する。そして、第2ダイジェスト候補選定部17は、算出した関連度が所定の閾値以上となる推論対象ペアPtagを構成する非第1ダイジェスト候補と、全ての第1ダイジェスト候補とを、第2ダイジェスト候補として選定する。この場合、第2ダイジェスト候補選定部17は、第1ダイジェスト候補と関連度が高い非第1ダイジェスト候補を、第2ダイジェスト候補に取り込むことができる。これにより、ストーリー理解のために必要な重要シーンの周辺のシーンをダイジェスト候補に好適に取り込むことができる。 Then, the relevance calculation unit 16 inputs each of the inference target pairs Ptag determined by the pair determination unit 15 into the relevance inference device configured with reference to the relevance inference device information D2. As a result, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag. Then, the second digest candidate selection unit 17 sets the non-first digest candidate constituting the inference target pair Ptag whose calculated relevance is equal to or higher than a predetermined threshold value and all the first digest candidates as the second digest candidate. Select. In this case, the second digest candidate selection unit 17 can incorporate the non-first digest candidate having a high degree of relevance to the first digest candidate into the second digest candidate. As a result, the scenes around the important scenes necessary for understanding the story can be suitably incorporated into the digest candidates.
 ここで、第1ダイジェスト候補と非第1ダイジェスト候補とのペアの一部の組合せを推論対象ペアPtagとして選定する方法について補足説明する。第1選定方法では、ペア決定部15は、再生時刻が所定時間差以内となる第1ダイジェスト候補と非第1ダイジェスト候補との区間データのペアのみを、推論対象ペアPtagとして選定する。第2選定方法では、ペア決定部15は、一定の時間(例えば2秒)間隔により素材データD1から選定した区間データに該当する第1ダイジェスト候補及び非第1ダイジェスト候補のみを対象として推論対象ペアPtagを選定する。第3選定方法では、ペア決定部15は、第1ダイジェスト候補及び非第1ダイジェスト候補のクラスタリングを行い、特定のクラスに含まれる第1ダイジェスト候補及び非第1ダイジェスト候補のみを対象として推論対象ペアPtagを選定する。 Here, a supplementary explanation will be given on a method of selecting a partial combination of a pair of a first digest candidate and a non-first digest candidate as an inference target pair Ptag. In the first selection method, the pair determination unit 15 selects only the pair of the section data between the first digest candidate and the non-first digest candidate whose reproduction time is within a predetermined time difference as the inference target pair Ptag. In the second selection method, the pair determination unit 15 targets only the first digest candidate and the non-first digest candidate corresponding to the section data selected from the material data D1 at regular time (for example, 2 seconds) intervals, and the pair is inferred. Select Ptag. In the third selection method, the pair determination unit 15 clusters the first digest candidate and the non-first digest candidate, and the inference target pair targets only the first digest candidate and the non-first digest candidate included in the specific class. Select Ptag.
 (5)関連度推論器の学習
 次に、関連度推論器の学習による関連度推論器情報D2の生成について説明する。図7は、関連度推論器情報D2を生成する学習システムの概略構成図である。上記学習システムは、学習データD4を参照可能な学習装置6を有する。
(5) Learning of the relevance inferior device Next, the generation of the relevance inferior information D2 by learning the relevance inferior device will be described. FIG. 7 is a schematic configuration diagram of a learning system that generates the relevance inferior information D2. The learning system has a learning device 6 that can refer to the learning data D4.
 学習装置6は、例えば図2に示す情報処理装置1の構成と同一構成を有し、主に、プロセッサ21と、メモリ22と、インターフェース23とを有している。学習装置6は、情報処理装置1であってもよく、情報処理装置1以外の任意の装置であってもよい。 The learning device 6 has the same configuration as that of the information processing device 1 shown in FIG. 2, for example, and mainly has a processor 21, a memory 22, and an interface 23. The learning device 6 may be an information processing device 1 or any device other than the information processing device 1.
 学習データD4は、学習用の素材データである学習用素材データと、単位区間ごとに学習用素材データが重要か非重要かを示すラベルとを含む。ここでは、学習用素材データからダイジェスト(学習用ダイジェスト)を人手の作業により予め生成し、ダイジェストとして採用された学習用素材データの区間データには重要のラベルが付され、ダイジェスト以外の区間データには非重要のラベルが付されている。以後では、重要のラベルが付された学習用素材データ(即ち学習用ダイジェスト)の区間データを「重要データ」と呼び、非重要のラベルが付された学習用素材データの区間データを「非重要データ」と呼ぶ。 The learning data D4 includes learning material data which is learning material data and a label indicating whether the learning material data is important or non-important for each unit interval. Here, a digest (learning digest) is generated in advance from the learning material data by manual work, and important labels are attached to the section data of the learning material data adopted as the digest, and the section data other than the digest is assigned. Is labeled as non-important. Hereinafter, the section data of the learning material data with the important label (that is, the learning digest) is referred to as "important data", and the section data of the learning material data with the non-important label is "non-important". Called "data".
 図8は、学習装置6の機能ブロック構成の一例を示す。学習装置6は、機能的には、主に、ペア決定部61と、学習部62とを有する。ペア決定部61及び学習部62は、例えば、プロセッサ21により実現される。 FIG. 8 shows an example of the functional block configuration of the learning device 6. Functionally, the learning device 6 mainly has a pair determination unit 61 and a learning unit 62. The pair determination unit 61 and the learning unit 62 are realized by, for example, the processor 21.
 ペア決定部61は、学習データD4を参照し、学習用素材データの区間データから学習用の推論対象ペアPtagを決定し、これらのペアに対する正解ラベルを生成する。ペア決定部61の処理の具体例については後述する。 The pair determination unit 61 refers to the learning data D4, determines the inference target pair Ptag for learning from the interval data of the learning material data, and generates a correct label for these pairs. A specific example of the processing of the pair determination unit 61 will be described later.
 学習部62は、ペア決定部61が決定した学習用の推論対象ペアPtagと、正解ラベルとの組合せに基づき、関連度推論器の学習を行う。この場合、学習装置6は、学習用の推論対象ペアPtagを関連度推論器に入力した場合の関連度推論器の出力と、入力された学習用の推論対象ペアPtagに対応する正解ラベルとの誤差(損失)が最小となるように、関連度推論器のパラメータを決定する。損失を最小化するように上述のパラメータを決定するアルゴリズムは、勾配降下法や誤差逆伝播法などの機械学習において用いられる任意の学習アルゴリズムであってもよい。なお、学習部62は、学習データD4を参照して重要度推論器の学習を行い、重要度推論器情報D3を生成する処理をさらに行ってもよい。 The learning unit 62 learns the relevance inference device based on the combination of the inference target pair Ptag for learning determined by the pair determination unit 61 and the correct answer label. In this case, the learning device 6 has the output of the relevance inference device when the inference target pair Ptag for learning is input to the relevance inference device, and the correct answer label corresponding to the input inference target pair Ptag for learning. Determine the parameters of the relevance inferior so that the error (loss) is minimized. The algorithm for determining the above parameters to minimize the loss may be any learning algorithm used in machine learning such as gradient descent or backpropagation. The learning unit 62 may further perform a process of learning the importance inference device with reference to the learning data D4 and generating the importance inference device information D3.
 次に、ペア決定部61の処理の具体例(第1学習例及び第2学習例)について説明する。 Next, a specific example of the processing of the pair determination unit 61 (first learning example and second learning example) will be described.
 第1学習例では、ペア決定部61は、学習用素材データを構成する重要データ及び非重要データの全ての2つの組合せを学習用の推論対象ペアPtagとして決定する。そして、ペア決定部61は、重要データ同士のペアとなる学習用の推論対象ペアPtagに対し、正例であることを示す正解ラベルを付し、その他のペアとなる学習用の推論対象ペアPtagに対し、負例であることを示す正解ラベルを付す。なお、「その他のペア」とは、非重要データ同士のペア、及び、重要データと非重要データのペアを指す。その後、学習部62は、例えば、負例であることを示す正解ラベルの場合には正解となる関連度が最低値であるとみなし、正例であることを示す正解ラベルの場合には正解となる関連度が最大値であるとみなし、関連度推論器の学習を行う。 In the first learning example, the pair determination unit 61 determines all two combinations of the important data and the non-important data constituting the learning material data as the inference target pair Ptag for learning. Then, the pair determination unit 61 attaches a correct answer label indicating that it is a correct example to the inference target pair Ptag for learning that is a pair of important data, and the inference target pair Ptag for learning that is another pair. On the other hand, a correct answer label indicating that it is a negative example is attached. The "other pair" refers to a pair of non-important data and a pair of important data and non-important data. After that, the learning unit 62 considers that, for example, in the case of a correct answer label indicating that it is a negative example, the degree of relevance that becomes a correct answer is the lowest value, and in the case of a correct answer label indicating that it is a correct example, it is regarded as a correct answer. The degree of relevance is considered to be the maximum value, and the relevance inferior is learned.
 第1学習例によれば、学習装置6は、入力される区間データのペアが同時にダイジェストに含まれる蓋然性が高いほど高い関連度を出力するように関連度推論器を好適に学習することができる。 According to the first learning example, the learning device 6 can suitably learn the relevance inferior so that the higher the probability that the pair of input interval data is included in the digest at the same time, the higher the relevance is output. ..
 第2学習例では、ペア決定部61は、学習用素材データの任意の2つの区間データ(重要データ及び非重要データを含む)を学習用の推論対象ペアPtagとして決定する。そして、学習部62は、学習用の推論対象ペアPtagを構成する2つの区間データの再生時刻の差に応じた値を、対象の学習用の推論対象ペアPtagに対する正解ラベルとして決定する。「再生時刻の差に応じた値」は、例えば、2つの区間データの再生時刻が近いほど1(例えば関連度の最大値)に近い数字となり、遠いほど0(例えば関連度の最低値)に近い数字となるように、関連度の値域に応じて正規化された値となる。 In the second learning example, the pair determination unit 61 determines arbitrary two interval data (including important data and non-important data) of the learning material data as the inference target pair Ptag for learning. Then, the learning unit 62 determines a value corresponding to the difference in reproduction time of the two section data constituting the inference target pair Ptag for learning as the correct label for the inference target pair Ptag for learning. The "value according to the difference in reproduction time" is, for example, a number closer to 1 (for example, the maximum value of the degree of relevance) as the reproduction time of the two section data is closer, and 0 (for example, the lowest value of the degree of relevance) as the distance is farther. It is a value normalized according to the range of the degree of relevance so that the numbers are close to each other.
 第2学習例によれば、学習装置6は、ストーリーとして繋がりが深い区間データのペアほど高い関連度を出力するように関連度推論器を好適に学習することができる。 According to the second learning example, the learning device 6 can suitably learn the relevance degree inferior so that the pair of section data having a deep connection as a story outputs a higher relevance degree.
 なお、情報処理装置1は、第2学習例で学習された関連度推論器を、例えば図7に示した第2選定例に適用することで、第1ダイジェスト候補に時間的に近い非第1ダイジェスト候補を、第2ダイジェスト候補として好適に取り込むことができる。これにより、情報処理装置1は、重要シーンの周辺シーンを好適に第2ダイジェスト候補として選定し、ストーリーの理解がしやすいダイジェストの生成を好適に支援することができる。 The information processing apparatus 1 applies the relevance inferior learned in the second learning example to, for example, the second selection example shown in FIG. 7, so that the information processing apparatus 1 is non-first, which is close in time to the first digest candidate. The digest candidate can be suitably incorporated as the second digest candidate. As a result, the information processing apparatus 1 can suitably select the peripheral scenes of the important scenes as the second digest candidate, and can suitably support the generation of the digest in which the story is easy to understand.
 (6)処理フロー
 図9は、第1実施形態において情報処理装置1が実行する処理の手順を示すフローチャートの一例である。情報処理装置1は、図9に示すフローチャートの処理を、例えば、処理の開始を指示するユーザ入力を検知した場合等に実行する。
(6) Processing Flow FIG. 9 is an example of a flowchart showing a procedure of processing executed by the information processing apparatus 1 in the first embodiment. The information processing apparatus 1 executes the processing of the flowchart shown in FIG. 9, for example, when a user input instructing the start of the processing is detected.
 まず、情報処理装置1の第1ダイジェスト候補選定部14は、インターフェース13を介して素材データD1を記憶装置4から取得する(ステップS11)。なお、記憶装置4に複数のコンテンツに相当する素材データD1が記憶されている場合には、第1ダイジェスト候補選定部14は、ユーザ入力等により指定されたコンテンツに対応する素材データD1を取得する。 First, the first digest candidate selection unit 14 of the information processing device 1 acquires the material data D1 from the storage device 4 via the interface 13 (step S11). When the material data D1 corresponding to a plurality of contents is stored in the storage device 4, the first digest candidate selection unit 14 acquires the material data D1 corresponding to the contents specified by the user input or the like. ..
 次に、第1ダイジェスト候補選定部14は、素材データを構成する区間データから第1ダイジェスト候補を選定する(ステップS12)。この場合、第1ダイジェスト候補選定部14は、重要度推論器情報D3を参照して構成した重要度推論器に区間データを入力することで、各区間データの重要度を算出し、算出した重要度に基づいて、第1ダイジェスト候補となる区間データを選定する。 Next, the first digest candidate selection unit 14 selects the first digest candidate from the section data constituting the material data (step S12). In this case, the first digest candidate selection unit 14 calculates the importance of each section data by inputting the section data into the importance inferior configured with reference to the importance inferior information D3, and the calculated importance. The section data that is the first digest candidate is selected based on the degree.
 次に、ペア決定部15は、第1ダイジェスト候補を含む推論対象ペアPtagを生成する(ステップS13)。この場合、ペア決定部15は、上述した第1選定例に従い、第1ダイジェスト候補同士をペアとする推論対象ペアPtagを生成してもよく、第2選定例に従い、第1ダイジェスト候補と非第1ダイジェスト候補とをペアとする推論対象ペアPtagを生成してもよい。 Next, the pair determination unit 15 generates an inference target pair Ptag including the first digest candidate (step S13). In this case, the pair determination unit 15 may generate an inference target pair Ptag in which the first digest candidates are paired according to the first selection example described above, and the first digest candidate and the non-first digest candidate may be generated according to the second selection example. An inference target pair Ptag that pairs with one digest candidate may be generated.
 次に、関連度算出部16は、ステップS13で生成された推論対象ペアPtagの各々の関連度を算出する(ステップS14)。この場合、関連度算出部16は、関連度推論器情報D2を参照することで構成した関連度推論器に推論対象ペアPtagを順次入力することで、推論対象ペアPtagの各々の関連度を算出する。 Next, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag generated in step S13 (step S14). In this case, the relevance calculation unit 16 calculates the relevance of each of the inference target pair Ptag by sequentially inputting the inference target pair Ptag into the relevance inferior configured by referring to the relevance inferior information D2. do.
 次に、第2ダイジェスト候補選定部17は、第2ダイジェスト候補の選定を行う(ステップS15)。この場合、第2ダイジェスト候補選定部17は、例えば、上述した第1選定例に従い、第1ダイジェスト候補のうち関連度が高い推論対象ペアPtagとなる第1ダイジェスト候補を第2ダイジェスト候補として選定する。他の例では、第2ダイジェスト候補選定部17は、第2選定例に従い、第1ダイジェスト候補と関連度が高い非第1ダイジェスト候補を、第1ダイジェスト候補と共に第2ダイジェスト候補として選定する。 Next, the second digest candidate selection unit 17 selects the second digest candidate (step S15). In this case, the second digest candidate selection unit 17 selects, for example, the first digest candidate which is the inference target pair Ptag having a high degree of relevance among the first digest candidates as the second digest candidate according to the first selection example described above. .. In another example, the second digest candidate selection unit 17 selects a non-first digest candidate having a high degree of relevance to the first digest candidate as the second digest candidate together with the first digest candidate according to the second selection example.
 次に、出力制御部18は、第2ダイジェスト候補に関する情報の出力を行う(ステップS16)。この場合、出力制御部18は、上述したように、第2ダイジェスト候補に関する情報を記憶装置4などの外部装置に供給してもよく、出力装置3に出力させてもよい。 Next, the output control unit 18 outputs information regarding the second digest candidate (step S16). In this case, as described above, the output control unit 18 may supply the information regarding the second digest candidate to an external device such as the storage device 4, or may output the information to the output device 3.
 図10は、第1実施形態において学習装置6が実行する処理の手順を示すフローチャートの一例である。学習装置6は、図10に示すフローチャートの処理を、例えば、処理の開始を指示するユーザ入力を検知した場合等に実行する。 FIG. 10 is an example of a flowchart showing a procedure of processing executed by the learning device 6 in the first embodiment. The learning device 6 executes the processing of the flowchart shown in FIG. 10, for example, when a user input instructing the start of the processing is detected.
 まず、学習装置6のペア決定部61は、学習データD4から学習用素材データを取得する(ステップS21)。なお、学習データD4に複数のコンテンツに相当する学習用素材データが含まれている場合には、ペア決定部61は、ユーザ入力等により指定されたコンテンツに対応する学習用素材データを取得する。 First, the pair determination unit 61 of the learning device 6 acquires learning material data from the learning data D4 (step S21). When the learning data D4 includes learning material data corresponding to a plurality of contents, the pair determination unit 61 acquires the learning material data corresponding to the contents specified by the user input or the like.
 次に、ペア決定部61は、学習用の推論対象ペアPtagを生成する(ステップS22)。この場合、ペア決定部61は、例えば、上述した第1学習例又は第2学習例のいずれかに従い、学習用素材データから抽出した区間データのペアを、推論対象ペアPtagとして生成する。 Next, the pair determination unit 61 generates an inference target pair Ptag for learning (step S22). In this case, the pair determination unit 61 generates, for example, a pair of interval data extracted from the learning material data as an inference target pair Ptag according to either the first learning example or the second learning example described above.
 さらに、ペア決定部61は、ステップS22で生成された学習用の推論対象ペアPtagに対する正解ラベルを決定する(ステップS23)。この場合、ペア決定部61は、第1学習例に従い、重要データ同士のペアか否かに基づき正解ラベルを決定してもよく、第2学習例に従い、学習用の推論対象ペアPtagとなる区間データ同士の再生時刻の差に応じた値を、正解ラベルとして決定してもよい。 Further, the pair determination unit 61 determines the correct label for the inference target pair Ptag for learning generated in step S22 (step S23). In this case, the pair determination unit 61 may determine the correct answer label based on whether or not it is a pair of important data according to the first learning example, and according to the second learning example, the section that becomes the inference target pair Ptag for learning. A value corresponding to the difference in reproduction time between the data may be determined as the correct answer label.
 そして、学習部62は、学習用の推論対象ペアPtagと、正解ラベルとに基づき、関連度推論器の学習を行う(ステップS24)。そして、学習装置6は、学習により得られた関連度推論器のパラメータを、関連度推論器情報D2として生成する。なお、生成された関連度推論器情報D2は、記憶装置4と学習装置6とのデータ通信により直ちに記憶装置4に記憶されてもよく、着脱可能な記憶媒体を介して記憶装置4に記憶されてもよい。 Then, the learning unit 62 learns the relevance inference device based on the inference target pair Ptag for learning and the correct answer label (step S24). Then, the learning device 6 generates the parameters of the relevance inferior obtained by learning as the relevance inferior information D2. The generated relevance inferior information D2 may be immediately stored in the storage device 4 by data communication between the storage device 4 and the learning device 6, and is stored in the storage device 4 via a removable storage medium. You may.
 (7)変形例
 次に、上記実施形態に好適な各変形例について説明する。以下の変形例は任意に組み合わせて上述の実施形態に適用してもよい。
(7) Modifications Next, each modification suitable for the above embodiment will be described. The following modifications may be applied to the above-described embodiment in any combination.
 (変形例1)
 ペア決定部15は、同一の素材データから抽出した2つの区間データを推論対象ペアPtagとして組み合わせる代わりに、異なる素材データから夫々抽出した区間データを推論対象ペアPtagとして組み合わせてもよい。
(Modification 1)
The pair determination unit 15 may combine the interval data extracted from different material data as the inference target pair Ptag instead of combining the two interval data extracted from the same material data as the inference target pair Ptag.
 例えば、この場合、第1ダイジェスト候補選定部14は、素材データD1とは異なる第2素材データから第1ダイジェスト候補を選定する。この場合、素材データD1と第2素材データとは、例えば、同一の場所(例えばスポーツ会場)を同一の時間帯において異なるカメラにより撮影したデータであってもよい。なお、第2素材データには、重要区間と非重要区間とを識別するラベルが予め付されていてもよい。この場合、第1ダイジェスト候補選定部14は、重要区間のラベルが付された区間データを、第1ダイジェスト候補として選定する。 For example, in this case, the first digest candidate selection unit 14 selects the first digest candidate from the second material data different from the material data D1. In this case, the material data D1 and the second material data may be, for example, data taken at the same place (for example, a sports venue) by different cameras in the same time zone. The second material data may be labeled in advance to identify important sections and non-important sections. In this case, the first digest candidate selection unit 14 selects the section data labeled with the important section as the first digest candidate.
 そして、ペア決定部15は、第2素材データから抽出された第1ダイジェスト候補と、素材データD1から抽出した区間データとの組を推論対象ペアPtagとして決定する。この場合、第2ダイジェスト候補選定部17は、例えば、関連度が所定値以上となった推論対象ペアPtagを構成する素材データD1の区間データと、第2素材データから抽出された第1ダイジェスト候補とを、第2ダイジェスト候補として選定する。この態様によれば、情報処理装置1は、複数の素材データからダイジェスト候補を好適に選定することができる。 Then, the pair determination unit 15 determines a pair of the first digest candidate extracted from the second material data and the section data extracted from the material data D1 as the inference target pair Ptag. In this case, the second digest candidate selection unit 17 is, for example, the section data of the material data D1 constituting the inference target pair Ptag whose relevance degree is equal to or higher than a predetermined value, and the first digest candidate extracted from the second material data. Is selected as the second digest candidate. According to this aspect, the information processing apparatus 1 can suitably select a digest candidate from a plurality of material data.
 (変形例2)
 情報処理装置1は、ユーザ入力により指定された推論対象ペアPtagに対する関連度の算出及び出力を行ってもよい。
(Modification 2)
The information processing apparatus 1 may calculate and output the degree of relevance to the inference target pair Ptag specified by the user input.
 図11は、変形例2に係る情報処理装置1Aの機能ブロック図の一例である。情報処理装置1Aのプロセッサ11は、ペア決定部15Aと、関連度算出部16Aと、出力制御部18Aとを有する。以後では、上述した実施形態と同一構成要素については適宜同一の符号を付し、その説明を省略する。 FIG. 11 is an example of a functional block diagram of the information processing apparatus 1A according to the modified example 2. The processor 11 of the information processing device 1A has a pair determination unit 15A, a relevance calculation unit 16A, and an output control unit 18A. Hereinafter, the same components as those in the above-described embodiment will be appropriately designated by the same reference numerals, and the description thereof will be omitted.
 ペア決定部15Aは、インターフェース13を介して入力装置2から受信する入力信号S2に基づき、推論対象ペアPtagを決定する。例えば、ペア決定部15Aは、入力信号S2に基づき指定された素材データD1の2つの区間データを推論対象ペアPtagとして決定する。 The pair determination unit 15A determines the inference target pair Ptag based on the input signal S2 received from the input device 2 via the interface 13. For example, the pair determination unit 15A determines two section data of the material data D1 designated based on the input signal S2 as the inference target pair Ptag.
 この場合、例えば、情報処理装置1Aのユーザは、推論対象ペアPtagの一方の区間データとして、ダイジェスト候補となる区間データを指定し、他方の区間データとして、ダイジェスト候補として相応しいか否かの判定対象となる区間データを指定する。なお、ペア決定部15は、異なる素材データから夫々推論対象ペアPtagとする区間データを指定する入力を受け付け、これらの区間データを推論対象ペアPtagとして決定してもよい。また、ペア決定部15Aは、入力信号S2に基づき複数の推論対象ペアPtagを決定してもよい。 In this case, for example, the user of the information processing apparatus 1A designates the section data to be the digest candidate as one section data of the inference target pair Ptag, and determines whether or not it is suitable as the digest candidate as the other section data. Specify the section data to be. The pair determination unit 15 may accept inputs for designating section data to be the inference target pair Ptag from different material data, and determine these section data as the inference target pair Ptag. Further, the pair determination unit 15A may determine a plurality of inference target pairs Ptag based on the input signal S2.
 そして、関連度算出部16Aは、関連度推論器情報D2を参照することで構成した関連度推論器に基づき、ペア決定部15Aが決定した推論対象ペアPtagの関連度を算出し、算出した関連度に関する関連度情報Iaを出力制御部18Aへ供給する。そして、出力制御部18Aは、関連度情報Iaに基づく出力を行う。この場合、例えば、出力制御部18Aは、出力装置3に、推論対象ペアPtagの関連度を表示するための出力信号S1を供給する。 Then, the relevance calculation unit 16A calculates and calculates the relevance of the inference target pair Ptag determined by the pair determination unit 15A based on the relevance inferior configured by referring to the relevance inferior information D2. Relevance information Ia regarding the degree is supplied to the output control unit 18A. Then, the output control unit 18A outputs based on the relevance information Ia. In this case, for example, the output control unit 18A supplies the output device 3 with an output signal S1 for displaying the relevance of the inference target pair Ptag.
 なお、出力制御部18Aは、推論対象ペアPtagが複数個存在する場合には、最も関連度が高い上位所定個数分の推論対象ペアPtagの関連度に関する情報のみを出力してもよく、関連度が所定の閾値以上となる推論対象ペアPtagの関連度に関する情報のみを出力してもよい。上述の所定個数は、1以上の任意の数に設定されてもよい。 When the output control unit 18A has a plurality of inference target pair Ptags, the output control unit 18A may output only the information regarding the relevance of the inference target pair Ptags corresponding to the upper predetermined number having the highest relevance. May output only the information about the relevance of the inferred pair Ptag in which is equal to or greater than a predetermined threshold. The above-mentioned predetermined number may be set to any number of 1 or more.
 図12は、変形例2において情報処理装置1Aが実行するフローチャートの一例である。まず、情報処理装置1Aのペア決定部15Aは、推論対象ペアPtagを指定するユーザ入力を受け付ける(ステップS31)。この場合、ペア決定部15Aは、例えば、シークバーなどを含む素材データD1の再生画面を表示し、任意の再生時刻に対応する区間データを推論対象ペアPtagとして指定させてもよい。次に、関連度算出部16Aは、ステップS31で指定された推論対象ペアPtagの関連度を算出する(ステップS32)。そして、出力制御部18Aは、ステップS32で算出された関連度に基づく出力を行う(ステップS33)。 FIG. 12 is an example of a flowchart executed by the information processing apparatus 1A in the modification 2. First, the pair determination unit 15A of the information processing apparatus 1A accepts a user input for designating the inference target pair Ptag (step S31). In this case, the pair determination unit 15A may display the reproduction screen of the material data D1 including the seek bar or the like, and specify the section data corresponding to an arbitrary reproduction time as the inference target pair Ptag. Next, the relevance calculation unit 16A calculates the relevance of the inference target pair Ptag specified in step S31 (step S32). Then, the output control unit 18A outputs based on the degree of relevance calculated in step S32 (step S33).
 このように、変形例2に係る情報処理装置1Aは、ユーザ入力により指定された推論対象ペアPtagに対する関連度の算出及び出力を好適に行うことができる。 As described above, the information processing apparatus 1A according to the modification 2 can suitably calculate and output the degree of relevance to the inference target pair Ptag specified by the user input.
 (変形例3)
 素材データD1に映像データに加えて音データが含まれている場合、関連度算出部16は、音データを用いて関連度の算出を行ってもよい。
(Modification 3)
When the material data D1 includes sound data in addition to the video data, the relevance calculation unit 16 may calculate the relevance using the sound data.
 この場合、第1の例では、関連度算出部16は、推論対象ペアPtagを構成する2つの区間データの映像データ及び音データに基づき、関連度推論器情報D2を参照して関連度を算出する。この場合、映像データ及び音データを含む区間データのペアが入力された場合に関連度を出力されるように予め学習された関連度推論器のパラメータが関連度推論器情報D2として予め記憶装置4に記憶されている。なお、関連度推論器には、音データが直接入力される代わりに音データの特徴量が入力されてもよい。この場合、音データに対して所定の特徴量抽出処理などが行われた後、抽出された特徴量が関連度推論器に入力される。また、素材データD1の各区間データの重要度を算出する場合も同様に、第1ダイジェスト候補選定部14は、映像データに加えて音データを用いて各区間データの重要度を算出してもよい。 In this case, in the first example, the relevance calculation unit 16 calculates the relevance with reference to the relevance inferior information D2 based on the video data and the sound data of the two section data constituting the inference target pair Ptag. do. In this case, the parameter of the relevance inferior learned in advance so that the relevance is output when a pair of interval data including video data and sound data is input is stored in advance as the relevance inferior information D2. Is remembered in. The feature amount of the sound data may be input to the relevance inferior instead of directly inputting the sound data. In this case, after a predetermined feature amount extraction process or the like is performed on the sound data, the extracted feature amount is input to the relevance inferior. Similarly, when calculating the importance of each section data of the material data D1, the first digest candidate selection unit 14 may calculate the importance of each section data using sound data in addition to the video data. good.
 第2の例では、関連度算出部16は、推論対象ペアPtagを構成する2つの区間データに含まれる音データのみに基づき、関連度推論器情報D2を参照して関連度を算出してもよい。この場合、音データのペアが入力された場合に関連度を出力されるように予め学習された関連度推論器のパラメータが関連度推論器情報D2として予め記憶装置4に記憶されている。 In the second example, the relevance calculation unit 16 may calculate the relevance by referring to the relevance inferior information D2 based only on the sound data included in the two section data constituting the inference target pair Ptag. good. In this case, the parameters of the relevance inferior learned in advance so that the relevance is output when the pair of sound data is input are stored in the storage device 4 in advance as the relevance inferior information D2.
 このように、情報処理装置1は、映像データ又は音データの少なくとも一方を用い、推論対象ペアPtagの関連度を好適に算出することができる。 As described above, the information processing apparatus 1 can suitably calculate the relevance of the inference target pair Ptag using at least one of the video data and the sound data.
 (変形例4)
 図3の機能ブロックにおいて、情報処理装置1は、第1ダイジェスト候補選定部14又は第2ダイジェスト候補選定部17の少なくも一方を備えなくともよい。
(Modification example 4)
In the functional block of FIG. 3, the information processing apparatus 1 does not have to include at least one of the first digest candidate selection unit 14 and the second digest candidate selection unit 17.
 例えば、重要区間と非重要区間とを識別するラベルが予め素材データD1に付加されている場合には、ペア決定部15は、重要区間の区間データを第1ダイジェスト候補とみなし、推論対象ペアPtagを決定してもよい。他の例では、出力制御部18は、関連度算出部16が出力する関連度情報Iaに基づき所定の出力を行ってもよい。この場合、出力制御部18Aは、最も関連度が高い上位所定個数分の推論対象ペアPtagに関する情報を出力してもよく、関連度が所定の閾値以上となる推論対象ペアPtagに関する情報のみを出力してもよい。上述の所定個数は、1以上の任意の数に設定されてもよい。上述の「推論対象ペアPtagに関する情報」は、推論対象ペアPtagを構成する区間データそのものであってもよく、推論対象ペアPtagを構成する区間データの時間情報(素材データD1における再生時刻を示す情報)であってもよい。 For example, when a label that distinguishes an important section and a non-important section is added to the material data D1 in advance, the pair determination unit 15 considers the section data of the important section as the first digest candidate, and the pair Ptag to be inferred. May be determined. In another example, the output control unit 18 may perform a predetermined output based on the relevance information Ia output by the relevance calculation unit 16. In this case, the output control unit 18A may output information about the inference target pair Ptag having the highest relevance for the upper predetermined number, and output only the information about the inference target pair Ptag whose relevance is equal to or higher than the predetermined threshold. You may. The above-mentioned predetermined number may be set to any number of 1 or more. The above-mentioned "information about the inference target pair Ptag" may be the section data itself constituting the inference target pair Ptag, and the time information of the section data constituting the inference target pair Ptag (information indicating the reproduction time in the material data D1). ) May be.
 <第2実施形態>
 図13は、第2実施形態における情報処理装置1Xの機能ブロック図である。情報処理装置1Xは、主に、ペア決定手段15Xと、関連度算出手段16Xとを有する。
<Second Embodiment>
FIG. 13 is a functional block diagram of the information processing apparatus 1X according to the second embodiment. The information processing apparatus 1X mainly includes a pair determination unit 15X and a relevance calculation unit 16X.
 ペア決定手段15Xは、映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となるペアを決定する。ここで、「映像データ」は、1枚の画像から構成されてもよく、複数枚の画像を含んでもよい。上記の「データ」及び「ペア」は、夫々、第1実施形態(変形例を含む、以下同じ)における「区間データ」及び「推論対象ペアPtag」とすることができる。また、ペア決定手段15Xは、第1実施形態におけるペア決定部15又はペア決定部15Aとすることができる。 The pair determining means 15X determines a pair of data including at least one of video data or sound data, and at least one of the data is a candidate for the first digest. Here, the "video data" may be composed of one image or may include a plurality of images. The above-mentioned "data" and "pair" can be "interval data" and "inference target pair Ptag" in the first embodiment (including modification, the same applies hereinafter), respectively. Further, the pair determination means 15X can be the pair determination unit 15 or the pair determination unit 15A in the first embodiment.
 関連度算出手段16Xは、ペア決定手段15Xが決定したペアが同時にダイジェストに含まれる蓋然性の度合いを示す関連度を算出する。関連度算出手段16Xは、第1実施形態における関連度算出部16又は関連度算出部16Aとすることができる。 The relevance calculation means 16X calculates the relevance degree indicating the degree of probability that the pair determined by the pair determination means 15X is included in the digest at the same time. The relevance calculation means 16X can be the relevance calculation unit 16 or the relevance calculation unit 16A in the first embodiment.
 図14は、第2実施形態において情報処理装置1Xが実行するフローチャートの一例である。まず、ペア決定手段15Xは、映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となるペアを決定する(ステップS41)。関連度算出手段16Xは、ペア決定手段15Xが決定したペアが同時にダイジェストに含まれる蓋然性の度合いを示す関連度を算出する(ステップS42)。 FIG. 14 is an example of a flowchart executed by the information processing apparatus 1X in the second embodiment. First, the pair determining means 15X determines a pair of data including at least one of video data or sound data, and at least one of the data is a candidate for the first digest (step S41). The relevance calculation means 16X calculates the relevance degree indicating the degree of probability that the pair determined by the pair determination means 15X is simultaneously included in the digest (step S42).
 第2実施形態に係る情報処理装置1Xは、ペアとなるデータが同時にダイジェストに含まれるべきか否か判定するための指標となる関連度を好適に算出することができる。 The information processing apparatus 1X according to the second embodiment can suitably calculate the degree of relevance as an index for determining whether or not the paired data should be included in the digest at the same time.
 なお、上述した各実施形態において、プログラムは、様々なタイプの非一時的なコンピュータ可読媒体(non-transitory computer readable medium)を用いて格納され、コンピュータであるプロセッサ等に供給することができる。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記憶媒体(tangible storage medium)を含む。非一時的なコンピュータ可読媒体の例は、磁気記憶媒体(例えばフレキシブルディスク、磁気テープ、ハードディスクドライブ)、光磁気記憶媒体(例えば光磁気ディスク)、CD-ROM(Read Only Memory)、CD-R、CD-R/W、半導体メモリ(例えば、マスクROM、PROM(Programmable ROM)、EPROM(Erasable PROM)、フラッシュROM、RAM(Random Access Memory))を含む。また、プログラムは、様々なタイプの一時的なコンピュータ可読媒体(transitory computer readable medium)によってコンピュータに供給されてもよい。一時的なコンピュータ可読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュータに供給できる。 In each of the above-described embodiments, the program is stored using various types of non-transitory computer readable medium and can be supplied to a processor or the like which is a computer. Non-temporary computer-readable media include various types of tangible storage media. Examples of non-temporary computer-readable media include magnetic storage media (eg, flexible disks, magnetic tapes, hard disk drives), magneto-optical storage media (eg, magneto-optical disks), CD-ROMs (ReadOnlyMemory), CD-Rs, Includes CD-R / W, semiconductor memory (eg, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (RandomAccessMemory)). The program may also be supplied to the computer by various types of temporary computer readable medium. Examples of temporary computer-readable media include electrical, optical, and electromagnetic waves. The temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.
 その他、上記の各実施形態の一部又は全部は、以下の付記のようにも記載され得るが以下には限られない。 Other than that, a part or all of each of the above embodiments may be described as in the following appendix, but is not limited to the following.
[付記1]
 映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定するペア決定手段と、
 前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する関連度算出手段と、
を有する情報処理装置。
[Appendix 1]
A pair determining means for determining a pair of data including at least one of video data or sound data, wherein at least one of the data is a first digest candidate which is a digest candidate.
A relevance calculation means for calculating the relevance indicating the degree of probability that the pair is simultaneously included in the digest, and
Information processing device with.
[付記2]
 前記ペア決定手段は、前記ダイジェストの生成において素材となる素材データを複数の区間に分割した場合の前記素材データの区間を構成する区間データに基づき、前記ペアを決定する、付記1に記載の情報処理装置。
[Appendix 2]
The information according to Appendix 1, wherein the pair determining means determines the pair based on the section data constituting the section of the material data when the material data used as the material in the generation of the digest is divided into a plurality of sections. Processing equipment.
[付記3]
 前記関連度に基づき、第2ダイジェスト候補となる前記区間データを選定する第2ダイジェスト候補選定手段をさらに有する、付記2に記載の情報処理装置。
[Appendix 3]
The information processing apparatus according to Appendix 2, further comprising a second digest candidate selection means for selecting the section data to be the second digest candidate based on the degree of relevance.
[付記4]
 前記ペア決定手段は、前記第1ダイジェスト候補となる2つの前記区間データを前記ペアとして決定し、
 前記第2ダイジェスト候補選定手段は、前記関連度に基づき、前記第2ダイジェスト候補となる前記第1ダイジェスト候補を選定する、付記3に記載の情報処理装置。
[Appendix 4]
The pair determination means determines the two section data as the first digest candidate as the pair.
The information processing apparatus according to Appendix 3, wherein the second digest candidate selection means selects the first digest candidate to be the second digest candidate based on the degree of relevance.
[付記5]
前記ペア決定手段は、前記第1ダイジェスト候補と、前記第1ダイジェスト候補ではない前記区間データである非第1ダイジェスト候補とを前記ペアとして決定し、
 前記第2ダイジェスト候補選定手段は、前記関連度に基づき、前記第2ダイジェスト候補となる前記非第1ダイジェスト候補を選定する、付記3に記載の情報処理装置。
[Appendix 5]
The pair determination means determines the first digest candidate and the non-first digest candidate, which is the section data that is not the first digest candidate, as the pair.
The information processing apparatus according to Appendix 3, wherein the second digest candidate selection means selects the non-first digest candidate to be the second digest candidate based on the degree of relevance.
[付記6]
 前記区間データの各々に対して算出した重要度に基づき、前記区間データから前記第1ダイジェスト候補を選定する第1ダイジェスト候補選定手段をさらに有する、付記2~5のいずれか一項に記載の情報処理装置。
[Appendix 6]
The information according to any one of Supplementary Provisions 2 to 5, further comprising a first digest candidate selection means for selecting the first digest candidate from the section data based on the importance calculated for each of the section data. Processing equipment.
[付記7]
 前記ペア決定手段は、再生時刻の差が所定時間差以内となる2つの前記区間データを、前記ペアとして決定する、付記2~6のいずれか一項に記載の情報処理装置。
[Appendix 7]
The information processing apparatus according to any one of Supplementary note 2 to 6, wherein the pair determining means determines as the pair the two section data in which the difference in reproduction time is within a predetermined time difference.
[付記8]
 前記ペア決定手段は、所定の時間間隔毎に前記素材データから抽出された前記区間データから前記ペアを決定する、付記2~6のいずれか一項に記載の情報処理装置。
[Appendix 8]
The information processing apparatus according to any one of Supplementary note 2 to 6, wherein the pair determining means determines the pair from the section data extracted from the material data at predetermined time intervals.
[付記9]
 前記ペア決定手段は、前記区間データに対してクラスタリングを行い、所定のクラスに属する前記区間データから前記ペアを決定する、付記2~6のいずれか一項に記載の情報処理装置。
[Appendix 9]
The information processing apparatus according to any one of Supplementary note 2 to 6, wherein the pair determination means clusters the interval data and determines the pair from the interval data belonging to a predetermined class.
[付記10]
 前記関連度算出手段は、学習用素材データから作成された学習用ダイジェストの区間データ同士のペアを正例とし、当該正例となる前記ペア以外の前記学習用素材データの区間データ同士のペアを負例として学習した関連度推論器に基づき、前記関連度を算出する、付記1~9のいずれか一項に記載の情報処理装置。
[Appendix 10]
The relevance calculation means uses a pair of section data of the learning digest created from the learning material data as a positive example, and a pair of section data of the learning material data other than the positive example. The information processing apparatus according to any one of Supplementary note 1 to 9, which calculates the degree of relevance based on the relevance inferior learned as a negative example.
[付記11]
 前記関連度算出手段は、映像データ又は音データの少なくとも一方を含むデータのペアが入力された場合に、当該データのペアが同一の素材データから抽出されたと仮定した場合の当該データのペアの再生時刻の差分に相当する情報を出力するように学習された関連度推論器に基づき、前記関連度を算出する、付記1~9のいずれか一項に記載の情報処理装置。
[Appendix 11]
The relevance calculation means reproduces a pair of data when it is assumed that the pair of data is extracted from the same material data when a pair of data including at least one of video data or sound data is input. The information processing apparatus according to any one of Supplementary note 1 to 9, which calculates the degree of relevance based on a relevance inferior learned to output information corresponding to a time difference.
[付記12]
前記関連度に関する情報、又は、前記関連度に基づき選定された第2ダイジェスト候補に関する情報を出力する出力制御手段をさらに有する、付記1~11のいずれか一項に記載の情報処理装置。
[Appendix 12]
The information processing apparatus according to any one of Supplementary note 1 to 11, further comprising an output control means for outputting information on the degree of relevance or information on a second digest candidate selected based on the degree of relevance.
[付記13]
 コンピュータにより、
 映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定し、
 前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する、
制御方法。
[Appendix 13]
By computer
A pair of data including at least one of video data or sound data, and the pair in which at least one data is a first digest candidate, which is a digest candidate, is determined.
Calculate the degree of association that indicates the degree of probability that the pair will be included in the digest at the same time.
Control method.
[付記14]
 映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定するペア決定手段と、
 前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する関連度算出手段
としてコンピュータを機能させるプログラムが格納された記憶媒体。
[Appendix 14]
A pair determining means for determining a pair of data including at least one of video data or sound data, wherein at least one of the data is a first digest candidate which is a digest candidate.
A storage medium in which a program for operating a computer as a relevance calculation means for calculating a relevance indicating the degree of probability that the pair is simultaneously included in the digest is stored.
 以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。すなわち、本願発明は、請求の範囲を含む全開示、技術的思想にしたがって当業者であればなし得るであろう各種変形、修正を含むことは勿論である。また、引用した上記の特許文献等の各開示は、本書に引用をもって繰り込むものとする。 Although the invention of the present application has been described above with reference to the embodiment, the invention of the present application is not limited to the above embodiment. Various changes that can be understood by those skilled in the art can be made within the scope of the present invention in terms of the configuration and details of the present invention. That is, it goes without saying that the invention of the present application includes all disclosure including claims, various modifications and modifications that can be made by those skilled in the art in accordance with the technical idea. In addition, each disclosure of the above-mentioned patent documents cited shall be incorporated into this document by citation.
 1、1A、1B、1X 情報処理装置
 2 入力装置
 3 出力装置
 4 記憶装置
 6 学習装置
 8 端末装置
 100、100B ダイジェスト候補選定システム
1, 1A, 1B, 1X Information processing device 2 Input device 3 Output device 4 Storage device 6 Learning device 8 Terminal device 100, 100B Digest candidate selection system

Claims (14)

  1.  映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定するペア決定手段と、
     前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する関連度算出手段と、
    を有する情報処理装置。
    A pair determining means for determining a pair of data including at least one of video data or sound data, wherein at least one of the data is a first digest candidate which is a digest candidate.
    A relevance calculation means for calculating the relevance indicating the degree of probability that the pair is simultaneously included in the digest, and
    Information processing device with.
  2.  前記ペア決定手段は、前記ダイジェストの生成において素材となる素材データを複数の区間に分割した場合の前記素材データの区間を構成する区間データに基づき、前記ペアを決定する、請求項1に記載の情報処理装置。 The pair-determining means according to claim 1, wherein the pair-determining means determines the pair based on the section data constituting the section of the material data when the material data as the material is divided into a plurality of sections in the generation of the digest. Information processing device.
  3.  前記関連度に基づき、第2ダイジェスト候補となる前記区間データを選定する第2ダイジェスト候補選定手段をさらに有する、請求項2に記載の情報処理装置。 The information processing apparatus according to claim 2, further comprising a second digest candidate selection means for selecting the section data to be the second digest candidate based on the degree of relevance.
  4.  前記ペア決定手段は、前記第1ダイジェスト候補となる2つの前記区間データを前記ペアとして決定し、
     前記第2ダイジェスト候補選定手段は、前記関連度に基づき、前記第2ダイジェスト候補となる前記第1ダイジェスト候補を選定する、請求項3に記載の情報処理装置。
    The pair determination means determines the two section data as the first digest candidate as the pair.
    The information processing apparatus according to claim 3, wherein the second digest candidate selection means selects the first digest candidate to be the second digest candidate based on the degree of relevance.
  5.  前記ペア決定手段は、前記第1ダイジェスト候補と、前記第1ダイジェスト候補ではない前記区間データである非第1ダイジェスト候補とを前記ペアとして決定し、
     前記第2ダイジェスト候補選定手段は、前記関連度に基づき、前記第2ダイジェスト候補となる前記非第1ダイジェスト候補を選定する、請求項3に記載の情報処理装置。
    The pair determination means determines the first digest candidate and the non-first digest candidate, which is the section data that is not the first digest candidate, as the pair.
    The information processing apparatus according to claim 3, wherein the second digest candidate selection means selects the non-first digest candidate to be the second digest candidate based on the degree of relevance.
  6.  前記区間データの各々に対して算出した重要度に基づき、前記区間データから前記第1ダイジェスト候補を選定する第1ダイジェスト候補選定手段をさらに有する、請求項2~5のいずれか一項に記載の情報処理装置。 The item according to any one of claims 2 to 5, further comprising a first digest candidate selection means for selecting the first digest candidate from the section data based on the importance calculated for each of the section data. Information processing device.
  7.  前記ペア決定手段は、再生時刻の差が所定時間差以内となる2つの前記区間データを、前記ペアとして決定する、請求項2~6のいずれか一項に記載の情報処理装置。 The information processing device according to any one of claims 2 to 6, wherein the pair determining means determines as the pair the two section data whose reproduction time difference is within a predetermined time difference.
  8.  前記ペア決定手段は、所定の時間間隔毎に前記素材データから抽出された前記区間データから前記ペアを決定する、請求項2~6のいずれか一項に記載の情報処理装置。 The information processing apparatus according to any one of claims 2 to 6, wherein the pair determining means determines the pair from the section data extracted from the material data at predetermined time intervals.
  9.  前記ペア決定手段は、前記区間データに対してクラスタリングを行い、所定のクラスに属する前記区間データから前記ペアを決定する、請求項2~6のいずれか一項に記載の情報処理装置。 The information processing apparatus according to any one of claims 2 to 6, wherein the pair determination means clusters the interval data and determines the pair from the interval data belonging to a predetermined class.
  10.  前記関連度算出手段は、学習用素材データから作成された学習用ダイジェストの区間データ同士のペアを正例とし、当該正例となる前記ペア以外の前記学習用素材データの区間データ同士のペアを負例として学習した関連度推論器に基づき、前記関連度を算出する、請求項1~9のいずれか一項に記載の情報処理装置。 The relevance calculation means uses a pair of section data of the learning digest created from the learning material data as a positive example, and a pair of section data of the learning material data other than the positive example. The information processing apparatus according to any one of claims 1 to 9, which calculates the degree of relevance based on the relevance inferior learned as a negative example.
  11.  前記関連度算出手段は、映像データ又は音データの少なくとも一方を含むデータのペアが入力された場合に、当該データのペアが同一の素材データから抽出されたと仮定した場合の当該データのペアの再生時刻の差分に相当する情報を出力するように学習された関連度推論器に基づき、前記関連度を算出する、請求項1~9のいずれか一項に記載の情報処理装置。 The relevance calculation means reproduces a pair of data when it is assumed that the pair of data is extracted from the same material data when a pair of data including at least one of video data or sound data is input. The information processing apparatus according to any one of claims 1 to 9, which calculates the degree of relevance based on a relevance inferior learned to output information corresponding to a time difference.
  12.  前記関連度に関する情報、又は、前記関連度に基づき選定された第2ダイジェスト候補に関する情報を出力する出力制御手段をさらに有する、請求項1~11のいずれか一項に記載の情報処理装置。 The information processing apparatus according to any one of claims 1 to 11, further comprising an output control means for outputting information on the degree of relevance or information on a second digest candidate selected based on the degree of relevance.
  13.  コンピュータにより、
     映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定し、
     前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する、
    制御方法。
    By computer
    A pair of data including at least one of video data or sound data, and the pair in which at least one data is a first digest candidate, which is a digest candidate, is determined.
    Calculate the degree of association that indicates the degree of probability that the pair will be included in the digest at the same time.
    Control method.
  14.  映像データ又は音データの少なくとも一方を含むデータのペアであって、少なくとも一方のデータがダイジェストの候補である第1ダイジェスト候補となる前記ペアを決定するペア決定手段と、
     前記ペアが同時に前記ダイジェストに含まれる蓋然性の度合いを示す関連度を算出する関連度算出手段
    としてコンピュータを機能させるプログラムが格納された記憶媒体。
    A pair determining means for determining a pair of data including at least one of video data or sound data, wherein at least one of the data is a first digest candidate which is a digest candidate.
    A storage medium in which a program for operating a computer as a relevance calculation means for calculating a relevance indicating the degree of probability that the pair is simultaneously included in the digest is stored.
PCT/JP2020/020772 2020-05-26 2020-05-26 Information processing device, control method, and storage medium WO2021240652A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US17/926,731 US20230205815A1 (en) 2020-05-26 2020-05-26 Information processing device, control method and storage medium
PCT/JP2020/020772 WO2021240652A1 (en) 2020-05-26 2020-05-26 Information processing device, control method, and storage medium
JP2022527325A JP7420243B2 (en) 2020-05-26 2020-05-26 Information processing device, control method and program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/020772 WO2021240652A1 (en) 2020-05-26 2020-05-26 Information processing device, control method, and storage medium

Publications (1)

Publication Number Publication Date
WO2021240652A1 true WO2021240652A1 (en) 2021-12-02

Family

ID=78723031

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/020772 WO2021240652A1 (en) 2020-05-26 2020-05-26 Information processing device, control method, and storage medium

Country Status (3)

Country Link
US (1) US20230205815A1 (en)
JP (1) JP7420243B2 (en)
WO (1) WO2021240652A1 (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006287319A (en) * 2005-03-31 2006-10-19 Nippon Hoso Kyokai <Nhk> Program digest generation apparatus and program digest generation program
US20070245242A1 (en) * 2006-04-12 2007-10-18 Yagnik Jay N Method and apparatus for automatically summarizing video
JP2008283486A (en) * 2007-05-10 2008-11-20 Sony Corp Information processor, information processing method, and program
JP2011223287A (en) * 2010-04-09 2011-11-04 Sony Corp Information processor, information processing method, and program
JP2012060238A (en) * 2010-09-06 2012-03-22 Sony Corp Moving picture processing device, moving picture processing method and program
JP2013031008A (en) * 2011-07-28 2013-02-07 Fujitsu Ltd Information processor, digest generating method, and digest generating program
JP2015099958A (en) * 2013-11-18 2015-05-28 日本電信電話株式会社 Digest video generating apparatus, digest video generating method, and digest video generating program
JP2019003585A (en) * 2017-06-20 2019-01-10 日本放送協会 Summary video creation device and program of the same

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11158344B1 (en) * 2015-09-30 2021-10-26 Amazon Technologies, Inc. Video ingestion and clip creation
US9996769B2 (en) * 2016-06-08 2018-06-12 International Business Machines Corporation Detecting usage of copyrighted video content using object recognition
US20200184043A1 (en) * 2018-12-11 2020-06-11 Avago Technologies International Sales Pte. Limited Multimedia content recognition with local and cloud-assisted machine learning

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006287319A (en) * 2005-03-31 2006-10-19 Nippon Hoso Kyokai <Nhk> Program digest generation apparatus and program digest generation program
US20070245242A1 (en) * 2006-04-12 2007-10-18 Yagnik Jay N Method and apparatus for automatically summarizing video
JP2008283486A (en) * 2007-05-10 2008-11-20 Sony Corp Information processor, information processing method, and program
JP2011223287A (en) * 2010-04-09 2011-11-04 Sony Corp Information processor, information processing method, and program
JP2012060238A (en) * 2010-09-06 2012-03-22 Sony Corp Moving picture processing device, moving picture processing method and program
JP2013031008A (en) * 2011-07-28 2013-02-07 Fujitsu Ltd Information processor, digest generating method, and digest generating program
JP2015099958A (en) * 2013-11-18 2015-05-28 日本電信電話株式会社 Digest video generating apparatus, digest video generating method, and digest video generating program
JP2019003585A (en) * 2017-06-20 2019-01-10 日本放送協会 Summary video creation device and program of the same

Also Published As

Publication number Publication date
JP7420243B2 (en) 2024-01-23
US20230205815A1 (en) 2023-06-29
JPWO2021240652A1 (en) 2021-12-02

Similar Documents

Publication Publication Date Title
US11025879B2 (en) Image processing apparatus, image processing method, and storage medium storing program
CN109472296A (en) A kind of model training method and device promoting decision tree based on gradient
US20210117651A1 (en) Facial image identification system, identifier generation device, identification device, image identification system, and identification system
US10560632B2 (en) Moving image reproducing apparatus, control method therefor, and storage medium storing control program therefor
US11676382B2 (en) Systems and methods for generating composite media using distributed networks
CN111539903B (en) Method and device for training face image synthesis model
CN110472558B (en) Image processing method and device
KR20230063772A (en) Metaverse personalized content creation and authentication method and apparutus and system therefor
WO2021240652A1 (en) Information processing device, control method, and storage medium
CN111933137B (en) Voice wake-up test method and device, computer readable medium and electronic equipment
KR102459775B1 (en) Automatic editing method, apparatus and system for artificial intelligence-based vidieo content production
US20220262099A1 (en) Learning data generation device, learning device, identification device, generation method and storage medium
WO2021240653A1 (en) Information processing device, control method, and storage medium
KR102452100B1 (en) Method, device and system for providing learning service base on brain wave and blinking eyes
WO2021240651A1 (en) Information processing device, control method, and storage medium
US11381887B2 (en) Systems and methods for managing interruption of content presentation
CN105765567A (en) Generation of a communication request based on visual selection
JP7097261B2 (en) Learning data analysis method and computer system
WO2021240732A1 (en) Information processing device, control method, and storage medium
KR20220089367A (en) Conference recoring system
US20230063393A1 (en) Remote-meeting system, remote-meeting method, and remote-meeting program
CN112597912A (en) Conference content recording method, device, equipment and storage medium
EP3764288A1 (en) Picture set description generation method and apparatus, and computer device and storage medium
WO2021240654A1 (en) Information processing device, control method, and storage medium
CN114424218A (en) Machine learning device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20938478

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022527325

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20938478

Country of ref document: EP

Kind code of ref document: A1