WO2019008635A1 - Video monitoring device adjusting method and video monitoring device - Google Patents

Video monitoring device adjusting method and video monitoring device Download PDF

Info

Publication number
WO2019008635A1
WO2019008635A1 PCT/JP2017/024350 JP2017024350W WO2019008635A1 WO 2019008635 A1 WO2019008635 A1 WO 2019008635A1 JP 2017024350 W JP2017024350 W JP 2017024350W WO 2019008635 A1 WO2019008635 A1 WO 2019008635A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
threshold
error
alarm
monitoring
Prior art date
Application number
PCT/JP2017/024350
Other languages
French (fr)
Japanese (ja)
Inventor
浜田高宏
Original Assignee
株式会社K-Will
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社K-Will filed Critical 株式会社K-Will
Priority to CN201780089397.8A priority Critical patent/CN110870305B/en
Priority to JP2019528202A priority patent/JP7033797B2/en
Priority to PCT/JP2017/024350 priority patent/WO2019008635A1/en
Publication of WO2019008635A1 publication Critical patent/WO2019008635A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details

Definitions

  • the present invention relates to an adjustment method and an image monitoring apparatus of an image monitoring apparatus capable of mechanically detecting an error of an image and an audio included in a digital image signal.
  • video distribution services such as Internet content distribution services and Internet Protocol TeleVision (IPTV) business are expected to expand in the future, and will be distributed accordingly. Since the number of such content also increases dramatically, it is considered to be practically difficult for the observer to monitor all the content. Therefore, it can be said that improving the accuracy of error detection in machine monitoring to human level is a pressing issue in all video distribution services.
  • IPTV Internet Protocol TeleVision
  • Patent Document 1 discloses an audio inspection method for detecting an audio error caused by a video error caused by noise caused by various causes in a digital video signal and a noise generated by a different cause in a digital audio signal. It is done.
  • the error detection is performed by comparing the characteristic value (parameter) of the audio signal with the threshold, but there is a problem as to how to set the threshold. That is, if the threshold is set to be strict, error detection is frequently performed, but if many of the detected errors are negligible for the viewer, detection may be wasted. On the other hand, if the threshold value is set loosely, although the frequency of error detection decreases, there is a risk that the viewer may miss an error that can not be ignored. This is one of the reasons why machine surveillance is inferior to human surveillance. Since changing one parameter affects other parameters, especially when there are multiple parameters that are related to each other, two or more parameters must be changed at the same time, and appropriate threshold settings should be made. Is very difficult.
  • One of the objects of the present invention is to provide a method of adjusting an image monitoring apparatus for appropriately detecting an image and / or an error generated depending on content in a digital image signal.
  • Another object of the present invention is to provide an image monitoring apparatus for detecting an image and / or an error properly through learning in a digital image signal.
  • a method of adjusting an image monitoring apparatus wherein an image signal corresponding to an image to be monitored and / or audio is input to detect that an error occurs in a monitoring item.
  • the video monitoring apparatus is configured to determine that the error has occurred when all of a plurality of parameters that change according to the video signal exceed a threshold determined according to the monitoring item.
  • At least two of the plurality of parameters are mutually related
  • the threshold values of the plurality of parameters are respectively determined to create a plurality of threshold value groups, which are stored in a memory, It is characterized in that any set of the threshold group stored in the memory is selected according to contents of video and / or audio to be monitored.
  • the inventor of the present invention has determined that the appropriate value of the threshold group is the content of the video and / or audio content to be monitored ("movie”, “drama”, “variety”, “sports”, “documentary” , “Entertainment”, “animation”, etc.) have been found to be unevenly distributed.
  • the present invention has been derived based on such findings. That is, if the threshold values of the plurality of parameters are respectively determined and a plurality of sets of threshold values are created and stored in the memory, they are stored in the memory according to the content of video and / or audio to be monitored. By selecting any set of the threshold group, a threshold group more appropriate for monitoring can be set as compared to the case where a single threshold group is used, so erroneous detection of an error in the image monitoring apparatus is effectively performed. It can be prevented.
  • the video monitoring device performs learning in parallel with the video and / or audio monitoring operation, updates at least one threshold of the threshold group based on the learning result, and stores the updated threshold group in a memory It is preferable to store in
  • the video surveillance device issues an alarm when an error is detected.
  • a video signal corresponding to the video and / or audio to be monitored is input, and the alarm is issued at predetermined time intervals, the first event of the alarm for the error to be issued Count the number of events and the number of second events for which the alarm has been issued for errors that should not be issued, Furthermore, a video signal for inspection whose content and place of occurrence of the error are known is input to the video monitoring apparatus, and the number of third events for which the alarm has not been issued for an error that should be issued.
  • Count An evaluation value is determined by weighting and calculating the number of obtained first events, the number of second events, and the number of third events, and the threshold group is determined based on the determined evaluation values. It is preferable to update the
  • the learning it is preferable to count the number of the first events, the number of the second events, and the number of the third events while individually changing the thresholds of the threshold group.
  • a video monitoring apparatus is a video monitoring apparatus that receives a video signal corresponding to video and / or audio to be monitored and detects that an error has occurred in a monitoring item.
  • a threshold storage unit that stores a plurality of parameters that change according to the video signal, wherein at least two of the plurality of parameters are associated with each other;
  • a determination unit that determines that the error has occurred and issues an alarm when all of the plurality of parameters exceed the threshold determined according to the monitoring item;
  • a learning unit that performs learning in parallel with the video and / or audio monitoring operation;
  • An updating unit that updates the threshold based on a learning result in the learning unit;
  • a video storage unit for storing video signals before and after the occurrence of the error when the determination unit detects the occurrence of the error; The learning unit may learn by analyzing the notification result of the error.
  • the result of the error notification is analyzed and classified into, for example, "correct alarm”, “unwanted alarm”, “pass through”, etc., whereby the learning unit performs learning to make the threshold more appropriate.
  • the error detection accuracy is improved because the value can be updated to
  • the learning unit inputs a video signal corresponding to the video and / or audio to be monitored, the alarm is issued for an error that should be issued at a constant time interval. Counts the number of first events and the number of second events for which an alarm should have been issued for errors that should not have triggered an alarm When the signal is input, the number of third events for which the alarm has not been issued for the error that should be issued is counted, and the number of the first event obtained and the number of the second events It is preferable to calculate an evaluation value by calculating by weighting the number of cases and the number of cases of the third event, and ranking the threshold values stored in the threshold storage unit based on the obtained evaluation values.
  • the learning unit count the number of first events, the number of second events, and the number of third events while individually changing the threshold value of the monitoring item.
  • the learning unit may be configured to cause the alarm to be issued for an error that should be issued for the m-th video signal of the M video signals during the time t1 to t2.
  • T (m) be the number of events
  • F (m) be the number of second events for which an alarm should have been issued for errors that should not have triggered an alarm.
  • H (m) the number of third events for which the alarm has not been issued.
  • Wt (m) is the weight of the first event
  • Wf (m) is the weight of the second event
  • Wh (m) is the weight of the third event
  • the video storage unit cuts out and stores the input video signal with a predetermined length at a predetermined timing.
  • a display unit that displays the learning result of the learning unit and reproduces video and / or audio based on the video signal before and after the occurrence of the error.
  • the updating unit prioritizes and lists new threshold candidates that replace the current threshold.
  • voice includes sounds in addition to human and animal voices.
  • Video signal includes the case of either video signal or audio signal.
  • Content is the content of the video. Specifically, there are “movie”, “drama”, “variety”, “sports”, “documentary”, “entertainment”, “animation” etc. as the major classification of content. On the other hand, sub-classification of contents, if taking “sports” as an example, includes “baseball”, “soccer”, “volleyball”, “judo”, “sumo”, “land athletics” and the like.
  • “Learning” associates the monitoring result of watching the video and / or audio flowing in real time with the monitoring result of the same video and / or audio by the video monitoring device, and uses it as a threshold Including feedback.
  • “Interrelated” means that adjusting one parameter affects the other parameter.
  • the “above the threshold” includes both cases where the threshold is exceeded and below the threshold.
  • the present invention it is possible to provide a method of adjusting an image monitoring apparatus for appropriately detecting an image and / or an error generated according to content in a digital image signal. Also, according to the present invention, it is possible to provide an image monitoring apparatus which detects an image and / or an error properly through learning in a digital image signal.
  • FIG. 2 is a block diagram showing an image monitoring apparatus 100.
  • FIG. 6 is a diagram showing an example of inspection items in the video / audio monitoring unit 103 displayed on the display unit 107.
  • FIG. 6 is a diagram showing an example of alarm information displayed on a display unit 107. It is a figure which shows the example of the parameter for every monitoring item displayed on the display part 107.
  • FIG. It is a figure which shows the example which put together the correct item alarm which generate
  • FIG. 1 is a block diagram showing an image monitoring apparatus 100.
  • a video monitoring apparatus 100 inputs a main input unit 101 for inputting a video signal (hereinafter referred to as a video signal to be inspected) corresponding to video and / or audio to be monitored, and a video signal for inspection.
  • a video signal to be inspected a video signal corresponding to video and / or audio to be monitored
  • Sub-input unit 102 video / audio monitoring unit (determination unit) 103 incorporating memory for storing threshold, video / audio clip storage unit (video storage unit) 104, alarm output unit 105, internal memory (threshold A parameter optimization learning unit (learning unit and updating unit) 106 including a storage unit 106a, a display unit 107 such as a display on which the information displayed by the supervisor MN can be viewed, and the supervisor MN can input information And a supervisor input unit 108 such as a keyboard.
  • FIG. 2 is a diagram showing an example of inspection items in the video / audio monitoring unit 103 displayed on the display unit 107.
  • the video signals to be monitored are video signals of all formats such as an SDI signal, a file, an IP format, and HDMI (registered trademark).
  • the supervisor MN changes the inside of the box corresponding to each monitoring item via the supervisor input unit 108 while looking at the monitoring item displayed on the display unit 107, and performs “inspection” or “off” for each monitoring item. You can choose one of these.
  • the inspection item for which the supervisor has selected “inspection” is to be monitored by the video monitoring apparatus 100, but the inspection item for which “off” is selected is not monitored.
  • monitoring items related to the image include “freeze”, “black out”, “block freeze”, “block black out”, “block noise”, “red blink”, “brightness blink”, “scene change”, “image reverse” There are “line noise”, “cut point abnormality”, and “time code discontinuous”.
  • monitoring items relating to audio include “mute”, “cut-off mute”, “audio pop noise”, “sound skipping”, “voice noise”, “loudness” and “true peak”. The inspection items are not limited to these.
  • a plurality of thresholds (a threshold group, the details will be described later) are set for each monitoring item.
  • a threshold group the details will be described later.
  • the parameter is transmitted to the voice monitoring unit 103, where parameters corresponding to various sensitivity adjustments and durations that can be applied to the monitoring algorithm are calculated.
  • the video / voice monitoring unit 103 compares a plurality of obtained parameters with a threshold value corresponding to the monitoring item, and when all the parameters exceed the threshold value, an error corresponding to the monitoring item is detected. It judges and outputs the alarm signal matched with the said monitoring item to the alarm output part 105.
  • the alarm output unit 105 inputs alarm information including the error occurrence time, the content of the detected error, and the severity of the error to the display unit 107 according to the input alarm signal, the display unit 107 receives the alarm information. It can be displayed and viewed by the observer MN. Also, the video / audio monitoring unit 103 cuts out video signals before and after the parameter exceeds the threshold and stores the video signals in the video / audio clip storage unit 104.
  • FIG. 3 is a diagram showing an example of alarm information displayed on the display unit 107.
  • the lower part of the screen shows the detected error and content in chronological order
  • the upper part of the screen shows "category” indicating that the error is video or audio, and the level of error is normal or severe.
  • the "class” indicating, the "inspection item” indicating the type of error, and the "number of occurrences of error” are displayed together.
  • the supervisor MN selects any error via the supervisor input unit 108
  • the video / audio clip storage unit 104 reads out the corresponding video signal before and after the error, and the display unit 107 It is input. Since the video and / or the sound before and after the error are output from the display unit 107 by this, the supervisor MN can actually visually recognize or listen to the content of the error.
  • the correct answer (correct answer If an alarm is issued for an error that should not issue an alarm (an unnecessary alarm) or if an alarm is not issued for an error that should issue an alarm Can be recognized respectively.
  • it is ideal that all alarms issued when the image monitoring apparatus 100 detects an error are all correct alarms, but in reality, unnecessary alarms and passing through occur. This results from the fact that the error detection standard by machine monitoring of the video / voice monitoring unit 103 does not exactly match the error detection standard by monitoring of the supervisor MN. Therefore, unless the error detection by the machine monitoring approaches the error detection by the human monitoring, the monitoring by the image monitoring apparatus 100 alone becomes difficult.
  • the threshold value of the parameter set for each monitoring item is changed. This is called threshold tuning.
  • the number of parameters is I (m)
  • the number of threshold values to be tuned is 1 ⁇ I (1) + 2 ⁇ I (2) +. It can be seen that I (m) increases as the number of monitoring items increases.
  • FIG. 4 is a diagram showing an example of parameters for each monitoring item displayed on the display unit 107.
  • the supervisor MN can now input an arbitrary numerical value for each parameter in the box corresponding to the parameter via the supervisor input unit 108 while looking at the parameter displayed on the display unit 107. There is.
  • sensitivity threshold (activity) For example, in response to the inspection item "freeze”, 4 parameters "sensitivity threshold (activity)”, “sensitivity threshold (noise)”, “time threshold (start)”, and “time threshold (end)” There is one.
  • Graph scale represents the scale of the graph for display, and is not a parameter here. That is, in order to detect a video freeze as an error, thresholds (threshold groups) of four parameters must be set appropriately.
  • sensitivity threshold (activity)” and “sensitivity threshold (noise)” are upper and lower limit values of the variance for each small block included in one video frame as a parameter, and they are mutually related. It can be said that they fit. Such parameters are disclosed in detail in WO 2015-059782.
  • time threshold (start) and “time threshold (end)” indicate the length of a period during which it is determined that the video is phrased, and they are mutually related. Therefore, when one of the thresholds is adjusted, it is impossible to appropriately detect the freeze unless the other threshold is also changed.
  • an appropriate threshold value is not input for all inspection items, unnecessary alarms or omissions at the time of error detection are caused.
  • a default threshold initial value
  • Such a default threshold can be stored and used, for example, in the built-in memory of the video / audio monitoring unit 103.
  • the unnecessary alarm and the passing through are reduced for one content, but the unnecessary alarm and the passing occurs for another content. I understand.
  • FIG. 5 is a diagram showing an example in which the correct answer alarm, the unnecessary alarm, and the slip-through that occur when the error detection is performed using the same threshold are summarized for each monitoring item and content. From the example of FIG. 5, it can be seen that the occurrence frequency of the correct alarm, the unnecessary alarm, and the slip-in differs for each content. Based on the examination result, the inventor has found that it is sufficient to determine the default threshold according to the content of the content. Such default thresholds can be determined from simulations and accumulated experience.
  • the default threshold is determined according to the content of the content, there is a possibility of reducing unnecessary alarms and passing through to some extent, but it is not always optimal. Even if the contents of the content are the same, the frequency of unnecessary alarms and passing through may change depending on the reception state and the like. Therefore, in parallel with the monitoring operation, by causing the video monitoring apparatus 100 to learn about the currently input video signal, it is determined whether the default threshold is appropriate or not, and it is preferable to further update it if it is not appropriate. It can be said. This makes it possible to increase the number of correct alarms and reduce the frequency of unnecessary alarms and passing through.
  • the updated threshold may be newly set as the default threshold of the content.
  • (Learning mode) The learning function of the video surveillance device 100 will be described below. In the following learning example, it is possible to determine which is better for the case where there are threshold candidates to be changed with respect to the default threshold.
  • the supervisor MN sets a predetermined learning period from the supervisor input unit 108. Then, the video surveillance device 100 can perform learning during this learning period.
  • the monitoring operation is performed using the default threshold value for one monitoring item in M video signals of the same type of content.
  • an error detected by the image monitoring apparatus 100 is displayed on the display unit 107 as shown in FIG. 3 during a learning period (time t1 to t2, for example, a time unit such as hours, days, weeks, or months). . Since the errors displayed as shown in FIG. 3 may include inappropriate ones, it is necessary to check whether the errors are appropriate for learning.
  • the supervisor MN reads the video signals before and after the error stored in the video / audio clip storage unit 104 by designating one of the errors displayed in FIG. Video and / or audio can be viewed on the display unit 107. As a result, the supervisor MN can count the number of correct alarms generated during the time t1 to t2 and the number of unnecessary alarms. Since the video signal before and after each error stored in the video / audio clip storage unit 104 has a length of about 3 seconds, it does not take a long time to view, and this check is completed in a short time. There is little burden.
  • the video signal for inspection whose content and occurrence point (time) of the error prepared beforehand are known is input to the video / audio monitoring unit 103 via the sub input unit 102, and the same threshold as described above
  • the video / audio monitoring unit 103 detects an error using
  • the supervisor MN finds the number of correct alarms and the number of slips by collating the contents of the known error with the occurrence part. be able to.
  • the number of passing is extracted and used. Since the video signal for inspection has a length corresponding to a viewing time of several tens of minutes for each content, it does not take time for inspection.
  • the supervisor MN learns from the supervisor input unit 108 the parameter optimization learning from the number of correct alarms, the number of unnecessary alarms, and the number of slips calculated for each of the M video signals. Input to the part 106.
  • T (m) be the number of correct alarms (first event) in the m-th video signal
  • F (m) be the number of unnecessary alarms (second event)
  • the number of slips (third event) be Assuming that H (m), the parameter optimization learning unit 106 obtains an evaluation value A (m) according to the following evaluation function.
  • Wt (m) is the correct alarm weighting
  • Wf (m) is the unnecessary alarm weighting
  • Wh (m) is the passing weight
  • Wt (m)> 0> Wf (m)> Wh (m) 0.
  • Wh (m) -5.
  • the weighting may be fixed for the same type of content. According to this evaluation function, the higher the obtained evaluation value A (m), the more accurate the error detection, and the lower the value, the more inaccurate the error detection.
  • the supervisor MN replaces the default threshold value with the threshold value desired to be changed via the supervisor input unit 108, and executes the same monitoring operation as above using the video signal of the same content.
  • the evaluation value is determined. If the evaluation value of the monitoring operation using the default threshold is equal to or more than the evaluation value of the monitoring operation using the threshold desired to be changed, the parameter optimization learning unit 106 continues using the default threshold in the content. As a thing, do not change the stored default threshold. On the other hand, if the evaluation value of the monitoring operation using the default threshold is lower than the evaluation value of the monitoring operation using the threshold for which change is desired, the parameter optimization learning unit 106 determines the default threshold for the content. It is determined that the change is necessary, and the change is replaced with a desired threshold, that is, learning is performed. Thereby, the accuracy of error detection can be further enhanced.
  • FIG. 6 is a diagram showing a list of examples of a plurality of threshold values when such an update is performed a plurality of times in an inspection item regarding a certain content: freeze, ((a) before update, (b) after update) .
  • a plurality of sets of threshold value groups shown in FIG. 6 are listed up and stored in the built-in memory 106a, displayed on the display unit 107 as needed, and can be confirmed by the supervisor MN.
  • the default threshold group is currently listed at the top with priority 1 but its evaluation value A (m) is “85”.
  • the evaluation value A (m) of the threshold group newly evaluated as the first candidate is "92”
  • the evaluation value A (m) of the threshold group newly evaluated as the second candidate is "81" Suppose that there was.
  • the parameter optimization learning unit 106 since the evaluation value A (m)-"92" of the first candidate threshold group is higher than the evaluation value A (m) "85” of the default threshold group so far, the parameter optimization learning unit 106 However, as shown in FIG. 6B, by setting the priority to 1 by updating, it is replaced as a new default threshold group and transmitted to the video / audio monitoring unit 103 for monitoring. By this, the default threshold group so far will be carried back to priority 2.
  • the parameter optimization learning unit 106 since the evaluation value A (m) “81” of the second candidate threshold group is lower than the evaluation value A (m) “85” of the replaced default threshold group, the parameter optimization learning unit 106 However, by setting the priority to 3 by updating, it will be listed in lower order. The list is stored and used in the memory 106 a of the parameter optimization learning unit 106. Thereby, the threshold value group of high evaluation value A (m) can always be made into the default threshold value group regarding the said content. However, if the content is different, even if the threshold value group uses the same value, the priority may be different. Note that such updating of the threshold value group may be performed automatically by the parameter optimization learning unit 106, or may be performed after waiting for the supervisor MN's permission. Alternatively, regardless of the value of the evaluation value A (m), the threshold designated by the supervisor MN via the supervisor input unit 108 can be set as the default threshold.
  • the first candidate, the second candidate,... are determined, but each time the supervisor MN inputs an arbitrary value from the supervisor input unit 108 to the parameter optimization learning unit 106, the correct answer is obtained each time
  • the evaluation value may be calculated after obtaining the number of alarms, the number of unnecessary alarms, and the number of by-passes.
  • the parameter optimization learning unit 106 individually changes the threshold by + 5% or -5% with respect to the default threshold group, and in each case, the number of correct alarms, the number of unnecessary alarms, and the bypass
  • the evaluation value may be calculated after finding the number of. Evaluation values can be similarly obtained for other monitoring items.
  • the video signal for inspection may be input between the input of the monitoring target video signal, or may be performed in parallel with the input of the monitoring target video signal to perform inspection in the background.
  • the same threshold can then be used to perform error detection.
  • the video / audio storage clip unit 104 cuts out the monitoring target video signal input from the main input unit 101 over a predetermined length at a predetermined timing, and associates it with the monitoring result performed by the supervisor MN.
  • it may be stored as a video signal for inspection in a memory (not shown).
  • the stored test video signal is input from the sub input unit 102 at a necessary timing, and is used for threshold updating as described above.
  • the present invention it is possible to provide a method of adjusting a video monitoring apparatus for appropriately detecting a video and / or an error generated according to content in a digital video signal, and in the digital video signal, the video and / or the error are properly learned It is possible to provide an image monitoring device that detects
  • video monitoring apparatus 101 main input unit 102 secondary input unit 103 video / audio monitoring unit 104 video / audio clip storage unit 105 alarm output unit 106 learning unit for parameter optimization built-in memory 107 a display unit 108 monitor unit 108 monitor input unit MN monitor

Abstract

Provided is a method for adjusting a video monitoring device for appropriately detecting video and/or errors occurring in accordance with the content of digital video signals. Also provided is a video monitoring device for appropriately detecting video and/or errors through learning from digital video signals. The video monitoring device comprises a video/audio monitoring unit that incorporates a memory in which threshold values are stored, and a parameter optimization learning unit. The video monitoring device is configured to determine that an error has occurred when all of a plurality of parameters varying in accordance with video signals exceed respective threshold values specified in accordance with monitoring items. At least two of the plurality of parameters are associated with each other, and the video monitoring device selects a set of threshold value groups stored in the memory in accordance with the content of the video and/or audio to be monitored. Through an analysis of error warning results, the parameter optimization learning unit performs learning and updates the threshold values on the basis of the learning result.

Description

映像監視装置の調整方法及び映像監視装置Method of adjusting image monitoring apparatus and image monitoring apparatus
 本発明は、デジタル映像信号に含まれた映像や音声のエラーを機械的に検出できる映像監視装置の調整方法及び映像監視装置に関する。 The present invention relates to an adjustment method and an image monitoring apparatus of an image monitoring apparatus capable of mechanically detecting an error of an image and an audio included in a digital image signal.
 旧来のアナログテレビ放送時代には、その映像や音声の品質は、監視者が実際に映像や音声を視聴(監視)してエラーの検出を実行していた。これに対し、デジタルテレビ放送に移行した現代においては、映像信号の処理が容易になり機械的に映像や音声を監視する環境が整いつつあり、一部では既に機械監視が行われている。その一方で、機械監視はエラー検出の正確性において未だ人間の監視に劣ることが多いという評価もあり、旧来の監視者による監視も並行して行われているという実情がある。 In the age of analog television broadcasting in the past, the quality of the video and audio was that an observer actually viewed (monitored) the video and audio and performed error detection. On the other hand, with the transition to digital television broadcasting, on the other hand, processing of video signals has become easy, and environments for mechanically monitoring video and audio have been established, and some of them have already been monitored. On the other hand, there is also an evaluation that machine monitoring is still often inferior to human monitoring in the accuracy of error detection, and there is a fact that monitoring by a conventional monitor is also performed in parallel.
 しかるに、今後はデジタル映像を送信する地上波の事業に加えて、コンテンツのネット配信事業や、IPTV(Internet Protocol Tele Vision)事業などの映像配信サービスが拡大すると予測されるところ、それに伴って配信されるコンテンツの数も劇的に増大することから、全てのコンテンツに対して監視者による監視を行うことは事実上困難になると考えられている。よって、機械監視におけるエラー検出の正確性を人間並みに向上させることは、全ての映像配信サービスにおいて喫緊の課題ともいえる。 However, in addition to the terrestrial business that transmits digital video, video distribution services such as Internet content distribution services and Internet Protocol TeleVision (IPTV) business are expected to expand in the future, and will be distributed accordingly. Since the number of such content also increases dramatically, it is considered to be practically difficult for the observer to monitor all the content. Therefore, it can be said that improving the accuracy of error detection in machine monitoring to human level is a pressing issue in all video distribution services.
 特許文献1には、デジタル映像信号において種々の原因により発生するノイズに起因した映像のエラーや、デジタル音声信号において種々の原因により発生するノイズに起因した音声のエラーを検出する音声検査方法が開示されている。 Patent Document 1 discloses an audio inspection method for detecting an audio error caused by a video error caused by noise caused by various causes in a digital video signal and a noise generated by a different cause in a digital audio signal. It is done.
国際公開2015/059782号公報International Publication 2015/059782
 特許文献1によれば、連続するデジタル音声信号を5msec以下で区切ってサンプリングし、サンプリングした信号から高周波成分を抽出して、抽出された高周波成分に基づく値を閾値と比較することで、音声に生じたエラーを機械的に検出することができる。 According to Patent Document 1, continuous digital audio signals are divided and sampled at 5 msec or less, high frequency components are extracted from the sampled signals, and a value based on the extracted high frequency components is compared with a threshold value to obtain voice. Errors that occur can be detected mechanically.
 しかるに、特許文献1のような技術では、音声信号の特性値(パラメータ)を閾値と比較することによってエラー検出が行われているが、その閾値をどのように設定すべきかという課題がある。すなわち、閾値を厳しめに設定すると、頻繁にエラー検出がなされる一方で、検出されたエラーの多くが視聴者にとって無視できる程度のものであったときは検出が無駄になる恐れがある。一方、閾値を緩く設定すると、エラー検出の頻度は下がるが、視聴者にとって無視できないエラーが見逃されてしまう恐れがある。これが、機械監視が人間の監視に劣る理由の1つとされる。特にパラメータが複数あって相互に関連している場合など、1つのパラメータを変更すると他のパラメータに影響が及ぶため、2つ以上のパラメータを同時に変更しなくてはならず、適切な閾値の設定は非常に困難である。 However, in the technology as disclosed in Patent Document 1, the error detection is performed by comparing the characteristic value (parameter) of the audio signal with the threshold, but there is a problem as to how to set the threshold. That is, if the threshold is set to be strict, error detection is frequently performed, but if many of the detected errors are negligible for the viewer, detection may be wasted. On the other hand, if the threshold value is set loosely, although the frequency of error detection decreases, there is a risk that the viewer may miss an error that can not be ignored. This is one of the reasons why machine surveillance is inferior to human surveillance. Since changing one parameter affects other parameters, especially when there are multiple parameters that are related to each other, two or more parameters must be changed at the same time, and appropriate threshold settings should be made. Is very difficult.
 本発明の目的の1つは、デジタル映像信号においてコンテンツに応じて発生する映像及び/又はエラーを適切に検出する映像監視装置の調整方法を提供することにある。 One of the objects of the present invention is to provide a method of adjusting an image monitoring apparatus for appropriately detecting an image and / or an error generated depending on content in a digital image signal.
 又、本発明の別の目的は、デジタル映像信号において学習を通じて適切に映像及び/又はエラーを検出する映像監視装置を提供することにある。 Another object of the present invention is to provide an image monitoring apparatus for detecting an image and / or an error properly through learning in a digital image signal.
 第1の本発明の映像監視装置の調整方法は、監視対象である映像及び/又は音声に対応する映像信号を入力して、監視項目に対してエラーが発生したことを検出する映像監視装置の調整方法において、
 前記映像監視装置は、前記映像信号に応じて変化する複数のパラメータの全てが、前記監視項目に応じて決定された閾値を超えたときに、前記エラーが発生したと判定するようになっており、前記複数のパラメータのうち少なくとも2つが、相互に関連し合っていて、
 前記複数のパラメータの閾値をそれぞれ決定して閾値群を複数組作成し、メモリに記憶し、
 監視対象である映像及び/又は音声のコンテンツに応じて、前記メモリに記憶された前記閾値群のいずれかの組を選定することを特徴とする。
According to a first aspect of the present invention, there is provided a method of adjusting an image monitoring apparatus according to the first aspect of the present invention, wherein an image signal corresponding to an image to be monitored and / or audio is input to detect that an error occurs in a monitoring item. In the adjustment method,
The video monitoring apparatus is configured to determine that the error has occurred when all of a plurality of parameters that change according to the video signal exceed a threshold determined according to the monitoring item. , At least two of the plurality of parameters are mutually related,
The threshold values of the plurality of parameters are respectively determined to create a plurality of threshold value groups, which are stored in a memory,
It is characterized in that any set of the threshold group stored in the memory is selected according to contents of video and / or audio to be monitored.
 本発明者は鋭意研究の結果、前記閾値群の適正値が、監視対象である映像及び/又は音声のコンテンツの内容(「映画」、「ドラマ」、「バラエティ」、「スポーツ」、「ドキュメンタリー」、「エンターテインメント」、「アニメ」等)に応じて偏在することを発見した。かかる発見に基づいて、本発明を導出したのである。すなわち、前記複数のパラメータの閾値をそれぞれ決定して前記閾値群を複数組作成し、メモリに記憶しておけば、監視対象である映像及び/又は音声のコンテンツに応じて前記メモリに記憶された前記閾値群のいずれかの組を選定することで、単一の閾値群を用いる場合に比べ、より監視に適正な閾値群を設定できるから、前記映像監視装置におけるエラーの誤検出を効果的に防止することができるのである。 As a result of earnest research, the inventor of the present invention has determined that the appropriate value of the threshold group is the content of the video and / or audio content to be monitored ("movie", "drama", "variety", "sports", "documentary" , "Entertainment", "animation", etc.) have been found to be unevenly distributed. The present invention has been derived based on such findings. That is, if the threshold values of the plurality of parameters are respectively determined and a plurality of sets of threshold values are created and stored in the memory, they are stored in the memory according to the content of video and / or audio to be monitored. By selecting any set of the threshold group, a threshold group more appropriate for monitoring can be set as compared to the case where a single threshold group is used, so erroneous detection of an error in the image monitoring apparatus is effectively performed. It can be prevented.
 更に、前記映像監視装置が、映像及び/又は音声の監視動作と並行して学習を行い、その学習結果に基づいて前記閾値群の少なくとも1つの閾値を更新し、更新された前記閾値群をメモリに記憶すると好ましい。 Furthermore, the video monitoring device performs learning in parallel with the video and / or audio monitoring operation, updates at least one threshold of the threshold group based on the learning result, and stores the updated threshold group in a memory It is preferable to store in
 更に、前記映像監視装置は、エラーを検出したときにアラームを発報するようになっており、
 前記学習において、監視対象である映像及び/又は音声に対応する映像信号を入力させて、一定の時間間隔で、アラームを発報すべきエラーに対して前記アラームが発報された第1事象の件数と、アラームを発報すべきでないエラーに対して前記アラームが発報された第2事象の件数とをそれぞれカウントし、
 更に、エラーの内容と発生箇所が既知である検査用の映像信号を前記映像監視装置に入力させて、アラームを発報すべきエラーに対して前記アラームが発報されなかった第3事象の件数をカウントし、
 得られた前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数とを重み付けして演算することにより評価値を求め、求めた前記評価値に基づいて前記閾値群を更新すると好ましい。
Furthermore, the video surveillance device issues an alarm when an error is detected.
In the learning, a video signal corresponding to the video and / or audio to be monitored is input, and the alarm is issued at predetermined time intervals, the first event of the alarm for the error to be issued Count the number of events and the number of second events for which the alarm has been issued for errors that should not be issued,
Furthermore, a video signal for inspection whose content and place of occurrence of the error are known is input to the video monitoring apparatus, and the number of third events for which the alarm has not been issued for an error that should be issued. Count,
An evaluation value is determined by weighting and calculating the number of obtained first events, the number of second events, and the number of third events, and the threshold group is determined based on the determined evaluation values. It is preferable to update the
 前記学習において、前記閾値群の閾値を個別に変化させながら、前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数をカウントすると好ましい。 In the learning, it is preferable to count the number of the first events, the number of the second events, and the number of the third events while individually changing the thresholds of the threshold group.
 第2の本発明の映像監視装置は、監視対象である映像及び/又は音声に対応する映像信号を入力して、監視項目に対してエラーが発生したことを検出する映像監視装置において、
 前記映像信号に応じて変化する複数のパラメータであって、前記複数のパラメータのうち少なくとも2つが相互に関連し合ってなるパラメータの閾値を記憶する閾値記憶部と、
 前記複数のパラメータの全てが、前記監視項目に応じて決定された前記閾値を超えたときに、前記エラーが発生したと判定し、アラームを発報する判定部と、
 映像及び/又は音声の監視動作と並行して学習を行う学習部と、
 前記学習部における学習結果に基づいて前記閾値を更新する更新部と、
 前記判定部が前記エラーの発生を検出したときは、前記エラーの発生前後の映像信号を記憶する映像記憶部と、を有し、
 前記エラーの発報結果を解析することにより、前記学習部が学習することを特徴とする。
A video monitoring apparatus according to a second aspect of the present invention is a video monitoring apparatus that receives a video signal corresponding to video and / or audio to be monitored and detects that an error has occurred in a monitoring item.
A threshold storage unit that stores a plurality of parameters that change according to the video signal, wherein at least two of the plurality of parameters are associated with each other;
A determination unit that determines that the error has occurred and issues an alarm when all of the plurality of parameters exceed the threshold determined according to the monitoring item;
A learning unit that performs learning in parallel with the video and / or audio monitoring operation;
An updating unit that updates the threshold based on a learning result in the learning unit;
A video storage unit for storing video signals before and after the occurrence of the error when the determination unit detects the occurrence of the error;
The learning unit may learn by analyzing the notification result of the error.
 本発明によれば、前記エラーの発報結果を解析し、例えば「正解アラーム」、「不要アラーム、「すり抜け」などに分類することにより、前記学習部が学習を行って、前記閾値をより適正な値に更新できるので、エラー検出の精度が高まる。 According to the present invention, the result of the error notification is analyzed and classified into, for example, "correct alarm", "unwanted alarm", "pass through", etc., whereby the learning unit performs learning to make the threshold more appropriate. The error detection accuracy is improved because the value can be updated to
 更に、前記学習部は、監視対象である映像及び/又は音声に対応する映像信号を入力したときに、一定の時間間隔で、アラームを発報すべきエラーに対して前記アラームが発報された第1事象の件数と、アラームを発報すべきでないエラーに対して前記アラームが発報された第2事象の件数をカウントし、更に、エラーの内容と発生箇所が既知である検査用の映像信号を入力したときに、アラームを発報すべきエラーに対して前記アラームが発報されなかった第3事象の件数をカウントし、得られた前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数とを重み付けして演算することにより評価値を求め、求めた前記評価値に基づいて前記閾値記憶部に記憶された前記閾値を順位付けすると好ましい。 Furthermore, when the learning unit inputs a video signal corresponding to the video and / or audio to be monitored, the alarm is issued for an error that should be issued at a constant time interval. Counts the number of first events and the number of second events for which an alarm should have been issued for errors that should not have triggered an alarm When the signal is input, the number of third events for which the alarm has not been issued for the error that should be issued is counted, and the number of the first event obtained and the number of the second events It is preferable to calculate an evaluation value by calculating by weighting the number of cases and the number of cases of the third event, and ranking the threshold values stored in the threshold storage unit based on the obtained evaluation values.
 更に、前記学習部は、前記監視項目の閾値を個別に変化させながら、前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数をカウントすると好ましい。 Furthermore, it is preferable that the learning unit count the number of first events, the number of second events, and the number of third events while individually changing the threshold value of the monitoring item.
 更に、前記学習部は、M個の映像信号のうちm番目の映像信号に対して、時刻t1~t2の間に、アラームを発報すべきエラーに対して前記アラームが発報された第1事象の件数をT(m)とし、アラームを発報すべきでないエラーに対して前記アラームが発報された第2事象の件数をF(m)とし、アラームを発報すべきエラーに対して前記アラームが発報されなかった第3事象の件数をH(m)としたときに、以下の評価関数に従って評価値A(m)を求めると好ましい。
Figure JPOXMLDOC01-appb-I000002
但し、
Wt(m)は前記第1事象の重み付け
Wf(m)は前記第2事象の重み付け
Wh(m)は前記第3事象の重み付け
Furthermore, the learning unit may be configured to cause the alarm to be issued for an error that should be issued for the m-th video signal of the M video signals during the time t1 to t2. Let T (m) be the number of events, and F (m) be the number of second events for which an alarm should have been issued for errors that should not have triggered an alarm. Assuming that the number of third events for which the alarm has not been issued is H (m), it is preferable to obtain an evaluation value A (m) according to the following evaluation function.
Figure JPOXMLDOC01-appb-I000002
However,
Wt (m) is the weight of the first event Wf (m) is the weight of the second event Wh (m) is the weight of the third event
 更に、前記映像記憶部は、入力された映像信号を、所定のタイミングで所定の長さで切り出して記憶すると好ましい。 Furthermore, it is preferable that the video storage unit cuts out and stores the input video signal with a predetermined length at a predetermined timing.
 更に、前記学習部の学習結果を表示すると共に、前記エラーの発生前後の映像信号に基づいて映像及び/又は音声を再生する表示部を有すると好ましい。 Furthermore, it is preferable to have a display unit that displays the learning result of the learning unit and reproduces video and / or audio based on the video signal before and after the occurrence of the error.
 更に、前記更新部は、現在の閾値に代わる新たな閾値の候補を、優先順位を付けてリストアップすると好ましい。 Furthermore, it is preferable that the updating unit prioritizes and lists new threshold candidates that replace the current threshold.
 本明細書中、「音声」とは、人間や動物の声の他、音も含む。「映像信号」とは、映像信号及び音声信号のいずれか一方の場合を含む。「コンテンツ」とは、映像の内容のことである。具体的には、コンテンツの大分類として、「映画」、「ドラマ」、「バラエティ」、「スポーツ」、「ドキュメンタリー」、「エンターテインメント」、「アニメ」などがある。一方、コンテンツの小分類としては、「スポーツ」を例にとれば、「野球」、「サッカー」、「バレーボール」、「柔道」、「相撲」、「陸上競技」などがある。「学習」は、リアルタイムで流れる映像及び/又は音声を監視者が視聴した監視結果を、同じ映像及び/又は音声に対応する映像信号を映像監視装置で監視した結果と対応付け、それを閾値にフィードバックすることを含む。「相互に関連する」とは、一方のパラメータを調整すると、他方のパラメータに影響が及ぶことをいう。「閾値を超える」とは、閾値を上回ること、及び閾値を下回ることの双方の場合を含む。 In the present specification, "voice" includes sounds in addition to human and animal voices. "Video signal" includes the case of either video signal or audio signal. "Content" is the content of the video. Specifically, there are "movie", "drama", "variety", "sports", "documentary", "entertainment", "animation" etc. as the major classification of content. On the other hand, sub-classification of contents, if taking "sports" as an example, includes "baseball", "soccer", "volleyball", "judo", "sumo", "land athletics" and the like. “Learning” associates the monitoring result of watching the video and / or audio flowing in real time with the monitoring result of the same video and / or audio by the video monitoring device, and uses it as a threshold Including feedback. "Interrelated" means that adjusting one parameter affects the other parameter. The "above the threshold" includes both cases where the threshold is exceeded and below the threshold.
 本発明によれば、デジタル映像信号においてコンテンツに応じて発生する映像及び/又はエラーを適切に検出する映像監視装置の調整方法を提供することができる。又、本発明によれば、デジタル映像信号において学習を通じて適切に映像及び/又はエラーを検出する映像監視装置を提供することができる。 According to the present invention, it is possible to provide a method of adjusting an image monitoring apparatus for appropriately detecting an image and / or an error generated according to content in a digital image signal. Also, according to the present invention, it is possible to provide an image monitoring apparatus which detects an image and / or an error properly through learning in a digital image signal.
映像監視装置100を示すブロック図である。FIG. 2 is a block diagram showing an image monitoring apparatus 100. 表示部107に表示される映像・音声監視部103における検査項目の例を示す図である。FIG. 6 is a diagram showing an example of inspection items in the video / audio monitoring unit 103 displayed on the display unit 107. 表示部107に表示されるアラーム情報の例を示す図である。FIG. 6 is a diagram showing an example of alarm information displayed on a display unit 107. 表示部107に表示される監視項目毎のパラメータの例を示す図である。It is a figure which shows the example of the parameter for every monitoring item displayed on the display part 107. FIG. 同じ閾値を用いてエラー検出を行った際に発生した正解アラーム、不要アラーム、すり抜けを、監視項目とコンテンツ毎にまとめた例を示す図である。It is a figure which shows the example which put together the correct item alarm which generate | occur | produced when error detection was performed using the same threshold value, an unnecessary alarm, and a slip through for every monitoring item and content. あるコンテンツに関する検査項目フリーズにおいて、このような更新を複数回行った際の閾値の複数群の例((a)更新前、(b)更新後)のリストを示す図である。It is a figure which shows the list of an example ((a) before an update, (b) after an update) of several groups of the threshold-value at the time of performing such update several times in the inspection item freeze regarding a certain content.
 本実施の形態にかかる映像監視装置及びその調整方法を、図面を参照して説明する。図1は、映像監視装置100を示すブロック図である。図1において、映像監視装置100は、監視対象となる映像及び/又は音声に対応する映像信号(以下、検査対象映像信号という)を入力する主入力部101と、検査用の映像信号を入力する副入力部102と、閾値を記憶するメモリを内蔵する映像・音声監視部(判定部)103と、映像・音声クリップ蓄積部(映像記憶部)104と、アラーム出力部105と、内蔵メモリ(閾値記憶部)106aを含むパラメータ最適化用学習部(学習部兼更新部)106と、監視者MNが表示された情報を視認可能なディスプレイ等の表示部107と、監視者MNが情報を入力可能なキーボード等の監視者入力部108とを有している。 An image monitoring apparatus according to the present embodiment and an adjustment method thereof will be described with reference to the drawings. FIG. 1 is a block diagram showing an image monitoring apparatus 100. As shown in FIG. In FIG. 1, a video monitoring apparatus 100 inputs a main input unit 101 for inputting a video signal (hereinafter referred to as a video signal to be inspected) corresponding to video and / or audio to be monitored, and a video signal for inspection. Sub-input unit 102, video / audio monitoring unit (determination unit) 103 incorporating memory for storing threshold, video / audio clip storage unit (video storage unit) 104, alarm output unit 105, internal memory (threshold A parameter optimization learning unit (learning unit and updating unit) 106 including a storage unit 106a, a display unit 107 such as a display on which the information displayed by the supervisor MN can be viewed, and the supervisor MN can input information And a supervisor input unit 108 such as a keyboard.
 映像監視装置100の監視動作について説明する。図2は、表示部107に表示される映像・音声監視部103における検査項目の例を示す図である。監視対象となる映像信号は、SDI信号、ファイル、IP形式、HDMI(登録商標)等全てのフォーマットの映像信号を対象とする。監視者MNは、表示部107に表示された監視項目を見ながら、監視者入力部108を介して、監視項目それぞれに対応するボックス内を変化させ、監視項目毎に「検査」又は「オフ」のいずれかを選択することができる。監視者が「検査」を選択した検査項目は、映像監視装置100が監視することになるが、「オフ」を選択した検査項目は監視されない。 The monitoring operation of the video monitoring device 100 will be described. FIG. 2 is a diagram showing an example of inspection items in the video / audio monitoring unit 103 displayed on the display unit 107. As shown in FIG. The video signals to be monitored are video signals of all formats such as an SDI signal, a file, an IP format, and HDMI (registered trademark). The supervisor MN changes the inside of the box corresponding to each monitoring item via the supervisor input unit 108 while looking at the monitoring item displayed on the display unit 107, and performs “inspection” or “off” for each monitoring item. You can choose one of these. The inspection item for which the supervisor has selected “inspection” is to be monitored by the video monitoring apparatus 100, but the inspection item for which “off” is selected is not monitored.
 映像に関する監視項目としては、「フリーズ」、「ブラックアウト」、「遮断フリーズ」、「遮断ブラックアウト」、「ブロックノイズ」、「赤色点滅」、「輝度点滅」、「場面転換」、「映像反転」、「ラインノイズ」、「カット点異常」、「タイムコード不連続」がある。一方、音声に関する監視項目としては、「ミュート」、「遮断ミュート」、「プツ音(Audio Pop Noise)」、「音飛び」、「音声ノイズ」、「ラウドネス」、「トゥルーピーク」がある。尚、検査項目がこれらに限られることはない。 The monitoring items related to the image include "freeze", "black out", "block freeze", "block black out", "block noise", "red blink", "brightness blink", "scene change", "image reverse" There are “line noise”, “cut point abnormality”, and “time code discontinuous”. On the other hand, monitoring items relating to audio include "mute", "cut-off mute", "audio pop noise", "sound skipping", "voice noise", "loudness" and "true peak". The inspection items are not limited to these.
 監視動作の前提として、映像・音声監視部103には、監視項目毎に複数の閾値(閾値群という、詳細は後述)が設定されているものとする。図1を参照して、映像監視装置100は、主入力部101から検査対象映像信号を入力すると、入力された検査対象映像信号は、画像フレーム、画素データ、音声サンプリングデータが抽出されて、映像・音声監視部103に送信され、ここで監視アルゴリズムに適用できる各種感度調整や継続時間等に対応したパラメータを算出する。 As a premise of the monitoring operation, it is assumed that in the video / voice monitoring unit 103, a plurality of thresholds (a threshold group, the details will be described later) are set for each monitoring item. Referring to FIG. 1, when the video monitoring apparatus 100 receives an inspection target video signal from the main input unit 101, an image frame, pixel data, and audio sampling data are extracted from the input inspection target video signal. The parameter is transmitted to the voice monitoring unit 103, where parameters corresponding to various sensitivity adjustments and durations that can be applied to the monitoring algorithm are calculated.
 映像・音声監視部103は、監視アルゴリズムに従って、求めた複数のパラメータを監視項目に対応する閾値と比較し、全てのパラメータが閾値を超えた場合には監視項目に対応するエラーを検出したものと判定して、当該監視項目と対応づけたアラーム信号をアラーム出力部105に出力する。アラーム出力部105は、入力したアラーム信号に応じて、エラー発生時刻と、検出したエラーの内容と、その深刻度とを含むアラーム情報を表示部107に入力するので、表示部107はアラーム情報を表示し、それを監視者MNが視認することができる。又、映像・音声監視部103は、パラメータが閾値を超えた前後の映像信号を切り出して、映像・音声クリップ蓄積部104に記憶するようになっている。 According to the monitoring algorithm, the video / voice monitoring unit 103 compares a plurality of obtained parameters with a threshold value corresponding to the monitoring item, and when all the parameters exceed the threshold value, an error corresponding to the monitoring item is detected. It judges and outputs the alarm signal matched with the said monitoring item to the alarm output part 105. FIG. Since the alarm output unit 105 inputs alarm information including the error occurrence time, the content of the detected error, and the severity of the error to the display unit 107 according to the input alarm signal, the display unit 107 receives the alarm information. It can be displayed and viewed by the observer MN. Also, the video / audio monitoring unit 103 cuts out video signals before and after the parameter exceeds the threshold and stores the video signals in the video / audio clip storage unit 104.
 図3は、表示部107に表示されるアラーム情報の例を示す図である。図3において、画面下段には検出されたエラー及び内容が時系列で表示されると共に、画面上段にはエラーが映像又は音声であることを示す「カテゴリー」、エラーのレベルが普通又は重篤を示す「クラス」、エラーの種別を示す「検査項目」、及びエラーの「発生回数」がまとめて表示される。監視者MNが、監視者入力部108を介して、いずれかのエラーを選択すると、それに応じて映像・音声クリップ蓄積部104から、対応するエラー前後の映像信号が読み出され、表示部107に入力される。これによりエラー前後における映像及び/又は音声が表示部107から出力されるので、監視者MNはエラーの内容を実際に視認でき、或いは聴取することができる。 FIG. 3 is a diagram showing an example of alarm information displayed on the display unit 107. As shown in FIG. In FIG. 3, the lower part of the screen shows the detected error and content in chronological order, and the upper part of the screen shows "category" indicating that the error is video or audio, and the level of error is normal or severe. The "class" indicating, the "inspection item" indicating the type of error, and the "number of occurrences of error" are displayed together. When the supervisor MN selects any error via the supervisor input unit 108, the video / audio clip storage unit 104 reads out the corresponding video signal before and after the error, and the display unit 107 It is input. Since the video and / or the sound before and after the error are output from the display unit 107 by this, the supervisor MN can actually visually recognize or listen to the content of the error.
 ところで、表示部107に表示された検出エラーの内容と、実際の映像及び/又は音声を監視者MNが見比べることで、アラームを発報すべきエラーに対してアラームが発報された場合(正解アラームという)と、アラームを発報すべきでないエラーに対してアラームが発報された場合(不要アラームという)と、アラームを発報すべきエラーに対してアラームが発報されなかった場合(すり抜けという)とをそれぞれ認識できる。本来的には、映像監視装置100がエラー検出時に発報するアラームが全て正解アラームであることが理想であるが、実際には不要アラームやすり抜けが生じてしまう。これは、映像・音声監視部103の機械監視によるエラー検出基準が、監視者MNの監視によるエラー検出基準と厳密に一致しないことから生じるものである。よって、機械監視によるエラー検出を人間の監視によるエラー検出に近づけない限り、映像監視装置100単独での監視が困難となる。 By the way, when the alarm is issued for the error that should be issued by the observer MN comparing the content of the detection error displayed on the display unit 107 with the actual video and / or audio, the correct answer (correct answer If an alarm is issued for an error that should not issue an alarm (an unnecessary alarm) or if an alarm is not issued for an error that should issue an alarm Can be recognized respectively. Essentially, it is ideal that all alarms issued when the image monitoring apparatus 100 detects an error are all correct alarms, but in reality, unnecessary alarms and passing through occur. This results from the fact that the error detection standard by machine monitoring of the video / voice monitoring unit 103 does not exactly match the error detection standard by monitoring of the supervisor MN. Therefore, unless the error detection by the machine monitoring approaches the error detection by the human monitoring, the monitoring by the image monitoring apparatus 100 alone becomes difficult.
 そこで、映像・音声監視部103の機械監視によるエラー検出基準を、監視者MNの監視によるエラー検出基準に近づけるために、監視項目毎に設定されたパラメータの閾値を変えることとする。これを閾値のチューニングという。ここで、監視項目がm個あったとき、そのパラメータの個数をI(m)とすると、チューニングすべき閾値の数は、1・I(1)+2・I(2)+・・・+m・I(m)となって、監視項目が増えるにつれて増大することがわかる。 Therefore, in order to bring the error detection standard by machine monitoring of the video / voice monitoring unit 103 closer to the error detection standard by monitoring of the supervisor MN, the threshold value of the parameter set for each monitoring item is changed. This is called threshold tuning. Here, when there are m monitoring items, assuming that the number of parameters is I (m), the number of threshold values to be tuned is 1 · I (1) + 2 · I (2) +. It can be seen that I (m) increases as the number of monitoring items increases.
 図4は、表示部107に表示される監視項目毎のパラメータの例を示す図である。監視者MNは、表示部107に表示されたパラメータを見ながら、監視者入力部108を介して、パラメータに対応するボックス内に、パラメータ毎に任意の数値を入力することができるようになっている。 FIG. 4 is a diagram showing an example of parameters for each monitoring item displayed on the display unit 107. As shown in FIG. The supervisor MN can now input an arbitrary numerical value for each parameter in the box corresponding to the parameter via the supervisor input unit 108 while looking at the parameter displayed on the display unit 107. There is.
 一例を挙げると、検査項目「フリーズ」に対応して、パラメータとして「感度閾値(アクティビティ)」、「感度閾値(ノイズ)」、「時間閾値(開始)」、「時間閾値(終了)」の4つがある。「グラフスケール」は表示用のグラフの縮尺を表すもので、ここではパラメータではない。つまり、エラーとしての映像フリーズを検出する為には、4つのパラメータの閾値(閾値群)を各々適切に設定しなければならない。しかるに、「感度閾値(アクティビティ)」と「感度閾値(ノイズ)」とは、パラメータとして映像1フレームに含まれる小ブロックごとに分散をとった値の上限値及び下限値であり、相互に関連し合っているといえる。このようなパラメータは、国際公開第2015-059782号に詳細が開示されている。又、「時間閾値(開始)」と「時間閾値(終了)」は、映像がフレーズしたと判断する期間の長さを示したものであり、相互に関係し合っている。よって、いずれか一方の閾値を調整した場合、他方の閾値も併せて変更しなければ、適切にフリーズを検出できないこととなる。 For example, in response to the inspection item "freeze", 4 parameters "sensitivity threshold (activity)", "sensitivity threshold (noise)", "time threshold (start)", and "time threshold (end)" There is one. "Graph scale" represents the scale of the graph for display, and is not a parameter here. That is, in order to detect a video freeze as an error, thresholds (threshold groups) of four parameters must be set appropriately. However, "sensitivity threshold (activity)" and "sensitivity threshold (noise)" are upper and lower limit values of the variance for each small block included in one video frame as a parameter, and they are mutually related. It can be said that they fit. Such parameters are disclosed in detail in WO 2015-059782. Also, “time threshold (start)” and “time threshold (end)” indicate the length of a period during which it is determined that the video is phrased, and they are mutually related. Therefore, when one of the thresholds is adjusted, it is impossible to appropriately detect the freeze unless the other threshold is also changed.
 ここで、全ての検査項目に対して適切な閾値を入力しないと、エラー検出時の不要アラームやすり抜けを招いてしまうが、図4に示すように検査項目の数が多い場合、全ての検査項目に対して適正な閾値を決定することは困難である。そこで、デフォルトの閾値(初期値)を予め決定しておくことが望ましい。このようなデフォルトの閾値は、例えば映像・音声監視部103の内蔵メモリに記憶して用いることができる。しかるに、本発明者の検討結果によれば、全てのコンテンツに対して同じ閾値を用いた場合において、あるコンテンツでは不要アラームやすり抜けが低減されるが、別なコンテンツでは不要アラームやすり抜けが生じることが分かった。 Here, if an appropriate threshold value is not input for all inspection items, unnecessary alarms or omissions at the time of error detection are caused. However, as shown in FIG. 4, when the number of inspection items is large, all inspection items It is difficult to determine an appropriate threshold for Therefore, it is desirable to determine a default threshold (initial value) in advance. Such a default threshold can be stored and used, for example, in the built-in memory of the video / audio monitoring unit 103. However, according to the study results of the inventor, when the same threshold value is used for all the content, the unnecessary alarm and the passing through are reduced for one content, but the unnecessary alarm and the passing occurs for another content. I understand.
 図5は、同じ閾値を用いてエラー検出を行った際に発生した正解アラーム、不要アラーム、すり抜けを、監視項目とコンテンツ毎にまとめた例を示す図である。図5の例から、コンテンツ毎に正解アラーム、不要アラーム、すり抜けの発生頻度が異なることがわかる。この検討結果に基づいて、本発明者は、コンテンツの内容に応じてデフォルトの閾値を決定すればよいことを見出したのである。このようなデフォルトの閾値は、シミュレーションや積み重ねられた経験から求めることができる。 FIG. 5 is a diagram showing an example in which the correct answer alarm, the unnecessary alarm, and the slip-through that occur when the error detection is performed using the same threshold are summarized for each monitoring item and content. From the example of FIG. 5, it can be seen that the occurrence frequency of the correct alarm, the unnecessary alarm, and the slip-in differs for each content. Based on the examination result, the inventor has found that it is sufficient to determine the default threshold according to the content of the content. Such default thresholds can be determined from simulations and accumulated experience.
 一方、コンテンツの内容に応じてデフォルトの閾値を決定すれば、ある程度不要アラームやすり抜けを減らせる可能性はあるが、常に最適とは限らない。コンテンツの内容が同じだとしても、受信状態等によっては、不要アラームやすり抜けの頻度が変化することがある。そこで、監視動作と並行して、現在入力している映像信号に関して映像監視装置100に学習させることで、デフォルトの閾値が適切か否かを判断し、適切でなければ更に更新することが好ましいといえる。これにより、正解アラームを増大させて、不要アラームやすり抜けの頻度を低下させることができる。更新された閾値は、新たに当該コンテンツのデフォルトの閾値とすることができる。 On the other hand, if the default threshold is determined according to the content of the content, there is a possibility of reducing unnecessary alarms and passing through to some extent, but it is not always optimal. Even if the contents of the content are the same, the frequency of unnecessary alarms and passing through may change depending on the reception state and the like. Therefore, in parallel with the monitoring operation, by causing the video monitoring apparatus 100 to learn about the currently input video signal, it is determined whether the default threshold is appropriate or not, and it is preferable to further update it if it is not appropriate. It can be said. This makes it possible to increase the number of correct alarms and reduce the frequency of unnecessary alarms and passing through. The updated threshold may be newly set as the default threshold of the content.
(学習態様)
 以下、映像監視装置100の学習機能について説明する。以下の学習例では、デフォルトの閾値に対して、変更すべき閾値候補がある場合について、いずれが優れているか判定できる。図1の映像監視装置100において、映像及び/又は音声の監視動作を実際に行っている間に、監視者MNが監視者入力部108から所定の学習期間を設定する。すると、映像監視装置100は、この学習期間の間、学習を行うことができる。
(Learning mode)
The learning function of the video surveillance device 100 will be described below. In the following learning example, it is possible to determine which is better for the case where there are threshold candidates to be changed with respect to the default threshold. In the video monitoring apparatus 100 of FIG. 1, while the video and / or audio monitoring operation is actually performed, the supervisor MN sets a predetermined learning period from the supervisor input unit 108. Then, the video surveillance device 100 can perform learning during this learning period.
 具体的には、同一種のコンテンツでM個の映像信号において、1つの監視項目についてデフォルトの閾値を用いて監視動作を行う。まず学習期間(時刻t1~t2、例えば時間、日、週間、月などの時間単位で良い)の間、映像監視装置100が検出したエラーが、図3に示すように表示部107に表示される。図3のように表示されたエラーには、不適切なものも含まれている可能性があるため、学習のためにはエラーの適否のチェックを行う必要がある。 Specifically, the monitoring operation is performed using the default threshold value for one monitoring item in M video signals of the same type of content. First, an error detected by the image monitoring apparatus 100 is displayed on the display unit 107 as shown in FIG. 3 during a learning period (time t1 to t2, for example, a time unit such as hours, days, weeks, or months). . Since the errors displayed as shown in FIG. 3 may include inappropriate ones, it is necessary to check whether the errors are appropriate for learning.
 ここで、監視者MNは、図3に表示されたエラーのいずれかをワンクリックで指定することにより、映像・音声クリップ蓄積部104に記憶された当該エラー前後の映像信号を読み出して、対応する映像及び/又は音声を表示部107で視聴することができる。その結果、監視者MNは、時刻t1~t2の間に生じた正解アラームの件数と不要アラームの件数とをカウントすることが出来る。映像・音声クリップ蓄積部104に記憶された各エラー前後の映像信号は、長さが3秒程度であるから視聴に時間はかからず、このチェックは短時間で終了するため、監視者MNの負担が少ない。 Here, the supervisor MN reads the video signals before and after the error stored in the video / audio clip storage unit 104 by designating one of the errors displayed in FIG. Video and / or audio can be viewed on the display unit 107. As a result, the supervisor MN can count the number of correct alarms generated during the time t1 to t2 and the number of unnecessary alarms. Since the video signal before and after each error stored in the video / audio clip storage unit 104 has a length of about 3 seconds, it does not take a long time to view, and this check is completed in a short time. There is little burden.
 但し、上述のチェックでは、すり抜けの件数をカウントすることができない。そこで、予め準備しておいた、エラーの内容と発生箇所(時間)が既知である検査用の映像信号を、副入力部102を介して映像・音声監視部103に入力し、上述と同じ閾値を用いて映像・音声監視部103でエラーの検出を行う。これにより図3に示すものと同様なアラーム情報が得られたとき、監視者MNは、既知であるエラーの内容と発生箇所を照合することで、正解アラームの件数と、すり抜けの件数とを求めることができる。ここではすり抜けの件数を抽出して用いる。検査用の映像信号は、コンテンツ毎に数十分程度の視聴時間に相当する長さであるため、検査の手間がかからない。以上のようにしてM個の映像信号に対してそれぞれ求めた正解アラームの件数と、不要アラームの件数と、すり抜けの件数とを、監視者MNが監視者入力部108から、パラメータ最適化用学習部106に入力する。 However, in the above-mentioned check, the number of passing can not be counted. Therefore, the video signal for inspection whose content and occurrence point (time) of the error prepared beforehand are known is input to the video / audio monitoring unit 103 via the sub input unit 102, and the same threshold as described above The video / audio monitoring unit 103 detects an error using As a result, when alarm information similar to that shown in FIG. 3 is obtained, the supervisor MN finds the number of correct alarms and the number of slips by collating the contents of the known error with the occurrence part. be able to. Here, the number of passing is extracted and used. Since the video signal for inspection has a length corresponding to a viewing time of several tens of minutes for each content, it does not take time for inspection. As described above, the supervisor MN learns from the supervisor input unit 108 the parameter optimization learning from the number of correct alarms, the number of unnecessary alarms, and the number of slips calculated for each of the M video signals. Input to the part 106.
 ここで、m番目の映像信号における正解アラーム(第1事象)の数をT(m)とし、不要アラーム(第2事象)の数をF(m)とし、すり抜け(第3事象)の数をH(m)とすると、パラメータ最適化用学習部106は、以下の評価関数に従って,評価値A(m)を求める。
Figure JPOXMLDOC01-appb-I000003
但し、
Wt(m)は正解アラームの重み付け
Wf(m)は不要アラームの重み付け
Wh(m)はすり抜けの重み付け
Here, let T (m) be the number of correct alarms (first event) in the m-th video signal, F (m) be the number of unnecessary alarms (second event), and let the number of slips (third event) be Assuming that H (m), the parameter optimization learning unit 106 obtains an evaluation value A (m) according to the following evaluation function.
Figure JPOXMLDOC01-appb-I000003
However,
Wt (m) is the correct alarm weighting Wf (m) is the unnecessary alarm weighting Wh (m) is the passing weight
 エラー検出において、一般的に正解アラーム数は多い方が好ましく、不要アラーム数は少ない方が好ましく、更にすり抜けは極力なくすることが好ましい。よって、メモリ106aに記憶された重み付けとして、Wt(m)>0>Wf(m)>Wh(m)と設定することが好ましく、例えばWt(m)=+3とし、Wf(m)=-1とし、Wh(m)=-5とする。但し、重み付けは同一種のコンテンツでは固定するとよい。この評価関数によれば、得られた評価値A(m)が高いほどエラー検出が正確であり、その値が低いほどエラー検出が不正確となる。 Generally, in error detection, it is preferable that the number of correct alarms be large, and it be preferable that the number of unnecessary alarms be small. Furthermore, it is preferable to minimize passing through. Therefore, it is preferable to set Wt (m)> 0> Wf (m)> Wh (m) as the weighting stored in the memory 106a. For example, Wt (m) = + 3 and Wf (m) = − 1. Let Wh (m) = -5. However, the weighting may be fixed for the same type of content. According to this evaluation function, the higher the obtained evaluation value A (m), the more accurate the error detection, and the lower the value, the more inaccurate the error detection.
 次いで、監視者MNが監視者入力部108を介して、デフォルトの閾値から、変更を希望する閾値へと置換を行って、同じコンテンツの映像信号を用いて、以上と同様な監視動作を実行して評価値を求める。デフォルトの閾値を用いた監視動作の評価値が、変更を希望する閾値を用いた監視動作の評価値以上であれば、パラメータ最適化用学習部106は、当該コンテンツにおいてはデフォルトの閾値を使い続けるものとして、記憶したデフォルトの閾値を変更しない。これに対し、デフォルトの閾値を用いた監視動作の評価値が、変更を希望する閾値を用いた監視動作の評価値より低ければ、パラメータ最適化用学習部106は、当該コンテンツにおいてはデフォルトの閾値の変更が必要と判断し、変更を希望する閾値と置換し、すなわち学習を行う。これにより、エラー検出の精度を一層高めることができる。 Next, the supervisor MN replaces the default threshold value with the threshold value desired to be changed via the supervisor input unit 108, and executes the same monitoring operation as above using the video signal of the same content. The evaluation value is determined. If the evaluation value of the monitoring operation using the default threshold is equal to or more than the evaluation value of the monitoring operation using the threshold desired to be changed, the parameter optimization learning unit 106 continues using the default threshold in the content. As a thing, do not change the stored default threshold. On the other hand, if the evaluation value of the monitoring operation using the default threshold is lower than the evaluation value of the monitoring operation using the threshold for which change is desired, the parameter optimization learning unit 106 determines the default threshold for the content. It is determined that the change is necessary, and the change is replaced with a desired threshold, that is, learning is performed. Thereby, the accuracy of error detection can be further enhanced.
 図6は、あるコンテンツに関する検査項目:フリーズにおいて、このような更新を複数回行った際の閾値の複数群の例((a)更新前、(b)更新後)のリストを示す図である。図6に示す複数組の閾値群がリストアップされて、内蔵メモリ106aに記憶されており、必要に応じて表示部107に表示され、監視者MNが確認できるようになっている。 FIG. 6 is a diagram showing a list of examples of a plurality of threshold values when such an update is performed a plurality of times in an inspection item regarding a certain content: freeze, ((a) before update, (b) after update) . A plurality of sets of threshold value groups shown in FIG. 6 are listed up and stored in the built-in memory 106a, displayed on the display unit 107 as needed, and can be confirmed by the supervisor MN.
 図6(a)の更新前リストによれば、現在、デフォルトの閾値群が優先順位1として最上位にリストアップされているが、その評価値A(m)は「85」である。一方、新たに第1候補として評価された閾値群の評価値A(m)は「92」であり、新たに第2候補として評価された閾値群の評価値A(m)は「81」であったとする。 According to the pre-update list of FIG. 6A, the default threshold group is currently listed at the top with priority 1 but its evaluation value A (m) is “85”. On the other hand, the evaluation value A (m) of the threshold group newly evaluated as the first candidate is "92", and the evaluation value A (m) of the threshold group newly evaluated as the second candidate is "81" Suppose that there was.
 かかる場合、第1候補の閾値群の評価値A(m)-「92」は、今までのデフォルトの閾値群の評価値A(m)「85」よりも高いので、パラメータ最適化学習部106が、図6(b)に示すように、更新により優先順位1とすることで新たなデフォルトの閾値群として置き換え、監視のために映像・音声監視部103に送信する。これにより、今までのデフォルトの閾値群は、優先順位2に繰り下がることとなる。 In this case, since the evaluation value A (m)-"92" of the first candidate threshold group is higher than the evaluation value A (m) "85" of the default threshold group so far, the parameter optimization learning unit 106 However, as shown in FIG. 6B, by setting the priority to 1 by updating, it is replaced as a new default threshold group and transmitted to the video / audio monitoring unit 103 for monitoring. By this, the default threshold group so far will be carried back to priority 2.
 これに対し、第2候補の閾値群の評価値A(m)「81」は、置換されたデフォルトの閾値群の評価値A(m)「85」よりも低いため、パラメータ最適化学習部106が、更新により優先順位3とすることで、より下位にリストアップすることとなる。かかるリストは、パラメータ最適化用学習部106のメモリ106aに記憶されて用いられる。これにより、当該コンテンツに関して、常に高い評価値A(m)の閾値群をデフォルトの閾値群とできる。但し、コンテンツが異なれば同じ値を用いた閾値群であっても、優先順位が異なることはあり得る。尚、このような閾値群の更新は、パラメータ最適化学習部106が自動的に行っても良いし、監視者MNの許可を待ってから行うこともできる。或いは、評価値A(m)の値にかかわらず、監視者入力部108を介して監視者MNが指定した閾値を、デフォルトの閾値とすることもできる。 On the other hand, since the evaluation value A (m) “81” of the second candidate threshold group is lower than the evaluation value A (m) “85” of the replaced default threshold group, the parameter optimization learning unit 106 However, by setting the priority to 3 by updating, it will be listed in lower order. The list is stored and used in the memory 106 a of the parameter optimization learning unit 106. Thereby, the threshold value group of high evaluation value A (m) can always be made into the default threshold value group regarding the said content. However, if the content is different, even if the threshold value group uses the same value, the priority may be different. Note that such updating of the threshold value group may be performed automatically by the parameter optimization learning unit 106, or may be performed after waiting for the supervisor MN's permission. Alternatively, regardless of the value of the evaluation value A (m), the threshold designated by the supervisor MN via the supervisor input unit 108 can be set as the default threshold.
 尚、第1候補、第2候補、・・・の決め方であるが、監視者MNが任意の値を監視者入力部108からパラメータ最適化用学習部106に入力することで、その都度、正解アラームの数と、不要アラームの数と、すり抜けの数を求めた上で、評価値を演算しても良い。或いは、パラメータ最適化用学習部106が、デフォルトの閾値群に対して、個別に閾値を+5%又は-5%ずつ変化させて、その都度、正解アラームの数と、不要アラームの数と、すり抜けの数を求めた上で、評価値を演算するようにしても良い。その他の監視項目においても、同様に評価値を求めることができる。 It should be noted that the first candidate, the second candidate,... Are determined, but each time the supervisor MN inputs an arbitrary value from the supervisor input unit 108 to the parameter optimization learning unit 106, the correct answer is obtained each time The evaluation value may be calculated after obtaining the number of alarms, the number of unnecessary alarms, and the number of by-passes. Alternatively, the parameter optimization learning unit 106 individually changes the threshold by + 5% or -5% with respect to the default threshold group, and in each case, the number of correct alarms, the number of unnecessary alarms, and the bypass The evaluation value may be calculated after finding the number of. Evaluation values can be similarly obtained for other monitoring items.
 尚、検査用の映像信号の入力は、監視対象映像信号の入力の合間に行うこともできるし、監視対象映像信号の入力と並行して行って、バックグラウンドで検査を行うこともできる。それにより同じ閾値を用いてエラー検出を実行できる。更に、映像・音声蓄積クリップ部104は、主入力部101から入力された監視対象映像信号を、所定の長さに渡って所定のタイミングで切り取り、監視者MNが行った監視結果に対応づけて、不図示のメモリに検査用映像信号として記憶しても良い。記憶された検査用映像信号は、必要なタイミングで副入力部102から入力されて、上述のように閾値更新に用いられることとなる。 The video signal for inspection may be input between the input of the monitoring target video signal, or may be performed in parallel with the input of the monitoring target video signal to perform inspection in the background. The same threshold can then be used to perform error detection. Furthermore, the video / audio storage clip unit 104 cuts out the monitoring target video signal input from the main input unit 101 over a predetermined length at a predetermined timing, and associates it with the monitoring result performed by the supervisor MN. Alternatively, it may be stored as a video signal for inspection in a memory (not shown). The stored test video signal is input from the sub input unit 102 at a necessary timing, and is used for threshold updating as described above.
 本発明により、デジタル映像信号においてコンテンツに応じて発生する映像及び/又はエラーを適切に検出する映像監視装置の調整方法を提供することができ、デジタル映像信号において学習を通じて適切に映像及び/又はエラーを検出する映像監視装置を提供することができる。 According to the present invention, it is possible to provide a method of adjusting a video monitoring apparatus for appropriately detecting a video and / or an error generated according to content in a digital video signal, and in the digital video signal, the video and / or the error are properly learned It is possible to provide an image monitoring device that detects
100      映像監視装置
101      主入力部
102      副入力部
103      映像・音声監視部
104      映像・音声クリップ蓄積部
105      アラーム出力部
106      パラメータ最適化用学習部
106a     内蔵メモリ
107      表示部
108      監視者入力部
MN      監視者
100 video monitoring apparatus 101 main input unit 102 secondary input unit 103 video / audio monitoring unit 104 video / audio clip storage unit 105 alarm output unit 106 learning unit for parameter optimization built-in memory 107 a display unit 108 monitor unit 108 monitor input unit MN monitor

Claims (11)

  1.  監視対象である映像及び/又は音声に対応する映像信号を入力して、監視項目に対してエラーが発生したことを検出する映像監視装置の調整方法において、
     前記映像監視装置は、前記映像信号に応じて変化する複数のパラメータの全てが、前記監視項目に応じて決定された閾値を超えたときに、前記エラーが発生したと判定するようになっており、前記複数のパラメータのうち少なくとも2つが、相互に関連し合っていて、
     前記複数のパラメータの閾値をそれぞれ決定して閾値群を複数組作成し、メモリに記憶し、
     監視対象である映像及び/又は音声のコンテンツに応じて、前記メモリに記憶された前記閾値群のいずれかの組を選定することを特徴とする映像監視装置の調整方法。
    A method of adjusting an image monitoring apparatus, which receives an image signal corresponding to an image and / or an audio to be monitored and detects that an error has occurred in a monitoring item,
    The video monitoring apparatus is configured to determine that the error has occurred when all of a plurality of parameters that change according to the video signal exceed a threshold determined according to the monitoring item. , At least two of the plurality of parameters are mutually related,
    The threshold values of the plurality of parameters are respectively determined to create a plurality of threshold value groups, which are stored in a memory,
    A method of adjusting a video surveillance apparatus, comprising selecting any set of the threshold group stored in the memory according to contents of video and / or audio to be monitored.
  2.  前記映像監視装置が、映像及び/又は音声の監視動作と並行して学習を行い、その学習結果に基づいて前記閾値群の少なくとも1つの閾値を更新し、更新された前記閾値群をメモリに記憶することを特徴とする請求項1に記載の映像監視装置の調整方法。 The video monitoring device performs learning in parallel with the video and / or audio monitoring operation, updates at least one threshold of the threshold group based on the learning result, and stores the updated threshold group in a memory The adjustment method of the image | video monitoring apparatus of Claim 1 characterized by the above-mentioned.
  3.  前記映像監視装置は、エラーを検出したときにアラームを発報するようになっており、
     前記学習において、監視対象である映像及び/又は音声に対応する映像信号を入力させて、一定の時間間隔で、アラームを発報すべきエラーに対して前記アラームが発報された第1事象の件数と、アラームを発報すべきでないエラーに対して前記アラームが発報された第2事象の件数とをそれぞれカウントし、
     更に、エラーの内容と発生箇所が既知である検査用の映像信号を前記映像監視装置に入力させて、アラームを発報すべきエラーに対して前記アラームが発報されなかった第3事象の件数をカウントし、
     得られた前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数とを重み付けして演算することにより評価値を求め、求めた前記評価値に基づいて前記閾値群を更新することを特徴とする請求項2に記載の映像監視装置の調整方法。
    The video surveillance device issues an alarm when it detects an error.
    In the learning, a video signal corresponding to the video and / or audio to be monitored is input, and the alarm is issued at predetermined time intervals, the first event of the alarm for the error to be issued Count the number of events and the number of second events for which the alarm has been issued for errors that should not be issued,
    Furthermore, a video signal for inspection whose content and place of occurrence of the error are known is input to the video monitoring apparatus, and the number of third events for which the alarm has not been issued for an error that should be issued. Count,
    An evaluation value is determined by weighting and calculating the number of obtained first events, the number of second events, and the number of third events, and the threshold group is determined based on the determined evaluation values. The method of adjusting an image monitoring apparatus according to claim 2, wherein
  4.  前記学習において、前記閾値群の閾値を個別に変化させながら、前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数をカウントすることを特徴とする請求項3に記載の映像監視装置の調整方法。 4. The method according to claim 3, wherein in the learning, the number of the first event, the number of the second event, and the number of the third event are counted while changing the thresholds of the threshold group individually. The adjustment method of the image | video monitoring apparatus of description.
  5.  監視対象である映像及び/又は音声に対応する映像信号を入力して、監視項目に対してエラーが発生したことを検出する映像監視装置において、
     前記映像信号に応じて変化する複数のパラメータであって、前記複数のパラメータのうち少なくとも2つが相互に関連し合ってなるパラメータの閾値を記憶する閾値記憶部と、
     前記複数のパラメータの全てが前記監視項目に応じて決定された前記閾値を超えたときに、前記エラーが発生したと判定し、アラームを発報する判定部と、
     映像及び/又は音声の監視動作と並行して学習を行う学習部と、
     前記学習部における学習結果に基づいて前記閾値を更新する更新部と、
     前記判定部が前記エラーの発生を検出したときは、前記エラーの発生前後の映像信号を記憶する映像記憶部と、を有し、
     前記エラーの発報結果を解析することにより、前記学習部が学習することを特徴とする映像監視装置。
    In an image monitoring apparatus that receives an image signal corresponding to an image and / or audio to be monitored and detects that an error has occurred in a monitoring item,
    A threshold storage unit that stores a plurality of parameters that change according to the video signal, wherein at least two of the plurality of parameters are associated with each other;
    A determination unit that determines that the error has occurred and issues an alarm when all of the plurality of parameters exceed the threshold determined according to the monitoring item;
    A learning unit that performs learning in parallel with the video and / or audio monitoring operation;
    An updating unit that updates the threshold based on a learning result in the learning unit;
    A video storage unit for storing video signals before and after the occurrence of the error when the determination unit detects the occurrence of the error;
    The image monitoring apparatus characterized in that the learning unit learns by analyzing the notification result of the error.
  6.  前記学習部は、監視対象である映像及び/又は音声に対応する映像信号を入力したときに、一定の時間間隔で、アラームを発報すべきエラーに対して前記アラームが発報された第1事象の件数と、アラームを発報すべきでないエラーに対して前記アラームが発報された第2事象の件数をカウントし、更に、エラーの内容と発生箇所が既知である検査用の映像信号を入力したときに、アラームを発報すべきエラーに対して前記アラームが発報されなかった第3事象の件数をカウントし、得られた前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数とを重み付けして演算することにより評価値を求め、求めた前記評価値に基づいて前記閾値記憶部に記憶された前記閾値を順位付けすることを特徴とする請求項5に記載の映像監視装置。 When the learning unit inputs a video signal corresponding to video and / or audio to be monitored, the alarm is issued for an error that should issue an alarm at predetermined time intervals. Count the number of events and the number of second events for which an alarm should have been issued for errors that should not have triggered an alarm. Furthermore, the video signal for inspection whose content and location of the error are known The number of third events for which an alarm should not be issued when the input is made is counted, and the number of the first event obtained and the number of the second event Calculating an evaluation value by weighting and calculating the number of cases of the third event, and ranking the threshold values stored in the threshold storage unit based on the obtained evaluation values. Video director described in 5 Apparatus.
  7.  前記学習部は、前記監視項目の閾値を個別に変化させながら、前記第1事象の件数と、前記第2事象の件数と、前記第3事象の件数をカウントすることを特徴とする請求項6に記載の映像監視装置。 The learning unit is characterized by counting the number of first events, the number of second events, and the number of third events while individually changing the threshold value of the monitoring item. The video surveillance device described in.
  8.  前記学習部は、M個の映像信号のうちm番目の映像信号に対して、時刻t1~t2の間に、アラームを発報すべきエラーに対して前記アラームが発報された第1事象の件数をT(m)とし、アラームを発報すべきでないエラーに対して前記アラームが発報された第2事象の件数をF(m)とし、アラームを発報すべきエラーに対して前記アラームが発報されなかった第3事象の件数をH(m)としたときに、以下の評価関数に従って評価値A(m)を求めることを特徴とする請求項6又は7に記載の映像監視装置。
    Figure JPOXMLDOC01-appb-I000001
    但し、
    Wt(m)は前記第1事象の重み付け
    Wf(m)は前記第2事象の重み付け
    Wh(m)は前記第3事象の重み付け
    The learning unit is configured to generate an alarm for the m-th video signal of the M video signals during a period from time t1 to time t2. Let T (m) be the number, and let F (m) be the number of second events for which an alarm should have been issued for errors that should not have triggered the alarm. The image monitoring apparatus according to claim 6 or 7, wherein an evaluation value A (m) is determined according to the following evaluation function, where H (m) is the number of third events for which no event has been reported. .
    Figure JPOXMLDOC01-appb-I000001
    However,
    Wt (m) is the weight of the first event Wf (m) is the weight of the second event Wh (m) is the weight of the third event
  9.  前記更新部は、前記優先順位に従って前記監視項目毎に前記閾値をリストアップすることを特徴とする請求項6~8のいずれかに記載の映像監視装置。 The video monitoring apparatus according to any one of claims 6 to 8, wherein the updating unit lists up the threshold for each of the monitoring items in accordance with the priority.
  10.  前記映像記憶部は、入力された映像信号を、所定のタイミングで所定の長さで切り出して記憶することを特徴とする請求項5~9のいずれかに記載の映像監視装置。 The video monitoring apparatus according to any one of claims 5 to 9, wherein the video storage unit cuts out and stores the input video signal at a predetermined timing at a predetermined length.
  11.  前記学習部の学習結果を表示すると共に、前記エラーの発生前後の映像信号に基づいて映像及び/又は音声を再生する表示部を有することを特徴とする請求項5~10のいずれかに記載の映像監視装置。 The display device according to any one of claims 5 to 10, further comprising a display unit for displaying the learning result of the learning unit and reproducing video and / or audio based on the video signal before and after the occurrence of the error. Video surveillance device.
PCT/JP2017/024350 2017-07-03 2017-07-03 Video monitoring device adjusting method and video monitoring device WO2019008635A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201780089397.8A CN110870305B (en) 2017-07-03 2017-07-03 Method for adjusting video monitoring device and video monitoring device
JP2019528202A JP7033797B2 (en) 2017-07-03 2017-07-03 How to adjust the video monitoring device and the video monitoring device
PCT/JP2017/024350 WO2019008635A1 (en) 2017-07-03 2017-07-03 Video monitoring device adjusting method and video monitoring device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2017/024350 WO2019008635A1 (en) 2017-07-03 2017-07-03 Video monitoring device adjusting method and video monitoring device

Publications (1)

Publication Number Publication Date
WO2019008635A1 true WO2019008635A1 (en) 2019-01-10

Family

ID=64950684

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2017/024350 WO2019008635A1 (en) 2017-07-03 2017-07-03 Video monitoring device adjusting method and video monitoring device

Country Status (3)

Country Link
JP (1) JP7033797B2 (en)
CN (1) CN110870305B (en)
WO (1) WO2019008635A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021034841A (en) * 2019-08-22 2021-03-01 日本電気株式会社 Monitoring condition setting device, system, method, and program

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0946733A (en) * 1995-07-25 1997-02-14 Nippon Hoso Kyokai <Nhk> Automatic video monitoring device
JP2001269770A (en) * 2000-03-24 2001-10-02 Kawasaki Steel Corp Method for automatically detecting abnormality in molten metal treating facility
JP2006060335A (en) * 2004-08-18 2006-03-02 Sharp Corp Image processing apparatus
JP2006101296A (en) * 2004-09-30 2006-04-13 Nec Personal Products Co Ltd Video data correcting device and method, video output device and method, reproducing device and method, program, and recording medium
JP2006186787A (en) * 2004-12-28 2006-07-13 Fujitsu Ten Ltd Digital data receiver
JP2008046346A (en) * 2006-08-15 2008-02-28 Sony Corp Electric power consumption reduction apparatus, display device, image processor, electric power consumption reduction method, and computer program
JP2011109183A (en) * 2009-11-12 2011-06-02 Toshiba Corp Reception-state presenting device
JP2011228975A (en) * 2010-04-21 2011-11-10 Nikon Systems Inc Video noise detection device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101765025A (en) * 2008-12-23 2010-06-30 北京中星微电子有限公司 System for abnormal detection of surveillance camera and method thereof
CN102131101A (en) * 2011-04-21 2011-07-20 江苏东怡软件技术有限公司 Intelligent video image quality automatic analysis system and method
CN102395043B (en) * 2011-11-11 2013-09-11 北京声迅电子股份有限公司 Video quality diagnosing method
CN102421008A (en) * 2011-12-07 2012-04-18 浙江捷尚视觉科技有限公司 Intelligent video quality detecting system
CN103873852A (en) * 2012-12-11 2014-06-18 上海文广互动电视有限公司 Multi-mode parallel video quality fault detection method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0946733A (en) * 1995-07-25 1997-02-14 Nippon Hoso Kyokai <Nhk> Automatic video monitoring device
JP2001269770A (en) * 2000-03-24 2001-10-02 Kawasaki Steel Corp Method for automatically detecting abnormality in molten metal treating facility
JP2006060335A (en) * 2004-08-18 2006-03-02 Sharp Corp Image processing apparatus
JP2006101296A (en) * 2004-09-30 2006-04-13 Nec Personal Products Co Ltd Video data correcting device and method, video output device and method, reproducing device and method, program, and recording medium
JP2006186787A (en) * 2004-12-28 2006-07-13 Fujitsu Ten Ltd Digital data receiver
JP2008046346A (en) * 2006-08-15 2008-02-28 Sony Corp Electric power consumption reduction apparatus, display device, image processor, electric power consumption reduction method, and computer program
JP2011109183A (en) * 2009-11-12 2011-06-02 Toshiba Corp Reception-state presenting device
JP2011228975A (en) * 2010-04-21 2011-11-10 Nikon Systems Inc Video noise detection device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021034841A (en) * 2019-08-22 2021-03-01 日本電気株式会社 Monitoring condition setting device, system, method, and program

Also Published As

Publication number Publication date
JP7033797B2 (en) 2022-03-11
CN110870305A (en) 2020-03-06
JPWO2019008635A1 (en) 2020-04-30
CN110870305B (en) 2021-06-15

Similar Documents

Publication Publication Date Title
EP3511862B1 (en) System and method for dynamically ordering video channels according to rank of abnormal detection
Series Methodology for the subjective assessment of the quality of television pictures
US7509662B2 (en) Method and apparatus for generation of a preferred broadcasted programs list
US10114868B2 (en) Electronic data generation methods
US20210142363A1 (en) Managing content delivery via audio cues
US20120170847A1 (en) Object mapping device, method of mapping object, program and recording medium
US8737745B2 (en) Scene-based people metering for audience measurement
JP7048825B2 (en) Monitoring loudness levels during media replacement events with shorter time constants
US20170132525A1 (en) Method and system using machine learning techniques for checking data integrity in a data warehouse feed
CN110225367A (en) It has been shown that, recognition methods and the device of object information in a kind of video
US20210225409A1 (en) Methods and systems for detection of anomalous motion in a video stream and for creating a video summary
CN106028147A (en) Video signal monitoring method and video signal monitoring system
Series Methodology for the subjective assessment of the quality of television pictures
CN110049373A (en) Set-top box Caton detection method, system and storage medium
US20110167445A1 (en) Audiovisual content channelization system
US20230336834A1 (en) Systems and methods for aggregating related media content based on tagged content
US20210014562A1 (en) Methods and apparatus to detect boring media
US20110033115A1 (en) Method of detecting feature images
JP6388894B2 (en) System and method for compressed display of long video sequences
WO2019008635A1 (en) Video monitoring device adjusting method and video monitoring device
US20100054702A1 (en) Information processing device, information processing method, and program
US20080316360A1 (en) Information signal processing apparatus, method of creating database, method of processing information signal, and program for implementing method of processing information signal
US11527091B2 (en) Analyzing apparatus, control method, and program
JP2019220825A (en) View recording analyzer, view recording analytical method, and view recording analytic program
US20210279231A1 (en) Systems and methods for improving accuracy of device maps using media viewing data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17916469

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2019528202

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17916469

Country of ref document: EP

Kind code of ref document: A1