CN111147874A

CN111147874A - Method and system for program list accurate to second based on image recognition

Info

Publication number: CN111147874A
Application number: CN201911361129.4A
Authority: CN
Inventors: 范洪涛; 马巨
Original assignee: Mianyang Huan Technology Co ltd
Current assignee: Mianyang Huan Technology Co ltd
Priority date: 2019-12-25
Filing date: 2019-12-25
Publication date: 2020-05-12

Abstract

The invention relates to the technical field of digital televisions, and discloses a method for accurately marking a program to the second based on image recognition, which comprises the steps of recording a program video 5 minutes before and after the program starts to play, dividing the video into frames, manually marking the start time and the name to form a sample of the head and the tail of the program, carrying out deep learning method training on the adaptation of the marks to form a characteristic library of the head and the tail of the program, then intercepting 1-2 pictures of the live video being played and comparing the pictures with the characteristic library, if the recognition is successful, modifying the start time of the initial program, if the recognition fails, outputting the picture, manually auditing and supplementing the sample of the head and the tail of the program, and manually marking the start time and the name again to form a sample of the head and the tail of the program. The invention can correct the time of the program list to the second, thereby not only improving the accuracy of the program list, but also improving the user experience.

Description

Method and system for program list accurate to second based on image recognition

Technical Field

The invention relates to the technical field of digital televisions, in particular to a method and a system for accurately setting a program list to be in seconds based on image recognition.

Background

The television station provides a program list, and the operator makes the EPG according to the program list provided by the television station. However, the program list provided by the television station is basically inaccurate time, and time errors often exist, namely the time on the program list is inconsistent with the actual playing time of the television program.

Therefore, in order to solve the above problems, a method and system for program guide accuracy to seconds based on image recognition is urgently needed.

Disclosure of Invention

Based on the above problems, the present invention provides a method and system for program guide accurate to second based on image recognition. The invention can correct the time of the program list to the second, thereby not only improving the accuracy of the program list, but also improving the user experience.

The invention specifically adopts the following technical scheme for realizing the purpose:

a method for image recognition based program guide accuracy to seconds, comprising the steps of:

the method comprises the following steps: a preparation stage: recording the program list video 5 minutes before and after the program starts playing, and dividing the video into frames.

Step two: and manually marking the start time and the name to form a sample of the head and the tail of the program.

Step three: and (4) carrying out deep learning method training on the adaptation of the mark to form a program leader and trailer feature library.

Step four: and (3) identification: intercepting 1-2 pictures of a live video being played, comparing the pictures with a feature library, and modifying the starting time of an initial program list if the identification is successful; if the identification fails, outputting a picture, and manually checking the head and tail samples of the supplementary program, namely returning to the second step and the third step.

The system for accurately recording the program list to the second based on the image recognition comprises a program recording system, a splitting system, a training system and a program recognition system.

Further, the program recording system automatically records live programs within 5 minutes before and after playing according to the time of the program list provided by the initial television station.

Further, the video splitting system splits the recorded program list in minute units by frame, and marks the program start time and the program name by manual work.

Further, massive program picture information is obtained in a training system, the accuracy of the feature library is trained and optimized, and the feature library is stored after training is completed.

Further, the program identification system intercepts the live broadcast picture to be identified, and compares the live broadcast picture with the feature library to identify the accurate time of the program.

The working principle is as follows: splitting recorded program list videos into frames, forming a sample of a program head and a program tail after manual recording, training by a deep learning method to form a program head and tail feature library, optimizing the accuracy of the feature library and storing the feature library in the training process, intercepting a video picture and comparing the video picture with the feature library when identifying a live video, modifying the starting time of an initial program list or adding a program head and tail sample in the comparison process, and splitting the program list videos into the frames, so that the time of the program list can be corrected to the second in the comparison process, the accuracy of the program list is improved, the characteristics of the program head and tail feature library can be increased after the newly added program head and tail sample is trained by the deep learning method, the comparison is convenient, and the time of the program list is quickly corrected.

The invention has the following beneficial effects:

(1) according to the method, recorded videos are divided into frames, the start time and the name are marked manually to form a sample of the head and the tail of a program, the marked sample is trained by using deep learning to form a characteristic library of the head and the tail of the program, captured pictures of the live broadcast videos are compared with the characteristic library, the accurate time of the program is identified, and the accuracy is up to the second, so that the accuracy of the program list is improved, and the user experience is also improved.

(2) In the invention, the recorded video is divided into frames, and the default is one second and twelve frames, so that one frame is equal to one twelfth of one second, therefore, when the picture captured by the live video is compared with the feature library, the accuracy of the program list can reach the second level, so that a user can directly watch the wanted television program when watching back, and the user experience is improved.

(3) The invention realizes self-learning and self-perfection of the feature library through continuously supplementing program leader and trailer samples and training, and the recognition rate of the program leader and trailer can be continuously improved.

(4) In the invention, after the program identification system successfully identifies the live broadcast picture to be identified, the initial program list is automatically corrected, and the adjusted program list can be accurate to the second level, so that the accuracy of the program list is improved, and the user experience is also improved.

(5) In the invention, after the program identification system fails to identify the live broadcast picture to be identified, the picture is output and is supplemented to the feature library after manual verification, so that the feature library is continuously improved.

Drawings

FIG. 1 is a system block diagram of the present invention;

FIG. 2 is a flow chart for identifying program listings;

FIG. 3 is a schematic diagram of a deep learning method;

Detailed Description

For a better understanding of the present invention by those skilled in the art, the present invention will be described in further detail below with reference to the accompanying drawings and the following examples.

Example 1:

referring to fig. 1 to 3, a method for the accuracy of a program list to seconds based on image recognition comprises the following steps:

In this embodiment, recording a program video 5 minutes before and after the program starts to play, dividing the video into frames, manually marking the start time and the title to form a sample of the beginning and the end of the program, performing deep learning method training on the marked adaptation to form a feature library of the beginning and the end of the program, then capturing 1-2 pictures of the live video being played, comparing the pictures with the feature library, if the identification is successful, modifying the start time of the initial program, if the identification is failed, outputting the pictures, manually auditing and supplementing the sample of the beginning and the end of the program, manually marking the start time and the title again to form a sample of the beginning and the end of the program, performing deep learning method training on the marked adaptation to form a feature library of the beginning and the end of the program, and dividing the program video into frames, so that the time of the program can be corrected to seconds during the comparison, therefore, the accuracy of the program list is improved, the newly added program head and tail sample can continuously supplement the characteristics of the program head and tail characteristic library after being trained by a deep learning method, the self-learning and self-perfection of the characteristic library are realized, and the recognition rate of the program head and the program tail can be continuously improved, so that the time of the program list is quickly corrected.

Specifically, the recorded video is divided into frames, and the default is one second and twelve frames, so that one frame is equal to one twelfth of one second, when the picture captured by the live video is compared with the feature library, the accuracy of the program list can reach the second level, the accuracy of the program list is improved, a user can directly watch the wanted television program when the user watches the program list again, and the user experience is improved.

Specifically, when a live video is identified, a video picture is captured and compared with a feature library, the starting time of an initial program list is modified or a program head and tail sample is newly added in the comparison process, and after the newly added program head and tail sample is trained through a deep learning method, the features of the program head and tail feature library can be increased, so that comparison is convenient, and the time of the program list is quickly corrected.

Specifically, the specific method of deep learning is as follows:

referring to fig. 3, an image is input and convolved through three trainable filters and an applicable bias, after convolution, three feature maps are generated at a C1 level, then four pixels in each group in the feature maps are summed, weighted, and biased, and feature maps of three S2 levels are obtained through a Sigmoid function. These maps are further filtered to obtain a layer C3. This hierarchy, again, as with S2, results in S4. Finally, these pixel values are rasterized and connected into a vector input to a conventional neural network, resulting in an output.

Specifically, the layer C is a feature extraction layer, the input of each neuron is connected to the local receptive field of the previous layer, and the local feature is extracted, and once the local feature is extracted, the position relationship between the local feature and other features is determined; the S layer is a feature mapping layer, each calculation layer of the network is composed of a plurality of feature mappings, each feature mapping is a plane, and the weights of all neurons on the plane are equal. The feature mapping structure adopts a sigmoid function with small influence function kernel as an activation function of the convolution network, so that the feature mapping has displacement invariance.

Specifically, because the neurons on one mapping surface share the weight, the number of free parameters of the network is reduced, and the complexity of network parameter selection is reduced. Each feature extraction layer in the convolutional neural network is followed by a feature mapping layer for local averaging and quadratic extraction, and the specific quadratic feature extraction structure enables the network to have high distortion tolerance capability on input samples during identification.

Specifically, Z represents a layer of convolution operation, and NN represents the result of this convolution operation.

The above is an embodiment of the present invention. The embodiments and specific parameters in the embodiments are only for the purpose of clearly illustrating the verification process of the invention and are not intended to limit the scope of the invention, which is defined by the claims, and all equivalent structural changes made by using the contents of the specification and the drawings of the present invention should be covered by the scope of the present invention.

Claims

1. A method for image recognition based program guide accuracy to seconds, comprising the steps of:

2. The image recognition based program guide to second accuracy system of claim 1, wherein: the system comprises a program recording system, a splitting system, a training system and a program identification system.

3. The method and system of claim 2 for program guide accuracy to seconds based on image recognition, wherein: and the program recording system automatically records the live programs within 5 minutes before and after playing according to the time of the program list provided by the initial television station.

4. The method and system of claim 2 for program guide accuracy to seconds based on image recognition, wherein: the video splitting system splits the recorded minute-level program list by taking a frame as a unit and marks the program starting time and the program name by manpower.

5. The method and system of claim 2 for program guide accuracy to seconds based on image recognition, wherein: and acquiring massive program picture information in the training system, training the accuracy of the optimized feature library, and storing the feature library after the training is finished.

6. The method and system of claim 2 for program guide accuracy to seconds based on image recognition, wherein: the program identification system intercepts the live broadcast picture to be identified, compares the live broadcast picture with the feature library and identifies the accurate time of the program.