CN115019235B

CN115019235B - Scene division and content detection method and system

Info

Publication number: CN115019235B
Application number: CN202210685018.4A
Authority: CN
Inventors: 孙涛; 孙中民
Original assignee: Tianjin Guorui Digital Safety System Co ltd
Current assignee: Tianjin Guorui Digital Safety System Co ltd
Priority date: 2022-06-15
Filing date: 2022-06-15
Publication date: 2023-06-27
Anticipated expiration: 2042-06-15
Also published as: CN115019235A

Abstract

The invention provides a scene division and content detection method and system, which are characterized in that a first vector matrix is generated by extracting various features in multimedia data, the first vector matrix is input into a state chain model to obtain an explicit feature distribution area, a semantic feature set of a required implicit feature distribution area is further determined, the first vector matrix and the semantic feature set are input into a calculation function, probability density parameters of the state chain model are simultaneously introduced, and dividing lines of different scene divisions are calculated and determined, so that accurate segmentation content detection is realized.

Description

Scene division and content detection method and system

Technical Field

The present application relates to the field of network multimedia, and in particular, to a method and system for scene division and content detection.

Background

The existing network has a large amount of scene information and very rich video data, a plurality of completely different scenes are often clipped in one video, whether the video content is legal or not is detected in the different scenes, different detection algorithms are required to be called, a large amount of burden is brought to a processing link, and the operation amount is increased. Meanwhile, whether boundary lines of different scenes can be accurately divided is also an important point for improving detection accuracy.

Thus, there is an urgent need for a method and system for targeted scene division and content detection.

Disclosure of Invention

The invention aims to provide a method and a system for scene division and content detection, which are characterized in that a first vector matrix is generated by extracting various features in multimedia data, the first vector matrix is input into a state chain model to obtain an explicit feature distribution area, a semantic feature set of a needed implicit feature distribution area is further determined, the first vector matrix and the semantic feature set are input into a calculation function, probability density parameters of the state chain model are simultaneously introduced, and dividing lines of different scene divisions are calculated and determined, so that accurate segmentation content detection is realized.

In a first aspect, the present application provides a method of scene division and content detection, the method comprising:

receiving multimedia data sent by an acquisition terminal, extracting visual features, sound features and text features from the multimedia data, and generating a first vector matrix according to preset rules by the visual features, the sound features and the text features;

inputting the first vector matrix into a state chain model, determining an explicit feature distribution area corresponding to the multimedia data according to a preset probability density function, obtaining a possible implicit feature distribution area, extracting a plurality of second vector matrices in the possible implicit feature distribution area, and decomposing the second vector matrices to obtain implicit features;

semantically analyzing the latent features to obtain a plurality of undetermined semantic features, calculating the correlation degree among the undetermined semantic features, removing undetermined semantic features with the correlation degree lower than a threshold value, and determining a semantic feature set corresponding to the multimedia data;

inputting the first vector matrix and the semantic feature set into a calculation function, introducing probability density parameters of a state chain model to obtain a conditional probability formula from the second vector matrix to the first vector matrix, calculating the conditional probability formula through a neural network model, and calculating to obtain an optimal second vector matrix;

determining dividing lines of different scene divisions according to the distribution condition among the optimal second vector matrixes, dividing the multimedia data into different scene sections according to the dividing lines, and sequentially carrying out semantic analysis to obtain semantic tags corresponding to the different scene sections;

and according to the semantic tags, different content detection algorithms are called, and content detection is carried out on scene segments corresponding to the semantic tags.

With reference to the first aspect, in a first possible implementation manner of the first aspect, the semantic analysis further includes a clustering operation, and the scene segments of the same class are analyzed in a centralized manner.

With reference to the first aspect, in a second possible implementation manner of the first aspect, the receiving the multimedia data stream sent by the acquisition terminal includes encoding and decoding the multimedia data stream.

With reference to the first aspect, in a third possible implementation manner of the first aspect, the semantic analysis uses a neural network model.

In a second aspect, the present application provides a system for scene division and content detection, the system comprising a processor and a memory:

the memory is used for storing program codes and transmitting the program codes to the processor;

the processor is configured to perform the method according to any one of the four possible aspects of the first aspect according to instructions in the program code.

In a third aspect, the present application provides a computer readable storage medium for storing program code for performing the method of any one of the four possible aspects.

Advantageous effects

The invention provides a scene division and content detection method and system, which are characterized in that a required semantic feature set is determined through a state chain model, a calculation function is input, probability density parameters of the state chain model are introduced, and dividing lines of different scene divisions are calculated and determined, so that accurate segmentation content detection can be realized, different content detection algorithms are respectively invoked by different scene segments, the detection precision is improved, and the operation amount is reduced.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments will be briefly described below, and it will be obvious to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.

FIG. 1 is a flow chart of the method of the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings so that the advantages and features of the present invention can be more easily understood by those skilled in the art, thereby making clear and defining the scope of the present invention.

Fig. 1 is a flowchart of a method for scene division and content detection provided in the present application, including:

In some preferred embodiments, the semantic analysis further includes a clustering operation that centrally analyzes scene segments of the same class.

In some preferred embodiments, the receiving the multimedia data stream sent by the acquisition terminal includes encoding and decoding the multimedia data stream.

In some preferred embodiments, the semantic analysis employs a neural network model.

The application provides a system for scene division and content detection, the system comprising: the system includes a processor and a memory:

the processor is configured to perform the method according to any of the embodiments of the first aspect according to instructions in the program code.

The present application provides a computer readable storage medium for storing program code for performing the method of any one of the embodiments of the first aspect.

In a specific implementation, the present invention also provides a computer storage medium, where the computer storage medium may store a program, where the program may include some or all of the steps in the various embodiments of the present invention when executed. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM) or a Random Access Memory (RAM).

It will be apparent to those skilled in the art that the techniques of embodiments of the present invention may be implemented in software plus a necessary general purpose hardware platform. Based on such understanding, the technical solutions in the embodiments of the present invention may be embodied in essence or a part contributing to the prior art in the form of a software product, which may be stored in a storage medium, such as a ROM/RAM, a magnetic disk, an optical disk, etc., including several instructions to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method described in the embodiments or some parts of the embodiments of the present invention.

The same or similar parts between the various embodiments of the present description are referred to each other. In particular, for the embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and reference should be made to the description of the method embodiments for the matters.

The embodiments of the present invention described above do not limit the scope of the present invention.

Claims

1. A method of scene division and content detection, the method comprising:

2. The method according to claim 1, characterized in that: the semantic analysis also comprises clustering operation, and the scene segments of the same class are analyzed in a concentrated mode.

3. The method according to claim 2, characterized in that: the receiving and collecting the multimedia data stream sent by the terminal comprises encoding and decoding the multimedia data stream.

4. A method according to claim 3, characterized in that: the semantic analysis adopts a neural network model.

5. A system for scene division and content detection, the system comprising a processor and a memory:

the processor is configured to perform the method according to any of the claims 1-4 according to instructions in the program code.

6. A computer readable storage medium, characterized in that the computer readable storage medium is for storing a program code for performing a method implementing any of claims 1-4.