CN114760460A - Video quality detection method, device, storage medium and apparatus - Google Patents

Video quality detection method, device, storage medium and apparatus Download PDF

Info

Publication number
CN114760460A
CN114760460A CN202011610103.1A CN202011610103A CN114760460A CN 114760460 A CN114760460 A CN 114760460A CN 202011610103 A CN202011610103 A CN 202011610103A CN 114760460 A CN114760460 A CN 114760460A
Authority
CN
China
Prior art keywords
detected
video
audio
image
data stream
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011610103.1A
Other languages
Chinese (zh)
Inventor
宋泽坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Hongxiang Technical Service Co Ltd
Original Assignee
Beijing Hongxiang Technical Service Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Hongxiang Technical Service Co Ltd filed Critical Beijing Hongxiang Technical Service Co Ltd
Priority to CN202011610103.1A priority Critical patent/CN114760460A/en
Publication of CN114760460A publication Critical patent/CN114760460A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/57Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for processing of video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The invention discloses a video quality detection method, a device, a storage medium and a device, compared with the existing manual video quality detection mode, the invention obtains a video to be detected and a log file corresponding to the video to be detected, extracts the video to be detected to obtain an image to be detected and an audio to be detected, extracts the data of the log file to obtain an image data stream and an audio data stream, generates a sound and picture synchronous comparison result according to the image to be detected, the audio data stream, the image data stream and the audio data stream, sends the image data of the image to be detected, the log file, the sound and picture synchronous comparison result and the audio data of the audio to be detected as video detection information to a preset server, receives the video quality detection result fed back by the preset server according to the video detection information, and can detect the quality of the video which can not obtain source data, the detection efficiency is improved.

Description

Video quality detection method, device, storage medium and apparatus
Technical Field
The present invention relates to the field of video processing technologies, and in particular, to a method, a device, a storage medium, and an apparatus for detecting video quality.
Background
At present, the quality of a video to be detected, which cannot acquire original video stream data and audio stream data, is often detected manually. However, the manual detection of the video quality has high cost, low efficiency and poor reliability.
The above is only for the purpose of assisting understanding of the technical aspects of the present invention, and does not represent an admission that the above is prior art.
Disclosure of Invention
The invention mainly aims to provide a video quality detection method, a video quality detection device, a video quality detection storage medium and a video quality detection device, and aims to solve the technical problem of how to detect the quality of a video which cannot acquire source data.
In order to achieve the above object, the present invention provides a video quality detection method, which includes the following steps:
acquiring a video to be detected and a log file corresponding to the video to be detected, and extracting the video to be detected to obtain an image to be detected and an audio to be detected;
performing data extraction on the log file to obtain an image data stream and an audio data stream;
generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream;
sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result and the audio data of the audio to be detected as video detection information to a preset server;
and receiving a video quality detection result fed back by the preset server according to the video detection information.
Optionally, the step of obtaining a video to be detected and a log file corresponding to the video to be detected, and extracting the video to be detected to obtain an image to be detected and an audio to be detected specifically includes:
acquiring a video to be detected and a log file corresponding to the video to be detected;
when the video to be detected starts to play, image interception is carried out on the video to be detected, and an image to be detected is obtained;
acquiring current equipment information, and determining a current audio recording mode according to the current equipment information;
and recording the audio of the video to be detected according to the current audio recording mode to obtain the audio to be detected.
Optionally, the step of capturing an image of the video to be detected when the video to be detected starts to be played to obtain an image to be detected specifically includes:
when the video to be detected starts to play, image interception is carried out on the video to be detected through a preset screenshot script, and an intercepted image set is obtained;
acquiring the interception time of each intercepted image in the intercepted image set, and sequencing the intercepted images according to the interception time to obtain a sequencing result;
traversing the intercepted image according to the sequencing result, and taking the traversed intercepted image as an image to be detected.
Optionally, the step of performing audio recording on the video to be detected according to the current audio recording mode to obtain the audio to be detected specifically includes:
determining a target recording device and an audio playing device according to the current audio recording mode, wherein the target recording device is arranged at a position relative to the audio playing device;
controlling the audio playing device to operate, and controlling the target recording device to start audio recording;
and receiving the recorded audio uploaded by the target recording equipment, and selecting the audio to be detected from the recorded audio.
Optionally, the step of receiving the recorded audio uploaded by the target recording device and selecting the audio to be detected from the recorded audio specifically includes:
receiving the recorded audio uploaded by the target recording equipment, and cutting the recorded audio according to the intercepting time to obtain a candidate audio;
and traversing the candidate audio according to the sequencing result, and taking the traversed candidate audio as the audio to be detected.
Optionally, the step of generating a sound-picture synchronization comparison result according to the image to be detected, the audio to be detected, the image data stream, and the audio data stream specifically includes:
performing character recognition on the image to be detected to obtain image recognition characters;
performing voice recognition on the audio to be detected to obtain voice recognition characters;
generating a character matching result according to the image recognition characters and the voice recognition characters;
determining a data stream matching result according to the image data stream and the audio data stream;
and generating a sound-picture synchronous comparison result according to the character matching result and the data stream matching result.
Optionally, the step of generating a text matching result according to the image recognition text and the voice recognition text specifically includes:
determining character similarity according to the image recognition characters and the voice recognition characters;
and judging whether the character similarity is greater than a preset similarity threshold value or not, and generating a character matching result according to the judgment result.
Optionally, the step of determining a data stream matching result according to the image data stream and the audio data stream specifically includes:
acquiring the arrival time of image data corresponding to the image data stream, and acquiring the arrival time of audio data corresponding to the audio data stream;
and generating a data stream matching result according to the image data arrival time and the audio data arrival time.
Optionally, the step of generating a data stream matching result according to the image data arrival time and the audio data arrival time specifically includes:
determining a time difference value according to the image data arrival time and the audio data arrival time;
and judging whether the time difference is larger than a preset time threshold value or not, and generating a data stream matching result according to a judgment result.
Optionally, the step of extracting data from the log file to obtain an image data stream and an audio data stream specifically includes:
performing data extraction on the log file through a preset log capturing script to obtain extracted data;
performing data cleaning on the extracted data to obtain data to be classified;
and carrying out data classification on the data to be classified to obtain an image data stream and an audio data stream.
Optionally, after the step of receiving the video quality detection result fed back by the preset server according to the video detection information, the video quality detection method further includes:
acquiring current playing interface information, and determining an information display template according to the current playing interface information;
and writing the video quality display result into the current playing interface according to the information display template, acquiring quality detection reminding information, and displaying the quality detection reminding information.
Optionally, after the step of writing the video quality display result into the current playing interface according to the information display template, obtaining quality detection reminding information, and displaying the quality detection reminding information, the video quality detection method further includes:
receiving manual marking operation fed back by a user according to the quality detection reminding information, and generating feedback information according to the manual marking operation;
and acquiring video information of the video to be detected, and sending the video information and the feedback information to the preset server.
Furthermore, to achieve the above object, the present invention also proposes a video quality detection apparatus comprising a memory, a processor and a video quality detection program stored on the memory and executable on the processor, the video quality detection program being configured to implement the steps of the video quality detection method as described above.
Furthermore, to achieve the above object, the present invention further provides a storage medium having a video quality detection program stored thereon, which when executed by a processor implements the steps of the video quality detection method as described above.
In addition, to achieve the above object, the present invention further provides a video quality detection apparatus, including: the system comprises a video extraction module, a log extraction module, a synchronous comparison module, a data sending module and a result receiving module;
the video extraction module is used for acquiring a video to be detected and a log file corresponding to the video to be detected, extracting the video to be detected and acquiring an image to be detected and an audio to be detected;
the log extraction module is used for extracting data of the log file to obtain an image data stream and an audio data stream;
the synchronous comparison module is used for generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream;
the data sending module is used for sending the image data of the image to be detected, the log file, the synchronous voice-picture comparison result and the audio data of the audio to be detected to a preset server as video detection information;
and the result receiving module is used for receiving a video quality detection result fed back by the preset server according to the video detection information.
Optionally, the video extraction module is further configured to obtain a video to be detected and a log file corresponding to the video to be detected;
the video extraction module is further used for intercepting the video to be detected when the video to be detected starts to be played, so as to obtain an image to be detected;
the video extraction module is also used for acquiring current equipment information and determining a current audio recording mode according to the current equipment information;
the video extraction module is further configured to record an audio of the video to be detected according to the current audio recording mode, so as to obtain the audio to be detected.
Optionally, the video extraction module is further configured to perform image capture on the video to be detected through a preset screenshot script when the video to be detected starts to be played, so as to obtain a captured image set;
the video extraction module is further used for acquiring the interception time of each intercepted image in the intercepted image set, and sequencing the intercepted images according to the interception time to obtain a sequencing result;
the video extraction module is further configured to traverse the captured image according to the sorting result, and use the traversed captured image as an image to be detected.
Optionally, the video extraction module is further configured to determine a target recording device and an audio playing device according to the current audio recording mode, where the target recording device is disposed at a position opposite to the audio playing device;
the video extraction module is further used for controlling the audio playing device to operate and controlling the target recording device to start audio recording;
the video extraction module is further used for receiving the recorded audio uploaded by the target recording device and selecting the audio to be detected from the recorded audio.
Optionally, the video extraction module is further configured to receive the recorded audio uploaded by the target recording device, and cut the recorded audio according to the interception time to obtain a candidate audio;
the video extraction module is further configured to traverse the candidate audio according to the sorting result, and use the traversed candidate audio as the audio to be detected.
Optionally, the synchronous comparison module is further configured to perform character recognition on the image to be detected to obtain image recognition characters;
the synchronous comparison module is also used for carrying out voice recognition on the audio to be detected to obtain voice recognition characters;
the synchronous comparison module is also used for generating a character matching result according to the image recognition characters and the voice recognition characters;
the synchronous comparison module is further used for determining a data stream matching result according to the image data stream and the audio data stream;
and the synchronous comparison module is also used for generating a sound-picture synchronous comparison result according to the character matching result and the data stream matching result.
Compared with the existing mode of manually detecting the video quality, the method and the device have the advantages that the video to be detected and the log file corresponding to the video to be detected are obtained, the image to be detected and the audio to be detected are obtained by extracting the data of the log file, the image data stream and the audio data stream are obtained, the sound-picture synchronous comparison result is generated according to the image to be detected, the audio to be detected, the image data stream and the audio data stream, the image data, the log file, the sound-picture synchronous comparison result of the image to be detected and the audio data of the audio to be detected are used as video detection information and are sent to the preset server, the video quality detection result fed back by the preset server according to the video detection information is received, and therefore the video which cannot obtain source data can be subjected to quality detection, and the detection efficiency is improved.
Drawings
Fig. 1 is a schematic structural diagram of a video quality detection device in a hardware operating environment according to an embodiment of the present invention;
FIG. 2 is a flowchart illustrating a video quality detection method according to a first embodiment of the present invention;
FIG. 3 is a flowchart illustrating a video quality detection method according to a second embodiment of the present invention;
FIG. 4 is a flowchart illustrating a video quality detection method according to a third embodiment of the present invention;
fig. 5 is a block diagram of a video quality detection apparatus according to a first embodiment of the invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Referring to fig. 1, fig. 1 is a schematic structural diagram of a video quality detection device in a hardware operating environment according to an embodiment of the present invention.
As shown in fig. 1, the video quality detection apparatus may include: a processor 1001, such as a Central Processing Unit (CPU), a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Wherein a communication bus 1002 is used to enable connective communication between these components. The user interface 1003 may include a Display screen (Display), and the optional user interface 1003 may further include a standard wired interface and a wireless interface, and the wired interface for the user interface 1003 may be a USB interface in the present invention. The network interface 1004 may optionally include a standard wired interface, a WIreless interface (e.g., a WIreless-FIdelity (WI-FI) interface). The Memory 1005 may be a Random Access Memory (RAM) Memory or a Non-volatile Memory (NVM), such as a disk Memory. The memory 1005 may alternatively be a storage device separate from the processor 1001.
Those skilled in the art will appreciate that the configuration shown in fig. 1 does not constitute a limitation of the video quality detection apparatus and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
As shown in FIG. 1, memory 1005, identified as one type of computer storage medium, may include an operating system, a network communication module, a user interface module, and a video quality detection program.
In the video quality detection apparatus shown in fig. 1, the network interface 1004 is mainly used for connecting to a background server and performing data communication with the background server; the user interface 1003 is mainly used for connecting user equipment; the video quality detection apparatus calls a video quality detection program stored in the memory 1005 through the processor 1001 and performs the video quality detection method provided by the embodiment of the present invention.
Based on the hardware structure, the embodiment of the video quality detection method is provided.
Referring to fig. 2, fig. 2 is a flowchart illustrating a video quality detection method according to a first embodiment of the present invention.
In a first embodiment, the video quality detection method includes the steps of:
step S10: the method comprises the steps of obtaining a video to be detected and a log file corresponding to the video to be detected, extracting the video to be detected, and obtaining an image to be detected and an audio to be detected.
It should be noted that the execution subject of this embodiment is the video quality detection device, where the video quality detection device may be an electronic device such as a computer or a mobile phone, or may also be another device that can achieve the same or similar functions.
It should be understood that, the acquiring of the video to be detected may be randomly acquiring a play address from a play list after the video playing application program is started, and selecting a play video to play the video to acquire the video to be detected.
It can be understood that the obtaining of the log file corresponding to the video to be detected may be obtaining the log file corresponding to the video to be detected in a preset storage area. Wherein the preset storage area can be preset by a developer of the video quality detection device.
It should be understood that a video to be detected and a log file corresponding to the video to be detected are obtained, the video to be detected is extracted, an image to be detected is obtained, and an audio to be detected can be the log file corresponding to the video to be detected and the video to be detected, when the video to be detected starts to play, the video to be detected is subjected to image interception, the image to be detected is obtained, current equipment information is obtained, a current audio recording mode is determined according to the current equipment information, and the audio to be detected is recorded according to the current audio recording mode, so that the audio to be detected is obtained.
Step S20: and performing data extraction on the log file to obtain an image data stream and an audio data stream.
It should be understood that, the data extraction of the log file to obtain the image data stream and the audio data stream may be performed by performing data extraction of the log file through a preset log capture script to obtain extracted data, performing data cleaning on the extracted data to obtain data to be classified, performing data classification on the data to be classified to obtain the image data stream and the audio data stream,
step S30: and generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream.
It can be understood that the generation of the sound-picture synchronization comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream may be to perform character recognition on the image to be detected to obtain image recognition characters, perform voice recognition on the audio to be detected to obtain voice recognition characters, generate a character matching result according to the image recognition characters and the voice recognition characters, determine a data stream matching result according to the image data stream and the audio data stream, and generate a sound-picture synchronization comparison result according to the character matching result and the data stream matching result.
Further, in order to improve reliability of a text matching result, the generating a text matching result according to the image recognition text and the voice recognition text includes:
determining character similarity according to the image recognition characters and the voice recognition characters, judging whether the character similarity is larger than a preset similarity threshold value, and generating a character matching result according to a judgment result.
Further, in order to generate a data stream matching result quickly, the determining a data stream matching result according to the image data stream and the audio data stream includes:
and acquiring the arrival time of image data corresponding to the image data stream, acquiring the arrival time of audio data corresponding to the audio data stream, and generating a data stream matching result according to the arrival time of the image data and the arrival time of the audio data.
Step S40: and sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result and the audio data of the audio to be detected as video detection information to a preset server.
It should be understood that sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result, and the audio data of the audio to be detected as the video detection information to the preset server may be sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result, and the audio data of the audio to be detected as the video detection information to the preset server in a preset wireless transmission manner.
It should be noted that the preset wireless transmission mode may be selected in real time by a user of the video quality detection device, for example: 5G, 4G, etc., which are not intended to be limiting in this embodiment.
Step S50: and receiving a video quality detection result fed back by the preset server according to the video detection information.
It should be understood that, the receiving of the video quality detection result fed back by the preset server according to the video detection information may be receiving the video quality detection result fed back by the preset server according to the video detection information through a preset wireless transmission manner, which is not limited in this embodiment.
Further, in order to feed back the detection result to the user in time, after receiving the video quality detection result fed back by the preset server according to the video detection information, the method further includes:
acquiring current playing interface information, and determining an information display template according to the current playing interface information;
and writing the video quality display result into the current playing interface according to the information display template, obtaining quality detection reminding information, and displaying the quality detection reminding information.
It should be noted that the current playing interface information may be layout information of the current playing interface, and the present embodiment does not limit this.
It should be understood that the determining of the information presentation template according to the current playing interface information may be to search the preset template library for the information presentation template corresponding to the current playing interface information. The preset template library includes a corresponding relationship between the current playing interface information and the information display template, and the corresponding relationship between the current playing interface information and the information display template may be preset by a developer of the video quality detection device, which is not limited in this embodiment.
It can be understood that the video quality display result is written into the current playing interface according to the information display template, the quality detection reminding information can be obtained by determining an information area to be written into the current playing interface according to the information display template, and the video quality display result is written into the information area to be written into the current playing interface, so that the quality detection reminding information is obtained.
Further, in order to upload manual mark information fed back by a user in time, the writing of the video quality display result into the current playing interface according to the information display template to obtain quality detection reminding information, and displaying the quality detection reminding information further includes:
receiving manual marking operation fed back by a user according to the quality detection reminding information, and generating feedback information according to the manual marking operation;
and acquiring video information of the video to be detected, and sending the video information and the feedback information to the preset server.
It should be noted that the manual marking operation may be operation information performed by a user in real time on a user interaction interface of the video quality detection device; the feedback information can be screen-splash information, screen-blackness information and normal information; the video information may be video playing address information, and the like, which is not limited in this embodiment.
Compared with the existing method for manually detecting the video quality, in this embodiment, the video to be detected and the log file corresponding to the video to be detected are obtained, the image to be detected and the audio to be detected are obtained, the data of the log file is extracted to obtain the image data stream and the audio data stream, the audio-video synchronous comparison result is generated according to the image to be detected, the audio to be detected, the image data stream and the audio data stream, the image data, the log file, the audio-video synchronous comparison result and the audio data of the audio to be detected are sent to the preset server as video detection information, and the video quality detection result fed back by the preset server according to the video detection information is received, so that the video which cannot obtain the source data can be subjected to quality detection, and the detection efficiency is improved.
Referring to fig. 3, fig. 3 is a flowchart illustrating a second embodiment of the video quality detection method according to the present invention, and the second embodiment of the video quality detection method is proposed based on the first embodiment shown in fig. 2.
In the second embodiment, the step S10 includes:
step S101: the method comprises the steps of obtaining a video to be detected and a log file corresponding to the video to be detected.
It should be understood that, the acquiring of the video to be detected may be randomly acquiring a play address from a play list after the video playing application program is started, and selecting a play video to play the video to acquire the video to be detected.
It can be understood that the obtaining of the log file corresponding to the video to be detected may be obtaining the log file corresponding to the video to be detected in a preset storage area. Wherein the preset storage area can be preset by a developer of the video quality detection device.
Step S102: and when the video to be detected starts to play, carrying out image interception on the video to be detected to obtain an image to be detected.
It should be understood that, the image capturing of the video to be detected may be performed every preset time period to obtain the image to be detected. The preset time period may be preset by a developer of the video quality detection device, and in this embodiment, 1 second is taken as an example for description.
Step S103: and acquiring current equipment information, and determining a current audio recording mode according to the current equipment information.
It should be noted that the current device information may be system information of the current device, for example, an Android system or an IOS system, which is not limited in this embodiment.
It should be understood that determining the current audio recording mode according to the current device information may be looking up a current device recording mode corresponding to the current device information in a preset recording mode table. The preset recording mode table includes a corresponding relationship between current device information and a current device recording mode, and the corresponding relationship between the current device information and the current device recording mode may be preset by a developer of the video quality detection device, which is not limited in this embodiment.
Step S104: and recording the audio of the video to be detected according to the current audio recording mode to obtain the audio to be detected.
It should be understood that the audio recording is performed on the video to be detected according to the current audio recording mode, and the obtaining of the audio to be detected may be determining a target recording device and an audio playing device according to the current audio recording mode, where the target recording device is disposed at a position opposite to the audio playing device, controlling the audio playing device to operate, controlling the target recording device to start audio recording, receiving the recorded audio uploaded by the target recording device, and selecting the audio to be detected from the recorded audio.
In a second embodiment, by acquiring a video to be detected and a log file corresponding to the video to be detected, when the video to be detected starts to be played, image capture is performed on the video to be detected to acquire an image to be detected, current equipment information is acquired, a current audio recording mode is determined according to the current equipment information, and audio recording is performed on the video to be detected according to the current audio recording mode to acquire an audio to be detected, so that a reliable image to be detected and the audio to be detected can be extracted.
In the second embodiment, the step S20 includes:
step S201: and performing data extraction on the log file through a preset log capturing script to obtain extracted data.
It should be noted that the preset log capture script may be preset by a developer of the video quality detection apparatus, which is not limited in this embodiment.
Step S202: and carrying out data cleaning on the extracted data to obtain data to be classified.
It should be appreciated that the data cleansing of the extracted data is to clean up duplicate data in the extracted data.
Step S203: and carrying out data classification on the data to be classified to obtain an image data stream and an audio data stream.
It can be understood that, the data classification of the data to be classified to obtain the image data stream and the audio data stream may be to obtain data characteristics of the data to be classified, and perform data classification on the data to be classified according to the data characteristics to obtain the image data stream and the audio data stream.
In a second embodiment, a preset log grabbing script is used for extracting data from the log file to obtain extracted data, the extracted data is subjected to data cleaning to obtain data to be classified, the data to be classified is subjected to data classification to obtain an image data stream and an audio data stream, and therefore the image data stream and the audio data stream can be determined rapidly.
In the second embodiment, the step S30 includes:
step S301: and carrying out character recognition on the image to be detected to obtain image recognition characters.
It should be understood that, performing character recognition on the image to be detected to obtain the image recognition characters may be performing character recognition on the image to be detected by using a preset character recognition script to obtain the image recognition characters. The preset character recognition script may be an OCR script preset by a developer of the video quality detection device, which is not limited in this embodiment.
Step S302: and carrying out voice recognition on the audio to be detected to obtain voice recognition characters.
It can be understood that, performing voice recognition on the audio to be detected to obtain the voice recognition characters may be performing character recognition on the image to be detected by presetting a voice recognition script to obtain the voice recognition characters. The preset voice recognition script can be preset by a developer of the video quality detection device.
Step S303: and generating a character matching result according to the image recognition characters and the voice recognition characters.
It should be understood that the generation of the character matching result from the image recognition character and the voice recognition character may be that when the image recognition character and the voice recognition character are completely matched, the character matching result is determined to be a successful matching.
Further, in order to improve the reliability of the text matching result, the step S303 includes:
determining character similarity according to the image recognition characters and the voice recognition characters;
and judging whether the character similarity is greater than a preset similarity threshold value or not, and generating a character matching result according to the judgment result.
It should be understood that determining the character similarity based on the image recognition characters and the voice recognition characters may be comparing the image recognition characters with the voice recognition characters one by one, and determining the character similarity based on the comparison result.
It should be noted that the preset similarity threshold may be preset by a developer of the video quality detection device, and in this embodiment, 0.65 is taken as an example for description.
In a specific implementation, for example, when the similarity of the characters is greater than 0.65, the character matching result is determined to be successful; and when the character similarity is less than or equal to 0.65, judging that the character matching result is a matching failure.
Step S304: and determining a data stream matching result according to the image data stream and the audio data stream.
It should be appreciated that determining a data stream match result from the image data stream and the audio data stream may be matching the image data stream with the audio data stream to obtain a data stream match result.
Further, in order to generate a data stream matching result quickly, the step S304 includes:
acquiring the arrival time of image data corresponding to the image data stream, and acquiring the arrival time of audio data corresponding to the audio data stream;
and generating a data stream matching result according to the image data arrival time and the audio data arrival time.
It should be understood that, the generation of the data stream matching result according to the image data arrival time and the audio data arrival time may be that when the image data arrival time and the audio data arrival time are equal, the data stream matching result is determined to be a successful matching; and when the image data arrival time is not equal to the audio data arrival time, judging that the data stream matching result is matching failure.
Further, in order to improve reliability of a data stream matching result, the step of generating the data stream matching result according to the image data arrival time and the audio data arrival time specifically includes:
determining a time difference value according to the image data arrival time and the audio data arrival time;
and judging whether the time difference is larger than a preset time threshold value or not, and generating a data stream matching result according to a judgment result.
It can be understood that: determining the time difference value according to the image data arrival time and the audio data arrival time may be subtracting the audio data arrival time from the image data arrival time, and taking an absolute value to obtain the time difference value.
It should be noted that the preset time threshold may be preset by a developer of the video quality detection apparatus, and in this embodiment, 100 milliseconds is taken as an example for description.
In a specific implementation, for example, when the time difference between the arrival time of the image data and the arrival time of the audio data is not more than 100 milliseconds within 10 seconds, the data stream matching result is determined to be successful; and when the time difference value between the arrival time of the image data and the arrival time of the audio data in 10 seconds is more than 100 milliseconds, judging that the data stream matching result is a matching failure.
Step S305: and generating a sound-picture synchronous comparison result according to the character matching result and the data stream matching result.
It should be understood that, the generation of the sound-picture synchronization comparison result according to the character matching result and the data stream matching result may be that when the character matching result is successful and the data stream matching result is successful, the sound-picture synchronization comparison result is determined to be sound-picture synchronization; and when the character matching result is matching failure or the data stream matching result is matching failure, judging that the sound-picture synchronization comparison result is sound-picture synchronization.
In a second embodiment, the image to be detected is subjected to character recognition to obtain image recognition characters, the audio to be detected is subjected to voice recognition to obtain voice recognition characters, character matching results are generated according to the image recognition characters and the voice recognition characters, data stream matching results are determined according to the image data stream and the audio data stream, and sound and picture synchronization comparison results are generated according to the character matching results and the data stream matching results, so that the accuracy of the sound and picture synchronization comparison results can be improved.
Referring to fig. 4, fig. 4 is a flowchart illustrating a video quality detection method according to a third embodiment of the present invention, and the third embodiment of the video quality detection method according to the present invention is proposed based on the second embodiment shown in fig. 3.
In a third embodiment, the step S102 includes:
step S1021: and when the video to be detected starts to be played, carrying out image interception on the video to be detected through a preset screenshot script to obtain an intercepted image set.
It should be noted that the preset screenshot script may be a screenshot script preset by a developer of the video quality detection device, which is not limited in this embodiment.
It should be understood that, image capture is performed on the video to be detected through the preset capture script, and the captured image set is obtained by image capture on the video to be detected through the preset capture script every preset time period, so as to obtain a captured image, and a captured image set is generated according to the captured image. The preset time period may be preset by a developer of the video quality detection device, and in this embodiment, 1 second is taken as an example for description.
Step S1022: and acquiring the interception time of each intercepted image in the intercepted image set, and sequencing the intercepted images according to the interception time to obtain a sequencing result.
It should be understood that the sorting of the clipped images according to the clipping time may be sorting the clipped images from early to late according to the clipping time, and this embodiment is not limited thereto.
Step S1023: traversing the intercepted image according to the sequencing result, and taking the traversed intercepted image as an image to be detected.
In a specific implementation, for example, when a video to be detected starts to play, an image of the video to be detected is captured by presetting a screenshot script. Because the video file is played according to the sequence of the frames, when the source data of the video to be detected cannot be obtained, the captured image set can be obtained by adopting a screenshot mode. When the image capturing is carried out on the video to be detected every second or shorter time, the obtained captured image is equivalent to the main frame image of the video to be detected.
In a third embodiment, when the video to be detected starts to be played, image interception is performed on the video to be detected through a preset screenshot script to obtain an intercepted image set, the interception time of each intercepted image in the intercepted image set is obtained, the intercepted images are sequenced according to the interception time to obtain a sequencing result, the intercepted images are traversed according to the sequencing result, and the traversed intercepted images are used as the images to be detected, so that the images to be detected can be generated through screenshot when the source data of the video to be detected cannot be obtained.
In a third embodiment, the step S104 includes:
step S1041: and determining a target recording device and an audio playing device according to the current audio recording mode, wherein the target recording device is arranged at a position relative to the audio playing device.
In specific implementation, for example, because the mobile phone with the Android system cannot record the sound of software inside the mobile phone, when the current audio recording mode is the Android system audio recording mode, a handset earphone is used as an audio playing device, and a handset earphone microphone is used as a target recording device. Wherein, the earphone receiver is bound with the mobile phone earphone microphone.
Step S1042: and controlling the audio playing equipment to operate, and controlling the target recording equipment to start audio recording.
It should be understood that controlling the operation of the audio playing device may be connecting the audio playing device to the video quality detection device and controlling the operation of the audio playing device.
Step S1043: and receiving the recorded audio uploaded by the target recording equipment, and selecting the audio to be detected from the recorded audio.
In specific implementation, for example, because the mobile phone with the Android system cannot record the sound of software inside the mobile phone, an earphone needs to be connected to the mobile phone, a mobile phone earphone receiver is bound with a mobile phone earphone microphone, audio played by the mobile phone earphone receiver is recorded through the mobile phone earphone microphone, the audio uploaded by the mobile phone earphone receiver is used as recorded audio, and then the audio to be detected is selected from the recorded audio.
Further, in order to ensure consistency of the image to be detected and the audio to be detected in terms of time, the step S1043 includes:
receiving the recorded audio uploaded by the target recording equipment, and cutting the recorded audio according to the intercepting time to obtain candidate audio;
and traversing the candidate audio according to the sequencing result, and taking the traversed candidate audio as the audio to be detected.
In a specific implementation, for example, the recorded audio is cut according to the capturing time to obtain the candidate audio, the recorded audio may be cut every second to be n audio sets, and the audio in the audio sets is used as the candidate audio.
In a third embodiment, a target recording device and an audio playing device are determined according to the current audio recording mode, the target recording device is arranged at a position opposite to the audio playing device, the audio playing device is controlled to operate, the target recording device is controlled to start audio recording, recorded audio uploaded by the target recording device is received, and audio to be detected is selected from the recorded audio, so that noise emitted from the outside can be reduced, and the reliability of the audio to be detected is improved.
Furthermore, an embodiment of the present invention further provides a storage medium, where a video quality detection program is stored, and the video quality detection program, when executed by a processor, implements the steps of the video quality detection method as described above.
In addition, referring to fig. 5, an embodiment of the present invention further provides a video quality detection apparatus, where the video quality detection apparatus includes: the system comprises a video extraction module 10, a log extraction module 20, a synchronous comparison module 30, a data sending module 40 and a result receiving module 50;
the video extraction module 10 is configured to acquire a video to be detected and a log file corresponding to the video to be detected, and extract the video to be detected to acquire an image to be detected and an audio to be detected.
It should be understood that, the acquiring of the video to be detected may be randomly acquiring a play address from a play list after the video playing application program is started, and selecting a play video to play the video to acquire the video to be detected.
It can be understood that the obtaining of the log file corresponding to the video to be detected may be obtaining the log file corresponding to the video to be detected in a preset storage area. Wherein the preset storage area can be preset by a developer of the video quality detection device.
It should be understood that a video to be detected and a log file corresponding to the video to be detected are obtained, the video to be detected is extracted, an image to be detected is obtained, and an audio to be detected can be the log file corresponding to the video to be detected and the video to be detected, when the video to be detected starts to play, the video to be detected is subjected to image interception, the image to be detected is obtained, current equipment information is obtained, a current audio recording mode is determined according to the current equipment information, and the audio to be detected is recorded according to the current audio recording mode, so that the audio to be detected is obtained.
The log extraction module 20 is configured to perform data extraction on the log file to obtain an image data stream and an audio data stream.
It should be understood that the data extraction of the log file to obtain the image data stream and the audio data stream may be performed by performing data extraction on the log file through a preset log capture script to obtain extracted data, performing data cleaning on the extracted data to obtain data to be classified, performing data classification on the data to be classified to obtain an image data stream and an audio data stream,
the synchronous comparison module 30 is configured to generate a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream, and the audio data stream.
It can be understood that the generation of the sound-picture synchronization comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream may be to perform character recognition on the image to be detected to obtain image recognition characters, perform voice recognition on the audio to be detected to obtain voice recognition characters, generate a character matching result according to the image recognition characters and the voice recognition characters, determine a data stream matching result according to the image data stream and the audio data stream, and generate a sound-picture synchronization comparison result according to the character matching result and the data stream matching result.
Further, in order to improve the reliability of the text matching result, the synchronous comparison module 30 is further configured to determine a text similarity according to the image recognition text and the voice recognition text, determine whether the text similarity is greater than a preset similarity threshold, and generate a text matching result according to the determination result.
Further, in order to generate a data stream matching result quickly, the synchronization comparison module 30 is further configured to obtain an arrival time of image data corresponding to the image data stream, obtain an arrival time of audio data corresponding to the audio data stream, and generate a data stream matching result according to the arrival time of the image data and the arrival time of the audio data.
The data sending module 40 is configured to send the image data of the image to be detected, the log file, the sound-picture synchronization comparison result, and the audio data of the audio to be detected to a preset server as video detection information.
It should be understood that sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result, and the audio data of the audio to be detected as the video detection information to the preset server may be sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result, and the audio data of the audio to be detected as the video detection information to the preset server in a preset wireless transmission manner.
It should be noted that the preset wireless transmission mode may be selected in real time by a user of the video quality detection device, for example: 5G, 4G, etc., which are not intended to be limiting in this embodiment.
The result receiving module 50 is configured to receive a video quality detection result fed back by the preset server according to the video detection information.
It should be understood that, the receiving of the video quality detection result fed back by the preset server according to the video detection information may be receiving the video quality detection result fed back by the preset server according to the video detection information through a preset wireless transmission manner, which is not limited in this embodiment.
Compared with the existing method for manually detecting the video quality, in this embodiment, the video to be detected and the log file corresponding to the video to be detected are obtained, the image to be detected and the audio to be detected are obtained, the data of the log file is extracted to obtain the image data stream and the audio data stream, the audio-video synchronous comparison result is generated according to the image to be detected, the audio to be detected, the image data stream and the audio data stream, the image data, the log file, the audio-video synchronous comparison result and the audio data of the audio to be detected are sent to the preset server as video detection information, and the video quality detection result fed back by the preset server according to the video detection information is received, so that the video which cannot obtain the source data can be subjected to quality detection, and the detection efficiency is improved.
Other embodiments or specific implementation manners of the video quality detection apparatus according to the present invention may refer to the above method embodiments, and are not described herein again.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or system that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or system. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or system in which the element is included.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments. In the unit claims enumerating several means, several of these means can be embodied by one and the same item of hardware. The use of the words first, second, third, etc. do not denote any order, but rather the words first, second, third, etc. are to be interpreted as names.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention or portions thereof that contribute to the prior art may be embodied in the form of a software product, where the computer software product is stored in a storage medium (e.g., a Read Only Memory (ROM)/Random Access Memory (RAM), a magnetic disk, an optical disk), and includes several instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.
The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.
The invention discloses A1 and a video quality detection method, which comprises the following steps:
acquiring a video to be detected and a log file corresponding to the video to be detected, and extracting the video to be detected to obtain an image to be detected and an audio to be detected;
performing data extraction on the log file to obtain an image data stream and an audio data stream;
generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream;
sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result and the audio data of the audio to be detected to a preset server as video detection information;
and receiving a video quality detection result fed back by the preset server according to the video detection information.
The video quality detection method according to the point a2 and the point a1, the step of obtaining the video to be detected and the log file corresponding to the video to be detected, and extracting the video to be detected to obtain the image to be detected and the audio to be detected, specifically includes:
acquiring a video to be detected and a log file corresponding to the video to be detected;
when the video to be detected starts to play, image interception is carried out on the video to be detected, and an image to be detected is obtained;
acquiring current equipment information, and determining a current audio recording mode according to the current equipment information;
and recording the audio of the video to be detected according to the current audio recording mode to obtain the audio to be detected.
A3, the video quality detection method according to a2, wherein the step of capturing the image of the video to be detected to obtain the image to be detected when the video to be detected starts to play includes:
when the video to be detected starts to play, image interception is carried out on the video to be detected through a preset screenshot script, and an intercepted image set is obtained;
acquiring the interception time of each intercepted image in the intercepted image set, and sequencing the intercepted images according to the interception time to obtain a sequencing result;
traversing the intercepted image according to the sequencing result, and taking the traversed intercepted image as an image to be detected.
A4, the video quality detection method according to A3, wherein the step of recording the audio of the video to be detected according to the current audio recording mode to obtain the audio to be detected specifically comprises:
determining a target recording device and an audio playing device according to the current audio recording mode, wherein the target recording device is arranged at a position relative to the audio playing device;
controlling the audio playing device to operate, and controlling the target recording device to start audio recording;
and receiving the recorded audio uploaded by the target recording equipment, and selecting the audio to be detected from the recorded audio.
A5, the video quality detection method according to a4, wherein the step of receiving the recorded audio uploaded by the target recording device and selecting the audio to be detected from the recorded audio includes:
receiving the recorded audio uploaded by the target recording equipment, and cutting the recorded audio according to the intercepting time to obtain a candidate audio;
traversing the candidate audios according to the sorting result, and taking the traversed candidate audios as the audios to be detected.
A6, the video quality detection method as defined in any one of A1-A5, wherein the step of generating the audio-video synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream specifically comprises:
performing character recognition on the image to be detected to obtain image recognition characters;
performing voice recognition on the audio to be detected to obtain voice recognition characters;
generating a character matching result according to the image recognition characters and the voice recognition characters;
determining a data stream matching result according to the image data stream and the audio data stream;
and generating a sound-picture synchronous comparison result according to the character matching result and the data stream matching result.
A7, the video quality detection method as in a6, wherein the step of generating the text matching result according to the image recognition text and the voice recognition text specifically includes:
determining character similarity according to the image recognition characters and the voice recognition characters;
and judging whether the character similarity is greater than a preset similarity threshold value or not, and generating a character matching result according to the judgment result.
A8, the method for detecting video quality as in a6, wherein the step of determining the data stream matching result according to the image data stream and the audio data stream specifically includes:
acquiring the arrival time of image data corresponding to the image data stream, and acquiring the arrival time of audio data corresponding to the audio data stream;
and generating a data stream matching result according to the image data arrival time and the audio data arrival time.
A9, the video quality detection method as in A8, wherein the step of generating a data stream matching result according to the image data arrival time and the audio data arrival time specifically includes:
determining a time difference value according to the image data arrival time and the audio data arrival time;
and judging whether the time difference value is greater than a preset time threshold value or not, and generating a data stream matching result according to a judgment result.
A10, in particular, the video quality detection method according to any one of a1-a5, wherein the step of performing data extraction on the log file to obtain an image data stream and an audio data stream includes:
performing data extraction on the log file through a preset log capturing script to obtain extracted data;
carrying out data cleaning on the extracted data to obtain data to be classified;
and carrying out data classification on the data to be classified to obtain an image data stream and an audio data stream.
A11, the video quality detection method according to any one of A1-A5, wherein after the step of receiving the video quality detection result fed back by the preset server according to the video detection information, the video quality detection method further comprises:
acquiring current playing interface information, and determining an information display template according to the current playing interface information;
and writing the video quality display result into the current playing interface according to the information display template, acquiring quality detection reminding information, and displaying the quality detection reminding information.
A12, the video quality detection method as in a11, wherein after the step of writing the video quality display result into the current playing interface according to the information display template, obtaining quality detection reminding information, and displaying the quality detection reminding information, the video quality detection method further comprises:
receiving manual marking operation fed back by a user according to the quality detection reminding information, and generating feedback information according to the manual marking operation;
and acquiring video information of the video to be detected, and sending the video information and the feedback information to the preset server.
The invention discloses B13, a video quality detection device, comprising: a memory, a processor and a video quality detection program stored on the memory and executable on the processor, the video quality detection program when executed by the processor implementing the steps of the video quality detection method as described above.
The invention discloses C14, a storage medium having stored thereon a video quality detection program which, when executed by a processor, implements the steps of a video quality detection method as described above.
The invention discloses D15 and a video quality detection device, wherein the video quality detection device comprises: the system comprises a video extraction module, a log extraction module, a synchronous comparison module, a data sending module and a result receiving module;
the video extraction module is used for acquiring a video to be detected and a log file corresponding to the video to be detected, extracting the video to be detected and acquiring an image to be detected and an audio to be detected;
the log extraction module is used for extracting data of the log file to obtain an image data stream and an audio data stream;
the synchronous comparison module is used for generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream;
the data sending module is used for sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result and the audio data of the audio to be detected to a preset server as video detection information;
and the result receiving module is used for receiving a video quality detection result fed back by the preset server according to the video detection information.
D16, the video quality detection device according to D15, and the video extraction module, further configured to obtain a video to be detected and a log file corresponding to the video to be detected;
the video extraction module is further used for intercepting the video to be detected when the video to be detected starts to be played, so as to obtain an image to be detected;
the video extraction module is also used for acquiring current equipment information and determining a current audio recording mode according to the current equipment information;
the video extraction module is further configured to record an audio of the video to be detected according to the current audio recording mode, so as to obtain the audio to be detected.
D17, in the video quality detection apparatus according to D16, the video extraction module is further configured to perform image capturing on the video to be detected through a preset screenshot script when the video to be detected starts to be played, so as to obtain a captured image set;
the video extraction module is also used for acquiring the interception time of each intercepted image in the intercepted image set and sequencing the intercepted images according to the interception time to obtain a sequencing result;
the video extraction module is also used for traversing the intercepted image according to the sequencing result and taking the traversed intercepted image as an image to be detected.
D18, the video quality detection apparatus according to D17, the video extraction module further configured to determine a target recording device and an audio playing device according to the current audio recording mode, wherein the target recording device is disposed at a position opposite to the audio playing device;
the video extraction module is further used for controlling the audio playing device to operate and controlling the target recording device to start audio recording;
the video extraction module is further used for receiving the recorded audio uploaded by the target recording equipment and selecting the audio to be detected from the recorded audio.
D19, the video quality detection apparatus according to D18, the video extraction module is further configured to receive the recorded audio uploaded by the target recording device, and cut the recorded audio according to the capturing time to obtain candidate audio;
the video extraction module is further configured to traverse the candidate audio according to the sorting result, and use the traversed candidate audio as the audio to be detected.
D20, the video quality detection device as any one of D15-D19, the synchronous comparison module is further configured to perform character recognition on the image to be detected to obtain image recognition characters;
the synchronous comparison module is also used for carrying out voice recognition on the audio to be detected to obtain voice recognition characters;
the synchronous comparison module is also used for generating a character matching result according to the image recognition characters and the voice recognition characters;
the synchronous comparison module is further used for determining a data stream matching result according to the image data stream and the audio data stream;
and the synchronous comparison module is also used for generating a sound-picture synchronous comparison result according to the character matching result and the data stream matching result.

Claims (10)

1. A video quality detection method is characterized by comprising the following steps:
acquiring a video to be detected and a log file corresponding to the video to be detected, and extracting the video to be detected to obtain an image to be detected and an audio to be detected;
performing data extraction on the log file to obtain an image data stream and an audio data stream;
generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream;
sending the image data of the image to be detected, the log file, the sound-picture synchronous comparison result and the audio data of the audio to be detected as video detection information to a preset server;
and receiving a video quality detection result fed back by the preset server according to the video detection information.
2. The video quality detection method according to claim 1, wherein the step of obtaining the video to be detected and the log file corresponding to the video to be detected, and extracting the video to be detected to obtain the image to be detected and the audio to be detected specifically comprises:
acquiring a video to be detected and a log file corresponding to the video to be detected;
when the video to be detected starts to play, image interception is carried out on the video to be detected, and an image to be detected is obtained;
acquiring current equipment information, and determining a current audio recording mode according to the current equipment information;
and recording the audio of the video to be detected according to the current audio recording mode to obtain the audio to be detected.
3. The video quality detection method according to claim 2, wherein the step of capturing the image of the video to be detected to obtain the image to be detected when the video to be detected starts to play comprises:
when the video to be detected starts to play, image interception is carried out on the video to be detected through a preset screenshot script, and an intercepted image set is obtained;
acquiring the interception time of each intercepted image in the intercepted image set, and sequencing the intercepted images according to the interception time to obtain a sequencing result;
traversing the intercepted image according to the sequencing result, and taking the traversed intercepted image as an image to be detected.
4. The video quality detection method according to claim 3, wherein the step of performing audio recording on the video to be detected according to the current audio recording mode to obtain the audio to be detected specifically comprises:
determining a target recording device and an audio playing device according to the current audio recording mode, wherein the target recording device is arranged at a position relative to the audio playing device;
controlling the audio playing device to operate, and controlling the target recording device to start audio recording;
and receiving recorded audio uploaded by the target recording equipment, and selecting the audio to be detected from the recorded audio.
5. The method for detecting video quality according to claim 4, wherein the step of receiving the recorded audio uploaded by the target recording device and selecting the audio to be detected from the recorded audio includes:
receiving the recorded audio uploaded by the target recording equipment, and cutting the recorded audio according to the intercepting time to obtain candidate audio;
traversing the candidate audios according to the sorting result, and taking the traversed candidate audios as the audios to be detected.
6. The video quality detection method according to any one of claims 1 to 5, wherein the step of generating a video-audio synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream specifically comprises:
performing character recognition on the image to be detected to obtain image recognition characters;
performing voice recognition on the audio to be detected to obtain voice recognition characters;
generating a character matching result according to the image recognition characters and the voice recognition characters;
determining a data stream matching result according to the image data stream and the audio data stream;
and generating a sound-picture synchronous comparison result according to the character matching result and the data stream matching result.
7. The method according to claim 6, wherein the step of generating the text matching result according to the image recognition text and the voice recognition text specifically comprises:
determining character similarity according to the image recognition characters and the voice recognition characters;
and judging whether the character similarity is greater than a preset similarity threshold value or not, and generating a character matching result according to the judgment result.
8. A video quality detection apparatus, characterized in that the video quality detection apparatus comprises: memory, a processor and a video quality detection program stored on the memory and executable on the processor, the video quality detection program when executed by the processor implementing the steps of the video quality detection method according to any one of claims 1 to 7.
9. A storage medium having stored thereon a video quality detection program which, when executed by a processor, implements the steps of the video quality detection method according to any one of claims 1 to 7.
10. A video quality detection apparatus, characterized in that the video quality detection apparatus comprises: the system comprises a video extraction module, a log extraction module, a synchronous comparison module, a data sending module and a result receiving module;
the video extraction module is used for acquiring a video to be detected and a log file corresponding to the video to be detected, extracting the video to be detected and acquiring an image to be detected and an audio to be detected;
the log extraction module is used for extracting data of the log file to obtain an image data stream and an audio data stream;
the synchronous comparison module is used for generating a sound-picture synchronous comparison result according to the image to be detected, the audio to be detected, the image data stream and the audio data stream;
the data sending module is used for sending the image data of the image to be detected, the log file, the synchronous voice-picture comparison result and the audio data of the audio to be detected to a preset server as video detection information;
and the result receiving module is used for receiving a video quality detection result fed back by the preset server according to the video detection information.
CN202011610103.1A 2020-12-29 2020-12-29 Video quality detection method, device, storage medium and apparatus Pending CN114760460A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011610103.1A CN114760460A (en) 2020-12-29 2020-12-29 Video quality detection method, device, storage medium and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011610103.1A CN114760460A (en) 2020-12-29 2020-12-29 Video quality detection method, device, storage medium and apparatus

Publications (1)

Publication Number Publication Date
CN114760460A true CN114760460A (en) 2022-07-15

Family

ID=82324646

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011610103.1A Pending CN114760460A (en) 2020-12-29 2020-12-29 Video quality detection method, device, storage medium and apparatus

Country Status (1)

Country Link
CN (1) CN114760460A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116013365A (en) * 2023-03-21 2023-04-25 深圳联友科技有限公司 Voice full-automatic test method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116013365A (en) * 2023-03-21 2023-04-25 深圳联友科技有限公司 Voice full-automatic test method
CN116013365B (en) * 2023-03-21 2023-06-02 深圳联友科技有限公司 Voice full-automatic test method

Similar Documents

Publication Publication Date Title
CN108419141B (en) Subtitle position adjusting method and device, storage medium and electronic equipment
CN110139062B (en) Video conference record creating method and device and terminal equipment
KR102087882B1 (en) Device and method for media stream recognition based on visual image matching
CN109309844B (en) Video speech processing method, video client and server
US11281707B2 (en) System, summarization apparatus, summarization system, and method of controlling summarization apparatus, for acquiring summary information
CN110505497B (en) Cloud mobile phone operation monitoring method, system, device and storage medium
US10769247B2 (en) System and method for interacting with information posted in the media
CN113099156B (en) Video conference live broadcasting method, system, equipment and storage medium
CN110677718B (en) Video identification method and device
US20160125889A1 (en) Methods and systems for decreasing latency of content recognition
CN114902687A (en) Game screen recording method and device and computer readable storage medium
CN114760460A (en) Video quality detection method, device, storage medium and apparatus
WO2024025714A1 (en) Document portion identification in a recorded video
CN109271982B (en) Method for identifying multiple identification areas, identification terminal and readable storage medium
CN108881766B (en) Video processing method, device, terminal and storage medium
JP2017021672A (en) Search device
TW201626364A (en) System and method for recovering missed voice automatically
CN106775701B (en) Client automatic evidence obtaining method and system
CN113296660A (en) Image processing method and device and electronic equipment
CN111814714B (en) Image recognition method, device, equipment and storage medium based on audio and video recording
CN113515670A (en) Method, device and storage medium for identifying state of movie and television resource
CN112699720A (en) Monitoring method, device, storage medium and device based on character information set
CN110647500A (en) File storage method, device, terminal and computer readable storage medium
CN112446850A (en) Adaptation test method and device and electronic equipment
CN111816183B (en) Voice recognition method, device, equipment and storage medium based on audio and video recording

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination