WO2019061285A1 - Video recording method and video recording system of intelligent terminal - Google Patents

Video recording method and video recording system of intelligent terminal Download PDF

Info

Publication number
WO2019061285A1
WO2019061285A1 PCT/CN2017/104354 CN2017104354W WO2019061285A1 WO 2019061285 A1 WO2019061285 A1 WO 2019061285A1 CN 2017104354 W CN2017104354 W CN 2017104354W WO 2019061285 A1 WO2019061285 A1 WO 2019061285A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
information
image
audio
face
Prior art date
Application number
PCT/CN2017/104354
Other languages
French (fr)
Chinese (zh)
Inventor
金晓兰
Original Assignee
深圳传音通讯有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳传音通讯有限公司 filed Critical 深圳传音通讯有限公司
Priority to PCT/CN2017/104354 priority Critical patent/WO2019061285A1/en
Publication of WO2019061285A1 publication Critical patent/WO2019061285A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • the present invention relates to the field of video recording technology for intelligent terminals, and in particular, to a video recording method and a video recording system for an intelligent terminal.
  • intelligent terminals With the development and advancement of technology, the development of intelligent terminals is also changing with each passing day. Intelligent terminals Currently, intelligent terminals have gradually penetrated into all aspects of people's lives because of their convenience and functionality, and have become an indispensable auxiliary equipment in people's lives. At the same time, with the diversification of the functionality of smart terminals, the convenience of the functions of smart terminals has become one of the user's choice factors.
  • the present invention provides a video recording method and a video recording system for an intelligent terminal.
  • the invention utilizes a face recognition technology and a face tracking technology to preset a face image and audio information of a target person in the smart terminal, so that the intelligent terminal autonomously tracks the image or audio of the target person during the recording process, and Discharges the image or audio of a non-target person.
  • the present invention provides a method for recording a smart terminal, comprising the steps of: starting a recording system of the smart terminal, capturing a target image including a target person, and identifying face information of a person in the target image, Matching the face information with a preset standard face information, and when the face information matches the standard person information, locking the target face in the target image according to the face information, according to The target face information anchors an overall outline of the target person in the target image, and blurs a person image other than the target person in the target image.
  • the data table is matched, the action state of the target person is calculated, and the target person in the target image is anchored.
  • the recording method further includes: after the target image is recorded, capturing an operation instruction, performing frame processing on the target image, performing super-resolution processing on a frame-by-frame basis, and performing each frame image in the target image.
  • the blurred image in the picture is clear.
  • the recording method further includes: reading audio information in the target image, matching with a preset standard audio information, extracting target audio information that matches the standard audio information, and deleting the target image. Audio information other than the target audio information.
  • the step of extracting target audio information matching the standard audio information comprises: extracting an audio file in the target image, framing the audio file, and extracting an audio frame matching the standard audio information .
  • Another aspect of the present invention provides a recording system of an intelligent terminal, which includes the following module, a system control module, a recording system of the smart terminal, a camera module, and a target image including a target person, a face recognition module Identifying face information of a person in the target image, and a face verification module matching the face information with a preset standard face information, the face positioning module, when the face information and the face information When the standard person information is matched, the target face in the target image is locked according to the face information, and the human body positioning module anchors the entire target person in the target image according to the target face information.
  • the contour, the first image processing module blurs the image of the person other than the target person in the target image.
  • the human body positioning module includes a human body recognition unit that acquires human body information of the target person in the target image based on human body skin information in the target human face, and a human body positioning unit that will The information is matched with a preset character motion data table, the action state of the target person is calculated, and the target person in the target image is anchored.
  • the recording system further includes: an instruction module, capturing an operation instruction, an image framing module, performing framing processing on the target image, and a second image processing module performing super-resolution processing on a frame-by-frame basis to The blurred image in each frame of the image is sharpened.
  • an instruction module capturing an operation instruction
  • an image framing module performing framing processing on the target image
  • a second image processing module performing super-resolution processing on a frame-by-frame basis to The blurred image in each frame of the image is sharpened.
  • the video recording system further includes: a first information matching module, reading audio information in the target image, matching with a preset standard audio information, and extracting, by the target audio extraction module, the standard audio information The matched target audio information, the audio filtering module, deletes audio information other than the target audio information in the target image.
  • a first information matching module reading audio information in the target image, matching with a preset standard audio information, and extracting, by the target audio extraction module, the standard audio information The matched target audio information, the audio filtering module, deletes audio information other than the target audio information in the target image.
  • the target audio extraction module includes an audio framing unit that extracts audio in the target image a file, the audio file is framed, and the target audio extraction unit extracts an audio frame that matches the standard audio information.
  • the technical advantage of the present invention is that the present invention provides a video recording method and a video recording system for an intelligent terminal, which combines face recognition technology and face tracking technology, and further utilizes the particularity of human skin.
  • a preset character motion data table is used to calculate the overall contour of the target person, thereby realizing the technical effect of tracking the target person during the image recording process; at the same time, combined with the fuzzy calculation method in the image processing technology, the discharged recorded image is realized.
  • the non-personal influence interference; and the invention is based on the recorded image file, and also provides a method for image file sharpness recovery and audio file filtering, and the user can implement the recorded image file based on the above technical means of the present invention. Further optimization processing.
  • FIG. 1 is a schematic flow chart of a video recording method of a smart terminal according to a preferred embodiment of the present invention
  • FIG. 2 is a schematic flow chart of a method for clearing an image file according to a preferred embodiment of the present invention
  • FIG. 3 is a schematic flow chart of a method for filtering an audio file according to a preferred embodiment of the present invention
  • FIG. 4 is a structural diagram of a smart terminal recording system in accordance with a preferred embodiment of the present invention.
  • FIG. 5 is a structural diagram of a video recording system of a smart terminal according to another preferred embodiment of the present invention.
  • FIG. 6 is a structural diagram of a video recording method of a smart terminal according to another preferred embodiment of the present invention.
  • first, second, third, etc. may be employed in the present disclosure to describe various information, such information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other.
  • the first intelligent terminal may also be referred to as a second smart terminal without departing from the scope of the present disclosure.
  • the second smart terminal may also be referred to as a first smart terminal.
  • an aspect of the present invention provides a video recording method of an intelligent terminal.
  • FIG. 1 is a schematic flowchart of a video recording method of a smart terminal according to a preferred embodiment of the present invention. It can be seen from the figure that the recording method of the intelligent terminal provided in this embodiment mainly includes the following steps:
  • the smart terminal includes a system control module.
  • a startup command may be sent to the smart terminal by touching a shortcut icon of the smart terminal display interface or a text name in the shortcut menu.
  • the system control module of the intelligent terminal reads an operation instruction issued by the user, and after the analysis, starts the recording system of the intelligent terminal according to the startup instruction.
  • the smart terminal is provided with a camera module, and the camera module includes a front and/or a rear camera disposed on the smart terminal, and an image sensor and digital signal processing disposed inside the smart terminal. And so on.
  • the camera module can capture an image containing the target person through the camera of the smart terminal, and generate an image of the smart terminal readable by the image sensor and the digital information number processor, etc., and cache the smart signal. In the terminal.
  • the video recording system of the smart terminal includes a face recognition module, and the face recognition module can read the target in time sequence after the video recording process is completed or the video file is obtained to obtain an image file.
  • the face recognition module can read the target in time sequence after the video recording process is completed or the video file is obtained to obtain an image file.
  • Each frame of the image is imaged and the face information in each frame of the image is identified.
  • the face recognition module can accurately calibrate the position and size of the face in the image.
  • the pattern features contained in the face image are very rich, such as histogram features, color features, template features, structural features, and Haar features.
  • the face recognition module may identify the face information from the frame-by-frame picture.
  • the face recognition module uses the Adaboost algorithm to select some rectangular features (weak classifiers) that best represent the face, and constructs the weak classifier into a strong classifier according to the weighted voting method. Then, several strong classifiers obtained by training are connected in series to form a cascaded classifier of cascade structure, which effectively improves the detection speed of the classifier.
  • the face recognition module may further perform image preprocessing to reduce various conditions and random interference in the face recognition process.
  • the preprocessing process mainly includes ray compensation, gradation transformation, histogram equalization, normalization, geometric correction, filtering and sharpening of the face image.
  • the smart terminal further includes a face verification module, and the face verification module may extract the face information and perform search matching with a preset standard face information stored in the database.
  • a threshold is set in advance, and when the similarity between the face information and the preset face information exceeds the threshold, the result of the matching is output; otherwise, no deal with.
  • the following algorithm can be used to achieve matching of facial information: a method based on geometric features; a method of local feature analysis (Local Face Analysis); a method of feature face (Eigenface or PCA); Model method; Neural Networks method; Hidden Markov Model; Gabor wavelet transform + pattern matching.
  • a method based on geometric features a method of local feature analysis (Local Face Analysis); a method of feature face (Eigenface or PCA); Model method; Neural Networks method; Hidden Markov Model; Gabor wavelet transform + pattern matching.
  • the target face in the target image is locked according to the face information
  • the face positioning module in the smart terminal receives the verification result sent by the face verification module, and then records according to the video. Time sequence, locking the target face in each frame of the target image; or sequentially reading the target face in each frame of the picture recorded by the camera module and locking.
  • the recording system of the smart terminal further includes a person positioning module, and the person positioning module can read the target face locked by the face positioning module, and according to the target face information An overall contour of the target person is anchored in the target image.
  • the specific step of anchoring the overall contour of the target person from the frame image of the target image includes:
  • Human skin is one of the distinguishing features of the human body. Like the information such as fingerprints and irises, human skin is unique. Different human skins have different textures, colors, and brightness. The same person's skin has a certain similarity.
  • the human body recognizing unit in the human body positioning module of the smart terminal can use the human skin information as a reference to identify the human body with the highest similarity to the skin feature of the target human face in the frame image of the target image. Other body parts such as hands, feet, arms, legs, etc., and obtain information such as the position, shape, and the like of other parts of the human body relative to the target face.
  • the human body recognizing unit further has a human body positioning unit, and the human body positioning unit can read information about the position, shape, and the like of the other parts of the human body relative to the target human face, and the The human body information is matched with a preset character motion data table, and the action state of the target person is calculated, thereby anchoring the target person in the frame image of the target image.
  • the preset character motion data table is established by counting the positional relationship of the human body's face, hands, feet, arms, legs, etc. in various actions such as running, jumping, walking, and the like.
  • a first image processing module is further disposed in the smart terminal, and the image processing module can read an overall outline of the target person anchored by the human body positioning module, and Reversely selecting a frame image area that is not anchored, and using a blurring algorithm to blur an area other than the target person in the frame image in the target image, thereby blurring the image of the non-target person in the target image Chemical.
  • the present invention utilizes a combination of face recognition technology and face tracking technology, and further utilizes the particularity of the human skin to calculate the overall contour of the target person through a preset character motion data table, thereby realizing the image.
  • the present invention also provides a method for clearing the blurred image file.
  • 2 is a schematic flow chart of a method for clearing an image file according to a preferred embodiment of the present invention. It can be seen from the figure that, according to the recording method of the smart terminal in the above embodiment, the video recording method provided in this embodiment further includes:
  • the smart terminal of the embodiment includes an instruction module, where the instruction module can capture a trigger operation of the display interface of the smart terminal, and parse the trigger pressure, location, and the like of the trigger operation, thereby generating a running instruction. Send out.
  • the smart terminal further includes an image framing module, and the image framing module can receive the running instruction, and according to the running instruction, read a cache and a target of a storage space of the smart terminal. And performing image processing on the target image to decompose the target image into at least one single frame image.
  • a second image processing module is further disposed in the smart terminal, and the second image processing module may perform super-resolution processing on a frame-by-frame basis, and each frame image in the target image is The blurred image is clear. That is, the time resolution (acquiring a multi-frame image sequence of the same scene) is used in exchange for spatial resolution, and the conversion from temporal resolution to spatial resolution is realized.
  • the super-resolution method used in the embodiment includes: a super-resolution reconstruction method: a regularization reconstruction method; a non-uniform spatial sample interpolation method; an iterative back projection method (IBP); and a set theory reconstruction Method (convex set projection POCS); statistical reconstruction method (maximum a posteriori probability MAP and maximum likelihood estimation ML); hybrid ML/MAP/POCS method; adaptive filtering/Wiener filtering/Kalman filtering method; deterministic reconstruction method ; methods based on learning and pattern recognition, etc.
  • a super-resolution reconstruction method a regularization reconstruction method; a non-uniform spatial sample interpolation method; an iterative back projection method (IBP); and a set theory reconstruction Method (convex set projection POCS); statistical reconstruction method (maximum a posteriori probability MAP and maximum likelihood estimation ML); hybrid ML/MAP/POCS method; adaptive filtering/Wiener filtering/Kalman filtering method; deterministic reconstruction method ; methods based on
  • FIG. 3 it is a schematic flowchart of a method for filtering an audio file according to a preferred embodiment of the present invention. It can be seen from the figure that the filtering method of the audio file provided by this embodiment mainly includes the following steps:
  • the smart terminal further includes a first information matching module, where the first information matching module can read the audio information in the target image and associate it with a preset standard. The audio information is matched such that the target audio of the target person is marked in the audio information.
  • the smart terminal further has a target audio extraction module, and the target audio extraction module can read the marking information of the first information matching module. And extracting the labeled sub audio file in the audio file from the audio file of the target image.
  • the specific step of extracting the marked audio file includes:
  • the audio stream needs to be framed, that is, the audio stream is cut into a short segment, and each segment is called a frame unit audio stream. Therefore, in the embodiment, the target audio extraction module is provided with an audio framing unit, and the audio framing unit can use the moving window function to implement the framing operation on the audio file.
  • the target audio extraction module is provided with an audio framing unit, and the audio framing unit can use the moving window function to implement the framing operation on the audio file.
  • the unit audio stream there is generally overlap between the unit audio stream and the unit audio stream.
  • the stack is called a frame length of 25 ms and a frame shift of 10 ms.
  • the target audio extraction module further includes a target audio extraction unit, and the target audio extraction unit may read the tag information of the first information matching module, and extract the audio file from the target image. A sub-audio file that is marked in the audio file.
  • the smart terminal further has an audio filtering module, and the audio filtering module can also read the marking information of the first information matching module, and delete the audio file of the target image. A sub-audio file that is not marked in the audio file.
  • the marking information of the first information matching module is read by an audio adjustment module, and the marked audio file is enhanced, and the unmarked audio file portion is correspondingly weakened. , thereby achieving the technical effect of highlighting the audio of the target person in the image file.
  • the present invention is based on a recorded image file, and further provides a method for image file resolution recovery and audio file filtering.
  • the user can further optimize the recorded image file based on the above technical means of the present invention. deal with.
  • FIG. 4 it is a structural diagram of a smart terminal recording system in accordance with a preferred embodiment of the present invention.
  • the recording system of the intelligent terminal provided by this embodiment includes the following modules:
  • a recording system for starting the smart terminal Specifically, when the user wants to record a video file, a startup command may be sent to the smart terminal by touching a shortcut icon of the smart terminal display interface or a text name in the shortcut menu.
  • the system control module of the smart terminal reads an operation instruction issued by the user, and starts the recording system of the intelligent terminal according to the startup instruction after parsing.
  • the camera module Used to capture a target image that includes the target person.
  • the camera module includes a front and/or rear camera disposed on the smart terminal, and an image sensor, a digital signal processor, and the like disposed inside the smart terminal.
  • the camera module can capture an image containing the target person through the camera of the smart terminal, and generate an image of the smart terminal readable by the image sensor and the digital information number processor, etc., and cache the smart signal. In the terminal.
  • a face information for identifying a person in the target image can be in chronological order Reading each frame of the target image and identifying face information in each frame of the image.
  • the face verification module may extract the face information and perform search matching with a preset standard face information stored in a database.
  • a threshold is set in advance, and when the similarity between the face information and the preset face information exceeds the threshold, the result of the matching is output; otherwise, no deal with.
  • the face positioning module locks the target face in the target image according to the face information. Specifically, after receiving the verification result sent by the face verification module, the face positioning module locks the target face in each frame of the target image according to the time sequence of the video recording. Or sequentially read the target face in each frame of the picture recorded by the camera module and lock it.
  • An anchoring for anchoring an overall outline of the target person in the target image according to the target face information The person positioning module can read the target face locked by the face positioning module, and anchor the overall contour of the target person in the target image according to the target face information.
  • the human body positioning module has the following units:
  • the human body recognizing unit in the human body positioning module of the smart terminal can use the human skin information as a reference to identify a human hand, a foot, and an arm having the highest similarity to the skin feature of the target human face in the frame image of the target image. And other body parts such as legs, and obtain information such as the position and shape of the other parts of the human body relative to the target face.
  • the human body positioning unit can read information about the position, shape, and the like of the other parts of the human body relative to the target human face, and match the human body information with a preset character motion data table to calculate the target.
  • the action state of the character thereby anchoring the target person in the frame picture of the target image.
  • the image processing module can read the overall contour of the target person anchored by the human body positioning module, and select a frame image region that is not anchored in the reverse direction, and adopt a blur algorithm to frame the frame image in the target image.
  • the area other than the target person is blurred, thereby blurring the image of the non-target person in the target image.
  • the present invention also provides a system capable of clearing the blurred image file.
  • FIG. 5 it is a structural diagram of a video recording system of a smart terminal according to another preferred embodiment of the present invention. As can be seen from the figure, the video recording system provided in this embodiment further includes:
  • the instruction module may capture a trigger operation of the display interface of the smart terminal, and parse the trigger pressure, location, and the like of the trigger operation, thereby generating a running instruction to send out.
  • the image framing module can receive the running instruction, and according to the running instruction, read a target image buffered with a storage space of the smart terminal, and perform framing processing on the target image to The image is decomposed frame by frame into at least one single frame picture.
  • the second image processing module may perform super-resolution processing on a frame-by-frame basis to sharpen a blurred image in each frame of the target image. That is, the time resolution (acquiring a multi-frame image sequence of the same scene) is used in exchange for spatial resolution, and the conversion from temporal resolution to spatial resolution is realized.
  • FIG. 6 is a structural diagram of a video recording method of a smart terminal according to another preferred embodiment of the present invention.
  • the video recording system provided in this embodiment further includes the following modules:
  • the first information matching module may read the audio information in the target image and match it with a preset standard audio information, thereby marking the target audio of the target person in the audio information.
  • the target audio extraction module is readable Taking the tag information of the first information matching module, and extracting the labeled sub audio file in the audio file from the audio file of the target image.
  • the audio extraction module includes the following units:
  • an audio framing unit is disposed in the target audio extraction module, and the audio framing unit may use a moving window function to implement a framing operation on the audio file.
  • the target audio extraction unit may read the tag information of the first information matching module, and extract the labeled sub audio file in the audio file from the audio file of the target image.
  • the audio filtering module may also read the tag information of the first information matching module, and delete the unlabeled sub audio file in the audio file from the audio file of the target image.
  • an audio adjustment module may be disposed in the smart terminal, and the marking information of the first information matching module is read by the audio adjustment module, and is marked.
  • the audio file is enhanced and the unmarked portion of the audio file is weakened accordingly, thereby achieving the technical effect of highlighting the audio of the target person in the image file.
  • the present invention provides a video recording system for a smart terminal, which combines face recognition technology and face tracking technology, and further utilizes the particularity of human skin to calculate a target person through a preset character motion data table.
  • the overall contour thus, the technical effect of tracking the target person during the image recording process; at the same time, combined with the fuzzy calculation method in the image processing technology, the non-personal influence interference in the recorded image is discharged.

Abstract

Provided in the present invention are a video recording method and a video recording system of an intelligent terminal, comprising the following steps: starting up a video recording system of an intelligent terminal; capturing a target image comprising a target person; identifying face information of a person in the target image; matching the face information with preset standard face information; when the face information matches the standard person information, locking the target face in the target image according to the face information; anchoring the overall contour of the target person in the target image according to the target face information; and blurring images of people other than the target person in the target image. According to the present invention, the intelligent terminal may autonomously track the image or audio of a target person during the recording process, and thus eliminate the interference of the image or audio of a non-target person.

Description

一种智能终端的录像方法及录像系统Video recording method and video recording system of intelligent terminal 技术领域Technical field
本发明涉及智能终端的录像技术领域,尤其涉及一种智能终端的录像方法及录像系统。The present invention relates to the field of video recording technology for intelligent terminals, and in particular, to a video recording method and a video recording system for an intelligent terminal.
背景技术Background technique
随着科技的发展和进步,智能终端的发展也是日新月异,智能终端当前,智能终端因其便利性和功能性,已经逐渐渗入到人们生活的方方面面,成为人们生活中不可或缺的辅助设备。同时,随着智能终端功能性的多样化,智能终端各功能的便捷性也成为用户的选择因素之一。With the development and advancement of technology, the development of intelligent terminals is also changing with each passing day. Intelligent terminals Currently, intelligent terminals have gradually penetrated into all aspects of people's lives because of their convenience and functionality, and have become an indispensable auxiliary equipment in people's lives. At the same time, with the diversification of the functionality of smart terminals, the convenience of the functions of smart terminals has become one of the user's choice factors.
特别,由于智能终端摄影、录像技术的发展,智能终端已逐渐替代其他摄影、录像设备,成为人们生活中常用的摄影或者录像装置。但是,对于非专业的智能终端用户来说,如何能在录像过程中及时捕获目标人物的影像,而排出非目标人物的干扰,利用现有智能终端的摄影或者录像技术,仍然难以实现。In particular, due to the development of smart terminal photography and video technology, smart terminals have gradually replaced other photography and video equipment, and have become a common photography or video recording device in people's lives. However, for non-professional intelligent terminal users, how to capture the image of the target person in time during the recording process and discharge the interference of the non-target person, using the photography or video technology of the existing smart terminal, is still difficult to achieve.
发明内容Summary of the invention
为解决上述问题,本发明提供一种智能终端的录像方法及录像系统。本发明利用人脸识别技术及人脸追踪技术等,通过在智能终端中预设一目标人物的脸部图像及音频信息,实现在录像过程中智能终端自主地追踪目标人物的影像或者音频,而排出非目标人物的影像或者音频的干扰。To solve the above problems, the present invention provides a video recording method and a video recording system for an intelligent terminal. The invention utilizes a face recognition technology and a face tracking technology to preset a face image and audio information of a target person in the smart terminal, so that the intelligent terminal autonomously tracks the image or audio of the target person during the recording process, and Discharges the image or audio of a non-target person.
具体地,本发明提供一种智能终端的录像方法,其包括以下步骤:启动所述智能终端的录像系统,捕获一包括目标人物的目标影像,识别所述目标影像中的人物的人脸信息,将所述人脸信息与一预设的标准人脸信息进行匹配,当所述人脸信息与标准人信息相匹配时,根据所述人脸信息锁定所述目标影像中的目标人脸,根据所述目标人脸信息在所述目标影像中的锚定所述目标人物的整体轮廓,将所述目标影像中所述目标人物以外的人物图像进行模糊处理。Specifically, the present invention provides a method for recording a smart terminal, comprising the steps of: starting a recording system of the smart terminal, capturing a target image including a target person, and identifying face information of a person in the target image, Matching the face information with a preset standard face information, and when the face information matches the standard person information, locking the target face in the target image according to the face information, according to The target face information anchors an overall outline of the target person in the target image, and blurs a person image other than the target person in the target image.
优选地,根据所述目标人脸信息在所述目标影像中的锁定所述目标人物的整体轮廓 的步骤包括,锁定所述目标影像中的目标人脸后,以所述目标人脸中的人体皮肤信息为基准,获取所述目标影像中所述目标人物的身体信息,与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,锚定所述目标影像中的目标人物。Preferably, locking the overall outline of the target person in the target image according to the target face information The step of: capturing the target face in the target image, taking the body skin information in the target face as a reference, acquiring body information of the target person in the target image, and performing a preset character action The data table is matched, the action state of the target person is calculated, and the target person in the target image is anchored.
优选地,上述录像方法还包括,当所述目标影像录制完成后,捕获一操作指令,将所述目标影像进行分帧处理,逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。Preferably, the recording method further includes: after the target image is recorded, capturing an operation instruction, performing frame processing on the target image, performing super-resolution processing on a frame-by-frame basis, and performing each frame image in the target image. The blurred image in the picture is clear.
优选地,上述录像方法还包括,读取所述目标影像中的音频信息,与一预设的标准音频信息进行匹配,提取与所述标准音频信息相匹配的目标音频信息,删除所述目标影像中所述目标音频信息以外的音频信息。Preferably, the recording method further includes: reading audio information in the target image, matching with a preset standard audio information, extracting target audio information that matches the standard audio information, and deleting the target image. Audio information other than the target audio information.
优选地,提取与所述标准音频信息相匹配的目标音频信息的步骤包括,提取所述目标影像中的音频文件,将所述音频文件分帧,提取与所述标准音频信息相匹配的音频帧。Preferably, the step of extracting target audio information matching the standard audio information comprises: extracting an audio file in the target image, framing the audio file, and extracting an audio frame matching the standard audio information .
本发明另一方面,在于提供一种智能终端的录像系统,其包括以下模块,系统控制模块,启动所述智能终端的录像系统,摄像模块,捕获一包括目标人物的目标影像,人脸识别模块,识别所述目标影像中的人物的人脸信息,人脸校验模块,将所述人脸信息与一预设的标准人脸信息进行匹配,人脸定位模块,当所述人脸信息与标准人信息相匹配时,根据所述人脸信息锁定所述目标影像中的目标人脸,人体定位模块,根据所述目标人脸信息在所述目标影像中的锚定所述目标人物的整体轮廓,第一图像处理模块,将所述目标影像中所述目标人物以外的人物图像进行模糊处理。Another aspect of the present invention provides a recording system of an intelligent terminal, which includes the following module, a system control module, a recording system of the smart terminal, a camera module, and a target image including a target person, a face recognition module Identifying face information of a person in the target image, and a face verification module matching the face information with a preset standard face information, the face positioning module, when the face information and the face information When the standard person information is matched, the target face in the target image is locked according to the face information, and the human body positioning module anchors the entire target person in the target image according to the target face information. The contour, the first image processing module blurs the image of the person other than the target person in the target image.
优选地,所述人体定位模块包括,人体识别单元,以所述目标人脸中的人体皮肤信息为基准,获取所述目标影像中所述目标人物的人体信息,人体定位单元,将所述人体信息与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,锚定所述目标影像中的目标人物。Preferably, the human body positioning module includes a human body recognition unit that acquires human body information of the target person in the target image based on human body skin information in the target human face, and a human body positioning unit that will The information is matched with a preset character motion data table, the action state of the target person is calculated, and the target person in the target image is anchored.
优选地,上述录像系统还包括,指令模块,捕获一操作指令,影像分帧模块,将所述目标影像进行分帧处理,第二图像处理模块,逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。Preferably, the recording system further includes: an instruction module, capturing an operation instruction, an image framing module, performing framing processing on the target image, and a second image processing module performing super-resolution processing on a frame-by-frame basis to The blurred image in each frame of the image is sharpened.
优选地,上述录像系统还包括,第一信息匹配模块,读取所述目标影像中的音频信息,与一预设的标准音频信息进行匹配,目标音频提取模块,提取与所述标准音频信息相匹配的目标音频信息,音频过滤模块,删除所述目标影像中所述目标音频信息以外的音频信息。Preferably, the video recording system further includes: a first information matching module, reading audio information in the target image, matching with a preset standard audio information, and extracting, by the target audio extraction module, the standard audio information The matched target audio information, the audio filtering module, deletes audio information other than the target audio information in the target image.
优选地,所述目标音频提取模块包括,音频分帧单元,提取所述目标影像中的音频 文件,将所述音频文件分帧,目标音频提取单元,提取与所述标准音频信息相匹配的音频帧。Preferably, the target audio extraction module includes an audio framing unit that extracts audio in the target image a file, the audio file is framed, and the target audio extraction unit extracts an audio frame that matches the standard audio information.
与现有技术相比较,本发明的技术优势在于:本发明提出一种智能终端的录像方法及录像系统,利用人脸识别技术及人脸追踪技术结合,并进一步利用人体皮肤的特殊性,通过一预设人物动作数据表,推算出目标人物的整体轮廓,从而,实现在影像录制过程中追踪目标人物的技术效果;同时,再结合图像处理技术中的模糊化计算方法,实现排出录制影像中的非人物的影响干扰;而且本发明基于录制获取的影像文件,还提出一种影像文件清晰度恢复以及音频文件过滤的方法,用户可基于本发明的上述技术手段,实现对所录制的影像文件的进一步优化处理。Compared with the prior art, the technical advantage of the present invention is that the present invention provides a video recording method and a video recording system for an intelligent terminal, which combines face recognition technology and face tracking technology, and further utilizes the particularity of human skin. A preset character motion data table is used to calculate the overall contour of the target person, thereby realizing the technical effect of tracking the target person during the image recording process; at the same time, combined with the fuzzy calculation method in the image processing technology, the discharged recorded image is realized. The non-personal influence interference; and the invention is based on the recorded image file, and also provides a method for image file sharpness recovery and audio file filtering, and the user can implement the recorded image file based on the above technical means of the present invention. Further optimization processing.
附图说明DRAWINGS
图1为符合本发明一优选实施例的智能终端的录像方法的流程示意图;1 is a schematic flow chart of a video recording method of a smart terminal according to a preferred embodiment of the present invention;
图2为一符合本发明一优选实施例的影像文件清晰化的方法的流程示意图;2 is a schematic flow chart of a method for clearing an image file according to a preferred embodiment of the present invention;
图3为一符合本发明的一优选实施例的音频文件的过滤方法的流程示意图;3 is a schematic flow chart of a method for filtering an audio file according to a preferred embodiment of the present invention;
图4为一符合本发明一优选实施例的智能终端录像系统的结构图;4 is a structural diagram of a smart terminal recording system in accordance with a preferred embodiment of the present invention;
图5为一符合本发明另一优选实施例的智能终端的录像系统的结构图;FIG. 5 is a structural diagram of a video recording system of a smart terminal according to another preferred embodiment of the present invention; FIG.
图6为一符合本发明的另一优选实施例的智能终端的录像方法的结构图。FIG. 6 is a structural diagram of a video recording method of a smart terminal according to another preferred embodiment of the present invention.
具体实施方式Detailed ways
下面结合附图及具体实施例,详细阐述本发明的优势。The advantages of the present invention are explained in detail below with reference to the accompanying drawings and specific embodiments.
下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。The following description refers to the same or similar elements in the different figures unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present disclosure. Instead, they are merely examples of devices and methods consistent with aspects of the present disclosure as detailed in the appended claims.
首先,应当理解,尽管在本公开可能采用术语第一、第二、第三等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本公开范围的情况下,第一智能终端也可以被称为第二智能终端,类似地,第二智能终端也可以被称为第一智能终端。First, it should be understood that although the terms first, second, third, etc. may be employed in the present disclosure to describe various information, such information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other. For example, the first intelligent terminal may also be referred to as a second smart terminal without departing from the scope of the present disclosure. Similarly, the second smart terminal may also be referred to as a first smart terminal.
在智能终端录像技术的实际应用中,用户常因无法及时捕捉到目标人物或者其他对象的影像或者音频,或者周围吵杂的环境干扰影像录制的效果而烦恼,特别对于好动的 儿童的录像过程,家长常常难以捕获一段干净完整的儿童视频。为解决上述问题,本发明一方面提出一种智能终端的录像方法。In the practical application of the video recording technology of the smart terminal, the user often suffers from the inability to capture the image or audio of the target person or other object in time, or the surrounding noise environment disturbs the effect of the image recording, especially for the active In children's video recording process, it is often difficult for parents to capture a clean and complete video of children. In order to solve the above problems, an aspect of the present invention provides a video recording method of an intelligent terminal.
具体地,参阅图1,其为符合本发明一优选实施例的智能终端的录像方法的流程示意图。从图中可以看出,本实施例所述提供的智能终端的录像方法主要包括以下步骤:Specifically, referring to FIG. 1 , which is a schematic flowchart of a video recording method of a smart terminal according to a preferred embodiment of the present invention. It can be seen from the figure that the recording method of the intelligent terminal provided in this embodiment mainly includes the following steps:
-启动所述智能终端的录像系统- starting the recording system of the intelligent terminal
本实施例中,所述智能终端包括一系统控制模块。当用户希望录制一视频文件时,可通过触动所述智能终端显示界面的快捷图标或者快捷菜单中的文字名称等方式,向所述智能终端发送一启动指令。所述智能终端的系统控制模块将读取用户发出的操作指令,并在解析后根据所述启动指令,启动所述智能终端的录像系统。In this embodiment, the smart terminal includes a system control module. When the user wants to record a video file, a startup command may be sent to the smart terminal by touching a shortcut icon of the smart terminal display interface or a text name in the shortcut menu. The system control module of the intelligent terminal reads an operation instruction issued by the user, and after the analysis, starts the recording system of the intelligent terminal according to the startup instruction.
-捕获一包括目标人物的目标影像- Capture a target image including the target person
本实施例中,所述智能终端设置有摄像模块,所述摄像模块包括设置于所述智能终端上的前和/或后置摄像头,以及设置于所述智能终端内部的图像传感器及数字信号处理器等。所述摄像模块可通过智能终端的摄像头捕获一包含目标人物的影像,并利用图像传感器及数字信息号处理器等,将所述影像生成一智能终端可读的数字信号,并缓存与所述智能终端中。In this embodiment, the smart terminal is provided with a camera module, and the camera module includes a front and/or a rear camera disposed on the smart terminal, and an image sensor and digital signal processing disposed inside the smart terminal. And so on. The camera module can capture an image containing the target person through the camera of the smart terminal, and generate an image of the smart terminal readable by the image sensor and the digital information number processor, etc., and cache the smart signal. In the terminal.
-识别所述目标影像中的人物的人脸信息Identifying face information of a person in the target image
本实施例中,所述智能终端的录像系统中包括一人脸识别模块,在录像过程中,或者获得录像完成获得一影像文件后,所述人脸识别模块都可按照时间顺序读取所述目标影像的每一帧图片,并识别每一帧图片中的人脸信息。In this embodiment, the video recording system of the smart terminal includes a face recognition module, and the face recognition module can read the target in time sequence after the video recording process is completed or the video file is obtained to obtain an image file. Each frame of the image is imaged and the face information in each frame of the image is identified.
优选地,所述人脸识别模块可在图像中准确标定出人脸的位置和大小。人脸图像中包含的模式特征十分丰富,如直方图特征、颜色特征、模板特征、结构特征及Haar特征等。本实施例中,所述人脸识别模块可从所述逐帧图片中将这些人脸信息标识出来。Preferably, the face recognition module can accurately calibrate the position and size of the face in the image. The pattern features contained in the face image are very rich, such as histogram features, color features, template features, structural features, and Haar features. In this embodiment, the face recognition module may identify the face information from the frame-by-frame picture.
优选地,本实施例中,所述人脸识别模块利用Adaboost算法挑选出一些最能代表人脸的矩形特征(弱分类器),按照加权投票的方式将弱分类器构造为一个强分类器,再将训练得到的若干强分类器串联组成一个级联结构的层叠分类器,有效地提高分类器的检测速度。Preferably, in the embodiment, the face recognition module uses the Adaboost algorithm to select some rectangular features (weak classifiers) that best represent the face, and constructs the weak classifier into a strong classifier according to the weighted voting method. Then, several strong classifiers obtained by training are connected in series to form a cascaded classifier of cascade structure, which effectively improves the detection speed of the classifier.
其中,优选地,本实施例中,所述人脸识别模块还可进行图像预处理,以减少人脸识别过程中受到各种条件的限制和随机干扰。其中,预处理过程主要包括人脸图像的光线补偿、灰度变换、直方图均衡化、归一化、几何校正、滤波以及锐化等。Preferably, in this embodiment, the face recognition module may further perform image preprocessing to reduce various conditions and random interference in the face recognition process. Among them, the preprocessing process mainly includes ray compensation, gradation transformation, histogram equalization, normalization, geometric correction, filtering and sharpening of the face image.
-将所述人脸信息与一预设的标准人脸信息进行匹配, - matching the face information with a preset standard face information,
本实施例中,所述智能终端中还设置有一人脸校验模块,所述人脸校验模块可提取所述人脸信息并与数据库中存储的一预设标准人脸信息进行搜索匹配。优选地,本实施例中,预先设定一个阈值,当所述人脸信息与所述预设人脸信息的相似度超过这一阈值,则把匹配得到的结果输出;否则,则不进行任何处理。In this embodiment, the smart terminal further includes a face verification module, and the face verification module may extract the face information and perform search matching with a preset standard face information stored in the database. Preferably, in this embodiment, a threshold is set in advance, and when the similarity between the face information and the preset face information exceeds the threshold, the result of the matching is output; otherwise, no deal with.
其中,优选地,本实施例中可通过以下几种算法,实现人脸信息的匹配:基于几何特征的方法;局部特征分析方法(Local Face Analysis);特征脸方法(Eigenface或PCA);基于弹性模型的方法;神经网络方法(Neural Networks);隐马尔可夫模型方法(Hidden Markov Model);Gabor小波变换+图形匹配等。Preferably, in this embodiment, the following algorithm can be used to achieve matching of facial information: a method based on geometric features; a method of local feature analysis (Local Face Analysis); a method of feature face (Eigenface or PCA); Model method; Neural Networks method; Hidden Markov Model; Gabor wavelet transform + pattern matching.
-当所述人脸信息与标准人信息相匹配时,根据所述人脸信息锁定所述目标影像中的目标人脸- when the face information matches the standard person information, the target face in the target image is locked according to the face information
本实施例中,当所述人脸信息与标准人脸信息相匹配时,所述智能终端中的人脸定位模块接收到所述人脸校验模块发出的校验结果后,将按照录像录制的时间顺序,在所述目标影像的每一帧图片中锁定所述目标人脸;或者依次读取所述摄像模块录制的每一帧图片中的目标人脸并锁定。In this embodiment, when the face information matches the standard face information, the face positioning module in the smart terminal receives the verification result sent by the face verification module, and then records according to the video. Time sequence, locking the target face in each frame of the target image; or sequentially reading the target face in each frame of the picture recorded by the camera module and locking.
-根据所述目标人脸信息在所述目标影像中的锚定所述目标人物的整体轮廓- anchoring the overall outline of the target person in the target image according to the target face information
本实施例中,所述智能终端的录像系统中还包括一人物定位模块,所述人物定位模块可读取所述人脸定位模块所锁定的目标人脸,并根据所述目标人脸信息在所述目标影像中锚定所述目标人物的整体轮廓。In this embodiment, the recording system of the smart terminal further includes a person positioning module, and the person positioning module can read the target face locked by the face positioning module, and according to the target face information An overall contour of the target person is anchored in the target image.
其中,优选地,本实施例中,从所述目标影像的帧图像中锚定所述目标人物的整体轮廓的具体步骤包括:Preferably, in this embodiment, the specific step of anchoring the overall contour of the target person from the frame image of the target image includes:
-锁定所述目标影像中的目标人脸后,以所述目标人脸中的人体皮肤信息为基准,获取所述目标影像中所述目标人物的身体信息- after locking the target face in the target image, acquiring body information of the target person in the target image based on the human skin information in the target face
人体皮肤作为人体区别特征之一,与指纹、虹膜等信息一样,也具有唯一性,不同人体的皮肤具有不同的纹理、颜色、亮度等特征。相同的人的皮肤具有一定的相似性。本实施例中,所述智能终端的人体定位模块中的人体识别单元可以利用人体皮肤信息作为基准,在所述目标影像的帧图片中识别与所述目标人脸的皮肤特征相似度最高的人体手、脚、手臂、腿等其他人体部位,并获取所述人体其他部位相对于所述目标人脸的位置、形状等信息。Human skin is one of the distinguishing features of the human body. Like the information such as fingerprints and irises, human skin is unique. Different human skins have different textures, colors, and brightness. The same person's skin has a certain similarity. In this embodiment, the human body recognizing unit in the human body positioning module of the smart terminal can use the human skin information as a reference to identify the human body with the highest similarity to the skin feature of the target human face in the frame image of the target image. Other body parts such as hands, feet, arms, legs, etc., and obtain information such as the position, shape, and the like of other parts of the human body relative to the target face.
-与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,锚定所述目标影像中的目标人物的整体轮廓 - matching with a preset character motion data table, calculating an action state of the target person, and anchoring an overall outline of the target person in the target image
本实施例中,所述人体识别单元还具有一人体定位单元,所述人体定位单元可读取获得的所述人体其他部位相对于所述目标人脸的位置、形状等信息,并将所述人体信息与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,从而,锚定所述目标影像的帧图片中的目标人物。In this embodiment, the human body recognizing unit further has a human body positioning unit, and the human body positioning unit can read information about the position, shape, and the like of the other parts of the human body relative to the target human face, and the The human body information is matched with a preset character motion data table, and the action state of the target person is calculated, thereby anchoring the target person in the frame image of the target image.
其中,优选地,所述预设人物动作数据表为通过统计人体在奔跑、跳跃、行走等多种动作中,人体的脸、手、脚、手臂、腿等的位置关系而建立的。Preferably, the preset character motion data table is established by counting the positional relationship of the human body's face, hands, feet, arms, legs, etc. in various actions such as running, jumping, walking, and the like.
-将所述目标影像中所述目标人物以外的人物图像进行模糊处理。- Blurring a person image other than the target person in the target image.
继续参阅图1,本实施例中,所述智能终端中还设置有一第一图像处理模块,所述图像处理模块可读取所述人体定位模块所锚定的所述目标人物的整体轮廓,并反向选择未被锚定的帧图片区域,采用一模糊算法,将所述目标影像中的帧图片中目标人物以外的区域模糊化,从而,实现将所述目标影像中非目标人物的影像模糊化。With reference to FIG. 1 , in the embodiment, a first image processing module is further disposed in the smart terminal, and the image processing module can read an overall outline of the target person anchored by the human body positioning module, and Reversely selecting a frame image area that is not anchored, and using a blurring algorithm to blur an area other than the target person in the frame image in the target image, thereby blurring the image of the non-target person in the target image Chemical.
综上所述,本发明利用人脸识别技术及人脸追踪技术结合,并进一步利用人体皮肤的特殊性,通过一预设人物动作数据表,推算出目标人物的整体轮廓,从而,实现在影像录制过程中追踪目标人物的技术效果;同时,再结合图像处理技术中的模糊化计算方法,实现排出录制影像中的非人物的影响干扰。In summary, the present invention utilizes a combination of face recognition technology and face tracking technology, and further utilizes the particularity of the human skin to calculate the overall contour of the target person through a preset character motion data table, thereby realizing the image. The technical effect of tracking the target person during the recording process; at the same time, combined with the fuzzification calculation method in the image processing technology, the non-personal influence interference in the recorded image is discharged.
另外,为防止智能终端在目标识别、追踪过程中由于误差等,导致用户对于模糊化的影像不满意,本发明还提供一种将模糊化的影像文件清晰化的方法。具体参见图2,其为一符合本发明一优选实施例的影像文件清晰化的方法的流程示意图。从图中可以看出,基于上述实施例中智能终端的录像方法,本实施例所述提供的录像方法还包括:In addition, in order to prevent the user from being dissatisfied with the blurred image due to errors or the like in the target recognition and tracking process, the present invention also provides a method for clearing the blurred image file. 2 is a schematic flow chart of a method for clearing an image file according to a preferred embodiment of the present invention. It can be seen from the figure that, according to the recording method of the smart terminal in the above embodiment, the video recording method provided in this embodiment further includes:
-当所述目标影像录制完成后,捕获一操作指令- capturing an operation command when the target image is recorded
当用户对于通过上述方法所录制的影像文件不满意时,可以恢复所录制的影像中的清晰度,从而,可重新从清晰的影像文件中选择需要模糊化的对象等。具体地,本实施例的智能终端中包括一指令模块,所述指令模块可以捕获用户对于智能终端的显示界面的触发操作,并解析上述触发操作的触发压力、位置等信息,从而产生一运行指令向外发送。When the user is dissatisfied with the image file recorded by the above method, the sharpness in the recorded image can be restored, so that the object to be blurred and the like can be newly selected from the clear image file. Specifically, the smart terminal of the embodiment includes an instruction module, where the instruction module can capture a trigger operation of the display interface of the smart terminal, and parse the trigger pressure, location, and the like of the trigger operation, thereby generating a running instruction. Send out.
-将所述目标影像进行分帧处理- framing the target image
本实施例中,所述智能终端还包括一影像分帧模块,所述影像分帧模块可接收所述运行指令,并根据所述运行指令,读取缓存与所述智能终端的存储空间的目标影像,并将所述目标影像进行分帧处理,将所述目标影像逐帧分解为至少一个单帧图片。In this embodiment, the smart terminal further includes an image framing module, and the image framing module can receive the running instruction, and according to the running instruction, read a cache and a target of a storage space of the smart terminal. And performing image processing on the target image to decompose the target image into at least one single frame image.
-逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化 - performing super-resolution processing on a frame-by-frame basis to sharpen blurred images in each frame of the target image
进一步参见图2,本实施例中,所述智能终端中还设置有一第二图像处理模块,所述第二图像处理模块可逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。也即利用时间带宽(获取同一场景的多帧图像序列)换取空间分辨率,实现时间分辨率向空间分辨率的转换。优选地,本实施例中所采用的超分辨率的方法包括:超分辨率重构的方法有:规整化重建方法;非均匀空间样本内插方法;迭代反投影方法(IBP);集合理论重建方法(凸集投影POCS);统计重建方法(最大后验概率MAP和最大似然估计ML);混合ML/MAP/POCS方法;自适应滤波/维纳滤波/卡尔曼滤波方法;确定性重建方法;基于学习和模式识别的方法等。Referring to FIG. 2, in the embodiment, a second image processing module is further disposed in the smart terminal, and the second image processing module may perform super-resolution processing on a frame-by-frame basis, and each frame image in the target image is The blurred image is clear. That is, the time resolution (acquiring a multi-frame image sequence of the same scene) is used in exchange for spatial resolution, and the conversion from temporal resolution to spatial resolution is realized. Preferably, the super-resolution method used in the embodiment includes: a super-resolution reconstruction method: a regularization reconstruction method; a non-uniform spatial sample interpolation method; an iterative back projection method (IBP); and a set theory reconstruction Method (convex set projection POCS); statistical reconstruction method (maximum a posteriori probability MAP and maximum likelihood estimation ML); hybrid ML/MAP/POCS method; adaptive filtering/Wiener filtering/Kalman filtering method; deterministic reconstruction method ; methods based on learning and pattern recognition, etc.
另外,基于本发明上述实施例中所提供的智能终端的录像方法,为进一步过滤掉所录制的影像文件中的杂音,提出目标人物的清晰音频,本发明还提供一种音频文件的过滤方法。参阅图3,其为一符合本发明的一优选实施例的音频文件的过滤方法的流程示意图。从图中可以看出,本实施例所提供的音频文件的过滤方法主要包括以下步骤:In addition, based on the recording method of the smart terminal provided in the above embodiment of the present invention, in order to further filter out the noise in the recorded image file, clear audio of the target person is proposed, and the present invention also provides a method for filtering the audio file. Referring to FIG. 3, it is a schematic flowchart of a method for filtering an audio file according to a preferred embodiment of the present invention. It can be seen from the figure that the filtering method of the audio file provided by this embodiment mainly includes the following steps:
-读取所述目标影像中的音频信息,与一预设的标准音频信息进行匹配- reading audio information in the target image to match a preset standard audio information
当用户采用本发明上述实施例中所提供的录像方法获得一影像文件后,可进一步的剔除所述影像文件中除目标人物以外的音频信息。具体地,本实施例中,所述智能终端中还包括一第一信息匹配模块,所述第一信息匹配模块可以读取所述目标影像中的音频信息,并将其与一预设的标准音频信息进行匹配,从而,在所述音频信息中标记所述目标人物的目标音频。After the user obtains an image file by using the video recording method provided in the above embodiment of the present invention, audio information other than the target person in the image file may be further removed. Specifically, in the embodiment, the smart terminal further includes a first information matching module, where the first information matching module can read the audio information in the target image and associate it with a preset standard. The audio information is matched such that the target audio of the target person is marked in the audio information.
-提取与所述标准音频信息相匹配的目标音频信息Extracting target audio information that matches the standard audio information
继续参阅图3,从图中可以看出,本实施例中,所述智能终端中还具有一目标音频提取模块,所述目标音频提取模块可读取所述第一信息匹配模块的标记信息,并从所述目标影像的音频文件中提取所述音频文件中被标记的子音频文件。With reference to FIG. 3, it can be seen from the figure that in the embodiment, the smart terminal further has a target audio extraction module, and the target audio extraction module can read the marking information of the first information matching module. And extracting the labeled sub audio file in the audio file from the audio file of the target image.
其中,优选地,本实施例中,所述提取被标记的音频文件的具体步骤包括:Preferably, in this embodiment, the specific step of extracting the marked audio file includes:
-提取所述目标影像中的音频文件,将所述音频文件分帧Extracting an audio file in the target image, framing the audio file
本实施例中,要对音频流进行分析,需要对音频流分帧,也就是把音频流切开成一小段一小段,每小段称为一帧单位音频流。从而,本实施例中,所述目标音频提取模块中设置有一音频分帧单元,所述音频分帧单元可使用移动窗函数来实现对音频文件的分帧操作。分帧后,单位音频流与单位音频流之间一般是有交叠的,例如,每帧单位音频流的长度为25毫秒,每两帧单位音频流之间有25-10=15毫秒的交叠,称为以帧长25ms、帧移10ms分帧。 In this embodiment, to analyze the audio stream, the audio stream needs to be framed, that is, the audio stream is cut into a short segment, and each segment is called a frame unit audio stream. Therefore, in the embodiment, the target audio extraction module is provided with an audio framing unit, and the audio framing unit can use the moving window function to implement the framing operation on the audio file. After framing, there is generally overlap between the unit audio stream and the unit audio stream. For example, the length of the unit audio stream per frame is 25 milliseconds, and there is 25-10=15 milliseconds between each two frame unit audio streams. The stack is called a frame length of 25 ms and a frame shift of 10 ms.
-提取与所述标准音频信息相匹配的音频帧。- extracting an audio frame that matches the standard audio information.
本实施例中,所述目标音频提取模块还包括一目标音频提取单元,所述目标音频提取单元可读取所述第一信息匹配模块的标记信息,并从所述目标影像的音频文件中提取所述音频文件中被标记的子音频文件。In this embodiment, the target audio extraction module further includes a target audio extraction unit, and the target audio extraction unit may read the tag information of the first information matching module, and extract the audio file from the target image. A sub-audio file that is marked in the audio file.
-删除所述目标影像中所述目标音频信息以外的音频信息- deleting audio information other than the target audio information in the target image
本实施例中,所述智能终端相应地,还具有一音频过滤模块,所述音频过滤模块也可读取所述第一信息匹配模块的标记信息,并从所述目标影像的音频文件中删除所述音频文件中未被标记的子音频文件。In this embodiment, the smart terminal further has an audio filtering module, and the audio filtering module can also read the marking information of the first information matching module, and delete the audio file of the target image. A sub-audio file that is not marked in the audio file.
优选地,本实施例中,也可通过一音频调节模块,实现读取所述第一信息匹配模块的标记信息,并将被标记的音频文件增强,并相应地弱化未被标记的音频文件部分,从而实现将目标人物的音频在所述影像文件中凸显的技术效果。Preferably, in this embodiment, the marking information of the first information matching module is read by an audio adjustment module, and the marked audio file is enhanced, and the unmarked audio file portion is correspondingly weakened. , thereby achieving the technical effect of highlighting the audio of the target person in the image file.
综上所述,本发明基于录制获取的影像文件,还提出一种影像文件清晰度恢复以及音频文件过滤的方法,用户可基于本发明的上述技术手段,实现对所录制的影像文件的进一步优化处理。In summary, the present invention is based on a recorded image file, and further provides a method for image file resolution recovery and audio file filtering. The user can further optimize the recorded image file based on the above technical means of the present invention. deal with.
另外,为实现上述智能终端的录像方法,本发明另一方面,还提供一种智能终端的录像系统。参阅图4,其为一符合本发明一优选实施例的智能终端录像系统的结构图。从图中可以看出,本实施例所提供的智能终端的录像系统包括以下模块:In addition, in order to implement the recording method of the smart terminal, another aspect of the present invention provides a recording system of the smart terminal. Referring to FIG. 4, it is a structural diagram of a smart terminal recording system in accordance with a preferred embodiment of the present invention. As can be seen from the figure, the recording system of the intelligent terminal provided by this embodiment includes the following modules:
-系统控制模块-System Control Module
用于启动所述智能终端的录像系统。具体地,当用户希望录制一视频文件时,可通过触动所述智能终端显示界面的快捷图标或者快捷菜单中的文字名称等方式,向所述智能终端发送一启动指令。而所述智能终端的系统控制模块将读取用户发出的操作指令,并在解析后根据所述启动指令,启动所述智能终端的录像系统。A recording system for starting the smart terminal. Specifically, when the user wants to record a video file, a startup command may be sent to the smart terminal by touching a shortcut icon of the smart terminal display interface or a text name in the shortcut menu. The system control module of the smart terminal reads an operation instruction issued by the user, and starts the recording system of the intelligent terminal according to the startup instruction after parsing.
-摄像模块- Camera module
用于捕获一包括目标人物的目标影像。所述摄像模块包括设置于所述智能终端上的前和/或后置摄像头,以及设置于所述智能终端内部的图像传感器及数字信号处理器等。所述摄像模块可通过智能终端的摄像头捕获一包含目标人物的影像,并利用图像传感器及数字信息号处理器等,将所述影像生成一智能终端可读的数字信号,并缓存与所述智能终端中。Used to capture a target image that includes the target person. The camera module includes a front and/or rear camera disposed on the smart terminal, and an image sensor, a digital signal processor, and the like disposed inside the smart terminal. The camera module can capture an image containing the target person through the camera of the smart terminal, and generate an image of the smart terminal readable by the image sensor and the digital information number processor, etc., and cache the smart signal. In the terminal.
-人脸识别模块-Face recognition module
用于识别所述目标影像中的人物的人脸信息。所述人脸识别模块都可按照时间顺序 读取所述目标影像的每一帧图片,并识别每一帧图片中的人脸信息。A face information for identifying a person in the target image. The face recognition module can be in chronological order Reading each frame of the target image and identifying face information in each frame of the image.
-人脸校验模块- Face verification module
用于将所述人脸信息与一预设的标准人脸信息进行匹配。所述人脸校验模块可提取所述人脸信息并与数据库中存储的一预设标准人脸信息进行搜索匹配。优选地,本实施例中,预先设定一个阈值,当所述人脸信息与所述预设人脸信息的相似度超过这一阈值,则把匹配得到的结果输出;否则,则不进行任何处理。For matching the face information with a preset standard face information. The face verification module may extract the face information and perform search matching with a preset standard face information stored in a database. Preferably, in this embodiment, a threshold is set in advance, and when the similarity between the face information and the preset face information exceeds the threshold, the result of the matching is output; otherwise, no deal with.
-人脸定位模块-Face positioning module
当所述人脸信息与标准人信息相匹配时,所述人脸定位模块根据所述人脸信息锁定所述目标影像中的目标人脸。具体地,所述人脸定位模块接收到所述人脸校验模块发出的校验结果后,将按照录像录制的时间顺序,在所述目标影像的每一帧图片中锁定所述目标人脸;或者依次读取所述摄像模块录制的每一帧图片中的目标人脸并锁定。When the face information matches the standard person information, the face positioning module locks the target face in the target image according to the face information. Specifically, after receiving the verification result sent by the face verification module, the face positioning module locks the target face in each frame of the target image according to the time sequence of the video recording. Or sequentially read the target face in each frame of the picture recorded by the camera module and lock it.
-人体定位模块- Human positioning module
用于根据所述目标人脸信息在所述目标影像中的锚定所述目标人物的整体轮廓。所述人物定位模块可读取所述人脸定位模块所锁定的目标人脸,并根据所述目标人脸信息在所述目标影像中锚定所述目标人物的整体轮廓。An anchoring for anchoring an overall outline of the target person in the target image according to the target face information. The person positioning module can read the target face locked by the face positioning module, and anchor the overall contour of the target person in the target image according to the target face information.
其中,优选地,所述人体定位模块以下单元:Wherein, preferably, the human body positioning module has the following units:
-人体识别单元- human body recognition unit
用于以所述目标人脸中的人体皮肤信息为基准,获取所述目标影像中所述目标人物的人体信息。所述智能终端的人体定位模块中的人体识别单元可以利用人体皮肤信息作为基准,在所述目标影像的帧图片中识别与所述目标人脸的皮肤特征相似度最高的人体手、脚、手臂、腿等其他人体部位,并获取所述人体其他部位相对于所述目标人脸的位置、形状等信息。And configured to acquire body information of the target person in the target image based on human skin information in the target face. The human body recognizing unit in the human body positioning module of the smart terminal can use the human skin information as a reference to identify a human hand, a foot, and an arm having the highest similarity to the skin feature of the target human face in the frame image of the target image. And other body parts such as legs, and obtain information such as the position and shape of the other parts of the human body relative to the target face.
-人体定位单元- Human positioning unit
用于将所述人体信息与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,锚定所述目标影像中的目标人物。所述人体定位单元可读取获得的所述人体其他部位相对于所述目标人脸的位置、形状等信息,并将所述人体信息与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,从而,锚定所述目标影像的帧图片中的目标人物。And configured to match the human body information with a preset character motion data table, calculate an action state of the target person, and anchor a target person in the target image. The human body positioning unit can read information about the position, shape, and the like of the other parts of the human body relative to the target human face, and match the human body information with a preset character motion data table to calculate the target. The action state of the character, thereby anchoring the target person in the frame picture of the target image.
-第一图像处理模块- first image processing module
用于将所述目标影像中所述目标人物以外的人物图像进行模糊处理。具体地,所述 图像处理模块可读取所述人体定位模块所锚定的所述目标人物的整体轮廓,并反向选择未被锚定的帧图片区域,采用一模糊算法,将所述目标影像中的帧图片中目标人物以外的区域模糊化,从而,实现将所述目标影像中非目标人物的影像模糊化。And configured to perform blur processing on a character image other than the target person in the target image. Specifically, the The image processing module can read the overall contour of the target person anchored by the human body positioning module, and select a frame image region that is not anchored in the reverse direction, and adopt a blur algorithm to frame the frame image in the target image. The area other than the target person is blurred, thereby blurring the image of the non-target person in the target image.
另外,为防止智能终端在目标识别、追踪过程中由于误差等,导致用户对于模糊化的影像不满意,本发明还提供一种可将模糊化的影像文件清晰化的系统。具体参见图5,其为一符合本发明另一优选实施例的智能终端的录像系统的结构图。从图中可以看出,本实施例所述提供的录像系统还包括:In addition, in order to prevent the user from being dissatisfied with the blurred image due to errors or the like in the target recognition and tracking process, the present invention also provides a system capable of clearing the blurred image file. Referring to FIG. 5, it is a structural diagram of a video recording system of a smart terminal according to another preferred embodiment of the present invention. As can be seen from the figure, the video recording system provided in this embodiment further includes:
-指令模块- instruction module
用于捕获用户发出的一操作指令,所述指令模块可以捕获用户对于智能终端的显示界面的触发操作,并解析上述触发操作的触发压力、位置等信息,从而产生一运行指令向外发送。For capturing an operation instruction issued by the user, the instruction module may capture a trigger operation of the display interface of the smart terminal, and parse the trigger pressure, location, and the like of the trigger operation, thereby generating a running instruction to send out.
-影像分帧模块-Image framing module
用于将所述目标影像进行分帧处理。所述影像分帧模块可接收所述运行指令,并根据所述运行指令,读取缓存与所述智能终端的存储空间的目标影像,并将所述目标影像进行分帧处理,将所述目标影像逐帧分解为至少一个单帧图片。And used to perform framing processing on the target image. The image framing module can receive the running instruction, and according to the running instruction, read a target image buffered with a storage space of the smart terminal, and perform framing processing on the target image to The image is decomposed frame by frame into at least one single frame picture.
-第二图像处理模块- second image processing module
用于逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。所述第二图像处理模块可逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。也即利用时间带宽(获取同一场景的多帧图像序列)换取空间分辨率,实现时间分辨率向空间分辨率的转换。For super-resolution processing on a frame-by-frame basis, the blurred image in each frame of the target image is sharpened. The second image processing module may perform super-resolution processing on a frame-by-frame basis to sharpen a blurred image in each frame of the target image. That is, the time resolution (acquiring a multi-frame image sequence of the same scene) is used in exchange for spatial resolution, and the conversion from temporal resolution to spatial resolution is realized.
另外,基于本发明上述实施例中所提供的智能终端的录像系统,为进一步过滤掉所录制的影像文件中的杂音,提出目标人物的清晰音频,本发明还提供一种音频文件的过滤系统。参阅图6,其为一符合本发明的另一优选实施例的智能终端的录像方法的结构图。从图中可以看出,本实施例所提供的录像系统还包括以下模块:In addition, based on the recording system of the smart terminal provided in the above embodiment of the present invention, in order to further filter out the noise in the recorded image file, clear audio of the target person is proposed, and the present invention also provides a filtering system for the audio file. Referring to FIG. 6, which is a structural diagram of a video recording method of a smart terminal according to another preferred embodiment of the present invention. As can be seen from the figure, the video recording system provided in this embodiment further includes the following modules:
-第一信息匹配模块- first information matching module
用于读取所述目标影像中的音频信息,与一预设的标准音频信息进行匹配。所述第一信息匹配模块可以读取所述目标影像中的音频信息,并将其与一预设的标准音频信息进行匹配,从而,在所述音频信息中标记所述目标人物的目标音频。For reading audio information in the target image, and matching with a preset standard audio information. The first information matching module may read the audio information in the target image and match it with a preset standard audio information, thereby marking the target audio of the target person in the audio information.
-目标音频提取模块- Target audio extraction module
用于提取与所述标准音频信息相匹配的目标音频信息。所述目标音频提取模块可读 取所述第一信息匹配模块的标记信息,并从所述目标影像的音频文件中提取所述音频文件中被标记的子音频文件。For extracting target audio information that matches the standard audio information. The target audio extraction module is readable Taking the tag information of the first information matching module, and extracting the labeled sub audio file in the audio file from the audio file of the target image.
具体地,所述音频提取模块包括以下单元:Specifically, the audio extraction module includes the following units:
-音频分帧单元-Audio framing unit
用于提取所述目标影像中的音频文件,将所述音频文件分帧。具体地,所述目标音频提取模块中设置有一音频分帧单元,所述音频分帧单元可使用移动窗函数来实现对音频文件的分帧操作。And used to extract an audio file in the target image, and framing the audio file. Specifically, an audio framing unit is disposed in the target audio extraction module, and the audio framing unit may use a moving window function to implement a framing operation on the audio file.
-目标音频提取单元- target audio extraction unit
用于提取与所述标准音频信息相匹配的音频帧。具体地,所述目标音频提取单元可读取所述第一信息匹配模块的标记信息,并从所述目标影像的音频文件中提取所述音频文件中被标记的子音频文件。For extracting an audio frame that matches the standard audio information. Specifically, the target audio extraction unit may read the tag information of the first information matching module, and extract the labeled sub audio file in the audio file from the audio file of the target image.
-音频过滤模块-audio filter module
用于删除所述目标影像中所述目标音频信息以外的音频信息。所述音频过滤模块也可读取所述第一信息匹配模块的标记信息,并从所述目标影像的音频文件中删除所述音频文件中未被标记的子音频文件。And for deleting audio information other than the target audio information in the target image. The audio filtering module may also read the tag information of the first information matching module, and delete the unlabeled sub audio file in the audio file from the audio file of the target image.
另外,优选地,本实施例中,也可在所述智能终端中设置一音频调节模块,通过所述音频调节模块,实现读取所述第一信息匹配模块的标记信息,并将被标记的音频文件增强,并相应地弱化未被标记的音频文件部分,从而实现将目标人物的音频在所述影像文件中凸显的技术效果。In addition, in this embodiment, an audio adjustment module may be disposed in the smart terminal, and the marking information of the first information matching module is read by the audio adjustment module, and is marked. The audio file is enhanced and the unmarked portion of the audio file is weakened accordingly, thereby achieving the technical effect of highlighting the audio of the target person in the image file.
综上所述,本发明提出一种智能终端的录像系统,利用人脸识别技术及人脸追踪技术结合,并进一步利用人体皮肤的特殊性,通过一预设人物动作数据表,推算出目标人物的整体轮廓,从而,实现在影像录制过程中追踪目标人物的技术效果;同时,再结合图像处理技术中的模糊化计算方法,实现排出录制影像中的非人物的影响干扰。In summary, the present invention provides a video recording system for a smart terminal, which combines face recognition technology and face tracking technology, and further utilizes the particularity of human skin to calculate a target person through a preset character motion data table. The overall contour, thus, the technical effect of tracking the target person during the image recording process; at the same time, combined with the fuzzy calculation method in the image processing technology, the non-personal influence interference in the recorded image is discharged.
应当注意的是,本发明的实施例有较佳的实施性,且并非对本发明作任何形式的限制,任何熟悉该领域的技术人员可能利用上述揭示的技术内容变更或修饰为等同的有效实施例,但凡未脱离本发明技术方案的内容,依据本发明的技术实质对以上实施例所作的任何修改或等同变化及修饰,均仍属于本发明技术方案的范围内。 It should be noted that the embodiments of the present invention are preferred embodiments, and are not intended to limit the scope of the present invention. Any one skilled in the art may use the above-disclosed technical contents to change or modify the equivalent embodiments. Any modification or equivalent changes and modifications of the above embodiments in accordance with the technical spirit of the present invention are still within the scope of the technical solutions of the present invention.

Claims (10)

  1. 一种智能终端的录像方法,其特征在于,包括以下步骤:A video recording method for an intelligent terminal, comprising the steps of:
    启动所述智能终端的录像系统,Starting the recording system of the smart terminal,
    捕获一包括目标人物的目标影像,Capture a target image that includes the target person,
    识别所述目标影像中的人物的人脸信息,Identifying face information of a person in the target image,
    将所述人脸信息与一预设的标准人脸信息进行匹配,Matching the face information with a preset standard face information,
    当所述人脸信息与标准人信息相匹配时,根据所述人脸信息锁定所述目标影像中的目标人脸,When the face information matches the standard person information, the target face in the target image is locked according to the face information,
    根据所述目标人脸信息在所述目标影像中的锚定所述目标人物的整体轮廓,Anchoring an overall outline of the target person in the target image according to the target face information,
    将所述目标影像中所述目标人物以外的人物图像进行模糊处理。Blur processing of a person image other than the target person in the target image.
  2. 如权利要求1所述的录像方法,其特征在于,A video recording method according to claim 1, wherein
    根据所述目标人脸信息在所述目标影像中的锁定所述目标人物的整体轮廓的步骤包括,The step of locking the overall outline of the target person in the target image according to the target face information includes,
    锁定所述目标影像中的目标人脸后,After locking the target face in the target image,
    以所述目标人脸中的人体皮肤信息为基准,获取所述目标影像中所述目标人物的身体信息,Obtaining body information of the target person in the target image based on human skin information in the target face,
    与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,锚定所述目标影像中的目标人物。Matching with a preset character motion data table, calculating an action state of the target person, and anchoring a target person in the target image.
  3. 如权利要求1所述的录像方法,其特征在于,A video recording method according to claim 1, wherein
    还包括,Also includes,
    当所述目标影像录制完成后,After the target image is recorded,
    捕获一操作指令,Capture an operation command,
    将所述目标影像进行分帧处理,Performing frame processing on the target image,
    逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。The super-resolution processing is performed frame by frame, and the blurred image in each frame of the target image is sharpened.
  4. 如权利要求1所述的录像方法,其特征在于,A video recording method according to claim 1, wherein
    还包括,Also includes,
    读取所述目标影像中的音频信息,与一预设的标准音频信息进行匹配,Reading audio information in the target image to match a preset standard audio information,
    提取与所述标准音频信息相匹配的目标音频信息,Extracting target audio information that matches the standard audio information,
    删除所述目标影像中所述目标音频信息以外的音频信息。 Deleting audio information other than the target audio information in the target image.
  5. 如权利要求4所述的录像方法,其特征在于,A video recording method according to claim 4, wherein
    提取与所述标准音频信息相匹配的目标音频信息的步骤包括,The step of extracting target audio information that matches the standard audio information includes,
    提取所述目标影像中的音频文件,将所述音频文件分帧,Extracting an audio file in the target image, and framing the audio file,
    提取与所述标准音频信息相匹配的音频帧。An audio frame that matches the standard audio information is extracted.
  6. 一种智能终端的录像系统,其特征在于,包括以下模块,A video recording system for an intelligent terminal, comprising the following modules,
    系统控制模块,启动所述智能终端的录像系统,a system control module that starts a recording system of the smart terminal,
    摄像模块,捕获一包括目标人物的目标影像,a camera module that captures a target image including a target person,
    人脸识别模块,识别所述目标影像中的人物的人脸信息,a face recognition module that recognizes face information of a person in the target image,
    人脸校验模块,将所述人脸信息与一预设的标准人脸信息进行匹配,a face verification module that matches the face information with a preset standard face information,
    人脸定位模块,当所述人脸信息与标准人信息相匹配时,根据所述人脸信息锁定所述目标影像中的目标人脸,a face positioning module, when the face information matches the standard person information, locking the target face in the target image according to the face information,
    人体定位模块,根据所述目标人脸信息在所述目标影像中的锚定所述目标人物的整体轮廓,a human body positioning module that anchors an overall contour of the target person in the target image according to the target face information,
    第一图像处理模块,将所述目标影像中所述目标人物以外的人物图像进行模糊处理。The first image processing module blurs the image of the person other than the target person in the target image.
  7. 如权利要求6所述的录像系统,其特征在于,A video recording system according to claim 6 wherein:
    所述人体定位模块包括,The human body positioning module includes
    人体识别单元,以所述目标人脸中的人体皮肤信息为基准,获取所述目标影像中所述目标人物的人体信息,a human body recognition unit that acquires human body information of the target person in the target image based on human skin information in the target human face,
    人体定位单元,将所述人体信息与一预设人物动作数据表进行匹配,计算所述目标人物的动作状态,锚定所述目标影像中的目标人物。The human body positioning unit matches the human body information with a preset character motion data table, calculates an action state of the target person, and anchors a target person in the target image.
  8. 如权利要求6所述的录像系统,其特征在于,A video recording system according to claim 6 wherein:
    还包括,Also includes,
    指令模块,捕获一操作指令,An instruction module that captures an operation instruction,
    影像分帧模块,将所述目标影像进行分帧处理,The image framing module performs framing processing on the target image,
    第二图像处理模块,逐帧进行超分辨率处理,将所述目标影像中每帧图片中的模糊图像清晰化。The second image processing module performs super-resolution processing on a frame-by-frame basis to sharpen the blurred image in each frame of the target image.
  9. 如权利要求6所述的录像系统,其特征在于,A video recording system according to claim 6 wherein:
    还包括,Also includes,
    第一信息匹配模块,读取所述目标影像中的音频信息,与一预设的标准音频信息进行匹配, The first information matching module reads the audio information in the target image and matches a preset standard audio information.
    目标音频提取模块,提取与所述标准音频信息相匹配的目标音频信息,a target audio extraction module that extracts target audio information that matches the standard audio information,
    音频过滤模块,删除所述目标影像中所述目标音频信息以外的音频信息。The audio filtering module deletes audio information other than the target audio information in the target image.
  10. 如权利要求9所述的录像系统,其特征在于,The video recording system of claim 9 wherein:
    所述目标音频提取模块包括,The target audio extraction module includes
    音频分帧单元,提取所述目标影像中的音频文件,将所述音频文件分帧,An audio framing unit, extracting an audio file in the target image, and framing the audio file,
    目标音频提取单元,提取与所述标准音频信息相匹配的音频帧。 The target audio extracting unit extracts an audio frame that matches the standard audio information.
PCT/CN2017/104354 2017-09-29 2017-09-29 Video recording method and video recording system of intelligent terminal WO2019061285A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/104354 WO2019061285A1 (en) 2017-09-29 2017-09-29 Video recording method and video recording system of intelligent terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/104354 WO2019061285A1 (en) 2017-09-29 2017-09-29 Video recording method and video recording system of intelligent terminal

Publications (1)

Publication Number Publication Date
WO2019061285A1 true WO2019061285A1 (en) 2019-04-04

Family

ID=65903612

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/104354 WO2019061285A1 (en) 2017-09-29 2017-09-29 Video recording method and video recording system of intelligent terminal

Country Status (1)

Country Link
WO (1) WO2019061285A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104049721A (en) * 2013-03-11 2014-09-17 联想(北京)有限公司 Information processing method and electronic equipment
US20140348398A1 (en) * 2013-05-24 2014-11-27 Kabushiki Kaisha Toshiba Electronic apparatus and display control method
CN104794462A (en) * 2015-05-11 2015-07-22 北京锤子数码科技有限公司 Figure image processing method and device
CN105335714A (en) * 2015-10-28 2016-02-17 小米科技有限责任公司 Photograph processing method, device and apparatus
CN105426904A (en) * 2015-10-28 2016-03-23 小米科技有限责任公司 Photo processing method, apparatus and device
CN105679357A (en) * 2015-12-29 2016-06-15 惠州Tcl移动通信有限公司 Mobile terminal and voiceprint identification-based recording method thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104049721A (en) * 2013-03-11 2014-09-17 联想(北京)有限公司 Information processing method and electronic equipment
US20140348398A1 (en) * 2013-05-24 2014-11-27 Kabushiki Kaisha Toshiba Electronic apparatus and display control method
CN104794462A (en) * 2015-05-11 2015-07-22 北京锤子数码科技有限公司 Figure image processing method and device
CN105335714A (en) * 2015-10-28 2016-02-17 小米科技有限责任公司 Photograph processing method, device and apparatus
CN105426904A (en) * 2015-10-28 2016-03-23 小米科技有限责任公司 Photo processing method, apparatus and device
CN105679357A (en) * 2015-12-29 2016-06-15 惠州Tcl移动通信有限公司 Mobile terminal and voiceprint identification-based recording method thereof

Similar Documents

Publication Publication Date Title
Yin et al. Text detection, tracking and recognition in video: a comprehensive survey
Black et al. Recognizing facial expressions in image sequences using local parameterized models of image motion
Al-Allaf Review of face detection systems based artificial neural networks algorithms
US9928406B2 (en) Unified face representation for individual recognition in surveillance videos and vehicle logo super-resolution system
Feris et al. Detection and tracking of facial features in video sequences
CN111339806B (en) Training method of lip language recognition model, living body recognition method and device
CN112380512B (en) Convolutional neural network dynamic gesture authentication method and device, storage medium and equipment
US20200258236A1 (en) Person segmentations for background replacements
CN116129129B (en) Character interaction detection model and detection method
CN113689585B (en) Non-inductive attendance card punching method, system and related equipment
Sathyadevan et al. Identifying moving bodies from CCTV videos using machine learning techniques
WO2019061285A1 (en) Video recording method and video recording system of intelligent terminal
Amjed et al. Noncircular iris segmentation based on weighted adaptive hough transform using smartphone database
Szlávik et al. Face analysis using CNN-UM
CN116152908A (en) Method and device for identifying actions, detecting living bodies and training models, and electronic equipment
Bai et al. Exploration of computer vision and image processing technology based on OpenCV
CN112965602A (en) Gesture-based human-computer interaction method and device
Roth et al. Conservative visual learning for object detection with minimal hand labeling effort
Feris et al. Capturing people in surveillance video
HN et al. Detection and Localization of Mask Occluded Faces by transfer learning using Faster RCNN
Hong et al. Real time face detection and recognition system using Haar-like feature/HMM in ubiquitous network environments
Schmidt Feris et al. Detection and tracking of facial features in video sequences
Singh et al. Generic action recognition from egocentric videos
Patil et al. A review of human facial expression recognition systems applied for wild dataset in last decade
CN102194107A (en) Smiling face recognition method for reducing dimension by using improved linear discriminant analysis

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17927063

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17927063

Country of ref document: EP

Kind code of ref document: A1