CN111901672A

CN111901672A - Artificial intelligence image processing method

Info

Publication number: CN111901672A
Application number: CN202010540161.5A
Authority: CN
Inventors: 邹建财
Original assignee: Shenzhen Jingwah Information Technology Co ltd
Current assignee: Shenzhen Jingwah Information Technology Co ltd
Priority date: 2020-06-12
Filing date: 2020-06-12
Publication date: 2020-11-06

Abstract

The invention discloses an artificial intelligence image processing method, which comprises the following steps: receiving a video signal; carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image; and sending the processed image to a preset terminal device. The invention realizes intelligent voice communication through artificial intelligence and is applied to video conferences, online courses, online live broadcast delivery, online training and the like, thereby saving the cost and time for arranging backgrounds, improving the use experience of users and meeting the requirements of the users.

Description

Artificial intelligence image processing method

Technical Field

The invention relates to the technical field of artificial intelligence application, in particular to an artificial intelligence image processing method.

Background

At present, the demand for various online activities, such as video conferencing, web lessons, online live tape delivery, etc., is suddenly increasing. However, video images in video conferences, web lessons and online live tape have the problems of disordered backgrounds, single image processing, no beautification and the like. Therefore, there is a need for a solution to process video images in video conferences, web courses, and online live tape to meet the development needs.

Disclosure of Invention

The invention provides an artificial intelligence image processing method which can solve the problems that video images have disordered backgrounds, the image processing is single, the images are not beautified and the like in the prior art.

In order to solve the above problem, in a first aspect, the present invention provides an artificial intelligence image processing method, including:

receiving a video signal;

carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image;

and sending the processed image to a preset terminal device.

In the artificial intelligence image processing method, the image processing the video signal by using an artificial intelligence image processing algorithm to obtain a processed image includes:

the method comprises the steps of advancing a person dynamic video in a video signal through a person identification technology, and synthesizing the person dynamic video with a preset background to replace the background;

extracting character characteristic information in a character dynamic video, and identifying the character characteristic information to perform face changing and dress changing functions;

and extracting a character image according to the character characteristic information, and carrying out image optimization on the character image through an image processing algorithm.

In the artificial intelligence image processing method, the image processing on the video signal by using an artificial intelligence image processing algorithm to obtain a processed image further includes:

and adding background music and/or annotation information to the processed image.

identifying a voice signal in the video signal;

converting the voice signal into a preset multinational language through a preset intelligent voice recognition and translation algorithm;

the multi-national language is output through audio and/or subtitles.

In the artificial intelligence image processing method, the method further comprises:

determining a user through a receiving source of the video signal;

and confirming the identity information of the user according to face recognition and/or voiceprint recognition.

and processing the image according to the preference of the user.

the process of image processing is controlled.

In the artificial intelligence image processing method, the controlling the image processing process includes:

receiving audio information of the user;

identifying the audio information to convert the audio information into instruction manipulation information;

and controlling the image processing process through the instruction control information.

and acquiring the operation habits and requirements of the user, and sending the operation habits and requirements to a background server for learning.

In a second aspect, there is provided a computer readable storage medium having stored therein a plurality of instructions adapted to be loaded by a processor to perform the artificial intelligence image processing method as described above.

The invention has the beneficial effects that:

realize intelligent pronunciation through artificial intelligence and exchange and be applied to video conference, net class, online live tape goods and online training etc. practice thrift the cost and the time of arranging the background, promote user's use impression, satisfy user's demand.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is a flow chart of an artificial intelligence image processing method provided by the invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the description of the present invention, it is to be understood that the terms "center", "longitudinal", "lateral", "length", "width", "thickness", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", etc. indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be considered as limiting the present invention. Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more features. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.

In the present disclosure, the word "exemplary" is used to mean "serving as an example, instance, or illustration. Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments. The following description is presented to enable any person skilled in the art to make and use the invention. In the following description, details are set forth for the purpose of explanation. It will be apparent to one of ordinary skill in the art that the present invention may be practiced without these specific details. In other instances, well-known structures and processes are not shown in detail to avoid obscuring the description of the invention with unnecessary detail. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.

The invention aims to provide an artificial intelligent image processing method, which is based on an advanced embedded operating system, hardware with excellent performance and a powerful intelligent AI image processing algorithm, and can enable a common user to easily realize various real-time audio and video processing requirements in various online activities, such as video conferences, online live broadcast, online education, online training and other application scenes, thereby greatly reducing the cost, saving the time and simultaneously helping the user to obtain the best activity effect.

Referring to fig. 1, fig. 1 is a flowchart illustrating an artificial intelligence image processing method according to the present invention, the method including steps S1-S3:

and S1, receiving the video signal.

In this embodiment, the high definition camera inputs image information to the intelligent terminal through USB, wired network, WIFI, HDMI and other modes to perform image processing on the video signal by using an artificial intelligence image processing algorithm. Namely, the high-definition camera can transmit video signals in formats such as H.263\ H.264\ H.265\ MJPEG \ YUY2 to the intelligent device in any modes such as USB, wired network, WIFI, HDMI and the like. That is, the video signal can receive external equipment, such as independent camera equipment, and can also be obtained from a camera module built in the equipment.

S2, carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image; step S2 includes steps S21-S25:

and S21, advancing the character dynamic video in the video signal through a character recognition technology, and synthesizing the character dynamic video with a preset background to replace the background.

In this embodiment, the intelligent terminal synthesizes the extracted character dynamic video and the selected background through the character recognition technology, so as to realize the function of replacing the background. For example: specific natural gestures, limb movements, etc. may be recognized from the video signal and may be translated into execution instructions. In the on-line activities such as live broadcasting, a great amount of financial cost and time cost are needed to be consumed by arranging exquisite backgrounds, and only one background picture needs to be designed through the equipment.

And S22, extracting the character characteristic information in the dynamic character video, and identifying the character characteristic information to perform face changing and dress changing functions.

In this embodiment, the intelligent terminal synthesizes the extracted dynamic video of the person with the selected background through the person identification technology, and also supports functions of face changing, face changing and the like.

And S23, extracting a character image according to the character characteristic information, and carrying out image optimization on the character image through an image processing algorithm.

In this embodiment, image optimization, such as processing of beauty, black and white, etc., may be performed through an image processing algorithm according to the setting of the user.

And S24, adding background music and/or annotation information to the processed image.

In this embodiment, background music and a label may be added, for example, "the doctor li is doing intelligent product demonstration.

S25, recognizing a voice signal in the video signal; converting the voice signal into a preset multinational language through a preset intelligent voice recognition and translation algorithm; the multi-national language is output through audio and/or subtitles.

In the embodiment, the voice of the user can be converted into multi-language in real time through the intelligent voice recognition and translation algorithm, and the simultaneous output of audio and subtitles is supported.

In summary, the operating system is built in the smart device, and the smart AI image processing algorithm is run according to the requirements and specific settings of the user. The equipment can meet various application scene requirements of online activities, including video conferences, online lessons, online live broadcast delivery, online training and the like, and various use requirements of users are met. The real-time video background replacement and editing are realized through a fusion algorithm of various advanced AI image processing; changing faces; changing the clothes, and background music; various identifiers; recognizing and transferring voice (semanteme) into various languages, and outputting voice and subtitles; carrying out beautification effect processing on the image, wherein the beautification effect processing comprises various effect processing such as figure beautification, image antiquing, black and white and the like; the transverse and vertical screens are changed skillfully, and the requirements of audiences on watching in various devices are met.

In summary, this step performs real-time dynamic image processing on the video signal by means of hardware with neural Network (NPU) computation power, using an artificial intelligence image processing algorithm, to obtain a processed image.

And S3, sending the processed image to a preset terminal device.

In the embodiment, the processed audio and video signals are output to a computer, a smart phone or other equipment in various modes such as a USB (universal serial bus), a wired network, a WIFI (wireless fidelity), an HDMI (high-definition multimedia interface) and the like, so that the audio and video editing requirements of various application scenes can be met, intelligence is provided for users, and the real-time video processing equipment is simple to operate. For example: can realize according to user's demand that the screen switches anyhow. Namely, after video signals are processed in real time, the signals are transmitted to subsequent equipment such as a computer and a mobile phone in any mode such as USB, a wired network, WIFI and HDMI, and specific application is realized.

Preferably, the artificial intelligence image processing method further includes steps S4-S6:

s4, determining a user through a receiving source of the video signal; and confirming the identity information of the user according to face recognition and/or voiceprint recognition. And processing the image according to the preference of the user.

In the embodiment, intelligent voice instruction control is supported; the intelligent terminal confirms user information according to face recognition and voiceprint recognition, automatically adopts the preference setting of the user to process the video, and avoids the problem that the user needs to be reset every time when multiple users use the video.

And S5, controlling the image processing process. Step S5 includes steps S51-S53:

s51, receiving the audio information of the user;

s52, identifying the audio information to convert the audio information into command control information;

and S53, controlling the image processing process through the instruction control information.

Through steps S4 and S5, the identity of the user is confirmed through face recognition and voiceprint recognition, and image processing is performed according to the preference of the user; the intelligent voice command control is supported, the equipment control becomes very simple, and common consumers can easily get the hands.

And S6, acquiring the operation habits and requirements of the user, and sending the operation habits and requirements to a background server for learning.

In this embodiment, equipment passes through internet access cloud end server, can learn according to user's custom and demand to constantly optimize current function and increase new function.

It will be understood by those skilled in the art that all or part of the steps of the methods of the above embodiments may be performed by instructions or by associated hardware controlled by the instructions, which may be stored in a computer readable storage medium and loaded and executed by a processor. To this end, the present invention provides a storage medium, in which a plurality of instructions are stored, and the instructions can be loaded by a processor to execute the steps in any one of the integration methods provided by the present invention.

Wherein the storage medium may include: read Only Memory (ROM), Random Access Memory (RAM), magnetic or optical disks, and the like.

Since the instructions stored in the storage medium can execute the steps in any integration method provided in the embodiments of the present invention, the beneficial effects that can be achieved by any integration method provided in the embodiments of the present invention can be achieved, for details, see the foregoing embodiments, and are not described herein again.

The above operations can be implemented in the foregoing embodiments, and are not described in detail herein.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. An artificial intelligence image processing method, comprising:

receiving a video signal;

and sending the processed image to a preset terminal device.

2. The artificial intelligence image processing method of claim 1, wherein said image processing the video signal using an artificial intelligence image processing algorithm to obtain a processed image comprises:

3. The artificial intelligence image processing method of claim 2, wherein said image processing the video signal using an artificial intelligence image processing algorithm to obtain a processed image further comprises:

4. The artificial intelligence image processing method of claim 2, wherein said image processing the video signal using an artificial intelligence image processing algorithm to obtain a processed image further comprises:

identifying a voice signal in the video signal;

the multi-national language is output through audio and/or subtitles.

5. The artificial intelligence image processing method of claim 1, further comprising:

determining a user through a receiving source of the video signal;

6. The artificial intelligence image processing method of claim 5, further comprising:

and processing the image according to the preference of the user.

7. The artificial intelligence image processing method of claim 5 or 6, further comprising:

the process of image processing is controlled.

8. The artificial intelligence image processing method of claim 7, wherein the controlling the image processing comprises:

receiving audio information of the user;

9. The artificial intelligence image processing method of claim 5, further comprising:

10. A computer readable storage medium having stored thereon a plurality of instructions adapted to be loaded by a processor to perform the artificial intelligence image processing method of any of claims 1 to 9.