CN111901672A - Artificial intelligence image processing method - Google Patents

Artificial intelligence image processing method Download PDF

Info

Publication number
CN111901672A
CN111901672A CN202010540161.5A CN202010540161A CN111901672A CN 111901672 A CN111901672 A CN 111901672A CN 202010540161 A CN202010540161 A CN 202010540161A CN 111901672 A CN111901672 A CN 111901672A
Authority
CN
China
Prior art keywords
image processing
artificial intelligence
processing method
video signal
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010540161.5A
Other languages
Chinese (zh)
Inventor
邹建财
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Jingwah Information Technology Co ltd
Original Assignee
Shenzhen Jingwah Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Jingwah Information Technology Co ltd filed Critical Shenzhen Jingwah Information Technology Co ltd
Priority to CN202010540161.5A priority Critical patent/CN111901672A/en
Publication of CN111901672A publication Critical patent/CN111901672A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation
    • G06F40/56Natural language generation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams

Abstract

The invention discloses an artificial intelligence image processing method, which comprises the following steps: receiving a video signal; carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image; and sending the processed image to a preset terminal device. The invention realizes intelligent voice communication through artificial intelligence and is applied to video conferences, online courses, online live broadcast delivery, online training and the like, thereby saving the cost and time for arranging backgrounds, improving the use experience of users and meeting the requirements of the users.

Description

Artificial intelligence image processing method
Technical Field
The invention relates to the technical field of artificial intelligence application, in particular to an artificial intelligence image processing method.
Background
At present, the demand for various online activities, such as video conferencing, web lessons, online live tape delivery, etc., is suddenly increasing. However, video images in video conferences, web lessons and online live tape have the problems of disordered backgrounds, single image processing, no beautification and the like. Therefore, there is a need for a solution to process video images in video conferences, web courses, and online live tape to meet the development needs.
Disclosure of Invention
The invention provides an artificial intelligence image processing method which can solve the problems that video images have disordered backgrounds, the image processing is single, the images are not beautified and the like in the prior art.
In order to solve the above problem, in a first aspect, the present invention provides an artificial intelligence image processing method, including:
receiving a video signal;
carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image;
and sending the processed image to a preset terminal device.
In the artificial intelligence image processing method, the image processing the video signal by using an artificial intelligence image processing algorithm to obtain a processed image includes:
the method comprises the steps of advancing a person dynamic video in a video signal through a person identification technology, and synthesizing the person dynamic video with a preset background to replace the background;
extracting character characteristic information in a character dynamic video, and identifying the character characteristic information to perform face changing and dress changing functions;
and extracting a character image according to the character characteristic information, and carrying out image optimization on the character image through an image processing algorithm.
In the artificial intelligence image processing method, the image processing on the video signal by using an artificial intelligence image processing algorithm to obtain a processed image further includes:
and adding background music and/or annotation information to the processed image.
In the artificial intelligence image processing method, the image processing on the video signal by using an artificial intelligence image processing algorithm to obtain a processed image further includes:
identifying a voice signal in the video signal;
converting the voice signal into a preset multinational language through a preset intelligent voice recognition and translation algorithm;
the multi-national language is output through audio and/or subtitles.
In the artificial intelligence image processing method, the method further comprises:
determining a user through a receiving source of the video signal;
and confirming the identity information of the user according to face recognition and/or voiceprint recognition.
In the artificial intelligence image processing method, the method further comprises:
and processing the image according to the preference of the user.
In the artificial intelligence image processing method, the method further comprises:
the process of image processing is controlled.
In the artificial intelligence image processing method, the controlling the image processing process includes:
receiving audio information of the user;
identifying the audio information to convert the audio information into instruction manipulation information;
and controlling the image processing process through the instruction control information.
In the artificial intelligence image processing method, the method further comprises:
and acquiring the operation habits and requirements of the user, and sending the operation habits and requirements to a background server for learning.
In a second aspect, there is provided a computer readable storage medium having stored therein a plurality of instructions adapted to be loaded by a processor to perform the artificial intelligence image processing method as described above.
The invention has the beneficial effects that:
realize intelligent pronunciation through artificial intelligence and exchange and be applied to video conference, net class, online live tape goods and online training etc. practice thrift the cost and the time of arranging the background, promote user's use impression, satisfy user's demand.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
FIG. 1 is a flow chart of an artificial intelligence image processing method provided by the invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the description of the present invention, it is to be understood that the terms "center", "longitudinal", "lateral", "length", "width", "thickness", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", etc. indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be considered as limiting the present invention. Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more features. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.
In the present disclosure, the word "exemplary" is used to mean "serving as an example, instance, or illustration. Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments. The following description is presented to enable any person skilled in the art to make and use the invention. In the following description, details are set forth for the purpose of explanation. It will be apparent to one of ordinary skill in the art that the present invention may be practiced without these specific details. In other instances, well-known structures and processes are not shown in detail to avoid obscuring the description of the invention with unnecessary detail. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
The invention aims to provide an artificial intelligent image processing method, which is based on an advanced embedded operating system, hardware with excellent performance and a powerful intelligent AI image processing algorithm, and can enable a common user to easily realize various real-time audio and video processing requirements in various online activities, such as video conferences, online live broadcast, online education, online training and other application scenes, thereby greatly reducing the cost, saving the time and simultaneously helping the user to obtain the best activity effect.
Referring to fig. 1, fig. 1 is a flowchart illustrating an artificial intelligence image processing method according to the present invention, the method including steps S1-S3:
and S1, receiving the video signal.
In this embodiment, the high definition camera inputs image information to the intelligent terminal through USB, wired network, WIFI, HDMI and other modes to perform image processing on the video signal by using an artificial intelligence image processing algorithm. Namely, the high-definition camera can transmit video signals in formats such as H.263\ H.264\ H.265\ MJPEG \ YUY2 to the intelligent device in any modes such as USB, wired network, WIFI, HDMI and the like. That is, the video signal can receive external equipment, such as independent camera equipment, and can also be obtained from a camera module built in the equipment.
S2, carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image; step S2 includes steps S21-S25:
and S21, advancing the character dynamic video in the video signal through a character recognition technology, and synthesizing the character dynamic video with a preset background to replace the background.
In this embodiment, the intelligent terminal synthesizes the extracted character dynamic video and the selected background through the character recognition technology, so as to realize the function of replacing the background. For example: specific natural gestures, limb movements, etc. may be recognized from the video signal and may be translated into execution instructions. In the on-line activities such as live broadcasting, a great amount of financial cost and time cost are needed to be consumed by arranging exquisite backgrounds, and only one background picture needs to be designed through the equipment.
And S22, extracting the character characteristic information in the dynamic character video, and identifying the character characteristic information to perform face changing and dress changing functions.
In this embodiment, the intelligent terminal synthesizes the extracted dynamic video of the person with the selected background through the person identification technology, and also supports functions of face changing, face changing and the like.
And S23, extracting a character image according to the character characteristic information, and carrying out image optimization on the character image through an image processing algorithm.
In this embodiment, image optimization, such as processing of beauty, black and white, etc., may be performed through an image processing algorithm according to the setting of the user.
And S24, adding background music and/or annotation information to the processed image.
In this embodiment, background music and a label may be added, for example, "the doctor li is doing intelligent product demonstration.
S25, recognizing a voice signal in the video signal; converting the voice signal into a preset multinational language through a preset intelligent voice recognition and translation algorithm; the multi-national language is output through audio and/or subtitles.
In the embodiment, the voice of the user can be converted into multi-language in real time through the intelligent voice recognition and translation algorithm, and the simultaneous output of audio and subtitles is supported.
In summary, the operating system is built in the smart device, and the smart AI image processing algorithm is run according to the requirements and specific settings of the user. The equipment can meet various application scene requirements of online activities, including video conferences, online lessons, online live broadcast delivery, online training and the like, and various use requirements of users are met. The real-time video background replacement and editing are realized through a fusion algorithm of various advanced AI image processing; changing faces; changing the clothes, and background music; various identifiers; recognizing and transferring voice (semanteme) into various languages, and outputting voice and subtitles; carrying out beautification effect processing on the image, wherein the beautification effect processing comprises various effect processing such as figure beautification, image antiquing, black and white and the like; the transverse and vertical screens are changed skillfully, and the requirements of audiences on watching in various devices are met.
In summary, this step performs real-time dynamic image processing on the video signal by means of hardware with neural Network (NPU) computation power, using an artificial intelligence image processing algorithm, to obtain a processed image.
And S3, sending the processed image to a preset terminal device.
In the embodiment, the processed audio and video signals are output to a computer, a smart phone or other equipment in various modes such as a USB (universal serial bus), a wired network, a WIFI (wireless fidelity), an HDMI (high-definition multimedia interface) and the like, so that the audio and video editing requirements of various application scenes can be met, intelligence is provided for users, and the real-time video processing equipment is simple to operate. For example: can realize according to user's demand that the screen switches anyhow. Namely, after video signals are processed in real time, the signals are transmitted to subsequent equipment such as a computer and a mobile phone in any mode such as USB, a wired network, WIFI and HDMI, and specific application is realized.
Preferably, the artificial intelligence image processing method further includes steps S4-S6:
s4, determining a user through a receiving source of the video signal; and confirming the identity information of the user according to face recognition and/or voiceprint recognition. And processing the image according to the preference of the user.
In the embodiment, intelligent voice instruction control is supported; the intelligent terminal confirms user information according to face recognition and voiceprint recognition, automatically adopts the preference setting of the user to process the video, and avoids the problem that the user needs to be reset every time when multiple users use the video.
And S5, controlling the image processing process. Step S5 includes steps S51-S53:
s51, receiving the audio information of the user;
s52, identifying the audio information to convert the audio information into command control information;
and S53, controlling the image processing process through the instruction control information.
Through steps S4 and S5, the identity of the user is confirmed through face recognition and voiceprint recognition, and image processing is performed according to the preference of the user; the intelligent voice command control is supported, the equipment control becomes very simple, and common consumers can easily get the hands.
And S6, acquiring the operation habits and requirements of the user, and sending the operation habits and requirements to a background server for learning.
In this embodiment, equipment passes through internet access cloud end server, can learn according to user's custom and demand to constantly optimize current function and increase new function.
It will be understood by those skilled in the art that all or part of the steps of the methods of the above embodiments may be performed by instructions or by associated hardware controlled by the instructions, which may be stored in a computer readable storage medium and loaded and executed by a processor. To this end, the present invention provides a storage medium, in which a plurality of instructions are stored, and the instructions can be loaded by a processor to execute the steps in any one of the integration methods provided by the present invention.
Wherein the storage medium may include: read Only Memory (ROM), Random Access Memory (RAM), magnetic or optical disks, and the like.
Since the instructions stored in the storage medium can execute the steps in any integration method provided in the embodiments of the present invention, the beneficial effects that can be achieved by any integration method provided in the embodiments of the present invention can be achieved, for details, see the foregoing embodiments, and are not described herein again.
The above operations can be implemented in the foregoing embodiments, and are not described in detail herein.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (10)

1. An artificial intelligence image processing method, comprising:
receiving a video signal;
carrying out image processing on the video signal by adopting an artificial intelligence image processing algorithm to obtain a processed image;
and sending the processed image to a preset terminal device.
2. The artificial intelligence image processing method of claim 1, wherein said image processing the video signal using an artificial intelligence image processing algorithm to obtain a processed image comprises:
the method comprises the steps of advancing a person dynamic video in a video signal through a person identification technology, and synthesizing the person dynamic video with a preset background to replace the background;
extracting character characteristic information in a character dynamic video, and identifying the character characteristic information to perform face changing and dress changing functions;
and extracting a character image according to the character characteristic information, and carrying out image optimization on the character image through an image processing algorithm.
3. The artificial intelligence image processing method of claim 2, wherein said image processing the video signal using an artificial intelligence image processing algorithm to obtain a processed image further comprises:
and adding background music and/or annotation information to the processed image.
4. The artificial intelligence image processing method of claim 2, wherein said image processing the video signal using an artificial intelligence image processing algorithm to obtain a processed image further comprises:
identifying a voice signal in the video signal;
converting the voice signal into a preset multinational language through a preset intelligent voice recognition and translation algorithm;
the multi-national language is output through audio and/or subtitles.
5. The artificial intelligence image processing method of claim 1, further comprising:
determining a user through a receiving source of the video signal;
and confirming the identity information of the user according to face recognition and/or voiceprint recognition.
6. The artificial intelligence image processing method of claim 5, further comprising:
and processing the image according to the preference of the user.
7. The artificial intelligence image processing method of claim 5 or 6, further comprising:
the process of image processing is controlled.
8. The artificial intelligence image processing method of claim 7, wherein the controlling the image processing comprises:
receiving audio information of the user;
identifying the audio information to convert the audio information into instruction manipulation information;
and controlling the image processing process through the instruction control information.
9. The artificial intelligence image processing method of claim 5, further comprising:
and acquiring the operation habits and requirements of the user, and sending the operation habits and requirements to a background server for learning.
10. A computer readable storage medium having stored thereon a plurality of instructions adapted to be loaded by a processor to perform the artificial intelligence image processing method of any of claims 1 to 9.
CN202010540161.5A 2020-06-12 2020-06-12 Artificial intelligence image processing method Pending CN111901672A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010540161.5A CN111901672A (en) 2020-06-12 2020-06-12 Artificial intelligence image processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010540161.5A CN111901672A (en) 2020-06-12 2020-06-12 Artificial intelligence image processing method

Publications (1)

Publication Number Publication Date
CN111901672A true CN111901672A (en) 2020-11-06

Family

ID=73206314

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010540161.5A Pending CN111901672A (en) 2020-06-12 2020-06-12 Artificial intelligence image processing method

Country Status (1)

Country Link
CN (1) CN111901672A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115550704A (en) * 2022-12-01 2022-12-30 成都掌声如雷网络科技有限公司 Remote family interaction activity system and method based on multifunctional household appliance

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106534757A (en) * 2016-11-22 2017-03-22 北京金山安全软件有限公司 Face exchange method and device, anchor terminal and audience terminal
CN107316020A (en) * 2017-06-26 2017-11-03 司马大大(北京)智能系统有限公司 Face replacement method, device and electronic equipment
CN108027929A (en) * 2015-07-30 2018-05-11 奥兹格·布邓 Determined according to Consumer Preferences and play the media management system and method for the commercial film of all marketable products
US20180316942A1 (en) * 2012-04-24 2018-11-01 Skreens Entertainment Technologies, Inc. Systems and methods and interfaces for video processing, combination and display of heterogeneous sources
CN109147017A (en) * 2018-08-28 2019-01-04 百度在线网络技术(北京)有限公司 Dynamic image generation method, device, equipment and storage medium
CN109859100A (en) * 2019-01-30 2019-06-07 深圳安泰创新科技股份有限公司 Display methods, electronic equipment and the computer readable storage medium of virtual background
CN110113646A (en) * 2019-03-27 2019-08-09 深圳康佳电子科技有限公司 Intelligent interaction processing method, system and storage medium based on AI voice
CN110213613A (en) * 2018-08-09 2019-09-06 腾讯科技(深圳)有限公司 Image processing method, device and storage medium
CN110969572A (en) * 2019-11-29 2020-04-07 广州华多网络科技有限公司 Face changing model training method, face exchanging device and electronic equipment

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180316942A1 (en) * 2012-04-24 2018-11-01 Skreens Entertainment Technologies, Inc. Systems and methods and interfaces for video processing, combination and display of heterogeneous sources
CN108027929A (en) * 2015-07-30 2018-05-11 奥兹格·布邓 Determined according to Consumer Preferences and play the media management system and method for the commercial film of all marketable products
CN106534757A (en) * 2016-11-22 2017-03-22 北京金山安全软件有限公司 Face exchange method and device, anchor terminal and audience terminal
CN107316020A (en) * 2017-06-26 2017-11-03 司马大大(北京)智能系统有限公司 Face replacement method, device and electronic equipment
CN110213613A (en) * 2018-08-09 2019-09-06 腾讯科技(深圳)有限公司 Image processing method, device and storage medium
CN109147017A (en) * 2018-08-28 2019-01-04 百度在线网络技术(北京)有限公司 Dynamic image generation method, device, equipment and storage medium
CN109859100A (en) * 2019-01-30 2019-06-07 深圳安泰创新科技股份有限公司 Display methods, electronic equipment and the computer readable storage medium of virtual background
CN110113646A (en) * 2019-03-27 2019-08-09 深圳康佳电子科技有限公司 Intelligent interaction processing method, system and storage medium based on AI voice
CN110969572A (en) * 2019-11-29 2020-04-07 广州华多网络科技有限公司 Face changing model training method, face exchanging device and electronic equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
郑树泉: "人工智能的概念", 《读秀》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115550704A (en) * 2022-12-01 2022-12-30 成都掌声如雷网络科技有限公司 Remote family interaction activity system and method based on multifunctional household appliance
CN115550704B (en) * 2022-12-01 2023-03-14 成都掌声如雷网络科技有限公司 Remote family interaction activity method based on multifunctional household appliance

Similar Documents

Publication Publication Date Title
CN110463195B (en) Method and apparatus for rendering timed text and graphics in virtual reality video
CN107979763B (en) Virtual reality equipment video generation and playing method, device and system
US11409794B2 (en) Image deformation control method and device and hardware device
CN109448709A (en) A kind of terminal throws the control method and terminal of screen
US20220256140A1 (en) Video encoding method and apparatus, computer device, and storage medium
CN110931042A (en) Simultaneous interpretation method and device, electronic equipment and storage medium
CN207010880U (en) Set top box
CN112752116A (en) Display method, device, terminal and storage medium of live video picture
CN110784730A (en) Live video data transmission method, device, equipment and storage medium
CN112004131A (en) Display system
CN108737865A (en) A kind of intelligence audio-video equipment
CN111629222B (en) Video processing method, device and storage medium
WO2022088834A1 (en) Dynamic photograph album generation method, server, display terminal and readable storage medium
CN111464828A (en) Virtual special effect display method, device, terminal and storage medium
CN112581965A (en) Transcription method, device, recording pen and storage medium
CN111901672A (en) Artificial intelligence image processing method
JP2023549810A (en) Animal face style image generation method, model training method, device and equipment
CN108320331B (en) Method and equipment for generating augmented reality video information of user scene
KR102650138B1 (en) Display apparatus, method for controlling thereof and recording media thereof
WO2023045635A1 (en) Multimedia file subtitle processing method and apparatus, electronic device, computer-readable storage medium, and computer program product
CN116185191A (en) Server, display equipment and virtual digital human interaction method
US20220078524A1 (en) Method, system, and non-transitory computer-readable recording medium for providing content comprising augmented reality object by using plurality of devices
CN112954452B (en) Video generation method, device, terminal and storage medium
CN111107283B (en) Information display method, electronic equipment and storage medium
CN112788381B (en) Display apparatus and display method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201106