CN112672062B - Display device and portrait positioning method - Google Patents

Display device and portrait positioning method Download PDF

Info

Publication number
CN112672062B
CN112672062B CN202011620179.2A CN202011620179A CN112672062B CN 112672062 B CN112672062 B CN 112672062B CN 202011620179 A CN202011620179 A CN 202011620179A CN 112672062 B CN112672062 B CN 112672062B
Authority
CN
China
Prior art keywords
image
camera
user
portrait
display
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011620179.2A
Other languages
Chinese (zh)
Other versions
CN112672062A (en
Inventor
杨鲁明
鲍姗娟
王之奎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hisense Visual Technology Co Ltd
Original Assignee
Hisense Visual Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hisense Visual Technology Co Ltd filed Critical Hisense Visual Technology Co Ltd
Publication of CN112672062A publication Critical patent/CN112672062A/en
Priority to PCT/CN2021/101020 priority Critical patent/WO2022037229A1/en
Priority to CN202180051350.9A priority patent/CN116034580A/en
Application granted granted Critical
Publication of CN112672062B publication Critical patent/CN112672062B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Controls And Circuits For Display Device (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The application provides a display device and a portrait positioning method, wherein the portrait position is detected through a camera; and determining whether to start adjusting the orientation of the camera by comparing the portrait position with a preset area in the proofreading image. And if the current portrait position is not in the preset area, controlling the camera to rotate so that the user imaging position is located in the middle area of the picture. The method comprises the steps of carrying out person detection on a shot image through image detection to determine a specific position, and driving a camera to carry out fine adjustment, so that accurate positioning is achieved, and people shot by the camera can be focused and displayed in a television screen.

Description

Display device and portrait positioning method
The present application claims priority from the chinese patent application entitled "a method for locating a portrait on a display device" filed by the chinese patent office on 21/8/2020, application number 202010849806.3, the entire contents of which are incorporated herein by reference.
Technical Field
The application relates to the technical field of intelligent televisions, in particular to a display device and a portrait positioning method.
Background
The smart television is a television product which is based on an Internet application technology, has an open operating system and a chip, has an open application platform, can realize a bidirectional man-machine interaction function, integrates various functions such as audio and video, entertainment, data and the like, and is used for meeting diversified and personalized requirements of users. The intelligent television can be provided with external or internal functional equipment, and the functions of the intelligent television can be expanded by combining with application programs.
For example, the smart television may be provided with an image acquisition device such as a camera, and the smart television may acquire the user image through the camera and cooperate with the "limb movement" application program to enable the smart television to display the body image of the user in real time. When the limb action of the user is changed, the application program can also display the changed image, and the detection program is used for detecting and correcting the motion form of the limb, so that the effect of motion guidance is achieved.
However, the position and shooting angle of a built-in or external camera device of the smart television are fixed, so that the visual angle of the camera for image acquisition is limited, and the preview display area of the camera can only be an area image towards which the camera faces. According to the position change of the user and the limb action change in the motion process, the user can easily move out of the preview display area, so that the camera cannot acquire a complete image of the user, and the display effect is influenced. Moreover, in order to obtain a complete image again, the user needs to stop the motion and return to the proper position, thereby reducing the user experience.
Disclosure of Invention
The application provides a display device and a portrait positioning method, and aims to solve the problem that a traditional camera cannot acquire complete images of a user easily.
In a first aspect, the present application provides a display apparatus, which includes a display, an external device interface, and a controller, wherein the display apparatus is externally connected with a camera through the external device interface. The camera can rotate to shoot angles and is used for shooting images at different viewing angles; the display is configured to present a user interface; the controller is configured to perform the following program steps:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, and detecting a human image position from the image shot by the camera;
and sending a rotation instruction to the camera head according to the position of the portrait so as to enable the position of the portrait to be located in a preset judgment area.
In a second aspect, the present application further provides a display device, including a display, a camera, and a controller. The camera can rotate to shoot angles and is used for shooting images at different viewing angles; the display is configured to present a user interface; the controller is configured to perform the following program steps:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, and detecting a human image position from the image shot by the camera;
and sending a rotation instruction to the camera head according to the position of the portrait so as to enable the position of the portrait to be located in a preset judgment area.
In a third aspect, the present application further provides a portrait positioning method applied to the display device, including:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, and detecting a human image position from the image shot by the camera;
and sending a rotation instruction to the camera head according to the position of the portrait so as to enable the position of the portrait to be located in a preset judgment area.
According to the technical scheme, the display equipment and the portrait positioning method are provided, wherein the portrait position is detected through the camera; and determining whether to start adjusting the orientation of the camera by comparing the portrait position with a preset area in the proofreading image. And if the current portrait position is not in the preset area, controlling the camera to rotate so that the user imaging position is located in the middle area of the picture. The method comprises the steps of carrying out person detection on a shot image through image detection to determine a specific position, and driving a camera to carry out fine adjustment, so that accurate positioning is achieved, and people shot by the camera can be focused and displayed in a television screen.
Drawings
In order to more clearly explain the technical solution of the present application, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious to those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a schematic diagram of an operation scenario between a display device and a control apparatus in an embodiment of the present application;
fig. 2 is a block diagram of a hardware configuration of a display device in an embodiment of the present application;
fig. 3 is a block diagram of a hardware configuration of a control device in an embodiment of the present application;
FIG. 4 is a schematic diagram of a software configuration of a display device in an embodiment of the present application;
FIG. 5 is a schematic diagram of an icon control interface display of an application program of a display device in an embodiment of the present application;
fig. 6 is a schematic view of an arrangement structure of a display device and a camera in the embodiment of the present application;
FIG. 7 is a schematic view of a camera structure according to an embodiment of the present disclosure;
FIG. 8 is a schematic diagram of pixel locations in an embodiment of the present application;
FIG. 9a is a schematic top view of a scene of a display device before adjustment according to an embodiment of the present disclosure;
FIG. 9b is a schematic top view of an adjusted scene of a display device according to an embodiment of the present disclosure;
FIG. 9c is a schematic diagram of a side view of a scene of a display device before adjustment according to an embodiment of the present disclosure;
FIG. 9d is a schematic top view of an adjusted scene of a display device according to an embodiment of the present disclosure;
FIG. 10 is a schematic diagram of a portrait center and an image center in an embodiment of the present application;
FIG. 11 is a schematic geometric relationship diagram of a process of calculating a rotation angle according to an embodiment of the present application;
FIG. 12 is a schematic view illustrating a process of adjusting the rotation angle according to an embodiment of the present application;
FIG. 13 is a schematic view of the squatting position in the embodiment of the present application;
fig. 14 is a schematic view of a standing posture state in the embodiment of the present application.
Detailed Description
Reference will now be made in detail to embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following examples do not represent all embodiments consistent with the present application. But merely as exemplifications of systems and methods consistent with certain aspects of the application, as recited in the claims.
All other embodiments, which can be derived by a person skilled in the art from the exemplary embodiments described herein without inventive step, are intended to be within the scope of the claims appended hereto. In addition, while the disclosure herein has been presented in terms of one or more exemplary examples, it should be appreciated that aspects of the disclosure may be implemented solely as a complete embodiment.
It should be noted that the brief descriptions of the terms in the present application are only for the convenience of understanding the embodiments described below, and are not intended to limit the embodiments of the present application. These terms should be understood in their ordinary and customary meaning unless otherwise indicated.
The terms "first," "second," "third," and the like in the description and claims of this application and in the above-described drawings are used for distinguishing between similar or analogous objects or entities and are not necessarily intended to limit the order or sequence of any particular one, Unless otherwise indicated. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein.
Furthermore, the terms "comprises" and "comprising," as well as any variations thereof, are intended to cover a non-exclusive inclusion, such that a product or device that comprises a list of elements is not necessarily limited to those elements explicitly listed, but may include other elements not expressly listed or inherent to such product or device.
The term "module," as used herein, refers to any known or later developed hardware, software, firmware, artificial intelligence, fuzzy logic, or combination of hardware and/or software code that is capable of performing the functionality associated with that element.
The term "remote control" as used in this application refers to a component of an electronic device, such as the display device disclosed in this application, that is typically wirelessly controllable over a short distance. Typically using infrared and/or Radio Frequency (RF) signals and/or bluetooth to connect with the electronic device, and may also include WiFi, wireless USB, bluetooth, motion sensor, etc. For example: the hand-held touch remote controller replaces most of the physical built-in hard keys in the common remote control device with the user interface in the touch screen.
The term "gesture" as used in this application refers to a user's behavior through a change in hand shape or an action such as hand motion to convey a desired idea, action, purpose, or result.
Fig. 1 is a schematic diagram illustrating an operation scenario between a display device and a control apparatus according to an embodiment. As shown in fig. 1, a user may operate the display device 200 through the mobile terminal 300 and the control apparatus 100.
In some embodiments, the control apparatus 100 may be a remote controller, and the communication between the remote controller and the display device includes an infrared protocol communication or a bluetooth protocol communication, and other short-distance communication methods, etc., and the display device 200 is controlled by wireless or other wired methods. The user may input a user command through a key on a remote controller, voice input, control panel input, etc. to control the display apparatus 200. Such as: the user can input a corresponding control command through a volume up/down key, a channel control key, up/down/left/right moving keys, a voice input key, a menu key, a power on/off key, etc. on the remote controller, to implement the function of controlling the display device 200.
In some embodiments, mobile terminals, tablets, computers, laptops, and other smart devices may also be used to control the display device 200. For example, the display device 200 is controlled using an application program running on the smart device. The application, through configuration, may provide the user with various controls in an intuitive User Interface (UI) on a screen associated with the smart device.
In some embodiments, the mobile terminal 300 may install a software application with the display device 200 to implement connection communication through a network communication protocol for the purpose of one-to-one control operation and data communication. Such as: the mobile terminal 300 and the display device 200 can establish a control instruction protocol, synchronize a remote control keyboard to the mobile terminal 300, and control the display device 200 by controlling a user interface on the mobile terminal 300. The audio and video content displayed on the mobile terminal 300 can also be transmitted to the display device 200, so as to realize the synchronous display function.
As also shown in fig. 1, the display apparatus 200 also performs data communication with the server 400 through various communication means. The display device 200 may be allowed to be communicatively connected through a Local Area Network (LAN), a Wireless Local Area Network (WLAN), and other networks. The server 400 may provide various contents and interactions to the display apparatus 200. Illustratively, the display device 200 receives software program updates, or accesses a remotely stored digital media library, by sending and receiving information, as well as Electronic Program Guide (EPG) interactions. The server 400 may be a cluster or a plurality of clusters, and may include one or more types of servers. Other web service contents such as video on demand and advertisement services are provided through the server 400.
The display device 200 may be a liquid crystal display, an OLED display, a projection display device. The particular display device type, size, resolution, etc. are not limiting, and those skilled in the art will appreciate that the display device 200 may be modified in performance and configuration as desired.
The display apparatus 200 may additionally provide an intelligent network tv function of a computer support function including, but not limited to, a network tv, an intelligent tv, an Internet Protocol Tv (IPTV), and the like, in addition to the broadcast receiving tv function.
A hardware configuration block diagram of a display device 200 according to an exemplary embodiment is exemplarily shown in fig. 2.
In some embodiments, at least one of the controller 250, the tuner demodulator 210, the communicator 220, the detector 230, the input/output interface 255, the display 275, the audio output interface 285, the memory 260, the power supply 290, the user interface 265, and the external device interface 240 is included in the display apparatus 200.
In some embodiments, a display 275 receives image signals originating from the first processor output and displays video content and images and components of the menu manipulation interface.
In some embodiments, the display 275, includes a display screen assembly for presenting a picture, and a driving assembly that drives the display of an image.
In some embodiments, the video content is displayed from broadcast television content, or alternatively, from various broadcast signals that may be received via wired or wireless communication protocols. Alternatively, various image contents received from the network communication protocol and sent from the network server side can be displayed.
In some embodiments, the display 275 is used to present a user-manipulated UI interface generated in the display apparatus 200 and used to control the display apparatus 200.
In some embodiments, a driver assembly for driving the display is also included, depending on the type of display 275.
In some embodiments, display 275 is a projection display and may also include a projection device and a projection screen.
In some embodiments, communicator 220 is a component for communicating with external devices or external servers according to various communication protocol types. For example: the communicator may include at least one of a Wifi chip, a bluetooth communication protocol chip, a wired ethernet communication protocol chip, and other network communication protocol chips or near field communication protocol chips, and an infrared receiver.
In some embodiments, the display apparatus 200 may establish control signal and data signal transmission and reception with the external control apparatus 100 or the content providing apparatus through the communicator 220.
In some embodiments, the user interface 265 may be configured to receive infrared control signals from a control device 100 (e.g., an infrared remote control, etc.).
In some embodiments, the detector 230 is a signal used by the display device 200 to collect an external environment or interact with the outside.
In some embodiments, the detector 230 includes a light receiver, a sensor for collecting the intensity of ambient light, and parameters changes can be adaptively displayed by collecting the ambient light, and the like.
In some embodiments, the detector 230 may further include an image collector, such as a camera, etc., which may be configured to collect external environment scenes, collect attributes of the user or gestures interacted with the user, adaptively change display parameters, and recognize user gestures, so as to implement a function of interaction with the user.
In some embodiments, the detector 230 may also include a temperature sensor or the like, such as by sensing ambient temperature.
In some embodiments, the display apparatus 200 may adaptively adjust a display color temperature of an image. For example, the display apparatus 200 may be adjusted to display a cool tone when the temperature is in a high environment, or the display apparatus 200 may be adjusted to display a warm tone when the temperature is in a low environment.
In some embodiments, the detector 230 may also be a sound collector or the like, such as a microphone, which may be used to receive the user's voice. Illustratively, a voice signal including a control instruction of the user to control the display device 200, or to collect an ambient sound for recognizing an ambient scene type, so that the display device 200 can adaptively adapt to an ambient noise.
In some embodiments, as shown in fig. 2, the input/output interface 255 is configured to allow data transfer between the controller 250 and external other devices or other controllers 250. Such as receiving video signal data and audio signal data of an external device, or command instruction data, etc.
In some embodiments, the external device interface 240 may include, but is not limited to, the following: the interface can be any one or more of a high-definition multimedia interface (HDMI), an analog or data high-definition component input interface, a composite video input interface, a USB input interface, an RGB port and the like. The plurality of interfaces may form a composite input/output interface.
In some embodiments, as shown in fig. 2, the tuning demodulator 210 is configured to receive a broadcast television signal through a wired or wireless receiving manner, perform modulation and demodulation processing such as amplification, mixing, resonance, and the like, and demodulate an audio and video signal from a plurality of wireless or wired broadcast television signals, where the audio and video signal may include a television audio and video signal carried in a television channel frequency selected by a user and an EPG data signal.
In some embodiments, the frequency points demodulated by the tuner demodulator 210 are controlled by the controller 250, and the controller 250 can send out control signals according to user selection, so that the modem responds to the television signal frequency selected by the user and modulates and demodulates the television signal carried by the frequency.
In some embodiments, the broadcast television signal may be classified into a terrestrial broadcast signal, a cable broadcast signal, a satellite broadcast signal, an internet broadcast signal, or the like according to the broadcasting system of the television signal. Or may be classified into a digital modulation signal, an analog modulation signal, and the like according to a modulation type. Or the signals are classified into digital signals, analog signals and the like according to the types of the signals.
In some embodiments, the controller 250 and the modem 210 may be located in different separate devices, that is, the modem 210 may also be located in an external device of the main device where the controller 250 is located, such as an external set-top box. Therefore, the set top box outputs the television audio and video signals modulated and demodulated by the received broadcast television signals to the main body equipment, and the main body equipment receives the audio and video signals through the first input/output interface.
In some embodiments, the controller 250 controls the operation of the display device and responds to user operations through various software control programs stored in memory. The controller 250 may control the overall operation of the display apparatus 200. For example: in response to receiving a user command for selecting a UI object to be displayed on the display 275, the controller 250 may perform an operation related to the object selected by the user command.
In some embodiments, the object may be any one of selectable objects, such as a hyperlink or an icon. Operations related to the selected object, such as: displaying an operation connected to a hyperlink page, document, image, or the like, or performing an operation of a program corresponding to the icon. The user command for selecting the UI object may be a command input through various input means (e.g., a mouse, a keyboard, a touch pad, etc.) connected to the display apparatus 200 or a voice command corresponding to a voice spoken by the user.
As shown in fig. 2, the controller 250 includes at least one of a Random Access Memory 251 (RAM), a Read-Only Memory 252 (ROM), a video processor 270, an audio processor 280, other processors 253 (e.g., a Graphics Processing Unit (GPU), a Central Processing Unit 254 (CPU), a Communication Interface (Communication Interface), and a Communication Bus 256(Bus), which connects the respective components.
In some embodiments, RAM 251 is used to store temporary data for the operating system or other programs that are running.
In some embodiments, ROM 252 is used to store instructions for various system boots.
In some embodiments, the ROM 252 is used to store a Basic Input Output System (BIOS). The system is used for completing power-on self-test of the system, initialization of each functional module in the system, a driver of basic input/output of the system and booting an operating system.
In some embodiments, when the power-on signal is received, the display device 200 starts to power up, the CPU executes the system boot instruction in the ROM 252, and copies the temporary data of the operating system stored in the memory to the RAM 251 so as to start or run the operating system. After the start of the operating system is completed, the CPU copies the temporary data of the various application programs in the memory to the RAM 251, and then, the various application programs are started or run.
In some embodiments, processor 254 is used to execute operating system and application program instructions stored in memory. And executing various application programs, data and contents according to various interactive instructions received from the outside so as to finally display and play various audio and video contents.
In some demonstrative embodiments, processor 254 may include a plurality of processors. The plurality of processors may include a main processor and one or more sub-processors. A main processor for performing some operations of the display apparatus 200 in a pre-power-up mode and/or operations of displaying a screen in a normal mode. One or more sub-processors for one operation in a standby mode or the like.
In some embodiments, the graphics processor 253 is used to generate various graphics objects, such as: icons, operation menus, user input instruction display graphics, and the like. The display device comprises an arithmetic unit which carries out operation by receiving various interactive instructions input by a user and displays various objects according to display attributes. And the system comprises a renderer for rendering various objects obtained based on the arithmetic unit, wherein the rendered objects are used for being displayed on a display.
In some embodiments, the video processor 270 is configured to receive an external video signal, and perform video processing such as decompression, decoding, scaling, noise reduction, frame rate conversion, resolution conversion, image synthesis, and the like according to a standard codec protocol of the input signal, so as to obtain a signal that can be displayed or played on the direct display device 200.
In some embodiments, video processor 270 includes a demultiplexing module, a video decoding module, an image synthesis module, a frame rate conversion module, a display formatting module, and the like.
The demultiplexing module is used for demultiplexing the input audio and video data stream, and if the input MPEG-2 is input, the demultiplexing module demultiplexes the input audio and video data stream into a video signal and an audio signal.
And the video decoding module is used for processing the video signal after demultiplexing, including decoding, scaling and the like.
And the image synthesis module is used for carrying out superposition mixing processing on the GUI signal input by the user or generated by the user and the video image after the zooming processing by the graphic generator so as to generate an image signal for display.
The frame rate conversion module is configured to convert an input video frame rate, such as a 60Hz frame rate into a 120Hz frame rate or a 240Hz frame rate, and the normal format is implemented in, for example, an interpolation frame mode.
The display format module is used for converting the received video output signal after the frame rate conversion, and changing the signal to conform to the signal of the display format, such as outputting an RGB data signal.
In some embodiments, the graphics processor 253 and the video processor may be integrated or separately configured, and when the graphics processor and the video processor are integrated, the graphics processor and the video processor may perform processing of graphics signals output to the display, and when the graphics processor and the video processor are separately configured, the graphics processor and the video processor may perform different functions, respectively, for example, a GPU + frc (frame Rate conversion) architecture.
In some embodiments, the audio processor 280 is configured to receive an external audio signal, decompress and decode the received audio signal according to a standard codec protocol of the input signal, and perform noise reduction, digital-to-analog conversion, and amplification processes to obtain an audio signal that can be played in a speaker.
In some embodiments, video processor 270 may comprise one or more chips. The audio processor may also comprise one or more chips.
In some embodiments, the video processor 270 and the audio processor 280 may be separate chips or may be integrated together with the controller in one or more chips.
In some embodiments, the audio output, under the control of controller 250, receives sound signals output by audio processor 280, such as: the speaker 286, and an external sound output terminal of a generating device that can output to an external device, in addition to the speaker carried by the display device 200 itself, such as: external sound interface or earphone interface, etc., and may also include a near field communication module in the communication interface, for example: and the Bluetooth module is used for outputting sound of the Bluetooth loudspeaker.
The power supply 290 supplies power to the display device 200 from the power input from the external power source under the control of the controller 250. The power supply 290 may include a built-in power supply circuit installed inside the display apparatus 200, or may be a power supply interface installed outside the display apparatus 200 to provide an external power supply in the display apparatus 200.
A user interface 265 for receiving an input signal of a user and then transmitting the received user input signal to the controller 250. The user input signal may be a remote controller signal received through an infrared receiver, and various user control signals may be received through the network communication module.
In some embodiments, the user inputs a user command through the control apparatus 100 or the mobile terminal 300, the user input interface responds to the user input through the controller 250 according to the user input, and the display device 200 responds to the user input through the controller 250.
In some embodiments, a user may enter user commands on a Graphical User Interface (GUI) displayed on the display 275, and the user input interface receives the user input commands through the Graphical User Interface (GUI). Alternatively, the user may input the user command by inputting a specific sound or gesture, and the user input interface receives the user input command by recognizing the sound or gesture through the sensor.
In some embodiments, a "user interface" is a media interface for interaction and information exchange between an application or operating system and a user that enables conversion between an internal form of information and a form that is acceptable to the user. A commonly used presentation form of the User Interface is a Graphical User Interface (GUI), which refers to a User Interface related to computer operations and displayed in a graphical manner. It may be an interface element such as an icon, a window, a control, etc. displayed in the display screen of the electronic device, where the control may include a visual interface element such as an icon, a button, a menu, a tab, a text box, a dialog box, a status bar, a navigation bar, a Widget, etc.
The memory 260 includes a memory storing various software modules for driving the display device 200. Such as: various software modules stored in the first memory, including: at least one of a basic module, a detection module, a communication module, a display control module, a browser module, and various service modules.
The base module is a bottom layer software module for signal communication between various hardware in the display device 200 and for sending processing and control signals to the upper layer module. The detection module is used for collecting various information from various sensors or user input interfaces, and the management module is used for performing digital-to-analog conversion and analysis management.
For example, the voice recognition module comprises a voice analysis module and a voice instruction database module. The display control module is used for controlling the display to display the image content, and can be used for playing the multimedia image content, UI interface and other information. And the communication module is used for carrying out control and data communication with external equipment. And the browser module is used for executing a module for data communication between browsing servers. And the service module is used for providing various services and modules including various application programs. Meanwhile, the memory 260 may store a visual effect map for receiving external data and user data, images of various items in various user interfaces, and a focus object, etc.
Fig. 3 exemplarily shows a block diagram of a configuration of the control apparatus 100 according to an exemplary embodiment. As shown in fig. 3, the control apparatus 100 includes a controller 110, a communication interface 130, a user input/output interface, a memory, and a power supply source.
The control device 100 is configured to control the display device 200 and may receive an input operation instruction of a user and convert the operation instruction into an instruction recognizable and responsive by the display device 200, serving as an interaction intermediary between the user and the display device 200. Such as: the user responds to the channel up and down operation by operating the channel up and down keys on the control device 100.
In some embodiments, the control device 100 may be a smart device. Such as: the control apparatus 100 may install various applications that control the display apparatus 200 according to user demands.
In some embodiments, as shown in fig. 1, a mobile terminal 300 or other intelligent electronic device may function similar to the control device 100 after installing an application that manipulates the display device 200. Such as: a user may implement the functions of controlling the physical keys of device 100 by installing applications, various function keys or virtual buttons of a graphical user interface that may be provided on mobile terminal 300 or other intelligent electronic device.
The controller 110 includes a processor 112 and RAM 113 and ROM 114, a communication interface 130, and a communication bus. The controller is used to control the operation of the control device 100, as well as the communication cooperation between the internal components and the external and internal data processing functions.
The communication interface 130 enables communication of control signals and data signals with the display apparatus 200 under the control of the controller 110. Such as: the received user input signal is transmitted to the display apparatus 200. The communication interface 130 may include at least one of a WiFi chip 131, a bluetooth module 132, an NFC module 133, and other near field communication modules.
A user input/output interface 140, wherein the input interface includes at least one of a microphone 141, a touch pad 142, a sensor 143, keys 144, and other input interfaces. Such as: the user can realize a user instruction input function through actions such as voice, touch, gesture, pressing, and the like, and the input interface converts the received analog signal into a digital signal and converts the digital signal into a corresponding instruction signal, and sends the instruction signal to the display device 200.
The output interface includes an interface that transmits the received user instruction to the display apparatus 200. In some embodiments, the interface may be an infrared interface or a radio frequency interface. Such as: when the infrared signal interface is used, the user input instruction needs to be converted into an infrared control signal according to an infrared control protocol, and the infrared control signal is sent to the display device 200 through the infrared sending module. The following steps are repeated: when the rf signal interface is used, a user input command needs to be converted into a digital signal, and then the digital signal is modulated according to the rf control signal modulation protocol and then transmitted to the display device 200 through the rf transmitting terminal.
In some embodiments, the control device 100 includes at least one of a communication interface 130 and an input-output interface 140. The control device 100 is provided with a communication interface 130, such as: the WiFi, bluetooth, NFC, etc. modules may transmit the user input command to the display device 200 through the WiFi protocol, or the bluetooth protocol, or the NFC protocol code.
A memory 190 for storing various operation programs, data and applications for driving and controlling the control apparatus 200 under the control of the controller. The memory 190 may store various control signal commands input by a user.
And a power supply 180 for providing operational power support to the various elements of the control device 100 under the control of the controller. A battery and associated control circuitry.
In some embodiments, the system may include a Kernel (Kernel), a command parser (shell), a file system, and an application program. The kernel, shell, and file system together make up the basic operating system structure that allows users to manage files, run programs, and use the system. After power-on, the kernel is started, kernel space is activated, hardware is abstracted, hardware parameters are initialized, and virtual memory, a scheduler, signals and interprocess communication (IPC) are operated and maintained. And after the kernel is started, loading the Shell and the user application program. The application program is compiled into machine code after being started, and a process is formed.
Referring to fig. 4, in some embodiments, the system is divided into four layers, which are an Application (Applications) layer (abbreviated as "Application layer"), an Application Framework (Application Framework) layer (abbreviated as "Framework layer"), an Android runtime (Android runtime) and system library layer (abbreviated as "system runtime library layer"), and a kernel layer from top to bottom.
In some embodiments, at least one application program runs in the application program layer, and the application programs can be Window (Window) programs carried by an operating system, system setting programs, clock programs, camera applications and the like; or may be an application developed by a third party developer such as a hi program, a karaoke program, a magic mirror program, or the like. In specific implementation, the application packages in the application layer are not limited to the above examples, and may actually include other application packages, which is not limited in this embodiment of the present application.
The framework layer provides an Application Programming Interface (API) and a programming framework for the application program of the application layer. The application framework layer includes a number of predefined functions. The application framework layer acts as a processing center that decides to let the applications in the application layer act. The application program can access the resources in the system and obtain the services of the system in execution through the API interface.
As shown in fig. 4, in the embodiment of the present application, the application framework layer includes a manager (Managers), a Content Provider (Content Provider), and the like, where the manager includes at least one of the following modules: an Activity Manager (Activity Manager) is used for interacting with all activities running in the system; the Location Manager (Location Manager) is used for providing the system service or application with the access of the system Location service; a Package Manager (Package Manager) for retrieving various information related to an application Package currently installed on the device; a Notification Manager (Notification Manager) for controlling display and clearing of Notification messages; a Window Manager (Window Manager) is used to manage the icons, windows, toolbars, wallpapers, and desktop components on a user interface.
In some embodiments, the activity manager is to: managing the life cycle of each application program and the general navigation backspacing function, such as controlling the exit of the application program (including switching the user interface currently displayed in the display window to the system desktop), opening, backing (including switching the user interface currently displayed in the display window to the previous user interface of the user interface currently displayed), and the like.
In some embodiments, the window manager is configured to manage all window processes, such as obtaining a display size, determining whether a status bar is available, locking a screen, intercepting a screen, controlling a display change (e.g., zooming out, dithering, distorting, etc.) and the like.
In some embodiments, the system runtime layer provides support for the upper layer, i.e., the framework layer, and when the framework layer is used, the android operating system runs the C/C + + library included in the system runtime layer to implement the functions to be implemented by the framework layer.
In some embodiments, the kernel layer is a layer between hardware and software. As shown in fig. 4, the core layer includes at least one of the following drivers: audio drive, display drive, bluetooth drive, camera drive, WIFI drive, USB drive, HDMI drive, sensor drive (such as fingerprint sensor, temperature sensor, touch sensor, pressure sensor, etc.), and so on.
In some embodiments, the kernel layer further comprises a power driver module for power management.
In some embodiments, software programs and/or modules corresponding to the software architecture of fig. 4 are stored in the first memory or the second memory shown in fig. 2 or 3.
In some embodiments, taking the magic mirror application (photographing application) as an example, when the remote control receiving device receives a remote control input operation, a corresponding hardware interrupt is sent to the kernel layer. The kernel layer processes the input operation into an original input event (including information such as a value of the input operation, a timestamp of the input operation, etc.). The raw input events are stored at the kernel layer. The application program framework layer obtains an original input event from the kernel layer, identifies a control corresponding to the input event according to the current position of the focus and uses the input operation as a confirmation operation, the control corresponding to the confirmation operation is a control of a magic mirror application icon, the magic mirror application calls an interface of the application framework layer to start the magic mirror application, and then the kernel layer is called to start a camera driver, so that a static image or a video is captured through the camera.
In some embodiments, for a display device with a touch function, taking a split screen operation as an example, the display device receives an input operation (such as a split screen operation) that a user acts on a display screen, and the kernel layer may generate a corresponding input event according to the input operation and report the event to the application framework layer. The window mode (such as multi-window mode) corresponding to the input operation, the position and size of the window and the like are set by an activity manager of the application framework layer. And the window management of the application program framework layer draws a window according to the setting of the activity manager, then sends the drawn window data to the display driver of the kernel layer, and the display driver displays the corresponding application interface in different display areas of the display screen.
In some embodiments, as shown in fig. 5, the application layer containing at least one application may display a corresponding icon control in the display, such as: the system comprises a live television application icon control, a video on demand application icon control, a media center application icon control, an application center icon control, a game application icon control and the like.
In some embodiments, the live television application may provide live television via different signal sources. For example, a live television application may provide television signals using input from cable television, radio broadcasts, satellite services, or other types of live television services. And, the live television application may display video of the live television signal on the display device 200.
In some embodiments, a video-on-demand application may provide video from different storage sources. Unlike live television applications, video on demand provides a video display from some storage source. For example, the video on demand may come from a server side of the cloud storage, from a local hard disk storage containing stored video programs.
In some embodiments, the media center application may provide various applications for multimedia content playback. For example, a media center, which may be other than live television or video on demand, may provide services that a user may access to various images or audio through a media center application.
In some embodiments, an application center may provide storage for various applications. The application may be a game, an application, or some other application associated with a computer system or other device that may be run on the smart television. The application center may obtain these applications from different sources, store them in local storage, and then be operable on the display device 200.
In the embodiment of the present application, as shown in fig. 6, the camera 231 as a detector 230 may be embedded in or externally connected to the display device 200, and after the start operation, the camera 231 may detect image data. The camera 231 may be connected to the controller 250 through an interface part so as to transmit the detected image data to the controller 250 for processing. To detect images, the camera 231 may include a lens assembly and a pan-tilt assembly. The lens assembly may be based on a detection principle of a CCD (Charge Coupled Device) or a CMOS (Complementary Metal Oxide Semiconductor) to generate image data of an electrical signal according to a user image.
The lens subassembly sets up on the cloud platform subassembly, and the cloud platform subassembly can drive the lens subassembly and rotate to change the orientation of lens subassembly. The holder assembly can comprise at least two rotating parts to respectively drive the lens assembly to rotate left and right along the numerical direction and rotate up and down along the horizontal direction. Each rotating part can be connected with a motor so as to be driven by the motor to rotate automatically.
For example, as shown in fig. 7, the pan/tilt head assembly may include a first rotation shaft in a vertical state and a second rotation shaft in a horizontal state, the first rotation shaft being disposed on the top of the display 275 and rotatably connected to the top of the display 275; still be equipped with the mounting in the first pivot, the rotatable being connected with in top of mounting the camera lens subassembly is connected in second pivot, second pivot to drive the camera lens subassembly and rotate. The first rotating shaft and the second rotating shaft are respectively connected with a motor and a transmission component. The motor may be a servo motor, a stepping motor, etc. capable of supporting automatic control of the rotation angle. After acquiring the control command, the two motors can respectively rotate to drive the first rotating shaft and the second rotating shaft to rotate, so as to adjust the orientation of the lens assembly.
With different orientations of the lens assembly, the lens assembly can perform video shooting on users located at different positions, so as to acquire user image data. Obviously, different orientations correspond to image capture of different areas, and when the user is at a position right in front of the display 275, the fixed part and the lens assembly can be driven by the first rotating shaft on the holder assembly to rotate to the left, so that in the captured image, the portrait position of the user is located in the central area of the picture; when the body imaging position of the user is inclined, the lens component can be driven to rotate upwards through the second rotating shaft in the holder component so as to raise the shooting angle, and the portrait position of the user is located in the central area of the picture.
Based on the camera 231, some embodiments of the present application provide a method for locating a portrait, which may set an automatic control program in the display device 200, so as to adjust the orientation of the lens assembly in the camera 231 by detecting the position of the user, and repeat the detection process of the position of the portrait according to a certain frequency, thereby tracking the position of the portrait. The portrait positioning method may specifically include the steps of:
the portrait position is detected. After the camera 231 starts to operate, a plurality of frames of images are photographed in real time, and the photographed images are transmitted to the controller 250 of the display apparatus 200. The start of the camera 231 may be manually or automatically started. The manual activation is to complete the activation after the user selects an icon corresponding to the camera 231 in the operation interface through the control device 100 such as a remote controller. The automatic initiation may be user initiated automatically after performing some interaction that requires the camera 231 to be invoked. For example, the user selects the "look at the mirror" application in the "my applications" interface, and since the application needs to call the camera 231, the camera 231 is started at the same time as the application is started.
After the camera 231 is started, the controller 250 may perform image processing according to the started application program, for example, control the display 275 to display the image; on the other hand, the checking image can be analyzed by calling a detection program, so that the position of the user is determined. Wherein, the detection of the portrait position can be completed by an image processing program. That is, the limb information is detected by capturing an image photographed by the camera 231 in real time. The limb information can comprise key points and an outer frame wrapping the limb, and the position of the portrait is determined according to the detected key points and the position information of the limb frame in the image. Wherein, the key points may refer to a series of points in the human body image that can represent human body features. Such as eyes, ears, nose, neck, shoulders, elbows, wrists, waists, knees, and ankles, among others.
The determination of the key points can be obtained by image recognition, that is, the image corresponding to the key points can be determined by analyzing the characteristic shapes in the picture and matching the characteristic shapes with a preset template, and the positions corresponding to the image are obtained, so that the positions corresponding to the key points are obtained. The position can be represented by the number of pixels in the image away from the boundary. Can be according to camera 231's resolution ratio and visual angle to the upper left corner of image is the origin, and with right and downward for the positive direction and construct plane rectangular coordinate system, then each pixel in the image all can express through this rectangular coordinate system.
For example, as shown in fig. 8, the horizontal direction and the vertical direction camera viewing angles are HFOV and VFOV, respectively, the viewing angles can be obtained according to camera CameraInfo, a camera preview image supports 1080P, the width is 1920, and the height is 1080 pixels, then the position of each pixel in the image can be (x, y), where the value range of x is (0, 1920); y has a value in the range of (0,1080).
Generally, in order to accurately express the position of the user, the number of the key points may be set to be multiple, and in one detection process, position extraction needs to be performed on all or part of the multiple key points, so as to determine the outer frame area wrapping the limb. For example, the key points may include 18, i.e., 2 eye points, 2 ear points, 1 nose point, 1 neck point, 2 shoulder points, 2 elbow points, 2 wrist points, 2 waist points (or hip points), 2 knee joint points, and 2 ankle joint points. Obviously, the key points can be identified in different ways according to different user orientations. For example, the location corresponding to the waist is identified as the waist point when the user faces the display 275, and the hip point when the user faces away from the display 275.
Obviously, when the position of the user is changed or the posture of the user is changed, the positions of some key points are changed. With such a change, the relative position of the human body in the image acquired by the camera 231 will also change. For example, when the human body moves to the left, the position of the human body in the image collected by the camera 231 is shifted to the left, which is inconvenient for image analysis and real-time display.
Therefore, after the portrait position is detected, the portrait position needs to be compared with the preset area in the calibration image, so as to determine whether the current portrait position is in the preset area.
In some embodiments, the portrait position may be represented by a position of a center of a limb box, and the position of the center of the limb box may be calculated by detecting coordinates of each key point position.For example, the central position of the limb frame, namely the x-axis coordinate x of the central position, is calculated by acquiring the x-axis position coordinates of key points on the left side and the right side of the horizontal position of the limb frame 0 =(x 1 +x 2 )/2。
Since the camera 231 in the embodiment of the present application may include two left-right direction rotations and two up-down direction rotations, after the x-axis coordinate of the center position is obtained through calculation, the x-axis coordinate may be determined first, and whether the x-axis coordinate of the center position is located in the center position of the whole image is determined. For example, when the collation image is a 1080P image (1920,1080), the center point horizontal coordinate of the collation image is 960.
After the central position of the portrait and the central point of the image are determined, whether the position of the portrait is in the preset judgment area or not can be determined through comparison. To avoid an increase in processing load due to frequent adjustment, and to allow partial detection of errors. According to the requirements of practical application conditions and the horizontal visual angle of the camera 231, an allowable coordinate interval can be preset, and when the central position of the portrait is located in the allowable coordinate interval, the current position of the portrait is determined to be in the preset area. For example, if the maximum allowable coordinate error is 300 pixels, the allowable coordinate interval is [660, 1260], and when the detected user central position coordinate is in the interval, it is determined that the user is in the preset judgment area, that is, the difference between the calculated user central position coordinate and the 960 position is not large; when the detected central position coordinate of the user is not in the interval, the current portrait position is determined not to be in the preset area, namely, the difference between the central position coordinate of the portrait obtained by calculation and the position 960 is larger.
As shown in fig. 9a, 9b, 9c, and 9d, after comparing the portrait position with the preset area in the calibration image, it may be determined whether the portrait tracking is required according to the comparison result, and if the current portrait position is not within the preset area, the camera 231 is controlled to rotate in the horizontal direction (fig. 9a to 9b) and the vertical direction (fig. 9c to 9d), respectively, so that the user imaging position is located in the middle area of the screen. If the current portrait position is within the preset area, the camera 231 does not need to be controlled to rotate, and the orientation of the camera is maintained.
In order to control the camera 231 to rotate when the current portrait position is not within the preset area, the controller 250 may calculate a rotation angle amount according to the portrait position and generate a control command according to the rotation angle amount to control the camera 231 to rotate.
Specifically, after determining that the current portrait position is not within the preset area, the controller 250 may first calculate a distance between a center position of the portrait area and a center point of the image area; calculating according to the calculated distance and combining the maximum visual angle of the lens assembly of the camera 231 and the image size to obtain a rotation angle; and finally, sending the calculated rotation angle to the camera 231 in a control instruction mode, so that the motor in the camera 231 drives each rotating shaft to rotate, and the orientation of the lens assembly is adjusted.
For example, as shown in fig. 10 and 11, the preview resolution of the camera 231 is 1920 × 1080, and the horizontal width of the image: 1920, imgWidth ═ 1920; the image horizontal center position coordinate x is 960; the central position coordinate of the portrait area is (x) 0 ,y 0 ) Horizontal center position coordinate is x 0 (ii) a Horizontal viewing angle hfov; the center distance between the portrait area and the image area is: hd ═ x-x 0 Then, the rotation angle of the camera 231 in the horizontal direction can be calculated according to the following formula:
Figure BDA0002875941250000121
through the above formula, the angle of the camera 231 that needs to be adjusted can be calculated, and the controller 250 compares the coordinate values of the central position of the portrait area and the central point of the portrait area to determine the orientation of the central position of the portrait area relative to the central point of the portrait area, thereby determining the rotation direction of the camera 231. That is, if the horizontal position of the center of the portrait area is greater than the center of the image, the camera 231 is rotated to the right; and conversely, the camera 231 is rotated to the left. In the embodiment of the present application, the camera 231 may adopt a rear camera mode, so that the screen display image and the image shot by the camera are in a left-right mirror relationship, that is, the horizontal angle rotation is left-right opposite.
After determining the rotation angle and the direction, the controller 250 may encapsulate the rotation angle and the direction data, generate a control command, and transmit the control command to the camera 231. The motor in the camera 231 can rotate after receiving the control instruction, so as to drive the lens assembly to rotate through the rotating shaft, and adjust the orientation of the lens assembly.
It should be noted that, in the above embodiment, the determination and adjustment are performed by taking a horizontal coordinate as an example, in practical application, the lens assembly may also be adjusted similarly by comparing a vertical difference between a central position of the portrait area and a central position of the image area, and a specific adjustment method is the same as the horizontal adjustment method, that is, after determining that the current portrait position is not within the preset area, the controller 250 may first calculate a vertical distance between the central position of the portrait area and the central point of the image area; then, according to the calculated vertical distance, a rotation angle is calculated and obtained by combining the maximum vertical viewing angle of the lens assembly of the camera 231 and the image size; and finally, the calculated rotation angle is sent to the camera 231 in a control instruction mode, so that the motor in the camera 231 drives the second rotating shaft to rotate, and the orientation of the lens assembly is adjusted.
However, in practical application, due to the influence of the user posture and different requirements in different application programs, a better display, detection and tracking effect cannot be obtained by using the central position as the portrait position judgment mode in part of application scenes. Therefore, in some embodiments, controlling the camera 231 to rotate so that the user imaging position is located in the middle area of the screen may further be performed according to the following steps.
A first identification point is detected in the collated image.
Wherein the first identification point is one or more of the identified key points and is used for representing the position of part of the limb of the user. For example, the first recognition point may be 2 eye points (or 2 ear points) to represent the head position of the user. By matching the region corresponding to the eye pattern (or ear pattern) in the collation image, it is detected whether or not the current image contains the first recognition point, that is, whether or not the eye point (or ear point) is contained.
If the collated image does not contain the first identification point, a second identification point is detected in the collated image.
The second recognition point is a key point that is spaced apart from the first recognition point by a certain distance and can have a relative positional relationship. For example, the second identification point may be a chest point, and since the chest point is located below the eye point and the chest point is spaced apart from the eye point by a distance of 20-30cm in a normal use state, a direction that needs to be adjusted may be determined by detection of the chest point.
And generating a rotation direction according to the orientation relation between the second identification point and the first identification point.
For example, when the first recognition point, i.e., the eye point, is not detected in the collation image; when the second recognition point, i.e., the chest point, is detected, it is determined that the head image of the user cannot be displayed completely in the current corrected image, and the camera 231 needs to be lifted up to allow the head of the portrait to enter the preset area of the image.
Obviously, in practical applications, the first recognition point is not detected in the calibration image, and the determined rotation direction is different when the second recognition point is detected, according to the difference of the relative orientation of the second recognition point and the first recognition point. For example, when the first recognition point is a waist point and the second recognition point is a chest point, and the chest point is detected without detecting the waist point, it indicates that the captured image is too close to the upper half of the portrait, so the lower half of the portrait can be brought into the preset area of the image by lowering the capturing angle.
And controlling the camera 231 to rotate according to the rotating direction and the preset adjusting step length so as to enable the portrait to be located in the preset image area.
The preset adjustment step length can be set to a fixed value according to the shooting precision of the camera 231. For example, when the eye/ear key point (first identification point) is not detected and the shoulder key point (second identification point) is detected, the camera 231 may be lifted up, so that the first identification point position is adjusted by 100 pixels each time until the first identification point is at the position 1/7-1/5.
And if the first identification point is contained in the corrected image, acquiring the position of the first identification point relative to the image area.
By recognition of the picture in the collated image, if the first recognition point is recognized, the position where the first recognition point is located may be further extracted, thereby determining the position where the first recognition point is located with respect to the entire image area. For example, as shown in fig. 12, after obtaining the calibration image, if the eye point is recognized, that is, it is determined that the first recognition point is detected, the current coordinates P (x) of the eye point may be acquired 1 ,y 1 ). And comparing the coordinate value of the x axis and/or the coordinate value of the y axis in the current coordinate with the whole width imgWidth and/or height imgHeight of the image, thereby determining the position of the first recognition point relative to the image area. Wherein the position of the first recognition point in both directions relative to the image area can be determined in both directions. I.e. in the horizontal direction, the first identification point is located at a position x relative to the image area 1 /imgWidth; in the vertical direction, the position of the first identification point relative to the image area is y 1 /imgHeight。
After the position of the first identification point relative to the image area is obtained, the section where the position of the first identification point is located can be judged, and different adjustment modes can be determined according to different sections where the first identification point is located.
For example, as shown in fig. 12, by detecting that the first recognition point is located at a position relative to the image area in the vertical direction, it is detected that the eye (or ear) is below 1/5 of the height of the image screen, at which time the eye position is too low, and it is necessary to press down the camera 231 to raise the eye position to a suitable area, and in the process of pressing down the camera 231, if it is detected that the eye point is at 1/6 of the image screen, the pressing down is stopped, and the adjustment of the camera 231 is completed. When the eye (or ear) position is detected to be below 1/7 and above 1/5 of the image screen height, the current first identification point position is determined to be appropriate, so that the height of the camera 231 does not need to be adjusted, and the camera is prevented from being frequently changed due to shaking.
Since the distances between the user and the display device 200 are different in practical applications, the ratio of the portrait is different in the image captured by the camera 231. That is, when the user is at a distant position from the display apparatus 200, the photographed portrait is also small; when the display apparatus 200 is located at a close position, the photographed portrait is also large. For this reason, when the camera 231 is controlled to rotate, the same rotation step may vary the amount of movement of the image position when the user is at different distances from the display apparatus 200. For example, when the user is closer to the display device 200, the camera 231 adjusts 100 pixels and may move the eye region of the human image from the edge position to the 1/7-1/5 position; when the user is far away from the display device 200, the camera 231 adjusts 100 pixels to move the eye region of the portrait from the edge position to the position 1/5-1/2, which causes over-adjustment and affects the positioning effect.
To this end, in some embodiments, the adjustment step size may also be determined based on the characteristics of the detection of the first identification point or the second identification point. I.e. the method further comprises the steps of:
detecting a distance between two of the first recognition points or two of the second recognition points in the collation image;
matching a preset adjusting step length according to the distance between the two first identification points or the two second identification points;
and packaging the rotation direction and the preset adjustment step length to generate a rotation instruction.
Due to the human body characteristics, there are generally at least two key points of the same type, so that at least two first recognition points or at least two second recognition points can be detected in the corrected image. For example, when the first recognition points are eye key points, the left and right eyes of the user respectively correspond to two key points, and the distance between the user and the display device 200 can be estimated by calculating the distance between the two first recognition points. Obviously, the farther the distance between the user and the display device 200 is, the closer the distance between the two first recognition points is; the closer the distance between the user and the display device 200 is, the farther the distance between the two first recognition points is calculated.
After the distance between the two first identification points or the two second identification points is calculated, matching may be performed in a preset relationship table according to the type of the first identification point or the second identification point and the calculated distance, determining a preset adjustment step corresponding to the current distance, and encapsulating the determined rotation direction and the preset adjustment step in a rotation instruction, so as to send the rotation instruction to the camera 231. The camera 231 may adjust the photographing angle in the rotation direction according to a preset adjustment step length according to the received rotation instruction.
In the above embodiment, the camera 231 may re-extract the calibration image once per adjustment of the shooting angle according to the preset adjustment step length, and determine whether the portrait position is located in the preset area according to the determination method in the above embodiment, and if the portrait position is still not located in the preset area, generate a rotation instruction once again to control the camera 231 to continue rotating by the angle, and stop sending the rotation instruction to the camera 231 in this cycle manner until the portrait position is located in the preset area.
It should be noted that, in order to obtain the tracking effect of the camera 231, in practical applications, in the use process after determining that the position of the portrait is located in the preset area, the corrected image may be extracted from the image captured by the camera 231 according to the set frequency, and the position of the portrait may be determined according to the above-described embodiment. If the position of the portrait is not in the preset area due to the movement of the user in the using process, a rotation instruction is sent to the camera 231 to control the camera 231 to turn to the user area, so that the tracking effect is achieved.
According to the technical scheme, the image positioning method provided by the embodiment can solve the problem that the image area shot by the current television fixed camera is fixed and the horizontally or vertically moving human body cannot be tracked, positioned and shot. The method for locating the portrait is effective to quickly and accurately locating the focused portrait when the standing position of the portrait is not in the default camera shooting central area for scenes such as video call, body building and the like.
Because the number and the types of the identified key points are different in different detection processes, for example, when the user is in a squatting state, the waist point, the ankle joint point and other key points are shielded and cannot be identified, and therefore the key points identified in different detection processes can be different.
In order to obtain more accurate determination of the portrait position, in some embodiments, as shown in fig. 13 and 14, after a plurality of key points are identified, a skeleton line schematic graph may be further established according to the identified key points, so as to further determine the position of the portrait according to the skeleton line graph. Wherein the bone line can be determined by connecting a plurality of key points. In different poses of the user, the shape assumed by the bone line is also different.
It should be noted that the shooting position of the camera can be dynamically adjusted according to the motion change rule of the skeleton line through the drawn skeleton line. For example, when it is determined that the change process of the motion state of the bone line changes from the squatting state to the standing state, the viewing angle of the camera 231 may be raised, so that the portrait in the standing state may also be in a suitable area in the image, i.e., the transition from fig. 13 to fig. 14 has the effect. When the change process of the motion state of the skeleton line is judged to be changed from the standing state to the squatting state, the visual angle of the camera 231 can be reduced, so that the portrait in the squatting state can also be in a proper area in the image, namely, the effect of transition from fig. 14 to fig. 13 is achieved.
Based on the above-mentioned portrait positioning method, in some embodiments of the present application, there is also provided a display device 200, where the display device 200 includes a display 275 and a controller 250. The display apparatus 200 may have a camera 231 built therein or accessed through an external device interface 240. The camera 231 can rotate to shoot angles, and is used for shooting images at different viewing angles; the display 275 is configured to present a user interface; the controller 250 is configured to perform the following program steps:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, and detecting a human image position from the image shot by the camera;
and sending a rotation instruction to the camera head according to the position of the portrait so as to enable the position of the portrait to be located in a preset judgment area.
As can be seen from the above technical solutions, the display device 200 provided in this embodiment can configure the application program corresponding to the portrait positioning method in the controller 250, so as to perform the portrait positioning method in real time. I.e. by comparing the portrait position with a preset area in the calibration image to determine whether to start adjusting the orientation of the camera 231. If the current portrait position is not in the preset area, the camera 231 is controlled to rotate so that the user imaging position is located in the middle area of the picture. The method determines the specific position of the shot image through person detection of the image detection, drives the camera 231 to perform fine adjustment so as to achieve accurate positioning, and enables the person shot by the camera 231 to be focused and displayed in the display 275 of the display device 200
The embodiments provided in the present application are only a few examples of the general concept of the present application, and do not limit the scope of the present application. Any other embodiments extended according to the scheme of the present application without inventive efforts will be within the scope of protection of the present application for a person skilled in the art.

Claims (10)

1. A display device, comprising:
a display;
an external device interface configured to connect a camera, which can rotate a photographing angle, configured to photograph an image;
a controller configured to:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, detecting a human image position from an image shot by the camera, wherein the human image position is a limb center position obtained by calculating a plurality of limb key point position coordinates in a target recognition detection image and according to a skeleton line determined by a plurality of key points and a corresponding motion state;
calculating the distance between a user and display equipment according to the distance between at least two key points detected in the image, and matching a preset adjusting step length according to the distance between the user and the display equipment;
and sending a rotation instruction to the camera according to the position of the portrait and the distance between the user and the display equipment so as to enable the position of the portrait to be located in a preset judgment area, wherein the rotation instruction comprises a rotation direction and a rotation angle, and the rotation angle is determined according to a preset adjustment step length obtained by matching.
2. The display device according to claim 1, wherein in the step of detecting the position of the person image from the image taken by the camera, the controller is further configured to:
extracting a proofreading image;
identifying a characteristic shape in the collated image;
matching the characteristic shape with a preset template, and positioning a plurality of key points;
establishing a skeleton line graph according to the key points, and determining a motion state according to the skeleton line graph;
and calculating the central position according to the positions of the key points and the motion state to obtain the portrait position.
3. The display device of claim 1, wherein in the step of sending a rotation instruction to the camera according to the portrait position and the distance between the user and the display device, the controller is further configured to:
comparing the portrait position with a preset judgment area;
if the portrait position is located in the preset judgment area, controlling a display to display the image shot by the camera in real time;
if the portrait position is outside the preset judgment area, calculating the coordinate difference between the portrait position and the center of the preset judgment area;
and generating a rotation instruction according to the coordinate difference, and sending the rotation instruction to the camera head.
4. The display device according to claim 3, wherein in the step of generating the rotation instruction according to the coordinate difference, the controller is further configured to:
comparing the position coordinates of the portrait with the central coordinates of the preset judgment area to obtain a rotation direction;
calculating the distance between the position coordinates of the portrait and the central coordinates of the preset judgment area;
calculating a rotation angle according to the calculated distance and by combining camera parameters, wherein the camera parameters comprise a preview resolution and a maximum visual angle;
and encapsulating the rotation direction and the rotation angle to generate the rotation instruction.
5. The display device according to claim 1, wherein in the step of detecting the position of the person image from the image taken by the camera, the controller is further configured to:
detecting first identification points in the proofreading image, wherein the first identification points are one or more of key points;
if the proofreading image does not contain the first identification point, detecting a second identification point in the proofreading image, wherein the second identification point is a key point which has a preset orientation relation with the first identification point;
and generating a rotation direction according to the orientation relation between the second identification point and the first identification point.
6. The display device according to claim 5, wherein the first recognition point is a user eye point and the second recognition point is a user shoulder point; in the step of detecting a position of a person image from an image taken by the camera, the controller is further configured to:
detecting a shoulder point in the collation image if the collation image does not contain a user eye point;
and sending a rotation instruction to the camera head to control the camera head to raise the shooting angle, so that the shot image contains the eye point of the user, and the eye point is positioned in the position of 1/7-1/5 height interval in the image.
7. The display device of claim 5, wherein in the step of calculating a distance between the user and the display device based on a distance between at least two key points detected in the image, and matching a preset adjustment step size based on the distance between the user and the display device, the controller is further configured to:
detecting a distance between two of the first recognition points or two of the second recognition points in the collation image;
matching a preset adjusting step length according to the distance between the two first identification points or the two second identification points;
and packaging the rotation direction and the preset adjustment step length to generate a rotation instruction.
8. The display device according to claim 5, wherein in the step of detecting the position of the human image from the image captured by the camera, if the corrected image contains the eye point of the user, the controller is further configured to:
detecting a height interval of the eye point of the user in the proofreading image;
if the user eye point is below 1/5 the corrected image height, sending a rotation instruction to the camera to control the camera to lower the shooting height until the user eye point is at 1/6 stop at the corrected image height;
and if the eye point of the user is in the range of 1/7-1/5 of the corrected image height, the rotation instruction is cancelled to the camera head, so that the influence of camera shake on the detection result is prevented.
9. A display device, comprising:
a display;
a camera rotatable by a shooting angle configured to shoot an image;
a controller configured to:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, detecting a human image position from an image shot by the camera, wherein the human image position is a limb center position obtained by calculating a plurality of limb key point position coordinates in a target recognition detection image and according to a skeleton line determined by a plurality of key points and a corresponding motion state;
calculating the distance between a user and display equipment according to the distance between at least two key points detected in the image, and matching a preset adjusting step length according to the distance between the user and the display equipment;
and sending a rotation instruction to the camera according to the position of the portrait and the distance between the user and the display equipment so as to enable the position of the portrait to be located in a preset judgment area, wherein the rotation instruction comprises a rotation direction and a rotation angle, and the rotation angle is determined according to a preset adjustment step length obtained by matching.
10. A portrait positioning method is applied to display equipment, the display equipment comprises a display and a controller, the display equipment is internally provided with or externally connected with a camera through an external device interface, the camera can rotate a shooting angle, and the portrait positioning method comprises the following steps:
acquiring a control instruction which is input by a user and used for starting the camera application;
responding to the control instruction, detecting a human image position from an image shot by the camera, wherein the human image position is a limb center position obtained by calculating a plurality of limb key point position coordinates in a target recognition detection image and according to a skeleton line determined by a plurality of key points and a corresponding motion state;
calculating the distance between a user and display equipment according to the distance between at least two key points detected in the image, and matching a preset adjusting step length according to the distance between the user and the display equipment;
and sending a rotation instruction to the camera according to the position of the portrait and the distance between the user and the display equipment so as to enable the position of the portrait to be located in a preset judgment area, wherein the rotation instruction comprises a rotation direction and a rotation angle, and the rotation angle is determined according to a preset adjustment step length obtained by matching.
CN202011620179.2A 2020-08-21 2020-12-31 Display device and portrait positioning method Active CN112672062B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2021/101020 WO2022037229A1 (en) 2020-08-21 2021-06-18 Human image positioning methods and display devices
CN202180051350.9A CN116034580A (en) 2020-08-21 2021-06-18 Portrait positioning method and display equipment

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2020108498063 2020-08-21
CN202010849806 2020-08-21

Publications (2)

Publication Number Publication Date
CN112672062A CN112672062A (en) 2021-04-16
CN112672062B true CN112672062B (en) 2022-08-09

Family

ID=75412096

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011620179.2A Active CN112672062B (en) 2020-08-21 2020-12-31 Display device and portrait positioning method

Country Status (1)

Country Link
CN (1) CN112672062B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116034580A (en) * 2020-08-21 2023-04-28 海信视像科技股份有限公司 Portrait positioning method and display equipment
CN113542832B (en) * 2021-07-01 2023-07-04 深圳创维-Rgb电子有限公司 Display control method, display device and computer readable storage medium
CN114098632B (en) * 2022-01-27 2022-11-29 北京鹰瞳科技发展股份有限公司 Method for controlling a motor in a fundus camera and related product
CN114399949B (en) * 2022-01-28 2023-11-28 苏州华星光电技术有限公司 Adjustable display device and adjusting method thereof

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1742169B1 (en) * 2005-07-05 2012-02-22 Omron Corporation Tracking apparatus
CN103475849B (en) * 2013-09-22 2016-06-08 广东欧珀移动通信有限公司 Method photographic head shooting angle being adjusted when video calling and device
CN105357442A (en) * 2015-11-27 2016-02-24 小米科技有限责任公司 Shooting angle adjustment method and device for camera
KR20170064242A (en) * 2015-12-01 2017-06-09 삼성전자주식회사 Method and Electronic Apparatus for Providing Video Call
CN105554391B (en) * 2015-12-31 2019-05-14 广州广电运通金融电子股份有限公司 Control method and device, the finance device terminal of camera
CN108733280A (en) * 2018-03-21 2018-11-02 北京猎户星空科技有限公司 Focus follower method, device, smart machine and the storage medium of smart machine
CN109977770B (en) * 2019-02-21 2023-06-27 安克创新科技股份有限公司 Automatic tracking shooting method, device, system and storage medium
CN110460772B (en) * 2019-08-14 2021-03-09 广州织点智能科技有限公司 Camera automatic adjustment method, device, equipment and storage medium
CN110855891A (en) * 2019-11-26 2020-02-28 爱菲力斯(深圳)科技有限公司 Method and device for adjusting camera shooting angle based on human body posture and robot
CN111147749A (en) * 2019-12-31 2020-05-12 宇龙计算机通信科技(深圳)有限公司 Photographing method, photographing device, terminal and storage medium

Also Published As

Publication number Publication date
CN112672062A (en) 2021-04-16

Similar Documents

Publication Publication Date Title
CN113330736B (en) Display and image processing method
CN112672062B (en) Display device and portrait positioning method
CN112866772B (en) Display device and sound image character positioning and tracking method
CN111669508A (en) Camera control method and display device
CN112866773B (en) Display equipment and camera tracking method in multi-person scene
CN112291599B (en) Display device and method for adjusting angle of camera
CN111970548B (en) Display device and method for adjusting angle of camera
CN112073788A (en) Video data processing method and device and display equipment
CN112565862A (en) Display equipment and equipment parameter memorizing method and restoring method thereof
CN111835969A (en) Interactive method for controlling angle of camera and display equipment
CN114630053A (en) HDR image display method and display equipment
CN113473024A (en) Display device, holder camera and camera control method
CN114430492B (en) Display device, mobile terminal and picture synchronous scaling method
CN111939561B (en) Display device and interaction method
CN113825002A (en) Display device and focus control method
CN112473121A (en) Display device and method for displaying dodging ball based on limb recognition
CN112218145A (en) Smart television, VR display device and related methods
CN111078926A (en) Method for determining portrait thumbnail image and display equipment
CN113824870A (en) Display device and camera angle adjusting method
CN112399235B (en) Camera shooting effect enhancement method and display device of intelligent television
WO2021218473A1 (en) Display method and display device
CN112261289B (en) Display device and AI algorithm result acquisition method
CN112218156B (en) Method for adjusting video dynamic contrast and display equipment
CN112437284A (en) Projection picture correction method, terminal equipment and display equipment
CN111931692A (en) Display device and image recognition method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant