WO2020015468A1 - Image transmission method and apparatus, terminal device, and storage medium - Google Patents

Image transmission method and apparatus, terminal device, and storage medium Download PDF

Info

Publication number
WO2020015468A1
WO2020015468A1 PCT/CN2019/089353 CN2019089353W WO2020015468A1 WO 2020015468 A1 WO2020015468 A1 WO 2020015468A1 CN 2019089353 W CN2019089353 W CN 2019089353W WO 2020015468 A1 WO2020015468 A1 WO 2020015468A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
target image
image
data
transmitted
Prior art date
Application number
PCT/CN2019/089353
Other languages
French (fr)
Chinese (zh)
Inventor
付阳
王云飞
黄通兵
Original Assignee
北京七鑫易维信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京七鑫易维信息技术有限公司 filed Critical 北京七鑫易维信息技术有限公司
Publication of WO2020015468A1 publication Critical patent/WO2020015468A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • G06F3/013Eye tracking input arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • G06F3/1454Digital output to display device ; Cooperation and interconnection of the display device with other functional units involving copying of the display data of a local workstation or window to a remote workstation or window so that an actual copy of the data is displayed simultaneously on two or more displays, e.g. teledisplay

Definitions

  • Embodiments of the present invention relate to the field of transmission technologies, and in particular, to an image transmission method, device, terminal device, and storage medium.
  • the display device is a device capable of outputting an image or a touch signal (for example, a braille display designed for the blind). After receiving the image data of an external signal source (such as a computer), the display device displays the corresponding image content in real time.
  • an external signal source such as a computer
  • the display device displays the corresponding image content in real time.
  • the original image is usually directly transmitted. Therefore, the amount of image data to be transmitted in the process of transmitting images is large, and the speed of transmitting images is slow.
  • the image transmission method proposed to solve the problem of a large number of image transmission processes generally compresses the entire image data, which results in poor definition of the image content displayed by the display device and affects the user experience of viewing the image.
  • An image transmission method, device, terminal device and storage medium provided by the present invention can effectively improve image compression efficiency.
  • an embodiment of the present invention provides an image transmission method, including:
  • an embodiment of the present invention further provides an image transmission device, including:
  • An eye image acquisition module configured to acquire an eye image when a user looks at a target image
  • a fixation point determining module configured to determine a fixation point of a user based on a line of sight corresponding to the eye image
  • a pupil radius determination module configured to obtain a pupil radius at which the user looks at the fixation point
  • the data to be transmitted determination module is configured to divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, The priority of the assigned area to get the data to be transmitted;
  • a transmission module configured to transmit the data to be transmitted.
  • an embodiment of the present invention further provides a terminal device, including:
  • One or more processors are One or more processors;
  • a storage device configured to store one or more programs
  • the one or more programs are executed by the one or more processors, so that the one or more processors implement the image transmission method provided by the embodiment of the present invention.
  • an embodiment of the present invention further provides a computer-readable storage medium on which a computer program is stored.
  • the program is executed by a processor, the image transmission method provided by the embodiment of the present invention is implemented.
  • Embodiments of the present invention provide an image transmission method, device, terminal device, and storage medium.
  • the fixation point determined by the line of sight corresponding to the eye image when the user fixes the target image, the degree of attention corresponding to the pupil radius when the user fixes the fixation point, and the target image
  • the pixel value of each pixel points divides the target image, and then transmits it with different priorities (different priorities may correspond to different code rates or different rendering accuracy).
  • it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and improving the efficiency of image transmission; compared with compressing the target image as a whole, it effectively improves It improves the sharpness of the image of the user's area of interest and improves the compression effect of the target image.
  • FIG. 1a is a schematic flowchart of an image transmission method according to Embodiment 1 of the present invention.
  • FIG. 1b is a schematic diagram of an eye image provided by Embodiment 1 of the present invention.
  • FIG. 2a is a schematic flowchart of an image transmission method according to a second embodiment of the present invention.
  • FIG. 2b is a schematic diagram of an application scenario of an image transmission method provided in Embodiment 2 of the present invention.
  • FIG. 2c is a schematic diagram after assigning priorities to the divided target images according to the second embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of an image transmission device according to a third embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of a terminal device according to a fourth embodiment of the present invention.
  • FIG. 1a is a schematic flowchart of an image transmission method according to Embodiment 1 of the present invention.
  • the method is applicable to a case where a target image is transmitted between different image transmission devices (such as a terminal device and a display device or between different terminal devices).
  • This method may be performed by an image transmission apparatus provided in an embodiment of the present invention, where the apparatus may be implemented by software and / or hardware, and is generally integrated on a terminal device.
  • the terminal device includes, but is not limited to, a computer, a mobile phone, or a handheld computer.
  • an image transmission method provided in Embodiment 1 of the present invention includes the following steps:
  • the image transmission method may be applied to a terminal device.
  • the target image can be understood as an image to be processed in the terminal device.
  • the user can directly stare at the target image in the terminal device, and then process the target image based on the eye image when the user looks at the target image, and send the processed data to the display device or other terminal device.
  • an eye image when a user looks at a target image can be obtained through an image acquisition device provided on a terminal device.
  • the user when acquiring the eye image when the user looks at the target image, the user can also wear an AR device, and an image acquisition device is provided on the AR device to collect the eye image when the user looks at the target image.
  • This step can be achieved by The communication connection established by the AR device acquires the eye image when the user looks at the target image; if the user wears a VR device, the terminal device can be built into the VR device, the target image is displayed by the terminal device, and the user's gaze target image is obtained by the image acquisition device Eye image of the time, wherein the image acquisition device can be set on the VR device.
  • the terminal device may acquire an eye image when the user looks at the target image through a communication connection established with the VR device.
  • an image of the eye of the user when the user looks at the target image may be obtained through an image acquisition device on the terminal device.
  • the image acquisition device may be a common camera or an infrared camera.
  • the image acquisition device in this embodiment is an infrared camera
  • the infrared camera needs to be configured with an infrared lamp to be set to manufacture a light spot in an eye diagram, so that the user's sight can be determined in combination with the characteristics of the pupil of the user.
  • the fixation point can be understood as the position where the user fixates in the target image.
  • the eye image can be identified through the eye recognition algorithm to determine the line of sight corresponding to the eye image, and then the user's line of sight can be determined based on the determined line of sight.
  • the coordinates in the image to get the user's fixation point; or the fixation point of the user can be determined through a pre-built fixation point model.
  • this step may determine the direction of the user's line of sight based on the pupil characteristics in the eye image.
  • pupil characteristics in the eye image and spot information presented by the infrared lamp in the eye image may be acquired, and then according to the acquired The pupil feature and spot information are determined by the corneal reflection method to determine the direction of the user's line of sight in the eye image.
  • the main hardware requirements include, but are not limited to: a light source: generally an infrared light source, because infrared light will not affect the vision of the eye; and multiple infrared light sources can be arranged in a predetermined manner , Such as a character shape, a shape, etc .; image acquisition device: such as infrared camera equipment, infrared image sensor, camera or video camera.
  • the process of determining the user's line of sight includes:
  • an eye image the light source shines on the user's eye, and the user's eye is photographed by the image acquisition device, and the corresponding reflection point on the cornea, that is, the light spot (also known as Purkin's spot) is captured, thereby obtaining the Eye image.
  • the light spot also known as Purkin's spot
  • perform gaze / fixation point estimation as the eyeball rotates, the relative positional relationship between the pupil center and the light spot changes, and several eye images with light spots collected correspondingly reflect this position change relationship; according to the description
  • the position change relationship is estimated by the line of sight / fixation point.
  • eye tracking can also be called gaze tracking, which is a technique for estimating the eye's sight and / or gaze point by measuring the movement of the user's eyes.
  • Optical recording method is currently widely used: using an image acquisition device (such as a camera or video camera) to record the subject's eye movements, that is, to obtain an eye image that reflects the eye movement, and to extract an eye from the acquired eye image
  • the eye features may include: pupil position, pupil shape, iris position, iris shape, eyelid position, eye corner position, and / or light spot (also referred to as Purchin spot) position.
  • a contact / non-contact sensor such as an electrode or a capacitance sensor
  • a contact / non-contact sensor can also be used to estimate the eye movement.
  • FIG. 1b shows a schematic diagram of an eye image provided in Embodiment 1 of the present invention.
  • FIG. 1b shows one eye, which is not a limitation on the number of eyes.
  • the eye image obtained in this embodiment may also include two eyes. Eyes only to improve the accuracy of the target image processing.
  • an eye feature (feature information) when the user looks at the target image may be determined based on the obtained eye image, so as to determine a gaze point corresponding to the user's line of sight direction.
  • the direction of the user's line of sight can be obtained by analyzing the pupil 11 or the iris 12 in the eye image.
  • the spot information contained in the eye image can also be acquired to assist in obtaining the direction of the user's line of sight.
  • the pupil radius at which the user fixes the fixation point can be obtained.
  • both the positive and negative movements of the eye can be obtained by analyzing the pupil radius, so that the pupil radius can be analyzed to determine the positive and negative movements of the eye, so that the user's degree of attention to the current object of interest can be obtained.
  • the corresponding degree of attention can be determined by the pupil radius when the user pays attention to the target image, so as to obtain different processing methods for the target image.
  • an eye image obtained for judging a user's fixation point may be analyzed to determine a pupil radius at which the user fixes the fixation point; or when the user fixes the fixation point, the pupil radius of the fixation point is obtained.
  • edge information of a pupil in an eye image may be extracted, and then a corresponding pupil radius may be determined based on the edge information of the pupil.
  • S104 Divide the target image into regions based on the user's gaze point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and assign priorities to the divided regions. Get the data to be transmitted.
  • the degree of attention can be understood as the degree of interest of the user in the viewing object determined based on the characteristics of the user's eyes (such as the pupil radius).
  • the image to be transmitted can be understood as data to be transmitted based on the divided target image.
  • this step may determine the corresponding degree of attention based on the pupil radius and a preset pupil data table. Then combine the user's gaze point with the pixel value of each pixel of the target image to divide the target image into regions.
  • the degree of attention may also be determined in combination with the remaining eye features in the eye image. For example, to obtain the pupil features (such as pupil radius and pupil position) and spot features of the gaze point of the user, determine the fixation information based on the pupil position and spot features (fixation information includes the duration of fixation, the number of fixations, and / or the first fixation time), and The information and pupil radius determine the degree of attention.
  • the fixation information includes the duration of fixation, the number of fixations, and / or the first fixation time
  • the preset pupil data table may be a general-purpose data table obtained through training; it may also be a dedicated data table obtained by training for different users (the data tables corresponding to different users are different). If it is a special data table, after obtaining the eye image of the user when gazing at the target image, the iris features in the eye image can also be extracted to identify the user, thereby determining the user's gaze point The corresponding degree of attention can be determined after the pupil radius of.
  • this step may first be based on each pixel of the user's target image
  • the pixel values of the points perform edge segmentation on the target image, and then select each closed area and the area containing the gaze point.
  • the divided areas are assigned priorities based on the user's attention to the gaze point. This can effectively re-segment the target image based on the user's gaze point and attention and assign priorities.
  • the attention degree may have a one-to-one correspondence relationship with the priority of the divided regions. Based on the determined attention degree, different priorities can be assigned to the divided regions. Different attention levels correspond to different priorities.
  • priorities may correspond to different processing methods based on different application scenarios. For example, higher priority areas are assigned higher code rates, lower priority areas are assigned lower code rates; higher priority areas are assigned higher resolutions, and lower priority areas are assigned lower resolutions. Areas with higher rates or priorities use larger compression ratios, and areas with lower priorities use smaller compression ratios. It should be noted that the processing methods of regions with different priorities are not limited here. Different priorities can be assigned to different areas for different degrees of attention, and different processing methods can be used for different priorities.
  • this embodiment may also continue to obtain the fixation point and the corresponding pupil radius of the user gazing at the target image until the acquired degree of attention is greater than a certain threshold. Then divide the target image and assign priorities to the divided regions. This can effectively ensure that the user's attention is more interesting to the user
  • this step can perform different processing on each region to which the priority is assigned to obtain data to be transmitted, and the to-be-transmitted data can store data in each region.
  • the location information of the pixel point and the corresponding pixel value are not limited in the storage form, and may be stored in accordance with the location information or in areas.
  • this step may transmit the data to be transmitted to the display device, so that the display device displays an image corresponding to the data to be transmitted.
  • the display device can receive the data to be transmitted one by one, and then display the corresponding content on the display device.
  • An image transmission method provided in Embodiment 1 of the present invention is to first obtain an eye image when a user fixes a target image; second, to determine the user's fixation point based on the line of sight corresponding to the eye image; and then obtain the user fixation location
  • the pupil radius of the fixation point is described; the target image is then divided into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and after each division, Assign priority to the area to obtain data to be transmitted; and finally transmit the data to be transmitted.
  • the fixation point determined by the line of sight corresponding to the eye image when the user fixes the target image, the degree of attention corresponding to the pupil radius when the user fixes the fixation point, and each of the target image
  • the pixel value of the pixel points divides the target image, and then processes and transmits it with different priorities. Compared with directly transmitting the target image, it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and increasing the efficiency of image transmission; compared to compressing the target image as a whole, it is effective
  • the sharpness of the image of the user's area of interest is improved, and the compression effect of the target image is improved.
  • FIG. 2a is a schematic flowchart of an image transmission method according to the second embodiment of the present invention.
  • This second embodiment is optimized based on the foregoing embodiments.
  • the fixation point of the user is determined based on the line of sight corresponding to the eye image, which is further embodied as: extracting feature information in the eye image, where the feature information includes pupil features; The feature information determines a line of sight corresponding to the eye image; and determines a gaze point of the user in the target image according to the determined line of sight.
  • a pupil radius at which the user fixes the fixation point is further obtained, and is further optimized as: identifying an eye image of the user fixation at the fixation point, and determining the user's fixation on the fixation point. Pupil radius.
  • the target image will be divided into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image.
  • Priority is assigned to the area to obtain data to be transmitted, and the optimization is specifically: comparing the pupil radius with a preset pupil data table to determine the corresponding degree of attention; using an edge operator to perform edge detection on the target image to determine Extract each closed area in the target image; select a first area containing the user's gaze point from the determined closed areas, and use the area in the target image other than the first area as the second Area; according to the attention degree and a preset attention degree comparison table, assigning corresponding priorities to the first area and the second area to obtain data to be transmitted.
  • this embodiment further transmits the data to be transmitted, which is specifically optimized to transmit position information and corresponding pixel values of each pixel in the data to be transmitted in a preset order.
  • this embodiment further includes: when the startup instruction is detected, displaying the target image.
  • the startup instruction is detected, displaying the target image.
  • an image transmission method provided in Embodiment 2 of the present invention includes the following steps:
  • the startup instruction may be understood as an instruction to start the terminal device to perform image processing.
  • this step may first monitor whether there is a startup instruction. If the startup instruction is monitored, the target image can be displayed on the terminal device, so that the target image can be processed according to the user's eye image. Thereby, the corresponding data to be transmitted is obtained.
  • the startup instruction in this step may only be used to start displaying the first frame image in the video, and subsequent image frames in the video may be No need to listen to the start instruction to process the data to be transmitted.
  • the startup instruction may be set according to actual conditions, may be generated by setting key control on the terminal device, or may be generated by collecting specific actions of the user's eye image, which is not limited here.
  • this step can analyze the eye image when the user looks at the target image to obtain the corresponding data to be transmitted.
  • this step can analyze the eye image when the user looks at the target image to obtain the corresponding data to be transmitted.
  • the eye image After obtaining the eye image of the user looking at the target image, in this step, the eye image can be identified, and feature information in the eye image is extracted to determine the line of sight of the user looking at the target image based on the determined feature information.
  • the feature information may include pupil features, such as pupil edge features, pupil radius, and / or pupil center position.
  • the feature information may further include light spot information.
  • this step may determine a corresponding line of sight by using a corneal reflection method based on the pupil feature and the spot information in the feature information.
  • the user's line of sight can be determined through feature information and a pre-built comparison table of features and line of sight.
  • a contact / non-contact sensor such as an electrode or a capacitance sensor
  • this step can determine the gaze point coordinates of the line of sight in the target image based on the line of sight, and then determine the user's gaze point in the target image based on the gaze point coordinates.
  • the line of sight can be understood as a three-dimensional vector
  • the fixation point can be understood as the two-dimensional coordinates of the above-mentioned three-dimensional vector projected on a certain plane.
  • this step may further identify the eye image of the fixation point when the user fixes the fixation point, and determine the pupil radius of the fixation point when the user fixes the fixation point. Decide how to process the target image.
  • the eye image may be processed to obtain a grayscale gradient value of the eye image in a specified direction, and then the position where the grayscale gradient value reaches the maximum value is determined as the pupil edge position. After determining the pupil edge position, it is fitted, and then the radius of the fitted figure is determined to obtain the corresponding pupil radius.
  • S207 Compare the pupil radius with a preset pupil data table to determine a corresponding degree of attention.
  • the preset pupil data table can be understood as a comparison table of pupil radius and attention degree obtained in advance. After determining the pupil radius of the user's gaze point, in this step, a preset pupil data table can be searched to obtain the degree of attention corresponding to the pupil radius.
  • S208 Use an edge operator to perform edge detection on the target image to determine each closed area in the target image.
  • the edge operator can be understood as an operator that performs edge detection on the target image based on the pixel value of each pixel point of the target image.
  • a Laplacian-of-Gaussian (LoG) operator For example, a Roberts operator, or a Prewitt operator.
  • each edge information in the target image can be determined.
  • a region with continuous edges can be selected to form a closed region, and the area except the closed regions in the target image can be used to form a closed region.
  • a region containing the fixation point can be selected as the first region from each closed region based on the coordinates of the fixation point, and then the area other than the first region in the target image is used as the first region.
  • the second region thereby dividing the target image into a region containing a fixation point and a region not containing a fixation point.
  • the attention degree comparison table may be understood as a preset correspondence relationship between the attention degree and the priority, and the set relationship may be obtained through training.
  • this step may look up a preset attention degree comparison table to determine the corresponding priorities, and then assign corresponding priorities to the first area and the second area to obtain transfer data.
  • the preset order may be understood as a preset transmission order of data to be transmitted.
  • the position information can be understood as coordinate information.
  • this step may transmit the position information of each pixel in the data to be transmitted and the corresponding pixel value in a preset order, so that the display device performs an image based on the received position information and the corresponding pixel value. display.
  • the setting order can be set according to the actual application, can be determined based on the position of each pixel, or can be determined after the divided area.
  • FIG. 2b is a schematic diagram of an application scenario of an image transmission method provided in Embodiment 2 of the present invention.
  • the user 211 wears the AR device 212 to watch videos or images in the terminal device 213.
  • the terminal device 213 may be a computer, which is not limited herein.
  • the terminal device 213 may also be a device such as a mobile phone or a palmtop computer.
  • the process of image transmission based on this application scenario may be: after the terminal device 213 obtains the startup instruction, the target image may be displayed on the display screen of the terminal device 213, and then the terminal device 213 may obtain the communication connection established with the AR device 212
  • the gaze point of the image Then determine the pupil radius at which the user 211 looks at the fixation point, look up the degree of attention corresponding to the pupil radius in the preset pupil data table, and use the degree of attention to assign priorities to the first region and the second region after the target image is divided.
  • an edge operator may be used to perform edge detection on the target image to determine each closed area, and then select a first area and a second area including the gaze point from each closed area.
  • the terminal device 213 can acquire the eye image when the user 211 fixes the target image through the image acquisition device provided on the terminal device 213. If the user 211 is wearing a VR device, the terminal device 213 may be built into the VR device, and obtain the eye image of the user 211 gazing at the target image collected by the image acquisition device in the VR device through a communication connection established with the VR device.
  • FIG. 2c is a schematic diagram after assigning priorities to the divided target images according to the second embodiment of the present invention.
  • a target phone 2 displays a mobile phone 221 and a host 222
  • the user 211 gaze point is located on the mobile phone 221
  • the closed area formed by the edge of the mobile phone 221 in the target image 2 (For example, the area formed by the outer contour of the mobile phone 221) may be a first area
  • the area other than the first area in the target image 2 may be a second area.
  • the priority of the first region and the second region can be determined by analyzing the degree of attention corresponding to the pupil radius of the user 211's gaze point.
  • a higher priority can be set for the first region, and accordingly, the rendering accuracy or bit rate of the first region can be improved, and the rendering accuracy or bit rate of the second region can be reduced, thereby
  • the mobile phone 221 in the first area can be displayed clearly, and the host 222 in the second area can be displayed blurry.
  • An image transmission method provided in Embodiment 2 of the present invention embodies a gaze point determination operation, a pupil radius determination operation, a data to be transmitted determination operation, and a data to be transmitted transmission operation.
  • the display target image operation has been optimized. With this method, the target image can be displayed after the startup instruction is monitored, so as to determine the corresponding line of sight based on the feature information in the eye image when the user looks at the target image, and then determine the user's gaze point and look at the The degree of attention corresponding to the pupil radius when gazing at the point.
  • Each closed area in the target image is determined by the edge operator, and then the first area containing the fixation point and the second area except the first area in the target image are selected, and the first area and The second area is assigned a priority to obtain the data to be transmitted. Finally, the position information of each pixel in the data to be transmitted and the corresponding pixel value are transmitted in a preset order. On the basis of reducing the number of transmissions, the target image is compressed. The efficiency enables the places with high user attention to be displayed clearly and the places with low attention to be displayed ambiguously, and can effectively assign appropriate priorities to the first and second regions according to the user's degree of attention at the gaze point.
  • FIG. 3 is a schematic structural diagram of an image transmission apparatus according to Embodiment 3 of the present invention.
  • the apparatus is applicable to a case where a target image is transmitted between different image transmission devices (such as a terminal device and a display device).
  • the device may be implemented by software and / or hardware, and is generally integrated on a terminal device.
  • the image transmission device includes an eye image acquisition module 31, a fixation point determination module 32, a pupil radius determination module 33, a data to be transmitted determination module 34, and a transmission module 35.
  • the eye image acquisition module 31 is configured to acquire an eye image when a user looks at a target image
  • a fixation point determining module 32 configured to determine a fixation point of a user based on a line of sight corresponding to the eye image
  • the pupil radius determination module 33 is configured to obtain a pupil radius at which the user looks at the gaze point;
  • the to-be-transmitted data determining module 34 is configured to divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and divide the target image into segments. Subsequent areas are assigned priorities to get data to be transmitted;
  • the transmission module 35 is configured to transmit the data to be transmitted.
  • the image transmission device first obtains the eye image when the user fixes the target image through the eye image acquisition module 31; secondly, the gaze point determination module 32 determines the user's gaze point based on the line of sight corresponding to the eye image ; Then obtain the pupil radius of the gaze point of the user through the pupil radius determination module 33; and then, through the pending data determination module 34, based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the target image
  • the pixel value of each pixel in the target area is used to divide the target image, and priority is assigned to each divided area to obtain data to be transmitted; finally, the data to be transmitted is transmitted through the transmission module 35.
  • An image transmission device can, before a terminal device transmits a target image, a fixation point determined by a line of sight corresponding to an eye image when a user fixes on the target image, and a pupil radius corresponding to the fixation point when the user fixes the fixation point.
  • the degree of attention and the pixel value of each pixel in the target image divide the target image, and then process and transmit it with different priorities. Compared with directly transmitting the target image, it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and increasing the efficiency of image transmission; compared to compressing the target image as a whole, it is effective
  • the sharpness of the image of the user's area of interest is improved, and the compression effect of the target image is improved.
  • the fixation point determining module 32 is specifically configured to: extract feature information in the eye image, the feature information including pupil characteristics; determine a line of sight corresponding to the eye image according to the feature information; and The out of sight determines the gaze point of the user in the target image.
  • the pupil radius determination module 33 is specifically configured to identify an eye image of the user looking at the fixation point, and determine a pupil radius of the user looking at the fixation point.
  • the data to be transmitted determination module 34 is specifically configured to: compare the pupil radius with a preset pupil data table to determine a corresponding degree of attention; use an edge operator to perform edge detection on the target image, Determine each closed area in the target image; select a first area containing the user's gaze point from the determined closed areas, and use the area other than the first area in the target image as a first Two regions; assigning corresponding priorities to the first region and the second region according to the attention degree and a preset attention degree comparison table to obtain data to be transmitted.
  • the transmission module 35 is specifically configured to transmit position information and corresponding pixel values of each pixel in the data to be transmitted in a preset order.
  • the image transmission device further includes a target image display module 36 configured to display a target image when a startup instruction is detected.
  • the above-mentioned image transmission device can execute the image transmission method provided by any embodiment of the present invention, and has corresponding function modules and beneficial effects of executing the method.
  • FIG. 4 is a schematic structural diagram of a terminal device according to a fourth embodiment of the present invention.
  • the terminal device provided in the fourth embodiment of the present invention includes: one or more processors 41 and a storage device 42; the processor 41 in the terminal device may be one or more, and FIG.
  • the storage device 42 is taken as an example; the storage device 42 is configured to store one or more programs; the one or more programs are executed by the one or more processors 41, so that the one or more processors 41 are implemented as the present invention
  • the image transmission method according to any one of the embodiments.
  • the terminal device may further include an input device 43 and an output device 44.
  • the processor 41, the storage device 42, the input device 43, and the output device 44 in the terminal device may be connected through a bus or other manners.
  • the connection through the bus is taken as an example.
  • the storage device 42 in the terminal device may be configured to store one or more programs.
  • the programs may be software programs, computer-executable programs, and modules, as in the first or second embodiment of the present invention.
  • Program instructions / modules corresponding to the provided image transmission method include: eye image acquisition module 31, fixation point determination module 32, pupil radius determination module 33, to be transmitted
  • the data determination module 34 and the transmission module 35 further include a target image display module 36).
  • the processor 41 runs software programs, instructions, and modules stored in the storage device 42 to execute various functional applications and data processing of the terminal device, that is, to implement the image transmission method in the foregoing method embodiment.
  • the storage device 42 may include a storage program area and a storage data area, wherein the storage program area may store an operating system and application programs required for at least one function; the storage data area may store data created according to the use of the device, and the like.
  • the storage device 42 may include a high-speed random access memory, and may further include a non-volatile memory, such as at least one magnetic disk storage device, a flash memory device, or other non-volatile solid-state storage device.
  • the storage device 42 may further include memories remotely provided with respect to the processor 41, and these remote memories may be connected to the device through a network. Examples of the above network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
  • the input device 43 may be configured to receive inputted numeric or character information, and generate key signal input related to user settings and function control of the terminal device or acquire an eye image when the user looks at the target image.
  • the input device 43 may include, but is not limited to, an image acquisition device (such as an infrared camera configured with an infrared lamp), input devices such as keys and / or a microphone.
  • the output device 44 may include, but is not limited to, a display screen.
  • the program when the one or more programs included in the terminal device are executed by the one or more processors 41, the program performs the following operations: acquiring an eye image when the user looks at the target image; and based on the corresponding eye image, The line of sight determines the user's gaze point; obtains the pupil radius of the user's gaze point; based on the user's gaze point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image Divide the target image into regions, and assign priorities to the divided regions to obtain data to be transmitted; and transmit the data to be transmitted.
  • an embodiment of the present invention further provides a computer storage medium on which a computer program is stored.
  • the computer storage medium may include a readable storage medium and / or a writable storage medium.
  • the method includes: acquiring an eye image when a user fixes a target image; determining a user's gaze point based on a line of sight corresponding to the eye image; The pupil radius of the user gazing at the fixation point; dividing the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and Priority is assigned to each divided area to obtain data to be transmitted; and the data to be transmitted is transmitted.
  • the program when executed by the processor, the program may also be used to execute a technical solution of an image transmission method provided by any embodiment of the present invention.
  • the present invention can be implemented by software and necessary general hardware, and of course, can also be implemented by hardware, but in many cases the former is a better implementation .
  • the technical solution of the present invention that is essentially or contributes to the existing technology can be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as a computer floppy disk , Read-only memory (ROM), random access memory (RAM), flash memory (FLASH), hard disk or optical disk, etc., including several instructions to make a computer device (can be a personal computer , Server, or network device, etc.) perform the methods described in the embodiments of the present invention.
  • a computer-readable storage medium such as a computer floppy disk , Read-only memory (ROM), random access memory (RAM), flash memory (FLASH), hard disk or optical disk, etc.
  • ROM Read-only memory
  • RAM random access memory
  • FLASH flash memory
  • hard disk or optical disk etc.
  • the solution provided by the embodiment of the present invention may be applied to an image transmission process.
  • the solution first obtains an eye image when a user fixes a target image; secondly, determines a user's fixation point based on a line of sight corresponding to the eye image; Obtaining a pupil radius at which the user fixates on the fixation point; and thereafter dividing the target image into regions based on the fixation point of the user, a degree of attention corresponding to the pupil radius, and a pixel value of each pixel in the target image And assign priority to each divided area to obtain data to be transmitted; and finally transmit the data to be transmitted.
  • the fixation point determined by the line of sight corresponding to the eye image when the user fixes the target image, the degree of attention corresponding to the pupil radius when the user fixes the fixation point, and the target image
  • the pixel value of each pixel points divides the target image, and then transmits it with different priorities (different priorities may correspond to different code rates or different rendering accuracy).
  • it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and improving the efficiency of image transmission; compared with compressing the target image as a whole, it effectively improves It improves the sharpness of the image of the user's area of interest and improves the compression effect of the target image.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Disclosed in the present invention are an image transmission method and apparatus, a terminal device, and a storage medium. Said method comprises: acquiring an eye image when a user gazes at a target image; determining a gaze point of the user on the basis of the line of sight corresponding to the eye image; acquiring a pupil radius when the user gazes at the gaze point; on the basis of the user's gaze point, the degree of attention corresponding to the pupil radius, and the pixel values of pixel points in the target image, performing area division on the target image, and assigning priorities for the divided areas, so as to obtain data to be transmitted; and transmitting said data.

Description

一种图像传输方法、装置、终端设备及存储介质Image transmission method, device, terminal equipment and storage medium 技术领域Technical field
本发明实施例涉及传输技术领域,尤其涉及一种图像传输方法、装置、终端设备及存储介质。Embodiments of the present invention relate to the field of transmission technologies, and in particular, to an image transmission method, device, terminal device, and storage medium.
背景技术Background technique
显示设备是一种能够输出图像或感触信号(例如为盲人设计的盲文显示器)的设备,显示设备接收外部信号源(如计算机)的图像数据后,将对应的图像内容实时进行显示。在计算机向显示设备传输图像的过程中,通常直接将原始图像进行传输。因此,传输图像的过程中需要传输的图像数据量较大,传输图像的速度较慢。The display device is a device capable of outputting an image or a touch signal (for example, a braille display designed for the blind). After receiving the image data of an external signal source (such as a computer), the display device displays the corresponding image content in real time. In the process of a computer transmitting an image to a display device, the original image is usually directly transmitted. Therefore, the amount of image data to be transmitted in the process of transmitting images is large, and the speed of transmitting images is slow.
目前,为解决图像传输过程中数量大的问题所提出的图像传输方法通常是将图像数据整体进行压缩,这就导致了显示设备显示的图像内容清晰度差,影响了用户观看图像的体验。At present, the image transmission method proposed to solve the problem of a large number of image transmission processes generally compresses the entire image data, which results in poor definition of the image content displayed by the display device and affects the user experience of viewing the image.
发明内容Summary of the invention
本发明提供的一种图像传输方法、装置、终端设备及存储介质,能够有效提高图像压缩效率。An image transmission method, device, terminal device and storage medium provided by the present invention can effectively improve image compression efficiency.
第一方面,本发明实施例提供了一种图像传输方法,包括:In a first aspect, an embodiment of the present invention provides an image transmission method, including:
获取用户注视目标图像时的眼部图像;Obtaining an eye image when a user looks at a target image;
基于所述眼部图像对应的视线确定所述用户的注视点;Determining a gaze point of the user based on a line of sight corresponding to the eye image;
获取所述用户注视所述注视点的瞳孔半径;Obtaining a pupil radius at which the user looks at the fixation point;
基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;Divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and assign priorities to the divided regions to obtain transfer data;
传输所述待传输数据。Transmitting the data to be transmitted.
第二方面,本发明实施例还提供了一种图像传输装置,包括:In a second aspect, an embodiment of the present invention further provides an image transmission device, including:
眼部图像获取模块,设置为获取用户注视目标图像时的眼部图像;An eye image acquisition module configured to acquire an eye image when a user looks at a target image;
注视点确定模块,设置为基于所述眼部图像对应的视线确定用户的注视点;A fixation point determining module, configured to determine a fixation point of a user based on a line of sight corresponding to the eye image;
瞳孔半径确定模块,设置为获取所述用户注视所述注视点的瞳孔半径;A pupil radius determination module, configured to obtain a pupil radius at which the user looks at the fixation point;
待传数据确定模块,设置为基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;The data to be transmitted determination module is configured to divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, The priority of the assigned area to get the data to be transmitted;
传输模块,设置为传输所述待传输数据。A transmission module configured to transmit the data to be transmitted.
第三方面,本发明实施例还提供了一种终端设备,包括:According to a third aspect, an embodiment of the present invention further provides a terminal device, including:
一个或多个处理器;One or more processors;
存储装置,设置为存储一个或多个程序;A storage device configured to store one or more programs;
所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现本发明实施例提供的图像传输方法。The one or more programs are executed by the one or more processors, so that the one or more processors implement the image transmission method provided by the embodiment of the present invention.
第四方面,本发明实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现本发明实施例提供的图像传输方法。According to a fourth aspect, an embodiment of the present invention further provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the image transmission method provided by the embodiment of the present invention is implemented.
本发明实施例提供了一种图像传输方法、装置、终端设备及存储介质,首先获取用户注视目标图像时的眼部图像;其次基于所述眼部图像对应的视线确 定所述用户的注视点;然后获取所述用户注视所述注视点的瞳孔半径;之后基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;最后传输所述待传输数据。利用上述技术方案,能够在终端设备传输目标图像之前,通过用户注视目标图像时的眼部图像对应的视线所确定的注视点、用户注视该注视点时的瞳孔半径对应的关注度及目标图像中各像素点的像素值对目标图像进行划分,然后以不同的优先级进行传输(不同优先级可以对应不同的码率或不同的渲染精度)。相比于直接传输目标图像而言,有效的减少了传输的数据量,从而降低传输目标图像所需时间,提高了图像传输的效率;相比于将目标图像整体进行压缩而言,有效的提高了用户关注区域图像的清晰度,提高了目标图像压缩效果。Embodiments of the present invention provide an image transmission method, device, terminal device, and storage medium. First, an eye image of a user when gazing at a target image is acquired; secondly, a gaze point of the user is determined based on a line of sight corresponding to the eye image; Then, the pupil radius at which the user fixes the fixation point is obtained; and then the target image is area-based based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image. Dividing and assigning priorities to each divided area to obtain data to be transmitted; and finally transmitting the data to be transmitted. With the above technical solution, before the terminal device transmits the target image, the fixation point determined by the line of sight corresponding to the eye image when the user fixes the target image, the degree of attention corresponding to the pupil radius when the user fixes the fixation point, and the target image The pixel value of each pixel points divides the target image, and then transmits it with different priorities (different priorities may correspond to different code rates or different rendering accuracy). Compared with directly transmitting the target image, it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and improving the efficiency of image transmission; compared with compressing the target image as a whole, it effectively improves It improves the sharpness of the image of the user's area of interest and improves the compression effect of the target image.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1a为本发明实施例一提供的一种图像传输方法的流程示意图;FIG. 1a is a schematic flowchart of an image transmission method according to Embodiment 1 of the present invention; FIG.
图1b给出了本发明实施例一提供的眼部图像示意图;FIG. 1b is a schematic diagram of an eye image provided by Embodiment 1 of the present invention; FIG.
图2a为本发明实施例二提供的一种图像传输方法的流程示意图;2a is a schematic flowchart of an image transmission method according to a second embodiment of the present invention;
图2b给出了本发明实施例二提供的图像传输方法的应用场景示意图;FIG. 2b is a schematic diagram of an application scenario of an image transmission method provided in Embodiment 2 of the present invention; FIG.
图2c给出了本发明实施例二提供的对划分完的目标图像分配优先级后的示意图;FIG. 2c is a schematic diagram after assigning priorities to the divided target images according to the second embodiment of the present invention; FIG.
图3为本发明实施例三提供的一种图像传输装置的结构示意图;FIG. 3 is a schematic structural diagram of an image transmission device according to a third embodiment of the present invention; FIG.
图4为本发明实施例四提供的一种终端设备的结构示意图。FIG. 4 is a schematic structural diagram of a terminal device according to a fourth embodiment of the present invention.
具体实施方式detailed description
下面结合附图和实施例对本发明作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释本发明,而非对本发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与本发明相关的部分而非全部结构。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It can be understood that the specific embodiments described herein are only used to explain the present invention, rather than limiting the present invention. It should also be noted that, for the convenience of description, only some parts related to the present invention are shown in the drawings instead of all the structures.
在更加详细地讨论示例性实施例之前应当提到的是,一些示例性实施例被描述成作为流程图描绘的处理或方法。虽然流程图将各项操作(或步骤)描述成顺序的处理,但是其中的许多操作可以被并行地、并发地或者同时实施。此外,各项操作的顺序可以被重新安排。当其操作完成时所述处理可以被终止,但是还可以具有未包括在附图中的附加步骤。所述处理可以对应于方法、函数、规程、子例程、子程序等等。Before discussing the exemplary embodiments in more detail, it should be mentioned that some exemplary embodiments are described as processes or methods depicted as flowcharts. Although flowcharts describe operations (or steps) as sequential processing, many of these operations can be performed in parallel, concurrently, or simultaneously. In addition, the order of operations can be rearranged. The process may be terminated when its operation is completed, but may also have additional steps not included in the drawings. The processing may correspond to methods, functions, procedures, subroutines, subroutines, and so on.
实施例一Example one
图1a为本发明实施例一提供的一种图像传输方法的流程示意图,该方法可适用于不同图像传输设备间(如终端设备和显示设备间或不同终端设备间)对目标图像进行传输的情况。该方法可以由本发明实施例提供的图像传输装置来执行,其中该装置可由软件和/或硬件实现,并一般集成在终端设备上。在本实施例中终端设备包括但不限于:计算机、手机或掌上电脑等设备。FIG. 1a is a schematic flowchart of an image transmission method according to Embodiment 1 of the present invention. The method is applicable to a case where a target image is transmitted between different image transmission devices (such as a terminal device and a display device or between different terminal devices). This method may be performed by an image transmission apparatus provided in an embodiment of the present invention, where the apparatus may be implemented by software and / or hardware, and is generally integrated on a terminal device. In this embodiment, the terminal device includes, but is not limited to, a computer, a mobile phone, or a handheld computer.
如图1a所示,本发明实施例一提供的一种图像传输方法,包括如下步骤:As shown in FIG. 1a, an image transmission method provided in Embodiment 1 of the present invention includes the following steps:
S101、获取用户注视目标图像时的眼部图像。S101. Obtain an eye image when a user looks at a target image.
在本实施例中,图像传输方法可以应用在终端设备上。目标图像可以理解为终端设备中待处理的图像。In this embodiment, the image transmission method may be applied to a terminal device. The target image can be understood as an image to be processed in the terminal device.
可以理解的是,本步骤中用户可以直接注视终端设备中的目标图像,然后 基于用户注视目标图像时的眼部图像对目标图像进行处理,将处理后的数据发送至显示设备或其他终端设备。一般的,本步骤可以通过终端设备上设置的图像采集装置获取用户注视目标图像时的眼部图像。It can be understood that in this step, the user can directly stare at the target image in the terminal device, and then process the target image based on the eye image when the user looks at the target image, and send the processed data to the display device or other terminal device. Generally, in this step, an eye image when a user looks at a target image can be obtained through an image acquisition device provided on a terminal device.
此外,在获取用户注视目标图像时的眼部图像时,也可以使用户佩戴AR设备,在AR设备上设置有图像采集装置以采集用户注视目标图像时的眼部图像,本步骤则可以通过与AR设备建立的通信连接获取用户注视目标图像时的眼部图像;如果用户佩戴的是VR设备,终端设备可以内置于VR设备中,通过终端设备显示目标图像,通过图像采集装置获取用户注视目标图像时的眼部图像,其中图像采集装置可以设置在VR设备上。终端设备可以通过与VR设备建立的通信连接获取用户注视目标图像时的眼部图像。In addition, when acquiring the eye image when the user looks at the target image, the user can also wear an AR device, and an image acquisition device is provided on the AR device to collect the eye image when the user looks at the target image. This step can be achieved by The communication connection established by the AR device acquires the eye image when the user looks at the target image; if the user wears a VR device, the terminal device can be built into the VR device, the target image is displayed by the terminal device, and the user's gaze target image is obtained by the image acquisition device Eye image of the time, wherein the image acquisition device can be set on the VR device. The terminal device may acquire an eye image when the user looks at the target image through a communication connection established with the VR device.
具体的,本步骤可以通过终端设备上的图像采集装置获取用户注视目标图像时的眼部图像。其中,图像采集装置可以为普通摄像头,也可以为红外摄像头。当本实施例中的图像采集装置为红外摄像头时,该红外摄像头需要配置有红外灯,以设置为制造眼图中的光斑,从而可以结合用户瞳孔特征确定用户的视线。Specifically, in this step, an image of the eye of the user when the user looks at the target image may be obtained through an image acquisition device on the terminal device. The image acquisition device may be a common camera or an infrared camera. When the image acquisition device in this embodiment is an infrared camera, the infrared camera needs to be configured with an infrared lamp to be set to manufacture a light spot in an eye diagram, so that the user's sight can be determined in combination with the characteristics of the pupil of the user.
S102、基于所述眼部图像对应的视线确定所述用户的注视点。S102. Determine a gaze point of the user based on a line of sight corresponding to the eye image.
在本实施例中,注视点可以理解为目标图像中用户注视的位置。在获取用户注视目标图像时的眼部图像后,本步骤可以通过眼部识别算法对眼部图像进行识别,以确定出眼部图像对应的视线,然后可以基于确定出的视线确定用户视线在目标图像中的坐标,以得到用户的注视点;或可以通过预先构建的注视点模型确定用户的注视点。In this embodiment, the fixation point can be understood as the position where the user fixates in the target image. After obtaining the eye image when the user looks at the target image, in this step, the eye image can be identified through the eye recognition algorithm to determine the line of sight corresponding to the eye image, and then the user's line of sight can be determined based on the determined line of sight. The coordinates in the image to get the user's fixation point; or the fixation point of the user can be determined through a pre-built fixation point model.
具体的,当本实施例采用普通摄像头获取用户注视目标图像时的眼部图像 时,本步骤可以基于该眼部图像中的瞳孔特征确定用户的视线方向。当本实施例采用红外摄像头获取用户注视目标图像时的眼部图像时,本步骤可以获取该眼部图像中的瞳孔特征和红外灯在该眼部图像中所呈现的光斑信息,然后根据获取的瞳孔特征和光斑信息采用角膜反射法确定眼部图像中用户的视线方向。Specifically, when an ordinary camera is used in this embodiment to obtain an eye image when the user looks at the target image, this step may determine the direction of the user's line of sight based on the pupil characteristics in the eye image. When an infrared camera is used in this embodiment to obtain an eye image when a user looks at a target image, in this step, pupil characteristics in the eye image and spot information presented by the infrared lamp in the eye image may be acquired, and then according to the acquired The pupil feature and spot information are determined by the corneal reflection method to determine the direction of the user's line of sight in the eye image.
在采用红外摄像头确定用户的视线方向时,主要的硬件要求包括但不限于:光源:一般为红外光源,因为红外光线不会影响眼睛的视觉;并且可以为多个红外光源,以预定的方式排列,例如品字形、一字形等;图像采集装置:例如红外摄像设备、红外图像传感器、照相机或摄像机等。When using an infrared camera to determine the direction of the user's line of sight, the main hardware requirements include, but are not limited to: a light source: generally an infrared light source, because infrared light will not affect the vision of the eye; and multiple infrared light sources can be arranged in a predetermined manner , Such as a character shape, a shape, etc .; image acquisition device: such as infrared camera equipment, infrared image sensor, camera or video camera.
确定用户的视线方向的过程包括:The process of determining the user's line of sight includes:
首先获取眼部图像:光源照向用户眼睛,由图像采集装置对用户眼部进行拍摄,相应拍摄光源在角膜上的反射点即光斑(也称为普尔钦斑),由此获取带有光斑的眼部图像。然后进行视线/注视点估计:随着眼球转动时,瞳孔中心与光斑的相对位置关系随之发生变化,相应采集到的带有光斑的若干眼部图像反映出这样的位置变化关系;根据所述位置变化关系进行视线/注视点估计。First obtain an eye image: the light source shines on the user's eye, and the user's eye is photographed by the image acquisition device, and the corresponding reflection point on the cornea, that is, the light spot (also known as Purkin's spot) is captured, thereby obtaining the Eye image. Then perform gaze / fixation point estimation: as the eyeball rotates, the relative positional relationship between the pupil center and the light spot changes, and several eye images with light spots collected correspondingly reflect this position change relationship; according to the description The position change relationship is estimated by the line of sight / fixation point.
一般的,眼球追踪也可称为视线追踪,是通过测量用户眼睛的运动情况来估计眼睛的视线和/或注视点的技术。目前广泛应用的是光学记录法:用图像采集装置(如照相机或摄像机)记录被试者的眼睛运动情况,即获取反映眼睛运动的眼部图像,以及从获取到的眼部图像中提取眼部特征用于建立视线/注视点估计的模型。其中,眼部特征可以包括:瞳孔位置、瞳孔形状、虹膜位置、虹膜形状、眼皮位置、眼角位置和/或光斑(也称为普尔钦斑)位置等。In general, eye tracking can also be called gaze tracking, which is a technique for estimating the eye's sight and / or gaze point by measuring the movement of the user's eyes. Optical recording method is currently widely used: using an image acquisition device (such as a camera or video camera) to record the subject's eye movements, that is, to obtain an eye image that reflects the eye movement, and to extract an eye from the acquired eye image Features are used to build a gaze / gaze point estimation model. The eye features may include: pupil position, pupil shape, iris position, iris shape, eyelid position, eye corner position, and / or light spot (also referred to as Purchin spot) position.
可以理解的是,在获取用户的视线方向时,也可以结合接触/非接触式的传感器(例如电极、电容传感器)推算眼睛的运动。It can be understood that when obtaining the direction of the user's line of sight, a contact / non-contact sensor (such as an electrode or a capacitance sensor) can also be used to estimate the eye movement.
图1b给出了本发明实施例一提供的眼部图像示意图,图1b示出了一只眼睛,其并非对眼睛只数的限定,本实施例中所获取的眼部图像中也可以包括两只眼睛以提高对目标图像处理的准确度。具体的,本实施例可以基于获取的该眼部图像确定用户注视目标图像时的眼部特征(特征信息),从而确定用户的视线方向所对应的注视点。如图1b所示,本实施例可以通过分析眼部图像中的瞳孔11或虹膜12得到用户的视线方向。此外,当采用红外摄像头采集眼部图像时,也可以获取眼部图像中包含的光斑信息,以辅助得到用户的视线方向。FIG. 1b shows a schematic diagram of an eye image provided in Embodiment 1 of the present invention. FIG. 1b shows one eye, which is not a limitation on the number of eyes. The eye image obtained in this embodiment may also include two eyes. Eyes only to improve the accuracy of the target image processing. Specifically, in this embodiment, an eye feature (feature information) when the user looks at the target image may be determined based on the obtained eye image, so as to determine a gaze point corresponding to the user's line of sight direction. As shown in FIG. 1b, in this embodiment, the direction of the user's line of sight can be obtained by analyzing the pupil 11 or the iris 12 in the eye image. In addition, when an infrared camera is used to collect the eye image, the spot information contained in the eye image can also be acquired to assist in obtaining the direction of the user's line of sight.
S103、获取所述用户注视所述注视点的瞳孔半径。S103. Obtain a pupil radius at which the user looks at the gaze point.
在确定出用户的注视点后,本步骤可以获取此时用户注视注视点的瞳孔半径。After the user's fixation point is determined, in this step, the pupil radius at which the user fixes the fixation point can be obtained.
一般的,眼部的积极动作和消极动作都可以通过分析瞳孔半径得到,从而可以分析瞳孔半径以确定眼部的积极动作和消极动作,从而能够得到用户对当前关注物的关注度。当用户对观看的内容感兴趣时,则会产生积极的感受,相应的瞳孔就会扩张,瞳孔半径变大;当用户对观看的内容不感兴趣时,则会产生消极的感受,相应的瞳孔就会收缩,瞳孔半径变小。因此,本实施例可以通过对用户关注目标图像时的瞳孔半径确定对应的关注度,以得到对目标图像的不同处理方式。Generally, both the positive and negative movements of the eye can be obtained by analyzing the pupil radius, so that the pupil radius can be analyzed to determine the positive and negative movements of the eye, so that the user's degree of attention to the current object of interest can be obtained. When the user is interested in the content being watched, it will have a positive feeling, the corresponding pupil will be dilated, and the pupil radius will become larger; when the user is not interested in the content being watched, it will have a negative feeling, and the corresponding pupil will be Shrinks and pupil radius decreases. Therefore, in this embodiment, the corresponding degree of attention can be determined by the pupil radius when the user pays attention to the target image, so as to obtain different processing methods for the target image.
可以理解的是,本步骤可以分析已经获取的用于判断用户注视点的眼部图像确定用户注视该注视点的瞳孔半径;或在用户注视注视点时获取用户注视该注视点的瞳孔半径。具体的,本步骤可以提取眼部图像中瞳孔的边缘信息,然后基于瞳孔的边缘信息确定出对应的瞳孔半径。It can be understood that, in this step, an eye image obtained for judging a user's fixation point may be analyzed to determine a pupil radius at which the user fixes the fixation point; or when the user fixes the fixation point, the pupil radius of the fixation point is obtained. Specifically, in this step, edge information of a pupil in an eye image may be extracted, and then a corresponding pupil radius may be determined based on the edge information of the pupil.
S104、基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图 像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据。S104. Divide the target image into regions based on the user's gaze point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and assign priorities to the divided regions. Get the data to be transmitted.
在本实施例中,关注度可以理解为基于用户眼部特征(如瞳孔半径)确定的用户对观看物的兴趣度。待传输图像可以理解为基于划分后的目标图像得到的将要被传输的数据。In this embodiment, the degree of attention can be understood as the degree of interest of the user in the viewing object determined based on the characteristics of the user's eyes (such as the pupil radius). The image to be transmitted can be understood as data to be transmitted based on the divided target image.
在确定出用户注视目标图像的注视点的瞳孔半径后,本步骤可以基于瞳孔半径及预设瞳孔数据表确定出对应的关注度。然后结合用户的注视点和目标图像各像素点的像素值将目标图像进行区域划分。After determining the pupil radius of the gaze point at which the user looks at the target image, this step may determine the corresponding degree of attention based on the pupil radius and a preset pupil data table. Then combine the user's gaze point with the pixel value of each pixel of the target image to divide the target image into regions.
需要说明的是,本实施例中也可以结合眼部图像中的其余眼部特征进行关注度的确定。如获取用户注视注视点的瞳孔特征(如瞳孔半径和瞳孔位置)和光斑特征,根据瞳孔位置和光斑特征确定注视信息(注视信息包括注视时长、注视次数和/或首次注视时间),然后根据注视信息和瞳孔半径进行关注度的确定。It should be noted that in this embodiment, the degree of attention may also be determined in combination with the remaining eye features in the eye image. For example, to obtain the pupil features (such as pupil radius and pupil position) and spot features of the gaze point of the user, determine the fixation information based on the pupil position and spot features (fixation information includes the duration of fixation, the number of fixations, and / or the first fixation time), and The information and pupil radius determine the degree of attention.
需要说明的是,该预设瞳孔数据表可以是经过训练得到的通用的数据表;也可以是针对不同用户训练得到的专用数据表(不同用户对应的数据表不同)。如果是专用数据表,则在获取用户注视目标图像时的眼部图像后,还可以对该眼部图像中的虹膜特征进行提取,以对该用户进行身份识别,从而在确定出用户注视注视点的瞳孔半径后能够确定出对应的关注度。It should be noted that the preset pupil data table may be a general-purpose data table obtained through training; it may also be a dedicated data table obtained by training for different users (the data tables corresponding to different users are different). If it is a special data table, after obtaining the eye image of the user when gazing at the target image, the iris features in the eye image can also be extracted to identify the user, thereby determining the user's gaze point The corresponding degree of attention can be determined after the pupil radius of.
此外,在基于注视点、瞳孔半径对应的关注度和目标图像各像素点的像素值对目标图像进行区域划分,并为划分后的区域分配优先级时,本步骤可以首先基于用户目标图像各像素点的像素值对目标图像进行边缘分割,然后选取出各封闭的区域及包含注视点的区域。最后通过用户对注视点的关注度为划分后 的区域分配优先级。这样能够有效的基于用户的注视点及关注度对目标图像进行重新分割并分配优先级。In addition, when the target image is divided into regions based on the attention point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel of the target image, and priority is assigned to the divided regions, this step may first be based on each pixel of the user's target image The pixel values of the points perform edge segmentation on the target image, and then select each closed area and the area containing the gaze point. Finally, the divided areas are assigned priorities based on the user's attention to the gaze point. This can effectively re-segment the target image based on the user's gaze point and attention and assign priorities.
当用户注视的注视点不是用户想要关注的内容的时候(此时用户关注注视点的关注度较低),则可以降低包含关注点区域处的优先级,提高其余区域处的优先级。其中,关注度可以与划分后区域的优先级存在一一对应关系,基于确定出的关注度,能够为划分后的区域分配不同的优先级。不同的关注度对应不同的优先级。When the gaze point of the user's gaze is not the content the user wants to pay attention to (at this time, the user's attention to the gaze point is low), the priority of the area containing the point of interest can be lowered, and the priority of the remaining areas can be increased. Among them, the attention degree may have a one-to-one correspondence relationship with the priority of the divided regions. Based on the determined attention degree, different priorities can be assigned to the divided regions. Different attention levels correspond to different priorities.
需要说明的是,本步骤为划分后的区域分配优先级时,基于应用场景的不同,不同的优先级可以对应不同的处理方式。如,优先级较高的区域分配较高的码率,优先级较低的区域分配较低的码率;优先级较高的区域使用较高分辨率,优先级较低的区域使用较低分辨率或优先级较高的区域采用较大的压缩比,优先级较低的地方采用较小的压缩比。需要注意的是,此处并不对不同优先级的区域的处理方式进行限定。针对不同的关注度可以为不同的区域分配不同的优先级,针对不同的优先级可以采用不同的处理手段。It should be noted that when assigning priorities to the divided regions in this step, different priorities may correspond to different processing methods based on different application scenarios. For example, higher priority areas are assigned higher code rates, lower priority areas are assigned lower code rates; higher priority areas are assigned higher resolutions, and lower priority areas are assigned lower resolutions. Areas with higher rates or priorities use larger compression ratios, and areas with lower priorities use smaller compression ratios. It should be noted that the processing methods of regions with different priorities are not limited here. Different priorities can be assigned to different areas for different degrees of attention, and different processing methods can be used for different priorities.
此外,如果确定出的关注度低于一定阈值,本实施例也可以继续获取用户注视目标图像的注视点及对应的瞳孔半径,直到获取的关注度大于一定阈值。然后对目标图像进行划分,并为划分后的区域分配优先级。从而能够有效的保证用户关注的关注点是用户比较感兴趣的In addition, if the determined degree of attention is lower than a certain threshold, this embodiment may also continue to obtain the fixation point and the corresponding pupil radius of the user gazing at the target image until the acquired degree of attention is greater than a certain threshold. Then divide the target image and assign priorities to the divided regions. This can effectively ensure that the user's attention is more interesting to the user
在将目标图像进行划分,并为划分后的区域分配优先级后,本步骤可以对分配完优先级的各区域进行不同的处理得到待传输数据,该待传输数据中可以存储有各区域中各像素点的位置信息和对应的像素值,其存储形式并不作限定,可以按照位置信息进行存储,也可以按照区域进行存储。After dividing the target image and assigning priorities to the divided regions, this step can perform different processing on each region to which the priority is assigned to obtain data to be transmitted, and the to-be-transmitted data can store data in each region. The location information of the pixel point and the corresponding pixel value are not limited in the storage form, and may be stored in accordance with the location information or in areas.
S105、传输所述待传输数据。S105. Transmit the data to be transmitted.
在本实施例中,当得到待传输数据后,本步骤可以将该待传输数据传输至显示设备,以使显示设备将待传输数据对应的图像进行显示。显示设备则可以通过逐个接收待传输数据,然后在显示设备上显示相应的内容。In this embodiment, after the data to be transmitted is obtained, this step may transmit the data to be transmitted to the display device, so that the display device displays an image corresponding to the data to be transmitted. The display device can receive the data to be transmitted one by one, and then display the corresponding content on the display device.
本发明实施例一提供的一种图像传输方法,首先获取用户注视目标图像时的眼部图像;其次基于所述眼部图像对应的视线确定所述用户的注视点;然后获取所述用户注视所述注视点的瞳孔半径;之后基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;最终传输所述待传输数据。利用上述方法,能够在终端设备传输目标图像之前,通过用户注视目标图像时的眼部图像对应的视线所确定的注视点、用户注视该注视点时的瞳孔半径对应的关注度及目标图像中各像素点的像素值对目标图像进行划分,然后以不同的优先级进行处理并传输。相比于直接传输目标图像而言,有效的减少了传输的数据量,从而减少传输目标图像所需时间,增大了图像传输的效率;相比于将目标图像整体进行压缩而言,有效的提高了用户关注区域图像的清晰度,提高了目标图像压缩效果。An image transmission method provided in Embodiment 1 of the present invention is to first obtain an eye image when a user fixes a target image; second, to determine the user's fixation point based on the line of sight corresponding to the eye image; and then obtain the user fixation location The pupil radius of the fixation point is described; the target image is then divided into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and after each division, Assign priority to the area to obtain data to be transmitted; and finally transmit the data to be transmitted. By using the above method, before the terminal device transmits the target image, the fixation point determined by the line of sight corresponding to the eye image when the user fixes the target image, the degree of attention corresponding to the pupil radius when the user fixes the fixation point, and each of the target image The pixel value of the pixel points divides the target image, and then processes and transmits it with different priorities. Compared with directly transmitting the target image, it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and increasing the efficiency of image transmission; compared to compressing the target image as a whole, it is effective The sharpness of the image of the user's area of interest is improved, and the compression effect of the target image is improved.
实施例二Example two
图2a为本发明实施例二提供的一种图像传输方法的流程示意图,本实施例二在上述各实施例的基础上进行优化。在本实施例中,将基于所述眼部图像对应的视线确定所述用户的注视点,进一步具体化为:提取所述眼部图像中的特征信息,所述特征信息包括瞳孔特征;根据所述特征信息确定所述眼部图像对 应的视线;根据确定出的视线确定所述用户在所述目标图像中的注视点。FIG. 2a is a schematic flowchart of an image transmission method according to the second embodiment of the present invention. This second embodiment is optimized based on the foregoing embodiments. In this embodiment, the fixation point of the user is determined based on the line of sight corresponding to the eye image, which is further embodied as: extracting feature information in the eye image, where the feature information includes pupil features; The feature information determines a line of sight corresponding to the eye image; and determines a gaze point of the user in the target image according to the determined line of sight.
可选地,本实施例还将获取所述用户注视所述注视点的瞳孔半径,进一步优化为:识别所述用户注视所述注视点的眼部图像,确定所述用户注视所述注视点的瞳孔半径。Optionally, in this embodiment, a pupil radius at which the user fixes the fixation point is further obtained, and is further optimized as: identifying an eye image of the user fixation at the fixation point, and determining the user's fixation on the fixation point. Pupil radius.
在上述优化的基础上,将基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据,具体优化为:将所述瞳孔半径与预设瞳孔数据表进行比对,确定对应的关注度;利用边缘算子对所述目标图像进行边缘检测,确定出所述目标图像中各封闭区域;从确定出的各封闭区域中选取包含所述用户的注视点的第一区域,并将所述目标图像中除所述第一区域外的区域作为第二区域;根据所述关注度和预设的关注度对照表,为所述第一区域和所述第二区域分配对应的优先级,得到待传输数据。On the basis of the above optimization, the target image will be divided into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image. Priority is assigned to the area to obtain data to be transmitted, and the optimization is specifically: comparing the pupil radius with a preset pupil data table to determine the corresponding degree of attention; using an edge operator to perform edge detection on the target image to determine Extract each closed area in the target image; select a first area containing the user's gaze point from the determined closed areas, and use the area in the target image other than the first area as the second Area; according to the attention degree and a preset attention degree comparison table, assigning corresponding priorities to the first area and the second area to obtain data to be transmitted.
可选地,本实施例还将传输所述待传输数据,具体优化为:按照预设顺序传输所述待传输数据中各像素点的位置信息和对应的像素值。Optionally, this embodiment further transmits the data to be transmitted, which is specifically optimized to transmit position information and corresponding pixel values of each pixel in the data to be transmitted in a preset order.
可选地,本实施例在所述获取用户注视目标图像时的眼部图像之前,还优化包括了:当监测到启动指令时,显示目标图像。本实施例尚未详尽的内容请参考实施例一。Optionally, before acquiring the eye image when the user looks at the target image, this embodiment further includes: when the startup instruction is detected, displaying the target image. For details that are not yet detailed in this embodiment, please refer to the first embodiment.
如图2a所示,本发明实施例二提供的一种图像传输方法,包括如下步骤:As shown in FIG. 2a, an image transmission method provided in Embodiment 2 of the present invention includes the following steps:
S201、当监测到启动指令时,显示目标图像。S201. When a start instruction is detected, a target image is displayed.
在本实施例中,启动指令可以理解为启动终端设备进行图像处理的指令。In this embodiment, the startup instruction may be understood as an instruction to start the terminal device to perform image processing.
一般的,在通过本实施例所提高的图像传输方法对目标图像进行处理时,本步骤可以首先监听是否存在启动指令。如果监听到启动指令,则可以在终端 设备上显示目标图像,以能够根据用户眼部图像对该目标图像进行处理。从而得到对应的待传输数据。Generally, when the target image is processed by the image transmission method improved in this embodiment, this step may first monitor whether there is a startup instruction. If the startup instruction is monitored, the target image can be displayed on the terminal device, so that the target image can be processed according to the user's eye image. Thereby, the corresponding data to be transmitted is obtained.
可以理解的是,当终端设备与待传输数据的接收方(如显示设备)传输视频时,本步骤中的启动指令可以仅用于启动显示视频中的首帧图像,视频中的后续图像帧可以无需监听启动指令即可处理得到待传输数据。其中,在实际应用中,启动指令可以根据实际情况进行设定,可以是终端设备上设定按键控制产生的,也可以是通过采集用户眼部图像特定动作产生,此处对此不作限定。It can be understood that when the terminal device and the receiver of the data to be transmitted (such as a display device) transmit video, the startup instruction in this step may only be used to start displaying the first frame image in the video, and subsequent image frames in the video may be No need to listen to the start instruction to process the data to be transmitted. Among them, in actual applications, the startup instruction may be set according to actual conditions, may be generated by setting key control on the terminal device, or may be generated by collecting specific actions of the user's eye image, which is not limited here.
S202、获取用户注视目标图像时的眼部图像。S202. Obtain an eye image when the user looks at the target image.
在接收到启动指令显示目标图像后,本步骤可以对用户注视目标图像时的眼部图像进行分析,得到对应的待传输数据。在对下一帧图像进行分析时,则可以无需监测启动指令。After receiving the start instruction to display the target image, this step can analyze the eye image when the user looks at the target image to obtain the corresponding data to be transmitted. When analyzing the next frame of image, there is no need to monitor the start instruction.
S203、提取所述眼部图像中的特征信息,所述特征信息包括瞳孔特征。S203. Extract feature information in the eye image, where the feature information includes pupil characteristics.
在获取到用户注视目标图像的眼部图像后,本步骤可以对该眼部图像进行识别,提取该眼部图像中的特征信息,以基于确定出的特征信息确定用户注视目标图像的视线。其中,特征信息可以包括瞳孔特征,如瞳孔边缘特征、瞳孔半径和/或瞳孔中心位置。After obtaining the eye image of the user looking at the target image, in this step, the eye image can be identified, and feature information in the eye image is extracted to determine the line of sight of the user looking at the target image based on the determined feature information. The feature information may include pupil features, such as pupil edge features, pupil radius, and / or pupil center position.
当本实施例采用红外摄像头获取用户注视目标图像时的眼部图像时,则特征信息还可以包括光斑信息。When an infrared camera is used in this embodiment to obtain an eye image when a user looks at a target image, the feature information may further include light spot information.
S204、根据所述特征信息确定所述眼部图像对应的视线。S204. Determine a line of sight corresponding to the eye image according to the feature information.
具体的,当本实施例采用红外摄像头获取用户注视目标图像时的眼部图像时,本步骤可以基于特征信息中的瞳孔特征和光斑信息利用角膜反射法确定对应的视线。当本实施例采用普通摄像头获取用户注视目标图像时的眼部图像时, 本步骤可以通过特征信息及预先构建的特征与视线的对照表确定出用户的视线。此外,本步骤在确定视线时,还可以辅助接触/非接触式的传感器(例如电极、电容传感器)。Specifically, when an infrared camera is used in this embodiment to obtain an eye image when a user fixes a target image, this step may determine a corresponding line of sight by using a corneal reflection method based on the pupil feature and the spot information in the feature information. When an ordinary camera is used in this embodiment to obtain an eye image of a user when gazing at a target image, in this step, the user's line of sight can be determined through feature information and a pre-built comparison table of features and line of sight. In addition, in this step, when determining the line of sight, a contact / non-contact sensor (such as an electrode or a capacitance sensor) can be assisted.
S205、根据确定出的视线确定所述用户在所述目标图像中的注视点。S205. Determine a gaze point of the user in the target image according to the determined line of sight.
在确定出用户注视目标图像时眼部图像对应的视线后,本步骤可以基于该视线确定出该视线方向在目标图像中的注视点坐标,然后根据注视点坐标确定用户在目标图像中的注视点。其中,视线可以理解为是一个三维矢量,注视点可以理解为上述三维矢量投影在某个平面上的二维坐标。After determining the line of sight corresponding to the eye image when the user looks at the target image, this step can determine the gaze point coordinates of the line of sight in the target image based on the line of sight, and then determine the user's gaze point in the target image based on the gaze point coordinates. . The line of sight can be understood as a three-dimensional vector, and the fixation point can be understood as the two-dimensional coordinates of the above-mentioned three-dimensional vector projected on a certain plane.
S206、识别所述用户注视所述注视点的眼部图像,确定所述用户注视所述注视点的瞳孔半径。S206. Identify an eye image of the user gazing at the gaze point, and determine a pupil radius of the user gazing at the gaze point.
在本实施例中,确定出用户注视目标图像时的眼部图像和注视点后,本步骤可以进一步识别用户注视该注视点的眼部图像,确定用户在注视该注视点时的瞳孔半径,以决定对目标图像的处理方式。In this embodiment, after the eye image and fixation point when the user fixes the target image are determined, this step may further identify the eye image of the fixation point when the user fixes the fixation point, and determine the pupil radius of the fixation point when the user fixes the fixation point. Decide how to process the target image.
具体的,本步骤可以对该眼部图像进行处理,得到眼部图像在指定方向上灰度的梯度值,然后将该灰度的梯度值达到最大值时所处位置确定为瞳孔边缘位置。在确定出瞳孔边缘位置后对其进行拟合,然后确定出拟合图形的半径,以得到对应的瞳孔半径。Specifically, in this step, the eye image may be processed to obtain a grayscale gradient value of the eye image in a specified direction, and then the position where the grayscale gradient value reaches the maximum value is determined as the pupil edge position. After determining the pupil edge position, it is fitted, and then the radius of the fitted figure is determined to obtain the corresponding pupil radius.
S207、将所述瞳孔半径与预设瞳孔数据表进行比对,确定对应的关注度。S207: Compare the pupil radius with a preset pupil data table to determine a corresponding degree of attention.
在本实施例中,预设瞳孔数据表可以理解为预先训练得到的瞳孔半径及关注度的对照表。在确定出用户注视注视点的瞳孔半径后,本步骤可以查找预设瞳孔数据表得到该瞳孔半径所对应的关注度。In this embodiment, the preset pupil data table can be understood as a comparison table of pupil radius and attention degree obtained in advance. After determining the pupil radius of the user's gaze point, in this step, a preset pupil data table can be searched to obtain the degree of attention corresponding to the pupil radius.
S208、利用边缘算子对所述目标图像进行边缘检测,确定出所述目标图像 中各封闭区域。S208. Use an edge operator to perform edge detection on the target image to determine each closed area in the target image.
在本实施例中,边缘算子可以理解为基于目标图像各像素点的像素值对目标图像进行边缘检测的算子。如,高斯拉普拉斯(Laplacian-of-Gaussian,LoG)算子、Roberts算子或Prewitt算子等。In this embodiment, the edge operator can be understood as an operator that performs edge detection on the target image based on the pixel value of each pixel point of the target image. For example, a Laplacian-of-Gaussian (LoG) operator, a Roberts operator, or a Prewitt operator.
本步骤在采用边缘算子对目标图像进行边缘检测时,可以确定出目标图像中各边缘信息。在确定出各边缘信息后,本步骤可以选取各边缘连续的区域组成封闭区域,将目标图像中除各封闭区域外的区域形成一个封闭区域。In this step, when edge detection is performed on the target image by using an edge operator, each edge information in the target image can be determined. After determining the edge information, in this step, a region with continuous edges can be selected to form a closed region, and the area except the closed regions in the target image can be used to form a closed region.
S209、从确定出的各封闭区域中选取包含所述用户的注视点的第一区域,并将所述目标图像中除所述第一区域外的区域作为第二区域。S209. Select a first region including the user's gaze point from the determined closed regions, and use a region other than the first region in the target image as the second region.
在确定出目标图像中包含的各封闭区域后,本步骤可以基于注视点的坐标从各封闭区域中选取包含注视点的区域作为第一区域,然后将目标图像中除第一区域外的区域作为第二区域,从而将目标图像划分为了包含注视点的区域和不包含注视点的区域。After determining each closed region included in the target image, in this step, a region containing the fixation point can be selected as the first region from each closed region based on the coordinates of the fixation point, and then the area other than the first region in the target image is used as the first region. The second region, thereby dividing the target image into a region containing a fixation point and a region not containing a fixation point.
S210、根据所述关注度和预设的关注度对照表,为所述第一区域和所述第二区域分配对应的优先级,得到待传输数据。S210. Assign corresponding priorities to the first area and the second area according to the attention degree and a preset attention degree comparison table to obtain data to be transmitted.
在实施例中,关注度对照表可以理解为预先设定的关注度和优先级的对应关系,该设定关系可以通过训练得到。In the embodiment, the attention degree comparison table may be understood as a preset correspondence relationship between the attention degree and the priority, and the set relationship may be obtained through training.
在确定出第一区域、第二区域和关注度后,本步骤可以查找预设的关注度对照表确定对应的优先级,然后为第一区域和第二区域分配对应的优先级,以得到待传输数据。After determining the first area, the second area, and the attention degree, this step may look up a preset attention degree comparison table to determine the corresponding priorities, and then assign corresponding priorities to the first area and the second area to obtain transfer data.
S211、按照预设顺序传输所述待传输数据中各像素点的位置信息和对应的像素值。S211. Transmit the position information of each pixel in the data to be transmitted and the corresponding pixel value in the preset order.
在本实施例中,预设顺序可以理解为预先设定的待传输数据的传输顺序。位置信息可以理解为坐标信息。In this embodiment, the preset order may be understood as a preset transmission order of data to be transmitted. The position information can be understood as coordinate information.
在得到待传输数据后,本步骤可以按照预设顺序将待传输数据中各像素点的位置信息和对应的像素值进行传输,以使显示设备基于接收到的位置信息和对应的像素值进行图像显示。可以理解的是,设定顺序的设定可以根据实际应用进行设定,可以基于各像素点的位置确定,也可以划分后的区域确定。After obtaining the data to be transmitted, this step may transmit the position information of each pixel in the data to be transmitted and the corresponding pixel value in a preset order, so that the display device performs an image based on the received position information and the corresponding pixel value. display. It can be understood that the setting order can be set according to the actual application, can be determined based on the position of each pixel, or can be determined after the divided area.
图2b给出了本发明实施例二提供的图像传输方法的应用场景示意图。如图2b所示,用户211佩戴着AR设备212观看终端设备213中的视频或图像。其中,终端设备213可以为计算机,此处并不对此进行限定,终端设备213也可以为手机或掌上电脑等设备。FIG. 2b is a schematic diagram of an application scenario of an image transmission method provided in Embodiment 2 of the present invention. As shown in FIG. 2b, the user 211 wears the AR device 212 to watch videos or images in the terminal device 213. The terminal device 213 may be a computer, which is not limited herein. The terminal device 213 may also be a device such as a mobile phone or a palmtop computer.
基于该应用场景图像传输的过程可以为:当终端设备213获取到启动指令后,可以将目标图像显示在终端设备213的显示屏上,然后终端设备213可以通过与AR设备212建立的通信连接获取AR设备212中图像采集装置采集的用户21注视目标图像时的眼部图像,然后提取该眼部图像中的特征信息,以确定用户211的视线,基于确定出的视线确定用户211在所述目标图像的注视点。之后确定用户211注视该注视点的瞳孔半径,查找预设瞳孔数据表中对应于该瞳孔半径的关注度,利用该关注度为目标图像划分后的第一区域和第二区域分配优先级,以得到待传输数据,最后以设定顺序传输待传输数据中各像素点的位置信息和对应的像素值。其中,在对目标图像进行划分时,可以利用边缘算子对目标图像进行边缘检测,确定出各封闭区域,然后从各封闭区域中选取包括注视点的第一区域和第二区域。The process of image transmission based on this application scenario may be: after the terminal device 213 obtains the startup instruction, the target image may be displayed on the display screen of the terminal device 213, and then the terminal device 213 may obtain the communication connection established with the AR device 212 The eye image collected by the image acquisition device of the AR device 212 when the user 21 fixes the target image, and then extracts feature information in the eye image to determine the sight of the user 211, and determines the user 211 at the target based on the determined sight. The gaze point of the image. Then determine the pupil radius at which the user 211 looks at the fixation point, look up the degree of attention corresponding to the pupil radius in the preset pupil data table, and use the degree of attention to assign priorities to the first region and the second region after the target image is divided. Get the data to be transmitted, and finally transmit the position information and corresponding pixel values of each pixel in the data to be transmitted in a set order. When the target image is divided, an edge operator may be used to perform edge detection on the target image to determine each closed area, and then select a first area and a second area including the gaze point from each closed area.
可以理解的是,当用户211不佩戴AR设备212时,终端设备213可以通 过设置在终端设备213上的图像采集装置获取用户211注视目标图像时的眼部图像。如果用户211佩戴的是VR设备,终端设备213可以内置于VR设备中,通过与VR设备建立的通信连接获取VR设备中图像采集装置采集的用户211注视目标图像的眼部图像。It can be understood that when the user 211 does not wear the AR device 212, the terminal device 213 can acquire the eye image when the user 211 fixes the target image through the image acquisition device provided on the terminal device 213. If the user 211 is wearing a VR device, the terminal device 213 may be built into the VR device, and obtain the eye image of the user 211 gazing at the target image collected by the image acquisition device in the VR device through a communication connection established with the VR device.
图2c给出了本发明实施例二提供的对划分完的目标图像分配优先级后的示意图。如图2c所示,假设此时的目标图像2中显示了一部手机221和一台主机222,并且用户211注视点位于手机221上,则目标图像2中手机221的边缘所形成的封闭区域(如手机221外轮廓组成的区域)可以为第一区域,目标图像2中除了第一区域外的区域可以为第二区域。其中第一区域和第二区域的优先级可以通过分析用户211注视注视点的瞳孔半径对应的关注度确定。如果用户211关注手机221时的关注度较高,则可以为第一区域设置较高优先级,相应的可以提高第一区域渲染精度或码率,降低第二区域的渲染精度或码率,从而第一区域中手机221则可以清晰显示,第二区域中主机222则可以模糊显示。FIG. 2c is a schematic diagram after assigning priorities to the divided target images according to the second embodiment of the present invention. As shown in FIG. 2c, assuming that a target phone 2 displays a mobile phone 221 and a host 222, and the user 211 gaze point is located on the mobile phone 221, the closed area formed by the edge of the mobile phone 221 in the target image 2 (For example, the area formed by the outer contour of the mobile phone 221) may be a first area, and the area other than the first area in the target image 2 may be a second area. The priority of the first region and the second region can be determined by analyzing the degree of attention corresponding to the pupil radius of the user 211's gaze point. If the user 211 pays more attention to the mobile phone 221, a higher priority can be set for the first region, and accordingly, the rendering accuracy or bit rate of the first region can be improved, and the rendering accuracy or bit rate of the second region can be reduced, thereby The mobile phone 221 in the first area can be displayed clearly, and the host 222 in the second area can be displayed blurry.
本发明实施例二提供的一种图像传输方法,具体化了注视点确定操作、瞳孔半径确定操作、待传输数据确定操作和传输待传输数据操作。此外,还优化增加了显示目标图像操作。利用该方法,能够在监测到启动指令后,将目标图像进行显示,以根据用户注视目标图像时的眼部图像中的特征信息确定对应的视线,然后根据该视线确定用户的注视点及注视该注视点时瞳孔半径对应的关注度。通过边缘算子确定目标图像中各封闭区域,然后选取出包含注视点的第一区域和目标图像中除第一区域外的第二区域,再基于关注度和关注度对照表为第一区域和第二区域分配优先级得到待传输数据,最后按照预设顺序将待传输数据中各像素点的位置信息和对应的像素值进行传输,在降低传输数量的基 础上,提高了对目标图像的压缩效率,使得用户关注度高的地方清晰显示,关注度低的地方模糊显示,并且能够有效的根据用户注视注视点处的关注度为第一区域和第二区域分配适合的优先级。An image transmission method provided in Embodiment 2 of the present invention embodies a gaze point determination operation, a pupil radius determination operation, a data to be transmitted determination operation, and a data to be transmitted transmission operation. In addition, the display target image operation has been optimized. With this method, the target image can be displayed after the startup instruction is monitored, so as to determine the corresponding line of sight based on the feature information in the eye image when the user looks at the target image, and then determine the user's gaze point and look at the The degree of attention corresponding to the pupil radius when gazing at the point. Each closed area in the target image is determined by the edge operator, and then the first area containing the fixation point and the second area except the first area in the target image are selected, and the first area and The second area is assigned a priority to obtain the data to be transmitted. Finally, the position information of each pixel in the data to be transmitted and the corresponding pixel value are transmitted in a preset order. On the basis of reducing the number of transmissions, the target image is compressed. The efficiency enables the places with high user attention to be displayed clearly and the places with low attention to be displayed ambiguously, and can effectively assign appropriate priorities to the first and second regions according to the user's degree of attention at the gaze point.
实施例三Example three
图3为本发明实施例三提供的一种图像传输装置的结构示意图,该装置可适用于不同图像传输设备间(如终端设备和显示设备间)对目标图像进行传输的情况。其中该装置可由软件和/或硬件实现,并一般集成在终端设备上。FIG. 3 is a schematic structural diagram of an image transmission apparatus according to Embodiment 3 of the present invention. The apparatus is applicable to a case where a target image is transmitted between different image transmission devices (such as a terminal device and a display device). The device may be implemented by software and / or hardware, and is generally integrated on a terminal device.
如图3所示,该图像传输装置包括:眼部图像获取模块31、注视点确定模块32、瞳孔半径确定模块33、待传数据确定模块34和传输模块35。As shown in FIG. 3, the image transmission device includes an eye image acquisition module 31, a fixation point determination module 32, a pupil radius determination module 33, a data to be transmitted determination module 34, and a transmission module 35.
其中,眼部图像获取模块31,设置为获取用户注视目标图像时的眼部图像;The eye image acquisition module 31 is configured to acquire an eye image when a user looks at a target image;
注视点确定模块32,设置为基于所述眼部图像对应的视线确定用户的注视点;A fixation point determining module 32 configured to determine a fixation point of a user based on a line of sight corresponding to the eye image;
瞳孔半径确定模块33,设置为获取所述用户注视所述注视点的瞳孔半径;The pupil radius determination module 33 is configured to obtain a pupil radius at which the user looks at the gaze point;
待传数据确定模块34,设置为基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;The to-be-transmitted data determining module 34 is configured to divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and divide the target image into segments. Subsequent areas are assigned priorities to get data to be transmitted;
传输模块35,设置为传输所述待传输数据。The transmission module 35 is configured to transmit the data to be transmitted.
在本实施例中,该图像传输装置首先通过眼部图像获取模块31获取用户注视目标图像时的眼部图像;其次通过注视点确定模块32基于所述眼部图像对应的视线确定用户的注视点;然后通过瞳孔半径确定模块33获取所述用户注视所述注视点的瞳孔半径;之后通过待传数据确定模块34基于所述用户的注视点、 所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;最后通过传输模块35传输所述待传输数据。In this embodiment, the image transmission device first obtains the eye image when the user fixes the target image through the eye image acquisition module 31; secondly, the gaze point determination module 32 determines the user's gaze point based on the line of sight corresponding to the eye image ; Then obtain the pupil radius of the gaze point of the user through the pupil radius determination module 33; and then, through the pending data determination module 34, based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the target image The pixel value of each pixel in the target area is used to divide the target image, and priority is assigned to each divided area to obtain data to be transmitted; finally, the data to be transmitted is transmitted through the transmission module 35.
本实施例提供的一种图像传输装置,能够在终端设备传输目标图像之前,通过用户注视目标图像时的眼部图像对应的视线所确定的注视点、用户注视该注视点时的瞳孔半径对应的关注度及目标图像中各像素点的像素值对目标图像进行划分,然后以不同的优先级进行处理传输。相比于直接传输目标图像而言,有效的减少了传输的数据量,从而减少传输目标图像所需时间,增大了图像传输的效率;相比于将目标图像整体进行压缩而言,有效的提高了用户关注区域图像的清晰度,提高了目标图像压缩效果。An image transmission device provided in this embodiment can, before a terminal device transmits a target image, a fixation point determined by a line of sight corresponding to an eye image when a user fixes on the target image, and a pupil radius corresponding to the fixation point when the user fixes the fixation point. The degree of attention and the pixel value of each pixel in the target image divide the target image, and then process and transmit it with different priorities. Compared with directly transmitting the target image, it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and increasing the efficiency of image transmission; compared to compressing the target image as a whole, it is effective The sharpness of the image of the user's area of interest is improved, and the compression effect of the target image is improved.
可选地,注视点确定模块32,具体设置为:提取所述眼部图像中的特征信息,所述特征信息包括瞳孔特征;根据所述特征信息确定所述眼部图像对应的视线;根据确定出的视线确定所述用户在所述目标图像中的注视点。Optionally, the fixation point determining module 32 is specifically configured to: extract feature information in the eye image, the feature information including pupil characteristics; determine a line of sight corresponding to the eye image according to the feature information; and The out of sight determines the gaze point of the user in the target image.
在上述优化的基础上,瞳孔半径确定模块33,具体设置为:识别所述用户注视所述注视点的眼部图像,确定所述用户注视所述注视点的瞳孔半径。Based on the above optimization, the pupil radius determination module 33 is specifically configured to identify an eye image of the user looking at the fixation point, and determine a pupil radius of the user looking at the fixation point.
基于上述技术方案,待传数据确定模块34,具体设置为:将所述瞳孔半径与预设瞳孔数据表进行比对,确定对应的关注度;利用边缘算子对所述目标图像进行边缘检测,确定出所述目标图像中各封闭区域;从确定出的各封闭区域中选取包含所述用户的注视点的第一区域,并将所述目标图像中除所述第一区域外的区域作为第二区域;根据所述关注度和预设的关注度对照表,为所述第一区域和所述第二区域分配对应的优先级,得到待传输数据。Based on the above technical solution, the data to be transmitted determination module 34 is specifically configured to: compare the pupil radius with a preset pupil data table to determine a corresponding degree of attention; use an edge operator to perform edge detection on the target image, Determine each closed area in the target image; select a first area containing the user's gaze point from the determined closed areas, and use the area other than the first area in the target image as a first Two regions; assigning corresponding priorities to the first region and the second region according to the attention degree and a preset attention degree comparison table to obtain data to be transmitted.
可选地,传输模块35,具体设置为:按照预设顺序传输所述待传输数据中 各像素点的位置信息和对应的像素值。Optionally, the transmission module 35 is specifically configured to transmit position information and corresponding pixel values of each pixel in the data to be transmitted in a preset order.
可选地,该图像传输装置,还优化包括了:目标图像显示模块36,设置为当监测到启动指令时,显示目标图像。Optionally, the image transmission device further includes a target image display module 36 configured to display a target image when a startup instruction is detected.
上述图像传输装置可执行本发明任意实施例所提供的图像传输方法,具备执行方法相应的功能模块和有益效果。The above-mentioned image transmission device can execute the image transmission method provided by any embodiment of the present invention, and has corresponding function modules and beneficial effects of executing the method.
实施例四Embodiment 4
图4为本发明实施例四提供的一种终端设备的结构示意图。如图4所示,本发明实施例四提供的终端设备包括:一个或多个处理器41和存储装置42;该终端设备中的处理器41可以是一个或多个,图4中以一个处理器41为例;存储装置42设置为存储一个或多个程序;所述一个或多个程序被所述一个或多个处理器41执行,使得所述一个或多个处理器41实现如本发明实施例中任一项所述的图像传输方法。FIG. 4 is a schematic structural diagram of a terminal device according to a fourth embodiment of the present invention. As shown in FIG. 4, the terminal device provided in the fourth embodiment of the present invention includes: one or more processors 41 and a storage device 42; the processor 41 in the terminal device may be one or more, and FIG. The storage device 42 is taken as an example; the storage device 42 is configured to store one or more programs; the one or more programs are executed by the one or more processors 41, so that the one or more processors 41 are implemented as the present invention The image transmission method according to any one of the embodiments.
所述终端设备还可以包括:输入装置43和输出装置44。The terminal device may further include an input device 43 and an output device 44.
终端设备中的处理器41、存储装置42、输入装置43和输出装置44可以通过总线或其他方式连接,图4中以通过总线连接为例。The processor 41, the storage device 42, the input device 43, and the output device 44 in the terminal device may be connected through a bus or other manners. In FIG. 4, the connection through the bus is taken as an example.
该终端设备中的存储装置42作为一种计算机可读存储介质,可设置为存储一个或多个程序,所述程序可以是软件程序、计算机可执行程序以及模块,如本发明实施例一或二所提供图像传输方法对应的程序指令/模块(例如,附图3所示的图像传输装置中的模块,包括:眼部图像获取模块31、注视点确定模块32、瞳孔半径确定模块33、待传数据确定模块34和传输模块35,还包括目标图像显示模块36)。处理器41通过运行存储在存储装置42中的软件程序、指 令以及模块,从而执行终端设备的各种功能应用以及数据处理,即实现上述方法实施例中图像传输方法。As a computer-readable storage medium, the storage device 42 in the terminal device may be configured to store one or more programs. The programs may be software programs, computer-executable programs, and modules, as in the first or second embodiment of the present invention. Program instructions / modules corresponding to the provided image transmission method (for example, the modules in the image transmission device shown in FIG. 3 include: eye image acquisition module 31, fixation point determination module 32, pupil radius determination module 33, to be transmitted The data determination module 34 and the transmission module 35 further include a target image display module 36). The processor 41 runs software programs, instructions, and modules stored in the storage device 42 to execute various functional applications and data processing of the terminal device, that is, to implement the image transmission method in the foregoing method embodiment.
存储装置42可包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据设备的使用所创建的数据等。此外,存储装置42可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实例中,存储装置42可进一步包括相对于处理器41远程设置的存储器,这些远程存储器可以通过网络连接至设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The storage device 42 may include a storage program area and a storage data area, wherein the storage program area may store an operating system and application programs required for at least one function; the storage data area may store data created according to the use of the device, and the like. In addition, the storage device 42 may include a high-speed random access memory, and may further include a non-volatile memory, such as at least one magnetic disk storage device, a flash memory device, or other non-volatile solid-state storage device. In some examples, the storage device 42 may further include memories remotely provided with respect to the processor 41, and these remote memories may be connected to the device through a network. Examples of the above network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
输入装置43可设置为接收输入的数字或字符信息,以及产生与终端设备的用户设置以及功能控制有关的键信号输入或获取用户注视目标图像时的眼部图像。输入装置43可包括但不限于:图像采集装置(如红外摄像头,该红外摄像头配置有红外灯)、按键和/或麦克风等输入设备。输出装置44可包括但不限于:显示屏。The input device 43 may be configured to receive inputted numeric or character information, and generate key signal input related to user settings and function control of the terminal device or acquire an eye image when the user looks at the target image. The input device 43 may include, but is not limited to, an image acquisition device (such as an infrared camera configured with an infrared lamp), input devices such as keys and / or a microphone. The output device 44 may include, but is not limited to, a display screen.
并且,当上述终端设备所包括一个或者多个程序被所述一个或者多个处理器41执行时,程序进行如下操作:获取用户注视目标图像时的眼部图像;基于所述眼部图像对应的视线确定所述用户的注视点;获取所述用户注视所述注视点的瞳孔半径;基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;传输所述待传输数据。In addition, when the one or more programs included in the terminal device are executed by the one or more processors 41, the program performs the following operations: acquiring an eye image when the user looks at the target image; and based on the corresponding eye image, The line of sight determines the user's gaze point; obtains the pupil radius of the user's gaze point; based on the user's gaze point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image Divide the target image into regions, and assign priorities to the divided regions to obtain data to be transmitted; and transmit the data to be transmitted.
此外,本发明实施例还提供一种计算机存储介质,其上存储有计算机程序,该计算机存储介质可以包括可读存储介质和/或可写存储介质。该程序被处理器 执行时用于执行一种图像传输方法,该方法包括:获取用户注视目标图像时的眼部图像;基于所述眼部图像对应的视线确定所述用户的注视点;获取所述用户注视所述注视点的瞳孔半径;基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;传输所述待传输数据。In addition, an embodiment of the present invention further provides a computer storage medium on which a computer program is stored. The computer storage medium may include a readable storage medium and / or a writable storage medium. When the program is executed by a processor, the method is used to execute an image transmission method. The method includes: acquiring an eye image when a user fixes a target image; determining a user's gaze point based on a line of sight corresponding to the eye image; The pupil radius of the user gazing at the fixation point; dividing the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and Priority is assigned to each divided area to obtain data to be transmitted; and the data to be transmitted is transmitted.
可选的,该程序被处理器执行时还可以用于执行本发明任意实施例所提供的一种图像传输方法的技术方案。通过以上关于实施方式的描述,所属领域的技术人员可以清楚地了解到,本发明可借助软件及必需的通用硬件来实现,当然也可以通过硬件实现,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如计算机的软盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、闪存(FLASH)、硬盘或光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。Optionally, when the program is executed by the processor, the program may also be used to execute a technical solution of an image transmission method provided by any embodiment of the present invention. Through the above description of the embodiments, those skilled in the art can clearly understand that the present invention can be implemented by software and necessary general hardware, and of course, can also be implemented by hardware, but in many cases the former is a better implementation . Based on such an understanding, the technical solution of the present invention that is essentially or contributes to the existing technology can be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as a computer floppy disk , Read-only memory (ROM), random access memory (RAM), flash memory (FLASH), hard disk or optical disk, etc., including several instructions to make a computer device (can be a personal computer , Server, or network device, etc.) perform the methods described in the embodiments of the present invention.
注意,上述仅为本发明的较佳实施例及所运用技术原理。本领域技术人员会理解,本发明不限于这里所述的特定实施例,对本领域技术人员来说能够进行各种明显的变化、重新调整和替代而不会脱离本发明的保护范围。因此,虽然通过以上实施例对本发明进行了较为详细的说明,但是本发明不仅仅限于以上实施例,在不脱离本发明构思的情况下,还可以包括更多其他等效实施例,而本发明的范围由所附的权利要求范围决定。Note that the above are only the preferred embodiments of the present invention and the applied technical principles. Those skilled in the art will understand that the present invention is not limited to the specific embodiments described herein, and those skilled in the art can make various obvious changes, readjustments and substitutions without departing from the scope of protection of the present invention. Therefore, although the present invention has been described in more detail through the above embodiments, the present invention is not limited to the above embodiments. Without departing from the concept of the present invention, more equivalent embodiments may be included, and the present invention The scope is determined by the scope of the appended claims.
工业实用性Industrial applicability
本发明实施例提供的方案,可应用于图像传输过程中,该方案通过首先获取用户注视目标图像时的眼部图像;其次基于所述眼部图像对应的视线确定所述用户的注视点;然后获取所述用户注视所述注视点的瞳孔半径;之后基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;最后传输所述待传输数据。利用上述技术方案,能够在终端设备传输目标图像之前,通过用户注视目标图像时的眼部图像对应的视线所确定的注视点、用户注视该注视点时的瞳孔半径对应的关注度及目标图像中各像素点的像素值对目标图像进行划分,然后以不同的优先级进行传输(不同优先级可以对应不同的码率或不同的渲染精度)。相比于直接传输目标图像而言,有效的减少了传输的数据量,从而降低传输目标图像所需时间,提高了图像传输的效率;相比于将目标图像整体进行压缩而言,有效的提高了用户关注区域图像的清晰度,提高了目标图像压缩效果。The solution provided by the embodiment of the present invention may be applied to an image transmission process. The solution first obtains an eye image when a user fixes a target image; secondly, determines a user's fixation point based on a line of sight corresponding to the eye image; Obtaining a pupil radius at which the user fixates on the fixation point; and thereafter dividing the target image into regions based on the fixation point of the user, a degree of attention corresponding to the pupil radius, and a pixel value of each pixel in the target image And assign priority to each divided area to obtain data to be transmitted; and finally transmit the data to be transmitted. With the above technical solution, before the terminal device transmits the target image, the fixation point determined by the line of sight corresponding to the eye image when the user fixes the target image, the degree of attention corresponding to the pupil radius when the user fixes the fixation point, and the target image The pixel value of each pixel points divides the target image, and then transmits it with different priorities (different priorities may correspond to different code rates or different rendering accuracy). Compared with directly transmitting the target image, it effectively reduces the amount of data to be transmitted, thereby reducing the time required to transmit the target image, and improving the efficiency of image transmission; compared with compressing the target image as a whole, it effectively improves It improves the sharpness of the image of the user's area of interest and improves the compression effect of the target image.

Claims (10)

  1. 一种图像传输方法,包括:An image transmission method includes:
    获取用户注视目标图像时的眼部图像;Obtaining an eye image when a user looks at a target image;
    基于所述眼部图像对应的视线确定所述用户的注视点;Determining a gaze point of the user based on a line of sight corresponding to the eye image;
    获取所述用户注视所述注视点的瞳孔半径;Obtaining a pupil radius at which the user looks at the fixation point;
    基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;Divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, and assign priorities to the divided regions to obtain transfer data;
    传输所述待传输数据。Transmitting the data to be transmitted.
  2. 根据权利要求1所述的方法,其中,所述基于所述眼部图像对应的视线确定所述用户的注视点,包括:The method according to claim 1, wherein the determining a gaze point of the user based on a line of sight corresponding to the eye image comprises:
    提取所述眼部图像中的特征信息,所述特征信息包括瞳孔特征;Extract feature information in the eye image, where the feature information includes pupil features;
    根据所述特征信息确定所述眼部图像对应的视线;Determining a line of sight corresponding to the eye image according to the feature information;
    根据确定出的视线确定所述用户在所述目标图像中的注视点。Determining a gaze point of the user in the target image according to the determined line of sight.
  3. 根据权利要求1所述的方法,其中,所述获取所述用户注视所述注视点的瞳孔半径,包括:The method according to claim 1, wherein the obtaining a pupil radius of the user gazing at the gaze point comprises:
    识别所述用户注视所述注视点的眼部图像,确定所述用户注视所述注视点的瞳孔半径。An eye image of the user gazing at the gaze point is identified, and a pupil radius of the user gazing at the gaze point is determined.
  4. 根据权利要求1所述的方法,其中,所述基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据,包括:The method according to claim 1, wherein the target image is divided into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, Priority is assigned to each divided area to obtain data to be transmitted, including:
    将所述瞳孔半径与预设瞳孔数据表进行比对,确定对应的关注度;Comparing the pupil radius with a preset pupil data table to determine a corresponding degree of attention;
    利用边缘算子对所述目标图像进行边缘检测,确定出所述目标图像中各封 闭区域;Performing edge detection on the target image using an edge operator to determine each closed area in the target image;
    从确定出的各封闭区域中选取包含所述用户的注视点的第一区域,并将所述目标图像中除所述第一区域外的区域作为第二区域;Selecting a first region containing the user's gaze point from the determined closed regions, and using a region other than the first region in the target image as a second region;
    根据所述关注度和预设的关注度对照表,为所述第一区域和所述第二区域分配对应的优先级,得到待传输数据。According to the attention degree and a preset attention degree comparison table, corresponding priorities are assigned to the first area and the second area to obtain data to be transmitted.
  5. 根据权利要求4所述的方法,其中,所述传输所述待传输数据,包括:The method according to claim 4, wherein said transmitting said data to be transmitted comprises:
    按照预设顺序传输所述待传输数据中各像素点的位置信息和对应的像素值。The position information of each pixel in the data to be transmitted and the corresponding pixel value are transmitted in a preset order.
  6. 根据权利要求1所述的方法,其中,在所述获取用户注视目标图像时的眼部图像之前,还包括:The method according to claim 1, before the acquiring an eye image when a user fixes a target image, further comprising:
    当监测到启动指令时,显示目标图像。When a start instruction is detected, a target image is displayed.
  7. 一种图像传输装置,包括:An image transmission device includes:
    眼部图像获取模块,设置为获取用户注视目标图像时的眼部图像;An eye image acquisition module configured to acquire an eye image when a user looks at a target image;
    注视点确定模块,设置为基于所述眼部图像对应的视线确定用户的注视点;A fixation point determining module, configured to determine a fixation point of a user based on a line of sight corresponding to the eye image;
    瞳孔半径确定模块,设置为获取所述用户注视所述注视点的瞳孔半径;A pupil radius determination module, configured to obtain a pupil radius at which the user looks at the fixation point;
    待传数据确定模块,设置为基于所述用户的注视点、所述瞳孔半径对应的关注度和所述目标图像中各像素点的像素值将所述目标图像进行区域划分,并为各划分后的区域分配优先级,得到待传输数据;The data to be transmitted determination module is configured to divide the target image into regions based on the user's fixation point, the degree of attention corresponding to the pupil radius, and the pixel value of each pixel in the target image, The priority of the assigned area to get the data to be transmitted;
    传输模块,设置为传输所述待传输数据。A transmission module configured to transmit the data to be transmitted.
  8. 根据权利要求7所述的装置,其中,还包括:The apparatus according to claim 7, further comprising:
    目标图像显示模块,设置为当监测到启动指令时,显示目标图像。The target image display module is configured to display a target image when a start instruction is detected.
  9. 一种终端设备,包括:A terminal device includes:
    一个或多个处理器;One or more processors;
    存储装置,设置为存储一个或多个程序;A storage device configured to store one or more programs;
    所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1至6中任一项所述的图像传输方法。The one or more programs are executed by the one or more processors, so that the one or more processors implement the image transmission method according to any one of claims 1 to 6.
  10. 一种计算机存储介质,其上存储有计算机程序,该程序被处理器执行时实现如权利要求1至6中任一项所述的图像传输方法。A computer storage medium having stored thereon a computer program that, when executed by a processor, implements the image transmission method according to any one of claims 1 to 6.
PCT/CN2019/089353 2018-07-16 2019-05-30 Image transmission method and apparatus, terminal device, and storage medium WO2020015468A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810777991.2 2018-07-16
CN201810777991.2A CN108919958B (en) 2018-07-16 2018-07-16 Image transmission method and device, terminal equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2020015468A1 true WO2020015468A1 (en) 2020-01-23

Family

ID=64411035

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/089353 WO2020015468A1 (en) 2018-07-16 2019-05-30 Image transmission method and apparatus, terminal device, and storage medium

Country Status (2)

Country Link
CN (1) CN108919958B (en)
WO (1) WO2020015468A1 (en)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108919958B (en) * 2018-07-16 2020-10-09 北京七鑫易维信息技术有限公司 Image transmission method and device, terminal equipment and storage medium
CN110378914A (en) * 2019-07-22 2019-10-25 北京七鑫易维信息技术有限公司 Rendering method and device, system, display equipment based on blinkpunkt information
CN111147549B (en) * 2019-12-06 2023-05-12 珠海格力电器股份有限公司 Terminal desktop content sharing method, device, equipment and storage medium
CN111553846B (en) * 2020-05-12 2023-05-26 Oppo广东移动通信有限公司 Super-resolution processing method and device
CN114092323A (en) * 2020-06-29 2022-02-25 Oppo广东移动通信有限公司 Image processing method, image processing device, storage medium and electronic equipment
CN111679772B (en) * 2020-08-17 2020-11-24 深圳诚一信科技有限公司 Screen recording method and system, multi-screen device and readable storage medium
CN111988525A (en) * 2020-08-25 2020-11-24 Oppo广东移动通信有限公司 Image processing method and related device
CN112528107A (en) * 2020-12-07 2021-03-19 支付宝(杭州)信息技术有限公司 Content data display method and device and server
CN112579029A (en) * 2020-12-11 2021-03-30 上海影创信息科技有限公司 Display control method and system of VR glasses
CN114935971B (en) * 2021-02-05 2024-08-20 京东方科技集团股份有限公司 Display device and display driving method
CN112988950B (en) * 2021-03-12 2023-10-13 成都数联铭品科技有限公司 Front-end rendering method and system of knowledge graph, electronic equipment and storage medium
CN113012501B (en) * 2021-03-18 2023-05-16 深圳市天天学农网络科技有限公司 Remote teaching method
CN113269044A (en) * 2021-04-27 2021-08-17 青岛小鸟看看科技有限公司 Display control method and device of head-mounted display equipment and head-mounted display equipment
CN113362450B (en) * 2021-06-02 2023-01-03 聚好看科技股份有限公司 Three-dimensional reconstruction method, device and system
CN113256661A (en) * 2021-06-23 2021-08-13 北京蜂巢世纪科技有限公司 Image processing method, apparatus, device, medium, and program product
CN113645500B (en) * 2021-10-15 2022-01-07 北京蔚领时代科技有限公司 Virtual reality video stream data processing system
CN114040184B (en) * 2021-11-26 2024-07-16 京东方科技集团股份有限公司 Image display method, system, storage medium and computer program product
CN116149471A (en) * 2022-12-30 2023-05-23 维沃移动通信有限公司 Display control method, device, augmented reality equipment and medium
CN116382549B (en) * 2023-05-22 2023-09-01 昆山嘉提信息科技有限公司 Image processing method and device based on visual feedback

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106682946A (en) * 2016-12-30 2017-05-17 北京七鑫易维信息技术有限公司 Advertisement content analysis method and device
CN107168668A (en) * 2017-04-28 2017-09-15 北京七鑫易维信息技术有限公司 Image data transfer method, device and storage medium, processor
CN107390863A (en) * 2017-06-16 2017-11-24 北京七鑫易维信息技术有限公司 Control method and device, electronic equipment, the storage medium of equipment
CN108919958A (en) * 2018-07-16 2018-11-30 北京七鑫易维信息技术有限公司 A kind of image transfer method, device, terminal device and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6917715B2 (en) * 2002-04-19 2005-07-12 International Business Machines Corporation Foveal priority in stereoscopic remote viewing system
JP2010191026A (en) * 2009-02-17 2010-09-02 Kddi Corp Terminal outputting image data in accordance with external display device, program, and method
CN106445129A (en) * 2016-09-14 2017-02-22 乐视控股(北京)有限公司 Method, device and system for displaying panoramic picture information
CN107153519A (en) * 2017-04-28 2017-09-12 北京七鑫易维信息技术有限公司 Image transfer method, method for displaying image and image processing apparatus

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106682946A (en) * 2016-12-30 2017-05-17 北京七鑫易维信息技术有限公司 Advertisement content analysis method and device
CN107168668A (en) * 2017-04-28 2017-09-15 北京七鑫易维信息技术有限公司 Image data transfer method, device and storage medium, processor
CN107390863A (en) * 2017-06-16 2017-11-24 北京七鑫易维信息技术有限公司 Control method and device, electronic equipment, the storage medium of equipment
CN108919958A (en) * 2018-07-16 2018-11-30 北京七鑫易维信息技术有限公司 A kind of image transfer method, device, terminal device and storage medium

Also Published As

Publication number Publication date
CN108919958A (en) 2018-11-30
CN108919958B (en) 2020-10-09

Similar Documents

Publication Publication Date Title
WO2020015468A1 (en) Image transmission method and apparatus, terminal device, and storage medium
WO2020216054A1 (en) Sight line tracking model training method, and sight line tracking method and device
CN108681399B (en) Equipment control method, device, control equipment and storage medium
JP2024045273A (en) System and method for detecting human gaze and gesture in unconstrained environments
RU2672502C1 (en) Device and method for forming cornea image
CN105787884A (en) Image processing method and electronic device
CN103885589A (en) Eye movement tracking method and device
CN108076290B (en) Image processing method and mobile terminal
CN109032351B (en) Fixation point function determination method, fixation point determination device and terminal equipment
KR20170031733A (en) Technologies for adjusting a perspective of a captured image for display
CN107277375B (en) Self-photographing method and mobile terminal
US11487354B2 (en) Information processing apparatus, information processing method, and program
US11080888B2 (en) Information processing device and information processing method
KR20170022078A (en) Display apparatus and controlling method thereof
Sun et al. Real-time gaze estimation with online calibration
CN108140124B (en) Prompt message determination method and device and electronic equipment
WO2023011103A1 (en) Parameter control method and apparatus, head-mounted display device, and storage medium
CN105306819A (en) Gesture-based photographing control method and device
JP2023515205A (en) Display method, device, terminal device and computer program
CN114092985A (en) Terminal control method, device, terminal and storage medium
WO2019085519A1 (en) Method and device for facial tracking
EP2888716B1 (en) Target object angle determination using multiple cameras
KR20200144196A (en) Electronic device and method for providing function using corneal image thereof
CN111610886A (en) Method and device for adjusting brightness of touch screen and computer readable storage medium
WO2020044916A1 (en) Information processing device, information processing method, and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19837591

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03/05/2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19837591

Country of ref document: EP

Kind code of ref document: A1