US20140359486A1 - Apparatus and method for configuring screen for video call using facial expression - Google Patents

Apparatus and method for configuring screen for video call using facial expression Download PDF

Info

Publication number
US20140359486A1
US20140359486A1 US14/463,109 US201414463109A US2014359486A1 US 20140359486 A1 US20140359486 A1 US 20140359486A1 US 201414463109 A US201414463109 A US 201414463109A US 2014359486 A1 US2014359486 A1 US 2014359486A1
Authority
US
United States
Prior art keywords
screen
face
video call
image
expression
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/463,109
Inventor
Ji-Young Yi
Sung-Dae Cho
Kee-Hyon Park
Jong-man Kim
Jin-Ho Kim
Chul-Hwan Lee
Dong-Hoon Jang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Priority to US14/463,109 priority Critical patent/US20140359486A1/en
Publication of US20140359486A1 publication Critical patent/US20140359486A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06K9/00302
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04842Selection of displayed objects or displayed text elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • G06V40/165Detection; Localisation; Normalisation using facial parts and geometric relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/174Facial expression recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals

Definitions

  • the present invention relates generally to a screen configuring apparatus and method, and more particularly, to an apparatus and method for configuring a screen for a video call by selecting an image of an interested user among multiple users.
  • a picture of a caller is taken using a camera, displayed on a screen, and images of the persons with whom the caller wants to have a telephone conversation are displayed in a specific location of the screen, for a video call.
  • a multipoint video call (or video conference call) technique which allows a user to have a video call with multiple persons on a mobile terminal, automatically identifies a speaking party by lip movement recognition, and displays an image of the speaker at the center of the screen, making it possible to talk with multiple persons.
  • a display includes a main screen having the largest area on a video call screen, and at least one sub screen.
  • the conventional multipoint video call technique may malfunction when several users move their lips at the same time.
  • the present invention has been made to solve the above-mentioned problems occurring in the prior art, and the present invention provides a video call apparatus and method for estimating a facial expression during a video call with multiple users, selecting an image of an interested person, and allowing a user to have a video call with the selected interested person.
  • an apparatus for configuring a screen for a video call using a facial expression which includes a facial expression information calculator for recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; a facial expression determiner for determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; a screen configurer for configuring a video call screen including multiple video images received for the video call; and an image selector for selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression.
  • the screen configurer may reconfigure the video call screen using the selected video image.
  • a method for configuring a screen for a video call using a facial expression by configuring a video call screen including multiple video images received for the video call; recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression; and reconfiguring the video call screen using the selected video image.
  • FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention
  • FIG. 2 is a flowchart illustrating a process of extracting reference expression information used to estimate changes in facial expression in a screen configuring apparatus according to an embodiment of the present invention
  • FIG. 3 is a diagram illustrating images obtained in a process of extracting reference expression information according to an embodiment of the present invention
  • FIG. 4 is a flowchart illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention.
  • FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention.
  • FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention.
  • the screen configuring apparatus includes a facial expression information calculator 100 , a facial expression determiner 110 , an image selector 120 , and a screen configurer 130 .
  • the facial expression information calculator 100 calculates facial expression information within a frame of an input image received from a camera during a video call, or of input images received outside the video call.
  • the facial expression information calculator 100 presets reference expression information that is used to determine changes in facial expression during a video call from an input image received from the camera before the video call.
  • the facial expression information calculator 100 includes a face recognizer 101 , a facial feature extractor 102 , and a face angle calculator 103 .
  • the face recognizer 101 uses a general face recognition technique in recognizing a face area in an input image, for example, recognizing an area corresponding to a preset facial skin color in an input image, as a face area.
  • the facial feature extractor 102 extracts facial features in the recognized face area.
  • a general facial feature extraction technique is used.
  • the facial features as used herein may refer to facial feature components such as eyes, nose, mouth and chin.
  • the face angle calculator 103 calculates a reference face angle based on the extracted facial features. Specifically, the face angle calculator 103 draws polygonal sides by connecting the calculated facial features, and calculates an angle of the recognized face based on the drawn polygonal sides. For the calculation of the face angle, a general face angle calculation technique is used.
  • the screen configurer 130 configures a video call screen for the video call, using at least one input image received during the video call and a user image received from a camera.
  • the at least one input image is defined as at least one sub image
  • the user image received from a camera is defined as a main image.
  • the face configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed.
  • the screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
  • the facial expression determiner 110 determines whether there is a change in facial expression by comparing facial expression information in the main image, calculated by the facial expression information calculator 100 during a video call, with preset reference expression information.
  • the facial expression determiner 110 determines whether there is a change in face angle by comparing the face angle in the main image calculated by the face angle calculator 103 with a preset reference face angle.
  • the image selector 120 selects a sub image corresponding to the changed facial expression from among multiple sub images located on the video call screen.
  • the image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
  • the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the image selector 120 , and displays the reconfigured video call screen.
  • the screen configurer 130 switches between a screen of the main image and a screen of the selected sub image on the video call screen.
  • the screen configuring apparatus estimates a facial expression of a user and selects an image of an interested person on the video call screen, making it possible for the user to conveniently select an image of the interested person without taking extensive action.
  • FIG. 2 is a diagram illustrating a process of setting reference expression information in a screen configuring apparatus according to an embodiment of the present invention.
  • the facial expression information calculator 100 upon receiving an image from a camera in step 200 , the facial expression information calculator 100 recognizes a face in the received image in step 210 .
  • a general face recognition technique is used, and a technique of learning a skin color and recognizing an area corresponding to the learned skin color as a face area may also be used.
  • the facial expression information calculator 100 recognizes a face area 301 in an input image 300 .
  • the facial expression information calculator 100 extracts facial features in the recognized face. As represented by reference numeral 310 , the facial expression information calculator 100 extracts facial features at the locations corresponding to eyes, nose, mouth and chin in the face area.
  • the facial expression information calculator 100 calculates a face angle of the recognized face based on the extracted facial features. For example, the facial expression information calculator 100 calculates a face angle 321 in an image 320 using an area of a polygon by connecting the facial features, and then ends setting the reference expression information.
  • the screen configuring apparatus may recognize changes in facial expression in an image received during a video call and reconfigure a video call screen corresponding to the changed facial expression.
  • FIG. 4 is a diagram illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention.
  • a user image received from a camera is defined as a main image, and at least one input image received from outside is defined as at least one sub image.
  • An embodiment of the present invention will be described with reference to FIGS. 5 to 7 .
  • FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention.
  • the screen con figurer 130 configures and displays a video call screen including a screen of a main image and a screen of at least one sub image in step 401 .
  • the screen configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed.
  • the screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
  • the displayed video call screen may be as illustrated in FIG. 5 .
  • the facial expression information calculator 100 recognizes a face in the main image, calculates facial features of the recognized face, and calculates a face angle based on the calculated facial features.
  • the face recognizer 101 recognizes a face area in the main image using the general face recognition technique, for example, by recognizing an area corresponding to a preset facial skin color in an input image, as a face area.
  • the facial feature extractor 102 extracts facial features in the recognized face area, and the face angle calculator 103 calculates a reference face angle based on the extracted facial features.
  • step 403 the facial expression determiner 110 compares the face angle in the main image calculated by the facial expression information calculator 100 , with a preset reference face angle.
  • step 404 the facial expression determiner 110 determines whether there is a change in face angle. If there is a change in face angle, the image selector 120 proceeds to step 405 . Otherwise, the screen configurer 130 continuously displays the video call screen in step 401 .
  • step 405 the image selector 120 selects a sub image that is located on the video call screen to correspond to the face angle of the main image.
  • the image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
  • the image selector 120 estimates a face direction 502 corresponding to a face angle of a main image 500 , and selects a sub image 501 corresponding to the estimated face direction 502 in the face area.
  • the screen configurer 130 may further display, on the video call screen, face direction arrow icons for allowing the user to recognize face directions corresponding to face angles. These face direction arrow icons may be displayed to overlap the screen of the main image.
  • the screen configurer 130 may display the edge of the selected sub image to be bold, or may display the selected sub image to be greater in size than other sub images.
  • step 406 the screen configurer 130 determines whether a change in face angle is continuously recognized for a preset time. If the change in face angle is continuously recognized, the screen configurer 130 proceeds to step 407 . Otherwise, the screen configurer 130 continuously displays the video call screen in step 401 .
  • the reason why the screen configurer 130 determines whether a change in face angle continues for a preset time is to prevent a wrong sub image from being selected due to the unintended user facial movement.
  • the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the image selector 120 , and displays the reconfigured video call screen. For example, as illustrated in FIG. 6 , the screen configurer 130 reconfigures the video call screen such that the screen of the main image and the screen of the selected sub image are switched in terms of the location, thereby displaying the selected sub image in the area of the main image.
  • the screen configurer 130 may to place the screen of the sub image in the area of the main image by switching between a screen 700 of the main image and a screen 701 of the sub image on the video call screen.
  • step 408 the screen configurer 130 determines if the video call has been completed. If the video call has been completed, the screen con figurer 130 ends the video call operation. Otherwise, the screen configurer 130 returns to step 401 and repeats its succeeding steps.
  • the user's facial expression is estimated and an image of the user's interested person on the video call screen is selected, thereby allowing the user to conveniently select an image of the interested person without taking extensive action.
  • the embodiments of the present invention may set a time margin for selecting an image of an interested person, to prevent a wrong image from being selected due to unintended user facial movement.
  • the embodiments of the present invention provide accurate, intuitive and convenient functions according to the display screen of the video call apparatus, thereby increasing user convenience.

Abstract

Apparatus and method for configuring a screen for a video call using a facial expression by recognizing a face from an image, calculating facial expression information for an expression of the recognized face, and determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face. If there is a change in expression of the recognized face, the apparatus and method selects a video image corresponding to the changed expression in the video call screen, and reconfigures the video call screen using the selected video image, making it possible for a user to conveniently select an image of the interested person without taking extensive action, and preventing a wrong image from being selected due to the unintended user facial movement.

Description

    PRIORITY
  • This application is a Continuation Application of U.S. application Ser. No. 13/293,720, filed at the U.S. Patent and Trademark Office on Nov. 10, 2011, now U.S. Pat. No. 8,810,624, issued on Aug. 19, 2014, which claims priority under 35 10 U.S.C. §119(a) to a Korean Patent Application filed in the Korean Intellectual Property
  • Office on Nov. 10, 2010 and assigned Serial No. 10-2010-0111791, the entire disclosure of which is incorporated herein by reference.
  • BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present invention relates generally to a screen configuring apparatus and method, and more particularly, to an apparatus and method for configuring a screen for a video call by selecting an image of an interested user among multiple users.
  • 2. Description of the Related Art
  • In a video call, a picture of a caller is taken using a camera, displayed on a screen, and images of the persons with whom the caller wants to have a telephone conversation are displayed in a specific location of the screen, for a video call.
  • A multipoint video call (or video conference call) technique, which allows a user to have a video call with multiple persons on a mobile terminal, automatically identifies a speaking party by lip movement recognition, and displays an image of the speaker at the center of the screen, making it possible to talk with multiple persons.
  • In a multipoint video call apparatus, a display includes a main screen having the largest area on a video call screen, and at least one sub screen.
  • However, the conventional multipoint video call technique may malfunction when several users move their lips at the same time.
  • Additionally, it is difficult for a user to select an image of another user other than the speaker, when a user has an interest in having a conversation with the other user.
  • SUMMARY OF THE INVENTION
  • Accordingly, the present invention has been made to solve the above-mentioned problems occurring in the prior art, and the present invention provides a video call apparatus and method for estimating a facial expression during a video call with multiple users, selecting an image of an interested person, and allowing a user to have a video call with the selected interested person.
  • According to one aspect of the present invention, there is provided an apparatus for configuring a screen for a video call using a facial expression which includes a facial expression information calculator for recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; a facial expression determiner for determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; a screen configurer for configuring a video call screen including multiple video images received for the video call; and an image selector for selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression. The screen configurer may reconfigure the video call screen using the selected video image.
  • According to another aspect of the present invention, there is provided a method for configuring a screen for a video call using a facial expression by configuring a video call screen including multiple video images received for the video call; recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression; and reconfiguring the video call screen using the selected video image.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The above and other aspects, features and advantages of various embodiments of the present invention will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
  • FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention;
  • FIG. 2 is a flowchart illustrating a process of extracting reference expression information used to estimate changes in facial expression in a screen configuring apparatus according to an embodiment of the present invention;
  • FIG. 3 is a diagram illustrating images obtained in a process of extracting reference expression information according to an embodiment of the present invention;
  • FIG. 4 is a flowchart illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention; and
  • FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention.
  • Throughout the drawings, the same drawing reference numerals will be used to refer to the same elements, features and structures.
  • DETAILED DESCRIPTION OF EMBODIMENTS OF THE PRESENT INVENTION
  • Various embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description, specific details such as detailed configuration and components are merely provided to assist the overall understanding of various embodiments of the present invention. Therefore, it will be apparent to a person having ordinary skill in the art of the present invention that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. In addition, descriptions of well-known functions and constructions are omitted for clarity and conciseness.
  • FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention.
  • Referring to FIG. 1, the screen configuring apparatus includes a facial expression information calculator 100, a facial expression determiner 110, an image selector 120, and a screen configurer 130.
  • The facial expression information calculator 100 calculates facial expression information within a frame of an input image received from a camera during a video call, or of input images received outside the video call.
  • The facial expression information calculator 100 presets reference expression information that is used to determine changes in facial expression during a video call from an input image received from the camera before the video call.
  • The facial expression information calculator 100 includes a face recognizer 101, a facial feature extractor 102, and a face angle calculator 103.
  • The face recognizer 101 uses a general face recognition technique in recognizing a face area in an input image, for example, recognizing an area corresponding to a preset facial skin color in an input image, as a face area.
  • The facial feature extractor 102 extracts facial features in the recognized face area. For the extraction of the facial features, a general facial feature extraction technique is used. The facial features as used herein may refer to facial feature components such as eyes, nose, mouth and chin.
  • The face angle calculator 103 calculates a reference face angle based on the extracted facial features. Specifically, the face angle calculator 103 draws polygonal sides by connecting the calculated facial features, and calculates an angle of the recognized face based on the drawn polygonal sides. For the calculation of the face angle, a general face angle calculation technique is used.
  • When a video call begins, the screen configurer 130 configures a video call screen for the video call, using at least one input image received during the video call and a user image received from a camera. The at least one input image is defined as at least one sub image, and the user image received from a camera is defined as a main image.
  • That is, the face configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed. The screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
  • The facial expression determiner 110 determines whether there is a change in facial expression by comparing facial expression information in the main image, calculated by the facial expression information calculator 100 during a video call, with preset reference expression information.
  • Specifically, the facial expression determiner 110 determines whether there is a change in face angle by comparing the face angle in the main image calculated by the face angle calculator 103 with a preset reference face angle.
  • If there is a change in facial expression information, the image selector 120 selects a sub image corresponding to the changed facial expression from among multiple sub images located on the video call screen.
  • That is, if a difference between the calculated face angle in the main image and the preset reference face angle is greater than or equal to a preset value, the image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
  • If a change in facial expression is continuously recognized for a preset time, the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the image selector 120, and displays the reconfigured video call screen.
  • Specifically, if the face direction estimated by the image selector 120 is continuously recognized for a preset time, the screen configurer 130 switches between a screen of the main image and a screen of the selected sub image on the video call screen.
  • As such, the screen configuring apparatus estimates a facial expression of a user and selects an image of an interested person on the video call screen, making it possible for the user to conveniently select an image of the interested person without taking extensive action.
  • FIG. 2 is a diagram illustrating a process of setting reference expression information in a screen configuring apparatus according to an embodiment of the present invention.
  • Referring to FIG. 2, upon receiving an image from a camera in step 200, the facial expression information calculator 100 recognizes a face in the received image in step 210. As described above, for the recognition of a face in an image, a general face recognition technique is used, and a technique of learning a skin color and recognizing an area corresponding to the learned skin color as a face area may also be used. For example, with reference to FIG. 3, the facial expression information calculator 100 recognizes a face area 301 in an input image 300.
  • In step 220, the facial expression information calculator 100 extracts facial features in the recognized face. As represented by reference numeral 310, the facial expression information calculator 100 extracts facial features at the locations corresponding to eyes, nose, mouth and chin in the face area.
  • In step 230, the facial expression information calculator 100 calculates a face angle of the recognized face based on the extracted facial features. For example, the facial expression information calculator 100 calculates a face angle 321 in an image 320 using an area of a polygon by connecting the facial features, and then ends setting the reference expression information.
  • As such, the screen configuring apparatus may recognize changes in facial expression in an image received during a video call and reconfigure a video call screen corresponding to the changed facial expression.
  • FIG. 4 is a diagram illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention.
  • According to an embodiment of the present invention, a user image received from a camera is defined as a main image, and at least one input image received from outside is defined as at least one sub image. An embodiment of the present invention will be described with reference to FIGS. 5 to 7.
  • FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention.
  • Referring to FIG. 4, if a video call begins in step 400, the screen con figurer 130 configures and displays a video call screen including a screen of a main image and a screen of at least one sub image in step 401.
  • The screen configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed. The screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
  • The displayed video call screen may be as illustrated in FIG. 5.
  • In step 402, the facial expression information calculator 100 recognizes a face in the main image, calculates facial features of the recognized face, and calculates a face angle based on the calculated facial features.
  • Specifically, the face recognizer 101 recognizes a face area in the main image using the general face recognition technique, for example, by recognizing an area corresponding to a preset facial skin color in an input image, as a face area.
  • Thereafter, the facial feature extractor 102 extracts facial features in the recognized face area, and the face angle calculator 103 calculates a reference face angle based on the extracted facial features.
  • In step 403, the facial expression determiner 110 compares the face angle in the main image calculated by the facial expression information calculator 100, with a preset reference face angle.
  • In step 404, the facial expression determiner 110 determines whether there is a change in face angle. If there is a change in face angle, the image selector 120 proceeds to step 405. Otherwise, the screen configurer 130 continuously displays the video call screen in step 401.
  • In step 405, the image selector 120 selects a sub image that is located on the video call screen to correspond to the face angle of the main image.
  • That is, if a difference between the calculated face angle in the main image and the preset reference face angle is greater than or equal to a preset value, the image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
  • Referring to FIGS. 5 and 6, the image selector 120 estimates a face direction 502 corresponding to a face angle of a main image 500, and selects a sub image 501 corresponding to the estimated face direction 502 in the face area.
  • As illustrated in FIG. 5, the screen configurer 130 may further display, on the video call screen, face direction arrow icons for allowing the user to recognize face directions corresponding to face angles. These face direction arrow icons may be displayed to overlap the screen of the main image.
  • To emphasize that the selected sub image is a selected image, the screen configurer 130 may display the edge of the selected sub image to be bold, or may display the selected sub image to be greater in size than other sub images.
  • In step 406, the screen configurer 130 determines whether a change in face angle is continuously recognized for a preset time. If the change in face angle is continuously recognized, the screen configurer 130 proceeds to step 407. Otherwise, the screen configurer 130 continuously displays the video call screen in step 401.
  • The reason why the screen configurer 130 determines whether a change in face angle continues for a preset time is to prevent a wrong sub image from being selected due to the unintended user facial movement.
  • In step 407, the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the image selector 120, and displays the reconfigured video call screen. For example, as illustrated in FIG. 6, the screen configurer 130 reconfigures the video call screen such that the screen of the main image and the screen of the selected sub image are switched in terms of the location, thereby displaying the selected sub image in the area of the main image.
  • As illustrated in FIG. 7, the screen configurer 130 may to place the screen of the sub image in the area of the main image by switching between a screen 700 of the main image and a screen 701 of the sub image on the video call screen.
  • In step 408, the screen configurer 130 determines if the video call has been completed. If the video call has been completed, the screen con figurer 130 ends the video call operation. Otherwise, the screen configurer 130 returns to step 401 and repeats its succeeding steps.
  • According to embodiments of the present invention, the user's facial expression is estimated and an image of the user's interested person on the video call screen is selected, thereby allowing the user to conveniently select an image of the interested person without taking extensive action. The embodiments of the present invention may set a time margin for selecting an image of an interested person, to prevent a wrong image from being selected due to unintended user facial movement.
  • In addition, the embodiments of the present invention provide accurate, intuitive and convenient functions according to the display screen of the video call apparatus, thereby increasing user convenience.
  • While the invention has been shown and described with reference to various embodiments thereof, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (14)

What is claimed is:
1. An apparatus for configuring a screen for a video call using a facial expression, comprising:
a facial expression information calculator for recognizing a face from an image, and calculating facial expression information for an expression of the recognized face;
a facial expression determiner for determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face;
a screen configurer for configuring a video call screen including multiple video images received for the video call; and
an image selector for selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression,
wherein the screen configurer reconfigures the video call screen using the selected video image.
2. The apparatus of claim 1, wherein the video call screen includes a screen of a main image received from a camera and a screen of at least one sub image received outside of the video call.
3. The apparatus of claim 2, wherein the facial expression information calculator recognizes a face in the main image, extracts facial features of the recognized face, and calculates a face angle of the recognized face based on the extracted facial features.
4. The apparatus of claim 3, wherein the facial expression information calculator calculates a face angle from an image received before the video call, and sets the calculated face angle as a reference face angle.
5. The apparatus of claim 4, wherein the facial expression determiner determines whether a difference between the calculated face angle and the reference face angle is greater than or equal to a preset threshold by comparing the calculated face angle with the reference face angle.
6. The apparatus of claim 5, wherein the image selector estimates a face direction corresponding to the calculated face angle if the difference between the calculated face angle and the reference face angle is greater than or equal to the threshold and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
7. The apparatus of claim 6, wherein the screen configurer reconfigures the video call screen by switching between the screen of the main image and the screen of the selected sub image on the video call screen.
8. A method for configuring a screen for a video call using a facial expression, comprising:
configuring a video call screen including multiple video images received for the video call;
recognizing a face from an image, and calculating facial expression information for an expression of the recognized face;
determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face;
selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression; and
reconfiguring the video call screen using the selected video image.
9. The method of claim 8, wherein the video call screen includes a screen of a main image received from a camera and a screen of at least one sub image received outside of the video call.
10. The method of claim 9, wherein calculating facial expression information further comprises:
recognizing a face in the main image;
extracting facial features of the recognized face; and
calculating a face angle of the recognized face based on the extracted facial features.
11. The method of claim 10, further comprising calculating a face angle from an image received before the video call, and setting the calculated face angle as a reference face angle.
12. The method of claim 11, wherein determining whether there is a change in expression of the recognized face further comprises determining whether a difference between the calculated face angle and the reference face angle is greater than or equal to a preset threshold by comparing the calculated face angle with the reference face angle.
13. The method of claim 12, wherein selecting a video image corresponding to the changed expression comprises:
estimating a face direction corresponding to the calculated face angle if the difference between the calculated face angle and the reference face angle is the threshold value or above; and
selecting a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
14. The method of claim 13, wherein reconfiguring the video call screen comprises switching between the screen of the main image and the screen of the selected sub image on the video call screen.
US14/463,109 2010-11-10 2014-08-19 Apparatus and method for configuring screen for video call using facial expression Abandoned US20140359486A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/463,109 US20140359486A1 (en) 2010-11-10 2014-08-19 Apparatus and method for configuring screen for video call using facial expression

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR1020100111791A KR101733246B1 (en) 2010-11-10 2010-11-10 Apparatus and method for composition of picture for video call using face pose
KR10-2010-0111791 2010-11-10
US13/293,720 US8810624B2 (en) 2010-11-10 2011-11-10 Apparatus and method for configuring screen for video call using facial expression
US14/463,109 US20140359486A1 (en) 2010-11-10 2014-08-19 Apparatus and method for configuring screen for video call using facial expression

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/293,720 Continuation US8810624B2 (en) 2010-11-10 2011-11-10 Apparatus and method for configuring screen for video call using facial expression

Publications (1)

Publication Number Publication Date
US20140359486A1 true US20140359486A1 (en) 2014-12-04

Family

ID=46019250

Family Applications (2)

Application Number Title Priority Date Filing Date
US13/293,720 Expired - Fee Related US8810624B2 (en) 2010-11-10 2011-11-10 Apparatus and method for configuring screen for video call using facial expression
US14/463,109 Abandoned US20140359486A1 (en) 2010-11-10 2014-08-19 Apparatus and method for configuring screen for video call using facial expression

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/293,720 Expired - Fee Related US8810624B2 (en) 2010-11-10 2011-11-10 Apparatus and method for configuring screen for video call using facial expression

Country Status (2)

Country Link
US (2) US8810624B2 (en)
KR (1) KR101733246B1 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10890965B2 (en) * 2012-08-15 2021-01-12 Ebay Inc. Display orientation adjustment using facial landmark information
US9756282B2 (en) * 2012-11-20 2017-09-05 Sony Corporation Method and apparatus for processing a video signal for display
USD772253S1 (en) * 2013-02-19 2016-11-22 Sony Computer Entertainment Inc. Display panel or screen with an animated graphical user interface
KR102169523B1 (en) 2013-05-31 2020-10-23 삼성전자 주식회사 Display apparatus and control method thereof
US9104907B2 (en) * 2013-07-17 2015-08-11 Emotient, Inc. Head-pose invariant recognition of facial expressions
US9547808B2 (en) * 2013-07-17 2017-01-17 Emotient, Inc. Head-pose invariant recognition of facial attributes
CN103401981B (en) * 2013-07-25 2016-03-30 深圳市金立通信设备有限公司 A kind of method of initiating communication request and mobile terminal
TWD166921S (en) * 2013-08-14 2015-04-01 新力電腦娛樂股份有限公司 Graphical user interface for a display panel
USD752079S1 (en) * 2013-10-15 2016-03-22 Deere & Company Display screen with graphical user interface
KR102205498B1 (en) * 2014-09-18 2021-01-20 삼성전자주식회사 Feature extraction method and apparatus from input image
JP6592940B2 (en) * 2015-04-07 2019-10-23 ソニー株式会社 Information processing apparatus, information processing method, and program
CN107635110A (en) * 2017-09-30 2018-01-26 维沃移动通信有限公司 A kind of video interception method and terminal
CN108366221A (en) * 2018-05-16 2018-08-03 维沃移动通信有限公司 A kind of video call method and terminal
CN113342239A (en) * 2021-05-31 2021-09-03 锐迪科微电子科技(上海)有限公司 Region-of-interest determination method and apparatus

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070216773A1 (en) * 2006-02-01 2007-09-20 Sony Corporation System, apparatus, method, program and recording medium for processing image
US20100208078A1 (en) * 2009-02-17 2010-08-19 Cisco Technology, Inc. Horizontal gaze estimation for video conferencing
US20100220172A1 (en) * 2009-02-27 2010-09-02 Avaya Inc. Automatic Video Switching for Multimedia Conferencing
US8289363B2 (en) * 2006-12-28 2012-10-16 Mark Buckler Video conferencing
US8451312B2 (en) * 2010-01-06 2013-05-28 Apple Inc. Automatic video stream selection

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4206053B2 (en) 2004-03-31 2009-01-07 株式会社国際電気通信基礎技術研究所 User interface device and user interface program
CN1735163A (en) 2004-08-14 2006-02-15 鸿富锦精密工业(深圳)有限公司 The system and method for a kind of preview and switching favorite channels
JP2006285715A (en) 2005-04-01 2006-10-19 Konica Minolta Holdings Inc Sight line detection system
KR100735415B1 (en) 2005-09-01 2007-07-04 삼성전자주식회사 Method for performing video telephone call of multilateral in?wireless terminal
US8098273B2 (en) * 2006-12-20 2012-01-17 Cisco Technology, Inc. Video contact center facial expression analyzer module
KR101944416B1 (en) * 2012-07-02 2019-01-31 삼성전자주식회사 Method for providing voice recognition service and an electronic device thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070216773A1 (en) * 2006-02-01 2007-09-20 Sony Corporation System, apparatus, method, program and recording medium for processing image
US8289363B2 (en) * 2006-12-28 2012-10-16 Mark Buckler Video conferencing
US20100208078A1 (en) * 2009-02-17 2010-08-19 Cisco Technology, Inc. Horizontal gaze estimation for video conferencing
US20100220172A1 (en) * 2009-02-27 2010-09-02 Avaya Inc. Automatic Video Switching for Multimedia Conferencing
US8451312B2 (en) * 2010-01-06 2013-05-28 Apple Inc. Automatic video stream selection

Also Published As

Publication number Publication date
US8810624B2 (en) 2014-08-19
US20120113211A1 (en) 2012-05-10
KR20120050346A (en) 2012-05-18
KR101733246B1 (en) 2017-05-08

Similar Documents

Publication Publication Date Title
US8810624B2 (en) Apparatus and method for configuring screen for video call using facial expression
US20200059628A1 (en) Establishing a video conference during a phone call
US20100171807A1 (en) System and associated methodology for multi-layered site video conferencing
KR101786944B1 (en) Speaker displaying method and videophone terminal therefor
US11182936B2 (en) Drawing content processing method and device for terminal apparatus, and terminal apparatus
CN114422738A (en) Compositing and scaling angularly separated sub-scenes
CN106101743B (en) Panoramic video recognition methods and device
US20160239098A1 (en) Gesture controlled communication
US20110022389A1 (en) Apparatus and method for improving performance of voice recognition in a portable terminal
CN105631804B (en) Image processing method and device
WO2021190428A1 (en) Image capturing method and electronic device
EP3118847B1 (en) Image displaying method, image displaying device, computer program and recording medium
CN113194254A (en) Image shooting method and device, electronic equipment and storage medium
EP3771203A1 (en) Electronic nameplate display method and apparatus in video conference
JP2006215116A (en) Display apparatus with imaging function, display control method for the same, display control program and recording medium with the program recorded thereon
US20130094593A1 (en) Method for adjusting video image compression using gesture
US9582179B2 (en) Apparatus and method for editing image in portable terminal
WO2018133305A1 (en) Method and device for image processing
KR20100041061A (en) Video telephony method magnifying the speaker's face and terminal using thereof
He et al. Real-time whiteboard capture and processing using a video camera for teleconferencing
CN109427036B (en) Skin color treatment method and device
AU2015201127A1 (en) Establishing a video conference during a phone call
CN116033110A (en) Video conference speaker display method, device, equipment and storage medium
WO2023078103A1 (en) Multi-mode face driving method and apparatus, electronic device, and storage medium
KR100210398B1 (en) Method of recognizing talkers in a video conferencing system

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION