US20140359486A1 - Apparatus and method for configuring screen for video call using facial expression - Google Patents
Apparatus and method for configuring screen for video call using facial expression Download PDFInfo
- Publication number
- US20140359486A1 US20140359486A1 US14/463,109 US201414463109A US2014359486A1 US 20140359486 A1 US20140359486 A1 US 20140359486A1 US 201414463109 A US201414463109 A US 201414463109A US 2014359486 A1 US2014359486 A1 US 2014359486A1
- Authority
- US
- United States
- Prior art keywords
- screen
- face
- video call
- image
- expression
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G06K9/00302—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
- G06V40/165—Detection; Localisation; Normalisation using facial parts and geometric relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/174—Facial expression recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/4223—Cameras
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4788—Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
Definitions
- the present invention relates generally to a screen configuring apparatus and method, and more particularly, to an apparatus and method for configuring a screen for a video call by selecting an image of an interested user among multiple users.
- a picture of a caller is taken using a camera, displayed on a screen, and images of the persons with whom the caller wants to have a telephone conversation are displayed in a specific location of the screen, for a video call.
- a multipoint video call (or video conference call) technique which allows a user to have a video call with multiple persons on a mobile terminal, automatically identifies a speaking party by lip movement recognition, and displays an image of the speaker at the center of the screen, making it possible to talk with multiple persons.
- a display includes a main screen having the largest area on a video call screen, and at least one sub screen.
- the conventional multipoint video call technique may malfunction when several users move their lips at the same time.
- the present invention has been made to solve the above-mentioned problems occurring in the prior art, and the present invention provides a video call apparatus and method for estimating a facial expression during a video call with multiple users, selecting an image of an interested person, and allowing a user to have a video call with the selected interested person.
- an apparatus for configuring a screen for a video call using a facial expression which includes a facial expression information calculator for recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; a facial expression determiner for determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; a screen configurer for configuring a video call screen including multiple video images received for the video call; and an image selector for selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression.
- the screen configurer may reconfigure the video call screen using the selected video image.
- a method for configuring a screen for a video call using a facial expression by configuring a video call screen including multiple video images received for the video call; recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression; and reconfiguring the video call screen using the selected video image.
- FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention
- FIG. 2 is a flowchart illustrating a process of extracting reference expression information used to estimate changes in facial expression in a screen configuring apparatus according to an embodiment of the present invention
- FIG. 3 is a diagram illustrating images obtained in a process of extracting reference expression information according to an embodiment of the present invention
- FIG. 4 is a flowchart illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention.
- FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention.
- FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention.
- the screen configuring apparatus includes a facial expression information calculator 100 , a facial expression determiner 110 , an image selector 120 , and a screen configurer 130 .
- the facial expression information calculator 100 calculates facial expression information within a frame of an input image received from a camera during a video call, or of input images received outside the video call.
- the facial expression information calculator 100 presets reference expression information that is used to determine changes in facial expression during a video call from an input image received from the camera before the video call.
- the facial expression information calculator 100 includes a face recognizer 101 , a facial feature extractor 102 , and a face angle calculator 103 .
- the face recognizer 101 uses a general face recognition technique in recognizing a face area in an input image, for example, recognizing an area corresponding to a preset facial skin color in an input image, as a face area.
- the facial feature extractor 102 extracts facial features in the recognized face area.
- a general facial feature extraction technique is used.
- the facial features as used herein may refer to facial feature components such as eyes, nose, mouth and chin.
- the face angle calculator 103 calculates a reference face angle based on the extracted facial features. Specifically, the face angle calculator 103 draws polygonal sides by connecting the calculated facial features, and calculates an angle of the recognized face based on the drawn polygonal sides. For the calculation of the face angle, a general face angle calculation technique is used.
- the screen configurer 130 configures a video call screen for the video call, using at least one input image received during the video call and a user image received from a camera.
- the at least one input image is defined as at least one sub image
- the user image received from a camera is defined as a main image.
- the face configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed.
- the screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
- the facial expression determiner 110 determines whether there is a change in facial expression by comparing facial expression information in the main image, calculated by the facial expression information calculator 100 during a video call, with preset reference expression information.
- the facial expression determiner 110 determines whether there is a change in face angle by comparing the face angle in the main image calculated by the face angle calculator 103 with a preset reference face angle.
- the image selector 120 selects a sub image corresponding to the changed facial expression from among multiple sub images located on the video call screen.
- the image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
- the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the image selector 120 , and displays the reconfigured video call screen.
- the screen configurer 130 switches between a screen of the main image and a screen of the selected sub image on the video call screen.
- the screen configuring apparatus estimates a facial expression of a user and selects an image of an interested person on the video call screen, making it possible for the user to conveniently select an image of the interested person without taking extensive action.
- FIG. 2 is a diagram illustrating a process of setting reference expression information in a screen configuring apparatus according to an embodiment of the present invention.
- the facial expression information calculator 100 upon receiving an image from a camera in step 200 , the facial expression information calculator 100 recognizes a face in the received image in step 210 .
- a general face recognition technique is used, and a technique of learning a skin color and recognizing an area corresponding to the learned skin color as a face area may also be used.
- the facial expression information calculator 100 recognizes a face area 301 in an input image 300 .
- the facial expression information calculator 100 extracts facial features in the recognized face. As represented by reference numeral 310 , the facial expression information calculator 100 extracts facial features at the locations corresponding to eyes, nose, mouth and chin in the face area.
- the facial expression information calculator 100 calculates a face angle of the recognized face based on the extracted facial features. For example, the facial expression information calculator 100 calculates a face angle 321 in an image 320 using an area of a polygon by connecting the facial features, and then ends setting the reference expression information.
- the screen configuring apparatus may recognize changes in facial expression in an image received during a video call and reconfigure a video call screen corresponding to the changed facial expression.
- FIG. 4 is a diagram illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention.
- a user image received from a camera is defined as a main image, and at least one input image received from outside is defined as at least one sub image.
- An embodiment of the present invention will be described with reference to FIGS. 5 to 7 .
- FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention.
- the screen con figurer 130 configures and displays a video call screen including a screen of a main image and a screen of at least one sub image in step 401 .
- the screen configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed.
- the screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
- the displayed video call screen may be as illustrated in FIG. 5 .
- the facial expression information calculator 100 recognizes a face in the main image, calculates facial features of the recognized face, and calculates a face angle based on the calculated facial features.
- the face recognizer 101 recognizes a face area in the main image using the general face recognition technique, for example, by recognizing an area corresponding to a preset facial skin color in an input image, as a face area.
- the facial feature extractor 102 extracts facial features in the recognized face area, and the face angle calculator 103 calculates a reference face angle based on the extracted facial features.
- step 403 the facial expression determiner 110 compares the face angle in the main image calculated by the facial expression information calculator 100 , with a preset reference face angle.
- step 404 the facial expression determiner 110 determines whether there is a change in face angle. If there is a change in face angle, the image selector 120 proceeds to step 405 . Otherwise, the screen configurer 130 continuously displays the video call screen in step 401 .
- step 405 the image selector 120 selects a sub image that is located on the video call screen to correspond to the face angle of the main image.
- the image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
- the image selector 120 estimates a face direction 502 corresponding to a face angle of a main image 500 , and selects a sub image 501 corresponding to the estimated face direction 502 in the face area.
- the screen configurer 130 may further display, on the video call screen, face direction arrow icons for allowing the user to recognize face directions corresponding to face angles. These face direction arrow icons may be displayed to overlap the screen of the main image.
- the screen configurer 130 may display the edge of the selected sub image to be bold, or may display the selected sub image to be greater in size than other sub images.
- step 406 the screen configurer 130 determines whether a change in face angle is continuously recognized for a preset time. If the change in face angle is continuously recognized, the screen configurer 130 proceeds to step 407 . Otherwise, the screen configurer 130 continuously displays the video call screen in step 401 .
- the reason why the screen configurer 130 determines whether a change in face angle continues for a preset time is to prevent a wrong sub image from being selected due to the unintended user facial movement.
- the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the image selector 120 , and displays the reconfigured video call screen. For example, as illustrated in FIG. 6 , the screen configurer 130 reconfigures the video call screen such that the screen of the main image and the screen of the selected sub image are switched in terms of the location, thereby displaying the selected sub image in the area of the main image.
- the screen configurer 130 may to place the screen of the sub image in the area of the main image by switching between a screen 700 of the main image and a screen 701 of the sub image on the video call screen.
- step 408 the screen configurer 130 determines if the video call has been completed. If the video call has been completed, the screen con figurer 130 ends the video call operation. Otherwise, the screen configurer 130 returns to step 401 and repeats its succeeding steps.
- the user's facial expression is estimated and an image of the user's interested person on the video call screen is selected, thereby allowing the user to conveniently select an image of the interested person without taking extensive action.
- the embodiments of the present invention may set a time margin for selecting an image of an interested person, to prevent a wrong image from being selected due to unintended user facial movement.
- the embodiments of the present invention provide accurate, intuitive and convenient functions according to the display screen of the video call apparatus, thereby increasing user convenience.
Abstract
Apparatus and method for configuring a screen for a video call using a facial expression by recognizing a face from an image, calculating facial expression information for an expression of the recognized face, and determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face. If there is a change in expression of the recognized face, the apparatus and method selects a video image corresponding to the changed expression in the video call screen, and reconfigures the video call screen using the selected video image, making it possible for a user to conveniently select an image of the interested person without taking extensive action, and preventing a wrong image from being selected due to the unintended user facial movement.
Description
- This application is a Continuation Application of U.S. application Ser. No. 13/293,720, filed at the U.S. Patent and Trademark Office on Nov. 10, 2011, now U.S. Pat. No. 8,810,624, issued on Aug. 19, 2014, which claims priority under 35 10 U.S.C. §119(a) to a Korean Patent Application filed in the Korean Intellectual Property
- Office on Nov. 10, 2010 and assigned Serial No. 10-2010-0111791, the entire disclosure of which is incorporated herein by reference.
- 1. Field of the Invention
- The present invention relates generally to a screen configuring apparatus and method, and more particularly, to an apparatus and method for configuring a screen for a video call by selecting an image of an interested user among multiple users.
- 2. Description of the Related Art
- In a video call, a picture of a caller is taken using a camera, displayed on a screen, and images of the persons with whom the caller wants to have a telephone conversation are displayed in a specific location of the screen, for a video call.
- A multipoint video call (or video conference call) technique, which allows a user to have a video call with multiple persons on a mobile terminal, automatically identifies a speaking party by lip movement recognition, and displays an image of the speaker at the center of the screen, making it possible to talk with multiple persons.
- In a multipoint video call apparatus, a display includes a main screen having the largest area on a video call screen, and at least one sub screen.
- However, the conventional multipoint video call technique may malfunction when several users move their lips at the same time.
- Additionally, it is difficult for a user to select an image of another user other than the speaker, when a user has an interest in having a conversation with the other user.
- Accordingly, the present invention has been made to solve the above-mentioned problems occurring in the prior art, and the present invention provides a video call apparatus and method for estimating a facial expression during a video call with multiple users, selecting an image of an interested person, and allowing a user to have a video call with the selected interested person.
- According to one aspect of the present invention, there is provided an apparatus for configuring a screen for a video call using a facial expression which includes a facial expression information calculator for recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; a facial expression determiner for determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; a screen configurer for configuring a video call screen including multiple video images received for the video call; and an image selector for selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression. The screen configurer may reconfigure the video call screen using the selected video image.
- According to another aspect of the present invention, there is provided a method for configuring a screen for a video call using a facial expression by configuring a video call screen including multiple video images received for the video call; recognizing a face from an image, and calculating facial expression information for an expression of the recognized face; determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face; selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression; and reconfiguring the video call screen using the selected video image.
- The above and other aspects, features and advantages of various embodiments of the present invention will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
-
FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention; -
FIG. 2 is a flowchart illustrating a process of extracting reference expression information used to estimate changes in facial expression in a screen configuring apparatus according to an embodiment of the present invention; -
FIG. 3 is a diagram illustrating images obtained in a process of extracting reference expression information according to an embodiment of the present invention; -
FIG. 4 is a flowchart illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention; and -
FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention. - Throughout the drawings, the same drawing reference numerals will be used to refer to the same elements, features and structures.
- Various embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description, specific details such as detailed configuration and components are merely provided to assist the overall understanding of various embodiments of the present invention. Therefore, it will be apparent to a person having ordinary skill in the art of the present invention that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. In addition, descriptions of well-known functions and constructions are omitted for clarity and conciseness.
-
FIG. 1 is a diagram illustrating a structure of a screen configuring apparatus according to an embodiment of the present invention. - Referring to
FIG. 1 , the screen configuring apparatus includes a facialexpression information calculator 100, a facial expression determiner 110, animage selector 120, and a screen configurer 130. - The facial
expression information calculator 100 calculates facial expression information within a frame of an input image received from a camera during a video call, or of input images received outside the video call. - The facial
expression information calculator 100 presets reference expression information that is used to determine changes in facial expression during a video call from an input image received from the camera before the video call. - The facial
expression information calculator 100 includes aface recognizer 101, afacial feature extractor 102, and aface angle calculator 103. - The
face recognizer 101 uses a general face recognition technique in recognizing a face area in an input image, for example, recognizing an area corresponding to a preset facial skin color in an input image, as a face area. - The
facial feature extractor 102 extracts facial features in the recognized face area. For the extraction of the facial features, a general facial feature extraction technique is used. The facial features as used herein may refer to facial feature components such as eyes, nose, mouth and chin. - The
face angle calculator 103 calculates a reference face angle based on the extracted facial features. Specifically, theface angle calculator 103 draws polygonal sides by connecting the calculated facial features, and calculates an angle of the recognized face based on the drawn polygonal sides. For the calculation of the face angle, a general face angle calculation technique is used. - When a video call begins, the screen configurer 130 configures a video call screen for the video call, using at least one input image received during the video call and a user image received from a camera. The at least one input image is defined as at least one sub image, and the user image received from a camera is defined as a main image.
- That is, the face configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed. The screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
- The facial expression determiner 110 determines whether there is a change in facial expression by comparing facial expression information in the main image, calculated by the facial
expression information calculator 100 during a video call, with preset reference expression information. - Specifically, the facial expression determiner 110 determines whether there is a change in face angle by comparing the face angle in the main image calculated by the
face angle calculator 103 with a preset reference face angle. - If there is a change in facial expression information, the
image selector 120 selects a sub image corresponding to the changed facial expression from among multiple sub images located on the video call screen. - That is, if a difference between the calculated face angle in the main image and the preset reference face angle is greater than or equal to a preset value, the
image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen. - If a change in facial expression is continuously recognized for a preset time, the screen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by the
image selector 120, and displays the reconfigured video call screen. - Specifically, if the face direction estimated by the
image selector 120 is continuously recognized for a preset time, the screen configurer 130 switches between a screen of the main image and a screen of the selected sub image on the video call screen. - As such, the screen configuring apparatus estimates a facial expression of a user and selects an image of an interested person on the video call screen, making it possible for the user to conveniently select an image of the interested person without taking extensive action.
-
FIG. 2 is a diagram illustrating a process of setting reference expression information in a screen configuring apparatus according to an embodiment of the present invention. - Referring to
FIG. 2 , upon receiving an image from a camera instep 200, the facialexpression information calculator 100 recognizes a face in the received image instep 210. As described above, for the recognition of a face in an image, a general face recognition technique is used, and a technique of learning a skin color and recognizing an area corresponding to the learned skin color as a face area may also be used. For example, with reference toFIG. 3 , the facialexpression information calculator 100 recognizes aface area 301 in aninput image 300. - In
step 220, the facialexpression information calculator 100 extracts facial features in the recognized face. As represented byreference numeral 310, the facialexpression information calculator 100 extracts facial features at the locations corresponding to eyes, nose, mouth and chin in the face area. - In
step 230, the facialexpression information calculator 100 calculates a face angle of the recognized face based on the extracted facial features. For example, the facialexpression information calculator 100 calculates aface angle 321 in animage 320 using an area of a polygon by connecting the facial features, and then ends setting the reference expression information. - As such, the screen configuring apparatus may recognize changes in facial expression in an image received during a video call and reconfigure a video call screen corresponding to the changed facial expression.
-
FIG. 4 is a diagram illustrating a process of reconfiguring a video call screen corresponding to changes in facial expression during a video call in a screen configuring apparatus according to an embodiment of the present invention. - According to an embodiment of the present invention, a user image received from a camera is defined as a main image, and at least one input image received from outside is defined as at least one sub image. An embodiment of the present invention will be described with reference to
FIGS. 5 to 7 . -
FIGS. 5 to 7 are diagrams illustrating images obtained in a process of configuring a video call screen according to an embodiment of the present invention. - Referring to
FIG. 4 , if a video call begins instep 400, thescreen con figurer 130 configures and displays a video call screen including a screen of a main image and a screen of at least one sub image instep 401. - The screen configurer 130 displays a main image in an area with a preset size on the video call screen, and displays at least one sub image in the remaining area except for the area where the main image is displayed. The screen configurer 130 sets a size of the area where the main image is displayed on the video call screen, to be greater than a size of the area where the at least one sub image is displayed.
- The displayed video call screen may be as illustrated in
FIG. 5 . - In
step 402, the facialexpression information calculator 100 recognizes a face in the main image, calculates facial features of the recognized face, and calculates a face angle based on the calculated facial features. - Specifically, the
face recognizer 101 recognizes a face area in the main image using the general face recognition technique, for example, by recognizing an area corresponding to a preset facial skin color in an input image, as a face area. - Thereafter, the
facial feature extractor 102 extracts facial features in the recognized face area, and theface angle calculator 103 calculates a reference face angle based on the extracted facial features. - In
step 403, thefacial expression determiner 110 compares the face angle in the main image calculated by the facialexpression information calculator 100, with a preset reference face angle. - In
step 404, thefacial expression determiner 110 determines whether there is a change in face angle. If there is a change in face angle, theimage selector 120 proceeds to step 405. Otherwise, thescreen configurer 130 continuously displays the video call screen instep 401. - In
step 405, theimage selector 120 selects a sub image that is located on the video call screen to correspond to the face angle of the main image. - That is, if a difference between the calculated face angle in the main image and the preset reference face angle is greater than or equal to a preset value, the
image selector 120 estimates a face direction corresponding to the face angle in the main image, and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen. - Referring to
FIGS. 5 and 6 , theimage selector 120 estimates aface direction 502 corresponding to a face angle of amain image 500, and selects asub image 501 corresponding to the estimatedface direction 502 in the face area. - As illustrated in
FIG. 5 , thescreen configurer 130 may further display, on the video call screen, face direction arrow icons for allowing the user to recognize face directions corresponding to face angles. These face direction arrow icons may be displayed to overlap the screen of the main image. - To emphasize that the selected sub image is a selected image, the
screen configurer 130 may display the edge of the selected sub image to be bold, or may display the selected sub image to be greater in size than other sub images. - In
step 406, thescreen configurer 130 determines whether a change in face angle is continuously recognized for a preset time. If the change in face angle is continuously recognized, thescreen configurer 130 proceeds to step 407. Otherwise, thescreen configurer 130 continuously displays the video call screen instep 401. - The reason why the
screen configurer 130 determines whether a change in face angle continues for a preset time is to prevent a wrong sub image from being selected due to the unintended user facial movement. - In
step 407, thescreen configurer 130 reconfigures a video call screen corresponding to the changed facial expression using the sub image selected by theimage selector 120, and displays the reconfigured video call screen. For example, as illustrated inFIG. 6 , thescreen configurer 130 reconfigures the video call screen such that the screen of the main image and the screen of the selected sub image are switched in terms of the location, thereby displaying the selected sub image in the area of the main image. - As illustrated in
FIG. 7 , thescreen configurer 130 may to place the screen of the sub image in the area of the main image by switching between ascreen 700 of the main image and ascreen 701 of the sub image on the video call screen. - In
step 408, thescreen configurer 130 determines if the video call has been completed. If the video call has been completed, thescreen con figurer 130 ends the video call operation. Otherwise, thescreen configurer 130 returns to step 401 and repeats its succeeding steps. - According to embodiments of the present invention, the user's facial expression is estimated and an image of the user's interested person on the video call screen is selected, thereby allowing the user to conveniently select an image of the interested person without taking extensive action. The embodiments of the present invention may set a time margin for selecting an image of an interested person, to prevent a wrong image from being selected due to unintended user facial movement.
- In addition, the embodiments of the present invention provide accurate, intuitive and convenient functions according to the display screen of the video call apparatus, thereby increasing user convenience.
- While the invention has been shown and described with reference to various embodiments thereof, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
Claims (14)
1. An apparatus for configuring a screen for a video call using a facial expression, comprising:
a facial expression information calculator for recognizing a face from an image, and calculating facial expression information for an expression of the recognized face;
a facial expression determiner for determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face;
a screen configurer for configuring a video call screen including multiple video images received for the video call; and
an image selector for selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression,
wherein the screen configurer reconfigures the video call screen using the selected video image.
2. The apparatus of claim 1 , wherein the video call screen includes a screen of a main image received from a camera and a screen of at least one sub image received outside of the video call.
3. The apparatus of claim 2 , wherein the facial expression information calculator recognizes a face in the main image, extracts facial features of the recognized face, and calculates a face angle of the recognized face based on the extracted facial features.
4. The apparatus of claim 3 , wherein the facial expression information calculator calculates a face angle from an image received before the video call, and sets the calculated face angle as a reference face angle.
5. The apparatus of claim 4 , wherein the facial expression determiner determines whether a difference between the calculated face angle and the reference face angle is greater than or equal to a preset threshold by comparing the calculated face angle with the reference face angle.
6. The apparatus of claim 5 , wherein the image selector estimates a face direction corresponding to the calculated face angle if the difference between the calculated face angle and the reference face angle is greater than or equal to the threshold and selects a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
7. The apparatus of claim 6 , wherein the screen configurer reconfigures the video call screen by switching between the screen of the main image and the screen of the selected sub image on the video call screen.
8. A method for configuring a screen for a video call using a facial expression, comprising:
configuring a video call screen including multiple video images received for the video call;
recognizing a face from an image, and calculating facial expression information for an expression of the recognized face;
determining whether there is a change in expression of the recognized face by comparing the calculated facial expression information with reference expression information preset to determine a change in expression of the face;
selecting a video image corresponding to the changed expression in the video call screen if there is a change in expression; and
reconfiguring the video call screen using the selected video image.
9. The method of claim 8 , wherein the video call screen includes a screen of a main image received from a camera and a screen of at least one sub image received outside of the video call.
10. The method of claim 9 , wherein calculating facial expression information further comprises:
recognizing a face in the main image;
extracting facial features of the recognized face; and
calculating a face angle of the recognized face based on the extracted facial features.
11. The method of claim 10 , further comprising calculating a face angle from an image received before the video call, and setting the calculated face angle as a reference face angle.
12. The method of claim 11 , wherein determining whether there is a change in expression of the recognized face further comprises determining whether a difference between the calculated face angle and the reference face angle is greater than or equal to a preset threshold by comparing the calculated face angle with the reference face angle.
13. The method of claim 12 , wherein selecting a video image corresponding to the changed expression comprises:
estimating a face direction corresponding to the calculated face angle if the difference between the calculated face angle and the reference face angle is the threshold value or above; and
selecting a sub image corresponding to the estimated face direction in a face area of the main image on the video call screen.
14. The method of claim 13 , wherein reconfiguring the video call screen comprises switching between the screen of the main image and the screen of the selected sub image on the video call screen.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/463,109 US20140359486A1 (en) | 2010-11-10 | 2014-08-19 | Apparatus and method for configuring screen for video call using facial expression |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020100111791A KR101733246B1 (en) | 2010-11-10 | 2010-11-10 | Apparatus and method for composition of picture for video call using face pose |
KR10-2010-0111791 | 2010-11-10 | ||
US13/293,720 US8810624B2 (en) | 2010-11-10 | 2011-11-10 | Apparatus and method for configuring screen for video call using facial expression |
US14/463,109 US20140359486A1 (en) | 2010-11-10 | 2014-08-19 | Apparatus and method for configuring screen for video call using facial expression |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/293,720 Continuation US8810624B2 (en) | 2010-11-10 | 2011-11-10 | Apparatus and method for configuring screen for video call using facial expression |
Publications (1)
Publication Number | Publication Date |
---|---|
US20140359486A1 true US20140359486A1 (en) | 2014-12-04 |
Family
ID=46019250
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/293,720 Expired - Fee Related US8810624B2 (en) | 2010-11-10 | 2011-11-10 | Apparatus and method for configuring screen for video call using facial expression |
US14/463,109 Abandoned US20140359486A1 (en) | 2010-11-10 | 2014-08-19 | Apparatus and method for configuring screen for video call using facial expression |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/293,720 Expired - Fee Related US8810624B2 (en) | 2010-11-10 | 2011-11-10 | Apparatus and method for configuring screen for video call using facial expression |
Country Status (2)
Country | Link |
---|---|
US (2) | US8810624B2 (en) |
KR (1) | KR101733246B1 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10890965B2 (en) * | 2012-08-15 | 2021-01-12 | Ebay Inc. | Display orientation adjustment using facial landmark information |
US9756282B2 (en) * | 2012-11-20 | 2017-09-05 | Sony Corporation | Method and apparatus for processing a video signal for display |
USD772253S1 (en) * | 2013-02-19 | 2016-11-22 | Sony Computer Entertainment Inc. | Display panel or screen with an animated graphical user interface |
KR102169523B1 (en) | 2013-05-31 | 2020-10-23 | 삼성전자 주식회사 | Display apparatus and control method thereof |
US9104907B2 (en) * | 2013-07-17 | 2015-08-11 | Emotient, Inc. | Head-pose invariant recognition of facial expressions |
US9547808B2 (en) * | 2013-07-17 | 2017-01-17 | Emotient, Inc. | Head-pose invariant recognition of facial attributes |
CN103401981B (en) * | 2013-07-25 | 2016-03-30 | 深圳市金立通信设备有限公司 | A kind of method of initiating communication request and mobile terminal |
TWD166921S (en) * | 2013-08-14 | 2015-04-01 | 新力電腦娛樂股份有限公司 | Graphical user interface for a display panel |
USD752079S1 (en) * | 2013-10-15 | 2016-03-22 | Deere & Company | Display screen with graphical user interface |
KR102205498B1 (en) * | 2014-09-18 | 2021-01-20 | 삼성전자주식회사 | Feature extraction method and apparatus from input image |
JP6592940B2 (en) * | 2015-04-07 | 2019-10-23 | ソニー株式会社 | Information processing apparatus, information processing method, and program |
CN107635110A (en) * | 2017-09-30 | 2018-01-26 | 维沃移动通信有限公司 | A kind of video interception method and terminal |
CN108366221A (en) * | 2018-05-16 | 2018-08-03 | 维沃移动通信有限公司 | A kind of video call method and terminal |
CN113342239A (en) * | 2021-05-31 | 2021-09-03 | 锐迪科微电子科技(上海)有限公司 | Region-of-interest determination method and apparatus |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070216773A1 (en) * | 2006-02-01 | 2007-09-20 | Sony Corporation | System, apparatus, method, program and recording medium for processing image |
US20100208078A1 (en) * | 2009-02-17 | 2010-08-19 | Cisco Technology, Inc. | Horizontal gaze estimation for video conferencing |
US20100220172A1 (en) * | 2009-02-27 | 2010-09-02 | Avaya Inc. | Automatic Video Switching for Multimedia Conferencing |
US8289363B2 (en) * | 2006-12-28 | 2012-10-16 | Mark Buckler | Video conferencing |
US8451312B2 (en) * | 2010-01-06 | 2013-05-28 | Apple Inc. | Automatic video stream selection |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4206053B2 (en) | 2004-03-31 | 2009-01-07 | 株式会社国際電気通信基礎技術研究所 | User interface device and user interface program |
CN1735163A (en) | 2004-08-14 | 2006-02-15 | 鸿富锦精密工业(深圳)有限公司 | The system and method for a kind of preview and switching favorite channels |
JP2006285715A (en) | 2005-04-01 | 2006-10-19 | Konica Minolta Holdings Inc | Sight line detection system |
KR100735415B1 (en) | 2005-09-01 | 2007-07-04 | 삼성전자주식회사 | Method for performing video telephone call of multilateral in?wireless terminal |
US8098273B2 (en) * | 2006-12-20 | 2012-01-17 | Cisco Technology, Inc. | Video contact center facial expression analyzer module |
KR101944416B1 (en) * | 2012-07-02 | 2019-01-31 | 삼성전자주식회사 | Method for providing voice recognition service and an electronic device thereof |
-
2010
- 2010-11-10 KR KR1020100111791A patent/KR101733246B1/en active IP Right Grant
-
2011
- 2011-11-10 US US13/293,720 patent/US8810624B2/en not_active Expired - Fee Related
-
2014
- 2014-08-19 US US14/463,109 patent/US20140359486A1/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070216773A1 (en) * | 2006-02-01 | 2007-09-20 | Sony Corporation | System, apparatus, method, program and recording medium for processing image |
US8289363B2 (en) * | 2006-12-28 | 2012-10-16 | Mark Buckler | Video conferencing |
US20100208078A1 (en) * | 2009-02-17 | 2010-08-19 | Cisco Technology, Inc. | Horizontal gaze estimation for video conferencing |
US20100220172A1 (en) * | 2009-02-27 | 2010-09-02 | Avaya Inc. | Automatic Video Switching for Multimedia Conferencing |
US8451312B2 (en) * | 2010-01-06 | 2013-05-28 | Apple Inc. | Automatic video stream selection |
Also Published As
Publication number | Publication date |
---|---|
US8810624B2 (en) | 2014-08-19 |
US20120113211A1 (en) | 2012-05-10 |
KR20120050346A (en) | 2012-05-18 |
KR101733246B1 (en) | 2017-05-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8810624B2 (en) | Apparatus and method for configuring screen for video call using facial expression | |
US20200059628A1 (en) | Establishing a video conference during a phone call | |
US20100171807A1 (en) | System and associated methodology for multi-layered site video conferencing | |
KR101786944B1 (en) | Speaker displaying method and videophone terminal therefor | |
US11182936B2 (en) | Drawing content processing method and device for terminal apparatus, and terminal apparatus | |
CN114422738A (en) | Compositing and scaling angularly separated sub-scenes | |
CN106101743B (en) | Panoramic video recognition methods and device | |
US20160239098A1 (en) | Gesture controlled communication | |
US20110022389A1 (en) | Apparatus and method for improving performance of voice recognition in a portable terminal | |
CN105631804B (en) | Image processing method and device | |
WO2021190428A1 (en) | Image capturing method and electronic device | |
EP3118847B1 (en) | Image displaying method, image displaying device, computer program and recording medium | |
CN113194254A (en) | Image shooting method and device, electronic equipment and storage medium | |
EP3771203A1 (en) | Electronic nameplate display method and apparatus in video conference | |
JP2006215116A (en) | Display apparatus with imaging function, display control method for the same, display control program and recording medium with the program recorded thereon | |
US20130094593A1 (en) | Method for adjusting video image compression using gesture | |
US9582179B2 (en) | Apparatus and method for editing image in portable terminal | |
WO2018133305A1 (en) | Method and device for image processing | |
KR20100041061A (en) | Video telephony method magnifying the speaker's face and terminal using thereof | |
He et al. | Real-time whiteboard capture and processing using a video camera for teleconferencing | |
CN109427036B (en) | Skin color treatment method and device | |
AU2015201127A1 (en) | Establishing a video conference during a phone call | |
CN116033110A (en) | Video conference speaker display method, device, equipment and storage medium | |
WO2023078103A1 (en) | Multi-mode face driving method and apparatus, electronic device, and storage medium | |
KR100210398B1 (en) | Method of recognizing talkers in a video conferencing system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |