WO2016182504A1

WO2016182504A1 - A virtual reality headset

Info

Publication number: WO2016182504A1
Application number: PCT/SG2015/050101
Authority: WO
Inventors: Bryan Shwo-kang CHOW
Original assignee: Chow Bryan Shwo-Kang
Priority date: 2015-05-08
Filing date: 2015-05-08
Publication date: 2016-11-17

Abstract

A Virtual Reality headset, the Virtual Reality headset comprising: a frame wearable by a user; one or more displays disposed in the frame and facing eyes of the user; a camera for capturing at least one image of a facial expression including at least part of a mouth of the user, the camera being disposed at a distance away from the user for capturing the at least one image of the facial expression of the user; a communication interface for transmitting the captured at least one image to a processor configured for generating an image displayable on the one or more displays or another one or more displays based on the at least one image captured by the camera.

Description

A Virtual Reality Headset

Field The present invention relates to a Virtual Reality headset, in particular, one capable of capturing facial expression.

Background An existing Virtual Reality headset may include a camera but the camera only captures images of surroundings of a user or images of the eyes of a user. Furthermore, such images are insufficient for good representation of the physical state or emotion of the user in a virtual environment accessible to one or more Virtual Reality headsets. Brief Description of the Drawings

Various embodiments of the present disclosure will now be described, by way of example only, and with reference to the accompanying drawings in which: Figure 1 illustrates a Virtual Reality headset according to an example of the present disclosure.

Figure 2 illustrates another Virtual Reality headset according to an example of the present disclosure.

Figure 3 illustrates architecture of a processor according to an example of the present disclosure.

Summary

According to one aspect of an example of the present invention, there is provided a Virtual Reality headset, the Virtual Reality headset comprising: a frame wearable by a user; one or more displays disposed in the frame and facing eyes of the user; a camera for capturing at least one image of a facial expression including at least part of a mouth of the user, the camera being disposed at a distance away from the user for capturing the at least one image of the facial expression of the user; and a communication interface for transmitting the captured at least one image to a processor configured for generating an image displayable on the one or more displays or another one or more displays based on the at least one image captured by the camera.

Detailed Description

Figure 1 shows an example Virtual Reality (VR) headset 100 of the present disclosure. The Virtual Reality headset 100 has a frame 102 wearable by a user (the user is not shown in Figure 1 ), one or more displays 128 (not visible and covered by the frame 102 in Figure 1 ) disposed in the frame 102 and facing eyes of the user, and a camera 104 for capturing at least one image of a facial expression including at least part of a mouth of the user. The camera 104 is disposed at a distance 1 12 away from the user for capturing the at least one image of the facial expression of the user. The Virtual Reality headset 100 has a communication interface 1 16, in this case, in the form of a data transfer wire, for transmitting the captured at least one image to a processor 130 configured for generating an image displayable on the one or more displays 128 or another one or more displays (not shown in Figure 1 ; also known herein as "the other one or more displays") based on the at least one image captured by the camera 104. As a head of the user will contact the frame 102, in this example, the distance 1 12 is from the camera 104 to an axis 120 orthogonal to an edge 122 of the frame 102.

For example, the camera 104 may be an optical camera typically used on a smartphone device, mobile device or a digital camera capable of capturing at least one image of the facial expression including at least part of the mouth of the user at a relatively close distance away from the user. The at least one image may be one photograph, a series of photographs or a real time video feed. In the present disclosure, a video or video feed is understood to be a plurality of images. The processor 130 may be a central processing unit of a computer or a gaming device. As an example, the camera 104 may include an image sensor based on semiconductor charge-coupled devices (CCD), active pixel sensors in complementary metal-oxide-semiconductor (CMOS), N-type metal-oxide-semiconductor (NMOS, Live MOS) technologies and the like.

In the present example, the frame 102 includes a head strap 1 14 for strapping the Virtual Reality headset 100 to a head of the user and an eye goggle portion 1 18 for covering both eyes of the user. Furthermore, in the present example, the one or more displays 128 in the frame 102 refer specifically to two displays 128 for enabling stereoscopic view for 3- dimensional images. The two displays 128 are facing eyes of the user looking into the eye goggle portion 1 18 of the frame 102. More specifically, in this case, each of the two displays 128 faces each eye of the user. The other one or more displays (not shown in Figure 1 ) mentioned earlier may be one or more displays located in another Virtual Reality headset or one or more displays of or connected to a network connected device such as a laptop, a desktop computer, a smartphone, a tablet, and the like.

In the present example, the processor 130 is configured for processing, in real-time, data of the at least one image captured to map the at least one image captured to a model of the user in a virtual environment displayable in the image displayable on the one or more displays 128 or the other one or more displays. For instance, one way to realise this is to carry out texture mapping of the at least one image captured to the model of the user in the virtual environment. It is appreciated that other methods to map and display the at least one captured image in the virtual environment can also be employed. The model of the user in the virtual environment can be a two dimensional (2D) model or a three dimensional (3D) model. For instance, the video or image of the user can be overlaid on a 3D model of the head and face of the user or representative avatar in real time. More specifically, the at least one image of the captured facial expression including at least part of a mouth of the user can be mapped to a face portion at a location of an avatar representative of the user in the virtual environment, wherein the location of the face portion is corresponding to location of the captured facial expression.

The image generated by the processor 130 based on the at least one image captured by the camera 104 will include the avatar representative of the user, or a 2D or 3D model of the user in the virtual environment. In some instances, the image generated by the processor 130 will be displayed on the other one or more displays instead of the one or more displays 128 to show the avatar representative of the user, or a 2D or 3D model of the user in the virtual environment to another user viewing the other one or more displays. However, in other instances, the avatar representative of the user, or a 2D or 3D model of the user in the virtual environment may be displayed in the one or more displays 128 to allow the user to view his/her avatar representation, or 2D or 3D model representation. It is appreciated that both the one or more displays 128 and the other one or more displays can concurrently show the avatar representative of the user, or a 2D or 3D model of the user in the virtual environment.

It is appreciated that animation may be applied to the generated image including the avatar, or the 2D or 3D model of the user in some instances. In other instances, the at least one image captured can be mapped without processing or with minimal processing (e.g. only correction or compensation of optical camera distortion or defects) to the face portion of the avatar, or the 2D or 3D model of the user so as to show actual image or footage of the facial expression of the user.

Including at least part of the mouth of the user provides clearer indication of a physical state or an emotional state of the user as opposed to capturing the facial expression without the mouth of the user. Capturing at least part of the mouth also provides mouth movement information in the case the user is speaking and this can also be mapped into the virtual environment, in an avatar representative of the user, or in a 2D or 3D model of the user in the virtual environment. For example, the at least part of the mouth of the user can include 20%- 100% of the mouth and may or may not include at least an edge of the mouth. For better indication of a physical state or an emotional state of the user, more parts of the mouth should be captured and/or at least an edge of the mouth is captured. In the example of Figure 1 , the camera 104 is at a location directed to capture the mouth entirely.

The virtual environment, including the avatar, or 2D or 3D model of the user therein, may be generated by the processor 130 for user interaction in a social media application. The social media application may be a computer-mediated tool that allow people to create, share or exchange information, ideas, and pictures/videos in virtual communities and networks. The social media application may include a group of Internet- or network- based applications that allow creation and exchange of user-generated content. Furthermore, the social media application may depend on mobile and web-based technologies to create highly interactive platforms through which individuals and communities share, co-create, discuss, and modify user-generated content.

The virtual environment, including the avatar, or 2D or 3D model of the user therein, may also be generated by the processor 130 for user interaction in a game application. The game application may support multi-player modes during game play for displaying an avatar, or a 2D or 3D model of other users logged in through a server to play the game provided by the game application. Depending on the game type or in the case of a single player mode, it is possible that the virtual environment, including the avatar, or a 2D or 3D model of the user is displayed on the one or more displays 128 viewable by the user.

Furthermore, the processor 130 may be configured for compensating distortion of the at least one image captured for generating the image displayable on the one or more displays 128 or the other one or more displays. For instance, perspective distortion in the form of a warping or transformation of an object (in this case, the captured facial expression of the user) and its surrounding area that differs significantly from what the object would look like with a normal focal length may result from the use of wide angle lens. Such distortion correction can be accomplished via available image processing techniques.

In the present example, the camera 104 is mounted to the frame 102. The frame 102 includes an elongate member 1 10 extending away from a face of the user and the camera 104 is mounted on the elongate member 1 10. The elongate member 1 10 includes a connection 108 at one end mounted to the frame 102, and another end holding the camera 104 and directing the camera 104 in a direction for capturing the facial expression of the user. The connection 108 may be a fixed connection on the frame 102 or made detachable from the frame 202. The elongate member 1 10 is moveable to adjust position of the camera 104 with respect to the face of the user. The elongate member 1 10 can be made of a bendable material that maintains shape after bending to enable the position adjustment of the camera 104. This enables the distance 1 12 of the camera 104 from the face of the user to be adjusted.

The Virtual Reality headset 100 comprises a microphone 124 (an optional component). In the present example, the microphone is located adjacent to the camera 104 for capturing sound made by the user. The processor 130 can be configured to synchronize the sound with mouth movements captured by the camera 104 and displayed in the virtual environment.

The distance 1 12 may be set at a distance of 1 cm to 10 cm (e.g. 5 cm) away from the face of the user. This is a range of distance observed to be desirable for capturing the facial expression of the user without having the camera 104 located too close or too far away from the user, which may compromise camera image capture, the aesthetics of the Virtual Reality headset 100 or make the Virtual Reality headset 100 inconvenient for handling as the camera 104 can be an obstruction. In the present example, the user can advantageously adjust the elongate member 1 10 to set the distance 1 12 of the camera 104 away from the face of the user between 0 cm (come into contact with the user) to as far as the elongate member 1 10 can bring the camera 104 away from the face of the user. However, it is noted that the minimum of the distance 1 12 that can still allow the camera 104 to function properly to capture facial expression of the user is a function of the specification of the camera, including wideness of wide angle capture of the camera used. It is appreciated that this function can be derived through experimentation. With regard to the distance range of 1 cm to 10 cm, 1 cm is a probable minimum distance that will likely not affect camera image capture (sufficient details of the facial expression of the user should be captured) performed using optical lens typically used in smartphones. mobile devices, and digital cameras. It is noted that 0 cm (i.e. camera contacting user's face) will impede mouth movements and cover an image sensor in the camera that is directed to capture images of the mouth of the user. 10 cm is a comfortable distance to avoid the camera being too far out from the user's face, which may affect aesthetics of the Virtual Reality headset and cause obstruction. It is appreciated that beyond 10 cm is possible as well as long as the facial expression including at least a part of the mouth is captured with sufficient details for processing and displaying in the virtual environment.

The communication interface 1 16 may include a power supply line to draw electrical power from the processor 130 to operate the Virtual Reality headset 100. The processor 130 can in turn obtain electrical power from mains electricity or from a portable power supply. The Virtual Reality headset 100 requires electrical power to run the one or more displays 128, the camera 104 and/or the microphone 124. It is appreciated that in another example, the Virtual Reality headset 100 can be configured to draw electrical power from a portable power supply or from both the portable power supply and through the power supply line. The portable power supply may be incorporated in, for instance, the frame 102 of the Virtual Reality headset 100. In this case, the processor 130 may draw power from the portable power supply.

Figure 2 shows another example of a Virtual Reality headset 200 similar (i.e. containing the same features) to the Virtual Reality headset 100 in Figure 1 except for one key difference. The key difference is that instead of having the camera 104 and the elongate member 1 10 of the Virtual Reality headset 100, there is in the Virtual Reality headset 200 a camera 204 mounted to a frame 202 at a portion along a line of symmetry 206 of a face of a user (not shown in Figure 2) and the camera 204 is directed in a direction for capturing the facial expression of the user (including at least part of the mouth of the user). The camera 204 can be permanently mounted to the frame 202 or made to be detachable from the frame 202.

In this case, a distance 212 the same as the distance 1 12 in Figure 1 may be set at a fixed distance of about 1 cm to 10 cm (e.g. 5 cm) away from the face of the user. The minimum of the distance 212 is similarly governed by the function described with reference to the Virtual Reality headset 100 in Figure 1 .

Furthermore, similarly, the Virtual Reality headset 200 has a communication interface (not visible in Figure 2; operates like the communication interface 1 16 in Figure 1 ) for transmitting the captured at least one image of the facial expression of the user to a processor 208 (operates like the processor 130 in Figure 1 ) configured for processing an image displayable on one or more displays 228 (not visible in Figure 2 and covered by the frame 202 of the Virtual Reality headset 200) or another one or more displays (not shown in Figure 2; similar to the other one or more displays discussed with reference to Figure 1 ) based on the at least one image captured by the camera 204. The one or more displays 228 is similar to the one or more displays 128 in Figure 1 and is located in an eye goggle portion 218 of the frame 202. In the example of Figure 2, the camera 204 is directed to capture the mouth entirely. It is appreciated that in another example, the camera 204 may be directed to capture only part of the mouth.

The Virtual Reality headset 200 may also comprise a microphone 224 (similar to microphone 124 in Figure 1 ). In the present example, the microphone 224 is located adjacent to the camera 204 for capturing sound made by the user. Similarly, the processor 208 can be configured to synchronize the sound with mouth movements captured by the camera 204 and displayed in a virtual environment.

Another difference between the Virtual Reality headset 100 in Figure 1 and the Virtual Reality headset 200 is that the Virtual Reality headset 200 has a processor 208 (not visible in Figure 2; indicated by broken lines for illustration) mounted in or integrated with the electronics of the Virtual Reality headset 200. In this case, more specifically, the processor 208 resides in the eye goggle portion 218 of the frame 202. This is a different location from the processor 130 in Figure 1 , which is residing outside the Virtual Reality headset 200. The communication interface (not shown in Figure 2) of the Virtual Reality headset 200 can similarly be a data transfer wire connecting the processor 208 to the camera 204. It is appreciated that the Virtual Reality headset 100 in Figure 1 can also relocate the processor 130 to the same location as the processor 208 and the Virtual Reality headset 200 can have the processor 208 located in the same location as the processor 130 in Figure 1 . The processing performed by the processor 208 are the same as the processing performed by the processor 130. A further difference between the Virtual Reality headset of 100 in Figure 1 and the Virtual Reality headset 200 is that the Virtual Reality headset 200 has a portable power supply (not shown in Figure 2; can be located anywhere in, for instance, the frame 202 of the Virtual Reality headset 200 and may be connected to the processor 208) incorporated into the Virtual Reality headset 200 for operating the Virtual Reality headset 200. The Virtual Reality headset 200 requires electrical power to run the one or more displays 228, the camera 204, the processor 208, and/or the microphone 224. It is appreciated that in another example, where the processor 208 is at the same location as the processor 130 in Figure 1 , the Virtual Reality headset 200 can be configured to draw electrical power from the processor 208 through a power supply line like the Virtual Reality headset 100 in Figure. In yet another example, the Virtual Reality headset 200 can be configured to draw electrical power from both the portable power supply and through the power supply line. In a further example, the processor 208 can in turn obtain electrical power from mains electricity or from a portable power supply.

In another example, there can be a Virtual Reality headset having both the cameras 104 in Figure 1 and 204 in Figure 2 as described to ensure better image capture. The Virtual Reality headset may also have both the processor 130 in Figure 1 and the processor 208 in Figure 2 as described. In this case, the processor 130 in Figure 1 and the processor 208 in Figure 2 may process different things (i.e. software) or share processing load.

It is appreciated that the respective communication interfaces of the Virtual Reality headsets 100 in Figure 1 and 200 in Figure 2 may be a wireless data transmitter instead of a data transfer wire for transferring data wirelessly between the camera 104 and the processor 130 or the camera 204 and the processor 208 respectively. The wireless data communication may be based on proprietary wireless data communication standards, Bluetooth, WiFi, Near Field Communication (NFC), Digital Living Network Alliance (DLNA) wireless technology, and the like.

It is appreciated that in another example incorporating all or some of the features of the Virtual Reality headsets 100 and 200 described with reference to Figures 1 and 2 respectively, there can be a modification to have one camera for capturing the at least one image from a first angle and one or more camera (not shown in the Figures) for capturing at least one other image of the facial expression of the user from a second angle different from the first angle. This is for capturing stereoscopic views of the facial expressions of the user. The stereoscopic views can provide input to the respective processors 130 and 208 of the Virtual Reality headsets 100 and 200 for generating an image or video containing 3D content displayable on one or more displays (this can be the one or more displays 128 or 228 of the respective Virtual Reality headsets 100 or 200 or the respective other one or more displays). A user may view the 3D content displayed on the one or more displays 128 or 228 of the respective Virtual Reality headsets 100 or 200, or on the respective other one or more displays through 3D lens. Such 3D lens may be provided in the eye goggle portion 1 18 in Figure 1 or 218 in Figure 2 of the respective Virtual Reality Headset 100 or 200 to enable the user to view 3D content displayed on the one or more displays 128 or 228 of the respective Virtual Reality Headset 100 or 200. In other examples of the present disclosure, there may be included one or more cameras that include any type of image sensor for capturing a facial expression of a user. The one or more cameras is/are mounted on or integrated to, and form part of a Virtual Reality (VR) headset for capturing facial expressions including mouth movements of the user. A video, including audio captured through a microphone mounted to the VR headset are transmitted, preferably in real-time, to a processor for processing, for instance, to render the facial expressions including mouth movements of the user onto an avatar, 2D or 3D model of the user in a virtual environment. The virtual environment including the avatar, 2D or 3D model of the user is generated by the processor as a video that is transmitted across a network (including the Internet or Intranet) for displaying in one or more displays of the remote VR headset of another user. Examples of the present disclosure may have one or more of the following advantages. Existing Virtual Reality (VR) headsets are targeted to provide "singular" experiences for a single participant. For a virtual environment (or VR environment) with multiple participants, an example VR headset of the present disclosure can advantageously convey facial expressions of each of the multiple participants by having a camera to capture facial expressions of each of the multiple participants when needed.

Although mouth movements can be simulated based on audio data, more complicated expressions such as a smile or frown, and actions such as a kiss, cannot be detected and reconstructed based on existing head tracking and audio data provided by an existing VR headset. However, an example VR headset of the present disclosure can capture images of the facial expression of the user and then process the images for reconstruction and/or overlaying in a virtual environment. This can provide a better representation of the facial expressions.

Furthermore, a fixed camera (such as one mounted on a desk, a monitor or display of a device) cannot reliably capture facial expressions and can also be obstructed by the headset itself. An example VR headset of the present disclosure can advantageously provide a camera mounted to an elongate member adjustable to move the position of the camera or by positioning the camera at a fixed location along a line of symmetry of the face of the user suitable for capturing at least an image of a facial expression of the user.

It has been discovered that most facial expressions are conveyed in a lower half of a face of a user below the eyes. In an example VR headset of the present disclosure, advantageously, a camera is mounted on the headset in a manner directed in the direction of the mouth area of the user to capture at least one image that may be a video feed of the facial expression of the user. The at least one image or video feed can then be processed by a processor and overlaid on a 3D model of the user in a virtual environment that is displayed remotely to the user via one or more displays of the VR headset and/or one or more displays accessible to one or more participants in the virtual environment.

Advantageously, the Virtual Reality headset can capture at least one images of a facial expression including at least a part of the mouth of a user remotely and direct data of the at least one images to a processor for processing to enable the at least one images to be overlaid in a virtual environment for purposes of virtual reality telepresence.

Figure 3 shows in more detail an example of the processor 130 in Figure 1 and the processor 208 in Figure 2. The processor 130 and the processor 208 may each comprise a processing unit 302 for processing software including one or more computer programs for running one or more computer applications to enable processing of, in realtime, data of at least one image captured to map the at least one image captured to a model of the user in a virtual environment displayable in the image displayable on the one or more displays (128 in Figure 1 or 228 in Figure 2) or the other one or more displays as described earlier. Other processing enabled are compensating distortion of the at least one image captured for generating the image displayable on the one or more displays (128 in Figure 1 or 228 in Figure 2) or the other one or more displays as described earlier, generating the virtual environment including the avatar, the 2D or 3D model of the user as described earlier, generating the virtual environment for user interaction in a game application and/or generating the virtual environment for user interaction in a social media application. The processor 130 in Figure 1 and the processor 208 in Figure 2 may reside in a gaming machine, a mobile device, a tablet device, a desktop or portable computer, and the like.

Furthermore, the processing unit 302 may include user input modules such as a computer mouse 336, keyboard/keypad 304, and/or a plurality of output devices such as a display 308. The display 308 may be a touch screen capable of receiving user input as well. It is appreciated that in another example, another display (not shown) like the display 308 may be included. The processing unit 302 may be connected to a computer network 312 via a suitable transceiver device 314 (i.e. a network interface), to enable access to e.g. the Internet or other network systems such as a wired Local Area Network (LAN) or Wide Area Network (WAN). Optionally, the processing unit 302 may be connected to one or more external wireless communication enabled devices 334 via a suitable wireless transceiver device 332 e.g. a WiFi transceiver, Bluetooth module, Mobile telecommunication transceiver suitable for Global System for Mobile Communication (GSM), 3G, 3.5G, 4G telecommunication systems, and the like. Through the computer network 312, the processing unit 302 can gain access to one or more storages i.e. data storages, databases, data servers and the like connectable to the computer network 312 to retrieve and/or store data in the one or more storages. The processing unit 302 may include a processor integrated circuit 318, a Random Access Memory (RAM) 320 and a Read Only Memory (ROM) 322. The processing unit 302 may also include a number of Input/Output (I/O) interfaces, for example I/O interface 338 to the computer mouse 336, a memory card slot 316, I/O interface 324 to the display 308, and I/O interface 326 to the keyboard/keypad 304.

The components of the processing unit 302 typically communicate via an interconnected bus 528 and in a manner known to the person skilled in the relevant art.

The computer programs may further include one or more software applications for e.g. image processing, instant messaging platform, audio/video playback, internet accessibility, operating the processor 130 and the processor 208 (i.e. operating system), network security, file accessibility, database management, which are applications typically equipped on a desktop, electronic apparatus, mobile device or portable computer.

The computer programs may be supplied to the user of the processor 130 and the processor 208 encoded on a data storage medium such as a CD-ROM, on a flash memory carrier or a Hard Disk Drive, and are to be read using a corresponding data storage medium drive of a data storage device 330. Such computer or application programs may also be downloaded from the computer network 312. The application programs are read and controlled in its execution by the processor integrated circuit 318. Intermediate storage of program data may be accomplished using RAM 320.

In more detail, one or more of the computer or application programs may be stored on any non-transitory machine- or computer- readable medium. The machine- or computer- readable medium may include storage devices such as magnetic or optical disks, memory chips, or other storage devices suitable for interfacing with a general purpose computer. The machine- or computer- readable medium may also include a hard-wired medium such as that exemplified in the Internet system, or wireless medium such as that exemplified in the Wireless LAN (WLAN) system and the like. The computer program when loaded and executed on such a general-purpose computer effectively results in an apparatus that implements the steps of the computing methods in examples herein described.

Many modifications and other examples can be made to the Virtual Reality headset by those skilled in the art having the understanding of the above described disclosure together with the drawings. Therefore, it is to be understood that the Virtual Reality headset is not to be limited to the above description contained herein only, and that possible modifications are to be included in the claims of the disclosure.

Claims

1 . A Virtual Reality headset, the Virtual Reality headset comprising:

a frame wearable by a user;

one or more displays disposed in the frame and facing eyes of the user;

a camera for capturing at least one image of a facial expression including at least part of a mouth of the user, the camera being disposed at a distance away from the user for capturing the at least one image of the facial expression of the user; and

a communication interface for transmitting the captured at least one image to a processor configured for generating an image displayable on the one or more displays or another one or more displays based on the at least one image captured by the camera.

2. The Virtual Reality headset as claimed in claim 1 , wherein the processor is configured for processing, in real-time, data of the at least one image captured to map the at least one image captured to a model of the user in a virtual environment displayable in the image displayable on the one or more displays or the other one or more displays.

3. The Virtual Reality headset as claimed in claim 2, wherein the model of the user in the virtual environment is a three dimensional model.

4. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the camera is mounted to the frame.

5. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the frame comprises an elongate member extending away from a face of the user and the camera is mounted on the elongate member.

6. The Virtual Reality headset as claimed in claim 5, wherein the elongate member is moveable to adjust position of the camera with respect to the face of the user.

7. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the camera is mounted to the frame at a portion along a line of symmetry of a face of the user and the camera is directed in a direction for capturing the facial expression of the user.

8. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the frame comprises the processor.

9. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the communication interface is a wireless data transmitter.

10. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the Virtual Reality headset comprises a microphone.

1 1 . The Virtual Reality headset as claimed in any one of the preceding claims, wherein the distance of the camera away from the face of the user is 1 cm to 10 cm.

12. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the camera captures the at least one image from a first angle and the Virtual Reality headset includes one or more camera for capturing at least one other image of the facial expression of the user from a second angle different from the first angle.

13. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the processor is configured for compensating distortion of the at least one image captured for generating the image displayable on the one or more displays or the other one or more displays.

14. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the virtual environment is generated for user interaction in a social media application.

15. The Virtual Reality headset as claimed in any one of the preceding claims, wherein the virtual environment is generated for user interaction in a game application.