WO2021196337A1

WO2021196337A1 - Method for processing interactive voice data in multi-person vr scene

Info

Publication number: WO2021196337A1
Application number: PCT/CN2020/088827
Authority: WO
Inventors: 邢维振; 尹桑
Original assignee: 上海唯二网络科技有限公司
Priority date: 2020-04-03
Filing date: 2020-05-06
Publication date: 2021-10-07
Also published as: CN111475022A

Abstract

The present invention relates to the technical field of virtual reality (VR). Disclosed is a method for processing interactive voice data in a multi-person VR scene, comprising: receiving voice data from voice sending terminals; according to positions of current players corresponding to all on-line terminals in the multi-person VR scene, searching for voice receiving terminals located around the voice sending terminals, all the on-line terminals including the voice sending terminals; and forwarding the voice data to the found voice receiving terminals. In this way, by means of the method for determining voice receiving players according to a spatial position relationship between players in the multi-person VR scene, only the on-line terminals located around the voice sending terminals can receive the voice data sent by the voice sending terminals, thus, while ensuring the degree of realism of VR, the voice data received by other on-line terminals not around the voice sending terminals is effectively reduced, avoiding the problem of voice content not being able to be distinguished due to the voice from multiple parties being received simultaneously and mixed, and greatly improving the experience of the players in the multi-person VR scene.

Description

A method for processing interactive voice data in a multi-person VR scene

Technical field

The invention belongs to the field of virtual reality technology, and specifically relates to a method for processing interactive voice data in a multi-person VR scene.

Background technique

With the rapid development of communication technology and computer technology, VR (Virtual Reality) technology has gradually become popular. Virtual reality VR technology is a computer simulation system that can create and experience a virtual world. It uses a computer to generate a simulation environment. It is also a system simulation of multi-source information fusion, interactive three-dimensional dynamic vision and entity behavior. Immerse the user in the environment. At present, the VR technology is widely used in scenes such as film and television, virtual reality games, painting, etc. The most convenient way to realize it is to use a smart phone with VR glasses and a headset to achieve virtual audio-visual effects. This way of experience plays a cost advantage, improves personal immersion, and is not limited to a single-person VR experience, but can also be experienced in scenarios such as multi-person VR games or multi-person VR meetings.

In the existing multiplayer VR scene, due to the limitation of the equipment, the communication method between players cannot be carried out by inputting text through the keyboard as conveniently as on the computer. Therefore, real-time voice communication is communicated in the multiplayer VR scene. Mainstream means. However, when real-time voice is performed in a multi-person VR scene, there will be problems that the voice is unclear and illegible due to the simultaneous reception of multiple voices and mixed audio display.

Summary of the invention

In order to solve the problem of unclear and illegible speech caused by simultaneous reception and mixing of multiple people’s voices when performing real-time voice in a multi-person VR scene, the purpose of the present invention is to provide a multi-person VR scene. The method, device, computer equipment, terminal equipment and computer storage medium for processing interactive voice data in the scene.

In the first aspect, the present invention provides a method for processing interactive voice data in a multi-person VR scene, which is suitable for execution on the server side, and includes:

Receive voice data from the voice sending terminal;

Searching for voice receiving terminals located around the voice sending terminal according to the corresponding current player positions of all online terminals in the multiplayer VR scene, where all online terminals include the voice sending terminal;

The voice data is forwarded to the found voice receiving terminal.

Based on the above-mentioned inventive content, the way of receiving voice players can be determined by the spatial position relationship between players in a multiplayer VR scene, so that only online terminals located around the voice sending terminal can receive the voice data sent by the voice sending terminal Therefore, while ensuring the authenticity of VR, other non-surrounding online terminals can effectively reduce the received voice data, avoiding the problem of indistinguishable voice content due to simultaneous mixing of multiple parties, greatly improving the multiplayer VR scene for players Experience. In addition, it can greatly reduce the amount of voice data transmission, relieve network pressure, and reduce the probability of voice data loss during data transmission, thereby avoiding the occurrence of incoherent voice, delay and quality due to excessive data loss. The problem of deterioration.

In a possible design, for an online terminal that is not a voice sending terminal, it is determined whether it is located around the voice sending terminal according to the following method: According to the current player position of the online terminal, it is determined whether the online terminal is in a If it is in a first sector area centered on the current player position of the voice sending terminal and the radius is the first distance, it is determined that the online terminal is a first-type voice receiving terminal located around the voice sending terminal , Wherein the sector angle of the first sector area is the corresponding current player perspective of the voice sending terminal in the multi-person VR scene. Through the foregoing design, if the online terminal is in the first sector area, it indicates that the corresponding online player is located in front of the voice emitting player in the multiplayer VR scene. If the online terminal is used as the voice receiver of the voice data The terminal will greatly match the real scene and enhance the reality of VR.

In a possible design, if the online terminal is not located in the first sector area, the method further includes: judging whether the online terminal is in a voice-transmitting terminal according to the current player position of the online terminal In the second fan-shaped area with the current player position as the center and the second distance as the radius, if so, it is determined that the online terminal is a second-type voice receiving terminal located around the voice sending terminal, wherein The second distance is smaller than the first distance, and the fan angle of the second fan-shaped area is the corresponding current non-player perspective of the voice sending terminal in the multiplayer VR scene. Through the foregoing design, if the online terminal is in the second sector area, it indicates that the corresponding online player is located behind the voice emitting player in the multiplayer VR scene. If the online terminal is used as the voice receiver of the voice data The terminal will also greatly match the real scene and further enhance the reality of VR.

In a possible design, for an online terminal that is not a voice sending terminal, it is determined whether it is located around the voice sending terminal according to the following method: according to the current player position of the voice sending terminal, it is judged whether the voice sending terminal is It is located in a third sector area centered on the current player position of the online terminal and a radius of the third distance. If so, it is determined that the online terminal is a third-type voice receiver located around the voice sending terminal A terminal, wherein the fan-shaped angle of the third fan-shaped area is the corresponding current player perspective of the online terminal in the multiplayer VR scene. Through the aforementioned design, the online terminal can select the target to be listened to or highlight the voice data of the target to be listened to according to the perspective of the receiver, thereby enhancing the freedom of listening of the online player.

In a possible design, if the voice emitting terminal is not located in the third sector area, the method further includes: judging whether the voice emitting terminal is in a position of the voice emitting terminal according to the current player position of the voice emitting terminal. If the current player position of the online terminal is the center and the fourth fan-shaped area with the fourth distance as the radius, it is determined that the online terminal is a fourth-type voice receiving terminal located around the voice sending terminal, where The fourth distance is smaller than the third distance, and the fan angle of the fourth fan-shaped area is the corresponding current non-player perspective of the online terminal in the multiplayer VR scene.

In a possible design, for the found second-type voice receiving terminal or fourth-type voice receiving terminal, before forwarding the voice data, the voice data is attenuated. Through the aforementioned design, the reality of virtual reality VR can be further improved.

In the second aspect, the present invention provides a device for processing interactive voice data in a multi-person VR scene, including a voice data receiving unit, a receiving terminal searching unit, and a voice data forwarding unit that are sequentially connected in communication;

The voice data receiving unit is used to receive voice data from a voice sending terminal;

The receiving terminal searching unit is configured to search for voice receiving terminals located around the voice sending terminal according to the corresponding current player positions of all online terminals in the multiplayer VR scene, where all online terminals include the voice sending terminal;

The voice data forwarding unit is used to forward the voice data to the found voice receiving terminal.

In a third aspect, the present invention provides a computer device, including a memory, a processor, and a transceiver that are sequentially connected in communication, wherein the memory is used to store a computer program, the transceiver is used to send and receive data, and the processor is used to The computer program is read, and the method described in the first aspect or any one of the possible designs in the first aspect is executed.

In a fourth aspect, the present invention provides a computer-readable storage medium having instructions stored on the computer-readable storage medium. When the instructions are run on a computer, any one of the first aspect or the first aspect is executed. A possible design of the method.

In the fifth aspect, the present invention provides a computer program product containing instructions, which when the instructions run on a computer, cause the computer to execute the method described in the first aspect or any one of the possible designs in the first aspect. .

In the sixth aspect, the present invention provides a method for processing interactive voice data in a multi-person VR scene, which is suitable for execution on the terminal side, and includes:

Receive the voice data forwarded by the server and from the voice sending terminal;

According to the corresponding current player positions of the local online terminal and the voice sending terminal in the multiplayer VR scene, determine whether the local online terminal is a voice receiving terminal located around the voice sending terminal, and if so, output and display the voice data, Otherwise, stop displaying the voice data.

Based on the above-mentioned invention content, the way of receiving voices from players can be determined by the spatial position relationship between players in a multiplayer VR scene, so that only online terminals located around the voice sending terminal can display the voice data sent by the voice sending terminal Therefore, while ensuring the reality of VR, local online terminals can effectively reduce the display of voice data, avoid the problem of indistinguishable voice content due to simultaneous mixing of multiple parties, and greatly enhance the player's multiplayer VR scene experience.

In a possible design, it is determined whether the local online terminal is located around the voice sending terminal according to the following method: According to the current player position of the local online terminal, it is determined whether the local online terminal is located in a voice sending terminal. In the first fan-shaped area with the current player position as the center and the first distance as the radius, if it is, it is determined that the local online terminal is the first type of voice receiving terminal located around the voice sending terminal, wherein the first The fan-shaped angle of a fan-shaped area is the corresponding current player perspective of the voice sending terminal in the multi-person VR scene. Through the foregoing design, if the local online terminal is in the first sector area, it indicates that the local online player is in front of the voice-producing player in the multiplayer VR scene. If the local online terminal is used as the voice of the voice data The receiving terminal will greatly match the real scene and enhance the reality of VR.

In a possible design, if the local online terminal is not located in the first fan-shaped area, the method further includes: judging whether the local online terminal is located in the first sector according to the current player position of the local online terminal. If the current player position of the voice sending terminal is in a second fan-shaped area centered and the second distance is the radius, it is determined that the local online terminal is a second-type voice receiving terminal located around the voice sending terminal, where The second distance is smaller than the first distance, and the fan angle of the second fan-shaped area is the corresponding current non-player perspective of the voice sending terminal in the multiplayer VR scene. Through the foregoing design, if the local online terminal is in the second sector area, it indicates that the local online player is behind the voice-producing player in the multiplayer VR scene. If the local online terminal is used as the voice of the voice data The receiving terminal will also greatly match the real scene and further enhance the reality of VR.

In a possible design, it is determined whether the local online terminal is located around the voice sending terminal according to the following method: According to the current player position of the voice emitting terminal, it is determined whether the voice emitting terminal is located at a location where the local online terminal is located. If it is in a third fan-shaped area with the current player position as the center and the third distance as the radius, it is determined that the local online terminal is a third-type voice receiving terminal located around the voice sending terminal, wherein the first The fan angle of the three-sector area is the corresponding current player perspective of the local online terminal in the multiplayer VR scene. Through the foregoing design, the local online terminal can select the target to be listened to or highlight the voice data of the target of interest according to the perspective of the receiver, thereby enhancing the listening freedom of the local online player.

In a possible design, if the voice emitting terminal is not located in the third sector area, the method further includes: judging whether the voice emitting terminal is in a position of the voice emitting terminal according to the current player position of the voice emitting terminal. If the current player position of the local online terminal is in the fourth fan-shaped area centered on the fourth distance and the radius is the fourth distance, it is determined that the local online terminal is a fourth-type voice receiving terminal located around the voice sending terminal, where The fourth distance is smaller than the third distance, and the fan angle of the fourth fan-shaped area is the corresponding current non-player perspective of the local online terminal in the multiplayer VR scene.

In a seventh aspect, the present invention provides a device for processing interactive voice data in a multi-person VR scene, including a receiving unit and a display unit connected in communication;

The receiving unit is configured to receive voice data forwarded by the server and from the voice sending terminal;

The display unit is configured to determine whether the local online terminal is a voice receiving terminal located around the voice sending terminal according to the corresponding current player positions of the local online terminal and the voice sending terminal in the multiplayer VR scene, and if so, Output and display the voice data, otherwise stop displaying the voice data.

In an eighth aspect, the present invention provides a terminal device, including a memory, a processor, and a transceiver that are sequentially connected in communication, wherein the memory is used to store a computer program, the transceiver is used to send and receive data, and the processor is used to The computer program is read, and the method described in any one of the above sixth aspect or the sixth aspect may be designed.

In a ninth aspect, the present invention provides a computer-readable storage medium having instructions stored on the computer-readable storage medium. When the instructions are run on a computer, any one of the sixth aspect or the sixth aspect is executed. A possible design of the method.

In a tenth aspect, the present invention provides a computer program product containing instructions, which when the instructions run on a computer, cause the computer to execute the method described in the sixth aspect or any one of the possible designs in the sixth aspect. .

Description of the drawings

In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.

Fig. 1 is a schematic flowchart of the first method for processing interactive voice data in a multi-person VR scene provided by the present invention.

Fig. 2 is an example diagram of a voice-producing player and other online players in a multiplayer VR scene provided by the present invention.

Fig. 3 is a schematic structural diagram of the first device for processing interactive voice data in a multi-person VR scene provided by the present invention.

Fig. 4 is a schematic diagram of the structure of the computer equipment provided by the present invention.

Fig. 5 is a schematic flowchart of a second method for processing interactive voice data in a multi-person VR scene provided by the present invention.

Fig. 6 is a schematic structural diagram of a second device for processing interactive voice data in a multi-person VR scene provided by the present invention.

Fig. 7 is a schematic structural diagram of a terminal device provided by the present invention.

Detailed ways

The present invention will be further described below in conjunction with the drawings and specific embodiments. It should be noted here that although the description of these embodiments is used to help understand the present invention, it does not constitute a limitation to the present invention. The specific structural and functional details disclosed herein are only used to describe example embodiments of the present invention. However, the present invention may be embodied in many alternative forms, and it should not be understood that the present invention is limited to the embodiments set forth herein.

It should be understood that although the terms first, second, etc. may be used herein to describe various units, these units should not be limited by these terms. These terms are only used to distinguish one unit from another. For example, the first unit may be referred to as the second unit, and the second unit may be referred to as the first unit similarly, without departing from the scope of the exemplary embodiment of the present invention.

It should be understood that the term "and/or" that may appear in this text is only an association relationship describing associated objects, which means that there can be three relationships. For example, A and/or B can mean that there is A alone, There are B alone, and there are three situations of A and B at the same time; for the term "/ and" that may appear in this article, it describes another related object relationship, indicating that there can be two relationships, for example, A/ and B, you can Means: A is alone, A and B are alone; In addition, for the character "/" that may appear in this article, it generally means that the associated objects before and after are in an "or" relationship.

It should be understood that when a unit is referred to herein as being "connected", "connected" or "coupled" to another unit, it can be directly connected or coupled to the other unit, or intermediate units may exist. In contrast, when a unit is referred to herein as being "directly connected" or "directly coupled" with another unit, it means that there is no intermediate unit. In addition, other words used to describe the relationship between units should be interpreted in a similar way (for example, "between" and "directly between", "adjacent" and "directly adjacent", etc. Wait).

It should be understood that the terms used herein are only used to describe specific embodiments and are not intended to limit the exemplary embodiments of the present invention. If used herein, the singular forms "a", "an" and "the" are intended to include plural forms, unless the context clearly indicates the opposite. It should also be understood that when the terms "include", "includes", "includes" and/or "includes" are used in this document, they specify the characteristics, integers, steps, operations, units and/or components of the statement. Existence, and does not exclude the existence or addition of one or more other features, quantities, steps, operations, units, components and/or their combinations.

It should be understood that it should also be noted that in some alternative embodiments, the functions/acts that appear may be different from the order in which they appear in the drawings. For example, depending on the functions/actions involved, it may actually be executed concurrently in nature, or sometimes two diagrams shown in succession may be executed in reverse order.

It should be understood that specific details are provided in the following description to facilitate a complete understanding of the exemplary embodiments. However, those of ordinary skill in the art should understand that the exemplary embodiments can be implemented without these specific details. For example, the system can be shown in a block diagram to avoid unnecessary details to make the example unclear. In other instances, well-known procedures, structures, and techniques may not be shown in unnecessary details to avoid making the example embodiments unclear.

Example one

As shown in FIG. 1, the method for processing interactive voice data in a multi-person VR scene provided by this embodiment is suitable for execution on the server side, and may but is not limited to include the following steps S101 to S103.

S101. Receive voice data from a voice sending terminal.

In the step S101, the voice sending terminal is an electronic device held by the voice-producing player and used to participate in the multi-person VR scene experience, which may include, but is not limited to, a smart phone or a VR experience machine. The voice data is obtained by the voice sending terminal collecting the real-time voice of the player from the voice (for example, collected by a configured microphone), and then transmitted to the server for maintaining the multi-person VR scene experience through the Internet.

S102. According to the corresponding current player positions of all online terminals in the multiplayer VR scene, search for voice receiving terminals located around the voice sending terminal, where all online terminals include the voice sending terminal.

In the step S102, the online terminal is an electronic device held by different players and used to participate in a multi-person VR scene experience. Like the voice sending terminal, it can include, but is not limited to, a smart phone or a VR experience. Machine waiting. Since the server is an existing VR server used to maintain the multiplayer VR scene experience, it has the corresponding current player position of each online terminal in the multiplayer VR scene, and then can determine the voice of the player and other online players based on these current player positions The relative distance of the player in the multiplayer VR scene. At the same time, considering that in the real scene, other players located around the voice-producing player can hear the voice made by the voice-producing player. Therefore, the relative distance can be used to determine which online players are online. The terminal can receive the voice data, that is, find the voice receiving terminal located around the voice sending terminal, and the voice receiving terminal found can receive the voice data.

S103. Forward the voice data to the found voice receiving terminal.

In the step S103, the forwarding method is still transmission through the Internet. In addition, if at least one voice receiving terminal located around the voice sending terminal is not found, it indicates that in the multiplayer VR scene, there are no other online players around the voice emitting player. For the online terminals of these online players, There is no need to acquire and display the voice data, so the forwarding of the voice data will be terminated.

Therefore, based on the server-side processing method described in the foregoing steps S101 to S103, the way of voice receiving players can be determined through the spatial position relationship between players in a multiplayer VR scene, so that only online terminals located around the voice sending terminal can Receive the voice data sent by the voice sending terminal, thereby ensuring the authenticity of VR while enabling other non-surrounding online terminals to effectively reduce the received voice data, avoiding the inability to distinguish the voice content due to simultaneous mixing of multiple parties The problem has greatly improved the player’s multiplayer VR scene experience. In addition, it can greatly reduce the amount of voice data transmission, relieve network pressure, and reduce the probability of voice data loss during data transmission, thereby avoiding the occurrence of incoherent voice, delay and quality due to excessive data loss. The problem of deterioration.

Example two

On the basis of the first embodiment, this embodiment specifically proposes a technical solution of how to find the voice receiving terminal located around the voice sending terminal. The terminal judges whether it is located around the voice sending terminal according to the following method: According to the current player position of the online terminal, it is judged whether the online terminal is located at a center with the current player position of the voice sending terminal as the center and first If the distance is within the first sector area of the radius, it is determined that the online terminal is a first-type voice receiving terminal located around the voice sending terminal, wherein the sector angle of the first sector area is the voice Send the terminal's corresponding current player perspective in the multiplayer VR scene.

In the above manner, the corresponding current player perspective of the voice sending terminal is the current perspective of the voice emitting player in the multiplayer VR scene, such as the perspective α in FIG. When experiencing a multiplayer VR scene, it is sent to the server in real time. Therefore, if the online terminal is in the first sector area, it indicates that the corresponding online player is in front of the voice emitting player in the multiplayer VR scene. If the online terminal is used as the voice receiving terminal for the voice data, It will greatly match the real scene and enhance the reality of VR.

Considering that in a real scene, other online players located behind the voice-producing player can also hear the voice of the voice-producing player. Therefore, in order to further enhance the reality of VR, if the online terminal is not located in the first sector area, then It includes: judging whether the online terminal is in a second fan-shaped area with the current player position of the voice sending terminal as the center and a second distance as the radius according to the current player position of the online terminal, if it is, then It is determined that the online terminal is a second-type voice receiving terminal located around the voice sending terminal, wherein the second distance is smaller than the first distance, and the sector angle of the second sector area is the voice sending terminal Corresponding to the current non-player perspective in a multiplayer VR scene. The corresponding current non-player perspective of the voice sending terminal is the perspective opposite to the current perspective of the voice-producing player in the multiplayer VR scene, as shown in the perspective β in Figure 2. The parameters of this perspective can be based on the voice sending terminal’s It corresponds to the current player's perspective. Since the main direction of propagation after a person emits a sound is straight forward, the propagation distance toward the rear will be significantly lower than that of the front. Therefore, the second distance needs to be set to be smaller than the first distance, so as to ensure the authenticity of the virtual reality VR. Therefore, if the online terminal is in the second sector area, it indicates that the corresponding online player is behind the voice emitting player in the multiplayer VR scene. If the online terminal is used as the voice receiving terminal for the voice data, It will also greatly match the real scene and further enhance the reality of VR.

Considering that in a real scene, the sound propagation path behind the player will include diffraction, reflection, or diffusion, which will cause obvious signal fading and noise enhancement. Therefore, in order to further enhance the reality of virtual reality VR , Further including: for the found second-type voice receiving terminal, before forwarding the voice data, performing attenuation processing on the voice data. The specific method of the attenuation processing is an existing method, which may include, but is not limited to, methods such as volume reduction and/or noise insertion.

For example, as shown in Figure 2, in a multiplayer VR scene, there are voice-producing player A corresponding to the voice sending terminal and online players B, C, D, E, F, and G corresponding to other online terminals, where the The current player viewing angle α of the voice-producing player A is 36 degrees, and the current non-player viewing angle β is 324 degrees (that is, 360 degrees-36 degrees). Therefore, the online players in the first sector area have online player B, and the online terminal corresponding to online player B is the first type of voice receiving terminal, which can receive and display the complete voice data; The online players in the second sector area include online player C, and the online terminal corresponding to online player C is the second type of voice receiving terminal, which can receive and display the attenuated voice data. As for the other players D, E, F, and G, they are not in the first sector area or the second sector area due to the distance in the multiplayer VR scene. The online terminals corresponding to them are different. In other words, the voice data will not be received in any form, which not only reduces the transmission volume of voice data, relieves network pressure, but also reduces the problem of indistinguishable voice content caused by simultaneous multi-party mixing in these online terminals.

Example three

On the basis of the first embodiment, this embodiment also specifically proposes another technical solution that is different from the second embodiment and how to find the voice receiving terminal located around the voice sending terminal, that is, in the step S102, For a certain online terminal that is not a voice sending terminal, determine whether it is located around the voice sending terminal according to the following method: Determine whether the voice emitting terminal is located in the vicinity of the online terminal according to the current player position of the voice emitting terminal If it is in a third fan-shaped area with the current player position as the center and the third distance as the radius, it is determined that the online terminal is a third-type voice receiving terminal located around the voice sending terminal, wherein the first The fan angle of the three-sector area is the corresponding current player perspective of the online terminal in the multiplayer VR scene.

In the above manner, the corresponding current player perspective of the online terminal is the current perspective of the online player in the multiplayer VR scene, and the parameters of this perspective will also be transmitted to the online terminal in real time when experiencing the multiplayer VR scene. server. The above proposes a technical solution for determining whether it is located around the voice sending terminal based on the corresponding current player's perspective of the online terminal. Although it may cause the reality of VR to be inferior to the second embodiment because it does not completely match the sound propagation principle, The online terminal can select the target to be listened to or highlight the voice data of the target according to the perspective of the receiver, thereby enhancing the freedom of listening of the online player.

Similar to the second embodiment, if the voice emitting terminal is not located in the third fan-shaped area, the method further includes: judging whether the voice emitting terminal is in a position of the voice emitting terminal according to the current player position of the voice emitting terminal. If the current player position of the online terminal is in the fourth fan-shaped area centered on the fourth distance and the radius is the fourth distance, it is determined that the online terminal is a fourth-type voice receiving terminal located around the voice sending terminal, where The fourth distance is smaller than the third distance, and the fan angle of the fourth fan-shaped area is the corresponding current non-player perspective of the online terminal in the multiplayer VR scene. The corresponding current non-player perspective of the online terminal is the perspective opposite to the current perspective of the online player in the multiplayer VR scene, and the parameters of the perspective may also be obtained based on the corresponding current player perspective of the online terminal. In addition, for the found fourth-type voice receiving terminal, the voice data may also be attenuated before forwarding the voice data.

Example four

As shown in FIG. 3, this embodiment provides a hardware device that implements the method for processing interactive voice data in a multi-person VR scene described in any one of Embodiments 1 to 3, including voices connected in sequence. The data receiving unit, the receiving terminal searching unit, and the voice data forwarding unit; the voice data receiving unit is used to receive voice data from the voice sending terminal; the receiving terminal searching unit is used to determine whether all online terminals are in a multiplayer VR scene Corresponding to the current player position in, search for voice receiving terminals located around the voice sending terminal, where all online terminals include the voice sending terminal; the voice data forwarding unit is used to forward the voice data to the found voice receiving terminal Describe the voice data.

For the working process, working details, and technical effects of the aforementioned device provided in this embodiment, reference may be made to the method described in any one of Embodiment 1 to Embodiment 3, and will not be repeated here.

Example five

As shown in FIG. 4, this embodiment provides a computer device that executes the method for processing interactive voice data in a multi-person VR scene described in any one of the first to the third embodiment, and includes a memory connected in sequence. , A processor, and a transceiver, wherein the memory is used to store a computer program, the transceiver is used to send and receive data, and the processor is used to read the computer program, and execute as described in any one of the first to the third embodiment. Describes the method of processing interactive voice data in a multi-person VR scene. For specific examples, the memory may include, but is not limited to, random access memory (RAM), read only memory (ROM), flash memory (Flash Memory), first-in first-out memory (FIFO), and/or first-in-last-out memory (FILO) Etc.; the transceiver may include, but is not limited to, a WiFi (wireless fidelity) wireless transceiver, a Bluetooth wireless transceiver, a GPRS (General Packet Radio Service, general packet radio service technology) wireless transceiver and/or ZigBee (Zigbee) Protocol, a low-power local area network protocol based on the IEEE802.15.4 standard) wireless transceiver, etc.; the processor may not be limited to adopting a microprocessor of the STM32F105 series. In addition, the computer device may also include, but is not limited to, a power supply module, a display screen, and other necessary components.

For the working process, working details, and technical effects of the aforementioned computer equipment provided in this embodiment, reference may be made to the method described in any one of Embodiment 1 to Embodiment 3, and details are not described herein again.

Example Six

This embodiment provides a computer-readable storage medium that stores instructions containing the method for processing interactive voice data in a multi-person VR scene described in any one of Embodiments 1 to 3, that is, the computer-readable storage medium Instructions are stored on the storage medium, and when the instructions are run on a computer, the method for processing interactive voice data in a multi-person VR scene as described in any one of Embodiment 1 to Embodiment 3 is executed. Wherein, the computer-readable storage medium refers to a carrier for storing data, which can include, but is not limited to, a floppy disk, an optical disk, a hard disk, a flash memory, a USB flash drive, and/or a memory stick (Memory Stick), etc. The computer may be a general-purpose computer, a dedicated Computers, computer networks, or other programmable devices.

For the working process, working details, and technical effects of the aforementioned computer-readable storage medium provided in this embodiment, refer to the method described in any one of Embodiments 1 to 3, and will not be repeated here.

Example Seven

This embodiment provides a computer program product containing instructions. When the instructions run on a computer, the computer executes the interactive operation in a multi-person VR scene as described in any one of the first to third embodiments. The method of processing voice data. Wherein, the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.

Example eight

As shown in FIG. 5, the method for processing interactive voice data in a multi-person VR scene provided by this embodiment is suitable for execution on the terminal side, and may include but is not limited to the following steps S201 to S202.

S201. Receive voice data forwarded by the server and from the voice sending terminal.

In the step S201, the voice sending terminal is an electronic device held by the voice-producing player and used to participate in the multi-person VR scene experience, which may include, but is not limited to, a smart phone or a VR experience machine. The voice data is obtained by the voice sending terminal collecting the real-time voice of the player (for example, collected by the configured microphone), and then transmitted to the server used to maintain the multi-person VR scene experience through the Internet. The server is transmitted to the local online terminal through the Internet. The local online terminal is an electronic device owned by a local player and used to participate in a multi-person VR scene experience. Like the voice sending terminal, it can include, but is not limited to, a smart phone or a VR experience machine.

S202. According to the corresponding current player positions of the local online terminal and the voice sending terminal in the multiplayer VR scene, determine whether the local online terminal is a voice receiving terminal located around the voice sending terminal, and if so, output and display the voice Data, otherwise stop displaying the voice data.

In the step S202, since the server is an existing VR server used to maintain the multiplayer VR scene experience, it has the corresponding current player position of each online terminal in the multiplayer VR scene, so the local online terminal can easily download from The server side obtains the corresponding current player position of the local online terminal in the multiplayer VR scene and the corresponding current player position of the voice sending terminal in the multiplayer VR scene. Therefore, the local online terminal can determine the relative distance between the voice-producing player and the local online player in the multiplayer VR scene based on the aforementioned two current player positions, while considering that in the real scene, only other players located around the voice-producing player can hear it. The voice emits the voice of the player, so it can finally be determined according to the relative distance whether the local online terminal of the local online player can perform real-time display after receiving the voice data, that is, it is judged whether the local online terminal is located around the voice sending terminal If yes, output and display the voice data; otherwise, stop displaying the voice data. In addition, the specific display method can be, but is not limited to, making a sound through a voice horn.

Therefore, based on the terminal-side processing method described in the foregoing steps S201 to S202, the way of receiving voice players can be determined through the spatial position relationship between players in a multiplayer VR scene, so that only online terminals located around the voice sending terminal can Display the voice data sent by the voice sending terminal, so that while ensuring the authenticity of VR, the local online terminal can effectively reduce the display of voice data, and avoid the problem of indistinguishable voice content caused by simultaneous mixing of multiple parties , Greatly enhance the player's multiplayer VR scene experience.

Example 9

On the basis of the eighth embodiment, this embodiment specifically proposes a technical solution of how to determine whether the local online terminal is a voice receiving terminal located around the voice sending terminal, that is, in the step S202, it is determined as follows Whether the local online terminal is located around the voice transmitting terminal: According to the current player position of the local online terminal, it is determined whether the local online terminal is located at a center with the current player position of the voice transmitting terminal and a first distance as the center If it is in the first sector area of the radius, it is determined that the local online terminal is a first-type voice receiving terminal located around the voice sending terminal, wherein the sector angle of the first sector area is the voice sending terminal The terminal corresponds to the current player's perspective in the multiplayer VR scene.

Similar to the second embodiment, in the above manner, the corresponding current player perspective of the voice sending terminal is the current perspective of the voice emitting player in the multiplayer VR scene, and the parameters of this perspective will be experienced by the voice sending terminal. The human VR scene is transmitted to the server in real time, and then transmitted by the server to the local online terminal through the Internet. Therefore, if the local online terminal is in the first sector area, it indicates that the local online player is in front of the voice emitting player in the multiplayer VR scene. If the local online terminal is used as the voice receiving terminal for the voice data , It will greatly match the real scene and enhance the reality of VR.

Considering that in a real scene, other online players behind the voice-producing player can also hear the voice of the voice-producing player. Therefore, in order to further enhance the reality of VR, if the local online terminal is not located in the first sector area, It also includes: judging whether the local online terminal is in a second fan-shaped area centered on the current player position of the voice sending terminal and a radius of a second distance according to the current player position of the local online terminal, if If it is, it is determined that the local online terminal is a second-type voice receiving terminal located around the voice sending terminal, wherein the second distance is less than the first distance, and the fan angle of the second sector area is The voice sending terminal corresponds to the current non-player perspective in the multiplayer VR scene. The corresponding current non-player perspective of the voice sending terminal is a perspective opposite to the current perspective of the voice-producing player in the multiplayer VR scene, and the parameters of the perspective may be obtained based on the corresponding current player perspective of the voice sending terminal. Since the main direction of propagation after a person emits a sound is straight forward, the propagation distance toward the rear will be significantly lower than that of the front. Therefore, the second distance needs to be set to be smaller than the first distance, so as to ensure the authenticity of the virtual reality VR. Therefore, if the local online terminal is in the second sector area, it indicates that the local online player is behind the voice emitting player in the multiplayer VR scene. If the local online terminal is used as the voice receiving terminal for the voice data , It will also greatly match the real scene and further enhance the reality of VR.

Considering that in a real scene, the sound propagation path behind the player will include diffraction, reflection, or diffusion, which will cause obvious signal fading and noise enhancement. Therefore, in order to further enhance the reality of virtual reality VR , Further comprising: performing attenuation processing on the voice data before displaying the voice data. The specific method of the attenuation processing is an existing method, which may include, but is not limited to, methods such as volume reduction and/or noise insertion.

Example ten

On the basis of the first embodiment, this embodiment also specifically proposes another technical solution that is different from the ninth embodiment and how to judge whether the local online terminal is a voice receiving terminal located around the voice sending terminal, that is, in the In step S202, it is determined whether the local online terminal is located around the voice sending terminal according to the following method: according to the current player position of the voice emitting terminal, it is determined whether the voice emitting terminal is a current player using the local online terminal If it is in a third sector area with a position as the center and a third distance as a radius, it is determined that the local online terminal is a third-type voice receiving terminal located around the voice sending terminal, wherein the third sector The fan-shaped angle of the area is the corresponding current player perspective of the local online terminal in the multiplayer VR scene.

Similar to the third embodiment, in the above manner, the corresponding current player perspective of the local online terminal is the current perspective of the online player in the multiplayer VR scene, and the parameters of the perspective are generated locally. The above proposes a technical solution for determining whether it is located around the voice sending terminal based on the corresponding current player perspective of the local online terminal, although it may cause the reality of VR to be inferior to the ninth embodiment because it does not completely match the sound propagation principle. However, the local online terminal can select the target to be listened to or highlight the voice data of the target according to the perspective of the receiver, which improves the listening freedom of the local online player.

Similar to the ninth embodiment, if the voice emitting terminal is not located in the third sector area, the method further includes: judging whether the voice emitting terminal is in a position of the voice emitting terminal according to the current player position of the voice emitting terminal. If the current player position of the local online terminal is in the fourth fan-shaped area centered on the fourth distance and the radius is the fourth distance, it is determined that the local online terminal is a fourth-type voice receiving terminal located around the voice sending terminal, where The fourth distance is smaller than the third distance, and the fan angle of the fourth fan-shaped area is the corresponding current non-player perspective of the local online terminal in the multiplayer VR scene. The corresponding current non-player perspective of the local online terminal is the perspective opposite to the current perspective of the local online player in the multiplayer VR scene, and the parameters of the perspective may also be obtained based on the corresponding current player perspective of the local online terminal. In addition, before displaying the voice data, the voice data may be attenuated.

Example 11

As shown in FIG. 6, this embodiment provides a hardware device that implements the method for processing interactive voice data in a multi-person VR scene described in any one of the eighth to the tenth embodiment, including the receiving of a communication connection Unit and display unit; the receiving unit is used to receive the voice data forwarded by the server and from the voice sending terminal; the display unit is used to respond to the local online terminal and the voice sending terminal in a multi-person VR scene Corresponding to the current player position, determine whether the local online terminal is a voice receiving terminal located around the voice sending terminal, and if so, output and display the voice data; otherwise, stop displaying the voice data.

For the working process, working details, and technical effects of the aforementioned device provided in this embodiment, reference may be made to the method described in any one of Embodiment 8 to Embodiment 10, and will not be repeated here.

Example 12

As shown in FIG. 7, this embodiment provides a terminal device that executes the method for processing interactive voice data in a multi-person VR scene described in any one of the eighth to the tenth embodiment. A memory, a processor, and a transceiver, wherein the memory is used to store a computer program, the transceiver is used to send and receive data, and the processor is used to read the computer program and execute any of the eight to tenth embodiments. A method for processing interactive voice data in a multi-person VR scene. For specific examples, the memory may include, but is not limited to, random access memory (RAM), read only memory (ROM), flash memory (Flash Memory), first-in first-out memory (FIFO), and/or first-in-last-out memory (FILO) Etc.; the transceiver may include, but is not limited to, a WiFi (Wireless Fidelity) wireless transceiver, a Bluetooth wireless transceiver, a GPRS (General Packet Radio Service, general packet radio service technology) wireless transceiver and/or ZigBee (Zigbee) Protocol, a low-power local area network protocol based on the IEEE802.15.4 standard) wireless transceiver, etc.; the processor may not be limited to adopting a microprocessor of the STM32F105 series. In addition, the terminal device may also include, but is not limited to, a power supply module, a display screen, and other necessary components.

For the working process, working details, and technical effects of the foregoing terminal device provided in this embodiment, reference may be made to the method described in any one of Embodiment 8 to Embodiment 10, and details are not described herein again.

Embodiment 13

This embodiment provides a computer-readable storage medium that stores instructions containing the method for processing interactive voice data in a multi-person VR scene described in any one of the eighth to tenth embodiments, that is, the computer can Instructions are stored on the read storage medium, and when the instructions are run on a computer, the method for processing interactive voice data in a multi-person VR scene as described in any one of Embodiment 8 to Embodiment 10 is executed. Wherein, the computer-readable storage medium refers to a carrier for storing data, which can include, but is not limited to, a floppy disk, an optical disk, a hard disk, a flash memory, a USB flash drive, and/or a memory stick (Memory Stick), etc. The computer may be a general-purpose computer, a dedicated Computers, computer networks, or other programmable devices.

For the working process, working details, and technical effects of the aforementioned computer-readable storage medium provided in this embodiment, reference may be made to the method described in any one of Embodiment 8 to Embodiment 10, and will not be repeated here.

Example Fourteen

This embodiment provides a computer program product containing instructions. When the instructions run on a computer, the computer executes the interaction in a multi-person VR scene as described in any one of the eighth to tenth embodiments. The method of processing voice data. Wherein, the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.

The multiple embodiments described above are merely illustrative. If it relates to a unit described as a separate component, it may or may not be physically separate; if it relates to a component displayed as a unit, it may or may not be physically separate. It may not be a physical unit, that is, it may be located in one place, or it may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments. Those of ordinary skill in the art can understand and implement it without creative work.

The above embodiments are only used to illustrate the technical solutions of the present invention, not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, a person of ordinary skill in the art should understand that: The recorded technical solutions shall be modified or some of the technical features shall be equivalently replaced. However, these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Finally, it should be noted that the present invention is not limited to the above-mentioned optional embodiments, and anyone can derive other products in various forms under the enlightenment of the present invention. The above-mentioned specific embodiments should not be construed as limiting the scope of protection of the present invention, and the scope of protection of the present invention should be defined in the claims, and the description can be used to interpret the claims.

Claims

A method for processing interactive voice data in a multi-person VR scene, characterized in that it is suitable for execution on the server side, and includes:

Receive voice data from the voice sending terminal;

Searching for voice receiving terminals located around the voice sending terminal according to the corresponding current player positions of all online terminals in the multiplayer VR scene, where all online terminals include the voice sending terminal;

The voice data is forwarded to the found voice receiving terminal.
The method for processing interactive voice data in a multi-person VR scene according to claim 1, wherein for an online terminal of a non-voice sending terminal, it is determined whether it is located at the voice sending terminal in the following manner. around:

According to the current player position of the online terminal, it is determined whether the online terminal is in a first fan-shaped area centered on the current player position of the voice sending terminal and the radius is a first distance. The online terminal is a first-type voice receiving terminal located around the voice sending terminal, wherein the fan angle of the first sector area is the corresponding current player perspective of the voice sending terminal in a multiplayer VR scene;

Or, according to the current player position of the voice emitting terminal, it is determined whether the voice emitting terminal is in a third fan-shaped area centered on the current player position of the online terminal and the third distance is the radius, if it is, Then it is determined that the online terminal is a third-type voice receiving terminal located around the voice sending terminal, wherein the fan angle of the third sector area is the corresponding current player perspective of the online terminal in the multiplayer VR scene.
The method for processing interactive voice data in a multi-person VR scene as claimed in claim 2, characterized in that:

If the online terminal is not located in the first sector area, the method further includes: judging whether the online terminal is located at a center and centered on the current player position of the voice sending terminal according to the current player position of the online terminal In a second sector area with a second distance as a radius, if it is, it is determined that the online terminal is a second-type voice receiving terminal located around the voice sending terminal, wherein the second distance is smaller than the first voice receiving terminal. Distance, the sector angle of the second sector area is the corresponding current non-player perspective of the voice sending terminal in the multiplayer VR scene;

Alternatively, if the voice emitting terminal is not located in the third sector area, the method further includes: judging whether the voice emitting terminal is a current player using the online terminal according to the current player position of the voice emitting terminal If it is in a fourth fan-shaped area with a location as the center and a fourth distance as a radius, it is determined that the online terminal is a fourth-type voice receiving terminal located around the voice sending terminal, wherein the fourth distance is less than The third distance and the fan angle of the fourth fan-shaped area are the corresponding current non-player viewing angles of the online terminal in the multiplayer VR scene.
The method for processing interactive voice data in a multi-person VR scene as claimed in claim 3, characterized in that: for the second type of voice receiving terminal or the fourth type of voice receiving terminal that is found, the voice is forwarded Before data, attenuation processing is performed on the voice data.
A device for processing interactive voice data in a multi-person VR scene, including a voice data receiving unit, a receiving terminal search unit, and a voice data forwarding unit that are sequentially connected in communication;

The voice data receiving unit is used to receive voice data from a voice sending terminal;

The receiving terminal searching unit is configured to search for voice receiving terminals located around the voice sending terminal according to the corresponding current player positions of all online terminals in the multiplayer VR scene, where all online terminals include the voice sending terminal;

The voice data forwarding unit is used to forward the voice data to the found voice receiving terminal.
A computer device, characterized in that it comprises a memory, a processor, and a transceiver connected in sequence in communication, wherein the memory is used to store a computer program, the transceiver is used to send and receive data, and the processor is used to read the A computer program that executes the method according to any one of claims 1 to 4.
A computer-readable storage medium, characterized in that instructions are stored on the computer-readable storage medium, and when the instructions are run on a computer, the method according to any one of claims 1 to 4 is executed.
A method for processing interactive voice data in a multi-person VR scene, characterized in that it is suitable for execution on the terminal side, and includes:

Receive the voice data forwarded by the server and from the voice sending terminal;

According to the corresponding current player positions of the local online terminal and the voice sending terminal in the multiplayer VR scene, determine whether the local online terminal is a voice receiving terminal located around the voice sending terminal, and if so, output and display the voice data, Otherwise, stop displaying the voice data.
8. The method for processing interactive voice data in a multi-person VR scene according to claim 8, characterized in that it is determined whether a local online terminal is located around the voice sending terminal in the following manner:

According to the current player position of the local online terminal, it is determined whether the local online terminal is in a first fan-shaped area centered on the current player position of the voice sending terminal and the radius is a first distance; if it is, then It is determined that the local online terminal is a first-type voice receiving terminal located around the voice sending terminal, wherein the fan angle of the first sector area is the corresponding current player perspective of the voice sending terminal in a multiplayer VR scene ；

Or, according to the current player position of the voice emitting terminal, it is determined whether the voice emitting terminal is in a third sector area centered on the current player position of the local online terminal and the third distance is the radius. , It is determined that the local online terminal is a third-type voice receiving terminal located around the voice sending terminal, wherein the fan angle of the third sector area is the corresponding current of the local online terminal in the multi-person VR scene Player's perspective.
The method for processing interactive voice data in a multi-person VR scene as claimed in claim 9, characterized in that:

If the local online terminal is not located in the first sector area, the method further includes: judging whether the local online terminal is in a current player position of the voice sending terminal according to the current player position of the local online terminal If it is in the second sector area centered and the second distance is the radius, it is determined that the local online terminal is a second-type voice receiving terminal located around the voice sending terminal, wherein the second distance is less than The first distance and the fan angle of the second fan-shaped area are the corresponding current non-player perspective of the voice sending terminal in the multiplayer VR scene;

Alternatively, if the voice emitting terminal is not located in the third sector area, the method further includes: judging whether the voice emitting terminal is in a current player location based on the local online terminal according to the current player position of the voice emitting terminal. The player’s position is the center and the fourth distance is the radius of the fourth fan-shaped area. The distance is less than the third distance, and the fan angle of the fourth sector area is the corresponding current non-player perspective of the local online terminal in the multiplayer VR scene.