CN107886938B

CN107886938B - Virtual reality guidance hypnosis voice processing method and device

Info

Publication number: CN107886938B
Application number: CN201610868130.6A
Authority: CN
Inventors: 李荣茂; 臧珊珊; 刘燕君; 陈昳丽; 朱艳春; 陈鸣闽; 谢耀钦
Original assignee: Shenzhen Institute of Advanced Technology of CAS
Current assignee: Shenzhen Institute of Advanced Technology of CAS
Priority date: 2016-09-29
Filing date: 2016-09-29
Publication date: 2020-11-17
Anticipated expiration: 2036-09-29
Also published as: CN107886938A

Abstract

The invention discloses a virtual reality guidance hypnosis voice processing method and a device, wherein the method comprises the following steps: performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words; carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words; searching corresponding voice units from a hypnosis voice library according to text level information and voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores voice units generated according to linguistic data input by a user susceptible person; synthesizing the searched voice unit into hypnotic voice data; and synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice. The invention can improve the hypnosis susceptibility of users and optimize the hypnosis effect.

Description

Virtual reality guidance hypnosis voice processing method and device

Technical Field

The invention relates to the technical field of virtual reality guided hypnosis, in particular to a virtual reality guided hypnosis voice processing method and device.

Background

The existing virtual reality guided hypnosis technology generally uses fixed standardized hypnosis voice (recorded by professional announcers) to be synthesized with a virtual reality hypnosis scene to guide a user to enter a hypnosis state. Although the fixed standardized hypnotic voice guides the user to have hypnosis in various aspects such as tone, timbre and the like close to the voice requirement of the hypnosis process, the requirement of the user on faster and better hypnosis cannot be met, and the hypnosis effect is poor.

Disclosure of Invention

The embodiment of the invention provides a virtual reality guidance hypnosis voice processing method, which is used for improving the hypnosis susceptibility of a user and optimizing the hypnosis effect, and comprises the following steps:

performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words;

carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words;

searching corresponding voice units from a hypnosis voice library according to text level information and voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores voice units generated according to linguistic data input by a user susceptible person;

synthesizing the searched voice unit into hypnotic voice data;

and synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice.

In one embodiment, the text analysis of the hypnotic guide to obtain the text hierarchy information of the hypnotic guide includes: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.

In one embodiment, the corpus is input by a user-sensible person according to hypnotic voice characteristics; and/or, the corpus is entered by a user susceptible person at a specified sampling rate and speech resolution.

In one embodiment, the corpus is entered by a user susceptible using a dialect; and/or, the corpus is entered by a user susceptible person using a personalized language.

In one embodiment, the method further comprises: establishing and updating the hypnosis voice library in real time according to the linguistic data input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;

the synthesizing of the searched voice unit into the hypnotic voice data comprises the following steps: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.

The embodiment of the invention also provides a virtual reality guidance hypnosis voice processing device, which is used for improving the hypnosis susceptibility of users and optimizing the hypnosis effect, and comprises the following components:

the text analysis module is used for carrying out text analysis on the hypnosis guide words to obtain text level information of the hypnosis guide words;

the voice analysis module is used for carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words;

the voice query module is used for searching corresponding voice units from a hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, and the hypnosis voice library stores voice units generated according to the linguistic data input by the susceptible person of the user;

the voice synthesis module is used for synthesizing the searched voice unit into hypnotic voice data;

and the voice output module is used for synthesizing the hypnosis voice data and the virtual reality hypnosis scene and outputting the virtual reality hypnosis guiding voice.

In one embodiment, the text analysis module is further configured to: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.

In one embodiment, the apparatus further comprises: the corpus processing module is used for establishing and updating the hypnosis voice library in real time according to the corpus input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;

the speech synthesis module is further to: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.

In the embodiment of the invention, the matching of the hypnosis process voice and the hypnosis susceptibility of the user is considered, the voice synthesis technology is used, the original standardized hypnosis voice is changed, the voice of the user susceptible person is synthesized, and the hypnosis voice sensitive to the user is finally output by combining the characteristics of hypnosis guidance, so that the hypnosis susceptibility of the user is improved, and the hypnosis effect is optimized. Meanwhile, the embodiment of the invention provides a scheme for automatically synthesizing the voice, so that the complexity of field manual recording is avoided, the requirements of users are met, the automatic voice output device can help people without any hypnosis knowledge background to automatically output hypnosis voice, complete the hypnosis process and help the users to better enter the hypnosis state.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts. In the drawings:

fig. 1 is a schematic diagram of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention;

fig. 2 is a schematic diagram illustrating an embodiment of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention;

fig. 3 is a schematic diagram of a virtual reality guidance hypnosis voice processing device in an embodiment of the present invention;

fig. 4 is a schematic diagram of an embodiment of a virtual reality guided hypnosis voice processing apparatus according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the embodiments of the present invention are further described in detail below with reference to the accompanying drawings. The exemplary embodiments and descriptions of the present invention are provided to explain the present invention, but not to limit the present invention.

The inventor considers that the current virtual reality hypnosis guiding technology cannot meet the requirements of users by using fixed and standardized hypnosis voice, and has influence on the hypnosis effect. And if the hypnosis voice is considered to be matched with the hypnosis susceptibility of the user, the expected effect can be realized more quickly and better. On the other hand, the hypnotic susceptible sound object of the user often has no professional hypnotic technical background, and cannot provide a professional and effective hypnotic guiding process for the user. Based on this, the embodiment of the invention provides a virtual reality guided hypnosis voice processing method, which optimizes the hypnosis effect by improving the hypnosis susceptibility of users.

Fig. 1 is a schematic diagram of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention, as shown in fig. 1, the method may include:

step 101, performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words;

102, carrying out voice analysis on the hypnosis guide language to obtain voice rhythm information of the hypnosis guide language;

103, searching a corresponding voice unit from a hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores the voice unit generated according to the linguistic data input by the susceptible person of the user;

step 104, synthesizing the searched voice unit into hypnotic voice data;

and 105, synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice.

It can be known from the flow shown in fig. 1 that the embodiment of the present invention fully considers that different hypnotic sound characteristics applied to users during the process of guiding hypnosis will have different effects on the hypnotic effect, wherein the users are more easily brought into a specific hypnotic state by the voices of the users susceptible, thereby achieving a better hypnotic effect. Meanwhile, the embodiment of the invention provides a scheme for automatically synthesizing the voice, so that the complexity of field manual recording is avoided, the staged achievement of hypnosis completely generated by a machine is realized, the hypnosis voice with the voice characteristics of the user susceptible can be output, and the user requirements are met. On the other hand, the embodiment of the invention can help people without any hypnosis knowledge background to automatically output hypnosis voice by means of the voice synthesis technology, complete the hypnosis process and help users to better enter the hypnosis state.

In specific implementation, the linguistic data of the user susceptible can be collected in the early stage to establish a hypnosis voice library. The hypnosis voice library stores voice units generated according to the linguistic data input by the user susceptible person. In the embodiment, the corpus can be designed according to the hypnosis voice characteristics to be output, then the user is allowed to record the corpus under the specific requirement, and then the recorded corpus is analyzed and set to establish the required hypnosis voice library. In specific implementation, the corpus recorded by the user susceptible person may be recorded by the user susceptible person according to the hypnotic voice feature. According to the hypnotic voice characteristics, the user is likely to input the corpus under a specific requirement, for example, when recording the corpus, the recorder is required to have consistent volume, slow speed, clear pronunciation, and moderate emotion. The linguistic data input by the user susceptible person can also be input by the user susceptible person under the specified sampling rate and the speech resolution. For example, a recorder is required to record high snr speech at a specific sampling rate and speech resolution, making the corpus more standard.

In addition, in view of the embodiment of the invention, aiming at solving the problem that the hypnotic effect is influenced because the user is not familiar and sensitive to the voice of a hypnotic teacher in the virtual reality hypnotic guiding process by means of the voice synthesis technology, the voice synthesis technology is used for automatically outputting the sound easy to induce the hypnosis of the user in the virtual reality hypnotic guiding process, so that the emotional connection is established with the user in the aspect of language characteristics, and the hypnotic effect is optimized; the dialect of the user or the voice of the trusted person of the user can more easily enable the user to enter a specific hypnotic state, so that a better hypnotic effect is achieved, and therefore the linguistic data input by the susceptible person of the user can be input by the dialect of the susceptible person of the user and/or can be input by the susceptible person of the user by using a personalized language. Therefore, the requirement of the user is met by outputting the hypnosis voice with local characteristics and individuation. Since the key point of the embodiment of the invention for introducing the speech synthesis technology is to output the speech of the user with a hypnotic sensibility, the recorder of the previous corpus is suggested to be selected specifically.

After the user susceptible person inputs the linguistic data, the hypnosis voice library is established and updated in real time according to the linguistic data input by the user susceptible person. In the process of establishing and updating the hypnosis voice library, the linguistic data can be split in the hypnosis voice library by using a hidden Markov model to construct a voice unit.

When virtual reality guidance hypnosis voice needs to be output, text analysis needs to be carried out on the hypnosis guidance words to obtain text level information of the hypnosis guidance words; and carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words. The text analysis of the hypnotic guide is performed to obtain the text hierarchy information of the hypnotic guide, and the text hierarchy information may include: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.

In a specific embodiment, when text analysis and voice layer prosody analysis of the hypnotic guide are performed, text analysis may be performed on the hypnotic guide of the text version, and the analysis may be performed on the language layer, the grammar layer, and the semantic layer, respectively, to obtain the hierarchical information of the hypnotic guide, that is, the hierarchical relationship of phrases, sentences, and the like; for example, in combination with the virtual reality hypnosis feature, the hypnosis guide words negotiated with professional hypnotics mainly include progressive relaxation guide, hypnosis scene guide and the like, and the information such as words, phrases, sentences and the like in the hypnosis guide words is obtained by analyzing the grammatical and semantic meanings of the text information. Then, prosody analysis is performed on the basis of the voice layer of the hypnotic guide, for example, information such as tone, intonation, loudness, and the like of the voice corresponding to the hypnotic guide is analyzed to obtain prosody information of the voice layer.

After the text level information and the voice rhythm information of the hypnosis guide language are obtained, corresponding voice units are searched from the hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, and then the searched voice units are synthesized into hypnosis voice data. In specific implementation, the hidden markov model can be used for selecting, splicing and synthesizing the searched voice units. And correspondingly synthesizing the voice units extracted from the hypnosis voice library to obtain the required voice data, namely the hypnosis voice data which is easy to be sensed by the user. In a specific example, the hypnotic guidance is output in a gentle and emotional manner, so it is necessary to control the speed and impart emotion to the hypnotic voice during the voice synthesis process.

And finally, synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice. For example, the synthesized hypnosis voice data can be adjusted, optimized and finally shaped under the guidance of a professional hypnotizer, and then the synthesized hypnosis voice data is guided into a virtual reality hypnosis scene to output virtual reality hypnosis guiding voice.

Fig. 2 is a schematic diagram of a specific example of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention, as shown in fig. 2, in this example, for a virtual reality hypnotic scene, a corpus is first designed, a user susceptible person inputs the recorded corpus, so as to establish a hypnotic speech library, and a Hidden Markov Model (HMM) is used to construct a speech unit to be stored in the hypnotic speech library; after the hypnosis guide words are determined, performing text analysis on the hypnosis guide words to obtain text level information of the hypnosis guide words, and performing voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words; then, according to the text level information and the voice rhythm information of the hypnosis guide language, searching a corresponding voice unit from the hypnosis voice library, and synthesizing the searched voice unit into hypnosis voice data; and optimizing the hypnosis voice data, synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and finally outputting the virtual reality hypnosis guiding voice.

It can be known from the above embodiments that the embodiment of the present invention uses a speech synthesis technology, and performs corpus collection on a user susceptible person in an early stage to collect specific sentences of the user hypnotic susceptible sound, thereby establishing a hypnotic speech library, and then performs speech analysis, speech unit extraction, synthesis, and other operations only by providing text information, and finally realizes outputting the hypnotic speech of a person more sensitive to the user, and enhancing the hypnotic effect.

Based on the same inventive concept, the embodiment of the present invention further provides a virtual reality guided hypnosis voice processing apparatus, as described in the following embodiments. Because the principle of the device for solving the problems is similar to that of the virtual reality hypnosis guiding voice processing method, the implementation of the device can refer to the implementation of the virtual reality hypnosis guiding voice processing method, and repeated parts are not repeated.

Fig. 3 is a schematic diagram of a virtual reality guided hypnosis voice processing apparatus according to an embodiment of the present invention, as shown in fig. 3, the apparatus may include:

the text analysis module 301 is configured to perform text analysis on the hypnotic guide to obtain text level information of the hypnotic guide;

the voice analysis module 302 is configured to perform voice analysis on the hypnotic guide language to obtain voice prosody information of the hypnotic guide language;

the voice query module 303 is configured to search a corresponding voice unit from a hypnosis voice library according to text level information and voice prosody information of the hypnosis guide language, where the hypnosis voice library stores voice units generated according to a corpus input by a user susceptible person;

the voice synthesis module 304 is used for synthesizing the searched voice unit into hypnotic voice data;

and the voice output module 305 is configured to synthesize the hypnotic voice data and the virtual reality hypnotic scene, and output the virtual reality hypnotic guiding voice.

In a specific implementation, the text analysis module 301 may be further configured to: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.

In specific implementation, the corpus can be input by a user susceptible person according to the hypnotic voice characteristics; and/or the corpus may be entered by a user susceptible at a specified sampling rate and speech resolution.

In specific implementation, the corpus can be input by a user susceptible person using a dialect; and/or the corpus may be entered by a user susceptible person using a personalized language.

Fig. 4 is a schematic diagram of an embodiment of a virtual reality guided hypnosis voice processing apparatus according to an embodiment of the present invention, and as shown in fig. 4, the apparatus shown in fig. 3 may further include: a corpus processing module 401, configured to establish and update the hypnosis voice library in real time according to the corpus entered by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;

the speech synthesis module 304 may be further operable to: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.

In summary, the embodiment of the present invention combines the speech synthesis technology and the virtual reality hypnosis scenario based on the existing virtual reality hypnosis technology, considering that the speech characteristics with different sensitivities have different effects on the hypnosis effect of the user, improves the original standardized (recorded by a professional broadcaster) hypnosis speech, and finally outputs the guidance speech of the user's hypnosis susceptibility, thereby achieving a more effective hypnosis state. The embodiment of the invention provides a scheme for automatically synthesizing voice, and can output various hypnotic voices with local characteristics and meet the requirements of users. On the other hand, by means of the voice synthesis technology, the standardized hypnosis easy-induction specific hypnosis voice can be synthesized and output, the hypnosis process is completed, and the user is helped to better enter the hypnosis state.

The embodiment of the invention can be applied to the virtual reality guiding hypnosis process of radiotherapy clinical respiration control of the patient with the tumor of the chest and the abdomen.

As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A virtual reality guidance hypnosis voice processing method is characterized by comprising the following steps:

synthesizing the searched voice unit into hypnotic voice data;

synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting virtual reality hypnosis guiding voice;

wherein, the corpus is input by the user sensible person according to the hypnosis voice characteristics and according to the requirement by using the dialect; and/or, the corpus is entered by a user susceptible person using a personalized language at a specified sampling rate and speech resolution.

2. The method as set forth in claim 1, wherein the text analysis of the hypnotic guide to obtain the text hierarchy information of the hypnotic guide comprises: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.

3. The method of any of claims 1 to 2, further comprising: establishing and updating the hypnosis voice library in real time according to the linguistic data input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;

4. A virtual reality guidance hypnosis voice processing device is characterized by comprising:

the voice output module is used for synthesizing the hypnosis voice data and the virtual reality hypnosis scene and outputting virtual reality hypnosis guiding voice;

5. The apparatus of claim 4, wherein the text analysis module is further to: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.

6. The apparatus of any of claims 4 to 5, further comprising: the corpus processing module is used for establishing and updating the hypnosis voice library in real time according to the corpus input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;