CN107886938B - Virtual reality guidance hypnosis voice processing method and device - Google Patents

Virtual reality guidance hypnosis voice processing method and device Download PDF

Info

Publication number
CN107886938B
CN107886938B CN201610868130.6A CN201610868130A CN107886938B CN 107886938 B CN107886938 B CN 107886938B CN 201610868130 A CN201610868130 A CN 201610868130A CN 107886938 B CN107886938 B CN 107886938B
Authority
CN
China
Prior art keywords
voice
hypnosis
hypnotic
guide
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610868130.6A
Other languages
Chinese (zh)
Other versions
CN107886938A (en
Inventor
李荣茂
臧珊珊
刘燕君
陈昳丽
朱艳春
陈鸣闽
谢耀钦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Institute of Advanced Technology of CAS
Original Assignee
Shenzhen Institute of Advanced Technology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Institute of Advanced Technology of CAS filed Critical Shenzhen Institute of Advanced Technology of CAS
Priority to CN201610868130.6A priority Critical patent/CN107886938B/en
Publication of CN107886938A publication Critical patent/CN107886938A/en
Application granted granted Critical
Publication of CN107886938B publication Critical patent/CN107886938B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a virtual reality guidance hypnosis voice processing method and a device, wherein the method comprises the following steps: performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words; carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words; searching corresponding voice units from a hypnosis voice library according to text level information and voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores voice units generated according to linguistic data input by a user susceptible person; synthesizing the searched voice unit into hypnotic voice data; and synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice. The invention can improve the hypnosis susceptibility of users and optimize the hypnosis effect.

Description

Virtual reality guidance hypnosis voice processing method and device
Technical Field
The invention relates to the technical field of virtual reality guided hypnosis, in particular to a virtual reality guided hypnosis voice processing method and device.
Background
The existing virtual reality guided hypnosis technology generally uses fixed standardized hypnosis voice (recorded by professional announcers) to be synthesized with a virtual reality hypnosis scene to guide a user to enter a hypnosis state. Although the fixed standardized hypnotic voice guides the user to have hypnosis in various aspects such as tone, timbre and the like close to the voice requirement of the hypnosis process, the requirement of the user on faster and better hypnosis cannot be met, and the hypnosis effect is poor.
Disclosure of Invention
The embodiment of the invention provides a virtual reality guidance hypnosis voice processing method, which is used for improving the hypnosis susceptibility of a user and optimizing the hypnosis effect, and comprises the following steps:
performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words;
carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words;
searching corresponding voice units from a hypnosis voice library according to text level information and voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores voice units generated according to linguistic data input by a user susceptible person;
synthesizing the searched voice unit into hypnotic voice data;
and synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice.
In one embodiment, the text analysis of the hypnotic guide to obtain the text hierarchy information of the hypnotic guide includes: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.
In one embodiment, the corpus is input by a user-sensible person according to hypnotic voice characteristics; and/or, the corpus is entered by a user susceptible person at a specified sampling rate and speech resolution.
In one embodiment, the corpus is entered by a user susceptible using a dialect; and/or, the corpus is entered by a user susceptible person using a personalized language.
In one embodiment, the method further comprises: establishing and updating the hypnosis voice library in real time according to the linguistic data input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;
the synthesizing of the searched voice unit into the hypnotic voice data comprises the following steps: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.
The embodiment of the invention also provides a virtual reality guidance hypnosis voice processing device, which is used for improving the hypnosis susceptibility of users and optimizing the hypnosis effect, and comprises the following components:
the text analysis module is used for carrying out text analysis on the hypnosis guide words to obtain text level information of the hypnosis guide words;
the voice analysis module is used for carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words;
the voice query module is used for searching corresponding voice units from a hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, and the hypnosis voice library stores voice units generated according to the linguistic data input by the susceptible person of the user;
the voice synthesis module is used for synthesizing the searched voice unit into hypnotic voice data;
and the voice output module is used for synthesizing the hypnosis voice data and the virtual reality hypnosis scene and outputting the virtual reality hypnosis guiding voice.
In one embodiment, the text analysis module is further configured to: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.
In one embodiment, the corpus is input by a user-sensible person according to hypnotic voice characteristics; and/or, the corpus is entered by a user susceptible person at a specified sampling rate and speech resolution.
In one embodiment, the corpus is entered by a user susceptible using a dialect; and/or, the corpus is entered by a user susceptible person using a personalized language.
In one embodiment, the apparatus further comprises: the corpus processing module is used for establishing and updating the hypnosis voice library in real time according to the corpus input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;
the speech synthesis module is further to: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.
In the embodiment of the invention, the matching of the hypnosis process voice and the hypnosis susceptibility of the user is considered, the voice synthesis technology is used, the original standardized hypnosis voice is changed, the voice of the user susceptible person is synthesized, and the hypnosis voice sensitive to the user is finally output by combining the characteristics of hypnosis guidance, so that the hypnosis susceptibility of the user is improved, and the hypnosis effect is optimized. Meanwhile, the embodiment of the invention provides a scheme for automatically synthesizing the voice, so that the complexity of field manual recording is avoided, the requirements of users are met, the automatic voice output device can help people without any hypnosis knowledge background to automatically output hypnosis voice, complete the hypnosis process and help the users to better enter the hypnosis state.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts. In the drawings:
fig. 1 is a schematic diagram of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention;
fig. 2 is a schematic diagram illustrating an embodiment of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention;
fig. 3 is a schematic diagram of a virtual reality guidance hypnosis voice processing device in an embodiment of the present invention;
fig. 4 is a schematic diagram of an embodiment of a virtual reality guided hypnosis voice processing apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the embodiments of the present invention are further described in detail below with reference to the accompanying drawings. The exemplary embodiments and descriptions of the present invention are provided to explain the present invention, but not to limit the present invention.
The inventor considers that the current virtual reality hypnosis guiding technology cannot meet the requirements of users by using fixed and standardized hypnosis voice, and has influence on the hypnosis effect. And if the hypnosis voice is considered to be matched with the hypnosis susceptibility of the user, the expected effect can be realized more quickly and better. On the other hand, the hypnotic susceptible sound object of the user often has no professional hypnotic technical background, and cannot provide a professional and effective hypnotic guiding process for the user. Based on this, the embodiment of the invention provides a virtual reality guided hypnosis voice processing method, which optimizes the hypnosis effect by improving the hypnosis susceptibility of users.
Fig. 1 is a schematic diagram of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention, as shown in fig. 1, the method may include:
step 101, performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words;
102, carrying out voice analysis on the hypnosis guide language to obtain voice rhythm information of the hypnosis guide language;
103, searching a corresponding voice unit from a hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores the voice unit generated according to the linguistic data input by the susceptible person of the user;
step 104, synthesizing the searched voice unit into hypnotic voice data;
and 105, synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice.
It can be known from the flow shown in fig. 1 that the embodiment of the present invention fully considers that different hypnotic sound characteristics applied to users during the process of guiding hypnosis will have different effects on the hypnotic effect, wherein the users are more easily brought into a specific hypnotic state by the voices of the users susceptible, thereby achieving a better hypnotic effect. Meanwhile, the embodiment of the invention provides a scheme for automatically synthesizing the voice, so that the complexity of field manual recording is avoided, the staged achievement of hypnosis completely generated by a machine is realized, the hypnosis voice with the voice characteristics of the user susceptible can be output, and the user requirements are met. On the other hand, the embodiment of the invention can help people without any hypnosis knowledge background to automatically output hypnosis voice by means of the voice synthesis technology, complete the hypnosis process and help users to better enter the hypnosis state.
In specific implementation, the linguistic data of the user susceptible can be collected in the early stage to establish a hypnosis voice library. The hypnosis voice library stores voice units generated according to the linguistic data input by the user susceptible person. In the embodiment, the corpus can be designed according to the hypnosis voice characteristics to be output, then the user is allowed to record the corpus under the specific requirement, and then the recorded corpus is analyzed and set to establish the required hypnosis voice library. In specific implementation, the corpus recorded by the user susceptible person may be recorded by the user susceptible person according to the hypnotic voice feature. According to the hypnotic voice characteristics, the user is likely to input the corpus under a specific requirement, for example, when recording the corpus, the recorder is required to have consistent volume, slow speed, clear pronunciation, and moderate emotion. The linguistic data input by the user susceptible person can also be input by the user susceptible person under the specified sampling rate and the speech resolution. For example, a recorder is required to record high snr speech at a specific sampling rate and speech resolution, making the corpus more standard.
In addition, in view of the embodiment of the invention, aiming at solving the problem that the hypnotic effect is influenced because the user is not familiar and sensitive to the voice of a hypnotic teacher in the virtual reality hypnotic guiding process by means of the voice synthesis technology, the voice synthesis technology is used for automatically outputting the sound easy to induce the hypnosis of the user in the virtual reality hypnotic guiding process, so that the emotional connection is established with the user in the aspect of language characteristics, and the hypnotic effect is optimized; the dialect of the user or the voice of the trusted person of the user can more easily enable the user to enter a specific hypnotic state, so that a better hypnotic effect is achieved, and therefore the linguistic data input by the susceptible person of the user can be input by the dialect of the susceptible person of the user and/or can be input by the susceptible person of the user by using a personalized language. Therefore, the requirement of the user is met by outputting the hypnosis voice with local characteristics and individuation. Since the key point of the embodiment of the invention for introducing the speech synthesis technology is to output the speech of the user with a hypnotic sensibility, the recorder of the previous corpus is suggested to be selected specifically.
After the user susceptible person inputs the linguistic data, the hypnosis voice library is established and updated in real time according to the linguistic data input by the user susceptible person. In the process of establishing and updating the hypnosis voice library, the linguistic data can be split in the hypnosis voice library by using a hidden Markov model to construct a voice unit.
When virtual reality guidance hypnosis voice needs to be output, text analysis needs to be carried out on the hypnosis guidance words to obtain text level information of the hypnosis guidance words; and carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words. The text analysis of the hypnotic guide is performed to obtain the text hierarchy information of the hypnotic guide, and the text hierarchy information may include: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.
In a specific embodiment, when text analysis and voice layer prosody analysis of the hypnotic guide are performed, text analysis may be performed on the hypnotic guide of the text version, and the analysis may be performed on the language layer, the grammar layer, and the semantic layer, respectively, to obtain the hierarchical information of the hypnotic guide, that is, the hierarchical relationship of phrases, sentences, and the like; for example, in combination with the virtual reality hypnosis feature, the hypnosis guide words negotiated with professional hypnotics mainly include progressive relaxation guide, hypnosis scene guide and the like, and the information such as words, phrases, sentences and the like in the hypnosis guide words is obtained by analyzing the grammatical and semantic meanings of the text information. Then, prosody analysis is performed on the basis of the voice layer of the hypnotic guide, for example, information such as tone, intonation, loudness, and the like of the voice corresponding to the hypnotic guide is analyzed to obtain prosody information of the voice layer.
After the text level information and the voice rhythm information of the hypnosis guide language are obtained, corresponding voice units are searched from the hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, and then the searched voice units are synthesized into hypnosis voice data. In specific implementation, the hidden markov model can be used for selecting, splicing and synthesizing the searched voice units. And correspondingly synthesizing the voice units extracted from the hypnosis voice library to obtain the required voice data, namely the hypnosis voice data which is easy to be sensed by the user. In a specific example, the hypnotic guidance is output in a gentle and emotional manner, so it is necessary to control the speed and impart emotion to the hypnotic voice during the voice synthesis process.
And finally, synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting the virtual reality hypnosis guiding voice. For example, the synthesized hypnosis voice data can be adjusted, optimized and finally shaped under the guidance of a professional hypnotizer, and then the synthesized hypnosis voice data is guided into a virtual reality hypnosis scene to output virtual reality hypnosis guiding voice.
Fig. 2 is a schematic diagram of a specific example of a virtual reality guided hypnotic speech processing method according to an embodiment of the present invention, as shown in fig. 2, in this example, for a virtual reality hypnotic scene, a corpus is first designed, a user susceptible person inputs the recorded corpus, so as to establish a hypnotic speech library, and a Hidden Markov Model (HMM) is used to construct a speech unit to be stored in the hypnotic speech library; after the hypnosis guide words are determined, performing text analysis on the hypnosis guide words to obtain text level information of the hypnosis guide words, and performing voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words; then, according to the text level information and the voice rhythm information of the hypnosis guide language, searching a corresponding voice unit from the hypnosis voice library, and synthesizing the searched voice unit into hypnosis voice data; and optimizing the hypnosis voice data, synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and finally outputting the virtual reality hypnosis guiding voice.
It can be known from the above embodiments that the embodiment of the present invention uses a speech synthesis technology, and performs corpus collection on a user susceptible person in an early stage to collect specific sentences of the user hypnotic susceptible sound, thereby establishing a hypnotic speech library, and then performs speech analysis, speech unit extraction, synthesis, and other operations only by providing text information, and finally realizes outputting the hypnotic speech of a person more sensitive to the user, and enhancing the hypnotic effect.
Based on the same inventive concept, the embodiment of the present invention further provides a virtual reality guided hypnosis voice processing apparatus, as described in the following embodiments. Because the principle of the device for solving the problems is similar to that of the virtual reality hypnosis guiding voice processing method, the implementation of the device can refer to the implementation of the virtual reality hypnosis guiding voice processing method, and repeated parts are not repeated.
Fig. 3 is a schematic diagram of a virtual reality guided hypnosis voice processing apparatus according to an embodiment of the present invention, as shown in fig. 3, the apparatus may include:
the text analysis module 301 is configured to perform text analysis on the hypnotic guide to obtain text level information of the hypnotic guide;
the voice analysis module 302 is configured to perform voice analysis on the hypnotic guide language to obtain voice prosody information of the hypnotic guide language;
the voice query module 303 is configured to search a corresponding voice unit from a hypnosis voice library according to text level information and voice prosody information of the hypnosis guide language, where the hypnosis voice library stores voice units generated according to a corpus input by a user susceptible person;
the voice synthesis module 304 is used for synthesizing the searched voice unit into hypnotic voice data;
and the voice output module 305 is configured to synthesize the hypnotic voice data and the virtual reality hypnotic scene, and output the virtual reality hypnotic guiding voice.
In a specific implementation, the text analysis module 301 may be further configured to: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.
In specific implementation, the corpus can be input by a user susceptible person according to the hypnotic voice characteristics; and/or the corpus may be entered by a user susceptible at a specified sampling rate and speech resolution.
In specific implementation, the corpus can be input by a user susceptible person using a dialect; and/or the corpus may be entered by a user susceptible person using a personalized language.
Fig. 4 is a schematic diagram of an embodiment of a virtual reality guided hypnosis voice processing apparatus according to an embodiment of the present invention, and as shown in fig. 4, the apparatus shown in fig. 3 may further include: a corpus processing module 401, configured to establish and update the hypnosis voice library in real time according to the corpus entered by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;
the speech synthesis module 304 may be further operable to: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.
In summary, the embodiment of the present invention combines the speech synthesis technology and the virtual reality hypnosis scenario based on the existing virtual reality hypnosis technology, considering that the speech characteristics with different sensitivities have different effects on the hypnosis effect of the user, improves the original standardized (recorded by a professional broadcaster) hypnosis speech, and finally outputs the guidance speech of the user's hypnosis susceptibility, thereby achieving a more effective hypnosis state. The embodiment of the invention provides a scheme for automatically synthesizing voice, and can output various hypnotic voices with local characteristics and meet the requirements of users. On the other hand, by means of the voice synthesis technology, the standardized hypnosis easy-induction specific hypnosis voice can be synthesized and output, the hypnosis process is completed, and the user is helped to better enter the hypnosis state.
The embodiment of the invention can be applied to the virtual reality guiding hypnosis process of radiotherapy clinical respiration control of the patient with the tumor of the chest and the abdomen.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (6)

1. A virtual reality guidance hypnosis voice processing method is characterized by comprising the following steps:
performing text analysis on the hypnotic guide words to obtain text level information of the hypnotic guide words;
carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words;
searching corresponding voice units from a hypnosis voice library according to text level information and voice rhythm information of the hypnosis guide language, wherein the hypnosis voice library stores voice units generated according to linguistic data input by a user susceptible person;
synthesizing the searched voice unit into hypnotic voice data;
synthesizing the hypnosis voice data with the virtual reality hypnosis scene, and outputting virtual reality hypnosis guiding voice;
wherein, the corpus is input by the user sensible person according to the hypnosis voice characteristics and according to the requirement by using the dialect; and/or, the corpus is entered by a user susceptible person using a personalized language at a specified sampling rate and speech resolution.
2. The method as set forth in claim 1, wherein the text analysis of the hypnotic guide to obtain the text hierarchy information of the hypnotic guide comprises: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.
3. The method of any of claims 1 to 2, further comprising: establishing and updating the hypnosis voice library in real time according to the linguistic data input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;
the synthesizing of the searched voice unit into the hypnotic voice data comprises the following steps: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.
4. A virtual reality guidance hypnosis voice processing device is characterized by comprising:
the text analysis module is used for carrying out text analysis on the hypnosis guide words to obtain text level information of the hypnosis guide words;
the voice analysis module is used for carrying out voice analysis on the hypnosis guide words to obtain voice rhythm information of the hypnosis guide words;
the voice query module is used for searching corresponding voice units from a hypnosis voice library according to the text level information and the voice rhythm information of the hypnosis guide language, and the hypnosis voice library stores voice units generated according to the linguistic data input by the susceptible person of the user;
the voice synthesis module is used for synthesizing the searched voice unit into hypnotic voice data;
the voice output module is used for synthesizing the hypnosis voice data and the virtual reality hypnosis scene and outputting virtual reality hypnosis guiding voice;
wherein, the corpus is input by the user sensible person according to the hypnosis voice characteristics and according to the requirement by using the dialect; and/or, the corpus is entered by a user susceptible person using a personalized language at a specified sampling rate and speech resolution.
5. The apparatus of claim 4, wherein the text analysis module is further to: and performing text language, grammar and semantic analysis on the hypnotic guide words to obtain word information, phrase information and sentence information in the hypnotic guide words and relationship information among the words, the phrases and the sentences.
6. The apparatus of any of claims 4 to 5, further comprising: the corpus processing module is used for establishing and updating the hypnosis voice library in real time according to the corpus input by the user susceptible person; splitting the corpus in the hypnosis voice library by using a hidden Markov model to construct a voice unit;
the speech synthesis module is further to: and selecting, splicing and synthesizing the searched voice units by using a hidden Markov model.
CN201610868130.6A 2016-09-29 2016-09-29 Virtual reality guidance hypnosis voice processing method and device Active CN107886938B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610868130.6A CN107886938B (en) 2016-09-29 2016-09-29 Virtual reality guidance hypnosis voice processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610868130.6A CN107886938B (en) 2016-09-29 2016-09-29 Virtual reality guidance hypnosis voice processing method and device

Publications (2)

Publication Number Publication Date
CN107886938A CN107886938A (en) 2018-04-06
CN107886938B true CN107886938B (en) 2020-11-17

Family

ID=61769106

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610868130.6A Active CN107886938B (en) 2016-09-29 2016-09-29 Virtual reality guidance hypnosis voice processing method and device

Country Status (1)

Country Link
CN (1) CN107886938B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109473090A (en) * 2018-09-30 2019-03-15 北京光年无限科技有限公司 A kind of narration data processing method and processing device towards intelligent robot
CN113545781B (en) * 2021-07-20 2024-06-07 浙江工商职业技术学院 Virtual reality sleep-promoting method and device

Citations (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020002460A1 (en) * 1999-08-31 2002-01-03 Valery Pertrushin System method and article of manufacture for a voice messaging expert system that organizes voice messages based on detected emotions
CN1717720A (en) * 2003-09-05 2006-01-04 松下电器产业株式会社 Acoustic processing system, acoustic processing device, acoustic processing method, acoustic processing program, and storage medium
CN1815551A (en) * 2006-02-28 2006-08-09 安徽中科大讯飞信息科技有限公司 Method for conducting text dialect treatment for dialect voice synthesizing system
CN1929655A (en) * 2006-09-28 2007-03-14 中山大学 Mobile phone capable of realizing text and voice conversion
CN101051458A (en) * 2006-04-04 2007-10-10 中国科学院自动化研究所 Rhythm phrase predicting method based on module analysis
CN101064104A (en) * 2006-04-24 2007-10-31 中国科学院自动化研究所 Emotion voice creating method based on voice conversion
CN101187990A (en) * 2007-12-14 2008-05-28 华南理工大学 A session robotic system
CN101198277A (en) * 2005-02-22 2008-06-11 海尔思-斯玛特有限公司 Methods and systems for physiological and psycho-physiological monitoring and uses thereof
CN101488342A (en) * 2008-12-31 2009-07-22 广东协联科贸发展有限公司 Human-machine language interaction deduction system and intelligent implementing method for human-machine language interaction demand response
US20090306979A1 (en) * 2008-06-10 2009-12-10 Peeyush Jaiswal Data processing system for autonomously building speech identification and tagging data
CN101685634A (en) * 2008-09-27 2010-03-31 上海盛淘智能科技有限公司 Children speech emotion recognition method
CN201453284U (en) * 2009-06-17 2010-05-12 上海风点信息科技有限公司 Psychological therapy system
CN101772015A (en) * 2008-12-29 2010-07-07 卢中江 Method for starting up mobile terminal through voice password
CN102063903A (en) * 2010-09-25 2011-05-18 中国科学院深圳先进技术研究院 Speech interactive training system and speech interactive training method
CN102201234A (en) * 2011-06-24 2011-09-28 北京宇音天下科技有限公司 Speech synthesizing method based on tone automatic tagging and prediction
CN102294068A (en) * 2010-06-22 2011-12-28 上海迈辉信息技术有限公司 Intelligent feedback music relaxation hypnosis treatment system and realizing method thereof
CN102430182A (en) * 2011-09-01 2012-05-02 汪卫东 Feedback hypnosis therapeutic instrument
CN102496363A (en) * 2011-11-11 2012-06-13 北京宇音天下科技有限公司 Correction method for Chinese speech synthesis tone
CN102527045A (en) * 2012-01-14 2012-07-04 李慈 Intelligent learning doll and realizing method and circuit system thereof
CN102779508A (en) * 2012-03-31 2012-11-14 安徽科大讯飞信息科技股份有限公司 Speech corpus generating device and method, speech synthesizing system and method
CN103035241A (en) * 2012-12-07 2013-04-10 中国科学院自动化研究所 Model complementary Chinese rhythm interruption recognition system and method
CN103117057A (en) * 2012-12-27 2013-05-22 安徽科大讯飞信息科技股份有限公司 Application method of special human voice synthesis technique in mobile phone cartoon dubbing
CN103236259A (en) * 2013-03-22 2013-08-07 乐金电子研发中心(上海)有限公司 Voice recognition processing and feedback system, voice response method
CN103500244A (en) * 2013-09-06 2014-01-08 雷路德 Virtual friend conversational system and method thereof
CN103544140A (en) * 2012-07-12 2014-01-29 国际商业机器公司 Data processing method, display method and corresponding devices
CN104050962A (en) * 2013-03-16 2014-09-17 李军 Multifunctional reader based on voice synthesis technology
CN104217713A (en) * 2014-07-15 2014-12-17 西北师范大学 Tibetan-Chinese speech synthesis method and device
US20150174362A1 (en) * 2013-12-17 2015-06-25 Juliana Stoianova Panova Adjuvant Method for the Interface of Psychosomatic Approaches and Technology for Improving Medical Outcomes
CN105596007A (en) * 2014-11-24 2016-05-25 张湘民 Snore position minitype recorder
CN105749402A (en) * 2016-02-24 2016-07-13 陕西科技大学 Deep hypnosis system and method based on feedback mechanism
CN105892616A (en) * 2016-03-29 2016-08-24 宇龙计算机通信科技(深圳)有限公司 Terminal control method, terminal control device and terminal
CN105912104A (en) * 2016-04-01 2016-08-31 宇龙计算机通信科技(深圳)有限公司 Power management method and device for virtual-reality device

Patent Citations (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020002460A1 (en) * 1999-08-31 2002-01-03 Valery Pertrushin System method and article of manufacture for a voice messaging expert system that organizes voice messages based on detected emotions
CN1717720A (en) * 2003-09-05 2006-01-04 松下电器产业株式会社 Acoustic processing system, acoustic processing device, acoustic processing method, acoustic processing program, and storage medium
CN101198277A (en) * 2005-02-22 2008-06-11 海尔思-斯玛特有限公司 Methods and systems for physiological and psycho-physiological monitoring and uses thereof
CN1815551A (en) * 2006-02-28 2006-08-09 安徽中科大讯飞信息科技有限公司 Method for conducting text dialect treatment for dialect voice synthesizing system
CN101051458A (en) * 2006-04-04 2007-10-10 中国科学院自动化研究所 Rhythm phrase predicting method based on module analysis
CN101064104A (en) * 2006-04-24 2007-10-31 中国科学院自动化研究所 Emotion voice creating method based on voice conversion
CN1929655A (en) * 2006-09-28 2007-03-14 中山大学 Mobile phone capable of realizing text and voice conversion
CN101187990A (en) * 2007-12-14 2008-05-28 华南理工大学 A session robotic system
US20090306979A1 (en) * 2008-06-10 2009-12-10 Peeyush Jaiswal Data processing system for autonomously building speech identification and tagging data
CN101685634A (en) * 2008-09-27 2010-03-31 上海盛淘智能科技有限公司 Children speech emotion recognition method
CN101772015A (en) * 2008-12-29 2010-07-07 卢中江 Method for starting up mobile terminal through voice password
CN101488342A (en) * 2008-12-31 2009-07-22 广东协联科贸发展有限公司 Human-machine language interaction deduction system and intelligent implementing method for human-machine language interaction demand response
CN201453284U (en) * 2009-06-17 2010-05-12 上海风点信息科技有限公司 Psychological therapy system
CN102294068A (en) * 2010-06-22 2011-12-28 上海迈辉信息技术有限公司 Intelligent feedback music relaxation hypnosis treatment system and realizing method thereof
CN102063903A (en) * 2010-09-25 2011-05-18 中国科学院深圳先进技术研究院 Speech interactive training system and speech interactive training method
CN102201234A (en) * 2011-06-24 2011-09-28 北京宇音天下科技有限公司 Speech synthesizing method based on tone automatic tagging and prediction
CN102430182A (en) * 2011-09-01 2012-05-02 汪卫东 Feedback hypnosis therapeutic instrument
CN102496363A (en) * 2011-11-11 2012-06-13 北京宇音天下科技有限公司 Correction method for Chinese speech synthesis tone
CN102527045A (en) * 2012-01-14 2012-07-04 李慈 Intelligent learning doll and realizing method and circuit system thereof
CN102779508A (en) * 2012-03-31 2012-11-14 安徽科大讯飞信息科技股份有限公司 Speech corpus generating device and method, speech synthesizing system and method
CN103544140A (en) * 2012-07-12 2014-01-29 国际商业机器公司 Data processing method, display method and corresponding devices
CN103035241A (en) * 2012-12-07 2013-04-10 中国科学院自动化研究所 Model complementary Chinese rhythm interruption recognition system and method
CN103117057A (en) * 2012-12-27 2013-05-22 安徽科大讯飞信息科技股份有限公司 Application method of special human voice synthesis technique in mobile phone cartoon dubbing
CN104050962A (en) * 2013-03-16 2014-09-17 李军 Multifunctional reader based on voice synthesis technology
CN103236259A (en) * 2013-03-22 2013-08-07 乐金电子研发中心(上海)有限公司 Voice recognition processing and feedback system, voice response method
CN103500244A (en) * 2013-09-06 2014-01-08 雷路德 Virtual friend conversational system and method thereof
US20150174362A1 (en) * 2013-12-17 2015-06-25 Juliana Stoianova Panova Adjuvant Method for the Interface of Psychosomatic Approaches and Technology for Improving Medical Outcomes
CN104217713A (en) * 2014-07-15 2014-12-17 西北师范大学 Tibetan-Chinese speech synthesis method and device
CN105596007A (en) * 2014-11-24 2016-05-25 张湘民 Snore position minitype recorder
CN105749402A (en) * 2016-02-24 2016-07-13 陕西科技大学 Deep hypnosis system and method based on feedback mechanism
CN105892616A (en) * 2016-03-29 2016-08-24 宇龙计算机通信科技(深圳)有限公司 Terminal control method, terminal control device and terminal
CN105912104A (en) * 2016-04-01 2016-08-31 宇龙计算机通信科技(深圳)有限公司 Power management method and device for virtual-reality device

Also Published As

Publication number Publication date
CN107886938A (en) 2018-04-06

Similar Documents

Publication Publication Date Title
Bigi SPPAS-multi-lingual approaches to the automatic annotation of speech
EP3994683B1 (en) Multilingual neural text-to-speech synthesis
CN108806656B (en) Automatic generation of songs
US8825486B2 (en) Method and apparatus for generating synthetic speech with contrastive stress
CN108806655B (en) Automatic generation of songs
US8914291B2 (en) Method and apparatus for generating synthetic speech with contrastive stress
CN112164379A (en) Audio file generation method, device, equipment and computer readable storage medium
CN104008752A (en) Speech recognition device and method, and semiconductor integrated circuit device
McAuliffe et al. ISCAN: A system for integrated phonetic analyses across speech corpora
El Ouahabi et al. Toward an automatic speech recognition system for amazigh-tarifit language
CN111477210A (en) Speech synthesis method and device
US10665221B2 (en) Virtual reality guide hypnosis speech processing method and apparatus
Latif et al. Sparks of large audio models: A survey and outlook
CN107886938B (en) Virtual reality guidance hypnosis voice processing method and device
Saratxaga et al. Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque.
Koudounas et al. Italic: An italian intent classification dataset
US11176943B2 (en) Voice recognition device, voice recognition method, and computer program product
Erro et al. ZureTTS: Online platform for obtaining personalized synthetic voices
Krug et al. Modelling microprosodic effects can lead to an audible improvement in articulatory synthesis
CN112420022B (en) Noise extraction method, device, equipment and storage medium
Sannino et al. Lessonable: leveraging deep fakes in MOOC content creation
Hämäläinen et al. The CNG corpus of European Portuguese children’s speech
Kastrati et al. The opportunity of using eSpeak as Text-to-Speech synthesizer for Albanian Language
Boroș et al. Rss-tobi-a prosodically enhanced romanian speech corpus
CN115249472A (en) Voice synthesis method and device for realizing stress overall planning by combining context

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20180406

Assignee: Shenzhen achievement transformation Center Co.,Ltd.

Assignor: SHENZHEN INSTITUTES OF ADVANCED TECHNOLOGY CHINESE ACADEMY OF SCIENCES

Contract record no.: X2023980044076

Denomination of invention: Virtual reality guided hypnotic speech processing method and device

Granted publication date: 20201117

License type: Common License

Record date: 20231023

EE01 Entry into force of recordation of patent licensing contract