CN106098057B - Playing speech rate management method and device - Google Patents

Playing speech rate management method and device Download PDF

Info

Publication number
CN106098057B
CN106098057B CN201610412991.3A CN201610412991A CN106098057B CN 106098057 B CN106098057 B CN 106098057B CN 201610412991 A CN201610412991 A CN 201610412991A CN 106098057 B CN106098057 B CN 106098057B
Authority
CN
China
Prior art keywords
playing
tts
target
text
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610412991.3A
Other languages
Chinese (zh)
Other versions
CN106098057A (en
Inventor
周海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisound Intelligent Technology Co Ltd
Original Assignee
Beijing Yunzhisheng Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yunzhisheng Information Technology Co Ltd filed Critical Beijing Yunzhisheng Information Technology Co Ltd
Priority to CN201610412991.3A priority Critical patent/CN106098057B/en
Publication of CN106098057A publication Critical patent/CN106098057A/en
Application granted granted Critical
Publication of CN106098057B publication Critical patent/CN106098057B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser

Abstract

The invention relates to a play speech rate management method and a play speech rate management device, wherein the method comprises the following steps: acquiring a TTS playing text to be played; judging a target field to which the TTS playing text belongs; determining a target playing speed corresponding to the target field; and playing the TTS playing text according to the target playing speed. According to the technical scheme, the TTS playing text to be played is obtained, the corresponding target playing speed is determined for the TTS playing text according to the target field to which the TTS playing text belongs, and therefore the TTS playing text is played according to the target playing speed, different fields are played at different speeds, playing can be conducted in a targeted mode, not all texts are played at one speed, therefore, a user can be guaranteed to clearly listen to the playing content of the playing text in any scene, and the use experience of the user is improved.

Description

Playing speech rate management method and device
Technical Field
The invention relates to the technical field of voice processing, in particular to a playing speed management method and device.
Background
TTS is an abbreviation of Text To Speech, i.e., "from Text To Speech," which is part of a human-machine conversation To enable a machine To speak.
It applies the outstanding actions of linguistics and psychology at the same time, and under the support of built-in chip, it can intelligently convert the characters into natural speech flow by means of the design of neural network. The TTS technology carries out real-time conversion on the text file, and the conversion time can be calculated in seconds. Under the action of a special intelligent voice controller, the voice rhythm of the text output is smooth, so that a listener feels natural when listening to information and does not have the feeling of indifference and acerbity of machine voice output.
TTS is one type of speech synthesis application that converts files stored in a computer, such as help files or web pages, into natural speech output. TTS can not only help visually impaired people read information on a computer, but also increase the readability of text documents. Today's TTS applications include voice-driven mail and voice sensitive systems and are often used with voice recognition programs.
Disclosure of Invention
The embodiment of the invention provides a method and a device for managing playing speech speed, which are used for intelligently managing the speech speed fed back by TTS voice.
According to a first aspect of the embodiments of the present invention, there is provided a play speech rate management method, including:
acquiring a TTS playing text to be played;
judging a target field to which the TTS playing text belongs;
determining a target playing speed corresponding to the target field;
and playing the TTS playing text according to the target playing speed.
In the embodiment, the TTS playing text to be played is acquired, and the corresponding target playing speed is determined for the TTS playing text according to the target field to which the TTS playing text belongs, so that the TTS playing text is played according to the target playing speed, and thus, different fields are played at different speeds, and the TTS playing text can be played in a targeted manner, instead of all texts being played at one speed, so that a user can clearly listen to the playing content of the playing text in any scene, and the use experience of the user is improved.
For example, when a user encounters a [ safety ] related voice prompt while driving a car, the TTS speech rate in the [ safety ] field will be used, slower.
When a user listens to music and meets the voice prompt of reminding, the TTS speed in the field of reminding is used and is faster.
When the user receives the voice prompt of 'advertisement' in the voice broadcast, the TTS speed in the field of 'advertisement' is used, and the speed is particularly fast.
When the user receives the ordinary operation voice prompt, the default TTS speed and the standard speed in the ordinary field are used.
In one embodiment, the determining a target domain to which the TTS playing text belongs includes:
acquiring a keyword label contained in the TTS playing text;
and determining the target field of the TTS playing text according to the keyword tag.
In this embodiment, a keyword tag may be preset in each TTS playing text to indicate a field to which the TTS playing text belongs, so that the field to which the TTS playing text belongs is conveniently determined according to the keyword tag.
For example, if a TTS playing text belongs to the security field, a keyword tag [ security ] can be preset for the TTS playing text, so that the field of the TTS playing text can be determined conveniently and quickly.
In one embodiment, the determining a target playing speed corresponding to the target domain includes:
and determining a target playing speed corresponding to the target field according to the corresponding relation between the preset field and the preset playing speed.
In this embodiment, a user or a manufacturer may preset a plurality of fields and a corresponding play speech rate for each field. The setting fields comprise fields such as safety, reminding, advertisement and common, wherein the speed of speech corresponding to the field of safety is 180 words per minute for 120 plus materials, the speed of speech corresponding to the field of common is 240 words per minute for 180 plus materials, the speed of speech corresponding to the field of reminding is 300 words per minute for 240 plus materials, and the speed of speech corresponding to the field of advertisement is 360 words per minute for 300 plus materials. Thus, the target playing speed corresponding to the target field can be determined according to the corresponding relation between the preset field and the preset playing speed.
In one embodiment, the method further comprises:
receiving an input setting command;
and setting preset fields and playing speed corresponding to each field according to the setting command.
In this embodiment, a user or a manufacturer may preset a plurality of fields and a corresponding play speech rate for each field. The setting fields comprise fields such as safety, reminding, advertisement and common, wherein the speed of speech corresponding to the field of safety is 180 words per minute for 120 plus materials, the speed of speech corresponding to the field of common is 240 words per minute for 180 plus materials, the speed of speech corresponding to the field of reminding is 300 words per minute for 240 plus materials, and the speed of speech corresponding to the field of advertisement is 360 words per minute for 300 plus materials.
In one embodiment, the method further comprises:
when a voice command input by a user is received, determining a TTS playing text to be played corresponding to the voice command.
In this embodiment, it may be determined that the TTS that should be fed back plays the text according to a voice command input by the user. Thereby meeting the playing requirements of users.
According to a second aspect of the embodiments of the present invention, there is provided a playback speech rate management apparatus, including:
the obtaining module is used for obtaining a TTS playing text to be played;
the judging module is used for judging the target field of the TTS playing text;
the first determining module is used for determining a target playing speed corresponding to the target field;
and the playing module is used for playing the TTS playing text according to the target playing speed.
In one embodiment, the determining module comprises:
the obtaining submodule is used for obtaining a keyword tag contained in the TTS playing text;
and the domain determining submodule is used for determining the target domain of the TTS playing text according to the keyword tag.
In one embodiment, the first determining module comprises:
and the speech rate determining submodule is used for determining a target playing speech rate corresponding to the target field according to the corresponding relation between the preset field and the preset playing speech rate.
In one embodiment, the apparatus further comprises:
the receiving module is used for receiving an input setting command;
and the setting module is used for setting the preset fields and the playing speed corresponding to each field according to the setting command.
In one embodiment, the apparatus further comprises:
and the second determining module is used for determining the TTS playing text to be played corresponding to the voice command when the voice command input by the user is received.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.
Fig. 1 is a flow diagram illustrating a play speech rate management method according to an example embodiment.
Fig. 2 is a flowchart illustrating a step S102 in a play speech rate management method according to an exemplary embodiment.
Fig. 3 is a flowchart illustrating step S103 of a play speech rate management method according to an exemplary embodiment.
FIG. 4 is a flow diagram illustrating another method of play speech rate management according to an example embodiment.
FIG. 5 is a flow diagram illustrating another method of play speech rate management according to an example embodiment.
Fig. 6 is a block diagram illustrating a playback speech rate management apparatus according to an example embodiment.
Fig. 7 is a block diagram illustrating a determination module in a playback speech rate management apparatus according to an example embodiment.
Fig. 8 is a block diagram illustrating a first determination module in a playback speech rate management apparatus according to an example embodiment.
Fig. 9 is a block diagram illustrating yet another playback speech rate management apparatus according to an example embodiment.
Fig. 10 is a block diagram illustrating yet another playback speech rate management apparatus according to an example embodiment.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.
Fig. 1 is a flow diagram illustrating a play speech rate management method according to an example embodiment. The playing speech rate management can be applied to playing equipment, and the playing equipment can be any equipment with a speech playing function, such as a mobile phone, a computer, a digital broadcast terminal, a message receiving and sending equipment, a game console, a tablet equipment, a medical equipment, a body-building equipment, a personal digital assistant, a vehicle-mounted player and the like. As shown in fig. 1, the method comprises steps S101-S104:
in step S101, a TTS play text to be played is acquired;
in step S102, a target field to which the TTS played text belongs is determined;
in step S103, determining a target playing speed corresponding to the target field;
in step S104, the TTS playing text is played according to the target playing speed.
In the embodiment, the TTS playing text to be played is acquired, and the corresponding target playing speed is determined for the TTS playing text according to the target field to which the TTS playing text belongs, so that the TTS playing text is played according to the target playing speed, and thus, different fields are played at different speeds, and the TTS playing text can be played in a targeted manner, instead of all texts being played at one speed, so that a user can clearly listen to the playing content of the playing text in any scene, and the use experience of the user is improved.
For example, when a user encounters a [ safety ] related voice prompt while driving a car, the TTS speech rate in the [ safety ] field will be used, slower.
When a user listens to music and meets the voice prompt of reminding, the TTS speed in the field of reminding is used and is faster.
When the user receives the voice prompt of 'advertisement' in the voice broadcast, the TTS speed in the field of 'advertisement' is used, and the speed is particularly fast.
When the user receives the ordinary operation voice prompt, the default TTS speed and the standard speed in the ordinary field are used.
Fig. 2 is a flowchart illustrating a step S102 in a play speech rate management method according to an exemplary embodiment.
As shown in fig. 2, in one embodiment, the step S102 may include steps S201 to S202:
in step S201, a keyword tag included in the TTS playing text is acquired;
in step S202, a target field to which the TTS playing text belongs is determined according to the keyword tag.
In this embodiment, a keyword tag may be preset in each TTS playing text to indicate a field to which the TTS playing text belongs, so that the field to which the TTS playing text belongs is conveniently determined according to the keyword tag.
For example, if a TTS playing text belongs to the security field, a keyword tag [ security ] can be preset for the TTS playing text, so that the field of the TTS playing text can be determined conveniently and quickly.
Fig. 3 is a flowchart illustrating step S103 of a play speech rate management method according to an exemplary embodiment.
As shown in fig. 3, in an embodiment, the step S103 may include the step S301:
in step S301, a target play speech rate corresponding to the target field is determined according to a correspondence between a preset field and a preset play speech rate.
In this embodiment, a user or a manufacturer may preset a plurality of fields and a corresponding play speech rate for each field. The setting fields comprise fields such as safety, reminding, advertisement and common, wherein the speed of speech corresponding to the field of safety is 180 words per minute for 120 plus materials, the speed of speech corresponding to the field of common is 240 words per minute for 180 plus materials, the speed of speech corresponding to the field of reminding is 300 words per minute for 240 plus materials, and the speed of speech corresponding to the field of advertisement is 360 words per minute for 300 plus materials. Thus, the target playing speed corresponding to the target field can be determined according to the corresponding relation between the preset field and the preset playing speed.
FIG. 4 is a flow diagram illustrating another method of play speech rate management according to an example embodiment.
As shown in fig. 4, in an embodiment, the method further includes steps S401-S402:
in step S401, an input setting command is received;
in step S402, according to the setting command, a preset domain and a playing speed corresponding to each domain are set.
In this embodiment, a user or a manufacturer may preset a plurality of fields and a corresponding play speech rate for each field. For example, as shown in table 1, the setting fields include fields such as "safety", "reminder", "advertisement" and "general", where the speed of speech corresponding to the field "safety" is 120-.
TABLE 1
FIELD Playing speed of speech
Security 120-
General purpose 180-
Reminding system 240-300 words/minute
Advertising 300 + 360 words/minute
FIG. 5 is a flow diagram illustrating another method of play speech rate management according to an example embodiment.
As shown in fig. 5, in an embodiment, before step S101, the method further includes step S501:
in step S501, when a voice command input by a user is received, a TTS play text to be played corresponding to the voice command is determined.
In this embodiment, it may be determined that the TTS that should be fed back plays the text according to a voice command input by the user. Thereby meeting the playing requirements of users.
The following are embodiments of the apparatus of the present invention that may be used to perform embodiments of the method of the present invention.
Fig. 6 is a block diagram illustrating a playback speech rate management apparatus, which may be implemented as part or all of a playback device through software, hardware, or a combination of both, according to an example embodiment. As shown in fig. 6, the playback speech rate management apparatus includes:
an obtaining module 61, configured to obtain a TTS playing text to be played;
the judging module 62 is configured to judge a target field to which the TTS played text belongs;
a first determining module 63, configured to determine a target playing speed corresponding to the target domain;
and the playing module 64 is configured to play the TTS playing text according to the target playing speed.
In the embodiment, the TTS playing text to be played is acquired, and the corresponding target playing speed is determined for the TTS playing text according to the target field to which the TTS playing text belongs, so that the TTS playing text is played according to the target playing speed, and thus, different fields are played at different speeds, and the TTS playing text can be played in a targeted manner, instead of all texts being played at one speed, so that a user can clearly listen to the playing content of the playing text in any scene, and the use experience of the user is improved.
For example, when a user encounters a [ safety ] related voice prompt while driving a car, the TTS speech rate in the [ safety ] field will be used, slower.
When a user listens to music and meets the voice prompt of reminding, the TTS speed in the field of reminding is used and is faster.
When the user receives the voice prompt of 'advertisement' in the voice broadcast, the TTS speed in the field of 'advertisement' is used, and the speed is particularly fast.
When the user receives the ordinary operation voice prompt, the default TTS speed and the standard speed in the ordinary field are used.
Fig. 7 is a block diagram illustrating a determination module in a playback speech rate management apparatus according to an example embodiment.
As shown in fig. 7, in one embodiment, the determining module 62 includes:
the obtaining submodule 71 is configured to obtain a keyword tag included in the TTS playing text;
and the domain determining submodule 72 is configured to determine, according to the keyword tag, a target domain to which the TTS playing text belongs.
In this embodiment, a keyword tag may be preset in each TTS playing text to indicate a field to which the TTS playing text belongs, so that the field to which the TTS playing text belongs is conveniently determined according to the keyword tag.
For example, if a TTS playing text belongs to the security field, a keyword tag [ security ] can be preset for the TTS playing text, so that the field of the TTS playing text can be determined conveniently and quickly.
Fig. 8 is a block diagram illustrating a first determination module in a playback speech rate management apparatus according to an example embodiment.
As shown in fig. 8, in one embodiment, the first determining module 63 includes:
and a speech rate determining submodule 81, configured to determine a target playing speech rate corresponding to the target field according to a correspondence between a preset field and a preset playing speech rate.
In this embodiment, a user or a manufacturer may preset a plurality of fields and a corresponding play speech rate for each field. The setting fields comprise fields such as safety, reminding, advertisement and common, wherein the speed of speech corresponding to the field of safety is 180 words per minute for 120 plus materials, the speed of speech corresponding to the field of common is 240 words per minute for 180 plus materials, the speed of speech corresponding to the field of reminding is 300 words per minute for 240 plus materials, and the speed of speech corresponding to the field of advertisement is 360 words per minute for 300 plus materials. Thus, the target playing speed corresponding to the target field can be determined according to the corresponding relation between the preset field and the preset playing speed.
Fig. 9 is a block diagram illustrating yet another playback speech rate management apparatus according to an example embodiment.
As shown in fig. 9, in one embodiment, the apparatus further comprises:
a receiving module 91, configured to receive an input setting command;
and the setting module 92 is configured to set a preset field and a playing speed corresponding to each field according to the setting command.
In this embodiment, a user or a manufacturer may preset a plurality of fields and a corresponding play speech rate for each field. The setting fields comprise fields such as safety, reminding, advertisement and common, wherein the speed of speech corresponding to the field of safety is 180 words per minute for 120 plus materials, the speed of speech corresponding to the field of common is 240 words per minute for 180 plus materials, the speed of speech corresponding to the field of reminding is 300 words per minute for 240 plus materials, and the speed of speech corresponding to the field of advertisement is 360 words per minute for 300 plus materials.
Fig. 10 is a block diagram illustrating yet another playback speech rate management apparatus according to an example embodiment.
As shown in fig. 10, in one embodiment, the apparatus further comprises:
a second determining module 1001, configured to determine, when a voice command input by a user is received, a to-be-played TTS playing text corresponding to the voice command.
In this embodiment, it may be determined that the TTS that should be fed back plays the text according to a voice command input by the user. Thereby meeting the playing requirements of users.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (8)

1. A play speech rate management method is characterized by comprising the following steps:
acquiring a TTS playing text to be played;
judging a target field to which the TTS playing text belongs;
determining a target playing speed corresponding to the target field;
playing the TTS playing text according to the target playing speed;
the judging of the target field of the TTS playing text comprises the following steps:
acquiring a keyword label contained in the TTS playing text;
and determining the target field of the TTS playing text according to the keyword tag.
2. The method according to claim 1, wherein the determining a target playback speech rate corresponding to the target domain comprises:
and determining a target playing speed corresponding to the target field according to the corresponding relation between the preset field and the preset playing speed.
3. The method of claim 2, further comprising:
receiving an input setting command;
and setting preset fields and playing speed corresponding to each field according to the setting command.
4. The method of claim 1, further comprising:
when a voice command input by a user is received, determining a TTS playing text to be played corresponding to the voice command.
5. A playback speech rate management apparatus, comprising:
the obtaining module is used for obtaining a TTS playing text to be played;
the judging module is used for judging the target field of the TTS playing text;
the first determining module is used for determining a target playing speed corresponding to the target field;
the playing module is used for playing the TTS playing text according to the target playing speed;
the judging module comprises:
the obtaining submodule is used for obtaining a keyword tag contained in the TTS playing text;
and the domain determining submodule is used for determining the target domain of the TTS playing text according to the keyword tag.
6. The apparatus of claim 5, wherein the first determining module comprises:
and the speech rate determining submodule is used for determining a target playing speech rate corresponding to the target field according to the corresponding relation between the preset field and the preset playing speech rate.
7. The apparatus of claim 6, further comprising:
the receiving module is used for receiving an input setting command;
and the setting module is used for setting the preset fields and the playing speed corresponding to each field according to the setting command.
8. The apparatus of claim 5, further comprising:
and the second determining module is used for determining the TTS playing text to be played corresponding to the voice command when the voice command input by the user is received.
CN201610412991.3A 2016-06-13 2016-06-13 Playing speech rate management method and device Active CN106098057B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610412991.3A CN106098057B (en) 2016-06-13 2016-06-13 Playing speech rate management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610412991.3A CN106098057B (en) 2016-06-13 2016-06-13 Playing speech rate management method and device

Publications (2)

Publication Number Publication Date
CN106098057A CN106098057A (en) 2016-11-09
CN106098057B true CN106098057B (en) 2020-02-07

Family

ID=57846433

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610412991.3A Active CN106098057B (en) 2016-06-13 2016-06-13 Playing speech rate management method and device

Country Status (1)

Country Link
CN (1) CN106098057B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106534964B (en) * 2016-11-23 2020-02-14 广东小天才科技有限公司 Method and device for adjusting speech rate
CN108231058A (en) * 2016-12-17 2018-06-29 鸿富锦精密电子(天津)有限公司 Voice auxiliary testing system and voice auxiliary test methods
CN107170456A (en) * 2017-06-28 2017-09-15 北京云知声信息技术有限公司 Method of speech processing and device
CN111312280B (en) * 2020-01-16 2023-11-07 北京百度网讯科技有限公司 Method and apparatus for controlling speech

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07110807A (en) * 1993-10-13 1995-04-25 Hitachi Ltd Document reader
CN104732975A (en) * 2013-12-20 2015-06-24 华为技术有限公司 Method and device for voice instant messaging
CN104681023A (en) * 2015-02-15 2015-06-03 联想(北京)有限公司 Information processing method and electronic equipment
CN105405439B (en) * 2015-11-04 2019-07-05 科大讯飞股份有限公司 Speech playing method and device

Also Published As

Publication number Publication date
CN106098057A (en) 2016-11-09

Similar Documents

Publication Publication Date Title
CN106098057B (en) Playing speech rate management method and device
RU2682023C1 (en) Digital personal assistant interaction with impersonations and rich multimedia in responses
US20090055186A1 (en) Method to voice id tag content to ease reading for visually impaired
CN105895105B (en) Voice processing method and device
WO2008001500A1 (en) Audio content generation system, information exchange system, program, audio content generation method, and information exchange method
TW202006532A (en) Broadcast voice determination method, device and apparatus
WO2005093713A1 (en) Speech synthesis device
US9047858B2 (en) Electronic apparatus
US20080104512A1 (en) Method and apparatus for providing realtime feedback in a voice dialog system
CN109995939B (en) Information processing method and electronic device
CN107170456A (en) Method of speech processing and device
Guerreiro et al. Scanning for digital content: How blind and sighted people perceive concurrent speech
CN109346057A (en) A kind of speech processing system of intelligence toy for children
JPWO2014069220A1 (en) REPRODUCTION DEVICE, SETTING DEVICE, REPRODUCTION METHOD, AND PROGRAM
Truschin et al. Designing interfaces for multiple-goal environments: Experimental insights from in-vehicle speech interfaces
TW201621883A (en) Personalized audio and/or video shows
CN106847273B (en) Awakening word selection method and device for voice recognition
Maharjan et al. Experiences of a speech-enabled conversational agent for the self-report of well-being among people living with affective disorders: an in-the-wild study
CN109460548B (en) Intelligent robot-oriented story data processing method and system
CN106202087A (en) A kind of information recommendation method and device
CN105957528A (en) Audio processing method and apparatus
Yoshino et al. Japanese dialogue corpus of information navigation and attentive listening annotated with extended iso-24617-2 dialogue act tags
CN112185341A (en) Dubbing method, apparatus, device and storage medium based on speech synthesis
Zhong et al. Effects of text supplementation on speech intelligibility for listeners with normal and impaired hearing: A systematic review with implications for telecommunication
Schmitz et al. Modelling personality in voices of talking products through prosodic parameters

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: Room 101, 1st floor, building 1, Xisanqi building materials City, Haidian District, Beijing 100096

Patentee after: Yunzhisheng Intelligent Technology Co.,Ltd.

Address before: 100191 Beijing, Huayuan Road, Haidian District No. 2 peony technology building, 5 floor, A503

Patentee before: BEIJING UNISOUND INFORMATION TECHNOLOGY Co.,Ltd.