WO2021180067A1 - 屏幕分区显示的方法、终端、计算机存储介质 - Google Patents
屏幕分区显示的方法、终端、计算机存储介质 Download PDFInfo
- Publication number
- WO2021180067A1 WO2021180067A1 PCT/CN2021/079736 CN2021079736W WO2021180067A1 WO 2021180067 A1 WO2021180067 A1 WO 2021180067A1 CN 2021079736 W CN2021079736 W CN 2021079736W WO 2021180067 A1 WO2021180067 A1 WO 2021180067A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- broadcast
- user
- broadcast information
- voice
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000005192 partition Methods 0.000 title 1
- 230000000007 visual effect Effects 0.000 claims abstract description 27
- 230000014509 gene expression Effects 0.000 abstract description 4
- 238000004590 computer program Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000003058 natural language processing Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 208000001613 Gambling Diseases 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/016—Input arrangements with force or tactile feedback as computer generated output to the user
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
Definitions
- This application relates to the field of communication technology, and in particular to a voice broadcast method, device, storage medium and electronic equipment.
- voice is supported and used on more and more devices.
- the voice modal has its own characteristics of use. Long-distance/visual channels occupied/parallel tasks/elderly people with limited abilities are all key use scenarios for voice. In these scenarios, the prominence of the voice key broadcast information becomes very important. Due to the natural nature of the language, these slot information is buried in a large amount of broadcast content, which increases the difficulty of identification. If only important information is presented without affecting the naturalness of language expression, it is difficult to understand.
- the embodiments of the present application provide a voice broadcast method, device, storage medium, and electronic equipment, which can achieve prominent presentation of voice key broadcast information during the voice broadcast process without affecting the naturalness of language expression.
- an embodiment of the present application provides a voice broadcast method, which receives input information from a user, and recognizes the first intention corresponding to the input information;
- the first broadcast information is displayed to the user, wherein when the first broadcast information is displayed, the at least one slot information is prominently displayed through one or more of auditory, tactile, and visual methods.
- the first broadcast information is displayed to the user, wherein, when the first broadcast information is displayed, one of auditory, tactile, and visual Or multiple ways to highlight the at least one slot information, including:
- the first broadcast information is displayed to the user in a visual form, wherein, when the first broadcast information is displayed, first special processing is performed on at least one slot information in the first broadcast information, and the first broadcast information
- a special processing includes one or more of font enlargement processing, font bolding processing, adding underline processing, adjusting higher-level font processing, and font color changing processing.
- the first broadcast information is displayed to the user, wherein, when the first broadcast information is displayed, one of auditory, tactile, and visual Or multiple ways to highlight the at least one slot information, including:
- the first broadcast information is broadcast to the user in a auditory form, wherein, when the at least one slot information is broadcast, a second special processing is performed on the at least one slot information, and the second special processing includes Tone enhancement processing and/or volume enhancement processing.
- the first broadcast information is displayed to the user, wherein, when the first broadcast information is displayed, one of auditory, tactile, and visual Or multiple ways to highlight the at least one slot information, including:
- the first broadcast information is broadcast to the user in a auditory form, wherein when the at least one slot information is broadcast, a vibration is performed.
- the receiving user input information includes:
- the displaying the first broadcast information to the user, wherein when the first broadcast information is displayed, the at least one slot information is prominently displayed through one or more of auditory, tactile and visual methods include:
- the first broadcast information is displayed to the user according to the distance, wherein, when the first broadcast information is displayed, one or more of hearing, touch, and vision is used to highlight the display according to the distance. At least one slot information is described.
- the presenting the first broadcast information to the user includes:
- the first broadcast information is sent to a first device, and the first broadcast information is projected and displayed by the first device.
- an embodiment of the present application provides a voice broadcast device, including:
- the first receiving module is configured to receive user input information, and identify the first intention corresponding to the input information
- the first determining module is configured to determine corresponding first broadcast information according to the first intention, and determine at least one slot in the broadcast information according to a preset correspondence between the first broadcast information and the slot information Information;
- the first display module is used to display the first broadcast information to the user, wherein when the first broadcast information is displayed, the At least one slot information.
- the first display module includes:
- the first display unit is configured to display the first broadcast information to the user in a visual form, wherein, when displaying the first broadcast information, perform a first display on at least one slot information in the first broadcast information.
- a special processing includes one or more of font enlargement processing, font bolding processing, adding underline processing, adjusting higher-level font processing, and font color changing processing.
- the first display module includes:
- the second display unit is used to audibly broadcast the first broadcast information to the user, wherein when the at least one slot information is broadcast, the at least one slot information is subjected to a second special processing,
- the second special processing includes pitch enhancement processing and/or volume enhancement processing.
- the first display module includes:
- the third display unit is configured to audibly broadcast the first broadcast information to the user, wherein when the at least one slot information is broadcast, vibrate.
- the first receiving module includes:
- the first receiving unit is configured to receive voice information of the user
- the first determining unit is configured to analyze the voice information to determine the volume of the voice information
- the second determining unit is configured to determine the distance to the user according to the volume of the voice information
- the first display module includes:
- the fourth display unit is configured to display the first broadcast information to the user according to the distance, wherein when the first broadcast information is displayed, one or more of auditory, tactile, and visual methods are used to display the first broadcast information, The at least one slot information is highlighted according to the distance.
- the first display module includes:
- the screen projection unit is configured to send the first broadcast information to a first device, and use the first device to project and display the first broadcast information.
- an embodiment of the present application provides an electronic device that includes a memory, a processor, a touch sensor, and a display screen.
- the memory stores a computer program
- the processor is connected to the memory, and the The processor executes the computer program to implement the first aspect or the method in any possible implementation manner of the first aspect.
- an embodiment of the present application provides a computer-readable storage medium, including computer instructions, which, when the computer instructions run on an electronic device, cause the electronic device to execute the first aspect or any one of the first aspect.
- the instruction of the method in the implementation mode is not limited to:
- the embodiment of the present invention quickly extracts key slot information during the voice interaction process, reduces the safe sharing caused by voice distraction, and prevents users from missing important or interesting content during audio listening, which improves User experience.
- FIG. 1 is a flowchart of a voice broadcast method provided by an embodiment of the application
- FIG. 2 is a display interface diagram of the first broadcast information provided by an embodiment of the application
- FIG. 3 is another display interface diagram of the first broadcast information provided by an embodiment of the application.
- FIG. 4 is a schematic structural diagram of a voice broadcast device provided by an embodiment of the application.
- FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the application.
- At least one refers to one or more, and “multiple” refers to two or more.
- “And/or” describes the association relationship of the associated objects, indicating that there can be three relationships, for example, A and/or B, which can mean: A alone exists, A and B exist at the same time, and B exists alone, where A, B can be singular or plural.
- the character “/” generally indicates that the associated objects before and after are in an “or” relationship.
- the following at least one item (a)” or similar expressions refers to any combination of these items, including any combination of a single item (a) or a plurality of items (a).
- at least one of a, b, or c can mean: a, b, c, ab, ac, bc, or abc, where a, b, and c can be single or multiple .
- Fig. 1 shows a flowchart of a voice broadcast method provided by an embodiment of the present invention.
- the embodiment of the present invention provides a voice broadcast method, which can be applied to a voice broadcast device.
- the voice broadcast device may be, for example, a terminal device, where the terminal device may be a mobile phone, a tablet computer, a smart watch, a vehicle-mounted terminal, or a smart TV. , Smart speakers, in-vehicle control screens, auxiliary robots or MP3 players, etc., which have the function of displaying audio or other content related to human vision, hearing, smell, touch or taste.
- Step S01 receiving input information from the user, and identifying the first intention corresponding to the input information
- Step S02 Determine the corresponding first broadcast information according to the first intention, and determine at least one slot information in the broadcast information according to the preset correspondence between the first broadcast information and the slot information;
- Step S03 Display the first broadcast information to the user, wherein when the first broadcast information is displayed, at least one slot information is prominently displayed through one or more of auditory, tactile and visual methods.
- the embodiment of the present invention quickly extracts key slot information during the voice interaction process, and highlights the slot information to reduce voice distraction and prevent users from missing important or interesting things in the process of listening to audio.
- the content enhances the user experience.
- step S01 receiving input information from the user, and identifying the first intention corresponding to the input information
- the voice broadcasting device may include an audio receiving unit and/or a user input unit, where the audio receiving unit may receive sound (audio data) and can process such sound into audio data.
- the user input unit may be used to receive inputted numeric or character information, and the user input unit may include a touch panel and other input devices.
- the touch panel also known as the touch screen, can collect the user's touch operations on or near it (for example, the user uses fingers, stylus and other suitable objects or accessories to operate on the touch panel or near the touch panel), And drive the corresponding connection device according to the preset program.
- Other input devices may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackball, mouse, joystick, etc., which are not specifically limited here.
- the input information may be voice information received through the audio receiving unit or text information received through the user input unit.
- the voice broadcast device receives the voice information, it recognizes the voice information and converts the voice information into corresponding text information.
- Text message such as: "Where is the capital of China?".
- the voice broadcast device uses intent recognition technology (for example, NLP (Natural Language Processing, natural language processing) technology) to determine the user's intention, that is, the first intention.
- intent recognition technology for example, NLP (Natural Language Processing, natural language processing) technology
- step S02 determine the corresponding first broadcast information according to the first intention, and determine at least one slot information in the broadcast information according to the preset correspondence between the first broadcast information and the slot information.
- the voice broadcast device presets a matching or association relationship between the first intention and the first broadcast information
- the slot information corresponding to each piece of first broadcast information is preset.
- the slot is defined as: extracting the clearly defined attributes of a given entity from a large amount of corpus, such as the slot in the taxi, the departure location, and the purpose
- the attributes in the geoslot and departure time slot are "departure location", "destination” and "departure time” respectively.
- Slots are generally key information in voice communication. When the user enters a voice command, it is also a process of filling the slot, in order to transform the user's intention into a clear command that can be processed by the system. Slots are generally changeable and important information in technical processing, while other information is fixed and processed based on the session template.
- the slot information may include one or more of the following: keywords of people or events that the user pays attention to; default keywords in the audio; keywords set by the user.
- the slot information may include keywords of people or events that the user pays attention to; or, the slot information may include default keywords in audio or text.
- the default keywords in the audio or text include the name of celebrity C, the name of event D, etc.; or, the slot information may include keywords set by the user.
- the keywords set by the target user include "focus” and "soccer”; or, the slot information may include keywords of people or events that the user pays attention to, as well as the default keywords in the audio or text; or, the slot information It can include keywords of people or events that the user pays attention to, as well as keywords set by the user; or, the slot information can include the default keywords in the audio or text and the keywords set by the user; or, the slot information can include the target user The keywords of the people or events you are concerned about, the default keywords in the audio, and the keywords set by the target user.
- step S03 displaying the first broadcast information to the user, wherein when the first broadcast information is displayed, at least one slot information is prominently displayed through one or more of auditory, tactile and visual methods.
- step S03 may include the following steps:
- Step S031 Display the first broadcast information to the user in a visual form, wherein when displaying the first broadcast information, perform first special processing on at least one slot information in the first broadcast information, and the first special processing includes increasing the font size Processing, font bolding processing, adding underline processing, adjusting higher-level font processing, and font color changing processing (that is, color change processing, for example, fields other than slot information in the first broadcast information are displayed in black, and slot information is displayed in red Highlight one or more of).
- the first special processing includes increasing the font size Processing, font bolding processing, adding underline processing, adjusting higher-level font processing, and font color changing processing (that is, color change processing, for example, fields other than slot information in the first broadcast information are displayed in black, and slot information is displayed in red Highlight one or more of).
- the voice broadcast device may also include a display unit for displaying information input by the user or information provided to the user (that is, the first broadcast information).
- the display unit may include a display panel, and the display panel may be configured with a liquid crystal display (LCD), an organic light-emitting diode (OLED), etc., to configure the display panel.
- LCD liquid crystal display
- OLED organic light-emitting diode
- the user queries the front road condition information during driving, and performs text enlargement and bolding processing on the slot information of the broadcast information (first broadcast information).
- “Help me check the previous gambling or not” shown in Figure 2 is the user's input information, "The section between Beiming Road and Nanxiang Road ahead is congested, and it is expected to be congested for 15 minutes to travel on rainy days. Drive carefully” is the first Once the information is broadcast, "Beiming Road to Nanxiang Road section is congested” and "15 minutes” are slot information.
- the voice broadcasting device can perform audio broadcasting on the first broadcasting information through the audio output unit, and the audio output unit can convert the first broadcasting information into an audio signal and output it as sound .
- the audio output unit may also provide audio output related to a specific function performed by the voice broadcasting device (for example, call signal reception sound, message reception sound, etc.).
- the audio output unit may include a speaker, a buzzer, and so on.
- the audio output unit When the audio output unit performs audio broadcast on the first broadcast information, it may highlight the slot information in the first broadcast information, or may not highlight the slot information in the first broadcast information, which is not done in the present invention. limited.
- the audio output unit when the audio output unit performs audio broadcast on the first broadcast information, it highlights the slot information in the first broadcast information, which may specifically be:
- Step S032 The voice broadcast device audibly broadcasts the first broadcast information to the user, wherein, when at least one slot information is broadcast, a second special process is performed on the at least one slot information, and the second special process includes pitch enhancement processing and / Or volume enhancement processing.
- the voice broadcast device when broadcasting the first broadcast information, it displays the first broadcast information through the display unit, and performs the first special processing on the slot information in the first broadcast information, such as font enlargement processing and font bolding processing.
- the voice broadcast device displays the first broadcast information through the display unit, and performs the first special processing on the slot information in the first broadcast information, such as font enlargement processing and font bolding processing.
- Adding underline processing, adjusting higher-level font processing, and font color changing processing can reduce the security risk caused by voice distraction and prevent users from missing important or interesting content during audio listening. , Improve the user experience.
- the voice broadcast device may only perform step S032; the voice broadcast device audibly broadcasts the first broadcast information to the user, wherein, when at least one slot information is broadcast, the at least one slot information is performed.
- the second special processing, the second special processing includes pitch enhancement processing and/or volume enhancement processing.
- Step S031 may not be executed, that is, step S031 may be unnecessary. The foregoing implementation manner is mainly applied to a voice broadcast device that does not have a display screen.
- step S03 display the first broadcast information to the user, wherein when the first broadcast information is displayed, at least one slot is prominently displayed through one or more of auditory, tactile and visual methods Information can also include the following steps:
- Step S033 Broadcast the first broadcast information to the user in a auditory form, wherein when at least one slot information is broadcast, vibration is performed.
- step S033 can be implemented in combination with step S031 and/or step S032, or can be implemented separately.
- step S01: receiving user input information may include:
- Step S011 Receive the user's voice information
- Step S012 Analyze the voice information to determine the volume of the voice information
- Step S013 Determine the distance to the user according to the volume of the voice information
- step S03 displaying the first broadcast information to the user, where, when displaying the first broadcast information, highlighting at least one slot information through one or more of auditory, tactile and visual methods may include:
- Step S034 Display the first broadcast information to the user according to the distance, wherein when the first broadcast information is displayed, at least one slot information is prominently displayed according to the distance through one or more of auditory, tactile and visual methods.
- the output volume level of the first broadcast information increases as the distance between the user and the voice broadcast device increases, and the output volume level of the slot information Also, as the distance between the user and the voice broadcast device increases, the output volume level of the slot information is generally greater than the volume level of other information except the slot information in the first broadcast information.
- the font display size of the first broadcast information increases as the distance between the user and the voice broadcast device increases, and the font display of the slot information The size increases as the distance between the user and the voice broadcast device increases.
- the font display size of the slot information is larger than the font display size of other information except the slot information in the first broadcast information.
- the enhanced display of slot information can be determined dynamically according to the distance, which is highly practical and further improves the user experience.
- step S031 Display the first broadcast information to the user in a visual form, where, when the first broadcast information is displayed, the first broadcast information At least one slot information of the first special processing, the first special processing includes one or more of font enlargement processing, font bolding processing, adding underline processing, adjusting higher-level font processing, and font color changing processing, and Can include:
- At least one slot information in the first broadcast information is intercepted, a first card text is generated, and the first card text is highlighted.
- the first card text is composed of the slot information in the first broadcast information.
- the highlighting of the first card text can specifically be to perform font enlargement processing, font bolding processing, adding underline processing, and processing to the characters in the first card text. Adjust one or more of higher-level font processing and font color change processing.
- step S03 displaying the first broadcast information to the user, it may also include: sending the first broadcast information to the first device, and the first broadcast information is displayed on the screen through the first device, and
- a device may be a display or an electronic device with a display.
- the first device may be a general-purpose HUD head-up display for automobiles. Wherein, when the first device casts and displays the first broadcast information, it may highlight at least one slot information in the first broadcast information.
- Fig. 4 is a structural block diagram of a voice broadcasting device provided by an embodiment of the present invention.
- a voice broadcast device 100 including:
- the first receiving module 110 is configured to receive input information of the user and identify the first intention corresponding to the input information
- the first determining module 120 is configured to determine the corresponding first broadcast information according to the first intention, and determine at least one slot information in the broadcast information according to the preset correspondence between the first broadcast information and the slot information;
- the first display module 130 is configured to display the first broadcast information to the user, wherein when the first broadcast information is displayed, at least one slot information is prominently displayed through one or more of auditory, tactile and visual methods.
- the first display module 130 includes:
- the first display unit is used to display the first broadcast information to the user in a visual form, wherein, when the first broadcast information is displayed, the first special processing is performed on at least one slot information in the first broadcast information, and the first special processing is performed Including one or more of font enlargement processing, font bolding processing, adding underline processing, adjusting higher-level font processing, and font color changing processing.
- the first display module 130 includes:
- the second display unit is used to audibly broadcast the first broadcast information to the user, wherein, when at least one slot information is broadcast, a second special processing is performed on the at least one slot information, and the second special processing includes pitch enhancement processing And/or volume enhancement processing.
- the first display module 130 includes:
- the third display unit is used to audibly broadcast the first broadcast information to the user, wherein when at least one slot information is broadcast, vibration is performed.
- the first receiving module 110 includes:
- the first receiving unit is used to receive the user's voice information
- the first determining unit is used to analyze the voice information to determine the volume of the voice information
- the second determining unit is configured to determine the distance to the user according to the volume of the voice information
- the first display module 130 includes:
- the fourth display unit is used to display the first broadcast information to the user according to the distance, wherein when the first broadcast information is displayed, at least one slot is prominently displayed according to the distance through one or more of hearing, touch and vision information.
- the first display module 130 includes:
- the screen projection unit is configured to send the first broadcast information to the first device, and the first broadcast information is projected and displayed on the first device through the first device.
- an embodiment of the present invention provides an electronic device 50.
- the electronic device 50 of this embodiment includes a processor 51, a memory 52, and a program 53 stored in the memory 52 and running on the processor 51.
- the program 53 is executed by the processor 51, the voice broadcast method in the embodiment is implemented. In order to avoid repetition, it will not be repeated here.
- the embodiment of the present invention also provides a computer storage medium, including computer instructions, which when the computer instructions run on an electronic device, cause the electronic device to execute each step in the above-mentioned voice broadcast method.
- the embodiment of the present invention also provides a computer program product.
- the computer program product runs on a computer
- the computer program product runs on the computer
- the computer executes the steps in the above-mentioned voice broadcast method.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
本发明实施例提供一种语音播报方法、装置、存储介质及电子设备,其中,语音播报方法包括:接收用户的输入信息,识别输入信息对应的第一意图;根据第一意图确定对应的第一播报信息,根据预设的第一播报信息与槽位信息的对应关系,确定播报信息中的至少一个槽位信息;向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示至少一个槽位信息。上述方法能够在语音播报过程中实现语音关键播报信息的突出呈现,且不影响语言表达的自然性。
Description
本申请要求于2020年03月10日提交中国专利局、申请号为202010162199.3、申请名称为“语音播报方法、装置、存储介质及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
本申请涉及通信技术领域,尤其涉及一种语音播报方法、装置、存储介质及电子设备。
语音作为一种非接触的新交互形式,在越来越多的设备上被支持和使用。语音模态本身具有使用特点,远距离/视觉通道被占用/并行任务/老人等能力受限人群,都是语音的关键使用场景。在这些场景中,语音关键播报信息的突出性变得非常重要。由于语言的自然属性,这些槽位信息被掩埋在大量的播报内容中,增加了辨识的难度。如果仅呈现重要信息,又不影响语言表达的自然性,难以理解。
发明内容
本申请实施例提供一种语音播报方法、装置、存储介质及电子设备,能够在语音播报过程中实现语音关键播报信息的突出呈现,且不影响语言表达的自然性。
第一方面,本申请实施例提供一种语音播报方法,接收用户的输入信息,识别所述输入信息对应的第一意图;
根据所述第一意图确定对应的第一播报信息,根据预设的所述第一播报信息与槽位信息的对应关系,确定所述播报信息中的至少一个槽位信息;
向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息。
结合第一方面,在一种可能的实现方式中,所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:
向所述用户以视觉形式展示所述第一播报信息,其中,在展示所述第一播报信息时,对所述第一播报信息中的至少一个槽位信息进行第一特殊处理,所述第一特殊处理包括字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种。
结合第一方面,在一种可行的实现方式中,所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:
向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,对所述至少一个槽位信息进行第二特殊处理,所述第二特殊处理包括音调增强处理和/或音量增强处理。
结合第一方面,在一种可行的实现方式中,所述向所述用户展示所述第一播报信 息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:
向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,进行振动。
结合第一方面,在一种可行的实现方式中,所述接收用户的输入信息,包括:
接收所述用户的语音信息;
对所述语音信息进行分析确定所述语音信息的音量;
根据所述语音信息的音量确定与用户的距离;
所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:
根据所述距离向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,根据所述距离突出展示所述至少一个槽位信息。
结合第一方面,在一种可行的实现方式中,所述向所述用户展示所述第一播报信息,包括:
将所述第一播报信息发送给第一设备,通过所述第一设备对所述第一播报信息进行投屏显示。
第二方面,本申请实施例提供一种语音播报装置,包括:
第一接收模块,用于接收用户的输入信息,识别所述输入信息对应的第一意图;
第一确定模块,用于根据所述第一意图确定对应的第一播报信息,根据预设的所述第一播报信息与槽位信息的对应关系,确定所述播报信息中的至少一个槽位信息;以及
第一展示模块,用于向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息。
结合第二方面,在一种可行的实现方式中,所述第一展示模块包括:
第一展示单元,用于向所述用户以视觉形式展示所述第一播报信息,其中,在展示所述第一播报信息时,对所述第一播报信息中的至少一个槽位信息进行第一特殊处理,所述第一特殊处理包括字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种。
结合第二方面,在一种可行的实现方式中,所述第一展示模块包括:
第二展示单元,用于向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,对所述至少一个槽位信息进行第二特殊处理,所述第二特殊处理包括音调增强处理和/或音量增强处理。
结合第二方面,在一种可行的实现方式中,所述第一展示模块包括:
第三展示单元,用于向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,进行振动。
结合第二方面,在一种可行的实现方式中,所述第一接收模块包括:
第一接收单元,用于接收所述用户的语音信息;
第一确定单元,用于对所述语音信息进行分析确定所述语音信息的音量;以及
第二确定单元,用于根据所述语音信息的音量确定与用户的距离;
所述第一展示模块包括:
第四展示单元,用于根据所述距离向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,根据所述距离突出展示所述至少一个槽位信息。
结合第二方面,在一种可行的实现方式中,所述第一展示模块包括:
投屏单元,用于将所述第一播报信息发送给第一设备,通过所述第一设备对所述第一播报信息进行投屏显示。
第三方面,本申请实施例提供一种电子设备,所述电子设备包括存储器、处理器、触摸传感器及显示屏,所述存储器中存储有计算机程序,所述处理器与所述存储器连接,所述处理器执行计算机程序以实现执行第一方面或者第一方面的任一可能的实现方式中的方法。
第四方面,本申请实施例提供一种计算机可读存储介质,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行第一方面或者第一方面的任一可能的实现方式中的方法的指令。
可以理解,本发明实施例通过在进行语音交互过程中,快速提取关键槽位信息,降低语音分心带来的安全分享,避免用户在听音频的过程中错过重要或者感兴趣的内容,提升了用户的使用体验。
图1为本申请实施例提供的语音播报方法的流程图;
图2为本申请实施例提供的关于第一播报信息的显示界面图;
图3为本申请实施例提供的关于第一播报信息的又一显示界面图;
图4为本申请实施例提供的语音播报装置的结构示意图;
图5为本申请实施例提供的电子设备的结构示意图。
为了更好的理解本发明的技术方案,下面结合附图对本申请实施例进行详细描述。
应当明确,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。
本申请中,“至少一个”是指一个或者多个,“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B的情况,其中A,B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中a,b,c可以是单个,也可以是多个。
图1示出了本发明实施例提供的一种语音播报方法的流程图。
本发明实施例提供了一种语音播报方法,该方法可以应用于语音播报装置中,语 音播报装置例如可以为终端设备,其中,终端设备可以为手机、平板电脑、智能手表、车载终端、智能电视、智能音箱、车载中控屏、辅助机器人或者MP3播放器等设备,该设备具有显示音频或其他与人类的视觉、听觉、嗅觉、触觉或味觉相关的内容的功能。
本发明实施例提供的语音播报方法包括:
步骤S01:接收用户的输入信息,识别输入信息对应的第一意图;
步骤S02:根据第一意图确定对应的第一播报信息,根据预设的第一播报信息与槽位信息的对应关系,确定播报信息中的至少一个槽位信息;
步骤S03:向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示至少一个槽位信息。
可以理解,本发明实施例通过在进行语音交互过程中,快速提取关键的槽位信息,通过突出显示槽位信息,可以降低语音分心,避免用户在听音频的过程中错过重要或者感兴趣的内容,提升了用户的使用体验。
下面继续结合图1对本发明实施例提供的语音播报方法进行更为具体的说明。
针对步骤S01:接收用户的输入信息,识别输入信息对应的第一意图;
其中,语音播报装置可以包括音频接收单元和/或用户输入单元,其中,音频接收单元可以接收声音(音频数据),并且能够将这样的声音处理为音频数据。用户输入单元可用于接收输入的数字或字符信息,用户输入单元可包括触控面板以及其他输入设备。触控面板,也称为触摸屏,可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触控面板上或在触控面板附近的操作),并根据预先设定的程式驱动相应的连接装置。其他输入设备可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种,具体此处不做限定。
基于上述,输入信息可以为通过音频接收单元接收到的语音信息或者通过用户输入单元接收到的文本信息,当语音播报装置接收到语音信息时,识别语音信息,将语音信息转化为对应的文本信息,文本信息例如:“中国的首都是哪里?”。
其中,语音播报装置获取文本信息后,采用意图识别技术(例如NLP(Natural Language Processing,自然语言处理)技术)确定用户的意图,即第一意图。
针对步骤S02:根据第一意图确定对应的第一播报信息,根据预设的第一播报信息与槽位信息的对应关系,确定播报信息中的至少一个槽位信息。
其中,语音播报装置预设有第一意图与第一播报信息的匹配或者关联关系;
其中,每条第一播报信息对应的槽位信息通过预先设定,其中,槽位的定义为:从大量语料中抽取给定实体被明确定义的属性,比如打车中的,出发地点槽,目的地槽,出发时间槽中的属性分别是“出发地点”、“目的地”和“出发时间”。槽位一般都是语音交流中的关键信息。当用户录入语音指令时,也是一个填充槽位的过程,是为了让用户意图转化为系统可处理的明确指令。槽位在技术处理中一般为可变化的重要信息,而其他信息则会基于会话模板固定处理。
在一些可能的实现方式中,槽位信息可以包含以下一项或多项:用户关注的人物 或事件的关键词;音频中默认的关键词;用户设置的关键词。
作为该实现方式的一个示例,槽位信息可以包含用户关注的人物或事件的关键词;或者,槽位信息可以包含音频或文本中默认的关键词。例如,音频或文本中默认的关键词包括明星C的名字、事件D的名称等;又或者,槽位信息可以包含用户设置的关键词。例如,目标用户设置的关键词包括“重点”和“足球”;又或者,槽位信息可以包含用户关注的人物或事件的关键词,以及音频或文本中默认的关键词;或者,槽位信息可以包含用户关注的人物或事件的关键词,以及用户设置的关键词;或者,槽位信息可以包含音频或文本中默认的关键词和用户设置的关键词;或者,槽位信息可以包含目标用户关注的人物或事件的关键词,音频中默认的关键词,以及目标用户设置的关键词。
针对步骤S03:向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示至少一个槽位信息。
在一些可能的实现方式中,步骤S03可以包括如下步骤:
步骤S031:向用户以视觉形式展示第一播报信息,其中,在展示第一播报信息时,对第一播报信息中的至少一个槽位信息进行第一特殊处理,第一特殊处理包括字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理(即颜色变化处理,例如第一播报信息中除槽位信息以外的字段采用黑色显示,槽位信息采用红色进行突出显示)中的一种或多种。
其中,语音播报装置还可以包括显示单元,用于显示由用户输入的信息或提供给用户的信息(即第一播报信息)。显示单元可包括显示面板,显示面板可以采用液晶显示器(Liquid Crystal Display,LCD)、有机发光二极管(Organic Light-Emitting Diode,OLED)等形式来配置显示面板。
结合应用场景,如图2所示,用户在驾驶过程中,查询前方路况信息,将播报信息(第一播报信息)的槽位信息进行文字增大及加粗处理。其中,图2中所示的“帮我查一下前面赌不赌”即为用户的输入信息,“前方北明路到南翔路路段拥堵,预计拥堵15分钟雨天出行,小心驾驶”即为第一播报信息,“北明路到南翔路路段拥堵”和“15分钟”即为槽位信息。
需要知道的是,在执行步骤S031的同时、之前或之后,语音播报装置可以通过音频输出单元对第一播报信息进行音频播报,音频输出单元能够将第一播报信息转换成音频信号并且输出为声音。而且,音频输出单元还可以提供与语音播报装置执行的特定功能相关的音频输出(例如,呼叫信号接收声音、消息接收声音等等)。音频输出单元可以包括扬声器、蜂鸣器等等。
音频输出单元对第一播报信息进行音频播报时,可以对第一播报信息中的槽位信息进行突出展示,也可以不对第一播报信息中的槽位信息进行突出展示,本发明对此不做限定。
优选地,音频输出单元对第一播报信息进行音频播报时,会对第一播报信息中的槽位信息进行突出展示,具体可以为:
步骤S032:语音播报装置向用户以听觉形式播报第一播报信息,其中,在播报到至少一个槽位信息时,对至少一个槽位信息进行第二特殊处理,第二特殊处理包括音 调增强处理和/或音量增强处理。
可以理解,语音播报装置在播报第一播报信息时,通过显示单元显示第一播报信息,并对第一播报信息中的槽位信息进行第一特殊处理,例如字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种,可以降低语音分心带来的安全风险,同时避免用户在听音频的过程中错过重要或者感兴趣的内容,提升了用户的使用体验。
在一些可能的实现方式中,语音播报装置可以仅执行步骤S032;语音播报装置向用户以听觉形式播报第一播报信息,其中,在播报到至少一个槽位信息时,对至少一个槽位信息进行第二特殊处理,第二特殊处理包括音调增强处理和/或音量增强处理。可不执行步骤S031,即步骤S031可以是非必要的。上述实现方式主要应用于不具有显示屏的语音播报装置中。
在一些可能的实现方式中,步骤S03:向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示至少一个槽位信息,还可以包括如下步骤:
步骤S033:向用户以听觉形式播报第一播报信息,其中,在播报到至少一个槽位信息时,进行振动。
其中,步骤S033可以与步骤S031和/或步骤S032结合实现,也可以单独实现。
在一些可能的实现方式中,步骤S01:接收用户的输入信息,可以包括:
步骤S011:接收用户的语音信息;
步骤S012:对语音信息进行分析确定语音信息的音量;
步骤S013:根据语音信息的音量确定与用户的距离;
其中,步骤S03:向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示至少一个槽位信息,可以包括:
步骤S034:根据距离向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,根据距离突出展示至少一个槽位信息。
示例性地,如果用户采用了听觉的方式突出展示至少一个槽位信息,则第一播报信息的输出音量级随着用户和语音播报装置之间的距离增加而增加,槽位信息的输出音量级同样随着用户和语音播报装置之间的距离增加而增加,通常情况下,槽位信息的输出音量级要大于第一播报信息中除槽位信息以外的其他信息的音量级。
示例性地,如果用户采用了视觉的方式突出展示至少一个槽位信息,则第一播报信息的字体显示大小随着用户和语音播报装置之间的距离增加而增大,槽位信息的字体显示大小随着用户和语音播报装置之间的距离增加而增大,通常情况下,槽位信息的字体显示大小要大于第一播报信息中除槽位信息以外的其他信息的字体显示大小。
可以理解,通过识别用户与语音播报装置之间的距离,可以动态地根据距离确定槽位信息的加强展示,实用性强,进一步提高用户体验。
结合又一应用场景,如图3所示,在一些可能的实现方式中,步骤S031:向用户以视觉形式展示第一播报信息,其中,在展示第一播报信息时,对第一播报信息中的至少一个槽位信息进行第一特殊处理,第一特殊处理包括字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种,还 可以包括:
截取第一播报信息中的至少一个槽位信息,生成第一卡片文本,并突出显示所述第一卡片文本。
其中,第一卡片文本由第一播报信息中的槽位信息组成,突出显示第一卡片文本具体可以为对第一卡片文本中的字符进行字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种。
示例性地,图3所示的“明天的天气怎么样?”为用户的输入信息,“明天天气晴,温度15-25度,天气降温,记得增加衣服,注意保暖,和我说新闻,还可以播报更多新闻信息”为第一播报信息,其中,“晴”和“15-25”为槽位信息,“晴15-25”为第一卡片文本。图3所示的第一卡片文本中的字符进行字体变大处理,以实现对第一卡片文本的突出显示。
在一些可能的实现方式中,步骤S03中:向用户展示第一播报信息,还可以包括:将第一播报信息发送给第一设备,通过第一设备对第一播报信息进行投屏显示,第一设备可以为显示器或者具有显示器的电子设备,例如第一设备可以为汽车通用HUD抬头显示器。其中,第一设备对第一播报信息进行投屏显示时,可以突出显示第一播报信息中的至少一个槽位信息。
图4本发明实施例提供的一种语音播报装置的结构框图;
请参阅图4,一种语音播报装置100,包括:
第一接收模块110,用于接收用户的输入信息,识别输入信息对应的第一意图;
第一确定模块120,用于根据第一意图确定对应的第一播报信息,根据预设的第一播报信息与槽位信息的对应关系,确定播报信息中的至少一个槽位信息;以及
第一展示模块130,用于向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示至少一个槽位信息。
在一种可选的实施方式中,第一展示模块130包括:
第一展示单元,用于向用户以视觉形式展示第一播报信息,其中,在展示第一播报信息时,对第一播报信息中的至少一个槽位信息进行第一特殊处理,第一特殊处理包括字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种。
在一种可选的实施方式中,第一展示模块130包括:
第二展示单元,用于向用户以听觉形式播报第一播报信息,其中,在播报到至少一个槽位信息时,对至少一个槽位信息进行第二特殊处理,第二特殊处理包括音调增强处理和/或音量增强处理。
在一种可选的实施方式中,第一展示模块130包括:
第三展示单元,用于向用户以听觉形式播报第一播报信息,其中,在播报到至少一个槽位信息时,进行振动。
在一种可选的实施方式中,第一接收模块110包括:
第一接收单元,用于接收用户的语音信息;
第一确定单元,用于对语音信息进行分析确定语音信息的音量;以及
第二确定单元,用于根据语音信息的音量确定与用户的距离;
第一展示模块130包括:
第四展示单元,用于根据距离向用户展示第一播报信息,其中,在展示第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,根据距离突出展示至少一个槽位信息。
在一种可选的实施方式中,第一展示模块130包括:
投屏单元,用于将第一播报信息发送给第一设备,通过第一设备对第一播报信息进行投屏显示。
需要知道的是,装置实施例的进一步实施方式以及其它细节可以参考方法实施例中的对应内容,为避免重复,在此不一一赘述。
请参阅附图5,本发明实施例提供了一种电子设备50,该实施例的电子设备50包括:处理器51、存储器52以及存储在存储器52中并可在处理器51上运行的程序53,该程序53被处理器51执行时实现实施例中的语音播报方法,为避免重复,此处不一一赘述。
本发明实施例还提供了一种计算机存储介质,包括计算机指令,当计算机指令在电子设备上运行时,使得电子设备执行如上述的语音播报方法中的各个步骤。
本发明实施例还提供了一种计算机程序产品,当计算机程序产品在计算机上运行时,该计算机程序产品在计算机上运行时,使得计算机执行上述语音播报方法中的各个步骤。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
以上,仅为本申请的具体实施方式,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。本申请的保护范围应以权利要求的保护范围为准。
Claims (14)
- 一种语音播报方法,其特征在于,包括:接收用户的输入信息,识别所述输入信息对应的第一意图;根据所述第一意图确定对应的第一播报信息,根据预设的所述第一播报信息与槽位信息的对应关系,确定所述播报信息中的至少一个槽位信息;向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息。
- 根据权利要求1所述的方法,其特征在于,所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:向所述用户以视觉形式展示所述第一播报信息,其中,在展示所述第一播报信息时,对所述第一播报信息中的至少一个槽位信息进行第一特殊处理,所述第一特殊处理包括:字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种。
- 根据权利要求1或2所述的方法,其特征在于,所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,对所述至少一个槽位信息进行第二特殊处理,所述第二特殊处理包括音调增强处理和/或音量增强处理。
- 根据权利要求3所述的方法,其特征在于,所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,进行振动。
- 根据权利要求1所述的方法,其特征在于,所述输入信息为语音信息,所述接收用户的输入信息,包括:接收所述用户的语音信息;对所述语音信息进行分析确定所述语音信息的音量;根据所述语音信息的音量确定与用户的距离;所述向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息,包括:根据所述距离向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,根据所述距离突出展示所述至少一个槽位信息。
- 根据权利要求1所述的方法,其特征在于,所述向所述用户展示所述第一播报信息,包括:将所述第一播报信息发送给第一设备,通过所述第一设备对所述第一播报信息进行投屏显示。
- 一种语音播报装置,其特征在于,包括:第一接收模块,用于接收用户的输入信息,识别所述输入信息对应的第一意图;第一确定模块,用于根据所述第一意图确定对应的第一播报信息,根据预设的所述第一播报信息与槽位信息的对应关系,确定所述播报信息中的至少一个槽位信息;以及第一展示模块,用于向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,突出展示所述至少一个槽位信息。
- 根据权利要求7所述的语音播报装置,其特征在于,所述第一展示模块包括:第一展示单元,用于向所述用户以视觉形式展示所述第一播报信息,其中,在展示所述第一播报信息时,对所述第一播报信息中的至少一个槽位信息进行第一特殊处理,所述第一特殊处理包括字体变大处理、字体加粗处理、增加下划线处理、调整更高级别字体处理以及字体变色处理中的一种或多种。
- 根据权利要求7或8所述的语音播报装置,其特征在于,所述第一展示模块包括:第二展示单元,用于向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,对所述至少一个槽位信息进行第二特殊处理,所述第二特殊处理包括音调增强处理和/或音量增强处理。
- 根据权利要求9所述的语音播报装置,其特征在于,所述第一展示模块包括:第三展示单元,用于向所述用户以听觉形式播报所述第一播报信息,其中,在播报到所述至少一个槽位信息时,进行振动。
- 根据权利要求7所述的语音播报装置,其特征在于,所述第一接收模块包括:第一接收单元,用于接收所述用户的语音信息;第一确定单元,用于对所述语音信息进行分析确定所述语音信息的音量;以及第二确定单元,用于根据所述语音信息的音量确定与用户的距离;所述第一展示模块包括:第四展示单元,用于根据所述距离向所述用户展示所述第一播报信息,其中,在展示所述第一播报信息时,通过听觉、触觉和视觉中的一种或多种方式,根据所述距离突出展示所述至少一个槽位信息。
- 根据权利要求7所述的语音播报装置,其特征在于,所述第一展示模块包括:投屏单元,用于将所述第一播报信息发送给第一设备,通过所述第一设备对所述第一播报信息进行投屏显示。
- 一种存储介质,所述存储介质包括存储的程序,其中,在所述程序运行时控制所述存储介质所在设备执行权利要求1至6任意一项所述的方法。
- 一种电子设备,包括存储器和处理器,所述存储器用于存储包括程序指令的信息,所述处理器用于控制程序指令的执行,其特征在于:所述程序指令被处理器加载并执行时实现权利要求1至6任意一项所述的方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010162199.3A CN113448426A (zh) | 2020-03-10 | 2020-03-10 | 语音播报方法、装置、存储介质及电子设备 |
CN202010162199.3 | 2020-03-10 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021180067A1 true WO2021180067A1 (zh) | 2021-09-16 |
Family
ID=77671225
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/079736 WO2021180067A1 (zh) | 2020-03-10 | 2021-03-09 | 屏幕分区显示的方法、终端、计算机存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN113448426A (zh) |
WO (1) | WO2021180067A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114566060A (zh) * | 2022-02-23 | 2022-05-31 | 成都智元汇信息技术股份有限公司 | 公共交通消息通知处理方法、装置、系统、电子设备及介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6033224A (en) * | 1997-06-27 | 2000-03-07 | Kurzweil Educational Systems | Reading machine system for the blind having a dictionary |
CN105185167A (zh) * | 2015-08-12 | 2015-12-23 | 广东小天才科技有限公司 | 一种助听方法、助听装置、第一助听系统和第二助听系统 |
CN106878070A (zh) * | 2017-01-24 | 2017-06-20 | 广西大学 | Soa构架下基于云计算的电网实时监控报警系统及实现方法 |
CN107800856A (zh) * | 2016-08-29 | 2018-03-13 | 中兴通讯股份有限公司 | 一种语音播报方法、装置及移动终端 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109981448B (zh) * | 2019-03-28 | 2022-03-25 | 联想(北京)有限公司 | 信息处理方法和电子设备 |
CN110334352B (zh) * | 2019-07-08 | 2023-07-07 | 腾讯科技(深圳)有限公司 | 引导信息显示方法、装置、终端及存储介质 |
CN110827827A (zh) * | 2019-11-27 | 2020-02-21 | 维沃移动通信有限公司 | 一种语音播报方法及电子设备 |
-
2020
- 2020-03-10 CN CN202010162199.3A patent/CN113448426A/zh active Pending
-
2021
- 2021-03-09 WO PCT/CN2021/079736 patent/WO2021180067A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6033224A (en) * | 1997-06-27 | 2000-03-07 | Kurzweil Educational Systems | Reading machine system for the blind having a dictionary |
CN105185167A (zh) * | 2015-08-12 | 2015-12-23 | 广东小天才科技有限公司 | 一种助听方法、助听装置、第一助听系统和第二助听系统 |
CN107800856A (zh) * | 2016-08-29 | 2018-03-13 | 中兴通讯股份有限公司 | 一种语音播报方法、装置及移动终端 |
CN106878070A (zh) * | 2017-01-24 | 2017-06-20 | 广西大学 | Soa构架下基于云计算的电网实时监控报警系统及实现方法 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114566060A (zh) * | 2022-02-23 | 2022-05-31 | 成都智元汇信息技术股份有限公司 | 公共交通消息通知处理方法、装置、系统、电子设备及介质 |
CN114566060B (zh) * | 2022-02-23 | 2023-03-24 | 成都智元汇信息技术股份有限公司 | 公共交通消息通知处理方法、装置、系统、电子设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
CN113448426A (zh) | 2021-09-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10204618B2 (en) | Terminal and method for voice control on terminal | |
CN108491123B (zh) | 一种调节应用程序图标的方法及移动终端 | |
CN107728400B (zh) | 一种信息的显示方法及移动终端 | |
WO2020011077A1 (zh) | 通知消息显示方法及终端设备 | |
JP2014010456A (ja) | 移動端末機及びその音声認識方法 | |
CN109213407B (zh) | 一种截图方法及终端设备 | |
CN111177180A (zh) | 一种数据查询方法、装置以及电子设备 | |
CN110866038A (zh) | 信息推荐方法及终端设备 | |
US20220392130A1 (en) | Image special effect processing method and apparatus | |
CN112560540B (zh) | 一种美妆穿搭推荐方法及装置 | |
WO2019129264A1 (zh) | 界面的显示方法和移动终端 | |
WO2021180067A1 (zh) | 屏幕分区显示的方法、终端、计算机存储介质 | |
CN108052356A (zh) | 一种启动计算器的方法,及终端设备 | |
WO2024183434A1 (zh) | 基于文本生成图片的方法及模型训练方法、装置、设备及存储介质 | |
CN107223224A (zh) | 一种弱视辅助方法和装置 | |
CN113676395A (zh) | 信息处理方法、相关设备及可读存储介质 | |
CN113593614A (zh) | 图像处理方法及装置 | |
CN113031838B (zh) | 屏幕录制方法、装置及电子设备 | |
CN112583695B (zh) | 一种消息展示方法 | |
CN109325212B (zh) | 信息交互方法、装置、电子设备及浏览器 | |
CN116860913A (zh) | 语音交互方法、装置、设备及存储介质 | |
CN111554314A (zh) | 噪声检测方法、装置、终端及存储介质 | |
WO2019076375A1 (zh) | 短信界面的显示方法、移动终端及可读存储介质 | |
CN116977884A (zh) | 视频切分模型的训练方法、视频切分方法及装置 | |
CN113542206B (zh) | 一种图像处理方法、装置及计算机可读存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21767838 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21767838 Country of ref document: EP Kind code of ref document: A1 |