CN110060686A - Voice interactive method and device, terminal device, computer readable storage medium - Google Patents

Voice interactive method and device, terminal device, computer readable storage medium Download PDF

Info

Publication number
CN110060686A
CN110060686A CN201910300604.0A CN201910300604A CN110060686A CN 110060686 A CN110060686 A CN 110060686A CN 201910300604 A CN201910300604 A CN 201910300604A CN 110060686 A CN110060686 A CN 110060686A
Authority
CN
China
Prior art keywords
information
topic type
topic
user
text information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910300604.0A
Other languages
Chinese (zh)
Other versions
CN110060686B (en
Inventor
吴迪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201910300604.0A priority Critical patent/CN110060686B/en
Publication of CN110060686A publication Critical patent/CN110060686A/en
Application granted granted Critical
Publication of CN110060686B publication Critical patent/CN110060686B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Telephonic Communication Services (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a kind of voice interactive method and device, terminal device, computer readable storage mediums, are related to interactive voice field, voice interactive method, comprising the following steps: receive the topic type voice messaging of user's input;The topic type voice messaging is subjected to speech recognition, is converted to corresponding text information;The text information is matched in default topic type way to put questions database, obtains the corresponding topic type way to put questions information of the text information.The present invention can effectively solve the problem of user when doing one's assignment in face of problem and do not know how opening inquiry, directly topic type title is read out, the feedback of effective topic type way to put questions information can be obtained, human-computer interaction is allowed to become more smooth.

Description

Voice interactive method and device, terminal device, computer readable storage medium
Technical field
The present invention relates to interactive voice field more particularly to a kind of voice interactive methods and device, terminal device, computer Readable storage medium storing program for executing.
Background technique
With the development of science and technology, the voice technology of terminal device is slowly at mainstream.Present user is set using voice and terminal When standby interaction, general process are as follows: user opens the interface of a voice system, such as: Siri, user pass through conventional way to put questions It inquires, for example, " making a phone call to XXX ", " alarm clock for fixed X point " etc..
Existing this voice way to put questions mode may be only available for general scene, is not applied for study class voice response and produces Product.In learning process, different subjects have different topic types, and when student likes to ask topic, general describing mode is had no way standard It really describes oneself to want the problem of asking, limits the case where student asks operation topic by voice system.
Summary of the invention
The object of the present invention is to provide a kind of voice interactive method and device, terminal device, computer readable storage medium, When user, which does not know, how to ask operation topic, it is for reference to provide related topic type way to put questions information, helps its accurate description Oneself wants the problem of asking, improves the usage experience of user.
Technical solution provided by the invention is as follows:
A kind of voice interactive method, comprising the following steps: receive the topic type voice messaging of user's input;By the topic type language Message breath carries out speech recognition, is converted to corresponding text information;Default topic type way to put questions database to the text information into Row matching, obtains the corresponding topic type way to put questions information of the text information.
In the above-mentioned technical solutions, user can inquire the corresponding topic type of this topic type by reading the topic type for the topic for wanting to ask Way to put questions information is described more with making user so that user be allow accurately to be inquired using related way to put questions when voice inquiry topic Accurately, terminal device can also provide more accurate feedback.
Further, described that the text information is matched in default topic type way to put questions database, obtain the text The corresponding topic type way to put questions information of information includes: to match in default topic type way to put questions database to the text information, when not When being fitted on the topic type way to put questions information completely the same with the text information, using the highest topic type way to put questions information of matching degree as described in The corresponding topic type way to put questions information of text information.
In the above-mentioned technical solutions, matching degree highest explanation is immediate with text information, and is most likely to be use The desired topic type way to put questions information in family, provides good usage experience in many aspects for user.
Further, described that the topic type voice messaging is subjected to speech recognition, corresponding text information is converted into one Step includes: that the topic type voice is carried out speech recognition, semantic parsing, is converted to corresponding text information, the text information It include: discipline information and topic type information;Described matches the text information in default topic type way to put questions database, obtains The corresponding topic type way to put questions information of the text information includes: the corresponding column of the discipline information in default topic type way to put questions database The topic type information is matched in table, obtains the corresponding topic type way to put questions information of the text information.
It in the above-mentioned technical solutions, may include discipline information in topic type voice messaging, to improve matching topic type way to put questions information When speed and precision.
Further, further includes: when detecting that user encounters the topic that will not be done, issue prompt information, the prompt letter Breath reads the topic type of the topic for prompt user;The topic type voice messaging for receiving user's input specifically: receive user The topic type voice messaging inputted according to the prompt information.
In the above-mentioned technical solutions, prompt information is issued on suitable opportunity, user is allowed not know how description topic When, relevant topic type way to put questions information is provided in time, allows user that can accurately describe the topic for oneself wanting to ask, so that subsequent obtain Accurately feedback, improves the usage experience of user.
Further, further includes: when receiving the voice question information of user, then it is assumed that detecting that user encounters will not do Topic;Or, when taking user when same topic residence time is more than preset time, then it is assumed that detect that user encounters The topic that will not be done.
In the above-mentioned technical solutions, the sending of various ways triggering prompt information, keeps human-computer interaction more intelligent.
The present invention also provides a kind of voice interaction devices, comprising: receiving module, for receiving the topic type voice of user's input Information;Identification module is converted to corresponding text information for the topic type voice messaging to be carried out speech recognition;Match mould Block obtains the corresponding topic type of the text information for matching in default topic type way to put questions database to the text information Way to put questions information.
Further, the identification module is converted to corresponding text for the topic type voice messaging to be carried out speech recognition Word information includes: the identification module, and the topic type voice is carried out speech recognition, semantic parsing, is converted to corresponding text Information, the text information include: discipline information and topic type information;The matching module, in default topic type way to put questions data Library matches the text information, and obtaining the corresponding topic type way to put questions information of the text information includes: the matching module, The topic type information is matched in the corresponding list of the discipline information of default topic type way to put questions database, is obtained described The corresponding topic type way to put questions information of text information.
Further, further includes: cue module, for when detecting that user encounters the topic that will not be done, issuing prompt letter Breath, the prompt information are the topic type for prompting user to read the topic;The receiving module, for receiving the topic of user's input Type voice messaging specifically: the receiving module receives the topic type voice messaging that user inputs according to the prompt information.
The present invention also provides a kind of terminal device, including memory, processor and storage are in the memory and can The computer program run on the processor, the processor realize any of the above-described a institute when running the computer program The step of stating voice interactive method.
The present invention also provides a kind of computer readable storage medium, the computer-readable recording medium storage has computer Program, which is characterized in that the computer program realizes any of the above-described voice interactive method when being executed by processor Step.
Compared with prior art, voice interactive method of the invention and device, terminal device, computer readable storage medium Beneficial effect is:
The present invention can effectively solve the problem of user when doing one's assignment in face of problem and do not know how opening inquiry, Directly topic type title is read out, the feedback of effective topic type way to put questions information can be obtained, human-computer interaction is allowed to become more smooth.
Detailed description of the invention
Below by clearly understandable mode, preferred embodiment is described with reference to the drawings, to a kind of voice interactive method and Device, terminal device, computer readable storage medium above-mentioned characteristic, technical characteristic, advantage and its implementation give into one Walk explanation.
Fig. 1 is the flow chart of voice interactive method one embodiment of the present invention;
Fig. 2 is a kind of schematic diagram of topic type of the present invention;
Fig. 3 is the schematic diagram that a kind of topic type way to put questions information of the present invention is shown;
Fig. 4 is the flow chart of another embodiment of voice interactive method of the present invention;
Fig. 5 is the structural schematic diagram of terminal device one embodiment of the present invention;
Fig. 6 is the flow chart of another embodiment of voice interactive method of the present invention;
Fig. 7 is the flow chart of voice interactive method further embodiment of the present invention;
Fig. 8 is the structural schematic diagram of voice interaction device one embodiment of the present invention;
Fig. 9 is the structural schematic diagram of another embodiment of voice interaction device of the present invention;
Drawing reference numeral explanation:
8. voice interaction device, 81. receiving modules, 82. identification modules, 83. matching modules, 84. cue modules, 85. clap Take the photograph module, 86. display modules, 5. terminal devices, 51. memories, 52. computer programs, 53. processors.
Specific embodiment
In being described below, for illustration and not for limitation, the tool of such as particular system structure, technology etc is proposed Body details, so as to provide a thorough understanding of the present application embodiment.However, it will be clear to one skilled in the art that there is no these specific The application also may be implemented in the other embodiments of details.In other cases, it omits to well-known system, device, electricity The detailed description of road and method, so as not to obscure the description of the present application with unnecessary details.
It should be appreciated that ought use in this specification and in the appended claims, term " includes " indicates the description Feature, entirety, step, operation, the presence of element and/or component, but one or more other features, entirety, step are not precluded Suddenly, the presence or addition of operation, element, component and/or set.
To make simplified form, part related to the present invention is only schematically shown in each figure, they are not represented Its practical structures as product.In addition, there is identical structure or function in some figures so that simplified form is easy to understand Component only symbolically depicts one of those, or has only marked one of those.Herein, "one" is not only indicated " only this ", can also indicate the situation of " more than one ".
It will be further appreciated that the term "and/or" used in present specification and the appended claims is Refer to any combination and all possible combinations of one or more of associated item listed, and including these combinations.
In the specific implementation, terminal device described in the embodiment of the present application is including but not limited to such as with the sensitive table of touch Mobile phone, laptop computer or the tablet computer in face (for example, touch-screen display and/or touch tablet) etc other Portable device.It is to be further understood that in certain embodiments, the terminal device is not portable communication device, but Desktop computer with touch sensitive surface (such as: touch-screen display and/or touch tablet).
In following discussion, the terminal device including display and touch sensitive surface is described.However, should manage Solution, terminal device may include that other one or more physical Users of such as physical keyboard, mouse and/or control-rod connect Jaws equipment.
Terminal device supports various application programs, such as one of the following or multiple: drawing application program, demonstration application Program, network creation application program, word-processing application, disk imprinting application program, spreadsheet applications, game are answered With program, telephony application, videoconference application, email application, instant messaging applications, forging Refining supports application program, photo management application program, digital camera application program, digital camera applications program, web browsing to answer With program, digital music player application and/or video frequency player application program.
At least one of such as touch sensitive surface can be used in the various application programs that can be executed on the terminal device Public physical user-interface device.It can be adjusted among applications and/or in corresponding application programs and/or change touch is quick Feel the corresponding information shown in the one or more functions and terminal on surface.In this way, terminal public physical structure (for example, Touch sensitive surface) it can support the various application programs with user interface intuitive and transparent for a user.
In addition, term " first ", " second " etc. are only used for distinguishing description, and should not be understood as in the description of the present application Indication or suggestion relative importance.
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, Detailed description of the invention will be compareed below A specific embodiment of the invention.It should be evident that drawings in the following description are only some embodiments of the invention, for For those of ordinary skill in the art, without creative efforts, it can also be obtained according to these attached drawings other Attached drawing, and obtain other embodiments.
Fig. 1 shows a kind of implementation flow chart of voice interactive method of the present invention, which can be applied to Terminal device (such as: tablet computer, private tutor's machine etc. understand in the present embodiment, all using private tutor's machine as subject solution It releases, but those skilled in the art understands that the voice interactive method can also be applied to other terminal devices, as long as being able to achieve phase Answer function), the voice interactive method the following steps are included:
S101 receives the topic type voice messaging of user's input.
Specifically, topic type voice messaging refers to the topic type for the topic that user reads.For example, in Fig. 2, if user wants to ask " b à i This problem of ni á n ", but do not know and how to describe this problem, this can be read and inscribe corresponding topic type, i.e., " see phonetic, write word The topic type voice messaging that language " is inputted as user, to inquire corresponding topic type way to put questions information.
Topic type voice messaging is carried out speech recognition by S102, is converted to corresponding text information.
Specifically, this step is the general step during interactive voice, i.e., topic type voice messaging is passed through into speech recognition, Text information is converted to, for using when subsequent match.
S103 matches text information in default topic type way to put questions database, obtains the corresponding topic type way to put questions of text information Information.
Specifically, default topic type way to put questions database can be tabular form (it is of course also possible to be other forms, as long as energy Realize matching), it comprises multiple topic types and its problems of correspondence, as shown in following table one.
Table one
Topic type Way to put questions
It sees phonetic, writes word xxxxxx
Sub- Lian Yilian in the same old way keeps sentence expression accurate yyyyyy
Ancient poetry supplement is complete aaaaaaa
Single choice bbbbbb
Assuming that table one is default topic type way to put questions database, it is according to the text information that the topic type voice messaging of user is converted " seeing phonetic, write word " matches and carrys out corresponding topic type way to put questions information for " xxxxxx ".
Preferably, voice interactive method is further comprising the steps of: displaying topic type way to put questions information.
There are many exhibition methods of topic type way to put questions information:
The first, text is shown, in the display screen display topic type way to put questions information of private tutor's machine, as shown in Figure 3.
Second, voice is shown, i.e., says a specific example with the form of voice broadcast, for reference.
The third, text and voice combination displaying topic type way to put questions information.Such as: when finding topic type way to put questions information (i.e. topic type Way to put questions guide) after, " this is encountered in the display screen display topic type way to put questions information of private tutor's machine, while for the description of example voice You can ask in this way when class topic type: xxxxxx ", describe an example equal to voice, further user be helped to understand how to ask Ask the topic for oneself wanting to ask.
Certainly, other exhibition methods can also be not make herein as long as user's understanding topic type way to put questions information can be allowed Limitation.
In the present embodiment, user can inquire the corresponding topic type way to put questions of this topic type by reading the topic type for the topic for wanting to ask and believe Breath, so that user be allow accurately to inquire using related way to put questions, description is more accurate, and terminal device can also provide more accurately Feedback.
In addition, topic type voice messaging is because belonging to specific some topic types in the present embodiment, it is only necessary to carry out speech recognition, turn It is changed to text information, is not related to semantic parsing, while simplifying identification difficulty, also can guarantee topic type way to put questions information High matching degree.
In another embodiment of the present invention, it is improved for Fig. 1, as shown in figure 4, a kind of voice interactive method packet It includes:
S401 issues prompt information when detecting that user encounters the topic that will not be done, and prompt information is that prompt user reads Purpose of setting a question topic type.
Specifically, only meeting certain condition (such as: when detecting that user encounters the topic that will not be done), the just meeting of private tutor's machine Prompt information prompt user's reading topic type is issued to inquire topic type way to put questions information.
Optionally, when receiving the voice question information of user, then it is assumed that detect that user encounters the topic that will not be done; Or, when taking user when same topic residence time is more than preset time, then it is assumed that detecting that user encounters will not do Topic.
Specifically, voice question information refers to user asks the information for how describing topic.
When user encounters the mesh but do not know of will not thinking over a problem and how to ask sometimes, phonetic order can be directly issued, Such as: " how I will describe topic to inquire how it does " passes through the operations such as speech recognition, semantic parsing, private tutor's mechanism solution For the voice question information for having received user, then it is assumed that detect that user encounters the topic that will not be done, so that it may issue prompt letter Breath prompt user's reading topic type.
If if private tutor's machine supports video capture, whether can also judge user by monitoring the case where user does the homework Encounter the topic that will not be done.
Such as: user is taken when same topic residence time is more than preset time, then it is assumed that detects that user meets To the topic that will not be done.
Topic is more difficult sometimes, and user may spend the long time to go thinking that can just work it out, sometimes It may be exactly unlimitedly to be entangled with how to do, will not ask again on earth, therefore the setting of preset time can allow private tutor's machine reasonable Opportunity issue prompt information and if user wants to ask can read type of setting a question, i.e. input topic type voice messaging, if user is not desired to ask, Also it can be ignored, oneself continues to study.
The length of preset time can realize individualized fit according to different user's self-settinies, such as: the family of user A Religion machine preset time is set as 10 minutes, and private tutor's machine preset time of user B is set as 15 minutes etc..
S402 receives the topic type voice messaging that user inputs according to prompt information.
Topic type voice messaging is carried out speech recognition by S403, is converted to corresponding text information.
S404 matches text information in default topic type way to put questions database, obtains the corresponding topic type way to put questions of text information Information.
Optionally, displaying topic type way to put questions information.
In the present embodiment, private tutor's chance issues prompt information on suitable opportunity, and user is allowed not know how description topic When mesh, relevant topic type way to put questions information is provided in time, allows user that can accurately describe the topic for oneself wanting to ask, thus subsequent To accurate feedback, the usage experience of user is improved.
Fig. 6 shows the implementation flow chart of another voice interactive method of the invention, which can apply In terminal device (such as: tablet computer, private tutor's machine etc., in the present embodiment for convenience of understand, all using private tutor's machine as subject solution It releases, but those skilled in the art understands that the voice interactive method can also be applied to other terminal devices, as long as being able to achieve phase Answer function), the voice interactive method the following steps are included:
S601 receives the topic type voice messaging of user's input;
Topic type voice messaging is carried out speech recognition by S602, is converted to corresponding text information;
Text information is matched in default topic type way to put questions database, obtains the corresponding topic type way to put questions information of text information Specifically: S603 matches text information in default topic type way to put questions database, when not being matched to and text information complete one When the topic type way to put questions information of cause, using the highest topic type way to put questions information of matching degree as the corresponding topic type way to put questions information of text information.
Specifically, same topic type may be taken since the speech habits for the people that sets a question are different on different papers, exercise-book At different names.Such as: see that phonetic, writing of Chinese characters can also be named as: see phonetic, write word.
Including default topic type way to put questions database can all collect the different call of same topic type when establishing, convenient for subsequent Accurate matching.
It, at this time can be by the highest topic of matching degree but there is always having no idea to be matched to completely the same situation sometimes Type way to put questions information shows user, user is allowed voluntarily to judge as the corresponding topic type way to put questions information of text information.
Optionally, topic type voice messaging is carried out speech recognition by S602, and being converted to corresponding text information further comprises: Topic type voice messaging is subjected to speech recognition, semantic parsing, is converted to corresponding text information.
Specifically, after passing through speech recognition, then pass through the extraction of semantic parsing progress key message, composition text letter Breath.
Such as: the topic type voice messaging of user is " phonetic is seen by first part, writes word ", and it is exactly " that speech recognition, which comes out, A part sees phonetic, writes word ", it is parsed using semanteme, the text information (i.e. key message) of conversion is " to see phonetic, write word " first part " this information filtering unrelated with topic type is fallen, improves the precision of subsequent match by language ".
Optionally, the voice interactive method of the present embodiment further include: when detecting that user encounters the topic that will not be done, hair Prompt information out, prompt information are the topic type for prompting user to read topic;Receive the topic type voice messaging of user's input specifically: Receive the topic type voice messaging that user inputs according to prompt information.
Preferably, when receiving the voice question information of user, then it is assumed that detect that user encounters the topic that will not be done; Or, when taking user when same topic residence time is more than preset time, then it is assumed that detecting that user encounters will not do Topic.
In the present embodiment, when not being matched to the topic type way to put questions information completely the same with text information, then most by matching degree High topic type way to put questions information output, for reference, matching degree highest explanation is immediate with text information, and most having can It can be the topic type way to put questions information that user wants, provide good usage experience in many aspects for user.
Fig. 7 shows the implementation flow chart of another voice interactive method of the invention, which can apply In terminal device (such as: tablet computer, private tutor's machine etc., in the present embodiment for convenience of understand, all using private tutor's machine as subject solution It releases, but those skilled in the art understands that the voice interactive method can also be applied to other terminal devices, as long as being able to achieve phase Answer function), voice interactive method the following steps are included:
When detecting that user encounters the topic that will not be done, prompt information is issued, prompt information is that prompt user's reading should The subject and topic type of topic;Receive the topic type voice messaging of user's input specifically: receive what user inputted according to prompt information Topic type voice messaging (this step is optional step).
S701 receives the topic type voice messaging of user's input;
Topic type voice messaging is subjected to speech recognition, being converted to corresponding text information further comprises: S702 will inscribe type Voice carries out speech recognition, semantic parsing, is converted to corresponding text information, and text information includes: discipline information and topic type letter Breath.
Specifically, the topic topic type of different subjects is the same sometimes, but the particularity based on different subjects, it Way to put questions can exist difference, therefore, user input topic type voice messaging in addition to containing topic type information, sometimes also It can include discipline information, improve the accuracy of subsequent topic type way to put questions information matches.
After user's input topic type voice messaging, by speech recognition and it is semantic parse, thus accurately extract subject and The corresponding information of topic type.
Such as: the topic type voice messaging of user is " single choice test items of mathematics ", after speech recognition and semantic parsing, Discipline information is mathematics in obtained text information, and topic type information is single choice test items.
Text information is matched in default topic type way to put questions database, obtains the corresponding topic type way to put questions information of text information Include: that S703 matches topic type information in the corresponding list of discipline information of default topic type way to put questions database, obtains text The corresponding topic type way to put questions information of word information.
Specifically, the corresponding topic type of different subjects can be associated in default topic type way to put questions database, subsequent match When, it is directly reduced the scope lookup according to the discipline information in text information, matching speed and precision can be improved.
Such as: default topic type way to put questions database has multiple lists, comprising: the column of mathematically related topic type way to put questions information composition Table, the list of the relevant topic type way to put questions information composition of history, the list etc. of the relevant topic type way to put questions information composition of Chinese language, matching When, list related is directly targeted to according to discipline information and is matched.
Optionally, topic type information is matched in the corresponding list of discipline information of default topic type way to put questions database, When not being matched to the topic type way to put questions information completely the same with topic type information, using the highest topic type way to put questions information of matching degree as this The corresponding topic type way to put questions information of text information.
Optionally, further includes: displaying topic type way to put questions information.
In the present embodiment, user can be effectively solved when doing one's assignment, and in face of problem and do not know how opening inquiry Problem directly reads out topic type title, and the feedback of effective topic type way to put questions information can be obtained, human-computer interaction is allowed to become more suitable Freely;And in topic type voice messaging may include discipline information, to improve speed and precision when matching topic type way to put questions information.
It should be understood that in the above-described embodiments, the size of each step number is not meant that the order of the execution order, each step Execution sequence should determine that the implementation process of the embodiments of the invention shall not be constituted with any limitation with function and internal logic.
Fig. 8 is the schematic diagram of voice interaction device 8 provided by the present application, for ease of description, is illustrated only and the application The relevant part of embodiment.
The voice interaction device can be the software unit being built in terminal device, hardware cell or soft or hard combination Unit can also be used as independent pendant and be integrated into terminal device.
The voice interaction device includes:
Receiving module 81, for receiving the topic type voice messaging of user's input.
Specifically, topic type voice messaging refers to the topic type for the topic that user reads.For example, in Fig. 2, if user wants to ask " b à i This problem of ni á n ", but do not know and how to describe this problem, this can be read and inscribe corresponding topic type, i.e., " see phonetic, write word The topic type voice messaging that language " is inputted as user, to inquire corresponding topic type way to put questions information.
Identification module 82 is converted to corresponding text information for topic type voice messaging to be carried out speech recognition.
Matching module 83 obtains text information pair for matching in default topic type way to put questions database to text information The topic type way to put questions information answered.
Specifically, default topic type way to put questions database can be tabular form (it is of course also possible to be other forms, as long as energy Realize matching), it comprises multiple topic types and its problems of correspondence, shown in table one as above.
Assuming that table one is default topic type way to put questions database, it is according to the text information that the topic type voice messaging of user is converted " seeing phonetic, write word " matches and carrys out corresponding topic type way to put questions information for " xxxxxx ".
Preferably, voice interaction device further include: display module is used for displaying topic type way to put questions information.
There are many exhibition methods of topic type way to put questions information:
The first, text is shown, in the display screen display topic type way to put questions information of private tutor's machine, as shown in Figure 3.
Second, voice is shown, i.e., says a specific example with the form of voice broadcast, for reference.
The third, text and voice combination displaying topic type way to put questions information.Such as: when finding topic type way to put questions information (i.e. topic type Way to put questions guide) after, " this is encountered in the display screen display topic type way to put questions information of private tutor's machine, while for the description of example voice You can ask in this way when class topic type: xxxxxx ", describe an example equal to voice, further user be helped to understand how to ask Ask the topic for oneself wanting to ask.
Certainly, other exhibition methods can also be not make herein as long as user's understanding topic type way to put questions information can be allowed Limitation.
In the present embodiment, user can inquire the corresponding topic type way to put questions of this topic type by reading the topic type for the topic for wanting to ask and believe Breath, so that user be allow accurately to inquire using related way to put questions, description is more accurate, and terminal device can also provide more accurately Feedback.
In addition, topic type voice messaging is because belonging to specific some topic types in the present embodiment, it is only necessary to carry out speech recognition, turn It is changed to text information, is not related to semantic parsing, while simplifying identification difficulty, also can guarantee topic type way to put questions information High matching degree.
It in another Installation practice of the invention, is improved for above-mentioned apparatus embodiment, as shown in figure 9, one Planting voice interaction device includes:
Cue module 84, for issuing prompt information when detecting that user encounters the topic that will not be done, prompt information is User is prompted to read the topic type of topic.
Specifically, only meeting certain condition (such as: when detecting that user encounters the topic that will not be done), the just meeting of private tutor's machine Prompt information prompt user's reading topic type is issued to inquire topic type way to put questions information.
Optionally, receiving module 81, for receiving the voice question information of user, when the voice for receiving user is putd question to When information, then it is assumed that detect that user encounters the topic that will not be done;Or, voice interaction device further include: shooting module 85 is used for Shooting user inscribe, when take user same topic residence time be more than preset time when, then it is assumed that detect user Encounter the topic that will not be done.
Specifically, voice question information refers to user asks the information for how describing topic.
When user encounters the mesh but do not know of will not thinking over a problem and how to ask sometimes, phonetic order can be directly issued, Such as: " how I will describe topic to inquire how it does " passes through the operations such as speech recognition, semantic parsing, private tutor's mechanism solution For the voice question information for having received user, then it is assumed that detect that user encounters the topic that will not be done, so that it may issue prompt letter Breath prompt user's reading topic type.
If if private tutor's machine supports video capture, whether can also judge user by monitoring the case where user does the homework Encounter the topic that will not be done.
Topic is more difficult sometimes, and user may spend the long time to go thinking that can just work it out, sometimes It may be exactly unlimitedly to be entangled with how to do, will not ask again on earth, therefore the setting of preset time can allow private tutor's machine reasonable Opportunity issue prompt information and if user wants to ask can read type of setting a question, i.e. input topic type voice messaging, if user is not desired to ask, Also it can be ignored, oneself continues to study.
The length of preset time can realize individualized fit according to different user's self-settinies, such as: the family of user A Religion machine preset time is set as 10 minutes, and private tutor's machine preset time of user B is set as 15 minutes etc..
Receiving module 81, the topic type voice messaging inputted for receiving user according to prompt information;
Identification module 82 is converted to corresponding text information for topic type voice messaging to be carried out speech recognition;
Matching module 83 obtains text information pair for matching in default topic type way to put questions database to text information The topic type way to put questions information answered.
Optionally, display module 86 are used for displaying topic type way to put questions information.
In the present embodiment, private tutor's chance issues prompt information on suitable opportunity, and user is allowed not know how description topic When mesh, relevant topic type way to put questions information is provided in time, allows user that can accurately describe the topic for oneself wanting to ask, thus subsequent To accurate feedback, the usage experience of user is improved.
In the embodiment of another voice interaction device of the invention, comprising:
Receiving module 81, for receiving the topic type voice messaging of user's input;
Identification module 82 is converted to corresponding text information for topic type voice messaging to be carried out speech recognition;
Matching module 83 obtains text information pair for matching in default topic type way to put questions database to text information The topic type way to put questions information answered includes: matching module 83, is matched in default topic type way to put questions database to text information, when not When being fitted on the topic type way to put questions information completely the same with text information, using the highest topic type way to put questions information of matching degree as text information Corresponding topic type way to put questions information.
Specifically, same topic type may be taken since the speech habits for the people that sets a question are different on different papers, exercise-book At different names.Such as: see that phonetic, writing of Chinese characters can also be named as: see phonetic, write word.
Including default topic type way to put questions database can all collect the different call of same topic type when establishing, convenient for subsequent Accurate matching.
It, at this time can be by the highest topic of matching degree but there is always having no idea to be matched to completely the same situation sometimes Type way to put questions information shows user, user is allowed voluntarily to judge as the corresponding topic type way to put questions information of text information.
Optionally, identification module 82 are converted to corresponding text information for topic type voice messaging to be carried out speech recognition Further comprise: topic type voice messaging is carried out speech recognition, semantic parsing, is converted to corresponding text letter by identification module 82 Breath.
Specifically, after passing through speech recognition, then pass through the extraction of semantic parsing progress key message, composition text letter Breath.
Such as: the topic type voice messaging of user is " phonetic is seen by first part, writes word ", and it is exactly " that speech recognition, which comes out, A part sees phonetic, writes word ", it is parsed using semanteme, the text information (i.e. key message) of conversion is " to see phonetic, write word " first part " this information filtering unrelated with topic type is fallen, improves the precision of subsequent match by language ".
Optionally, the voice interaction device of the present embodiment further include: cue module 84 detects that user encounters not for working as When the topic that can be done, prompt information is issued, prompt information is the topic type for prompting user to read topic.
Optionally, receiving module 81, for receiving the voice question information of user, when the voice for receiving user is putd question to When information, then it is assumed that detect that user encounters the topic that will not be done;Or, voice interaction device further include: shooting module 85 is used for Shooting user inscribe, when take user same topic residence time be more than preset time when, then it is assumed that detect user Encounter the topic that will not be done.
In the present embodiment, when not being matched to the topic type way to put questions information completely the same with text information, then most by matching degree High topic type way to put questions information output, for reference, matching degree highest explanation is immediate with text information, and most having can It can be the topic type way to put questions information that user wants, provide good usage experience in many aspects for user.
In the embodiment of another voice interaction device of the invention, comprising:
Receiving module 81, for receiving the topic type voice messaging of user's input;
It is further to be converted to corresponding text information for topic type voice messaging to be carried out speech recognition for identification module 82 Include: identification module 82, topic type voice is subjected to speech recognition, semantic parsing, is converted to corresponding text information, text information It include: discipline information and topic type information.
Specifically, the topic topic type of different subjects is the same sometimes, but the particularity based on different subjects, it Way to put questions can exist difference, therefore, user input topic type voice messaging in addition to containing topic type information, sometimes also It can include discipline information, improve the accuracy of subsequent topic type way to put questions information matches.
After user's input topic type voice messaging, by speech recognition and it is semantic parse, thus accurately extract subject and The corresponding information of topic type.
Such as: the topic type voice messaging of user is " single choice test items of mathematics ", after speech recognition and semantic parsing, Discipline information is mathematics in obtained text information, and topic type information is single choice test items.
Matching module 83 matches text information in default topic type way to put questions database, it is corresponding to obtain text information Topic type way to put questions information includes: matching module 83, to topic type in the corresponding list of discipline information of default topic type way to put questions database Information is matched, and the corresponding topic type way to put questions information of text information is obtained.
Specifically, the corresponding topic type of different subjects can be associated in default topic type way to put questions database, subsequent match When, it is directly reduced the scope lookup according to the discipline information in text information, matching speed and precision can be improved.
Optionally, topic type is believed in the corresponding list of discipline information of default topic type way to put questions database in matching module 83 Breath is matched, and when not being matched to the topic type way to put questions information completely the same with topic type information, the highest topic type of matching degree is asked Method information is as the corresponding topic type way to put questions information of the text information.
Optionally, display module 86 are used for displaying topic type way to put questions information.
In the present embodiment, user can be effectively solved when doing one's assignment, and in face of problem and do not know how opening inquiry Problem directly reads out topic type title, and the feedback of effective topic type way to put questions information can be obtained, human-computer interaction is allowed to become more suitable Freely;And in topic type voice messaging may include discipline information, to improve speed and precision when matching topic type way to put questions information.
It is apparent to those skilled in the art that for convenience of description and succinctly, only with above-mentioned each journey The division progress of sequence module can according to need and for example, in practical application by above-mentioned function distribution by different programs Module is completed, i.e., the internal structure of described device is divided into different program unit or module, described above complete to complete Portion or partial function.Each program module in embodiment can integrate in one processing unit, can also be each unit list It is solely physically present, can also be integrated in a processing unit with two or more units, above-mentioned integrated unit both can be with Using formal implementation of hardware, can also be realized in the form of software program unit.In addition, the specific name of each program module Also it is only for convenience of distinguishing each other, the protection scope being not intended to limit this application.
Fig. 5 is the structural schematic diagram of the terminal device 5 provided in one embodiment of the invention.As shown in figure 5, the present embodiment Terminal device 5 include: processor 53, memory 51 and be stored in the memory 51 and can be on the processor 53 The computer program 52 of operation, such as: interactive voice program.The processor 53 is realized when executing the computer program 52 The step in each voice interactive method embodiment is stated, alternatively, realization when the processor 53 executes the computer program 52 The function of each module in above-mentioned each voice interaction device embodiment.
The terminal device 5 can set for desktop PC, notebook, palm PC, Tablet PC, mobile phone etc. It is standby.The terminal device 5 may include, but be not limited only to, processor 53, memory 51.It will be understood by those skilled in the art that figure 5 be only the example of terminal device, does not constitute the restriction to terminal device 5, may include than illustrating more or fewer portions Part perhaps combines certain components or different components, such as: terminal device can also be set including input-output equipment, display Standby, network access equipment, bus etc..
The processor 53 can be central processing unit (Central Processing Unit, CPU), can also be Other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), field programmable gate array (Field- Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor is also possible to any conventional processor Deng.
The memory 51 can be the internal storage unit of the terminal device 5, such as: the hard disk of terminal device is interior It deposits.The memory is also possible to the External memory equipment of the terminal device, such as: the grafting being equipped on the terminal device Formula hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card) etc..Further, the memory 51 can also both including the terminal device 5 internal storage unit or Including External memory equipment.The memory 51 is for storing required for the computer program 52 and the terminal device 5 Other programs and data.The memory can be also used for temporarily storing the data that has exported or will export.
In the above-described embodiments, it all emphasizes particularly on different fields to the description of each embodiment, is not described in some embodiment Or the part recorded, reference can be made to the related descriptions of other embodiments.
Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure Member and algorithm steps can be realized with the combination of electronic hardware or computer software and electronic hardware.These functions are actually It is executed with hardware or software, specific application and design constraint depending on technical solution.Professional technician can be with Each specific application is used different methods to achieve the described function, but this realization is it is not considered that exceed this Shen Range please.
In embodiment provided herein, it should be understood that disclosed device/terminal device and method, it can be with It realizes in other way.For example, device described above/terminal device embodiment is only schematical, for example, institute The division of module or unit is stated, only a kind of logical function partition, there may be another division manner in actual implementation, example Such as, multiple units or components can be combined or can be integrated into another system, or some features can be ignored, or not hold Row.Another point, shown or discussed mutual coupling or direct-coupling or communication connection can be through some interfaces, The INDIRECT COUPLING or communication connection of device or unit can be electrical, mechanical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It, can also be in addition, each functional unit in each embodiment of the application may be integrated in a processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated module/unit be realized in the form of SFU software functional unit and as independent product sale or In use, can store in a computer readable storage medium.Based on this understanding, the present invention realizes above-described embodiment All or part of the process in method can also send instructions to relevant hardware by computer program and complete, the meter Calculation machine program can be stored in a computer readable storage medium, the computer program when being executed by processor, it can be achieved that on The step of stating each embodiment of the method.Wherein, the computer program includes: computer program code, the computer program Code can be source code form, object identification code form, executable file or certain intermediate forms etc..It is described computer-readable to deposit Storage media may include: any entity or device, recording medium, USB flash disk, mobile hard that can carry the computer program code Disk, magnetic disk, CD, computer storage, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium etc..It needs to illustrate Be, the content that the computer readable storage medium includes can according in jurisdiction make laws and patent practice requirement into Row increase and decrease appropriate, such as: it does not include electricity according to legislation and patent practice, computer-readable medium in certain jurisdictions Carrier signal and telecommunication signal.
It should be noted that above-described embodiment can be freely combined as needed.The above is only of the invention preferred Embodiment, it is noted that for those skilled in the art, in the premise for not departing from the principle of the invention Under, several improvements and modifications can also be made, these modifications and embellishments should also be considered as the scope of protection of the present invention.

Claims (10)

1. a kind of voice interactive method, which comprises the following steps:
Receive the topic type voice messaging of user's input;
The topic type voice messaging is subjected to speech recognition, is converted to corresponding text information;
The text information is matched in default topic type way to put questions database, obtains the corresponding topic type way to put questions of the text information Information.
2. voice interactive method as described in claim 1, which is characterized in that it is described in default topic type way to put questions database to institute It states text information to be matched, obtaining the corresponding topic type way to put questions information of the text information includes:
The text information is matched in default topic type way to put questions database, when not being matched to and the text information complete one When the topic type way to put questions information of cause, believe the highest topic type way to put questions information of matching degree as the corresponding topic type way to put questions of the text information Breath.
3. voice interactive method as described in claim 1, which is characterized in that described that the topic type voice messaging is carried out language Sound identification, being converted to corresponding text information further comprises:
The topic type voice is subjected to speech recognition, semantic parsing, is converted to corresponding text information, the text information packet It includes: discipline information and topic type information;
Described matches the text information in default topic type way to put questions database, obtains the corresponding topic of the text information Type way to put questions information includes:
The topic type information is matched in the corresponding list of the discipline information of default topic type way to put questions database, is obtained The corresponding topic type way to put questions information of the text information.
4. voice interactive method as described in claim 1, which is characterized in that further comprising the steps of:
When detecting that user encounters the topic that will not be done, prompt information is issued, the prompt information reads institute for prompt user State the topic type of topic;
The topic type voice messaging for receiving user's input specifically:
Receive the topic type voice messaging that user inputs according to the prompt information.
5. voice interactive method as claimed in claim 4, which is characterized in that further include:
When receiving the voice question information of user, then it is assumed that detect that user encounters the topic that will not be done;
Or, when taking user when same topic residence time is more than preset time, then it is assumed that detect that user encounters not The topic that can be done.
6. a kind of voice interaction device characterized by comprising
Receiving module, for receiving the topic type voice messaging of user's input;
Identification module is converted to corresponding text information for the topic type voice messaging to be carried out speech recognition;
Matching module obtains the text information for matching in default topic type way to put questions database to the text information Corresponding topic type way to put questions information.
7. voice interaction device as claimed in claim 6, which is characterized in that the identification module is used for the topic type language Message breath carries out speech recognition, and being converted to corresponding text information includes:
The topic type voice is carried out speech recognition, semantic parsing, is converted to corresponding text information, institute by the identification module Stating text information includes: discipline information and topic type information;
The matching module obtains the text for matching in default topic type way to put questions database to the text information The corresponding topic type way to put questions information of information includes:
The matching module, to the topic type information in the corresponding list of the discipline information of default topic type way to put questions database It is matched, obtains the corresponding topic type way to put questions information of the text information.
8. voice interaction device as claimed in claim 6, which is characterized in that further include:
Cue module, for when detecting that user encounters the topic that will not be done, issuing prompt information, the prompt information is to mention Show that user reads the topic type of the topic;
The receiving module, for receiving the topic type voice messaging of user's input specifically:
The receiving module receives the topic type voice messaging that user inputs according to the prompt information.
9. a kind of terminal device, including memory, processor and storage are in the memory and can be on the processor The computer program of operation, which is characterized in that the processor is realized when running the computer program as in claim 1-5 The step of any one voice interactive method.
10. a kind of computer readable storage medium, the computer-readable recording medium storage has computer program, and feature exists In the step of realization voice interactive method as described in any one of claim 1-5 when the computer program is executed by processor Suddenly.
CN201910300604.0A 2019-04-15 2019-04-15 Voice interaction method and device, terminal equipment and computer readable storage medium Active CN110060686B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910300604.0A CN110060686B (en) 2019-04-15 2019-04-15 Voice interaction method and device, terminal equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910300604.0A CN110060686B (en) 2019-04-15 2019-04-15 Voice interaction method and device, terminal equipment and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN110060686A true CN110060686A (en) 2019-07-26
CN110060686B CN110060686B (en) 2021-06-22

Family

ID=67319042

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910300604.0A Active CN110060686B (en) 2019-04-15 2019-04-15 Voice interaction method and device, terminal equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110060686B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111324206A (en) * 2020-02-28 2020-06-23 重庆百事得大牛机器人有限公司 Gesture interaction-based confirmation information identification system and method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104050256A (en) * 2014-06-13 2014-09-17 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method
US20170178526A1 (en) * 2013-03-13 2017-06-22 Edulock, Inc. System and Method for Multi-Layered Education Based Locking of Electronic Computing Devices
CN108509439A (en) * 2017-02-24 2018-09-07 上海莘越软件科技有限公司 A kind of Algebra Teaching system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170178526A1 (en) * 2013-03-13 2017-06-22 Edulock, Inc. System and Method for Multi-Layered Education Based Locking of Electronic Computing Devices
CN104050256A (en) * 2014-06-13 2014-09-17 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method
CN108509439A (en) * 2017-02-24 2018-09-07 上海莘越软件科技有限公司 A kind of Algebra Teaching system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111324206A (en) * 2020-02-28 2020-06-23 重庆百事得大牛机器人有限公司 Gesture interaction-based confirmation information identification system and method
CN111324206B (en) * 2020-02-28 2023-07-18 重庆百事得大牛机器人有限公司 System and method for identifying confirmation information based on gesture interaction

Also Published As

Publication number Publication date
CN110060686B (en) 2021-06-22

Similar Documents

Publication Publication Date Title
US9009025B1 (en) Context-based utterance recognition
CN107093339A (en) Display methods of imparting knowledge to students and system
CN108470034A (en) A kind of smart machine service providing method and system
CN108701127A (en) Electronic equipment and its operating method
CN107733984A (en) A kind of method, terminal and computer-readable recording medium for pushing screen locking information
CN110073349A (en) Consider the word order suggestion of frequency and formatted message
Lopatovska et al. User recommendations for intelligent personal assistants
CN108614851A (en) Notes content display methods in tutoring system and device
CN105206123B (en) A kind of deaf and dumb patient's ac equipment
CN104035995A (en) Method and device for generating group tags
CN109902187A (en) A kind of construction method and device, terminal device of feature knowledge map
US11189283B2 (en) Freeform conversation writing assistant
US20230289514A1 (en) Speech recognition text processing method and apparatus, device, storage medium, and program product
CN109801527A (en) Method and apparatus for output information
US20090150341A1 (en) Generation of alternative phrasings for short descriptions
CN109524008A (en) A kind of audio recognition method, device and equipment
CN109241302A (en) A kind of comment authorization method, device and the terminal device of online course
US20190347068A1 (en) Personal history recall
CN106202087A (en) A kind of information recommendation method and device
CN110060686A (en) Voice interactive method and device, terminal device, computer readable storage medium
CN110275948A (en) Free jump method, device and the medium of Self-Service
Oproescu et al. Software and hardware solutions for Using the keyboards by blind people
CN109948155B (en) Multi-intention selection method and device and terminal equipment
CN107819937A (en) A kind of memo information based reminding method and device, terminal and readable storage medium storing program for executing
CN106847287A (en) Word read recognition methods, user terminal and word read identifying system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant