CN110223695B

CN110223695B - Task creation method and mobile terminal

Info

Publication number: CN110223695B
Application number: CN201910570495.4A
Authority: CN
Inventors: 彭思文
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2019-06-27
Filing date: 2019-06-27
Publication date: 2021-08-27
Anticipated expiration: 2039-06-27
Also published as: CN110223695A

Abstract

The invention provides a task creating method and a mobile terminal, comprising the following steps: acquiring audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot, and the slot is connected with the fixed action corpora. In the invention, the audio information spoken by the user can be subjected to real-time text conversion and template matching in scenes such as conversation or conference, so that the keywords related to the target task to be handled in the user voice are quickly filled into the corresponding slots in the task template to be handled to obtain the target task to be handled, the task to be handled is quickly and efficiently created, the labor and the time are saved, and the efficiency is improved.

Description

Task creation method and mobile terminal

Technical Field

The embodiment of the invention relates to the technical field of communication, in particular to a task creating method and a mobile terminal.

Background

In the scene of communication information and negotiation affairs, conversation and conference are common communication means, the conversation and conference are often accompanied with the generation of tasks to be handled, but in the communication with a faster rhythm, the memory of the information is often very difficult.

In the prior art, a task to be handled is usually created in two ways, one way is that, in a call or a conference scene, a conference record needs to be manually recorded in real time, and a task to be handled generated in the call or the conference scene is manually recorded, so that after the call or the conference is finished, the task to be handled is dispatched to a corresponding responsible person. And secondly, recording in a call or conference scene, creating a task to be handled by listening to the recording after the call or conference is finished, and dispatching the task to be handled to the corresponding responsible person.

However, in the current scheme, the first mode is to manually record the meeting, manually create the to-do task and distribute the proxy task to each responsible person, which is inefficient and labor-consuming, and the second mode needs to spend a lot of time on recording analysis, which results in that the creation and distribution of the to-do task take a long time.

Disclosure of Invention

The embodiment of the invention provides a task creating method and a mobile terminal, and aims to solve the problems that time consumption for creating and distributing tasks to be handled is long, efficiency is low, and manpower is consumed.

In a first aspect, an embodiment of the present invention provides a task creating method, applied to a mobile terminal, where the method includes:

acquiring audio information;

converting the audio information into text information;

matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information;

determining the category of the first keyword, and filling the first keyword into a corresponding slot position in a task template to be handled to obtain a target task to be handled;

the task template to be handled comprises fixed action corpora and at least one slot position, the slot position is connected with the fixed action corpora, and the slot position corresponds to the category of the first keyword.

In a second aspect, an embodiment of the present invention provides a mobile terminal, where the mobile terminal includes:

the first acquisition module is used for acquiring audio information;

the conversion module is used for converting the audio information into text information;

the first extraction module is used for matching the text information with a preset first keyword template and extracting at least one first keyword in the text information;

the filling module is used for determining the category of the first keyword, and filling the first keyword into the corresponding slot position in the task template to be handled to obtain the target task to be handled;

In a third aspect, an embodiment of the present invention further provides a mobile terminal, including a processor, a memory, and a computer program stored on the memory and executable on the processor, where the computer program, when executed by the processor, implements the steps of the task creation method provided in the present invention.

In a fourth aspect, the embodiments of the present invention also provide a readable storage medium, where instructions in the storage medium, when executed by a processor of an electronic device, enable the electronic device to perform the steps of the task creation method provided by the present invention.

In the embodiment of the invention, the mobile terminal can acquire audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot position, the slot position is connected with the fixed action corpora, and the slot position corresponds to the category of the first keyword.

The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.

Drawings

FIG. 1 is a flowchart illustrating steps of a task creation method according to an embodiment of the present invention;

FIG. 2 is an interface diagram of a task creation method according to an embodiment of the present invention;

FIG. 3 is an interface diagram of another task creation method provided by an embodiment of the invention;

FIG. 4 is an interface diagram of another task creation method provided by an embodiment of the invention;

FIG. 5 is a flowchart illustrating specific steps of a task creation method according to an embodiment of the present invention;

fig. 6 is a block diagram of a mobile terminal according to an embodiment of the present invention;

fig. 7 is a block diagram of a mobile terminal according to another embodiment of the present invention;

fig. 8 is a schematic structural diagram of a mobile terminal according to another embodiment of the present invention.

Detailed Description

Exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the invention are shown in the drawings, it should be understood that the invention can be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

Fig. 1 is a flowchart of steps of a task creating method according to an embodiment of the present invention, and as shown in fig. 1, the method may include:

step 101, obtaining audio information.

In the embodiment of the invention, the mobile terminal can acquire the audio information in real time through the microphone when a call or a conference is carried out.

In addition, when the number of the participants in the call or the conference is large, the mobile terminal can also extract the voiceprint information of each piece of audio information so as to determine the source corresponding to each piece of audio information through the identification of the voiceprint information.

Step 102, converting the audio information into text information.

Specifically, the audio information is converted into the text information by using an Automatic Speech Recognition (ASR) technology, which is a technology for converting human Speech into text, and aims to convert vocabulary contents in human Speech into computer-readable input, convert the acquired audio information into corresponding text information by using the ASR technology, and display the text information by using the mobile terminal after the text information is generated.

And 103, matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information.

In the embodiment of the present invention, the to-do task generated in the call or the conference may be understood as an execution instruction, and according to the most basic requirement for the to-do task, the to-do task may have three parameters, i.e. an execution action, a task initiator, and a task executor, as for the to-do task: "Wang II, help me call one car. "," I "may be a task initiator," Wang-two "may be a task performer, and" call one car "may be a performance action.

In addition, when the task to be handled is more complex, the task to be handled may further have other parameters, such as: wang two, ten am at the new street to help me call a car. "there may be a time parameter in the task to be handled: ten points in the morning, and a location parameter of "New street crossing".

Therefore, the first keyword template can be established according to various parameters which can be possessed in the task to be handled.

In an implementation manner of the embodiment of the present invention, the first keyword template may include a plurality of parameters, such as an execution action parameter, a task initiator parameter, a task executor parameter, a time parameter, a location parameter, and the like. The method comprises the steps of matching text information with a preset first keyword template, extracting at least one first keyword in the text information, specifically, segmenting the text information, identifying the semantics of each segmented word through a Natural Language Understanding (NLU) technology, and determining the segmented word as the first keyword when the semantics of the segmented word are matched with parameters in the first keyword template. Among these, the goal of NLU technology is to convert text into semantic representation, where the exact meaning of a word in the text is not important, but is important the semantic information conveyed by the text, also known as semantic decoding. And the NLU system understands and obtains the semantics corresponding to the text information according to the text information.

For example, for the text message "wang di, ten am helped me to call a car at a new street crossing. The word segmentation in the 'ten points in the morning' can be understood by an NLU technology to obtain that the semantic meaning of the word segmentation is time, and the voice of the word segmentation in the 'ten points in the morning' is matched with the time parameter in the first keyword template, so that the word segmentation in the 'ten points in the morning' can be determined as the first keyword.

In another implementation manner of the embodiment of the present invention, the first keyword template may include a plurality of parameters, such as an execution action parameter, a task initiator parameter, a task executor parameter, a time parameter, a location parameter, and the like. And each parameter includes a plurality of template keywords corresponding to the parameter. The text information is matched with a preset first keyword template, at least one first keyword in the text information is extracted, the text information is subjected to word segmentation, the similarity between each word segmentation and each template keyword is calculated, and the word segmentation with the similarity between the word segmentation and the template keyword being greater than or equal to a preset similarity threshold value is determined as the first keyword.

For example, the time parameters in the first keyword template include a plurality of template keywords from "zero point" to "twenty-four points", and for the text information "wang-two, ten points in the morning help me call a car at a new street. The segmented word "ten points in the morning" in "is matched with the template keyword" ten points "in the first keyword template, and thus, the segmented word" ten points in the morning "can be determined as the first keyword.

And 104, determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain the target task to be handled.

The task creating method provided by the embodiment of the invention can have various application scenes, such as a voice call scene, a voice conference scene and the like, and can respectively create task templates to be handled corresponding to the intention of the audio information aiming at different application scenes. When the task template is in a specific application scene, NLU technical understanding can be performed on text information of all audio information collected in the application scene to obtain the intention corresponding to the audio information, and the task template to be handled corresponding to the audio information is selected, so that the first keyword is filled into a slot position corresponding to the category of the first keyword in the task template to be handled, and the target task to be handled is obtained.

Specifically, the to-do task template includes a fixed action corpus, where the fixed action corpus is used to reflect an execution action, and in addition, the to-do task template may further include at least one slot corresponding to a category of the first keyword, where the slot is connected with the fixed action corpus, and the slot may have a corresponding category, and the category may be set according to an actual situation. In the to-do task template, parameters of the to-do task can be expressed through the slot, the slot can be a text placeholder, and the corresponding words and sentences are filled into the slot, so that the content in the to-do task template can be completely supplemented, and the corresponding target to-do template can be obtained.

For example, "{ person name } helps me to do { task } at { time } { place } may be a proxy task template, where" help me at "and" doing "may be fixed action corpora, { person name }, { time }, { place }, { task } may be four slots, with respective categories being person name category, time category, place category, task category, respectively.

Referring to fig. 2, an interface diagram of a task creation method according to an embodiment of the present invention is shown, in which a scenario that a boss replaces a task to an employee by voice is shown, and dialog audio of two people has been converted into corresponding pieces of text information: "wangsi, help me to make an appointment", "good", "want to go", "Nanjing new street", "several spots", "ten spots in the morning", and "no problem". The first keyword template is used for extracting the first keywords from the text information, so that the first keywords 'Wangbi', 'appointment', 'Nanjing new street crossing' and 'ten points in the morning' can be obtained, the semantics of the first keywords are understood through the NLU, and the categories of the first keywords 'Wangbi', 'appointment', 'Nanjing new street crossing' and 'ten points in the morning' are determined to be the name category, the task category, the place category and the time category respectively. Then, according to a proxy task template, '{ person name } helps me to carry out { task }' at { time } { place }, a first keyword 'wang bi' can be filled into a { person name } slot position, a first keyword 'ten am' is filled into a { time } slot position, a first keyword 'Nanjing new street entrance' is filled into a { place } slot position, and a first keyword 'car appointment' is filled into a { task } slot position, so that a target proxy task 'Wang bi' helps me to make an appointment at ten am Nanjing new street entrance is obtained.

After the slot is filled in, referring to fig. 3, an interface diagram of another task creating method provided by the embodiment of the present invention is shown, the created target task can be displayed through the interface a of fig. 3, and it is determined whether the target task is correct, if the user considers that the place in the target task needs to be modified, the user can click a selection candidate button corresponding to the place, enter the interface B, and provide the candidate sentence to the user through a candidate word signboard provided in the interface B, where recognition errors may exist, such as the duplicate name, the multiple meaning item, the similar pronunciation, the dialect, and the like of the place, so that the user can select the candidate word therein to replace the place in the target task.

In another case, referring to fig. 4, an interface diagram of another task creation method provided by the embodiment of the present invention is shown, in which a scenario that a boss replaces a task to an employee by voice is shown, and dialog audio of two people has been converted into corresponding pieces of text information: "wangsi, help me to make an appointment", "good", "want to go", "Nanjing new street mouth", and "no problem". According to a proxy task template, '{ person name } helps me to carry out { task }' at { time } { place }, a first keyword 'wang di' can be filled into a { person name } slot position, a first keyword 'Nanjing new street entrance' is filled into a { place } slot position, and a first keyword 'car appointment' is filled into a { task } slot position, so that a preliminary proxy task 'Wang di helps me to carry out car appointment at a { time } Nanjing new street entrance' is obtained.

According to the preliminary agent task 'Wangbeiqi appointment at the { time } Nanjing new street, wherein the { time } slot is not filled completely, therefore, the time parameter' nine and a half in the morning 'of the current scene can be obtained according to the keyword' now ', and the time parameter' nine and a half in the morning 'is filled into the { time } slot, so that the target agent task' Wangbeiqi appointment at the nine and a half in the morning at the Nanjing new street is obtained.

To sum up, a task creating method provided by the embodiment of the present invention includes: acquiring audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot position, the slot position is connected with the fixed action corpora, and the slot position corresponds to the category of the first keyword.

Fig. 5 is a flowchart of steps of another task creation method provided by an embodiment of the present invention, and as shown in fig. 5, the method may include:

step 201, audio information is acquired.

The implementation manner of this step is similar to the implementation process of step 101 described above, and the embodiment of the present invention is not described in detail here.

Step 202, extracting the voiceprint information of the audio information through a voiceprint extraction model.

In the embodiment of the invention, because a plurality of users often communicate in a conversation or conference scene, the mobile terminal can determine the source corresponding to each received audio information, so that the target to-be-handled task is sent to the target source, and the task distribution is realized.

Specifically, the Voiceprint information of the audio information is extracted through a Voiceprint extraction model, the Voiceprint (Voiceprint) information is a sound wave spectrum which is displayed by an electro-acoustic instrument and carries speech information, and modern scientific researches show that the Voiceprint not only has specificity, but also has the characteristic of relative stability. Voiceprint information is therefore an important parameter in determining the origin of speech.

Step 203, matching the voiceprint information with a preset voiceprint template, and determining source information corresponding to the voiceprint information, where the voiceprint template includes a corresponding relationship between the voiceprint information and the source information.

In this step, the preset voiceprint template may include voiceprint information previously recorded by different sources, and a correspondence between the voiceprint information and the source information is established, the voiceprint information may be stored in the mobile terminal in a mode of a voiceprint spectrum, the voiceprint information is matched with the preset voiceprint template, and the source information corresponding to the voiceprint information may be determined, so that target source information that needs to receive a target task to be handled may be determined from a plurality of source information in the following step.

Step 204, converting the audio information into text information

The implementation manner of this step is similar to the implementation process of step 102 described above, and the embodiment of the present invention is not described in detail here.

Step 205, matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information.

The implementation manner of this step is similar to the implementation process of step 103 described above, and the embodiment of the present invention is not described in detail here.

And step 206, determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain the target task to be handled.

The implementation manner of this step is similar to the implementation process of step 104 described above, and the embodiment of the present invention is not described in detail here.

Step 207, obtaining at least one source information of the audio information through the voiceprint information in the audio information, and determining target source information corresponding to the target to-be-handled task according to the at least one source information and the first keyword.

In this step, the matching between the source information and the first keyword may specifically be matching between the source information and the category of the first keyword, and the target source information corresponding to the target to-be-handled task may be determined from the source information through the matching between the source information and the first keyword.

For example, referring to fig. 2, an interface diagram of a task creation method provided by an embodiment of the present invention is shown, in which a scenario that a boss replaces a task to an employee by voice is shown, and dialog audio of two people has been converted into corresponding pieces of text information: "wangsi, help me to make an appointment", "good", "want to go", "Nanjing new street", "several spots", "ten spots in the morning", and "no problem". By voiceprint recognition, it can be determined that the audio: the source information corresponding to the 'Wangsi, help me appointment, the' Nanjing New street ', and the' ten o 'clock in the morning' is the boss; determining the audio: the source information corresponding to the 'good', 'where to go', 'several points' and 'no problem' is the staff, and the boss is determined to be the task initiator and the staff is determined to be the task executor according to the meaning of the first keyword in the plurality of text information, so that the staff can be determined to be the target source information from the two source information, and the target task to be handled is sent to the staff for execution after the target task to be handled is created.

And 208, sending the target to-be-handled task to a target terminal corresponding to the target source information.

In this step, the mobile terminal may distribute the target to-do task to the target terminal corresponding to the target source information in the address book, the social software, the project management software, and the like, and perform task tracking.

Optionally, after step 206, the method may further include:

and 209, adding a preset display effect to the first keyword in the slot of the target task to be handled.

And the display effects corresponding to the first keywords in different slot positions are different.

In this step, if the target to-do task is successfully created, a special display effect may be added to and displayed on the corresponding keyword to highlight the key content in the target to-do task, and the display effects corresponding to the first keywords in different slots are different. The display effects include, but are not limited to, a display effect on font size, a display effect on font color, and a display effect on font style.

For example, referring to fig. 2, an interface diagram of a task creation method provided by an embodiment of the present invention is shown, in which a scenario that a boss replaces a task to an employee by voice is shown, and dialog audio of two people has been converted into corresponding pieces of text information: "wangsi, help me to make an appointment", "good", "want to go", "south jing new street", "several", "ten points earlier", "no problem", and establish a target task of taking an appointment "wangsi help me make an appointment at ten points earlier south jing new street", then may add the "wangsi" keyword of { person name } slot to a first preset display effect, add the "ten points earlier" keyword of { time } slot to a second preset display effect, add the "south jing new street" keyword of { place } slot to a third preset display effect, and add the "car appointment" of { task } slot to a fourth preset display effect.

Optionally, the embodiment of the present invention may further include:

and A1, obtaining a sample corpus text.

In the embodiment of the invention, the sample corpus text can be mined according to the user expression habits and the large data corpus, and the task template to be handled is constructed according to the sample corpus text, so that the matching accuracy of the task template to be handled is improved, and the accurate extraction of the keyword sentences is ensured.

The sample corpus text can be collected according to the expression habit of the user, and can also be collected through a large data corpus.

Optionally, step a1 may include:

and A11, generating a query prompt aiming at the task template to be handled.

In this step, the mobile terminal may generate a query prompt for the to-do task template to guide the user to provide a corresponding sample corpus text according to the query prompt.

A12, receiving sample corpus text in response to the query prompt.

In this step, for example, the mobile terminal may generate a query prompt for the to-do task template for the car-booking service: "want someone to help you call for a car, how you would say". And further receiving a sample corpus text provided by the user for the query prompt, such as: "someone helps me to make a car to a certain place" and "please help me to call a car".

A2, establishing the task template to be handled according to the sample corpus text.

According to the sample corpus text, certain keywords in the sample corpus text can be replaced by corresponding slot positions, and the fixed action corpus is connected with the slot positions to obtain the task template to be handled.

Optionally, step a2 may include:

a21, matching the sample corpus text with a corpus text in a preset corpus, and determining a weight value of the sample corpus text, wherein the preset corpus comprises a corresponding relation between the corpus text and the weight value.

In this step, the corpus text of the call or conference scene acquired through big data can be acquired, the user use times of each corpus text is counted, the weight is added to the corresponding corpus text according to the user use times, the corresponding relationship between the corpus text and the weight value is established, and the preset corpus is obtained.

By matching the sample corpus text with the corpus text in the preset corpus, the weighted value of the sample corpus text can be obtained.

A22, determining the target sample corpus text with the weight value greater than or equal to a preset threshold value.

In the embodiment of the invention, by determining the target sample corpus text with the weight value greater than or equal to the preset threshold value, the target sample corpus text with higher value can be screened out from the massive sample corpus text.

Further, after the target sample corpus text with the weight value greater than or equal to the preset threshold value is determined, a dictionary of the professional field may be introduced to improve the expression recognition accuracy of the professional field, for example, in the movie field, corresponding keywords in the target sample corpus text may be corrected by the dictionary including the name of the movie and the name of an actor to improve the expression recognition accuracy of the movie field.

And A23, matching the target sample corpus text with a preset second keyword template, and determining at least one second keyword in the target sample corpus text.

In this step, the target sample corpus text may be participled, and at least one second keyword in the text information may be determined by matching the participle with a preset second keyword template. The second keyword template may be the same template as the first keyword template, and the second keyword template may also be different templates from the first keyword template.

For example, for a target sample corpus text "someone, help me to make a car to a place", a keyword "someone" for a task performer classification, a keyword "car to a task performance action classification, and a keyword" place "for a place classification may be extracted.

And A24, replacing the second keyword with a slot corresponding to the category of the first keyword in the target sample corpus text, and determining other texts except the second keyword as the fixed action corpus to obtain the to-do task template.

In this step, referring to the example provided in step a23, for the target sample corpus text "someone, help me make a car to go somewhere", and the keyword "someone" of the task performer classification, the keyword "make a car" of the task performance action classification, and the keyword "somewhere" of the place classification. In a target sample corpus text ' someone helps me to make a car to go to a place ', a keyword ' someone ' is replaced by a slot position { name of a person } of a task executor classification, a keyword ' car ' is replaced by a slot position { task } of a task executor classification, a keyword ' somewhere ' is replaced by a slot position { place } of a task executor classification, the ' your ' is determined as a fixed action corpus, and a to-be-handled task template ' name } is obtained, i ' task } is helped to go to the { place } '.

In summary, another task creating method provided in the embodiment of the present invention includes: acquiring audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a slot position corresponding to the category of the first keyword in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises the fixed action corpus and at least one slot position corresponding to the category of the first keyword, and the slot position is connected with the fixed action corpus.

Fig. 6 is a block diagram of a mobile terminal according to an embodiment of the present invention, and as shown in fig. 6, the mobile terminal 30 includes:

a first obtaining module 301, configured to obtain audio information;

a conversion module 302, configured to convert the audio information into text information;

the first extraction module 303 is configured to match the text information with a preset first keyword template, and extract at least one first keyword in the text information;

the filling module 304 is configured to determine the category of the first keyword, and fill the first keyword into a corresponding slot in the task template to be handled to obtain a target task to be handled;

Optionally, the mobile terminal 30 further includes:

the source information determining module is used for obtaining at least one source information of the audio information through voiceprint information in the audio information, and determining target source information corresponding to the target task to be handled according to the at least one source information and the first keyword;

and the sending module is used for sending the target to-be-handled task to the target terminal corresponding to the target source person information.

The generating module is used for generating inquiry prompts aiming at the task templates to be handled;

and the receiving response module is used for receiving the sample corpus text responding to the inquiry prompt.

The text matching module is used for matching the sample corpus text with the corpus text in a preset corpus and determining a weight value of the sample corpus text, wherein the preset corpus comprises a corresponding relation between the corpus text and the weight value;

the weight determining module is used for determining the target sample corpus text with the weight value larger than or equal to a preset threshold value;

the template matching module is used for matching the target sample corpus text with a preset second keyword template and determining at least one second keyword in the target sample corpus text;

and the replacing module is used for replacing the second keywords with slot positions corresponding to the category of the first keywords in the target sample corpus text, and determining other texts except the second keywords as the fixed action corpus to obtain the task template to be handled.

The display module is used for adding a preset display effect to the first key word in the slot position of the target task to be handled;

In summary, the mobile terminal provided in the embodiment of the present invention includes obtaining audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot position, the slot position is connected with the fixed action corpora, and the slot position corresponds to the category of the first keyword. In the invention, the audio information spoken by the user can be subjected to real-time text conversion and template matching in scenes such as conversation or conference, and the keywords related to the target task to be handled in the user voice are quickly filled into the corresponding slot positions in the task template to be handled to obtain the target task to be handled, so that the task to be handled is quickly and efficiently created, the labor and the time are saved, and the efficiency is improved.

The embodiment of the present invention further provides a mobile terminal, which includes a processor, a memory, and a computer program stored in the memory and capable of running on the processor, where the computer program, when executed by the processor, implements each process of the task creation method embodiment, and can achieve the same technical effect, and is not described herein again to avoid repetition.

The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements each process of the task creation method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.

Fig. 7 is a block diagram of a mobile terminal according to another embodiment of the present invention. The mobile terminal 500 shown in fig. 7 includes: at least one processor 501, memory 502, at least one network interface 504, a user interface 503, and a camera 506. The various components in the mobile terminal 500 are coupled together by a bus system 505. It is understood that the bus system 505 is used to enable connection communications between these components. The bus system 505 includes a power bus, a control bus, and a status signal bus in addition to a data bus. For clarity of illustration, however, the various buses are labeled as bus system 505 in FIG. 7.

The user interface 503 may include, among other things, a display, a keyboard, or a pointing device (e.g., a mouse, trackball, touch pad, or flexible screen, among others.

It is to be understood that the memory 502 in embodiments of the present invention may be either volatile memory or nonvolatile memory, or may include both volatile and nonvolatile memory. The non-volatile memory may be a Read-only memory (ROM), a programmable Read-only memory (PROM), an erasable programmable Read-only memory (erasabprom, EPROM), an electrically erasable programmable Read-only memory (EEPROM), or a flash memory. The volatile memory may be a Random Access Memory (RAM) which functions as an external cache. By way of example, but not limitation, many forms of RAM are available, such as static random access memory (staticiram, SRAM), dynamic random access memory (dynamic RAM, DRAM), synchronous dynamic random access memory (syncronous DRAM, SDRAM), double data rate synchronous dynamic random access memory (DDRSDRAM ), Enhanced Synchronous DRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), and direct memory bus RAM (DRRAM). The memory 502 of the subject systems and methods described in connection with the embodiments of the invention is intended to comprise, without being limited to, these and any other suitable types of memory.

In some embodiments, memory 502 stores elements, executable modules or data structures, or a subset thereof, or an expanded set thereof as follows: an operating system 5021 and application programs 5022.

The operating system 5021 includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, and is used for implementing various basic services and processing hardware-based tasks. The application 5022 includes various applications, such as a media player (MediaPlayer), a Browser (Browser), and the like, for implementing various application services. The program for implementing the method according to the embodiment of the present invention may be included in the application program 5022.

In the embodiment of the present invention, the processor 501 is configured to obtain audio information by calling a program or an instruction stored in the memory 502, specifically, a program or an instruction stored in the application 5022; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a slot position corresponding to the category of the first keyword in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises a fixed action corpus and at least one slot position corresponding to the category of the first keyword, and the slot position is connected with the fixed action corpus.

The method disclosed by the above-mentioned embodiments of the present invention may be applied to the processor 501, or implemented by the processor 501. The processor 501 may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware or instructions in the form of software in the processor 501. The processor 501 may be a general-purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, or discrete hardware components. The various methods, steps and logic blocks disclosed in the embodiments of the present invention may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present invention may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in the memory 502, and the processor 501 reads the information in the memory 502 and completes the steps of the method in combination with the hardware.

It is to be understood that the embodiments described herein may be implemented in hardware, software, firmware, middleware, microcode, or any combination thereof. For a hardware implementation, the processing units may be implemented within one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), general purpose processors, controllers, micro-controllers, microprocessors, other electronic units configured to perform the functions described herein, or a combination thereof.

For a software implementation, the techniques described in this disclosure may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described in this disclosure. The software codes may be stored in a memory and executed by a processor. The memory may be implemented within the processor or external to the processor.

The mobile terminal 500 can implement the processes implemented by the mobile terminal in the foregoing embodiments, and in order to avoid repetition, the detailed description is omitted here.

In the embodiment of the present invention, the mobile terminal 500 may obtain audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot position, the slot position is connected with the fixed action corpora, and the slot position corresponds to the category of the first keyword. In the invention, the audio information spoken by the user can be subjected to real-time text conversion and template matching in scenes such as conversation or conference, and the keywords related to the target task to be handled in the user voice are quickly filled into the corresponding slot positions in the task template to be handled to obtain the target task to be handled, so that the task to be handled is quickly and efficiently created, the labor and the time are saved, and the efficiency is improved.

The mobile terminal includes: a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the task creation method described above.

The mobile terminal further includes: the readable storage medium has stored thereon a computer program which, when being executed by a processor, carries out the steps of the task creation method described above.

Specifically, the mobile terminal 600 in fig. 8 may be a mobile phone, a tablet computer, a Personal Digital Assistant (PDA), or a vehicle-mounted computer.

The mobile terminal 600 in fig. 8 includes a Radio Frequency (RF) circuit 610, a memory 620, an input unit 630, a display unit 640, a processor 660, an audio circuit 670, a wireless local area network (wireless fidelity) module 680, a power supply 690, and a camera 6110.

The input unit 630 may be used, among other things, to receive numeric or character information input by a user and to generate signal inputs related to user settings and function control of the mobile terminal 600. Specifically, in the embodiment of the present invention, the input unit 630 may include a touch panel 631. The touch panel 631 may collect touch operations performed by a user (e.g., operations performed by the user on the touch panel 631 by using any suitable object or accessory such as a finger or a stylus) thereon or nearby, and drive the corresponding connection device according to a preset program. Alternatively, the touch panel 631 may include two parts of a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 660, and can receive and execute commands sent by the processor 660. In addition, the touch panel 631 may be implemented using various types, such as resistive, capacitive, infrared, and surface acoustic wave. In addition to the touch panel 631, the input unit 630 may also include other input devices 632, and the other input devices 632 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.

Among other things, the display unit 640 may be used to display information input by a user or information provided to the user and various menu interfaces of the mobile terminal 600. The display unit 640 may include a display panel 641, and optionally, the display panel 641 may be configured in the form of an LCD or an organic light-emitting diode (OLED).

It should be noted that the touch panel 631 may cover the display panel 641 to form a touch display screen, and when the touch display screen detects a touch operation thereon or nearby, the touch display screen is transmitted to the processor 660 to determine the type of the touch event, and then the processor 660 provides a corresponding visual output on the touch display screen according to the type of the touch event.

The touch display screen comprises an application program interface display area and a common control display area. The arrangement modes of the application program interface display area and the common control display area are not limited, and can be an arrangement mode which can distinguish two display areas, such as vertical arrangement, left-right arrangement and the like. The application interface display area may be used to display an interface of an application. Each interface may contain at least one interface element such as an icon and/or widget desktop control for an application. The application interface display area may also be an empty interface that does not contain any content. The common control display area is used for displaying controls with high utilization rate, such as application icons like setting buttons, interface numbers, scroll bars, phone book icons and the like.

The processor 660 is a control center of the mobile terminal 600, connects various parts of the entire mobile phone by using various interfaces and lines, and performs various functions of the mobile terminal 600 and processes data by operating or executing software programs and/or modules stored in the first memory 621 and calling data stored in the second memory 622, thereby integrally monitoring the mobile terminal 600. Optionally, processor 660 may include one or more processing units.

In the embodiment of the present invention, the processor 660 is configured to obtain the audio information by calling the software program and/or module stored in the first memory 621 and/or the data stored in the second memory 622; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a corresponding slot position in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot position, the slot position is connected with the fixed action corpora, and the slot position corresponds to the category of the first keyword.

It can be seen that, in the embodiment of the present invention, the mobile terminal may include: acquiring audio information; converting the audio information into text information; matching the text information with a preset first keyword template, and extracting at least one first keyword in the text information; determining the category of the first keyword, and filling the first keyword into a slot position corresponding to the category of the first keyword in the task template to be handled to obtain a target task to be handled; the task template to be handled comprises fixed action corpora and at least one slot corresponding to the category of the first keywords, and the slot is connected with the fixed action corpora.

For the above device embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and for the relevant points, refer to the partial description of the method embodiment.

The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.

As is readily imaginable to the person skilled in the art: any combination of the above embodiments is possible, and thus any combination between the above embodiments is an embodiment of the present invention, but the present disclosure is not necessarily detailed herein for reasons of space.

The task creation methods provided herein are not inherently related to any particular computer, virtual system, or other apparatus. Various general purpose systems may also be used with the teachings herein. The structure required to construct a system incorporating aspects of the present invention will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.

In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.

Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the invention and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.

Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.

Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the claims, any of the claimed embodiments may be used in any combination.

The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functionality of some or all of the components of the method according to embodiments of the invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.

It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

Claims

1. A task creation method is applied to a mobile terminal, and is characterized by comprising the following steps:

acquiring audio information;

converting the audio information into text information;

the task template to be handled comprises a fixed action corpus and at least one slot position, the slot position is connected with the fixed action corpus, and the slot position corresponds to the category of the first keyword;

wherein the method further comprises:

generating a query prompt aiming at the task template to be handled;

receiving a sample corpus text in response to the query prompt;

matching the sample corpus text with corpus texts in a preset corpus to determine a weight value of the sample corpus text, wherein the preset corpus comprises a corresponding relation between the corpus text and the weight value;

determining a target sample corpus text with the weight value larger than or equal to a preset threshold value;

matching the target sample corpus text with a preset second keyword template, and determining at least one second keyword in the target sample corpus text;

and replacing the second keywords with slot positions corresponding to the category of the first keywords in the target sample corpus text, and determining other texts except the second keywords as the fixed action corpus to obtain the to-be-handled task template.

2. The method of claim 1, wherein after determining the category of the first keyword and filling the first keyword into a corresponding slot in a task to be handled template to obtain a target task to be handled, the method further comprises:

obtaining at least one source information of the audio information through voiceprint information in the audio information;

determining target source information corresponding to the target to-be-handled task according to the at least one source information and the first keyword;

and sending the target to-be-handled task to a target terminal corresponding to the target source person information.

3. The method of claim 1, wherein after determining the category of the first keyword and filling the first keyword into a corresponding slot in a task to be handled template to obtain a target task to be handled, the method further comprises:

adding a preset display effect to the first keyword in the slot position of the target task to be handled;

4. A mobile terminal, characterized in that the mobile terminal comprises:

the first acquisition module is used for acquiring audio information;

wherein the mobile terminal further comprises:

a receiving response module for receiving the sample corpus text responding to the inquiry prompt;

5. The mobile terminal of claim 4, wherein the mobile terminal further comprises:

6. The mobile terminal of claim 4, wherein the mobile terminal further comprises:

7. A mobile terminal, characterized in that it comprises a processor, a memory and a computer program stored on said memory and executable on said processor, said computer program, when executed by said processor, implementing the steps of the task creation method according to any one of claims 1 to 3.

8. A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, which computer program, when being executed by a processor, carries out the steps of the task creation method according to any one of claims 1 to 3.