WO2020232617A1 - 语音信息处理方法、装置、电子设备以及存储介质 - Google Patents
语音信息处理方法、装置、电子设备以及存储介质 Download PDFInfo
- Publication number
- WO2020232617A1 WO2020232617A1 PCT/CN2019/087667 CN2019087667W WO2020232617A1 WO 2020232617 A1 WO2020232617 A1 WO 2020232617A1 CN 2019087667 W CN2019087667 W CN 2019087667W WO 2020232617 A1 WO2020232617 A1 WO 2020232617A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- takeaway
- merchant
- voice
- order
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/80—2D [Two Dimensional] animation, e.g. using sprites
Definitions
- This application relates to the field of Internet of Things, and more specifically, to a voice information processing method, device, electronic equipment, and storage medium.
- voice assistants To interact with users of electronic devices. In the process of interacting with voice assistants, users can use voice assistants to complete some operations. However, related Voice assistants have not yet been involved in the food delivery field.
- this application proposes a voice information processing method, device, electronic equipment, and storage medium to improve the foregoing problems.
- the present application provides a voice information processing method, which is applied to a voice assistant, and the method includes: the voice assistant starts to receive voice information after being started; when the voice information is received, the received voice information The voice information is recognized; if it is recognized that the received voice information includes takeaway-related information, the takeaway merchant information is obtained according to the specified rules; the card is displayed, and the takeaway merchant information is displayed in the card; The target merchant information determined in the takeaway merchant information generates an order.
- the present application provides a voice information processing device.
- the device includes: a voice information receiving unit for receiving voice information after the voice information processing device is started; a voice recognition unit for receiving voice After the information, the received voice information is recognized; the takeaway information acquisition unit is configured to obtain takeaway merchant information according to specified rules if it is recognized that the received voice information includes takeaway related information; information display unit , Used to display a card, and display the takeaway merchant information in the card; an order generation unit, used to generate an order based on the target merchant information determined from the takeaway merchant information.
- this application provides an electronic device including one or more processors and a memory; one or more programs, wherein the one or more programs are stored in the memory and configured to be The one or more processors execute the above-mentioned methods.
- the present application provides a computer-readable storage medium having program code stored in the computer-readable storage medium, wherein the above-mentioned method is executed when the program code is running, and control confusion can also be avoided.
- the voice information processing method, device, electronic equipment, and storage medium provided by the present application start to receive voice information after the voice assistant is started, and when the voice assistant receives the voice information, it recognizes the received voice information , If it is recognized that the received voice information includes takeaway-related information, the takeaway merchant information is obtained according to the specified rules, and then the card is displayed, and the takeaway merchant information is displayed in the card, and based on the information from the takeaway The target merchant information determined in the merchant information generates an order.
- the user can trigger the display of takeaway related information through voice, and can directly complete the ordering operation in the voice assistant, and there is no need to start the target client for ordering takeaway separately, and then pass Swipe the page multiple times to find the takeaway you need, which greatly improves the user experience.
- Fig. 1 shows a flowchart of a voice information processing method proposed in an embodiment of the present application
- Fig. 2 shows a schematic diagram of a voice information collection interface proposed by an embodiment of the present application
- FIG. 3 shows a schematic diagram of a takeaway voice setting interface proposed in an embodiment of the present application
- FIG. 4 shows a schematic diagram of a result of adding a takeaway voice setting interface according to an embodiment of the present application
- FIG. 5 shows a schematic diagram of a voice assistant communicating with a server of a target takeaway client according to an embodiment of the present application
- Fig. 6 shows a schematic diagram of a card proposed in an embodiment of the present application
- FIG. 7 shows a schematic diagram of a multi-card display proposed in an embodiment of the present application.
- FIG. 8 shows a schematic diagram of a card switching proposed in an embodiment of the present application.
- FIG. 9 shows a schematic diagram of a card displaying detailed information of a merchant according to an embodiment of the present application.
- FIG. 10 shows a flowchart of a voice information processing method proposed by another embodiment of the present application.
- FIG. 11 shows a flowchart of a voice information processing method proposed by still another embodiment of this application.
- FIG. 12 shows a structural block diagram of a voice information processing device proposed by an embodiment of the present application.
- FIG. 13 shows a structural block diagram of a voice information processing device proposed by another embodiment of the present application.
- FIG. 14 shows a structural block diagram of a voice information processing device proposed by still another embodiment of the present application.
- FIG. 15 shows a structural block diagram of a voice information processing device proposed by another embodiment of the present application.
- FIG. 16 shows a structural block diagram of an electronic device of the present application for executing the voice information processing method according to an embodiment of the present application
- Fig. 17 is a storage unit for storing or carrying program codes for implementing the voice information processing method according to the embodiment of the present application.
- voice assistants For example, Apple's Siri, Samsung's Bixby, Google Assistant, Amazon Alex, etc.
- the voice assistant can be regarded as an intelligent application. The user can help the user solve some practical problems or replace the user in operating the electronic device through the intelligent interaction of the intelligent dialogue with the voice assistant and the instant question and answer.
- the voice assistant of an electronic device when the voice assistant of an electronic device detects that the user input "Help me open Baidu map”, the electronic device can recognize that the user's intention is to use the Baidu map application, so that it can start Baidu The map starts.
- the voice assistant of the electronic device when the voice assistant of the electronic device detects that the user input "Where is there a parking lot nearby”, the electronic device can recognize that the user is looking for a parking lot within a certain range, then the electronic device can be based on the location Search for parking lots and display the search results.
- the relevant voice assistant recognizes the user’s input to order takeaway or takeaway, it either uses "order takeaway" or "takeaway” as keywords to search for resources in a similar manner to a search engine, and then displays the searched The textual information of, or it may be directly searched for nearby restaurants for display.
- the response of the related voice assistant to the food-related information is not the real intention of the user, and thus cannot improve the user's experience and cannot meet the actual needs of the user.
- the inventor also found that in the process of directly using relevant take-out software to place a take-out order, users need to repeatedly slide the page to find the goods they need, which in turn will also cause a poor user experience.
- the inventor proposes a voice information processing method, device, electronic device, and storage medium that can improve the above-mentioned problems in this application.
- the user can trigger the display of takeaway related information through voice, and can directly complete the order placement operation in the voice assistant, and there is no need to separately start the target client for ordering takeaway , And then through multiple page swipes to find the takeaway you need, which greatly improves the user experience.
- a voice information processing method provided by an embodiment of the present application is applied to a voice assistant, and the method includes:
- Step S110 The voice assistant starts to receive voice information after being started.
- the voice assistant may be an application program that runs independently in an electronic device. It can also be a component configured in a certain application. During the user's use, the user can trigger the activation of the voice assistant by touching the physical buttons of the electronic device, or trigger the activation of the voice assistant by touching the virtual buttons displayed on the electronic device.
- the electronic device when the electronic device is equipped with the HOME button, the electronic device can be configured in advance to trigger the associated target application to start by long pressing the HOME button or double-clicking the HOME button, or long pressing the HOME button or double-clicking the HOME button
- the electronic device In the case of triggering a component in the associated target application to start, configure the associated target application or a component in the target application as a voice assistant, so that the voice can be triggered by long pressing the HOME button or double-clicking the HOME button The assistant starts.
- the electronic device can be configured with an entrance that triggers the activation of the voice assistant on the desktop of the system or in an application.
- the portal can be a desktop application icon named as the voice assistant, and when an portal that triggers the voice assistant to start is configured in a certain application, the portal can be a certain A control named Voice Assistant, for example, a text control or a button control.
- the electronic device can display the interface shown in Figure 2. After the interface is displayed, the electronic device will trigger the configured microphone or other physical voice collection components to start sound collection, so that the activated voice The assistant can obtain the collected voice information.
- Step S120 After receiving the voice information, recognize the received voice information.
- the voice information received by the voice assistant is still a voice signal.
- the voice assistant also needs to convert the voice information in the form of voice signal into the voice information in text form, and this conversion process is to recognize the received voice information the process of.
- the voice assistant has multiple ways to realize the recognition of the received voice information.
- the API (Application Programming Interface) of the third-party speech recognition system can be pre-configured.
- the voice assistant can wait for the third-party speech recognition system based on the pre-configured API of the third-party speech recognition system.
- the recognized voice signal is transmitted to a third party for recognition, and then the voice information in text form returned by the third party is received.
- the API of a third-party speech recognition system provided by Microsoft or Google can be used.
- the neural network model can be trained in advance to obtain a model capable of converting voice information in the form of sound signals into voice information in text form. Then the trained model is deployed in a designated server or service cluster, and the voice assistant can transmit the received voice information in the form of voice signals to the process of recognizing the received voice information.
- the server or service cluster performs recognition, and then receives the recognized text voice information returned by the server or service cluster.
- the voice assistant can call the API of a third-party voice recognition system for recognition, and transmit it to a designated server or service cluster for recognition for real-time selection, so as to enhance the flexibility of recognition.
- the voice assistant can determine which form of recognition to perform according to the current network status. It is understandable that the communication process will be more stable when directly connected to the network through the WIFI hotspot than through the mobile communication base station. Then the voice assistant can recognize that the current access to the network is through WIFI, Call the API of the third-party speech recognition system for recognition, and when it is recognized that the mobile communication base station is currently connected to the network, it is transmitted to the designated server or service cluster for recognition.
- the voice assistant can also switch the recognition mode based on the recognition success rate. It is understandable that the pronunciation habits or speaking styles of different users will be different, so even for the same recognition method, there will be different recognition results due to the user's own pronunciation habits or speaking styles. Then in this case, the voice assistant can switch the recognition mode according to the user's own habits and combined with the recognition success rate. Among them, the voice assistant can determine that the recognition fails when it detects that the user has repeatedly recognized similar sound signals. Then the voice assistant can switch to another recognition method after recognizing a specified number of failures using a certain recognition method.
- the voice assistant starts to use the API of a third-party voice recognition system for recognition, and detects that it has recognized three similar sound signals in a row (the specified number of times is 3), then it is determined to switch to transmission to the server or service Identify in the cluster. It should be noted that similar sound signals can be judged by judging the error energy of the two sound signals.
- Step S130 If it is recognized that the received voice information does not include takeaway related information, the current process is ended.
- Step S131 If it is recognized that the received voice information includes takeaway-related information, the takeaway merchant information is obtained according to a specified rule.
- the voice assistant will only trigger to obtain the information of the takeaway merchant after it recognizes some specific information.
- the specific information includes takeaway-related information, which can include "I want to order takeaway", "order takeaway” or "takeaway”.
- takeaway-related information can include "I want to order takeaway", "order takeaway” or "takeaway”.
- the voice assistant recognizes that the received voice information includes takeaway-related information, it will determine that the user wants to order takeaway, and then go to obtain takeaway merchant information.
- the take-out language setting interface 98 shown in FIG. 3 includes a take-out related information addition control 97 and a display control 96 to which take-out related information has been added. Furthermore, after the user clicks the takeaway related information adding control 97, new takeaway related information can be added. And in the adding process, in addition to adding in the form of text, it can also be added in the form of sound signal at the same time.
- the takeaway-related information addition control 97 to trigger the user to enter the takeaway-related information in the form of voice
- the language assistant can directly compare the received voice signal with the pre-stored voice signal in order to improve the final feedback effect. It is understandable that the voice signal is compared In the signal process, the aforementioned method of comparing the signal similarity can be used to determine whether the received sound signal has a similar match with the pre-stored sound signal. If so, you can directly obtain the takeaway merchant information without further follow-up Language to text conversion recognition.
- the language assistant has multiple ways to obtain the information of the takeaway merchant.
- the language assistant can obtain takeaway merchant information based on the data interface with the target takeaway client.
- the language assistant will send an information acquisition request to the target takeaway client, and the information acquisition request is used to trigger the target takeaway client to obtain takeaway merchant information according to specified rules; and receive the return from the target client Of takeaway merchant information.
- the voice assistant will first use the communication channel 95 to send the information acquisition request to the target takeaway client.
- the target takeaway client can first query whether there is cached takeaway merchant information locally. Return directly to the voice assistant.
- the target takeaway client is a client that can be used to independently generate takeaway orders and place orders. Similar to Meituan Takeaway Client or Ele.me Client.
- the voice assistant directly sends an information acquisition request to the server through the communication channel 93, and then returns the takeaway merchant information to the voice assistant through the communication channel 93.
- the language assistant can forward the information acquisition request to the server through the target takeaway client, and the server directly returns the takeaway merchant information to the voice assistant without converting it through the target takeaway client. Then increase the information transmission rate.
- the application program is distinguished by the port number occupied by the application program. For example, the browser client occupies port 80 and the voice assistant occupies port 8080. If the returned information points to port 8080, the electronic device will know that this information is returned to the voice assistant.
- the voice assistant adds the port number occupied by the voice assistant to the generated information acquisition request, and then when the server generates the returned information, the voice assistant’s port number is added to the returned information so that the information can be Is sent directly to the voice assistant.
- the specified rule described in this embodiment may include obtaining the takeaway merchant information of the takeaway merchant closest to the user’s current location based on multiple dimensions, or it may be that the delivery range matches the user and has a high degree of praise.
- the takeaway merchant information of the several takeaway merchants in may also be the takeaway merchant information of the takeaway merchants that directly adapt to the user's history of ordering goods, of course, it may also be the takeaway merchant information after the above-mentioned rules are mixed.
- the specific specified rule is not specifically limited, and one or more of the foregoing multiple rules may be used in combination.
- the specified rules used can also be changed periodically.
- Step S140 Display a card, and display the takeaway merchant information in the card.
- the takeaway merchant information can be displayed in the card 92 as shown in FIG. 6.
- the takeaway merchant information may include the name of the takeaway merchant, scoring information, the starting price of sales, and the type of goods.
- the voice assistant can detect the quantity of the takeaway merchant information
- each card corresponds to displaying one takeaway merchant information.
- three cards 92a, 92b, and 92c can be displayed respectively to display the three takeaway merchant information.
- only part of the takeaway merchant information can be displayed first, and then after detecting that the user slides toward the upper side of the screen, load more merchant information for display. And at the same time hide part of the takeaway merchant information that has been displayed before. For example, in the interface shown in FIG. 8, after detecting that the user slides the screen in the direction indicated by the arrow, the card 92a ranked closest to the direction indicated by the arrow will be hidden, and a new card 92e will be loaded for display.
- Step S150 Generate an order based on the target merchant information determined from the takeaway merchant information.
- the user can further manipulate, so that the voice assistant can display more information so that the user can select the desired product to place an order.
- the voice assistant may send the target merchant information to the target client after acquiring the target merchant information selected by the user, and the target merchant information is used to trigger the target client to generate order information , And display the generated order information. It is understandable that in this manner, the voice assistant will trigger the target client to switch to the foreground display, and then trigger the target client to generate order information and display the generated order information.
- the method may further include: after detecting the payment of the order, sending the order to the server of the target client, and receiving the order execution status returned by the server.
- the voice assistant supports invoking a third-party payment service provider for payment, for example, WeChat payment or Alipay payment, and can also support invoking a payment server configured corresponding to the voice assistant itself for payment.
- the step of generating an order based on the target merchant information determined from the takeaway merchant information includes: generating an order based on the target product information determined from the product information of the target merchant.
- the voice information processing method provided by the present application starts to receive voice information after the voice assistant is started, and when the voice assistant receives the voice information, it recognizes the received voice information, if the received voice information is recognized
- the voice information includes takeaway-related information
- obtain takeaway merchant information according to the specified rules then display the card, and display the takeaway merchant information in the card, and based on the target merchant information determined from the takeaway merchant information Generate orders.
- the user can trigger the display of takeaway related information through voice, and can directly complete the ordering operation in the voice assistant, and there is no need to start the target client for ordering takeaway separately, and then pass Swipe the page multiple times to find the takeaway you need, which greatly improves the user experience.
- a voice information processing method provided by an embodiment of the present application is applied to a voice assistant, and the method includes:
- Step S210 The voice assistant starts to receive voice information after being started.
- Step S220 After receiving the voice information, recognize the received voice information.
- Step S230 If it is recognized that the received voice information does not include takeaway-related information, the current process is ended.
- Step S231 If it is recognized that the received voice information includes takeaway-related information, recognize whether the takeaway-related information includes mode information about a pre-configured ordering mode of takeaway.
- the voice assistant can obtain information about takeaway merchants in advance, and then configure the user's favorite products.
- Step S240 If the identification includes pre-configured mode information, obtain product information of the merchant corresponding to the mode information.
- the mode information is recognized as the favorite mode, obtain the favorite product information in the favorite merchant configured by the user; if the mode information is recognized as the most recent mode, obtain the takeaway that the user recently ordered Merchant and takeaway product information; and if it is recognized that the mode information is the most mode, obtain the takeaway merchant and takeaway product information with the most orders placed by the user.
- Step S250 Generate an order based on the commodity information of the merchant corresponding to the mode information.
- the voice assistant can directly generate an order based on the acquired product information.
- the product information of the merchant corresponding to the mode information may be displayed first. If the voice information for confirming the order is recognized, based on the information
- the mode information corresponds to the merchandise information of the merchant to generate an order. For example, if the voice assistant recognizes that the user says “I want to order my favorite takeaway", the voice assistant may directly obtain pre-configured favorite products to generate an order after recognizing that there is "favorite” mode information.
- the voice assistant recognizes the "favorite” mode information, it can first display the product corresponding to the "favorite” mode information in the form of the aforementioned card, and then recognize that there is "favorite” mode information. After similar semantics such as "Place an order” or “just this” are confirmed, an order is generated based on the product information of the merchant corresponding to the mode information.
- the voice assistant when the voice assistant recognizes that the user said "the same takeaway as last time", the voice assistant can recognize that the mode information is the most recent, and then it can directly generate an order based on the last order.
- Step S260 If it is recognized that the takeaway-related information does not include pre-configured mode information, obtain takeaway merchant information according to a specified rule.
- Step S270 Display a card, and display the takeaway merchant information in the card.
- Step S280 Generate an order based on the target merchant information determined from the takeaway merchant information.
- This application provides a voice information processing method, through which the user can trigger the display of takeaway related information through voice, and can directly complete the operation of placing an order in the voice assistant, and there is no need to start a separate operation for Click on the target client of the takeaway, and then use multiple page scrolling to find the takeaway you need, which greatly improves the user experience.
- the voice information input by the user carries the mode information of the pre-configured take-out mode
- the order can be generated directly according to the mode information, so that the user can be more quickly adapted Needs to further enhance the user experience.
- a voice information processing method provided by an embodiment of the present application is applied to an electronic device, and the method includes:
- Step S310 The voice assistant starts to receive voice information after being started.
- Step S320 After receiving the voice information, recognize the received voice information.
- Step S330 If it is recognized that the received voice information does not include takeaway related information, the current process is ended.
- Step S331 If it is recognized that the received voice information includes takeaway-related information, obtain product information matching the user bound to the voice assistant.
- the step of obtaining product information matching the user bound to the voice assistant includes:
- Obtain the preference information of the user bound to the voice assistant obtain preference information pre-selected and set by the voice assistant binding user.
- the language assistant can recognize that Gong Gong Pao Chicken Set Rice is a preference of the user.
- the voice assistant can also periodically obtain the location information of the electronic device, so as to re-determine the designated time for preference statistics based on the location. segment.
- the voice assistant can periodically obtain the location information of the electronic device where the voice assistant is located; detect the location represented by the location information; if it is detected that the current location is away from the location of the last takeaway order If the distance exceeds the designated distance, the timing of the designated time period is restarted; based on the obtained preference information of the user bound to the voice assistant, product information matching the user bound to the voice assistant is obtained.
- Step S340 Generate an order based on product information matched with the user bound to the voice assistant.
- Step S350 Display the order.
- Step S360 Detect whether the order is selected within a specified time.
- Step S370 If selected, after detecting that the order is paid, send the order to the server of the target client, and receive the order execution status returned by the server.
- Step S380 If not selected, obtain the takeaway merchant information according to the specified rules.
- Step S390 Display a card, and display the takeaway merchant information in the card.
- Step S391 Generate an order based on the target merchant information determined from the takeaway merchant information.
- This application provides a voice information processing method, through which the user can trigger the display of takeaway related information through voice, and can directly complete the operation of placing an order in the voice assistant, and there is no need to start a separate operation for Click the target client of the takeaway, and then use multiple page scrolling to find the takeaway you need, which greatly improves the user experience.
- the order generated based on the obtained product information that matches the user bound to the voice assistant may be displayed first, and then when the user selects the displayed order, the payment is directly completed, and then The efficiency of the entire takeaway order is improved, and when the user does not select the order, the takeaway merchant information can still be obtained and displayed according to the specified rules.
- a voice information processing device 400 provided by an embodiment of the present application, the device 400 includes:
- the voice information receiving unit 410 is configured to receive voice information after the voice information processing device is started.
- the voice recognition unit 420 is configured to recognize the received voice information after receiving the voice information.
- the takeaway information obtaining unit 430 is configured to obtain takeaway merchant information according to specified rules if it is recognized that the received voice information includes takeaway related information.
- the takeaway information obtaining unit 430 is specifically configured to generate an information obtaining request based on the data interface with the target takeaway client, and send the information obtaining request to the target takeaway client, and the information obtaining request is used to trigger
- the target takeaway client obtains takeaway merchant information according to specified rules; and receives the takeaway merchant information returned by the target client.
- the takeaway information obtaining unit 430 is specifically configured to send an information obtaining request to the server corresponding to the target takeaway client, and the information obtaining request is used to trigger the server to obtain the takeaway merchant information according to a specified rule; Receive the takeaway merchant information returned by the server and obtained according to the specified rules.
- the information display unit 440 is configured to display a card and display the takeaway merchant information in the card.
- the information display unit 440 is specifically configured to detect the quantity of the takeaway merchant information; if it detects that the quantity of the takeaway merchant information is more than one, it displays multiple cards, each of which displays correspondingly A takeaway merchant information.
- the apparatus 400 further includes an order generating unit 450, configured to generate an order based on the target merchant information determined from the takeaway merchant information.
- the order generating unit 450 is configured to send the target merchant information to the target client, and the target merchant information is used to trigger the target client to generate order information and display the generated order information.
- the order generating unit 450 is further configured to send the order to the server of the target client after detecting the payment of the order.
- the information display unit 440 is also configured to receive the order execution status returned by the server.
- the takeaway merchant information includes the merchant name of the takeaway merchant.
- the information display unit 440 is further configured to display detailed information of the takeaway merchant if the designated touch operation on the card is detected, the detailed information including the product information of the takeaway merchant.
- the information display unit 440 is specifically configured to display detailed information of the takeaway merchant on the card after detecting a sliding operation in a specified direction acting on the card.
- the order generating unit 450 is specifically configured to generate an order based on the target product information determined from the product information of the target merchant.
- the device 400 further includes a designated mode adapting unit 460, configured to identify whether the takeaway-related information includes mode information about a pre-configured takeaway mode. In this manner, if the takeaway information obtaining unit 430 recognizes that it includes pre-configured mode information, it obtains the product information of the merchant corresponding to the mode information.
- the takeaway information obtaining unit 430 is specifically configured to obtain the favorite product information in the favorite merchant configured by the user when the mode information is recognized as the favorite mode; if the mode information is recognized as the most recent mode Obtain the takeaway merchant and takeaway product information where the user recently placed an order; and if it is recognized that the mode information is the most mode, obtain the takeaway merchant and takeaway product information with the most orders by the user.
- the order generating unit 450 is specifically configured to generate an order based on the commodity information of the merchant corresponding to the mode information. As a way, the order generating unit 450 is specifically configured to generate an order based on the commodity information of the merchant corresponding to the mode information after recognizing the voice information for confirming the order.
- the takeaway information acquisition unit 430 if it is recognized that the takeaway-related information does not include the pre-configured mode information, executes the specified rules to obtain the takeaway merchant information.
- the takeaway information obtaining unit 430 is specifically configured to obtain product information matching the user bound to the voice assistant if it is recognized that the received voice information includes takeaway related information. Wherein, the takeaway information obtaining unit 430 is specifically configured to obtain preference information of the user bound to the voice assistant; based on the obtained preference information of the voice assistant bound user, obtain product information that matches the voice assistant bound user . Wherein, optionally, the takeaway information obtaining unit 430 is specifically configured to obtain preference information pre-selected and set by the voice assistant bound user.
- the takeaway information acquisition unit 430 is specifically configured to calculate the commodities that the voice assistant bound user has placed orders within a specified time period; the number of orders placed exceeds the specified number of orders among the commodities placed in the specified historical time period
- the category of the product is used as the voice assistant to bind the user’s preference information.
- the apparatus further includes a location information obtaining unit 470, configured to periodically obtain location information of the electronic device where the voice assistant is located; and perform operations on the location represented by the location information. Detection; if it is detected that the distance between the current location and the location of the last takeaway order exceeds the specified distance, restart the timing of the specified time period.
- a location information obtaining unit 470 configured to periodically obtain location information of the electronic device where the voice assistant is located; and perform operations on the location represented by the location information. Detection; if it is detected that the distance between the current location and the location of the last takeaway order exceeds the specified distance, restart the timing of the specified time period.
- the order generating unit 450 is specifically configured to generate an order based on product information matched with the user bound to the voice assistant.
- the information display unit 440 is specifically configured to display the order.
- the device 400 further includes an order detection unit 480 for detecting whether the order is selected within a specified time. If not, the takeaway information acquiring unit 430 is specifically used for Perform the described procedures to obtain takeaway merchant information according to specified rules. If selected, after detecting that the order is paid, the order detection unit 480 sends the order to the server of the target client, and the information display unit 440 is specifically configured to receive the order execution status returned by the server.
- the electronic device 200 includes one or more (only one is shown in the figure) a processor 102, a memory 104, and a network module 106 coupled to each other.
- the memory 104 stores a program that can execute the content in the foregoing embodiment
- the processor 102 can execute the program stored in the memory 104.
- the processor 102 may include one or more processing cores.
- the processor 102 uses various interfaces and lines to connect various parts of the entire electronic device 200, and executes by running or executing instructions, programs, code sets, or instruction sets stored in the memory 104, and calling data stored in the memory 104.
- the processor 102 may use at least one of digital signal processing (Digital Signal Processing, DSP), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA), and Programmable Logic Array (Programmable Logic Array, PLA).
- DSP Digital Signal Processing
- FPGA Field-Programmable Gate Array
- PLA Programmable Logic Array
- the processor 102 may integrate one or a combination of a central processing unit (CPU), a graphics processing unit (GPU), a modem, and the like.
- the CPU mainly processes the operating system, user interface, and application programs; the GPU is used for rendering and drawing of display content; the modem is used for processing wireless communication. It can be understood that the above-mentioned modem may not be integrated into the processor 102, but may be implemented by a communication chip alone.
- the memory 104 may include random access memory (RAM) or read-only memory (Read-Only Memory).
- the memory 104 may be used to store instructions, programs, codes, code sets or instruction sets.
- the memory 104 may include a storage program area and a storage data area, where the storage program area may store instructions for implementing the operating system and instructions for implementing at least one function (such as touch function, sound playback function, image playback function, etc.) , Instructions for implementing the following method embodiments, etc.
- the data storage area can also store data (such as phone book, audio and video data, chat record data) created by the terminal 100 during use.
- the network module 106 is used to receive and send electromagnetic waves, realize the mutual conversion between electromagnetic waves and electrical signals, so as to communicate with a communication network or other devices, such as with audio playback devices.
- the network module 106 may include various existing circuit elements for performing these functions, for example, an antenna, a radio frequency transceiver, a digital signal processor, an encryption/decryption chip, a subscriber identity module (SIM) card, a memory, etc. .
- SIM subscriber identity module
- the network module 106 can communicate with various networks such as the Internet, an intranet, and a wireless network, or communicate with other devices through a wireless network.
- the aforementioned wireless network may include a cellular telephone network, a wireless local area network, or a metropolitan area network.
- the network module 106 can exchange information with the base station.
- FIG. 18 shows a structural block diagram of a computer-readable storage medium provided by an embodiment of the present application.
- the computer-readable medium 1100 stores program code, and the program code can be invoked by a processor to execute the method described in the foregoing method embodiment.
- the computer-readable storage medium 1100 may be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM.
- the computer-readable storage medium 1100 includes a non-transitory computer-readable storage medium.
- the computer-readable storage medium 1100 has a storage space for executing the program code 810 of any method step in the foregoing method. These program codes can be read out from or written into one or more computer program products.
- the program code 1110 may be compressed in a suitable form, for example.
- the voice information processing method, device, electronic equipment, and storage medium provided by the present application start to receive voice information after the voice assistant is started, and when the voice assistant receives the voice information, it recognizes the received voice information , If it is recognized that the received voice information includes takeaway-related information, the takeaway merchant information is obtained according to the specified rules, and then the card is displayed, and the takeaway merchant information is displayed in the card, and based on the information from the takeaway The target merchant information determined in the merchant information generates an order.
- the user can trigger the display of takeaway related information through voice, and can directly complete the ordering operation in the voice assistant, and there is no need to start the target client for ordering takeaway separately, and then pass Swipe the page multiple times to find the takeaway you need, which greatly improves the user experience.
Landscapes
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- User Interface Of Digital Computer (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (20)
- 一种语音信息处理方法,其特征在于,应用于语音助手,所述方法包括:所述语音助手在启动后开始接收语音信息;当接收到语音信息后,对所述接收到的语音信息进行识别;若识别到所述接收到的语音信息中包括有外卖相关信息时,按照指定规则获取外卖商户信息;显示卡片,并在所述卡片中显示所述外卖商户信息;基于从所述外卖商户信息中确定的目标商户信息生成订单。
- 根据权利要求1所述的方法,其特征在于,所述外卖商户信息包括所述外卖商户的商户名称,所述显示卡片,并在所述卡片中显示所述外卖商户信息的步骤之后还包括:若检测到作用于所述卡片的指定触控操作后,显示所述外卖商户的详细信息,所述详细信息包括所述外卖商户的商品信息。
- 根据权利要求2所述的方法,其特征在于,所述基于从所述外卖商户信息中确定的目标商户信息生成订单的步骤包括:基于从所述目标商户的商品信息中确定的目标商品信息生成订单。
- 根据权利要求2所述的方法,其特征在于,所述若检测到作用于所述卡片的指定触控操作后,显示所述外卖商户的详细信息的步骤包括:若检测到作用于所述卡片的延指定方向的滑动操作后,在所述卡片中显示所述外卖商户的详细信息。
- 根据权利要求1所述的方法,其特征在于,所述显示卡片,并在所述卡片中显示所述外卖商户信息的步骤包括:检测所述所述外卖商户信息的数量;若检测到所述外卖商户信息的数量为多个时,显示多个卡片,其中每个卡片对应显示一个外卖商户信息。
- 根据权利要求1-5任一所述的方法,其特征在于,所述按照指定规则获取外卖商户信息的步骤之前还包括:识别所述外卖相关信息是否包括关于预先配置订外卖模式的模式信息,若识别包括预先配置的模式信息,获取与所述模式信息对应商户的商品信息;基于所述与所述模式信息对应商户的商品信息生成订单;若识别所述外卖相关信息不包括预先配置的模式信息,执行所述按照指定规则获取外卖商户信息。
- 根据权利要求6所述的方法,其特征在于,所述若识别包括预先配置的模式信息,获取与所述模式信息对应商户的商品信息的步骤包括:若识别所述模式信息为最喜欢模式时,获取用户配置的最喜欢商户中最喜欢的商品信息;若识别所述模式信息为最近模式时,获取用户最近一次下单的外卖商户以及外卖商品信息;以及若识别所述模式信息为最多模式时,获取用户下单最多的外卖商户以及外卖商品信息。
- 根据权利要求6所述的方法,其特征在于,基于所述与所述模式信息对应商户的商品信息生成订单的步骤包括:显示与所述模式信息对应商户的商品信息;若识别到确认下单的语音信息后,基于所述与所述模式信息对应商户的商品信息生成订单。
- 根据权利要求1-5任一所述的方法,其特征在于,所述按照指定规则获取外卖商户信息的步骤之前还包括:若识别到所述接收到的语音信息中包括有外卖相关信息时,获取与所述语音助手绑定用户匹配的商品信息;基于与所述语音助手绑定用户匹配的商品信息生成订单;显示所述订单;检测所述订单在指定时间内是否被选择,若未选择,执行所述按照指定规则获取外卖商户信息。
- 根据权利要求9所述的方法,其特征在于,所述获取与所述语音助手绑定用户匹配的商品信息的步骤包括:获取所述语音助手绑定用户的喜好信息;基于获取的所述语音助手绑定用户的喜好信息,获取与所述语音助手绑定用户匹配的商品信息。
- 根据权利要求10所述的方法,其特征在于,所述获取所述语音助手绑定用户的喜好信息的步骤包括:获取语音助手绑定用户预选设定的喜好信息。
- 根据权利要求11所述的方法,其特征在于,所述获取所述语音助手绑定用户的喜好信息的步骤包括:计算所述语音助手绑定用户在指定时间段内下单的商品;将所述指定历史时间段内下单的商品中下单次数超过指定次数的商品的类别作为所述语音助手绑定用户的喜好信息。
- 根据权利要求12所述的方法,其特征在于,所述方法还包括:周期性获取所述语音助手所在电子设备的位置信息;对所述位置信息所表征的位置进行检测;若检测到当前的位置距离最近一次订购外卖时的位置之间的距离超过指定距离,重新开始计时所述指定时间段。
- 根据权利要求1-13任一所述的方法,其特征在于,所述按照指定规则获取外卖商户信息的步骤包括:基于与目标外卖客户端之间的数据接口生成信息获取请求,向所述目标外卖客户端发送信息获取请求,所述信息获取请求用于触发所述目标外卖客户端按照指定规则获取外卖商户信息;接收所述目标客户端返回的外卖商户信息。
- 根据权利要求1-13任一所述的方法,其特征在于,所述按照指定规则获取外卖商户信息的步骤包括:向目标外卖客户端所对应的服务端发送信息获取请求,所述信息获取请求用于触发所述服务端按照指定规则获取外卖商户信息;接收所述服务端返回的按照指定规则获取的外卖商户信息。
- 根据权利要求1-15任一所述的方法,其特征在于,所述基于从所述外卖商户信息中确定的目标商户信息生成订单的步骤包括:将所述目标商户信息发送给目标客户端,所述目标商户信息用于触发所述目标客户端生成订单信息,并显示所生成的订单信息。
- 根据权利要求1-16任一所述的方法,其特征在于,所述基于从所述外卖商户信息中确定的目标商户信息生成订单的步骤之后还包括:检测到所述订单支付后,将所述订单发送给目标客户端的服务端,并接收所述服务端返回的订单执行情况。
- 一种语音信息处理装置,其特征在于,所述装置包括:语音信息接收单元,用于在所述语音信息处理装置启动后接收语音信息;语音识别单元,用于当接收到语音信息后,对所述接收到的语音信息进行识别;外卖信息获取单元,用于若识别到所述接收到的语音信息中包括有外卖相关信息时,按照指定规则获取外卖商户信息;信息显示单元,用于显示卡片,并在所述卡片中显示所述外卖商户信息;订单生成单元,用于基于从所述外卖商户信息中确定的目标商户信息生成订单。
- 一种电子设备,其特征在于,包括一个或多个处理器以及存储器;一个或多个程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行以实现权利要求1-17任一所述的方法。
- 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有程序代码,其中,在所述程序代码被处理器运行时执行权利要求1-17任一所述的方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/087667 WO2020232617A1 (zh) | 2019-05-20 | 2019-05-20 | 语音信息处理方法、装置、电子设备以及存储介质 |
CN201980089737.6A CN113330489A (zh) | 2019-05-20 | 2019-05-20 | 语音信息处理方法、装置、电子设备以及存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/087667 WO2020232617A1 (zh) | 2019-05-20 | 2019-05-20 | 语音信息处理方法、装置、电子设备以及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020232617A1 true WO2020232617A1 (zh) | 2020-11-26 |
Family
ID=73459363
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/087667 WO2020232617A1 (zh) | 2019-05-20 | 2019-05-20 | 语音信息处理方法、装置、电子设备以及存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN113330489A (zh) |
WO (1) | WO2020232617A1 (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116805023B (zh) * | 2023-08-25 | 2023-11-03 | 量子数科科技有限公司 | 一种基于大语言模型的外卖推荐方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104078043A (zh) * | 2013-04-26 | 2014-10-01 | 腾讯科技(深圳)有限公司 | 网络交易系统的语音操作指令识别处理方法和系统 |
CN106383872A (zh) * | 2016-09-06 | 2017-02-08 | 北京百度网讯科技有限公司 | 基于人工智能的信息处理方法及装置 |
CN107230142A (zh) * | 2017-07-12 | 2017-10-03 | 陈维龙 | 基于语音生成订单的方法及装置、交易方法及系统 |
-
2019
- 2019-05-20 WO PCT/CN2019/087667 patent/WO2020232617A1/zh active Application Filing
- 2019-05-20 CN CN201980089737.6A patent/CN113330489A/zh active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104078043A (zh) * | 2013-04-26 | 2014-10-01 | 腾讯科技(深圳)有限公司 | 网络交易系统的语音操作指令识别处理方法和系统 |
CN106383872A (zh) * | 2016-09-06 | 2017-02-08 | 北京百度网讯科技有限公司 | 基于人工智能的信息处理方法及装置 |
CN107230142A (zh) * | 2017-07-12 | 2017-10-03 | 陈维龙 | 基于语音生成订单的方法及装置、交易方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN113330489A (zh) | 2021-08-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113366524B (zh) | 信息推荐方法、装置、电子设备以及存储介质 | |
US9930167B2 (en) | Messaging application with in-application search functionality | |
CN105389099B (zh) | 用于语音记录和回放的方法和设备 | |
US8000454B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
US20180275840A1 (en) | Method for executing program and electronic device thereof | |
JP2020173834A (ja) | 通信セッションの状態の保存 | |
US10911565B2 (en) | Method, device and system for associating a service account | |
JP2022505659A (ja) | インタラクティブメッセージ処理方法、装置、コンピュータ機器及びコンピュータプログラム | |
CN110099380B (zh) | 应用程序推荐方法、装置、电子设备及介质 | |
US10832666B2 (en) | Advanced user interface for voice search and results display | |
US9665904B2 (en) | Order entry system and order entry method | |
WO2020228030A1 (zh) | 设备推荐方法、装置、电子设备以及存储介质 | |
CN111667328B (zh) | 页面内容展示方法、装置及电子设备 | |
CN108304115B (zh) | 终端条目选择方法、装置及存储介质和终端 | |
WO2020232616A1 (zh) | 信息推荐方法、装置、电子设备以及存储介质 | |
CN109683760B (zh) | 最近内容的显示方法、装置、终端及存储介质 | |
CN109684443B (zh) | 智能交互方法和装置 | |
WO2019223484A1 (zh) | 信息显示方法、装置、移动终端以及存储介质 | |
WO2020232617A1 (zh) | 语音信息处理方法、装置、电子设备以及存储介质 | |
US8867708B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
US11121888B2 (en) | Intelligent service platform and method | |
CN113330475B (zh) | 信息推荐方法、装置、电子设备以及存储介质 | |
WO2018113751A1 (zh) | 一种设置通信快捷方式的方法及电子设备 | |
JP2015012463A (ja) | 情報処理装置、特定用語通知方法、プログラム、特定用語通知システム、および端末装置 | |
WO2020258082A1 (zh) | 信息推荐方法、装置、电子设备以及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19929763 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19929763 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 26.04.2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19929763 Country of ref document: EP Kind code of ref document: A1 |