CN112489659B - Data processing method, system, server and terminal equipment based on input method - Google Patents

Data processing method, system, server and terminal equipment based on input method Download PDF

Info

Publication number
CN112489659B
CN112489659B CN202011287122.5A CN202011287122A CN112489659B CN 112489659 B CN112489659 B CN 112489659B CN 202011287122 A CN202011287122 A CN 202011287122A CN 112489659 B CN112489659 B CN 112489659B
Authority
CN
China
Prior art keywords
server
input method
method application
binding
text input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011287122.5A
Other languages
Chinese (zh)
Other versions
CN112489659A (en
Inventor
刘晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Apollo Zhilian Beijing Technology Co Ltd
Original Assignee
Apollo Zhilian Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Apollo Zhilian Beijing Technology Co Ltd filed Critical Apollo Zhilian Beijing Technology Co Ltd
Priority to CN202011287122.5A priority Critical patent/CN112489659B/en
Publication of CN112489659A publication Critical patent/CN112489659A/en
Application granted granted Critical
Publication of CN112489659B publication Critical patent/CN112489659B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The application discloses a data processing method, a system, a server, terminal equipment, electronic equipment and a storage medium based on an input method, and relates to artificial intelligence, automatic driving, voice technology and information flow technology in computer technology. The specific implementation scheme is as follows: the first server receives the voice recognition request sent by the second server, acquires the audio information corresponding to the voice recognition request from the terminal equipment according to the voice recognition request, generates text information corresponding to the audio information, and sends the text information to the second server. The defect of error of the input text information caused by error clicking is realized, the accuracy and the reliability of information input are improved, and the technical effects of improving the flexibility and the diversity of data processing are realized.

Description

Data processing method, system, server and terminal equipment based on input method
Technical Field
The present disclosure relates to artificial intelligence, autopilot, voice technology, and information flow technology in computer technology, and more particularly, to a data processing method, system, server, terminal device, electronic device, and storage medium based on an input method.
Background
Input method applications are widely used in various terminal devices such as hand-held terminals, notebook computers, and vehicle-mounted terminals.
In some embodiments, the input method application may support text input and speech input, while in other embodiments, the input method application supports only text input or speech input, and when the input method application supports only text input, we may refer to the input method application as a text input method application.
When the input method is applied as a text input method, a user needs to input related information to the terminal device in a manual mode, so that how to improve the flexibility and diversity of inputting the related information by the user becomes a problem to be solved urgently.
Disclosure of Invention
The application provides a data processing method, a system, a server, a terminal device, an electronic device and a storage medium based on an input method, which are used for improving flexibility and diversity.
According to a first aspect of the present application, there is provided an input method-based data processing method applied to a first server, where the first server is configured to provide a voice input method application service, the method including:
receiving a voice recognition request sent by a second server, wherein the second server is used for providing text input method application service, and the voice recognition request is generated when a terminal device initiates audio information;
according to the voice recognition request, acquiring audio information corresponding to the voice recognition request from the terminal equipment;
generating text information corresponding to the audio information, and sending the text information to the second server, wherein the text information is used for being output by the terminal equipment.
In this embodiment, through interaction among the first server, the second server and the terminal device, output of text information corresponding to audio information is achieved, and the problem that in the related art, text information needs to be input manually is avoided, which is caused by the fact that: the defect of error of the input text information caused by the click error is solved, and the problems of low flexibility and convenience are solved, so that the accuracy and reliability of information input are improved, and the technical effects of improving the flexibility and diversity of data processing are realized.
According to a second aspect of the present application, there is provided an input method-based data processing method applied to a second server, where the second server is configured to provide a text input method application service, and the method includes:
if the audio information sent by the terminal equipment is received, a voice recognition request is sent to a first server, wherein the voice recognition request is used for requesting to acquire text information corresponding to the audio information;
receiving text information corresponding to the audio information and fed back by the first server;
and sending the text information to the terminal equipment, wherein the text information is used for being output by the terminal equipment.
According to a third aspect of the present application, there is provided a data processing method based on an input method, applied to a terminal device, including:
receiving audio information input by a user, and respectively transmitting the audio information to a first server and a second server, wherein the first server is used for providing a voice input method application service, the second server is used for providing a text input method application service, the audio information is used for generating a voice recognition request and text information, and the audio information is transmitted to the first server after the first server receives the voice recognition request;
And receiving the text information sent by the second server and outputting the text information.
According to a fourth aspect of the present application, there is provided a server for providing a voice input method application service, the server comprising:
the first receiving module is used for receiving a voice recognition request sent by a second server, wherein the second server is used for providing text input method application service, and the voice recognition request is generated when the terminal equipment initiates audio information;
the acquisition module is used for acquiring audio information corresponding to the voice recognition request from the terminal equipment according to the voice recognition request;
the first generation module is used for generating text information corresponding to the audio information;
and the first sending module is used for sending the text information to the second server, wherein the text information is used for being output by the terminal equipment.
According to a fifth aspect of the present application, there is provided a server for providing a text input method application service, comprising:
a fourth sending module, configured to send a speech recognition request to a first server if audio information sent by a terminal device is received, where the speech recognition request is used to request to obtain text information corresponding to the audio information;
The third receiving module is used for receiving text information corresponding to the audio information and fed back by the first server;
and a fifth sending module, configured to send the text information to the terminal device, where the text information is used for being output by the terminal device.
According to a sixth aspect of the present application, there is provided a terminal device, comprising:
the sixth receiving module is used for receiving the audio information input by the user;
a seventh sending module, configured to send the audio information to a first server and a second server, where the first server is configured to provide a voice input method application service, the second server is configured to provide a text input method application service, the audio information is used to generate a voice recognition request and text information, and the audio information is sent to the first server after the first server receives the voice recognition request;
a seventh receiving module, configured to receive the text information sent by the second server;
and the output module is used for outputting the text information.
According to a seventh aspect of the present application, there is provided an input method based data processing system, the system comprising:
The server according to the fourth embodiment;
the server according to the fifth embodiment;
the terminal device according to the sixth embodiment.
According to an eighth aspect of the present application, there is provided an electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of the first aspect; or,
to enable the at least one processor to perform the method as described in the second aspect; or,
to enable the at least one processor to perform the method as described in the third aspect.
According to a ninth aspect of the present application, there is provided a non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method according to the first aspect; or,
the computer instructions for causing the computer to perform the method according to the second aspect; or,
the computer instructions are for causing the computer to perform the method according to the third aspect.
According to a ninth aspect of the present application, there is provided a computer program product comprising: a computer program stored in a readable storage medium, from which at least one processor of an electronic device can read, the at least one processor executing the computer program causing the electronic device to perform the method of the first aspect; alternatively, execution of the computer program by the at least one processor causes the electronic device to perform the method of the second aspect; alternatively, execution of the computer program by the at least one processor causes the electronic device to perform the method of the third aspect.
According to the present application: when the second server only supports the application of the text input method, the terminal equipment can provide the audio information by combining interaction among the first server, the second server and the terminal equipment, the second server initiates a voice recognition request, the first server converts the audio information into a specific interaction process of the text information, the technical scheme of outputting the text information corresponding to the audio information is realized, and the problem that in the related art, when the second server only supports the application of the text input method, the text information needs to be input manually is avoided: the defect of error of the input text information caused by the click error is solved, and the problems of low flexibility and convenience are solved, so that the accuracy and reliability of information input are improved, and the technical effect technology for improving the flexibility and diversity of data processing is realized.
It should be understood that the description of this section is not intended to identify key or critical features of the embodiments of the application or to delineate the scope of the application. Other features of the present application will become apparent from the description that follows.
Drawings
The drawings are for better understanding of the present solution and do not constitute a limitation of the present application. Wherein:
FIG. 1 is a schematic diagram according to a first embodiment of the present application;
FIG. 2 is a schematic illustration of an application scenario according to the present application;
FIG. 3 is a schematic diagram according to a second embodiment of the present application;
FIG. 4 is a schematic diagram according to a third embodiment of the present application;
FIG. 5 is a schematic diagram according to a fourth embodiment of the present application;
FIG. 6 is a schematic diagram according to a fifth embodiment of the present application;
FIG. 7 is a schematic diagram according to a sixth embodiment of the present application;
FIG. 8 is a schematic diagram according to a seventh embodiment of the present application;
FIG. 9 is a schematic diagram according to an eighth embodiment of the present application;
FIG. 10 is a schematic diagram according to a ninth embodiment of the present application;
fig. 11 is a schematic diagram according to a tenth embodiment of the present application.
Detailed Description
Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and should be considered as merely exemplary. Accordingly, one of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
The inventors of the present application have creatively made the inventive concept of the present application: if the first server supports the voice input method application service, the second server only supports the text input method application, and when the user inputs the audio information through the terminal equipment, the second server can output the text information based on the audio information through interaction among the second server, the first server and the terminal equipment.
Based on the above inventive concept, the application provides a data processing method, a system, a server, a terminal device, an electronic device and a storage medium based on an input method, which are applied to artificial intelligence, automatic driving, voice technology and information flow technology in computer technology to achieve the technical effect of improving the accuracy and reliability of information input.
The following describes the technical solutions of the present application and how the technical solutions of the present application solve the above technical problems in detail with specific embodiments. The following embodiments may be combined with each other, and the same or similar concepts or processes may not be described in detail in some embodiments. Embodiments of the present application will be described below with reference to the accompanying drawings.
Fig. 1 is a schematic diagram according to a first embodiment of the present application, as shown in fig. 1, a data processing method based on an input method provided in this embodiment includes:
s101: the first server receives the voice recognition request sent by the second server.
The first server is used for providing a voice input method application service, the second server is used for providing a text input method application service, and the second server generates voice recognition requests when the voice recognition requests initiate audio information for the terminal equipment.
For example, the execution body of the embodiment may be a first server, and the first server is a server for improving a voice input method application service. In one example, the first server may be a stand-alone server, and in another example, the first server may be a server cluster including a plurality of servers.
The terminal device may include various handheld devices, vehicle mounted devices, wearable devices, computing devices, or other processing devices connected to a wireless modem with wireless communication capabilities.
In particular, the terminal device may be a mobile terminal, such as a mobile phone (or "cellular" phone) and a computer with a mobile terminal, for example, a portable, pocket, hand-held, computer-built-in or vehicle-mounted mobile device (such as a vehicle-mounted terminal) that exchanges speech and/or data with the radio access network; the terminal device may also be a personal communication services (Personal Communication Service, PCS) phone, cordless phone, session initiation protocol (Session Initiation Protocol, SIP) phone, wireless local loop (Wireless Local Loop, WLL) station, personal digital assistant (Personal Digital Assistant, PDA), tablet, wireless modem (modem), handheld device (handset), laptop (laptop computer), machine type communication (Machine Type Communication, MTC) terminal or the like; the Terminal device may also be referred to as a system, subscriber Unit (Subscriber Unit), subscriber Station (Subscriber Station), mobile Station (Mobile Station), remote Station (Remote Station), remote Terminal (Remote Terminal), access Terminal (Access Terminal), user Terminal (User Terminal), user Agent (User Agent), user device (User Device or User Equipment), and the like, without limitation herein.
For example, if the terminal device is a vehicle-mounted mobile device (e.g., a vehicle-mounted terminal), the data processing method based on the input method provided in this embodiment may be applied to an application scenario as shown in fig. 2, where, as shown in fig. 2, the application scenario may include: driver 100, in-vehicle terminal 210 provided in vehicle 200, text input method server 300, and voice input method server 400.
The vehicle-mounted terminal is one of terminal devices in S101, the text input method server is the second server in S101, and the voice input method server is the first server in S101.
In fig. 2, in order to distinguish between the text input method server 300 and the voice input method server 400, the text input method server 300 is presented in the form of an independent server, and the voice input method server 400 is presented in the form of a service cluster.
It should be understood that the driver 100 may input text information at interfaces corresponding to respective vehicle-mounted services of the vehicle-mounted terminal 210 based on the text input method application, and enjoy the vehicle-mounted service provided by the vehicle-mounted terminal 210 based on the text information. Among them, the in-vehicle service includes a navigation service and an entertainment service (e.g., a music service) installed in the in-vehicle terminal 210, etc.
In the related art, if the input method application of the vehicle-mounted terminal is a text input method application, and the text input method application cannot support voice analysis, when a driver inputs audio information at an interface corresponding to any vehicle service of the vehicle-mounted terminal, the any vehicle service cannot normally run, and the driver needs to input text information at the interface corresponding to the any vehicle service manually, so that the vehicle-mounted terminal can run the any vehicle service.
In this embodiment, the driver may directly input the audio information without manually inputting the text information corresponding to the any vehicle service, and when the text input method server receives the audio information sent by the terminal device, the text input method server may send a voice recognition request to the voice input method server.
S102: the first server collects audio information corresponding to the voice recognition request from the terminal equipment according to the voice recognition request.
In connection with the application scenario as shown in fig. 2, this step can be understood as: if the voice input method server receives a voice recognition request sent by the text input method server, audio information corresponding to the voice recognition request is collected from the vehicle-mounted terminal.
S103: the first server generates text information corresponding to the audio information and sends the text information to the second server, wherein the text information is used for being output by the terminal equipment.
The first server may parse the audio information after obtaining the audio information from the terminal device, generate text information corresponding to the audio information, and send the text information to the second server, where the second server may send the text information to the terminal device, and the terminal device outputs the text information.
The second server initiates the voice recognition request, and the first server converts the voice information into a specific interaction process of the text information, so that the text information corresponding to the voice information is output, and the problem that in the related art, when the second server only supports the text input method application, the text information needs to be input manually is avoided, wherein the interaction is caused by combining the first server, the second server and the terminal equipment: the defect of error of the input text information caused by the click error is solved, and the problems of low flexibility and convenience are solved, so that the accuracy and reliability of information input are improved, and the technical effects of improving the flexibility and diversity of data processing are realized.
In connection with the application scenario as shown in fig. 2, S103 may be understood as: after the voice input method server obtains the audio information from the vehicle-mounted terminal, the audio information can be analyzed, text information corresponding to the audio information is generated, the text information is sent to the text input method server, and accordingly, the text input method server can send the text information to the vehicle-mounted terminal, and the vehicle-mounted terminal outputs the text information.
Based on the above analysis, by adopting the data processing method based on the input method provided by the embodiment, the accuracy and reliability of information input can be improved, and the technical effects of flexibility and diversity of data processing can also be improved.
Fig. 3 is a schematic diagram according to a second embodiment of the present application, and as shown in fig. 3, a data processing method based on an input method according to the present embodiment includes:
S201: the terminal equipment collects the audio information input by the user.
For example, one or more microphones may be provided on the terminal device, and audio information input by the user may be collected according to the one or more microphones.
S202: the terminal device sends the audio information to the second server.
Accordingly, the second server receives the audio information transmitted by the terminal device.
S203: the second server generates a voice recognition request according to the audio information.
In one example, the second server may monitor information sent by the terminal device, and if it is monitored that the information sent by the terminal device is audio information, a voice recognition request may be generated; if the information sent by the terminal device is monitored to be text information, the search result corresponding to the text information can be displayed through the terminal device, or the text information can be sent to the opposite terminal device, for example, in combination with an application scenario as shown in fig. 2, if the driver inputs the text information corresponding to the destination on the terminal device in a manual mode, the text input method server can query the destination based on the text information and display the text information through the vehicle-mounted terminal.
S204: the second server sends a voice recognition request to the first server.
Accordingly, the first server receives the voice recognition request sent by the second server.
In some embodiments, a binding relationship between the voice input method application and the text input method application may be established through interaction between the first server and the second server, and the second server may send a voice recognition request to the second server based on the binding relationship, where the binding relationship may be used for characterization, and service support provided by the voice input method application is accessed by the text input method application.
The binding relationship is established, which can be understood as approval and guarantee of interaction of the two parties, so as to realize the safety and reliability of information transmission between the first server and the second server.
In some embodiments, establishing a binding relationship between a voice input method application and a text input method application may include the steps of:
step 1: the second server sends a binding request to the first server, wherein the binding request is used for requesting to access a service provided by the voice input application, and the binding request carries an identification of the text input method application.
Accordingly, the first server receives the binding request sent by the second server.
Step 2: the first server executes the binding operation according to the binding request, if the binding is successful, the step 3 is executed, and if the binding is failed, the steps 7 to 8 are executed.
Step 3: the first server generates a binding relation corresponding to the identification of the text input method application, and sends a first prompt message to the second server, wherein the first prompt message is used for prompting that the binding is successful and prompting the second server to stop initiating the binding request.
Accordingly, the second server receives the first hint message sent by the second server.
Step 4: the second server stops sending binding requests to the first server.
Illustratively, the first server and the second server may complete binding based on an interface definition language (Android Interface Definition Language, AIDL) and communicate data, thereby enabling data communication between two different counties.
In this embodiment, if the binding is successful, the first server sends the first prompt message to the second server, and the second server stops sending the binding request to the first server based on the first prompt message, so that the technical effect of saving the interactive resources can be achieved.
Step 5: the first server monitors binding relations corresponding to the identifications of the text input method applications.
Step 6: if the binding relationship corresponding to the identification of the text input method application is in the disconnection state, the first server rebuilds the binding relationship corresponding to the identification of the text input method application.
In this embodiment, after the binding is successful, the first server may monitor the binding relationship corresponding to the identifier of the text input method application, and rebind the binding relationship corresponding to the identifier of the text input method application when the binding relationship corresponding to the identifier of the text input method application is disconnected, so as to achieve the technical effects of reliability and stability of the binding relationship corresponding to the identifier of the text input method application, and improving reliability and timeliness of interaction.
In other embodiments, steps 5 and 6 may be replaced with: the second server monitors binding relations corresponding to the identifications of the text input method application; and if the binding relation corresponding to the identification of the text input method application is in a disconnected state, the second server initiates a binding request to the first server again.
That is, in one example, the first server may monitor the binding relationship corresponding to the identifier of the text input method application, and actively rebind when the binding relationship corresponding to the identifier of the text input method application is disconnected; in another example, the second server may monitor the binding relationship corresponding to the identifier of the text input method application, and when the binding relationship corresponding to the identifier of the text input method application is disconnected, initiate a binding request to the first server again, and the first server performs the step of the binding operation (i.e. returns to step 2), thereby implementing the reliability of the binding relationship corresponding to the identifier of the text input method application, and further implementing the technical effect of reliable output of the text information.
Step 7: the first server generates and sends a second hint message to the second server, wherein the second hint message is used for hinting that the binding fails.
Accordingly, the second server receives the second hint message sent by the first server.
Step 8: and after the preset time interval, the second server sends a binding request to the first server again until the binding is successful.
For example, the preset time interval may be set by the second server (or the second server and the first server) based on demand, history, trial, and the like.
That is, in this embodiment, the binding may be completed by a loop binding manner, that is, if the binding fails, the second server initiates the binding request again after a period of time until the binding succeeds, so as to implement reliability of data processing, avoid the defect that the audio information cannot be identified and the text information needs to be input by the user due to the binding failure, and improve the intelligent technical effect of data processing.
S205: the first server acquires audio information corresponding to the audio identification request from the terminal device.
In combination with the above example, the first server may receive the voice recognition request sent by the second server based on the binding relationship, and in this embodiment, if the voice recognition request may include the identifier of the text input method application, the first server may determine whether the pre-stored binding relationship includes the binding relationship corresponding to the identifier of the text input method application, and if so, the first server collects the audio information corresponding to the voice recognition request from the terminal device.
In this embodiment, when the prestored binding relationship includes a binding relationship corresponding to the identifier of the text input method application, the audio information is collected, which is equivalent to that the first server authenticates the identity of the second server, and when the authentication passes, the voice recognition request of the second server is processed and fed back (that is, the audio information is collected from the terminal device and the subsequent operation is performed), so that the technical effects of reliability and safety of data processing are achieved.
In some embodiments, the speech recognition request includes an identification of the text input method application, and S205 may include the following steps:
step 1: the first server determines the terminal equipment corresponding to the identification of the text input method application according to the mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment.
That is, the mapping relationship may be stored in the first server, and the mapping relationship may be sent to the first server by the second server, or may be sent to the first server by the terminal device, which is not limited in this embodiment.
In some embodiments, a method of establishing a mapping relationship may include: the terminal equipment sends a registration request to a second server, wherein the registration request carries an identification of the text input method application and an identification of the terminal equipment, and the registration request is used for requesting to establish a mapping relation between the text input method application and the terminal equipment; and the second server establishes a mapping relation according to the registration request.
Step 2: if the voice input method application in the terminal equipment is in an idle state, the first server collects audio information from the terminal equipment.
The idle state may be used to characterize that the voice input method application may currently provide services.
In this embodiment, the first server determines the state information of the voice input method application, and collects the audio information when the state information is in the idle state, so as to improve the technical effect of flexibility of data processing.
Step 3: if the voice input method application in the terminal equipment is in a busy state, when the voice input method application in the terminal equipment is switched from the busy state to an idle state, the first server collects audio information from the terminal equipment.
It should be understood that the state information of the voice input method application may be an idle state as described in step 2, or may be a busy state in the step, and when the voice input method application is in the busy state, in order to ensure stability and reliability of calling the voice input method application by other threads, the first server may monitor the state information, and collect audio information when it is monitored that the state information is switched from the busy state to the idle state, i.e. the voice input method application is released, thereby realizing flexibility and diversity of data processing, and ensuring technical effects of calling the reliability and stability of the voice input method application by each thread.
In some embodiments, step 3 may be replaced with: if the voice input method application in the terminal equipment is in a busy state, generating and sending a third prompt message to the second server, wherein the third prompt message is used for prompting voice failure, and the third prompt message is used for being output by the terminal equipment.
For example, in connection with the application scenario as shown in fig. 2, the present embodiment can be understood as: if the voice input method application in the vehicle-mounted terminal is in a busy state, the voice input method server generates a third prompt message and sends the third prompt message to the text input method server, the text input method server sends the third prompt message to the vehicle-mounted terminal, and the vehicle-mounted terminal displays the third prompt message, wherein the third prompt message is used for prompting voice failure.
In some embodiments, the third hint message may carry the reason for the voice failure.
In this embodiment, on the one hand, when the voice input method application is in a busy state, the first server may generate and send the third prompting message to the second server, so as to achieve the technical effect of ensuring stability and reliability of calling the voice input method application by other threads; on the other hand, the terminal equipment outputs the third prompt message, so that the voice interaction experience of the user can be improved.
S206: the first server analyzes the audio information to obtain text information.
S207: the first server sends the text information to the second server.
Accordingly, the second server receives the text information transmitted by the first server.
S208: the second server transmits the text information to the terminal device.
Accordingly, the terminal device receives the text information transmitted by the second server.
In some embodiments, S208 may specifically include: and the second server determines the terminal equipment according to the mapping relation and sends text information to the terminal equipment.
In this embodiment, the terminal device is determined by a mapping relationship manner, and text information is sent to the terminal device, so that a corresponding relationship between the text information and the terminal device can be realized, and the accuracy and reliability of text information output are improved
S209: the terminal device outputs the text information.
Illustratively, the terminal device includes a display, which may be used to characterize a device for displaying other information such as text information, and the display includes a liquid crystal display (Liquid Crystal Display, LCD), a light emitting diode (Light Emitting Diode, LED) display, an organic light emitting (Organic Light Emitting Display, OLED) display, and the like, and the embodiments of the present application are not limited.
Fig. 4 is a schematic diagram according to a third embodiment of the present application, and as shown in fig. 4, a data processing method based on an input method according to the present embodiment includes:
s301: if the audio information sent by the terminal equipment is received, the second server sends a voice recognition request to the first server, wherein the voice recognition request is used for requesting to acquire text information corresponding to the audio information.
S302: the second server receives text information corresponding to the audio information fed back by the first server.
S303: the second server sends text information to the terminal device, the text information being for output by the terminal device.
In some embodiments, the speech recognition request includes an identification of the text input method application; and when the voice input method application in the terminal equipment is in an idle state, determining based on the audio information acquired from the terminal equipment, wherein the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
In some embodiments, the speech recognition request includes an identification of the text input method application; the text information is that the voice input method application in the terminal equipment is in an idle state, when the voice input method application in the terminal equipment is switched from a busy state to the idle state, the text information is determined based on the audio information collected from the terminal equipment, and the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
In some embodiments, the speech recognition request includes an identification of the text input method application; and if the prestored binding relation comprises a binding relation corresponding to the identification of the text input method application, acquiring the audio information from the terminal equipment, wherein the binding relation corresponding to the identification of the text input method application is used for representing that the service support provided by the voice input method application is accessed by the text input method application.
In some embodiments, before the second server sends the voice recognition request to the first server, the method further comprises:
a second server sends a binding request to the first server, wherein the binding request is used for requesting to access a service provided by the voice input application;
the second server receives a first prompt message sent by the first server, wherein the first prompt message is used for prompting that the binding is successful and prompting that the binding request is stopped;
and stopping sending the binding request to the first server according to the first prompt message.
In some embodiments, after the second server sends the binding request to the first server, the method further comprises:
The second server receives a second prompt message sent by the second server, wherein the second prompt message is used for prompting binding failure;
the second server sends a binding request to the first server again based on the second hint message.
In some embodiments, after the second server receives the first hint message sent by the first server, the method further includes:
if the binding relation corresponding to the identification of the text input method application is disconnected, the second server sends a rebinding request to the first server, wherein the rebinding request is used for requesting to rebinding the binding relation corresponding to the identification of the text input method application.
In some embodiments, the binding corresponding to the identification of the text input method application is: and monitoring the binding relation corresponding to the identification of the text input method application, and reconstructing if the binding relation corresponding to the identification of the text input method application is in a disconnected state.
In some embodiments, the speech recognition request includes an identification of the text input method application; after the second server sends the voice recognition request to the first server, the method further comprises:
The second server receives a third prompt message sent by the first server, wherein the third prompt message is used for prompting voice failure, and the third prompt message is: according to a mapping relation between a pre-stored text input method application identifier and a terminal device identifier, determining a terminal device corresponding to the text input method application identifier, and generating a voice input method application in the terminal device in a busy state;
and sending the third prompt message to the terminal equipment, wherein the third prompt message is used for being output by the terminal equipment.
In some embodiments, the second server sending the text information to the terminal device includes:
and the second server determines the terminal equipment according to a preset mapping relation and sends the text information to the terminal equipment for outputting, wherein the mapping relation is used for representing the corresponding relation between the identification of the text input method application and the identification of the terminal equipment.
In some embodiments, before the second server sends the text information to the terminal device, the method further comprises:
a second server receives a registration request sent by the terminal equipment, wherein the registration request carries an identifier of the text input method application and an identifier of the terminal equipment, and the registration request is used for requesting to establish a mapping relationship between the text input method application and the terminal equipment;
And the second server establishes the mapping relation according to the registration request.
Fig. 5 is a schematic diagram according to a fourth embodiment of the present application, and as shown in fig. 5, a data processing method based on an input method according to the present embodiment includes:
s401: the terminal equipment receives audio information input by a user and respectively sends the audio information to a first server and a second server, wherein the first server is used for providing a voice input method application service, the second server is used for providing a text input method application service, the audio information is used for generating a voice recognition request and text information, and the terminal equipment sends the audio information to the first server after the first server receives the voice recognition request.
S402: the terminal device receives the text information transmitted by the second server and outputs the text information.
In some embodiments, the terminal device receives the text information sent by the second server, including: and receiving the text information sent by the second server based on a preset mapping relation, wherein the mapping relation is used for representing the correspondence relation between the identification of the text input method application and the identification of the terminal equipment.
In some embodiments, the method further comprises:
the terminal equipment sends a registration request to the second server, wherein the registration request carries the identification of the text input method application and the identification of the terminal equipment, and the registration request is used for requesting to establish the mapping relationship between the text input method application and the terminal equipment.
In some embodiments, after the terminal device sends the audio information to the first server, the method further comprises:
the terminal equipment receives and outputs a third prompt message sent by the second server, wherein the third prompt message is used for prompting voice recognition failure, and the third prompt message is: and determining the terminal equipment corresponding to the identification of the text input method application according to the mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment, wherein the voice input method application in the terminal equipment is generated when the voice input method application is in a busy state.
Fig. 6 is a schematic diagram of a fifth embodiment of the present application, and as shown in fig. 6, a server for providing a voice input method application service according to the present embodiment includes:
a first receiving module 11, configured to receive a speech recognition request sent by a second server, where the second server is configured to provide a text input method application service, and the speech recognition request is generated when a terminal device initiates audio information;
An acquisition module 12, configured to acquire, according to the speech recognition request, audio information corresponding to the speech recognition request from the terminal device;
a first generation module 13, configured to generate text information corresponding to the audio information;
a first sending module 14, configured to send the text information to the second server, where the text information is used for being output by the terminal device.
The server in this embodiment corresponds to the first server in any of the above embodiments, and corresponds to the voice input method server in the application scenario shown in fig. 2.
In some embodiments, the speech recognition request includes an identification of the text input method application; the collection module 12 is configured to determine, according to a mapping relationship between a pre-stored identifier of a text input method application and an identifier of a terminal device, the terminal device corresponding to the identifier of the text input method application, and collect the audio information from the terminal device if the voice input method application in the terminal device is in an idle state.
In some embodiments, the speech recognition request includes an identification of the text input method application; the collection module 12 is configured to determine, according to a mapping relationship between a pre-stored identifier of a text input method application and an identifier of a terminal device, the terminal device corresponding to the identifier of the text input method application, and if the voice input method application in the terminal device is in a busy state, collect the audio information from the terminal device when the voice input method application in the terminal device is switched from the busy state to an idle state.
In some embodiments, the speech recognition request includes an identification of the text input method application; the collection module 12 is configured to collect, from the terminal device, audio information corresponding to the speech recognition request if the pre-stored binding relationship includes a binding relationship corresponding to the identifier of the text input method application, where the binding relationship corresponding to the identifier of the text input method application is used to characterize that service support provided by the speech input method application is accessed by the text input method application.
Fig. 7 is a schematic diagram according to a sixth embodiment of the present application, and as shown in fig. 7, on the basis of the fifth embodiment, the server further includes:
a second receiving module 15, configured to receive a binding request initiated by the second server;
a binding module 16, configured to perform a binding operation corresponding to the binding request, where the binding request is used to request access to a service provided by the voice input application;
the second generation module 17 is configured to generate a binding relationship corresponding to the identifier of the text input method application if the binding is successful;
the second sending module 18 is configured to send a first prompting message to the second server, where the first prompting message is configured to prompt that the binding is successful, and is configured to prompt the second server to stop initiating the binding request.
In some embodiments, the second generating module 17 is configured to generate a second hint message if the binding fails, where the second hint message is used to hint that the binding fails;
the second sending module 18 is configured to send the second prompting message to the second server;
the second receiving module 15 is configured to receive a binding request sent again by the second server based on the second hint message, and return to the step of executing, by the binding module 16, a binding operation corresponding to the binding request.
In some embodiments, the second receiving module 15 is configured to receive a binding request sent by the second server after the binding relationship corresponding to the identifier of the text input method application is broken, and return to the step of executing, by the binding module 16, a binding operation corresponding to the binding request.
As can be seen in conjunction with fig. 7, in some embodiments, the server further comprises:
the monitoring module 19 is used for monitoring the binding relation corresponding to the identification of the text input method application;
the binding module 16 is configured to reconstruct a binding relationship corresponding to the identifier of the text input method application if the binding relationship corresponding to the identifier of the text input method application is in a disconnected state.
As can be seen in conjunction with fig. 7, in some embodiments, the speech recognition request includes an identification of the text input method application; the server further includes:
a determining module 20, configured to determine a terminal device corresponding to an identifier of a text input method application according to a mapping relationship between a pre-stored identifier of the text input method application and an identifier of the terminal device;
a third generating module 21, configured to generate a third prompting message if the voice input method application in the terminal device is in a busy state, where the third prompting message is used to prompt a voice failure, and the third prompting message is used to be output by the terminal device;
and a third sending module 22, configured to send the third prompting message to the second server.
In some embodiments, the text information is output by the terminal device based on a preset mapping relationship, where the mapping relationship is used to characterize a correspondence between the identifier of the text input method application and the identifier of the terminal device.
Fig. 8 is a schematic diagram of a seventh embodiment of the present application, and as shown in fig. 8, a server for providing a text input method application service includes:
A fourth sending module 31, configured to send a speech recognition request to the first server if audio information sent by the terminal device is received, where the speech recognition request is used to request to obtain text information corresponding to the audio information;
a third receiving module 32, configured to receive text information corresponding to the audio information, which is fed back by the first server;
and a fifth sending module 33, configured to send the text information to the terminal device, where the text information is used for being output by the terminal device.
The server in this embodiment corresponds to the second server in any of the above embodiments, and corresponds to the text input method server of the application scenario shown in fig. 2.
In some embodiments, the speech recognition request includes an identification of the text input method application; and when the voice input method application in the terminal equipment is in an idle state, determining based on the audio information acquired from the terminal equipment, wherein the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
In some embodiments, the speech recognition request includes an identification of the text input method application; the text information is that the voice input method application in the terminal equipment is in an idle state, when the voice input method application in the terminal equipment is switched from a busy state to the idle state, the text information is determined based on the audio information collected from the terminal equipment, and the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
In some embodiments, the speech recognition request includes an identification of the text input method application; and if the prestored binding relation comprises a binding relation corresponding to the identification of the text input method application, acquiring the audio information from the terminal equipment, wherein the binding relation corresponding to the identification of the text input method application is used for representing that the service support provided by the voice input method application is accessed by the text input method application.
Fig. 9 is a schematic diagram according to an eighth embodiment of the present application, as shown in fig. 9, on the basis of the seventh embodiment described above, the server further includes:
a sixth sending module 34, configured to send a binding request to the first server, where the binding request is used to request access to a service provided by the voice input application;
a fourth receiving module 35, configured to receive a first prompting message sent by the first server, where the first prompting message is used to prompt that the binding is successful, and is used to prompt that the binding request is stopped;
and a stopping module 36, configured to stop sending a binding request to the first server according to the first prompting message.
In some embodiments, the third receiving module 32 is configured to receive a second hint message sent by a second server, where the second hint message is used to hint that the binding fails;
The fifth sending module 33 is configured to send a binding request to the first server again based on the second hint message.
In some embodiments, the sixth sending module 34 is configured to send a rebinding request to the first server if the binding relationship corresponding to the identifier of the text input method application is disconnected, where the rebinding request is used to request rebinding the binding relationship corresponding to the identifier of the text input method application.
In some embodiments, the binding corresponding to the identification of the text input method application is: and monitoring the binding relation corresponding to the identification of the text input method application, and reconstructing if the binding relation corresponding to the identification of the text input method application is in a disconnected state.
In some embodiments, the third receiving module 32 is configured to receive a third alert message sent by the first server, where the third alert message is used to alert that the voice fails, and the third alert message is: according to a mapping relation between a pre-stored text input method application identifier and a terminal device identifier, determining a terminal device corresponding to the text input method application identifier, and generating a voice input method application in the terminal device in a busy state;
The fifth sending module 33 is configured to send the third prompting message to the terminal device, where the third prompting message is used for being output by the terminal device.
In some embodiments, the fifth sending module 33 is configured to determine the terminal device according to a preset mapping relationship, and send the text information output to the terminal device, where the mapping relationship is used to characterize a correspondence between an identifier of the text input method application and an identifier of the terminal device.
As can be seen in conjunction with fig. 9, in some embodiments, the server further comprises:
a fifth receiving module 37, configured to receive a registration request sent by the terminal device, where the registration request carries an identifier of the text input method application and an identifier of the terminal device, and the registration request is used to request to establish a mapping relationship between the text input method application and the terminal device;
the establishing module 38 is configured to establish the mapping relationship according to the registration request.
Fig. 10 is a schematic diagram according to a ninth embodiment of the present application, as shown in fig. 10, a terminal device includes:
a sixth receiving module 41, configured to receive audio information input by a user;
A seventh transmitting module 42, configured to transmit the audio information to a first server and a second server, where the first server is configured to provide a voice input method application service, the second server is configured to provide a text input method application service, the audio information is used to generate a voice recognition request and text information, and the audio information is transmitted to the first server after the first server receives the voice recognition request;
a seventh receiving module 43, configured to receive the text information sent by the second server;
and an output module 44 for outputting the text information.
In some embodiments, the seventh receiving module 43 is configured to receive the text information sent by the second server based on a preset mapping relationship, where the mapping relationship is used to characterize a correspondence between an identifier of the text input method application and an identifier of the terminal device.
In some embodiments, the seventh sending module 42 is configured to send a registration request to the second server, where the registration request carries an identifier of the text input method application and an identifier of the terminal device, and the registration request is used to request to establish a mapping relationship between the text input method application and the terminal device.
In some embodiments, the seventh receiving module 43 is configured to receive a third alert message sent by the second server, where the third alert message is used to alert that the speech recognition fails, and the third alert message is: according to a mapping relation between a pre-stored text input method application identifier and a terminal device identifier, determining a terminal device corresponding to the text input method application identifier, and generating a voice input method application in the terminal device in a busy state;
the output module 44 is configured to output the third prompting message.
According to another aspect of the embodiments of the present application, there is further provided a data processing system based on an input method, the system including:
the server for providing a voice input method application service according to any one of the above embodiments, as described in the fifth embodiment or the sixth embodiment;
the server for providing a voice input method application service according to any one of the above embodiments, as described in the seventh embodiment or the eighth embodiment;
the terminal device according to the ninth embodiment.
According to embodiments of the present application, an electronic device and a readable storage medium are also provided.
According to an embodiment of the present application, there is also provided a computer program product comprising: a computer program stored in a readable storage medium, from which at least one processor of an electronic device can read, the at least one processor executing the computer program causing the electronic device to perform the solution provided by any one of the embodiments described above.
Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the application described and/or claimed herein.
Fig. 11 is a schematic view according to a tenth embodiment of the present application, and as shown in fig. 11, an electronic device includes:
as shown in fig. 11, the electronic device includes: one or more processors 101, memory 102, and interfaces for connecting the components, including high-speed interfaces and low-speed interfaces. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions executing within the electronic device, including instructions stored in or on memory to display graphical information of the GUI on an external input/output device, such as a display device coupled to the interface. In other embodiments, multiple processors and/or multiple buses may be used, if desired, along with multiple memories. Also, multiple electronic devices may be connected, each providing a portion of the necessary operations (e.g., as a server array, a set of blade servers, or a multiprocessor system). In fig. 11, a processor 101 is taken as an example.
Memory 102 is a non-transitory computer-readable storage medium provided herein. The memory stores instructions executable by the at least one processor to cause the at least one processor to perform the input method-based data processing method provided herein. The non-transitory computer readable storage medium of the present application stores computer instructions for causing a computer to execute the input method-based data processing method provided by the present application.
The memory 102 is used as a non-transitory computer readable storage medium for storing non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the input method-based data processing method in the embodiments of the present application. The processor 101 executes various functional applications of the server and data processing, i.e., implements the input method-based data processing method in the above-described method embodiments, by running non-transitory software programs, instructions, and modules stored in the memory 102.
The memory 102 may include a storage program area that may store an operating system, at least one application program required for functions, and a storage data area; the storage data area may store data created according to the use of the electronic device of the input method-based data processing method, and the like. In addition, the memory 102 may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage device. In some embodiments, memory 102 may optionally include memory located remotely from processor 101, which may be connected to the electronic device of the input method based data processing method via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device of the data processing method based on the input method may further include: an input device 103 and an output device 104. The processor 101, memory 102, input device 103, and output device 104 may be connected by a bus or otherwise, for example in fig. 11.
The input device 103 may receive input numeric or character information and generate key signal inputs related to user settings and function control of the electronic device based on the data processing method of the input method, such as a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointer stick, one or more mouse buttons, a track ball, a joystick, etc. The output device 104 may include a display apparatus, auxiliary lighting devices (e.g., LEDs), haptic feedback devices (e.g., vibration motors), and the like. The display device may include, but is not limited to, a Liquid Crystal Display (LCD), a Light Emitting Diode (LED) display, and a plasma display. In some implementations, the display device may be a touch screen.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASIC (application specific integrated circuit), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software applications, or code) include machine instructions for a programmable processor, and can be implemented in a high-level procedural and/or object-oriented programming language, and/or in assembly/machine language. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), blockchain-service networks (BSNs), wide Area Networks (WANs), and the internet.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical host and virtual special server (VPS, virtual Private Server) service are overcome.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present application may be performed in parallel, sequentially, or in a different order, provided that the desired results of the technical solutions disclosed in the present application can be achieved, and are not limited herein.
The above embodiments do not limit the scope of the application. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present application are intended to be included within the scope of the present application.

Claims (54)

1. An input method-based data processing method is applied to a first server, wherein the first server is used for providing a voice input method application service, and the method comprises the following steps:
receiving a voice recognition request sent by a second server, wherein the second server is used for providing a text input method application service, the text input method application cannot support voice analysis, and the voice recognition request is generated when a terminal device initiates audio information;
According to the voice recognition request, acquiring audio information corresponding to the voice recognition request from the terminal equipment;
generating text information corresponding to the audio information, and sending the text information to the second server, wherein the text information is used for being output by the terminal equipment.
2. The method of claim 1, wherein the speech recognition request includes an identification of the text input method application; according to the voice recognition request, collecting audio information corresponding to the voice recognition request from the terminal equipment comprises the following steps:
determining terminal equipment corresponding to the identification of the text input method application according to a mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment;
and if the voice input method application in the terminal equipment is in an idle state, acquiring the audio information from the terminal equipment.
3. The method of claim 1, wherein the speech recognition request includes an identification of the text input method application; according to the voice recognition request, collecting audio information corresponding to the voice recognition request from the terminal equipment comprises the following steps:
Determining terminal equipment corresponding to the identification of the text input method application according to a mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment;
if the voice input method application in the terminal equipment is in a busy state, when the voice input method application in the terminal equipment is switched from the busy state to an idle state, the audio information is collected from the terminal equipment.
4. A method according to any one of claims 1 to 3, wherein the speech recognition request includes an identification of the text input method application; according to the voice recognition request, collecting audio information corresponding to the voice recognition request from the terminal equipment comprises the following steps:
and if the prestored binding relation comprises a binding relation corresponding to the identification of the text input method application, acquiring audio information corresponding to the voice recognition request from the terminal equipment, wherein the binding relation corresponding to the identification of the text input method application is used for representing that the service support provided by the voice input method application is accessed by the text input method application.
5. The method of claim 4, prior to receiving the voice recognition request sent by the second server, the method further comprising:
Receiving a binding request initiated by the second server, and executing a binding operation corresponding to the binding request, wherein the binding request is used for requesting to access a service provided by the voice input method application;
if the binding is successful, a binding relation corresponding to the identification of the text input method application is generated, and a first prompt message is sent to the second server, wherein the first prompt message is used for prompting that the binding is successful and prompting that the second server stops initiating the binding request.
6. The method of claim 5, after receiving the second server initiated binding request, the method further comprising:
if the binding fails, generating and sending a second prompt message to the second server, wherein the second prompt message is used for prompting the binding failure;
and receiving a binding request which is sent again by the second server based on the second prompt message, and returning to the step of executing the binding operation corresponding to the binding request.
7. The method of claim 5, after sending the first hint message to the second server, the method further comprising:
and receiving a binding request sent by the second server after the binding relation corresponding to the identifier of the text input method application is disconnected, and returning to the step of executing the binding operation corresponding to the binding request.
8. The method of claim 5, after generating a binding relationship corresponding to an identity of the text input method application, the method further comprising:
monitoring binding relations corresponding to the identifications of the text input method application;
if the binding relation corresponding to the identification of the text input method application is in a disconnected state, reconstructing the binding relation corresponding to the identification of the text input method application.
9. A method according to any one of claims 1 to 3, the speech recognition request including an identification of the text input method application; after receiving the voice recognition request sent by the second server, the method further comprises:
determining terminal equipment corresponding to the identification of the text input method application according to a mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment;
if the voice input method application in the terminal equipment is in a busy state, generating and sending a third prompt message to the second server, wherein the third prompt message is used for prompting voice failure, and the third prompt message is used for being output by the terminal equipment.
10. A method according to any one of claims 1 to 3, wherein the text information is output by the terminal device based on a preset mapping relationship, the mapping relationship being used to characterize a correspondence between an identity of the text input method application and an identity of the terminal device.
11. The data processing method based on the input method is applied to a second server, wherein the second server is used for providing text input method application service, and the text input method application cannot support the analysis of voice, and the data processing method comprises the following steps:
if the audio information sent by the terminal equipment is received, a voice recognition request is sent to a first server, wherein the voice recognition request is used for requesting to acquire text information corresponding to the audio information, and the first server is used for providing voice input method application service;
receiving text information corresponding to the audio information and fed back by the first server;
and sending the text information to the terminal equipment, wherein the text information is used for being output by the terminal equipment.
12. The method of claim 11, wherein the speech recognition request includes an identification of the text input method application; and when the voice input method application in the terminal equipment is in an idle state, determining based on the audio information acquired from the terminal equipment, wherein the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
13. The method of claim 11, wherein the speech recognition request includes an identification of the text input method application; the text information is that the voice input method application in the terminal equipment is in an idle state, when the voice input method application in the terminal equipment is switched from a busy state to the idle state, the text information is determined based on the audio information collected from the terminal equipment, and the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
14. The method of any of claims 11 to 13, wherein the speech recognition request includes an identification of the text input method application; and if the prestored binding relation comprises a binding relation corresponding to the identification of the text input method application, acquiring the audio information from the terminal equipment, wherein the binding relation corresponding to the identification of the text input method application is used for representing that the service support provided by the voice input method application is accessed by the text input method application.
15. The method of claim 14, prior to sending the speech recognition request to the first server, the method further comprising:
Sending a binding request to the first server, wherein the binding request is used for requesting to access a service provided by the voice input method application;
receiving a first prompt message sent by the first server, wherein the first prompt message is used for prompting that the binding is successful and prompting that the binding request is stopped;
and stopping sending the binding request to the first server according to the first prompt message.
16. The method of claim 15, after sending a binding request to the first server, the method further comprising:
receiving a second prompt message sent by a second server, wherein the second prompt message is used for prompting binding failure;
and sending a binding request to the first server again based on the second prompt message.
17. The method of claim 15, after receiving the first hint message sent by the first server, the method further comprising:
and if the binding relation corresponding to the identification of the text input method application is disconnected, sending a rebinding request to the first server, wherein the rebinding request is used for requesting rebinding of the binding relation corresponding to the identification of the text input method application.
18. The method of claim 15, wherein the binding corresponding to the identification of the text input method application is: and monitoring the binding relation corresponding to the identification of the text input method application, and reconstructing if the binding relation corresponding to the identification of the text input method application is in a disconnected state.
19. The method of any of claims 11 to 13, the speech recognition request including an identification of the text input method application; after sending the voice recognition request to the first server, the method further comprises:
receiving a third prompting message sent by the first server, wherein the third prompting message is used for prompting voice failure, and the third prompting message is: according to a mapping relation between a pre-stored text input method application identifier and a terminal device identifier, determining a terminal device corresponding to the text input method application identifier, and generating a voice input method application in the terminal device in a busy state;
and sending the third prompt message to the terminal equipment, wherein the third prompt message is used for being output by the terminal equipment.
20. The method according to any of claims 11 to 13, wherein sending the text information to the terminal device comprises:
And determining the terminal equipment according to a preset mapping relation, and sending the text information to the terminal equipment for output, wherein the mapping relation is used for representing the corresponding relation between the identification of the text input method application and the identification of the terminal equipment.
21. The method of claim 20, prior to sending the text information to the terminal device, the method further comprising:
receiving a registration request sent by the terminal equipment, wherein the registration request carries an identifier of the text input method application and an identifier of the terminal equipment, and the registration request is used for requesting to establish a mapping relationship between the text input method application and the terminal equipment;
and establishing the mapping relation according to the registration request.
22. The data processing method based on the input method is applied to terminal equipment and comprises the following steps:
receiving audio information input by a user, and respectively sending the audio information to a first server and a second server, wherein the first server is used for providing a voice input method application service, the second server is used for providing a text input method application service, the text input method application cannot support voice analysis, the audio information is used for generating a voice recognition request and text information, and the audio information is sent to the first server after the first server receives the voice recognition request; the voice recognition request is generated by the second server when the terminal equipment initiates audio information, and the text information is obtained by analyzing the audio information by the first server;
And receiving the text information sent by the second server and outputting the text information.
23. The method of claim 22, wherein receiving the text information sent by the second server comprises: and receiving the text information sent by the second server based on a preset mapping relation, wherein the mapping relation is used for representing the correspondence relation between the identification of the text input method application and the identification of the terminal equipment.
24. The method of claim 23, prior to receiving the text information sent by the second server, the method further comprising:
and sending a registration request to the second server, wherein the registration request carries the identification of the text input method application and the identification of the terminal equipment, and the registration request is used for requesting to establish the mapping relationship between the text input method application and the terminal equipment.
25. The method of any of claims 22 to 24, after sending the audio information to a first server, the method further comprising:
receiving and outputting a third prompt message sent by the second server, wherein the third prompt message is used for prompting that the voice recognition fails, and the third prompt message is: and determining the terminal equipment corresponding to the identification of the text input method application according to the mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment, wherein the voice input method application in the terminal equipment is generated when the voice input method application is in a busy state.
26. A server for providing a voice input method application service, the server comprising:
the first receiving module is used for receiving a voice recognition request sent by a second server, wherein the second server is used for providing a text input method application service, the text input method application cannot support voice analysis, and the voice recognition request is generated when a terminal device initiates audio information;
the acquisition module is used for acquiring audio information corresponding to the voice recognition request from the terminal equipment according to the voice recognition request;
the first generation module is used for generating text information corresponding to the audio information;
and the first sending module is used for sending the text information to the second server, wherein the text information is used for being output by the terminal equipment.
27. The server of claim 26, wherein the speech recognition request includes an identification of the text input method application; the acquisition module is used for determining the terminal equipment corresponding to the identification of the text input method application according to the mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment, and acquiring the audio information from the terminal equipment if the voice input method application in the terminal equipment is in an idle state.
28. The server of claim 26, wherein the speech recognition request includes an identification of the text input method application; the acquisition module is used for determining terminal equipment corresponding to the identification of the text input method application according to a mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment, and acquiring the audio information from the terminal equipment when the voice input method application in the terminal equipment is switched from a busy state to an idle state if the voice input method application in the terminal equipment is in the busy state.
29. The server according to any of claims 26-28, wherein the speech recognition request comprises an identification of the text input method application; the acquisition module is used for acquiring the audio information corresponding to the voice recognition request from the terminal equipment if the prestored binding relation comprises the binding relation corresponding to the identification of the text input method application, wherein the binding relation corresponding to the identification of the text input method application is used for representing that the service support provided by the voice input method application is accessed by the text input method application.
30. The server of claim 29, the server further comprising:
the second receiving module is used for receiving the binding request initiated by the second server;
a binding module, configured to execute a binding operation corresponding to the binding request, where the binding request is used to request access to a service provided by the voice input method application;
the second generation module is used for generating a binding relation corresponding to the identification of the text input method application if the binding is successful;
the second sending module is used for sending a first prompt message to the second server, wherein the first prompt message is used for prompting that the binding is successful and prompting that the second server stops initiating the binding request.
31. The server of claim 30, wherein the second generating module is configured to generate a second hint message if the binding fails, wherein the second hint message is configured to hint that the binding fails;
the second sending module is used for sending the second prompt message to the second server;
the second receiving module is configured to receive a binding request sent again by the second server based on the second hint message, and return to the step of executing, by the binding module, a binding operation corresponding to the binding request.
32. The server according to claim 30, wherein the second receiving module is configured to receive a binding request sent by the second server after the binding relationship corresponding to the identifier of the text input method application is broken, and return to the step of executing, by the binding module, a binding operation corresponding to the binding request.
33. The server of claim 30, the server further comprising:
the monitoring module is used for monitoring the binding relation corresponding to the identification of the text input method application;
and the binding module is used for reconstructing the binding relationship corresponding to the identifier of the text input method application if the binding relationship corresponding to the identifier of the text input method application is in a disconnected state.
34. A server according to any one of claims 26 to 29, the speech recognition request including an identification of the text input method application; the server further includes:
the determining module is used for determining the terminal equipment corresponding to the identification of the text input method application according to the mapping relation between the identification of the pre-stored text input method application and the identification of the terminal equipment;
a third generating module, configured to generate a third prompting message if the voice input method application in the terminal device is in a busy state, where the third prompting message is used to prompt a voice failure, and the third prompting message is used to be output by the terminal device;
And the third sending module is used for sending the third prompt message to the second server.
35. The server according to any one of claims 26 to 29, wherein the text information is output by the terminal device based on a preset mapping relationship, the mapping relationship being used to characterize a correspondence between an identity of the text input method application and an identity of the terminal device.
36. A server for providing a text input method application service, and the text input method application cannot support parsing of speech, comprising:
a fourth sending module, configured to send a voice recognition request to a first server if audio information sent by a terminal device is received, where the voice recognition request is used to request to obtain text information corresponding to the audio information, and the first server is used to provide a voice input method application service;
the third receiving module is used for receiving text information corresponding to the audio information and fed back by the first server;
and a fifth sending module, configured to send the text information to the terminal device, where the text information is used for being output by the terminal device.
37. The server of claim 36, wherein the speech recognition request includes an identification of the text input method application; and when the voice input method application in the terminal equipment is in an idle state, determining based on the audio information acquired from the terminal equipment, wherein the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
38. The server of claim 36, wherein the speech recognition request includes an identification of the text input method application; the text information is that the voice input method application in the terminal equipment is in an idle state, when the voice input method application in the terminal equipment is switched from a busy state to the idle state, the text information is determined based on the audio information collected from the terminal equipment, and the terminal equipment is determined according to the mapping relation between the identification of the voice input method application and the identification of the terminal equipment.
39. The server according to any of claims 36-38, wherein the speech recognition request comprises an identification of the text input method application; and if the prestored binding relation comprises a binding relation corresponding to the identification of the text input method application, acquiring the audio information from the terminal equipment, wherein the binding relation corresponding to the identification of the text input method application is used for representing that the service support provided by the voice input method application is accessed by the text input method application.
40. The server of claim 39, the server further comprising:
A sixth sending module, configured to send a binding request to the first server, where the binding request is used to request access to a service provided by the voice input method application;
the fourth receiving module is used for receiving a first prompt message sent by the first server, wherein the first prompt message is used for prompting that the binding is successful and prompting that the binding request is stopped;
and the stopping module is used for stopping sending the binding request to the first server according to the first prompt message.
41. The server of claim 40, wherein the third receiving module is configured to receive a second hint message sent by a second server, where the second hint message is configured to hint that binding failed;
and the fifth sending module is used for sending a binding request to the first server again based on the second prompt message.
42. The server according to claim 40, wherein the sixth sending module is configured to send a rebinding request to the first server if the binding relationship corresponding to the identifier of the text input method application is disconnected, where the rebinding request is used to request rebinding of the binding relationship corresponding to the identifier of the text input method application.
43. The server of claim 40, wherein the binding relationship corresponding to the identity of the text input method application is: and monitoring the binding relation corresponding to the identification of the text input method application, and reconstructing if the binding relation corresponding to the identification of the text input method application is in a disconnected state.
44. The server according to any one of claims 36-38, wherein the third receiving module is configured to receive a third alert message sent by the first server, where the third alert message is configured to alert of a voice failure, and the third alert message is: according to a mapping relation between a pre-stored text input method application identifier and a terminal device identifier, determining a terminal device corresponding to the text input method application identifier, and generating a voice input method application in the terminal device in a busy state;
the fifth sending module is configured to send the third prompting message to the terminal device, where the third prompting message is used for being output by the terminal device.
45. The server according to any one of claims 36 to 38, wherein the fifth sending module is configured to determine the terminal device according to a preset mapping relationship, and send the text information output to the terminal device, where the mapping relationship is used to characterize a correspondence between an identifier of the text input method application and an identifier of the terminal device.
46. The server according to any one of claims 36 to 38, the server further comprising:
a fifth receiving module, configured to receive a registration request sent by the terminal device, where the registration request carries an identifier of the text input method application and an identifier of the terminal device, and the registration request is used to request to establish a mapping relationship between the text input method application and the terminal device;
and the establishing module is used for establishing the mapping relation according to the registration request.
47. A terminal device, comprising:
the sixth receiving module is used for receiving the audio information input by the user;
a seventh sending module, configured to send the audio information to a first server and a second server, where the first server is configured to provide a voice input method application service, the second server is configured to provide a text input method application service, the text input method application cannot support voice parsing, the audio information is used to generate a voice recognition request and text information, and the audio information is sent to the first server after the first server receives the voice recognition request; the voice recognition request is generated by the second server when the terminal equipment initiates audio information, and the text information is obtained by analyzing the audio information by the first server;
A seventh receiving module, configured to receive the text information sent by the second server;
and the output module is used for outputting the text information.
48. A terminal device according to claim 47, wherein the seventh receiving module is configured to receive the text information sent by the second server based on a preset mapping relationship, where the mapping relationship is used to characterize a correspondence between an identifier of the text input method application and an identifier of the terminal device.
49. A terminal device as defined in claim 48, wherein the seventh sending module is configured to send a registration request to the second server, where the registration request carries an identifier of the text input method application and an identifier of the terminal device, and the registration request is used to request establishment of a mapping relationship between the text input method application and the terminal device.
50. The terminal device of any of claims 47-49, wherein the seventh receiving module is configured to receive a third alert message sent by the second server, wherein the third alert message is configured to alert that speech recognition failed, and the third alert message is: according to a mapping relation between a pre-stored text input method application identifier and a terminal device identifier, determining a terminal device corresponding to the text input method application identifier, and generating a voice input method application in the terminal device in a busy state;
The output module is used for outputting the third prompt message.
51. A data processing system based on an input method, the system comprising:
a server as claimed in any one of claims 26 to 35;
a server as claimed in any one of claims 36 to 46;
a terminal device as claimed in any of claims 47 to 50.
52. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-10; or,
to enable the at least one processor to perform the method of any one of claims 11-21; or,
to enable the at least one processor to perform the method of any one of claims 22-25.
53. A non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 1-10; or,
The computer instructions for causing the computer to perform the method of any one of claims 11-21; or,
the computer instructions for causing the computer to perform the method of any one of claims 22-25.
54. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any one of claims 1-10; alternatively, the computer program, when executed by a processor, implements the method according to any of claims 11-21; alternatively, the computer program, when executed by a processor, implements the method according to any of claims 22-25.
CN202011287122.5A 2020-11-17 2020-11-17 Data processing method, system, server and terminal equipment based on input method Active CN112489659B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011287122.5A CN112489659B (en) 2020-11-17 2020-11-17 Data processing method, system, server and terminal equipment based on input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011287122.5A CN112489659B (en) 2020-11-17 2020-11-17 Data processing method, system, server and terminal equipment based on input method

Publications (2)

Publication Number Publication Date
CN112489659A CN112489659A (en) 2021-03-12
CN112489659B true CN112489659B (en) 2024-02-13

Family

ID=74930991

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011287122.5A Active CN112489659B (en) 2020-11-17 2020-11-17 Data processing method, system, server and terminal equipment based on input method

Country Status (1)

Country Link
CN (1) CN112489659B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116416994A (en) * 2021-12-30 2023-07-11 华为技术有限公司 Voice cooperative input method, electronic equipment and computer readable storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001142487A (en) * 1999-11-11 2001-05-25 Sony Corp Voice data input system
KR20060083338A (en) * 2005-01-14 2006-07-20 에스케이 텔레콤주식회사 Method and system for providing messenger service
CN104348828A (en) * 2013-08-05 2015-02-11 三星电子株式会社 Interfacing device and method for supporting speech dialogue service
CN105162836A (en) * 2015-07-29 2015-12-16 百度在线网络技术(北京)有限公司 Method for executing speech communication, server and intelligent terminal equipment
CN105988581A (en) * 2015-06-16 2016-10-05 乐卡汽车智能科技(北京)有限公司 Voice input method and apparatus
KR101959292B1 (en) * 2017-12-08 2019-03-18 주식회사 머니브레인 Method and computer device for providing improved speech recognition based on context, and computer readable recording medium
JP2020022104A (en) * 2018-08-02 2020-02-06 アズビル株式会社 Voice server, voice recognition control system and voice recognition control method
CN111326158A (en) * 2020-01-23 2020-06-23 深圳市安顺康医疗电子有限公司 Voice control method based on intelligent terminal
CN111404998A (en) * 2020-02-27 2020-07-10 北京三快在线科技有限公司 Voice interaction method, first electronic device and readable storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102412523B1 (en) * 2017-07-18 2022-06-24 삼성전자주식회사 Method for operating speech recognition service, electronic device and server supporting the same

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001142487A (en) * 1999-11-11 2001-05-25 Sony Corp Voice data input system
KR20060083338A (en) * 2005-01-14 2006-07-20 에스케이 텔레콤주식회사 Method and system for providing messenger service
CN104348828A (en) * 2013-08-05 2015-02-11 三星电子株式会社 Interfacing device and method for supporting speech dialogue service
CN105988581A (en) * 2015-06-16 2016-10-05 乐卡汽车智能科技(北京)有限公司 Voice input method and apparatus
CN105162836A (en) * 2015-07-29 2015-12-16 百度在线网络技术(北京)有限公司 Method for executing speech communication, server and intelligent terminal equipment
KR101959292B1 (en) * 2017-12-08 2019-03-18 주식회사 머니브레인 Method and computer device for providing improved speech recognition based on context, and computer readable recording medium
JP2020022104A (en) * 2018-08-02 2020-02-06 アズビル株式会社 Voice server, voice recognition control system and voice recognition control method
CN111326158A (en) * 2020-01-23 2020-06-23 深圳市安顺康医疗电子有限公司 Voice control method based on intelligent terminal
CN111404998A (en) * 2020-02-27 2020-07-10 北京三快在线科技有限公司 Voice interaction method, first electronic device and readable storage medium

Also Published As

Publication number Publication date
CN112489659A (en) 2021-03-12

Similar Documents

Publication Publication Date Title
CN108737242B (en) System for providing dialog content
CN110647305B (en) Voice interaction method, device, equipment and medium of application program
CN110505245B (en) Application login method, device and equipment
CN111666546B (en) Application login method and device
CN111934840A (en) Communication method of client and server, gateway, electronic equipment and storage medium
CN111586128B (en) Method, device, equipment and storage medium for acquiring applet data
CN111414208B (en) Application program starting method, device and equipment
CN112671778B (en) Account management method, device, equipment and storage medium for electronic equipment
CN112269607A (en) Applet control method, system, server and terminal device
EP3816802A2 (en) Method and device for processing mini program data
US11990130B2 (en) Method, apparatus, device and computer storage medium for processing voices
CN111274596B (en) Device interaction method, authority management method, interaction device and user side
CN112489659B (en) Data processing method, system, server and terminal equipment based on input method
CN110933227A (en) Assistance method, device, equipment and medium for intelligent terminal
CN112565225B (en) Method and device for data transmission, electronic equipment and readable storage medium
CN111881005B (en) Information display method, device, equipment and storage medium
CN111615171B (en) Access method and device of wireless local area network
CN111782445B (en) Configuration method and device of equipment debugging environment
CN112153754B (en) Point-to-point connection method and device, electronic equipment and storage medium
CN111177558B (en) Channel service construction method and device
CN111783643B (en) Face recognition method and device, electronic equipment and storage medium
CN111522599B (en) Method and device for transmitting information
CN111988829B (en) Access method and device of wireless local area network
CN110609671B (en) Sound signal enhancement method, device, electronic equipment and storage medium
CN113778546A (en) Method and device for gray scale optimization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20211019

Address after: 100176 101, floor 1, building 1, yard 7, Ruihe West 2nd Road, Beijing Economic and Technological Development Zone, Daxing District, Beijing

Applicant after: Apollo Zhilian (Beijing) Technology Co.,Ltd.

Address before: 2 / F, baidu building, 10 Shangdi 10th Street, Haidian District, Beijing 100085

Applicant before: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant