CN108173802B

CN108173802B - Communication processing method, device and terminal

Info

Publication number: CN108173802B
Application number: CN201611116941.7A
Authority: CN
Inventors: 涂畅; 张扬; 王砚峰
Original assignee: Beijing Sogou Technology Development Co Ltd
Current assignee: Beijing Sogou Technology Development Co Ltd
Priority date: 2016-12-07
Filing date: 2016-12-07
Publication date: 2022-06-07
Anticipated expiration: 2036-12-07
Also published as: CN108173802A

Abstract

The embodiment of the invention provides a communication processing method, a device and a terminal, wherein the method comprises the following steps: monitoring a target communication application, and determining a target communication mode corresponding to the target communication application; when input information is detected, converting the input information according to the target communication mode to generate corresponding target communication information; and sending the target communication information to opposite-end equipment so that the opposite-end equipment outputs according to the target communication information. Based on the embodiment of the invention, two parties using the terminal can communicate by adopting different media, thereby solving the problem of inconvenient communication when the two parties need to adopt the same communication mode.

Description

Communication processing method, device and terminal

Technical Field

The present invention relates to the field of communications technologies, and in particular, to a communication processing method, a communication processing apparatus, and a terminal.

Background

With the development of communication technology, terminals such as mobile phones are more and more popular, and great convenience is brought to life, study and work of people.

As a specific application of the terminal, a user can usually use the terminal to contact other users, for example, the user can make a phone call to the other party to directly perform voice communication with the other party by using a telephone manner; or the short message can be sent to the opposite side, so that the short message mode is adopted to carry out text communication with the opposite side, and the like.

At present, two parties in communication can only select the same medium to communicate, such as voice communication or text communication. For example, when the user a wants to communicate with the user B, if the user B is inconvenient to answer a call, the user a on the calling side can only choose to perform text communication with the other side by using a short message, for example, the user a can send a short message to the user B through an instant messaging application. Specifically, if a party a making a call calls a party B, and the party B directly refuses to answer the call because of inconvenience, the terminal of the party a cannot establish communication connection with the terminal of the party B, so that the party a and the party B cannot communicate with each other.

Disclosure of Invention

The technical problem to be solved by the embodiments of the present invention is to provide a communication processing method, so that users of two communication parties can communicate according to media needed by the users, and the problem of inconvenience caused by the fact that the two communication parties need to adopt the same communication mode at present is solved.

Correspondingly, the embodiment of the invention also provides a communication processing device and a terminal, which are used for ensuring the realization and the application of the method.

In order to solve the above problem, the present invention discloses a communication processing method, which includes:

monitoring a target communication application, and determining a target communication mode corresponding to the target communication application;

when input information is detected, converting the input information according to the target communication mode to generate corresponding target communication information;

and sending the target communication information to opposite-end equipment so that the opposite-end equipment outputs according to the target communication information.

Optionally, the converting the input information according to the target communication manner to generate corresponding target communication information includes: judging whether the input information conforms to an output format corresponding to the target communication mode; when the input information does not conform to the output format, converting the input information based on the output format to generate corresponding target communication information; and when the input information conforms to the output format, taking the input information as the target communication information.

Optionally, the method further comprises: receiving communication information sent by the opposite terminal equipment; and outputting the communication information according to the output format corresponding to the local communication mode.

Optionally, the communication mode includes a text communication mode, a video communication mode or a voice communication mode; wherein, the output format corresponding to the text communication mode is a text display format; the output format corresponding to the voice communication mode is a voice playing format; and the output format corresponding to the video communication mode is a video playing format.

Optionally, the converting the input information according to the target communication manner to generate corresponding target communication information includes: when the target communication mode is a voice communication mode, determining whether the current input information conforms to a voice playing format; if not, converting the input information to generate voice information according with the user voiceprint characteristics, and taking the voice information as the target communication information.

Optionally, the method further comprises: collecting voice information input by a user; carrying out voice processing on the collected voice information and determining the voiceprint characteristics of the user;

wherein, the converting the input information to generate the voice information conforming to the voiceprint characteristics of the user comprises: and converting the input text information based on the voiceprint characteristics to generate the voice information which accords with the voiceprint characteristics of the user.

Optionally, the method further comprises: collecting voice information input by a user; carrying out voice recognition on the collected voice information, and determining a text corresponding to each audio in the voice information; storing the mapping relation between each audio and the corresponding characters to generate a voice database corresponding to the user;

wherein, the converting the input information to generate the voice information conforming to the voiceprint characteristics of the user comprises: and converting the input text information according to the mapping relation stored in the voice database to generate voice information according with the voiceprint characteristics of the user.

Optionally, the method further comprises: in the input process, searching each candidate item matched with the currently input text information from the voice database; displaying characters in each candidate item; selecting a target candidate item according to the operation instruction; and adding the audio in the target candidate item to the target communication information.

Optionally, the converting the input information according to the target communication manner to generate corresponding target communication information includes: when the target communication mode is a text communication mode, determining whether the current input information conforms to a text display format; if not, converting the input information into text information conforming to a text display format by converting the input information, and taking the converted text information as the target communication information.

Optionally, when the target communication mode is a text communication mode, the method further includes: in the input process, searching each candidate item matched with the currently input voice information from a preset voice database; displaying characters in each candidate item, or playing audio in each candidate item; selecting a target candidate item according to the operation instruction; and adding the characters in the target candidate item to the target communication information.

Optionally, the converting the input information according to the target communication manner to generate corresponding target communication information includes: when the target communication mode is a video communication mode, determining whether the current input information conforms to a video playing format; if not, converting the input information to generate audio information conforming to the voiceprint characteristics of the user; and synthesizing video information conforming to a video playing format by adopting the audio information and the image information of the user, and taking the video information as the target communication information.

The invention also discloses a communication processing device, comprising:

the communication mode determining module is used for monitoring the target communication application and determining a target communication mode corresponding to the target communication application;

the communication information generation module is used for converting the input information according to the target communication mode when the input information is detected to generate corresponding target communication information;

and the communication information sending module is used for sending the target communication information to opposite-end equipment so as to enable the opposite-end equipment to output according to the target communication information.

Optionally, the communication information generating module may be specifically configured to determine whether the input information conforms to an output format corresponding to the target communication mode; when the input information does not conform to the output format, converting the input information based on the output format to generate corresponding target communication information; and when the input information conforms to the output format, taking the input information as the target communication information.

Optionally, the communication processing apparatus further includes: the communication information receiving module and the communication information output module. The communication information receiving module is used for receiving communication information sent by the opposite terminal equipment; and the communication information output module is used for outputting the communication information according to the output format corresponding to the local communication mode.

Optionally, the communication method includes: any one of a text communication mode, a video communication mode or a voice communication mode; wherein, the output format corresponding to the text communication mode is a text display format; the output format corresponding to the voice communication mode is a voice playing format; and the output format corresponding to the video communication mode is a video playing format.

Optionally, the communication information generating module includes:

and the voice determining submodule is used for determining whether the current input information conforms to a voice playing format or not when the target communication mode is a voice communication mode.

And the voice conversion submodule is used for converting the input information to generate voice information which accords with the voiceprint characteristics of the user and using the voice information as the target communication information if the input information does not accord with the voice playing format.

Optionally, the apparatus further comprises: the voice print recognition system comprises a voice information collection module and a voiceprint characteristic determination module. The voice information collecting module is used for collecting voice information input by a user. And the voiceprint characteristic determination module is used for carrying out voice processing on the collected voice information and determining the voiceprint characteristics of the user. The voice conversion sub-module is specifically configured to convert the input text information based on the voiceprint feature, and generate voice information that conforms to the voiceprint feature of the user.

Optionally, the apparatus further comprises: the device comprises a voice recognition module and a voice database generation module. The voice recognition module is used for carrying out voice recognition on the collected voice information and determining a text corresponding to each audio in the voice information. And the voice database module is used for storing the mapping relation between each audio and the corresponding characters to generate a voice database corresponding to the user. The voice conversion sub-module is also used for converting the input text information according to the mapping relation stored in the voice database to generate the voice information according with the voiceprint characteristics of the user.

Optionally, the apparatus further comprises the following modules:

and the text searching module is used for searching candidate items matched with the currently input text information from the voice database in the input process.

And the character display module is used for displaying characters in the candidate items.

And the target candidate item selecting module is used for selecting the target candidate item according to the operation instruction.

And the audio adding module is used for adding the audio in the target candidate item to the target communication information.

Optionally, the communication information generating module includes:

and the text determining submodule is used for determining whether the current input information conforms to a text display format or not when the target communication mode is a text communication mode.

And the text conversion sub-module is used for converting the input information into text information conforming to the text display format by converting the input information if the input information does not conform to the text display format, and taking the converted text information as the target communication information.

Optionally, when the target communication mode is a text communication mode, the apparatus may further include the following module:

and the voice searching module is used for searching each candidate item matched with the currently input voice information from a preset voice database in the input process.

And the audio playing module is used for playing the audio in each candidate item.

And the text adding module is used for adding the characters in the target candidate items to the target communication information.

Optionally, the communication information generating module includes the following sub-modules:

the video determining submodule is used for determining whether the current input information conforms to a video playing format or not when the target communication mode is a video communication mode;

and the video conversion sub-module is used for converting the input information to generate audio information conforming to the voiceprint characteristics of the user if the input information does not conform to the video playing format, synthesizing the video information conforming to the video playing format by adopting the audio information and the image information of the user, and taking the video information as the target communication information.

Compared with the prior art, the embodiment of the invention has the following advantages:

in the embodiment of the invention, the terminal converts the input information of the user according to the target communication mode corresponding to the target communication application, so that the users of both communication parties can communicate according to the media required by the users, namely, the two parties using the terminal can communicate by adopting different media, and the problem of inconvenient communication caused by the fact that the two communication parties need to adopt the same communication mode at present is solved.

Drawings

FIG. 1 is a flowchart illustrating steps of a communication processing method according to an embodiment of the present invention;

FIG. 2 is a flow chart illustrating steps in another embodiment of a communication processing method of the present invention;

fig. 3 is a block diagram of a communication processing method according to an embodiment of the present invention;

fig. 4 is a block diagram illustrating a structure of a terminal for communication processing according to an exemplary embodiment.

Detailed Description

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

When a user wants to use the terminal to communicate with other users, the communication application installed on the terminal is started to send communication information to the terminal used by other users on the communication application, so that the opposite side can receive the communication information through the terminal, and the communication with the opposite side is realized.

It should be noted that, when a user communicates based on a terminal, the embodiment of the present invention may refer to the terminal currently used by the user as a local terminal, may refer to a communication application currently used by the user as a target communication application, so as to contact other users through the target communication application, and may refer to a communication method corresponding to a media currently used by the user as a local communication method; the terminal used by the other user currently contacted may also be referred to as the peer device, and the communication method used by the peer device may also be referred to as the target communication method.

One of the core concepts of the embodiments of the present invention is that two parties using terminals can communicate with each other through different media. Specifically, the terminal can convert information input by the user into target communication information according to a target communication mode corresponding to the target communication application, and send the target communication information to the opposite-end device, so that the opposite-end device can output the target communication information. Therefore, the input information of the user is converted, so that the users of both communication parties can communicate according to media required by the users, and the communication of the users is facilitated.

Referring to fig. 1, a flowchart illustrating steps of an embodiment of a communication processing method according to the present invention is shown, which may specifically include the following steps:

step 102, monitoring a target communication application, and determining a target communication mode corresponding to the target communication application.

The terminal can monitor the target communication application by acquiring the authority provided by the target communication application, so that when a user uses the target communication application for communication, a local communication mode selected by the user can be determined, a communication mode used by opposite-end equipment can be determined based on connection with the opposite-end equipment, and the communication mode used by the opposite-end equipment can be used as the target communication mode. For example, when the user selects to use voice for communication, the voice communication mode may be used as a local communication mode, and if other users using the peer device select to use text for communication, the text communication mode may be used as a target communication mode; or, if other users using the peer device choose to use the video for communication, the video communication mode may be used as the target communication mode. Of course, when the user selects to use the text for communication, the terminal may use the text communication method as the local communication method, and if other users using the peer device also select to use the text for communication, the terminal may also use the text communication method as the target communication method.

And 104, when the input information is detected, converting the input information according to the target communication mode to generate corresponding target communication information.

In an optional embodiment of the present invention, the target communication manner may be used to control a format of the communication information output by the peer device, for example, to specifically control an output format of the target communication information. Specifically, when the user inputs information, the terminal may detect the input information and may determine whether the input information conforms to the output format corresponding to the target communication method. If the input information does not conform to the output format corresponding to the target communication mode, the input information can be converted based on the output format corresponding to the target communication mode to generate corresponding target communication information. If the input information conforms to the output format corresponding to the target communication mode, the input information can be used as the target communication information.

In another optional embodiment of the present invention, the target communication mode may be used to control a format of locally transmitting the target communication information, for example, may be used to control a transmission format of the target communication information. Converting the input information according to the target communication mode to generate corresponding target communication information, which may specifically include: when the user inputs information, the terminal can detect the input information and judge whether the input information conforms to the sending format corresponding to the target communication mode. If the input information does not conform to the sending format corresponding to the target communication mode, the input information can be converted based on the sending format corresponding to the target communication mode to generate corresponding target communication information. If the input information conforms to the sending format corresponding to the target communication mode, the input information can be used as the target communication information.

And 106, sending the target communication information to opposite-end equipment so that the opposite-end equipment outputs the target communication information.

The terminal can send the generated target communication information to the opposite terminal equipment through the network. The opposite-end device may specifically include a terminal used by the opposite side, that is, a terminal used by a user that the user needs to contact, such as a mobile phone, a tablet computer, a personal computer, and the like. Therefore, the opposite terminal equipment can receive the target communication information and output the target communication information, so that the opposite terminal can acquire the information content required to be communicated by the user, and the communication purpose is achieved.

As a specific application of the present invention, when the user a wants to make a call to the user B, but the user B is inconvenient to answer the call, they may not select the same media for communication, for example, may not select to send a short message to communicate at the same time; and a cross-media communication mode is adopted, for example, the user B can communicate with the user A in a text mode, and the user A still communicates with the user B in a voice mode.

Specifically, when the user a communicates with the user B, that is, when the terminal 1 used by the user a communicates with the terminal 2 used by the user B, the terminal 1 used by the user a can monitor the target communication application currently used by the user a, and use the voice communication mode selected by the user a as the local communication mode, and the terminal 2 used by the user B can be used as the opposite terminal device, and the text communication mode selected by the user B can be used as the target communication mode, therefore, when the user A inputs the voice information, the voice information can be converted into corresponding text information according to a text communication mode so as to send the text information as target communication information to the opposite terminal equipment, and then the opposite terminal equipment can display the text information after degree conversion to the user B, namely the opposite terminal equipment can output according to the target communication information.

In addition, for the terminal 2 used by the user B, the text communication mode selected by the user B can be used as the local communication mode, the terminal 1 used by the user a can be used as the opposite terminal device, and the voice communication mode selected by the user a can be used as the target communication mode by monitoring the target communication application currently used by the user B, so that when the user B inputs text information, the text information can be converted into corresponding voice information according to the voice communication mode, the voice information is sent to the opposite terminal device as the target communication information, and the opposite terminal device can play the voice information after conversion to the user a, that is, the opposite terminal device can output the voice information according to the target communication information.

Therefore, in the communication process of the users A and B, the voice information input by the user A can be converted into text information to be seen by the user B, the text information input by the user B can be converted into voice information to be heard by the user A, the user A and the user B can be facilitated, and the communication between the users A and B can be realized according to the respective desired modes.

Referring to fig. 2, a flowchart illustrating steps of another embodiment of a communication processing method of the present invention is shown, which may specifically include the following steps:

step 202, monitoring the target communication application and determining a target communication mode corresponding to the target communication application.

In a specific implementation, the target communication mode may be used to control a sending format of the target communication information and control an output format of the target communication information. Therefore, the target communication mode may have a corresponding transmission format and an output format, the transmission format may be specifically used for limiting the transmission format of the target communication information, and the output format may be specifically used for limiting the output format of the target communication information.

In an alternative embodiment of the present invention, the communication means may include, but is not limited to: any one of a text communication mode, a video communication mode or a voice communication mode. The output format corresponding to the text communication mode is a text display format, and the sending format corresponding to the text communication mode comprises: a text format, a voice format, or a video format; preferably, the transmission format corresponding to the text communication mode is a text format. The output format corresponding to the voice communication mode is a voice playing format, and the sending format corresponding to the text communication mode comprises the following steps: a text format, a voice format, or a video format; preferably, the transmission format corresponding to the voice communication mode is a voice format. The output format corresponding to the video communication mode is a video playing format, and the sending format corresponding to the video communication mode comprises the following steps: a text format, a voice format, or a video format; preferably, the video communication mode corresponds to a transmission format video format.

And 204, when the input information is detected, converting the input information according to the target communication mode to generate corresponding target communication information.

In an optional embodiment of the present invention, the converting the input information according to the target communication manner to generate corresponding target communication information may specifically include: judging whether the input information conforms to an output format corresponding to the target communication mode; when the input information does not conform to the output format, converting the input information based on the output format to generate corresponding target communication information; and when the input information conforms to the output format corresponding to the target communication mode, taking the input information as the target communication information.

Specifically, when the user inputs information, the terminal may determine whether the input information needs to be converted by determining whether the input information conforms to an output format corresponding to the target communication application. If the input information conforms to the output format corresponding to the target communication application, the input information can be directly used as the target communication information without conversion; if the input information conforms to the output format corresponding to the target communication application, the input information needs to be converted based on the output format corresponding to the target communication application, so that the target communication information conforming to the output format corresponding to the target communication application can be generated, and the communication information is sent according to the output format corresponding to the target communication application.

Step 206, sending the target communication information to an opposite terminal device, so that the opposite terminal device outputs according to the target communication information.

In the embodiment of the present invention, if the target communication mode is a voice communication mode, when a user inputs text information, for example, when inputting a text in a dialog box of a communication application, the terminal may determine that the text information input by the user does not conform to a voice playing format, convert the text information, generate voice information conforming to a voiceprint feature of the user, and may send the voice information to an opposite terminal device as the target communication information.

In an optional embodiment of the present invention, the converting the input information according to the target communication manner to generate corresponding target communication information may specifically include: when the target communication mode is a voice communication mode, determining whether the current input information conforms to a voice playing format; if not, converting the input information to generate voice information according with the voiceprint characteristics of the user, and taking the voice information as the target communication information; and if so, taking the input information as the target communication information. For example, when a user inputs text information, the terminal may obtain the text information currently input by the user; converting the text information to generate voice information which accords with the voiceprint characteristics of the user; and taking the voice information as the target communication information. Specifically, the terminal can acquire the text information currently input by the user, and can refer to the voiceprint feature of the user when converting the text information into the voice information, and convert the acquired text information into corresponding voice information, namely, generate the voice information conforming to the voiceprint feature of the user, so that the voice information can be used as target communication information to be sent to the opposite terminal device, that is, the voice of the user can be simulated to send the voice information to the opposite terminal device, and the user experience is improved.

Optionally, the communication processing method may further include: collecting voice information input by a user; and carrying out voice processing on the collected voice information, and determining the voiceprint characteristics of the user. Specifically, the terminal can collect voice information historically input by the user, and then perform voiceprint analysis on the collected voice information to determine the voiceprint characteristics of the user, so that in the subsequent voice conversion process, the user input information can be converted based on the voiceprint characteristics of the user, for example, text information input by the user is converted based on the voiceprint characteristics of the user to generate voice information conforming to the voiceprint characteristics of the user, and then the user using the opposite-end device can listen to the voice information conforming to the voiceprint characteristics of the user.

In an optional embodiment of the present invention, the communication processing method may further include: carrying out voice recognition on the collected voice information, and determining a text corresponding to each audio in the voice information; and storing the mapping relation between each audio and the corresponding characters to generate a voice database corresponding to the user. Wherein, the converting the input information to generate the voice information conforming to the voiceprint characteristics of the user comprises: and converting the input text information according to the mapping relation stored in the voice database to generate voice information according with the voiceprint characteristics of the user. For example, the terminal may perform word segmentation on text information input by the user to obtain words after word segmentation; and the audio corresponding to each word in the text information can be extracted from the voice database corresponding to the user based on the mapping relation stored in the voice database, and then the audio information conforming to the voiceprint feature of the user can be generated based on the extracted audio, namely, the text information input by the user is converted into the voice information according to the voiceprint feature of the user.

Preferably, the terminal may also store the voiceprint features of the user in a voice database corresponding to the user, so that in the voice conversion process, the voiceprint features and/or the mapping relationship stored in the voice database of the user may be used to convert the text information into corresponding voice information.

In an optional embodiment of the present application, when the target communication mode is a voice communication mode, if the user inputs video information, the terminal may extract audio data in the video information as the target communication information, for example, audio data collected by an audio collecting device such as a speaker may be used as the target communication information.

Of course, if the target communication mode is a text communication mode, when the user inputs voice information, for example, when the user speaks, the terminal may obtain the voice information input by the user through an audio acquisition device such as a speaker, and may determine that the voice information input by the user does not conform to the text display format, convert the voice information to generate corresponding text information, and may send the text information to the opposite terminal device as the target communication information.

In another optional embodiment of the present invention, the converting the input information according to the target communication manner to generate corresponding target communication information may specifically include: when the target communication mode is a text communication mode, determining whether the current input information conforms to a text display format; if not, converting the input information into text information conforming to a text display format by identifying the input information, and taking the converted text information as the target communication information; and if so, taking the input information as target communication information. For example, when a user inputs voice information, the terminal may acquire the currently input voice information; converting the voice information into text information conforming to a text format by identifying the voice information; and taking the converted text information as the target communication information. For example, in the process of inputting voice information by a user, a terminal may acquire the currently input voice information by the user, and may convert the voice information into corresponding text information by performing voice recognition on the acquired voice information, so as to send the text information to an opposite-end device as target communication information, so that the opposite-end device may directly display the text information corresponding to the voice information input by the user.

Optionally, when the target communication mode is a text communication mode and the local communication mode of the terminal is a video communication mode, the terminal may receive video information input by a user, where the video information includes image data and audio data; the video information can be determined not to conform to the text display format, and the audio data in the video information can be converted to generate corresponding text information, so that the text information can be used as target communication information and sent to opposite-end equipment.

In an optional embodiment of the present invention, the converting the input information according to the target communication method to generate corresponding target communication information includes: when the target communication mode is a video communication mode, determining whether the current input information conforms to a video playing format; if not, converting the input information to generate audio information conforming to the voiceprint characteristics of the user, synthesizing video information conforming to a video playing format by adopting the audio information and the image information of the user, and taking the video information as the target communication information; if so, the input information can be used as the target communication information. For example, when the information input by the user is text information, the terminal may convert the text information input by the user to generate audio information conforming to the voiceprint characteristics of the user, and may acquire image information of the user, such as image data corresponding to a preset default image, to synthesize corresponding video information using the converted audio information and image information, and may send the synthesized video information to the opposite device as target communication information. Optionally, when the information input by the user is voice information, the terminal may determine that the voice information does not conform to the video playing format, may obtain image information of the user, synthesize corresponding video information by using the image information and the input voice information, and send the synthesized voice information to the opposite-end device as target communication information.

In another optional embodiment of the invention, the method may further comprise: the method comprises the steps of collecting video information input by a user in advance, performing word segmentation on text data in the video information, and determining audio corresponding to each word after word segmentation and/or determining a video clip corresponding to each word after word segmentation; constructing an incidence relation between each word and the corresponding audio, and storing the incidence relation in an audio database; and/or constructing an association relation between each word and the corresponding video segment, and storing the association relation into a video database, so that in the subsequent video conversion process, the input text information and/or voice information can be converted into corresponding video information based on the video database. For example, when the user inputs text information, the terminal may extract a video clip corresponding to each word in the text information from the video database, synthesize a target video using the extracted video clips, and send the target video as target communication information to the peer device, that is, generate video information corresponding to the input text information based on the extracted video clip. Optionally, when the user inputs the voice information, the terminal may convert the voice information into corresponding text information, extract a video segment corresponding to each word in the text information from the video database, and then synthesize a target video corresponding to the voice information by using each extracted video segment.

In another optional embodiment of the present invention, when the terminal collects the video information of the user, the terminal may further divide the video according to the video time point corresponding to each word, so as to determine the video segment corresponding to each word; and voice recognition can be carried out on the audio data in the video information, and the audio corresponding to each word is determined, so that the direct association relation between the video clip and the audio can be constructed for each word and stored in the video database of the user. Therefore, when the input voice information is converted into the video information, the terminal can convert each audio in the voice information into the corresponding video clip based on the direct association relationship between the video clip and the audio stored in the video database of the user, so that the target video can be synthesized by adopting each video clip obtained by conversion and is sent to the opposite terminal device as the target communication information.

Of course, the terminal may also receive the communication information sent by the opposite-end device, and may output the received communication information according to a local communication manner, so that the user may obtain the information content input by the opposite side. Therefore, in an optional embodiment of the present invention, the communication processing method may further include the steps of:

and step 208, receiving the communication information sent by the opposite terminal equipment.

And step 210, outputting the communication information according to an output format corresponding to the local communication mode.

As a specific application implemented by the invention, in the communication process of two users, when one party is inconvenient to speak, if one party is inconvenient to answer the call, the embodiment of the invention can provide different communication modes for the two parties, namely the two parties can communicate with each other by adopting different communication modes. Specifically, one party can communicate using text, and the other party can communicate using voice. In the communication process, the information of the party sending the text is converted into voice, and the information of the party sending the voice is converted into the text, so that the two parties can communicate in different modes. In the process of converting the voice, the terminal can refer to the historical voice database of the user himself, generate the voice according with the voiceprint characteristics of the user and send the voice to the opposite side. In the process of inputting the text by the user, the terminal can convert the input text into voice in real time and send the voice to the opposite terminal device, so that the converted voice is played by the opposite terminal device.

In combination with the above example, the terminal may store the voice of the user B during the daily conversation of the user B, recognize the pronunciation of each word or each word of the user B, and then store the voice in the database as the personalized data of the user B, that is, generate the personalized voice database of the user B. In the communication process, when the user A sends information to the user B, the user A can directly communicate through voice. The terminal 1 can convert the voice information input by the user a into text information in real time and send the text information to the user B for viewing, for example, the text information is displayed to the user B in a form similar to a short message. After the user B sees the text message sent by the user A through the terminal 2, the user B can reply the text message to the user A. The terminal 2 can convert the text information replied by the user B into voice information in real time and send the voice information to the terminal 2, so that the converted voice information is directly played to the user A through the terminal 2 to be listened, and when the converted voice information is converted into the voice information, the voice information matched with the voiceprint characteristics of the user B can be generated by referring to the personalized voice database of the user B, so that the voice information can be sent to the user A by simulating the sound of the user B. After hearing the voice information of the user B, the user A can directly communicate with the user B by voice. The above steps are repeated in a circulating way until the whole communication process is finished.

In specific implementation, when the user inputs information slowly, the terminal in the embodiment of the present invention may associate, according to the historical habit of the user, information to be input based on the video data and/or the voice database corresponding to the user, and recommend the associated information as a candidate to the user, so as to improve the input efficiency of the user. For example, when the local communication mode is a voice communication mode, the terminal may recommend the suggested candidate to the user by playing the audio in the candidate. When the local communication mode is a text communication mode, the terminal can recommend the conjectured candidate item to the user by displaying the characters in the candidate item.

In an optional embodiment of the present invention, when the target communication mode is a voice communication mode, the communication processing method may further include: in the input process, searching each candidate item matched with the currently input text information from the voice database and/or the video database; displaying characters in each candidate item; selecting a target candidate item according to the operation instruction; and adding the audio in the target candidate item to the target communication information. For example, in the process of inputting a text by a user, when the text input by the user is slow, the terminal may search in the personalized speech database of the user according to a currently input character of the user, so as to obtain a candidate item matched with the currently input character of the user from the personalized speech database, where the candidate item includes the character and an audio corresponding to the character, and further recommend the candidate item to the user by playing the audio in the candidate item to the user and/or displaying the character in the candidate item to the user, so that the user may input the character of the selected candidate item into a dialog box of a communication application by a selection operation to add the audio in the selected candidate item to the converted speech information, that is, the audio of the selected target candidate item is added to the target communication information and sent to an opposite terminal device, so as to play the voice information input by the user to the opposite side through the opposite terminal equipment.

In another optional embodiment of the present invention, when the target communication mode is a text communication mode, the communication processing method may further include: in the input process, searching each candidate item matched with the currently input voice information from a preset voice database; displaying characters in each candidate item, or playing audio in each candidate item; selecting a target candidate item according to the operation instruction; and adding the characters in the target candidate item to the target communication information. For example, in the process of inputting voice by a user, when the voice input by the user is slow, if the voice input by the user cannot be detected within a certain waiting time, the terminal may search in the personalized voice database of the user according to the voice currently input by the user, so that a candidate item matched with the voice currently input by the user may be obtained from the personalized voice database, and further, the candidate item may be recommended to the user by playing audio in the candidate item and/or displaying characters in the candidate item to the user, so that the user may add the selected characters of the candidate item to converted text information through selection operation, that is, add the selected characters of the target candidate item to the target communication information, thereby improving the input efficiency of the user.

It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.

Referring to fig. 3, a block diagram of a communication processing apparatus according to an embodiment of the present invention is shown, which may specifically include the following modules:

the communication mode determining module 302 is configured to monitor a target communication application and determine a target communication mode corresponding to the target communication application.

The communication information generating module 304 is configured to, when input information is detected, convert the input information according to the target communication manner to generate corresponding target communication information.

A communication information sending module 306, configured to send the target communication information to an opposite-end device, so that the opposite-end device outputs the target communication information.

In an optional embodiment of the present invention, the communication information generating module 304 may be specifically configured to: judging whether the input information conforms to an output format corresponding to the target communication mode; when the input information does not conform to the output format, converting the input information based on the output format to generate corresponding target communication information; and when the input information conforms to the output format, taking the input information as the target communication information.

In an optional embodiment of the present invention, the communication processing apparatus may further include: the communication information receiving module and the communication information output module. The communication information receiving module can be used for receiving the communication information sent by the opposite terminal equipment; the communication information output module can be used for outputting the communication information according to the output format corresponding to the local communication mode.

Optionally, the communication manner in the embodiment of the present invention may include, but is not limited to, any one of a text communication manner, a video communication manner, or a voice communication manner; wherein, the output format corresponding to the text communication mode is a text display format; the output format corresponding to the voice communication mode is a voice playing format; and the output format corresponding to the video communication mode is a video playing format.

In an optional embodiment of the present invention, the communication information generating module 304 may include the following sub-modules:

In an optional embodiment of the invention, the apparatus may further comprise: the voice print recognition system comprises a voice information collection module and a voiceprint characteristic determination module. The voice information collection module may be used to collect voice information input by a user. The voiceprint feature determination module may be configured to perform voice processing on the collected voice information, and determine a voiceprint feature of the user. The voice conversion sub-module may be specifically configured to convert the input text information based on the voiceprint feature, and generate voice information that conforms to the voiceprint feature of the user.

Optionally, the apparatus in the embodiment of the present invention may further include: the device comprises a voice recognition module and a voice database generation module. The speech recognition module can be used for performing speech recognition on the collected speech information and determining text corresponding to each audio in the speech information. The voice database module can be used for storing the mapping relation between each audio and the corresponding characters and generating the voice database corresponding to the user. The voice conversion sub-module can also be used for converting the input text information according to the mapping relation stored in the voice database to generate the voice information according with the voiceprint characteristics of the user.

In an optional embodiment of the present invention, the apparatus may further include the following modules:

In an optional embodiment of the present invention, the communication information generating module 304 includes:

In an optional embodiment of the present invention, when the target communication method is a text communication method, the apparatus may further include the following module:

and the voice searching module is used for searching candidate items matched with the currently input voice information from a preset voice database in the input process.

Optionally, the video conversion sub-module may be specifically configured to, when the input information is text information, convert the text information to generate voice information according with the voiceprint feature of the user.

For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.

Fig. 4 is a block diagram illustrating a structure of a terminal 400 for communication processing according to an exemplary embodiment. For example, the terminal 400 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.

Referring to fig. 4, the terminal 400 may include one or more of the following components: processing components 402, memory 404, power components 406, multimedia components 408, audio components 410, input/output (I/O) interfaces 412, sensor components 414, and communication components 416.

The processing component 402 generally controls overall operation of the terminal 400, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing element 402 may include one or more processors 420 to execute instructions to perform all or part of the steps of the methods described above. Further, the processing component 402 can include one or more modules that facilitate interaction between the processing component 402 and other components. For example, the processing component 402 can include a multimedia module to facilitate interaction between the multimedia component 408 and the processing component 402.

The memory 404 is configured to store various types of data to support operations at the device 400. Examples of such data include instructions for any application or method operating on the terminal 400, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 404 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.

The power components 404 provide power to the various components of the terminal 400. The power components 404 may include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for the terminal 400.

The multimedia component 408 comprises a screen providing an output interface between the terminal 400 and the user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 408 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera may receive external multimedia data when the terminal 400 is in an operation mode, such as a photographing mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.

The audio component 410 is configured to output and/or input audio signals. For example, the audio component 410 includes a Microphone (MIC) configured to receive an external audio signal when the terminal 400 is in an operating mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 404 or transmitted via the communication component 416. In some embodiments, audio component 410 also includes a speaker for outputting audio signals.

The I/O interface 412 provides an interface between the processing component 402 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.

The sensor component 414 includes one or more sensors for providing various aspects of status assessment for the terminal 400. For example, the sensor assembly 414 can detect an open/closed state of the device 400, relative positioning of components, such as a display and keypad of the terminal 400, the sensor assembly 414 can also detect a change in position of the terminal 400 or a component of the terminal 400, the presence or absence of user contact with the terminal 400, orientation or acceleration/deceleration of the terminal 400, and a change in temperature of the terminal 400. The sensor assembly 414 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 414 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 414 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

The communication component 416 is configured to facilitate communications between the terminal 400 and other devices in a wired or wireless manner. The terminal 400 may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 414 receives a broadcast signal or broadcast associated information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 414 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.

In an exemplary embodiment, the terminal 400 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.

In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as the memory 404 comprising instructions, executable by the processor 420 of the terminal 400 to perform the above-described method is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.

A non-transitory computer readable storage medium having instructions therein, which when executed by a processor of a terminal, enable the terminal to perform a communication processing method, the method comprising: monitoring a target communication application, and determining a target communication mode corresponding to the target communication application; when input information is detected, converting the input information according to the target communication mode to generate corresponding target communication information; and sending the target communication information to opposite-end equipment so that the opposite-end equipment outputs according to the target communication information.

The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.

As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.

Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.

The communication processing method, the communication processing apparatus and the terminal provided by the present invention are introduced in detail, and a specific example is applied in the text to explain the principle and the implementation of the present invention, and the description of the above embodiment is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims

1. A communication processing method, comprising:

monitoring a target communication application, and determining a target communication mode corresponding to the target communication application; the target communication mode is a communication mode used by opposite-end equipment;

when input information is detected, converting the input information according to the target communication mode to generate corresponding target communication information; the method comprises the following steps:

when the target communication mode is a voice communication mode, determining that the current input information does not accord with a voice playing format; converting the input information to generate voice information which accords with the voiceprint characteristics of the user, and taking the voice information as the target communication information; the target communication information conforms to the sending format of the target communication mode; wherein the generating of the voice information conforming to the voiceprint characteristics of the user comprises: segmenting input text information to obtain segmented characters; extracting audio corresponding to each word in the text information from the corresponding voice database based on the mapping relation stored in the voice database, and further generating voice information conforming to the voiceprint characteristics of the user based on the extracted audio;

the converting the input information according to the target communication mode to generate corresponding target communication information includes: when the target communication mode is a video communication mode, determining whether the current input information conforms to a video playing format; if not, converting the input information to generate audio information conforming to the voiceprint characteristics of the user; synthesizing video information conforming to a video playing format by adopting the audio information and the image information of the user, and taking the video information as the target communication information;

sending the target communication information to opposite-end equipment so that the opposite-end equipment outputs according to the target communication information;

receiving communication information sent by the opposite terminal equipment;

outputting the communication information according to an output format corresponding to a local communication mode;

the converting the input information according to the target communication mode to generate corresponding target communication information includes:

when the target communication mode is a text communication mode, determining whether the current input information conforms to a text display format;

if not, converting the input information into text information conforming to a text display format by converting the input information, and taking the converted text information as the target communication information.

2. The method of claim 1, wherein the converting the input information according to the target communication manner to generate corresponding target communication information comprises:

judging whether the input information conforms to an output format corresponding to the target communication mode;

when the input information does not conform to the output format, converting the input information based on the output format to generate corresponding target communication information;

and when the input information conforms to the output format, taking the input information as the target communication information.

3. The method according to any one of claims 1 to 2,

the communication mode comprises a text communication mode, a video communication mode or a voice communication mode;

wherein, the output format corresponding to the text communication mode is a text display format;

the output format corresponding to the voice communication mode is a voice playing format;

and the output format corresponding to the video communication mode is a video playing format.

4. The method of claim 1, further comprising:

collecting voice information input by a user;

carrying out voice processing on the collected voice information and determining the voiceprint characteristics of the user;

5. The method of claim 1, further comprising:

collecting voice information input by a user;

carrying out voice recognition on the collected voice information, and determining a text corresponding to each audio in the voice information;

storing the mapping relation between each audio and the corresponding characters to generate a voice database corresponding to the user;

6. The method of claim 5, further comprising:

in the input process, searching each candidate item matched with the currently input text information from the voice database;

displaying characters in each candidate item;

selecting a target candidate item according to the operation instruction;

and adding the audio in the target candidate item to the target communication information.

7. The method of claim 3, wherein when the target communication method is a text communication method, the method further comprises:

in the input process, searching each candidate item matched with the currently input voice information from a preset voice database;

displaying characters in each candidate item, or playing audio in each candidate item;

selecting a target candidate item according to the operation instruction;

and adding the characters in the target candidate item to the target communication information.

8. A communication processing apparatus, comprising:

the communication mode determining module is used for monitoring the target communication application and determining a target communication mode corresponding to the target communication application; the target communication mode is a communication mode used by opposite-end equipment;

the communication information generation module is used for converting the input information according to the target communication mode when the input information is detected to generate corresponding target communication information; the target communication information conforms to the sending format of the target communication mode;

a communication information sending module, configured to send the target communication information to an opposite-end device, so that the opposite-end device outputs the target communication information;

wherein, the communication information generation module includes:

the voice determining submodule is used for determining whether the current input information accords with a voice playing format or not when the target communication mode is a voice communication mode;

the voice conversion submodule is used for converting the current input information when the current input information does not accord with a voice playing format, generating voice information according with the voiceprint characteristics of a user and taking the voice information as the target communication information; wherein the generating of the voice information conforming to the voiceprint characteristics of the user comprises: segmenting input text information to obtain segmented characters; extracting audio corresponding to each word in the text information from the corresponding voice database based on the mapping relation stored in the voice database, and further generating voice information conforming to the voiceprint characteristics of the user based on the extracted audio;

the communication information generation module comprises:

the video conversion sub-module is used for converting the input information to generate audio information conforming to the voiceprint characteristics of the user if the input information does not conform to the video playing format, synthesizing the video information conforming to the video playing format by adopting the audio information and the image information of the user, and taking the video information as the target communication information;

the communication information receiving module is used for receiving the communication information sent by the opposite terminal equipment;

the communication information output module is used for outputting the communication information according to an output format corresponding to a local communication mode;

the communication information generation module comprises:

the text determining submodule is used for determining whether the current input information conforms to a text display format or not when the target communication mode is a text communication mode;

9. The apparatus of claim 8, wherein the communication information generating module is specifically configured to determine whether the input information conforms to an output format corresponding to the target communication manner; when the input information does not conform to the output format, converting the input information based on the output format to generate corresponding target communication information; and when the input information conforms to the output format, taking the input information as the target communication information.

10. The apparatus of any one of claims 8-9, wherein the means for communicating comprises: any one of a text communication mode, a video communication mode or a voice communication mode; the output format corresponding to the text communication mode is a text display format; the output format corresponding to the voice communication mode is a voice playing format; and the output format corresponding to the video communication mode is a video playing format.

11. The apparatus of claim 8, further comprising:

the voice information collection module is used for collecting voice information input by a user;

the voiceprint feature determination module is used for carrying out voice processing on the collected voice information and determining the voiceprint features of the user;

the voiceprint feature determination module is specifically configured to convert input text information based on the voiceprint features, and generate voice information that conforms to the voiceprint features of the user.

12. The apparatus of claim 8, further comprising:

the voice recognition module is used for carrying out voice recognition on the collected voice information and determining a text corresponding to each audio in the voice information;

the voice database generation module is used for storing the mapping relation between each audio and the corresponding characters and generating a voice database corresponding to the user;

the voice conversion sub-module is also used for converting the input text information according to the mapping relation stored in the voice database to generate the voice information according with the voiceprint characteristics of the user.

13. The apparatus of claim 12, further comprising:

the text searching module is used for searching candidate items matched with the currently input text information from the voice database in the input process;

the character display module is used for displaying characters in each candidate item;

the target candidate item selecting module is used for selecting a target candidate item according to the operation instruction;

14. The apparatus of claim 8, wherein when the target communication method is a text communication method, the apparatus further comprises:

the voice searching module is used for searching candidate items matched with the currently input voice information from a preset voice database in the input process;

the audio playing module is used for playing the audio in each candidate item;

15. A terminal comprising a memory, and one or more programs, wherein the one or more programs are stored in the memory and configured for execution by one or more processors to perform the one or more programs including instructions for:

when the target communication mode is a voice communication mode, determining that the current input information does not conform to a voice playing format; converting the input information to generate voice information which accords with the voiceprint characteristics of the user, and taking the voice information as the target communication information; the target communication information conforms to the sending format of the target communication mode; wherein the generating of the voice information conforming to the voiceprint characteristics of the user comprises: segmenting input text information to obtain segmented characters; extracting audio corresponding to each word in the text information from the corresponding voice database based on the mapping relation stored in the voice database, and further generating voice information according with the voiceprint characteristics of the user based on the extracted audio;

when the target communication mode is a video communication mode, determining whether the current input information conforms to a video playing format; if not, converting the input information to generate audio information conforming to the voiceprint characteristics of the user; synthesizing video information conforming to a video playing format by adopting the audio information and the image information of the user, and taking the video information as the target communication information;

sending the target communication information to opposite terminal equipment so that the opposite terminal equipment outputs according to the target communication information;

receiving communication information sent by the opposite terminal equipment;

16. The terminal of claim 15, wherein the converting the input information according to the target communication manner to generate corresponding target communication information comprises:

17. The terminal according to any of the claims 15 to 16,

18. The terminal of claim 15, further comprising instructions for:

collecting voice information input by a user;

carrying out voice processing on the collected voice information, and determining the voiceprint characteristics of the user;

19. The terminal of claim 15, further comprising instructions for:

collecting voice information input by a user;

performing voice recognition on the collected voice information, and determining a text corresponding to each audio frequency in the voice information;

20. The terminal of claim 19, further comprising instructions for:

displaying characters in each candidate item;

selecting a target candidate item according to the operation instruction;

21. The terminal of claim 17, wherein when the target communication method is a text communication method, the terminal further comprises instructions for:

selecting a target candidate item according to the operation instruction;

22. A readable storage medium, wherein instructions in the storage medium, when executed by a processor of a terminal, enable the terminal to perform the communication processing method according to any one of method claims 1 to 7.