WO2020056948A1 - Method and device for data processing and device for use in data processing - Google Patents

Method and device for data processing and device for use in data processing Download PDF

Info

Publication number
WO2020056948A1
WO2020056948A1 PCT/CN2018/121235 CN2018121235W WO2020056948A1 WO 2020056948 A1 WO2020056948 A1 WO 2020056948A1 CN 2018121235 W CN2018121235 W CN 2018121235W WO 2020056948 A1 WO2020056948 A1 WO 2020056948A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
reply
candidate
data
clipboard
Prior art date
Application number
PCT/CN2018/121235
Other languages
French (fr)
Chinese (zh)
Inventor
刘文文
李琳
潘牧野
Original Assignee
北京搜狗科技发展有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京搜狗科技发展有限公司 filed Critical 北京搜狗科技发展有限公司
Publication of WO2020056948A1 publication Critical patent/WO2020056948A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/02User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail using automatic reactions or user delegation, e.g. automatic replies or chatbot-generated messages

Definitions

  • the present application relates to the field of communications technologies, and in particular, to a data processing method, device, and device for data processing.
  • communication applications such as SMS applications and instant messaging applications can provide users with information interaction functions to enable different users to exchange information. For example, different users can send text messages to each other through a text messaging application. As another example, different users can also send messages to each other through an instant messaging application.
  • the embodiments of the present application provide a data processing method and device, and a device for data processing, which can improve the response efficiency, and can realize intelligent reply in a reply scenario across communication windows.
  • an embodiment of the present application discloses a data processing method, including:
  • the to-be-reply information corresponds to all or part of the clipboard information
  • the reply candidate is displayed.
  • an embodiment of the present application discloses a data processing device, including:
  • a to-be-reply information determining module configured to determine to-be-reply information from the clipboard information; the to-be-reply information corresponds to all or part of the clipboard information;
  • a reply candidate determination module configured to determine a reply candidate corresponding to the information to be replyed
  • the reply candidate display module is configured to display the reply candidate.
  • an embodiment of the present application discloses a device for data processing, including a memory, and one or more programs.
  • One or more programs are stored in the memory, and are configured to be read by one or one.
  • the above processor executes the one or more programs including instructions for:
  • an embodiment of the present application discloses a machine-readable medium having instructions stored thereon that, when executed by one or more processors, cause a device to execute the data processing method according to one or more of the foregoing.
  • the embodiment of the present application supports a user to carry information to be responded to through a clipboard, so that the information to be responded can be determined from the clipboard information, and a reply candidate corresponding to the information to be responded is provided, so the response efficiency can be improved.
  • the embodiment of the present application can be applied to a response scenario across communication windows. Specifically, a user can copy information in a first communication window and jump to a second communication window.
  • the embodiment of the present application can automatically The information to be answered is determined in the information, and a reply candidate corresponding to the information to be responded is provided. Therefore, an intelligent reply can be implemented in a reply scene across the communication window to improve the reply efficiency.
  • the embodiments of the present application may also be applied to a reply scenario in a communication window.
  • the reply scene in the communication window is specifically: the information received by the user and the information sent are located in the same communication window.
  • the embodiment of the present application can determine the information to be responded by using the clipboard information.
  • FIG. 1 is a schematic diagram of an application environment of a data processing method according to an embodiment of the present application
  • FIG. 2 is a flowchart of steps in a first embodiment of a data processing method according to the present application
  • FIG. 3 is a flowchart of steps in a second embodiment of a data processing method of the present application.
  • FIG. 4 is a schematic diagram of an interface according to an embodiment of the present application.
  • FIG. 5 is a structural block diagram of an embodiment of a data processing apparatus of the present application.
  • FIG. 6 is a block diagram of an apparatus 800 for data processing of the present application.
  • FIG. 7 is a schematic structural diagram of a server in some embodiments of the present application.
  • An embodiment of the present application provides a data processing scheme, which may include: determining information to be responded from the clipboard information; the information to be responded may correspond to all or part of the clipboard information; and determining the information to be responded to Reply candidates corresponding to the information; displaying the reply candidates.
  • the clipboard is an area in the memory and is a plug-in in a preset program.
  • the user can use a simple cut, copy, paste and other actions to select the selected information in various ways. Pass and share between applications.
  • the clipboard uses the terminal's internal resource memory, or virtual memory, to temporarily save cut and copied information.
  • the preset programs may specifically include: browser programs, instant messaging programs, social network (for example: Weibo, forum, news, etc.) programs, and other applications with clipboard functions.
  • the information in the clipboard is selected by the user.
  • the type of clipboard information may include at least one of the following types: text, picture, audio, video, and page address.
  • the clipboard may include only one piece of information; or, the clipboard may include multiple pieces of information.
  • the clipboard may include three pieces of information, where the first piece of information may be a text type, the second piece of information may be a voice type, and the third piece of information may be a picture type. It can be understood that the embodiment of the present application does not limit the amount of the clipboard information and the specific type of the clipboard information.
  • the embodiments of the present application may be applicable to a reply scenario across communication windows.
  • the cross-communication window may include: different communication windows of different applications, or different communication windows of the same application.
  • Different windows of different applications may include: a communication window of a short message application and a communication window of an instant messaging application, or a communication window of a first instant messaging application and a communication window of a second instant messaging application.
  • user A receives message A sent by user B through a short message application. Since sending of a short message consumes corresponding short message charges, the embodiment of the present application supports user A to send a reply corresponding to message A to user B through an instant messaging application.
  • user A can copy information A and jump to the communication window between user A and user B in the instant messaging application.
  • the clipboard information (that is, the content of information A) can be used as The information to be responded to, and a reply candidate is automatically provided for user A to choose, so that user A can achieve a quick reply in the case of cross-scenarios, which can improve the response efficiency.
  • user C receives information B sent by user D through communication window A of the instant messaging application. Because communication window A is a group window, the reply corresponding to information B may involve privacy. Therefore, the embodiment of this application supports user C through the instant messaging application. Communication window B sends a reply corresponding to message B to user D. Correspondingly, the user C can copy the information B and jump to the communication window B between the user C and the user D in the instant messaging application.
  • the clipboard information (that is, the content of the information B)
  • a reply candidate is automatically provided for user C to choose, so that user C can achieve a quick reply under the circumstance of cross-scenario, thereby improving the reply efficiency.
  • user D is the leader of user C
  • communication window A is the communication window of the work group
  • communication window B is the communication window between user C and user D. Since the reply generated in the communication window of the work group may be excluded by user C See with users other than user D, so it is not convenient to reply to message B in the communication window of the work group.
  • the embodiments of the present application may also be applied to a reply scenario in a communication window.
  • the reply scene in the communication window is specifically: the information received by the user and the information sent are located in the same communication window.
  • the embodiment of the present application can determine the information to be responded by using the clipboard information.
  • the embodiment of the present application supports a user to carry information to be responded to through a clipboard, so that the information to be responded can be determined from the clipboard information, and a reply candidate corresponding to the information to be responded is provided, so the response efficiency can be improved.
  • the embodiment of the present application can be applied to a response scenario across communication windows. Specifically, a user can copy information in a first communication window and jump to a second communication window.
  • the embodiment of the present application can automatically The information to be answered is determined in the information, and a reply candidate corresponding to the information to be responded is provided. Therefore, an intelligent reply can be implemented in a reply scene across the communication window to improve the reply efficiency.
  • the above-mentioned user jumps to the second communication window, which is only an optional embodiment.
  • the solution in this embodiment of the present application may not depend on the jump of the communication window, even if the user does not jump to the communication window
  • the information to be responded can be automatically determined from the clipboard information, and a reply candidate corresponding to the information to be responded is provided.
  • the data processing method provided in the embodiment of the present application may be applied to an application environment such as a website and / or an APP (Application, Application) to improve the response efficiency.
  • the APP may be a communication application
  • the website may be a webpage for providing a communication service, and the like.
  • the data processing method provided in the embodiment of the present application can be applied to the application environment shown in FIG. 1.
  • the client 100 and the server 200 are located in a wired or wireless network. Through the wired or wireless network, the client 100 Perform data interaction with the server 200.
  • the client 100 may run on a terminal.
  • the above terminals include, but are not limited to, a smartphone, a tablet, an e-book reader, and MP3 (Motion Picture Expert Compression Standard Audio Level 3, Moving Picture Experts Group Audio Layer III ) Player, MP4 (Moving Picture Expert Compression Standard Audio Level 4, Moving Picture Experts Group Audio Layer 4) player, laptop portable computer, car computer, desktop computer, set-top box, smart TV, wearable device and so on.
  • the client 100 may be an APP running on the device, such as an instant messaging APP, a short message APP, an input method APP, or an APP built into the operating system.
  • an APP running on the device, such as an instant messaging APP, a short message APP, an input method APP, or an APP built into the operating system.
  • the embodiment of the present application does not limit the specific APP corresponding to the client.
  • FIG. 2 a flowchart of steps in a first embodiment of a data processing method according to the present application is shown, which may specifically include the following steps:
  • Step 201 Determine to-be-reply information from the clipboard information; the to-be-reply information may correspond to all or part of the clipboard information;
  • Step 202 Determine a reply candidate corresponding to the information to be replyed.
  • Step 203 Display the reply candidates.
  • At least one step of the method embodiment shown in FIG. 2 may be executed by a server and / or a client.
  • the embodiment of this application does not limit the specific execution subject of each step.
  • the method embodiment in FIG. 2 may correspond to a trigger condition.
  • the triggering condition may include: the input method keyboard is turned up.
  • the keyboard of the input method is invoked, which can indicate that the user wants to reply by inputting. Therefore, the method in the embodiment of the present application can be triggered.
  • the trigger condition may include: the clipboard information is updated.
  • the updated clipboard information may indicate that the user has generated new clipboard information, indicating that the user has a response requirement, and therefore, the method of the embodiment of the present application may be triggered.
  • the trigger condition may include: the clipboard information is updated and the input method keyboard is invoked. In this case, it indicates that the user has a requirement to reply through input, and therefore, the method in the embodiment of the present application can be triggered.
  • the trigger condition may include: after the clipboard information is updated, jumping to a communication window.
  • Jumping to the communication window can refer to jumping from the interface before the clipboard operation to the communication window. In this case, it can be explained that the user wants to reply through the communication window after the jump, so the method of the embodiment of the present application can be triggered.
  • the interface before the clipboard operation can be a communication window or a non-communication window.
  • the trigger condition may include: after the clipboard information is updated, jumping to a communication window, and the input method keyboard is invoked.
  • the above trigger condition is only an optional embodiment. In fact, those skilled in the art can determine the above trigger condition according to actual application requirements.
  • the above trigger condition may also be a preset gesture of a user.
  • the specific trigger conditions are not limited.
  • the clipboard information can be obtained by accessing the clipboard.
  • Clipboard information can include: one piece of content, or multiple pieces of content.
  • the selection interface corresponding to each piece of information can be displayed in the communication window, so that the user can select at least one piece of information to copy
  • Step 201 may determine to-be-reply information from the clipboard information, and the to-be-reply information may correspond to all or part of the clipboard information.
  • the clipboard information may include: the information and the sender identification of the information.
  • the sender identification may be filtered from the clipboard information, and the information may be retained as the information to be responded to.
  • the type of the clipboard information may include at least one of the following types: text, picture, audio, video, and page address.
  • the embodiment of the present application may determine at least one type of information to be responded to, and determine a corresponding reply candidate for the at least one type of information to be responded to.
  • the step 202 of determining the reply candidate corresponding to the information to be specifically answered may include: determining a theme corresponding to the information to be responded to; and determining a reply candidate corresponding to the information to be responded according to the theme.
  • a topic may be characterized by a topic keyword, and the topic keyword may refer to a keyword that can reflect a topic to be responded to.
  • the above determining the subject corresponding to the information to be responded to may specifically include:
  • Determining method A1 determining the theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
  • a determination method A2 firstly identify a video stream and / or an audio stream corresponding to a video, and determine a theme corresponding to the video according to the obtained first recognition result; and / or
  • Determining method A3 performing second recognition on the picture, and determining a theme corresponding to the picture according to the obtained second recognition result; and / or
  • a determination method A4 Perform voice recognition on the audio, and determine a theme corresponding to the information to be responded according to the obtained voice recognition result.
  • Video usually consists of still pictures, which are called video frames.
  • the video stream corresponding to the video can be used to represent consecutive video frames.
  • the audio stream corresponding to the video can be used to represent a continuous audio signal, and the audio stream is synchronized with the continuous video frame to achieve the synchronous playback effect of the video picture and audio.
  • the audio stream corresponding to the video may correspond to video content such as the lines of the video, the soundtrack, and the soundtrack may include: theme songs, episodes, ending songs, and background music corresponding to the lines. It can be understood that the embodiment of the present application does not limit the specific video content corresponding to the audio stream.
  • the video stream and audio stream corresponding to the video can be located in the same file.
  • audio can be extracted from the video file.
  • the video file can be converted into an audio file, for example, MP4 (Motion Picture Expert Compression Standard Audio Level 4, Moving Picture Experts Group Audio Layer 4) format video files are converted to MP3 (Motion Picture Expert Compression Standard Audio Level 3, Moving Picture Experts Group Audio Audio Layer III) format audio files.
  • the video stream and audio stream corresponding to the video may be located in separate files, that is, the video file and the audio file may be independent. In this case, the audio file may be directly obtained.
  • the audio file may include an audio stream corresponding to the video, so the audio stream corresponding to the video may be read from the audio file.
  • the preset time interval may be a playback duration corresponding to N video frames, and N is a positive integer. It is understood that the embodiments of the present application There are no restrictions on the specific N and the preset time interval.
  • the embodiments of the present application can identify the video stream and / or audio stream corresponding to the video by using the following identification methods:
  • Recognition method 1 Perform image recognition on a video stream corresponding to a video to obtain corresponding image target information; and / or
  • Recognition method 2 Perform text recognition on a video stream corresponding to a video to obtain corresponding text information; and / or
  • Recognition method 3 Perform voice recognition on the audio stream corresponding to the video to obtain corresponding text information.
  • image recognition refers to a technology that uses a machine to process, analyze, and understand an image to identify image objects in various modes.
  • a technology for processing, analyzing, and understanding a video frame by a machine to identify image targets in various modes can be used.
  • the image target in the video frame may correspond to a certain image area in the video frame
  • the image target in the video frame may include: objects, people, space, and so on.
  • a character may be a person in a video frame
  • an item may be an item worn by a person in a video frame
  • a space may be an environmental space in which the character is located in the video frame, such as an outdoor environment, an indoor environment, etc.
  • an indoor environment may include an indoor It can be understood that the information such as the wall and the ground does not limit the specific image target in the video frame.
  • the process of performing image recognition on a video frame corresponding to a video stream and / or an audio stream may include: detecting an image target in the video frame, and using a deep learning method on the acquired image The target is analyzed to obtain corresponding image target information. Therefore, the recognition result in the embodiment of the present application may include: image target information corresponding to a video frame.
  • the above image target information may include: the image of the image target (that is, the image of the image target in the video frame, which usually corresponds to a certain closed area in the video frame), the recognition result of the image target (such as the recognized image Target name, category, etc.).
  • the text information in the video frame may include: text information included in the image, and / or text information in the subtitles.
  • text recognition technology may be used to perform text recognition on a video frame corresponding to a video stream and / or an audio stream.
  • the above text recognition technology may include: OCR (Optical Character Recognition) technology, etc.
  • OCR Optical Character Recognition
  • the OCR technology may segment characters in an image after pre-processing such as noise reduction to obtain a single character image. And recognize the characters corresponding to a single character image. It can be understood that the embodiment of the present application does not limit the specific text recognition technology.
  • a subtitle file corresponding to the subtitle of the video frame can be obtained, and the text information in the subtitle can be obtained from the subtitle file.
  • Get text information in subtitles It can be understood that the embodiment of the present application does not limit the specific acquisition manner of the text information in the subtitles.
  • O ⁇ O1, O2, ..., Oi, ..., OT ⁇
  • Oi The i-th speech feature
  • T is the total number of speech features.
  • the process of speech recognition is to find the most likely word string W according to the known speech feature sequence O.
  • speech recognition is a model matching process.
  • the process of recognizing the voice input by the user is the process of comparing the features of the voice input by the user with the template, and finally determining the best template that matches the voice input by the user to obtain the result of voice recognition .
  • Specific speech recognition algorithms can use statistics-based hidden Markov model training and recognition algorithms, neural network-based training and recognition algorithms, dynamic time-rounded matching-based recognition algorithms, and other algorithms.
  • the application embodiment does not limit the specific speech recognition process.
  • the user G receives the website A sent by the user H and copies the website A
  • the embodiment of the present application can automatically analyze the page content of the website A to obtain the topic A corresponding to the website A
  • the topic A can be related to entertainment gossip, national economy and people's livelihood, etc., and can automatically give a response candidate corresponding to the topic A, such as "I also look at this page", or "I also like the content corresponding to the topic A" and so on.
  • the embodiment of this application can automatically identify video A and obtain the subject B corresponding to video A.
  • the Topic B can be related to the child's life in the kindergarten, etc., and can automatically give a response candidate corresponding to the topic B, such as "It seems that the child is happy in the kindergarten".
  • the embodiment of the present application can automatically identify picture A and obtain the subject C corresponding to picture A.
  • the Theme C can be related to a piece of clothing, such as a coat, a down jacket, a skirt, etc., and can automatically give a reply candidate for Theme C, such as "Theme C is beautiful and worth buying", "Theme C is a bit old-fashioned” "Subject C is fat” and so on.
  • Determining method B1 A TF-IDF (term frequency-inverse document frequency algorithm) method is used to determine the topic keywords corresponding to the information to be responded to.
  • TF-IDF The main idea of TF-IDF is: If a word or phrase appears frequently in a document or a text and has a high TF, and rarely appears in other documents or texts, the word or phrase is considered to have a good category distinction. Capabilities, suitable for classification.
  • the LDA (Latent Dirichlet Allocation) model is used to determine the topic keywords corresponding to the information to be answered.
  • the LDA model is a document generation model and an unsupervised machine learning technology. It thinks that a document or a text has multiple topics, and each topic corresponds to a different topic keyword. The process of constructing a document or a text, first select a certain topic with a certain probability, and then select a certain topic keyword with a certain probability under this topic, so the first one of this document is generated. Theme keywords. Repeating this process continuously produces a document or a text.
  • the use of LDA is the inverse process of the above document generation process, that is, to find the theme of this document or this text, and the topic keywords corresponding to these topics according to a document or a text.
  • a classification model is used to determine a category corresponding to the information to be responded to, and a topic keyword is obtained based on the information of the category.
  • the classification model may include: a fasttext model.
  • the fastText model can output the probability that the word sequence belongs to different categories for the input word sequence (a text or a sentence).
  • the fastText model can combine words and phrases in a word sequence into a feature vector.
  • the feature vector is mapped to the middle layer through a linear transformation, and the middle layer is then mapped to the corresponding preset category.
  • fastText may use a non-linear activation function in the process of mapping to the corresponding preset category.
  • fastText has the advantages of fast speed and high accuracy.
  • the embodiment of the present application does not limit the specific classification model.
  • An embodiment of the present application may determine a reply candidate corresponding to the information to be responded according to the subject.
  • the information to be returned is notification-type information.
  • the reply candidate may be "received", "good”, and the like.
  • the subject of the message to be answered “Come to the meeting room” can be "location notification”, and the subject of the message to be answered "There is a place in your plan that needs to be modified” can be "work modification notice".
  • the information to be answered may be inquiry-type information.
  • the reply candidate may be a positive candidate, a negative candidate, one of a plurality of options, or an answer to a question.
  • the corresponding reply candidates may include: location A or location B.
  • the corresponding reply candidate may include: Off or Off.
  • the corresponding reply candidates may include: eating, watching videos, and so on.
  • step 202 determines a response candidate corresponding to the information to be responded to, specifically, may include: searching for a mapping relationship between the data to be responded to and the response data according to the information to be responded to obtain the response to be answered.
  • Reply candidates corresponding to the reply information may be obtained according to historical communication data corresponding to at least one user, and the historical communication data may include historical to-be-reply data and corresponding historical reply data.
  • the above mapping relationship may be determined by: obtaining historical communication data of at least one user; historical communication data may include: historical to-be-reply data and its corresponding historical reply data; for each piece of historical to-be-reply data Extract the corresponding historical reply content; use the historical reply content corresponding to the historical to-reply data that meets the preset conditions as the reply data corresponding to the historical to-reply data, so that the historical to-reply data and its corresponding reply data can be based on, Determine the above mapping relationship.
  • the user may be a current terminal user or at least one sampling user in the entire network, and the historical communication data obtained is also different, for example:
  • At least one set of historical communication data generated by a user of the current terminal For example, the user's response data to the communication content can be obtained; at least one set of question and answer pairs can be extracted from the response data, each group of question and answer pairs can include: communication content, and response content to the corresponding communication content, the at least A set of question-answer pairs is the at least one set of historical communication data generated by the user.
  • the terminal may often receive the text message "What time do you get off work?", The terminal sometimes responds with “9 o'clock”, and sometimes it returns "8 o'clock", so "when do you get off work?" And "9 o'clock” constitute a set of question and answer pairs. "When do you get off work?" And “8 o'clock” form a set of question-and-answer pairs. You can form personalized cache data based on the question and answer pair: "when do you get off work? 1 9 o'clock 8 o'clock. When the terminal receives the same text message again, it will give There are two reply candidates: 1 9 o'clock and 8 o'clock. Users don't need to input, they can click to reply to the SMS.
  • the historical to-be-reply data in each set of historical communication data may be obtained first, and then the same historical to-be-reply data is combined to obtain all historical to-be-reply data contained in the historical communication data,
  • the historical to-reply data in one group of historical communication data refers to the above of the communication, and the historical to-reply data is the response content generated for the communication above; or, the historical to-be-reply data in the historical communication data refers to It is the network question data, and the historical reply data is the answer to the network question data and so on.
  • the corresponding historical reply data can be obtained, and the number of occurrences of each piece of historical reply data can be obtained.
  • historical reply data whose occurrences are greater than a preset number (for example: 20, 30, etc.) can be obtained as its Corresponding reply data; the historical reply data can also be sorted according to the number of occurrences, and then the historical reply data ranked in the first few positions (for example: 4, 5, etc.) is used as its reply data. This can obtain the mapping relationship between the data to be reply and the reply data.
  • a preset number for example: 20, 30, etc.
  • the data to be returned "when do you get off work?" Contains two pieces of reply data: 1 8 o'clock 9 o'clock; the data to be reply "Did you eat?" Contains six pieces of response data, namely: 1have eaten 2not 3not yet 4eaten 5eaten 6have not yet; the data to be returned “rest early, good night” contains a piece of response data, specifically: 1good night. It can be understood that the embodiments of the present application are directed to specific mapping relationships.
  • step 202 determines a reply candidate corresponding to the information to be responded, which may specifically include: extracting first feature information of the information to be responded to; based on a correspondence relationship between the pre-established feature information and a reply rule, A first reply rule corresponding to the first feature information is determined; at least one reply candidate is determined through the first reply rule.
  • the characteristic information is, for example, a preset sentence, a preset sentence format, and the like, and the preset sentence is, for example, "have you eaten", "when did you sleep", "how are you", etc., the preset sentence format For example: “Did you go to XX for dinner or go to XX for dinner", “Eat XX or XX today” (where XX is the default word) and so on.
  • corresponding response rules can be constructed for each type of feature information.
  • the response rules for "have eaten” are: 1 already eaten 2 not yet eaten 3 ready to eat; To go to Wanzhou Pea Mi Noodles to eat or to eat fragrant meal "to establish the reply rules are: 1 Wanzhou Pea Mi Noodles 2 To make incense, etc.
  • reply rules can also be constructed, which are not limited in the examples of this application.
  • determining the at least one reply candidate by using the first reply rule includes: extracting at least one specific keyword in the information to be responded to; and at least one specific keyword And combining with the first reply rule to obtain the at least one reply candidate.
  • the reply rule constructed for it is: I go to XX for dinner today, where XX represents the default item in the message to be answered, for example :
  • the first data to be answered is "Do you plan to go home for dinner today, or go to Richang for dinner?”
  • the default words include: go home, Richang, so that at least one response candidate constructed includes : 1 I went to restaurant A for dinner today; 2 I went to restaurant B for dinner today.
  • other reply candidates can be added, such as: 3 casually, etc.
  • At least one reply candidate can be determined based on a specific keyword included in the information to be responded, so the determined reply candidate is more relevant and accurate.
  • step 202 determines the response candidate corresponding to the information to be responded to, which may specifically include: determining user status information through a running application of the terminal; and determining the information to be responded according to the user status information.
  • the corresponding reply candidate may specifically include: determining user status information through a running application of the terminal; and determining the information to be responded according to the user status information. The corresponding reply candidate.
  • Technical solution 4 uses the user status information in the process of generating the reply candidate, and the above user status information obtained through the running application can reflect the user's use of the application.
  • the use of the application by the user may specifically include a series of users such as settings, browsing, purchase, and viewing generated by the user through the running APP.
  • the action is such that the reply candidate generated according to the user status information can carry deep information other than the judgment reply, so that the reply candidate can meet the user's precise reply intention, thereby improving the accuracy and richness of the quick reply candidate.
  • the reply candidate matches the user's precise reply intention the user can directly use the above reply candidate to reply to the information. Since the input cost when the user responds to the information can be further reduced, the reply efficiency can be improved.
  • a response candidate such as "watching a video” or “watching a TV series” is generated; or, when the content of the short message is, for example, "Did you eat?", "No, Candidates for replying to “watching video”, “not yet, watching TV series”, “watching TV series while eating”.
  • Step 203 may display the reply candidates for selection by the user.
  • the target reply candidate corresponding to the trigger operation may be displayed on the screen, and the screen may be: outputting the target reply candidate to an input box of a communication window.
  • the displaying the response candidate may specifically include: displaying the response candidate above an input method keyboard. After the input method keyboard is called up, at least one reply candidate may be displayed above the input method keyboard without generating any input string or generating any input string by the user. According to another embodiment, the reply candidates may be displayed through a pop-up window or a mask. It can be understood that the embodiment of the present application does not limit the specific display manner of the above reply candidates.
  • step 203 displays the reply candidates, which may specifically include: ranking multiple reply candidates according to the social relationship between the local user and the peer user; displaying the sorted Multiple reply candidates.
  • the social relationship between the local user and the peer user can be determined in the following ways: obtaining the communication content between the peer user and the local user; extracting all the content in the communication content A first predetermined keyword included; and based on the first predetermined keyword, determining a social relationship between a local user and a peer user.
  • a database may be set in advance, and the database includes at least one social relationship and keywords corresponding to each social relationship.
  • the keywords for "couple relationship” can include: “Smith”, “honey”, “dear”, and the keywords for "couple” relationship can include: “wife”, “husband”, “key of colleague relationship” Words can include: “project”, “* ⁇ ", “Leader”, “Madam”, etc.
  • the communication content contains any of the above keywords, and if any of the above keywords are contained, it is extracted as the first predetermined keyword, and then The social relationship is determined by the first predetermined keyword.
  • the probability that it belongs to a social relationship may be determined, and then multiple response candidates are sorted according to the probability and output, for example, if the social relationship is a couple relationship, the response candidates include: 1 Goodnight 2 Anan 3 Good night, dear, the probability that these three response candidates belong to a couple relationship is: 0.1, 0.6, 0.9, then you can output the three response candidates in the following order: 1 Good night, dear 2 An An 3 Good night.
  • multiple reply candidates can be sorted according to social relationships, so the rationality of the ranking results can be improved.
  • the data processing method in the embodiment of the present application supports a user to carry information to be responded to via a clipboard, so that the information to be responded can be determined from the information on the clipboard, and a reply candidate corresponding to the information to be responded is provided, so the response efficiency can be improved.
  • the embodiment of the present application can be applied to a response scenario across communication windows. Specifically, a user can copy information in a first communication window and jump to a second communication window. The embodiment of the present application can automatically The information to be answered is determined in the information, and a reply candidate corresponding to the information to be responded is provided. Therefore, an intelligent reply can be implemented in a reply scene across the communication window to improve the reply efficiency.
  • FIG. 3 a flowchart of steps in a second embodiment of a data processing method according to the present application is shown, which may specifically include the following steps:
  • Step 301 Determine to-be-reply information from the clipboard information; the to-be-reply information may correspond to all or part of the clipboard information;
  • Step 302 Determine a reply candidate corresponding to the information to be replyed.
  • Step 303 Display the reply candidates and the clipboard information.
  • the embodiment of the present application can display the reply candidate and the clipboard information at the same time, so that the comparison display effect between the clipboard information and the reply candidate can be achieved, so that the user knows that the reply candidate is responding to the clipboard information.
  • FIG. 4 a schematic diagram of an interface according to an embodiment of the present application is shown, which may specifically include: a communication window 401 and an input method interface 402;
  • the communication window 401 may include communication content and an input box. Taking the communication window between user A and user B as an example, the communication content may include: communication content 1 sent by user B and communication content 2 sent by user A, and the like.
  • the input method interface 402 may include an input method keyboard 421, a clipboard information area 422, and a reply candidate area 423.
  • the clipboard information area 422 may be located above the input method keyboard 421, and the clipboard information area 422 may cover part or all of the input.
  • Method tool; the reply candidate area 423 may be located in the clipboard information area 423.
  • the clipboard information area 422 may include: clipboard information and prompt information corresponding to the clipboard information, such as "from the clipboard” and the like.
  • the reply candidate area 423 may include: n (n is a natural number) reply candidates, and corresponding prompt information, such as “smart reply” and the like.
  • the target reply candidate corresponding to the click operation can be output to the input box.
  • FIG. 5 a structural block diagram of an embodiment of a data processing apparatus according to the present application is shown, which may specifically include:
  • the to-be-reply information determination module 501 is configured to determine the to-be-reply information from the clipboard information; the to-be-reply information may correspond to all or part of the clipboard information;
  • a reply candidate determination module 502 configured to determine a reply candidate corresponding to the information to be replyed
  • the reply candidate display module 503 is configured to display the reply candidates.
  • the apparatus may further include:
  • the clipboard information display module is configured to display the clipboard information.
  • the reply candidate display module 503 is specifically configured to display the reply candidate above the input method keyboard.
  • the type of the clipboard information may include at least one of the following types: text, picture, audio, video, and page address.
  • the reply candidate determination module 502 may include:
  • a topic determination module configured to determine a topic corresponding to the information to be responded to
  • the candidate determination module is configured to determine a response candidate corresponding to the information to be returned according to the subject.
  • the topic determination module may include:
  • a first theme determination module configured to determine a theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
  • a second theme determination module configured to perform first recognition on a video stream and / or audio stream corresponding to a video, and determine a theme corresponding to the video according to the obtained first recognition result; and / or
  • a third theme determination module configured to perform second recognition on the picture, and determine a theme corresponding to the picture according to the obtained second recognition result; and / or
  • the fourth theme determining module is configured to perform voice recognition on the audio, and determine a theme corresponding to the information to be responded according to the obtained voice recognition result.
  • the reply candidate determination module 502 may include:
  • a search module configured to perform a search in a mapping relationship between the data to be replied and the reply data according to the information to be replied to obtain a reply candidate corresponding to the information to be replied;
  • the mapping relationship is obtained based on historical communication data corresponding to at least one user.
  • the historical communication data may include historical to-be-reply data and corresponding historical reply data.
  • the reply candidate display module 503 may include:
  • a sorting module configured to sort multiple reply candidates based on the social relationship between the local user and the peer user
  • the sorting display module is configured to display the sorted multiple reply candidates.
  • the description is relatively simple.
  • the related parts refer to the description of the method embodiment.
  • An embodiment of the present application provides a device for data processing, including a memory, and one or more programs.
  • One or more programs are stored in the memory and configured to be executed by one or more processors.
  • the one or more programs include instructions for: determining information to be responded from the clipboard information; the information to be responded to corresponds to all or part of the clipboard information; and determining that the information to be responded corresponds to ; Candidates for reply; showing the candidates for reply.
  • Fig. 6 is a block diagram of a device 800 for data processing according to an exemplary embodiment.
  • the device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness equipment, a personal digital assistant, and the like.
  • the device 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input / output (I / O) interface 812, a sensor component 814, And communication component 816.
  • the processing component 802 generally controls the overall operations of the device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
  • the processing element 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the method described above.
  • the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components.
  • the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.
  • the memory 804 is configured to store various types of data to support operation at the device 800. Examples of these data include instructions for any application or method operating on the device 800, contact data, phone book data, messages, pictures, videos, and the like.
  • the memory 804 may be implemented by any type of volatile or non-volatile storage devices, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), Programming read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read-only memory
  • EPROM Programming read-only memory
  • PROM programmable read-only memory
  • ROM read-only memory
  • magnetic memory flash memory
  • flash memory magnetic disk or optical disk.
  • the power component 806 provides power to various components of the device 800.
  • the power component 806 may include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for the device 800.
  • the multimedia component 808 includes a screen that provides an output interface between the device 800 and a user.
  • the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user.
  • the touch panel includes one or more touch sensors to sense touch, swipe, and gestures on the touch panel. The touch sensor may not only sense a boundary of a touch or slide action, but also detect duration and pressure related to the touch or slide operation.
  • the multimedia component 808 includes a front camera and / or a rear camera. When the device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and / or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
  • the audio component 810 is configured to output and / or input audio signals.
  • the audio component 810 includes a microphone (MIC) that is configured to receive an external audio signal when the device 800 is in an operation mode, such as a call mode, a recording mode, and a voice data processing mode.
  • the received audio signal may be further stored in the memory 804 or transmitted via the communication component 816.
  • the audio component 810 further includes a speaker for outputting audio signals.
  • the I / O interface 812 provides an interface between the processing component 802 and a peripheral interface module.
  • the peripheral interface module may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
  • the sensor component 814 includes one or more sensors for providing status assessment of various aspects of the device 800.
  • the sensor component 814 can detect the on / off state of the device 800 and the relative positioning of the components, such as the display and keypad of the device 800.
  • the sensor component 814 can also detect the change of the position of the device 800 or a component of the device 800 , The presence or absence of the user's contact with the device 800, the orientation or acceleration / deceleration of the device 800, and the temperature change of the device 800.
  • the sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
  • the sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor component 814 may further include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • the communication component 816 is configured to facilitate wired or wireless communication between the device 800 and other devices.
  • the device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof.
  • the communication component 816 receives a broadcast signal or broadcast-related information from an external broadcast management system via a broadcast channel.
  • the communication component 816 further includes a near field communication (NFC) module to facilitate short-range communication.
  • the NFC module can be implemented based on radio frequency data processing (RFID) technology, infrared data association (IrDA) technology, ultra wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
  • RFID radio frequency data processing
  • IrDA infrared data association
  • UWB ultra wideband
  • Bluetooth Bluetooth
  • the device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor, or other electronic component is implemented to perform the above method.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable A gate array
  • controller microcontroller, microprocessor, or other electronic component is implemented to perform the above method.
  • a non-transitory computer-readable storage medium including instructions may be executed by the processor 820 of the device 800 to complete the foregoing method.
  • the non-transitory computer-readable storage medium may be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
  • FIG. 7 is a schematic structural diagram of a server in some embodiments of the present application.
  • the server 1900 may have relatively large differences due to different configurations or performance, and may include one or more central processing units (CPUs) 1922 (for example, one or more processors) and memory 1932, one or one
  • the above storage medium 1930 eg, one or one storage device with an amount of Shanghai
  • the memory 1932 and the storage medium 1930 may be temporary storage or persistent storage.
  • the program stored in the storage medium 1930 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations on the server.
  • the central processing unit 1922 may be configured to communicate with the storage medium 1930, and execute a series of instruction operations in the storage medium 1930 on the server 1900.
  • the server 1900 may also include one or more power sources 1926, one or more wired or wireless network interfaces 1950, one or more input-output interfaces 1958, one or more keyboards 1956, and / or, one or more operating systems 1941. , Such as Windows ServerTM, Mac OSXTM, UnixTM, LinuxTM, FreeBSDTM and so on.
  • a non-transitory computer-readable storage medium when instructions in the storage medium are executed by a processor of a device (server or terminal), enable the device to execute the data processing method shown in FIG. 2 or FIG. 3.
  • a non-transitory computer-readable storage medium when instructions in the storage medium are executed by a processor of a device (server or terminal), enable the device to execute a data processing method, the method includes: from a clipboard Information to be answered is determined; the information to be responded to corresponds to all or part of the clipboard information; a response candidate corresponding to the information to be responded is determined; and the response candidate is displayed.

Abstract

Provided in the embodiments of the present application are a method and device for data processing and a device for use in data processing. The method specifically comprises: determining information to be replied from clipboard information; the information to be replied corresponding to the entirety or a portion of the clipboard information; determining a reply candidate corresponding to the information to be replied; and displaying the reply candidate. The embodiments of the present application increase reply efficiency and implement smart replying in a scenario of replying across communication windows.

Description

一种数据处理方法、装置和用于数据处理的装置Data processing method, device and device for data processing
本申请要求在2018年09月20日提交中国专利局、申请号为201811101152.5、发明名称为“一种数据处理方法、装置和用于数据处理的装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed on September 20, 2018 with the Chinese Patent Office, application number 201811101152.5, and the invention name "a data processing method, device, and device for data processing". Incorporated by reference in this application.
技术领域Technical field
本申请涉及通信技术领域,尤其涉及一种数据处理方法、装置和用于数据处理的装置。The present application relates to the field of communications technologies, and in particular, to a data processing method, device, and device for data processing.
背景技术Background technique
随着通信技术的发展,短信应用、即时通讯应用等通讯类应用可以向用户提供信息交互功能,以使不同用户之间能够进行信息交互。例如,不同用户可以通过短信应用互相发送短信。又如,不同用户也可以通过即时通讯应用互相发送信息。With the development of communication technology, communication applications such as SMS applications and instant messaging applications can provide users with information interaction functions to enable different users to exchange information. For example, different users can send text messages to each other through a text messaging application. As another example, different users can also send messages to each other through an instant messaging application.
在实际应用中,用户接收到通信对端发送的信息时,通常需要琢磨合适的语句进行信息回复,回复效率低。In practical applications, when a user receives information sent by a communication peer, he or she usually needs to think about appropriate statements to reply to the information, and the response efficiency is low.
发明内容Summary of the Invention
本申请实施例提供一种数据处理方法、装置和用于数据处理的装置,可以提高回复效率,且可以在跨通信窗口的回复场景中实现智能回复。The embodiments of the present application provide a data processing method and device, and a device for data processing, which can improve the response efficiency, and can realize intelligent reply in a reply scenario across communication windows.
为了解决上述问题,本申请实施例公开了一种数据处理方法,包括:In order to solve the above problem, an embodiment of the present application discloses a data processing method, including:
从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;Determining to-be-reply information from the clipboard information; the to-be-reply information corresponds to all or part of the clipboard information;
确定所述待回复信息对应的回复候选;Determining a reply candidate corresponding to the information to be replyed;
展示所述回复候选。The reply candidate is displayed.
另一方面,本申请实施例公开了一种数据处理装置,包括:On the other hand, an embodiment of the present application discloses a data processing device, including:
待回复信息确定模块,用于从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;A to-be-reply information determining module, configured to determine to-be-reply information from the clipboard information; the to-be-reply information corresponds to all or part of the clipboard information;
回复候选确定模块,用于确定所述待回复信息对应的回复候选;以及A reply candidate determination module, configured to determine a reply candidate corresponding to the information to be replyed; and
回复候选展示模块,用于展示所述回复候选。The reply candidate display module is configured to display the reply candidate.
再一方面,本申请实施例公开了一种用于数据处理的装置,包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一 个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:In another aspect, an embodiment of the present application discloses a device for data processing, including a memory, and one or more programs. One or more programs are stored in the memory, and are configured to be read by one or one. The above processor executes the one or more programs including instructions for:
又一方面,本申请实施例公开了一种机器可读介质,其上存储有指令,当由一个或多个处理器执行时,使得装置执行如前述一个或多个所述的数据处理方法。In another aspect, an embodiment of the present application discloses a machine-readable medium having instructions stored thereon that, when executed by one or more processors, cause a device to execute the data processing method according to one or more of the foregoing.
本申请实施例包括以下优点:The embodiments of the present application include the following advantages:
本申请实施例支持用户通过剪贴板携带待回复信息,进而可以从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选,因此可以提高回复效率。本申请实施例可以应用于跨通信窗口的回复场景,具体地,用户可以对第一通信窗口中的信息进行复制,并跳转至第二通信窗口中,则本申请实施例可以自动从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选,因此可以在跨通信窗口的回复场景中实现智能回复,提高回复效率。The embodiment of the present application supports a user to carry information to be responded to through a clipboard, so that the information to be responded can be determined from the clipboard information, and a reply candidate corresponding to the information to be responded is provided, so the response efficiency can be improved. The embodiment of the present application can be applied to a response scenario across communication windows. Specifically, a user can copy information in a first communication window and jump to a second communication window. The embodiment of the present application can automatically The information to be answered is determined in the information, and a reply candidate corresponding to the information to be responded is provided. Therefore, an intelligent reply can be implemented in a reply scene across the communication window to improve the reply efficiency.
本申请实施例还可以应用于通信窗口内的回复场景。通信窗口内的回复场景具体为:用户接收的信息与发送的信息位于同一通信窗口。对于对安全性要求较为严格的操作系统,例如IOS系统,其无法支持直接读取屏幕的内容,因此,本申请实施例可以通过剪贴板信息确定待回复信息。The embodiments of the present application may also be applied to a reply scenario in a communication window. The reply scene in the communication window is specifically: the information received by the user and the information sent are located in the same communication window. For operating systems with stricter security requirements, such as the IOS system, it cannot support the direct reading of the screen content. Therefore, the embodiment of the present application can determine the information to be responded by using the clipboard information.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solution of the embodiments of the present application more clearly, the drawings used in the description of the embodiments of the application will be briefly introduced below. Obviously, the drawings in the following description are just some embodiments of the application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without paying creative labor.
图1是本申请实施例的一种数据处理方法的应用环境的示意;FIG. 1 is a schematic diagram of an application environment of a data processing method according to an embodiment of the present application; FIG.
图2是本申请的一种数据处理方法实施例一的步骤流程图;2 is a flowchart of steps in a first embodiment of a data processing method according to the present application;
图3是本申请的一种数据处理方法实施例二的步骤流程图;3 is a flowchart of steps in a second embodiment of a data processing method of the present application;
图4是本申请实施例的一种界面的示意;4 is a schematic diagram of an interface according to an embodiment of the present application;
图5是本申请的一种数据处理装置实施例的结构框图;5 is a structural block diagram of an embodiment of a data processing apparatus of the present application;
图6是本申请的一种用于数据处理的装置800的框图;及FIG. 6 is a block diagram of an apparatus 800 for data processing of the present application; and
图7是本申请的一些实施例中服务器的结构示意图。FIG. 7 is a schematic structural diagram of a server in some embodiments of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他 实施例,都属于本申请保护的范围。In the following, the technical solutions in the embodiments of the present application will be clearly and completely described with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
本申请实施例提供了一种数据处理方案,该方案可以包括:从剪贴板信息中确定出待回复信息;所述待回复信息可以对应所述剪贴板信息的全部或者部分;确定所述待回复信息对应的回复候选;展示所述回复候选。An embodiment of the present application provides a data processing scheme, which may include: determining information to be responded from the clipboard information; the information to be responded may correspond to all or part of the clipboard information; and determining the information to be responded to Reply candidates corresponding to the information; displaying the reply candidates.
本申请实施例中,剪贴板是内存中的一块区域,是预置的程序中的一个插件,通过剪贴板,用户可以通过简单的剪切、复制、粘贴等动作将选定的信息在各种应用程序之间进行传递和共享。剪贴板通过利用终端的内部资源存储器,或虚拟内存来临时保存剪切和复制的信息。In the embodiment of the present application, the clipboard is an area in the memory and is a plug-in in a preset program. With the clipboard, the user can use a simple cut, copy, paste and other actions to select the selected information in various ways. Pass and share between applications. The clipboard uses the terminal's internal resource memory, or virtual memory, to temporarily save cut and copied information.
预置的程序具体可以包括:浏览器程序、即时通讯程序、社交网络(例如:微博、论坛、新闻等)程序等等具有剪贴板功能的应用程序。The preset programs may specifically include: browser programs, instant messaging programs, social network (for example: Weibo, forum, news, etc.) programs, and other applications with clipboard functions.
剪贴板中的信息由用户选定。剪贴板信息的类型可以包括如下类型中的至少一种:文本、图片、音频、视频和页面地址。The information in the clipboard is selected by the user. The type of clipboard information may include at least one of the following types: text, picture, audio, video, and page address.
在实际应用中,剪贴板中可以仅仅包括一条信息;或者,剪贴板中也可包括多条信息。例如,剪贴板中可包括3条信息,其中第1条信息可为文本类型、第2条信息可为语音类型,第3条信息可为图片类型等。可以理解,本申请实施例对于剪贴板信息的数量、以及剪贴板信息的具体类型不加以限制。In practical applications, the clipboard may include only one piece of information; or, the clipboard may include multiple pieces of information. For example, the clipboard may include three pieces of information, where the first piece of information may be a text type, the second piece of information may be a voice type, and the third piece of information may be a picture type. It can be understood that the embodiment of the present application does not limit the amount of the clipboard information and the specific type of the clipboard information.
本申请实施例可以适用于跨通信窗口的回复场景。跨通信窗口可以包括:不同应用的不同通信窗口,或者相同应用的不同通信窗口等。The embodiments of the present application may be applicable to a reply scenario across communication windows. The cross-communication window may include: different communication windows of different applications, or different communication windows of the same application.
不同应用的不同窗口可以包括:短信应用的通信窗口与即时通讯应用的通信窗口,或者,第一即时通讯应用的通信窗口与第二即时通讯应用的通信窗口等。例如,用户A通过短信应用接收用户B发送的信息A,由于短信发送需要消耗对应的短信资费,因此,本申请实施例支持用户A通过即时通讯应用向用户B发送信息A对应的回复。相应地,用户A可以对信息A进行复制,并跳转至即时通讯应用中用户A与用户B之间的通信窗口,则本申请实施例可以将剪贴板信息(也即信息A的内容)作为待回复信息,并自动提供回复候选,供用户A选择,以使用户A在跨场景的情况下实现快速回复,进而可以提高回复效率。Different windows of different applications may include: a communication window of a short message application and a communication window of an instant messaging application, or a communication window of a first instant messaging application and a communication window of a second instant messaging application. For example, user A receives message A sent by user B through a short message application. Since sending of a short message consumes corresponding short message charges, the embodiment of the present application supports user A to send a reply corresponding to message A to user B through an instant messaging application. Correspondingly, user A can copy information A and jump to the communication window between user A and user B in the instant messaging application. In this embodiment of the present application, the clipboard information (that is, the content of information A) can be used as The information to be responded to, and a reply candidate is automatically provided for user A to choose, so that user A can achieve a quick reply in the case of cross-scenarios, which can improve the response efficiency.
在此对相同应用的不同窗口进行说明。例如,用户C通过即时通讯应用的通信窗口A接收用户D发送的信息B,由于通信窗口A为群窗口,信息B对应的回复可能涉及隐私,因此,本申请实施例支持用户C通过即时通讯应用的通信窗口B向用户D发送信息B对应的回复。相应地,用户C可以对信息B进行复制,并跳转至即时通讯应用中用户C与用户D之间的通信窗口B,则本申请实施例可以将剪贴板信息(也即信息B的内容)作为待回复信息,并自动提供回复候选,供用户C选择,以使用户C在跨场景的情况下 实现快速回复,进而可以提高回复效率。例如,用户D为用户C的领导,通信窗口A为工作群的通信窗口,通信窗口B为用户C与用户D之间通信窗口,由于在工作群的通信窗口中产生的回复可能被除了用户C和用户D之外的其他用户看到,因此,不便于在工作群的通信窗口中进行信息B的回复。Here are descriptions of different windows for the same application. For example, user C receives information B sent by user D through communication window A of the instant messaging application. Because communication window A is a group window, the reply corresponding to information B may involve privacy. Therefore, the embodiment of this application supports user C through the instant messaging application. Communication window B sends a reply corresponding to message B to user D. Correspondingly, the user C can copy the information B and jump to the communication window B between the user C and the user D in the instant messaging application. In this embodiment of the present application, the clipboard information (that is, the content of the information B) As the to-be-reply information, a reply candidate is automatically provided for user C to choose, so that user C can achieve a quick reply under the circumstance of cross-scenario, thereby improving the reply efficiency. For example, user D is the leader of user C, communication window A is the communication window of the work group, and communication window B is the communication window between user C and user D. Since the reply generated in the communication window of the work group may be excluded by user C See with users other than user D, so it is not convenient to reply to message B in the communication window of the work group.
本申请实施例还可以应用于通信窗口内的回复场景。通信窗口内的回复场景具体为:用户接收的信息与发送的信息位于同一通信窗口。对于对安全性要求较为严格的操作系统,例如IOS系统,其无法支持直接读取屏幕的内容,因此,本申请实施例可以通过剪贴板信息确定待回复信息。The embodiments of the present application may also be applied to a reply scenario in a communication window. The reply scene in the communication window is specifically: the information received by the user and the information sent are located in the same communication window. For operating systems with stricter security requirements, such as the IOS system, it cannot support the direct reading of the screen content. Therefore, the embodiment of the present application can determine the information to be responded by using the clipboard information.
本申请实施例支持用户通过剪贴板携带待回复信息,进而可以从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选,因此可以提高回复效率。本申请实施例可以应用于跨通信窗口的回复场景,具体地,用户可以对第一通信窗口中的信息进行复制,并跳转至第二通信窗口中,则本申请实施例可以自动从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选,因此可以在跨通信窗口的回复场景中实现智能回复,提高回复效率。The embodiment of the present application supports a user to carry information to be responded to through a clipboard, so that the information to be responded can be determined from the clipboard information, and a reply candidate corresponding to the information to be responded is provided, so the response efficiency can be improved. The embodiment of the present application can be applied to a response scenario across communication windows. Specifically, a user can copy information in a first communication window and jump to a second communication window. The embodiment of the present application can automatically The information to be answered is determined in the information, and a reply candidate corresponding to the information to be responded is provided. Therefore, an intelligent reply can be implemented in a reply scene across the communication window to improve the reply efficiency.
需要说明的是,上述用户跳转至第二通信窗口,只是作为可选实施例,实际上,本申请实施例的方案可以不依赖于通信窗口的跳转,即使用户不进行通信窗口的跳转,本申请实施例依然可以自动从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选。It should be noted that the above-mentioned user jumps to the second communication window, which is only an optional embodiment. In fact, the solution in this embodiment of the present application may not depend on the jump of the communication window, even if the user does not jump to the communication window In the embodiment of the present application, the information to be responded can be automatically determined from the clipboard information, and a reply candidate corresponding to the information to be responded is provided.
本申请实施例提供的数据处理方法可以应用于例如网站和/或APP(应用程序,Application)的应用环境中,以提高回复效率。例如,APP可以为通信应用,网站可以为用于提供通信服务的网页等。The data processing method provided in the embodiment of the present application may be applied to an application environment such as a website and / or an APP (Application, Application) to improve the response efficiency. For example, the APP may be a communication application, and the website may be a webpage for providing a communication service, and the like.
本申请实施例提供的数据处理方法可应用于图1所示的应用环境中,如图1所示,客户端100与服务器200位于有线或无线网络中,通过该有线或无线网络,客户端100与服务器200进行数据交互。The data processing method provided in the embodiment of the present application can be applied to the application environment shown in FIG. 1. As shown in FIG. 1, the client 100 and the server 200 are located in a wired or wireless network. Through the wired or wireless network, the client 100 Perform data interaction with the server 200.
可选地,客户端100可以运行在终端上,上述终端具体包括但不限于:智能手机、平板电脑、电子书阅读器、MP3(动态影像专家压缩标准音频层面3,Moving Picture Experts Group Audio Layer III)播放器、MP4(动态影像专家压缩标准音频层面4,Moving Picture Experts Group Audio Layer IV)播放器、膝上型便携计算机、车载电脑、台式计算机、机顶盒、智能电视机、可穿戴设备等等。Optionally, the client 100 may run on a terminal. The above terminals include, but are not limited to, a smartphone, a tablet, an e-book reader, and MP3 (Motion Picture Expert Compression Standard Audio Level 3, Moving Picture Experts Group Audio Layer III ) Player, MP4 (Moving Picture Expert Compression Standard Audio Level 4, Moving Picture Experts Group Audio Layer 4) player, laptop portable computer, car computer, desktop computer, set-top box, smart TV, wearable device and so on.
客户端100可以为设备上运行的APP,如即时通讯APP、短信APP、输入法APP、或者操作系统自带的APP等,本申请实施例对于客户端所对应的具体APP不加以限制。The client 100 may be an APP running on the device, such as an instant messaging APP, a short message APP, an input method APP, or an APP built into the operating system. The embodiment of the present application does not limit the specific APP corresponding to the client.
方法实施例一Method Example One
参照图2,示出了本申请的一种数据处理方法实施例一的步骤流程图,具体可以包括如下步骤:Referring to FIG. 2, a flowchart of steps in a first embodiment of a data processing method according to the present application is shown, which may specifically include the following steps:
步骤201、从剪贴板信息中确定出待回复信息;所述待回复信息可以对应所述剪贴板信息的全部或者部分;Step 201: Determine to-be-reply information from the clipboard information; the to-be-reply information may correspond to all or part of the clipboard information;
步骤202、确定所述待回复信息对应的回复候选;Step 202: Determine a reply candidate corresponding to the information to be replyed.
步骤203、展示所述回复候选。Step 203: Display the reply candidates.
图2所示方法实施例的至少一个步骤可由服务器和/或客户端执行,当然本申请实施例对于各个步骤的具体执行主体不加以限制。At least one step of the method embodiment shown in FIG. 2 may be executed by a server and / or a client. Of course, the embodiment of this application does not limit the specific execution subject of each step.
图2方法实施例可以对应有触发条件。The method embodiment in FIG. 2 may correspond to a trigger condition.
根据一种实施例,该触发条件可以包括:输入法键盘被调起。输入法键盘被调起,可以说明用户欲要通过输入进行回复,因此,可以触发本申请实施例的方法。According to an embodiment, the triggering condition may include: the input method keyboard is turned up. The keyboard of the input method is invoked, which can indicate that the user wants to reply by inputting. Therefore, the method in the embodiment of the present application can be triggered.
根据另一种实施例,该触发条件可以包括:剪贴板信息被更新。剪贴板信息被更新可以指用户产生了新的剪贴板信息,说明用户具备回复需求,因此,可以触发本申请实施例的方法。According to another embodiment, the trigger condition may include: the clipboard information is updated. The updated clipboard information may indicate that the user has generated new clipboard information, indicating that the user has a response requirement, and therefore, the method of the embodiment of the present application may be triggered.
根据再一种实施例,该触发条件可以包括:剪贴板信息被更新、且输入法键盘被调起。此种情况下,说明用户具备通过输入进行回复的需求,因此,可以触发本申请实施例的方法。According to yet another embodiment, the trigger condition may include: the clipboard information is updated and the input method keyboard is invoked. In this case, it indicates that the user has a requirement to reply through input, and therefore, the method in the embodiment of the present application can be triggered.
根据又一种实施例,该触发条件可以包括:在剪贴板信息被更新后、跳转至通信窗口。跳转至通信窗口可以指从剪贴板操作之前的界面跳转至了通信窗口,此种情况下,可以说明用户欲要通过跳转后的通信窗口进行回复,因此可以触发本申请实施例的方法。剪贴板操作之前的界面可以为通信窗口也可以为非通信窗口。According to another embodiment, the trigger condition may include: after the clipboard information is updated, jumping to a communication window. Jumping to the communication window can refer to jumping from the interface before the clipboard operation to the communication window. In this case, it can be explained that the user wants to reply through the communication window after the jump, so the method of the embodiment of the present application can be triggered. . The interface before the clipboard operation can be a communication window or a non-communication window.
根据又一种实施例,该触发条件可以包括:在剪贴板信息被更新后、跳转至通信窗口、且输入法键盘被调起。According to another embodiment, the trigger condition may include: after the clipboard information is updated, jumping to a communication window, and the input method keyboard is invoked.
可以理解,上述触发条件只是作为可选实施例,实际上,本领域技术人员可以根据实际应用需求,确定上述触发条件,例如,上述触发条件还可以用户的预设手势等,本申请实施例对于具体的触发条件不加以限制。It can be understood that the above trigger condition is only an optional embodiment. In fact, those skilled in the art can determine the above trigger condition according to actual application requirements. For example, the above trigger condition may also be a preset gesture of a user. The specific trigger conditions are not limited.
在步骤201中,可以通过访问剪贴板,获取剪贴板信息。剪贴板信息可以包括:一条内容、或者多条内容。例如,可以响应于用户的长按操作,在通信窗口中显示每条信息对应的选择接口,以供用户选择所需的至少一条信息进行复制In step 201, the clipboard information can be obtained by accessing the clipboard. Clipboard information can include: one piece of content, or multiple pieces of content. For example, in response to the user's long-press operation, the selection interface corresponding to each piece of information can be displayed in the communication window, so that the user can select at least one piece of information to copy
步骤201可以从剪贴板信息中确定出待回复信息,该待回复信息可以对应所述剪贴板信息的全部或者部分。例如,剪贴板信息中可以包括:信息、以及信息的发送方标识,则可以从剪贴板信息中过滤掉发送方标识,而可以保留信息,作为待回复信息。Step 201 may determine to-be-reply information from the clipboard information, and the to-be-reply information may correspond to all or part of the clipboard information. For example, the clipboard information may include: the information and the sender identification of the information. The sender identification may be filtered from the clipboard information, and the information may be retained as the information to be responded to.
本申请实施例中,可选的是,所述剪贴板信息的类型可以包括如下类型中的至少一种:文本、图片、音频、视频和页面地址。本申请实施例可以确定出至少一种类型的待回复信息,并针对至少一种类型的待回复信息,确定对应的回复候选。In the embodiment of the present application, optionally, the type of the clipboard information may include at least one of the following types: text, picture, audio, video, and page address. The embodiment of the present application may determine at least one type of information to be responded to, and determine a corresponding reply candidate for the at least one type of information to be responded to.
本申请实施例可以提供确定所述待回复信息对应的回复候选的如下技术方案:The embodiments of the present application may provide the following technical solutions for determining a response candidate corresponding to the information to be responded to:
技术方案1中,步骤202确定所述待回复信息对应的回复候选具体可以包括:确定所述待回复信息对应的主题;依据所述主题,确定所述待回复信息对应的回复候选。In the technical solution 1, the step 202 of determining the reply candidate corresponding to the information to be specifically answered may include: determining a theme corresponding to the information to be responded to; and determining a reply candidate corresponding to the information to be responded according to the theme.
主题可以指待回复信息所表现的中心思想。本申请实施例可以通过主题关键词表征主题,主题关键词可以指能够体现待回复信息的主题的关键词。The subject can refer to the central idea of the message to be answered. In the embodiment of the present application, a topic may be characterized by a topic keyword, and the topic keyword may refer to a keyword that can reflect a topic to be responded to.
可选地,上述确定所述待回复信息对应的主题,具体可以包括:Optionally, the above determining the subject corresponding to the information to be responded to may specifically include:
确定方式A1、依据页面地址对应页面的内容,确定所述页面地址对应的主题;和/或Determining method A1, determining the theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
确定方式A2、对视频对应的视频流和/或音频流进行第一识别,并依据得到的第一识别结果,确定所述视频对应的主题;和/或A determination method A2: firstly identify a video stream and / or an audio stream corresponding to a video, and determine a theme corresponding to the video according to the obtained first recognition result; and / or
确定方式A3、对图片进行第二识别,并依据得到的第二识别结果,确定所述图片对应的主题;和/或Determining method A3, performing second recognition on the picture, and determining a theme corresponding to the picture according to the obtained second recognition result; and / or
确定方式A4、对音频进行语音识别,并依据得到的语音识别结果,确定所述待回复信息对应的主题。A determination method A4: Perform voice recognition on the audio, and determine a theme corresponding to the information to be responded according to the obtained voice recognition result.
视频通常由静止的画面组成,这些静止的画面被称为视频帧。视频对应的视频流可用于表示连续的视频帧。视频对应的音频流可用于表示连续的音频信号,该音频流与连续的视频帧具备同步性,以实现视频画面和音频的同步播放效果。Video usually consists of still pictures, which are called video frames. The video stream corresponding to the video can be used to represent consecutive video frames. The audio stream corresponding to the video can be used to represent a continuous audio signal, and the audio stream is synchronized with the continuous video frame to achieve the synchronous playback effect of the video picture and audio.
在实际应用中,视频对应的音频流可以与视频的台词、配乐等视频内容相应,该配乐可以包括:主题曲、插曲、片尾曲、以及台词对应的背景音乐等。可以理解,本申请实施例对于音频流对应的具体视频内容不加以限制。In practical applications, the audio stream corresponding to the video may correspond to video content such as the lines of the video, the soundtrack, and the soundtrack may include: theme songs, episodes, ending songs, and background music corresponding to the lines. It can be understood that the embodiment of the present application does not limit the specific video content corresponding to the audio stream.
在实际应用中,视频对应的视频流和音频流可以位于相同的文件中,此种情况下,可以从视频文件中提取出音频,具体地,可以将视频文件转换为音频文件,例如可以将MP4(动态影像专家压缩标准音频层面4,Moving Picture Experts Group Audio Layer 4)格式的视频文件转换为MP3(动态影像专家压缩标准音频层面3,Moving Picture Experts Group Audio Layer III)格式的音频文件等。或者,视频对应的视频流和音频流可以分别位于独立的文件中,也即,视频文件和音频文件可以是独立的,此种情况下,可以直接获取音频文件。上述音频文件中可以包括视频对应的音频流,故可以从上述音频文件中读取视频对应的音频流。In practical applications, the video stream and audio stream corresponding to the video can be located in the same file. In this case, audio can be extracted from the video file. Specifically, the video file can be converted into an audio file, for example, MP4 (Motion Picture Expert Compression Standard Audio Level 4, Moving Picture Experts Group Audio Layer 4) format video files are converted to MP3 (Motion Picture Expert Compression Standard Audio Level 3, Moving Picture Experts Group Audio Audio Layer III) format audio files. Alternatively, the video stream and audio stream corresponding to the video may be located in separate files, that is, the video file and the audio file may be independent. In this case, the audio file may be directly obtained. The audio file may include an audio stream corresponding to the video, so the audio stream corresponding to the video may be read from the audio file.
在实际应用中,可以按照预置时间间隔从视频中提取若干视频帧,提取得到的视频 帧可以作为图像识别的对象。可以理解,本领域技术人员可以根据实际应用需求,确定上述预置时间间隔,例如,上述预置时间间隔可以为N个视频帧对应的播放时长,N为正整数,可以理解,本申请实施例对于具体的N及预置时间间隔不加以限制。In practical applications, several video frames can be extracted from the video at preset time intervals, and the extracted video frames can be used as objects for image recognition. It can be understood that a person skilled in the art may determine the preset time interval according to actual application requirements. For example, the preset time interval may be a playback duration corresponding to N video frames, and N is a positive integer. It is understood that the embodiments of the present application There are no restrictions on the specific N and the preset time interval.
本申请实施例可以采用如下识别方式对视频对应的视频流和/或音频流进行识别:The embodiments of the present application can identify the video stream and / or audio stream corresponding to the video by using the following identification methods:
识别方式1、对视频对应的视频流进行图像识别,以得到对应的图像目标信息;和/或Recognition method 1. Perform image recognition on a video stream corresponding to a video to obtain corresponding image target information; and / or
识别方式2、对视频对应的视频流进行文本识别,以得到对应的文本信息;和/或Recognition method 2. Perform text recognition on a video stream corresponding to a video to obtain corresponding text information; and / or
识别方式3、对视频对应的音频流进行语音识别,以得到对应的文本信息。Recognition method 3: Perform voice recognition on the audio stream corresponding to the video to obtain corresponding text information.
识别方式1中,图像识别,是指利用机器对图像进行处理、分析和理解,以识别各种不同模式的图像目标的技术。具体到本申请实施例,可以利用机器对视频帧进行处理、分析和理解,以识别各种不同模式的图像目标的技术。其中,通常视频帧中的图像目标可以在视频帧中对应有一定的图像区域,视频帧中的图像目标可以包括:物品、人物、空间等。例如,人物可以为视频帧中人物,物品可以为视频帧中人物穿戴的物品,空间可以为视频帧中人物所处的环境空间,如室外环境、室内环境等,例如,室内环境可以包括室内的墙壁、地面等信息,可以理解,本申请实施例对于视频帧中的具体图像目标不加以限制。In the recognition method 1, image recognition refers to a technology that uses a machine to process, analyze, and understand an image to identify image objects in various modes. Specifically to the embodiment of the present application, a technology for processing, analyzing, and understanding a video frame by a machine to identify image targets in various modes can be used. Among them, the image target in the video frame may correspond to a certain image area in the video frame, and the image target in the video frame may include: objects, people, space, and so on. For example, a character may be a person in a video frame, an item may be an item worn by a person in a video frame, and a space may be an environmental space in which the character is located in the video frame, such as an outdoor environment, an indoor environment, etc. For example, an indoor environment may include an indoor It can be understood that the information such as the wall and the ground does not limit the specific image target in the video frame.
在本申请的一种可选实施例中,对视频流和/或音频流对应的视频帧进行图像识别的过程可以包括:检测视频帧中的图像目标,并利用深度学习方法对获取到的图像目标进行分析,以得到对应的图像目标信息,因此,本申请实施例的识别结果可以包括:视频帧对应的图像目标信息。上述图像目标信息可以包括:图像目标的图像(也即图像目标在视频帧中的图像,该图像目标在视频帧中通常对应有一定的封闭区域)、图像目标的识别结果(如识别得到的图像目标的名称、类别等信息)。例如,可以利用人脸检测技术检测视频帧中的人脸,并利用深度学习方法对人脸进行分析,以得到人物的性别、年龄等信息,甚至还可以得到人物的来源,如源自哪个影视剧等,甚至还可以得到人物是哪个名人。进一步,还可以检测该人物穿戴的物品,如服装、鞋子、佩戴的手表、首饰等。或者,还可以检测该人物所处的空间信息等。In an optional embodiment of the present application, the process of performing image recognition on a video frame corresponding to a video stream and / or an audio stream may include: detecting an image target in the video frame, and using a deep learning method on the acquired image The target is analyzed to obtain corresponding image target information. Therefore, the recognition result in the embodiment of the present application may include: image target information corresponding to a video frame. The above image target information may include: the image of the image target (that is, the image of the image target in the video frame, which usually corresponds to a certain closed area in the video frame), the recognition result of the image target (such as the recognized image Target name, category, etc.). For example, you can use face detection technology to detect faces in video frames, and use deep learning methods to analyze the faces to obtain information such as the gender and age of the character, and even the source of the character, such as which film and television source Drama, etc. You can even get a celebrity. Further, it is also possible to detect items worn by the character, such as clothing, shoes, watches, jewelry, etc. Alternatively, it is also possible to detect spatial information and the like where the character is located.
视频帧中的文本信息可以包括:图像中包括的文本信息、和/或、字幕中的文本信息。The text information in the video frame may include: text information included in the image, and / or text information in the subtitles.
对于识别方式2,可以采用文本识别技术对视频流和/或音频流对应的视频帧进行文本识别。上述文本识别技术可以包括:OCR(光学字符识别,Optical Character Recognition)技术等,OCR技术可以在对图像进行降噪等预处理后,对图像中的字符进行切分,以得到单个的字符图像,并识别单个字符图像对应的字符。可以理解,本申请实施例对于具体的文本识别技术不加以限制。For the identification method 2, text recognition technology may be used to perform text recognition on a video frame corresponding to a video stream and / or an audio stream. The above text recognition technology may include: OCR (Optical Character Recognition) technology, etc. The OCR technology may segment characters in an image after pre-processing such as noise reduction to obtain a single character image. And recognize the characters corresponding to a single character image. It can be understood that the embodiment of the present application does not limit the specific text recognition technology.
对于识别方式2,可以获取视频帧的字幕对应的字幕文件,并从该字幕文件中获取字幕中的文本信息;或者,可以对视频帧对应的画面进行截屏,并对截屏图像进行文本识别,以得到字幕中的文本信息。可以理解,本申请实施例对于字幕中的文本信息的具体获取方式不加以限制。For the identification method 2, a subtitle file corresponding to the subtitle of the video frame can be obtained, and the text information in the subtitle can be obtained from the subtitle file. Get text information in subtitles. It can be understood that the embodiment of the present application does not limit the specific acquisition manner of the text information in the subtitles.
对于识别方式3,可以采用语音识别技术将视频对应的音频流转换为文本信息。如果将视频对应的音频流记作S,对S进行一系列处理后得到与之相对应的语音特征序列O,记作O={O1,O2,…,Oi,…,OT},其中Oi是第i个语音特征,T为语音特征总个数。音频流S对应的句子可看作是由许多词组成的一个词串,记作W={w1,w2,…,wn},n为自然数。语音识别的过程就是根据已知的语音特征序列O,求出最可能的词串W。For the recognition method 3, a voice recognition technology can be used to convert the audio stream corresponding to the video into text information. If the audio stream corresponding to the video is recorded as S, a series of processing is performed on S to obtain a corresponding speech feature sequence O, which is denoted as O = {O1, O2, ..., Oi, ..., OT}, where Oi is The i-th speech feature, T is the total number of speech features. The sentence corresponding to the audio stream S can be regarded as a word string composed of many words, which is written as W = {w1, w2, ..., wn}, and n is a natural number. The process of speech recognition is to find the most likely word string W according to the known speech feature sequence O.
具体来说,语音识别是一个模型匹配的过程,在这个过程中,可以首先根据人的语音特点建立语音模型,通过对输入的语音信号的分析,抽取所需的特征,来建立语音识别所需的模板;对用户所输入语音进行识别的过程即是将用户所输入语音的特征与所述模板比较的过程,最后确定与所述用户所输入语音匹配的最佳模板,从而获得语音识别的结果。具体的语音识别算法,可采用基于统计的隐含马尔可夫模型的训练和识别算法,也可采用基于神经网络的训练和识别算法、基于动态时间归整匹配的识别算法等等其他算法,本申请实施例对于具体的语音识别过程不加以限制。Specifically, speech recognition is a model matching process. In this process, you can first build a speech model based on the characteristics of people's speech, and analyze the input speech signals to extract the required features to establish the speech recognition requirements. The process of recognizing the voice input by the user is the process of comparing the features of the voice input by the user with the template, and finally determining the best template that matches the voice input by the user to obtain the result of voice recognition . Specific speech recognition algorithms can use statistics-based hidden Markov model training and recognition algorithms, neural network-based training and recognition algorithms, dynamic time-rounded matching-based recognition algorithms, and other algorithms. The application embodiment does not limit the specific speech recognition process.
在本申请的一种应用示例中,用户G接收到用户H发送的网址A,并复制了网址A,则本申请实施例可以自动对网址A的页面内容进行分析,得到网址A对应的主题A,该主题A可以有关娱乐八卦、国计民生等,并自动给出主题A对应的回复候选,如“我也看看这个网页”、或者“我也喜欢主题A对应的内容”等。In an application example of the present application, the user G receives the website A sent by the user H and copies the website A, then the embodiment of the present application can automatically analyze the page content of the website A to obtain the topic A corresponding to the website A The topic A can be related to entertainment gossip, national economy and people's livelihood, etc., and can automatically give a response candidate corresponding to the topic A, such as "I also look at this page", or "I also like the content corresponding to the topic A" and so on.
在本申请的另一种应用示例中,用户I接收到用户J发送的视频A,并复制了视频A,则本申请实施例可以自动对视频A进行识别,得到视频A对应的主题B,该主题B可以有关孩子在幼儿园的生活等,并自动给出主题B对应的回复候选,如“看来孩子在幼儿园很开心”等。In another application example of this application, if user I receives video A sent by user J and copies video A, the embodiment of this application can automatically identify video A and obtain the subject B corresponding to video A. The Topic B can be related to the child's life in the kindergarten, etc., and can automatically give a response candidate corresponding to the topic B, such as "It seems that the child is happy in the kindergarten".
在本申请的另一种应用示例中,用户K接收到用户L发送的图片A,并复制了图片A,则本申请实施例可以自动对图片A进行识别,得到图片A对应的主题C,该主题C可以有关一件衣服,如一件大衣、一件羽绒服、一件裙子等,并自动给出主题C对应的回复候选,如“主题C很好看,值得买”、“主题C有点老气”、“主题C显胖”等。In another application example of the present application, user K receives picture A sent by user L and copies picture A, the embodiment of the present application can automatically identify picture A and obtain the subject C corresponding to picture A. The Theme C can be related to a piece of clothing, such as a coat, a down jacket, a skirt, etc., and can automatically give a reply candidate for Theme C, such as "Theme C is beautiful and worth buying", "Theme C is a bit old-fashioned" "Subject C is fat" and so on.
本申请实施例可以提供确定所述待回复信息对应的主题的如下确定方式:The embodiments of the present application may provide the following determination manners for determining a topic corresponding to the information to be responded to:
确定方式B1、采用TF-IDF(词频-逆文档频率算法,term frequency–inverse document frequency)方法,确定所述待回复信息对应的主题关键词。Determining method B1. A TF-IDF (term frequency-inverse document frequency algorithm) method is used to determine the topic keywords corresponding to the information to be responded to.
TF-IDF的主要思想是:如果某个词或短语在一个文档或者一段文本中出现的频率TF高,并且在其他文档或文本中很少出现,则认为此词或者短语具有很好的类别区分能力,适合用来分类。The main idea of TF-IDF is: If a word or phrase appears frequently in a document or a text and has a high TF, and rarely appears in other documents or texts, the word or phrase is considered to have a good category distinction. Capabilities, suitable for classification.
确定方式B2、采用LDA(潜在狄利克雷分布,Latent Dirichlet Allocation)模型,确定待回复信息对应的主题关键词。Determining method B2. The LDA (Latent Dirichlet Allocation) model is used to determine the topic keywords corresponding to the information to be answered.
LDA模型是一种文档生成模型,是一种非监督机器学习技术。它认为一篇文档或者一段文本是有多个主题的,而每个主题又对应着不同的主题关键词。一篇文档或者一段文本的构造过程,首先是以一定的概率选择某个主题,然后再在这个主题下以一定的概率选出某一个主题关键词,这样就生成了这篇文档的第一个主题关键词。不断重复这个过程,就生成了一篇文档或者一段文本。LDA的使用是上述文档生成过程的逆过程,即根据一篇文档或者一段文本,去寻找出这篇文档或者这段文本的主题,以及这些主题所对应的主题关键词。The LDA model is a document generation model and an unsupervised machine learning technology. It thinks that a document or a text has multiple topics, and each topic corresponds to a different topic keyword. The process of constructing a document or a text, first select a certain topic with a certain probability, and then select a certain topic keyword with a certain probability under this topic, so the first one of this document is generated. Theme keywords. Repeating this process continuously produces a document or a text. The use of LDA is the inverse process of the above document generation process, that is, to find the theme of this document or this text, and the topic keywords corresponding to these topics according to a document or a text.
确定方式B3、采用分类模型,确定待回复信息对应的类别,依据该类别的信息得到主题关键词。分类模型可以包括:fasttext模型。fastText模型可以针对输入的词序列(一段文本或者一句话),输出这个词序列属于不同类别的概率。fastText模型可以将词序列中的词和词组组成特征向量,特征向量通过线性变换映射到中间层,中间层再映射到对应的预设类别。可选地,fastText在映射到对应的预设类别的过程中可以使用非线性激活函数。fastText具有速度快、准确度高的优点,当然,本申请实施例对于具体的分类模型不加以限制。Determining method B3. A classification model is used to determine a category corresponding to the information to be responded to, and a topic keyword is obtained based on the information of the category. The classification model may include: a fasttext model. The fastText model can output the probability that the word sequence belongs to different categories for the input word sequence (a text or a sentence). The fastText model can combine words and phrases in a word sequence into a feature vector. The feature vector is mapped to the middle layer through a linear transformation, and the middle layer is then mapped to the corresponding preset category. Optionally, fastText may use a non-linear activation function in the process of mapping to the corresponding preset category. fastText has the advantages of fast speed and high accuracy. Of course, the embodiment of the present application does not limit the specific classification model.
可以理解,本领域技术人员可以根据实际应用需求,采用上述确定方式B1至确定方式B3中的任一或者组合,本申请实施例对于确定所述待回复信息对应的主题的具体过程不加以限制。It can be understood that a person skilled in the art may adopt any one or combination of the foregoing determination manner B1 to determination manner B3 according to actual application requirements, and the embodiment of the present application does not limit the specific process of determining the topic corresponding to the information to be responded to.
本申请实施例可以依据所述主题,确定所述待回复信息对应的回复候选。可选地,待回复信息为通知类信息,此种情况下,回复候选可以为“收到”、“好的”等。待回复信息“来会议室一下”对应的主题可以为“地点通知”,待回复信息“你的企划方案有个地方需要修改”对应的主题可以为“工作修改通知”。An embodiment of the present application may determine a reply candidate corresponding to the information to be responded according to the subject. Optionally, the information to be returned is notification-type information. In this case, the reply candidate may be "received", "good", and the like. The subject of the message to be answered "Come to the meeting room" can be "location notification", and the subject of the message to be answered "There is a place in your plan that needs to be modified" can be "work modification notice".
可选地,待回复信息可以为问询类信息,此种情况下,回复候选可以为肯定类候选、否定类候选、多个选项中的一个选项、或者问题的答案。例如,待回复信息为“明天去地点A吃饭还是地点B吃饭”,则对应的回复候选可以包括:地点A、或者地点B。又如,待回复信息为“你下班了吗”,则对应的回复候选可以包括:下了、或者没下。再如,待回复信息为“你在干啥呢”,则对应的回复候选可以包括:在吃饭、在看视频等。Optionally, the information to be answered may be inquiry-type information. In this case, the reply candidate may be a positive candidate, a negative candidate, one of a plurality of options, or an answer to a question. For example, if the information to be responded is “Go to place A for dinner tomorrow or place B for dinner”, the corresponding reply candidates may include: location A or location B. For another example, if the message to be answered is "Are you off work?", The corresponding reply candidate may include: Off or Off. As another example, if the message to be answered is "What are you doing", the corresponding reply candidates may include: eating, watching videos, and so on.
技术方案2、Technical solution 2,
技术方案2中,步骤202确定所述待回复信息对应的回复候选,具体可以包括:依据所述待回复信息,在待回复数据与回复数据之间的映射关系中进行查找,以得到所述待回复信息对应的回复候选;其中,所述映射关系可以为依据至少一个用户对应的历史通信数据得到,所述历史通信数据可以包括:历史待回复数据及其对应的历史回复数据。In technical solution 2, step 202 determines a response candidate corresponding to the information to be responded to, specifically, may include: searching for a mapping relationship between the data to be responded to and the response data according to the information to be responded to obtain the response to be answered. Reply candidates corresponding to the reply information; wherein the mapping relationship may be obtained according to historical communication data corresponding to at least one user, and the historical communication data may include historical to-be-reply data and corresponding historical reply data.
在具体实施过程中,可以通过以下方式确定出上述映射关系:获得至少一个用户的历史通信数据;历史通信数据可以包括:历史待回复数据及其对应的历史回复数据;针对每条历史待回复数据提取出其对应的历史回复内容;将对应历史待回复数据的满足预设条件的历史回复内容作为对应历史待回复数据的回复数据,由此,可以依据历史待回复数据及其对应的回复数据,确定上述映射关系。In a specific implementation process, the above mapping relationship may be determined by: obtaining historical communication data of at least one user; historical communication data may include: historical to-be-reply data and its corresponding historical reply data; for each piece of historical to-be-reply data Extract the corresponding historical reply content; use the historical reply content corresponding to the historical to-reply data that meets the preset conditions as the reply data corresponding to the historical to-reply data, so that the historical to-reply data and its corresponding reply data can be based on, Determine the above mapping relationship.
该用户可以为当前终端的用户,也可以为全网中的至少一个采样用户,进而所获取的历史通信数据也不同,例如:The user may be a current terminal user or at least one sampling user in the entire network, and the historical communication data obtained is also different, for example:
①获取由当前终端的用户产生的至少一组历史通信数据。例如,可以获取所述用户针对通信内容的回复数据;从所述回复数据中提取出至少一组问答对,每组问答对可以包括:通信内容、以及针对对应通信内容的回复内容,所述至少一组问答对即为所述用户所产生的所述至少一组历史通信数据。① Acquire at least one set of historical communication data generated by a user of the current terminal. For example, the user's response data to the communication content can be obtained; at least one set of question and answer pairs can be extracted from the response data, each group of question and answer pairs can include: communication content, and response content to the corresponding communication content, the at least A set of question-answer pairs is the at least one set of historical communication data generated by the user.
如:终端可能经常收到短信“几点下班?”,终端有时候回复“9点”,有时候回复“8点”,于是“几点下班?”与“9点”构成一组问答对,“几点下班?”与“8点”构成一组问答对,可以基于该问答对构成个性化缓存数据:“几点下班?①9点②8点。当终端再次收到相同的短信时,直接给出两个回复候选:①9点②8点,用户无需输入,可以直接点击回复短信。For example, the terminal may often receive the text message "What time do you get off work?", The terminal sometimes responds with "9 o'clock", and sometimes it returns "8 o'clock", so "when do you get off work?" And "9 o'clock" constitute a set of question and answer pairs. "When do you get off work?" And "8 o'clock" form a set of question-and-answer pairs. You can form personalized cache data based on the question and answer pair: "when do you get off work? ① 9 o'clock 8 o'clock. When the terminal receives the same text message again, it will give There are two reply candidates: ① 9 o'clock and 8 o'clock. Users don't need to input, they can click to reply to the SMS.
②确定出至少一个采样用户;获取由所述至少一个采样用户产生的所述至少一组历史通信数据,该至少一个采样用户例如为系统中的所有用户,系统中的部分用户等等,基于该方案不需要针对每个用户都单独对其历史通信数据进行分析,故而能够提高历史通信数据的获取效率。② Determine at least one sampling user; acquire the at least one set of historical communication data generated by the at least one sampling user, for example, all users in the system, some users in the system, etc., based on the The solution does not need to analyze the historical communication data of each user separately, so it can improve the acquisition efficiency of historical communication data.
在获得至少一组历史通信数据之后,可以首先获得每组历史通信数据中的历史待回复数据,然后将相同的历史待回复数据进行合并,获得历史通信数据中所包含的所有历史待回复数据,其中一组历史通信数据中的历史待回复数据指的是通信上文,而历史回复数据则是针对该通信上文所产生的回复内容;或者,一组历史通信数据中的历史待回复数据指的是网络提问数据,而历史回复数据则是针对该网络提问数据所产生的答案等等。After obtaining at least one set of historical communication data, the historical to-be-reply data in each set of historical communication data may be obtained first, and then the same historical to-be-reply data is combined to obtain all historical to-be-reply data contained in the historical communication data, The historical to-reply data in one group of historical communication data refers to the above of the communication, and the historical to-reply data is the response content generated for the communication above; or, the historical to-be-reply data in the historical communication data refers to It is the network question data, and the historical reply data is the answer to the network question data and so on.
在获得历史通信数据中所包含的所有历史待回复数据之后,针对每条历史待回复数据可以获取其所对应的历史回复数据,并获得每条历史回复数据的出现次数。After obtaining all historical to-be-reply data contained in the historical communication data, for each piece of historical to-be-reply data, the corresponding historical reply data can be obtained, and the number of occurrences of each piece of historical reply data can be obtained.
进一步,在获得各条历史待回复数据及其对应的历史回复数据之后,针对每条历史待回复数据可以获取其出现次数大于预设次数(例如:20、30等等)的历史回复数据作为其对应的回复数据;也可以将其历史回复数据按照出现次数从高到低进行排序,进而获取排序位于前几位(例如:4、5等等)的历史回复数据作为其回复数据等等,由此可以得到待回复数据与回复数据之间的映射关系。Further, after obtaining each piece of historical to-be-reply data and its corresponding historical reply data, for each piece of historical to-reply data, historical reply data whose occurrences are greater than a preset number (for example: 20, 30, etc.) can be obtained as its Corresponding reply data; the historical reply data can also be sorted according to the number of occurrences, and then the historical reply data ranked in the first few positions (for example: 4, 5, etc.) is used as its reply data. This can obtain the mapping relationship between the data to be reply and the reply data.
以获取出现次数大于预设次数(例如:20)为例,则待回复数据“几点下班?”包含两条回复数据,分别为:①8点②9点;待回复数据“你吃饭了吗?”包含六条回复数据,分别为:①吃过了②没③还没④吃完了⑤吃了⑥没有呢;待回复数据“早点休息,晚安”包含一条回复数据,具体为:①晚安。可以理解,本申请实施例对于具体的映射关系。Taking the number of occurrences greater than the preset number (for example: 20) as an example, the data to be returned "when do you get off work?" Contains two pieces of reply data: ① 8 o'clock 9 o'clock; the data to be reply "Did you eat?" Contains six pieces of response data, namely: ①have eaten ②not ③not yet ④eaten ⑤eaten ⑥have not yet; the data to be returned “rest early, good night” contains a piece of response data, specifically: ①good night. It can be understood that the embodiments of the present application are directed to specific mapping relationships.
技术方案3、Technical solution 3.
技术方案3中,步骤202确定所述待回复信息对应的回复候选,具体可以包括:提取出所述待回复信息的第一特征信息;基于预先建立的特征信息与回复规则之间的对应关系,确定出所述第一特征信息所对应的第一回复规则;通过所述第一回复规则确定至少一个回复候选。In technical solution 3, step 202 determines a reply candidate corresponding to the information to be responded, which may specifically include: extracting first feature information of the information to be responded to; based on a correspondence relationship between the pre-established feature information and a reply rule, A first reply rule corresponding to the first feature information is determined; at least one reply candidate is determined through the first reply rule.
举例来说,该特征信息例如为:预设语句、预设语句格式等等,预设语句例如为:“吃饭了吗”、“几点睡觉”、“好吗”等等,预设语句格式例如为:“你今天去XX吃饭还是去XX吃饭”、“今天吃XX还是XX”(其中XX是缺省词)等等。在具体实施过程中,可以针对每种特征信息分别构建其对应的回复规则,例如:针对“吃饭了吗”构建其回复规则为:①已经吃了②还没吃③准备去吃;针对“今天去万州豌杂面吃饭还是去冒香吃饭”构建其回复规则为:①万州豌杂面②冒香等等,当然,还可以构建其他回复规则,本申请实施例不作限制。For example, the characteristic information is, for example, a preset sentence, a preset sentence format, and the like, and the preset sentence is, for example, "have you eaten", "when did you sleep", "how are you", etc., the preset sentence format For example: "Did you go to XX for dinner or go to XX for dinner", "Eat XX or XX today" (where XX is the default word) and so on. In the specific implementation process, corresponding response rules can be constructed for each type of feature information. For example, the response rules for "have eaten" are: ① already eaten ② not yet eaten ③ ready to eat; To go to Wanzhou Pea Mi Noodles to eat or to eat fragrant meal "to establish the reply rules are: ① Wanzhou Pea Mi Noodles ② To make incense, etc. Of course, other reply rules can also be constructed, which are not limited in the examples of this application.
作为一种可选的实施例,所述通过所述第一回复规则确定所述至少一个回复候选,包括:提取所述待回复信息中的至少一个特定关键词;将所述至少一个特定关键词与所述第一回复规则进行组合,从而获得所述至少一个回复候选。As an optional embodiment, determining the at least one reply candidate by using the first reply rule includes: extracting at least one specific keyword in the information to be responded to; and at least one specific keyword And combining with the first reply rule to obtain the at least one reply candidate.
举例来说,假设预设语句格式为“你今天打算去XX吃饭还是去XX吃饭”,为其构建的回复规则为:我今天去XX吃饭,其中XX表示待回复信息中的缺省项,例如:第一待回复数据为“你今天打算回家吃饭,还是去日昌吃饭”,则缺省词(也即特定关键词)包括:回家、日昌,从而所构建的至少一个回复候选包括:①我今天去A餐厅吃饭;②我今天去B餐厅吃饭。另外,由于用户可能并不一定去这两个地方吃饭,则还可以添加其他回复候选,例如:③随便等等。;又例如针对“今天吃饺子还是面条”,为其构建回复规则为:今天吃XX,则将缺省词“饺子”、“面条”与该回复规则进行 组合,从而所构建的至少一个回复候选包括:①今天吃饺子;②今天吃面条等等。通过上述方案,可以基于待回复信息中所包含的特定关键词确定出至少一个回复候选,故而所确定的回复候选与待回复信息的关联性更高,更加准确。For example, suppose the preset sentence format is "Do you plan to go to XX for dinner or go to XX for dinner"? The reply rule constructed for it is: I go to XX for dinner today, where XX represents the default item in the message to be answered, for example : The first data to be answered is "Do you plan to go home for dinner today, or go to Richang for dinner?", Then the default words (that is, specific keywords) include: go home, Richang, so that at least one response candidate constructed includes : ① I went to restaurant A for dinner today; ② I went to restaurant B for dinner today. In addition, since the user may not necessarily go to these two places for dinner, other reply candidates can be added, such as: ③ casually, etc. ; For example, for "eating dumplings or noodles today", construct a reply rule for it: eat XX today, then combine the default words "dumplings" and "noodles" with the reply rule, so that at least one reply candidate is constructed Including: ① eating dumplings today; ② eating noodles today and so on. Through the above solution, at least one reply candidate can be determined based on a specific keyword included in the information to be responded, so the determined reply candidate is more relevant and accurate.
技术方案4、Technical solution 4.
技术方案4中,步骤202确定所述待回复信息对应的回复候选,具体可以包括:通过终端的处于运行状态的应用程序,确定用户状态信息;依据所述用户状态信息,确定所述待回复信息对应的回复候选。In the technical solution 4, step 202 determines the response candidate corresponding to the information to be responded to, which may specifically include: determining user status information through a running application of the terminal; and determining the information to be responded according to the user status information. The corresponding reply candidate.
技术方案4在生成回复候选的过程中使用了用户状态信息,而通过处于运行状态的应用程序获取的上述用户状态信息能够反映用户对于应用程序的使用情况,例如,当用户通过终端接收来自通信对端的信息时或者用户通过终端对来自通信对端的信息进行回复时,上述用户对于应用程序的使用情况具体可以包括:用户通过处于运行状态的APP产生的设置、浏览、购买、观看等一系列的用户动作,这样,使得依据上述用户状态信息生成的回复候选能够携带除了判断性回复之外的深层信息,因此能够使得回复候选契合用户的精确回复意图,进而提高快捷回复候选的精确度和丰富度。而在回复候选契合用户的精确回复意图的情况下,用户可以直接使用上述回复候选进行信息的回复,由于可以进一步降低用户回复信息时的输入代价,因此能够提高回复效率。Technical solution 4 uses the user status information in the process of generating the reply candidate, and the above user status information obtained through the running application can reflect the user's use of the application. For example, when the user receives a When the user's information is returned or the user responds to the information from the correspondent peer through the terminal, the use of the application by the user may specifically include a series of users such as settings, browsing, purchase, and viewing generated by the user through the running APP. The action is such that the reply candidate generated according to the user status information can carry deep information other than the judgment reply, so that the reply candidate can meet the user's precise reply intention, thereby improving the accuracy and richness of the quick reply candidate. In the case where the reply candidate matches the user's precise reply intention, the user can directly use the above reply candidate to reply to the information. Since the input cost when the user responds to the information can be further reduced, the reply efficiency can be improved.
例如,当用户M接收到短信时,正在通过视频类APP看视频,这样,可以通过视频类APP获取对应的用户状态信息,并在该短信的内容为例如“在干嘛”的问题时,可依据上述用户状态信息生成“在看视频”、“在看电视剧”等回复候选;或者,在该短信的内容为例如“吃饭了吗”的问题时,可依据上述用户状态信息生成“没呢,在看视频”、“还没,在看电视剧”、“边吃饭边看电视剧”等回复候选。For example, when user M receives a short message, he is watching a video through a video app. In this way, the corresponding user status information can be obtained through the video app, and when the content of the short message is, for example, a question "What are you doing?" According to the above user status information, a response candidate such as "watching a video" or "watching a TV series" is generated; or, when the content of the short message is, for example, "Did you eat?", "No, Candidates for replying to “watching video”, “not yet, watching TV series”, “watching TV series while eating”.
以上通过技术方案1至技术方案4对确定所述待回复信息对应的回复候选的过程进行了详细介绍,可以理解,本领域技术人员可以根据实际应用需求,采用技术方案1至技术方案4中的任一或者组合,或者,还可以采用其他技术方案,本申请实施例对于确定所述待回复信息对应的回复候选的具体过程不加以限制。The process of determining a reply candidate corresponding to the information to be responded is described in detail through the technical solutions 1 to 4 above. It can be understood that those skilled in the art can use the technical solutions 1 to 4 according to the actual application requirements. Either or a combination, or other technical solutions may also be adopted. The embodiment of the present application does not limit the specific process of determining a response candidate corresponding to the information to be returned.
步骤203可以展示所述回复候选,以供用户选择。可选地,可以响应于用户对于任一回复候选的触发操作,将所述触发操作对应的目标回复候选上屏,该上屏可以为:将该目标回复候选输出至通信窗口的输入框中。Step 203 may display the reply candidates for selection by the user. Optionally, in response to a user's trigger operation for any reply candidate, the target reply candidate corresponding to the trigger operation may be displayed on the screen, and the screen may be: outputting the target reply candidate to an input box of a communication window.
根据一种实施例,上述展示所述回复候选,具体可以包括:在输入法键盘的上方,展示所述回复候选。在输入法键盘被调起后,可以在用户未产生任何输入串、或者产生了任意输入串的情况下,在输入法键盘的上方展示至少一个回复候选。根据另一种实施例,可以通过弹窗或者蒙层等方式,展示所述回复候选。可以理解,本申请实施例对于 上述回复候选的具体展示方式不加以限制。According to an embodiment, the displaying the response candidate may specifically include: displaying the response candidate above an input method keyboard. After the input method keyboard is called up, at least one reply candidate may be displayed above the input method keyboard without generating any input string or generating any input string by the user. According to another embodiment, the reply candidates may be displayed through a pop-up window or a mask. It can be understood that the embodiment of the present application does not limit the specific display manner of the above reply candidates.
在本申请的一种可选实施例中,步骤203展示所述回复候选,具体可以包括:依据本端用户与对端用户之间的社会关系,对多个回复候选进行排序;展示排序后的多个回复候选。In an optional embodiment of the present application, step 203 displays the reply candidates, which may specifically include: ranking multiple reply candidates according to the social relationship between the local user and the peer user; displaying the sorted Multiple reply candidates.
在实际应用中,可以通过以下方式确定出本端用户与对端用户之间的社会关系:获取所述对端用户与所述本端用户之间的通信内容;提取出所述通信内容中所包含的第一预定关键词;基于所述第一预定关键词确定出本端用户与对端用户之间的社会关系。In practical applications, the social relationship between the local user and the peer user can be determined in the following ways: obtaining the communication content between the peer user and the local user; extracting all the content in the communication content A first predetermined keyword included; and based on the first predetermined keyword, determining a social relationship between a local user and a peer user.
举例来说,可以预先设定一数据库,该数据库中包括至少一个社会关系以及每个社会关系对应的关键词。如“情侣关系”的关键词可以包括:“思密达”、“honey”、“亲爱的”,“夫妻”关系的关键词可以包括:“老婆”、“老公”,“同事关系”的关键词可以包括:“项目”、“*工”、“Leader”、“Madam”等。For example, a database may be set in advance, and the database includes at least one social relationship and keywords corresponding to each social relationship. For example, the keywords for "couple relationship" can include: "Smith", "honey", "dear", and the keywords for "couple" relationship can include: "wife", "husband", "key of colleague relationship" Words can include: "project", "* 工", "Leader", "Madam", etc.
进而在获得对端用户与本端用户之间的通信内容之后,可以判断该通信内容中是否包含上述任意关键词,如果包含上述任意关键词,则将其提取出来作为第一预定关键词,然后通过第一预定关键词确定出社会关系。Furthermore, after obtaining the communication content between the peer user and the local user, it can be determined whether the communication content contains any of the above keywords, and if any of the above keywords are contained, it is extracted as the first predetermined keyword, and then The social relationship is determined by the first predetermined keyword.
可选地,针对每个回复候选,可以确定其属于社会关系的概率,然后将多个回复候选按照该概率进行排序之后输出,例如:如果社会关系为情侣关系,回复候选包括:①晚安②安安③晚安,亲爱的,这三个回复候选属于情侣关系的概率的分别为:0.1、0.6、0.9,则可以将三个回复候选按照以下排序方式输出:①晚安,亲爱的②安安③晚安。由于在上述方案中,可以按照社会关系对多个回复候选进行排序,故可以提高排序结果的合理性。Optionally, for each response candidate, the probability that it belongs to a social relationship may be determined, and then multiple response candidates are sorted according to the probability and output, for example, if the social relationship is a couple relationship, the response candidates include: ① Goodnight ② Anan ③ Good night, dear, the probability that these three response candidates belong to a couple relationship is: 0.1, 0.6, 0.9, then you can output the three response candidates in the following order: ① Good night, dear ② An An ③ Good night. In the above solution, multiple reply candidates can be sorted according to social relationships, so the rationality of the ranking results can be improved.
综上,本申请实施例的数据处理方法,支持用户通过剪贴板携带待回复信息,进而可以从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选,因此可以提高回复效率。本申请实施例可以应用于跨通信窗口的回复场景,具体地,用户可以对第一通信窗口中的信息进行复制,并跳转至第二通信窗口中,则本申请实施例可以自动从剪贴板信息中确定出待回复信息,并提供待回复信息对应的回复候选,因此可以在跨通信窗口的回复场景中实现智能回复,提高回复效率。In summary, the data processing method in the embodiment of the present application supports a user to carry information to be responded to via a clipboard, so that the information to be responded can be determined from the information on the clipboard, and a reply candidate corresponding to the information to be responded is provided, so the response efficiency can be improved. . The embodiment of the present application can be applied to a response scenario across communication windows. Specifically, a user can copy information in a first communication window and jump to a second communication window. The embodiment of the present application can automatically The information to be answered is determined in the information, and a reply candidate corresponding to the information to be responded is provided. Therefore, an intelligent reply can be implemented in a reply scene across the communication window to improve the reply efficiency.
方法实施例二Method embodiment two
参照图3,示出了本申请的一种数据处理方法实施例二的步骤流程图,具体可以包括如下步骤:Referring to FIG. 3, a flowchart of steps in a second embodiment of a data processing method according to the present application is shown, which may specifically include the following steps:
步骤301、从剪贴板信息中确定出待回复信息;所述待回复信息可以对应所述剪贴板信息的全部或者部分;Step 301: Determine to-be-reply information from the clipboard information; the to-be-reply information may correspond to all or part of the clipboard information;
步骤302、确定所述待回复信息对应的回复候选;Step 302: Determine a reply candidate corresponding to the information to be replyed.
步骤303、展示所述回复候选、以及所述剪贴板信息。Step 303: Display the reply candidates and the clipboard information.
本申请实施例可以同时展示回复候选和剪贴板信息,这样可以实现剪贴板信息和回复候选之间的对照展示效果,以使用户获知回复候选是针对剪贴板信息进行回复的。The embodiment of the present application can display the reply candidate and the clipboard information at the same time, so that the comparison display effect between the clipboard information and the reply candidate can be achieved, so that the user knows that the reply candidate is responding to the clipboard information.
参照图4,示出了本申请实施例的一种界面的示意,具体可以包括:通信窗口401和输入法界面402;Referring to FIG. 4, a schematic diagram of an interface according to an embodiment of the present application is shown, which may specifically include: a communication window 401 and an input method interface 402;
其中,通信窗口401可以包括:通信内容和输入框。以用户A与用户B之间的通信窗口为例,通信内容可以包括:用户B发送的通信内容1和用户A发送的通信内容2等。The communication window 401 may include communication content and an input box. Taking the communication window between user A and user B as an example, the communication content may include: communication content 1 sent by user B and communication content 2 sent by user A, and the like.
输入法界面402可以包括:输入法键盘421、剪贴板信息区域422和回复候选区域423,其中,剪贴板信息区域422可以位于输入法键盘421之上,剪贴板信息区域422可以覆盖部分或者全部输入法工具;回复候选区域423可以位于剪贴板信息区域423。The input method interface 402 may include an input method keyboard 421, a clipboard information area 422, and a reply candidate area 423. The clipboard information area 422 may be located above the input method keyboard 421, and the clipboard information area 422 may cover part or all of the input. Method tool; the reply candidate area 423 may be located in the clipboard information area 423.
剪贴板信息区域422可以包括:剪贴板信息、以及剪贴板信息对应的提示信息,如“来自剪贴板”等。The clipboard information area 422 may include: clipboard information and prompt information corresponding to the clipboard information, such as "from the clipboard" and the like.
回复候选区域423可以包括:n(n为自然数)个回复候选、以及对应的提示信息,如“智能回复”等。The reply candidate area 423 may include: n (n is a natural number) reply candidates, and corresponding prompt information, such as “smart reply” and the like.
若接收到用户对于一个回复候选的点击操作,则可以将该点击操作对应的目标回复候选输出至输入框中。If a user's click operation on a reply candidate is received, the target reply candidate corresponding to the click operation can be output to the input box.
在本申请的一种可选实施例中,可以判断通信窗口的通信内容中是否包括上述剪贴板信息,若是,则不展示上述剪贴板信息,否则,展示上述剪贴板信息。上述判断可以避免剪贴板信息的重复。In an optional embodiment of the present application, it may be determined whether the above-mentioned clipboard information is included in the communication content of the communication window. If so, the above-mentioned clipboard information is not displayed, otherwise, the above-mentioned clipboard information is displayed. The above judgment can avoid duplication of clipboard information.
需要说明的是,对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请实施例并不受所描述的动作顺序的限制,因为依据本申请实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本申请实施例所必须的。It should be noted that, for the sake of simple description, the method embodiments are all described as a series of action combinations. However, those skilled in the art should know that the embodiments of the present application are not limited by the described action sequence because According to the embodiment of the present application, some steps may be performed in other orders or simultaneously. Secondly, a person skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present application.
装置实施例Device embodiment
参照图5,示出了本申请的一种数据处理装置实施例的结构框图,具体可以包括:Referring to FIG. 5, a structural block diagram of an embodiment of a data processing apparatus according to the present application is shown, which may specifically include:
待回复信息确定模块501,配置为从剪贴板信息中确定出待回复信息;所述待回复信息可以对应所述剪贴板信息的全部或者部分;The to-be-reply information determination module 501 is configured to determine the to-be-reply information from the clipboard information; the to-be-reply information may correspond to all or part of the clipboard information;
回复候选确定模块502,配置为确定所述待回复信息对应的回复候选;以及A reply candidate determination module 502 configured to determine a reply candidate corresponding to the information to be replyed; and
回复候选展示模块503,配置为展示所述回复候选。The reply candidate display module 503 is configured to display the reply candidates.
可选地,所述装置还可以包括:Optionally, the apparatus may further include:
剪贴板信息展示模块,配置为展示所述剪贴板信息。The clipboard information display module is configured to display the clipboard information.
可选地,所述回复候选展示模块503,具体配置为在输入法键盘的上方,展示所述回复候选。Optionally, the reply candidate display module 503 is specifically configured to display the reply candidate above the input method keyboard.
可选地,所述剪贴板信息的类型可以包括如下类型中的至少一种:文本、图片、音频、视频和页面地址。Optionally, the type of the clipboard information may include at least one of the following types: text, picture, audio, video, and page address.
可选地,所述回复候选确定模块502可以包括:Optionally, the reply candidate determination module 502 may include:
主题确定模块,配置为确定所述待回复信息对应的主题;A topic determination module configured to determine a topic corresponding to the information to be responded to;
候选确定模块,配置为依据所述主题,确定所述待回复信息对应的回复候选。The candidate determination module is configured to determine a response candidate corresponding to the information to be returned according to the subject.
可选地,所述主题确定模块可以包括:Optionally, the topic determination module may include:
第一主题确定模块,配置为依据页面地址对应页面的内容,确定所述页面地址对应的主题;和/或A first theme determination module configured to determine a theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
第二主题确定模块,配置为对视频对应的视频流和/或音频流进行第一识别,并依据得到的第一识别结果,确定所述视频对应的主题;和/或A second theme determination module configured to perform first recognition on a video stream and / or audio stream corresponding to a video, and determine a theme corresponding to the video according to the obtained first recognition result; and / or
第三主题确定模块,配置为对图片进行第二识别,并依据得到的第二识别结果,确定所述图片对应的主题;和/或A third theme determination module configured to perform second recognition on the picture, and determine a theme corresponding to the picture according to the obtained second recognition result; and / or
第四主题确定模块,配置为对音频进行语音识别,并依据得到的语音识别结果,确定所述待回复信息对应的主题。The fourth theme determining module is configured to perform voice recognition on the audio, and determine a theme corresponding to the information to be responded according to the obtained voice recognition result.
可选地,所述回复候选确定模块502可以包括:Optionally, the reply candidate determination module 502 may include:
查找模块,配置为依据所述待回复信息,在待回复数据与回复数据之间的映射关系中进行查找,以得到所述待回复信息对应的回复候选;A search module configured to perform a search in a mapping relationship between the data to be replied and the reply data according to the information to be replied to obtain a reply candidate corresponding to the information to be replied;
其中,所述映射关系为依据至少一个用户对应的历史通信数据得到,所述历史通信数据可以包括:历史待回复数据及其对应的历史回复数据。The mapping relationship is obtained based on historical communication data corresponding to at least one user. The historical communication data may include historical to-be-reply data and corresponding historical reply data.
可选地,所述回复候选展示模块503可以包括:Optionally, the reply candidate display module 503 may include:
排序模块,配置为依据本端用户与对端用户之间的社会关系,对多个回复候选进行排序;A sorting module configured to sort multiple reply candidates based on the social relationship between the local user and the peer user;
排序展示模块,配置为展示排序后的多个回复候选。The sorting display module is configured to display the sorted multiple reply candidates.
对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。As for the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple. For the related parts, refer to the description of the method embodiment.
本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。Each embodiment in this specification is described in a progressive manner. Each embodiment focuses on the differences from other embodiments, and the same or similar parts between the various embodiments may refer to each other.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the device in the above embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.
本申请实施例提供了一种用于数据处理的装置,包括有存储器,以及一个或者一个 以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;确定所述待回复信息对应的回复候选;展示所述回复候选。An embodiment of the present application provides a device for data processing, including a memory, and one or more programs. One or more programs are stored in the memory and configured to be executed by one or more processors. The one or more programs include instructions for: determining information to be responded from the clipboard information; the information to be responded to corresponds to all or part of the clipboard information; and determining that the information to be responded corresponds to ; Candidates for reply; showing the candidates for reply.
图6是根据一示例性实施例示出的一种用于数据处理的装置800的框图。例如,装置800可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。Fig. 6 is a block diagram of a device 800 for data processing according to an exemplary embodiment. For example, the device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness equipment, a personal digital assistant, and the like.
参照图6,装置800可以包括以下一个或多个组件:处理组件802,存储器804,电源组件806,多媒体组件808,音频组件810,输入/输出(I/O)的接口812,传感器组件814,以及通信组件816。6, the device 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input / output (I / O) interface 812, a sensor component 814, And communication component 816.
处理组件802通常控制装置800的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理元件802可以包括一个或多个处理器820来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件802可以包括一个或多个模块,便于处理组件802和其他组件之间的交互。例如,处理组件802可以包括多媒体模块,以方便多媒体组件808和处理组件802之间的交互。The processing component 802 generally controls the overall operations of the device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing element 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the method described above. In addition, the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components. For example, the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.
存储器804被配置为存储各种类型的数据以支持在设备800的操作。这些数据的示例包括用于在装置800上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器804可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。The memory 804 is configured to store various types of data to support operation at the device 800. Examples of these data include instructions for any application or method operating on the device 800, contact data, phone book data, messages, pictures, videos, and the like. The memory 804 may be implemented by any type of volatile or non-volatile storage devices, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), Programming read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk.
电源组件806为装置800的各种组件提供电力。电源组件806可以包括电源管理系统,一个或多个电源,及其他与为装置800生成、管理和分配电力相关联的组件。The power component 806 provides power to various components of the device 800. The power component 806 may include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for the device 800.
多媒体组件808包括在所述装置800和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与所述触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件808包括一个前置摄像头和/或后置摄像头。当设备800处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。The multimedia component 808 includes a screen that provides an output interface between the device 800 and a user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, swipe, and gestures on the touch panel. The touch sensor may not only sense a boundary of a touch or slide action, but also detect duration and pressure related to the touch or slide operation. In some embodiments, the multimedia component 808 includes a front camera and / or a rear camera. When the device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and / or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
音频组件810被配置为输出和/或输入音频信号。例如,音频组件810包括一个麦克风(MIC),当装置800处于操作模式,如呼叫模式、记录模式和语音数据处理模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器804或经由通信组件816发送。在一些实施例中,音频组件810还包括一个扬声器,用于输出音频信号。The audio component 810 is configured to output and / or input audio signals. For example, the audio component 810 includes a microphone (MIC) that is configured to receive an external audio signal when the device 800 is in an operation mode, such as a call mode, a recording mode, and a voice data processing mode. The received audio signal may be further stored in the memory 804 or transmitted via the communication component 816. In some embodiments, the audio component 810 further includes a speaker for outputting audio signals.
I/O接口812为处理组件802和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。The I / O interface 812 provides an interface between the processing component 802 and a peripheral interface module. The peripheral interface module may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
传感器组件814包括一个或多个传感器,用于为装置800提供各个方面的状态评估。例如,传感器组件814可以检测到设备800的打开/关闭状态,组件的相对定位,例如所述组件为装置800的显示器和小键盘,传感器组件814还可以检测装置800或装置800一个组件的位置改变,用户与装置800接触的存在或不存在,装置800方位或加速/减速和装置800的温度变化。传感器组件814可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件814还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件814还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。The sensor component 814 includes one or more sensors for providing status assessment of various aspects of the device 800. For example, the sensor component 814 can detect the on / off state of the device 800 and the relative positioning of the components, such as the display and keypad of the device 800. The sensor component 814 can also detect the change of the position of the device 800 or a component of the device 800 , The presence or absence of the user's contact with the device 800, the orientation or acceleration / deceleration of the device 800, and the temperature change of the device 800. The sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects without any physical contact. The sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor component 814 may further include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
通信组件816被配置为便于装置800和其他设备之间有线或无线方式的通信。装置800可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件816经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,所述通信组件816还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频数据处理(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。The communication component 816 is configured to facilitate wired or wireless communication between the device 800 and other devices. The device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof. In one exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast-related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 further includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module can be implemented based on radio frequency data processing (RFID) technology, infrared data association (IrDA) technology, ultra wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
在示例性实施例中,装置800可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。In an exemplary embodiment, the device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor, or other electronic component is implemented to perform the above method.
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器804,上述指令可由装置800的处理器820执行以完成上述方法。例如,所述非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, a non-transitory computer-readable storage medium including instructions, such as a memory 804 including instructions, may be executed by the processor 820 of the device 800 to complete the foregoing method. For example, the non-transitory computer-readable storage medium may be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
图7是本申请的一些实施例中服务器的结构示意图。该服务器1900可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上中央处理器(central processing units, CPU)1922(例如,一个或一个以上处理器)和存储器1932,一个或一个以上存储应用程序1942或数据1944的存储介质1930(例如一个或一个以上海量存储设备)。其中,存储器1932和存储介质1930可以是短暂存储或持久存储。存储在存储介质1930的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括对服务器中的一系列指令操作。更进一步地,中央处理器1922可以设置为与存储介质1930通信,在服务器1900上执行存储介质1930中的一系列指令操作。FIG. 7 is a schematic structural diagram of a server in some embodiments of the present application. The server 1900 may have relatively large differences due to different configurations or performance, and may include one or more central processing units (CPUs) 1922 (for example, one or more processors) and memory 1932, one or one The above storage medium 1930 (eg, one or one storage device with an amount of Shanghai) storing application programs 1942 or data 1944. The memory 1932 and the storage medium 1930 may be temporary storage or persistent storage. The program stored in the storage medium 1930 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations on the server. Furthermore, the central processing unit 1922 may be configured to communicate with the storage medium 1930, and execute a series of instruction operations in the storage medium 1930 on the server 1900.
服务器1900还可以包括一个或一个以上电源1926,一个或一个以上有线或无线网络接口1950,一个或一个以上输入输出接口1958,一个或一个以上键盘1956,和/或,一个或一个以上操作系统1941,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM等等。The server 1900 may also include one or more power sources 1926, one or more wired or wireless network interfaces 1950, one or more input-output interfaces 1958, one or more keyboards 1956, and / or, one or more operating systems 1941. , Such as Windows ServerTM, Mac OSXTM, UnixTM, LinuxTM, FreeBSDTM and so on.
一种非临时性计算机可读存储介质,当所述存储介质中的指令由装置(服务器或者终端)的处理器执行时,使得装置能够执行图2或图3所示的数据处理方法。A non-transitory computer-readable storage medium, when instructions in the storage medium are executed by a processor of a device (server or terminal), enable the device to execute the data processing method shown in FIG. 2 or FIG. 3.
一种非临时性计算机可读存储介质,当所述存储介质中的指令由装置(服务器或者终端)的处理器执行时,使得装置能够执行一种数据处理方法,所述方法包括:从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;确定所述待回复信息对应的回复候选;展示所述回复候选。A non-transitory computer-readable storage medium, when instructions in the storage medium are executed by a processor of a device (server or terminal), enable the device to execute a data processing method, the method includes: from a clipboard Information to be answered is determined; the information to be responded to corresponds to all or part of the clipboard information; a response candidate corresponding to the information to be responded is determined; and the response candidate is displayed.
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本申请的真正范围和精神由下面的权利要求指出。Those skilled in the art will readily contemplate other embodiments of the present application after considering the specification and practicing the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of this application. These variations, uses, or adaptations follow the general principles of this application and include common general knowledge or conventional technical means in the technical field not disclosed in this disclosure. . It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the application being indicated by the following claims.
应当理解的是,本申请并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求来限制。It should be understood that the present application is not limited to the precise structure that has been described above and shown in the drawings, and various modifications and changes can be made without departing from the scope thereof. The scope of the application is limited only by the accompanying claims.
以上所述仅为本申请的较佳实施例,并不用以限制本申请,凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。The above is only a preferred embodiment of the present application and is not intended to limit the present application. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present application shall be included in the protection of the present application. Within range.
以上对本申请所提供的一种数据处理方法、一种数据处理装置和一种用于数据处理的装置,进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The data processing method, data processing device, and data processing device provided by the present application have been described in detail above. Specific examples are applied in this article to explain the principle and implementation of the present application. The descriptions of the above embodiments are only used to help understand the method and core ideas of the present application; meanwhile, for a person of ordinary skill in the art, according to the ideas of the present application, there will be changes in the specific implementation and application scope. In summary, the content of this specification should not be construed as a limitation on this application.

Claims (25)

  1. 一种数据处理方法,其特征在于,所述方法包括:A data processing method, characterized in that the method includes:
    从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;Determining to-be-reply information from the clipboard information; the to-be-reply information corresponds to all or part of the clipboard information;
    确定所述待回复信息对应的回复候选;Determining a reply candidate corresponding to the information to be replyed;
    展示所述回复候选。The reply candidate is displayed.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, further comprising:
    展示所述剪贴板信息。Displaying the clipboard information.
  3. 根据权利要求1或2所述的方法,其特征在于,所述展示所述回复候选,包括:The method according to claim 1 or 2, wherein the displaying the reply candidate comprises:
    在输入法键盘的上方,展示所述回复候选。Above the input method keyboard, the reply candidates are displayed.
  4. 根据权利要求1或2所述的方法,其特征在于,所述剪贴板信息的类型包括如下类型中的至少一种:文本、图片、音频、视频和页面地址。The method according to claim 1 or 2, wherein the type of the clipboard information includes at least one of the following types: text, picture, audio, video, and page address.
  5. 根据权利要求1或2所述的方法,其特征在于,所述确定所述待回复信息对应的回复候选,包括:The method according to claim 1 or 2, wherein the determining a reply candidate corresponding to the information to be returned comprises:
    确定所述待回复信息对应的主题;Determining a subject corresponding to the information to be responded to;
    依据所述主题,确定所述待回复信息对应的回复候选。According to the subject, a reply candidate corresponding to the information to be reply is determined.
  6. 根据权利要求5所述的方法,其特征在于,所述确定所述待回复信息对应的主题,包括:The method according to claim 5, wherein the determining a subject corresponding to the information to be returned comprises:
    依据页面地址对应页面的内容,确定所述页面地址对应的主题;和/或Determine the theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
    对视频对应的视频流和/或音频流进行第一识别,并依据得到的第一识别结果,确定所述视频对应的主题;和/或First identifying a video stream and / or an audio stream corresponding to a video, and determining a theme corresponding to the video according to the obtained first recognition result; and / or
    对图片进行第二识别,并依据得到的第二识别结果,确定所述图片对应的主题;和/或Perform a second recognition on the picture, and determine the theme corresponding to the picture according to the obtained second recognition result; and / or
    对音频进行语音识别,并依据得到的语音识别结果,确定所述待回复信息对应的主题。Perform voice recognition on the audio, and determine the theme corresponding to the information to be answered according to the obtained voice recognition result.
  7. 根据权利要求1或2所述的方法,其特征在于,所述确定所述待回复信息对应的回复候选,包括:The method according to claim 1 or 2, wherein the determining a reply candidate corresponding to the information to be returned comprises:
    依据所述待回复信息,在待回复数据与回复数据之间的映射关系中进行查找,以得到所述待回复信息对应的回复候选;Searching in the mapping relationship between the data to be replied and the reply data according to the information to be replied to obtain a reply candidate corresponding to the information to be replied;
    其中,所述映射关系为依据至少一个用户对应的历史通信数据得到,所 述历史通信数据包括:历史待回复数据及其对应的历史回复数据。The mapping relationship is obtained based on historical communication data corresponding to at least one user, and the historical communication data includes historical to-be-reply data and corresponding historical reply data.
  8. 根据权利要求1或2所述的方法,其特征在于,所述展示所述回复候选,包括:The method according to claim 1 or 2, wherein the displaying the reply candidate comprises:
    依据本端用户与对端用户之间的社会关系,对多个回复候选进行排序;Sort multiple response candidates based on the social relationship between the local user and the peer user;
    展示排序后的多个回复候选。Multiple ranked candidates are displayed.
  9. 一种数据处理装置,其特征在于,包括:A data processing device, comprising:
    待回复信息确定模块,配置为从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;A to-be-reply information determination module configured to determine to-be-reply information from the clipboard information; the to-be-reply information corresponds to all or part of the clipboard information;
    回复候选确定模块,配置为确定所述待回复信息对应的回复候选;以及A reply candidate determination module configured to determine a reply candidate corresponding to the information to be replyed; and
    回复候选展示模块,配置为展示所述回复候选。The reply candidate display module is configured to display the reply candidate.
  10. 根据权利要求9所述的装置,其特征在于,所述装置还包括:The apparatus according to claim 9, further comprising:
    剪贴板信息展示模块,配置为展示所述剪贴板信息。The clipboard information display module is configured to display the clipboard information.
  11. 根据权利要求9或10所述的装置,其特征在于,所述回复候选展示模块,具体配置为在输入法键盘的上方,展示所述回复候选。The device according to claim 9 or 10, wherein the reply candidate display module is specifically configured to display the reply candidate above an input method keyboard.
  12. 根据权利要求9或10所述的装置,其特征在于,所述剪贴板信息的类型包括如下类型中的至少一种:文本、图片、音频、视频和页面地址。The device according to claim 9 or 10, wherein the type of the clipboard information includes at least one of the following types: text, picture, audio, video, and page address.
  13. 根据权利要求9或10所述的装置,其特征在于,所述回复候选确定模块包括:The apparatus according to claim 9 or 10, wherein the reply candidate determination module comprises:
    主题确定模块,配置为确定所述待回复信息对应的主题;A topic determination module configured to determine a topic corresponding to the information to be responded to;
    候选确定模块,配置为依据所述主题,确定所述待回复信息对应的回复候选。The candidate determination module is configured to determine a response candidate corresponding to the information to be returned according to the subject.
  14. 根据权利要求13所述的装置,其特征在于,所述主题确定模块包括:The apparatus according to claim 13, wherein the theme determination module comprises:
    第一主题确定模块,配置为依据页面地址对应页面的内容,确定所述页面地址对应的主题;和/或A first theme determination module configured to determine a theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
    第二主题确定模块,配置为对视频对应的视频流和/或音频流进行第一识别,并依据得到的第一识别结果,确定所述视频对应的主题;和/或A second theme determination module configured to perform first recognition on a video stream and / or audio stream corresponding to a video, and determine a theme corresponding to the video according to the obtained first recognition result; and / or
    第三主题确定模块,配置为对图片进行第二识别,并依据得到的第二识别结果,确定所述图片对应的主题;和/或A third theme determination module configured to perform second recognition on the picture, and determine a theme corresponding to the picture according to the obtained second recognition result; and / or
    第四主题确定模块,配置为对音频进行语音识别,并依据得到的语音识 别结果,确定所述待回复信息对应的主题。The fourth theme determining module is configured to perform voice recognition on the audio, and determine a theme corresponding to the information to be responded based on the obtained voice recognition result.
  15. 根据权利要求9或10所述的装置,其特征在于,所述回复候选确定模块包括:The apparatus according to claim 9 or 10, wherein the reply candidate determination module comprises:
    查找模块,配置为依据所述待回复信息,在待回复数据与回复数据之间的映射关系中进行查找,以得到所述待回复信息对应的回复候选;A search module configured to perform a search in a mapping relationship between the data to be replied and the reply data according to the information to be replied to obtain a reply candidate corresponding to the information to be replied;
    其中,所述映射关系为依据至少一个用户对应的历史通信数据得到,所述历史通信数据包括:历史待回复数据及其对应的历史回复数据。The mapping relationship is obtained based on historical communication data corresponding to at least one user, and the historical communication data includes historical to-be-reply data and corresponding historical reply data.
  16. 根据权利要求9或10所述的装置,其特征在于,所述回复候选展示模块包括:The apparatus according to claim 9 or 10, wherein the reply candidate display module comprises:
    排序模块,配置为依据本端用户与对端用户之间的社会关系,对多个回复候选进行排序;A sorting module configured to sort multiple reply candidates based on the social relationship between the local user and the peer user;
    排序展示模块,配置为展示排序后的多个回复候选。The sorting display module is configured to display the sorted multiple reply candidates.
  17. 一种用于数据处理的装置,其特征在于,包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:An apparatus for data processing, comprising a memory and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by one or more processors One or more programs contain instructions for:
    从剪贴板信息中确定出待回复信息;所述待回复信息对应所述剪贴板信息的全部或者部分;Determining to-be-reply information from the clipboard information; the to-be-reply information corresponds to all or part of the clipboard information;
    确定所述待回复信息对应的回复候选;Determining a reply candidate corresponding to the information to be replyed;
    展示所述回复候选。The reply candidate is displayed.
  18. 根据权利要求17所述的装置,其特征在于,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:The apparatus of claim 17, wherein the apparatus is further configured to be executed by one or more processors, the one or more programs containing instructions for performing the following operations:
    展示所述剪贴板信息。Displaying the clipboard information.
  19. 根据权利要求17或18所述的装置,其特征在于,所述展示所述回复候选,包括:The apparatus according to claim 17 or 18, wherein the displaying the reply candidate comprises:
    在输入法键盘的上方,展示所述回复候选。Above the input method keyboard, the reply candidates are displayed.
  20. 根据权利要求17或18所述的装置,其特征在于,所述剪贴板信息的类型包括如下类型中的至少一种:文本、图片、音频、视频和页面地址。The device according to claim 17 or 18, wherein the type of the clipboard information includes at least one of the following types: text, picture, audio, video, and page address.
  21. 根据权利要求17或18所述的装置,其特征在于,所述确定所述待 回复信息对应的回复候选,包括:The apparatus according to claim 17 or 18, wherein determining the response candidate corresponding to the information to be returned comprises:
    确定所述待回复信息对应的主题;Determining a subject corresponding to the information to be responded to;
    依据所述主题,确定所述待回复信息对应的回复候选。According to the subject, a reply candidate corresponding to the information to be reply is determined.
  22. 根据权利要求21所述的装置,其特征在于,所述确定所述待回复信息对应的主题,包括:The apparatus according to claim 21, wherein the determining a subject corresponding to the information to be returned comprises:
    依据页面地址对应页面的内容,确定所述页面地址对应的主题;和/或Determine the theme corresponding to the page address according to the content of the page corresponding to the page address; and / or
    对视频对应的视频流和/或音频流进行第一识别,并依据得到的第一识别结果,确定所述视频对应的主题;和/或First identifying a video stream and / or an audio stream corresponding to a video, and determining a theme corresponding to the video according to the obtained first recognition result; and / or
    对图片进行第二识别,并依据得到的第二识别结果,确定所述图片对应的主题;和/或Perform a second recognition on the picture, and determine the theme corresponding to the picture according to the obtained second recognition result; and / or
    对音频进行语音识别,并依据得到的语音识别结果,确定所述待回复信息对应的主题。Perform voice recognition on the audio, and determine the theme corresponding to the information to be answered according to the obtained voice recognition result.
  23. 根据权利要求17或18所述的装置,其特征在于,所述确定所述待回复信息对应的回复候选,包括:The apparatus according to claim 17 or 18, wherein the determining a reply candidate corresponding to the information to be returned comprises:
    依据所述待回复信息,在待回复数据与回复数据之间的映射关系中进行查找,以得到所述待回复信息对应的回复候选;Searching in the mapping relationship between the data to be replied and the reply data according to the information to be replied to obtain a reply candidate corresponding to the information to be replied;
    其中,所述映射关系为依据至少一个用户对应的历史通信数据得到,所述历史通信数据包括:历史待回复数据及其对应的历史回复数据。The mapping relationship is obtained based on historical communication data corresponding to at least one user, and the historical communication data includes historical to-be-reply data and corresponding historical reply data.
  24. 根据权利要求17或18所述的装置,其特征在于,所述展示所述回复候选,包括:The apparatus according to claim 17 or 18, wherein the displaying the reply candidate comprises:
    依据本端用户与对端用户之间的社会关系,对多个回复候选进行排序;Sort multiple response candidates based on the social relationship between the local user and the peer user;
    展示排序后的多个回复候选。Multiple ranked candidates are displayed.
  25. 一种机器可读介质,其上存储有指令,当由一个或多个处理器执行时,使得装置执行如权利要求1至8中一个或多个所述的数据处理方法。A machine-readable medium having stored thereon instructions which, when executed by one or more processors, cause a device to perform the data processing method according to one or more of claims 1 to 8.
PCT/CN2018/121235 2018-09-20 2018-12-14 Method and device for data processing and device for use in data processing WO2020056948A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811101152.5 2018-09-20
CN201811101152.5A CN110929122B (en) 2018-09-20 2018-09-20 Data processing method and device for data processing

Publications (1)

Publication Number Publication Date
WO2020056948A1 true WO2020056948A1 (en) 2020-03-26

Family

ID=69855484

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/121235 WO2020056948A1 (en) 2018-09-20 2018-12-14 Method and device for data processing and device for use in data processing

Country Status (2)

Country Link
CN (1) CN110929122B (en)
WO (1) WO2020056948A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114356173A (en) * 2021-12-06 2022-04-15 科大讯飞股份有限公司 Message reply method and related device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015176518A1 (en) * 2014-05-22 2015-11-26 华为技术有限公司 Reply information recommending method and device
CN106446054A (en) * 2016-08-31 2017-02-22 北京搜狗科技发展有限公司 Information recommendation method and apparatus, and electronic device
CN108139952A (en) * 2017-06-14 2018-06-08 北京小米移动软件有限公司 Using exchange method, exchange method and device
CN108153755A (en) * 2016-12-05 2018-06-12 北京搜狗科技发展有限公司 Method, apparatus and electronic equipment are recommended in a kind of input

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101199474B1 (en) * 2008-04-29 2012-11-09 주식회사 케이티 Method for providing question and answer community service and system thereof, method for providing quiz game
JP5343650B2 (en) * 2009-03-23 2013-11-13 富士電機株式会社 Call center business support system
CN104077341B (en) * 2013-07-19 2016-04-20 腾讯科技(北京)有限公司 The method and apparatus that keyword automatically replies mapping relations is generated in instant messaging
CN104951219B (en) * 2014-03-25 2018-06-15 华为技术有限公司 A kind of method and mobile terminal of mobile terminal text input
US9213941B2 (en) * 2014-04-22 2015-12-15 Google Inc. Automatic actions based on contextual replies
CN104035986A (en) * 2014-05-30 2014-09-10 北京金山网络科技有限公司 Method and device for opening URL and method and device for searching for keywords
CN104158893B (en) * 2014-08-22 2017-08-11 北京奇虎科技有限公司 The method and system of Contents of clipboard are transmitted based on WiFi equipment
US20170277419A1 (en) * 2016-03-25 2017-09-28 Le Holdings (Beijing) Co., Ltd. Method and Electronic Device for Replying to a Message
CN107436709A (en) * 2016-05-25 2017-12-05 富泰华工业(深圳)有限公司 A kind of electronic installation with auxiliary recovery function and auxiliary answering method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015176518A1 (en) * 2014-05-22 2015-11-26 华为技术有限公司 Reply information recommending method and device
CN106446054A (en) * 2016-08-31 2017-02-22 北京搜狗科技发展有限公司 Information recommendation method and apparatus, and electronic device
CN108153755A (en) * 2016-12-05 2018-06-12 北京搜狗科技发展有限公司 Method, apparatus and electronic equipment are recommended in a kind of input
CN108139952A (en) * 2017-06-14 2018-06-08 北京小米移动软件有限公司 Using exchange method, exchange method and device

Also Published As

Publication number Publication date
CN110929122A (en) 2020-03-27
CN110929122B (en) 2024-02-06

Similar Documents

Publication Publication Date Title
CN110391966B (en) Message processing method and device and message processing device
JP2017530431A (en) Nuisance telephone number determination method, apparatus and system
CN107621886B (en) Input recommendation method and device and electronic equipment
WO2020019220A1 (en) Method for displaying service information in preview interface, and electronic device
CN111128183B (en) Speech recognition method, apparatus and medium
US20170109339A1 (en) Application program activation method, user terminal, and server
CN107315487B (en) Input processing method and device and electronic equipment
US11706166B2 (en) Presenting reactions from friends
WO2017181663A1 (en) Method and device for matching image to search information
CN107291772B (en) Search access method and device and electronic equipment
CN110222256B (en) Information recommendation method and device and information recommendation device
WO2019109663A1 (en) Cross-language search method and apparatus, and apparatus for cross-language search
CN111382339B (en) Search processing method and device for search processing
CN108717403B (en) Processing method and device for processing
US20160012078A1 (en) Intelligent media management system
CN109783244B (en) Processing method and device for processing
WO2020056948A1 (en) Method and device for data processing and device for use in data processing
CN109521888A (en) A kind of input method, device and medium
CN112000766A (en) Data processing method, device and medium
CN108073664B (en) Information processing method, device, equipment and client equipment
CN110399468A (en) A kind of data processing method, device and the device for data processing
CN109144286B (en) Input method and device
CN111708444A (en) Input method, input device and input device
CN103955493A (en) Information display method and device, and mobile terminal
CN110765338A (en) Data processing method and device and data processing device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18934147

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18934147

Country of ref document: EP

Kind code of ref document: A1