CN106656754B - Information extraction method and device based on instant messaging software - Google Patents

Information extraction method and device based on instant messaging software Download PDF

Info

Publication number
CN106656754B
CN106656754B CN201611124228.7A CN201611124228A CN106656754B CN 106656754 B CN106656754 B CN 106656754B CN 201611124228 A CN201611124228 A CN 201611124228A CN 106656754 B CN106656754 B CN 106656754B
Authority
CN
China
Prior art keywords
content
preset
user
session
category
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611124228.7A
Other languages
Chinese (zh)
Other versions
CN106656754A (en
Inventor
朱翼鹏
郦伟强
蔡胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Anyun Century Technology Co Ltd
Original Assignee
Beijing Anyun Century Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Anyun Century Technology Co Ltd filed Critical Beijing Anyun Century Technology Co Ltd
Priority to CN201611124228.7A priority Critical patent/CN106656754B/en
Publication of CN106656754A publication Critical patent/CN106656754A/en
Application granted granted Critical
Publication of CN106656754B publication Critical patent/CN106656754B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/21Monitoring or handling of messages
    • H04L51/216Handling conversation history, e.g. grouping of messages in sessions or threads

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention relates to the technical field of communication, and discloses an information extraction method and device based on instant messaging software, which aim to solve the technical problem of low efficiency of extracting information from session content in the prior art. The method comprises the following steps: obtaining session content generated by instant messaging software; determining a preset mark contained in the session content, wherein the preset mark is used for marking the session content according to a preset category; and acquiring the content under the preset category from the conversation content based on the preset mark. The technical effect of improving the efficiency of extracting the content under the preset category from the conversation content is achieved.

Description

Information extraction method and device based on instant messaging software
Technical Field
The invention relates to the technical field of communication, in particular to an information extraction method and device based on instant messaging software.
Background
With the continuous development of science and technology, electronic technology has also gained rapid development, and the variety of electronic products is also more and more, and people also enjoy various conveniences brought by the development of science and technology. People can enjoy comfortable life brought along with the development of science and technology through various types of mobile terminals. For example, mobile terminals such as smart phones and tablet computers have become an important part of people's lives, and users can listen to music, play games and the like by using the mobile terminals such as smart phones and tablet computers, so as to relieve pressure brought by modern fast-paced lives.
In the prior art, instant messaging software is increasingly used due to the fact that the communication mode is convenient and fast, people can share data such as instant messaging, images and videos through the instant messaging software installed on electronic equipment, conversation contents can be generated in the communication process through the instant messaging software, if some users miss an instant chat process or the conversation contents are extremely important, a lot of historical conversation contents need to be checked manually, and therefore the technical problem that the efficiency of extracting information from the conversation contents is low exists.
Disclosure of Invention
In view of the above, the present invention is proposed to provide an instant messenger-based information extraction method and apparatus that overcomes or at least partially solves the above-mentioned problems.
In a first aspect, an embodiment of the present invention provides an information extraction method based on instant messaging software, including:
obtaining session content generated by instant messaging software;
determining a preset mark contained in the session content, wherein the preset mark is used for marking the session content according to a preset category;
and acquiring the content under the preset category from the conversation content based on the preset mark.
Optionally, the obtaining, based on the preset mark, the preset content in the preset category from the session content includes: determining a preset mark selected by a user from at least one type of preset marks; extracting first content from the session content according to the preset mark selected by the user; sending the first content to a first designated user so that the first designated user can determine the accuracy of the first content; and obtaining confirmation feedback of the first appointed user, and obtaining the content under the preset category based on the confirmation feedback.
Optionally, the obtaining the content under the preset category based on the confirmation feedback includes: if the confirmation feedback represents that the first content is accurate, taking the first content as the content in the preset category; and if the confirmation feedback comprises second content obtained after the first content is modified, taking the second content as the content in the preset category.
Optionally, the first designated user is a producer of the first content; or, a manager of an event corresponding to the first content.
Optionally, the obtaining, based on the preset mark, content in the preset category from the session content includes: extracting the conversation contents with the preset marks in the conversation contents to obtain an extraction result; and determining the content under the preset category based on the extraction result and the user information generating the extraction result.
Optionally, the preset marks include at least two types of preset marks, and the obtaining, based on the preset marks, the content in the preset category from the session content includes: based on the at least two types of preset marks contained in the session content, extracting the content under at least two types of preset categories corresponding to the at least two types of preset marks from the session content.
Optionally, after the content in the preset category is acquired from the session content, the method further includes: and providing the content under the preset category for a second designated user so that the second designated user obtains the content under the preset category.
Optionally, before providing the content in the preset category to a second specified user, the method further includes: determining participants of the conversation content, and determining the participants as the second designated users; or, the second designated user to which the content under the preset category points is extracted from the session content; or, determining a manager of the content of the preset category, and determining the manager as the second designated user.
Optionally, the extracting, from the session content, the second designated user to which the content in the preset category points includes: determining a preset mark from the content under the preset category; determining the user associated with the preset mark as the second designated user; or if the content in the preset category is the work content, determining that the executor of the work content is the second designated user.
Optionally, the preset mark is marked by a producer of the corresponding session content; or, the preset mark is marked by a user with the session content management authority.
Optionally, the obtaining of the session content generated by the instant messaging software includes: obtaining the session content generated by a multi-person session group of the instant messaging software; after the acquiring, based on the preset mark, the content in the preset category from the session content, the method further includes: determining a new user joining the multi-person conversation group after obtaining the content under the preset category; and providing the content under the preset category to the new user.
In a second aspect, an embodiment of the present invention provides an information extraction apparatus based on instant messaging software, including:
the obtaining module is used for obtaining the session content generated by the instant messaging software;
the first determining module is used for determining a preset mark contained in the conversation content, wherein the preset mark is used for marking the conversation content according to a preset category;
and the acquisition module is used for acquiring the content under the preset category from the session content based on the preset mark.
Optionally, the obtaining module includes: the first determining unit is used for determining the preset mark selected by the user from at least one type of preset mark; the first extraction unit is used for extracting first content from the conversation content according to the preset mark selected by the user; the sending unit is used for sending the first content to a first designated user so that the first designated user can determine the accuracy of the first content; and the obtaining unit is used for obtaining confirmation feedback of the first appointed user and obtaining the content under the preset category based on the confirmation feedback.
Optionally, the obtaining unit includes: a first obtaining subunit, configured to, if the confirmation feedback indicates that the first content is an accurate content, take the first content as a content in the preset category; a second obtaining subunit, configured to, if the confirmation feedback includes a second content obtained after the first content is modified, take the second content as the content in the preset category.
Optionally, the first designated user is a producer of the first content; or, a manager of an event corresponding to the first content.
Optionally, the obtaining module includes: the second extraction unit is used for extracting the conversation content with the preset mark in the conversation content to obtain an extraction result; and the second determining unit is used for determining the content in the preset category based on the extraction result and the user information generating the extraction result.
Optionally, the preset marks include at least two types of preset marks, and the obtaining module is configured to: based on the at least two types of preset marks contained in the session content, extracting the content under at least two types of preset categories corresponding to the at least two types of preset marks from the session content.
Optionally, the apparatus further comprises: and the providing module is used for providing the content under the preset category for a second specified user so that the second specified user can obtain the content under the preset category.
Optionally, the apparatus further comprises: a second determining module, configured to determine a participant of the session content, and determine the participant as the second designated user; or, the second designated user to which the content under the preset category points is extracted from the session content; or, determining a manager of the content of the preset category, and determining the manager as the second designated user.
Optionally, the second determining module is configured to: determining a preset mark from the content under the preset category; determining the user associated with the preset mark as the second designated user; or if the content in the preset category is the work content, determining that the executor of the work content is the second designated user.
Optionally, the preset mark is marked by a producer of the corresponding session content; or, the preset mark is marked by a user with the session content management authority.
Optionally, the obtaining module is configured to: obtaining the session content generated by a multi-person session group of the instant messaging software; the device further comprises: a third determining module, configured to determine a new user joining the multi-user conversation group after obtaining the content in the preset category; and the providing module is used for providing the content under the preset category for the new user.
The technical scheme provided in the embodiment of the application at least has the following technical effects or advantages:
in the embodiment of the invention, after the session content generated by the instant messaging software is obtained, the preset mark contained in the session content can be determined, and the preset mark is used for marking the session content according to the preset category; therefore, the content under the preset category can be obtained from the conversation content based on the preset mark, and the user does not need to manually check all historical conversation contents, so that the technical effect of improving the efficiency of extracting the content under the preset category from the conversation content is achieved.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
fig. 1 is a flowchart illustrating an instant messenger-based information extraction method according to an embodiment of the present invention;
fig. 2 is a block diagram illustrating an information extraction apparatus based on instant messenger according to another embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
The embodiment of the invention provides an information extraction method and device based on instant messaging software, which are used for solving the technical problem of low efficiency of extracting information from session content in the prior art.
In a first aspect, an embodiment of the present invention provides an information extraction method based on instant messaging software, please refer to fig. 1, including:
step S101: obtaining session content generated by instant messaging software;
step S102: determining a preset mark contained in the session content, wherein the preset mark is used for marking the session content according to a preset category;
step S103: and acquiring the content under the preset category from the conversation content based on the preset mark.
For example, the solution may be applied to electronic devices, such as: mobile phones, tablet computers, notebook computers, PCs (Personal computers), and the like; the scheme can also be used for the server of the instant messaging software, and the embodiment of the invention is not limited.
In step S101, if the solution is applied to an electronic device located at a client, the session content may be obtained through data of instant chat software cached by the electronic device, where the electronic device may be any user participating in a session, or may be a user with a management right of the session content (e.g., a group owner or an administrator of a multi-person session group, or a group owner of a multi-person session group, a user specified by the administrator, etc.); if the scheme is applied to a server, the session content can be obtained by integrating the content received from a plurality of electronic devices participating in the instant session through the server.
The session content may be point-to-point session content between two users, or session content generated based on a multi-user session group, which may be a group of instant messaging software, and is often used for long-term sessions; the multi-person conversation group may also be a discussion group of instant messaging software, which is often used for temporary conversations.
In step S102, the preset mark may be generated by various users, such as: setting a preset mark for a certain conversation content by a producer of the conversation content; the individual session content can be marked by the user who has the management authority of the session content (such as a group owner, an administrator and the like).
In a specific implementation process, the preset mark may be used to mark category information of a certain session content, where the category information includes, for example: work items, parties, gourmet, movies, and the like.
In step S103, the content corresponding to the preset mark may be directly extracted as the extracted content in the preset category, and as an alternative embodiment, the content in the preset category may be extracted in the following manner: determining a preset mark selected by a user from at least one type of preset marks; extracting first content from the session content according to the preset mark selected by the user; sending the first content to a first designated user so that the first designated user can determine the accuracy of the first content; and obtaining confirmation feedback of the first appointed user, and obtaining the content under the preset category based on the confirmation feedback.
For example, the preset marks included in the session content may include one or more marks, such as: work item tags, party tags, food tags, movie tags, and the like, which indicate the categories to which the respective pieces of conversation content belong, including: in the categories of work items, parties, food, movies, and the like, the user may only be interested in a part of the content, so that at least one preset mark included in the conversation content can be selected, and the content in the preset category corresponding to the preset mark selected by the user is extracted.
Assuming that the session content is generated by a multi-person conversation group including user a, user B, user C, user D, and user E, the multi-person conversation group generates the following session content:
content of conversation 1
The user A: a meeting was started in a conference room at 10 am tomorrow.
Content of conversation 2
And a user B: to celebrate the birthday, I please eat at tomorrow night.
Content of conversation (c)
And a user B: yesterday goes to eat a bowl chicken, and the taste is very good, in XX way XX number.
Conversation content
And a user C: this weekend organized big house to peck isthmus play haar.
Content of conversation-
And a user D: user a, user C, remembering tomorrow at 8 am: 00 go to panda base Ha on time.
Content of conversation: |)
The user A: the leaders in the afterdays need to see and clean in the tomorrow.
Meanwhile, the user A marks the session content (working item) and the user B marks the session content (party), and in addition, the user A can set a food mark for the session content (food), a party mark for the session content (party), a working item mark for the session content (work item), and the like.
After obtaining the session content, the current electronic device searches for the session content by selecting a "work item" mark, the electronic device determines a position of a preset mark ("work item" mark) selected by the user in the session content, and after determining the position of the "work item" mark, extracts two pieces of first content based on the "work item" mark, which specifically includes: firstly, a user A: ha "Ha a meeting in a meeting room at 10 am tomorrow; ② user A: the leaders in the afterdays need to see and clean in the tomorrow. "
Taking the first content as: firstly, a user A: taking a meeting Ha "in a meeting room at 10 am tomorrow as an example (the processing method for other first content is similar), the current electronic device sends the session content to a first specified user (for example, a generator of the first content, that is, user A; or a manager of an event corresponding to the first content), and generates the following prompt information:
please confirm whether the following is accurate: ha "Ha a meeting in a meeting room at 10 am tomorrow;
after receiving the first content, the first designated user can determine whether the first content is accurate and return a determination feedback to the current device, wherein if the determination feedback indicates that the first content is accurate, the first content is taken as the extracted content in the preset category; and if the confirmation feedback comprises second content obtained after the first content is modified, taking the second content as the extracted content in the preset category.
For example: if the user A finds that the item is accurate, the user A can reply to 'no problem', and in this case, the extracted content under the preset category can be determined to be '10 am in conference room Ha tomorrow';
if the user a finds that the event is inaccurate, the user a may correct the event, for example, reply with the second content "take a meeting in the meeting room at 10 am the day", so that it may be determined that the content in the acquired preset category is "take a meeting in the meeting room at 10 am the day".
In the scheme, the content extracted based on the preset mark is corrected, so that the technical effect of obtaining more accurate content is achieved.
In a specific implementation process, in step S103, a part of the session content extracted from the session content may be directly used as the extracted content in the preset category, for example: the step of directly using the extracted part of content as content in a preset category, using the corrected part of session content as content in the preset category, and the like, wherein in order to obtain more detailed content in the preset category, the obtaining content in the preset category from the session content based on the preset mark includes:
extracting the conversation contents with the preset marks in the conversation contents to obtain an extraction result; and determining the preset content based on the extraction result and the user information generating the extraction result.
For example, regarding the conversation content generated by the previous user B, if only the conversation content is extracted, the information is not comprehensive enough, because other users do not know who please eat after obtaining the piece of conversation content, the extraction result of extracting the conversation content and the user information generating the extraction result together can form the content under the preset category, for example: the content under the preset category is "user B: based on this scheme, more detailed information can be obtained from the session contents, such as "i please eat everywhere in tomorrow evening" or "i please eat a grand home in tomorrow evening" to celebrate birthday, and so on.
In step S103, as can be known from the foregoing description, the preset mark set for the session content may only include a type of mark, so that the type of mark is a preset mark, and when the session content is extracted, all the session content corresponding to the type of mark may be extracted; as an optional embodiment, the preset marks include at least two types of preset marks, and the obtaining, based on the preset marks, the content in the preset category from the session content includes: based on the at least two types of preset marks contained in the session content, extracting the content under at least two types of preset categories corresponding to the at least two types of preset marks from the session content.
For example, assume that the session content includes three types of preset marks, which are: marking 'work items'; ② party mark; thirdly, the 'food' mark can respectively extract three types of contents corresponding to the three preset types; or, the user selects the "party" mark and the "food" mark from the three preset marks, and then the content in the party category and the content in the food category, etc. can be extracted from the marks.
In step S103, if the session content includes multiple preset marks, the preset mark for extracting the preset content may be determined in multiple ways, for example: the preset flag may be a flag set by a user of the electronic device, for example: user B is eating, then it may set the preset flag to include: the food, the corresponding preset category is the food category; if the user D prefers to watch various movies, the user D may set the preset flag to include: the corresponding preset type of the film and television is the film and television type; and everyone may need to pay attention to work matters and parties, and all users in the multi-person conversation group set the preset flag to include: the preset categories corresponding to the work items and the parties include: a work item category and a party category, and so on. So that the preset flags for user B include: the food, the work items and the party, the corresponding content of the preset category comprises: food category, work item category, and party category; the preset marks of the user D include: the corresponding preset types of contents of the film, the television, the work items and the party comprise: a movie category, a work item category, and a party category, among others. Of course, the preset flag may also be set by default, and the embodiment of the present invention is not limited. Further, in step S103, at least two types of preset contents that are interested by the user of the electronic device may be obtained, so as to extract the session contents more accurately.
After at least two types of preset contents are obtained, the two types of preset contents can be stored separately according to categories, so that the subsequent search is facilitated.
As an optional embodiment, after the obtaining the content in the preset category from the session content, the method further includes: and providing the content under the preset category for a second designated user so that the second designated user obtains the content under the preset category.
In a specific implementation process, the content in the preset category may include content that is also needed by other users, for example: if the conversation content is point-to-point conversation content of two users, if the content in the preset category is the content in the working item category, the colleagues of the two users may also need the content, and if the content in the preset category is the content in the party category, the friends of the two users may also need the content; if the session content is generated based on the multi-person conversation group, the preset content may include content that is also needed by other users in the multi-person conversation group, for example: work items, parties, etc., and may be provided to a second designated user, such as: send it to the electronic device where the second designated user is located, set it as announcement information for a multi-person conversation group, and so on.
By the scheme, the sharing of the content under the preset category is realized, and in addition, the second specified user does not need to re-extract the content under the preset category from the electronic equipment, so that the processing burden of the electronic equipment can be reduced.
In the implementation process, the second designated user may be a plurality of users, and three of them are listed below for description, and of course, in the implementation process, the second designated user is not limited to the following three cases.
First, a participant of the session content is determined, and the participant is determined to be the second designated user.
Taking the session content as point-to-point session content between two users as an example, the participants of the session content are the two users; taking the session content generated by the multi-person conversation group as an example, the participants of the session content are, for example: all members of the multi-person conversation group, a generator of conversation content contained in the conversation content, and so on.
Through the scheme, the technical effect that the content under the preset category can be shared among the participants of the conversation content is achieved.
Secondly, the second designated user to which the content under the preset category points is extracted from the session content.
In a specific implementation process, a preset mark can be determined from the content in the preset category; determining the user associated with the preset mark as the second designated user, where the preset mark is, for example, @,/… …/etc., and may also be other preset marks, and if the preset mark is detected, it indicates that the session content is provided for the user corresponding to the preset mark to view, so that the user associated with the preset mark (for example, a user behind the preset mark, a user between the preset marks, etc.) may be extracted as the second designated user; or, a user included in the session content corresponding to the content in the preset category may be directly extracted as the second specified user, for example, the session content listed above is the fifth specified user, where the included users are: the user A and the user C can determine that the second designated user is the user A and the user C; or, if the content in the preset category is the work content, determining that the executor of the work content is the second designated user, taking the session content sixthly listed as an example, because the executor of the work content is the work content, determining that the executor of the work content (for example, an administrative staff, an ordinary staff, and the like) is the second designated user, and the like.
In the scheme, the technical effect that the content under the preset category can be shared among the second designated users pointed by the content under the preset category is achieved, the sharing range is more accurate, and the irrelevant users are prevented from being disturbed.
And thirdly, determining a manager of the content under the preset category, and determining the manager as the second specified user.
Taking the session content as the session content obtained by the multi-person session group as an example, the administrator of each category of content can be set in the multi-person session group in advance, for example: setting a manager of the content of the party category as a user D, a manager of the work items as a user B and the like, and after obtaining the content corresponding to the conversation content (I), sending the content to the user B; after obtaining the content corresponding to the session content (c), it may be sent to user D, and so on. So that the content can be subsequently processed based on the manager of the content in the preset category, for example: user B may need to prepare a meeting room and user D may count the number of participants and other relevant information, etc.
As an alternative embodiment, the obtaining of the session content generated by the instant messaging software includes: obtaining the session content generated by a multi-person session group of the instant messaging software;
after the acquiring, based on the preset mark, the preset content in the preset category from the session content, the method further includes: determining a new user joining the multi-person conversation group after obtaining the content under the preset category; and providing the content under the preset category to the new user.
For example, users in a multi-person conversation group typically change, such as: and if the new user newly joining the multi-person conversation group does not know the previous conversation content, some information may be omitted, so that the multi-person conversation of the multi-person conversation group cannot be effectively participated in, and the multi-person conversation efficiency is influenced. In addition, in the scheme, because the content under the preset category is only provided for the new user, not the whole conversation content, the new user can conveniently and quickly acquire the key information of the previous chat, and the conversation efficiency is further improved.
Wherein, each time a new user is detected to join the multi-person conversation group, the content under the preset category of the previous preset time period (for example, 1h, 30min, etc.) can be provided to the new user, or the content under the preset category of the previous preset time period can be provided to all new members newly joined at preset time intervals (for example, 2min, 5min, etc.).
In a second aspect, based on the same inventive concept, an embodiment of the present invention provides an information extraction apparatus based on instant messaging software, please refer to fig. 2, including:
an obtaining module 20, configured to obtain session content generated by instant messaging software;
a first determining module 21, configured to determine a preset mark included in the session content, where the preset mark is used to mark the session content according to a preset category;
an obtaining module 22, configured to obtain, based on the preset flag, content in the preset category from the session content.
Optionally, the obtaining module 22 includes:
the first determining unit is used for determining the preset mark selected by the user from at least one type of preset mark;
the first extraction unit is used for extracting first content from the conversation content according to the preset mark selected by the user;
the sending unit is used for sending the first content to a first designated user so that the first designated user can determine the accuracy of the first content;
and the obtaining unit is used for obtaining confirmation feedback of the first appointed user and obtaining the content under the preset category based on the confirmation feedback.
Optionally, the obtaining unit includes:
a first obtaining subunit, configured to, if the confirmation feedback indicates that the first content is an accurate content, take the first content as a content in the preset category;
a second obtaining subunit, configured to, if the confirmation feedback includes a second content obtained after the first content is modified, take the second content as the content in the preset category.
Optionally, the first designated user is a producer of the first content; or, a manager of an event corresponding to the first content.
Optionally, the obtaining module 22 includes:
the second extraction unit is used for extracting the conversation content with the preset mark in the conversation content to obtain an extraction result;
and the second determining unit is used for determining the content in the preset category based on the extraction result and the user information generating the extraction result.
Optionally, the preset marks include at least two types of preset marks, and the obtaining module is configured to:
based on the at least two types of preset marks contained in the session content, extracting the content under at least two types of preset categories corresponding to the at least two types of preset marks from the session content.
Optionally, the apparatus further comprises:
and the providing module is used for providing the content under the preset category for a second specified user so that the second specified user can obtain the content under the preset category.
Optionally, the apparatus further comprises:
a second determining module, configured to determine a participant of the session content, and determine the participant as the second designated user; alternatively, the first and second electrodes may be,
extracting the second designated user pointed by the content under the preset category from the session content; alternatively, the first and second electrodes may be,
and determining a manager of the content of the preset category, and determining the manager as the second designated user.
Optionally, the second determining module is configured to:
determining a preset mark from the content under the preset category; determining the user associated with the preset mark as the second designated user; alternatively, the first and second electrodes may be,
and if the content under the preset category is the working content, determining that the executor of the working content is the second designated user.
Optionally, the preset mark is marked by a producer of the corresponding session content; or, the preset mark is marked by a user with the session content management authority.
Optionally, the obtaining module 20 is configured to: obtaining the session content generated by a multi-person session group of the instant messaging software;
the device further comprises:
a third determining module, configured to determine a new user joining the multi-user conversation group after obtaining the content in the preset category;
and the providing module is used for providing the content under the preset category for the new user.
Since the apparatus described in the second aspect of the present invention is an apparatus used for implementing the method for extracting information based on instant messaging software described in the first aspect of the present invention, and based on the method described in the first aspect of the present invention, a person skilled in the art can understand the specific structure and modification of the apparatus described in the second aspect of the present invention, and therefore no further description is given here, and all apparatuses used for implementing the method described in the first aspect of the present invention belong to the scope of the present invention to be protected.
The technical scheme provided in the embodiment of the application at least has the following technical effects or advantages:
in the embodiment of the invention, after the session content generated by the instant messaging software is obtained, the preset mark contained in the session content can be determined, and the preset mark is used for marking the session content according to the preset category; therefore, the content under the preset category can be obtained from the conversation content based on the preset mark, and the user does not need to manually check all historical conversation contents, so that the technical effect of improving the efficiency of extracting the content under the preset category from the conversation content is achieved.
The algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose systems may also be used with the teachings herein. The required structure for constructing such a system will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functionality of some or all of the components in an apparatus according to an embodiment of the invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

Claims (16)

1. An information extraction method based on instant messaging software is characterized by comprising the following steps:
obtaining session content generated by instant messaging software;
determining a preset mark contained in the session content, wherein the preset mark is used for marking the session content according to a preset category; the preset mark is marked for the session content by a producer of the session content; or marking each piece of session content by a user with the session content management authority;
based on the preset mark, acquiring the content under the preset category from the session content, including:
determining a preset mark selected by a user from at least one type of preset marks;
extracting first content from the session content according to the preset mark selected by the user;
sending the first content to a first designated user so that the first designated user can determine the accuracy of the first content; the first designated user is a producer of the first content; or, a manager of an event corresponding to the first content;
obtaining confirmation feedback of the first designated user, and obtaining content under the preset category based on the confirmation feedback, wherein the content comprises: if the confirmation feedback represents that the first content is accurate, taking the first content as the content in the preset category; and if the confirmation feedback comprises second content obtained after the first content is modified, taking the second content as the content in the preset category.
2. The method of claim 1, wherein the obtaining the content in the preset category from the session content based on the preset mark comprises:
extracting the conversation contents with the preset marks in the conversation contents to obtain an extraction result;
and determining the content under the preset category based on the extraction result and the user information generating the extraction result.
3. The method according to claim 1 or 2, wherein the preset mark comprises at least two types of preset marks, and the obtaining of the content in the preset category from the session content based on the preset mark comprises:
based on the at least two types of preset marks contained in the session content, extracting the content under at least two types of preset categories corresponding to the at least two types of preset marks from the session content.
4. The method according to claim 1 or 2, wherein after said obtaining content under the preset category from the session content, the method further comprises:
and providing the content under the preset category for a second designated user so that the second designated user obtains the content under the preset category.
5. The method of claim 4, wherein prior to said providing content under the preset category to a second designated user, the method further comprises:
determining participants of the conversation content, and determining the participants as the second designated users; alternatively, the first and second electrodes may be,
extracting the second designated user pointed by the content under the preset category from the session content; alternatively, the first and second electrodes may be,
and determining a manager of the content of the preset category, and determining the manager as the second designated user.
6. The method of claim 5, wherein the extracting the second designated user to which the content under the preset category points from the session content comprises:
determining a preset mark from the content under the preset category; determining the user associated with the preset mark as the second designated user; alternatively, the first and second electrodes may be,
and if the content under the preset category is the working content, determining that the executor of the working content is the second designated user.
7. The method of claim 1 or 2, wherein the preset mark is marked by a producer of the corresponding session content; or, the preset mark is marked by a user with the session content management authority.
8. The method of claim 1 or 2, wherein the obtaining session content generated by instant messaging software comprises: obtaining the session content generated by a multi-person session group of the instant messaging software;
after the acquiring, based on the preset mark, the content in the preset category from the session content, the method further includes:
determining a new user joining the multi-person conversation group after obtaining the content under the preset category;
and providing the content under the preset category to the new user.
9. An information extraction device based on instant messaging software is characterized by comprising:
the obtaining module is used for obtaining the session content generated by the instant messaging software;
the first determining module is used for determining a preset mark contained in the conversation content, wherein the preset mark is used for marking the conversation content according to a preset category; the preset mark is marked for the session content by a producer of the session content; or marking each piece of session content by a user with the session content management authority;
an obtaining module, configured to obtain, based on the preset flag, content in the preset category from the session content, where the obtaining module includes:
the first determining unit is used for determining the preset mark selected by the user from at least one type of preset mark;
the first extraction unit is used for extracting first content from the conversation content according to the preset mark selected by the user;
the sending unit is used for sending the first content to a first designated user so that the first designated user can determine the accuracy of the first content; the first designated user is a producer of the first content; or, a manager of an event corresponding to the first content;
the obtaining unit is used for obtaining confirmation feedback of the first appointed user and obtaining content under the preset category based on the confirmation feedback;
the obtaining unit includes:
a first obtaining subunit, configured to, if the confirmation feedback indicates that the first content is an accurate content, take the first content as a content in the preset category;
a second obtaining subunit, configured to, if the confirmation feedback includes a second content obtained after the first content is modified, take the second content as the content in the preset category.
10. The apparatus of claim 9, wherein the acquisition module comprises:
the second extraction unit is used for extracting the conversation content with the preset mark in the conversation content to obtain an extraction result;
and the second determining unit is used for determining the content in the preset category based on the extraction result and the user information generating the extraction result.
11. The apparatus of claim 9 or 10, wherein the preset marks comprise at least two types of preset marks, and the obtaining module is configured to:
based on the at least two types of preset marks contained in the session content, extracting the content under at least two types of preset categories corresponding to the at least two types of preset marks from the session content.
12. The apparatus of claim 9 or 10, wherein the apparatus further comprises:
and the providing module is used for providing the content under the preset category for a second specified user so that the second specified user can obtain the content under the preset category.
13. The apparatus of claim 12, wherein the apparatus further comprises:
a second determining module, configured to determine a participant of the session content, and determine the participant as the second designated user; alternatively, the first and second electrodes may be,
extracting the second designated user pointed by the content under the preset category from the session content; alternatively, the first and second electrodes may be,
and determining a manager of the content of the preset category, and determining the manager as the second designated user.
14. The apparatus of claim 13, wherein the second determining module is to:
determining a preset mark from the content under the preset category; determining the user associated with the preset mark as the second designated user; alternatively, the first and second electrodes may be,
and if the content under the preset category is the working content, determining that the executor of the working content is the second designated user.
15. The apparatus of claim 9 or 10, wherein the preset mark is marked by a producer of the corresponding session content; or, the preset mark is marked by a user with the session content management authority.
16. The apparatus of claim 9 or 10, wherein the obtaining module is to: obtaining the session content generated by a multi-person session group of the instant messaging software;
the device further comprises:
a third determining module, configured to determine a new user joining the multi-user conversation group after obtaining the content in the preset category;
and the providing module is used for providing the content under the preset category for the new user.
CN201611124228.7A 2016-12-08 2016-12-08 Information extraction method and device based on instant messaging software Active CN106656754B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611124228.7A CN106656754B (en) 2016-12-08 2016-12-08 Information extraction method and device based on instant messaging software

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611124228.7A CN106656754B (en) 2016-12-08 2016-12-08 Information extraction method and device based on instant messaging software

Publications (2)

Publication Number Publication Date
CN106656754A CN106656754A (en) 2017-05-10
CN106656754B true CN106656754B (en) 2020-09-15

Family

ID=58819821

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611124228.7A Active CN106656754B (en) 2016-12-08 2016-12-08 Information extraction method and device based on instant messaging software

Country Status (1)

Country Link
CN (1) CN106656754B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109446204B (en) * 2018-11-27 2022-04-15 北京微播视界科技有限公司 Data storage method and device for instant messaging, electronic equipment and medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8600987B2 (en) * 2007-10-11 2013-12-03 Google Inc. Classifying search results to determine page elements
CN103442271A (en) * 2013-09-11 2013-12-11 东莞市远峰科技有限公司 Classified program searching method used for TV (Television) box

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102780652B (en) * 2012-07-23 2018-04-20 上海量明科技发展有限公司 The method and system for sorting out collection are carried out in instant messaging to information
CN104462518B (en) * 2014-12-22 2018-10-19 百度在线网络技术(北京)有限公司 Method and apparatus for being labeled to IM information
CN105337747B (en) * 2015-11-17 2019-03-08 小米科技有限责任公司 Group history message treatment method and device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8600987B2 (en) * 2007-10-11 2013-12-03 Google Inc. Classifying search results to determine page elements
CN103442271A (en) * 2013-09-11 2013-12-11 东莞市远峰科技有限公司 Classified program searching method used for TV (Television) box

Also Published As

Publication number Publication date
CN106656754A (en) 2017-05-10

Similar Documents

Publication Publication Date Title
US10511642B1 (en) Tools for micro-communities
US9361626B2 (en) Social gathering-based group sharing
US9531649B2 (en) Identification of message recipients
US9531803B2 (en) Content sharing interface for sharing content in social networks
US9106710B1 (en) Interest-based system
EP3713159A1 (en) Gallery of messages with a shared interest
US10255360B2 (en) Communication terminal, communication method, program, and communication system
US20160182875A1 (en) Gallery of Videos Set to an Audio Time Line
US10171617B2 (en) Communication system that support review of usage details for a communication service
US20110196933A1 (en) Active e-mails
US20120158935A1 (en) Method and systems for managing social networks
AU2016201139A1 (en) Conversational question and answer
US20070282887A1 (en) Link swarming in an open overlay for social networks and online services
US20070282950A1 (en) Activity history management for open overlay for social networks and online services
CN113132344B (en) Broadcasting and managing call participation
US10158723B2 (en) Determining communication history of users
CN104702881B (en) Method and system for the automatic start of audio/video conference
US9646196B2 (en) Image processing device, image processing method, and program
US20150052199A1 (en) Updating time-related information in post to make it more relevant for the requester on subsequent retrieval of post
CN105959207A (en) Audio and video sharing method and device
CN106533923A (en) Information processing method and device based on instant messaging software
US9894114B2 (en) Adjusting the display of social media updates to varying degrees of richness based on environmental conditions and importance of the update
US20150079959A1 (en) Smart Microphone
US20140258358A1 (en) Method of combining network data and mobile device using the same
CN106656754B (en) Information extraction method and device based on instant messaging software

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20170725

Address after: 100102, 18 floor, building 2, Wangjing street, Beijing, Chaoyang District, 1801

Applicant after: BEIJING ANYUN SHIJI SCIENCE AND TECHNOLOGY CO., LTD.

Address before: 100088 Beijing city Xicheng District xinjiekouwai Street 28, block D room 112 (Desheng Park)

Applicant before: Beijing Qihu Technology Co., Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant