CN113204579A - Content association method, system, device, electronic equipment and storage medium - Google Patents

Content association method, system, device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113204579A
CN113204579A CN202110472686.4A CN202110472686A CN113204579A CN 113204579 A CN113204579 A CN 113204579A CN 202110472686 A CN202110472686 A CN 202110472686A CN 113204579 A CN113204579 A CN 113204579A
Authority
CN
China
Prior art keywords
content
document
text
target
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110472686.4A
Other languages
Chinese (zh)
Inventor
薛凌霄
李长亮
卢晓栋
郭馨泽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Software Co Ltd
Beijing Kingsoft Digital Entertainment Co Ltd
Original Assignee
Beijing Kingsoft Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Software Co Ltd filed Critical Beijing Kingsoft Software Co Ltd
Priority to CN202110472686.4A priority Critical patent/CN113204579A/en
Publication of CN113204579A publication Critical patent/CN113204579A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/418Document matching, e.g. of document images

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The embodiment of the invention provides a content association method, a system, a device, electronic equipment and a storage medium. The method can comprise the following steps: detecting whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbolic content of the first annotation is used as the initial content and the termination content, and text content exists between the initial content and the termination content; if the text content exists, determining the text content in the first text except the first annotation symbol as the content to be associated; in the case that a document related to the content to be associated is detected from other documents of the user, setting the content to be associated as a first content link in the target document; the documents related to the content to be associated are: the title comprises a document with content to be associated, and the linked content of the first content link is as follows: and displaying the document related to the content to be associated. According to the scheme, the text content in the document can be effectively associated with other documents with relevance.

Description

Content association method, system, device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of document processing technologies, and in particular, to a content association method, system, apparatus, electronic device, and storage medium.
Background
Users typically perform document editing of documents through document processing clients, where the document content is edited according to the actual needs of the user. For example: the document content is note content in the learning process, and at this time, one document is one note.
Typically, the content of some documents of the same user is relevant. For a document written by a user, if the text content in the document is known to be related to which other documents of the user, the document content can be more easily summarized and mastered by the user.
Based on this, how to effectively associate text content in a document with other documents with relevance is a problem to be solved urgently.
Disclosure of Invention
The embodiment of the invention aims to provide a content association method, a content association system, a content association device, electronic equipment and a storage medium, so that text content in a document can be effectively associated with other documents with relevance. The specific technical scheme is as follows:
in a first aspect, an embodiment of the present invention provides a content association method, where the method includes:
detecting whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbol content of a first annotation symbol is used as a starting content and an ending content, and a text content exists between the starting content and the ending content;
if yes, determining text contents in the first text except the first annotation symbol as contents to be associated;
in the case that a document related to the content to be associated is detected from other documents of the user, setting the content to be associated as a first content link in the target document;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
Optionally, the method further comprises:
in the case that a document related to the content to be associated is not detected from other documents of the user, setting the content to be associated as a second content link in the target document;
and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
Optionally, after the content to be associated is set as the second content link, the method further includes:
when the user creates a new document, if the content to be associated is a second content link, detecting whether the new document is a document related to the content to be associated;
and if so, replacing the link content of the second content link with the display interface of the new document.
Optionally, the method further comprises:
detecting entity words in the target document;
and adding the first label symbol for the entity word.
Optionally, after the content to be associated is set as the first content link in the target document, the method further includes:
detecting whether the content to be associated is contained in the document contents of other documents of the user and the content to be associated is not set as a first content link, and if so, outputting first prompt information;
the first prompt information is used for prompting that a document to be processed exists and/or prompting a document identifier of the document to be processed; the document to be processed is a document which contains the content to be associated and the content to be associated is not set as a first content link.
Optionally, the method further comprises:
detecting whether a second text meeting a second format condition exists in the target document; the second format condition is that the symbol content of a second label symbol is used as the starting content and the ending content, and text content exists between the starting content and the ending content;
if yes, determining text contents in the second text except the second annotation symbol as contents to be replaced;
and under the condition that the other documents have content blocks containing the contents to be replaced, performing content replacement on the second text in the target document based on the content blocks.
Optionally, the content replacement of the second text in the target document based on the content block includes:
displaying a selection list aiming at the content to be replaced in a display interface of the target document; wherein, each content block containing the content to be replaced is displayed in the selection list;
determining a target content block selected by the user through the selection list;
and replacing the second text in the display interface of the target document with the target content block.
Optionally, the selection list further includes a content input box;
after the outputting the selection list for the content to be replaced, the method further comprises:
when the situation that the user inputs content in the content input box is detected, the input content is obtained, and the second text in the display interface of the target document is replaced by the input content.
Optionally, after replacing the second text in the presentation interface of the target document with the target content block, the method further includes:
when a predetermined operation for the target content block existing in the target document is received, outputting second prompt information corresponding to the target content block at a specified associated position of the target content block;
wherein the second prompt message is: and replacing the text content except the second label mark in the second text by using the block ID of the target content block to obtain the content.
In a second aspect, an embodiment of the present invention provides a content association system, where the system includes: the system comprises a document processing client and a preset server;
the document processing client is used for detecting whether a first text meeting a first format condition exists in a target document of a user; if the first text exists, determining the text content in the first text except the first annotation symbol as the content to be associated; sending a query request of the document related to the content to be associated to the predetermined server; wherein the first format condition is that the symbol content of the first annotation symbol is used as a starting content and an ending content, and a text content is arranged between the starting content and the ending content;
the predetermined server is used for detecting whether a document related to the content to be associated exists in other documents of the user;
the document processing client is further configured to set the content to be associated as a first content link in the target document when a document related to the content to be associated is detected from other documents of the user; wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
In a third aspect, an embodiment of the present invention provides a content association apparatus, which is applied to an electronic device, and includes:
the detection module is used for detecting whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbol content of a first annotation symbol is used as a starting content and an ending content, and a text content exists between the starting content and the ending content;
the first determining module is used for determining text contents in the first text except the first annotation symbol as to-be-associated contents if detecting that the first text meeting a first format condition exists in the target document of the user;
the first processing module is used for setting the content to be associated as a first content link in the target document under the condition that a document related to the content to be associated is detected from other documents of the user;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
In a fourth aspect, an embodiment of the present invention provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor and the communication interface complete communication between the memory and the processor through the communication bus;
a memory for storing a computer program;
a processor configured to implement the steps of any of the content association methods described in the first aspect when executing a program stored in the memory.
In a fifth aspect, the present invention provides a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, implements the steps of any content association method described in the first aspect.
Embodiments of the present invention also provide a computer program product containing instructions, which when run on a computer, cause the computer to perform any of the above-mentioned content association methods.
The embodiment of the invention has the following beneficial effects:
in the scheme provided by the embodiment of the invention, the text content in the first text except the first annotation symbol is taken as the content to be associated of the target document; when a document with a title containing the content to be associated is detected from other documents of the user, that is, when a document with the content to be associated is detected, the content to be associated in the target document is associated with the document with the title containing the content to be associated by setting a content link. Therefore, the text content in the document can be effectively associated with other documents with relevance, and the use experience of the user is improved.
Of course, not all of the advantages described above need to be achieved at the same time in the practice of any one product or method of the invention.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other embodiments can be obtained by referring to these drawings.
Fig. 1 is a flowchart of a content association method according to an embodiment of the present invention;
fig. 2 is another flowchart of a content association method according to an embodiment of the present invention
Fig. 3 is another flowchart of a content association method according to an embodiment of the present invention;
FIG. 4(a) is a schematic diagram of an interface with dots as predetermined identification symbols;
FIG. 4(b) is a schematic diagram showing interaction among the terminal device, the reservation server and the intelligent processing terminal;
FIG. 4(c) shows that in the presentation interface of the target document, for the content to be replaced: aa, schematic view of the formed selection list;
FIG. 4(d) is a schematic diagram of the interface after replacing the text content conforming to the second format condition with the content block 1 in the presentation interface of the target document;
fig. 5 is a schematic structural diagram of a content association system according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of a content association apparatus according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived from the embodiments given herein by one of ordinary skill in the art, are within the scope of the invention.
In order to effectively associate text content in a document with other documents with relevance, the embodiment of the invention provides a content association method, a system, a device, an electronic device and a storage medium.
The following first describes a content association method provided in an embodiment of the present invention.
The content association method provided by the embodiment of the invention is applied to the electronic equipment. In a specific application, the electronic device may be a terminal device, for example: devices such as smart phones, tablet computers, notebook computers, and the like; of course, the electronic device is not limited to the terminal device, and for example, the electronic device may be a server, which is also reasonable.
Moreover, the functional software for implementing the content association method provided by the embodiment of the present invention may be a content association apparatus running in the electronic device. If the electronic device is a terminal device, the content association apparatus may be a function module of a document processing client running in the terminal device; if the electronic device is a server, the content association apparatus may be a function module of a predetermined server running in the server, and the predetermined server is a server corresponding to the document processing client on the user side.
In which, a user typically edits document content through a document processing client, and the document content is edited according to the actual needs of the user. Illustratively, the document processing client may be a specialized logging client for recording notes/logs edited by a user. It should be emphasized that any client for document editing, which has a content association requirement, may be used as the document processing client according to the embodiment of the present invention. In addition, the document processing client may be a client in an APP (Application) form, or may be a client in a web page form, and the embodiment of the present invention does not limit the specific form of the client.
Typically, the content of some documents of the same user is relevant. Then, for a document written by a user, if the relevance of the text content in the document to the documents of the user is known, the document content can be more easily summarized and mastered by the user.
In order to effectively associate text content in a document with other documents having relevance, so as to improve the use experience of a user, a content association method provided by the embodiment of the present invention may include the following steps:
detecting whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbol content of a first label symbol is used as initial content and termination content, and text content exists between the initial content and the termination content;
if the text content exists, determining the text content in the first text except the first annotation symbol as the content to be associated;
in the case that a document related to the content to be associated is detected from other documents of the user, setting the content to be associated as a first content link in the target document;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
In the scheme provided by the embodiment of the invention, the text content in the first text except the first annotation symbol is taken as the content to be associated of the target document; when a document with a title containing the content to be associated is detected from other documents of the user, that is, when a document with the content to be associated is detected, the content to be associated in the target document is associated with the document with the title containing the content to be associated by setting a content link. Therefore, the text content in the document can be effectively associated with other documents with relevance, and the use experience of the user is improved. In addition, the text contents in the first text except the first label are set as the first content link, that is, the text contents labeled by the first label are set as the first content link, so that the user can be helped to establish a knowledge network of the documents with the relevance, the user can conveniently perform the association management and the content summarization on the documents, and particularly, the effect is obviously improved on the documents with strong speciality and a large amount of document contents.
A content association method provided by an embodiment of the present invention is described in detail below with reference to the accompanying drawings.
As shown in fig. 1, a content association method provided in an embodiment of the present invention may include the following steps:
s101, detecting whether a first text meeting a first format condition exists in a target document of a user;
the first format condition is that the symbol content of the first label symbol is used as the initial content and the termination content, and text content is arranged between the initial content and the termination content. Wherein the target document may be any document being presented. In this embodiment, the first reference symbol is a combination type symbol including start-stop content and end content, and the first reference symbol may be set according to actual conditions. By way of example, the first callout can be a double bracket type symbol, such as: the first callout can be "[ ] ]" in "[ [ text content ] ]", or "{ } }" in "{ { text content } }", or "(())" in "((text content))", or the like. It is emphasized that the first reference sign is not limited to a double bracket type sign, and any combination of a start content and a stop content can be used as the first reference sign.
Whether the electronic equipment is terminal equipment or a server, whether the first text meeting the first format condition exists in the target document of the user can be detected. Illustratively, when the electronic device is a terminal device, the electronic device detects whether a first text meeting a first format condition exists in a target document of a user by analyzing each character type of document content of the target document; when the electronic device is a server, the terminal device running the document processing client at the user side can detect whether a first text meeting a first format condition exists in the target document of the user in a mode of analyzing each character type of the document content of the target document, and report the analysis result to the electronic device.
In addition, there may be a plurality of ways to detect whether the first text meeting the first format condition exists in the target document of the user. For example, the electronic device can detect whether a start content of a first annotation symbol is input in the target document, detect whether an end content of the first annotation symbol is input in the target document after detecting that the start content of the first annotation symbol is input, and detect that a first text meeting a first format condition exists in the target document if a text content exists between the start content and the end content of the first annotation symbol after detecting that the end content of the first annotation symbol is input. For another example, the electronic device may detect whether the end content of the first annotation symbol is input in the target document, and after detecting that the end content of the first annotation symbol is input, detect that the first text meeting the first format condition exists in the target document if the start content of the first annotation symbol exists within a predetermined text distance before the detected end content of the first annotation symbol and text content exists between the start content and the end content of the first annotation symbol. Wherein the electronic device can detect whether the start content and the end content of the first annotation symbol are input in the target document and whether text content exists between the start content and the end content of the first annotation symbol based on a predetermined character recognition mode.
In addition, there are a plurality of ways of forming the first text that meets the first format condition for the target document. Illustratively, in one implementation, the first text conforming to the first format condition is formed by a user performing an editing operation on the target document. Specifically, in the editing process of the target document, the user may add a first label to the text content required as the basis for association, so as to form a first text meeting the first format condition. The text content of the first label added by the user may be a physical word, but is not limited thereto.
It is understood that the entity words are words representing entities, and include nouns and pronouns, and the categories of the entity words may include a plurality, for example: name of a person, place name, organization name, and other proper nouns, and so forth. For example: the method includes that an 'intellectual property' word eye exists in document content of a target document, if a user wants to use the 'intellectual property' word eye as a correlation basis, namely, the user wants to correlate 'intellectual property' with a document with correlation, at the moment, if a first label symbol is 'intellectual property', 'the right side and the left side of the' intellectual property ',' the first label symbol 'is added by the user, and then' the intellectual property ',' the first text meeting a first format condition exists in the target document, and the electronic equipment can detect that the first text meeting the first format condition exists in the target document: "[ [ intellectual property ] ].
Illustratively, in another implementation, the electronic device edits the target document by itself, so as to form a first text meeting a first format condition in the target document. Based on this implementation, the content association method may further include:
detecting entity words in the target document; and adding the first label symbol to the entity word so as to obtain a first text meeting the first format condition.
In view of the fact that the entity words can represent the entity things, that is, can represent the specific meanings, in the implementation manner, in the display process of the target document, the electronic device detects the entity words in the target document and changes the detected entity words into the first text conforming to the first label. It can be understood that the electronic device can detect all types of entity words in the target document, thereby ensuring the comprehensiveness of content association; of course, the electronic device may also detect the entity words of the designated category in the target document, so as to ensure that the content association more meets the actual requirement and avoid the influence of too many content associations on the document processing efficiency. It is reasonable that the designated category may be set by the user in advance, or the log processing client/predetermined server, or the like.
In addition, in this implementation manner, after the first annotation symbol is added to the entity word, the first annotation symbol may be displayed in the log document, and of course, the first annotation symbol may also be hidden in the target document, that is, the first annotation symbol is invisible to the user, which is reasonable. In addition, for the condition that the first annotation symbol is hidden in the target document, the entity word marked with the first annotation symbol can be adjusted through a preset text adjusting mode, so that the effect of highlighting is achieved. The predetermined text adjustment manner may include: the text color is set to be one or more of a predetermined color, the text content is thickened, the text content is inclined, and the like. In addition, for the case that the first annotation symbol is hidden in the target document, in order to further prompt the user that the entity word is annotated by the first annotation symbol, for example, for an electronic device operated by a mouse, when the electronic device detects that the mouse hovers over the entity word annotated with the first annotation symbol, the electronic device displays the entity word with the first annotation symbol in a floating manner over the entity word. It is emphasized that, in a specific application, the above-mentioned various forms of the first text conforming to the first format condition may be used in combination with or in selection of one form.
The embodiment of the present invention is not limited to the specific implementation manner for detecting the entity word in the target document. For clarity of the scheme and clarity of layout, the following exemplary description describes a process of detecting entity words in a target document.
S102, when detecting that a first text meeting a first format condition exists in a target document of a user, determining text contents except a first label in the first text as to-be-associated contents;
when detecting that the first text meeting the first format condition exists in the target document, the text content between the starting content and the ending content of the first annotation symbol can be extracted to obtain the content to be associated. Wherein the associated content is content that needs to be determined as to which documents of the user are relevant.
S103, in the case that a document related to the content to be associated is detected from other documents of the user, setting the content to be associated as a first content link in the target document;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
For any document, because the title can summarize the document content of the document, and the title is unique, if the title of the document contains the content to be associated, the document can be regarded as a document related to the content to be associated. In the case that a document related to the content to be associated is detected from other documents of the user, in order to establish an association relationship between the content to be associated and the related document, the electronic device may set the content to be associated as a first content link in the target document. Therefore, the content to be associated in the target document can be a link which can be clicked, and the link content is a display interface of the document related to the content to be associated. In addition, after the content to be associated is set as the first content link, the content to be associated can be adjusted in a preset text adjustment mode to achieve the effect of highlighting the content to be associated, so that a user can quickly distinguish which content is set as the content link. For example, the predetermined text adjustment manner may include: the text color is set to be one or more of a predetermined color, the text content is thickened, the text content is inclined, and the like.
It can be understood that, if the number of documents related to the content to be associated is one, the linked content linked to the first content is: a presentation interface of a document related to the content to be associated; if the number of the documents related to the content to be associated is multiple, optionally, before the content to be associated is set as the first content link in the target document, the method may further include: outputting preset prompting information for prompting a user to select a document from a plurality of documents related to the content to be associated, wherein the link content of the first content link is as follows: and displaying the document which is selected by the user and is related to the content to be associated.
In addition, the manner of detecting whether a document related to the content to be associated exists from other documents of the user may include:
determining other documents of the user from the documents of the user based on the document identification of the target document;
detecting whether the titles of other documents contain the content to be associated or not aiming at each other document to obtain a detection result;
and determining whether a document related to the content to be associated exists in other documents of the user based on the detection result.
If the electronic device is a terminal device, the electronic device may perform the step of detecting whether the document related to the content to be associated exists in the other documents of the user by itself, and the electronic device may also perform the step of detecting whether the document related to the content to be associated exists in the other documents of the user by using a predetermined server in a server. In an optional implementation manner, detecting whether the title of the other document includes the content to be associated may include: and detecting whether the titles of the other documents contain the contents to be associated or not in a character matching mode.
If the electronic device is a terminal device, the target document is shown in the electronic device, so that the electronic device can set the content to be associated as the first content link in the target document. When the electronic device is a server, since the target document is shown in the terminal device running the document processing client, the electronic device controls the terminal device to set the content to be associated as a first content link in the target document, for example: the electronic equipment informs the terminal equipment of the corresponding relation between the access path of the relevant document of the content to be associated and the content to be associated, so that the content to be associated is set as a first content link in the target document of the terminal equipment.
In addition, in order to help the user find the content to be associated that is not noticed in other documents, optionally, after the content to be associated is set as the first content link in the target document, the content association method may further include:
detecting whether the content to be associated is contained in the document contents of other documents of the user and the content to be associated is not set as a first content link, and if so, outputting first prompt information;
the first prompt information is used for prompting that the document to be processed exists and/or prompting the document identification of the document to be processed; the document to be processed is a document which contains the content to be associated and the content to be associated is not set as the first content link.
Illustratively, if the user's documents include document 1, document 2 and document 3, and the target document is document 1, it is assumed that the first text satisfying the first format condition [ [ aa ] ] is detected in document 1, and of the first text: aa, set as a first content link; then, if it is detected that document 2 contains text content: aa, but aa in document 2 is not provided with the first content link, the first prompt information may be output: "there are other documents whose aa is not associated" or "there are aa in document 2 that is not associated".
In this embodiment, the specific form of the first prompt message is not limited. In addition, after the first prompt message is output, the user may trigger content association of the content to be associated in the document mentioned in the first prompt message based on the first prompt message, or continuously ignore the content to be associated in the document mentioned in the first prompt message.
In the scheme provided by the embodiment of the invention, the text content in the first text except the first annotation symbol is taken as the content to be associated of the target document; when a document with a title containing the content to be associated is detected from other documents of the user, that is, when a document with the content to be associated is detected, the content to be associated in the target document is associated with the document with the title containing the content to be associated by setting a content link. Therefore, the text content in the document can be effectively associated with other documents with relevance, and the use experience of the user is improved. In addition, the text contents in the first text except the first label are set as the first content link, that is, the text contents labeled by the first label are set as the first content link, so that the user can be helped to establish a knowledge network of the documents with the relevance, the user can conveniently perform the association management and the content summarization on the documents, and particularly, the effect is obviously improved on the documents with strong speciality and a large amount of document contents.
Optionally, based on the foregoing S101-S103, in another embodiment of the present invention, as shown in fig. 2, the content association method may further include the following steps:
s104, in the case that a document related to the content to be associated is not detected from other documents of the user, setting the content to be associated as a second content link in the target document;
if the document related to the content to be associated is not detected from other documents of the user, a prompt may be sent to the user for better user experience, that is, the content to be associated is set as the second content link in the target document. And the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists. In this way, after the content to be associated is set as the second content link, the content to be associated may be a link that can be clicked, and the link content is the specified content.
In addition, after the content to be associated is set as the second content link, the content to be associated can be adjusted in a preset text adjustment mode to achieve the effect of highlighting the content to be associated, so that a user can quickly distinguish which content is set as the content link. For example, the predetermined text adjustment manner may include: the text color is set to be one or more of a predetermined color, the text content is thickened, the text content is inclined, and the like. In addition, in a specific application, the content to be associated set as the second content link and the content to be associated set as the first content link may be adjusted by adopting different text adjustment modes, so as to distinguish text contents set as different content links, for example: the color of the content to be associated set as the first content link is set to red, and the color of the content to be associated set as the second content link is set to yellow. The specific form of the specific content may be various. For example, the specific content may be a blank interface, or the specific content may be a prompt statement, such as: it is reasonable to "no document related to the current content is found". It is understood that when the designated content is a blank interface, the user can edit the log content related to the content to be associated in the blank interface, so as to form a new log.
By the scheme provided by the embodiment, the text content in the document can be effectively associated with other documents with relevance, so that the use experience of a user is improved; and when the document related to the content to be associated is not found, the user is effectively prompted by setting the content link, so that better use experience of the user is ensured.
Optionally, in another embodiment of the present invention, based on the above S101-S104, after the content to be associated is set as the second content link, the method may further include the following step a:
whenever the user creates a new document, if the content to be associated is a second content link, detecting whether the new document is a document related to the content to be associated;
if so, the link content of the second content link replaces the display interface of the new document.
In order to establish the relevance between the content to be associated and the related document in time in consideration of the fact that a new document created by a user may be related to the content to be associated in a target document, in the embodiment, each time the user creates a new document, if the content to be associated is a second content link, whether the new document is a document related to the content to be associated is detected, and if the judgment result is yes, the link content of the second content link replaces the display interface of the new document.
There are various detection ways for detecting whether a user creates a new document. For example, in one implementation, the detecting whether the user creates a new document may include: after receiving a document new-creation instruction, determining a new-creation document corresponding to the document new-creation instruction, monitoring whether a user sends a storage instruction of the new-creation document, and if the storage instruction is monitored, detecting that the user creates a new document.
Therefore, by the scheme provided by the embodiment, the text content in the document can be effectively associated with other documents with relevance, so that the use experience of a user is improved; and when the document related to the content to be associated is not found, the user is effectively prompted by setting the content link, so that better use experience of the user is ensured. In addition, newly-built documents with relevance to the content to be associated can be found in time, and the association relation between the content to be associated and the newly-built documents is established.
Optionally, in another embodiment of the present invention, based on the above S101-S104, after the content to be associated is set as the second content link, the method may further include the following step B:
under the condition that the content to be associated is a second content link, detecting whether a document related to the content to be associated exists in other documents of the user except the target document every preset time;
if so, replacing the link content of the second content link with: and displaying the detected document related to the content to be associated.
Considering that a new document or a modified document created by a user may be related to a content to be associated in a target document, in order to establish the relevance between the content to be associated and the related document in time, in this embodiment, in a case where the content to be associated is a second content link, it is detected whether a document related to the content to be associated exists in other documents of the user except the target document at predetermined time intervals; and when the judgment result is yes, performing content replacement on the linked content linked with the second content.
Therefore, by the scheme provided by the embodiment, the text content in the document can be effectively associated with other documents with relevance, so that the use experience of a user is improved; and when the document related to the content to be associated is not found, the user is effectively prompted by setting the content link, so that better use experience of the user is ensured. In addition, newly built documents or changed documents with relevance to the content to be associated can be found in time, and the association relationship between the content to be associated and the newly built documents or the changed documents can be established.
Optionally, as shown in fig. 3, another embodiment of the present invention further provides a content association method, which may include the following steps:
s301, detecting whether a first text meeting a first format condition exists in a target document of a user, and detecting whether a second text meeting a second format condition exists in the target document;
the first format condition is that the symbol content of the first label symbol is used as the initial content and the termination content, and text content is arranged between the initial content and the termination content. And the second format condition is that the symbolic content of the second callout is used as the starting content and the ending content, and there is text content between the starting content and the ending content. And, the second callout is different from the first callout.
For a specific implementation manner and corresponding introduction content for detecting whether a first text meeting a first format condition exists in a target document of a user, refer to the corresponding content description in the foregoing embodiment, which is not described in detail in this embodiment.
The specific implementation manner and the corresponding introduction content for detecting whether the second text meeting the second format condition exists in the target document are as follows:
the second reference symbol may be a symbol of a combination type including start-stop content and end content, and may be a predetermined symbol. By way of example, the second callout can be a double bracket type symbol, such as: the second callout can be "[ ] ]" in "[ [ text content ] ]", or "{ } }" in "{ { text content } }", or "(())" in "((text content))", or the like. It is emphasized that the second reference sign is not limited to a double bracket type sign, and any kind of sign including a combination of start and stop contents may be used as the second reference sign. Whether the electronic device is a terminal device or a server, whether the second text meeting the second format condition exists in the target document of the user may be detected, and a specific detection manner refers to a manner of detecting whether the first text meeting the first format condition exists in the target document of the user in the above embodiment.
In addition, there are a plurality of ways of forming the second text that meets the second format condition for the target document. For example, in one implementation manner, the second text meeting the second format condition is formed by operating on the content of the document by the user, and specifically, in the editing process of the target document, the user may add a second label to the text content that needs to query whether the content of other logs can be replaced, so as to form the second text meeting the second format condition. The text content added with the second label by the user may be a real word, but is not limited thereto.
For example: if the "intellectual property" wording exists in the document content of the target document, and it is desired to inquire whether the "intellectual property" wording can be replaced by the content of other logs, at this time, if the second label symbol is "()", the user may add the second label symbol on the left and right sides of the "intellectual property" to form "((intellectual property))", so that the electronic device may detect that the second text meeting the second format condition exists in the target document: "((intellectual property)).
S302, if it is detected that a first text meeting a first format condition exists in a target document of a user, determining text contents in the first text except a first label symbol as to-be-associated contents;
s303, under the condition that a document related to the content to be associated is detected from other documents of the user, setting the content to be associated as a first content link in the target document;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
For example, in an alternative implementation manner, for any document, whether the content to be associated is included in the title of the document may be identified through character matching.
Optionally, in a case that a document related to the content to be associated is not detected from other documents of the user, the method may further include: setting the content to be associated as a second content link in the target document; and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
Optionally, after the content to be associated is set as the second content link, the method may further include:
whenever the user creates a new document, if the content to be associated is a second content link, detecting whether the new document is a document related to the content to be associated;
if so, the link content of the second content link replaces the display interface of the new document.
Optionally, the method may further include:
detecting entity words in the target document;
and adding the first label symbol for the detected entity word.
Optionally, after the content to be associated is set as the first content link in the target document, the method may further include:
detecting whether the content to be associated is contained in the document contents of other documents of the user and the content to be associated is not set as a first content link, and if so, outputting first prompt information;
the first prompt information is used for prompting that a document to be processed exists or prompting a document identifier of the document to be processed; the document to be processed is a document which contains content to be associated and the content to be associated is not set as a first content link.
S304, if the second text meeting the second format condition is detected to exist in the target document, determining text contents in the second text except the second label symbol as to-be-replaced contents;
when it is detected that a second text meeting a second format condition exists in the target document, text content between the starting content and the ending content of the second annotation symbol can be extracted to obtain content to be replaced. Wherein, the content to be replaced is: and content replaced by content with relevance in other documents.
S305, under the condition that a content block containing the content to be replaced exists in other documents, replacing the content of the second text in the target document based on the content block.
When the content block containing the content to be replaced exists in the other document, the content block which is relevant to the content to be replaced exists in the other document, so that the content of the second text in the target document can be replaced on the basis of the content block. That is, when the content block existing in the other document meets the user requirement, the second text in the presentation interface of the target document is replaced by a content block in the other document. After the second text in the presentation interface in the target document is replaced with a content block in another document, if the content block changes, the content block presented in the presentation interface of the target document also changes correspondingly, i.e., keeps consistent with the content block in another document.
It should be noted that each document contains at least one content block. Illustratively, each content block may be a paragraph; alternatively, each content block is a text block of the document content with a predetermined identifier as the starting content. For example: as shown in fig. 4(a), each content block has a dot as a start content with a dot as a predetermined identification symbol, and three content blocks are shown in fig. 4 (a).
In addition, the method for detecting whether the content block containing the content to be replaced exists in other documents may include:
for each other document, detecting whether the document contents of the other documents contain the content to be replaced, and if so, determining a content block containing the content to be replaced;
and when at least one content block is determined, judging that the content block containing the content to be replaced exists in other documents.
Moreover, if the electronic device is a terminal device, the electronic device may perform the step of detecting whether the content block including the content to be replaced exists in the other document by itself, or the electronic device may perform the step of detecting whether the content block including the content to be replaced exists in the other document by using a predetermined server in a server. If the electronic device is a terminal device, the target document is displayed in the electronic device, so that the electronic device can replace the content of the second text in the target document based on the content block after determining the content block. And when the electronic device is a server, because the target document is displayed in the terminal device running with the document processing client, the electronic device replaces the second text in the target document based on the content block by controlling the terminal device.
Therefore, by the scheme provided by the embodiment, the text content in the document can be effectively associated with other related documents, so that the use experience of a user is improved; in addition, the scheme can also replace the related content of the text content appointed by the user in the target document, so that the user can quickly finish document editing based on the created content. In addition, according to the scheme, the text content specified by the user in the target document can be replaced by the related content, so that the user can quickly reuse the content edited by the history. For example: under the note tool, historical records and contents can be quickly multiplexed, and compared with the traditional copying operation, the 'searching' operation is not needed, so that related contents cannot be omitted, and the multiplexing of knowledge points is more accurate.
Optionally, in another embodiment of the present invention, performing content replacement on the second text conforming to the second format condition in the target document based on the content block may include:
displaying a selection list aiming at the content to be replaced in a display interface of the target document; wherein, each content block containing the content to be replaced is displayed in the selection list;
determining a target content block selected by the user through the selection list;
and replacing the second text in the display interface of the target document with the target content block.
Considering that the number of the searched content blocks can be one or more, and the user has a requirement for selecting the required content block according to the requirement of the user, therefore, a selection list for the content to be replaced can be output in the display interface of the target document, so that the user can specify the content block through the selection list, and the replacement operation is completed.
Considering the content blocks in other documents, there may be no content that the user wishes to replace, and therefore, the user may be provided with a content input function in the selection list so that the user can input according to the actually required content. Based on the processing idea, the selection list may further include a content input box;
after outputting the selection list for the content to be replaced, the method may further include:
and when the user is detected to input the content in the content input box, acquiring the input content, and replacing the second text in the display interface of the target document with the input content.
For example, fig. 4(c) shows that in the presentation interface of the target document, for the content to be replaced: aa, forming a selection list, wherein the selection category comprises a content input box and three content blocks containing contents to be replaced are found from other documents; if the user selects the content block 1, the second text meeting the second format condition, i.e., (Aa)), is replaced with the content block 1 in the presentation interface of the target document, as shown in fig. 4 (d).
In addition, after replacing the second text in the presentation interface of the target document with the target content block, the method further includes:
when a predetermined operation for the target content block existing in the target document is received, outputting second prompt information corresponding to the target content block at a specified associated position of the target content block;
wherein the second prompt message is: and replacing the text contents except the second label in the second text by using the block ID of the target content block to obtain the contents.
For a content chunk in any document, it has a unique chunk ID, and for example, the ID may be configured to include a combination of an ID of the document and an identification of the location of the content chunk in the document. For example, the predetermined operation may be a hover operation for the content block, but is not limited thereto; and the specified associated position may be an upper left position, a lower right position, an upper left position, or a lower left position, etc. In addition, by outputting the second prompt information, the reader can know that the target content block is a content block existing in another document in the subsequent reading process of the target document, and can delete the target content block from the target document by deleting the block ID of the target content block. Therefore, by the scheme provided by the embodiment, the text content in the document can be effectively associated with other related documents, so that the use experience of a user is improved; in addition, according to the scheme, the text content appointed by the user in the target document can be replaced by the related content according to the user requirement, so that the user can quickly finish document editing based on the created content.
For clarity of the scheme and clarity of layout, a specific implementation manner of detecting the entity words in the target document is exemplarily described below.
For example, in one implementation, the process of detecting the entity word in the target document may include:
inputting part or all of the document content of the target document into a pre-trained language model to perform character-level coding on the document content, and classifying the coded text information through a CRF (conditional random field) algorithm to obtain the entity category of each character in the document content; and splicing adjacent characters in the same category to obtain the entity category of each word, and obtaining the entity words in the document content of the required target document based on the entity category of each word.
The CRF is a sequence tagging algorithm and can be used for tasks such as part-of-speech tagging, word segmentation, named entity recognition and the like. It is to be understood that, if the entity words of the specified category in the target document are detected, the entity words of the specified category may be selected from the multiple words based on the entity category of each word, so as to obtain the entity words in the target document.
In addition, if the electronic device is a terminal device, in an implementation manner, the terminal device may detect the entity word in the target document by using a predetermined server.
As shown in fig. 4(b), the terminal device submits the content to be processed, which may be all or part of the content of the target document, to the predetermined server; the predetermined server side can transmit the content to be processed to the intelligent processing side through a request interface provided by the intelligent processing side, and the intelligent processing side identifies entity words in the content to be processed and feeds back the entity words to the predetermined server side; and then, the preset server selects entity words which accord with the specified category from the received entity words, and feeds the selected entity words back to the terminal equipment, so that the terminal equipment obtains each entity word in the content to be processed. For example, the predetermined server may send an entity word recognition request carrying the content to be processed to the intelligent processing terminal, and after receiving the entity word recognition request, the intelligent processing terminal may extract the content to be processed from the entity word recognition request, recognize an entity word from the content to be processed, and feed back a response result carrying each entity word to the predetermined server, so as to feed back the recognized entity words to the predetermined server in batch.
Based on the foregoing method embodiment, an embodiment of the present invention further provides a content association system, as shown in fig. 5, where the system includes: a document processing client 510 and a reservation server 520;
the document processing client 510 is configured to detect whether a first text meeting a first format condition exists in a target document of a user; if the first text exists, determining the text content in the first text except the first annotation symbol as the content to be associated; sending a query request of the document related to the content to be associated to the predetermined server; wherein the first format condition is that the symbol content of the first annotation symbol is used as a starting content and an ending content, and a text content is arranged between the starting content and the ending content;
the predetermined server 520 is configured to detect whether a document related to the content to be associated exists in other documents of the user;
the document processing client 530 is further configured to set the content to be associated as a first content link in the target document when a document related to the content to be associated is detected from other documents of the user; wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
For example, the document processing client 510 may send a log query request carrying the content to be associated and the document identifier of the target document to the predetermined server; correspondingly, the predetermined server is configured to determine other documents of the user based on the document identifier of the target document; detecting whether a document related to the content to be associated exists in other documents of the user, and feeding back an access path of the document related to the content to be associated to the document processing client when the document related to the content to be associated exists; further, the document processing client 510 sets the content to be associated as a first content link based on the received access path.
In the scheme provided by the embodiment of the invention, the text content in the first text except the first annotation symbol is taken as the content to be associated of the target document; when a document with a title containing the content to be associated is detected from other documents of the user, that is, when a document with the content to be associated is detected, the content to be associated in the target document is associated with the document with the title containing the content to be associated by setting a content link. Therefore, the text content in the document can be effectively associated with other documents with relevance, and the use experience of the user is improved.
Optionally, in another embodiment of the present invention, the document processing client 510 is further configured to set the content to be associated as a second content link in the target document if a document related to the content to be associated is not detected from other documents of the user;
and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
Optionally, in another embodiment of the present invention, after the content to be associated is set as the second content link, whenever the user creates a new document, if the content to be associated is the second content link, the document processing client 510 is further configured to detect whether the new document is a document related to the content to be associated; and if so, replacing the link content of the second content link with the display interface of the new document.
Optionally, in another embodiment of the present invention, the document processing client 510 is further configured to:
detecting entity words in the target document;
and adding the first label symbol for the entity word.
Optionally, the document processing client is further configured to, after setting the content to be associated as a first content link in the target document, detect whether the content to be associated is included in the document contents of other documents of the user and the content to be associated is not set as the first content link, and if so, output first prompt information;
the first prompt information is used for prompting that a document to be processed exists or prompting a document identifier of the document to be processed; the document to be processed is a document which contains the content to be associated and the content to be associated is not set as a first content link.
Optionally, in another embodiment of the present invention, the document processing client 510 is further configured to: detecting whether a second text meeting a second format condition exists in the target document; if the second annotation symbol exists, determining the text content in the second text except the second annotation symbol as the content to be replaced; sending a query request of a content block containing the content to be replaced to the predetermined server; the second format condition is that the symbol content of a second label symbol is used as the starting content and the ending content, and text content exists between the starting content and the ending content;
the predetermined server 520 is further configured to query whether a content block containing the content to be replaced exists in the other documents;
the document processing client is further configured to, when a content block including the content to be replaced exists in the other document, perform content replacement on the second text in the target document based on the content block.
Optionally, in another embodiment of the present invention, the document processing client 510 performs content replacement on the second text in the target document based on the content block, including:
displaying a selection list aiming at the content to be replaced in a display interface of the target document; wherein, each content block containing the content to be replaced is displayed in the selection list;
determining a target content block selected by the user through the selection list;
and replacing the second text in the display interface of the target document with the target content block.
Optionally, in another embodiment of the present invention, the selection list further includes a content input box;
the document processing client 510 is further configured to, after outputting the selection list for the content to be replaced, when it is detected that the user inputs content in the content input box, obtain the input content, and replace the second text in the presentation interface of the target document with the input content.
Optionally, in another embodiment of the present invention, the document processing client 510 is further configured to, after replacing the second text in the presentation interface of the target document with the target content block, when receiving a predetermined operation for the target content block existing in the target document, output second prompt information corresponding to the target content block at a specified associated position of the target content block;
wherein the second prompt message is: and replacing the text content except the second label mark in the second text by using the block ID of the target content block to obtain the content.
It should be noted that, for the detailed implementation process of each step executed by the document processing client and the predetermined server in the content association system provided by the embodiment of the present invention, reference may be made to corresponding contents in the foregoing method embodiment, which is not described herein again.
Based on the foregoing method embodiment, an embodiment of the present invention further provides a content association apparatus, which is applied to an electronic device, and as shown in fig. 6, the apparatus may include:
the detecting module 610 is configured to detect whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbol content of a first annotation symbol is used as a starting content and an ending content, and a text content exists between the starting content and the ending content;
a first determining module 610, configured to determine, if it is detected that a first text meeting a first format condition exists in the target document of the user, text content in the first text except for the first annotation symbol as content to be associated;
a first processing module 630, configured to, in a case that a document related to the to-be-associated content is detected from other documents of the user, set the to-be-associated content as a first content link in the target document;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
Optionally, the first processing module 630 is further configured to, in the target document, set the content to be associated as a second content link if a document related to the content to be associated is not detected from other documents of the user;
and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
Optionally, in another embodiment of the present invention, after the content to be associated is set as the second content link, whenever the user creates a new document, if the content to be associated is the second content link, the first processing module 630 is further configured to detect whether the new document is a document related to the content to be associated;
and if so, replacing the link content of the second content link with the display interface of the new document.
Optionally, in another embodiment of the present invention, the detecting module 610 is further configured to:
detecting entity words in the target document;
and adding the first label symbol for the entity word.
Optionally, in another embodiment of the present invention, the first processing module is further configured to, after the content to be associated is set as the first content link in the target document, detect whether the content to be associated is included in the document contents of other documents of the user and the content to be associated is not set as the first content link, and if so, output first prompt information;
the first prompt information is used for prompting that a document to be processed exists or prompting a document identifier of the document to be processed; the document to be processed is a document which contains the content to be associated and the content to be associated is not set as a first content link.
Optionally, in another embodiment of the present invention, the detecting module 610 is further configured to detect whether a second text meeting a second format condition exists in the target document; the second format condition is that the symbol content of a second label symbol is used as the starting content and the ending content, and text content exists between the starting content and the ending content;
the device further comprises:
the second determining module is used for determining text contents in the second text except the second annotation symbol as contents to be replaced if the second text meeting the second format condition is detected to exist in the target document;
and the second processing module is used for replacing the content of the second text in the target document based on the content block when the content block containing the content to be replaced exists in the other document.
Optionally, in another embodiment of the present invention, the second processing module is specifically configured to:
displaying a selection list aiming at the content to be replaced in a display interface of the target document; wherein, each content block containing the content to be replaced is displayed in the selection list;
determining a target content block selected by the user through the selection list;
and replacing the second text in the display interface of the target document with the target content block.
Optionally, in another embodiment of the present invention, the selection list further includes a content input box;
the second processing module is further configured to, after the selection list for the content to be replaced is output, obtain the input content when it is detected that the user inputs content in the content input box, and replace the second text in the presentation interface of the target document with the input content.
Optionally, in another embodiment of the present invention, the second processing module is further configured to, when a predetermined operation is received for the target content block existing in the target document, output second prompt information corresponding to the target content block at a specified associated position of the target content block;
wherein the second prompt message is: and replacing the text content except the second label mark in the second text by using the block ID of the target content block to obtain the content.
Based on the above method embodiments, an electronic device according to an embodiment of the present invention is further provided, as shown in fig. 7, and includes a processor 701, a communication interface 702, a memory 703 and a communication bus 704, where the processor 701, the communication interface 702 and the memory 703 complete mutual communication through the communication bus 704,
a memory 703 for storing a computer program;
the processor 701 is configured to implement the steps of any content association method provided in the embodiment of the present invention when executing the program stored in the memory 703.
The communication bus mentioned in the electronic device may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.
The communication interface is used for communication between the electronic equipment and other equipment.
The Memory may include a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components.
In a further embodiment of the present invention, a computer-readable storage medium is further provided, in which a computer program is stored, and the computer program, when executed by a processor, implements the steps of any of the above-mentioned content association methods.
In a further embodiment provided by the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform the steps of any of the above-described embodiments of the content association method.
In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, cause the processes or functions described in accordance with the embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the system embodiment, since it is substantially similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (17)

1. A method for associating content, the method comprising:
detecting whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbol content of a first annotation symbol is used as a starting content and an ending content, and a text content exists between the starting content and the ending content;
if yes, determining text contents in the first text except the first annotation symbol as contents to be associated;
in the case that a document related to the content to be associated is detected from other documents of the user, setting the content to be associated as a first content link in the target document;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
2. The method of claim 1, further comprising:
in the case that a document related to the content to be associated is not detected from other documents of the user, setting the content to be associated as a second content link in the target document;
and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
3. The method according to claim 2, wherein after the content to be associated is set as the second content link, the method further comprises:
when the user creates a new document, if the content to be associated is a second content link, detecting whether the new document is a document related to the content to be associated;
and if so, replacing the link content of the second content link with the display interface of the new document.
4. The method according to any one of claims 1-3, further comprising:
detecting entity words in the target document;
and adding the first label symbol for the entity word.
5. The method according to any one of claims 1-3, wherein after the content to be associated is set as the first content link in the target document, the method further comprises:
detecting whether the content to be associated is contained in the document contents of other documents of the user and the content to be associated is not set as a first content link, and if so, outputting first prompt information;
the first prompt information is used for prompting that a document to be processed exists or prompting a document identifier of the document to be processed; the document to be processed is a document which contains the content to be associated and the content to be associated is not set as a first content link.
6. The method according to any one of claims 1-3, further comprising:
detecting whether a second text meeting a second format condition exists in the target document; the second format condition is that the symbol content of a second label symbol is used as the starting content and the ending content, and text content exists between the starting content and the ending content;
if yes, determining text contents in the second text except the second annotation symbol as contents to be replaced;
and under the condition that the other documents have content blocks containing the contents to be replaced, performing content replacement on the second text in the target document based on the content blocks.
7. The method of claim 6, wherein the content replacing the second text in the target document based on the content block comprises:
displaying a selection list aiming at the content to be replaced in a display interface of the target document; wherein, each content block containing the content to be replaced is displayed in the selection list;
determining a target content block selected by the user through the selection list;
and replacing the second text in the display interface of the target document with the target content block.
8. The method of claim 7, wherein the selection list further comprises a content input box;
after the outputting the selection list for the content to be replaced, the method further comprises:
when the situation that the user inputs content in the content input box is detected, the input content is obtained, and the second text in the display interface of the target document is replaced by the input content.
9. The method of claim 7, wherein after replacing the second text in the presentation interface of the target document with the target content block, the method further comprises:
when a predetermined operation for the target content block existing in the target document is received, outputting second prompt information corresponding to the target content block at a specified associated position of the target content block;
wherein the second prompt message is: and replacing the text content except the second label mark in the second text by using the block ID of the target content block to obtain the content.
10. A content association system, the system comprising: the system comprises a document processing client and a preset server;
the document processing client is used for detecting whether a first text meeting a first format condition exists in a target document of a user; if the first text exists, determining the text content in the first text except the first annotation symbol as the content to be associated; sending a query request of the document related to the content to be associated to the predetermined server; wherein the first format condition is that the symbol content of the first annotation symbol is used as a starting content and an ending content, and a text content is arranged between the starting content and the ending content;
the predetermined server is used for detecting whether a document related to the content to be associated exists in other documents of the user;
the document processing client is further configured to set the content to be associated as a first content link in the target document when a document related to the content to be associated is detected from other documents of the user; wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
11. The system according to claim 10, wherein the document processing client is further configured to set the content to be associated as a second content link in the target document if a document related to the content to be associated is not detected from other documents of the user;
and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
12. The system of claim 10 or 11, wherein the document processing client is further configured to: detecting whether a second text meeting a second format condition exists in the target document; if yes, determining text contents in the second text except the second annotation symbol as contents to be replaced; sending a query request of a content block containing the content to be replaced to the predetermined server; wherein the second format condition is that the symbol content of the second annotation symbol is used as a starting content and an ending content, and a text content is arranged between the starting content and the ending content;
the predetermined server is further configured to query whether a content block containing the content to be replaced exists in the other documents;
the document processing client is further configured to, when a content block including the content to be replaced exists in the other document, perform content replacement on the second text in the target document based on the content block.
13. An apparatus for associating content, the apparatus comprising:
the detection module is used for detecting whether a first text meeting a first format condition exists in a target document of a user; the first format condition is that the symbol content of a first annotation symbol is used as a starting content and an ending content, and a text content exists between the starting content and the ending content;
the first determining module is used for determining text contents in the first text except the first annotation symbol as to-be-associated contents if detecting that the first text meeting a first format condition exists in the target document of the user;
the first processing module is used for setting the content to be associated as a first content link in the target document under the condition that a document related to the content to be associated is detected from other documents of the user;
wherein, the document related to the content to be associated is: the title comprises the document of the content to be associated, and the linked content linked with the first content is as follows: and displaying the document related to the content to be associated.
14. The apparatus according to claim 13, wherein the first processing module is further configured to set the content to be associated as a second content link in the target document if a document related to the content to be associated is not detected from other documents of the user;
and the link content linked with the second content is designated content, and the designated content is used for prompting that no document related to the content to be associated exists.
15. The apparatus according to claim 13 or 14, wherein the detecting module is further configured to detect whether a second text meeting a second format condition exists in the target document; the second format condition is that the symbol content of a second label symbol is used as the starting content and the ending content, and text content exists between the starting content and the ending content;
the device further comprises:
a second determining module, configured to determine, if it is detected that a second text meeting a second format condition exists in the target document, text content in the second text except for the second annotation symbol as content to be replaced;
and the second processing module is used for replacing the content of the second text in the target document based on the content block when the content block containing the content to be replaced exists in the other document.
16. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;
a memory for storing a computer program;
a processor for implementing the method steps of any of claims 1-9 when executing a program stored in the memory.
17. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of the claims 1-9.
CN202110472686.4A 2021-04-29 2021-04-29 Content association method, system, device, electronic equipment and storage medium Pending CN113204579A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110472686.4A CN113204579A (en) 2021-04-29 2021-04-29 Content association method, system, device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110472686.4A CN113204579A (en) 2021-04-29 2021-04-29 Content association method, system, device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113204579A true CN113204579A (en) 2021-08-03

Family

ID=77027748

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110472686.4A Pending CN113204579A (en) 2021-04-29 2021-04-29 Content association method, system, device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113204579A (en)

Citations (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10124534A (en) * 1996-08-14 1998-05-15 N T T Data Tsushin Kk System and method for retrieving information
US5761689A (en) * 1994-09-01 1998-06-02 Microsoft Corporation Autocorrecting text typed into a word processing document
US20010018697A1 (en) * 2000-01-25 2001-08-30 Fuji Xerox Co., Ltd. Structured document processing system and structured document processing method
US20020055945A1 (en) * 2000-06-06 2002-05-09 Datatech Software, Inc. Method of document assembly
JP2002132755A (en) * 2000-10-20 2002-05-10 Fuji Xerox Co Ltd Document processing system
GB0509900D0 (en) * 2005-05-14 2005-06-22 Hewlett Packard Development Co Document transfer between document editing software applications
US20060200445A1 (en) * 2005-03-03 2006-09-07 Google, Inc. Providing history and transaction volume information of a content source to users
CA2605252A1 (en) * 2005-04-18 2006-10-26 Collage Analytics Llc System and method for efficiently tracking and dating content in very large dynamic document spaces
US7171619B1 (en) * 2001-07-05 2007-01-30 Sun Microsystems, Inc. Methods and apparatus for accessing document content
CN101004737A (en) * 2007-01-24 2007-07-25 贵阳易特软件有限公司 Individualized document processing system based on keywords
CN101432733A (en) * 2006-03-13 2009-05-13 奥多比公司 Augmenting the contents of an electronic document with data retrieved from a search
US20090217159A1 (en) * 2008-02-22 2009-08-27 Jeffrey Matthew Dexter Systems and Methods of Performing a Text Replacement Within Multiple Documents
JP2010170324A (en) * 2009-01-22 2010-08-05 Toshiba Corp Apparatus for supporting knowledge sharing, and method and program thereof
CN101971172A (en) * 2005-08-29 2011-02-09 谷歌公司 Mobile sitemaps
CN102124462A (en) * 2008-06-23 2011-07-13 谷歌公司 Query identification and association
EP2354976A1 (en) * 2010-02-09 2011-08-10 ExB Asset Management GmbH Online analysis and display of correlated information
CN102306186A (en) * 2011-08-29 2012-01-04 上海量明科技发展有限公司 Document traversing method and system
CN102541901A (en) * 2010-12-26 2012-07-04 上海量明科技发展有限公司 Method and system for identifying and outputting information during document reading
CN102682040A (en) * 2011-03-16 2012-09-19 日电(中国)有限公司 Device and method for calculating importance of documents
CN102822820A (en) * 2010-03-19 2012-12-12 微软公司 Indexing and searching employing virtual documents
US20130006979A1 (en) * 2011-06-29 2013-01-03 International Business Machines Corporation Enhancing cluster analysis using document metadata
CN102999524A (en) * 2011-09-16 2013-03-27 中广核工程有限公司 Method and system for searching document association
US20130097481A1 (en) * 2011-10-13 2013-04-18 Microsoft Corporation Application of Comments in Multiple Application Functionality Content
CN103123566A (en) * 2011-11-21 2013-05-29 联想(北京)有限公司 Electronic device and text input method thereof
CN103415850A (en) * 2012-03-14 2013-11-27 株式会社东芝 Structured document management device, structured document search method
CN103914488A (en) * 2013-01-08 2014-07-09 邓寅生 Document collection, identification, association, search and display system
CN104077011A (en) * 2013-03-26 2014-10-01 北京三星通信技术研究有限公司 Method for associating documents in same type and terminal equipment
CN105117397A (en) * 2015-06-18 2015-12-02 浙江大学 Method for searching semantic association of medical documents based on ontology
CN106682219A (en) * 2017-01-03 2017-05-17 腾讯科技(深圳)有限公司 Association document acquisition method and device
US9690785B1 (en) * 2014-01-30 2017-06-27 Google Inc. Change notification routing based on original authorship of modified region
CN106909276A (en) * 2017-01-10 2017-06-30 网易(杭州)网络有限公司 Method and apparatus for realizing electron reading content interaction
CN107533563A (en) * 2015-05-29 2018-01-02 英特尔公司 Technology for dynamic autoization content discovery
US20180067932A1 (en) * 2016-09-02 2018-03-08 FutureVault Inc. Real-time document filtering systems and methods
CN108519966A (en) * 2018-04-11 2018-09-11 掌阅科技股份有限公司 The replacement method and computing device of e-book particular text element
CN110188178A (en) * 2019-05-30 2019-08-30 深圳龙图腾创新设计有限公司 Across the document information lookup method of one kind, device, computer equipment and storage medium
CN110298027A (en) * 2018-03-22 2019-10-01 卡西欧计算机株式会社 Display device, display system, display methods and recording medium
US20190303448A1 (en) * 2018-03-30 2019-10-03 Vidy, Inc. Embedding media content items in text of electronic documents
CN110377558A (en) * 2019-06-14 2019-10-25 平安科技(深圳)有限公司 Document searching method, device, computer equipment and storage medium
CN111078885A (en) * 2019-12-18 2020-04-28 腾讯科技(深圳)有限公司 Label classification method, related device, equipment and storage medium
CN112148889A (en) * 2020-09-23 2020-12-29 平安直通咨询有限公司上海分公司 Recommendation list generation method and device
CN112347324A (en) * 2019-08-08 2021-02-09 珠海金山办公软件有限公司 Document query method and device, electronic equipment and storage medium
CN112541330A (en) * 2019-09-20 2021-03-23 富士施乐株式会社 Information processing apparatus and recording medium
CN112597274A (en) * 2020-12-18 2021-04-02 深圳市彬讯科技有限公司 Document determination method, device, equipment and storage medium based on BM25 algorithm

Patent Citations (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5761689A (en) * 1994-09-01 1998-06-02 Microsoft Corporation Autocorrecting text typed into a word processing document
JPH10124534A (en) * 1996-08-14 1998-05-15 N T T Data Tsushin Kk System and method for retrieving information
US20010018697A1 (en) * 2000-01-25 2001-08-30 Fuji Xerox Co., Ltd. Structured document processing system and structured document processing method
US20020055945A1 (en) * 2000-06-06 2002-05-09 Datatech Software, Inc. Method of document assembly
JP2002132755A (en) * 2000-10-20 2002-05-10 Fuji Xerox Co Ltd Document processing system
US7171619B1 (en) * 2001-07-05 2007-01-30 Sun Microsystems, Inc. Methods and apparatus for accessing document content
US20060200445A1 (en) * 2005-03-03 2006-09-07 Google, Inc. Providing history and transaction volume information of a content source to users
CA2605252A1 (en) * 2005-04-18 2006-10-26 Collage Analytics Llc System and method for efficiently tracking and dating content in very large dynamic document spaces
AU2006236418A1 (en) * 2005-04-18 2006-10-26 Collage Analytics Llc System and method for efficiently tracking and dating content in very large dynamic document spaces
GB0509900D0 (en) * 2005-05-14 2005-06-22 Hewlett Packard Development Co Document transfer between document editing software applications
CN101971172A (en) * 2005-08-29 2011-02-09 谷歌公司 Mobile sitemaps
CN101432733A (en) * 2006-03-13 2009-05-13 奥多比公司 Augmenting the contents of an electronic document with data retrieved from a search
CN101004737A (en) * 2007-01-24 2007-07-25 贵阳易特软件有限公司 Individualized document processing system based on keywords
US20090217159A1 (en) * 2008-02-22 2009-08-27 Jeffrey Matthew Dexter Systems and Methods of Performing a Text Replacement Within Multiple Documents
CN102124462A (en) * 2008-06-23 2011-07-13 谷歌公司 Query identification and association
JP2010170324A (en) * 2009-01-22 2010-08-05 Toshiba Corp Apparatus for supporting knowledge sharing, and method and program thereof
EP2354976A1 (en) * 2010-02-09 2011-08-10 ExB Asset Management GmbH Online analysis and display of correlated information
CN102822820A (en) * 2010-03-19 2012-12-12 微软公司 Indexing and searching employing virtual documents
CN102541901A (en) * 2010-12-26 2012-07-04 上海量明科技发展有限公司 Method and system for identifying and outputting information during document reading
CN102682040A (en) * 2011-03-16 2012-09-19 日电(中国)有限公司 Device and method for calculating importance of documents
US20130006979A1 (en) * 2011-06-29 2013-01-03 International Business Machines Corporation Enhancing cluster analysis using document metadata
CN102306186A (en) * 2011-08-29 2012-01-04 上海量明科技发展有限公司 Document traversing method and system
CN102999524A (en) * 2011-09-16 2013-03-27 中广核工程有限公司 Method and system for searching document association
US20130097481A1 (en) * 2011-10-13 2013-04-18 Microsoft Corporation Application of Comments in Multiple Application Functionality Content
CN103123566A (en) * 2011-11-21 2013-05-29 联想(北京)有限公司 Electronic device and text input method thereof
CN103415850A (en) * 2012-03-14 2013-11-27 株式会社东芝 Structured document management device, structured document search method
CN103914488A (en) * 2013-01-08 2014-07-09 邓寅生 Document collection, identification, association, search and display system
CN104077011A (en) * 2013-03-26 2014-10-01 北京三星通信技术研究有限公司 Method for associating documents in same type and terminal equipment
US9690785B1 (en) * 2014-01-30 2017-06-27 Google Inc. Change notification routing based on original authorship of modified region
CN107533563A (en) * 2015-05-29 2018-01-02 英特尔公司 Technology for dynamic autoization content discovery
CN105117397A (en) * 2015-06-18 2015-12-02 浙江大学 Method for searching semantic association of medical documents based on ontology
US20180067932A1 (en) * 2016-09-02 2018-03-08 FutureVault Inc. Real-time document filtering systems and methods
CA3035277A1 (en) * 2016-09-02 2018-03-08 FutureVault Inc. Real-time document filtering systems and methods
CN106682219A (en) * 2017-01-03 2017-05-17 腾讯科技(深圳)有限公司 Association document acquisition method and device
CN106909276A (en) * 2017-01-10 2017-06-30 网易(杭州)网络有限公司 Method and apparatus for realizing electron reading content interaction
CN110298027A (en) * 2018-03-22 2019-10-01 卡西欧计算机株式会社 Display device, display system, display methods and recording medium
US20190303448A1 (en) * 2018-03-30 2019-10-03 Vidy, Inc. Embedding media content items in text of electronic documents
CN110321469A (en) * 2018-03-30 2019-10-11 斯皮斯亚洲私人有限公司 The embedding media content item in the text of electronic document
CN108519966A (en) * 2018-04-11 2018-09-11 掌阅科技股份有限公司 The replacement method and computing device of e-book particular text element
CN110188178A (en) * 2019-05-30 2019-08-30 深圳龙图腾创新设计有限公司 Across the document information lookup method of one kind, device, computer equipment and storage medium
CN110377558A (en) * 2019-06-14 2019-10-25 平安科技(深圳)有限公司 Document searching method, device, computer equipment and storage medium
CN112347324A (en) * 2019-08-08 2021-02-09 珠海金山办公软件有限公司 Document query method and device, electronic equipment and storage medium
CN112541330A (en) * 2019-09-20 2021-03-23 富士施乐株式会社 Information processing apparatus and recording medium
CN111078885A (en) * 2019-12-18 2020-04-28 腾讯科技(深圳)有限公司 Label classification method, related device, equipment and storage medium
CN112148889A (en) * 2020-09-23 2020-12-29 平安直通咨询有限公司上海分公司 Recommendation list generation method and device
CN112597274A (en) * 2020-12-18 2021-04-02 深圳市彬讯科技有限公司 Document determination method, device, equipment and storage medium based on BM25 algorithm

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
姚原岗;林兰芬;董金祥;: "异质工程文档多维关联的语义检索方法", 浙江大学学报(工学版), no. 02, 15 February 2011 (2011-02-15) *
李绪成: "《Java Web程序设计基础教程》", 西安电子科技大学出版社, pages: 79 *
陈颖;张学福;姜世华;陈静;: "文档类型信息检索可视化系统比较分析", 情报杂志, no. 01, 18 January 2010 (2010-01-18), pages 79 *

Similar Documents

Publication Publication Date Title
US10795939B2 (en) Query method and apparatus
US11769072B2 (en) Document structure extraction using machine learning
US8560567B2 (en) Automatic question and answer detection
CN102317936B (en) Identifying comments to show in connection with a document
US9424354B2 (en) Providing crowdsourced answers to information needs presented by search engine and social networking application users
US7617202B2 (en) Systems and methods that employ a distributional analysis on a query log to improve search results
KR101071789B1 (en) Method and system for linking sources to copied text
CN100380321C (en) Method and system used in making action relate to semantic marker in electronic file
US9582503B2 (en) Interactive addition of semantic concepts to a document
EP3584728B1 (en) Method and device for analyzing open-source license
CN110888990B (en) Text recommendation method, device, equipment and medium
US20150200893A1 (en) Document review system
US10585978B2 (en) Method and system for providing a summary of textual content
CN101571859B (en) Method and apparatus for labelling document
AU2018226399A1 (en) Detecting style breaches in multi-author content or collaborative writing
KR101962407B1 (en) System for Supporting Generation Electrical Approval Document using Artificial Intelligence and Method thereof
CN112132710B (en) Legal element processing method and device, electronic equipment and storage medium
EP2854047A1 (en) Automatic keyword tracking and association
US8370344B2 (en) Information processing apparatus, information processing method, information processing program and recording medium for determining an order of displaying search items
CN102662953B (en) With the semantic tagger system and method that input method is integrated
US20190258666A1 (en) Resource accessibility services
KR102532216B1 (en) Method for establishing ESG database with structured ESG data using ESG auxiliary tool and ESG service providing system performing the same
CN113204579A (en) Content association method, system, device, electronic equipment and storage medium
CN112579937A (en) Character highlight display method and device
CN115640790A (en) Information processing method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination