CN116822468A - Structured text information generation method, device and storage medium - Google Patents

Structured text information generation method, device and storage medium Download PDF

Info

Publication number
CN116822468A
CN116822468A CN202310106501.7A CN202310106501A CN116822468A CN 116822468 A CN116822468 A CN 116822468A CN 202310106501 A CN202310106501 A CN 202310106501A CN 116822468 A CN116822468 A CN 116822468A
Authority
CN
China
Prior art keywords
target
vocabulary
description information
description
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310106501.7A
Other languages
Chinese (zh)
Inventor
沈秋阳
陈凯伦
束方意
方斗寒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Yaocheng Health Technology Co ltd
Qisheng Yaokang Information Technology Shanghai Co ltd
Original Assignee
Shanghai Yaocheng Health Technology Co ltd
Qisheng Yaokang Information Technology Shanghai Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Yaocheng Health Technology Co ltd, Qisheng Yaokang Information Technology Shanghai Co ltd filed Critical Shanghai Yaocheng Health Technology Co ltd
Priority to CN202310106501.7A priority Critical patent/CN116822468A/en
Publication of CN116822468A publication Critical patent/CN116822468A/en
Pending legal-status Critical Current

Links

Abstract

The application provides a method, equipment and a storage medium for generating text information based on structuring, which relate to the technical field of text processing and are used for improving the editing efficiency of creating explanatory texts for key words in documents. The method comprises the following steps: and responding to word description operation triggered by aiming at target words in the document to be processed, acquiring and displaying at least one piece of push description information associated with the target words, determining the push description words selected by a user from the displayed at least one piece of push description information as target description information corresponding to the target words, and updating word description text corresponding to the document to be processed according to the target description information corresponding to the target words.

Description

Structured text information generation method, device and storage medium
Technical Field
The present application relates to the field of text processing technologies, and in particular, to a method, an apparatus, and a storage medium for generating structured text information.
Background
In the technical field of text processing, in some specific fields/scenes, specialized words with high specificity and high specification requirements in specific fields are required to be used in documents, such as documents, abbreviations and clinical words in visit schedules of clinical test projects, legal words in legal fields, such as prosecution, original notice, notice and jurisdiction, are inconvenient for readers to understand, so that explanation of the specialized words in the documents is required, but in general, a large number of specialized words can appear in a document with strong relevance in one field, a document editor needs to edit detailed explanation information one by one aiming at the specialized words appearing in the document, the editing efficiency is low, and the accuracy of the explanation information is limited by the text description capability of the document editor.
Disclosure of Invention
The embodiment of the application provides a method, equipment and a storage medium for generating structured text information, which are used for at least improving the editing efficiency of creating description on key words in a document.
The first aspect of the present application provides a method for generating structured text information, including:
responding to word description operation triggered by aiming at a target word in a document to be processed, and acquiring at least one piece of push description information associated with the target word, wherein the word description operation is triggered by a user when the document to be processed is in an editing state, and the push description information is obtained by matching description information in a basic description information base;
displaying the at least one push description information in the document to be processed;
determining a push description word selected by the user from the at least one push description information as target description information corresponding to a target word, updating a word description text corresponding to the document to be processed according to the target description information corresponding to the target word, wherein the updated word description text comprises the target description information.
In the embodiment of the application, a basic description information base is pre-established, and the basic description information base comprises description information corresponding to a plurality of basic words, so that after a user triggers word description operation aiming at a target word in a document to be processed, the user directly pushes the pushed description information with high similarity to the target word to the user for the user to quickly select the description information corresponding to the target word, and further, the target description information can be directly updated into a word description text after the user selects the target description information corresponding to the target word; on one hand, in the process, a user only needs to execute the selection operation of pushing description information from pushing, and does not need to create complex description texts corresponding to target words, so that the time for creating the complex description texts for each target word by the user is saved, the time for creating word description texts for one or more target words in the whole document to be processed is obviously saved, and the creation efficiency is improved; on the other hand, in the process, the user only needs to select from the push description information, and does not need to carry out complex description text on the target vocabulary, so that the requirement on the text description capability of the user is reduced, and more users can quickly create vocabulary description texts with high descriptive performance.
In one possible implementation manner, before updating the vocabulary description text corresponding to the document to be processed according to the target description information selected by the user from the at least one push description information, the method further includes:
inquiring whether the vocabulary description text contains the target description information or not;
in response to the target description information not being contained in the vocabulary description text, adding the target description information into the vocabulary description text; or (b)
And in response to the target description information contained in the vocabulary description text, triggering a prompt message to inform the user that the target description information is contained in the vocabulary description text.
In one possible implementation manner, the displaying the at least one push description information in the document to be processed further includes:
displaying and creating a description control;
responding to the creation description operation triggered by the user through the creation description control, and displaying a description information editing page of the target vocabulary;
according to the description information indicated by the user through the description information editing page, determining target description information corresponding to the target vocabulary;
And updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary.
In one possible implementation manner, after determining the target description information corresponding to the target vocabulary according to the description information indicated by the user through the description information editing page, the method further includes:
and updating the target description information into the basic description information base.
In one possible implementation, the method further includes:
responding to a deleting operation triggered by a user aiming at the target vocabulary, and determining the total number of the target vocabulary contained in the document to be processed;
and if the total number is smaller than the preset number and the vocabulary description text contains the target description information corresponding to the target vocabulary, deleting or hiding the target description information in the vocabulary description text.
In one possible implementation manner, the displaying the at least one push description information in the document to be processed includes:
loading the at least one push description information display into a newly built image layer for display, wherein the newly built image is different from the image layer for displaying the document to be processed; and/or
And displaying the at least one push description information in a target area on a layer displaying the document to be processed, wherein the target area is different from the area displaying the document to be processed.
In one possible implementation manner, the push description information includes at least two, and the displaying the at least one push description information in the document to be processed includes:
displaying the push description information according to the sequence of the information similarity of the push description information from big to small, wherein:
and the information similarity is determined based on the similarity between the basic data corresponding to the push description information and the target vocabulary.
In one possible implementation manner, the obtaining at least one push description information associated with the target vocabulary includes:
based on the similarity between each basic word and the target word in the basic description information base, determining candidate words from each basic word;
and determining basic description information corresponding to each candidate word as the push description information.
In one possible implementation manner, the updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary includes:
After determining the target description information corresponding to the target vocabulary each time, updating the vocabulary description text in real time according to the target description information corresponding to the target vocabulary; or (b)
After the target description information corresponding to the target vocabulary is determined, and the description text updating operation is triggered, the vocabulary description text is updated based on the target description information corresponding to the target vocabulary.
In one possible implementation, the method further includes:
determining a structured tag of the target vocabulary according to target description information corresponding to the target vocabulary, wherein the structured tag points to the target description information corresponding to the target vocabulary;
and replacing the target vocabulary in the document to be processed with the structured tag to obtain document storage data corresponding to the document to be processed.
In one possible implementation manner, the presentation form of the structured tag includes at least one of < vocabulary identification information >, < first preset character= "vocabulary identification information"/>, < second preset character identification= "second preset character vocabulary identification information"/>, where the vocabulary identification information is used to point to the target description information.
In one possible implementation manner, after updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary, the method further includes:
And sequencing and displaying the description information based on the sequencing information of the vocabulary corresponding to the description information contained in the vocabulary description text.
In one possible implementation, the base description information repository includes at least one of:
a default description information base;
the user-defined information base related to the user;
and the project information base related to the document to be processed.
In one possible implementation, the target vocabulary includes any one or any combination of a thumbnail vocabulary, an otherwise known vocabulary, and a specific symbol.
In one possible implementation, the target vocabulary includes a thumbnail vocabulary, and the target description information includes one or any combination of the following information:
the abbreviated vocabulary is fully called under at least one language category;
the attribute description text of the thumbnail vocabulary;
the usage scenario description of the thumbnail vocabulary;
and the word category of the thumbnail word.
In one possible implementation, the target vocabulary includes a specialized vocabulary in a clinical trial domain, legal domain, medical domain, architectural domain, chemical domain, biological domain, physical domain.
In one possible implementation, the target vocabulary includes a professional vocabulary in the clinical trial field, the document to be processed is any sub-document of the target clinical trial project,
And the vocabulary description text associated with the document to be processed comprises description information items of target vocabularies in a plurality of sub-documents of the target clinical test item.
The second aspect of the present application provides a method and apparatus for generating structured text information, including:
the information pushing unit is used for responding to word description operation triggered by aiming at a target word in a document to be processed, acquiring at least one piece of pushing description information associated with the target word, wherein the word description operation is triggered by a user when the document to be processed is in an editing state, and the pushing description information is obtained by matching description information in a basic description information base;
the display unit is used for displaying the at least one push description information in the document to be processed;
and the description text updating unit is used for determining the push description vocabulary selected by the user from the at least one push description information as target description information corresponding to a target vocabulary, updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary, and the updated vocabulary description text comprises the target description information.
In a third aspect of the present application, there is provided a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the structured-text-information generation method of the first aspect when executing the program.
In a fourth aspect of the present application, there is provided a computer readable storage medium storing computer instructions which, when run on a computer, cause the computer to perform the structured-text-information generation method of the first aspect.
Drawings
FIG. 1 is a diagram of a word explanatory text provided in an embodiment of the present application;
FIG. 2 is a flowchart of a method for generating structured text information according to an embodiment of the present application;
FIG. 3 is a diagram of an example page of a trigger word description operation according to an embodiment of the present application;
fig. 4 is an exemplary diagram of a presentation form of push description information according to an embodiment of the present application;
FIG. 5 is a diagram showing an example of a document to be processed and vocabulary description text provided by an embodiment of the present application;
FIG. 6 is a diagram of an example display page of a base presentation creation description control provided by an embodiment of the present application;
fig. 7 is a schematic diagram of a process for acquiring target description information according to an embodiment of the present application;
FIG. 8 is an exemplary diagram for enhancing and displaying a target vocabulary according to an embodiment of the present application;
FIG. 9 is a diagram illustrating a specific example of structured document information generation according to an embodiment of the present application;
Fig. 10 is a diagram illustrating a structural example of a device for generating structured text information according to an embodiment of the present application;
fig. 11 is a block diagram of a computer device according to an embodiment of the present application.
Detailed Description
In order to better understand the technical solution provided by the embodiments of the present application, the following detailed description will be given with reference to the accompanying drawings and the specific embodiments of the present application;
the terms "first", "second" in embodiments of the present application are used for descriptive purposes only and are not to be construed as implying or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature, and in the description of embodiments of the application, unless otherwise indicated, the meaning of "at least one" is two or more.
In order to facilitate the technical solution of the present application to be better understood by those skilled in the art, the following description of technical terms related to the present application is provided.
Document to be processed: the document/text that the user is currently editing.
The user: the user in the embodiment of the application refers to a document editor generally, and can be a user or an account.
In order to more clearly understand the design concept of the present application, the method, the device and the storage medium for generating the structured text information provided by the embodiments of the present application are described in detail below.
The method for generating the structured text information provided by the embodiment of the application can be applied to a document editing tool or a document editing platform or a document editing system, and is hereinafter described by taking the application of the method in the document editing tool as an example; after a user opens a document to be processed through a document editing tool and enables the document to be processed to enter an editing state, the user can select any vocabulary in the document to be processed as target vocabulary and trigger word description operation aiming at the target vocabulary; and the document editing tool is used for matching one or more pieces of description information with high similarity from a pre-constructed basic description information base to be push description information and displaying the push description information to a user, and after the user selects one piece of target description information corresponding to a target word from the displayed push description information, the document editing tool can automatically update the target description information into word description texts corresponding to the document to be processed so as to realize the operation of efficiently explaining the target word in the document, wherein the word description operation characterizes that the specific explanation/description text of the target word needs to be added into the document to be processed so as to explain one or more pieces of information of definition, type, substantial content, use environment, method and the like of the target word.
As an embodiment, the target vocabulary in the embodiment of the present application characterizes a vocabulary which is not understood by a person having ordinary skill in the art, and the target vocabulary includes professional vocabulary in clinical test fields, legal fields, medical fields, construction fields, chemical fields, biological fields, and physical fields; for example, the target vocabulary may be a person to be reported, a jurisdiction, a complaint, etc. in the legal field, or may be a case report form, breast cancer, hemoglobin, etc. in the clinical test field.
As an embodiment, the target vocabulary in the embodiment of the present application may be any one or any combination of a thumbnail vocabulary, an otherwise known vocabulary, and a symbol, where the thumbnail vocabulary may be a thumbnail vocabulary under a specific language category, such as a chinese thumbnail vocabulary, an english thumbnail vocabulary, and the like, and the target vocabulary may be an abbreviated vocabulary in a clinical test field, such as AE, ICF, ECG, EOT, but not limited to, where AE is an adverse event (advertisement), ICF is an informed consent form (infomedconsentform), ECG is an electrocardiogram (electric computer), and EOT is an end of treatment (endof treatment). For example, the target vocabulary may also be a generic vocabulary of the sub-group vocabulary.
As an embodiment, when the target vocabulary is a thumbnail vocabulary in the embodiment of the present application, the target description information corresponding to the target vocabulary may, but is not limited to, include any combination of one or more of the following information items: 1) The word full scale of the abbreviated word under at least one language class; 2) Attribute description text of the thumbnail vocabulary; 3) A use scenario description of the thumbnail vocabulary; 4) The vocabulary category of the thumbnail vocabulary.
Specifically, when the target vocabulary is an english abbreviation, the target description information may include, but is not limited to, the english abbreviation, a chinese full name corresponding to the english abbreviation, an english full name, etc., for ease of understanding, please refer to fig. 1, a specific example 100 of a word description text is given, where each line is target description information/description information corresponding to one target vocabulary (i.e., the english abbreviation in the example), and 110 is target description information of the target vocabulary ICF in the figure.
Further, in the embodiment of the present application, when the target vocabulary is a professional vocabulary in the clinical test field and the document to be processed is any sub-document of the target clinical test item, the description information items of the target vocabulary in the plurality of sub-documents of the target clinical test item are included in the vocabulary description text associated with the document to be processed, that is, the target description information of the target vocabulary in the plurality of sub-documents of the target clinical test item may be collected into the same word description text, e.g., AE and ICF shown in fig. 1 may be the vocabulary in the first sub-document in the target clinical test, and ECG and EOT may be the vocabulary in the second sub-document in the target clinical test.
Furthermore, the multiple sub-documents in the same target clinical test item can be edited by the same user, or can be edited by multiple different users, and can be set by those skilled in the art according to actual requirements.
As one embodiment, the basic description information base comprises one or any combination of a default description information base, a user-defined information base associated with a user and a project information base associated with a document to be processed; the default description information base can be a database formed by description information corresponding to a plurality of words defaulted by the document editing tool; the user-defined information base can be a database composed of descriptive information corresponding to one or more vocabularies created by the user, and can be called by the user; if the document to be processed is associated under a target item, the basic description information base can be a database formed by description information corresponding to one or more vocabularies in the target item, and the like.
The following describes the method for generating structured text information according to the embodiment of the present application in detail, please refer to fig. 2, which specifically includes the following steps:
step S210, when the document to be processed is in an editing state, the user triggers word description operation aiming at a target word in the document to be processed.
As an embodiment, the specific triggering mode of the word description operation in the embodiment of the present application is not limited, and a person skilled in the art may set the specific triggering mode according to actual needs, for example, may click the mouse to trigger the word description operation multiple times after selecting/scanning the target vocabulary with a mouse/cursor, or may right click the mouse to trigger the word description operation after selecting/scanning the target vocabulary with a mouse, or may stop the first time length to trigger the word description operation after selecting the target vocabulary with a mouse, where the first time length is not limited in the embodiment of the present application, and may set the first time length according to actual needs, for example, but not limited to, 2 seconds, 3 seconds, or 5 seconds.
As an embodiment, please refer to fig. 3, a page diagram 300 is provided for triggering word description operation, that is, a user may select a target word 310 with a mouse/cursor, etc., so that the document editing tool displays a function option bar 320 after the target word is selected, and displays a word description control 321 in the function option bar 320; the user may trigger a word description operation for the target word by selecting word description control 321.
With continued reference to fig. 3, in an embodiment of the present application, other operation options for the target vocabulary may also be presented in the function options bar 320, such as, but not limited to, one or any combination of enhancement display options 322 (e.g., a thickening option for thickening the target vocabulary, a font color option for changing the color of the target vocabulary, etc.), endorsement options 323, add attachment options 324, etc.
In step S220, the document editing tool responds to the word description operation triggered for the target word in the document to be processed, and obtains at least one push description information associated with the target word, where the push description information is obtained by matching the description information in the basic description information base.
As an embodiment, in step S220, the document editing tool may determine candidate words from the base words based on the similarity between the base words and the target words in the base description information base; and determining the basic description information corresponding to each candidate word as the push description information.
Further, the character similarity or the content similarity of each basic vocabulary and each target vocabulary may be determined as the similarity of each basic vocabulary and each target vocabulary, and K1 basic vocabularies are selected as candidate vocabularies according to the order of the similarity from large to small, where K1 is a positive integer, and the specific numerical value of K1 is not limited, for example, K1 may be set to 2, 3, 5, 6, 8 or 10, etc.
Further, after determining the similarity between each basic vocabulary and the target vocabulary, K2 basic vocabularies can be selected from the basic vocabularies corresponding to the similarity greater than the similarity threshold as candidate vocabularies; the similarity threshold is not limited, for example, when the similarity is greater than or equal to 0 and less than or equal to 1, the similarity threshold may be set to 80%, 90% or 1, and the setting of K2 may be set to reference to the setting of K1, which is not repeated here.
In the foregoing case, if there is no basic vocabulary with a similarity greater than the similarity threshold in the basic description information base, no push description information is presented to the user.
As an embodiment, in step S220, the foregoing push description information may be further displayed in order of from greater to lesser information similarity of the push description information, where: and the information similarity is the similarity of the basic data and the target vocabulary corresponding to the push description information.
Step S230, the document editing tool displays the at least one push description information in the document to be processed.
As an embodiment, in step S230, the at least one push description information presentation may be loaded into a new layer for presentation, where the new layer is different from the layer for presenting the document to be processed; referring to fig. 4 (a), the document to be processed is shown in a layer 410, and push description information 1, push description information 2, etc. associated with the target vocabulary 310 are shown in a newly created layer 420.
As an embodiment, in step S230, the at least one push description information may be displayed in a target area on the layer for displaying the document to be processed, where the target area is different from the area for displaying the document to be processed; referring to fig. 4 (b), the document to be processed is shown in a region 431 in the layer 430, and the push description information 3, the push description information 4, and the like associated with the target vocabulary 310 are shown in a target region 432 in the layer 430.
Step S240, the user selects one push description information from the displayed at least one push description information.
In step S250, the document editing tool determines the push description vocabulary selected by the user as the target description information corresponding to the target vocabulary, updates the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary, and the updated vocabulary description text includes the target description information.
As an embodiment, in step S250, the document editing tool may directly add the target description information corresponding to the target vocabulary to the vocabulary description text, and in the embodiment of the present application, the display mode and the display area of the vocabulary description text are not excessively limited, and those skilled in the art may set according to actual requirements; for ease of understanding, referring to FIG. 5, an exemplary diagram of a display of a document to be processed and lexical descriptive text is presented, which may be presented in region 510 and lexical descriptive text in either region 520 or region 530.
As an embodiment, before step S250, it may further be queried whether the aforementioned target description information is already included in the aforementioned vocabulary description text; if the target description information is not contained in the vocabulary description text currently, the document editing tool can add the target description information into the vocabulary description text in response to the fact that the target description information is not contained in the vocabulary description text; if the target description information is currently included in the vocabulary description text, the document editing tool may trigger a prompt message in response to the target description information being included in the vocabulary description text, and notify the user that the target description information is included in the vocabulary description text through the prompt message, where the specific form and content of the prompt message are not limited in the embodiment of the present application, and may be set by those skilled in the art based on actual requirements.
As an embodiment, at least one push description information is displayed in step 230, and a creation description control may be displayed, so that a user may trigger a creation description operation through the creation description control, and the document editing tool responds to the creation description operation; the user can input the description information of the target vocabulary in the description information editing page, so that the document editing tool can determine the target description information corresponding to the target vocabulary according to the description information indicated by the user through the description information editing page, and update the target description information corresponding to the target vocabulary into the vocabulary description text.
For ease of understanding, please refer to fig. 6, a display page showing a creation description control 610 is provided, which may be shown in fig. 6 (a) in combination with at least one presentation of push description information, in which the push description information 1 associated with the target vocabulary 310 and the creation description control 610 described above are shown; referring to fig. 6 (b), in this case, it is indicated that the document editing tool does not obtain push description information matching the target vocabulary 310 from the base description information library, and thus only the creation description control 610 is presented to the user.
Referring to fig. 7, a schematic process diagram of obtaining target description information is provided, specifically, a user selects a create description control 610 through a cursor 710 to trigger a create description operation for a target vocabulary, a document editing tool displays a description information editing page 700 of the target vocabulary, so that the user can input description text information of each vocabulary description item in an editing box (e.g. 720 in a reference diagram) corresponding to each vocabulary description item in the description information editing page 700, and can perform functional operation on the description text information input in the corresponding editing box through each function control in a text function column (e.g. 730 in the reference diagram) corresponding to each vocabulary description item.
As an embodiment, the specific content, type and number of the vocabulary description items of the target vocabulary are not limited in the embodiment of the present application, and may be set by those skilled in the art according to actual needs, for example, the vocabulary description items may include, but not limited to, information such as names of the target vocabulary, full names under a preset language type, and the like, which may be set corresponding to information types included in the description information in this document.
As an embodiment, after the target description information corresponding to the target vocabulary is determined, the target description information may be updated into the basic description information base, so that after the user triggers the description operation for the target vocabulary, the target description information may be directly displayed to the user as push description information.
As an embodiment, the user can trigger the deleting operation aiming at the target vocabulary, so that the document editing tool updates the word description text based on the specific condition of whether the target vocabulary exists in the document to be processed after the user deletes the target vocabulary, and redundant information in the word description text is reduced; specifically, after triggering a deletion operation for a target vocabulary by a user, the document editing tool can respond to the deletion operation to determine the total number of the target vocabulary contained in the document to be processed after deleting the target vocabulary; if the total number is smaller than the preset number and the vocabulary description text contains the target description information corresponding to the target vocabulary, deleting or hiding the target description information in the vocabulary description text; the preset number may be, but is not limited to, 1, that is, after the user deletes the currently selected target vocabulary through the deleting operation, the text editing tool automatically checks whether other positions in the whole document to be processed still have the target vocabulary, if the other positions in the whole document still have the target vocabulary, the word description text is not processed, and if the other positions in the whole document do not have the target vocabulary, the target description information of the target vocabulary in the word description text is deleted.
As an embodiment, the word description text corresponding to the document to be processed in the embodiment of the application can be updated in real time, that is, when a user selects to push description information as target description information or newly creates target description information corresponding to target words for one target word, the document editing tool updates the target description information into the word description text in real time, and when the user deletes one target word in the document to be processed to cause that the target word does not exist in the whole document to be processed and the target description information corresponding to the target word is recorded in the word description text, the document editing tool also deletes the target description information from the word description text in real time; that is, in step S250, the document editing tool may update the vocabulary description text according to the target description information corresponding to the target vocabulary in real time after determining the target description information corresponding to the target vocabulary each time.
In the foregoing step S250, as an embodiment, the word description text corresponding to the document to be processed in the embodiment of the present application may be updated once after the user triggers the description text update operation, that is, after the user selects the recommended description information for one or more target words as the target description information or newly creates the target description information corresponding to one or more target words, when the user triggers the description text update operation, the document editing tool updates the target description information corresponding to one or more target words into the word description text once, or the user deletes one or more target words in the document to be processed, so that the one or more target words do not exist in the whole document to be processed, and after the user triggers the description text update operation, the document dialectical tool queries whether the target description information corresponding to one or more target words is recorded in the word description text, and if the target description information corresponding to one or more target words is recorded, the target description information of one or more target words is/are deleted once; in step S250, after determining the target description information corresponding to the target vocabulary, the document editing tool triggers a description text updating operation, and updates the vocabulary description text based on the target description information corresponding to the target vocabulary; the specific manner of triggering the text update operation is not limited, and those skilled in the art can set the operation according to actual requirements.
As an embodiment, in the embodiment of the present application, while or after updating the target description information of the target vocabulary to the vocabulary description text corresponding to the document to be processed, the target vocabulary may be further enhanced displayed in the document to be processed to notify the user that the description information of the target vocabulary has been updated to the vocabulary description text, where the specific manner of the foregoing enhancement display is not limited, and those skilled in the art may set up according to actual needs, for example, one or more of thickening, changing fonts, changing colors, using specific character marks, using specific identification graphics to perform identification, and the like may be combined to perform enhancement display, and please refer to fig. 8, which gives an illustration of enhancement display of the target vocabulary "AAA", "ABA" and "BBB" in the form of a combination of specific character marks (Abbr) and specific identification images (white squares).
As an embodiment, after the user selects the recommended description information as the target description information corresponding to the target vocabulary or the target description information corresponding to the newly built target vocabulary, the document editing tool can store the document to be processed after adding the description information (i.e. the target description information or the target description information) of the target vocabulary through the structured data; specifically, after step S230 and step S240, the document editing tool determines a structured tag of the target vocabulary according to the target description information corresponding to the target vocabulary, and replaces the target vocabulary in the document to be processed with the structured tag to obtain document storage data corresponding to the document to be processed; the structured tag points to the target description information corresponding to the target vocabulary, and the target description information can be stored in structured data.
As an embodiment, the presentation form of the foregoing structured tag may be, but is not limited to, at least one of < vocabulary identification information >, < first preset character= "vocabulary identification information"/>, < second preset character identification= "second preset character vocabulary identification information"/>, where the vocabulary identification information is used to point to the target description information; the vocabulary identification information can be a unique identification number (id) or a unique identification character string of the vocabulary; the first preset character may be a character characterizing the type of the vocabulary identification information or a character characterizing the word description operation, for example, the first preset character may be, but is not limited to, ID or Abbr, etc.; the second preset character may be a character characterizing the type of the vocabulary identification information or a character characterizing the word description operation, and the identification in < second preset character identification= "second preset character vocabulary identification information"/> "may be, but is not limited to, the type of the vocabulary identification information, etc., as the second preset character may be, but is not limited to," Abbr "described above and is identified as id, etc.
To facilitate the understanding OF the foregoing structural labels by those skilled in the art, a specific representation OF < second preset character identification= "second preset character vocabulary identification information"/>, where a section represents the foregoing second preset character, a type representation identification, and a table_of_abbriatia representation OF target description information pointing to a target vocabulary, is given here as a visual example OF < section type= "table_of_abbriatia"/>;
Referring to fig. 9, a specific example of structured text information generation is provided, in which a target vocabulary is an abbreviation in the clinical trial field, word description operation is abbreviated as word description, in which (a) in the figure is a specific display form of a document to be processed after word description operation marking is performed on the target vocabulary (abbreviation), in the figure (b) is word description text corresponding to the document to be processed, and (c) in the figure is a description schematic of document storage data corresponding to the document to be processed; the specific interaction procedure in this example is as follows:
stage one: determining target words to be marked, clicking a function button of an abbreviation word
User operation: selecting ECG (target vocabulary) in the document to be processed, clicking an abbreviation function button (i.e. the vocabulary description control 221 described above);
the technology is realized: the document editing tool performs intelligent matching according to the description information corresponding to each vocabulary of the ECG in the basic description information base, and returns candidate recommendation description information (namely abbreviation items) for the user to select.
Stage two: selecting recommendation description information as target description information corresponding to target vocabulary
User operation: selecting one piece of recommended descriptive information as target descriptive information (namely an abbreviated entry) of the ECG;
The technology is realized:
1) Copying target description information selected by a user into an abbreviation component data part (namely component data corresponding to the word description text) of a document to be processed by a document editing tool;
2) The document editing tool replaces the text "ECG" of the text data portion of the document to be processed with an abbreviation component reference tag (i.e., structured tag) < abgrid= "v03"/>; the < abbrid= "v03"/> indicates a reference to a target vocabulary/abbreviation with an id of "v03", when the document editing tool displays the specific content of the document to be processed, the stored text part is processed, and the target description information corresponding to the abbreviation pointed by the tag reference is displayed to the user.
3) The document editing tool renders an abbreviation component style at a display layer of the document to be processed, and displays an abbreviation "ECG" corresponding to < abbr id= "v03"/>, in the abbreviation component.
Stage three: automatic ordering of all abbreviations in an abbreviation vocabulary (i.e., word specification text)
No user operation is needed in the step; after the document editing tool detects the newly added abbreviations (target description information corresponding to target vocabulary), the full-text content of the document to be processed is automatically scanned, the structured tags of the < abbr/>, the abbreviation entries (target description information) corresponding to the found structured tags are read, dictionary sequence ordering is carried out according to the abbreviations, and the abbreviations are rendered into abbreviation tables and displayed to users.
Referring to fig. 10, based on the same inventive concept, an embodiment of the present application provides a structured document generating apparatus 1000, including:
an information pushing unit 1100, configured to obtain, in response to a word description operation triggered for a target word in a document to be processed, at least one push description information associated with the target word, where the word description operation is triggered when the document to be processed is in an editing state by a user, and the push description information is obtained by matching description information in a basic description information base;
a display unit 1200, configured to display the at least one push description information in the document to be processed;
and a description text updating unit 1300, configured to determine a push description vocabulary selected by the user from the at least one push description information as target description information corresponding to a target vocabulary, update a vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary, where the updated vocabulary description text includes the target description information.
The information pushing unit 1100, the display unit 1200, and the description text updating unit 1300 may refer to the foregoing detailed description of the structured text information generation method, and the description thereof will not be repeated here.
The structured document generation apparatus 1000 is as an example of a hardware entity a computer device as shown in fig. 11 comprising a processor 1101, a storage medium 1102, and at least one external communication interface 1103; the processor 1101, the storage medium 1102, and the external communication interface 1103 are all connected by a bus 1104.
The storage medium 1102 has stored therein a computer program;
the processor 1101, when executing the computer program, implements the position fingerprint positioning method discussed previously.
One processor 1101 is exemplified in fig. 11, but the number of processors 1101 is not limited in practice.
Wherein the storage medium 1102 may be a volatile storage medium (RAM), such as a random-access storage medium (RAM); the storage medium 1102 may also be a non-volatile storage medium (non-volatile memory), such as a read-only storage medium, a flash memory medium (flash memory), a hard disk (HDD) or a Solid State Drive (SSD), or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer, but is not limited thereto. The storage medium 1102 may be a combination of the above storage media.
Based on the same technical idea, an embodiment of the present application also provides a computer-readable storage medium storing computer instructions that, when executed on a computer, cause the computer to perform the method as previously discussed.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
It will be apparent to those skilled in the art that various modifications and variations can be made to the present application without departing from the spirit or scope of the application. Thus, it is intended that the present application also include such modifications and alterations insofar as they come within the scope of the appended claims or the equivalents thereof.

Claims (19)

1. A structured document information generation method, comprising:
Responding to word description operation triggered by aiming at a target word in a document to be processed, and acquiring at least one piece of push description information associated with the target word, wherein the word description operation is triggered by a user when the document to be processed is in an editing state, and the push description information is obtained by matching description information in a basic description information base;
displaying the at least one push description information in the document to be processed;
determining a push description word selected by the user from the at least one push description information as target description information corresponding to a target word, updating a word description text corresponding to the document to be processed according to the target description information corresponding to the target word, wherein the updated word description text comprises the target description information.
2. The method for generating structured text information according to claim 1, wherein before updating the vocabulary description text corresponding to the document to be processed according to the target description information selected by the user from the at least one push description information, the method further comprises:
inquiring whether the vocabulary description text contains the target description information or not;
In response to the target description information not being contained in the vocabulary description text, adding the target description information into the vocabulary description text; or (b)
And in response to the target description information contained in the vocabulary description text, triggering a prompt message to inform the user that the target description information is contained in the vocabulary description text.
3. The method for generating structured text information according to claim 1, wherein the displaying the at least one push description information in the document to be processed further comprises:
displaying and creating a description control;
responding to the creation description operation triggered by the user through the creation description control, and displaying a description information editing page of the target vocabulary;
according to the description information indicated by the user through the description information editing page, determining target description information corresponding to the target vocabulary;
and updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary.
4. The method for generating structured text information according to claim 3, wherein after determining the target description information corresponding to the target vocabulary according to the description information indicated by the user through the description information editing page, the method further comprises:
And updating the target description information into the basic description information base.
5. The structured document information generation method according to claim 1, further comprising:
responding to a deleting operation triggered by a user aiming at the target vocabulary, and determining the total number of the target vocabulary contained in the document to be processed;
and if the total number is smaller than the preset number and the vocabulary description text contains the target description information corresponding to the target vocabulary, deleting or hiding the target description information in the vocabulary description text.
6. The method for generating structured text information according to claim 1, wherein the presenting the at least one push description information in the document to be processed comprises:
loading the at least one push description information display into a newly built image layer for display, wherein the newly built image is different from the image layer for displaying the document to be processed; and/or
And displaying the at least one push description information in a target area on a layer displaying the document to be processed, wherein the target area is different from the area displaying the document to be processed.
7. The method for generating structured text information according to claim 1, wherein the push description information includes at least two push description information, and the displaying the at least one push description information in the document to be processed includes:
Displaying the push description information according to the sequence of the information similarity of the push description information from big to small, wherein:
and the information similarity is determined based on the similarity between the basic data corresponding to the push description information and the target vocabulary.
8. The method for generating structured text information according to claim 1, wherein the obtaining at least one push description information associated with the target vocabulary includes:
based on the similarity between each basic word and the target word in the basic description information base, determining candidate words from each basic word;
and determining basic description information corresponding to each candidate word as the push description information.
9. The method for generating structured text information according to claim 1, wherein the updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary comprises:
after determining the target description information corresponding to the target vocabulary each time, updating the vocabulary description text in real time according to the target description information corresponding to the target vocabulary; or (b)
After the target description information corresponding to the target vocabulary is determined, and the description text updating operation is triggered, the vocabulary description text is updated based on the target description information corresponding to the target vocabulary.
10. A structured document editing method according to claim 1, further comprising:
determining a structured tag of the target vocabulary according to target description information corresponding to the target vocabulary, wherein the structured tag points to the target description information corresponding to the target vocabulary;
and replacing the target vocabulary in the document to be processed with the structured tag to obtain document storage data corresponding to the document to be processed.
11. The structured document editing method according to claim 10, wherein the presentation form of the structured tag includes at least one of < vocabulary identification information >, < first preset character= "vocabulary identification information"/>, < second preset character identification= "second preset character vocabulary identification information"/>, the vocabulary identification information being used to point to the target description information.
12. The method for editing structured text according to claim 1, wherein after updating the vocabulary description text corresponding to the document to be processed according to the target description information corresponding to the target vocabulary, the method further comprises:
and sequencing and displaying the description information based on the sequencing information of the vocabulary corresponding to the description information contained in the vocabulary description text.
13. A structured document information generation method according to any one of claims 1 to 12 wherein said base description information store comprises at least one of:
a default description information base;
the user-defined information base related to the user;
and the project information base related to the document to be processed.
14. A structured document information generation method according to any one of claims 1 to 12 wherein the target vocabulary comprises any one or any combination of a thumbnail vocabulary, a term vocabulary, a specific symbol.
15. A method of generating structured document information according to any one of claims 1 to 12 wherein the target vocabulary comprises a thumbnail vocabulary and the target description information comprises one or any combination of:
the abbreviated vocabulary is fully called under at least one language category;
the attribute description text of the thumbnail vocabulary;
the usage scenario description of the thumbnail vocabulary;
and the word category of the thumbnail word.
16. The structured document information generation method of any one of claims 1 to 12 wherein the target vocabulary comprises a specialized vocabulary in a clinical trial domain, legal domain, medical domain, architectural domain, chemical domain, biological domain, physical domain.
17. The method for generating structured document information of claim 16, wherein said target vocabulary comprises a specialized vocabulary in the field of clinical trials, said document to be processed is any sub-document of the target clinical trial project,
and the vocabulary description text associated with the document to be processed comprises description information items of target vocabularies in a plurality of sub-documents of the target clinical test item.
18. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the method of any of claims 1-17 when the program is executed.
19. A computer storage medium having stored thereon a computer program, which when executed by a processor performs the steps of the method according to any of claims 1-17.
CN202310106501.7A 2023-02-13 2023-02-13 Structured text information generation method, device and storage medium Pending CN116822468A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310106501.7A CN116822468A (en) 2023-02-13 2023-02-13 Structured text information generation method, device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310106501.7A CN116822468A (en) 2023-02-13 2023-02-13 Structured text information generation method, device and storage medium

Publications (1)

Publication Number Publication Date
CN116822468A true CN116822468A (en) 2023-09-29

Family

ID=88113440

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310106501.7A Pending CN116822468A (en) 2023-02-13 2023-02-13 Structured text information generation method, device and storage medium

Country Status (1)

Country Link
CN (1) CN116822468A (en)

Similar Documents

Publication Publication Date Title
US7802305B1 (en) Methods and apparatus for automated redaction of content in a document
US9411788B2 (en) Methods and apparatus for improved navigation among controlled terms in one or more user documents
JP4401292B2 (en) Glyphlet
JP5117685B2 (en) System and method for semantically zooming information
US11361035B2 (en) Batch generation of links to documents based on document name and page content matching
JPH077408B2 (en) Method and system for changing emphasis characteristics
US9507773B2 (en) Translation assistance device, translation assistance system, and control method for the same
US20040243403A1 (en) Document relationship inspection apparatus, translation process apparatus, document relationship inspection method, translation process method, and document relationship inspection program
JP2010257392A (en) Device and method for inputting character, computer readable program, and recording medium
JP2005182460A (en) Information processor, annotation processing method, information processing program, and recording medium having information processing program stored therein
JP2005107931A (en) Image search apparatus
JP2008234078A (en) Information processor, information processing method, information processing program, and recording medium in which information processing program is recorded
CN116822468A (en) Structured text information generation method, device and storage medium
JP2009093581A (en) Control system for synonym search
JP7340952B2 (en) Template search system and template search method
JP3933407B2 (en) Document processing apparatus, document processing method, and storage medium storing document processing program
JP2004157965A (en) Search support device and method, program and recording medium
JP4149940B2 (en) Document processing apparatus, document processing method, and document processing program
JP4521413B2 (en) Database management system and program
CN115906779A (en) Structured text editing method, device and storage medium
JP2007025831A (en) Content retrieval apparatus and its method
JP2006301809A (en) Data processing system
JP2008233952A (en) Document preparation support device and document preparation support program
JP5742454B2 (en) Input support program, input support apparatus, and input support method
EP2503472A1 (en) Method and device for providing a dataset representative of a glossary

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination