CN115438632A - Text data processing method and device, electronic equipment and readable storage medium - Google Patents

Text data processing method and device, electronic equipment and readable storage medium Download PDF

Info

Publication number
CN115438632A
CN115438632A CN202211133993.0A CN202211133993A CN115438632A CN 115438632 A CN115438632 A CN 115438632A CN 202211133993 A CN202211133993 A CN 202211133993A CN 115438632 A CN115438632 A CN 115438632A
Authority
CN
China
Prior art keywords
text
text data
tag
label
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211133993.0A
Other languages
Chinese (zh)
Inventor
陈攀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Construction Bank Corp
CCB Finetech Co Ltd
Original Assignee
China Construction Bank Corp
CCB Finetech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Construction Bank Corp, CCB Finetech Co Ltd filed Critical China Construction Bank Corp
Priority to CN202211133993.0A priority Critical patent/CN115438632A/en
Publication of CN115438632A publication Critical patent/CN115438632A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/81Indexing, e.g. XML tags; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/83Querying
    • G06F16/835Query processing
    • G06F16/8373Query execution

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The disclosure provides a text data processing method and device, electronic equipment and a readable storage medium, which can be applied to the technical field of computers and the technical field of finance. The text data processing method comprises the following steps: in response to receiving a text data processing request, determining multiple text markup language tags according to text data to be processed in the text data processing request, wherein the text block group tag comprises at least one text block tag, and each text block tag in the at least one text block tag comprises a text data tag to be quoted and a rule tag; determining candidate text data respectively corresponding to each text block label according to at least one text block label; processing the candidate text data according to preset rules respectively corresponding to each text block label to obtain text data to be quoted; and determining the processed text data according to the text data to be processed and the text data to be quoted corresponding to each text block label.

Description

Text data processing method and device, electronic equipment and readable storage medium
Technical Field
The present disclosure relates to the field of computer technologies and the field of financial technologies, and more particularly, to a text data processing method and apparatus, an electronic device, a computer-readable storage medium, and a computer program product.
Background
With the development of computer technology, computer files are generated during the use of a computer by a user, and the computer files can comprise text files and binary files. A text file may refer to a computer file made up of several lines of characters, a text file typically existing in a computer file system.
In implementing the disclosed concept, the inventors found that there are at least the following problems in the related art: because synchronous updating of the reference content of the text file cannot be realized, the efficiency and the accuracy of processing the text file cannot be guaranteed.
Disclosure of Invention
In view of the above, the present disclosure provides a text data processing method and apparatus, an electronic device, a computer-readable storage medium, and a computer program product.
According to an aspect of the present disclosure, there is provided a text data processing method including:
in response to receiving a text data processing request, determining a multi-text markup language tag according to-be-processed text data in the text data processing request, wherein the multi-text markup language tag comprises a text block group tag, the text block group tag comprises at least one text block tag, each text block tag in the at least one text block tag comprises a to-be-quoted text data tag and a rule tag, and the rule tag corresponds to a preset rule;
determining candidate text data corresponding to each text block label according to the at least one text block label;
processing the candidate text data according to the preset rules respectively corresponding to each text block label to obtain text data to be quoted; and
and determining the processed text data according to the text data to be processed and the text data to be quoted corresponding to each text block label.
According to an embodiment of the present disclosure, each text block tag further includes a permission change tag and a location tag, the permission change tag corresponds to permission change content, and the location tag corresponds to location information of the permission change content.
According to an embodiment of the present disclosure, the method further includes:
in response to detecting that the permission change content is changed, determining changed target change content and target position information corresponding to the target change content; and
and replacing the text data to be quoted according to the target position information and the target change content to obtain updated text data.
According to an embodiment of the present disclosure, the replacing the text data to be referred to according to the target location information and the target change content, and obtaining updated text data includes:
according to the target change content, determining a target text block label corresponding to the target change content;
determining a target preset rule corresponding to the target text block label according to the target text block label;
filtering the target change content according to the target preset rule to obtain the processed target change content; and
and replacing the text data to be quoted according to the processed target change content and the target position information to obtain the updated text data.
According to an embodiment of the present disclosure, the text data tag to be referred to includes at least one of a pre-text data tag and a post-text data tag, and the candidate text data includes at least one of candidate pre-text data and candidate post-text data;
the pre-text data tag corresponds to the candidate pre-text data, and the post-text data tag corresponds to the candidate post-text data.
According to an embodiment of the present disclosure, each text block tag further includes address information and encoding information.
According to an embodiment of the present disclosure, the determining, according to the at least one text block tag, the candidate text data respectively corresponding to each text block tag includes:
determining a text file corresponding to each text block label according to the address information;
analyzing the text file according to the coding information to obtain an analyzed text file corresponding to each text block label;
determining the candidate preposed text data according to the preposed text data label and the analyzed text file; and
and determining the candidate post text data according to the post text data label and the analyzed text file.
According to an embodiment of the present disclosure, the preset rule includes at least one of a pre-rule and a post-rule, the pre-rule is used for processing the candidate pre-text data, and the post-rule is used for processing the candidate post-text data.
According to an embodiment of the present disclosure, the processing the candidate text data according to the preset rule corresponding to each text block tag respectively to obtain text data to be referred includes:
according to the preposed rule, filtering the candidate preposed text data to obtain preposed text data to be referred; and
and filtering the candidate post text data according to the post rule to obtain the post text data to be quoted.
According to an embodiment of the present disclosure, the multiple text markup language tag further includes a document tag, where the document tag includes a name sub-tag, a suffix sub-tag, a permission sub-tag, and a path sub-tag, and the method further includes:
a text file is generated based on the processed text data, the name sub-tag corresponding to a name of the text file, the suffix sub-tag corresponding to a suffix of the text file, the authority sub-tag corresponding to an access authority of the text file, and the path sub-tag corresponding to a generation position of the text file.
According to another aspect of the present disclosure, there is provided a text data processing apparatus including:
a first determining module, configured to determine, in response to receiving a text data processing request, a multiple-text markup language tag according to-be-processed text data in the text data processing request, where the to-be-processed text data is in an extensible markup language format, the multiple-text markup language tag includes a text block group tag, the text block group tag includes at least one text block tag, each text block tag in the at least one text block tag includes a to-be-referenced text data tag and a rule tag, and the rule tag corresponds to a preset rule;
a second determining module, configured to determine, according to the at least one text block tag, candidate text data corresponding to each text block tag;
the processing module is used for processing the candidate text data according to the preset rule corresponding to each text block label to obtain text data to be quoted; and
and the third determining module is used for determining the processed text data according to the text data to be processed and the text data to be quoted corresponding to each text block label.
According to another aspect of the present disclosure, there is provided an electronic device including:
one or more processors;
a memory to store one or more instructions that,
wherein the one or more instructions, when executed by the one or more processors, cause the one or more processors to implement the method as described above.
According to another aspect of the present disclosure, there is provided a computer-readable storage medium having stored thereon executable instructions that, when executed by a processor, cause the processor to implement the method as described above.
According to another aspect of the present disclosure, there is provided a computer program product comprising computer executable instructions for implementing the method as described above when executed.
According to the embodiment of the disclosure, after a text data processing request is received, the multiple text markup language tags are determined according to the text data to be processed, and then the candidate text data corresponding to each text block tag is determined according to at least one text block tag in the multiple text markup language tags. In addition, the text data to be quoted is obtained by processing the candidate text data according to the preset rules respectively corresponding to each text block label, and the processed text data is determined according to the text data to be processed and the text data to be quoted respectively corresponding to each text block label, so that the text data can be automatically processed, the technical problem that the efficiency and the accuracy of processing the text file cannot be guaranteed in the related technology is at least partially solved, and the organization efficiency and the accuracy of the text file are improved.
Drawings
The above and other objects, features and advantages of the present disclosure will become more apparent from the following description of the embodiments of the present disclosure with reference to the accompanying drawings, in which:
fig. 1 schematically shows a system architecture to which a text data processing method according to an embodiment of the present disclosure may be applied;
FIG. 2 schematically shows a flow chart of a text data processing method according to an embodiment of the present disclosure;
FIG. 3 schematically illustrates an example schematic of a multi-text markup language tag, according to an embodiment of the disclosure;
FIG. 4 schematically illustrates an example schematic diagram of a text data processing method according to an embodiment of the disclosure;
FIG. 5 schematically illustrates an example schematic diagram of a text data processing method according to another embodiment of the present disclosure;
FIG. 6 schematically shows a block diagram of a text data processing apparatus according to an embodiment of the present disclosure; and
fig. 7 schematically shows a block diagram of an electronic device adapted to implement a text data processing method according to an embodiment of the present disclosure.
Detailed Description
Hereinafter, embodiments of the present disclosure will be described with reference to the accompanying drawings. It should be understood that the description is illustrative only and is not intended to limit the scope of the present disclosure. In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details. Moreover, in the following description, descriptions of well-known structures and techniques are omitted so as to not unnecessarily obscure the concepts of the present disclosure.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. The terms "comprises," "comprising," and the like, as used herein, specify the presence of stated features, steps, operations, and/or components, but do not preclude the presence or addition of one or more other features, steps, operations, or components.
All terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art unless otherwise defined. It is noted that the terms used herein should be interpreted as having a meaning that is consistent with the context of this specification and should not be interpreted in an idealized or overly formal sense.
Where a convention analogous to "A, B and at least one of C, etc." is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., "a system having at least one of A, B and C" would include, but not be limited to, systems that have a alone, B alone, C alone, a and B together, a and C together, B and C together, and/or A, B, C together, etc.). Where a convention analogous to "A, B or at least one of C, etc." is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., "a system having at least one of A, B or C" would include, but not be limited to, systems that have a alone, B alone, C alone, a and B together, a and C together, B and C together, and/or A, B, C together, etc.).
In the technical scheme of the disclosure, the acquisition, storage, application and the like of the personal information of the related user all accord with the regulations of related laws and regulations, necessary security measures are taken, and the commonness and the customs are not violated.
In the technical scheme of the disclosure, before the personal information of the user is acquired or collected, the authorization or the consent of the user is acquired.
The text file may include ASCII (American Standard Code for Information exchange Standard Code), MIME (Multipurpose Internet Mail Extensions), TXT (text document), and the like.
Editing of the text file can be completed by entering character information or copying and pasting the contents of other text files. For example, in the case of editing a text file a, if it is necessary to refer to part of the contents of the text file B, the contents of the text file B may be pasted into the text file a. However, if the referenced part in the subsequent text file B is updated, the text file a needs to be manually edited or pasted to realize synchronous updating.
Further, with respect to text file C, since text file C cannot characterize what can be changed, the user cannot know which contents in text file C are changeable and which are not. For non-procedural files, normal text content and declared content cannot be distinguished; for procedural files, descriptions need to be made by annotation and it is only at program run time that it can be determined whether the changes are satisfactory.
In summary, in the prior art, since it is necessary to complete the reference of a certain text file to the contents of other text files by copy and paste, synchronous update of the contents of the referenced text file cannot be realized. In addition, because the reference mode of the existing text file cannot represent the range of the text which can be changed, the labor cost and the time cost are consumed.
In order to at least partially solve the technical problems in the related art, the present disclosure provides a text data processing method and apparatus, an electronic device, and a readable storage medium, which can be applied to the fields of computer technology and financial technology. The text data processing method comprises the following steps: in response to receiving a text data processing request, determining a multi-text markup language tag according to text data to be processed in the text data processing request, wherein the multi-text markup language tag comprises a text block group tag, the text block group tag comprises at least one text block tag, each text block tag in the at least one text block tag comprises a text data tag to be quoted and a rule tag, and the rule tag corresponds to a preset rule; determining candidate text data respectively corresponding to each text block label according to at least one text block label; processing the candidate text data according to preset rules respectively corresponding to each text block label to obtain text data to be quoted; and determining the processed text data according to the text data to be processed and the text data to be quoted corresponding to each text block label.
It should be noted that the text data processing method and apparatus provided in the embodiments of the present disclosure may be used in the fields of computer technology and financial technology, for example, for processing text data. The text data processing method and the text data processing device provided by the embodiment of the disclosure can also be used in any fields except the fields of computer technology and financial science and technology, such as the field of operation and maintenance. The application fields of the text data processing method and the text data processing device provided by the embodiment of the disclosure are not limited.
Fig. 1 schematically shows a system architecture to which a text data processing method according to an embodiment of the present disclosure can be applied. It should be noted that fig. 1 is only an example of a system architecture to which the embodiments of the present disclosure may be applied to help those skilled in the art understand the technical content of the present disclosure, and does not mean that the embodiments of the present disclosure may not be applied to other devices, systems, environments or scenarios.
As shown in fig. 1, the system architecture 100 according to this embodiment may include terminal devices 101, 102, 103, a network 104 and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired and/or wireless communication links, and so forth.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. The terminal devices 101, 102, 103 may have installed thereon various communication client applications, such as a shopping-like application, a web browser application, a search-like application, an instant messaging tool, a mailbox client, and/or social platform software, etc. (by way of example only).
The terminal devices 101, 102, 103 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like.
The server 105 may be a server providing various services, such as a background management server (for example only) providing support for websites browsed by users using the terminal devices 101, 102, 103. The background management server may analyze and perform other processing on the received data such as the user request, and feed back a processing result (e.g., a webpage, information, or data obtained or generated according to the user request) to the terminal device.
It should be noted that the text data processing method provided by the embodiment of the present disclosure may be generally executed by the server 105. Accordingly, the text data processing apparatus provided by the embodiment of the present disclosure may be generally disposed in the server 105. The text data processing method provided by the embodiment of the present disclosure may also be executed by a server or a server cluster that is different from the server 105 and is capable of communicating with the terminal devices 101, 102, 103 and/or the server 105. Accordingly, the text data processing apparatus provided by the embodiment of the present disclosure may also be disposed in a server or a server cluster different from the server 105 and capable of communicating with the terminal devices 101, 102, 103 and/or the server 105.
Alternatively, the text data processing method provided by the embodiment of the present disclosure may also be generally executed by the terminal device 101, 102, or 103. Accordingly, the text data processing device provided by the embodiment of the present disclosure may also be disposed in the terminal device 101, 102, or 103.
For example, the text file may be originally stored in any one of the terminal apparatuses 101, 102, or 103 (for example, but not limited to, the terminal apparatus 101), or stored on an external storage apparatus and may be imported into the terminal apparatus 101. Then, the terminal device 101 may locally execute the text data processing method provided by the embodiment of the present disclosure, or transmit the text file to another terminal device, a server, or a server cluster, and execute the text data processing method provided by the embodiment of the present disclosure by another terminal device, a server, or a server cluster that receives the text file.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
It should be noted that the sequence numbers of the respective operations in the following methods are merely used as representations of the operations for description, and should not be construed as representing the execution order of the respective operations. The method need not be performed in the exact order shown, unless explicitly stated.
Fig. 2 schematically shows a flow chart of a text data processing method according to an embodiment of the present disclosure.
As shown in fig. 2, the text data processing method 200 may include operations S210 to S240.
In operation S210, in response to receiving a text data processing request, determining a multi-text markup language tag according to-be-processed text data in the text data processing request, where the multi-text markup language tag includes a text block group tag, the text block group tag includes at least one text block tag, each text block tag in the at least one text block tag includes a to-be-referenced text data tag and a rule tag, and the rule tag corresponds to a preset rule.
In operation S220, candidate text data respectively corresponding to each text block tag is determined according to at least one text block tag.
In operation S230, the candidate text data is processed according to the preset rule corresponding to each text block tag, so as to obtain the text data to be referred.
In operation S240, the processed text data is determined according to the text data to be processed and the text data to be referred to respectively corresponding to each text block tag.
According to the embodiment of the disclosure, a code for generating a text data processing request can be written into a script in advance, and when text data processing is required, a user can run the script through terminal equipment to generate a text data processing request message and send the text data processing request message to a server, so that the server can process the text data processing request according to the text data processing request message to obtain processed text data.
According to an embodiment of the present disclosure, the text data to be processed may belong to an Extensible Markup Language (XML) format. A multi-text markup language tag, a text block group tag, a text block tag, a text data tag to be referred to, a rule tag, and the like may be defined based on the extensible markup language.
According to the embodiment of the disclosure, after the text data processing request is received, the multi-text markup language tag can be determined according to the text data to be processed in the text data processing request. A Multiple Text Markup Language (MTML) tag, which may be characterized as an < MTML > tag. The < mtml > tag may include at least one of a text block group tag, a file tag, an attribute tag, and a version tag.
According to an embodiment of the present disclosure, the text block group tag may include at least one text block tag. Each of the at least one text block tag may include a text data tag to be referenced, a rule tag, a permission change tag, and a location tag.
According to an embodiment of the present disclosure, an XML Schema may describe the structure of an extensible markup language document and define elements, attributes, sub-elements, sub-element order, number of sub-elements, data type, default values, fixed values, and whether text may be included in the document.
According to an embodiment of the present disclosure, after determining the multiple text markup language tags, candidate text data respectively corresponding to each text block tag may be determined according to at least one text block tag in the multiple text markup language tags. The candidate text data may be used to characterize the text data in the text file corresponding to each text block tag respectively. The candidate text data may include at least one of candidate pre-text data and candidate post-text data.
According to the embodiment of the disclosure, after the candidate text data is determined, the candidate text data can be processed according to the preset rule corresponding to each text block label, so as to obtain the text data to be referred. The text data to be quoted can be used for representing candidate text data after being filtered according to a preset rule. The preset rules include at least one of pre-rules and post-rules. The specific content of the preset rule can be set by those skilled in the art according to actual requirements.
According to the embodiment of the disclosure, after the text data to be referred is obtained, the processed text data can be determined according to the text data to be processed and the text data to be referred corresponding to each text block label. The processed text data may be used to characterize the text data of the text file to be generated.
According to the embodiment of the disclosure, after a text data processing request is received, the multiple text markup language tags are determined according to the text data to be processed, and then the candidate text data corresponding to each text block tag is determined according to at least one text block tag in the multiple text markup language tags. In addition, the text data to be quoted is obtained by processing the candidate text data according to the preset rules respectively corresponding to each text block label, and the processed text data is determined according to the text data to be processed and the text data to be quoted respectively corresponding to each text block label, so that the text data can be automatically processed, the technical problem that the efficiency and the accuracy of processing the text file cannot be guaranteed in the related technology is at least partially solved, and the organization efficiency and the accuracy of the text file are improved.
The text data processing method 200 according to an embodiment of the present invention is further explained with reference to fig. 3 to 5.
According to an embodiment of the present disclosure, the multiple text markup language tags further include a file tag, the file tag including a name sub-tag, a suffix sub-tag, a permission sub-tag, and a path sub-tag.
The text data processing method 200 may further include the following operations according to an embodiment of the present disclosure.
And generating a text file according to the processed text data, the name sub-label, the suffix sub-label, the authority sub-label and the path sub-label, wherein the name sub-label corresponds to the name of the text file, the suffix sub-label corresponds to the suffix name of the text file, the authority sub-label corresponds to the access authority of the text file, and the path sub-label corresponds to the generation position of the text file.
According to an embodiment of the present disclosure, the text block set tag can be characterized as a < tbs > tag for defining a text block set, including at least one text block tag. The file tag may be characterized as a < file > tag for defining attribute information of the text file to be generated. The attribute tag can be characterized as a < properties > tag, is used for defining a user-defined attribute value, and can be globally referenced in subsequent writing. The version tag may be characterized as a < version > tag for defining a version of the text file.
According to an embodiment of the present disclosure, the < mtml > tag may include attributes Xmlns, xmlns: xsi and xsi: schema location, the above attributes may be used to define a parse-check extensible markup language document.
According to embodiments of the present disclosure, an Xmlns (XML NameSpace) attribute may be used to distinguish between tags. xmlns: the xsi attribute may be understood as a custom syntax or format checker of the extensible markup language document. xsi: the schema location attribute may be used to define key-value pairs, for example, xsi: schema location = "key" "value", the key is used for representing assignment when defining a namespace, and the value is used for representing address information of a text file to be referred to.
According to an embodiment of the present disclosure, the file label may include a name sub-label, a suffix sub-label, a permission sub-label, and a path sub-label. The name sub-tag may be characterized as a < name > tag that defines the name of the text file to be generated. The suffix sub-tag may be characterized as an < extension > tag for defining a suffix name of the text file to be generated, e.g.,. Txt,. Java,. C,. Py, etc. The permission sub-tag may be characterized as a < permission > tag for defining the permission of the text file to be generated. The path sub-tag may be characterized as a < path > tag for defining a generation location of the text file to be generated.
FIG. 3 schematically illustrates an example schematic of a multi-text markup language tag according to an embodiment of this disclosure.
As shown in fig. 3, the multiple text markup language tag 300 may include an attribute tag 301, a file tag 302, and a text block group tag 303. The file tab 302 may include a name sub-tab 302_1, a suffix sub-tab 302_2, a permission sub-tab 302_3, and a path sub-tab 302_4.
Text chunk tab 303 can include at least one text chunk tab, and can include, for example, text chunk tab 303 u 1, text chunk tab 303 u 2, …, text chunk tab 303 u N, …, text chunk tab 303 u N, N can be an integer greater than 1, N e 1,2, … …, (N-1), N }. Each of the at least one text block tags may include a pre-text data tag, a post-text data tag, a location tag, and a permission change tag.
For example, in text block tag 303_1, a leading text data tag 303_11, a trailing text data tag 303_12, a location tag 303_13, and a permission change tag 303 _14may be included. The pre-text data tag 303_11 may include a rule tag 303_111 and a content tag 303_112, the rule tag 303_111 may correspond to a pre-rule, and the content tag 303_112 may correspond to candidate pre-text data. The posttext data tag 303\ 12 may include a rule tag 303_121 and a content tag 303_122, the rule tag 303_121 may correspond to a postrule, and the content tag 303_122 may correspond to candidate posttext data.
Candidate prepositive text data corresponding to the content tag 303_112 can be determined according to the prepositive text data tag 303_11, and the candidate prepositive text data can be processed according to a prepositive rule corresponding to the rule tag 303 _111to obtain the prepositive text data to be referred. Candidate posttext data corresponding to the content tag 303_122 can be determined according to the posttext data tag 303_12, and the candidate posttext data can be processed according to a postrule corresponding to the rule tag 303 _121to obtain posttext data to be referred.
After the pre-text data to be referred and the post-text data to be referred are obtained, the processed text data can be determined according to the text data to be processed and the pre-text data to be referred and the post-text data to be referred which correspond to the text block tag 303\ u 11.
According to an embodiment of the present disclosure, the text data tag to be referred to includes at least one of a pre-text data tag and a post-text data tag, and the candidate text data includes at least one of candidate pre-text data and candidate post-text data. The pre-text data tags correspond to candidate pre-text data, and the post-text data tags correspond to candidate post-text data.
According to an embodiment of the present disclosure, each text block tag further includes address information and encoding information.
According to an embodiment of the present disclosure, operation S220 may include the following operations.
And determining the text file corresponding to each text block label according to the address information. And analyzing the text file according to the coding information to obtain the analyzed text file corresponding to each text block label. And determining candidate preposed text data according to the preposed text data label and the analyzed text file. And determining candidate post text data according to the post text data label and the analyzed text file.
According to an embodiment of the present disclosure, the preset rule includes at least one of a pre-rule and a post-rule, the pre-rule is used for processing the candidate pre-text data, and the post-rule is used for processing the candidate post-text data.
According to an embodiment of the present disclosure, operation S230 may include the following operations.
And filtering the candidate preposed text data according to the preposed rule to obtain the preposed text data to be referred. And filtering the candidate post text data according to the post rule to obtain the post text data to be quoted.
According to an embodiment of the present disclosure, each text block tag may further include address information and encoding information. The address information may be used to characterize the storage address of the text file to be parsed. The coding information may be used to characterize the coding mode of the text file to be parsed.
According to an embodiment of the present disclosure, the text data tag to be referred to includes at least one of a pre-text data tag and a post-text data tag. Each text block tag may include a pre-text data tag, a post-text data tag, a permission change tag, and a location tag.
According to an embodiment of the present disclosure, the pre-text data tags may include a rule tag and a content tag. The rule tag may be characterized as a < rep > tag, and may be defined with a pre-rule. The content tag may include candidate pre-text data. The pre-text data tag may be characterized as a < head > tag and candidate pre-text data corresponding to the < head > tag may be inserted before the referenced text.
According to an embodiment of the present disclosure, the post-text data tags may include a rule tag and a content tag. The rule tag may be characterized as a < rep > tag, may be defined with a postrule, and the content tag may include candidate posttext data. The post-text data tag may be characterized as a < tail > tag and candidate post-text data corresponding to the < tail > tag may be inserted after the referenced text.
According to an embodiment of the present disclosure, the rule tag may correspond to a preset rule. The preset rules include at least one of pre-rules and post-rules. The preset rule may be a regular expression, and the candidate text data may be subjected to regular matching based on the preset rule to obtain text data to be quoted that conforms to the regular expression.
According to the embodiment of the disclosure, the text file corresponding to each text block label can be determined according to the address information, then the text file is analyzed according to the coding information to obtain the analyzed text file corresponding to each text block label, then the candidate text data is determined according to the text data label to be quoted, the candidate text data is processed according to the preset rule, and the content which does not accord with the rule is filtered out to complete the filtering of the candidate text data, so that the text data to be quoted is obtained.
According to the embodiment of the disclosure, the candidate text data is determined according to the text data label to be quoted and the parsed text file, and the candidate text data is processed according to the preset rule, so that the text data to be quoted is obtained. The preposed text data label, the candidate preposed text data and the preposed rule are corresponding, and the post text data label, the candidate post text data and the post rule are corresponding, so that the processing accuracy of the text data can be improved.
Fig. 4 schematically shows an example schematic diagram of a text data processing method according to an embodiment of the present disclosure.
As shown in fig. 4, in response to receiving the text data processing request 401, a multi-text markup language tag can be determined from the pending text data 401\1in the text data processing request 401, the multi-text markup language tag can include a text block group tag, and the text block group tag can include at least one text block tag 402. Text block tag 402 may include address information 402_1, encoding information 402_2, pre-text data tag 402_3, post-text data tag 402_4, pre-rules 402_5, and post-rules 402_6.
The text file 403 corresponding to the text block tag 402 can be determined from the address information 402_1. The text file 403 may be parsed according to the encoded information 402_2, resulting in a parsed text file 404 corresponding to the text block tag. Candidate pre-text data 405 may be determined from pre-text data tag 402_3 and parsed text file 404. Candidate post-text data 406 may be determined from post-text data tag 402 yu 4 and parsed text file 404.
The candidate pre-text data 405 may be filtered according to the pre-rule 402_5, so as to obtain pre-text data 407 to be referred to. The candidate post-text data 406 may be filtered according to the post rule 402_6, resulting in post-text data 408 to be referred to.
After the pre-text data 407 to be referred to and the post-text data 408 to be referred to are obtained, the processed text data 409 may be determined according to the text data 401_1 to be processed, and the pre-text data 407 to be referred to and the post-text data 408 to be referred to which correspond to the text block tag 402.
According to an embodiment of the present disclosure, each text block tag further includes a permission change tag corresponding to the permission change content and a location tag corresponding to location information of the permission change content.
The text data processing method 200 may further include the following operations according to an embodiment of the present disclosure.
In response to detecting that the alteration of the alteration permitted content occurs, the altered target alteration content and target position information corresponding to the target alteration content are determined. And replacing the text data to be referred according to the target position information and the target change content to obtain updated text data.
According to the embodiment of the disclosure, replacing the text data to be referred according to the target position information and the target change content, and obtaining the updated text data may include the following operations.
And determining a target text block label corresponding to the target change content according to the target change content. And determining a target preset rule corresponding to the target text block label according to the target text block label. And filtering the target change content according to a target preset rule to obtain the processed target change content. And replacing the text data to be quoted according to the processed target change content and the target position information to obtain the updated text data.
According to an embodiment of the present disclosure, the permission change label may be defined based on a key-value pair. The value of key may be used to define a keyword for permitting change of content, and in the case where a text block tag includes the keyword in the value, text data to be referred to corresponding to the text block tag may be replaced. The permission change tag may be provided with an md5 attribute that may be used to verify whether the permission change contents are changed. The location tag may be characterized as a < free > tag that may be used to define the location of the license change content.
According to the embodiment of the disclosure, the target change content can be used for representing the changed permission change content, and the target text block label corresponding to the target change content can be determined according to the target change content. The target location information may be used to represent location information corresponding to the changed allowable change content, and a target preset rule corresponding to the target text block tag may be determined according to the target text block tag.
According to an embodiment of the present disclosure, the target preset rule includes at least one of a target pre-rule and a target post-rule. The target preset rule may be a regular expression, and the target modification content may be subjected to regular matching based on the target preset rule, so as to obtain the processed target modification content conforming to the regular expression. The processed target change content can be used for replacing the text data corresponding to the target position information in the text data to be quoted, so as to obtain the updated text data.
According to the embodiment of the disclosure, after the permission change content is detected to be changed, the text data to be referred to is replaced according to the target position information and the target change content, so that updated text data is obtained. Because the range capable of being changed can be represented by the defined permission change label and the position label, the technical problems that the change range cannot be represented and the labor and time costs are consumed in the related technology are solved, and the processing efficiency of the text data is improved.
Fig. 5 schematically shows an example schematic diagram of a text data processing method according to another embodiment of the present disclosure.
As shown in fig. 5, in response to receiving the text data processing request 501, a multi-text markup language tag can be determined from the pending text data 501_1 in the text data processing request 501, the multi-text markup language tag can include a text block group tag, and the text block group tag can include a text block tag 502. Candidate text data 503 corresponding to the text block tag may be determined from the text block tag 502. The candidate text data 503 may be processed according to a preset rule corresponding to the text block tag, so as to obtain the text data 504 to be referred.
Text block tag 502 may include a permission change tag 502_1 and a location tag 502_2. In response to detecting that the permission change content is changed, the changed target change content is determined 505. The destination location information 506 corresponding to the destination alteration content can be determined from the destination alteration content 505 and the location tag 502_2.
After the target alteration content 505 and the target location information 506 are determined, a target text block tag 507 corresponding to the target alteration content 505 may be determined according to the target alteration content 505. A target preset rule 508 corresponding to the target text block label 507 may be determined according to the target text block label 507. The target modification content 505 may be filtered according to the target preset rule 508, so as to obtain a processed target modification content 509. The text data corresponding to the target position information 506 in the text data 504 to be referenced may be replaced with the processed target change content 509 to obtain updated text data 510.
The above is only an exemplary embodiment, but is not limited thereto, and other text data processing methods known in the art may be included as long as they can process text data.
Fig. 6 schematically shows a block diagram of a text data processing apparatus according to an embodiment of the present disclosure.
As shown in fig. 6, the text data processing apparatus 600 may include a first determining module 601, a second determining module 602, a processing module 603, and a third determining module 604.
The first determining module 601 is configured to determine, in response to receiving a text data processing request, multiple text markup language tags according to-be-processed text data in the text data processing request, where the to-be-processed text data is in an extensible markup language format, the multiple text markup language tags include a text block group tag, the text block group tag includes at least one text block tag, each text block tag in the at least one text block tag includes a to-be-referenced text data tag and a rule tag, and the rule tag corresponds to a preset rule.
A second determining module 602, configured to determine candidate text data corresponding to each text block tag according to at least one text block tag.
The processing module 603 is configured to process the candidate text data according to a preset rule corresponding to each text block tag, so as to obtain text data to be referred.
A third determining module 604, configured to determine the processed text data according to the text data to be processed and the text data to be referred that respectively correspond to each text block tag.
According to an embodiment of the present disclosure, each text block tag further includes a permission change tag corresponding to the permission change content and a location tag corresponding to location information of the permission change content.
According to an embodiment of the present disclosure, the text data processing apparatus 600 may further include a fourth determination module and a replacement module.
And the fourth determination module is used for determining the changed target change content and target position information corresponding to the target change content in response to the detection that the permission change content is changed.
And the replacing module is used for replacing the text data to be quoted according to the target position information and the target change content to obtain the updated text data.
According to an embodiment of the present disclosure, a replacement module may include a first determination unit, a second determination unit, a first processing unit, and a replacement unit.
And the first determining unit is used for determining a target text block label corresponding to the target change content according to the target change content.
And the second determining unit is used for determining a target preset rule corresponding to the target text block label according to the target text block label.
And the first processing unit is used for filtering the target change content according to the target preset rule to obtain the processed target change content.
And the replacing unit is used for replacing the text data to be quoted according to the processed target change content and the target position information to obtain updated text data.
According to an embodiment of the present disclosure, the text data tag to be referred to includes at least one of a pre-text data tag and a post-text data tag, and the candidate text data includes at least one of candidate pre-text data and candidate post-text data.
According to an embodiment of the present disclosure, the pre-text data tags correspond to candidate pre-text data, and the post-text data tags correspond to candidate post-text data.
According to an embodiment of the present disclosure, each text block tag further includes address information and encoding information.
According to an embodiment of the present disclosure, the second determining module 602 may include a third determining unit, a parsing unit, a fourth determining unit, and a fifth determining unit.
And the third determining unit is used for determining the text file corresponding to each text block label according to the address information.
And the analysis unit is used for analyzing the text file according to the coding information to obtain an analyzed text file corresponding to each text block label.
And the fourth determining unit is used for determining candidate preposed text data according to the preposed text data label and the analyzed text file.
And the fifth determining unit is used for determining candidate post text data according to the post text data label and the analyzed text file.
According to an embodiment of the present disclosure, the preset rule includes at least one of a pre-rule and a post-rule, the pre-rule is used for processing the candidate pre-text data, and the post-rule is used for processing the candidate post-text data.
According to an embodiment of the present disclosure, the processing module 603 may include a second processing unit and a third processing unit.
And the second processing unit is used for filtering the candidate preposed text data according to the preposed rule to obtain the preposed text data to be quoted.
And the third processing unit is used for filtering the candidate post text data according to the post rule to obtain the post text data to be quoted.
According to an embodiment of the present disclosure, the multiple text markup language tags further include a file tag, the file tag including a name sub-tag, a suffix sub-tag, a permission sub-tag, and a path sub-tag.
According to an embodiment of the present disclosure, the text data processing apparatus 600 may further include a generation module.
And the generating module is used for generating a text file according to the processed text data, the name sub-label, the suffix sub-label, the authority sub-label and the path sub-label, wherein the name sub-label corresponds to the name of the text file, the suffix sub-label corresponds to the suffix name of the text file, the authority sub-label corresponds to the access authority of the text file, and the path sub-label corresponds to the generating position of the text file.
Any number of modules, sub-modules, units, sub-units, or at least part of the functionality of any number thereof according to embodiments of the present disclosure may be implemented in one module. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be implemented by being split into a plurality of modules. Any one or more of the modules, sub-modules, units, sub-units according to embodiments of the present disclosure may be implemented at least in part as a hardware circuit, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on a chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or may be implemented in any other reasonable manner of hardware or firmware by integrating or packaging a circuit, or in any one of or a suitable combination of software, hardware, and firmware implementations. Alternatively, one or more of the modules, sub-modules, units, sub-units according to embodiments of the disclosure may be at least partially implemented as a computer program module, which when executed may perform the corresponding functions.
For example, any number of the first determining module 601, the second determining module 602, the processing module 603, and the third determining module 604 may be combined and implemented in one module/unit/sub-unit, or any one of the modules/units/sub-units may be split into a plurality of modules/units/sub-units. Alternatively, at least part of the functionality of one or more of these modules/units/sub-units may be combined with at least part of the functionality of other modules/units/sub-units and implemented in one module/unit/sub-unit. According to an embodiment of the present disclosure, at least one of the first determining module 601, the second determining module 602, the processing module 603, and the third determining module 604 may be implemented at least partially as a hardware circuit, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on a chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or may be implemented in hardware or firmware by any other reasonable manner of integrating or packaging a circuit, or in any one of three implementations of software, hardware, and firmware, or in any suitable combination of any of them. Alternatively, at least one of the first determining module 601, the second determining module 602, the processing module 603 and the third determining module 604 may be at least partly implemented as a computer program module, which when executed may perform a corresponding function.
It should be noted that the text data processing apparatus portion in the embodiment of the present disclosure corresponds to the text data processing method portion in the embodiment of the present disclosure, and the description of the text data processing apparatus portion specifically refers to the text data processing method portion, which is not described herein again.
Fig. 7 schematically shows a block diagram of an electronic device adapted to implement a text data processing method according to an embodiment of the present disclosure. The electronic device shown in fig. 7 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present disclosure.
As shown in fig. 7, a computer electronic device 700 according to an embodiment of the present disclosure includes a processor 701, which can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 702 or a program loaded from a storage section 709 into a Random Access Memory (RAM) 703. The processor 701 may include, for example, a general purpose microprocessor (e.g., a CPU), an instruction set processor and/or associated chipset, and/or a special purpose microprocessor (e.g., an Application Specific Integrated Circuit (ASIC)), among others. The processor 701 may also include on-board memory for caching purposes. The processor 701 may comprise a single processing unit or a plurality of processing units for performing the different actions of the method flows according to embodiments of the present disclosure.
In the RAM703, various programs and data necessary for the operation of the electronic apparatus 700 are stored. The processor 701, the ROM 702, and the RAM703 are connected to each other by a bus 704. The processor 701 performs various operations of the method flows according to the embodiments of the present disclosure by executing programs in the ROM 702 and/or the RAM 703. It is noted that the programs may also be stored in one or more memories other than the ROM 702 and RAM 703. The processor 701 may also perform various operations of method flows according to embodiments of the present disclosure by executing programs stored in the one or more memories.
Electronic device 700 may also include input/output (I/O) interface 705, which input/output (I/O) interface 705 is also connected to bus 704, according to an embodiment of the present disclosure. The electronic device 700 may also include one or more of the following components connected to the I/O interface 705: an input portion 706 including a keyboard, a mouse, and the like; an output section 707 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 708 including a hard disk and the like; and a communication section 709 including a network interface card such as a LAN card, a modem, or the like. The communication section 709 performs communication processing via a network such as the internet. A drive 710 is also connected to the I/O interface 705 as needed. A removable medium 711 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 710 as necessary, so that a computer program read out therefrom is mounted into the storage section 708 as necessary.
According to embodiments of the present disclosure, method flows according to embodiments of the present disclosure may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable storage medium, the computer program containing program code for performing the method illustrated by the flow chart. In such an embodiment, the computer program can be downloaded and installed from a network through the communication section 709, and/or installed from the removable medium 711. The computer program, when executed by the processor 701, performs the above-described functions defined in the system of the embodiment of the present disclosure. The systems, devices, apparatuses, modules, units, etc. described above may be implemented by computer program modules according to embodiments of the present disclosure.
The present disclosure also provides a computer-readable storage medium, which may be embodied in the device/apparatus/system described in the above embodiments; or may exist separately and not be assembled into the device/apparatus/system. The computer-readable storage medium carries one or more programs which, when executed, implement the method according to an embodiment of the disclosure.
According to an embodiment of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium. Examples may include, but are not limited to: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
For example, according to an embodiment of the present disclosure, a computer-readable storage medium may include the above-described ROM 702 and/or RAM703 and/or one or more memories other than the ROM 702 and RAM 703.
Embodiments of the present disclosure also include a computer program product comprising a computer program containing program code for performing the method provided by the embodiments of the present disclosure, when the computer program product is run on an electronic device, the program code being adapted to cause the electronic device to carry out the method of processing text data provided by the embodiments of the present disclosure.
The computer program, when executed by the processor 701, performs the above-described functions defined in the system/apparatus of the embodiments of the present disclosure. The above described systems, devices, modules, units, etc. may be implemented by computer program modules according to embodiments of the present disclosure.
In one embodiment, the computer program may be hosted on a tangible storage medium such as an optical storage device, a magnetic storage device, or the like. In another embodiment, the computer program may also be transmitted in the form of a signal over a network medium, distributed, and downloaded and installed via the communication section 709, and/or installed from the removable medium 711. The computer program containing program code may be transmitted using any suitable network medium, including but not limited to: wireless, wired, etc., or any suitable combination of the foregoing.
In accordance with embodiments of the present disclosure, program code for executing computer programs provided by embodiments of the present disclosure may be written in any combination of one or more programming languages, and in particular, these computer programs may be implemented using high level procedural and/or object oriented programming languages, and/or assembly/machine languages. The programming language includes, but is not limited to, programming languages such as Java, C + +, python, the "C" language, or the like. The program code may execute entirely on the user computing device, partly on the user device, partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions. Those skilled in the art will appreciate that various combinations and/or combinations of features recited in the various embodiments and/or claims of the present disclosure can be made, even if such combinations or combinations are not expressly recited in the present disclosure. In particular, various combinations and/or combinations of the features recited in the various embodiments of the present disclosure and/or the claims may be made without departing from the spirit and teachings of the present disclosure. All such combinations and/or associations are within the scope of the present disclosure.
The embodiments of the present disclosure have been described above. However, these examples are for illustrative purposes only and are not intended to limit the scope of the present disclosure. Although the embodiments are described separately above, this does not mean that the measures in the embodiments cannot be used in advantageous combination. The scope of the disclosure is defined by the appended claims and equivalents thereof. Various alternatives and modifications can be devised by those skilled in the art without departing from the scope of the present disclosure, and such alternatives and modifications are intended to be within the scope of the present disclosure.

Claims (12)

1. A text data processing method, comprising:
in response to receiving a text data processing request, determining multiple text markup language tags according to-be-processed text data in the text data processing request, wherein the multiple text markup language tags comprise a text block group tag, the text block group tag comprises at least one text block tag, each text block tag in the at least one text block tag comprises a to-be-quoted text data tag and a rule tag, and the rule tag corresponds to a preset rule;
determining candidate text data respectively corresponding to each text block label according to the at least one text block label;
processing the candidate text data according to the preset rule corresponding to each text block label to obtain text data to be quoted; and
and determining the processed text data according to the text data to be processed and the text data to be quoted corresponding to each text block label.
2. The method of claim 1, wherein each text block tag further comprises a license alteration tag corresponding to license alteration content and a location tag corresponding to location information of the license alteration content.
3. The method of claim 1, further comprising:
in response to detecting that the change of the permission change content occurs, determining changed target change content and target position information corresponding to the target change content; and
and replacing the text data to be quoted according to the target position information and the target change content to obtain updated text data.
4. The method according to claim 3, wherein the replacing the text data to be referred to according to the target position information and the target change content, and obtaining updated text data comprises:
determining a target text block label corresponding to the target change content according to the target change content;
determining a target preset rule corresponding to the target text block label according to the target text block label;
filtering the target change content according to the target preset rule to obtain the processed target change content; and
and replacing the text data to be quoted according to the processed target change content and the target position information to obtain the updated text data.
5. The method of claim 1, wherein the text data tag to be referenced comprises at least one of a pre-text data tag and a post-text data tag, and the candidate text data comprises at least one of candidate pre-text data and candidate post-text data;
the pre-text data tag corresponds to the candidate pre-text data, and the post-text data tag corresponds to the candidate post-text data.
6. The method of claim 5, wherein each text block tag further comprises address information and encoding information;
the determining, according to the at least one text block tag, candidate text data respectively corresponding to each text block tag includes:
determining text files respectively corresponding to the text block labels according to the address information;
analyzing the text file according to the coding information to obtain an analyzed text file corresponding to each text block label;
determining the candidate preposed text data according to the preposed text data label and the analyzed text file; and
and determining the candidate post text data according to the post text data label and the analyzed text file.
7. The method according to claim 5 or 6, wherein the preset rule comprises at least one of a pre-rule and a post-rule, the pre-rule being used for processing the candidate pre-text data, the post-rule being used for processing the candidate post-text data;
the processing the candidate text data according to the preset rule corresponding to each text block label to obtain text data to be referred comprises:
according to the preposed rule, filtering the candidate preposed text data to obtain preposed text data to be quoted; and
and filtering the candidate post text data according to the post rule to obtain the post text data to be quoted.
8. The method of claim 1, wherein the multi-text markup language tags further comprise a file tag comprising a name sub-tag, a suffix sub-tag, a permission sub-tag, and a path sub-tag, the method further comprising:
and generating a text file according to the processed text data, the name sub-label, the suffix sub-label, the permission sub-label and the path sub-label, wherein the name sub-label corresponds to the name of the text file, the suffix sub-label corresponds to the suffix name of the text file, the permission sub-label corresponds to the access permission of the text file, and the path sub-label corresponds to the generation position of the text file.
9. A text data processing apparatus comprising:
the device comprises a first determining module, a second determining module and a third determining module, wherein the first determining module is used for responding to a received text data processing request and determining a multi-text markup language tag according to-be-processed text data in the text data processing request, the to-be-processed text data is in an extensible markup language format, the multi-text markup language tag comprises a text block group tag, the text block group tag comprises at least one text block tag, each text block tag in the at least one text block tag comprises a to-be-quoted text data tag and a rule tag, and the rule tag corresponds to a preset rule;
a second determining module, configured to determine, according to the at least one text block tag, candidate text data corresponding to each text block tag;
the processing module is used for processing the candidate text data according to the preset rules respectively corresponding to the text block labels to obtain text data to be quoted; and
and the third determining module is used for determining the processed text data according to the text data to be processed and the text data to be quoted corresponding to each text block label.
10. An electronic device, comprising:
one or more processors;
a memory to store one or more instructions that,
wherein the one or more instructions, when executed by the one or more processors, cause the one or more processors to implement the method of any one of claims 1-8.
11. A computer readable storage medium having stored thereon executable instructions which, when executed by a processor, cause the processor to carry out the method of any one of claims 1 to 8.
12. A computer program product comprising computer executable instructions for implementing the method of any one of claims 1 to 8 when executed.
CN202211133993.0A 2022-09-16 2022-09-16 Text data processing method and device, electronic equipment and readable storage medium Pending CN115438632A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211133993.0A CN115438632A (en) 2022-09-16 2022-09-16 Text data processing method and device, electronic equipment and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211133993.0A CN115438632A (en) 2022-09-16 2022-09-16 Text data processing method and device, electronic equipment and readable storage medium

Publications (1)

Publication Number Publication Date
CN115438632A true CN115438632A (en) 2022-12-06

Family

ID=84249051

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211133993.0A Pending CN115438632A (en) 2022-09-16 2022-09-16 Text data processing method and device, electronic equipment and readable storage medium

Country Status (1)

Country Link
CN (1) CN115438632A (en)

Similar Documents

Publication Publication Date Title
US11563674B2 (en) Content based routing method and apparatus
CN109359194B (en) Method and apparatus for predicting information categories
US7865481B2 (en) Changing documents to include changes made to schemas
CN112463729B (en) Data file warehousing method and device, electronic equipment and medium
CN111897842A (en) Data checking method, device, computer system and medium
CN111125107A (en) Data processing method, device, electronic equipment and medium
CN113760729A (en) Code detection method and device
CN111770128B (en) Message management method and device
CN115599386A (en) Code generation method, device, equipment and storage medium
CN113515271A (en) Service code generation method and device, electronic equipment and readable storage medium
CN112559024A (en) Method and device for generating transaction code change list
CN113778725A (en) Data verification method and device
CN115640279A (en) Method and device for constructing data blood relationship
CN115438632A (en) Text data processing method and device, electronic equipment and readable storage medium
CN113918525A (en) Data exchange scheduling method, system, electronic device, medium, and program product
CN112783903B (en) Method and device for generating update log
CN113111650A (en) Text processing method, device, system and storage medium
CN113377376A (en) Data packet generation method, data packet generation device, electronic device, and storage medium
CN113535565A (en) Interface use case generation method, device, equipment, medium and program product
CN109067726B (en) Identification method and device for station building system, electronic equipment and storage medium
CN116382703B (en) Software package generation method, code development method and device, electronic equipment and medium
CN114817007A (en) Information processing method and device, electronic equipment and computer readable storage medium
CN115687513A (en) Data processing method and device, electronic equipment and computer readable storage medium
CN114691527A (en) Unit test case generation method, apparatus, device, medium, and program product
CN113946517A (en) Abnormal data determination method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination