CN117407536A - Screening method and device for audit project applicability - Google Patents

Screening method and device for audit project applicability Download PDF

Info

Publication number
CN117407536A
CN117407536A CN202311382276.6A CN202311382276A CN117407536A CN 117407536 A CN117407536 A CN 117407536A CN 202311382276 A CN202311382276 A CN 202311382276A CN 117407536 A CN117407536 A CN 117407536A
Authority
CN
China
Prior art keywords
sequence
texts
audit
text
version
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311382276.6A
Other languages
Chinese (zh)
Inventor
周婷婷
那崇宁
胡汉一
吴迪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Lab
Original Assignee
Zhejiang Lab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Lab filed Critical Zhejiang Lab
Priority to CN202311382276.6A priority Critical patent/CN117407536A/en
Publication of CN117407536A publication Critical patent/CN117407536A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/325Hash tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Animal Behavior & Ethology (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application relates to a screening method and device of an audit project applicability system, wherein the method comprises the following steps: generating a hash table based on the acquired system configuration information of the system texts, wherein the system configuration information comprises system names, effective dates and storage paths of the system texts; clustering the system names of the system texts to obtain at least one system sequence, wherein each system sequence comprises at least one system text; generating a system key point map based on each system sequence and corresponding system version information, wherein the system version information is acquired based on the effective date of each system text corresponding to the system sequence in the hash table; and screening the system version information corresponding to each system sequence in the system key point map based on the audit time range of the audit item to obtain the system version applicable to the audit item, so that map screening and display transformed along with the audit time are realized, and the audit efficiency is improved.

Description

Screening method and device for audit project applicability
Technical Field
The application relates to the technical field of knowledge maps, in particular to a screening method and a screening device for an audit project applicability system.
Background
Auditing is an important task that enterprises and institutions carry out before and after major projects and financial collection. During the auditing work, the auditing personnel need to supervise and audit the legitimacy, compliance, authenticity and correctness of the management activities, financial balance and other aspects of the audited units according to the national laws and regulations, auditing rules, provincial and local regulations, company regulations and other files. In the process of developing audit projects, the audit records, audit manuscripts and audit reports are compiled by searching for positioning audit system basis, and especially, when the historical audit records are checked, the corresponding historical audit system basis is also required to be searched. However, because the data volume of the audit system file is large and the audit system file is updated frequently, it is difficult for auditors to quickly and accurately locate the historical audit system file matched with the audit time of the audit project in a large-scale historical audit system library, so that the working efficiency of the auditors is low.
Disclosure of Invention
The embodiment provides a screening method and a screening device for an audit project applicability system, which are used for solving the problem that in the related technology, historical audit system files matched with audit time of an audit project are difficult to quickly and accurately locate, so that the working efficiency of auditors is low.
In a first aspect, in this embodiment, there is provided a screening method of an audit project applicability system, where the method includes:
generating a hash table based on the acquired system configuration information of the system texts, wherein the system configuration information comprises system names, effective dates and storage paths of the system texts;
clustering the system names of the system texts to obtain at least one system sequence, wherein each system sequence comprises at least one system text;
generating a system key point map based on each system sequence and corresponding system version information, wherein the system version information is acquired based on the effective date of each system text corresponding to the system sequence in the hash table;
and screening the system version information corresponding to each system sequence in the system key point map based on the auditing time range of the auditing project to obtain the system version applicable to the auditing project.
In some embodiments, the screening the system version information corresponding to each system sequence in the system key point map based on the audit time range of the audit item, to obtain a system version applicable to the audit item includes:
The effective dates of the plurality of system texts are arranged in an ascending order, a ranking table corresponding to the effective dates and the sequence numbers of the plurality of system texts is obtained, and the sequence numbers are used for identifying the numbers of the system sequences corresponding to the system texts effective on the effective dates;
determining a corresponding effective date interval in the sorting table based on the audit time range of the audit item;
acquiring a corresponding sequence number interval in the sorting table based on the effective date interval;
and determining a system version applicable to the audit item based on each sequence number in the sequence number interval and a sequence version update table corresponding to the sequence number acquired in advance.
In some embodiments, the step of arranging the effective dates of the plurality of system texts in an ascending order, and the step of obtaining the ordered list of the one-to-one correspondence between the effective dates and the sequence numbers of the plurality of system texts includes:
the effective dates of the plurality of system texts are arranged in ascending order, and an effective date ranking table of the plurality of system texts is obtained;
and acquiring the sequence numbers of the system texts corresponding to the effective dates based on the hash table, and generating a corresponding sequence number sorting table.
In some embodiments, the clustering the system names of the plurality of system texts, and obtaining at least one system sequence includes:
based on the system names of the system texts, sentence vectors corresponding to the system names are obtained;
clustering operation is carried out on the sentence vectors, and at least one system sequence is obtained;
based on the hash table, acquiring a system name and a system number of each system text corresponding to the system sequence;
and adding a sequence number corresponding to the system number into the hash table based on the system number of each system text corresponding to the system sequence.
In some embodiments, the generating the system gist map based on each system sequence and the corresponding system version information includes:
acquiring keywords corresponding to the system sequence based on the system names corresponding to the system sequence;
based on the hash table, acquiring the effective date and the system number of each system text corresponding to the system sequence;
based on the ascending arrangement of the effective dates, the system numbers of the system texts corresponding to the effective dates are correspondingly arranged, and a sequence version update table corresponding to the system sequences is generated;
And generating a system key point map based on the key words and the sequence version updating table corresponding to each system sequence.
In some embodiments, the generating the system gist map based on each system sequence and the corresponding system version information further includes:
sequentially acquiring the plurality of system texts based on the storage paths of the plurality of system texts;
extracting key sentences in the system text based on a semantic analysis algorithm;
generating item key point information of the system text based on the key sentence;
and generating a system key point map based on the key words and the sequence version updating table corresponding to each system sequence and the item key point information corresponding to each system text in the system sequence.
In some embodiments, generating the hash table based on the obtained system configuration information of the plurality of system texts includes:
based on the system names, effective dates and storage paths of the system texts, respectively generating a system name linked list, an effective date linked list and a storage path linked list, wherein the system configuration information of the system texts is formed based on values with the same serial numbers in the system name linked list, the effective date linked list and the storage path linked list;
Generating a system number linked list based on the length of the system name linked list;
and generating a hash table by taking the system number in the system number linked list as a key and the corresponding system configuration information as a value.
In some embodiments, after clustering the system names of the plurality of system texts to obtain at least one system sequence, the method further includes:
displaying the system text corresponding to the system sequence, and acquiring correction information corresponding to the system text;
correcting key words and item key point information of the system text based on correction information corresponding to the system text;
and generating a system key point map based on the corrected key words and the item key point information.
In some of these embodiments, after the deriving the institutional version for which the audit item is applicable, the method further comprises:
based on a plurality of audit time ranges of the audit item, respectively displaying the system version nodes corresponding to the audit time ranges in the system key point map through a plurality of colors.
In a second aspect, in this embodiment, there is provided a screening apparatus for an audit project applicability system, the apparatus including:
The first generation module is used for generating a hash table based on the acquired system configuration information of the system texts, wherein the system configuration information comprises system names, effective dates and storage paths of the system texts;
the clustering module is used for clustering the system names of the system texts to obtain at least one system sequence, and each system sequence comprises at least one system text;
the second generation module is used for generating a system key point map based on each system sequence and corresponding system version information, and the system version information is acquired based on the effective date of the system text;
and the screening module is used for screening the system version information corresponding to each system sequence in the system key point map based on the auditing time range of the auditing project to obtain the system version applicable to the auditing project.
Compared with the related art, the screening method of the audit project applicable system provided in the embodiment generates a hash table based on the acquired system configuration information of the system texts to acquire the system names, effective dates and efficient storage and query modes of the storage paths of the system texts; the method comprises the steps of clustering the system names of a plurality of system texts to obtain at least one system sequence, wherein each system sequence comprises at least one system text, namely, the system classification of the system texts is realized by carrying out semantic recognition on the system names; generating a system key point map based on each system sequence and corresponding system version information, and constructing a time iteration relation of a system text under the same system; and screening the system version information corresponding to each system sequence in the system key point map based on the audit time range of the audit item to obtain a system version suitable for the audit item, so that map screening and display transformed along with the audit time are realized, the historical audit system version matched with the audit time is rapidly and accurately positioned, and the audit efficiency is improved.
The details of one or more embodiments of the application are set forth in the accompanying drawings and the description below to provide a more thorough understanding of the other features, objects, and advantages of the application.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute an undue limitation to the application. In the drawings:
FIG. 1 is a block diagram of the computer hardware architecture of a screening method of audit project applicable regimes of some embodiments of the present application;
FIG. 2 is a flow chart of a screening method of audit project applicability according to some embodiments of the present application;
FIG. 3 is a flow chart of determining applicable regime versions based on audit timeframes according to some embodiments of the present application;
FIG. 4 is a flow chart of an ordered list of effective dates and sequence numbers corresponding to acquisition regime text according to some embodiments of the present application;
FIG. 5 is a flow chart of a system sequence for cluster acquisition based on system names in some embodiments of the present application;
FIG. 6 is a flow chart of generating a system gist map according to some embodiments of the present application;
FIG. 7 is a flow chart of generating a system gist map according to further embodiments of the present application;
FIG. 8 is a flow chart of generating a hash table based on institutional configuration information in accordance with some embodiments of the present application;
FIG. 9 is a flow chart of generating a system point map based on correction information according to some embodiments of the present application;
FIG. 10 is a flow chart of a screening method for audit item applicable regimes in accordance with some preferred embodiments of the present application;
fig. 11 is a block diagram of a screening apparatus for audit project applicable regimes according to some embodiments of the present application.
Detailed Description
For a clearer understanding of the objects, technical solutions and advantages of the present application, the present application is described and illustrated below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the present application.
Unless defined otherwise, technical or scientific terms used herein shall have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terms "a," "an," "the," "these," and the like in this application are not intended to be limiting in number, but rather are singular or plural. The terms "comprising," "including," "having," and any variations thereof, as used in the present application, are intended to cover a non-exclusive inclusion; for example, a process, method, and system, article, or apparatus that comprises a list of steps or modules (units) is not limited to the list of steps or modules (units), but may include other steps or modules (units) not listed or inherent to such process, method, article, or apparatus. The terms "connected," "coupled," and the like in this application are not limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect. Reference to "a plurality" in this application means two or more. "and/or" describes an association relationship of an association object, meaning that there may be three relationships, e.g., "a and/or B" may mean: a exists alone, A and B exist together, and B exists alone. Typically, the character "/" indicates that the associated object is an "or" relationship. The terms "first," "second," "third," and the like, as referred to in this application, merely distinguish similar objects and do not represent a particular ordering of objects.
The screening method of the audit project applicability system provided by the embodiment of the application can be executed in a server, a computer or a similar computing device. When the method is applied to a computer, fig. 1 is a block diagram of a hardware structure of a computer of a screening method of an audit item applicability system according to some embodiments of the present application. As shown in fig. 1, the computer may include one or more (only one is shown in fig. 1) processors 102 and a memory 104 for storing data, wherein the processors 102 may include, but are not limited to, a central processing unit CPU, a microprocessor MCU, a programmable logic device FPGA, or the like. The computer may also include a transmission device 106 for communication functions and an input-output device 108. It will be appreciated by those of ordinary skill in the art that the configuration shown in FIG. 1 is merely illustrative and is not intended to limit the configuration of the computer described above. For example, the computer may also include more or fewer components than shown in FIG. 1, or have a different configuration than shown in FIG. 1.
The memory 104 may be used to store a computer program, for example, a software program of application software and a module, such as a computer program corresponding to a screening method of an audit item applicability in the present embodiment, and the processor 102 executes the computer program stored in the memory 104 to perform various functional applications and data processing, that is, to implement the above-described method. Memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some embodiments, the memory 104 may further include memory remotely located relative to the processor 102, which may be connected to the terminal via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
In this embodiment, a screening method of an audit item applicable system is provided, and fig. 2 is a flowchart of a screening method of an audit item applicable system according to some embodiments of the present application, as shown in fig. 2, where the flowchart includes the following steps:
step S201, a hash table is generated based on the acquired system configuration information of the system texts, wherein the system configuration information comprises system names, effective dates and storage paths of the system texts.
The system text refers to the entire content of the regulations established by the organization for the canonical business operation. The system text includes a system name and system content. The system text has an effective date attribute, and when the system text content is updated, the effective date is updated synchronously. Thus, the validity time of the system text is from the validation date of the system text to the validation date of the next version of the system text.
In this embodiment, the system configuration information of the system text may be obtained by reading the system information configuration file. The system information configuration file may include a first field, a second field, and a third field, which respectively correspond to a system name, an effective date, and a storage path of the system text. Each system text corresponds to a system name, an effective date and a storage path.
And (3) sequentially storing the system configuration information of all the system texts according to the types by reading the system configuration information file, wherein the data storage sequences of different types corresponding to the same system text are consistent.
In a further embodiment, the system configuration information may be stored in the form of a linked list. For example, three linked lists are used to store the system name, the effective date and the storage path respectively, and the system name, the effective date and the serial number of the storage path corresponding to the same system text are the same in the linked list.
In addition to linked lists, other data structures such as data structures may be used to implement sequential storage.
According to the ordering of the system configuration information of the system texts in the storage sequence, a corresponding unique identifier can be generated for each system text. The identification may be a unique number or ID of the institutional text. The number may be stored using a storage means of the schedule configuration information. For example, a linked list store may be used, known as a system number linked list.
And generating a hash table according to the system name, the effective date, the storage path and the number of the system text, wherein the hash table can be used for inquiring system configuration information corresponding to each system text.
Step S202, clustering the system names of the system texts to obtain at least one system sequence, wherein each system sequence comprises at least one system text.
For the system texts with huge scale and large quantity, the system texts can be classified through clustering of semantic analysis, so that different system sequences can be obtained. The system sequence means that a plurality of system texts have approximately the same or similar regulation contents, and in this case, a plurality of similar system texts may be classified into the same system sequence. The clustering method can be to cluster the system names of the system texts by adopting the existing K-means, DBSCAN, OPTICS algorithm and the like. Multiple institutional texts in the same institutional sequence may have different effective dates. Each of the system sequences has a corresponding sequence number that can be stored in a hash table as an attribute of all system text in the system sequence.
And step S203, generating a system key point map based on each system sequence and corresponding system version information, wherein the system version information is acquired based on the effective date of each system text corresponding to the system sequence in the hash table.
According to the effective dates of a plurality of system texts in the same system sequence, the time arrangement sequence of each system text in the system sequence according to the effective dates can be obtained, and version information corresponding to the system sequence is generated according to the arrangement sequence.
And generating a system key point map according to each system sequence generated by clustering and the corresponding system version information. The system key map may include names of system sequences and corresponding system version information.
Further, the main content or abstract of each version of the system text can be included.
And step S204, screening the system version information corresponding to each system sequence in the system key point map based on the auditing time range of the auditing project to obtain the system version applicable to the auditing project.
And determining a time range corresponding to the audit item according to the audit requirement of the audit item, and acquiring the system version information corresponding to the time range in each system sequence from the system key point map according to the time range.
Furthermore, a system content abstract corresponding to the system version information can be obtained and is subjected to key display, so that auditors can conveniently and quickly obtain system text content applicable to the audit item.
Through the steps S201 to S204, a hash table is generated based on the acquired system configuration information of the system texts, and the efficient storage and query modes of the system names, the effective dates and the storage paths of the system texts are obtained; the method comprises the steps of clustering the system names of a plurality of system texts to obtain at least one system sequence, wherein each system sequence comprises at least one system text, namely, the system classification of the system texts is realized by carrying out semantic recognition on the system names; generating a system key point map based on each system sequence and corresponding system version information, and constructing a time iteration relation of a system text under the same system; and screening the system version information corresponding to each system sequence in the system key point map based on the audit time range of the audit item to obtain a system version suitable for the audit item, so that map screening and display transformed along with the audit time are realized, the historical audit system version matched with the audit time is rapidly and accurately positioned, and the audit efficiency is improved.
In some embodiments, fig. 3 is a flow chart of determining applicable regime versions based on audit timeframes according to some embodiments of the present application, as shown in fig. 3, the flow comprising the steps of:
step S301, the effective dates of the plurality of system texts are arranged in an ascending order, a ranking table of one-to-one correspondence between the effective dates and sequence numbers of the plurality of system texts is obtained, and the sequence numbers are used for identifying the numbers of the system sequences corresponding to the system texts effective on the effective dates.
The sequence number is a number for the system sequence generated in step S202, and may be stored in a hash table as an attribute of each system text in the system sequence.
And (3) arranging the effective dates of the plurality of system texts in an ascending order, and acquiring a ranking table of which the effective dates and the sequence numbers are in one-to-one correspondence according to the corresponding relation among the effective dates, the system numbers and the sequence numbers of the same system text in the hash table. In the ordering table, the effective dates and sequence numbers of the same system text are correspondingly stored, and are arranged according to the ascending order of the effective dates of the system text.
Specifically, the ordered effective date chain list and the ordered system number chain list can be obtained by arranging the effective date chain list and the system number chain list of the system text according to the ascending order of effective dates. And then acquiring a sequence number linked list corresponding to the system number linked list according to the corresponding relation between the system number and the sequence number of the same system text in the hash table.
Step S302, based on the audit time range of the audit item, determining a corresponding effective date interval in the ordered list.
Specifically, according to the audit time range corresponding to the audit object in the audit item, comparing with the ordered effective date linked list in step S301, and obtaining the effective date section overlapped with the audit time in the effective date linked list.
Step S303, based on the effective date interval, a corresponding sequence number interval in the sorting table is obtained.
And correspondingly finding a sequence number interval in the sequencing table according to the effective date interval. And the sequence numbers in the sequence number interval indicate that the system sequence is updated in version in the auditing time range.
Step S304, determining the applicable system version of the audit item based on each sequence number in the sequence number interval and a sequence version update table corresponding to the pre-acquired sequence number.
The sequence version updating table is a sorting table obtained by sorting the system numbers of the system texts of the same system sequence according to the ascending order of the effective date, and can be stored in a linked list mode.
Specifically, the number m of occurrences of the system sequence from the beginning to the sequence number interval in the whole sequence number linked list can be obtained through forward summation, and according to the number m of occurrences, a corresponding system number (the number m of the system number in the sequence version update table) is found in the sequence version update table corresponding to the sequence number, and the system text corresponding to the system number is the system version applicable to the audit item.
In other embodiments, the same sequence number may occur n times within the sequence number interval, n >1, indicating that the institutional sequence is subject to multiple version upgrades over the audit time frame. Likewise, the number m of occurrences of the system sequence in the entire sequence number list from the start to the first occurrence in the sequence number interval may be obtained by forward summing. And according to the occurrence times m, finding out the corresponding system number (the system number is m in the sequence version updating table) in the sequence version updating table corresponding to the sequence number, and sequentially obtaining n system numbers from the sequence version updating table from the system number, wherein the system text corresponding to the n system numbers is the system version applicable to the audit item.
Through the steps S301 to S304, by arranging the effective dates of the plurality of system texts in an ascending order, a ranking table corresponding to the effective dates and the sequence numbers of the plurality of system texts is obtained, and the sequence numbers of the system texts arranged in the ascending order according to the effective dates are obtained; determining a corresponding effective date interval in the ordered list by determining an audit time range based on the audit item, and determining a time interval in which version change occurs in the audit time range; acquiring a corresponding sequence number interval in the sequencing table based on the effective date interval, and determining a system sequence with version change in the auditing time range; the system version suitable for the auditing project is determined based on each sequence number in the sequence number interval and a sequence version update table corresponding to the sequence number acquired in advance, and the iterative relation of the system text is identified by using a state machine technology, so that the history auditing system version matched with the auditing time is rapidly and accurately positioned according to the transformation of the auditing time, and the auditing efficiency is improved.
In some embodiments, fig. 4 is a flowchart of a corresponding ordered list of effective dates and sequence numbers of the acquisition regimen text of some embodiments of the present application, as shown in fig. 4, the flowchart comprising the steps of:
and S401, arranging the effective dates of the plurality of system texts in an ascending order, and acquiring an effective date ranking table of the plurality of system texts.
Specifically, the ordered effective date linked list can be obtained by arranging the effective date linked list of the system text according to the ascending order of the effective dates. Other manners of ordering the validation dates may also be employed.
Step S402, based on the hash table, the sequence numbers of the system texts corresponding to the effective dates are obtained, and a corresponding sequence number sorting table is generated.
The hash table stores the corresponding relation among the system name, the effective date, the system number and the sequence number of the system text. According to the ascending order of the effective dates, an ordering table corresponding to the effective dates and the sequence numbers of the system texts one by one can be obtained through the hash table.
Specifically, a sequence number linked list corresponding to the effective date linked list may be generated according to the corresponding relationship between the effective date, the system number and the sequence number of the same system text in the hash table.
Through the steps S401 to S402, the effective dates of the plurality of system texts are arranged in ascending order, so that an effective date sorting table of the plurality of system texts is obtained, and the follow-up comparison with the audit time range is facilitated; and acquiring the sequence numbers of the system texts corresponding to the effective dates based on the hash table, generating a corresponding sequence number sorting table, acquiring the time sequence of version change of the system sequences, and providing necessary conditions for acquiring the version information of the system sequences.
In some embodiments, fig. 5 is a flowchart of a system sequence for cluster acquisition based on system names according to some embodiments of the present application, as shown in fig. 5, the flowchart including the steps of:
step S501, sentence vectors corresponding to the system names are obtained based on the system names of the plurality of system texts.
In some embodiments, the system names of the system text may be segmented using existing mature chinese segmentation tools. Specifically, a plurality of system name character strings can be obtained from the system name linked list, and each system name character string is segmented.
Then, based on the Chinese pre-training word vector trained by the large-scale Chinese corpus data, corresponding word vectors can be obtained for the word segmentation corresponding to each system name character string; and then, an unsupervised learning algorithm (such as an addition method, an averaging method, a TF-IDF weighted averaging method, an SIF embedding method and the like) is utilized to obtain sentence vectors corresponding to each system name character string.
Step S502, clustering operation is carried out on sentence vectors, and at least one system sequence is obtained.
And clustering the sentence vectors corresponding to each system name character string by using an unsupervised clustering algorithm (such as K-means, DBSCAN, OPTICS algorithm) to obtain at least one system sequence. The number of the system sequences can be expressed by a cluster_count.
Specifically, each system sequence can correspond to a linked list Cluster formed by system name character strings i RegName List, wherein i is the sequence number of the system sequence; a link list Cluster can also be generated i Chain table Cluster corresponding to regName_List and used for storing positions of system name character strings in system name chain table i _Index_List。
Step S503, based on the hash table, the system name and the system number of each system text corresponding to the system sequence are obtained.
Sequentially acquiring Cluster i Each element value in the Index List table, the corresponding position system number is obtained from the system number List table according to the element value (because the position in the system name List table corresponding to the same system text is consistent with the position in the system number List table), and the system numbers obtained in sequence are assembled into the system number List Cluster corresponding to the system sequence i _RegId_List。
Step S504, based on the system number of each system text corresponding to the system sequence, adding the sequence number corresponding to the system number in the hash table.
Cluster with system number list i The system number in RegId List is the Key (Key) of hash table, the corresponding Value (Value) is obtained, the Value is updated, and the sequence number i of the system sequence corresponding to the system number is added in the hash table.
Through the steps S501 to S504, sentence vectors corresponding to the system names are obtained based on the system names of the plurality of system texts, and characters of the system names are converted into the corresponding sentence vectors to perform clustering operation; the method comprises the steps of performing clustering operation on sentence vectors to obtain at least one system sequence, classifying system texts based on semantic analysis of system names, and constructing a system management system from nothing to nothing; acquiring a system name and a system number of each system text corresponding to the system sequence based on a hash table, and preparing data for generating a key point map based on the system sequence subsequently; and adding a sequence number corresponding to the system number into the hash table based on the system number of each system text corresponding to the system sequence, establishing a corresponding relation between the system number and the sequence number, and improving the searching efficiency of the follow-up system version information.
In some embodiments, fig. 6 is a flowchart of generating a system gist map according to some embodiments of the present application, as shown in fig. 6, the flowchart includes the following steps:
step S601, obtaining keywords corresponding to the system sequences based on the system names corresponding to the system sequences.
In some embodiments, the linked list Cluster composed of the system name character string generated in the step S502 can be i Each system name character string of RegName List is processed by word segmentation to generate word segmentation sets, and the maximum intersection of the word segmentation sets of each system name character string is calculated to be used as a keyword Cluster of the system sequence i i The key words can be used for identifying corresponding system sequences in the system key point map.
Step S602, based on the hash table, obtaining the effective date and the system number of each system text corresponding to the system sequence.
The system number linked list Cluster corresponding to the system sequence i generated in the step S503 i And (3) acquiring a system number corresponding to each system sequence, and inquiring the hash table according to the system number to obtain a corresponding effective date.
Step S603, based on the ascending arrangement of the effective dates, the system numbers of the system texts corresponding to the effective dates are correspondingly arranged, and a sequence version update table corresponding to the system sequences is generated.
Arranging the effective dates corresponding to the system numbers in the system sequence i according to an ascending order, arranging the system numbers and the effective dates correspondingly, and generating a sequenced system number linked list Cluster i The List is updated by the sequence version corresponding to the system sequence i.
And step S604, updating a table based on the keywords and the sequence versions corresponding to each system sequence, and generating a system key point map.
The system key map may include a system sequence node and a version information node, which are connected to each other to reflect a correspondence between the two. The same institutional sequence node may correspond to one or more version information nodes. And for each system sequence, using the corresponding keywords as the names of system sequence nodes, and using the sequence of each system number in the system sequence in the corresponding sequence version updating table as the names of version information nodes to generate a system key point map.
Further, the system key point map can further comprise item nodes, and the item nodes are used for displaying key items of the system text corresponding to a certain version information node, so that auditors can quickly acquire the content of the system text. The same version information node may correspond to one or more entry nodes. The entry nodes are interconnected with the corresponding version information nodes.
Through the steps S601-S604, key words corresponding to the system sequence are obtained based on the system names corresponding to the system sequence, so that main functions and application ranges of the system sequence are displayed; acquiring effective dates and system numbers of the system texts corresponding to the system sequences based on the hash table so as to sort the versions; the system numbers of the system texts corresponding to the effective dates are correspondingly arranged through ascending arrangement based on the effective dates, a sequence version update table corresponding to the system sequences is generated, and the version update sequence of the system sequences is obtained; and generating a system key point map based on key words and sequence version update tables corresponding to each system sequence, realizing static display of version information based on the system sequence, realizing dynamic screening and display of maps transformed along with audit time by combining a state machine technology, and improving audit efficiency.
In some embodiments, fig. 7 is a flowchart of generating a system gist map according to other embodiments of the present application, as shown in fig. 7, the flowchart includes the following steps:
step S701, sequentially acquiring a plurality of system texts based on the storage paths of the plurality of system texts.
The institutional text may be unstructured data, and each institutional text may correspond to a different storage path. The storage path may be stored through a storage path linked list. The corresponding storage path can be obtained by sequentially accessing the storage path linked list, and the storage path obtaining system text is accessed.
Step S702, extracting key sentences in the system text based on a semantic analysis algorithm.
In some embodiments, key sentences in the system text may be extracted by:
step S7021, analyzing the system text, wherein the analysis method comprises sentence segmentation of the system text by using separator;
step S7022, word segmentation is carried out on each sentence cut out by the system text, and word vectors corresponding to the word segmentation are obtained by utilizing the pre-training word vectors;
step S7023, obtaining a sentence vector of the sentence based on an average value of word vectors of each word in the sentence cut out by the system text;
step S7023, calculating the similarity between sentences based on sentence vectors of all sentences in the system text, and storing the similarity as an adjacent matrix;
step S7024, obtaining the score of each sentence based on an iterative textRank algorithm;
in step S7025, the sentences scored by Top-k are taken as the key sentences of the system text, where k is the number of sentences in the system text.
The steps are used for extracting the main content of the system text and filtering sentences with similar content. The key sentence may include summary information for the system text.
In step S703, entry point information of the system text is generated based on the key sentence.
The item gist information for each system text may be one or more pieces.
Step S704, a system key point map is generated based on the key words and the sequence version updating table corresponding to each system sequence and the item key point information corresponding to each system text in the system sequence.
And generating a corresponding item node number in the degree gist map according to the item gist information number, wherein the same version of system text can correspond to one or more item nodes. The entry nodes are interconnected with the corresponding version information nodes.
Through the steps S701 to S704, a plurality of system texts are sequentially acquired through storage paths based on the plurality of system texts, and all contents of the system texts are acquired, so that the content integrity of the system gist map is improved; extracting key sentences in the system texts based on a semantic analysis algorithm to obtain abstract information of each system text; generating item key point information of a system text for display based on the key sentences; the key points of the system text are conveniently and efficiently acquired by a user by generating a system key point map based on key words and sequence version updating tables corresponding to each system sequence and item key point information corresponding to each system text in the system sequence and further generating personalized data items while displaying the system sequence and the version information, so that auditing efficiency is improved.
In some embodiments, fig. 8 is a flowchart of generating a hash table based on institutional configuration information according to some embodiments of the present application, as shown in fig. 8, the flowchart including the steps of:
in step S801, a system name list, an effective date list, and a storage path list are generated based on the system names, effective dates, and storage paths of the plurality of system texts, respectively, and system configuration information of the system texts is formed based on values with the same serial numbers in the system name list, the effective date list, and the storage path list.
In one embodiment, three field names reg_name, valid_date, reg_path may be respectively configured, corresponding to the system Name, validation Date, and storage Path of the system configuration information. And reading the numerical values corresponding to the three fields in a linked list mode, wherein the system configuration information corresponding to each system text consists of a system name linked list, an effective date linked list and values with the same serial numbers in a storage path linked list.
Further, in the process of generating the system name linked list, the effective date linked list and the stored path linked list, the value of the system configuration information can be checked. The verification method may include:
checking whether the value of the system configuration information meets the non-null value requirement;
And checking whether the storage path in the system configuration information is a valid file path.
If the result of any check is no, the system configuration information is abnormal, and a null value is returned.
Step S802, a system number linked list is generated based on the length of the system name linked list.
And according to the length of the system name linked list, generating a system number for each system name in sequence, and forming a system number linked list, wherein the length of the system number linked list is consistent with the length of the system name linked list, and the system number is used as a fourth field of the system configuration information.
In step S803, a hash table is generated with the system number in the system number list as a key and the corresponding system configuration information as a value.
And splicing the system number in the system number linked list into a linked list as a Value (Value) by using the system number in the system number linked list as a Key (Key), and generating a hash table.
Through the steps S801 to S803, by respectively generating a system name linked list, an effective date linked list and a storage path linked list based on the system names, the effective dates and the storage paths of the plurality of system texts, the association sequence storage is realized for different types of system configuration information; generating a system number linked list based on the length of the system name linked list to serve as a unique identifier of the system text; by taking the system number in the system number linked list as a key and the corresponding system configuration information as a value, a hash table is generated, a data structure for efficiently storing and inquiring the system configuration information is provided, and the efficiency of screening the system text version information is improved.
In some embodiments, fig. 9 is a flowchart of generating an institutional main map based on correction information according to some embodiments of the present application, as shown in fig. 9, the flowchart includes the steps of:
step S901, showing the system text corresponding to the system sequence, and acquiring correction information corresponding to the system text.
Specifically, a linked list Cluster composed of system name character strings corresponding to each system sequence i i RegName List and keyword Cluster i And displaying the_KeyWords_List through a front-end interface, and acquiring correction information of the user on the system name character string and the KeyWords.
Step S902, based on the correction information corresponding to the system text, the key words and the item key point information of the system text are corrected.
Receiving the correction result returned by the client, and for the linked list Cluster i RegName List and keyword Cluster i And (3) correcting the_KeyWords_List. Meanwhile, the system name in the hash table can be corrected accordingly.
Further, the item gist information of the system text can be corrected according to the correction result.
Step S903, a system gist map is generated based on the corrected keyword and the item gist information.
Through the steps S901 to S903, relevant information for displaying the system text and a manual correction mode are provided by displaying the system text corresponding to the system sequence and acquiring correction information corresponding to the system text; the key words and the item key points of the system text are corrected based on the correction information corresponding to the system text, so that the correctness of the key words and the item key points of the system text is improved; and generating a system key point map based on the corrected key words and the item key point information, so that the phenomenon that the operation of audit work is influenced due to error information in the system key point map is avoided.
In some embodiments, the method further includes the step of displaying corresponding nodes in the system point map according to the audit time range in a targeted manner, and the method includes the following steps:
based on a plurality of audit time ranges of the audit item, respectively displaying the system version nodes corresponding to the audit time ranges in the system key point map through a plurality of colors.
According to different audit time ranges of the audit item, after the corresponding system sequence, the corresponding system version node and the item node are obtained, different color identifications can be adopted to distinguish the effective system version node corresponding to the audit time range and the item node associated with the system version node. No display or a weakened display may be made for institutional version nodes and item nodes that are not active during the audit period.
According to the screening method of the audit project applicable system, the corresponding display effects of the audit time ranges, the system version nodes and the item nodes are improved by respectively displaying the system version nodes corresponding to the audit time ranges in the system key point map through multiple colors based on the audit time ranges of the audit project.
The present embodiment is described and illustrated below by way of preferred embodiments. Fig. 10 is a flow chart of a screening method for audit item applicable regimes in accordance with some preferred embodiments of the present application. As shown in fig. 10, the flow includes the steps of:
Step S1001, respectively generating a system name linked list, an effective date linked list and a stored path linked list based on system names, effective dates and stored paths of a plurality of system texts, wherein system configuration information of the system texts is formed based on values with the same serial numbers in the system name linked list, the effective date linked list and the stored path linked list;
step S1002, a system number linked list is generated based on the length of the system name linked list;
step S1003, using the system number in the system number linked list as a key and the corresponding system configuration information as a value to generate a hash table;
step S1004, based on the system names of the system texts, sentence vectors corresponding to the system names are obtained;
specifically, the sentence vector can be obtained by:
(a) The Chinese word segmentation tool jieba is utilized to segment each system name character string of the system name linked list, the Ha Gong stop word list is utilized, and the stop word removing operation is carried out on the word segmentation result;
(b) Obtaining a corresponding word vector value for the word segmentation obtained in the step (a) by using a Chinese pre-training word vector of 'text-ailab-enabling-zh-d 100-s';
(c) And obtaining sentence vectors of each system name character string of the system name linked list by using an average method.
Step S1005, performing clustering operation on sentence vectors to obtain at least one system sequence;
the clustering algorithm may be a DBSCAN algorithm.
Step S1006, based on a hash table, acquiring a system name and a system number of each system text corresponding to the system sequence;
step S1007, adding a sequence number corresponding to the system number into the hash table based on the system number of each system text corresponding to the system sequence;
step S1008, obtaining keywords corresponding to the system sequences based on the system names corresponding to the system sequences;
step S1009, showing the system text corresponding to the system sequence and obtaining the correction information corresponding to the system text;
step S1010, correcting information such as keywords, contents and names of the system text based on the correction information corresponding to the system text;
step S1011, sequentially acquiring a plurality of system texts based on the storage paths of the plurality of system texts;
step S1012, extracting key sentences in the system text based on a semantic analysis algorithm;
step S1013, generating item key point information of the system text based on the key sentence;
step S1014, based on the hash table, obtaining the effective date and the system number of each system text corresponding to the system sequence;
Step S1015, based on the ascending arrangement of the effective dates, the system numbers of the system texts corresponding to the effective dates are correspondingly arranged, and a sequence version update table corresponding to the system sequences is generated;
step S1016, generating a system key point map based on key words and sequence version updating tables corresponding to each system sequence and item key point information corresponding to each system text in the system sequence;
the key words and the item key point information can be imported into a graph database neo4j to construct a global system key point map.
Step S1017, arranging the effective dates of the plurality of system texts in an ascending order, and obtaining a ranking table of which the effective dates and the sequence numbers are in one-to-one correspondence;
step S1018, determining a corresponding effective date interval in the sorting table based on the audit time range of the audit item;
step S1019, based on the effective date interval, acquiring a corresponding sequence number interval in the sequencing table;
step S1020, determining a system version applicable to the audit item based on each sequence number in the sequence number interval and a sequence version update table corresponding to the sequence number;
and S1021, based on a plurality of audit time ranges of the audit item, respectively displaying the system version nodes corresponding to the audit time ranges in the system key point map through a plurality of colors.
Through the steps S1001 to S1021, a high-efficiency storage and query mode of the system configuration information of the system text is established through a hash table; the system classification of the system text is realized by carrying out semantic recognition on the system name; the content integrity of the system key point map is improved through the extraction of the system items; the system management system is built from scratch through the system key point map, classification of system texts under the same system is realized, and a system text time iteration relation of the same system sequence is built; the system version information corresponding to each system sequence in the system key point map is dynamically screened and displayed based on the auditing time range, so that the historical auditing system version matched with the auditing time is rapidly and accurately positioned, and the auditing efficiency is improved.
In some embodiments, the application further provides a screening device for an audit project applicable system, where the screening device for an audit project applicable system is used to implement the foregoing embodiments and preferred embodiments, and the description is omitted. The terms "module," "unit," "sub-unit," and the like as used below may refer to a combination of software and/or hardware that performs a predetermined function. In some embodiments, fig. 11 is a block diagram of a screening apparatus for audit item applicability according to the present embodiment, as shown in fig. 11, the apparatus includes:
A first generation module 1101, configured to generate a hash table based on the acquired system configuration information of the plurality of system texts, where the system configuration information includes a system name, an effective date, and a storage path of the system texts;
the clustering module 1102 is configured to cluster the system names of the plurality of system texts, and obtain at least one system sequence, where each system sequence includes at least one system text;
a second generation module 1103, configured to generate a system gist map based on each system sequence and corresponding system version information, where the system version information is acquired based on an effective date of a system text;
and the screening module 1104 is used for screening the system version information corresponding to each system sequence in the system key point map based on the auditing time range of the auditing project to obtain the system version applicable to the auditing project.
According to the screening device for the audit project applicable system of the embodiment, a hash table is generated through the first generation module 1101 based on the acquired system configuration information of the system texts, and the efficient storage and query modes of the system names, the effective dates and the storage paths of the system texts are obtained; clustering the system names of the plurality of system texts through a clustering module 1102 to obtain at least one system sequence, wherein each system sequence comprises at least one system text, namely, the system classification of the system text is realized by carrying out semantic recognition on the system names; generating a system key point map based on each system sequence and corresponding system version information through a second generation module 1103, and constructing a time iteration relation of a system text under the same system; and the screening module 1104 screens the system version information corresponding to each system sequence in the system key point map based on the audit time range of the audit project to obtain a system version applicable to the audit project, so that map screening and display transformed along with the audit time are realized, the historical audit system version matched with the audit time is rapidly and accurately positioned, and the audit efficiency is improved.
In some embodiments, the screening module includes a first acquisition sub-module, a first determination sub-module, a second acquisition sub-module, a second determination sub-module; the first obtaining submodule is used for carrying out ascending arrangement on the effective dates of the plurality of system texts, obtaining a ranking table of one-to-one correspondence between the effective dates and sequence numbers of the plurality of system texts, wherein the sequence numbers are used for identifying the numbers of the system sequences corresponding to the system texts effective on the effective dates; the first determination submodule is used for determining a corresponding effective date interval in the sorting table based on the auditing time range of the auditing project; the second acquisition sub-module is used for acquiring a corresponding sequence number interval in the sequencing table based on the effective date interval; the second determining submodule is used for determining a system version applicable to the audit item based on each sequence number in the sequence number interval and a sequence version update table corresponding to the sequence number acquired in advance.
According to the screening device for the audit project applicable system, the first acquisition submodule is used for carrying out ascending arrangement on the effective dates of the plurality of system texts, so that an ordered list corresponding to the effective dates and the sequence numbers of the plurality of system texts one by one is obtained, and the sequence numbers of the system texts which are arranged according to the ascending arrangement of the effective dates are obtained; determining a corresponding effective date interval in the ordered list based on the auditing time range of the auditing project through a first determining submodule, and determining a time interval in which version change occurs in the auditing time range; acquiring a corresponding sequence number interval in the sequencing table based on the effective date interval through a second acquisition sub-module, and determining a system sequence with version change in the auditing time range; and determining a system version applicable to the audit item based on each sequence number in the sequence number interval and a sequence version update table corresponding to the sequence number acquired in advance by a second determination submodule, and identifying an iteration relation of a system text by using a state machine technology, so that a history audit system version matched with the audit time is rapidly and accurately positioned according to the transformation of the audit time, and the audit efficiency is improved.
In some embodiments, the first obtaining submodule includes a first obtaining unit and a generating unit, where the first obtaining unit is configured to perform ascending arrangement on effective dates of the plurality of system texts, and obtain an ordered list of effective dates of the plurality of system texts; the generating unit is used for acquiring the sequence numbers of the system texts corresponding to the effective dates based on the hash table and generating a corresponding sequence number sorting table.
According to the screening device for the audit project applicable system, the effective dates of the plurality of system texts are arranged in ascending order through the first acquisition unit, so that the effective date ranking table of the plurality of system texts is acquired, and the follow-up comparison with the audit time range is facilitated; the generating unit is used for acquiring the sequence numbers of the system texts corresponding to the effective dates based on the hash table, generating a corresponding sequence number sorting table, acquiring the time sequence of the version change of the system sequences and providing necessary conditions for acquiring the version information of the system sequences.
In some embodiments, the clustering module includes a third obtaining sub-module, a fourth obtaining sub-module, a fifth obtaining sub-module, and an adding sub-module, where the third obtaining sub-module is configured to obtain sentence vectors corresponding to the system names based on the system names of the plurality of system texts; the fourth acquisition sub-module is used for performing clustering operation on the sentence vectors to acquire at least one system sequence; the fifth obtaining submodule is used for obtaining the system names and the system numbers of the system texts corresponding to the system sequences based on the hash table; the adding submodule is used for adding the sequence number corresponding to the system number in the hash table based on the system number of each system text corresponding to the system sequence.
According to the screening device for the audit project applicable system, sentence vectors corresponding to the system names are obtained through the third obtaining submodule based on the system names of the system texts, and characters of the system names are converted into the corresponding sentence vectors to perform clustering operation; clustering is carried out on sentence vectors through a fourth acquisition sub-module, at least one system sequence is acquired, the system texts are classified based on semantic analysis of system names, and a system management system is built from nothing to nothing; a fifth obtaining submodule is used for obtaining the system names and the system numbers of the system texts corresponding to the system sequences based on the hash table, and carrying out data preparation for the subsequent generation of the key point map based on the system sequences; and adding a sequence number corresponding to the system number into the hash table by adding the system number of each system text corresponding to the system sequence to the submodule, so as to establish a corresponding relation between the system number and the sequence number and improve the searching efficiency of the follow-up system version information.
In some embodiments, the second generating module includes a sixth obtaining sub-module, a seventh obtaining sub-module, a first generating sub-module, and a second generating sub-module, where the sixth obtaining sub-module is configured to obtain a keyword corresponding to the system sequence based on a system name corresponding to the system sequence; the seventh obtaining submodule is used for obtaining the effective date and the system number of each system text corresponding to the system sequence based on the hash table; the first generation sub-module is used for correspondingly arranging the system numbers of the system texts corresponding to the effective dates based on the ascending arrangement of the effective dates, and generating a sequence version update table corresponding to the system sequences; the second generation sub-module is used for generating a system key point map based on the key words and the sequence version updating table corresponding to each system sequence.
According to the screening device for the audit project applicable system, keywords corresponding to the system sequence are acquired through the sixth acquisition submodule based on the system names corresponding to the system sequence, so that the main functions and the applicable range of the system sequence are displayed; acquiring effective dates and system numbers of all system texts corresponding to the system sequences based on the hash table through a seventh acquisition submodule so as to carry out version ordering; the method comprises the steps that a first generation submodule correspondingly arranges system numbers of a system text corresponding to an effective date based on ascending arrangement of the effective date, a sequence version update table corresponding to the system sequence is generated, and a version update sequence of the system sequence is obtained; and the second generation submodule generates a system key point map based on the key words and the sequence version update table corresponding to each system sequence, realizes static display of version information based on the system sequence, realizes dynamic screening and display of maps transformed along with audit time by combining a state machine technology, and improves audit efficiency.
In addition, in combination with the screening method of the audit project applicability provided in the above embodiment, a readable storage medium may be provided in this embodiment. The readable storage medium has a program stored thereon; the program, when executed by the processor, implements a screening method for any one of the audit item applicable regimes in the above embodiments.
It should be noted that, specific examples in this embodiment may refer to examples described in the foregoing embodiments and alternative implementations, and are not described in detail in this embodiment.
It should be understood that the specific embodiments described herein are merely illustrative of this application and are not intended to be limiting. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present application, are within the scope of the present application in light of the embodiments provided herein.
It is evident that the drawings are only examples or embodiments of the present application, from which the present application can also be adapted to other similar situations by a person skilled in the art without the inventive effort. In addition, it should be appreciated that while the development effort might be complex and lengthy, it would nevertheless be a routine undertaking of design, fabrication, or manufacture for those of ordinary skill having the benefit of this disclosure, and thus should not be construed as an admission of insufficient detail.
The term "embodiment" in this application means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the application. The appearances of such phrases in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive. It will be clear or implicitly understood by those of ordinary skill in the art that the embodiments described in this application can be combined with other embodiments without conflict.
The above examples only represent a few embodiments of the present application, which are described in more detail and are not to be construed as limiting the scope of the patent. It should be noted that it would be apparent to those skilled in the art that various modifications and improvements could be made without departing from the spirit of the present application, which would be within the scope of the present application. Accordingly, the scope of protection of the present application shall be subject to the appended claims.

Claims (10)

1. A method for screening an audit project applicable system, the method comprising:
generating a hash table based on the acquired system configuration information of the system texts, wherein the system configuration information comprises system names, effective dates and storage paths of the system texts;
clustering the system names of the system texts to obtain at least one system sequence, wherein each system sequence comprises at least one system text;
generating a system key point map based on each system sequence and corresponding system version information, wherein the system version information is acquired based on the effective date of each system text corresponding to the system sequence in the hash table;
and screening the system version information corresponding to each system sequence in the system key point map based on the auditing time range of the auditing project to obtain the system version applicable to the auditing project.
2. The method of claim 1, wherein the screening the system version information corresponding to each system sequence in the system point map based on the audit time range of the audit item to obtain the system version applicable to the audit item comprises:
the effective dates of the plurality of system texts are arranged in an ascending order, a ranking table corresponding to the effective dates and the sequence numbers of the plurality of system texts is obtained, and the sequence numbers are used for identifying the numbers of the system sequences corresponding to the system texts effective on the effective dates;
determining a corresponding effective date interval in the sorting table based on the audit time range of the audit item;
acquiring a corresponding sequence number interval in the sorting table based on the effective date interval;
and determining a system version applicable to the audit item based on each sequence number in the sequence number interval and a sequence version update table corresponding to the sequence number acquired in advance.
3. The method of claim 2, wherein the step of sorting the effective dates of the plurality of system texts in ascending order, and the step of obtaining a sorted list of one-to-one correspondence between effective dates and sequence numbers of the plurality of system texts comprises:
The effective dates of the plurality of system texts are arranged in ascending order, and an effective date ranking table of the plurality of system texts is obtained;
and acquiring the sequence numbers of the system texts corresponding to the effective dates based on the hash table, and generating a corresponding sequence number sorting table.
4. The method of claim 1, wherein clustering the system names of the plurality of system texts to obtain at least one system sequence comprises:
based on the system names of the system texts, sentence vectors corresponding to the system names are obtained;
clustering operation is carried out on the sentence vectors, and at least one system sequence is obtained;
based on the hash table, acquiring a system name and a system number of each system text corresponding to the system sequence;
and adding a sequence number corresponding to the system number into the hash table based on the system number of each system text corresponding to the system sequence.
5. The method of claim 4, wherein generating a institutional key map based on each of the institutional sequences and corresponding institutional version information comprises:
acquiring keywords corresponding to the system sequence based on the system names corresponding to the system sequence;
Based on the hash table, acquiring the effective date and the system number of each system text corresponding to the system sequence;
based on the ascending arrangement of the effective dates, the system numbers of the system texts corresponding to the effective dates are correspondingly arranged, and a sequence version update table corresponding to the system sequences is generated;
and generating a system key point map based on the key words and the sequence version updating table corresponding to each system sequence.
6. The method of claim 5, wherein generating a institutional key map based on each of the institutional sequences and corresponding institutional version information further comprises:
sequentially acquiring the plurality of system texts based on the storage paths of the plurality of system texts;
extracting key sentences in the system text based on a semantic analysis algorithm;
generating item key point information of the system text based on the key sentence;
and generating a system key point map based on the key words and the sequence version updating table corresponding to each system sequence and the item key point information corresponding to each system text in the system sequence.
7. The method of claim 1, wherein generating the hash table based on the obtained institutional configuration information of the plurality of institutional texts comprises:
Based on the system names, effective dates and storage paths of the system texts, respectively generating a system name linked list, an effective date linked list and a storage path linked list, wherein the system configuration information of the system texts is formed based on values with the same serial numbers in the system name linked list, the effective date linked list and the storage path linked list;
generating a system number linked list based on the length of the system name linked list;
and generating a hash table by taking the system number in the system number linked list as a key and the corresponding system configuration information as a value.
8. The method of claim 1, wherein after clustering the system names of the plurality of system texts to obtain at least one system sequence, the method further comprises:
displaying the system text corresponding to the system sequence, and acquiring correction information corresponding to the system text;
correcting key words and item key point information of the system text based on correction information corresponding to the system text;
and generating a system key point map based on the corrected key words and the item key point information.
9. The method of claim 1, wherein after said deriving an institutional version for which said audit item is applicable, said method further comprises:
Based on a plurality of audit time ranges of the audit item, respectively displaying the system version nodes corresponding to the audit time ranges in the system key point map through a plurality of colors.
10. Screening device of audit project applicable system, characterized in that it includes:
the first generation module is used for generating a hash table based on the acquired system configuration information of the system texts, wherein the system configuration information comprises system names, effective dates and storage paths of the system texts;
the clustering module is used for clustering the system names of the system texts to obtain at least one system sequence, and each system sequence comprises at least one system text;
the second generation module is used for generating a system key point map based on each system sequence and corresponding system version information, and the system version information is acquired based on the effective date of the system text;
and the screening module is used for screening the system version information corresponding to each system sequence in the system key point map based on the auditing time range of the auditing project to obtain the system version applicable to the auditing project.
CN202311382276.6A 2023-10-24 2023-10-24 Screening method and device for audit project applicability Pending CN117407536A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311382276.6A CN117407536A (en) 2023-10-24 2023-10-24 Screening method and device for audit project applicability

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311382276.6A CN117407536A (en) 2023-10-24 2023-10-24 Screening method and device for audit project applicability

Publications (1)

Publication Number Publication Date
CN117407536A true CN117407536A (en) 2024-01-16

Family

ID=89493906

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311382276.6A Pending CN117407536A (en) 2023-10-24 2023-10-24 Screening method and device for audit project applicability

Country Status (1)

Country Link
CN (1) CN117407536A (en)

Similar Documents

Publication Publication Date Title
US11704494B2 (en) Discovering a semantic meaning of data fields from profile data of the data fields
CN108804641B (en) Text similarity calculation method, device, equipment and storage medium
CN108292310B (en) Techniques for digital entity correlation
Whang et al. Incremental entity resolution on rules and data
US8533203B2 (en) Identifying synonyms of entities using a document collection
US11514701B2 (en) System and method for global identification in a collection of documents
US20190236460A1 (en) Machine learnt match rules
Anadiotis et al. Graph integration of structured, semistructured and unstructured data for data journalism
CN103425687A (en) Retrieval method and system based on queries
US9141665B1 (en) Optimizing search system resource usage and performance using multiple query processing systems
CN112463774B (en) Text data duplication eliminating method, equipment and storage medium
CN111078835A (en) Resume evaluation method and device, computer equipment and storage medium
Bogatu et al. Towards automatic data format transformations: data wrangling at scale
CN113626558B (en) Intelligent recommendation-based field standardization method and system
CN114610955A (en) Intelligent retrieval method and device, electronic equipment and storage medium
CN109726292A (en) Text analyzing method and apparatus towards extensive multilingual data
US9619458B2 (en) System and method for phrase matching with arbitrary text
CN117251777A (en) Data processing method, device, computer equipment and storage medium
KR102559806B1 (en) Method and Apparatus for Smart Law Precedent Search Technology and an Integrated Law Service Technology Based on Machine Learning
CN117407536A (en) Screening method and device for audit project applicability
CN115062049A (en) Data blood margin analysis method and device
CN116127053B (en) Entity word disambiguation, knowledge graph generation and knowledge recommendation methods and devices
CN117725555B (en) Multi-source knowledge tree association fusion method and device, electronic equipment and storage medium
CN117591624B (en) Test case recommendation method based on semantic index relation
RU2669143C1 (en) Method of data transformation of geoinformation systems (gis), system for its implementation and method of search for the data based on this method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination