CN116431686B - Training data query method and system based on heterogeneous archives - Google Patents

Training data query method and system based on heterogeneous archives Download PDF

Info

Publication number
CN116431686B
CN116431686B CN202310673393.1A CN202310673393A CN116431686B CN 116431686 B CN116431686 B CN 116431686B CN 202310673393 A CN202310673393 A CN 202310673393A CN 116431686 B CN116431686 B CN 116431686B
Authority
CN
China
Prior art keywords
data
file
borrowing
archive
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310673393.1A
Other languages
Chinese (zh)
Other versions
CN116431686A (en
Inventor
冯文英
杨斌
刘铁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Aeronautic Polytechnic
Original Assignee
Chengdu Aeronautic Polytechnic
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Aeronautic Polytechnic filed Critical Chengdu Aeronautic Polytechnic
Priority to CN202310673393.1A priority Critical patent/CN116431686B/en
Publication of CN116431686A publication Critical patent/CN116431686A/en
Application granted granted Critical
Publication of CN116431686B publication Critical patent/CN116431686B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06KGRAPHICAL DATA READING; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K7/00Methods or arrangements for sensing record carriers, e.g. for reading patterns
    • G06K7/10Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation
    • G06K7/10009Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation sensing by radiation using wavelengths larger than 0.1 mm, e.g. radio-waves or microwaves
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The invention discloses a training data query method and a training data query system based on heterogeneous archives, which are used for acquiring archives metadata and integrating archives streaming tracks according to the fact that users and users are closely browsed before and after the realization of the query of homologous archives browsing users: the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data; browse user information analysis, training data queries, and associated user queries. The invention fully considers the file content of heterogeneous types, provides high-efficiency self-adaptive query service for files under different training conditions, accurately provides query results for related users, and is convenient for developing subsequent file use work.

Description

Training data query method and system based on heterogeneous archives
Technical Field
The invention relates to the technical field of data management, in particular to a training data query method and system based on heterogeneous archives.
Background
The enterprise training refers to the enterprise or the planned and systematic training and training activities aiming at the improvement of personnel quality, capability, work performance and contribution to the organization, and aims to improve and improve the knowledge, skills, working method, working attitude and value of staff, thereby playing the maximum potential to improve the performance of individuals and organizations, promoting the continuous progress of the organization and the individuals, realizing the double development of the organization and the individuals, and being one of important means for promoting the continuous development of the enterprise.
At present, in the process of developing training of each unit, the collection and archiving of relevant paper files in each link are required to be manually completed, so that a great deal of manpower and time are required for unified archiving management, paper waste is caused, and inquiry by management staff is not facilitated. Most of the prior enterprises are inconvenient to carry out informatization management on training files of new staff in the training process of the new staff, the training files of the new staff are stored in a paper file mode, if the stored paper files are damaged, the training files of the new staff disappear thoroughly, and when the new staff need to inquire own file information, the leaders of all levels are required to be called, so that the file inquiry process is very complicated, inconvenience is brought to file inquiry, and therefore, the invention provides the training data inquiry method and system based on heterogeneous files.
Disclosure of Invention
According to a first aspect of the present invention, the present invention claims a training data query method based on heterogeneous archives, which is characterized in that: according to the inquiry of the user browsing user and the homologous file browsing user, the method comprises the following steps:
s1, acquiring file metadata: the file metadata acquisition device obtains file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information and personnel basic data in the file management department borrowing registration data, so as to obtain file metadata data;
s2, file streaming track integration: the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data;
s3, browsing user information analysis: the browsing user information analysis device executes browsing user information analysis based on the data acquired in the step S1 and the step S2;
S4, inquiring training data: the training data query device executes training data query based on the data acquired in the step S1 and the step S2;
s5, inquiring the associated user: the associated user inquiry device executes the associated user inquiry based on the data acquired in the steps S1, S2 and S3.
Further, the specific implementation method of the step S2 includes the steps of:
s2.1, comparing basic personnel data: the file envelope RFID sensing data, the file repository RFID sensing data and the file security authority information are provided with information conforming to the output data of the file metadata acquisition device, and the personnel ID is obtained according to the mode that the file metadata acquisition device generates the personnel ID;
electronic borrowing circulation data, paper borrowing circulation data, file purchase data, file training efficiency data, GPS positioning data and file metadata acquisition device output data are associated to obtain a personnel ID;
s2.2, filtering and clustering the archive information to meet the output data requirements, wherein the filtering and clustering of the archive information comprises processing of damaged archives, blank archives and special archives:
the specific implementation method of the step S2.2 comprises the following steps:
s2.2.1, RFID induction data for archive envelope: the method comprises the steps of heterogeneous archive public superior ID and name, sensing point ID and name, heterogeneous archive public superior type, heterogeneous archive public superior address and coordinate information: forming an archive ID according to the public superior ID of the connected heterogeneous archive and the induction point ID, forming an archive name according to the public superior name of the connected heterogeneous archive and the induction point name, and using original archive information along with an archive address and coordinates;
If the archive name contains a keyword capable of identifying multimedia, the archive type is multimedia, and the archive address and the coordinates are empty; if the archive names contain keywords capable of identifying video keywords and picture keywords, the archive types are visual archive types, and the other archive types follow the original archive types;
s2.2.2, RFID sensing data for archive: comprises a file storage device ID, a file communication destination name, a file communication destination type, a file communication destination address, and file communication destination coordinate information, wherein the file storage device ID, the file storage name, the file storage address, the file storage coordinate are used as file storage device ID, the file communication destination name, the file communication destination address, and the file communication destination coordinate,
if the archive name contains a keyword capable of identifying multimedia, the archive type is multimedia, archive addresses and coordinates are empty, and the other archive types are the archive communication destination types;
s2.2.3, for archive security rights information: the file system comprises a character file ID, a character file name, a character file address and character file coordinate information, wherein the type of a file library is a character file type, and the rest information uses file security authority information;
S2.2.4, for paper borrowing circulation data: the file library type is a video keyword type, and the rest information is the paper borrowing circulation data;
s2.2.5, for electronic borrowing circulation data: the electronic borrowing method comprises the steps of including an electronic borrowing ID, an electronic borrowing name, an electronic borrowing equipment ID and electronic borrowing personnel information, wherein the types of archives are multimedia types, archives addresses and coordinates are empty, the electronic borrowing ID and the electronic borrowing equipment ID are combined to form the archives ID, and the electronic borrowing personnel information and the electronic borrowing name are combined to form the archives name;
s2.2.6, for archive purchase data, archive training performance data: the method comprises the steps that the method comprises the steps of including an electronic borrowing device ID and electronic borrowing personnel information, wherein the types of archives are respectively an online contract electronic borrowing device type and a purchase channel type, archives addresses and coordinates are empty, and the electronic borrowing device ID and the electronic borrowing personnel are respectively used by the archives ID and the archives names;
s2.2.7, for GPS positioning data: the method comprises the steps of including a archive ID and archive coordinate information, wherein the archive type is GPS positioning archive type, and combining the character archive and the archive ID to form an archive name;
S2.3, electronic borrowing and video frequency comparison: performing electronic borrowing frequency comparison by associating the electronic borrowing circulation data with the multimedia arrival data; performing video frequency comparison on the paper borrowing and transferring data and the arrival data of the video electronic borrowing equipment in an associated mode;
s2.4, file streaming track integration: for static archive loan history information, fusing use information according to output data requirements after archive information clustering; for the lending history information, splitting the data into a static archive-electronic borrowing equipment-static archive form fusion use information;
s2.5, calculating an electronic borrowing and lending transfer keyword: setting the electronic borrowing circulation data as r1, when the next source of the lending history information r2 is the electronic borrowing circulation data and the revolving keywords are direct keywords of the multimedia in r1, taking the r2 revolving keywords as r1 transfer keywords, and if the direct keywords are not direct keywords, calculating the multimedia direct keywords in r1 closest to the r2 revolving keywords as transfer keywords; when the source type of the next lending history information r2 is non-electronic borrowing circulation data and non-GPS positioning data, calculating a latest electronic borrowing keyword of r1 multimedia direct around the r2 archive as a transfer keyword; when the source of the next lending history information is non-electronic lending circulation data lending history information r2 and the lending history information r3 with the type of the archive being a GPS positioning archive exists between the source of the next lending history information and the source of the non-electronic lending circulation data lending history information r1, calculating a latest electronic lending keyword which is directly connected with multimedia in r1 around the latest r3 archive and has a distance greater than a set threshold value s' tau from the r2 archive as a transfer keyword;
S2.6, resolving and splitting lending history information: analyzing and splitting the electronic borrowing and lending, the video lending, the text file lending and the purchasing channel lending into a keyword-electronic borrowing equipment-keyword form, and deleting the doped GPS positioning data in the split lending history information;
s2.7, lending behavior deduplication: performing deduplication on the usage information based on correspondence in the archive map, employing the following logic:
when the file cover RFID induction data is repeated with the file repository RFID induction data, reserving file cover RFID induction data information;
when the file cover data and the file purchase data are repeated, the file purchase data information is reserved;
when the file envelope data and the file lending induction information are repeated, the file envelope RFID induction time is reserved as the initial time of the lending action,
the other information adopts archive lending induction information;
when the file cover RFID sensing data and the paper borrowing circulation data are repeated, the file cover RFID sensing time is reserved as the initial time of the lending action,
the other information adopts NFC scanning data information;
when the file storage library data and the electronic borrowing circulation data are repeated, adopting the electronic borrowing circulation data;
S2.8, tracking and recording the use information: performing signaling data tracking on the data subjected to the lending action of S2.7, performing recording on the use information based on lending history information with lending history information sources being GPS positioning data to obtain discrete file track data, specifically calculating the distance and initial time difference between each static archive lending history information r1 archive and the lending history information r2 archive of the next non-GPS positioning data source, and deleting the GPS positioning data between r1 and r2 if the distance value is smaller than a set threshold value S tau and the time difference is smaller than a set threshold value t tau, and adopting the initial time of r2 when r1 is completed; otherwise, reserving a piece of GPS positioning data at intervals of delta t, wherein the reserved GPS positioning data finishing time is the data generating time plus delta t, and the lending history information of the rest finishing time is the finishing time by adopting the initial time recorded by the next lending action.
The sensing point is an RFID sensing point of the file cover, the common superior type of the heterogeneous files refers to a superior concept set of the heterogeneous files in type, the common superior type of the heterogeneous files comprises a multimedia type and a text type, the file storage library equipment comprises equipment for storing the heterogeneous files and at least comprises a file database and a file metadata database, the file communication destination type indicates the type of the file, at least comprises a training type and a renting and selling type, and the borrowing equipment represents electronic equipment for borrowing the files and at least comprises: the system comprises a mobile terminal and a PC, wherein the station data represent a storage site of the archive, the keyword-electronic borrowing equipment-keyword represents an association relation between the borrowing equipment and the archive keyword, the static archive-electronic borrowing equipment-static archive represents an association relation between the borrowing equipment and the archive, the distance checking calculation represents a distance between browsing user information analysis executed based on personal basic information data association personnel and discrete scattered archive track data and is used for representing a browsing path relation relativity of the archive borrowed by a user, the main key of the borrowing equipment represents an identification ID of the borrowing equipment, and the equipment information data indicate other information of the borrowing equipment and at least comprise the following steps: borrowing time and borrowing history.
Further, the specific implementation method of the step S1 includes the steps of:
s1.1, browsing personnel data are obtained: acquiring file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information, personnel certificate type, certificate number, name, sex and date of birth information in the file management department borrowing registration data, forming personnel ID according to the combined certificate type and certificate number, and endowing personnel ID with unique identification personnel based on personnel basic data;
s1.2, removing duplication of browser data: when the information of the lending user is repeated, deleting repeated browsing personnel data based on the latest RFID sensing data of the lending user in the file cover RFID sensing data and the file storage library RFID sensing data;
s1.3, verifying correctness: performing correctness checking on personnel certificate numbers in the file envelope RFID sensing data and input data except the file repository RFID sensing data;
s1.4, the data according to the verification is file metadata data.
Further, the specific implementation method for browsing user information analysis in the step S3 includes the steps of:
s3.1, associating personnel ID based on personal basic information data, wherein the file browsing radiation range information comprises a borrowing browsing device main key, and acquiring borrowing browsing device data after associating with device information data; the file catalog information comprises a lending user file management fee paying unit and a paying initial year and month, and the personal file management fee paying data comprises a lending user file management fee paying unit, a paying year and month and a paying state, and the lending user file management fee paying unit, the paying year and month and the paying state are associated to acquire lending user borrowing group and tening period information; the borrowing registration data of the archive management department comprises borrowing group information, and can be directly integrated into personal borrowing group data; acquiring borrowing equipment coordinates, borrowing group addresses and coordinates based on a map service;
S3.2, performing browsing user information analysis based on the discrete archive track data;
and S3.3, performing distance checking calculation on the information obtained in the step S3.1 and the information obtained in the step S3.2, and if the calculation result is smaller than a set distance checking threshold value, successfully comparing, otherwise failing to compare.
Further, step S4 performs a training data query based on the discrete archive track data, and performs the query on the static archive according to rules:
max (end_time+tτ1, end_time) -min (start_time-tτ1, start_time) Σtτ2 for the lending data, the query can be executed regularly:
end_time*+tτ3≥end_time**≥end_time*
wherein: * Representing a trusted lender; * Representing a lending user to be found; t tau 1 represents a reserved time threshold for expanding the time of the lending action of the trusted lender; tτ2 represents a contact time threshold; tτ3 represents a lending data determination time threshold; end_time is the time of the completion of the lending act, start_time is the time of the initiation of the lending act, max is the maximum function, and min is the minimum function.
Further, the specific implementation method of the step S5 associated user inquiry comprises the following steps:
s5.1, carrying out related user inquiry based on output data of the browsing user information analysis device, inquiring personnel identical to the trusted lending user equipment ID/the borrowing group ID under the condition that the comparison is successful, inquiring personnel identical to the trusted lending user equipment ID/the borrowing group ID under the condition that the comparison is failed, and inquiring personnel identical to the borrowing archive library/the borrowing archive library ID identified by the trusted lending user according to the discrete archive track data and returning a result;
S5.2, inquiring the coincident track borrower: if the borrowing equipment and the borrowing ground are successfully compared, further acquiring the personnel inquiry of the same-floor height/same-floor number/same-partition/same-factory area of the trusted borrowing user on the basis of the equipment information data;
s5.3, inquiring the associated track borrower: if the borrowing group and the person are successfully compared with the borrowing ground and the person is frequently lived, layer height/partition/factory area information is further acquired according to the borrowing group and the person address, and accordingly the same-layer height/same-partition/same-factory area personnel inquiry is executed.
According to a second aspect of the present invention, the present invention claims a training data query system based on heterogeneous archives, comprising: the system comprises a file metadata acquisition device, a file circulation track integration device, a browse user information analysis device, a training data query device and an associated user query device;
the file metadata acquisition device obtains file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information and personnel basic data in the file management department borrowing registration data, so as to obtain file metadata data;
The file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data;
the browsing user information analysis means performs browsing user information analysis based on the acquired data;
the training data query device executes training data query based on the acquired data;
the associated user query device executes associated user query based on the acquired data;
the system is used for executing the training data query method based on the heterogeneous archives.
The invention discloses a training data query method and a training data query system based on heterogeneous archives, which are used for acquiring archives metadata and integrating archives streaming tracks according to the fact that users and users are closely browsed before and after the realization of the query of homologous archives browsing users: the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data; browse user information analysis, training data queries, and associated user queries. The invention fully considers the file content of heterogeneous types, provides high-efficiency self-adaptive query service for files under different training conditions, accurately provides query results for related users, and is convenient for developing subsequent file use work.
Drawings
FIG. 1 is a workflow diagram of a heterogeneous archive based training data query method in accordance with the claimed invention;
FIG. 2 is a second workflow diagram of a heterogeneous archive based training data query method in accordance with the claimed invention;
FIG. 3 is a third workflow diagram of a heterogeneous archive based training data query method in accordance with the claimed invention;
FIG. 4 is a fourth workflow diagram of a heterogeneous archive based training data query method in accordance with the claimed invention;
FIG. 5 is a block diagram of a training data query system based on heterogeneous archives in accordance with the claimed invention.
Detailed Description
The invention is illustrated below with reference to specific examples.
Referring to fig. 1, according to a first embodiment of the present invention, the present invention claims a training data query method based on heterogeneous archives, which is characterized in that: according to the inquiry of the user browsing user and the homologous file browsing user, the method comprises the following steps:
s1, acquiring file metadata: the file metadata acquisition device obtains file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information and personnel basic data in the file management department borrowing registration data, so as to obtain file metadata data;
S2, file streaming track integration: the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data;
s3, browsing user information analysis: the browsing user information analysis device executes browsing user information analysis based on the data acquired in the step S1 and the step S2;
s4, inquiring training data: the training data query device executes training data query based on the data acquired in the step S1 and the step S2;
s5, inquiring the associated user: the associated user inquiry device executes the associated user inquiry based on the data acquired in the steps S1, S2 and S3.
Further, referring to fig. 2, the specific implementation method of step S2 includes the steps of:
s2.1, comparing basic personnel data: the file envelope RFID sensing data, the file repository RFID sensing data and the file security authority information are provided with information conforming to the output data of the file metadata acquisition device, and the personnel ID is obtained according to the mode that the file metadata acquisition device generates the personnel ID;
Electronic borrowing circulation data, paper borrowing circulation data, file purchase data, file training efficiency data, GPS positioning data and file metadata acquisition device output data are associated to obtain a personnel ID;
s2.2, filtering and clustering the archive information to meet the output data requirements, wherein the filtering and clustering of the archive information comprises processing of damaged archives, blank archives and special archives:
the specific implementation method of the step S2.2 comprises the following steps:
s2.2.1, RFID induction data for archive envelope: the method comprises the steps of heterogeneous archive public superior ID and name, sensing point ID and name, heterogeneous archive public superior type, heterogeneous archive public superior address and coordinate information: forming an archive ID according to the public superior ID of the connected heterogeneous archive and the induction point ID, forming an archive name according to the public superior name of the connected heterogeneous archive and the induction point name, and using original archive information along with an archive address and coordinates;
if the archive name contains a keyword capable of identifying multimedia, the archive type is multimedia, and the archive address and the coordinates are empty; if the archive names contain keywords capable of identifying video keywords and picture keywords, the archive types are visual archive types, and the other archive types follow the original archive types;
S2.2.2, RFID sensing data for archive: comprises a file storage device ID, a file communication destination name, a file communication destination type, a file communication destination address, and file communication destination coordinate information, wherein the file storage device ID, the file storage name, the file storage address, the file storage coordinate are used as file storage device ID, the file communication destination name, the file communication destination address, and the file communication destination coordinate,
if the archive name contains a keyword capable of identifying multimedia, the archive type is multimedia, archive addresses and coordinates are empty, and the other archive types are the archive communication destination types;
s2.2.3, for archive security rights information: the file system comprises a character file ID, a character file name, a character file address and character file coordinate information, wherein the type of a file library is a character file type, and the rest information uses file security authority information;
s2.2.4, for paper borrowing circulation data: the file library type is a video keyword type, and the rest information is the paper borrowing circulation data;
s2.2.5, for electronic borrowing circulation data: the electronic borrowing method comprises the steps of including an electronic borrowing ID, an electronic borrowing name, an electronic borrowing equipment ID and electronic borrowing personnel information, wherein the types of archives are multimedia types, archives addresses and coordinates are empty, the electronic borrowing ID and the electronic borrowing equipment ID are combined to form the archives ID, and the electronic borrowing personnel information and the electronic borrowing name are combined to form the archives name;
S2.2.6, for archive purchase data, archive training performance data: the method comprises the steps that the method comprises the steps of including an electronic borrowing device ID and electronic borrowing personnel information, wherein the types of archives are respectively an online contract electronic borrowing device type and a purchase channel type, archives addresses and coordinates are empty, and the electronic borrowing device ID and the electronic borrowing personnel are respectively used by the archives ID and the archives names;
s2.2.7, for GPS positioning data: the method comprises the steps of including a archive ID and archive coordinate information, wherein the archive type is GPS positioning archive type, and combining the character archive and the archive ID to form an archive name;
s2.3, electronic borrowing and video frequency comparison: performing electronic borrowing frequency comparison by associating the electronic borrowing circulation data with the multimedia arrival data; performing video frequency comparison on the paper borrowing and transferring data and the arrival data of the video electronic borrowing equipment in an associated mode;
s2.4, file streaming track integration: for static archive loan history information, fusing use information according to output data requirements after archive information clustering; for the lending history information, splitting the data into a static archive-electronic borrowing equipment-static archive form fusion use information;
s2.5, calculating the electronic borrowing and lending transfer keywords.
Specifically, the electronic borrowing circulation data is set as r1, when the next lending history information r2 source is the electronic borrowing circulation data and the revolving keywords are direct keywords of the multimedia in r1, the r2 revolving keywords are used as r1 transfer keywords, and if the direct keywords are not direct keywords, the multimedia direct keywords in r1 closest to the r2 revolving keywords are calculated to be used as transfer keywords; when the source type of the next lending history information r2 is non-electronic borrowing circulation data and non-GPS positioning data, calculating a latest electronic borrowing keyword of r1 multimedia direct around the r2 archive as a transfer keyword; when the source of the next lending history information is non-electronic lending circulation data lending history information r2 and the lending history information r3 with the type of the archive being a GPS positioning archive exists between the source of the next lending history information and the source of the non-electronic lending circulation data lending history information r1, calculating a latest electronic lending keyword which is directly connected with multimedia in r1 around the latest r3 archive and has a distance greater than a set threshold value s' tau from the r2 archive as a transfer keyword;
s2.6, resolving and splitting lending history information: analyzing and splitting the electronic borrowing and lending, the video lending, the text file lending and the purchasing channel lending into a keyword-electronic borrowing equipment-keyword form, and deleting the doped GPS positioning data in the split lending history information;
S2.7, lending behavior deduplication: performing deduplication on the usage information based on correspondence in the archive map;
the following logic is adopted:
when the file cover RFID induction data is repeated with the file repository RFID induction data, reserving file cover RFID induction data information;
when the file cover data and the file purchase data are repeated, the file purchase data information is reserved;
when the file envelope data and the file lending induction information are repeated, the file envelope RFID induction time is reserved as the initial time of the lending action,
the other information adopts archive lending induction information;
when the file cover RFID sensing data and the paper borrowing circulation data are repeated, the file cover RFID sensing time is reserved as the initial time of the lending action,
the other information adopts NFC scanning data information;
when the file storage library data and the electronic borrowing circulation data are repeated, adopting the electronic borrowing circulation data;
s2.8, tracking and recording the use information;
specifically, performing signaling data tracking on the data subjected to the lending action of S2.7, performing recording on the use information based on lending history information with a lending history information source being GPS positioning data to obtain discrete file track data, specifically calculating the distance and initial time difference between each static archive lending history information r1 archive and the lending history information r2 archive with the next non-GPS positioning data source, and deleting the GPS positioning data between r1 and r2 if the distance value is smaller than a set threshold value S 'tau and the time difference is smaller than a set threshold value t' tau, and adopting the initial time of r2 when r1 is completed; otherwise, reserving a piece of GPS positioning data at intervals of delta t, wherein the reserved GPS positioning data finishing time is the data generating time plus delta t, and the lending history information of the rest finishing time is the finishing time by adopting the initial time recorded by the next lending action.
The sensing point is an RFID sensing point of the file cover, the common superior type of the heterogeneous files refers to a superior concept set of the heterogeneous files in type, the common superior type of the heterogeneous files comprises a multimedia type and a text type, the file storage library equipment comprises equipment for storing the heterogeneous files and at least comprises a file database and a file metadata database, the file communication destination type indicates the type of the file, at least comprises a training type and a renting and selling type, and the borrowing equipment represents electronic equipment for borrowing the files and at least comprises: the system comprises a mobile terminal and a PC, wherein the station data represent a storage site of the archive, the keyword-electronic borrowing equipment-keyword represents an association relation between the borrowing equipment and the archive keyword, the static archive-electronic borrowing equipment-static archive represents an association relation between the borrowing equipment and the archive, the distance checking calculation represents a distance between browsing user information analysis executed based on personal basic information data association personnel and discrete scattered archive track data and is used for representing a browsing path relation relativity of the archive borrowed by a user, the main key of the borrowing equipment represents an identification ID of the borrowing equipment, and the equipment information data indicate other information of the borrowing equipment and at least comprise the following steps: borrowing time and borrowing history.
Further, referring to fig. 3, the specific implementation method of step S1 includes the steps of:
s1.1, browsing personnel data are obtained: acquiring file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information, personnel certificate type, certificate number, name, sex and date of birth information in the file management department borrowing registration data, forming personnel ID according to the combined certificate type and certificate number, and endowing personnel ID with unique identification personnel based on personnel basic data;
s1.2, removing duplication of browser data: when the information of the lending user is repeated, deleting repeated browsing personnel data based on the latest RFID sensing data of the lending user in the file cover RFID sensing data and the file storage library RFID sensing data;
s1.3, verifying correctness: performing correctness checking on personnel certificate numbers in the file envelope RFID sensing data and input data except the file repository RFID sensing data;
s1.4, the data according to the verification is file metadata data.
Further, referring to fig. 4, the specific implementation method of browsing user information analysis in step S3 includes the steps of:
S3.1, associating personnel ID based on personal basic information data, wherein the file browsing radiation range information comprises a borrowing browsing device main key, and acquiring borrowing browsing device data after associating with device information data; the file catalog information comprises a lending user file management fee paying unit and a paying initial year and month, and the personal file management fee paying data comprises a lending user file management fee paying unit, a paying year and month and a paying state, and the lending user file management fee paying unit, the paying year and month and the paying state are associated to acquire lending user borrowing group and tening period information; the borrowing registration data of the archive management department comprises borrowing group information, and can be directly integrated into personal borrowing group data; acquiring borrowing equipment coordinates, borrowing group addresses and coordinates based on a map service;
s3.2, performing browsing user information analysis based on the discrete archive track data;
and S3.3, performing distance checking calculation on the information obtained in the step S3.1 and the information obtained in the step S3.2, and if the calculation result is smaller than a set distance checking threshold value, successfully comparing, otherwise failing to compare.
Further, step S4 performs a training data query based on the discrete archive track data, and performs the query on the static archive according to rules:
max (end_time+tτ1, end_time) -min (start_time-tτ1, start_time) Σtτ2 for the lending data, the query can be executed regularly:
end_time*+tτ3≥end_time**≥end_time*
Wherein: * Representing a trusted lender; * Representing a lending user to be found; t tau 1 represents a reserved time threshold for expanding the time of the lending action of the trusted lender; tτ2 represents a contact time threshold; tτ3 represents a lending data determination time threshold; end_time is the time of the completion of the lending act, start_time is the time of the initiation of the lending act, max is the maximum function, and min is the minimum function.
Further, the specific implementation method of the step S5 associated user inquiry comprises the following steps:
s5.1, carrying out related user inquiry based on output data of the browsing user information analysis device, inquiring personnel identical to the trusted lending user equipment ID/the borrowing group ID under the condition that the comparison is successful, inquiring personnel identical to the trusted lending user equipment ID/the borrowing group ID under the condition that the comparison is failed, and inquiring personnel identical to the borrowing archive library/the borrowing archive library ID identified by the trusted lending user according to the discrete archive track data and returning a result;
s5.2, inquiring the coincident track borrower: if the borrowing equipment and the borrowing ground are successfully compared, further acquiring the personnel inquiry of the same-floor height/same-floor number/same-partition/same-factory area of the trusted borrowing user on the basis of the equipment information data;
S5.3, inquiring the associated track borrower: if the borrowing group and the person are successfully compared with the borrowing ground and the person is frequently lived, layer height/partition/factory area information is further acquired according to the borrowing group and the person address, and accordingly the same-layer height/same-partition/same-factory area personnel inquiry is executed.
According to a second embodiment of the present invention, referring to fig. 5, the present invention claims a training data query system based on heterogeneous archives, comprising: the system comprises a file metadata acquisition device, a file circulation track integration device, a browse user information analysis device, a training data query device and an associated user query device;
the file metadata acquisition device obtains file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information and personnel basic data in the file management department borrowing registration data, so as to obtain file metadata data;
the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data;
The browsing user information analysis means performs browsing user information analysis based on the acquired data;
the training data query device executes training data query based on the acquired data;
the associated user query device executes associated user query based on the acquired data;
the system is used for executing the training data query method based on the heterogeneous archives.
In this embodiment, the specific implementation method of step S3.2 includes the steps of:
s3.2.1, obtaining RFID induction data of the file cover and the file repository in the near n days;
s3.2.2 the removed archive type is determined to be GPS positioning archive, video keywords, video electronic borrowing equipment, electronic borrowing keywords, multimedia, text archive keywords, network contract electronic borrowing equipment, purchasing channel keywords and purchasing channel data;
s3.2.3, when borrowing identification is performed based on the discrete file track data between 7 points and 10 points in the morning, borrowing identification is performed based on the discrete file track data between 18 points and 3 points in the next day;
s3.2.4, calculating daily latest RFID sensing data based on borrowing identification and discrete archive track data based on borrowing identification;
s3.2.5, obtaining borrowing date data in discrete file track data based on borrowing identification based on a date dimension table;
S3.2.6 grouping and counting according to personnel, archives and date, obtaining archives with statistics larger than 1/2 of the number of borrowing days in n days for borrowing identification, and calculating archives corresponding to the maximum statistics for borrowing identification;
s3.2.7, judging whether a plurality of places exist in the result, and if so, calculating the archive with the latest RFID sensing time.
Those skilled in the art will appreciate that various modifications and improvements can be made to the disclosure. For example, the various devices or components described above may be implemented in hardware, software, firmware, or a combination of some or all of the three.
A flowchart is used in this disclosure to describe the steps of a method according to an embodiment of the present disclosure. It should be understood that the steps that precede or follow are not necessarily performed in exact order. Rather, the various steps may be processed in reverse order or simultaneously. Also, other operations may be added to these processes.
It will be appreciated by those of ordinary skill in the art that all or part of the steps of the methods described above may be performed by associated hardware, and that the program may be stored on a computer readable storage medium, such as a read only memory, a magnetic or optical disk, or the like. Alternatively, all or part of the steps of the above embodiments may be implemented using one or more integrated circuits. Accordingly, each device/building number in the above embodiment may be implemented in a form of hardware or may be implemented in a form of a software functional device. The present disclosure is not limited to any specific form of combination of hardware and software.
Unless defined otherwise, all terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
The foregoing is illustrative of the present disclosure and is not to be construed as limiting thereof. Although a few exemplary embodiments of this disclosure have been described, those skilled in the art will readily appreciate that many modifications are possible in the exemplary embodiments without materially departing from the novel teachings and advantages of this disclosure. Accordingly, all such modifications are intended to be included within the scope of this disclosure as defined in the claims. It is to be understood that the foregoing is illustrative of the present disclosure and is not to be construed as limited to the specific embodiments disclosed, and that modifications to the disclosed embodiments, as well as other embodiments, are intended to be included within the scope of the appended claims. The disclosure is defined by the claims and their equivalents.
In the description of the present specification, reference to the terms "one embodiment," "some embodiments," "illustrative embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiments or examples. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
While embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that: numerous variations, changes, substitutions and alterations may be made to those embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the claims and their equivalents.

Claims (6)

1. A training data query method based on heterogeneous archives is characterized in that: the method comprises the following steps of:
s1, acquiring file metadata: the file metadata acquisition device obtains file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information and personnel basic data in the file management department borrowing registration data, so as to obtain file metadata data;
s2, file streaming track integration: the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data;
S3, browsing user information analysis: the browsing user information analysis device executes browsing user information analysis based on the data acquired in the step S1 and the step S2;
s4, inquiring training data: the training data query device executes training data query based on the data acquired in the step S1 and the step S2;
s5, inquiring the associated user: the associated user inquiry device executes associated user inquiry based on the data acquired in the step S1, the step S2 and the step S3;
step S4, training data query is executed based on the discrete file track data, and query is executed on the static archive according to rules:
max(end_time * +t τ1 ,end_time ** )-min(start_time * -tτ 1 ,start_time ** )≥t τ2 for lending data, the query may be performed on a regular basis:
end_time * +t τ3 ≥end_time ** ≥end_time *
wherein: * Representing a trusted lender; * Representing a lending user to be found; t is t τ1 Representing a threshold of time of reservation for enlarging the time of a trusted lender's lending actionEtching; t is t τ2 Representing a contact time threshold; t is t τ3 A threshold value indicating a lending data determination time; end_time is the time of the completion of the lending act, start_time is the time of the initiation of the lending act, max is the maximum function, and min is the minimum function.
2. The heterogeneous archive based training data query method of claim 1, wherein:
the specific implementation method of the step S2 comprises the following steps:
S2.1, comparing basic personnel data: the file envelope RFID sensing data, the file repository RFID sensing data and the file security authority information are provided with information conforming to the output data of the file metadata acquisition device, and the personnel ID is obtained according to the mode that the file metadata acquisition device generates the personnel ID;
s2.2, filtering and clustering the archive information to meet the output data requirements, wherein the filtering and clustering of the archive information comprises processing of damaged archives, blank archives and special archives:
the specific implementation method of the step S2.2 comprises the following steps:
s2.2.1, RFID induction data for archive envelope: the method comprises the steps of heterogeneous archive public superior ID and name, sensing point ID and name, heterogeneous archive public superior type, heterogeneous archive public superior address and coordinate information: forming an archive ID according to the public superior ID of the connected heterogeneous archive and the induction point ID, forming an archive name according to the public superior name of the connected heterogeneous archive and the induction point name, and using original archive information along with an archive address and coordinates;
s2.2.2, RFID sensing data for archive: comprises a file storage device ID, a file communication destination name, a file communication destination type, a file communication destination address, and file communication destination coordinate information, wherein the file storage device ID, the file storage name, the file storage address, the file storage coordinate are used as file storage device ID, the file communication destination name, the file communication destination address, and the file communication destination coordinate,
If the archive name contains a keyword capable of identifying multimedia, the archive type is multimedia, archive addresses and coordinates are empty, and the other archive types are the archive communication destination types;
s2.2.3, for archive security rights information: the file system comprises a character file ID, a character file name, a character file address and character file coordinate information, wherein the type of a file library is a character file type, and the rest information uses file security authority information;
s2.2.4, for paper borrowing circulation data: the file library type is a video keyword type, and the rest information is the paper borrowing circulation data;
s2.2.5, for electronic borrowing circulation data: the electronic borrowing method comprises the steps of including an electronic borrowing ID, an electronic borrowing name, an electronic borrowing equipment ID and electronic borrowing personnel information, wherein the types of archives are multimedia types, archives addresses and coordinates are empty, the electronic borrowing ID and the electronic borrowing equipment ID are combined to form the archives ID, and the electronic borrowing personnel information and the electronic borrowing name are combined to form the archives name;
s2.2.6, for archive purchase data, archive training performance data: the method comprises the steps that the method comprises the steps of including an electronic borrowing device ID and electronic borrowing personnel information, wherein the types of archives are respectively an online contract electronic borrowing device type and a purchase channel type, archives addresses and coordinates are empty, and the electronic borrowing device ID and the electronic borrowing personnel are respectively used by the archives ID and the archives names;
S2.2.7, for GPS positioning data: the method comprises the steps of including a archive ID and archive coordinate information, wherein the archive type is GPS positioning archive type, and combining the character archive and the archive ID to form an archive name;
s2.3, electronic borrowing and video frequency comparison: performing electronic borrowing frequency comparison by associating the electronic borrowing circulation data with the multimedia arrival data; performing video frequency comparison on the paper borrowing and transferring data and the arrival data of the video electronic borrowing equipment in an associated mode;
s2.4, file streaming track integration: for static archive loan history information, fusing use information according to output data requirements after archive information clustering; for the lending history information, splitting the data into a static archive-electronic borrowing equipment-static archive form fusion use information;
s2.5, calculating an electronic borrowing and lending transfer keyword: let the electronic borrowing circulation data be r 1 The next piece of lending history information r 2 The source is electronic borrowing circulation data and the revolving key word is r 1 When the direct keyword of the medium multimedia, r is as follows 2 Rotation key as r 1 Transferring keywords, and calculating distance r if the keywords are indirect 2 The nearest r of the rotation key word 1 The medium multimedia direct keywords are used as transfer keywords; next lending history information r 2 When the source type is not electronic borrowing circulation data and is not GPS positioning data, r is calculated 2 Around the archive r 1 The nearest electronic borrowing keywords of the multimedia direct are used as transfer keywords;
s2.6, resolving and splitting lending history information: analyzing and splitting the electronic borrowing and lending, the video lending, the text file lending and the purchasing channel lending into a keyword-electronic borrowing equipment-keyword form, and deleting the doped GPS positioning data in the split lending history information;
s2.7, lending behavior deduplication: performing deduplication on the usage information based on correspondence in the archive map, employing the following logic:
when the file cover RFID induction data is repeated with the file repository RFID induction data, reserving file cover RFID induction data information;
when the file cover data and the file purchase data are repeated, the file purchase data information is reserved;
when the file envelope data and the file lending induction information are repeated, the file envelope RFID induction time is reserved as the initial time of the lending action,
the other information adopts archive lending induction information;
when the file cover RFID sensing data and the paper borrowing circulation data are repeated, the file cover RFID sensing time is reserved as the initial time of the lending action,
The other information adopts NFC scanning data information;
when the file storage library data and the electronic borrowing circulation data are repeated, adopting the electronic borrowing circulation data;
s2.8, tracking and recording the use information: performing signaling data tracking on the data subjected to S2.7 lending action deduplication, performing recording on the use information based on lending history information with lending history information source being GPS positioning data to obtain discrete file track data, and particularly calculating lending history information r of each static file library 1 Lending history information r of archive and next non-GPS positioning data source 2 The distance between archives and the initial time difference, if the distance value is less than the set threshold value s' τ And the time difference is smaller than a set threshold t' τ Delete r 1 And r 2 GPS positioning data between r 1 The finishing moment adopts r 2 Is a starting time of (1); otherwise, reserving a piece of GPS positioning data at intervals of delta t, wherein the reserved GPS positioning data finishing time is the data generating time plus delta t, and the lending history information of the vacancies of the rest finishing time adopts the initial time recorded by the next lending action as the finishing time;
the sensing point is an RFID sensing point of the file cover, the common superior type of the heterogeneous files refers to a superior concept set of the heterogeneous files in type, the common superior type of the heterogeneous files comprises a multimedia type and a text type, the file storage library equipment comprises equipment for storing the heterogeneous files and at least comprises a file database and a file metadata database, the file communication destination type indicates the type of the file, at least comprises a training type and a renting and selling type, and the borrowing equipment represents electronic equipment for borrowing the files and at least comprises: the system comprises a mobile terminal and a PC, wherein the station data represents a storage site of the file, the keyword-electronic borrowing equipment-keyword represents an association relation between the borrowing equipment and the file keyword, the static archive-electronic borrowing equipment-static archive represents an association relation between the borrowing equipment and the archive, the distance checking calculation represents a distance between browsing user information analysis executed based on personal basic information data association personnel and discrete scattered file track data, the distance is used for representing a browsing path relation relativity of the file borrowed by a user, a main key of the borrowing equipment represents an identification ID of the borrowing equipment, and the equipment information data represents other information of the borrowing equipment, and at least comprises: time of borrowing, history of borrowing;
If the archive name contains a keyword capable of identifying multimedia, the archive type is multimedia, and the archive address and the coordinates are empty; if the archive names contain keywords capable of identifying video keywords and picture keywords, the archive types are visual archive types, and the other archive types follow the original archive types.
3. The heterogeneous archive based training data query method of claim 2, wherein: the specific implementation method of the step S1 comprises the following steps:
s1.1, browsing personnel data are obtained: acquiring file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information, personnel certificate type, certificate number, name, sex and date of birth information in the file management department borrowing registration data, forming personnel ID according to the combined certificate type and certificate number, and endowing personnel ID with unique identification personnel based on personnel basic data;
s1.2, removing duplication of browser data: when the information of the lending user is repeated, deleting repeated browsing personnel data based on the latest RFID sensing data of the lending user in the file cover RFID sensing data and the file storage library RFID sensing data;
S1.3, verifying correctness: performing correctness checking on personnel certificate numbers in the file envelope RFID sensing data and input data except the file repository RFID sensing data;
s1.4, the data according to the verification is file metadata data.
4. A heterogeneous archive based training data query method as claimed in claim 3, wherein: the specific implementation method for browsing user information analysis in the step S3 comprises the following steps:
s3.1, associating personnel ID based on personal basic information data, wherein the file browsing radiation range information comprises a borrowing browsing device main key, and acquiring borrowing browsing device data after associating with device information data; the file catalog information comprises a lending user file management fee paying unit and a paying initial year and month, and the personal file management fee paying data comprises a lending user file management fee paying unit, a paying year and month and a paying state, and the lending user file management fee paying unit, the paying year and month and the paying state are associated to acquire lending user borrowing group and tening period information; the borrowing registration data of the archive management department comprises borrowing group information, and can be directly integrated into personal borrowing group data; acquiring borrowing equipment coordinates, borrowing group addresses and coordinates based on a map service;
s3.2, performing browsing user information analysis based on the discrete archive track data;
S3.3, performing distance checking calculation on the information obtained in the step S3.1 and the information obtained in the step S3.2, if the calculation result is smaller than a set distance checking threshold value, the comparison is successful, otherwise, the comparison fails;
the specific implementation method of the step S3.2 comprises the following steps:
s3.2.1, obtaining RFID induction data of the file cover and the file repository in the near n days;
s3.2.2 the removed archive type is determined to be GPS positioning archive, video keywords, video electronic borrowing equipment, electronic borrowing keywords, multimedia, text archive keywords, network contract electronic borrowing equipment, purchasing channel keywords and purchasing channel data;
s3.2.3, when borrowing identification is performed based on the discrete file track data between 7 points and 10 points in the morning, borrowing identification is performed based on the discrete file track data between 18 points and 3 points in the next day;
s3.2.4, calculating daily latest RFID sensing data based on borrowing identification and discrete archive track data based on borrowing identification;
s3.2.5, obtaining borrowing date data in discrete file track data based on borrowing identification based on a date dimension table;
s3.2.6 grouping and counting according to personnel, archives and date, obtaining archives with statistics larger than 1/2 of the number of borrowing days in n days for borrowing identification, and calculating archives corresponding to the maximum statistics for borrowing identification;
S3.2.7, judging whether a plurality of places exist in the result, and if so, calculating the archive with the latest RFID sensing time.
5. The heterogeneous archive based training data query method of claim 4, wherein: the specific implementation method of the step S5 associated user inquiry comprises the following steps:
s5.1, carrying out related user inquiry based on output data of the browsing user information analysis device, inquiring personnel identical to the trusted lending user equipment ID/the borrowing group ID under the condition that the comparison is successful, inquiring personnel identical to the trusted lending user equipment ID/the borrowing group ID under the condition that the comparison is failed, and inquiring personnel identical to the borrowing archive ID identified by the trusted lending user according to the discrete archive track data and returning a result;
s5.2, inquiring the coincident track borrower: if the borrowing equipment and the borrowing ground are successfully compared, further acquiring the personnel inquiry of the same-floor height/same-floor number/same-partition/same-factory area of the trusted borrowing user on the basis of the equipment information data;
s5.3, inquiring the associated track borrower: if the borrowing group and the person are successfully compared with the borrowing ground and the person is frequently lived, layer height/partition/factory area information is further acquired according to the borrowing group and the person address, and accordingly the same-layer height/same-partition/same-factory area personnel inquiry is executed.
6. A heterogeneous archive based training data query system, comprising: the system comprises a file metadata acquisition device, a file circulation track integration device, a browse user information analysis device, a training data query device and an associated user query device;
the file metadata acquisition device obtains file envelope RFID sensing data, file repository RFID sensing data, file security authority information, file browsing radiation range information, file directory information and personnel basic data in the file management department borrowing registration data, so as to obtain file metadata data;
the file circulation track integrating device fuses file envelope RFID sensing data, file storage library RFID sensing data, file security authority information, paper borrowing circulation data, electronic borrowing circulation data, file purchase data and file training efficiency data, wherein the file circulation track integrating device comprises data of use information, and performs tracking and recording on the use information based on GPS positioning data to obtain discrete file track data;
the browsing user information analysis means performs browsing user information analysis based on the acquired data;
the training data query device executes training data query based on the acquired data;
the associated user query device executes associated user query based on the acquired data;
Step S4, training data query is executed based on the discrete file track data, and query is executed on the static archive according to rules:
max(end_time * +t τ1 ,end_time ** )-min(start_time * -tτ 1 ,start_time ** )≥t τ2 for lending data, the query may be performed on a regular basis:
end_time * +t τ3 ≥end_time ** ≥end_time *
wherein: * Representing a trusted lender; * Representing a lending user to be found; t is t τ1 A reservation time threshold is represented and used for expanding the time of the loan action of the trusted lender; t is t τ2 Representing a contact time threshold; t is t τ3 A threshold value indicating a lending data determination time; end_time is the lending behavior completion time, start_time is the lending behavior initial time, max is the maximum function, and min is the minimum function;
the system is used for executing a heterogeneous archive-based training data query method as claimed in any one of claims 2 to 5.
CN202310673393.1A 2023-06-08 2023-06-08 Training data query method and system based on heterogeneous archives Active CN116431686B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310673393.1A CN116431686B (en) 2023-06-08 2023-06-08 Training data query method and system based on heterogeneous archives

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310673393.1A CN116431686B (en) 2023-06-08 2023-06-08 Training data query method and system based on heterogeneous archives

Publications (2)

Publication Number Publication Date
CN116431686A CN116431686A (en) 2023-07-14
CN116431686B true CN116431686B (en) 2023-09-01

Family

ID=87085758

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310673393.1A Active CN116431686B (en) 2023-06-08 2023-06-08 Training data query method and system based on heterogeneous archives

Country Status (1)

Country Link
CN (1) CN116431686B (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101283353A (en) * 2005-08-03 2008-10-08 温克科技公司 Systems for and methods of finding relevant documents by analyzing tags
CN103914047A (en) * 2014-03-28 2014-07-09 北京市第一中级人民法院 Intelligent archive management control system and method
CN106529812A (en) * 2016-11-20 2017-03-22 广西大学 Intelligent archives management system and application
CN107103529A (en) * 2016-02-23 2017-08-29 陈馨媛 Bank Profile management system based on SOA frameworks
CN108388619A (en) * 2018-02-10 2018-08-10 河南圣佳电子科技有限公司 Smart profile room real time data report display systems
CN108629929A (en) * 2018-04-26 2018-10-09 王艳华 A kind of archive management device and archives lend automated management system
CN109241376A (en) * 2018-08-24 2019-01-18 山东浪潮通软信息科技有限公司 A kind of electronic records management device and method
CN109241352A (en) * 2018-06-28 2019-01-18 平安科技(深圳)有限公司 The acquisition methods and server of Profile information
CN109857827A (en) * 2019-01-31 2019-06-07 山东省国土测绘院 A kind of geography information archives integrated management approach and system
CN110516020A (en) * 2019-08-15 2019-11-29 韩伶俐 A kind of territorial resource archives management system based on digital city geo-spatial framework

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101283353A (en) * 2005-08-03 2008-10-08 温克科技公司 Systems for and methods of finding relevant documents by analyzing tags
CN103914047A (en) * 2014-03-28 2014-07-09 北京市第一中级人民法院 Intelligent archive management control system and method
CN107103529A (en) * 2016-02-23 2017-08-29 陈馨媛 Bank Profile management system based on SOA frameworks
CN106529812A (en) * 2016-11-20 2017-03-22 广西大学 Intelligent archives management system and application
CN108388619A (en) * 2018-02-10 2018-08-10 河南圣佳电子科技有限公司 Smart profile room real time data report display systems
CN108629929A (en) * 2018-04-26 2018-10-09 王艳华 A kind of archive management device and archives lend automated management system
CN109241352A (en) * 2018-06-28 2019-01-18 平安科技(深圳)有限公司 The acquisition methods and server of Profile information
CN109241376A (en) * 2018-08-24 2019-01-18 山东浪潮通软信息科技有限公司 A kind of electronic records management device and method
CN109857827A (en) * 2019-01-31 2019-06-07 山东省国土测绘院 A kind of geography information archives integrated management approach and system
CN110516020A (en) * 2019-08-15 2019-11-29 韩伶俐 A kind of territorial resource archives management system based on digital city geo-spatial framework

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
马威.基于.NET的检察院档案管理系统设计与实现.中国优秀硕士学位论文全文数据库 (信息科技辑).2012,(第7期),I138-662. *

Also Published As

Publication number Publication date
CN116431686A (en) 2023-07-14

Similar Documents

Publication Publication Date Title
US11240273B2 (en) Data processing and scanning systems for generating and populating a data inventory
US11347889B2 (en) Data processing systems for generating and populating a data inventory
US10437860B2 (en) Data processing systems for generating and populating a data inventory
US11144670B2 (en) Data processing systems for identifying and modifying processes that are subject to data subject access requests
US10438016B2 (en) Data processing systems for generating and populating a data inventory
US10803097B2 (en) Data processing systems for generating and populating a data inventory
US10438020B2 (en) Data processing systems for generating and populating a data inventory for processing data access requests
US20210256161A1 (en) Data processing systems for generating and populating a data inventory for processing data access requests
US7747521B2 (en) System and method for monitoring events associated with a person or property
US20090187657A1 (en) Content asset management system, method and control program
US20140279591A1 (en) Network-based real estate marketplace database and location-based matching
WO2006002179A2 (en) Evaluating the relevance of documents and systems and methods therefor
US10970675B2 (en) Data processing systems for generating and populating a data inventory
US11222309B2 (en) Data processing systems for generating and populating a data inventory
CN116431686B (en) Training data query method and system based on heterogeneous archives
US20210303603A1 (en) Data processing systems for generating and populating a data inventory
WO2019023510A1 (en) Data processing systems for generating and populating a data inventory

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant