WO2022236973A1 - Data backfilling method and apparatus, electronic device, and storage medium - Google Patents

Data backfilling method and apparatus, electronic device, and storage medium Download PDF

Info

Publication number
WO2022236973A1
WO2022236973A1 PCT/CN2021/110210 CN2021110210W WO2022236973A1 WO 2022236973 A1 WO2022236973 A1 WO 2022236973A1 CN 2021110210 W CN2021110210 W CN 2021110210W WO 2022236973 A1 WO2022236973 A1 WO 2022236973A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
log data
data
current
log
Prior art date
Application number
PCT/CN2021/110210
Other languages
French (fr)
Chinese (zh)
Inventor
宋同珍
万月亮
火一莽
Original Assignee
北京锐安科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京锐安科技有限公司 filed Critical 北京锐安科技有限公司
Publication of WO2022236973A1 publication Critical patent/WO2022236973A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/1805Append-only file systems, e.g. using logs or journals to store data
    • G06F16/1815Journaling file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Definitions

  • the embodiments of the present application relate to the technical field of data processing, for example, to a data backfilling method, device, electronic equipment, and storage medium.
  • This application provides a data backfill method, device, electronic equipment, and storage medium, which can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data. .
  • the embodiment of the present application provides a data backfilling method, the method comprising:
  • the embodiment of the present application also provides a data backfilling device, which includes: an analysis module, a detection module, a search module, and a backfill module; wherein,
  • the parsing module is configured to extract the current log data in the data stream to be processed; parse the field information in the current log data according to the log data format;
  • the detection module is configured to detect whether there is associated information and focus information in the current log data based on the field information in the current log data;
  • the search module is configured to, in response to the detection result that the associated information exists in the current log data and the key attention information does not exist in the current log data, search the association relationship database based on the associated information.
  • State key information
  • the backfilling module is configured to backfill the key attention information into the current log data in response to finding the search result of the key attention information in the association relationship database; store the backfilled current log data to the log database.
  • the embodiment of the present application provides an electronic device, including:
  • processors one or more processors
  • memory arranged to store one or more programs
  • the one or more processors are configured to execute the one or more programs to implement the data backfill method described in any embodiment of the present application.
  • an embodiment of the present application provides a storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the data backfilling method described in any embodiment of the present application is implemented.
  • FIG. 1 is a schematic flow diagram of a data backfill method provided in an embodiment of the present application
  • FIG. 2 is a schematic flow diagram of another data backfill method provided in the embodiment of the present application.
  • FIG. 3 is a schematic flowchart of another data backfill method provided in the embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of a data backfill device provided in an embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
  • Figure 1 is a schematic flow chart of a data backfill method provided by an embodiment of the present application.
  • the method can be performed by a data backfill device or an electronic device, and the device or electronic device can be implemented by software and/or hardware.
  • the device or Electronic devices can be integrated in any smart device with network communication function.
  • the data backfill method may include the following steps:
  • the electronic device can extract the current log data from the data stream to be processed; and parse out multiple field information in the current log data according to the format of the log data.
  • the electronic device can extract the current session from the data stream to be processed; extract the log information from the current session; perform structured processing on the log information to obtain structured processed log information; Output to a structured file; use the structured file as current log data.
  • the electronic device may detect whether there is associated information and focus information in the current log data based on multiple field information in the current log data.
  • each log data may include multiple fields; these fields at least include fields corresponding to associated information and fields corresponding to focused information. Therefore, based on the field corresponding to the associated information, the electronic device can detect whether there is associated information under the field corresponding to the associated information; at the same time, it can also detect whether there is focused information under the field corresponding to the focused information based on the field corresponding to the focused information.
  • the value of the current log data in the field corresponding to the associated information is empty, it means that the current log does not have associated information; if the value of the current log data in the field corresponding to the associated information is not empty, it means that the current log Associated information exists.
  • the focused attention information refers to the attention information preset or defaulted by the user, which may be log information corresponding to at least one field and/or at least one format.
  • the electronic device may search the associated relationship database for the focused information based on the associated information.
  • the associated information in this embodiment of the present application may be an identifier (Identity, ID) of each log data; it may also be other identifiers that uniquely identify each log data. Therefore, the electronic device can use the associated information of each log data to search the associated relationship database for the key information corresponding to the log data.
  • the electronic device may backfill the focused information into the current log data; and store the backfilled current log data into the log database. For example, the electronic device may backfill the found key attention information to a corresponding position based on the association information of the current log data.
  • the data backfilling method proposed in the embodiment of the present application extracts the current log data from the data stream to be processed; parses the field information in the current log data according to the log data format; detects the current log data based on the field information in the current log data. Whether there is associated information and focused information; in response to the detection result that there is associated information in the current log data and there is no focused information in the current log data, look up the focused information in the associated relationship database based on the associated information; Find the search results of key attention information in the database, and backfill the key attention information into the current log data; store the backfilled current log data in the log database.
  • the associated information and key attention information in different log data can be stored in the association relationship database, so that the corresponding key attention information can be found in the association relationship database based on the association information , the found focus information can be backfilled into the log data that lacks the focus information.
  • the processing method in the related art cannot maximize the value of the focused information.
  • the data backfill method proposed in the embodiment of the present application can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data;
  • the embodiments of the present application are simple, convenient, popular and applicable to a wider range.
  • FIG. 2 is a schematic flowchart of another data backfilling method provided by an embodiment of the present application.
  • the foregoing embodiment may be extended, and Embodiment 2 may also be combined with the foregoing other embodiments.
  • the data backfill method may include the following steps:
  • the electronic device may extract from the log database at least one piece of log data that has associated information and does not contain focused information according to a predetermined period. For example, for log data that only has relevant information but no key information, the electronic device can store the log data that only has relevant information but no key information in the log database in advance; Log data that only has associated information but no focused information.
  • the electronic device can extract only a predetermined amount of log data from the log database, or extract all log data from the log database.
  • the relationship database is in the process of dynamic data increase and decrease, and the query results obtained by query operations at different times may be different. For example, the key information corresponding to the associated information may be obtained, or the key information corresponding to the associated information may not be obtained.
  • the data backfilling method proposed in the embodiment of the present application extracts the current log data from the data stream to be processed; parses the field information in the current log data according to the log data format; detects the current log data based on the field information in the current log data. Whether there is associated information and focused information; in response to the detection result that there is associated information in the current log data and there is no focused information in the current log data, look up the focused information in the associated relationship database based on the associated information; Find the search results of key attention information in the database, and backfill the key attention information into the current log data; store the backfilled current log data in the log database.
  • the associated information and key attention information in different log data can be stored in the association relationship database, so that the corresponding key attention information can be found in the association relationship database based on the association information , the found focus information can be backfilled into the log data that lacks the found focus information.
  • the processing method in the related art cannot maximize the value of the focused information.
  • the data backfill method proposed in the embodiment of the present application can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data;
  • the embodiments of the present application are simple, convenient, popular and applicable to a wider range.
  • FIG. 3 is a schematic flowchart of another data backfilling method provided by an embodiment of the present application.
  • the foregoing embodiments may be extended, and Embodiment 3 may be combined with other foregoing embodiments.
  • the data backfill method may include the following steps:
  • the electronic device can extract the current session from the data stream to be processed; extract the log information from the current session.
  • the electronic device may extract multiple sessions from the data to be processed according to the data format corresponding to each session as the current session; extract log information from the current session according to the data format corresponding to the log information.
  • the electronic device can perform structured processing on the log information to obtain the structured log information; output the structured processed log information to the structured file; use the structured file as the current log data; according to
  • the log data format parses out multiple field information in the current log data.
  • the data in structured files is structured data.
  • Structured data is also called row data. It is data logically expressed and realized by a two-dimensional table structure. It strictly follows the data format and length specifications, mainly through relational databases. for storage and management.
  • the data backfilling method proposed in the embodiment of the present application extracts the current log data from the data stream to be processed; parses out multiple field information in the current log data according to the log data format; based on the multiple field information in the current log data, detects Whether there is associated information and key attention information in the current log data; if there is associated information and no key attention information in the current log data, then based on the associated information, search for the key attention information in the associated relational database; if found in the associated relational database Focus on the information, backfill the focused information into the current log data; store the backfilled current log data in the log database.
  • the associated information and key attention information in different log data can be stored in the association relationship database, so that the corresponding key attention information can be found in the association relationship database based on the association information , the found focus information can be backfilled into the log data that lacks the focus information.
  • the processing method in the related art cannot maximize the value of the focused information.
  • the data backfill method proposed in the embodiment of the present application can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data;
  • the embodiments of the present application are simple, convenient, popular and applicable to a wider range.
  • FIG. 4 is a schematic structural diagram of a data backfilling device provided by an embodiment of the present application.
  • the data backfilling device 400 includes: an analysis module 401, a detection module 402, a search module 403 and a backfill module 404; wherein,
  • the parsing module 401 is configured to extract the current log data in the data stream to be processed; parse the field information in the current log data according to the log data format;
  • the detection module 402 is configured to detect whether there is associated information and focus information in the current log data based on the field information in the current log data;
  • the search module 403 is configured to, in response to the detection result that the associated information exists in the current log data and the key attention information does not exist in the current log data, search in an association relationship database based on the associated information Said key focus information;
  • the backfilling module 404 is configured to, in response to finding the search result of the key attention information in the association relation database, backfill the key attention information into the current log data; the backfilled current log data stored in the log database.
  • the backfill module 404 is further configured to store the associated information and the focused information in response to the detection result that the associated information and the focused information exist in the current log data into the relational database; storing the current log data into the log database.
  • the backfilling module 404 is further configured to store the current log data in the log database in response to the search result that the key attention information is not found in the relational database.
  • the device further includes: an extraction module;
  • the extracting module is configured to extract at least one log data in the log database that has the associated information and does not have the focused information according to a predetermined period;
  • the search module 403 is also configured to search the at least one log data for key attention information in the association relationship database based on the associated information in the at least one log data;
  • the backfilling module 404 is further configured to, in response to finding the key information in the at least one log data in the association relationship database, backfill the key information in the at least one log data into the Among the at least one log data; storing the backfilled at least one log data in the log database.
  • the parsing module 401 is configured to extract the current log data from the data stream to be processed in the following manner: extract the current session from the data stream to be processed; extract the log data from the current session Outputting log information; carrying out structured processing on the log information to obtain structured processed log information; outputting the structured processed log information into a structured file; using the structured file as the current log data.
  • the above-mentioned data backfilling device can execute the method provided by any embodiment of the present application, and has corresponding functional modules and beneficial effects for executing the method.
  • the data backfill method provided in any embodiment of this application.
  • FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
  • FIG. 5 shows a block diagram of an exemplary electronic device suitable for implementing embodiments of the present application.
  • the electronic device 12 shown in FIG. 5 is just one type of example.
  • electronic device 12 takes the form of a general-purpose computing device.
  • Components of electronic device 12 may include: one or more processors or processing units 16 , system memory 28 , bus 18 connecting various system components including system memory 28 and processing unit 16 .
  • the bus 18 represents one or more types of bus structures, and the bus 18 includes a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any bus structure in the various bus structures.
  • These architectures include, for example, the Industry Standard Architecture (ISA) bus, the Micro Channel Architecture (MCA) bus, the Enhanced ISA bus, the Video Electronics Standards Association (VESA) ) Local bus and Peripheral Component Interconnect (PCI) bus.
  • Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12 and include both volatile and nonvolatile media, removable and non-removable media.
  • System memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (Random Access Memory, RAM) 30 and/or cache memory 32 .
  • Electronic device 12 may include other removable/non-removable, volatile/nonvolatile computer system storage media.
  • storage system 34 may be used to read and write to non-removable, non-volatile magnetic media, not shown in FIG. 5, commonly referred to as a hard disk drive.
  • a magnetic disk drive for reading and writing a removable nonvolatile magnetic disk such as a floppy disk
  • an optical disk drive for reading and writing a removable nonvolatile optical disk
  • the removable non-volatile optical disc is, for example, a compact disc read-only memory (CD-ROM), a digital video disc (Digital Video Disc Read-Only Memory, DVD-ROM) or other optical media.
  • each drive may be connected to bus 18 via one or more data media interfaces.
  • System memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present application.
  • Program/utility tool 40 may be stored in system memory 28 as a set (eg, at least one) of program modules 46, such program modules 46 including an operating system, one or more application programs, other program modules, and program data, which Each or combination of the examples may include the implementation of a network environment.
  • Program modules 46 generally perform the functions and/or methods of the embodiments described herein.
  • the electronic device 12 may also communicate with one or more external devices 14 (e.g., a keyboard, pointing device, display 24, etc.), may also communicate with one or more devices that enable a user to interact with the electronic device 12, and/or communicate with Any device (eg, network card, modem, etc.) that enables the electronic device 12 to communicate with one or more other computing devices. Such communication may occur through input/output (I/O) interface 22 .
  • the electronic device 12 can also communicate with one or more networks through the network adapter 20, one or more networks such as local area network (Local Area Network, LAN), wide area network (Wide Area Network, WAN) and/or public network, public network For example the Internet. As shown in FIG.
  • network adapter 20 communicates with other modules of electronic device 12 via bus 18 .
  • Other hardware and/or software modules can be used in conjunction with electronic device 12, and other hardware and software modules include: microcode, device drivers, redundant processing units, external disk drive arrays, disk arrays (Redundant Arrays of Independent Disks, RAID) systems, Tape drives and data backup storage systems, etc.
  • the processing unit 16 executes a variety of functional applications and data processing by running the programs stored in the system memory 28 , such as implementing the data backfill method provided by the embodiment of the present application.
  • Embodiment 6 of the present application provides a computer storage medium.
  • the computer-readable storage medium in the embodiments of the present application may use any combination of one or more computer-readable media.
  • the computer readable medium may be a computer readable signal medium or a computer readable storage medium.
  • a computer-readable storage medium may be, for example, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination thereof. Examples of computer readable storage media include: electrical connections with one or more conductors, portable computer diskettes, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (Erasable Programmable Read-Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in combination with an instruction execution system, apparatus, or device.
  • a computer readable signal medium may include a data signal that travels in baseband or as part of a carrier wave with computer readable program code embodied on the computer readable signal medium. Such propagated data signals may take many forms, including electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport a program for use by or in conjunction with an instruction execution system, apparatus, or device.
  • the program code contained on the computer readable medium can be transmitted by any appropriate medium, including wireless, electric wire, optical cable, radio frequency (Radio Frequency, RF), etc., or any appropriate combination of the above.
  • Computer program code for performing the operations of the present application may be written in one or more programming languages or combinations thereof, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional procedural programming languages.
  • a programming language such as C language or similar programming language.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, part on the user's computer and part on a remote computer or entirely on the remote computer or server.
  • the remote computer may be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or the remote computer may be connected to an external computer, such as the remote computer using Internet services provider to connect to external computers via the Internet.
  • LAN local area network
  • WAN wide area network
  • Internet services provider to connect to external computers via the Internet.

Abstract

A data backfilling method and apparatus, an electronic device, and a storage medium. The method comprises: extracting current log data in a data stream to be processed; parsing field information in the current log data according to a log data format; on the basis of the field information in the current log data, detecting whether association information and key attention information exist in the current log data; in response to the existence of the association information in the current log data and no detection result of the key attention information in the current log data, searching for the key attention information in an association relationship database on the basis of the association information; in response to the search of a search result of the key attention information in the association relationship database, backfilling the key attention information into the current log data; and storing the backfilled current log data into the log database.

Description

数据回填方法、装置、电子设备及存储介质Data backfill method, device, electronic device and storage medium
本公开要求在2021年05月08日提交中国专利局、申请号为202110500895.5的中国专利申请的优先权,以上申请的全部内容通过引用结合在本公开中。This disclosure claims the priority of the Chinese patent application with application number 202110500895.5 submitted to the China Patent Office on May 08, 2021, and the entire content of the above application is incorporated in this disclosure by reference.
技术领域technical field
本申请实施例涉及数据处理技术领域,例如涉及一种数据回填方法、装置、电子设备及存储介质。The embodiments of the present application relate to the technical field of data processing, for example, to a data backfilling method, device, electronic equipment, and storage medium.
背景技术Background technique
很多互联网应用在不同的操作下所产生的数据所携带的信息不同,一部分操作携带了一些特定关注的内容,但另一部分操作则没有携带这些内容;并且由于手机的不同还会发生在同一个应用相同的操作下不同手机产生的数据所携带的信息不同,例如,对于IOS操作系统的手机和Android操作系统的手机,在同一操作下产生的数据携带的信息可能就不同。Many Internet applications carry different information in the data generated under different operations. Some operations carry some content of specific concern, but other operations do not carry such content; The information carried by the data generated by different mobile phones under the same operation is different. For example, for mobile phones with the IOS operating system and mobile phones with the Android operating system, the information carried by the data generated under the same operation may be different.
这种同一操作下产生的数据中携带的信息不一致的情形,给相关技术中的数据处理方法造成了困难,例如,有些特定关注的信息只存在于某个操作数据中或者某一个终端的数据中,但是其他操作数据或其他终端数据也非常关注这些信息,可见,相关技术中的处理方法无法将这些重点关注信息发挥出最大的价值。The inconsistency of the information carried in the data generated under the same operation has caused difficulties for the data processing methods in related technologies. For example, some information of specific concern only exists in the data of a certain operation or in the data of a certain terminal. , but other operational data or other terminal data are also very concerned about this information, it can be seen that the processing methods in the related art cannot maximize the value of these key information.
发明内容Contents of the invention
本申请提供一种数据回填方法、装置、电子设备及存储介质,可以对缺少重点关注的信息进行回填,使得重点关注的信息能够在不同的场景中体现出来,在最大程度上发挥和提高数据价值。This application provides a data backfill method, device, electronic equipment, and storage medium, which can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data. .
第一方面,本申请实施例提供了一种数据回填方法,所述方法包括:In the first aspect, the embodiment of the present application provides a data backfilling method, the method comprising:
在待处理数据流中提取出当前日志数据;按照日志数据格式在所述当前日志数据中解析出字段信息;Extracting the current log data from the data stream to be processed; parsing field information in the current log data according to the log data format;
基于所述当前日志数据中的字段信息,检测所述当前日志数据中是否存在关联信息和重点关注信息;Based on the field information in the current log data, detect whether there is associated information and focus information in the current log data;
响应于所述当前日志数据中存在所述关联信息且所述当前日志数据中不存在所述重点关注信息的检测结果,基于所述关联信息在关联关系数据库中查找所述重点关注信息;In response to a detection result that the associated information exists in the current log data and the focused information does not exist in the current log data, searching for the focused information in an association relationship database based on the associated information;
响应于在所述关联关系数据库中查找到所述重点关注信息的查找结果,将所述重点关注信息回填至所述当前日志数据中;将回填后的当前日志数据存储至所述日志数据库中。Responding to the finding of the key information in the association relationship database, backfill the key information into the current log data; store the backfilled current log data into the log database.
第二方面,本申请实施例还提供了一种数据回填装置,所述装置包括:解析模块、检测模块、查找模块和回填模块;其中,In the second aspect, the embodiment of the present application also provides a data backfilling device, which includes: an analysis module, a detection module, a search module, and a backfill module; wherein,
所述解析模块,设置为在待处理数据流中提取出当前日志数据;按照日志数据格式在所述当前日志数据中解析出字段信息;The parsing module is configured to extract the current log data in the data stream to be processed; parse the field information in the current log data according to the log data format;
所述检测模块,设置为基于所述当前日志数据中的字段信息,检测所述当前日志数据中是否存在关联信息和重点关注信息;The detection module is configured to detect whether there is associated information and focus information in the current log data based on the field information in the current log data;
所述查找模块,设置为响应于所述当前日志数据中存在所述关联信息且所述当前日志数据中不存在所述重点关注信息的检测结果,基于所述关联信息在关联关系数据库中查找所述重点关注信息;The search module is configured to, in response to the detection result that the associated information exists in the current log data and the key attention information does not exist in the current log data, search the association relationship database based on the associated information. State key information;
所述回填模块,设置为响应于在所述关联关系数据库中查找到所述重点关注信息的查找结果,将所述重点关注信息回填至所述当前日志数据中;将回填后的当前日志数据存储至所述日志数据库中。The backfilling module is configured to backfill the key attention information into the current log data in response to finding the search result of the key attention information in the association relationship database; store the backfilled current log data to the log database.
第三方面,本申请实施例提供了一种电子设备,包括:In a third aspect, the embodiment of the present application provides an electronic device, including:
一个或多个处理器;one or more processors;
存储器,设置为存储一个或多个程序,memory, arranged to store one or more programs,
所述一个或多个处理器,设置为执行所述一个或多个程序以实现本申请任意实施例所述的数据回填方法。The one or more processors are configured to execute the one or more programs to implement the data backfill method described in any embodiment of the present application.
第四方面,本申请实施例提供了一种存储介质,所述存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现本申请任意实施例所述的数据回填方法。In a fourth aspect, an embodiment of the present application provides a storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the data backfilling method described in any embodiment of the present application is implemented.
附图说明Description of drawings
图1为本申请实施例提供的一种数据回填方法的流程示意图;FIG. 1 is a schematic flow diagram of a data backfill method provided in an embodiment of the present application;
图2为本申请实施例提供的另一种数据回填方法的流程示意图;FIG. 2 is a schematic flow diagram of another data backfill method provided in the embodiment of the present application;
图3为本申请实施例提供的另一种数据回填方法的流程示意图;FIG. 3 is a schematic flowchart of another data backfill method provided in the embodiment of the present application;
图4为本申请实施例提供的数据回填装置的结构示意图;FIG. 4 is a schematic structural diagram of a data backfill device provided in an embodiment of the present application;
图5为本申请实施例提供的电子设备的结构示意图。FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图和实施例对本申请作进行说明。The application will be described below in conjunction with the accompanying drawings and embodiments.
实施例一Embodiment one
图1是本申请实施例提供的一种数据回填方法的流程示意图,该方法可以由数据回填装置或者电子设备来执行,该装置或者电子设备可以由软件和/或硬件的方式实现,该装置或者电子设备可以集成在任何具有网络通信功能的智能设备中。如图1所示,数据回填方法可以包括以下步骤:Figure 1 is a schematic flow chart of a data backfill method provided by an embodiment of the present application. The method can be performed by a data backfill device or an electronic device, and the device or electronic device can be implemented by software and/or hardware. The device or Electronic devices can be integrated in any smart device with network communication function. As shown in Figure 1, the data backfill method may include the following steps:
S101、在待处理数据流中提取出当前日志数据;按照日志数据格式在当前日志数据中解析出字段信息。S101. Extract current log data from the data stream to be processed; parse field information from the current log data according to the log data format.
在本步骤中,电子设备可以在待处理数据流中提取出当前日志数据;按照日志数据格式在当前日志数据中解析出多个字段信息。例如,电子设备可以在待处理数据流中提取出当前会话;在当前会话中提取出日志信息;将日志信息进行结构化处理,得到结构化处理后的日志信息;将结构化处理后的日志信息输出到结构化文件中;将该结构化文件作为当前日志数据。In this step, the electronic device can extract the current log data from the data stream to be processed; and parse out multiple field information in the current log data according to the format of the log data. For example, the electronic device can extract the current session from the data stream to be processed; extract the log information from the current session; perform structured processing on the log information to obtain structured processed log information; Output to a structured file; use the structured file as current log data.
S102、基于当前日志数据中的字段信息,检测当前日志数据中是否存在关联信息和重点关注信息。S102. Based on the field information in the current log data, detect whether there is associated information and focus information in the current log data.
在本步骤中,电子设备可以基于当前日志数据中的多个字段信息,检测当前日志数据中是否存在关联信息和重点关注信息。例如,每个日志数据可以包括多个字段;这些字段至少包括关联信息对应的字段和重点关注信息对应的字段。因此,电子设备可以基于关联信息对应的字段,检测关联信息对应的字段下是否存在关联信息;同时还可以基于重点关注信息对应的字段,检测重点关注信息对应的字段下是否存在重点关注信息。例如,若当前日志数据在关联信息对应的字段上的取值为空,则表示当前日志不存在关联信息;若当前日志数据在关联信息对应的字段上的取值不为空,则表示当前日志存在关联信息。In this step, the electronic device may detect whether there is associated information and focus information in the current log data based on multiple field information in the current log data. For example, each log data may include multiple fields; these fields at least include fields corresponding to associated information and fields corresponding to focused information. Therefore, based on the field corresponding to the associated information, the electronic device can detect whether there is associated information under the field corresponding to the associated information; at the same time, it can also detect whether there is focused information under the field corresponding to the focused information based on the field corresponding to the focused information. For example, if the value of the current log data in the field corresponding to the associated information is empty, it means that the current log does not have associated information; if the value of the current log data in the field corresponding to the associated information is not empty, it means that the current log Associated information exists.
重点关注信息是指用户预先设置或默认设置的关注信息,可为至少一个字段和/或至少一个格式对应的日志信息。The focused attention information refers to the attention information preset or defaulted by the user, which may be log information corresponding to at least one field and/or at least one format.
S103、响应于当前日志数据中存在关联信息且当前日志数据中不存在重点关注信息的检测结果,基于关联信息在关联关系数据库中查找重点关注信息。S103. In response to the detection result that there is associated information in the current log data and there is no focused information in the current log data, search for the focused information in the association relationship database based on the associated information.
在本步骤中,若当前日志数据中存在关联信息且当前日志数据中不存在重点关注信息,则电子设备可以基于关联信息在关联关系数据库中查找重点关注信息。例如,本申请实施例中的关联信息可以是每个日志数据的标识(Identity,ID);也可以是其他唯一标识每个日志数据的标识。因此,电子设备可以使用每个日志数据的关联信息在关联关系数据库中查找对应日志数据的重点关注信 息。In this step, if there is associated information in the current log data and there is no focused information in the current log data, the electronic device may search the associated relationship database for the focused information based on the associated information. For example, the associated information in this embodiment of the present application may be an identifier (Identity, ID) of each log data; it may also be other identifiers that uniquely identify each log data. Therefore, the electronic device can use the associated information of each log data to search the associated relationship database for the key information corresponding to the log data.
S104、响应于在关联关系数据库中查找到重点关注信息的查找结果,将重点关注信息回填至当前日志数据中;将回填后的当前日志数据存储至日志数据库中。S104. In response to finding the search result of the key attention information in the relational database, backfill the key attention information into the current log data; store the backfilled current log data into the log database.
在本步骤中,若在关联关系数据库中查找到重点关注信息,则电子设备可以将重点关注信息回填至当前日志数据中;将回填后的当前日志数据存储至日志数据库中。例如,电子设备可以基于当前日志数据的关联信息,将查找到的重点关注信息回填至对应的位置上。In this step, if the focused information is found in the relational database, the electronic device may backfill the focused information into the current log data; and store the backfilled current log data into the log database. For example, the electronic device may backfill the found key attention information to a corresponding position based on the association information of the current log data.
本申请实施例提出的数据回填方法,在待处理数据流中提取出当前日志数据;按照日志数据格式在当前日志数据中解析出字段信息;基于当前日志数据中的字段信息,检测当前日志数据中是否存在关联信息和重点关注信息;响应于当前日志数据中存在关联信息且当前日志数据中不存在重点关注信息的检测结果,基于关联信息在关联关系数据库中查找重点关注信息;响应于在关联关系数据库中查找到重点关注信息的查找结果,将重点关注信息回填至当前日志数据中;将回填后的当前日志数据存储至日志数据库中。也就是说,在本申请实施例中,可以将不同的日志数据中的关联信息和重点关注信息存储至关联关系数据库中,如此在关联关系数据库中基于关联信息即可查找到对应的重点关注信息,可以将查找到的重点关注信息回填至缺少该重点关注信息的日志数据中。The data backfilling method proposed in the embodiment of the present application extracts the current log data from the data stream to be processed; parses the field information in the current log data according to the log data format; detects the current log data based on the field information in the current log data. Whether there is associated information and focused information; in response to the detection result that there is associated information in the current log data and there is no focused information in the current log data, look up the focused information in the associated relationship database based on the associated information; Find the search results of key attention information in the database, and backfill the key attention information into the current log data; store the backfilled current log data in the log database. That is to say, in the embodiment of the present application, the associated information and key attention information in different log data can be stored in the association relationship database, so that the corresponding key attention information can be found in the association relationship database based on the association information , the found focus information can be backfilled into the log data that lacks the focus information.
相关技术中的处理方法无法将重点关注信息发挥出最大的价值。和相关技术相比,本申请实施例提出的数据回填方法,可以对缺少重点关注的信息进行回填,使得重点关注的信息能够在不同的场景中体现出来,在最大程度上发挥和提高数据价值;并且,本申请实施例简单方便、便于普及,适用范围更广。The processing method in the related art cannot maximize the value of the focused information. Compared with related technologies, the data backfill method proposed in the embodiment of the present application can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data; In addition, the embodiments of the present application are simple, convenient, popular and applicable to a wider range.
实施例二Embodiment two
图2是本申请实施例提供的另一种数据回填方法的流程示意图。可对上述实施例进行扩展,也可将实施例二与上述的其他实施例进行结合。如图2所示,数据回填方法可以包括以下步骤:FIG. 2 is a schematic flowchart of another data backfilling method provided by an embodiment of the present application. The foregoing embodiment may be extended, and Embodiment 2 may also be combined with the foregoing other embodiments. As shown in Figure 2, the data backfill method may include the following steps:
S201、在待处理数据流中提取出当前日志数据;按照日志数据格式在当前日志数据中解析出字段信息。S201. Extract current log data from the data stream to be processed; parse field information from the current log data according to the log data format.
S202、基于当前日志数据中的字段信息,检测当前日志数据中是否存在关联信息和重点关注信息。S202. Based on the field information in the current log data, detect whether there is associated information and focus information in the current log data.
S203、响应于当前日志数据中存在关联信息且当前日志数据中不存在重点关注信息的检测结果,基于关联信息在关联关系数据库中查找重点关注信息。S203. In response to the detection result that there is associated information in the current log data and no focused information exists in the current log data, search for the focused information in the association relationship database based on the associated information.
S204、响应于在关联关系数据库中未查找到重点关注信息的查找结果,将当前日志数据存储至日志数据库中。S204. In response to the search result that the key information is not found in the relational database, store the current log data in the log database.
S205、按照预定周期在日志数据库中提取出存在关联信息且不存在重点关注信息的至少一个日志数据。S205. Extract at least one piece of log data that has associated information and does not contain focused information from the log database according to a predetermined period.
在本步骤中,电子设备可以按照预定周期在日志数据库中提取出存在关联信息且不存在重点关注信息的至少一个日志数据。例如,对于只存在关联信息但不存在重点关注信息的日志数据,电子设备可以预先将只存在关联信息但不存在重点关注信息的日志数据存储至日志数据库中;按照预定周期在日志数据库中提取出只存在关联信息但不存在重点关注信息的日志数据。In this step, the electronic device may extract from the log database at least one piece of log data that has associated information and does not contain focused information according to a predetermined period. For example, for log data that only has relevant information but no key information, the electronic device can store the log data that only has relevant information but no key information in the log database in advance; Log data that only has associated information but no focused information.
此外,电子设备可以在日志数据库中只提取出预定数量的日志数据,也可以在日志数据库中提取出全部的日志数据。In addition, the electronic device can extract only a predetermined amount of log data from the log database, or extract all log data from the log database.
S206、基于至少一个日志数据中的关联信息在关联关系数据库中查找至少一个日志数据中的重点关注信息。S206. Based on the association information in the at least one log data, look up the key information in the at least one log data in the association relationship database.
关联关系数据库处于动态的数据增减过程中,不同时刻进行查询操作所得的查询结果可能存在差异,例如,可能得到关联信息对应的重点关注信息,也可能无法得到关联信息对应的重点关注信息。The relationship database is in the process of dynamic data increase and decrease, and the query results obtained by query operations at different times may be different. For example, the key information corresponding to the associated information may be obtained, or the key information corresponding to the associated information may not be obtained.
S207、响应于在关联关系数据库中查找到至少一个日志数据中的重点关注信息的查找结果,将至少一个日志数据中的重点关注信息分别回填至至少一个日志数据中;将回填后的至少一个日志数据存储至日志数据库中。S207. Responding to the finding of the key information in at least one log data in the relational database, backfill the key information in at least one log data into at least one log data respectively; at least one log after backfilling The data is stored in the log database.
本申请实施例提出的数据回填方法,在待处理数据流中提取出当前日志数据;按照日志数据格式在当前日志数据中解析出字段信息;基于当前日志数据中的字段信息,检测当前日志数据中是否存在关联信息和重点关注信息;响应于当前日志数据中存在关联信息且当前日志数据中不存在重点关注信息的检测结果,基于关联信息在关联关系数据库中查找重点关注信息;响应于在关联关系数据库中查找到重点关注信息的查找结果,将重点关注信息回填至当前日志数据中;将回填后的当前日志数据存储至日志数据库中。也就是说,在本申请实施例中,可以将不同的日志数据中的关联信息和重点关注信息存储至关联关系数据库中,如此在关联关系数据库中基于关联信息即可查找到对应的重点关注信息,可以将查找到的重点关注信息回填至缺少该查找到的重点关注信息的日志数据中。The data backfilling method proposed in the embodiment of the present application extracts the current log data from the data stream to be processed; parses the field information in the current log data according to the log data format; detects the current log data based on the field information in the current log data. Whether there is associated information and focused information; in response to the detection result that there is associated information in the current log data and there is no focused information in the current log data, look up the focused information in the associated relationship database based on the associated information; Find the search results of key attention information in the database, and backfill the key attention information into the current log data; store the backfilled current log data in the log database. That is to say, in the embodiment of the present application, the associated information and key attention information in different log data can be stored in the association relationship database, so that the corresponding key attention information can be found in the association relationship database based on the association information , the found focus information can be backfilled into the log data that lacks the found focus information.
相关技术中的处理方法无法将重点关注信息发挥出最大的价值。和相关技术相比,本申请实施例提出的数据回填方法,可以对缺少重点关注的信息进行回填,使得重点关注的信息能够在不同的场景中体现出来,在最大程度上发挥和提高数据价值;并且,本申请实施例简单方便、便于普及,适用范围更广。The processing method in the related art cannot maximize the value of the focused information. Compared with related technologies, the data backfill method proposed in the embodiment of the present application can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data; In addition, the embodiments of the present application are simple, convenient, popular and applicable to a wider range.
实施例三Embodiment Three
图3是本申请实施例提供的另一种数据回填方法的流程示意图。可对上述实施例进行扩展,并可以将实施例三与上述的其他实施例进行结合。如图3所示,数据回填方法可以包括以下步骤:FIG. 3 is a schematic flowchart of another data backfilling method provided by an embodiment of the present application. The foregoing embodiments may be extended, and Embodiment 3 may be combined with other foregoing embodiments. As shown in Figure 3, the data backfill method may include the following steps:
S301、在待处理数据流中提取出当前会话;在当前会话中提取出日志信息。S301. Extract the current session from the data stream to be processed; extract log information from the current session.
在本步骤中,电子设备可以在待处理数据流中提取出当前会话;在当前会话中提取出日志信息。例如,电子设备可以按照每个会话对应的数据格式在待处理数据中提取出多个会话分别作为当前会话;按照日志信息对应的数据格式在当前会话中提取出日志信息。In this step, the electronic device can extract the current session from the data stream to be processed; extract the log information from the current session. For example, the electronic device may extract multiple sessions from the data to be processed according to the data format corresponding to each session as the current session; extract log information from the current session according to the data format corresponding to the log information.
S302、将日志信息进行结构化处理,得到结构化处理后的日志信息;将结构化处理后的日志信息输出到结构化文件中;将结构化文件作为当前日志数据;按照日志数据格式在当前日志数据中解析出字段信息。S302. Perform structured processing on the log information to obtain the structured processed log information; output the structured processed log information to the structured file; use the structured file as the current log data; The field information is parsed from the data.
在本步骤中,电子设备可以将日志信息进行结构化处理,得到结构化处理后的日志信息;将结构化处理后的日志信息输出到结构化文件中;将结构化文件作为当前日志数据;按照日志数据格式在当前日志数据中解析出多个字段信息。例如,结构化文件中的数据为结构化数据,结构化数据也称作行数据,是由二维表结构来逻辑表达和实现的数据,严格地遵循数据格式与长度规范,主要通过关系型数据库进行存储和管理。In this step, the electronic device can perform structured processing on the log information to obtain the structured log information; output the structured processed log information to the structured file; use the structured file as the current log data; according to The log data format parses out multiple field information in the current log data. For example, the data in structured files is structured data. Structured data is also called row data. It is data logically expressed and realized by a two-dimensional table structure. It strictly follows the data format and length specifications, mainly through relational databases. for storage and management.
S303、基于当前日志数据中的字段信息,检测当前日志数据中是否存在关联信息和重点关注信息。S303. Based on the field information in the current log data, detect whether there is associated information and focus information in the current log data.
S304、响应于当前日志数据中存在关联信息且当前日志数据中不存在重点关注信息的检测结果,基于关联信息在关联关系数据库中查找重点关注信息。S304. In response to the detection result that the relevant information exists in the current log data and the focused information does not exist in the current log data, search for the focused information in the relational database based on the correlated information.
S305、响应于在关联关系数据库中查找到重点关注信息的查找结果,将重点关注信息回填至当前日志数据中;将回填后的当前日志数据存储至日志数据库中。S305. Responding to the search result of finding the key attention information in the relational database, backfill the key attention information into the current log data; store the backfilled current log data into the log database.
本申请实施例提出的数据回填方法,在待处理数据流中提取出当前日志数据;按照日志数据格式在当前日志数据中解析出多个字段信息;基于当前日志 数据中的多个字段信息,检测当前日志数据中是否存在关联信息和重点关注信息;若当前日志数据中存在关联信息且不存在重点关注信息,则基于关联信息在关联关系数据库中查找重点关注信息;若在关联关系数据库中查找到重点关注信息,则将重点关注信息回填至当前日志数据中;将回填后的当前日志数据存储至日志数据库中。也就是说,在本申请实施例中,可以将不同的日志数据中的关联信息和重点关注信息存储至关联关系数据库中,如此在关联关系数据库中基于关联信息即可查找到对应的重点关注信息,可以将查找到的重点关注信息回填至缺少该重点关注信息的日志数据中。The data backfilling method proposed in the embodiment of the present application extracts the current log data from the data stream to be processed; parses out multiple field information in the current log data according to the log data format; based on the multiple field information in the current log data, detects Whether there is associated information and key attention information in the current log data; if there is associated information and no key attention information in the current log data, then based on the associated information, search for the key attention information in the associated relational database; if found in the associated relational database Focus on the information, backfill the focused information into the current log data; store the backfilled current log data in the log database. That is to say, in the embodiment of the present application, the associated information and key attention information in different log data can be stored in the association relationship database, so that the corresponding key attention information can be found in the association relationship database based on the association information , the found focus information can be backfilled into the log data that lacks the focus information.
相关技术中的处理方法无法将重点关注信息发挥出最大的价值。和相关技术相比,本申请实施例提出的数据回填方法,可以对缺少重点关注的信息进行回填,使得重点关注的信息能够在不同的场景中体现出来,在最大程度上发挥和提高数据价值;并且,本申请实施例简单方便、便于普及,适用范围更广。The processing method in the related art cannot maximize the value of the focused information. Compared with related technologies, the data backfill method proposed in the embodiment of the present application can backfill information that lacks key concerns, so that the key information can be reflected in different scenarios, and maximize the value of data; In addition, the embodiments of the present application are simple, convenient, popular and applicable to a wider range.
实施例四Embodiment Four
图4为本申请实施例提供的数据回填装置的结构示意图。如图4所示,所述数据回填装置400包括:解析模块401、检测模块402、查找模块403和回填模块404;其中,FIG. 4 is a schematic structural diagram of a data backfilling device provided by an embodiment of the present application. As shown in FIG. 4, the data backfilling device 400 includes: an analysis module 401, a detection module 402, a search module 403 and a backfill module 404; wherein,
所述解析模块401,设置为在待处理数据流中提取出当前日志数据;按照日志数据格式在所述当前日志数据中解析出字段信息;The parsing module 401 is configured to extract the current log data in the data stream to be processed; parse the field information in the current log data according to the log data format;
所述检测模块402,设置为基于所述当前日志数据中的字段信息,检测所述当前日志数据中是否存在关联信息和重点关注信息;The detection module 402 is configured to detect whether there is associated information and focus information in the current log data based on the field information in the current log data;
所述查找模块403,设置为响应于所述当前日志数据中存在所述关联信息且所述当前日志数据中不存在所述重点关注信息的检测结果,基于所述关联信息在关联关系数据库中查找所述重点关注信息;The search module 403 is configured to, in response to the detection result that the associated information exists in the current log data and the key attention information does not exist in the current log data, search in an association relationship database based on the associated information Said key focus information;
所述回填模块404,设置为响应于在所述关联关系数据库中查找到所述重点关注信息的查找结果,将所述重点关注信息回填至所述当前日志数据中;将回填后的当前日志数据存储至所述日志数据库中。The backfilling module 404 is configured to, in response to finding the search result of the key attention information in the association relation database, backfill the key attention information into the current log data; the backfilled current log data stored in the log database.
在一实施例中,所述回填模块404,还设置为响应于所述当前日志数据中存在所述关联信息和所述重点关注信息的检测结果,将所述关联信息和所述重点关注信息存储至关联关系数据库中;将所述当前日志数据存储至所述日志数据库中。In an embodiment, the backfill module 404 is further configured to store the associated information and the focused information in response to the detection result that the associated information and the focused information exist in the current log data into the relational database; storing the current log data into the log database.
在一实施例中,所述回填模块404,还设置为响应于在所述关联关系数据库 中未查找到所述重点关注信息的查找结果,将所述当前日志数据存储至所述日志数据库中。In one embodiment, the backfilling module 404 is further configured to store the current log data in the log database in response to the search result that the key attention information is not found in the relational database.
在一实施例中,所述装置还包括:提取模块;In one embodiment, the device further includes: an extraction module;
所述提取模块,设置为按照预定周期在所述日志数据库中提取出存在所述关联信息且不存在所述重点关注信息的至少一个日志数据;The extracting module is configured to extract at least one log data in the log database that has the associated information and does not have the focused information according to a predetermined period;
所述查找模块403,还设置为基于所述至少一个日志数据中的关联信息在所述关联关系数据库中查找所述至少一个日志数据中的重点关注信息;The search module 403 is also configured to search the at least one log data for key attention information in the association relationship database based on the associated information in the at least one log data;
所述回填模块404,还设置为响应于在所述关联关系数据库中查找到所述至少一个日志数据中的重点关注信息的查找结果,将所述至少一个日志数据中的重点关注信息分别回填至所述至少一个日志数据中;将回填后的至少一个日志数据存储至所述日志数据库中。The backfilling module 404 is further configured to, in response to finding the key information in the at least one log data in the association relationship database, backfill the key information in the at least one log data into the Among the at least one log data; storing the backfilled at least one log data in the log database.
在一实施例中,所述解析模块401,设置为通过以下方式将在待处理数据流中提取出当前日志数据:在所述待处理数据流中提取出当前会话;在所述当前会话中提取出日志信息;将所述日志信息进行结构化处理,得到结构化处理后的日志信息;将所述结构化处理后的日志信息输出到结构化文件中;将所述结构化文件作为所述当前日志数据。In one embodiment, the parsing module 401 is configured to extract the current log data from the data stream to be processed in the following manner: extract the current session from the data stream to be processed; extract the log data from the current session Outputting log information; carrying out structured processing on the log information to obtain structured processed log information; outputting the structured processed log information into a structured file; using the structured file as the current log data.
上述数据回填装置可执行本申请任意实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中描述的技术细节,可参见本申请任意实施例提供的数据回填方法。The above-mentioned data backfilling device can execute the method provided by any embodiment of the present application, and has corresponding functional modules and beneficial effects for executing the method. For technical details not described in this embodiment, refer to the data backfill method provided in any embodiment of this application.
实施例五Embodiment five
图5是本申请实施例提供的电子设备的结构示意图。图5示出了适于用来实现本申请实施方式的示例性电子设备的框图。图5显示的电子设备12仅仅是一类示例。FIG. 5 is a schematic structural diagram of an electronic device provided by an embodiment of the present application. FIG. 5 shows a block diagram of an exemplary electronic device suitable for implementing embodiments of the present application. The electronic device 12 shown in FIG. 5 is just one type of example.
如图5所示,电子设备12以通用计算设备的形式表现。电子设备12的组件可以包括:一个或者多个处理器或者处理单元16,系统存储器28,连接不同系统组件的总线18,不同系统组件包括系统存储器28和处理单元16。As shown in FIG. 5, electronic device 12 takes the form of a general-purpose computing device. Components of electronic device 12 may include: one or more processors or processing units 16 , system memory 28 , bus 18 connecting various system components including system memory 28 and processing unit 16 .
总线18表示多类总线结构中的一种或多种,总线18包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括工业标准体系结构(Industry Standard Architecture,ISA)总线,微通道体系结构(Micro Channel Architecture,MCA)总线,增强型ISA总线、视频电子标准协会(Video Electronics  Standards Association,VESA)局域总线以及外围组件互连(Peripheral Component Interconnect,PCI)总线。The bus 18 represents one or more types of bus structures, and the bus 18 includes a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any bus structure in the various bus structures. These architectures include, for example, the Industry Standard Architecture (ISA) bus, the Micro Channel Architecture (MCA) bus, the Enhanced ISA bus, the Video Electronics Standards Association (VESA) ) Local bus and Peripheral Component Interconnect (PCI) bus.
电子设备12典型地包括多种计算机系统可读介质。这些介质可以是任何能够被电子设备12访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。 Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12 and include both volatile and nonvolatile media, removable and non-removable media.
系统存储器28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(Random Access Memory,RAM)30和/或高速缓存32。电子设备12可以包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以用于读写不可移动的、非易失性磁介质,不可移动的、非易失性磁介质在图5中未显示,通常称为硬盘驱动器。尽管图5中未示出,可以提供用于对可移动非易失性磁盘例如软盘进行读写的磁盘驱动器,以及对可移动非易失性光盘读写的光盘驱动器。其中,可移动非易失性光盘例如只读光盘(Compact Disc Read-Only Memory,CD-ROM),数字视频光盘(Digital Video Disc Read-Only Memory,DVD-ROM)或者其它光介质。 System memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (Random Access Memory, RAM) 30 and/or cache memory 32 . Electronic device 12 may include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read and write to non-removable, non-volatile magnetic media, not shown in FIG. 5, commonly referred to as a hard disk drive. Although not shown in FIG. 5, a magnetic disk drive for reading and writing a removable nonvolatile magnetic disk such as a floppy disk, and an optical disk drive for reading and writing a removable nonvolatile optical disk may be provided. Wherein, the removable non-volatile optical disc is, for example, a compact disc read-only memory (CD-ROM), a digital video disc (Digital Video Disc Read-Only Memory, DVD-ROM) or other optical media.
在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线18相连。系统存储器28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本申请多个实施例的功能。In these cases, each drive may be connected to bus 18 via one or more data media interfaces. System memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present application.
具有一组(例如至少一个)程序模块46的程序/实用工具40,可以存储至系统存储器28中,这样的程序模块46包括操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或组合中可能包括网络环境的实现。程序模块46通常执行本申请所描述的实施例中的功能和/或方法。Program/utility tool 40 may be stored in system memory 28 as a set (eg, at least one) of program modules 46, such program modules 46 including an operating system, one or more application programs, other program modules, and program data, which Each or combination of the examples may include the implementation of a network environment. Program modules 46 generally perform the functions and/or methods of the embodiments described herein.
电子设备12也可以与一个或多个外部设备14(例如键盘、指向设备、显示器24等)通信,还可与一个或者多个使得用户能与该电子设备12交互的设备通信,和/或与使得该电子设备12能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口22进行。并且,电子设备12还可以通过网络适配器20与一个或者多个网络通信,一个或者多个网络例如局域网(Local Area Network,LAN),广域网(Wide Area Network,WAN)和/或公共网络,公共网络例如因特网。如图5所示,网络适配器20通过总线18与电子设备12的其它模块通信。可以结合电子设备12使用其它硬件和/或软件模块,其它硬件和软件模块包括:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、磁盘阵列(Redundant Arrays of Independent Disks,RAID)系统、磁带驱动器以及数据备份存储系统等。The electronic device 12 may also communicate with one or more external devices 14 (e.g., a keyboard, pointing device, display 24, etc.), may also communicate with one or more devices that enable a user to interact with the electronic device 12, and/or communicate with Any device (eg, network card, modem, etc.) that enables the electronic device 12 to communicate with one or more other computing devices. Such communication may occur through input/output (I/O) interface 22 . Moreover, the electronic device 12 can also communicate with one or more networks through the network adapter 20, one or more networks such as local area network (Local Area Network, LAN), wide area network (Wide Area Network, WAN) and/or public network, public network For example the Internet. As shown in FIG. 5 , network adapter 20 communicates with other modules of electronic device 12 via bus 18 . Other hardware and/or software modules can be used in conjunction with electronic device 12, and other hardware and software modules include: microcode, device drivers, redundant processing units, external disk drive arrays, disk arrays (Redundant Arrays of Independent Disks, RAID) systems, Tape drives and data backup storage systems, etc.
处理单元16通过运行存储至系统存储器28中的程序,执行多种功能应用以及数据处理,例如实现本申请实施例所提供的数据回填方法。The processing unit 16 executes a variety of functional applications and data processing by running the programs stored in the system memory 28 , such as implementing the data backfill method provided by the embodiment of the present application.
实施例六Embodiment six
本申请实施例六提供了一种计算机存储介质。Embodiment 6 of the present application provides a computer storage medium.
本申请实施例的计算机可读存储介质,可以采用一个或多个计算机可读的介质的任意组合。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质。计算机可读存储介质例如可以是电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的例子包括:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read-Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本文件中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者结合使用。The computer-readable storage medium in the embodiments of the present application may use any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer-readable storage medium may be, for example, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination thereof. Examples of computer readable storage media include: electrical connections with one or more conductors, portable computer diskettes, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (Erasable Programmable Read-Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In this document, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in combination with an instruction execution system, apparatus, or device.
计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,计算机可读的信号介质中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者结合使用的程序。A computer readable signal medium may include a data signal that travels in baseband or as part of a carrier wave with computer readable program code embodied on the computer readable signal medium. Such propagated data signals may take many forms, including electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport a program for use by or in conjunction with an instruction execution system, apparatus, or device.
计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括无线、电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。The program code contained on the computer readable medium can be transmitted by any appropriate medium, including wireless, electric wire, optical cable, radio frequency (Radio Frequency, RF), etc., or any appropriate combination of the above.
可以以一种或多种程序设计语言或组合来编写用于执行本申请操作的计算机程序代码,所述程序设计语言包括面向对象的程序设计语言诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言诸如C语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、一部分在用户计算机上另一部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,远程计算机可以连接到外部计算机,例如远程计算机利用因特网服务提供商来通过因特网连接到外部计算机。Computer program code for performing the operations of the present application may be written in one or more programming languages or combinations thereof, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional procedural programming languages. A programming language such as C language or similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, part on the user's computer and part on a remote computer or entirely on the remote computer or server. In cases involving a remote computer, the remote computer may be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or the remote computer may be connected to an external computer, such as the remote computer using Internet services provider to connect to external computers via the Internet.

Claims (12)

  1. 一种数据回填方法,所述方法包括:A data backfill method, the method comprising:
    在待处理数据流中提取出当前日志数据;按照日志数据格式在所述当前日志数据中解析出字段信息;Extracting the current log data from the data stream to be processed; parsing field information in the current log data according to the log data format;
    基于所述当前日志数据中的字段信息,检测所述当前日志数据中是否存在关联信息和重点关注信息;Based on the field information in the current log data, detect whether there is associated information and focus information in the current log data;
    响应于所述当前日志数据中存在所述关联信息且所述当前日志数据中不存在所述重点关注信息的检测结果,基于所述关联信息在关联关系数据库中查找所述重点关注信息;In response to a detection result that the associated information exists in the current log data and the focused information does not exist in the current log data, searching for the focused information in an association relationship database based on the associated information;
    响应于在所述关联关系数据库中查找到所述重点关注信息的查找结果,将所述重点关注信息回填至所述当前日志数据中;将回填后的当前日志数据存储至所述日志数据库中。Responding to the finding of the key information in the association relationship database, backfill the key information into the current log data; store the backfilled current log data into the log database.
  2. 根据权利要求1所述的方法,所述方法还包括:The method according to claim 1, said method further comprising:
    响应于所述当前日志数据中存在所述关联信息和所述重点关注信息的检测结果,将所述关联信息和所述重点关注信息存储至关联关系数据库中;将所述当前日志数据存储至所述日志数据库中。In response to the detection result that the associated information and the focused attention information exist in the current log data, the associated information and the focused attention information are stored in an association relationship database; the current log data is stored in the in the log database.
  3. 根据权利要求1所述的方法,所述方法还包括:The method according to claim 1, said method further comprising:
    响应于在所述关联关系数据库中未查找到所述重点关注信息的查找结果,将所述当前日志数据存储至所述日志数据库中。In response to a search result that the key information is not found in the association relationship database, the current log data is stored in the log database.
  4. 根据权利要求3所述的方法,所述方法还包括:The method of claim 3, further comprising:
    按照预定周期在所述日志数据库中提取出存在所述关联信息且不存在所述重点关注信息的至少一个日志数据;Extracting at least one log data in the log database that has the associated information and does not have the focused information according to a predetermined period;
    基于所述至少一个日志数据中的关联信息在所述关联关系数据库中查找所述至少一个日志数据中的重点关注信息;Searching for the key information in the at least one log data in the association relationship database based on the association information in the at least one log data;
    响应于在所述关联关系数据库中查找到所述至少一个日志数据中的重点关注信息的查找结果,将所述至少一个日志数据中的重点关注信息分别回填至所述至少一个日志数据中;将回填后的至少一个日志数据存储至所述日志数据库中。Responding to the search result of finding the focus information in the at least one log data in the association relationship database, backfill the focus information in the at least one log data into the at least one log data respectively; The backfilled at least one log data is stored in the log database.
  5. 根据权利要求1所述的方法,其中,所述在待处理数据流中提取出当前日志数据,包括:The method according to claim 1, wherein said extracting current log data from the data stream to be processed comprises:
    在所述待处理数据流中提取出当前会话;在所述当前会话中提取出日志信息;Extracting the current session from the data stream to be processed; extracting log information from the current session;
    将所述日志信息进行结构化处理,得到结构化处理后的日志信息;将所述 结构化处理后的日志信息输出到结构化文件中;将所述结构化文件作为所述当前日志数据。Carrying out structured processing on the log information to obtain structured processed log information; outputting the structured processed log information into a structured file; using the structured file as the current log data.
  6. 一种数据回填装置,所述装置包括:解析模块、检测模块、查找模块和回填模块;其中,A data backfilling device, the device comprising: an analysis module, a detection module, a search module and a backfill module; wherein,
    所述解析模块,设置为在待处理数据流中提取出当前日志数据;按照日志数据格式在所述当前日志数据中解析出字段信息;The parsing module is configured to extract the current log data in the data stream to be processed; parse the field information in the current log data according to the log data format;
    所述检测模块,设置为基于所述当前日志数据中的字段信息,检测所述当前日志数据中是否存在关联信息和重点关注信息;The detection module is configured to detect whether there is associated information and focus information in the current log data based on the field information in the current log data;
    所述查找模块,设置为响应于所述当前日志数据中存在所述关联信息且所述当前日志数据中不存在所述重点关注信息的检测结果,基于所述关联信息在关联关系数据库中查找所述重点关注信息;The search module is configured to, in response to the detection result that the associated information exists in the current log data and the key attention information does not exist in the current log data, search the association relationship database based on the associated information. State key information;
    所述回填模块,设置为响应于在所述关联关系数据库中查找到所述重点关注信息的查找结果,将所述重点关注信息回填至所述当前日志数据中;将回填后的当前日志数据存储至所述日志数据库中。The backfilling module is configured to backfill the key attention information into the current log data in response to finding the search result of the key attention information in the association relationship database; store the backfilled current log data to the log database.
  7. 根据权利要求6所述的装置,其中,所述回填模块,还设置为响应于所述当前日志数据中存在所述关联信息和所述重点关注信息的检测结果,将所述关联信息和所述重点关注信息存储至关联关系数据库中;将所述当前日志数据存储至所述日志数据库中。The device according to claim 6, wherein the backfill module is further configured to combine the associated information with the The key attention information is stored in the relational database; the current log data is stored in the log database.
  8. 根据权利要求6所述的装置,其中,所述回填模块,还设置为响应于在所述关联关系数据库中未查找到所述重点关注信息的查找结果,将所述当前日志数据存储至所述日志数据库中。The device according to claim 6, wherein the backfilling module is further configured to store the current log data in the in the log database.
  9. 根据权利要求8所述的装置,所述装置还包括:提取模块;The device according to claim 8, further comprising: an extraction module;
    所述提取模块,设置为按照预定周期在所述日志数据库中提取出存在所述关联信息且不存在所述重点关注信息的至少一个日志数据;The extracting module is configured to extract at least one log data in the log database that has the associated information and does not have the focused information according to a predetermined period;
    所述查找模块,还设置为基于所述至少一个日志数据中的关联信息在所述关联关系数据库中查找所述至少一个日志数据中的重点关注信息;The search module is also configured to search the at least one log data for key attention information in the association relationship database based on the associated information in the at least one log data;
    所述回填模块,还设置为响应于在所述关联关系数据库中查找到所述至少一个日志数据中的重点关注信息的查找结果,将所述至少一个日志数据中的重点关注信息分别回填至所述至少一个日志数据中;将回填后的至少一个日志数据存储至所述日志数据库中。The backfill module is further configured to, in response to finding the key information in the at least one log data in the association relation database, backfill the key information in the at least one log data to the the at least one log data; and store the backfilled at least one log data in the log database.
  10. 根据权利要求6所述的装置,其中,所述解析模块,设置为通过以下方式将在待处理数据流中提取出当前日志数据:在所述待处理数据流中提取出 当前会话;在所述当前会话中提取出日志信息;将所述日志信息进行结构化处理,得到结构化处理后的日志信息;将所述结构化处理后的日志信息输出到结构化文件中;将所述结构化文件作为所述当前日志数据。The device according to claim 6, wherein the parsing module is configured to extract the current log data from the data stream to be processed in the following manner: extract the current session from the data stream to be processed; Extract log information in the current session; carry out structured processing on the log information to obtain structured processed log information; output the structured processed log information into a structured file; as the current log data.
  11. 一种电子设备,包括:An electronic device comprising:
    一个或多个处理器;one or more processors;
    存储器,设置为存储一个或多个程序,memory, arranged to store one or more programs,
    所述一个或多个处理器,设置为执行所述一个或多个程序以实现如权利要求1至5中任一项所述的数据回填方法。The one or more processors are configured to execute the one or more programs to implement the data backfilling method according to any one of claims 1 to 5.
  12. 一种存储介质,所述存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1至5中任一项所述的数据回填方法。A storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the data backfilling method according to any one of claims 1 to 5 is realized.
PCT/CN2021/110210 2021-05-08 2021-08-03 Data backfilling method and apparatus, electronic device, and storage medium WO2022236973A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110500895.5A CN113204531A (en) 2021-05-08 2021-05-08 Data backfill method and device, electronic equipment and storage medium
CN202110500895.5 2021-05-08

Publications (1)

Publication Number Publication Date
WO2022236973A1 true WO2022236973A1 (en) 2022-11-17

Family

ID=77030582

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/110210 WO2022236973A1 (en) 2021-05-08 2021-08-03 Data backfilling method and apparatus, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN113204531A (en)
WO (1) WO2022236973A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150220605A1 (en) * 2014-01-31 2015-08-06 Awez Syed Intelligent data mining and processing of machine generated logs
US20150242431A1 (en) * 2014-02-25 2015-08-27 Ca, Inc. Computer system log file analysis based on field type identification
CN106599104A (en) * 2016-11-29 2017-04-26 北京锐安科技有限公司 Mass data association method based on redis cluster
CN107947978A (en) * 2017-11-21 2018-04-20 北京锐安科技有限公司 The method and device of a kind of associated data
CN110457256A (en) * 2019-08-01 2019-11-15 大众问问(北京)信息科技有限公司 Date storage method, device, computer equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150220605A1 (en) * 2014-01-31 2015-08-06 Awez Syed Intelligent data mining and processing of machine generated logs
US20150242431A1 (en) * 2014-02-25 2015-08-27 Ca, Inc. Computer system log file analysis based on field type identification
CN106599104A (en) * 2016-11-29 2017-04-26 北京锐安科技有限公司 Mass data association method based on redis cluster
CN107947978A (en) * 2017-11-21 2018-04-20 北京锐安科技有限公司 The method and device of a kind of associated data
CN110457256A (en) * 2019-08-01 2019-11-15 大众问问(北京)信息科技有限公司 Date storage method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN113204531A (en) 2021-08-03

Similar Documents

Publication Publication Date Title
US11120018B2 (en) Spark query method and system supporting trusted computing
CN107273280B (en) Log processing method and device, electronic equipment and storage medium
CN110069608B (en) Voice interaction method, device, equipment and computer storage medium
US9456229B2 (en) Parsing single source content for multi-channel publishing
WO2017206949A1 (en) Bullet screen content processing method, and application server and user terminal
US10755052B2 (en) Semantic disambiguation method, device, server and storage medium
CN109471851B (en) Data processing method, device, server and storage medium
CN112162965B (en) Log data processing method, device, computer equipment and storage medium
WO2022247442A1 (en) Symbol parsing method and apparatus, and device and storage medium
CN110618999A (en) Data query method and device, computer storage medium and electronic equipment
CN112463800A (en) Data reading method and device, server and storage medium
WO2020168763A1 (en) Data classification and storage method and apparatus of application program, device, and storage medium
CN109033456B (en) Condition query method and device, electronic equipment and storage medium
WO2019071907A1 (en) Method for identifying help information based on operation page, and application server
CN110737678B (en) Data searching method, device, equipment and storage medium
WO2022236973A1 (en) Data backfilling method and apparatus, electronic device, and storage medium
CN110287338B (en) Industry hotspot determination method, device, equipment and medium
CN112487025A (en) Data query method and device, electronic equipment and storage medium
CN111261157A (en) Control method, device and equipment for short video and storage medium
CN107992457B (en) Information conversion method, device, terminal equipment and storage medium
CN112866979B (en) User information association method, device, equipment and medium based on 5G service interface
CN114238391A (en) Data paging query method and device, electronic equipment and storage medium
CN114090514A (en) Log retrieval method and device for distributed system
CN109740027B (en) Data exchange method, device, server and storage medium
CN113760988A (en) Method, device, equipment and storage medium for associating and processing unbounded stream data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21941542

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE