CN109408640A - Log classification method, device and storage medium - Google Patents

Log classification method, device and storage medium Download PDF

Info

Publication number
CN109408640A
CN109408640A CN201811300533.6A CN201811300533A CN109408640A CN 109408640 A CN109408640 A CN 109408640A CN 201811300533 A CN201811300533 A CN 201811300533A CN 109408640 A CN109408640 A CN 109408640A
Authority
CN
China
Prior art keywords
log
tree structure
obtains
sequence
sort tree
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811300533.6A
Other languages
Chinese (zh)
Other versions
CN109408640B (en
Inventor
孙木鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Neusoft Corp
Original Assignee
Neusoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Neusoft Corp filed Critical Neusoft Corp
Priority to CN201811300533.6A priority Critical patent/CN109408640B/en
Publication of CN109408640A publication Critical patent/CN109408640A/en
Application granted granted Critical
Publication of CN109408640B publication Critical patent/CN109408640B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The present invention provides a kind of log classification method, device and storage medium, by the original log sequence for obtaining log to be sorted, original log sequence is pre-processed, the logged sequence that obtains that treated, logged sequence is compared with preset log sort tree structure, obtains the classification results of log to be sorted.The above method realizes automatic classification of the system to running log, improves the efficiency of log classification.

Description

Log classification method, device and storage medium
Technical field
The present embodiments relate to log sorting technique field more particularly to a kind of log classification methods, device and storage Medium.
Background technique
With the high speed development of Internet service, Internet enterprises increasingly pay attention to the operation maintenance of service system.Using The operation maintenance of server directly influences the user experience of enterprise, is related to the vital interests of enterprise.It is arranged in operation maintenance Application service problem Primary Reference system log is looked into, system log is the reference and investigation of most effective judgement system running state File, as the promotion and application service engineering gigantism, enterprise application service of computer server performance generate all the time Massive logs file.Therefore, the running log of application server, which carries out classification, is particularly important.
Majority application server when something goes wrong, often by the way of manually investigation log, runs system at present Log is classified.However, as enterprise applies the increase of log quantity, the effect manually checking log, classifying to log Rate is lower.Therefore, realize application server log is checked automatically, classify it is especially urgent.
Summary of the invention
Log classification method, device and storage medium provided by the invention realize automatic classification of the system to running log, Improve the efficiency of log classification.
First aspect present invention provides a kind of log classification method, comprising:
Obtain the original log sequence of log to be sorted;
The original log sequence is pre-processed, the logged sequence that obtains that treated;
The logged sequence is compared with preset log sort tree structure, obtains the classification of the log to be sorted As a result.
In a kind of possible implementation, the creation process of the log sort tree structure, comprising:
Obtain the first log that system generates in preset period of time;
First log is pre-processed, the second log is obtained;
It is resequenced according to logged sequence of the predetermined order rule to second log, obtains third log;
The log sort tree structure is constructed according to the third log.
In a kind of possible implementation, second log includes content field;It is described right according to predetermined order rule The logged sequence of second log is resequenced, and third log is obtained, comprising:
Count the frequency that various words occur in the preset period of time in the content field;
Second log is resequenced according to the frequency that the word occurs, obtains third log.
It is described that the log sort tree structure is constructed according to the third log in a kind of possible implementation, comprising:
Initial log sort tree structure is constructed according to the third log;
Beta pruning is carried out to the initial log sort tree structure according to default branch's number, obtains the log classification tree knot Structure.
Second aspect of the present invention provides a kind of log sorter, comprising:
Module is obtained, for obtaining the original log sequence of log to be sorted;
Preprocessing module, for being pre-processed to the original log sequence, the logged sequence that obtains that treated;
Categorization module, for the logged sequence to be compared with preset log sort tree structure, obtain it is described to The classification results of classification log.
In a kind of possible implementation, the acquisition module is also used to obtain system generates in preset period of time first Log;
The preprocessing module is also used to pre-process first log, obtains the second log;
Described device further include: sorting module, for the logged sequence according to predetermined order rule to second log It resequences, obtains third log;
Creation module, for constructing the log sort tree structure according to the third log.
In a kind of possible implementation, second log includes content field;
The sorting module, is specifically used for:
Count the frequency that various words occur in the preset period of time in the content field;
Second log is resequenced according to the frequency that the word occurs, obtains third log.
In a kind of possible implementation, the creation module is specifically used for:
Initial log sort tree structure is constructed according to the third log;
Beta pruning is carried out to the initial log sort tree structure according to default branch's number, obtains the log classification tree knot Structure.
Third aspect present invention provides a kind of log sorter, comprising:
Memory;
Processor;And
Computer program;
Wherein, the computer program stores in the memory, and is configured as being executed by the processor with reality Now such as the described in any item methods of first aspect present invention.
Fourth aspect present invention provides a kind of computer readable storage medium, is stored thereon with computer program, the meter Calculation machine program is executed by processor to realize such as the described in any item methods of first aspect present invention.
Log classification method, device and storage medium provided by the invention, by the original log for obtaining log to be sorted Sequence pre-processes original log sequence, the logged sequence that obtains that treated, and logged sequence and preset log are classified Tree construction is compared, and obtains the classification results of log to be sorted.The above method realizes automatic classification of the system to running log, Improve the efficiency of log classification.
Detailed description of the invention
The drawings herein are incorporated into the specification and forms part of this specification, and shows and meets implementation of the invention Example, and be used to explain the principle of the present invention together with specification.
Fig. 1 is the flow diagram for the log classification method that one embodiment of the invention provides;
Fig. 2 is the flow diagram of the creation process for the log sort tree structure that one embodiment of the invention provides;
Fig. 3 is the schematic diagram that log sort tree structure is constructed according to third log that one embodiment of the invention provides;
Fig. 4 is the structural schematic diagram for the log sorter that one embodiment of the invention provides;
Fig. 5 be another embodiment of the present invention provides log sorter structural schematic diagram;
Fig. 6 is the hardware structural diagram for the log sorter that one embodiment of the invention provides.
Through the above attached drawings, it has been shown that the specific embodiment of the present invention will be hereinafter described in more detail.These attached drawings It is not intended to limit the scope of the inventive concept in any manner with verbal description, but is by referring to specific embodiments Those skilled in the art illustrate idea of the invention.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment Described in embodiment do not represent all embodiments consistented with the present invention.On the contrary, they be only with it is such as appended The example of device and method being described in detail in claims, some aspects of the invention are consistent.
Term " includes " and " having " and their any deformations in description and claims of this specification, it is intended that It is to cover and non-exclusive includes.Such as the process, method, system, product or equipment for containing a series of steps or units do not have It is defined in listed step or unit, but optionally further comprising the step of not listing or unit, or optionally also wrap Include the other step or units intrinsic for these process, methods, product or equipment.
"and/or" in the present invention describes the incidence relation of affiliated partner, indicates may exist three kinds of relationships, for example, A And/or B, can indicate: individualism A exists simultaneously A and B, these three situations of individualism B.Before character "/" typicallys represent Affiliated partner is a kind of relationship of "or" afterwards.
Log classification method provided in an embodiment of the present invention, by creating log sort tree structure, by the day of system generation Will is compared with preset log sort tree structure, obtains the classification results of log, is manually checked without operation maintenance personnel, It determines Log Types, improves the efficiency of log classification.
Technical solution of the present invention is described in detail with specific embodiment below.
Fig. 1 is the flow diagram for the log classification method that one embodiment of the invention provides, and this method can be by arbitrarily holding The device of row log classification method executes, which can pass through software and or hardware realization.
As shown in Figure 1, log classification method provided in this embodiment includes the following steps:
S101, the original log sequence for obtaining log to be sorted;
In the present embodiment, the original log sequence of log to be sorted includes time field, content field and other character words Section.
Since system provides a variety of application services, a large amount of system log sequence, different application can be all generated all the time Corresponding logged sequence has differences, and main difference is in content field.Illustratively, following is what system application program generated One original log sequence:
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--200 0 0 29
Wherein, " 2018-01-2120:54:45 " is time field;
“101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker-443- 101.*.*.*--200 00 29 " is content field, includes other character fields, such as "-" or space in content field.
It should be pointed out that the original log sequence of the log to be sorted of the present embodiment can be a logged sequence, It can be a plurality of logged sequence, this present embodiment be not especially limited.If the original log sequence of log to be sorted be it is a plurality of, Subsequent log integrity and log mechanized classification then are done to each original log sequence one by one.
S102, original log sequence is pre-processed, the logged sequence that obtains that treated;
In the present embodiment, log sorter first locates the original log sequence of the log to be sorted got in advance Reason, filters out unnecessary field, for example, time field (" 2018-01-2120:54:45 " in such as examples detailed above) and its His character field (i.e. useless character field, such as the "-" or space field " " in examples detailed above) only retains original log sequence In content field.
Illustratively, following for by pretreated logged sequence:
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 200 0 0 29
S103, logged sequence is compared with preset log sort tree structure, obtains the classification knot of log to be sorted Fruit.
Specifically, the log sort tree structure sequence of the present embodiment and saving a large amount of character string and (but being not limited only to word Symbol string), the text word frequency statistics being normally used in search engine.The present embodiment is real by building the tree construction of logged sequence Show the automatic classification to system log, improves the classified inquiry efficiency to system log.
Based on preset log sort tree structure, log sorter is by treated in S102 logged sequence and log point Class tree construction is compared, can the quick obtaining log classification results, convenient for operation maintenance personnel according to classification results to log It is analyzed and is checked.
Specifically, by treated in S102 logged sequence by order of the field successively with each layer section of log sort tree structure Point is compared, and determines the node of the corresponding log sort tree structure of each field of logged sequence, until the end of logged sequence Field;Using the node of the corresponding log sort tree structure of logged sequence end field as the classification results of the log.
Log classification method provided in an embodiment of the present invention, by obtaining the original log sequence of log to be sorted, to original Beginning logged sequence is pre-processed, the logged sequence that obtains that treated, by logged sequence and preset log sort tree structure into Row compares, and obtains the classification results of log to be sorted.By the above method, realizes automatic classification of the system to running log, mention The high efficiency of log classification.
Log classification method shown in above-described embodiment carries out log to be sorted according to preset log sort tree structure Classification has preferable inquiry classifying quality, with reference to the accompanying drawing to the creation of the log sort tree structure in above-described embodiment Process is described in detail.
Fig. 2 is the flow diagram of the creation process for the log sort tree structure that one embodiment of the invention provides, such as Fig. 2 institute Show, the creation process of log sort tree structure provided in this embodiment includes the following steps:
S201, the first log that system generates in preset period of time is obtained;
In the present embodiment, the first log is all history log sequences that system generates in preset period of time, exemplary , the first log that system generates in preset period of time is as follows:
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--200 0 0 29
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--500 0 0 39
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--200 0 0 129
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--500 0 0 339
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--200 0 0 52
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--200 0 0 54
2018-01-2120:54:45101.201.*.*POST/mediasite/JobFarm/Controller.svc/ Worker-443-101.*.*.*--500 0 0 35
S202, the first log is pre-processed, obtains the second log;
Same above-described embodiment of pretreatment in the present embodiment, filters out unnecessary field in the first log, such as time Field and other character fields (such as space field etc.), only retain the content field in the first log.
Illustratively, following for by pretreated second log:
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 200 0 0 29
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 500 0 0 39
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 200 0 0 129
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 500 0 0 339
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 200 0 0 52
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 200 0 0 54
101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.* 500 0 0 35
S203, it is resequenced according to logged sequence of the predetermined order rule to the second log, obtains third log;
Specifically, log sorter counts what various words in the content field of the second log occurred in preset period of time Frequency;
Second log is resequenced according to the frequency that word occurs, obtains third log.
Illustratively, table 1 is the log word frequency list of the second log.It is appreciated that high frequency word is fixed vocabulary, with day Will is classified, and correlation is high, and low frequency word is on-fixed vocabulary, little with log classification correlation.
Table 1
Word Word frequency
101.201.*.* 7
POST 7
/mediasite/JobFarm/Controller.svc/Worker 7
443 7
101.*.*.* 7
200 4
500 3
29 1
39 1
129 1
339 1
52 1
54 1
0 14
According to table 1, the content field of the second log can be sorted according to word frequency, constitute new third log.It is as follows:
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*200 29
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*500 39
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*200 129
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*500 339
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*200 52
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*200 54
0 0 101.201.*.*POST/mediasite/JobFarm/Controller.svc/Worker 443 101.*.*.*500 35
S204, log sort tree structure is constructed according to third log.
Specifically, constructing initial log sort tree structure according to third log;
Beta pruning is carried out to initial log sort tree structure according to default branch's number, obtains final log sort tree structure.
Fig. 3 is the schematic diagram that log sort tree structure is constructed according to third log that one embodiment of the invention provides, such as Fig. 3 It is shown, give the building process according to the third log building log sort tree structure in examples detailed above:
(1) third log is inserted into tree construction since root node root;
(2) if next word of third log is the child node of present node, its corresponding child node is constructed;
(3) (2) are repeated until whole words of third log are inserted into tree construction;
(4) according to branch's number is preset to tree construction progress beta pruning, final log sort tree structure is obtained.
Illustratively, if default branch's number is 2, retain the node in tree construction comprising two child nodes, delete the section Two child nodes of point, such as the child node " 29 " in Fig. 3, " 129 ", " 39 ", " 339 ".
Classified unrelated word by deleting with log, obtains final log sort tree structure.
According to the creation process of above-mentioned log sort tree structure it is found that if will by pretreated logged sequence with it is preset Log sort tree structure is compared, the classification results of the available logged sequence, which includes log classification tree Terminal note information in structure, such as " 200 " or " 500 " in Fig. 3.
It should be pointed out that there is a situation where a kind of possible, length by pretreated logged sequence is less than preset The length of logged sequence to be sorted, at this point, log sorter sends prompt information to operation maintenance personnel, so that operation maintenance personnel is to this Logged sequence is manually checked and is analyzed.
Optionally, system can periodically be updated above-mentioned log sort tree structure, it is ensured that log sort tree structure Classification results accuracy.
The creation process of log sort tree structure provided in this embodiment, after the history log of system is pre-processed, It is resequenced according to predetermined order rule to all logged sequences, according to the log creation log classification tree after rearrangement Structure, obtained log sort tree structure can determine the terminal node of logged sequence to be sorted, which is log Classification results.The periodic Update log sort tree structure of system, to improve the accurate of log sort tree structure classification results Property.
The embodiment of the present invention also provides a kind of log sorter, and shown in Figure 4, the embodiment of the present invention is only with Fig. 4 Example is illustrated, and is not offered as that present invention is limited only to this.
Fig. 4 is the structural schematic diagram for the log sorter that one embodiment of the invention provides, as shown in figure 4, the present embodiment The log sorter 40 of offer includes:
Module 41 is obtained, for obtaining the original log sequence of log to be sorted;
Preprocessing module 42, for being pre-processed to the original log sequence, the logged sequence that obtains that treated;
Categorization module 43 obtains described for the logged sequence to be compared with preset log sort tree structure The classification results of log to be sorted.
Log sorter provided in an embodiment of the present invention, including module, preprocessing module and categorization module are obtained, it obtains Module obtains the original log sequence of log to be sorted, and preprocessing module pre-processes original log sequence, handled Logged sequence is compared with preset log sort tree structure for logged sequence afterwards, categorization module, obtains log to be sorted Classification results.Above-mentioned apparatus realizes automatic classification of the system to running log, improves the efficiency of log classification.
On the basis of the above embodiments, in a kind of possible implementation, the acquisition module 41 is also used to obtain pre- If the first log that system generates in the period;
The preprocessing module 42 is also used to pre-process first log, obtains the second log.
Fig. 5 be another embodiment of the present invention provides log sorter structural schematic diagram, the base of device shown in Fig. 4 On plinth, as shown in figure 5, log sorter 40 provided in this embodiment, further includes:
Sorting module 44, for being resequenced according to logged sequence of the predetermined order rule to second log, Obtain third log;
Creation module 45, for constructing the log sort tree structure according to the third log.
In a kind of possible implementation, second log includes content field;
The sorting module 44, is specifically used for:
Count the frequency that various words occur in the preset period of time in the content field;
Second log is resequenced according to the frequency that the word occurs, obtains third log.
In a kind of possible implementation, the creation module 45 is specifically used for:
Initial log sort tree structure is constructed according to the third log;
Beta pruning is carried out to the initial log sort tree structure according to default branch's number, obtains the log classification tree knot Structure.
Log sorter provided in this embodiment can execute the technical solution of above method embodiment, realize former Reason is similar with technical effect, and details are not described herein again.
The embodiment of the present invention also provides a kind of log sorter, and shown in Figure 6, the embodiment of the present invention is only with Fig. 6 Example is illustrated, and is not offered as that present invention is limited only to this.
Fig. 6 is the hardware structural diagram for the log sorter that one embodiment of the invention provides, as shown in fig. 6, this reality Applying the log sorter 60 that example provides includes:
Memory 61;
Processor 62;And
Computer program;
Wherein, computer program is stored in memory 61, and is configured as being executed by processor 62 to realize as aforementioned The technical solution of any one embodiment of the method, it is similar that the realization principle and technical effect are similar, and details are not described herein again.
Optionally, memory 61 can also be integrated with processor 62 either independent.
When device except memory 61 is independently of processor 62, log sorter 60 further include:
Bus 63, for connecting memory 61 and processor 62.
The embodiment of the present invention also provides a kind of computer readable storage medium, is stored thereon with computer program, computer Program is executed by processor 62 to realize each step performed by log sorter 60 in embodiment of the method as above.
It should be understood that above-mentioned processor can be central processing unit (English: Central Processing Unit, letter Claim: CPU), can also be other general processors, digital signal processor (English: Digital Signal Processor, Referred to as: DSP), specific integrated circuit (English: Application Specific Integrated Circuit, referred to as: ASIC) etc..General processor can be microprocessor or the processor is also possible to any conventional processor etc..In conjunction with hair The step of bright disclosed method, can be embodied directly in hardware processor and execute completion, or with hardware in processor and soft Part block combiner executes completion.
Memory may include high speed RAM memory, it is also possible to and it further include non-volatile memories NVM, for example, at least one Magnetic disk storage can also be USB flash disk, mobile hard disk, read-only memory, disk or CD etc..
Bus can be industry standard architecture (Industry Standard Architecture, ISA) bus, outer Portion's apparatus interconnection (Peripheral Component, PCI) bus or extended industry-standard architecture (Extended Industry Standard Architecture, EISA) bus etc..Bus can be divided into address bus, data/address bus, control Bus etc..For convenient for indicating, the bus in illustrations does not limit only a bus or a type of bus.
Above-mentioned storage medium can be by any kind of volatibility or non-volatile memory device or their combination It realizes, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable Read-only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, Disk or CD.Storage medium can be any usable medium that general or specialized computer can access.
A kind of illustrative storage medium is coupled to processor, believes to enable a processor to read from the storage medium Breath, and information can be written to the storage medium.Certainly, storage medium is also possible to the component part of processor.It processor and deposits Storage media can be located at specific integrated circuit (Application Specific Integrated Circuits, referred to as: ASIC in).Certainly, pocessor and storage media can also be used as discrete assembly and be present in electronic equipment or main control device.
Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present invention., rather than its limitations;To the greatest extent Pipe present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: its according to So be possible to modify the technical solutions described in the foregoing embodiments, or to some or all of the technical features into Row equivalent replacement;And these are modified or replaceed, various embodiments of the present invention technology that it does not separate the essence of the corresponding technical solution The range of scheme.

Claims (10)

1. a kind of log classification method characterized by comprising
Obtain the original log sequence of log to be sorted;
The original log sequence is pre-processed, the logged sequence that obtains that treated;
The logged sequence is compared with preset log sort tree structure, obtains the classification knot of the log to be sorted Fruit.
2. the method according to claim 1, wherein the creation process of the log sort tree structure, comprising:
Obtain the first log that system generates in preset period of time;
First log is pre-processed, the second log is obtained;
It is resequenced according to logged sequence of the predetermined order rule to second log, obtains third log;
The log sort tree structure is constructed according to the third log.
3. according to the method described in claim 2, it is characterized in that, second log includes content field;The basis is pre- If ordering rule resequences to the logged sequence of second log, third log is obtained, comprising:
Count the frequency that various words occur in the preset period of time in the content field;
Second log is resequenced according to the frequency that the word occurs, obtains third log.
4. according to the method described in claim 2, it is characterized in that, described construct the log classification according to the third log Tree construction, comprising:
Initial log sort tree structure is constructed according to the third log;
Beta pruning is carried out to the initial log sort tree structure according to default branch's number, obtains the log sort tree structure.
5. a kind of log sorter characterized by comprising
Module is obtained, for obtaining the original log sequence of log to be sorted;
Preprocessing module, for being pre-processed to the original log sequence, the logged sequence that obtains that treated;
Categorization module obtains described to be sorted for the logged sequence to be compared with preset log sort tree structure The classification results of log.
6. device according to claim 5, which is characterized in that
The acquisition module is also used to obtain the first log that system generates in preset period of time;
The preprocessing module is also used to pre-process first log, obtains the second log;
Described device further include: sorting module, for being carried out according to logged sequence of the predetermined order rule to second log Rearrangement, obtains third log;
Creation module, for constructing the log sort tree structure according to the third log.
7. device according to claim 6, which is characterized in that second log includes content field;The sequence mould Block is specifically used for:
Count the frequency that various words occur in the preset period of time in the content field;
Second log is resequenced according to the frequency that the word occurs, obtains third log.
8. device according to claim 6, which is characterized in that the creation module is specifically used for:
Initial log sort tree structure is constructed according to the third log;
Beta pruning is carried out to the initial log sort tree structure according to default branch's number, obtains the log sort tree structure.
9. a kind of log sorter characterized by comprising
Memory;
Processor;And
Computer program;
Wherein, the computer program stores in the memory, and is configured as being executed by the processor to realize such as The described in any item methods of claim 1-4.
10. a kind of computer readable storage medium, which is characterized in that be stored thereon with computer program, the computer program It is executed by processor to realize method according to any of claims 1-4.
CN201811300533.6A 2018-11-02 2018-11-02 Log classification method and device and storage medium Active CN109408640B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811300533.6A CN109408640B (en) 2018-11-02 2018-11-02 Log classification method and device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811300533.6A CN109408640B (en) 2018-11-02 2018-11-02 Log classification method and device and storage medium

Publications (2)

Publication Number Publication Date
CN109408640A true CN109408640A (en) 2019-03-01
CN109408640B CN109408640B (en) 2021-04-20

Family

ID=65471027

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811300533.6A Active CN109408640B (en) 2018-11-02 2018-11-02 Log classification method and device and storage medium

Country Status (1)

Country Link
CN (1) CN109408640B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115030A (en) * 2020-09-28 2020-12-22 曙光信息产业(北京)有限公司 Node determination method and device, electronic equipment and storage medium
CN112445912A (en) * 2020-11-06 2021-03-05 苏州浪潮智能科技有限公司 Fault log classification method, system, device and medium
CN113934701A (en) * 2021-10-12 2022-01-14 网易(杭州)网络有限公司 Log processing method, device, server and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110185233A1 (en) * 2010-01-25 2011-07-28 International Business Machines Corporation Automated system problem diagnosing
US20160179906A1 (en) * 2014-12-18 2016-06-23 Salesforce.Com, Inc. Identifying relevant material for cases
CN105827432A (en) * 2015-12-29 2016-08-03 广东亿迅科技有限公司 SHELL script-based traffic log statistical method and statistical system
CN106227790A (en) * 2016-07-19 2016-12-14 北京北信源软件股份有限公司 A kind of method using Apache Spark classification and parsing massive logs

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110185233A1 (en) * 2010-01-25 2011-07-28 International Business Machines Corporation Automated system problem diagnosing
US20160179906A1 (en) * 2014-12-18 2016-06-23 Salesforce.Com, Inc. Identifying relevant material for cases
CN105827432A (en) * 2015-12-29 2016-08-03 广东亿迅科技有限公司 SHELL script-based traffic log statistical method and statistical system
CN106227790A (en) * 2016-07-19 2016-12-14 北京北信源软件股份有限公司 A kind of method using Apache Spark classification and parsing massive logs

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
金效行: "决策树算法在网站服务器日志分析中的应用", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115030A (en) * 2020-09-28 2020-12-22 曙光信息产业(北京)有限公司 Node determination method and device, electronic equipment and storage medium
CN112115030B (en) * 2020-09-28 2023-12-19 曙光信息产业(北京)有限公司 Node determination method and device, electronic equipment and storage medium
CN112445912A (en) * 2020-11-06 2021-03-05 苏州浪潮智能科技有限公司 Fault log classification method, system, device and medium
CN112445912B (en) * 2020-11-06 2022-06-07 苏州浪潮智能科技有限公司 Fault log classification method, system, device and medium
CN113934701A (en) * 2021-10-12 2022-01-14 网易(杭州)网络有限公司 Log processing method, device, server and storage medium

Also Published As

Publication number Publication date
CN109408640B (en) 2021-04-20

Similar Documents

Publication Publication Date Title
CN110737592B (en) Link abnormality identification method, server and computer readable storage medium
US9342627B2 (en) Determining word information entropies
CN109408640A (en) Log classification method, device and storage medium
CN107808306B (en) Business object segmentation method based on tag library, electronic device and storage medium
CN112183782B (en) Fault work order processing method and equipment
US8756071B2 (en) Methods and apparatus for queue-based cluster analysis
CN111708938B (en) Method, apparatus, electronic device, and storage medium for information processing
CN105630931A (en) Document classification method and device
US10783453B2 (en) Systems and methods for automated incident response
CN112416778A (en) Test case recommendation method and device and electronic equipment
CN109933648B (en) Real user comment distinguishing method and device
CN112468523A (en) Abnormal flow detection method, device, equipment and storage medium
CN106649376A (en) Navigation tag sorting method and device
CN107357885A (en) Method for writing data and device, electronic equipment, computer-readable storage medium
CN103593406A (en) Static resource identifier processing method and device
CN110689211A (en) Method and device for evaluating website service capability
CN115952162A (en) Data quality checking method, device and equipment
CN114564624A (en) Feature matching rule construction method, feature matching device, feature matching equipment and feature matching medium
US11501058B2 (en) Event detection based on text streams
CN111882113B (en) Enterprise mobile banking user prediction method and device
CN111104628A (en) User identification method and device, electronic equipment and storage medium
CN111831817A (en) Questionnaire generation and analysis method and device, computer equipment and readable storage medium
CN104391981A (en) Text classification method and device
CN113836430A (en) Book recommendation method, terminal and storage medium
CN113760864A (en) Data model generation method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant