CN107766416A - Data analysing method, apparatus and system - Google Patents

Data analysing method, apparatus and system Download PDF

Info

Publication number
CN107766416A
CN107766416A CN201710806187.8A CN201710806187A CN107766416A CN 107766416 A CN107766416 A CN 107766416A CN 201710806187 A CN201710806187 A CN 201710806187A CN 107766416 A CN107766416 A CN 107766416A
Authority
CN
China
Prior art keywords
data
target
analysis result
target data
analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710806187.8A
Other languages
Chinese (zh)
Inventor
王磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201710806187.8A priority Critical patent/CN107766416A/en
Publication of CN107766416A publication Critical patent/CN107766416A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This specification one or more embodiment discloses a kind of data analysing method, apparatus and system, to realize automatically analyzing for data, so as to improve the efficiency of data analysis.Methods described includes:Obtain target data to be analyzed;Concentrated in the data results of generation and the target data is matched, to obtain target analysis result corresponding to the target data, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.

Description

Data analysing method, apparatus and system
Technical field
This specification is related to technical field of data processing, more particularly to a kind of data analysing method, apparatus and system.
Background technology
Problem is very more on daily line, all kinds of problems that such as customer service, artificial discovery, public sentiment feedback platform report.For All kinds of problems reported, research staff need to fish for various data in operation system, and matching content, analysis reason, finally Draw a conclusion.But the whole process of manual analysis data is very time-consuming, wherein only data transfer mistake of the client to service end Journey just needs general 1 hour, or even needs 2-3 hour, wastes many times and human resources.Also, in many situations Under, research staff needs to carry out same or similar problem repeatedly analysis investigation, causes the efficiency of data analysis low.
The content of the invention
The purpose of this specification one or more embodiment is to provide a kind of data analysing method, apparatus and system, to Automatically analyzing for data is realized, so as to improve the efficiency of data analysis.
In order to solve the above technical problems, what this specification one or more embodiment was realized in:
On the one hand, this specification one or more embodiment provides a kind of data analysing method, applied to network side, bag Include:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, it is corresponding to obtain the target data Target analysis result, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
Alternatively, in addition to:
The keyword included in the target data is extracted, the data class of the target data is determined according to the keyword Type;Wherein, the keyword is related at least one in product type, data characteristics, data generation time, user profile;
Accordingly, the target data is matched in the data results concentration of generation, including:
The data for determining to match with the data type are concentrated from the data results;
Analysis result according to corresponding to the data to match with the data type matches to the target data.
Alternatively, in addition to:
The target data is configured using the data rule being pre-configured with, so that described with the target data postponed It can be identified by the network side.
Alternatively, it is described that the target data is configured using the data rule being pre-configured with, including:
The target data is split using designated symbols, so that the target data meets the data rule institute Defined clause.
Alternatively, the target data is matched in the data results concentration of generation, including:
Judge that whether the data results are concentrated comprising the target analysis result to match with the target data;
If so, then export the target analysis result;
If it is not, then the target data is sent to the corresponding device of manual analysis.
Alternatively, in addition to:
Obtain the target analysis result obtained by manual analysis;
The target analysis result obtained according to the manual analysis updates the data results collection.
Alternatively, in addition to:
Obtain sample analysis result corresponding to multiple sample datas;
The multiple sample data and the sample analysis result are subjected to corresponding storage, to generate the data analysis Result set;Or, the sample data is classified according to the keyword included in the sample data, by all kinds of samples Data and the sample analysis result are correspondingly stored, to generate the data results collection.
Alternatively, the User action log data that the target data includes client and/or server reports.
On the other hand, this specification one or more embodiment provides a kind of data analysis set-up, applied to network side, bag Include:
First acquisition module, obtain target data to be analyzed;
Matching module, concentrated in the data results of generation and the target data is matched, to obtain the mesh Target analysis result corresponding to data is marked, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis As a result.
Alternatively, in addition to:
Extraction module, the keyword included in the target data is extracted, the number of targets is determined according to the keyword According to data type;Wherein, in the keyword and product type, data characteristics, data generation time, user profile at least One correlation;
Accordingly, the matching module includes:
Determining unit, the data for determining to match with the data type are concentrated from the data results;
Matching unit, the analysis result according to corresponding to the data to match with the data type is to the target data Matched.
Alternatively, in addition to:
Configuration module, the target data is configured using the data rule being pre-configured with, so that described with postponing Target data can be identified by the network side.
Alternatively, the configuration module includes:
Cutting unit, the target data is split using designated symbols so that the target data meet it is described The clause of data rule defined.
Alternatively, the matching module includes:
Judging unit, judge that the data results concentrate the target point for whether including and matching with the target data Analyse result;
First output unit, if the data results are concentrated comprising the target analysis to match with the target data As a result, then the target analysis result is exported;
Second output unit, if the data results concentrate the target point for not including and matching with the target data Result is analysed, then is sent the target data to the corresponding device of manual analysis.
Alternatively, described device also includes:
Second acquisition module, obtain the target analysis result obtained by manual analysis;
Update module, the target analysis result obtained according to the manual analysis update the data results collection.
Alternatively, described device also includes:
3rd acquisition module, obtain sample analysis result corresponding to multiple sample datas;
Generation module, the multiple sample data and the sample analysis result are subjected to corresponding storage, to generate State data results collection;Or, the sample data is classified according to the keyword included in the sample data, will be each Sample data described in class and the sample analysis result are correspondingly stored, to generate the data results collection.
Alternatively, the User action log data that the target data includes client and/or server reports.
Another further aspect, this specification one or more embodiment provide a kind of data analysis system, are arranged at network side, wrap Include:
Space in a newspaper in data, target data to be analyzed is reported to components of data analysis;
The components of data analysis, obtain the target data that space in a newspaper reports in the data;In the data analysis knot of generation Fruit is concentrated and the target data is matched, to obtain target analysis result corresponding to the target data, wherein, the number Being concentrated according to analysis result includes multiple data and its corresponding analysis result.
Alternatively, in addition to:Manual analysis component;
The Data Analysis Platform, judge that the data results are concentrated whether to include and match with the target data Target analysis result;If so, then export the target analysis result;If it is not, then the target data is sent to the people Work analytic unit;
The manual analysis component, exports the target data, to carry out manual analysis to the target data;It will pass through The target analysis result that the manual analysis obtains is transmitted to the components of data analysis;
The components of data analysis, the target analysis result obtained according to the manual analysis update the data analysis knot Fruit collects.
Another further aspect, this specification one or more embodiment provide a kind of DAF, including:
Processor;And
It is arranged to store the memory of computer executable instructions, the executable instruction makes the place when executed Manage device:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, it is corresponding to obtain the target data Target analysis result, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
Another further aspect, this specification one or more embodiment provide a kind of storage medium, can held for storing computer Row instruction, the executable instruction realize below scheme when executed:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, it is corresponding to obtain the target data Target analysis result, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
Using the technical scheme of this specification one or more embodiment, target data to be analyzed can be obtained, and The data results of generation are concentrated and target data are matched, to obtain target analysis result corresponding to target data.Can See, the technical scheme causes data analysis to be no longer dependent on manual analysis, so as to save substantial amounts of time and human resources.
Further, the technical scheme can be concentrated in data results and include dividing with the target that target data matches When analysing result, the target analysis result is directly exported, so as to avoid to identical data be repeated several times the situation of analysis, is improved The efficiency of data analysis.
Brief description of the drawings
In order to illustrate more clearly of this specification one or more embodiment or technical scheme of the prior art, below will The required accompanying drawing used in embodiment or description of the prior art is briefly described, it should be apparent that, in describing below Accompanying drawing is only some embodiments described in this specification one or more embodiment, is come for those of ordinary skill in the art Say, without having to pay creative labor, other accompanying drawings can also be obtained according to these accompanying drawings.
Fig. 1 is the indicative flowchart according to the data analysing method of the embodiment of this specification one;
Fig. 2 is the indicative flowchart according to the data analysing method of another embodiment of this specification;
Fig. 3 is the schematic block diagram according to the data analysis set-up of the embodiment of this specification one;
Fig. 4 is the schematic block diagram according to the data analysis system of the embodiment of this specification one;
Fig. 5 is the schematic block diagram according to the DAF of the embodiment of this specification one.
Embodiment
This specification one or more embodiment provides a kind of data analysing method, apparatus and system, to realize data Automatically analyze, so as to improve the efficiency of data analysis.
In order that those skilled in the art more fully understand the technical scheme in this specification one or more embodiment, Below in conjunction with the accompanying drawing in this specification one or more embodiment, to the technology in this specification one or more embodiment Scheme is clearly and completely described, it is clear that and described embodiment is only this specification part of the embodiment, rather than Whole embodiments.Based on this specification one or more embodiment, those of ordinary skill in the art are not making creativeness The every other embodiment obtained under the premise of work, it should all belong to the model of this specification one or more embodiment protection Enclose.
Fig. 1 is according to the indicative flowchart of the data analysing method of the embodiment of this specification one, as shown in figure 1, the party Method is applied to network side, including:
Step S102, obtain target data to be analyzed.
Step S104, concentrated in the data results of generation and target data is matched, to obtain target data pair The target analysis result answered.
Wherein, data results, which are concentrated, includes multiple data and its corresponding analysis result.In data results collection In, directly multiple data and its corresponding analysis result can be stored, also multiple data can be pressed different data types Stored respectively.
Using the technical scheme of this specification one or more embodiment, target data to be analyzed can be obtained, and The data results of generation are concentrated and target data are matched, to obtain target analysis result corresponding to target data.Can See, the technical scheme causes data analysis to be no longer dependent on manual analysis, so as to save substantial amounts of time and human resources.
The step S102-S104 data analysing methods provided are described in detail below.
First, target data to be analyzed is obtained.Wherein, the use that target data includes client and/or server reports Family user behaviors log data, such as all kinds of problems that the platform such as front end customer service, artificial discovery, public sentiment feedback reports.
After target data is got, to enable target data to be identified by network side, using what is be pre-configured with Data rule configures to target data, so as to can be identified with the target data postponed by the network side.Wherein, in advance The data rule first configured can be the rule configured to the forms of data, clause etc..
Based on above-mentioned data rule, in one embodiment, when being configured to target data, using designated symbols pair Target data is split, so that target data meets the clause of data rule defined.For example, designated symbols accord with for punctuate Number, it is assumed that the target data got is long sentence, then the long sentence is split using punctuation mark, to be divided into network The short sentence that side can identify.
In one embodiment, when being configured to target data, the form of target data can also be changed, so that Target data meets the form of data rule defined.For example, it is assumed that the form of the target data got is Chinese, and data Data format specified in rule is phonetic, then the form of target data can be converted into phonetic by Chinese, so that network Side can recognize that target data.
Secondly, concentrated in the data results previously generated and target data is matched, to obtain target data pair The target analysis result answered.
In one embodiment, judge that whether data results are concentrated comprising the target analysis to match with target data As a result;If so, then export target analysis result;If it is not, then target data is sent to the corresponding device of manual analysis.
The scheme of the present embodiment, the target analysis knot including matching with target data can be concentrated in data results During fruit, the target analysis result is directly exported, so as to avoid to identical data be repeated several times the situation of analysis, improves number According to the efficiency of analysis.Further, it is possible to the target analysis result for not including matching with target data is concentrated in data results When, manual analysis is carried out to target data so that just carried out when target data does not only match that with data results collection artificial Analysis, and manual analysis not is completely dependent on, so as to save substantial amounts of time and human resources.
In one embodiment, data results collection can be generated by following any mode:
Mode one, obtain sample analysis result corresponding to multiple sample datas;By multiple sample datas and sample analysis As a result corresponding storage is carried out, to generate data results collection.
If pass-through mode one generate data results collection, then data results collection can directly store multiple data and Its corresponding analysis result.When data results are concentrated and target data is matched, data results only need to be judged Concentrate whether comprising the data to match with target data, if so, the analysis corresponding to the data then to match with target data As a result it is target analysis result corresponding to target data.
Wherein, the data to match with target data can be data identical with target data or with Target data implication identical data.Therefore, judging whether data results concentration includes what is matched with target data Before data, semantic analysis can be carried out to target data first, to determine the implication of target data.For example, before target data is End customer service report on product A the problem of " product A can not be bought ", although and data results are concentrated and not included and target The identical data of data, but comprising with target data implication identical data " product A can not be bought ", now, data point The data to match with target data are included in analysis result set.
Mode two, according to the keyword included in sample data sample data is classified, by Different categories of samples data with And sample analysis result is correspondingly stored, to generate data results collection.
If pass-through mode two generates data results collection, then data results collection can be according to different data types Store multiple data and its corresponding analysis result.The keyword included in sample data can be special with product type, data The related entity word such as sign, data generation time, user profile.Wherein, data characteristics can be that the side of reporting of sample data is pre- The mark first done to sample data, such as the mark " spy of product X ", " incidence is high " etc. or sample data in itself Sign, such as feature " the product X " etc. included in sample data.Data characteristics makes a distinction available for network side to sample data.
According to the difference of keyword, sample data can be classified from different perspectives.For example, if keyword is product class Type, then sample data can be classified according to the difference of product type;If keyword is data generation time, can be according to number Sample data is classified according to the difference of generation time;If keyword is user profile (such as address name, account), can Sample data is classified according to the difference of user profile;Etc..
When being matched according to data results set pair target data, the data class of target data need to be determined first Type.It is determined that target data data type when, the keyword that includes in extractable target data, and according to the key extracted Word determines the data type of target data.Wherein, keyword can be and product type, data characteristics, data generation time, use The related entity word such as family information.Wherein, data characteristics can be that the side of reporting of target data is done to target data in advance Mark, such as mark " feature of product X ", " incidence is high " etc. or target data in itself, such as wrapped in target data Feature " the product X " etc. contained.Data characteristics is sorted out available for network side to target data.
According to the difference of keyword, the data type of target data can be determined from different perspectives.For example, if keyword is production Category type, then can be according to the different data types for determining target data of product type;If keyword is data generation time, Can be according to the different data types for determining target data of data generation time;If keyword be user profile (such as address name, Account etc.), then can be according to the different data types for determining target data of user profile;Etc..
Under normal circumstances, target data is automatically analyzed for ease of network side, determines the data type of target data When institute's foundation keyword type should with classifying to sample data when institute's foundation keyword type it is identical.For example, If the keyword of institute's foundation is product type when classifying to sample data, then when determining the data type of target data Keyword related to product type in target data should be extracted.
After the data type for determining target data, you can according to the data type of target data to target data progress Match somebody with somebody.Specifically, the data for determining to match with data type are concentrated from data results first, so according to data type The analysis result corresponding to data to match matches to target data.
Concentrated in the data results that in the manner described above two are generated, analysis result corresponding to same class data may Including one or more.When the analysis result corresponding to the data to match with data type includes multiple, other can be combined Target analysis result corresponding to factor matching target data, other kinds of keyword in such as combining target data.
For example, data results concentrate A classes data (the i.e. number related to product A for including classifying according to product type According to) and its corresponding analysis result, B classes data (i.e. the data related to product B) and its corresponding analysis result, C class data (data i.e. related to products C) and its corresponding analysis result ....If the data type of target data is A class data, number The analysis result corresponding to A class data is concentrated to include according to analysis result multiple, then to be produced in extractable target data with data The related keyword such as time, user profile, and mesh is determined according to the keyword related to data generation time, user profile etc. Mark target analysis result corresponding to data.
In addition, the target of target data point can also be determined according to the probability of occurrence of each analysis result in multiple analysis results Result is analysed, for example, will appear from the target analysis result that probability highest analysis result is defined as target data.
In above-described embodiment, if data results concentrate the target analysis result for not including and matching with target data When, then target data is sent to the corresponding device of manual analysis.Based on this, the target point obtained by manual analysis can be obtained Result is analysed, and the target analysis result obtained according to manual analysis updates the data analysis result collection.
When updating the data analysis result collection, if data results, which are concentrated, includes multiple data and its corresponding analysis knot Fruit, then directly target analysis result and target data that manual analysis obtains correspondingly can be stored to data results collection; If data results concentrate include polytype data and all types of data corresponding to analysis result, need to be according to target The data type of data, target analysis result that target data and manual analysis obtain correspondingly is stored according to data type to Data results collection.
The scheme of the present embodiment, analysis result collection can be updated the data according to the target analysis result that manual analysis obtains, So that the content that data results are concentrated more enrich it is perfect, and then again to identical (identical or implication is identical) When target data is analyzed, without relying on manual analysis again, and data results in the updated are only needed to concentrate to target Data are matched, so as to save substantial amounts of time and human resources.
Fig. 2 is according to the indicative flowchart of the data analysing method of the embodiment of this specification one, as shown in Fig. 2 the party Method is applied to network side, including:
Step S201, obtain target data to be analyzed.
Wherein, the User action log data that target data includes client and/or server reports, such as front end visitor All kinds of problems that the platforms such as clothes, artificial discovery, public sentiment feedback report.
Step S202, target data is configured using the data rule being pre-configured with, so that with the number of targets postponed According to can be identified by the network side.
For example, target data is split using designated symbols (such as punctuation mark), so that target data meets data The clause of specified by rules;Or the form of target data is changed, so that target data meets data rule defined Form.
Step S203, data results collection is obtained, judge that data results are concentrated and whether include and target data phase The target analysis result of matching.If so, then perform step S204;If it is not, then perform step S205.
Wherein, data results collection includes multiple data and its corresponding analysis result.Match with target data Target analysis result refers to the analysis result corresponding to the data of (i.e. identical or implication is identical) identical with target data.
Step S204, export target analysis result.
Step S205, target data is sent to the corresponding device of manual analysis to carry out manual analysis, obtains number of targets According to corresponding target analysis result.
Step S206, the target analysis result obtained according to manual analysis update the data analysis result collection.
In the present embodiment, data results collection includes multiple data and its corresponding analysis result, can be directly by target The target analysis result that data and manual analysis obtain directly is stored to data results collection.
For data analysing method illustrated in fig. 2, now illustrated with a concrete scene.Assuming that target data is front end Customer service report it is related to product A the problem of " product A can not be bought ", found according to data results collection and target data The data " product A can not be bought " of " product A can not be bought " identical (i.e. identical or implication is identical).In data results Concentrate, analysis result corresponding to data " product A can not be bought " is " product A is out of print today ", then the analysis result is The target analysis result to match with target data " product A can not be bought ", network side directly forward end customer service can export target Analysis result " product A is out of print today ".Do not found and target data " product A can not be bought " if data results are concentrated The data of identical (i.e. identical or implication is identical), then network side output target data " product A can not be bought ", so that right Target data " product A can not be bought " carries out manual analysis.Assuming that the obtained target analysis result of manual analysis for " product A in September is not sold between 1 to 10 within 2017 ", then can " product A be in 2017 9 by target analysis result that manual analysis obtains Do not sold between months 1 to 10 " and corresponding store to data results of target data " product A can not be bought " concentrate, To update the data analysis result collection.
To sum up, the specific embodiment of this theme is described.Other embodiments are in appended claims In the range of.In some cases, the action recorded in detail in the claims can perform and still in a different order Desired result can be realized.In addition, the process described in the accompanying drawings not necessarily requires the particular order or continuous suitable shown Sequence, to realize desired result.In some embodiments, multitasking and parallel processing can be favourable.
The data analysing method provided above for this specification one or more embodiment, based on same thinking, this theory Bright book one or more embodiment also provides a kind of data analysis set-up.
Fig. 3 is the schematic block diagram according to the data analysis set-up of the embodiment of this specification one.As shown in figure 3, the device Applied to network side, including:
First acquisition module 310, obtains target data to be analyzed;
Matching module 320, concentrated in the data results of generation and target data is matched, to obtain target data Corresponding target analysis result, wherein, data results, which are concentrated, includes multiple data and its corresponding analysis result.
Alternatively, said apparatus also includes:
Extraction module, the keyword included in target data is extracted, the data type of target data is determined according to keyword; Wherein, keyword is related at least one in product type, data characteristics, data generation time, user profile;
Accordingly, matching module 320 includes:
Determining unit, the data for determining to match with data type are concentrated from data results;
Matching unit, the analysis result according to corresponding to the data to match with data type is to target data progress Match somebody with somebody.
Alternatively, said apparatus also includes:
Configuration module, target data is configured using the data rule being pre-configured with, so that with the number of targets postponed According to can be identified by network side.
Alternatively, configuration module includes:
Cutting unit, target data is split using designated symbols, so that target data meets data rule and advised Fixed clause.
Alternatively, matching module 320 includes:
Judging unit, judge that whether data results are concentrated comprising the target analysis result to match with target data;
First output unit, if data results are concentrated comprising the target analysis result to match with target data, Export target analysis result;
Second output unit, if data results concentrate the target analysis result for not including and matching with target data, Then target data is sent to the corresponding device of manual analysis.
Alternatively, said apparatus also includes:
Second acquisition module, obtain the target analysis result obtained by manual analysis;
Update module, the target analysis result obtained according to manual analysis update the data analysis result collection.
Alternatively, said apparatus also includes:
3rd acquisition module, obtain sample analysis result corresponding to multiple sample datas;
Generation module, multiple sample datas and sample analysis result are subjected to corresponding storage, to generate data analysis knot Fruit collects;Or, sample data is classified according to the keyword included in sample data, by Different categories of samples data and sample point Analysis result is correspondingly stored, to generate data results collection.
Alternatively, the User action log data that target data includes client and/or server reports.
Using the device of this specification one or more embodiment, target data to be analyzed can be obtained, and generating Data results concentrate target data is matched, to obtain target analysis result corresponding to target data.It can be seen that should Technical scheme causes data analysis to be no longer dependent on manual analysis, so as to save substantial amounts of time and human resources.
Further, the device can concentrate the target analysis knot including matching with target data in data results During fruit, the target analysis result is directly exported, so as to avoid to identical data be repeated several times the situation of analysis, improves number According to the efficiency of analysis.
The data analysing method provided above for this specification one or more embodiment, based on same thinking, this theory Bright book one or more embodiment also provides a kind of data analysis system.
Fig. 4 is according to the schematic block diagram of the data analysis system of the embodiment of this specification one, as shown in figure 4, the system It is arranged at space in a newspaper 410 and components of data analysis 420 on network side, including data;Wherein:
Space in a newspaper 410 in data, target data to be analyzed is reported to components of data analysis 420;
Components of data analysis 420, obtain the target data that space in a newspaper 410 reports in data;In the data results of generation Concentration matches to target data, to obtain target analysis result corresponding to target data, wherein, data results are concentrated Including multiple data and its corresponding analysis result.
In one embodiment, said system also includes manual analysis component;
Components of data analysis 420, judge that data results concentrate the target point for whether including and matching with target data Analyse result;If so, then export target analysis result;If it is not, then target data is sent to manual analysis component;
Manual analysis component, target data is exported, to carry out manual analysis to target data;It will be obtained by manual analysis Target analysis result transmit to components of data analysis 420;
Components of data analysis 420, the target analysis result obtained according to manual analysis update the data analysis result collection.
It should be understood that the data analysis system energy in data analysis set-up and Fig. 4 in Fig. 3 It is enough to realize previously described data analysing method, detailed description therein should be described with method part above it is similar, to keep away Exempt from cumbersome, do not repeat separately herein.
Based on same thinking, this specification one or more embodiment also provides a kind of DAF, such as Fig. 5 institutes Show.DAF can produce bigger difference because configuration or performance are different, can include one or more Processor 501 and memory 502, one or more storage application programs or data can be stored with memory 502.Its In, memory 502 can be of short duration storage or persistently storage.Be stored in memory 502 application program can include one or More than one module (diagram is not shown), each module can include can perform the series of computation machine in DAF Instruction.Further, processor 501 be could be arranged to communicate with memory 502, and memory is performed on DAF Series of computation machine executable instruction in 502.DAF can also include one or more power supplys 503, and one Individual or more than one wired or wireless network interface 504, one or more input/output interfaces 505, one or one with Upper keyboard 506.
Specifically in the present embodiment, DAF includes memory, and one or more program, its In one or more than one program storage in memory, and one or more than one program can include one or one With upper module, and each module can include to the series of computation machine executable instruction in DAF, and be configured So that by one, either more than one computing device this or more than one program bag contain that be used to carrying out following computer can Execute instruction:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, it is corresponding to obtain the target data Target analysis result, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
Alternatively, computer executable instructions when executed, can also make the processor:
The keyword included in the target data is extracted, the data class of the target data is determined according to the keyword Type;Wherein, the keyword is related at least one in product type, data characteristics, data generation time, user profile;
Accordingly, computer executable instructions when executed, can also make the processor:
The data for determining to match with the data type are concentrated from the data results;
Analysis result according to corresponding to the data to match with the data type matches to the target data.
Alternatively, computer executable instructions when executed, can also make the processor:
The target data is configured using the data rule being pre-configured with, so that described with the target data postponed It can be identified by the network side.
Alternatively, computer executable instructions when executed, can also make the processor:
The target data is split using designated symbols, so that the target data meets the data rule institute Defined clause.
Alternatively, computer executable instructions when executed, can also make the processor:
Judge that whether the data results are concentrated comprising the target analysis result to match with the target data;
If so, then export the target analysis result;
If it is not, then the target data is sent to the corresponding device of manual analysis.
Alternatively, computer executable instructions when executed, can also make the processor:
Obtain the target analysis result obtained by manual analysis;
The target analysis result obtained according to the manual analysis updates the data results collection.
Alternatively, computer executable instructions when executed, can also make the processor:
Obtain sample analysis result corresponding to multiple sample datas;
The multiple sample data and the sample analysis result are subjected to corresponding storage, to generate the data analysis Result set;Or, the sample data is classified according to the keyword included in the sample data, by all kinds of samples Data and the sample analysis result are correspondingly stored, to generate the data results collection.
Alternatively, the User action log data that the target data includes client and/or server reports.
This specification one or more embodiment also proposed a kind of computer-readable recording medium, and this is computer-readable to deposit Storage media stores one or more programs, and one or more programs include instruction, and the instruction, which is worked as, is included multiple application programs Electronic equipment when performing, the electronic equipment can be made to perform above-mentioned data analysing method, and specifically for performing:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, it is corresponding to obtain the target data Target analysis result, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
System, device, module or the unit that above-described embodiment illustrates, it can specifically be realized by computer chip or entity, Or realized by the product with certain function.One kind typically realizes that equipment is computer.Specifically, computer for example may be used Think personal computer, laptop computer, cell phone, camera phone, smart phone, personal digital assistant, media play It is any in device, navigation equipment, electronic mail equipment, game console, tablet PC, wearable device or these equipment The combination of equipment.
For convenience of description, it is divided into various units during description apparatus above with function to describe respectively.Certainly, this is being implemented The function of each unit can be realized in same or multiple softwares and/or hardware during specification one or more embodiment.
It should be understood by those skilled in the art that, this specification one or more embodiment can be provided as method, system or Computer program product.Therefore, this specification one or more embodiment can use complete hardware embodiment, complete software to implement The form of embodiment in terms of example or combination software and hardware.Moreover, this specification one or more embodiment can be used one Individual or multiple computer-usable storage mediums for wherein including computer usable program code (include but is not limited to disk storage Device, CD-ROM, optical memory etc.) on the form of computer program product implemented.
This specification one or more embodiment is with reference to according to the method for the embodiment of the present application, equipment (system) and meter The flow chart and/or block diagram of calculation machine program product describes.It should be understood that can by computer program instructions implementation process figure and/ Or each flow in block diagram and/or square frame and the flow in flow chart and/or block diagram and/or the combination of square frame.Can These computer program instructions are provided at all-purpose computer, special-purpose computer, Embedded Processor or other programmable datas The processor of equipment is managed to produce a machine so that hold by the processor of computer or other programmable data processing devices Capable instruction is produced for realizing in one flow of flow chart or multiple flows and/or one square frame of block diagram or multiple square frames The device for the function of specifying.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, so as in computer or The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in individual square frame or multiple square frames.
In a typical configuration, computing device includes one or more processors (CPU), input/output interface, net Network interface and internal memory.
Internal memory may include computer-readable medium in volatile memory, random access memory (RAM) and/or The forms such as Nonvolatile memory, such as read-only storage (ROM) or flash memory (flash RAM).Internal memory is computer-readable medium Example.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method Or technology come realize information store.Information can be computer-readable instruction, data structure, the module of program or other data. The example of the storage medium of computer includes, but are not limited to phase transition internal memory (PRAM), static RAM (SRAM), moved State random access memory (DRAM), other kinds of random access memory (RAM), read-only storage (ROM), electric erasable Programmable read only memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc read-only storage (CD-ROM), Digital versatile disc (DVD) or other optical storages, magnetic cassette tape, the storage of tape magnetic rigid disk or other magnetic storage apparatus Or any other non-transmission medium, the information that can be accessed by a computing device available for storage.Define, calculate according to herein Machine computer-readable recording medium does not include temporary computer readable media (transitory media), such as data-signal and carrier wave of modulation.
It should also be noted that, term " comprising ", "comprising" or its any other variant are intended to nonexcludability Comprising so that process, method, commodity or equipment including a series of elements not only include those key elements, but also wrapping Include the other element being not expressly set out, or also include for this process, method, commodity or equipment intrinsic want Element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that wanted including described Other identical element also be present in the process of element, method, commodity or equipment.
This specification one or more embodiment can computer executable instructions it is general on Described in hereafter, such as program module.Usually, program module includes performing particular task or realizes particular abstract data type Routine, program, object, component, data structure etc..The application can also be put into practice in a distributed computing environment, at these In DCE, by performing task by communication network and connected remote processing devices.In Distributed Calculation In environment, program module can be located in the local and remote computer-readable storage medium including storage device.
Each embodiment in this specification is described by the way of progressive, identical similar portion between each embodiment Divide mutually referring to what each embodiment stressed is the difference with other embodiment.It is real especially for system For applying example, because it is substantially similar to embodiment of the method, so description is fairly simple, related part is referring to embodiment of the method Part explanation.
This specification one or more embodiment is the foregoing is only, is not limited to this specification.For this For art personnel, this specification one or more embodiment can have various modifications and variations.It is all in this specification one Any modification, equivalent substitution and improvements made within the spirit and principle of individual or multiple embodiments etc., should be included in this explanation Within the right of book one or more embodiment.

Claims (20)

1. a kind of data analysing method, applied to network side, methods described includes:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, to obtain mesh corresponding to the target data Analysis result is marked, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
2. the method according to claim 11, in addition to:
The keyword included in the target data is extracted, the data type of the target data is determined according to the keyword; Wherein, the keyword is related at least one in product type, data characteristics, data generation time, user profile;
Accordingly, the target data is matched in the data results concentration of generation, including:
The data for determining to match with the data type are concentrated from the data results;
Analysis result according to corresponding to the data to match with the data type matches to the target data.
3. method according to claim 1 or 2, in addition to:
The target data is configured using the data rule being pre-configured with, so that described can with the target data postponed Identified by the network side.
It is 4. according to the method for claim 3, described that the target data is matched somebody with somebody using the data rule being pre-configured with Put, including:
The target data is split using designated symbols, so that the target data meets the data rule defined Clause.
5. according to the method for claim 1, the target data is matched in the data results concentration of generation, Including:
Judge that whether the data results are concentrated comprising the target analysis result to match with the target data;
If so, then export the target analysis result;
If it is not, then the target data is sent to the corresponding device of manual analysis.
6. the method according to claim 11, in addition to:
Obtain the target analysis result obtained by manual analysis;
The target analysis result obtained according to the manual analysis updates the data results collection.
7. the method according to claim 11, in addition to:
Obtain sample analysis result corresponding to multiple sample datas;
The multiple sample data and the sample analysis result are subjected to corresponding storage, to generate the data results Collection;Or, the sample data is classified according to the keyword included in the sample data, by all kinds of sample datas And the sample analysis result is correspondingly stored, to generate the data results collection.
The user behavior that 8. according to the method for claim 1, the target data includes client and/or server reports Daily record data.
9. a kind of data analysis set-up, applied to network side, described device includes:
First acquisition module, obtain target data to be analyzed;
Matching module, concentrated in the data results of generation and the target data is matched, to obtain the number of targets According to corresponding target analysis result, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
10. device according to claim 9, in addition to:
Extraction module, the keyword included in the target data is extracted, the target data is determined according to the keyword Data type;Wherein, the keyword and at least one in product type, data characteristics, data generation time, user profile It is related;
Accordingly, the matching module includes:
Determining unit, the data for determining to match with the data type are concentrated from the data results;
Matching unit, the analysis result according to corresponding to the data to match with the data type are carried out to the target data Matching.
11. the device according to claim 9 or 10, in addition to:
Configuration module, the target data is configured using the data rule being pre-configured with, so that described with the mesh postponed Mark data can be identified by the network side.
12. device according to claim 11, the configuration module includes:
Cutting unit, the target data is split using designated symbols, so that the target data meets the data The clause of specified by rules.
13. device according to claim 9, the matching module includes:
Judging unit, judge that whether the data results are concentrated comprising the target analysis knot to match with the target data Fruit;
First output unit, if the data results are concentrated comprising the target analysis knot to match with the target data Fruit, then export the target analysis result;
Second output unit, if the data results concentrate the target analysis knot for not including and matching with the target data Fruit, then the target data is sent to the corresponding device of manual analysis.
14. device according to claim 13, in addition to:
Second acquisition module, obtain the target analysis result obtained by manual analysis;
Update module, the target analysis result obtained according to the manual analysis update the data results collection.
15. device according to claim 9, in addition to:
3rd acquisition module, obtain sample analysis result corresponding to multiple sample datas;
Generation module, the multiple sample data and the sample analysis result are subjected to corresponding storage, to generate the number According to analysis result collection;Or, the sample data is classified according to the keyword included in the sample data, by all kinds of institutes State sample data and the sample analysis result is correspondingly stored, to generate the data results collection.
16. device according to claim 9, user's row that the target data includes client and/or server reports For daily record data.
17. a kind of data analysis system, is arranged at network side, the system includes:
Space in a newspaper in data, target data to be analyzed is reported to components of data analysis;
The components of data analysis, obtain the target data that space in a newspaper reports in the data;In the data results collection of generation In the target data is matched, to obtain target analysis result corresponding to the target data, wherein, data point Analysis result set includes multiple data and its corresponding analysis result.
18. system according to claim 17, in addition to:Manual analysis component;
The components of data analysis, judge that whether the data results are concentrated comprising the mesh to match with the target data Mark analysis result;If so, then export the target analysis result;If it is not, then the target data is sent to people's work point Analyse component;
The manual analysis component, exports the target data, to carry out manual analysis to the target data;Will be by described The target analysis result that manual analysis obtains is transmitted to the components of data analysis;
The components of data analysis, the target analysis result obtained according to the manual analysis update the data results Collection.
19. a kind of DAF, including:
Processor;And
It is arranged to store the memory of computer executable instructions, the executable instruction makes the processing when executed Device:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, to obtain mesh corresponding to the target data Analysis result is marked, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
20. a kind of storage medium, for storing computer executable instructions, the executable instruction is realized following when executed Flow:
Obtain target data to be analyzed;
Concentrated in the data results of generation and the target data is matched, to obtain mesh corresponding to the target data Analysis result is marked, wherein, the data results, which are concentrated, includes multiple data and its corresponding analysis result.
CN201710806187.8A 2017-09-08 2017-09-08 Data analysing method, apparatus and system Pending CN107766416A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710806187.8A CN107766416A (en) 2017-09-08 2017-09-08 Data analysing method, apparatus and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710806187.8A CN107766416A (en) 2017-09-08 2017-09-08 Data analysing method, apparatus and system

Publications (1)

Publication Number Publication Date
CN107766416A true CN107766416A (en) 2018-03-06

Family

ID=61265971

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710806187.8A Pending CN107766416A (en) 2017-09-08 2017-09-08 Data analysing method, apparatus and system

Country Status (1)

Country Link
CN (1) CN107766416A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110020665A (en) * 2019-02-12 2019-07-16 北京鑫汇普瑞科技发展有限公司 A kind of microbial biomass modal data analysis method being compatible with different flight mass spectrometers

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010084391A (en) * 2000-02-25 2001-09-06 고설남 Question and Answer System
CN101076061A (en) * 2007-03-30 2007-11-21 腾讯科技(深圳)有限公司 Robot server and automatic chatting method
CN101193071A (en) * 2007-03-28 2008-06-04 腾讯科技(深圳)有限公司 A client service method, system and device based on instant communication
CN103279528A (en) * 2013-05-31 2013-09-04 俞志晨 Question-answering system and question-answering method based on man-machine integration
CN104391934A (en) * 2014-11-21 2015-03-04 深圳市银雁金融配套服务有限公司 Data calibration method and device
CN104516921A (en) * 2013-09-30 2015-04-15 华为技术有限公司 Automatic response method and device
CN104598445A (en) * 2013-11-01 2015-05-06 腾讯科技(深圳)有限公司 Automatic question-answering system and method
CN105591882A (en) * 2015-12-10 2016-05-18 北京中科汇联科技股份有限公司 Method and system for mixed customer services of intelligent robots and human beings

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010084391A (en) * 2000-02-25 2001-09-06 고설남 Question and Answer System
CN101193071A (en) * 2007-03-28 2008-06-04 腾讯科技(深圳)有限公司 A client service method, system and device based on instant communication
CN101076061A (en) * 2007-03-30 2007-11-21 腾讯科技(深圳)有限公司 Robot server and automatic chatting method
CN103279528A (en) * 2013-05-31 2013-09-04 俞志晨 Question-answering system and question-answering method based on man-machine integration
CN104516921A (en) * 2013-09-30 2015-04-15 华为技术有限公司 Automatic response method and device
CN104598445A (en) * 2013-11-01 2015-05-06 腾讯科技(深圳)有限公司 Automatic question-answering system and method
CN104391934A (en) * 2014-11-21 2015-03-04 深圳市银雁金融配套服务有限公司 Data calibration method and device
CN105591882A (en) * 2015-12-10 2016-05-18 北京中科汇联科技股份有限公司 Method and system for mixed customer services of intelligent robots and human beings

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110020665A (en) * 2019-02-12 2019-07-16 北京鑫汇普瑞科技发展有限公司 A kind of microbial biomass modal data analysis method being compatible with different flight mass spectrometers

Similar Documents

Publication Publication Date Title
CN108595583A (en) Dynamic chart class page data crawling method, device, terminal and storage medium
CN107066449A (en) Information-pushing method and device
CN107608874A (en) Method of testing and device
CN105306495B (en) user identification method and device
CN107590291A (en) A kind of searching method of picture, terminal device and storage medium
CN107526718A (en) Method and apparatus for generating text
CN109697231A (en) A kind of display methods, system, storage medium and the processor of case document
CN106980667B (en) A kind of method and apparatus to article mark label
CN114357117A (en) Transaction information query method and device, computer equipment and storage medium
CN110362663A (en) Adaptive multi-sensing similarity detection and resolution
CN107741972A (en) A kind of searching method of picture, terminal device and storage medium
CN110119401A (en) Processing method, device, server and the storage medium of user's portrait
CN108694183A (en) A kind of search method and device
CN109828759A (en) Code compiling method, device, computer installation and storage medium
CN110209780A (en) A kind of question template generation method, device, server and storage medium
CN109447412A (en) Construct method, apparatus, computer equipment and the storage medium of business connection map
CN104580109B (en) Generation clicks the method and device of identifying code
CN107563394A (en) A kind of method and system of predicted pictures popularity
CN113450796B (en) Voice report generation method, device, equipment and storage medium
CN114676231A (en) Target information detection method, device and medium
CN107766416A (en) Data analysing method, apparatus and system
CN111666408A (en) Method and device for screening and displaying important clauses
CN111949655A (en) Form display method and device, electronic equipment and medium
CN114840634B (en) Information storage method and device, electronic equipment and computer readable medium
CN116226354A (en) Question and answer information determining method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200923

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Applicant after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Applicant before: Alibaba Group Holding Ltd.

Effective date of registration: 20200923

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Applicant after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Applicant before: Advanced innovation technology Co.,Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20180306

RJ01 Rejection of invention patent application after publication