CN117540107B - Data processing method and device, electronic equipment and storage medium - Google Patents

Data processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN117540107B
CN117540107B CN202410030664.6A CN202410030664A CN117540107B CN 117540107 B CN117540107 B CN 117540107B CN 202410030664 A CN202410030664 A CN 202410030664A CN 117540107 B CN117540107 B CN 117540107B
Authority
CN
China
Prior art keywords
attribute
data
intention
attribute data
input data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202410030664.6A
Other languages
Chinese (zh)
Other versions
CN117540107A (en
Inventor
郭云三
侍伟伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Tonghuashun Intelligent Technology Co Ltd
Original Assignee
Zhejiang Tonghuashun Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Tonghuashun Intelligent Technology Co Ltd filed Critical Zhejiang Tonghuashun Intelligent Technology Co Ltd
Priority to CN202410030664.6A priority Critical patent/CN117540107B/en
Publication of CN117540107A publication Critical patent/CN117540107A/en
Application granted granted Critical
Publication of CN117540107B publication Critical patent/CN117540107B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/904Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application provides a data processing method, a data processing device, electronic equipment and a storage medium, and relates to the technical field of computers; the method comprises the following steps: receiving input data input by a target object; extracting attribute data from the input data to obtain at least one attribute data; determining an intention category corresponding to the attribute data, wherein the intention category comprises at least one of the following: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space; and carrying out data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data to obtain at least one target intention. Therefore, by extracting the attribute data from the input data and combining the intention type of the attribute data, the attribute data can be effectively utilized, and a plurality of target intentions corresponding to the input data are provided for the target object by considering the intention type corresponding to the attribute data, so that the problem of inaccurate intention generation caused by the fact that the generation of the unique intention is only carried out based on the actual input data is avoided.

Description

Data processing method and device, electronic equipment and storage medium
1.1.1 Field of technology
The present application relates to the field of computer technologies, and in particular, to a data processing method, a data processing device, an electronic device, and a storage medium.
1.1.2 Background art
With the development of the internet, various application systems built on the internet are also endless. In order to improve the user experience of an application system, a question and answer function is generally provided for a user in the process of using the application system, and when a user inputs a question, the intention of the user is analyzed through the question input by the user so as to provide an answer matched with the intention of the user. The current intention is usually generated by converting a sentence into a SOL sentence after the application system receives the sentence input by the user, and directly selecting related field data from a data set pre-stored in the application system based on the SQL sentence, wherein the related field data is used as the intention. The current intention generation mode is used for generating the intention only based on the sentence actually input by the user, but sometimes the intention is difficult to describe by the user, and the directions of different users focusing on the same sentence can be different, if the intention generation is also performed based on the actually input sentence, the generated intention of the user is inaccurate, and the user experience is influenced.
1.1.3 Summary of the invention
The embodiment of the application provides a data processing method, a data processing device, electronic equipment and a storage medium.
According to a first aspect of the present application there is provided a data processing method comprising: receiving input data input by a target object; extracting attribute data from the input data to obtain at least one attribute data; determining an intention category corresponding to the attribute data, wherein the intention category comprises at least one of the following: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space; and carrying out data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data to obtain at least one target intention.
According to an embodiment of the application, the method further comprises: and visually displaying the at least one target intention.
According to an embodiment of the present application, extracting attribute data from the input data includes: extracting attribute data from the input data, including: carrying out natural language analysis on the input data to obtain an analysis result; converting the analysis result into a set statement form to obtain original data; and extracting the attributes of the original data to obtain at least one attribute data, wherein the attribute data comprises attribute words and attribute types.
According to an embodiment of the present application, the attribute types include a nominal type attribute, an ordered type attribute, and a measured type attribute.
According to an embodiment of the present application, the determining the intention category corresponding to the attribute data includes: combining the attribute data with a plurality of preset intention categories to obtain a combined result; determining a plurality of visual combinations meeting visual standards from the combination result; and determining the intention category corresponding to the attribute data according to the plurality of visual combinations.
According to an embodiment of the present application, according to the attribute data and the intention category corresponding to the attribute data, the data expansion of the input data includes: determining analysis logic corresponding to the attribute data according to a preset map library, the attribute type corresponding to the attribute data and the intention type; and carrying out data expansion on the input data according to the analysis logic of the attribute data.
According to an embodiment of the present application, the preset map library includes attribute maps of a plurality of attribute types, and the attribute maps include a plurality of intention categories and analysis logic corresponding to each intention category.
According to a second aspect of the present application, there is provided a data processing apparatus comprising: the receiving module is used for receiving input data input by a target object; the extraction module is used for extracting attribute data from the input data to obtain at least one attribute data; the determining module is used for determining an intention category corresponding to the attribute data, wherein the intention category comprises at least one of the following: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space; and the expansion module is used for carrying out data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data to obtain at least one target intention.
According to a third aspect of the present application, there is provided an electronic device comprising:
at least one processor; and
A memory communicatively coupled to the at least one processor; wherein,
The memory stores instructions executable by the at least one processor to enable the at least one processor to perform the methods of the present application.
According to a fourth aspect of the present application there is provided a non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of the present application.
The method of the embodiment of the application receives the input data input by the target object; extracting attribute data from the input data to obtain at least one attribute data; determining the intention category corresponding to the attribute data; and carrying out data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data to obtain at least one target intention. Therefore, by extracting the attribute data from the input data and combining the intention type of the attribute data, the attribute data corresponding to the input data can be effectively utilized, and a plurality of target intentions corresponding to the input data are provided for the target object in consideration of the intention type corresponding to the attribute data, so that the problem that the intention generation is inaccurate due to the fact that only the unique intention generation is carried out on the basis of the actual input data is avoided, the plurality of target intentions corresponding to the input data are fully considered, and the accuracy of the intention generation is improved.
It should be understood that the teachings of the present application need not achieve all of the benefits set forth above, but rather that certain technical solutions may achieve certain technical effects, and that other embodiments of the present application may also achieve benefits not set forth above.
1.1.4 Illustrations of the drawings
The above, as well as additional purposes, features, and advantages of exemplary embodiments of the present application will become readily apparent from the following detailed description when read in conjunction with the accompanying drawings. Several embodiments of the present application are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which:
in the drawings, the same or corresponding reference numerals indicate the same or corresponding parts.
Fig. 1 is a schematic diagram of an implementation flow of a data processing method according to an embodiment of the present application;
fig. 2 is a schematic diagram of an implementation flow of a data extraction method of a data processing method according to an embodiment of the present application;
Fig. 3 is a schematic implementation flow diagram of an intention category determining method of a data processing method according to an embodiment of the present application;
Fig. 4 is a schematic implementation flow diagram of a data expansion method of the data processing method according to an embodiment of the present application;
Fig. 5 is a schematic implementation flow diagram of a specific application example of a data processing method according to an embodiment of the present application;
Fig. 6 is a schematic diagram showing a composition structure of a data processing apparatus according to an embodiment of the present application.
1.1.5 Detailed description of the invention
In order to make the objects, features and advantages of the present application more comprehensible, the technical solutions according to the embodiments of the present application will be clearly described in the following with reference to the accompanying drawings, and it is obvious that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
Fig. 1 shows a schematic implementation flow chart of a data processing method according to an embodiment of the present application.
Referring to fig. 1, an embodiment of the present application provides an information processing method, including: operation 101, receiving input data input by a target object; operation 102, extracting attribute data from input data to obtain at least one attribute data; operation 103, determining an intention category corresponding to the attribute data, wherein the intention category comprises at least one of the following: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space; and 104, performing data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data to obtain at least one target intention.
In operation 101, input data input by a target object is received.
Specifically, the embodiment of the application can be applied to various application systems with user questioning functions, wherein the application systems comprise, but are not limited to, an e-commerce system, a financial system, various large platforms and various application software. The target object may include a user of the application system, such as an investor, buyer, etc.
Furthermore, in the process that the target object uses the application system, under the condition that self-service problem consultation is required, input data input by a user can be received first. The input data received from the user may be input data received from a question-answer interaction page provided by the user in the application system.
For example, some investors usually need to know about the invested company through some relevant platforms or application systems before they invest, at this time, the investors can ask questions through relevant platforms, for example, input "how much the company a is red", and at this time, input data input by the target object, that is, "how much the company a is red", can be detected.
In operation 102, attribute data extraction is performed on the input data to obtain at least one attribute data.
In particular, data may be classified into a plurality of attribute types, for example, a nominal (Categorical) data attribute (abbreviated as N), an ordered (Ordinal) data attribute (abbreviated as O), and a measured (Interval) data attribute (abbreviated as I). Where N is the most basic attribute type used to represent classification or tag data, there is no order or size relationship between the various classes, e.g., color (red, green, blue), gender (male, female), and country (united states, uk, china) all belong to N. O represents data having a sequential or hierarchical relationship, but cannot accurately measure the magnitude of a difference, e.g., education level (primary, middle, high, college) and product ratings (low, middle, high) all belong to O, where some values are known to be greater or lesser than others, but cannot determine the magnitude of a particular difference. I represents continuous data with equidistant relationships, the magnitude of the difference can be measured, but without absolute zero, e.g., temperature (degrees celsius, degrees fahrenheit) is I, where the differences between the values can be compared and the interval between them calculated.
Further, extracting attribute data from the input data to obtain a plurality of attribute data corresponding to the input data, wherein the attribute data is used for representing a section of data corresponding to each attribute type in the input data. For example, "how company a is red, company a represents attribute data corresponding to Nominal data attributes, and red represents attribute data corresponding to Ordinal data attributes.
In an embodiment of the present application, attribute data extraction may be performed by performing data classification on input data, specifically, performing attribute type classification on the input data according to feature information of each attribute type, and classifying attribute data corresponding to each attribute type in the input data.
Specifically, each data attribute type has respective feature information, some feature information that each attribute type has certain exists can be obtained by performing feature information analysis on each attribute type, and then according to a plurality of feature information corresponding to each attribute type, the fields of a plurality of feature information with corresponding attribute types in the input data are determined to be attribute data corresponding to the attribute types.
In an embodiment of the present application, an attribute data extraction model may be trained in advance, and attribute data acquisition may be performed on input data by using the attribute data extraction model. The attribute data extraction model can be obtained by training a common neural network model by taking a large amount of training data as input and taking target attribute data of each training data under each attribute type as output, and the specific training process can refer to a conventional neural network model training process, for example, a convolutional neural network model training process, which is not described herein.
It should be noted that, the method of extracting attribute data is not particularly limited in the present application, and any extracting method capable of extracting attribute data corresponding to input data belongs to the protection scope of the present application, for example, attribute data may be extracted by using a named entity recognition model.
Further, although there are a plurality of attribute types, the input data is not associated with the attribute data corresponding to each attribute type, if there is no corresponding field in the input data when extracting a certain attribute type, the default input data is no attribute data of the attribute type, and the existing attribute data is extracted, so that the attribute data extracted by the attribute data is at least one.
In operation 103, an intention category corresponding to the attribute data is determined, the intention category including at least one of: contrast, trend, composition, ranking, distribution, duty cycle, relationship, space.
In an embodiment of the present application, the intention category may be a common visual intention category, specifically including contrast, trend, composition, ranking, distribution, duty ratio, relationship, space, and the like.
The visual intention category corresponding to each attribute data is different, and at least one intention category corresponding to each attribute type can be preconfigured;
After the attribute data is determined, the corresponding intention category is determined according to the attribute type of the attribute data. Wherein the intent category may be considered at least one intent category.
In operation 104, according to the attribute data and the intention category corresponding to the attribute data, data expansion is performed on the input data to obtain at least one target intention.
And respectively carrying out data expansion on each attribute data from the angle of the corresponding intention category to obtain at least one target intention obtained by carrying out data expansion on the input data according to each attribute data.
In one embodiment of the application, after the at least one target intention is determined, the at least one target intention is visually presented.
Therefore, according to the embodiment of the application, the corresponding attribute data is extracted from the input data, then the intention type corresponding to each attribute data is determined, the input data is expanded from the angles of the attribute data and the intention type, the attribute data of the input data are fully utilized, various intention types possibly involved are considered, the generated intention data is more comprehensive, and the accuracy is higher.
Fig. 2 is a schematic flow chart of an implementation of a data extraction method of the data processing method according to an embodiment of the present application.
Referring to fig. 2, in an embodiment of the present application, the operation 102 performs attribute data extraction on the input data, including:
operation 201, performing natural language analysis on input data to obtain an analysis result;
Operation 202, converting the analysis result into a set statement form to obtain original data;
And 203, extracting the attributes of the original data to obtain at least one attribute data, wherein the attribute data comprises an attribute word and an attribute type.
In operations 201 to 202, input data is first converted into a set statement form that can be processed by a computer, so as to obtain original data, where the set statement form may be an SQL statement. The specific conversion method can refer to a conventional SQL statement conversion method, and is not described herein.
In operation 203, the raw data is identified by a pre-trained named entity model (NAMED ENTITY Recognizion, NER) to obtain attribute data including attribute words and attribute types. The NER model may be trained to extract attribute words corresponding to a plurality of attribute types in the original data, where the training process of the NER model may refer to a conventional training process of the NER model, which is not described herein.
For example, the input data is "company a reddening", the input data is converted into the SOL sentence form and then input into the NER model, the NER model can output attribute data 1 with attribute type N and attribute word company a; and attribute type 1, attribute word is attribute data 2 of the segmentation.
Fig. 3 is a schematic implementation flow diagram of an intention type determining method of a data processing method according to an embodiment of the present application.
Referring to fig. 3, in an embodiment of the present application, the determining an intention category corresponding to attribute data in operation 103 includes:
In operation 301, the attribute data is combined with a plurality of preset intention categories to obtain a combined result.
And combining each attribute data with each preset intention category to obtain a plurality of combinations, namely a combination result.
For example, in the case where the attribute data is 3, 24 combinations, that is, the combination result, are obtained by combining the attribute data (3) and the preset intention category (8).
At operation 302, a plurality of visualization combinations that meet the visualization criteria are determined from the combined results.
After all the combinations are obtained, it is required to determine that the combinations are visualizable, i.e. visualization criteria, so as to determine a plurality of visualizable combinations in the combination result. The visualization standard may be configured according to practical situations, and the present application is not limited in particular.
In an embodiment of the present application, when determining the visualization combination, in addition to determining whether the visualization criteria are met, additional conditions may be added to determine the best intention category corresponding to each attribute data. For example, additional conditions other than whether visualization is possible may be added to the visualization standard, for example, taking the combination with the highest similarity as the best combination corresponding to each attribute data, and the best combination is the visualization combination corresponding to the current attribute data.
And operation 303, determining the intention category corresponding to the attribute data according to the plurality of visual combinations.
Each visual combination shows one attribute data and one intention category, the visual combination corresponding to each attribute data is determined, and the intention category is determined from the corresponding visual combination, namely the intention category of each attribute data is obtained. Wherein the intention category of each attribute data is at least one.
The operations 301-303 may be implemented by a support vector machine trained in advance, and the training process of the support vector machine may include: acquiring a training set, wherein the training set comprises a plurality of training combination data and training results corresponding to each training combination data, the training combination data comprises training attribute words and training intention categories, and the training results comprise whether the corresponding training combination data can be visualized or not; and training the support vector machine through a training set, evaluating the support vector machine, and determining that the support vector machine training is completed under the condition that the error of the support vector machine meets the requirement.
Fig. 4 is a schematic implementation flow diagram of a data expansion method of the data processing method according to an embodiment of the present application.
Referring to fig. 4, in an embodiment of the present application, the operation 104 of performing data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data includes:
in operation 401, according to a preset map library, an attribute type corresponding to the attribute data, and an intention type, determining analysis logic corresponding to the attribute data.
In an embodiment of the present application, the preset map library includes attribute maps of a plurality of attribute types, and the attribute maps include a plurality of intention categories and analysis logic corresponding to each intention category.
For different attribute types, different attribute maps can be constructed in advance, and the process of constructing the attribute maps can comprise the following steps: acquiring training data, which may be a large amount of unstructured or semi-structured data, such as text documents, web page content, log files, etc.; preprocessing training data, such as text cleaning, word segmentation and the like; identifying attribute words in the preprocessed training data by using the NER model, for example, attribute words corresponding to a certain attribute type; determining association relations among a plurality of attribute words, wherein the association relations can be obtained through a common relation extraction model and are not described in detail herein; according to the association relation between the attribute words, the attribute words are taken as nodes, the association relation is taken as an edge, and an attribute map of the attribute type corresponding to the attribute words, which is also called an attribute relation map, such as an I corresponding index relation map, is constructed. The association relationship may refer to a relationship between two attribute words, for example, an industry, a competitor, association, comparison, and the like, which is not limited by the present application.
And then training the association relationship in the attribute map corresponding to each attribute type and the relationship of the intention category through the large language model in advance, and enabling the large language model to have the function of inputting the intention category through training to obtain the association relationship.
And inputting the intention category corresponding to each attribute data into the large model through the large language model trained in advance, so that the association relationship under the corresponding attribute type can be obtained. Wherein the association relationship is analysis logic.
Operation 402, performing data expansion on the input data according to the analysis logic of the attribute data.
After determining the analysis logic of each attribute data, expanding the corresponding attribute words in the input data according to the analysis logic. For example, assuming that the analysis logic is the industry, the input data is "net profit of company a", and the attribute word of the currently expanded attribute data is company a, the input data may be expanded to "net profit of company a and company B and company C of the industry to which the input data belongs".
In order to further understand the technical solution of the embodiments of the present application, a specific application example is described below.
Fig. 5 is a schematic implementation flow diagram of a specific application example of a data processing method according to an embodiment of the present application.
Referring to fig. 5, this specific application example of the present application includes:
S1, receiving a user question.
Specifically, a user question input by a user on a question-answer interaction page provided by an application system is received, and the user question is input data. Illustratively, the user question may be "company A net profit".
S2, determining at least one attribute type corresponding to the user question and an attribute word corresponding to each attribute type according to the set attribute types.
Specifically, the attribute type includes I, N, O, and the attribute word corresponding to I, N, O in the user question is extracted. Illustratively, taking the user question as "net profit of company a" as an example, the corresponding attribute word of N is "company a", and the corresponding attribute word of I is "net profit". The manner in which the attribute terms are obtained may refer to the description of operation 102 in fig. 1, which is not repeated here.
S3, predicting the intention category of each attribute type according to a pre-trained support vector machine to obtain the intention category corresponding to each attribute type.
Specifically, the support vector machine is trained in advance, and the training process of the support vector machine may include: acquiring a training set, wherein each training sample in the training set comprises an attribute type and a visual result of the attribute type and each set intention category, and the visual result comprises a proper presentation and a non-proper presentation; and training the support vector machine through the training set so that the support vector machine outputs the capability of the intention category which corresponds to the attribute type and is suitable for presentation.
The set intention category may be comparison, trend, composition, ranking, distribution, duty ratio, relationship, space.
And inputting the acquired attribute types into a support training machine to obtain the intention category which corresponds to each attribute type and is suitable for presentation (namely, suitable for visualization).
S4, obtaining analysis logic of the intention category corresponding to each attribute type.
Specifically, a knowledge graph corresponding to each attribute type, namely the attribute graph, is constructed in advance, wherein the attribute graph takes an attribute word as a node and takes an association relationship as an edge. After the intention category of each attribute type is obtained, the association relationship corresponding to the intention category of each attribute type is obtained, and the association relationship is analysis logic. An exemplary analysis logic may be another company of the industry, and assuming that the ranking corresponds to another company of the industry in the attribute map corresponding to N, the "other company of the industry to which the analysis logic belongs" may be queried through the "ranking" of the intention category.
And S5, expanding data according to the analysis logic to obtain at least one target intention.
Specifically, the data expansion corresponding to the analysis logic is performed on the attribute words corresponding to each analysis logic, for example, "net profit of company a" is expanded to "net profit of company a and other companies in the industry to which the company belongs", so that at least one target intention is obtained after the data expansion is performed on the attribute words corresponding to each attribute type in the user question.
Fig. 6 is a schematic diagram showing a composition structure of a data processing apparatus according to an embodiment of the present application.
Referring to fig. 6, based on the above data processing method, an embodiment of the present application further provides a data processing apparatus, where the apparatus includes: a receiving module 501, configured to receive input data input by a target object; the extraction module 502 is configured to perform attribute data extraction on the input data to obtain at least one attribute data; a determining module 503, configured to determine an intention category corresponding to the attribute data, where the intention category includes at least one of: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space; the expansion module 504 is configured to perform data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data, so as to obtain at least one target intention.
In one embodiment of the present application, the apparatus further comprises: and the display module is used for visually displaying at least one target intention.
In one embodiment of the present application, the extraction module 502 includes: the analysis sub-module is used for carrying out natural language analysis on the input data to obtain an analysis result; the conversion sub-module is used for converting the analysis result into a set statement form to obtain original data; and the extraction sub-module is used for extracting the attributes of the original data to obtain at least one attribute data, wherein the attribute data comprises an attribute word and an attribute type.
In one embodiment of the present application, the determining module 503 includes: the combination sub-module is used for combining the attribute data with a plurality of preset intention categories to obtain a combination result; a first determining sub-module for determining a plurality of visual combinations meeting the visual standard from the combination result; and the second determining submodule is used for determining the intention category corresponding to the attribute data according to the plurality of visual combinations.
In one embodiment of the present application, the expansion module 504 includes: the third determining module is used for determining analysis logic corresponding to the attribute data according to a preset map library, the attribute type corresponding to the attribute data and the intention type; and the expansion sub-module is used for carrying out data expansion on the input data according to the analysis logic of the attribute data.
It should be noted that, the description of the apparatus according to the embodiment of the present application is similar to the description of the embodiment of the method described above, and has similar beneficial effects as the embodiment of the method, so that a detailed description is omitted. The technical details of the data processing apparatus according to the embodiment of the present application that are not described in detail may be understood from the description of any one of fig. 1 to 5.
According to an embodiment of the present application, the present application also provides an electronic device and a non-transitory computer-readable storage medium.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present application may be performed in parallel, sequentially, or in a different order, so long as the desired results of the technical solution disclosed in the present application can be achieved, and are not limited herein.
The foregoing is merely illustrative of the present application, and the present application is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (7)

1. A method of data processing, the method comprising:
receiving input data input by a target object;
Extracting attribute data from the input data to obtain at least one attribute data;
determining an intention category corresponding to the attribute data, wherein the intention category comprises at least one of the following: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space;
According to the attribute data and the intention category corresponding to the attribute data, carrying out data expansion on the input data to obtain at least one target intention;
Wherein, according to the attribute data and the intention category corresponding to the attribute data, the data expansion is performed on the input data, including:
determining analysis logic corresponding to the attribute data according to a preset map library, the attribute type corresponding to the attribute data and the intention type corresponding to the attribute data;
According to the analysis logic of the attribute data, carrying out data expansion on the input data;
The preset map library comprises attribute maps of a plurality of attribute types, wherein the attribute maps comprise a plurality of intention categories and analysis logic corresponding to each intention category;
the attribute data comprises attribute words and attribute types;
the analyzing logic for performing data expansion on the input data according to the attribute data includes: expanding corresponding attribute words in the input data according to the analysis logic;
The extracting attribute data from the input data to obtain at least one attribute data includes: classifying the attribute types of the input data according to the characteristic information of each attribute type, and classifying attribute data corresponding to each attribute type in the input data to obtain at least one attribute data;
the analysis logic is an association relationship, and the association relationship is a relationship between two attribute words;
the determining the intention category corresponding to the attribute data comprises the following steps:
Combining the attribute data with a plurality of preset intention categories to obtain a combined result;
Determining a plurality of visual combinations meeting visual standards from the combination result;
And determining the intention category corresponding to the attribute data according to the plurality of visual combinations.
2. The method according to claim 1, wherein the method further comprises:
And visually displaying the at least one target intention.
3. The method of claim 1, wherein extracting attribute data from the input data comprises:
carrying out natural language analysis on the input data to obtain an analysis result;
Converting the analysis result into a set statement form to obtain original data;
and extracting the attribute of the original data to obtain at least one attribute data.
4. A method according to claim 3, wherein the attribute types include nominal data attributes, ordered data attributes, and measured data attributes.
5. A data processing apparatus, the apparatus comprising:
The receiving module is used for receiving input data input by a target object;
The extraction module is used for extracting attribute data from the input data to obtain at least one attribute data;
The determining module is used for determining an intention category corresponding to the attribute data, wherein the intention category comprises at least one of the following: comparison, trend, composition, ranking, distribution, duty cycle, relationship, space;
The expansion module is used for carrying out data expansion on the input data according to the attribute data and the intention category corresponding to the attribute data to obtain at least one target intention;
Wherein, expand the module, include: the third determining module is used for determining analysis logic corresponding to the attribute data according to a preset map library, the attribute type corresponding to the attribute data and the intention type corresponding to the attribute data; the expansion sub-module is used for carrying out data expansion on the input data according to the analysis logic of the attribute data;
The preset map library comprises attribute maps of a plurality of attribute types, wherein the attribute maps comprise a plurality of intention categories and analysis logic corresponding to each intention category;
the attribute data comprises attribute words and attribute types;
the analyzing logic for performing data expansion on the input data according to the attribute data includes: expanding corresponding attribute words in the input data according to the analysis logic;
The extracting attribute data from the input data to obtain at least one attribute data includes: classifying the attribute types of the input data according to the characteristic information of each attribute type, and classifying attribute data corresponding to each attribute type in the input data to obtain at least one attribute data;
the analysis logic is an association relationship, and the association relationship is a relationship between two attribute words;
the determining the intention category corresponding to the attribute data comprises the following steps:
Combining the attribute data with a plurality of preset intention categories to obtain a combined result;
Determining a plurality of visual combinations meeting visual standards from the combination result;
And determining the intention category corresponding to the attribute data according to the plurality of visual combinations.
6. An electronic device, comprising:
at least one processor; and
A memory communicatively coupled to the at least one processor; wherein,
The memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-4.
7. A non-transitory computer readable storage medium storing computer instructions for causing a computer to perform the method of any one of claims 1-4.
CN202410030664.6A 2024-01-09 2024-01-09 Data processing method and device, electronic equipment and storage medium Active CN117540107B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410030664.6A CN117540107B (en) 2024-01-09 2024-01-09 Data processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410030664.6A CN117540107B (en) 2024-01-09 2024-01-09 Data processing method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN117540107A CN117540107A (en) 2024-02-09
CN117540107B true CN117540107B (en) 2024-05-07

Family

ID=89790394

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410030664.6A Active CN117540107B (en) 2024-01-09 2024-01-09 Data processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN117540107B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107861981A (en) * 2017-09-28 2018-03-30 北京奇艺世纪科技有限公司 A kind of data processing method and device
CN113868493A (en) * 2021-09-13 2021-12-31 浪潮通信信息系统有限公司 Visual chart recommendation method
CN113946657A (en) * 2021-10-22 2022-01-18 唐亮 Knowledge reasoning-based automatic identification method for power service intention
CN114328799A (en) * 2021-11-10 2022-04-12 腾讯科技(深圳)有限公司 Data processing method, device and computer readable storage medium
WO2022212563A1 (en) * 2021-03-30 2022-10-06 Tableau Software, LLC Applying a visual analytics intent language to generate data visualizations
CN115827612A (en) * 2022-11-25 2023-03-21 中电金信软件有限公司 Data attribute generation method and device and electronic equipment
CN116167355A (en) * 2021-11-25 2023-05-26 中移(杭州)信息技术有限公司 Intention recognition method, device, equipment and storage medium
CN116244412A (en) * 2022-12-20 2023-06-09 北京航天云路有限公司 Multi-intention recognition method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040064450A1 (en) * 2002-09-30 2004-04-01 Kabushiki Kaisha Toshiba Method for preparing data to be analyzed, data analysis method, data analysis device, data preparation program, data analysis program, data prediction device, data prediction method, data prediction program and computer
US11836172B2 (en) * 2021-06-22 2023-12-05 Adobe Inc. Facilitating generation of data visualizations via natural language processing

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107861981A (en) * 2017-09-28 2018-03-30 北京奇艺世纪科技有限公司 A kind of data processing method and device
WO2022212563A1 (en) * 2021-03-30 2022-10-06 Tableau Software, LLC Applying a visual analytics intent language to generate data visualizations
CN113868493A (en) * 2021-09-13 2021-12-31 浪潮通信信息系统有限公司 Visual chart recommendation method
CN113946657A (en) * 2021-10-22 2022-01-18 唐亮 Knowledge reasoning-based automatic identification method for power service intention
CN114328799A (en) * 2021-11-10 2022-04-12 腾讯科技(深圳)有限公司 Data processing method, device and computer readable storage medium
CN116167355A (en) * 2021-11-25 2023-05-26 中移(杭州)信息技术有限公司 Intention recognition method, device, equipment and storage medium
CN115827612A (en) * 2022-11-25 2023-03-21 中电金信软件有限公司 Data attribute generation method and device and electronic equipment
CN116244412A (en) * 2022-12-20 2023-06-09 北京航天云路有限公司 Multi-intention recognition method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于网格LSTM混合算法的地质领域用户意图识别;贺金龙;付立军;姚郑;吕鹏飞;黄徐胜;;计算机系统应用;20201013(第10期);第44-52页 *

Also Published As

Publication number Publication date
CN117540107A (en) 2024-02-09

Similar Documents

Publication Publication Date Title
CN112667794A (en) Intelligent question-answer matching method and system based on twin network BERT model
Kiefer Assessing the Quality of Unstructured Data: An Initial Overview.
US8874581B2 (en) Employing topic models for semantic class mining
US10824816B2 (en) Semantic parsing method and apparatus
CN106778878B (en) Character relation classification method and device
CN111428503B (en) Identification processing method and processing device for homonymous characters
Addepalli et al. A proposed framework for measuring customer satisfaction and product recommendation for ecommerce
CN118193806A (en) Target retrieval method, target retrieval device, electronic equipment and storage medium
CN109815391A (en) News data analysis method and device, electric terminal based on big data
CN109684473A (en) A kind of automatic bulletin generation method and system
Groher et al. Leveraging AI-based decision support for opportunity analysis
CN107291686B (en) Method and system for identifying emotion identification
Pajankar et al. Preparing data for machine learning
KR102663632B1 (en) Device and method for artwork trend data prediction using artificial intelligence
CN112084376A (en) Map knowledge based recommendation method and system and electronic device
CN117540107B (en) Data processing method and device, electronic equipment and storage medium
CN111966836A (en) Knowledge graph vector representation method and device, computer equipment and storage medium
CN111723302A (en) Recommendation method based on collaborative dual-model deep representation learning
Han et al. Developing smart service concepts: morphological analysis using a Novelty-Quality map
US20230126022A1 (en) Automatically determining table locations and table cell types
Chiarello et al. Design and implementation of a text mining-based tool to support scoping reviews
Scholz et al. Extraction of statements in news for a media response analysis
Ji et al. Design and Application of Mapping Model for Font Recommendation System Based on Contents Emotion Analysis
JP2017188025A (en) Data analysis system, control method thereof, program, and recording medium
Erfina et al. Indonesian Analysis Sentiment on Non Fungible Token (NFT)

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant