CN111159386A - Data generation method and device, electronic equipment and storage medium - Google Patents

Data generation method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN111159386A
CN111159386A CN201911174761.8A CN201911174761A CN111159386A CN 111159386 A CN111159386 A CN 111159386A CN 201911174761 A CN201911174761 A CN 201911174761A CN 111159386 A CN111159386 A CN 111159386A
Authority
CN
China
Prior art keywords
target
data
generated
adopting
hbase
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911174761.8A
Other languages
Chinese (zh)
Inventor
吴名先
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
OneConnect Smart Technology Co Ltd
OneConnect Financial Technology Co Ltd Shanghai
Original Assignee
OneConnect Financial Technology Co Ltd Shanghai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by OneConnect Financial Technology Co Ltd Shanghai filed Critical OneConnect Financial Technology Co Ltd Shanghai
Priority to CN201911174761.8A priority Critical patent/CN111159386A/en
Priority to PCT/CN2020/086799 priority patent/WO2021103409A1/en
Publication of CN111159386A publication Critical patent/CN111159386A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/42Syntactic analysis
    • G06F8/427Parsing

Abstract

The invention provides a data generation method, a data generation device, electronic equipment and a storage medium. The data generation method can acquire the trigger interface of the data generation instruction when receiving the data generation instruction, identify all characters on the trigger interface, extracting attribute data of data to be generated, encapsulating the attribute data to obtain a target character string, analyzing the target character string to obtain a target parameter, further analyzing the target parameter to obtain a target type, calling a row key generation strategy corresponding to the target type from an HBASE library, further generating a target row key of the target parameter in the HBASE library, acquiring a target column name from the target parameter, generating an intermediate data file according to the target column name and the target row key, writing the intermediate data file into the HBASE library, the data to be generated can be obtained, data processing can be carried out based on the HBASE library, a large amount of data can be generated, and the data generation efficiency can be improved.

Description

Data generation method and device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of data processing technologies, and in particular, to a data generation method and apparatus, an electronic device, and a storage medium.
Background
In a large data test project, a large amount of test data is often needed, however, in the existing technical solution, the amount of data generated based on the traditional database is not enough to support the large data test project, and meanwhile, the efficiency of data generation is low because a large amount of algorithms are run on the bottom layer of the traditional database. Therefore, how to efficiently generate mass data becomes an urgent problem to be solved.
Disclosure of Invention
In view of the above, it is desirable to provide a data generation method, apparatus, electronic device, and storage medium, which can generate a large amount of data based on the HBASE library and can also improve the efficiency of data generation.
A method of data generation, the method comprising:
when a data generation instruction is received, acquiring a trigger interface of the data generation instruction;
recognizing all characters on the trigger interface by adopting a character recognition technology;
extracting at least one keyword from all the characters to obtain attribute data of data to be generated;
packaging the attribute data by adopting a JSON technology to obtain a target character string;
analyzing the target character string by adopting a JSON technology to obtain a target parameter;
analyzing the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter;
calling a row key generation strategy corresponding to the target type from an HBASE library;
generating a target row key of the target parameter in the HBASE library based on the row key generation strategy;
acquiring a target column name from the target parameter;
generating an intermediate data file according to the target column name and the target row key;
and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
According to the preferred embodiment of the present invention, the encapsulating the attribute data by using the JSON technology to obtain the target character string includes:
calling a target label corresponding to each attribute data in the attribute data from the HBASE library;
and packaging each attribute data and the target tag corresponding to each attribute data by adopting a JSON technology to obtain the target character string.
According to the preferred embodiment of the present invention, the analyzing the target character string by using JSON technology to obtain the target parameter includes:
acquiring all target labels in the target character string;
and extracting data corresponding to each target label in all the target labels as the target parameters.
According to a preferred embodiment of the present invention, the generating an intermediate data file according to the target column name and the target row key includes:
calling out target data corresponding to the target column name from the HBASE library;
determining a target position according to the target column name and the target row key;
and writing the target data into the target position to obtain the intermediate data file.
According to the preferred embodiment of the present invention, the writing the intermediate data file into the HBASE library by using the calculation engine to obtain the data to be generated includes:
dividing the intermediate data file into at least one data block according to the file size of the intermediate data file;
calculating the number of the at least one data block to obtain the target number;
creating an HBASE table in the HBASE library;
partitioning the HBASE table by taking the target number as the partition number;
and writing the at least one data block into the partitioned HBASE table by adopting a calculation engine to obtain the data to be generated.
According to a preferred embodiment of the present invention, after obtaining the data to be generated, the method further includes:
acquiring the table name and the generation time of the data to be generated;
generating prompt information according to the table name and the generation time;
and sending the prompt message to the terminal equipment of the appointed contact person.
According to a preferred embodiment of the present invention, after obtaining the data to be generated, the method further includes:
encrypting the data to be generated by adopting a symmetric encryption technology to obtain a ciphertext;
and when the ciphertext decryption is detected to be successful, displaying the data to be generated.
A data generation apparatus, the apparatus comprising:
the acquisition unit is used for acquiring a trigger interface of the data generation instruction when the data generation instruction is received;
the identification unit is used for identifying all characters on the trigger interface by adopting a character identification technology;
the extraction unit is used for extracting at least one keyword from all the characters to obtain attribute data of data to be generated;
the packaging unit is used for packaging the attribute data by adopting JSON technology to obtain a target character string;
the analysis unit is used for analyzing the target character string by adopting a JSON technology to obtain a target parameter;
the analysis unit is also used for analyzing the target parameters by adopting a shallow semantic analysis method to obtain the target type of the target parameters;
the calling unit is used for calling a row key generation strategy corresponding to the target type from the HBASE library;
a generating unit, configured to generate a target row key of the target parameter in the HBASE library based on the row key generation policy;
the acquisition unit is further used for acquiring a target column name from the target parameter;
the generating unit is further used for generating an intermediate data file according to the target column name and the target row key;
and the writing unit is used for writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
According to a preferred embodiment of the present invention, the encapsulation unit is specifically configured to:
calling a target label corresponding to each attribute data in the attribute data from the HBASE library;
and packaging each attribute data and the target tag corresponding to each attribute data by adopting a JSON technology to obtain the target character string.
According to the preferred embodiment of the present invention, the parsing unit parses the target character string by using JSON technology, and obtaining the target parameter includes:
acquiring all target labels in the target character string;
and extracting data corresponding to each target label in all the target labels as the target parameters.
According to a preferred embodiment of the present invention, the generating, by the generating unit, an intermediate data file according to the target column name and the target row key includes:
calling out target data corresponding to the target column name from the HBASE library;
determining a target position according to the target column name and the target row key;
and writing the target data into the target position to obtain the intermediate data file.
According to a preferred embodiment of the present invention, the generating unit writes the intermediate data file into the HBASE library by using a calculation engine, and obtaining the data to be generated includes:
dividing the intermediate data file into at least one data block according to the file size of the intermediate data file;
calculating the number of the at least one data block to obtain the target number;
creating an HBASE table in the HBASE library;
partitioning the HBASE table by taking the target number as the partition number;
and writing the at least one data block into the partitioned HBASE table by adopting a calculation engine to obtain the data to be generated.
According to the preferred embodiment of the present invention, the obtaining unit is further configured to obtain a table name and a generation time of the data to be generated after the data to be generated is obtained;
the generating unit is further used for generating prompt information according to the table name and the generating time;
the device further comprises:
and the sending unit is used for sending the prompt information to the terminal equipment of the appointed contact person.
According to a preferred embodiment of the invention, the apparatus further comprises:
the encryption unit is used for encrypting the data to be generated by adopting a symmetric encryption technology after the data to be generated is obtained to obtain a ciphertext;
and the display unit is used for displaying the data to be generated when the ciphertext decryption is detected to be successful.
An electronic device, the electronic device comprising:
a memory storing at least one instruction; and
a processor executing instructions stored in the memory to implement the data generation method.
A computer-readable storage medium having stored therein at least one instruction, the at least one instruction being executable by a processor in an electronic device to implement the data generation method.
According to the technical scheme, when a data generation instruction is received, the method can acquire a trigger interface of the data generation instruction, recognize all characters on the trigger interface by adopting a character recognition technology, extract at least one keyword from all the characters to obtain attribute data of data to be generated, encapsulate the attribute data by adopting a JSON technology to obtain a target character string, analyze the target character string by adopting the JSON technology to obtain a target parameter, analyze the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter, call a row key generation strategy corresponding to the target type from an HBASE library, generate a target row key of the target parameter in the HBASE library based on the row key generation strategy, acquire a target column name from the target parameter, and according to the target column name and the target row key, and generating an intermediate data file, and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated, so that not only can a large amount of data be generated based on the HBASE library, but also the generation efficiency of the data can be improved.
Drawings
FIG. 1 is a flow chart of a preferred embodiment of the data generation method of the present invention.
FIG. 2 is a functional block diagram of a preferred embodiment of the data generating apparatus of the present invention.
Fig. 3 is a schematic structural diagram of an electronic device implementing a data generation method according to a preferred embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in detail with reference to the accompanying drawings and specific embodiments.
FIG. 1 is a flow chart of a preferred embodiment of the data generation method of the present invention. The order of the steps in the flow chart may be changed and some steps may be omitted according to different needs.
The data generating method is applied to one or more electronic devices, which are devices capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions, and the hardware thereof includes, but is not limited to, a microprocessor, an Application Specific Integrated Circuit (ASIC), a Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like.
The electronic device may be any electronic product capable of performing human-computer interaction with a user, for example, a Personal computer, a tablet computer, a smart phone, a Personal Digital Assistant (PDA), a game machine, an interactive Internet Protocol Television (IPTV), an intelligent wearable device, and the like.
The electronic device may also include a network device and/or a user device. The network device includes, but is not limited to, a single network server, a server group consisting of a plurality of network servers, or a cloud computing (cloud computing) based cloud consisting of a large number of hosts or network servers.
The Network where the electronic device is located includes, but is not limited to, the internet, a wide area Network, a metropolitan area Network, a local area Network, a Virtual Private Network (VPN), and the like.
And S10, when a data generation instruction is received, acquiring a trigger interface of the data generation instruction.
In at least one embodiment of the present invention, the data generation instruction may be triggered by a user, or may be triggered automatically when a certain condition is met, which is not limited by the present invention.
Wherein the satisfying of certain conditions includes, but is not limited to: meeting the preset time, etc.
The preset time may include a certain time point, or include a time period, etc., such as: the preset time may be seven am each day.
In at least one embodiment of the invention, the trigger interface is composed of an input text box and a submit button, and is mainly used for collecting information. The input text box can have various forms according to the user requirement, wherein the content in the input text box can include, but is not limited to: text fields, characters, etc.
And S11, recognizing all characters on the trigger interface by adopting a character recognition technology.
In at least one embodiment of the present invention, all of the words include, but are not limited to: chinese, characters, numbers, etc.
In at least one embodiment of the present invention, the electronic device identifies the trigger interface by using a text recognition technology, so as to obtain all texts on the trigger interface.
The triggering interface comprises an input text box and a submission button.
And S12, extracting at least one keyword from all the characters to obtain attribute data of the data to be generated.
In at least one embodiment of the present invention, the attribute data may include, but is not limited to: row number, table name, column name, file name, etc.
In at least one embodiment of the present invention, the extracting, by the electronic device, at least one keyword from all the texts, and obtaining attribute data of data to be generated includes:
and the electronic equipment adopts a Chinese keyword extraction algorithm based on a high-dimensional clustering technology to extract at least one keyword from all the characters to obtain attribute data of the data to be generated.
Specifically, the electronic equipment performs fast word segmentation on all the characters according to a preset target dictionary to obtain a first word segmentation, calculates a target word frequency of the first word segmentation, determines the first word segmentation with the target word frequency higher than a preset word frequency as an initial keyword, prunes the initial keyword according to a preset small dictionary to obtain a final keyword, and determines the final keyword as attribute data of data to be generated.
Wherein the target dictionary may include common keywords.
Further, the small dictionary may include, but is not limited to, a particle word, stop word, and the like.
And by pruning the initial keywords, the attribute data of the data to be generated is accurately and quickly determined.
S13, adopting JSON (JavaScript Object Notation) technology to encapsulate the attribute data to obtain the target character string.
In at least one embodiment of the present invention, the target string is encapsulated by the attribute data, which is expressed as { tag: attribute data }.
In at least one embodiment of the present invention, the encapsulating, by the electronic device, the attribute data by using a JSON technique, and obtaining the target character string includes:
and the electronic equipment calls a target tag corresponding to each attribute data in the attribute data from the HBASE library, and further packages each attribute data and the target tag corresponding to each attribute data by adopting JSON technology to obtain the target character string.
Wherein the JSON technology is a lightweight data exchange format. The JSON technology adopts a text format completely independent of a programming language to store and represent data, is easy to read and write, is easy to analyze and generate by a machine, and effectively improves the network transmission efficiency.
Further, the HBASE library is a distributed, column-oriented open source database.
For example: the attribute data 1 is a name, the attribute data 2 is a gender, the electronic device retrieves a target tag corresponding to the name as A from an HBASE library, retrieves a target tag corresponding to the gender as B, packages the name and the A, and obtains a target character string of the attribute data 1 as { A: name, and packaging the gender and the B to obtain a target character string of the attribute data 2, wherein the target character string is { B: gender }.
Through the implementation mode, the attribute data is packaged into the target character string, so that the attribute data can be transmitted to a back-end server along with the target character string to perform data processing.
And S14, analyzing the target character string by adopting a JSON technology to obtain a target parameter.
In at least one embodiment of the present invention, the target parameters include, but are not limited to: table name, row number, column name, file name, etc.
In at least one embodiment of the present invention, before the target character string is parsed by using JSON technology to obtain the target parameter, the method further includes:
the electronic equipment transmits the target character string to a back-end server of the electronic equipment by adopting an ajax (asynchronous Javascript And XML) technology.
Among other things, the ajax technique is a technique for creating better, faster, and more interactive Web applications.
Specifically, the electronic device adds the target character string to a post request, and transmits the post request to a back-end server by adopting an ajax technology.
By the embodiment, the target character string from the front-end page of the electronic equipment is received by adopting the ajax technology, data can be exchanged with the back-end server under the condition that the front-end page is not reloaded, and further efficiency is improved.
In at least one embodiment of the present invention, the analyzing the target character string by using JSON technology to obtain the target parameter includes:
the electronic device obtains all target tags in the target character string, and further extracts data corresponding to each target tag in all the target tags as the target parameters.
For example: the target character string of the attribute data 1 is { a: name, and the target character string of the attribute data 2 is { B: gender, the electronic device obtains a target tag a of the attribute data 1 and a target tag B of the attribute data 2, and extracts the target character string { a: name in the name is used as a target parameter, and the target character string { B: gender in gender } as a target parameter.
Through the implementation mode, the target character string can be deserialized into the target parameters needed by the back end rapidly and accurately by adopting the JSON technology.
And S15, analyzing the target parameters by adopting a shallow semantic analysis method to obtain the target type of the target parameters.
In at least one embodiment of the present invention, the electronic device analyzes a semantic relationship between the target parameters by using a shallow semantic analysis method to obtain an application scenario of the target parameters, and further, the electronic device determines a target type of the target parameters according to the application scenario.
Wherein the target types may include, but are not limited to: transaction data type, statistical data type, generic data type, etc.
And S16, calling a row key generation strategy corresponding to the target type from the HBASE library.
In at least one embodiment of the present invention, the row key generation policy includes, but is not limited to, a combination of one or more of the following:
(1) and when the target type is a transaction data type, the row key generation strategy is added in a mode of pressing a timestamp.
(2) And when the target type is a general data type, the row key generation strategy is self-increment according to a sequence.
S17, generating a target row key of the target parameter in the HBASE library based on the row key generation strategy.
In at least one embodiment of the present invention, the electronic device obtains a number of rows from the target parameter, further, the electronic device imports the row key generation policy into the HBASE library, and generates the target row key according to the number of rows and the row key generation policy.
And the number of the target row keys is equal to the number of the rows.
Through the embodiment, the target row key can be generated quickly without manual operation.
And S18, acquiring the target column name from the target parameters.
In at least one embodiment of the present invention, the target column name refers to a name of each column of data, and the target column name is stored in the target parameter parsed by JSON technology, and further, the target column name may include, but is not limited to: name, gender, identification number, date of birth, etc.
In at least one embodiment of the present invention, the obtaining the target column name from the target parameter includes:
and the electronic equipment matches the target parameters with the column names stored in the HBASE library, and further confirms the target parameters matched with the column names as the target column names.
By the embodiment, the target column name can be acquired, and the intermediate data file can be conveniently generated according to the target column name.
And S19, generating an intermediate data file according to the target column name and the target row key.
In at least one embodiment of the invention, the intermediate data file is a set of statements generated in the HBASE library by which at least one datum can be generated having a fixed format.
In at least one embodiment of the present invention, the generating an intermediate data file according to the target column name and the target row key includes:
and the electronic equipment calls out target data corresponding to the target column name from the HBASE library, determines a target position according to the target column name and the target row key, and writes the target data into the target position to obtain an intermediate data file.
Wherein writing the target data into the target location to obtain an intermediate data file specifically includes:
the electronic equipment writes the target data into the target position through statements put < table >, < rowkey >, < family: column >, < value >, repeats the steps until the target position is written into the target data, and collects all the executed statements to generate the intermediate data file.
Although the amount of data stored in the HBASE library is large, the data is messy, and according to the present embodiment, the target data corresponding to the target column name can be retrieved from the HBASE library according to the user's demand, and the intermediate data file is generated.
In at least one embodiment of the invention, after generating the intermediate data file, the method further comprises:
and the electronic equipment sends the intermediate data file to an intermediate data layer, and stores the intermediate data file by using the intermediate data layer.
Wherein the intermediate data layer is a directory which is specially used by the linux system for storing intermediate files.
Through the embodiment, the intermediate data file is stored, so that the intermediate data file can be prevented from being lost due to errors in the process of generating the data to be generated, and further the data generation efficiency is influenced.
And S20, writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
In at least one embodiment of the invention, the data to be generated is a set of data having a fixed format and generated based on the HBASE library.
In at least one embodiment of the present invention, the writing, by the computing engine, the intermediate data file into the HBASE library to obtain the data to be generated includes:
according to the file size of the intermediate data file, the electronic device divides the intermediate data file into at least one data block, calculates the number of the at least one data block to obtain a target number, further creates an HBASE table in the HBASE library, partitions the HBASE table by taking the target number as the partition number, and writes the at least one data block into the partitioned HBASE table by adopting a calculation engine to obtain the data to be generated.
Through the embodiment, the calculation engine is adopted, so that the intermediate data file can be quickly written into the HBASE library, and the data generation rate is improved.
In at least one embodiment of the present invention, after obtaining the data to be generated, the method further includes:
the electronic equipment acquires the table name and the generation time of the data to be generated, generates prompt information according to the table name and the generation time, and further sends the prompt information to terminal equipment of a designated contact person.
Wherein, the prompt message may include, but is not limited to: table names of data to be generated, line numbers of the data to be generated, generation time of the data to be generated and the like.
Further, the designated contact may be a trigger for the data generation instruction.
Through the embodiment, the designated contact person can be reminded to check and receive the data after the data to be generated is generated.
In at least one embodiment of the present invention, after obtaining the data to be generated, the method further includes:
and the electronic equipment encrypts the data to be generated by adopting a symmetric encryption technology to obtain a ciphertext, and when the ciphertext decryption success is detected, the data to be generated is displayed.
Through the implementation mode, the data to be generated can be prevented from being tampered, and the safety of the data is improved.
According to the technical scheme, when a data generation instruction is received, the method can acquire a trigger interface of the data generation instruction, recognize all characters on the trigger interface by adopting a character recognition technology, extract at least one keyword from all the characters to obtain attribute data of data to be generated, encapsulate the attribute data by adopting a JSON technology to obtain a target character string, analyze the target character string by adopting the JSON technology to obtain a target parameter, analyze the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter, call a row key generation strategy corresponding to the target type from an HBASE library, generate a target row key of the target parameter in the HBASE library based on the row key generation strategy, acquire a target column name from the target parameter, and according to the target column name and the target row key, and generating an intermediate data file, and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated, so that not only can a large amount of data be generated based on the HBASE library, but also the generation efficiency of the data can be improved.
Fig. 2 is a functional block diagram of a preferred embodiment of the data generating apparatus according to the present invention. The data generation apparatus 11 includes an acquisition unit 110, a recognition unit 111, an extraction unit 112, a packaging unit 113, an analysis unit 114, a retrieval unit 115, a generation unit 116, a writing unit 117, a transmission unit 118, an encryption unit 119, a display unit 120, a transmission unit 121, and a saving unit 122. The module/unit referred to in the present invention refers to a series of computer program segments that can be executed by the processor 13 and that can perform a fixed function, and that are stored in the memory 12. In the present embodiment, the functions of the modules/units will be described in detail in the following embodiments.
When a data generation instruction is received, the obtaining unit 110 obtains a trigger interface of the data generation instruction.
In at least one embodiment of the present invention, the data generation instruction may be triggered by a user, or may be triggered automatically when a certain condition is met, which is not limited by the present invention.
Wherein the satisfying of certain conditions includes, but is not limited to: meeting the preset time, etc.
The preset time may include a certain time point, or include a time period, etc., such as: the preset time may be seven am each day.
In at least one embodiment of the invention, the trigger interface is composed of an input text box and a submit button, and is mainly used for collecting information. The input text box can have various forms according to the user requirement, wherein the content in the input text box can include, but is not limited to: text fields, characters, etc.
The recognition unit 111 recognizes all the characters on the trigger interface by using a character recognition technology.
In at least one embodiment of the present invention, all of the words include, but are not limited to: chinese, characters, numbers, etc.
In at least one embodiment of the present invention, the identifying unit 111 identifies the trigger interface by using a text recognition technology, so as to obtain all texts on the trigger interface.
The triggering interface comprises an input text box and a submission button.
The extracting unit 112 extracts at least one keyword from all the characters to obtain attribute data of data to be generated.
In at least one embodiment of the present invention, the attribute data may include, but is not limited to: row number, table name, column name, file name, etc.
In at least one embodiment of the present invention, the extracting unit 112 extracts at least one keyword from all the texts, and obtaining the attribute data of the data to be generated includes:
the extracting unit 112 extracts at least one keyword from all the characters by using a chinese keyword extraction algorithm based on a high-dimensional clustering technique, to obtain attribute data of data to be generated.
Specifically, the extracting unit 112 performs fast word segmentation on all the characters according to a preset target dictionary to obtain a first word segmentation, calculates a target word frequency of the first word segmentation, determines the first word segmentation with the target word frequency being greater than a preset word frequency as an initial keyword, and the extracting unit 112 prunes the initial keyword according to a preset small dictionary to obtain a final keyword, and determines the final keyword as attribute data of data to be generated.
Wherein the target dictionary may include common keywords.
Further, the small dictionary may include, but is not limited to, a particle word, stop word, and the like.
And by pruning the initial keywords, the attribute data of the data to be generated is accurately and quickly determined.
The encapsulation unit 113 encapsulates the attribute data by using a JSON (JavaScript Object Notation) technique to obtain a target character string.
In at least one embodiment of the present invention, the target string is encapsulated by the attribute data, which is expressed as { tag: attribute data }.
In at least one embodiment of the present invention, the encapsulating unit 113 encapsulates the attribute data by using JSON technology, and obtaining the target character string includes:
the encapsulation unit 113 calls a target tag corresponding to each attribute data in the attribute data from the HBASE library, and further, the encapsulation unit 113 encapsulates each attribute data and the target tag corresponding to each attribute data by using JSON technology to obtain the target character string.
Wherein the JSON technology is a lightweight data exchange format. The JSON technology adopts a text format completely independent of a programming language to store and represent data, is easy to read and write, is easy to analyze and generate by a machine, and effectively improves the network transmission efficiency.
Further, the HBASE library is a distributed, column-oriented open source database.
For example: the attribute data 1 is a name, the attribute data 2 is a gender, the encapsulation unit 113 retrieves a target tag corresponding to the name as a from the HBASE library, retrieves a target tag corresponding to the gender as B, encapsulates the name and the a, and obtains a target character string of the attribute data 1 as { a: name, and packaging the gender and the B to obtain a target character string of the attribute data 2, wherein the target character string is { B: sex }
Through the implementation mode, the attribute data is packaged into the target character string, so that the attribute data can be transmitted to a back-end server along with the target character string to perform data processing.
The parsing unit 114 parses the target character string by using JSON technology to obtain a target parameter.
In at least one embodiment of the present invention, the target parameters include, but are not limited to: table name, row number, column name, file name, etc.
In at least one embodiment of the present invention, before the target character string is parsed by using JSON technology to obtain the target parameter, the method further includes:
the transmission unit 121 transmits the target character string to a back-end server of the electronic device by using ajax (asynchronous Javascript And xml) technology.
Among other things, the ajax technique is a technique for creating better, faster, and more interactive Web applications.
Specifically, the transmission unit 121 adds the target character string to a post request, and transmits the post request to a backend server by using ajax technology.
Through the implementation mode, the target character string from the front-end page of the electronic equipment is received by adopting the ajax technology, data can be exchanged with the back-end server under the condition that the front-end page is not reloaded, and further efficiency is improved.
In at least one embodiment of the present invention, the parsing unit 114 parses the target character string by using JSON technology, and obtaining the target parameter includes:
the parsing unit 114 obtains all target tags in the target character string, and further, the parsing unit 114 extracts data corresponding to each target tag in all target tags as the target parameter.
For example: the target character string of the attribute data 1 is { a: name, and the target character string of the attribute data 2 is { B: gender, the analysis unit 114 obtains a target tag a of the attribute data 1 and a target tag B of the attribute data 2, and extracts the target character string { a: name in the name is used as a target parameter, and the target character string { B: gender in gender } as a target parameter.
Through the implementation mode, the target character string can be deserialized into the target parameters needed by the back end rapidly and accurately by adopting the JSON technology.
The parsing unit 114 parses the target parameter by using a shallow semantic analysis method to obtain a target type of the target parameter.
In at least one embodiment of the present invention, the parsing unit 114 parses the semantic relationship between the target parameters by using a shallow semantic analysis method to obtain an application scenario of the target parameters, and further, the parsing unit 114 determines a target type of the target parameters according to the application scenario.
Wherein the target types may include, but are not limited to: transaction data type, statistical data type, generic data type, etc.
The retrieval unit 115 retrieves a row key generation policy corresponding to the target type from the HBASE library.
In at least one embodiment of the present invention, the row key generation policy includes, but is not limited to, a combination of one or more of the following:
(1) and when the target type is a transaction data type, the row key generation strategy is added in a mode of pressing a timestamp.
(2) And when the target type is a general data type, the row key generation strategy is self-increment according to a sequence.
The generation unit 116 generates a target row key of the target parameter in the HBASE library based on the row key generation strategy.
In at least one embodiment of the present invention, the generating unit 116 obtains a number of rows from the target parameter, further, the generating unit 116 imports the row key generation policy into the hbsase library, and generates the target row key according to the number of rows and the row key generation policy.
And the number of the target row keys is equal to the number of the rows.
Through the embodiment, the target row key can be generated quickly without manual operation.
The acquisition unit 110 acquires a target column name from the target parameter.
In at least one embodiment of the present invention, the target column name refers to a name of each column of data, and the target column name is stored in the target parameter parsed by JSON technology, and further, the target column name may include, but is not limited to: name, gender, identification number, date of birth, etc.
In at least one embodiment of the present invention, the obtaining unit 110 obtains the target column name from the target parameter, where the obtaining unit includes:
the obtaining unit 110 matches the target parameter with a column name stored in the HBASE library, and further, the obtaining unit 110 confirms the target parameter matching the column name as the target column name.
By the embodiment, the target column name can be acquired, and the intermediate data file can be conveniently generated according to the target column name.
The generating unit 116 generates an intermediate data file according to the target column name and the target row key.
In at least one embodiment of the invention, the intermediate data file is a set of statements generated in the HBASE library by which at least one datum can be generated having a fixed format.
In at least one embodiment of the present invention, the generating unit 116 generates an intermediate data file according to the target column name and the target row key, including:
the generation unit 116 retrieves target data corresponding to the target column name from the HBASE library, determines a target location according to the target column name and the target row key, and writes the target data into the target location to obtain an intermediate data file.
The writing, by the generating unit 116, the target data into the target position to obtain the intermediate data file specifically includes:
the generating unit 116 writes the target data into the target location through statements put < table >, < rowkey >, < family: column >, < value >, repeats the above steps until the target location is written into the target data, and generates the intermediate data file by aggregating all the executed statements.
Although the amount of data stored in the HBASE library is large, the data is messy, and according to the present embodiment, the target data corresponding to the target column name can be retrieved from the HBASE library according to the user's demand, and the intermediate data file is generated.
In at least one embodiment of the invention, after generating the intermediate data file, the method further comprises:
the sending unit 118 sends the intermediate data file to an intermediate data layer, and further, the saving unit 122 saves the intermediate data file by using the intermediate data layer.
Wherein the intermediate data layer is a directory which is specially used by the linux system for storing intermediate files.
Through the embodiment, the intermediate data file is stored, so that the intermediate data file can be prevented from being lost due to errors in the process of generating the data to be generated, and further the data generation efficiency is influenced.
The writing unit 117 writes the intermediate data file into the HBASE library by using a calculation engine to obtain the data to be generated.
In at least one embodiment of the invention, the data to be generated is a set of data having a fixed format and generated based on the HBASE library.
In at least one embodiment of the present invention, the writing unit 117 writes the intermediate data file into the HBASE library by using a computing engine, and obtaining the data to be generated includes:
according to the file size of the intermediate data file, the write-in unit 117 divides the intermediate data file into at least one data block, calculates the number of the at least one data block, obtains a target number, further, the write-in unit 117 creates an HBASE table in the HBASE library, partitions the HBASE table by taking the target number as the partition number, and writes the at least one data block into the partitioned HBASE table by adopting a calculation engine, so as to obtain the data to be generated.
Through the embodiment, the calculation engine is adopted, so that the intermediate data file can be quickly written into the HBASE library, and the data generation rate is improved.
In at least one embodiment of the present invention, after obtaining the data to be generated, the method further includes:
the obtaining unit 110 obtains a table name and a generation time of the data to be generated, the generating unit 116 generates a prompt message according to the table name and the generation time, and further, the sending unit 118 sends the prompt message to a terminal device of a designated contact.
Wherein, the prompt message may include, but is not limited to: table names of data to be generated, line numbers of the data to be generated, generation time of the data to be generated and the like.
Further, the designated contact may be a trigger for the data generation instruction.
Through the embodiment, the designated contact person can be reminded to check and receive the data after the data to be generated is generated.
In at least one embodiment of the present invention, after obtaining the data to be generated, the method further includes:
the encryption unit 119 encrypts the data to be generated by using a symmetric encryption technique to obtain a ciphertext, and further, when it is detected that the ciphertext is decrypted successfully, the display unit 120 displays the data to be generated.
Through the implementation mode, the data to be generated can be prevented from being tampered, and the safety of the data is improved.
According to the technical scheme, when a data generation instruction is received, the method can acquire a trigger interface of the data generation instruction, recognize all characters on the trigger interface by adopting a character recognition technology, extract at least one keyword from all the characters to obtain attribute data of data to be generated, encapsulate the attribute data by adopting a JSON technology to obtain a target character string, analyze the target character string by adopting the JSON technology to obtain a target parameter, analyze the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter, call a row key generation strategy corresponding to the target type from an HBASE library, generate a target row key of the target parameter in the HBASE library based on the row key generation strategy, acquire a target column name from the target parameter, and according to the target column name and the target row key, and generating an intermediate data file, and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated, so that not only can a large amount of data be generated based on the HBASE library, but also the generation efficiency of the data can be improved.
Fig. 3 is a schematic structural diagram of an electronic device according to a preferred embodiment of the present invention.
In one embodiment of the invention, the electronic device 1 comprises, but is not limited to, a memory 12, a processor 13, and a computer program, such as a HBASE library-based data generation program, stored in the memory 12 and executable on the processor 13.
It will be appreciated by a person skilled in the art that the schematic diagram is only an example of the electronic device 1 and does not constitute a limitation of the electronic device 1, and that it may comprise more or less components than shown, or some components may be combined, or different components, e.g. the electronic device 1 may further comprise an input output device, a network access device, a bus, etc.
The Processor 13 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component, etc. The processor 13 is an operation core and a control center of the electronic device 1, and is connected to each part of the whole electronic device 1 by various interfaces and lines, and executes an operating system of the electronic device 1 and various installed application programs, program codes, and the like.
The processor 13 executes an operating system of the electronic device 1 and various installed application programs. The processor 13 executes the application program to implement the steps in the above-described respective data generation method embodiments, such as steps S10, S11, S12, S13, S14, S15, S16, S17, S18, S19, and S20 shown in fig. 1.
Alternatively, the processor 13, when executing the computer program, implements the functions of the modules/units in the above device embodiments, for example: when a data generation instruction is received, acquiring a trigger interface of the data generation instruction; recognizing all characters on the trigger interface by adopting a character recognition technology; extracting at least one keyword from all the characters to obtain attribute data of data to be generated; packaging the attribute data by adopting a JSON technology to obtain a target character string; analyzing the target character string by adopting a JSON technology to obtain a target parameter; analyzing the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter; calling a row key generation strategy corresponding to the target type from an HBASE library; generating a target row key of the target parameter in the HBASE library based on the row key generation strategy; acquiring a target column name from the target parameter; generating an intermediate data file according to the target column name and the target row key; and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
Illustratively, the computer program may be divided into one or more modules/units, which are stored in the memory 12 and executed by the processor 13 to accomplish the present invention. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution process of the computer program in the electronic device 1. For example, the computer program may be divided into an acquisition unit 110, a recognition unit 111, an extraction unit 112, a packaging unit 113, a parsing unit 114, a calling unit 115, a generation unit 116, a writing unit 117, a transmission unit 118, an encryption unit 119, a display unit 120, a transmission unit 121, and a saving unit 122.
The memory 12 can be used for storing the computer programs and/or modules, and the processor 13 implements various functions of the electronic device 1 by running or executing the computer programs and/or modules stored in the memory 12 and calling data stored in the memory 12. The memory 12 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, etc.) created according to the use of the electronic device, etc. Further, the memory 12 may include a non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other non-volatile solid state storage device.
The memory 12 may be an external memory and/or an internal memory of the electronic device 1. Further, the memory 12 may be a circuit with a memory function, such as a FIFO (First InFirst Out), which is not in physical form in an integrated circuit. Alternatively, the memory 12 may be a memory in a physical form, such as a memory stick, a TF Card (Trans-flash Card), or the like.
The integrated modules/units of the electronic device 1 may be stored in a computer-readable storage medium if they are implemented in the form of software functional units and sold or used as separate products. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented.
Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), or the like.
In conjunction with fig. 1, the memory 12 in the electronic device 1 stores a plurality of instructions to implement a data generation method, and the processor 13 can execute the plurality of instructions to implement: when a data generation instruction is received, acquiring a trigger interface of the data generation instruction; recognizing all characters on the trigger interface by adopting a character recognition technology; extracting at least one keyword from all the characters to obtain attribute data of data to be generated; packaging the attribute data by adopting a JSON technology to obtain a target character string; analyzing the target character string by adopting a JSON technology to obtain a target parameter; analyzing the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter; calling a row key generation strategy corresponding to the target type from an HBASE library; generating a target row key of the target parameter in the HBASE library based on the row key generation strategy; acquiring a target column name from the target parameter; generating an intermediate data file according to the target column name and the target row key; and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
Specifically, the processor 13 may refer to the description of the relevant steps in the embodiment corresponding to fig. 1 for a specific implementation method of the instruction, which is not described herein again.
In the embodiments provided in the present invention, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules is only one logical functional division, and other divisions may be realized in practice.
The modules described as separate parts may or may not be physically separate, and parts displayed as modules may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment.
In addition, functional modules in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional module.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof.
The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference signs in the claims shall not be construed as limiting the claim concerned.
Furthermore, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or means recited in the system claims may also be implemented by one unit or means in software or hardware. The terms second, etc. are used to denote names, but not any particular order.
Finally, it should be noted that the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, and although the present invention is described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention.

Claims (10)

1. A method of data generation, the method comprising:
when a data generation instruction is received, acquiring a trigger interface of the data generation instruction;
recognizing all characters on the trigger interface by adopting a character recognition technology;
extracting at least one keyword from all the characters to obtain attribute data of data to be generated;
packaging the attribute data by adopting a JSON technology to obtain a target character string;
analyzing the target character string by adopting a JSON technology to obtain a target parameter;
analyzing the target parameter by adopting a shallow semantic analysis method to obtain a target type of the target parameter;
calling a row key generation strategy corresponding to the target type from an HBASE library;
generating a target row key of the target parameter in the HBASE library based on the row key generation strategy;
acquiring a target column name from the target parameter;
generating an intermediate data file according to the target column name and the target row key;
and writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
2. The data generation method of claim 1, wherein the encapsulating the attribute data using JSON technology to obtain a target string comprises:
calling a target label corresponding to each attribute data in the attribute data from the HBASE library;
and packaging each attribute data and the target tag corresponding to each attribute data by adopting a JSON technology to obtain the target character string.
3. The data generation method of claim 1, wherein the parsing the target string using JSON technology to obtain target parameters comprises:
acquiring all target labels in the target character string;
and extracting data corresponding to each target label in all the target labels as the target parameters.
4. The data generating method of claim 1, wherein generating an intermediate data file according to the target column name and the target row key comprises:
calling out target data corresponding to the target column name from the HBASE library;
determining a target position according to the target column name and the target row key;
and writing the target data into the target position to obtain the intermediate data file.
5. The data generating method of claim 1, wherein the writing the intermediate data file into the HBASE library using a compute engine to obtain the data to be generated comprises:
dividing the intermediate data file into at least one data block according to the file size of the intermediate data file;
calculating the number of the at least one data block to obtain the target number;
creating an HBASE table in the HBASE library;
partitioning the HBASE table by taking the target number as the partition number;
and writing the at least one data block into the partitioned HBASE table by adopting a calculation engine to obtain the data to be generated.
6. The data generation method of claim 1, wherein after obtaining the data to be generated, the method further comprises:
acquiring the table name and the generation time of the data to be generated;
generating prompt information according to the table name and the generation time;
and sending the prompt message to the terminal equipment of the appointed contact person.
7. The data generation method of claim 1, wherein after obtaining the data to be generated, the method further comprises:
encrypting the data to be generated by adopting a symmetric encryption technology to obtain a ciphertext;
and when the ciphertext decryption is detected to be successful, displaying the data to be generated.
8. An apparatus for generating data, the apparatus comprising:
the acquisition unit is used for acquiring a trigger interface of the data generation instruction when the data generation instruction is received;
the identification unit is used for identifying all characters on the trigger interface by adopting a character identification technology;
the extraction unit is used for extracting at least one keyword from all the characters to obtain attribute data of data to be generated;
the packaging unit is used for packaging the attribute data by adopting JSON technology to obtain a target character string;
the analysis unit is used for analyzing the target character string by adopting a JSON technology to obtain a target parameter;
the analysis unit is also used for analyzing the target parameters by adopting a shallow semantic analysis method to obtain the target type of the target parameters;
the calling unit is used for calling a row key generation strategy corresponding to the target type from the HBASE library;
a generating unit, configured to generate a target row key of the target parameter in the HBASE library based on the row key generation policy;
the acquisition unit is further used for acquiring a target column name from the target parameter;
the generating unit is further used for generating an intermediate data file according to the target column name and the target row key;
and the writing unit is used for writing the intermediate data file into the HBASE library by adopting a calculation engine to obtain the data to be generated.
9. An electronic device, characterized in that the electronic device comprises:
a memory storing at least one instruction; and
a processor executing instructions stored in the memory to implement a data generation method as claimed in any one of claims 1 to 7.
10. A computer-readable storage medium characterized by: the computer-readable storage medium has stored therein at least one instruction that is executed by a processor in an electronic device to implement the data generation method of any one of claims 1 to 7.
CN201911174761.8A 2019-11-26 2019-11-26 Data generation method and device, electronic equipment and storage medium Pending CN111159386A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201911174761.8A CN111159386A (en) 2019-11-26 2019-11-26 Data generation method and device, electronic equipment and storage medium
PCT/CN2020/086799 WO2021103409A1 (en) 2019-11-26 2020-04-24 Data generation method and apparatus, electronic device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911174761.8A CN111159386A (en) 2019-11-26 2019-11-26 Data generation method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111159386A true CN111159386A (en) 2020-05-15

Family

ID=70556136

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911174761.8A Pending CN111159386A (en) 2019-11-26 2019-11-26 Data generation method and device, electronic equipment and storage medium

Country Status (2)

Country Link
CN (1) CN111159386A (en)
WO (1) WO2021103409A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113268761A (en) * 2021-07-20 2021-08-17 北京国电通网络技术有限公司 Information encryption method and device, electronic equipment and computer readable medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103001824B (en) * 2012-11-14 2015-10-07 北京奇虎科技有限公司 A kind of supervisory control system and method for supervising monitoring multiple servers
CN104239313B (en) * 2013-06-09 2019-07-30 百度在线网络技术(北京)有限公司 The searching method and mobile terminal of text are shown in screen based on mobile terminal
CN107977448A (en) * 2017-12-12 2018-05-01 北京小米移动软件有限公司 The method and apparatus for loading multi-data source data

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113268761A (en) * 2021-07-20 2021-08-17 北京国电通网络技术有限公司 Information encryption method and device, electronic equipment and computer readable medium
CN113268761B (en) * 2021-07-20 2021-09-24 北京国电通网络技术有限公司 Information encryption method and device, electronic equipment and computer readable medium

Also Published As

Publication number Publication date
WO2021103409A1 (en) 2021-06-03

Similar Documents

Publication Publication Date Title
CN111694840B (en) Data synchronization method, device, server and storage medium
CN108959257B (en) Natural language parsing method, device, server and storage medium
CN111797351A (en) Page data management method and device, electronic equipment and medium
CN111459967A (en) Structured query statement generation method and device, electronic equipment and medium
CN111639284A (en) Webpage labeling method and device, electronic equipment and medium
CN111538816B (en) Question-answering method, device, electronic equipment and medium based on AI identification
CN111796809A (en) Interface document generation method and device, electronic equipment and medium
CN111158795A (en) Report generation method, device, medium and electronic equipment
CN110472154A (en) A kind of resource supplying method, apparatus, electronic equipment and readable storage medium storing program for executing
CN112015900A (en) Medical attribute knowledge graph construction method, device, equipment and medium
CN111985241A (en) Medical information query method, device, electronic equipment and medium
CN112711398A (en) Method, device and equipment for generating buried point file and storage medium
US20190147104A1 (en) Method and apparatus for constructing artificial intelligence application
CN112948418A (en) Dynamic query method, device, equipment and storage medium
CN112416583A (en) High-frequency request processing method and device, electronic equipment and storage medium
CN111625567A (en) Data model matching method, device, computer system and readable storage medium
CN114116108A (en) Dynamic rendering method, device, equipment and storage medium
CN111159386A (en) Data generation method and device, electronic equipment and storage medium
CN113268597A (en) Text classification method, device, equipment and storage medium
CN113886204A (en) User behavior data collection method and device, electronic equipment and readable storage medium
CN111986771A (en) Medical prescription query method and device, electronic equipment and storage medium
CN112434062A (en) Quasi-real-time data processing method, device, server and storage medium
CN111061975B (en) Method and device for processing irrelevant content in page
US11676410B1 (en) Latent space encoding of text for named entity recognition
CN116360769A (en) Code generation method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination