WO2021082463A1 - Data processing method and apparatus, electronic device and storage medium - Google Patents
Data processing method and apparatus, electronic device and storage medium Download PDFInfo
- Publication number
- WO2021082463A1 WO2021082463A1 PCT/CN2020/095955 CN2020095955W WO2021082463A1 WO 2021082463 A1 WO2021082463 A1 WO 2021082463A1 CN 2020095955 W CN2020095955 W CN 2020095955W WO 2021082463 A1 WO2021082463 A1 WO 2021082463A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sample
- list
- description information
- reading
- samples
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Definitions
- the difficulty of reading data will increase.
- the storage of data sets is usually to adapt to the structure of the data set itself.
- the data storage method of the Da taFrame type in the pandas library of the python language is powerful, but the supported data types are limited, and it is mainly for professional users.
- the read operation mainly focuses on the data itself.
- the embodiments of the present disclosure propose a data processing method and device, electronic equipment, and storage medium.
- a data processing method including: determining a sample list including a plurality of samples according to a field of a sample and a parameter value corresponding to the field; The data type is used to determine the description information corresponding to the sample list; and the sample set is determined according to the sample list and the description information.
- the sample set used to store each sample and the corresponding description information can be determined, multiple types of samples can be stored in a unified manner, and the sample set includes description information describing the data type, It is convenient to store and read sample data, can support multiple types of data storage and read, and is easy to understand.
- each sample has a sample index
- the determining a sample list including multiple samples according to the field of the sample and the parameter value corresponding to the field includes: determining the field as all the samples.
- the column coordinates of the sample list, the sample index is determined as the row coordinates of the sample list;
- the parameter value corresponding to the field is objectified to obtain the object corresponding to the sample;
- the data in the object Add to the storage area corresponding to the column coordinates and the row coordinates to obtain the sample list.
- the parameter values and/or annotations of the samples can be objectified, and samples of multiple data types can be stored through the sample list, which is conducive to the rapid storage and reading of samples, and reduces the resource occupation during storage and reading.
- And can support multiple types of data storage, adapt to the storage of more complex types of data sets.
- the description information includes description information of how to read the sample, and the description information corresponding to the sample list is determined according to the data type of the sample in the sample list, It includes: determining a reading method for reading the sample according to the data type of the sample in the sample list; obtaining the description information according to the data type of the sample and the corresponding reading method.
- a variety of sample reading methods can be determined, which is conducive to supporting multiple types of data storage and access, and adapts to the storage and access of more complex types of data sets.
- the description information further includes description information of a storage location of the sample, and the storage location includes a storage path of the sample set and an index position of the sample in the sample set. In this way, the precise location of the sample can be determined, making it easy to read the sample.
- the method further includes: determining index range information used to classify the types of samples in the sample set; generating information related to the sample set based on the description information of the samples in the sample set and the index range information. The description information corresponding to the set. In this way, the index range of the sample can be determined, which is convenient for searching the sample.
- the method further includes: reading parameter values of samples in the sample list according to the description information and the sample list.
- the reading the parameter values of the samples in the sample list according to the description information and the sample list includes: obtaining objects in the sample list; according to the sample index, from Among the acquired objects, the first object of the first sample corresponding to the sample index is obtained; the data type of the first sample and the corresponding reading method are determined according to the description information; according to the reading In this way, the parameter value of the first sample is read from the first object.
- the resource consumption when reading samples can be reduced, and the sample data can be stored in a unified way, and the sample data type can be stored using the sample description information, so that the data reading method can be easily obtained according to the description information, which is easy to read
- the samples of multiple data types are conducive to quick reading of the samples.
- the reading the parameter values of the samples in the sample list according to the description information and the sample list includes: obtaining a sample index; according to the sample index, from the sample The parameter value of the sample corresponding to the sample index is obtained from the list.
- the reading the parameter values of the samples in the sample list according to the description information and the sample list includes: obtaining the object corresponding to the target field in the sample list;
- the description information determines the data type and reading method of the second sample corresponding to the acquired object; according to the reading method, the parameter value of the second sample is read from the acquired object.
- the parameter value of one or more samples in the field is read by accessing the field, which improves the flexibility of sample reading and facilitates the rapid reading of samples.
- the reading the parameter values of the samples in the sample list according to the description information and the sample list includes: obtaining the field name of the target field; according to the field name, from Obtain the parameter value of the target field corresponding to the field name from the sample list.
- the method further includes: performing at least one operation of query, modification, and deletion processing on the parameter value of the read sample.
- a data processing device including: a first determining part configured to determine a sample list including a plurality of samples according to a field of the sample and a parameter value corresponding to the field; The second determining part is configured to determine the description information corresponding to the sample list according to the data type of the samples in the sample list; the third determining part is configured to determine according to the sample list and the description information Sample set.
- each sample has a sample index
- the first determining part is further configured to: determine the field as the column coordinates of the sample list, and determine the sample index as the The row coordinates of the sample list; object the parameter values corresponding to the fields to obtain the object corresponding to the sample; add the data in the object to the storage area corresponding to the column coordinates and the row coordinates , To obtain the sample list.
- the description information includes description information of how to read the sample
- the second determining part is further configured to: determine a sample for reading according to the data type of the sample in the sample list. The method of reading; the description information is obtained according to the data type of the sample and the corresponding reading method.
- the description information further includes description information of a storage location of the sample, and the storage location includes a storage path of the sample set and an index position of the sample in the sample set.
- the device further includes: a fourth determining part configured to determine index range information used to classify samples in the sample set; and the generating part configured to determine the index range information of the samples in the sample set according to The description information and the index range information generate description information corresponding to the sample set.
- the device further includes: a reading part configured to read parameter values of samples in the sample list according to the description information and the sample list.
- the reading part is further configured to: obtain objects in the sample list; according to the sample index, obtain the first object corresponding to the sample index from the obtained objects.
- the reading part is further configured to: obtain a sample index; according to the sample index, obtain the parameter value of the sample corresponding to the sample index from the sample list.
- the reading part is further configured to: obtain the object corresponding to the target field in the sample list; determine the data type and the data type of the second sample corresponding to the obtained object according to the description information. Reading mode; according to the reading mode, the parameter value of the second sample is read from the obtained object.
- the reading part is further configured to: obtain the field name of the target field; according to the field name, obtain the parameter value of the target field corresponding to the field name from the sample list.
- the device further includes: an operation part configured to perform at least one of query, modification, and deletion of the parameter value of the read sample.
- an electronic device including: a processor; a memory configured to store executable instructions of the processor; wherein the processor is configured to execute the above-mentioned data processing method.
- a computer-readable storage medium having computer program instructions stored thereon, and when the computer program instructions are executed by a processor, the foregoing data processing method is implemented.
- a computer program including computer-readable code, when the computer-readable code runs in an electronic device, the processor in the electronic device executes the above-mentioned data processing method.
- Fig. 1 shows a flowchart of data processing according to an embodiment of the present disclosure
- Fig. 2 shows a schematic diagram of an exemplary sample list according to an embodiment of the present disclosure
- Fig. 3 shows an application schematic diagram of an exemplary storage method according to an embodiment of the present disclosure
- Fig. 4 shows a block diagram of a data processing device according to an embodiment of the present disclosure
- Figure 5 shows a block diagram of an electronic device according to an embodiment of the present disclosure
- Fig. 6 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
- Fig. 1 shows a flowchart of a data processing method according to an embodiment of the present disclosure. As shown in Fig. 1, the method includes:
- S11 Determine a sample list including multiple samples according to the fields of the samples and the parameter values corresponding to the fields;
- S12 Determine description information corresponding to the sample list according to the data type of the sample in the sample list;
- the sample set used to store each sample and the corresponding description information can be determined, multiple types of samples can be stored in a unified manner, and the sample set includes description information describing the data type, It is convenient to store and read sample data, can support multiple types of data storage and read, and is easy to understand.
- the execution subject of the data processing method may be a terminal device or a server or other processing equipment, where the terminal device may be a user equipment (User Equipment, UE), a mobile device, a user terminal, or a terminal. , Cellular phones, cordless phones, personal digital assistants (PDAs), handheld devices, computing devices, in-vehicle devices, wearable devices, etc.
- the method may be implemented by a processor invoking computer-readable instructions stored in a memory or executed by a server.
- data in the server may be stored by the storage method.
- the data type of the sample may include multiple types, such as image, video, and/or ordinary numerical value or text data, etc.
- the embodiment of the present disclosure does not limit the data type of the sample.
- the sample may include one or more fields, and each field may correspond to a certain feature or a certain type of characteristic of the sample.
- a certain sample is the profile data of a certain person, and the fields of the sample may include the height and weight of the person , Education, nationality, etc., and the sample can have data corresponding to each field.
- the embodiments of the present disclosure do not impose restrictions on the fields.
- the multiple samples may be stored in a datatable, and the storage in a list manner is convenient for operations such as data addition, deletion, and modification.
- each sample has a sample index
- the sample index may include the number or name corresponding to the sample
- S11 may include: determining the field as the column coordinates of the sample list, The sample index is determined as the row coordinates of the sample list; the parameter values of the sample are objectified to obtain the object corresponding to the sample; the object is added to the column coordinates and the row coordinates.
- the storage area corresponding to the coordinates is used to obtain the sample list.
- FIG. 2 shows a schematic diagram of an exemplary sample list according to an embodiment of the present disclosure.
- the list may use sample indexes as row coordinates and fields as column coordinates.
- the fields of each sample can be summarized as the column coordinates of the sample list.
- the sample can be a tuple, such as a two-dimensional tuple, and a data group composed of data in multiple fields.
- a sample for example, sample 1 is the profile data of a certain person, and the fields of the sample may include the person's height (for example, field 1), weight (for example, field 2), education (for example, field 3), etc.
- the other sample is a face image of a person.
- the fields of the sample can include the person’s identity information, etc., and the fields of each sample can be summarized.
- the column coordinates of the sample list can include height, weight, education, nationality, and identity. Information and other fields.
- Each field corresponds to a column of data in the sample list. In some rows of the column of data, the data can be empty.
- the list of samples includes the "nationality" field. In a sample of a person’s profile data, this field can contain data, such as "China", "United States", etc., while in a face image sample, this field There may be no data, that is, in the sample list, the data is empty in the row where the face image sample is located and the area corresponding to the column where the "nationality" field is located.
- the sample index of the sample can be determined as the row coordinates of the sample list.
- each sample can have a sample index, for example, sample 1, sample 2, sample 3, etc., and the sample index can be Determined as the row coordinates of the sample list.
- the samples may be training samples used when training a neural network, each sample may have a label, and the label may be data describing a field of the sample.
- the sample may include a face image sample.
- the neural network compares each face image sample with a reference image (for example, a face image of a certain target person), and determines the person The similarity between the face image sample and the reference image.
- the similarity is greater than or equal to the similarity threshold (for example, 0.5), it can be determined that the face image sample has a high similarity with the reference image and the identity verification is passed; otherwise, the identity verification fails.
- each face image sample When training the neural network, it is necessary to label each face image sample to provide parameters for the loss function of the neural network. For example, if a face image sample is the face image of the target person, it can be marked as 1, which means that the face in the face image sample is the face of the target person. If the face image sample is not the target person's face The face image can be marked as 0, which means that the face in the face image sample is not the face of the target person.
- the sample may have parameter values and annotations of each field. For example, the field of a certain face image sample is the identity information (for example, name or number, etc.) of the target object in the image, which is marked as 0 or 1.
- the label and parameter value of each field in the sample can be objectified.
- a storage area corresponding to the sample can be set up.
- a storage space can be set up in the memory, and the label of each field in the sample can be set up.
- the sum parameter value is stored in the storage area, and the object corresponding to the sample is obtained for easy storage and reading.
- the storage area storing the data (label and parameter value) of the field is the object.
- the data in the object corresponding to the sample can be added to the sample list, or the object can be associated with the sample list to initialize the sample list.
- the storage area in which the data of the field is stored may correspond to the storage area in the sample list (for example, a certain cell in the sample list).
- the data in the object corresponding to sample 1 can be added to the first row of the sample list, that is, the data of each field in the object corresponding to sample 1 can be added to the corresponding field in the first row
- the storage area for example, add height data in the storage area where the row coordinates are the first row and the column coordinates are the "height" field, and the weight data is added in the row coordinates of the first row and the column coordinates are the "weight" field.
- Each sample can be objectified in the above-mentioned manner, and the data stored in the object can be filled into the storage area of the sample list to obtain a sample list storing multiple samples.
- the sample list can be initialized according to the sample index. For example, the data of each field of one sample can be added to the corresponding storage area, and then the data of each field of the next sample can be added, that is, Add the data of each sample according to the row direction in the list to initialize the sample list.
- the sample list can be initialized according to the fields, that is, all the data of a field can be added to the corresponding storage area, and then the data of the next field can be added, that is, according to the column direction in the list Add data to initialize the sample list.
- the parameter values and/or annotations of the samples can be objectified, and samples of multiple data types can be stored through the sample list, which is conducive to the rapid storage and reading of samples and reduces the resource occupation during storage and reading. , And can support multiple types of data storage, adapt to the storage of more complex types of data sets.
- the sample in S12, can include multiple data types, and each type of sample can have a specific storage and reading method.
- the description information can be used for each sample. Description for easy access (such as storage or reading) of multiple types of samples.
- the sample may include a video
- the description information may include a description of how the video is stored or read
- the sample may also include an image
- the description information may also include the storage of the image. Or read the description of the way.
- the description information may be description information in a metadata (Meta Data) format, for example, dictionary type metadata, which is convenient for improving the data query speed.
- the embodiment of the present disclosure does not limit the type of the description information.
- the description information includes the description information of the reading method of the sample
- S12 may include: determining the reading method for reading each sample according to the data type of the sample in the sample list ; According to the data type of the sample and the corresponding reading mode, the description information is obtained.
- the description information may be information describing how the sample is read.
- the reading method of each sample can be judged according to the data type of the sample. For example, if the data type of a sample is an image, the sample can be read by the image reading function, and the data type of a sample is video, then The sample can be read with the video reading function, and the reading method of each sample can be determined separately.
- the data type of each sample can also be summarized.
- the sample set includes multiple samples. There are three types of data for the multiple samples, namely image, video, and general data.
- the image type can be determined by the image reading function.
- the sample of the video type is read through the video reading function, and the sample of the common data type is read through the data reading function.
- the embodiment of the present disclosure does not limit the data type and reading method of the sample.
- the description information can be obtained according to the data type of the sample and the corresponding reading method.
- the description information can be configured as dictionary data, for example, a dictionary
- the key of is the data type
- the value of the dictionary is the reading method corresponding to the data type.
- the key of the dictionary is the image
- the value of the dictionary is the image reading function.
- the embodiment of the present disclosure does not limit the data type of the description information.
- the description information may include description information of the storage location of the sample, and the storage location includes the storage path of the sample set and the index position of the sample in the sample set.
- the storage path of the sample set is that the sample set is stored in the "sample set" folder on the C drive of the computer.
- the index position of the sample in the sample set is the specific position of the sample.
- the data of each sample may be stored in the sample set in the form of an object, that is, occupy a certain storage area in the sample set.
- the sample set may store each sample in the form of a list, the data of the sample may occupy a certain row in the sample set, and the index position may indicate the row number.
- the first sample is located in the first row of the list of the sample set, The two samples are located in the second row of the sample set list,...; the field information of the sample can include the storage path of the sample set and the index position of each sample.
- the specific location of each sample can be determined according to the field information, for example, the first The 2 samples are located in the second row of the sample set list in the "sample set" folder in the C drive of the computer. For another example, some samples may only have certain fields, or multiple samples belong to the same type or the same sample file.
- the index position may be the row and column range of the sample in the sample set, or the row and column range of multiple samples. The embodiment of the present disclosure does not limit the storage location.
- the sample set may include samples of multiple data types, such as images, videos, ordinary numbers and/or text data, etc.
- the storage format of the samples may include a character-separated value file (Comma-Separated Value File). Values, CSV) format, hierarchical data format 5 (Hierarchical Data Format 5, HDF5), etc.
- the index position of the sample in the sample set may be the position range of the same type of sample generated after the same type of sample is summarized. For example, a sample file stored in the CSV format will store the index position of the CSV file in the description information.
- the method further includes: determining index range information used to classify the types of samples in the sample set; generating and according to the description information of the samples in the sample set and the index range information of the sample categories. Descriptive information corresponding to the sample set.
- the description information corresponding to the sample set further includes index range information for dividing the type of each sample in the sample set.
- the process of determining the description information corresponding to the sample set in the method further includes: determining index range information used to classify the categories of each sample in the sample set; according to the description information of the samples in the sample set and the index range information of the samples, Generate description information corresponding to the sample set. For example, it can be classified according to the label of the sample.
- the sample set can be divided into a training set, a validation set, and a test set.
- the training set includes 5000 samples, of which 3000 samples are labeled as 1, and 2000 samples are labeled as 0.
- the labels may not be distinguished, and the classification may be directly based on the number and index of the samples in the sample set.
- the samples in the sample set may be training samples used when training a neural network, and the training samples may be divided into multiple categories. For example, Divide 40% of the samples into the training set, divide 30% of the samples into the test set, and divide 30% of the samples into the validation set. Or it can be divided according to the index of each sample, and the index range of each category can be determined.
- the sample set stores each sample in the form of a list, and 10,000 samples are stored in the list.
- the 1st-4000th sample in the list can be divided into the training set, and the 4001th-4th sample in the list can be divided into training set.
- the 7000 samples are divided into the test set, and the 7001th-10000th samples are divided into the verification set.
- the description information may include index range information (for example, 1st-4000th, 4001th-7000th, 7001th-10000th, etc.) for dividing sample categories.
- index range information for example, 1st-4000th, 4001th-7000th, 7001th-10000th, etc.
- the embodiment of the present disclosure does not limit the division method and the index range.
- the sample set may also include other forms of data subsets, such as image subsets and video subsets.
- the description information of the sample set includes not only the sample subsets of the above-mentioned training set, test set, and validation set and the corresponding index range description information, but also the description information of other forms of data subsets in the sample set.
- sample reading methods can be determined, which is conducive to supporting multiple types of data storage and access, and adapts to the storage and access of more complex types of data sets.
- the sample set (dataset) can be obtained according to the sample list and the description information corresponding to the sample list.
- the sample list and the sample list can be combined
- the corresponding description information is written into the sample set to obtain the file of the sample set, or the description information corresponding to the sample list and the sample list are packaged to obtain the sample set.
- the embodiment of the present disclosure does not limit the way of obtaining the sample set.
- the sample set includes a sample list storing multiple samples, and also includes description information that records the reading method of each sample, so as to facilitate the storage and reading of samples of multiple data types. Samples that store multiple data types provide the basis.
- the description information can record the data type of each sample, and the reading method and other information. Based on the description information, the sample or the data of the sample in the sample set can be read.
- the embodiments of the present disclosure do not limit the content of the description information.
- the method further includes: reading parameter values of samples in the sample list according to the description information and the sample list.
- the sample in the sample list can be read in a suitable manner according to the description of the sample reading method in the description information, and the reading process may include: obtaining objects in the sample list According to the sample index, from the acquired objects, obtain the first object of the first sample corresponding to the sample index; determine the data type of the first sample and the corresponding reading method according to the description information ; According to the reading method, the parameter value of the first sample is read from the first object.
- the sample list includes a plurality of samples, and the samples are stored in the sample list in an object format.
- each object in the sample list can be obtained (for example, by accessing the'samples' attribute, that is, accessing the sample object, all objects in the list containing all samples can be obtained).
- the sample index in the sample list can be accessed, and the first sample to be read is selected in the sample index, that is, the first object corresponding to the first sample is selected, for example, the sample is selected through the sample index In the list, the row where the first sample is located.
- the description information may include the reading method of each sample, and the corresponding reading method may be selected in the description information according to the data type of the first sample, for example, the first sample
- the data type of is image
- the data type of the first sample is video
- the parameter value of the first sample can be read in the first object through a reading method corresponding to the data type of the first sample.
- the first sample is information data of a certain person, the first sample may include the person's height data, weight data, educational background data, nationality data, etc.
- the data can be read through the data reading function.
- the data of the first sample is extracted, and the parameter values of the first sample are read through the data reading function.
- the parameter value of one or more fields of the first sample data can also be read.
- the height data and weight data in the data can be selected and read through the data read function. Take the height data and weight data. There is no need to read all the data of the first sample, which improves the flexibility of sample reading.
- reading the parameter values of the samples in the sample list includes: obtaining a sample index; according to the sample index, from the sample list Obtain the parameter value of the sample corresponding to the sample index.
- the sample index can be used to access the sample list, read the sample corresponding to the sample index, and obtain the parameter value of the sample.
- a read instruction input by the user or the sample acquisition device can be received.
- the read instruction carries the sample index of the sample to be read.
- the sample index value can be set to 100, , You can get the sample in the 100th row of the list, that is, get the sample in the 100th row in the list, and read the parameter value of the sample according to the reading method of the sample.
- the parameter value of the sample can be read according to the reading method in the description information, or the data type of the sample can be determined according to the suffix of the sample, and then the corresponding reading method can be selected to read the sample to obtain the sample The parameter value.
- the resource consumption when reading samples can be reduced, and the sample data can be stored in a unified way, and the sample data type can be stored using the sample description information, so that the data reading method can be easily obtained according to the description information, which is easy to read
- the samples of multiple data types are conducive to quick reading of the samples.
- reading the parameter values of the samples in the sample list includes: obtaining the object corresponding to the target field in the sample list; according to the description The information determines the data type and reading method of the second sample corresponding to the acquired object; according to the reading method, the parameter value of the second sample is read from the acquired object.
- the target field in the sample list can be accessed to obtain the object corresponding to the target field.
- the target field can be accessed to obtain the object corresponding to the target field.
- the'fields' attribute that is, accessing the field
- the field list can include "age”, "height" and "nationality", etc., and can also be accessed by keywords in the field To access the corresponding field, for example, you can access the parameter value in the "Nationality” field through the keyword "nationality", or you can directly access the field name or keyword to access the parameter value in the field without accessing the field list.
- each field corresponds to a column of data in the sample list, and in some areas of the column of data, the data may be empty.
- the "nationality” field is included. In a sample of a person's profile data, this field can contain data, such as "China” or "United States", while in a face image sample, this field There is no data, that is, in the list, in the row where the face image sample is located, and the area corresponding to the column where the "nationality" field is located, the data is empty.
- the empty area may be ignored or not. The embodiment of the present disclosure does not limit whether to ignore.
- all the objects in the column of the target field can be obtained, that is, all the storage space for storing the data of the column.
- the reading method of the second sample to be read can be determined according to the description information, and the target field of the second sample can also be determined in the object corresponding to the target field according to the index of the second sample.
- the storage space of the parameter value that is, the object of the parameter value of the target field of the second sample.
- the parameter value of the target field of the second sample may be read through the reading method.
- the second sample is a person's profile data, and the second sample may include the person's height data, weight data, education data, nationality data, and so on.
- the object of the second sample in this field can be determined, and the parameter values in the object of the second sample can be read through the reading method corresponding to the data data.
- the height data of the second sample in the object of the field is read through the data reading function.
- reading the parameter values of the samples in the sample list includes: obtaining the field name of the target field; according to the field name, from the sample list To obtain the parameter value of the target field corresponding to the field name.
- the field name of the target field can be used to access the target field, and then obtain the parameter value of the target field.
- the storage device or processing device can receive a read instruction.
- the read instruction includes the field name of the target field of the sample to be read.
- the target field can be set to "height"
- the parameter value of the "height" field can be accessed through the field name "height”.
- the reading method corresponding to the data type of the field in the description information can be used to read the parameter value of the field, or the data type of the parameter of the field can be determined according to the suffix of the parameter of the field, and then the corresponding reading can be determined.
- any parameter value under the field can also be read, for example, the parameter value stored in the storage location with the index value of 100 under the "height" field can be obtained.
- the parameter value of one or more samples in the field is read by accessing the field, which improves the flexibility of sample reading and facilitates the rapid reading of samples.
- the storage device or data processing device will receive a read instruction, and the read instruction will include the sample data information that needs to be read, such as sample Index and field name, etc., according to the read instruction, the data type, storage location and corresponding reading method of the sample will be obtained from the description information of the sample set, so that the obtained reading method can be used to read the required data in the storage location Sample data. Therefore, a sample set containing description information can conveniently store and read various types of sample data, and the description information can facilitate users to understand and view the data stored in the sample set, which is suitable for scenarios for beginners.
- the method further includes: performing at least one operation of query, modification, and deletion processing on the parameter value of the read sample.
- at least one of processing such as query, modification, and deletion may be performed on the parameter value.
- the parameter value in the object can be queried, and the parameter value can be copied.
- the parameter value can be modified, and the modified parameter value can be saved in the object.
- the parameter value in the object can be deleted, so that the data in the object is empty.
- the embodiment of the present disclosure does not limit the processing manner of the parameter value.
- Fig. 3 shows an application schematic diagram of an exemplary storage method according to an embodiment of the present disclosure.
- multiple samples can be stored in a sample list, and the data types of the multiple samples can include image, audio, Video and/or general data (for example, numerical data, categorical data, and character data), etc.
- Each sample can occupy one row of storage area in the sample list.
- Each sample can include one or more fields.
- a sample is the profile data of a certain person.
- the fields of the sample can include the person’s height, weight, education and nationality, etc.
- the sample can have data corresponding to each field .
- the fields of each sample can be summarized as the column coordinates of the sample list, and the sample index of the sample is determined as the row coordinate of the sample list to obtain an empty sample list.
- the label and parameter value of the sample can be objectified, that is, a storage area corresponding to each parameter value of the sample is set up, and the label and parameter value of the sample are stored in the storage area to obtain The object corresponding to the sample is added, and the label and parameter values in the object are added to the sample list to obtain a sample list storing multiple samples.
- the sample can include multiple data types, and each type of sample can have a specific storage and reading method.
- the description information in the metadata format can be used for each sample. Description, in order to store or read multiple types of samples.
- the data type of each sample in the sample list can be summarized, and the reading method of each data type can be judged.
- the description information is configured as dictionary data.
- the key of the dictionary is the data type
- the value of the dictionary is the reading mode corresponding to the data type.
- the sample list and the description information corresponding to the sample list may be written into the sample set to obtain the file of the sample set. And by accessing the sample index or field to read the sample or a parameter value in the sample.
- each object in the sample list can be obtained, and the sample index in the sample list can be accessed to select the first sample to be read, that is, the first sample corresponding to the first sample is selected
- the first object Further, the corresponding reading method can be selected in the description information.
- the data type of the first sample is an image, and the image reading function in the description information can be selected to read the first sample.
- the first sample can be read through the reading method, or only the parameter value of one or more fields in the first sample can be read.
- the target field in the sample list can be accessed, and all objects corresponding to the target field can be obtained (the empty area in the target field can be ignored).
- the parameter value of the second sample in the target field can be read among all objects corresponding to the target field.
- the data type of the second sample can be determined, and the reading method corresponding to the data type of the second sample can be determined in the description information.
- the object of the second sample in the field can be determined, and the second sample can be read in the object according to the reading method corresponding to the data type of the second sample.
- the parameter value of the field can be read among the objects of all parameter values of the target field, and the second sample can be read in the object according to the reading method corresponding to the data type of the second sample.
- the storage method can be used to store multiple types of samples.
- using this storage method can store multiple types of samples according to a unified storage method.
- the program interface for storage and reading is convenient and concise.
- the storage and reading methods of various samples are unified and easy to understand, which is convenient for students to learn and operate.
- the embodiments of the present disclosure also provide data processing apparatuses, electronic equipment, computer-readable storage media, and programs. All of the above can be used to implement any image processing method provided by the embodiments of the present disclosure. For the corresponding technical solutions and descriptions, refer to the method Part of the corresponding records will not be repeated here.
- the writing order of the steps does not mean a strict execution order but constitutes any limitation on the implementation process.
- the specific execution order of each step should be based on its function and possibility.
- the inner logic is determined.
- Fig. 4 shows a block diagram of a data processing device according to an embodiment of the present disclosure.
- the device includes: a first determining part 11 configured to according to a field of a sample and a parameter value corresponding to the field, Determine a sample list including a plurality of samples; the second determining part 12 is configured to determine the description information corresponding to the sample list according to the data type of the samples in the sample list; the third determining part 13 is configured to determine the description information corresponding to the sample list according to The sample list and the description information determine the sample set.
- each sample has a sample index
- the first determining part is further configured to: determine the field as the column coordinates of the sample list, and determine the sample index of the sample as The row coordinates of the sample list; objectize the parameter value corresponding to the field to obtain the object corresponding to the sample; add the data in the object to the corresponding column coordinates and the row coordinates Storage area to obtain the sample list.
- the description information includes description information of how to read the sample
- the second determining part is further configured to: determine a sample for reading according to the data type of the sample in the sample list. The method of reading; the description information is obtained according to the data type of the sample and the corresponding reading method.
- the description information further includes description information of a storage location of the sample, and the storage location includes a storage path of the sample set and an index position of the sample in the sample set.
- the device further includes: a fourth determining part configured to determine index range information used to classify samples in the sample set; and the generating part configured to determine the index range information of the samples in the sample set according to The description information and the index range information of the sample category generate description information corresponding to the sample set.
- the device further includes: a reading part configured to read parameter values of samples in the sample list according to the description information and the sample list.
- the reading part is further configured to: obtain objects in the sample list; according to the sample index, obtain the first object corresponding to the sample index from the obtained objects.
- the reading part is further configured to: obtain a sample index; according to the sample index, obtain the parameter value of the sample corresponding to the sample index from the sample list.
- the reading part is further configured to: obtain the object corresponding to the target field in the sample list; determine the data type and the data type of the second sample corresponding to the obtained object according to the description information. Reading mode; according to the reading mode, the parameter value of the second sample is read from the obtained object.
- the reading part is further configured to: obtain the field name of the target field; according to the field name, obtain the parameter value of the target field corresponding to the field name from the sample list.
- the device further includes: an operation part configured to perform at least one of query, modification, and deletion of the parameter value of the read sample.
- the functions or parts included in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments.
- the functions or parts included in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments.
- parts may be parts of circuits, parts of processors, parts of programs or software, etc., of course, may also be units, modules, or non-modular.
- the embodiments of the present disclosure also provide a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above-mentioned method when executed by a processor.
- the computer-readable storage medium may be a volatile computer-readable storage medium or a non-volatile computer-readable storage medium.
- An embodiment of the present disclosure also proposes an electronic device, including: a processor; a memory configured to store executable instructions of the processor; wherein the processor is configured as the aforementioned method.
- the electronic device can be provided as a terminal, server or other form of device.
- Fig. 5 is a block diagram showing an electronic device 800 according to an exemplary embodiment.
- the electronic device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and other terminals.
- the electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power supply component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, and a sensor component 814 , And communication component 816.
- the processing component 802 generally controls the overall operations of the electronic device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
- the processing component 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the foregoing method.
- the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components.
- the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.
- the memory 804 is configured to store various types of data to support operations in the electronic device 800. Examples of these data include instructions for any application or method operating on the electronic device 800, contact data, phone book data, messages, pictures, videos, etc.
- the memory 804 can be implemented by any type of volatile or non-volatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable and Programmable read only memory (EPROM), programmable read only memory (PROM), read only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read-only memory
- EPROM erasable and Programmable read only memory
- PROM programmable read only memory
- ROM read only memory
- magnetic memory flash memory
- flash memory magnetic disk or optical disk.
- the power supply component 806 provides power for various components of the electronic device 800.
- the power supply component 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the electronic device 800.
- the multimedia component 808 includes a screen that provides an output interface between the electronic device 800 and the user.
- the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user.
- the touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation.
- the multimedia component 808 includes a front camera and/or a rear camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
- the audio component 810 is configured to output and/or input audio signals.
- the audio component 810 includes a microphone (MIC), and when the electronic device 800 is in an operation mode, such as a call mode, a recording mode, and a voice recognition mode, the microphone is configured to receive an external audio signal.
- the received audio signal may be further stored in the memory 804 or transmitted via the communication component 816.
- the audio component 810 further includes a speaker for outputting audio signals.
- the I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module.
- the above-mentioned peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: home button, volume button, start button, and lock button.
- the sensor component 814 includes one or more sensors for providing the electronic device 800 with various aspects of state evaluation.
- the sensor component 814 can detect the on/off status of the electronic device 800 and the relative positioning of the components.
- the component is the display and the keypad of the electronic device 800.
- the sensor component 814 can also detect the electronic device 800 or the electronic device 800.
- the position of the component changes, the presence or absence of contact between the user and the electronic device 800, the orientation or acceleration/deceleration of the electronic device 800, and the temperature change of the electronic device 800.
- the sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects when there is no physical contact.
- the sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
- the sensor component 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
- the communication component 816 is configured to facilitate wired or wireless communication between the electronic device 800 and other devices.
- the electronic device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof.
- the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel.
- the communication component 816 further includes a near field communication (NFC) module to facilitate short-range communication.
- the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
- RFID radio frequency identification
- IrDA infrared data association
- UWB ultra-wideband
- Bluetooth Bluetooth
- the electronic device 800 may be implemented by one or more application-specific integrated circuits (ASIC), digital signal processors (DSP), digital signal processing devices (DSPD), programmable logic devices (PLD), field-available A programmable gate array (FPGA), controller, microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
- ASIC application-specific integrated circuits
- DSP digital signal processors
- DSPD digital signal processing devices
- PLD programmable logic devices
- FPGA field-available A programmable gate array
- controller microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
- a computer-readable storage medium is also provided, such as the memory 804 including computer program instructions, which can be executed by the processor 820 of the electronic device 800 to complete the foregoing method.
- the embodiments of the present disclosure also provide a computer program product, including computer-readable code, and when the computer-readable code runs on a device, a processor in the device executes instructions for implementing the method provided in any of the above embodiments.
- the computer program product can be specifically implemented by hardware, software, or a combination thereof.
- the computer program product is specifically embodied as a computer storage medium.
- the computer program product is specifically embodied as a software product, such as a software development kit (SDK), etc. Wait.
- SDK software development kit
- Fig. 6 is a block diagram showing an electronic device 1900 according to an exemplary embodiment.
- the electronic device 1900 may be provided as a server. 6
- the electronic device 1900 includes a processing component 1922, which further includes one or more processors, and a memory resource represented by the memory 1932, for storing instructions executable by the processing component 1922, such as application programs.
- the application program stored in the memory 1932 may include one or more modules each corresponding to a set of instructions.
- the processing component 1922 is configured to execute instructions to perform the above-described methods.
- the electronic device 1900 may also include a power supply component 1926 configured to perform power management of the electronic device 1900, a wired or wireless network interface 1950 configured to connect the electronic device 1900 to the network, and an input output (I/O) interface 1958 .
- the electronic device 1900 can operate based on an operating system stored in the memory 1932, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
- a non-volatile computer-readable storage medium is also provided, such as the memory 1932 including computer program instructions, which can be executed by the processing component 1922 of the electronic device 1900 to complete the foregoing method.
- the present disclosure may be a system, method and/or computer program product.
- the computer program product may include a computer-readable storage medium loaded with computer-readable program instructions for enabling a processor to implement various aspects of the embodiments of the present disclosure.
- the computer-readable storage medium may be a tangible device that can hold and store instructions used by the instruction execution device.
- the computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- Non-exhaustive list of computer-readable storage media include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, such as a printer with instructions stored thereon
- RAM random access memory
- ROM read-only memory
- EPROM erasable programmable read-only memory
- flash memory flash memory
- SRAM static random access memory
- CD-ROM compact disk read-only memory
- DVD digital versatile disk
- memory stick floppy disk
- mechanical encoding device such as a printer with instructions stored thereon
- the computer-readable storage medium used here is not interpreted as the instantaneous signal itself, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (for example, light pulses through fiber optic cables), or through wires Transmission of electrical signals.
- the computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to various computing/processing devices, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
- the network may include copper transmission cables, optical fiber transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
- the network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network, and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device .
- the computer program instructions used to perform the operations of the embodiments of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or one or more programming instructions.
- Source code or object code written in any combination of languages, the programming language includes object-oriented programming languages such as Smalltalk, C++, etc., and conventional procedural programming languages such as "C" language or similar programming languages.
- Computer-readable program instructions can be executed entirely on the user's computer, partly on the user's computer, executed as a stand-alone software package, partly on the user's computer and partly executed on a remote computer, or entirely on the remote computer or server carried out.
- the remote computer can be connected to the user's computer through any kind of network-including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to connect to the user's computer) connection).
- LAN local area network
- WAN wide area network
- an electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by using the status information of the computer-readable program instructions.
- FPGA field programmable gate array
- PDA programmable logic array
- the computer-readable program instructions are executed to realize various aspects of the present disclosure.
- These computer-readable program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine that makes these instructions when executed by the processor of the computer or other programmable data processing device , A device that implements the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams is produced. It is also possible to store these computer-readable program instructions in a computer-readable storage medium. These instructions make computers, programmable data processing apparatuses, and/or other devices work in a specific manner. Thus, the computer-readable medium storing the instructions includes An article of manufacture, which includes instructions for implementing various aspects of the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.
- each block in the flowchart or block diagram may represent a module, program segment, or part of an instruction, and the module, program segment, or part of an instruction contains one or more components for realizing the specified logical function.
- Executable instructions may also occur in a different order than the order marked in the drawings. For example, two consecutive blocks can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved.
- each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.
- the embodiments of the present disclosure relate to a data processing method and device, electronic equipment, and storage medium.
- the method includes: determining a sample list including a plurality of samples according to a field of a sample and a parameter value corresponding to the field; The data type of the samples in the sample list is determined, and the description information corresponding to the sample list is determined; and the sample set is determined according to the sample list and the description information.
- the sample set used to store each sample and the corresponding description information can be determined, multiple types of samples can be stored in a unified manner, and the sample set includes description information describing the data type, It is convenient to store and read sample data, can support multiple types of data storage and read, and is easy to understand.
Abstract
Description
Claims (25)
- 一种数据处理方法,包括:A data processing method, including:根据样本的字段及与所述字段对应的参数值,确定包括多个样本的样本列表;Determine a sample list including multiple samples according to the fields of the samples and the parameter values corresponding to the fields;根据所述样本列表中样本的数据类型,确定与所述样本列表对应的描述信息;Determine the description information corresponding to the sample list according to the data type of the sample in the sample list;根据所述样本列表和所述描述信息,确定样本集。According to the sample list and the description information, a sample set is determined.
- 根据权利要求1所述的方法,其中,每个样本具有样本索引,根据样本的字段及与所述字段对应的参数值,确定包括多个样本的样本列表,包括:The method according to claim 1, wherein each sample has a sample index, and determining a sample list including a plurality of samples according to a field of the sample and a parameter value corresponding to the field includes:将所述字段确定为所述样本列表的列坐标,将所述样本索引确定为所述样本列表的行坐标;Determining the field as the column coordinates of the sample list, and determining the sample index as the row coordinates of the sample list;将所述字段对应的参数值进行对象化,获得与所述样本对应的对象;Objectify the parameter value corresponding to the field to obtain the object corresponding to the sample;将所述对象中的数据添加至与所述列坐标和所述行坐标对应的存储区域,获得所述样本列表。The data in the object is added to the storage area corresponding to the column coordinates and the row coordinates to obtain the sample list.
- 根据权利要求1或2所述的方法,其中,所述描述信息包括对样本的读取方式的描述信息,The method according to claim 1 or 2, wherein the description information includes description information of how the sample is read,所述根据所述样本列表中所述样本的数据类型,确定与所述样本列表对应的描述信息,包括:The determining the description information corresponding to the sample list according to the data type of the sample in the sample list includes:根据样本列表中样本的数据类型,确定用于读取样本的读取方式;According to the data type of the sample in the sample list, determine the reading method used to read the sample;根据所述样本的数据类型和对应的读取方式,获得所述描述信息。The description information is obtained according to the data type of the sample and the corresponding reading mode.
- 根据权利要求1-3任一项所述的方法,其中,所述描述信息还包括对样本存储位置的描述信息,所述存储位置包括所述样本集的存储路径及样本在所述样本集中的索引位置。The method according to any one of claims 1 to 3, wherein the description information further includes description information of the storage location of the sample, and the storage location includes the storage path of the sample set and the location of the sample in the sample set. Index position.
- 根据权利要求1-4任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-4, wherein the method further comprises:确定用于划分样本集中样本的类别的索引范围信息;Determine the index range information used to classify the types of samples in the sample set;根据所述样本集中样本的描述信息和所述样本类别的索引范围信息,生成与所述样本集对应的描述信息。According to the description information of the samples in the sample set and the index range information of the sample category, the description information corresponding to the sample set is generated.
- 根据权利要求1-5任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-5, wherein the method further comprises:根据所述描述信息及所述样本列表,读取所述样本列表中样本的参数值。According to the description information and the sample list, read the parameter values of the samples in the sample list.
- 根据权利要求6所述的方法,其中,所述根据所述描述信息及所述样本列表,读取所述样本列表中样本的参数值,包括:The method according to claim 6, wherein said reading parameter values of samples in said sample list according to said description information and said sample list comprises:获取所述样本列表中的对象;Obtaining objects in the sample list;根据样本索引,从所获取的对象中,获得与所述样本索引对应的第一样本的第一对象;According to the sample index, obtain the first object of the first sample corresponding to the sample index from the acquired objects;根据所述描述信息确定所述第一样本的数据类型及对应的读取方式;Determine the data type of the first sample and the corresponding reading method according to the description information;根据所述读取方式,从第一对象中读取所述第一样本的参数值。According to the reading method, the parameter value of the first sample is read from the first object.
- 根据权利要求6所述的方法,其中,所述根据所述描述信息及所述样本列表,读取所述样本列表中样本的参数值,包括:The method according to claim 6, wherein said reading parameter values of samples in said sample list according to said description information and said sample list comprises:获取样本索引;Get sample index;根据所述样本索引,从所述样本列表中获取所述样本索引对应的样本的参数值。According to the sample index, the parameter value of the sample corresponding to the sample index is obtained from the sample list.
- 根据权利要求6所述的方法,其中,所述根据所述描述信息及所述样本列表,读取所述样本列表中样本的参数值,包括:The method according to claim 6, wherein said reading parameter values of samples in said sample list according to said description information and said sample list comprises:获取所述样本列表中目标字段对应的对象;Obtaining the object corresponding to the target field in the sample list;根据所述描述信息确定所获取的对象对应的第二样本的数据类型及读取方式;Determine the data type and reading mode of the second sample corresponding to the acquired object according to the description information;根据所述读取方式,从所获取的对象中读取所述第二样本的参数值。According to the reading method, the parameter value of the second sample is read from the acquired object.
- 根据权利要求6所述的方法,其中,所述根据所述描述信息及所述样本列表,读取所述样本列表中样本的参数值,包括:The method according to claim 6, wherein said reading parameter values of samples in said sample list according to said description information and said sample list comprises:获取目标字段的字段名;Get the field name of the target field;根据所述字段名,从样本列表中获取所述字段名对应的目标字段的参数值。According to the field name, the parameter value of the target field corresponding to the field name is obtained from the sample list.
- 根据权利要求6-10任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 6-10, wherein the method further comprises:对读取的所述样本的参数值进行查询、修改和删除处理中至少一种操作。At least one operation of query, modification and deletion processing is performed on the parameter value of the read sample.
- 一种数据处理装置,包括:A data processing device includes:第一确定部分,被配置为根据样本的字段及与所述字段对应的参数值,确定包括多个样本的样本列表;The first determining part is configured to determine a sample list including multiple samples according to the fields of the samples and the parameter values corresponding to the fields;第二确定部分,被配置为根据所述样本列表中样本的数据类型,确定与所述样本列表对应的描述信息;The second determining part is configured to determine the description information corresponding to the sample list according to the data type of the sample in the sample list;第三确定部分,被配置为根据所述样本列表和所述描述信息,确定样本集。The third determining part is configured to determine a sample set according to the sample list and the description information.
- 根据权利要求12所述的装置,其中,每个样本具有样本索引所述第一确定部分还被配置为:The apparatus according to claim 12, wherein each sample has a sample index and the first determining part is further configured to:将所述字段确定为所述样本列表的列坐标,将所述样本索引确定为所述样本列表的行坐标;Determining the field as the column coordinates of the sample list, and determining the sample index as the row coordinates of the sample list;将所述字段对应的参数值进行对象化,获得与所述样本对应的对象;Objectify the parameter value corresponding to the field to obtain the object corresponding to the sample;将所述对象中的数据添加至与所述列坐标和所述行坐标对应的存储区域,获得所述样本列表。The data in the object is added to the storage area corresponding to the column coordinates and the row coordinates to obtain the sample list.
- 根据权利要求12或13所述的装置,其中,所述描述信息包括对样本的读取方式的描述信息,所述第二确定部分还被配置为:The device according to claim 12 or 13, wherein the description information includes description information of how to read the sample, and the second determining part is further configured to:根据样本列表中样本的数据类型,确定用于读取样本的读取方式;According to the data type of the sample in the sample list, determine the reading method used to read the sample;根据所述样本的数据类型和对应的读取方式,获得所述描述信息。The description information is obtained according to the data type of the sample and the corresponding reading mode.
- 根据权利要求12-14任一项所述的装置,其中,所述描述信息还包括对样本存储位置的描述信息,所述存储位置包括所述样本集的存储路径及样本在所述样本集中的索引位置。The device according to any one of claims 12-14, wherein the description information further includes description information of the storage location of the sample, and the storage location includes the storage path of the sample set and the location of the sample in the sample set. Index position.
- 根据权利要求12-15任一项所述的装置,其中,所述方法还包括:The device according to any one of claims 12-15, wherein the method further comprises:第四确定部分,被配置为确定用于划分样本集中样本的类别的索引范围信息;The fourth determining part is configured to determine index range information used to classify the types of samples in the sample set;生成部分,被配置为根据所述样本集中样本的描述信息和所述样本类别的索引范围信息,生成与所述样本集对应的描述信息。The generating part is configured to generate description information corresponding to the sample set according to the description information of the samples in the sample set and the index range information of the sample category.
- 根据权利要求12-16任一项所述的装置,其中,所述方法还包括:The device according to any one of claims 12-16, wherein the method further comprises:读取部分,被配置为根据所述描述信息及所述样本列表,读取所述样本列表中样本的参数值。The reading part is configured to read the parameter values of the samples in the sample list according to the description information and the sample list.
- 根据权利要求17所述的装置,其中,所述读取部分还被配置为:The device according to claim 17, wherein the reading part is further configured to:获取所述样本列表中的对象;Obtaining objects in the sample list;根据样本索引,从所获取的对象中,获得与所述样本索引对应的第一样本的第一对象;According to the sample index, obtain the first object of the first sample corresponding to the sample index from the acquired objects;根据所述描述信息确定所述第一样本的数据类型及对应的读取方式;Determine the data type of the first sample and the corresponding reading method according to the description information;根据所述读取方式,从所述第一对象中读取所述第一样本的参数值。According to the reading method, the parameter value of the first sample is read from the first object.
- 根据权利要求17所述的装置,其中,所述读取部分还被配置为:The device according to claim 17, wherein the reading part is further configured to:获取样本索引;Get sample index;根据所述样本索引,从所述样本列表中获取所述样本索引对应的样本的参数值。According to the sample index, the parameter value of the sample corresponding to the sample index is obtained from the sample list.
- 根据权利要求17所述的装置,其中,所述读取部分还被配置为:The device according to claim 17, wherein the reading part is further configured to:获取所述样本列表中目标字段对应的对象;Obtaining the object corresponding to the target field in the sample list;根据所述描述信息确定所获取的对象对应的第二样本的数据类型及读取方式;Determine the data type and reading mode of the second sample corresponding to the acquired object according to the description information;根据所述读取方式,从所获取的对象中读取所述第二样本的参数值。According to the reading method, the parameter value of the second sample is read from the acquired object.
- 根据权利要求17所述的装置,其中,所述读取部分还被配置为:The device according to claim 17, wherein the reading part is further configured to:获取目标字段的字段名;Get the field name of the target field;根据所述字段名,从样本列表中获取所述字段名对应的目标字段的参数值。According to the field name, the parameter value of the target field corresponding to the field name is obtained from the sample list.
- 根据权利要求17-21任一项所述的装置,其中,所述装置还包括:The device according to any one of claims 17-21, wherein the device further comprises:操作部分,被配置为对读取的所述样本的参数值进行查询、修改和删除处理中至少一种操作。The operation part is configured to perform at least one operation of query, modification and deletion of the parameter value of the read sample.
- 一种电子设备,包括:An electronic device including:处理器;processor;被配置为存储处理器可执行指令的存储器;A memory configured to store executable instructions of the processor;其中,所述处理器被配置为调用所述存储器存储的指令,以执行权利要求1至11中任意一项所述的方法。Wherein, the processor is configured to call instructions stored in the memory to execute the method according to any one of claims 1 to 11.
- 一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现权利要求1至11中任意一项所述的方法。A computer-readable storage medium having computer program instructions stored thereon, and when the computer program instructions are executed by a processor, the method according to any one of claims 1 to 11 is implemented.
- 一种计算机程序,包括计算机可读代码,当所述计算机可读代码在电子设备中运行时,所述电子设备中的处理器执行时实现权利要求1至11中任意一项所述的方法。A computer program comprising computer readable code, when the computer readable code runs in an electronic device, a processor in the electronic device executes the method according to any one of claims 1 to 11.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2021530871A JP2022510660A (en) | 2019-10-28 | 2020-06-12 | Data processing methods and their devices, electronic devices, and storage media |
KR1020217020477A KR20210096230A (en) | 2019-10-28 | 2020-06-12 | Data processing methods and devices, electronic devices and storage media |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911031910.5A CN110569329B (en) | 2019-10-28 | 2019-10-28 | Data processing method and device, electronic equipment and storage medium |
CN201911031910.5 | 2019-10-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021082463A1 true WO2021082463A1 (en) | 2021-05-06 |
Family
ID=68786157
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/095955 WO2021082463A1 (en) | 2019-10-28 | 2020-06-12 | Data processing method and apparatus, electronic device and storage medium |
Country Status (5)
Country | Link |
---|---|
JP (1) | JP2022510660A (en) |
KR (1) | KR20210096230A (en) |
CN (1) | CN110569329B (en) |
TW (1) | TWI755890B (en) |
WO (1) | WO2021082463A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110569329B (en) * | 2019-10-28 | 2022-08-02 | 深圳市商汤科技有限公司 | Data processing method and device, electronic equipment and storage medium |
CN113642408A (en) * | 2021-07-15 | 2021-11-12 | 杭州玖欣物联科技有限公司 | Method for processing and analyzing picture data in real time through industrial internet |
TWI772233B (en) * | 2021-11-29 | 2022-07-21 | 大陸商常州欣盛半導體技術股份有限公司 | Automatic integration method of cof test data |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101192148A (en) * | 2006-12-01 | 2008-06-04 | 深圳迈瑞生物医疗电子股份有限公司 | New and old application program compatible data processing method and its data storage method |
CN106126692A (en) * | 2016-06-29 | 2016-11-16 | 北京奇虎科技有限公司 | The searching method of a kind of sample data and device |
CN108509485A (en) * | 2018-02-07 | 2018-09-07 | 深圳壹账通智能科技有限公司 | Preprocess method, device, computer equipment and the storage medium of data |
CN110569329A (en) * | 2019-10-28 | 2019-12-13 | 深圳市商汤科技有限公司 | Data processing method and device, electronic equipment and storage medium |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI222057B (en) * | 1998-10-22 | 2004-10-11 | Ibm | Method to automatically retrieve information and data, computer readable medium, and method and system of automatically selecting encoding parameters |
CN101470734A (en) * | 2007-12-29 | 2009-07-01 | 北大方正集团有限公司 | Data identification and analyzing method and apparatus |
CN102110202B (en) * | 2009-12-29 | 2014-01-01 | 比亚迪股份有限公司 | Method for reading DRM file in mobile terminal file system |
CN102253810B (en) * | 2010-05-17 | 2014-02-05 | 深圳市世纪光速信息技术有限公司 | Method, apparatus and system used for reading data |
US9076152B2 (en) * | 2010-10-20 | 2015-07-07 | Microsoft Technology Licensing, Llc | Semantic analysis of information |
US9378065B2 (en) * | 2013-03-15 | 2016-06-28 | Advanced Elemental Technologies, Inc. | Purposeful computing |
CN104503707B (en) * | 2014-12-24 | 2018-03-06 | 华为技术有限公司 | Read the method and device of data |
CN105094707B (en) * | 2015-08-18 | 2018-03-13 | 华为技术有限公司 | A kind of data storage, read method and device |
CN108351898B (en) * | 2015-10-30 | 2021-10-08 | 安客诚公司 | Automated interpretation for structured multi-field file layout |
CN109189822B (en) * | 2018-08-08 | 2022-01-14 | 北京大数据研究院 | Data processing method and device |
CN109522424B (en) * | 2018-10-16 | 2020-04-24 | 北京达佳互联信息技术有限公司 | Data processing method and device, electronic equipment and storage medium |
-
2019
- 2019-10-28 CN CN201911031910.5A patent/CN110569329B/en active Active
-
2020
- 2020-06-12 JP JP2021530871A patent/JP2022510660A/en active Pending
- 2020-06-12 WO PCT/CN2020/095955 patent/WO2021082463A1/en active Application Filing
- 2020-06-12 KR KR1020217020477A patent/KR20210096230A/en active Search and Examination
- 2020-10-14 TW TW109135575A patent/TWI755890B/en active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101192148A (en) * | 2006-12-01 | 2008-06-04 | 深圳迈瑞生物医疗电子股份有限公司 | New and old application program compatible data processing method and its data storage method |
CN106126692A (en) * | 2016-06-29 | 2016-11-16 | 北京奇虎科技有限公司 | The searching method of a kind of sample data and device |
CN108509485A (en) * | 2018-02-07 | 2018-09-07 | 深圳壹账通智能科技有限公司 | Preprocess method, device, computer equipment and the storage medium of data |
CN110569329A (en) * | 2019-10-28 | 2019-12-13 | 深圳市商汤科技有限公司 | Data processing method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
KR20210096230A (en) | 2021-08-04 |
TW202117707A (en) | 2021-05-01 |
JP2022510660A (en) | 2022-01-27 |
TWI755890B (en) | 2022-02-21 |
CN110569329A (en) | 2019-12-13 |
CN110569329B (en) | 2022-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020029966A1 (en) | Method and device for video processing, electronic device, and storage medium | |
WO2021051650A1 (en) | Method and apparatus for association detection for human face and human hand, electronic device and storage medium | |
WO2021082463A1 (en) | Data processing method and apparatus, electronic device and storage medium | |
RU2615632C2 (en) | Method and device for recognizing communication messages | |
US8775969B2 (en) | Contact searching method and apparatus, and applied mobile terminal | |
US11334723B2 (en) | Method and device for processing untagged data, and storage medium | |
WO2017092121A1 (en) | Information processing method and device | |
WO2018090688A1 (en) | Input method, device and electronic device | |
WO2023078414A1 (en) | Related article search method and apparatus, electronic device, and storage medium | |
KR20130117021A (en) | Method for searching content of message and an electronic device thereof | |
TW201837752A (en) | Feedback response method and device | |
US11373038B2 (en) | Method and terminal for performing word segmentation on text information, and storage medium | |
TWI739633B (en) | Storage and reading method, electronic equipment and computer-readable storage medium | |
CN112328809A (en) | Entity classification method, device and computer readable storage medium | |
WO2019144724A1 (en) | Expression input method and apparatus | |
WO2023092975A1 (en) | Image processing method and apparatus, electronic device, storage medium, and computer program product | |
CN106959970B (en) | Word bank, processing method and device of word bank and device for processing word bank | |
CN110019657B (en) | Processing method, apparatus and machine-readable medium | |
US20170060822A1 (en) | Method and device for storing string | |
US10423706B2 (en) | Method and device for selecting information | |
CN110929122A (en) | Data processing method and device and data processing device | |
CN113312475B (en) | Text similarity determination method and device | |
CN107544969B (en) | Method for optimizing size of static lexicon and electronic equipment | |
CN113093920A (en) | Entry information display method, device and system | |
CN113918029A (en) | Input method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2021530871 Country of ref document: JP Kind code of ref document: A |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20882363 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20217020477 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20882363 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 210922) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20882363 Country of ref document: EP Kind code of ref document: A1 |