CN107766486B - Method, device, readable medium and storage controller for randomly extracting sample data - Google Patents

Method, device, readable medium and storage controller for randomly extracting sample data Download PDF

Info

Publication number
CN107766486B
CN107766486B CN201710959595.7A CN201710959595A CN107766486B CN 107766486 B CN107766486 B CN 107766486B CN 201710959595 A CN201710959595 A CN 201710959595A CN 107766486 B CN107766486 B CN 107766486B
Authority
CN
China
Prior art keywords
sample data
current
queue
weight
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710959595.7A
Other languages
Chinese (zh)
Other versions
CN107766486A (en
Inventor
邵辉
曹雪韬
王宏达
崔冲冲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur General Software Co Ltd
Original Assignee
Inspur General Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur General Software Co Ltd filed Critical Inspur General Software Co Ltd
Priority to CN201710959595.7A priority Critical patent/CN107766486B/en
Publication of CN107766486A publication Critical patent/CN107766486A/en
Application granted granted Critical
Publication of CN107766486B publication Critical patent/CN107766486B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a method, a device, a readable medium and a storage controller for randomly extracting sample data, wherein the method comprises the following steps: a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity; a1: generating a random number corresponding to current sample data at the head of the sequence queue; a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed; a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5; a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1; a5: and detecting the current number of each extracted reference sample data, and executing A1 when the current number is less than the extraction number. By the technical scheme of the invention, the sample data with the corresponding quantity can be more accurately and randomly extracted from the sample data set.

Description

Method, device, readable medium and storage controller for randomly extracting sample data
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for randomly extracting sample data, a readable medium, and a storage controller.
Background
The application scene of randomly extracting the sample data is very wide. Specifically, when the sample data set is large, a small amount of sample data can be randomly extracted from a large amount of sample data in the sample data set for analysis so as to realize a corresponding service.
At present, when m sample data are randomly extracted from n sample data of a sample data set, n sample data of the sample data set may be arranged into a sequence queue, then m positive integers smaller than n are generated according to actual requirements, and then m sample data with the same sequence position as each positive integer in the sequence queue are taken out, that is, m sample data are randomly extracted from n sample data of the sample data set.
In the above technical solution, the sample data in the sample data set may not be accurately randomly extracted due to the too large or too small amount of sample data in the sample data set.
Disclosure of Invention
The embodiment of the invention provides a method, a device, a readable medium and a storage controller for randomly extracting sample data, which can more accurately randomly extract a corresponding amount of sample data from a sample data set.
In a first aspect, the present invention provides a method for randomly extracting sample data, including:
a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity;
a1: generating a random number corresponding to current sample data at the head of the sequence queue;
a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed;
a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5;
a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1;
a5: and detecting the current number of each extracted reference sample data, and executing A1 when the current number is less than the extraction number.
Preferably, the first and second electrodes are formed of a metal,
further comprising: presetting at least two weight tables, wherein each weight table corresponds to a weight coefficient and at least one piece of characteristic information;
after the a3, further comprising:
analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
and storing the reference sample data to a target weight table in the at least two weight tables, wherein at least one piece of target characteristic information corresponding to the target weight table comprises the current characteristic information.
Preferably, the first and second electrodes are formed of a metal,
further comprising:
when the current number is not less than the extraction number, determining the screening number respectively corresponding to each weight table according to the storage number of the reference sample data respectively stored in each weight table and the weight coefficient respectively corresponding to each weight table;
and extracting target sample data with the target screening quantity corresponding to the weight table from each reference sample data stored in the weight table aiming at each weight table.
In a second aspect, an embodiment of the present invention provides an apparatus for randomly extracting sample data, including:
the device comprises a preprocessing module, a random number management module, an extraction management module, a queue management module and a detection module; wherein,
the preprocessing module is used for arranging all sample data of the sample data set into a sequence queue and determining the number of the samples to be extracted;
the random number management module is used for generating a random number corresponding to the current sample data at the head of the queue in the sequence queue, detecting whether the random number is smaller than the extraction quantity, and if so, triggering the extraction management module; otherwise, triggering the queue management module;
the extraction management module is used for taking out the current sample data at the head of the queue as reference sample data under the triggering of the random number management module and triggering the detection module;
the queue management module is used for placing the current sample data at the head of the queue at the tail of the sequence queue under the triggering of the random number management module and triggering the random number management module;
the detection module is configured to detect the current number of each of the reference sample data taken out under the trigger of the extraction management module, and trigger the random number management module when the current number is smaller than the extraction number.
Preferably, the first and second electrodes are formed of a metal,
further comprising: the device comprises a setting module, an analysis module and a storage processing module; wherein,
the setting module is used for presetting at least two weight tables, and each weight table corresponds to a weight coefficient and at least one piece of characteristic information respectively;
the analysis module is used for analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
the storage processing module is configured to store the reference sample data to a target weight table of the at least two weight tables, where at least one piece of target feature information corresponding to the target weight table includes the current feature information.
Preferably, the first and second electrodes are formed of a metal,
further comprising: the device comprises a quantity determining module and a screening and extracting module; wherein,
the number determining module is configured to determine, when the current number is not less than the extracted number, a filtering number corresponding to each weight table according to a storage number of reference sample data stored in each weight table and a weight coefficient corresponding to each weight table;
and the screening and extracting module is used for extracting target screening quantity target sample data corresponding to the weight tables from each reference sample data stored in the weight tables aiming at each weight table.
In a third aspect, an embodiment of the present invention provides a readable medium, which is characterized by including an execution instruction, and when a processor of a storage controller executes the execution instruction, the storage controller executes the method according to any one of the first aspect.
In a fourth aspect, an embodiment of the present invention provides a storage controller, including: a processor, a memory, and a bus;
the processor and the memory are connected through the bus;
the memory, when the storage controller is running, the processor executes the execution instructions stored by the memory to cause the storage controller to perform the method of any one of the first aspect.
The embodiment of the invention provides a method, a device, a readable medium and a storage controller for randomly extracting sample data, wherein in the method, each sample data of a sample data set is arranged into a sequence queue, the extraction quantity of the sample data needing to be extracted is determined, then a random number corresponding to the current sample data at the head of the queue in the sequence queue can be generated aiming at the formed sequence queue, whether the random number is less than the extraction quantity or not is detected, when the random number is not less than the extraction quantity, the current sample data at the head of the queue in the sequence queue does not accord with the extraction condition in the extraction process, the current sample data can be placed at the tail of the sequence queue, otherwise, when the random number is less than the extraction quantity, the current sample data at the head of the queue can be taken out as target sample data, the extraction process is circularly executed aiming at the current sample data at the head of the queue in the sequence queue, until the number of the extracted reference sample data reaches the determined extraction number, the sample data with the corresponding number is extracted from each sample data of the sample data set. In summary, according to the technical scheme provided by the embodiment of the invention, when the corresponding amount of sample data is randomly extracted from the sample data set, whether the condition that the sample data can be extracted as reference sample data is not directly related to the amount of the sample data in the sample data set is determined, and the corresponding amount of sample data can be more accurately randomly extracted from the sample data set.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a flowchart of a method for randomly extracting sample data according to an embodiment of the present invention;
FIG. 2 is a flow chart of another method for randomly sampling sample data according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of an apparatus for randomly extracting sample data according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of another apparatus for randomly extracting sample data according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer and more complete, the technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention, and based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without creative efforts belong to the scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a method for randomly extracting sample data, including:
a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity;
a1: generating a random number corresponding to current sample data at the head of the sequence queue;
a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed;
a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5;
a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1;
a5: and detecting the current number of each extracted reference sample data, and executing A1 when the current number is less than the extraction number.
In the above embodiment of the present invention, by arranging each sample data of the sample data set into a sequential queue, and determining the number of samples to be extracted, then generating a random number corresponding to the current sample data at the head of the queue in the sequential queue for the formed sequential queue, and detecting whether the random number is smaller than the number of extractions, if the random number is not smaller than the number of extractions, it indicates that the current sample data at the head of the queue in the sequential queue does not meet the extraction condition in the current extraction process, the current sample data can be placed at the tail of the sequential queue, otherwise, if the random number is smaller than the number of extractions, the current sample data at the head of the queue can be taken out as the target sample data, the aforementioned extraction process is cyclically performed for the current sample data at the head of the queue in the sequential queue until the number of the taken-out reference sample data reaches the determined number of extractions, therefore, the sample data of corresponding quantity can be extracted from each sample data of the sample data set. In summary, according to the technical scheme provided by the embodiment of the invention, when the corresponding amount of sample data is randomly extracted from the sample data set, whether the condition that the sample data can be extracted as reference sample data is not directly related to the amount of the sample data in the sample data set is determined, and the corresponding amount of sample data can be more accurately randomly extracted from the sample data set.
In one embodiment of the present invention, the method further comprises: presetting at least two weight tables, wherein each weight table corresponds to a weight coefficient and at least one piece of characteristic information;
after the a3, further comprising:
analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
and storing the reference sample data to a target weight table in the at least two weight tables, wherein at least one piece of target characteristic information corresponding to the target weight table comprises the current characteristic information.
In the above embodiment of the present invention, at least two weight tables are preset, each weight table corresponds to one weight coefficient and at least one feature information, after the target sample data is taken out, the target sample data can be analyzed to determine the current feature information carried in the target sample data, and the target sample data is stored in the target weight table of the at least two weight tables, where at least one target feature information corresponding to the target weight table includes the current feature information; therefore, the user can conveniently extract the sample data with corresponding quantity from different weight tables according to different feature information and different weight coefficients in combination with actual service requirements in the subsequent process.
Specifically, in an embodiment of the present invention, the method further includes: when the current number is not less than the extraction number, determining the screening number respectively corresponding to each weight table according to the storage number of the reference sample data respectively stored in each weight table and the weight coefficient respectively corresponding to each weight table; and extracting target sample data with the target screening quantity corresponding to the weight table from each reference sample data stored in the weight table aiming at each weight table.
In order to more clearly illustrate the technical solution and advantages of the present invention, for example, a set number of reference sample data are randomly extracted from a sample data set, each reference sample data is respectively stored in a corresponding weight table according to feature information carried in each reference sample data, and then a corresponding number of target sample data are respectively extracted from each weight table, as shown in fig. 2, the following steps may be specifically included:
step 201, at least two weight tables are preset, and each weight table corresponds to a weight coefficient and at least one feature information.
Step 202, arranging each sample data of the sample data set into a sequence queue, and determining the extraction quantity of the reference sample data.
In steps 201 to 202, the user may set the number of weight tables, the weight coefficient corresponding to each weight table, and at least one piece of feature information according to the actual service requirement.
For example, taking the example that each sample data in the sample data set is the business information of each enterprise registered in the east city and the west city of beijing city, and the business information of each enterprise in the east city needs to be extracted with emphasis, two weight tables A, B may be set, where at least one feature information corresponding to the weight table a includes the east city, the weight coefficient corresponding to the weight table a is 0.6, at least one feature information corresponding to the weight table B includes the west city, and the weight coefficient corresponding to the weight table B is 0.4.
It should be understood that, a corresponding black list may be further configured to store sample data that does not comply with the corresponding rule, for example, when the extracted reference sample data does not include feature information "eastern city region" or "western city region", the reference sample data may be stored in the black list.
Step 203, generating a random number corresponding to the current sample data at the head of the queue in the sequence queue.
Step 204, detecting whether the generated random number is smaller than the extraction number, if so, executing 206; otherwise, step 205 is performed.
And step 205, placing the current sample data at the head of the queue at the tail of the sequence queue.
Here, after step 205 is performed, step 203 may be performed again.
And step 206, taking out the current sample data at the head of the queue as reference sample data.
Step 207, the extracted reference sample data is analyzed to determine the current characteristic information carried in the reference sample data.
Step 208, storing the reference sample data to a target weight table of the at least two weight tables.
And at least one piece of target characteristic information corresponding to the target weight table comprises current characteristic information.
For example, when the extracted reference sample data is analyzed in step 207 to determine that the current feature information carried by the reference sample data is "eastern city area", the reference sample data may be stored in the weight table a, when the extracted reference sample data is analyzed in step 207 to determine that the current feature information carried by the reference sample data is "western city area", the reference sample data may be stored in the weight table B, and when the extracted reference sample data is analyzed in step 207 to determine that the current feature information carried by the reference sample data is neither "eastern city area" nor "western city area", the reference sample data may be stored in the blacklist.
Step 209, detecting whether the current number of each extracted reference sample data is less than the extraction number, if so, executing step 203; otherwise, step 210 is performed.
Step 210, determining the screening number corresponding to each weight table according to the storage number of the reference sample data stored in each weight table and the weight coefficient corresponding to each weight table.
For example, when the number of extractions is 400, the number of reference sample data stored in the weight table a is 200, and the number of reference sample data stored in the weight table B is 200, it may be determined that the number of filters corresponding to the weight table a is 120, which is a product of the number of stores 200 and the corresponding weight coefficient 0.6, and similarly, it may be determined that the number of filters corresponding to the weight table B is 80, which is a product of the number of stores 200 and the corresponding weight coefficient 0.4.
It is understood that, when the product of the storage quantity of the reference sample data stored in a certain weight table and the corresponding weight coefficient is not an integer, the product can be rounded by a rounding method, and the rounded numerical value is used as the screening quantity corresponding to the weight table.
Step 211, for each weight table, extracting a target screening quantity of target sample data corresponding to the weight table from each reference sample data stored in the weight table.
Obviously, 120 target sample data can be randomly extracted from 200 reference sample data stored in the weight value table a by a method similar to the steps 202 to 206; 80 target sample data are randomly extracted from 200 reference sample data stored in the weight value table B by a method similar to the steps 202 to 206.
As shown in fig. 3, an embodiment of the present invention provides an apparatus for randomly extracting sample data, including:
a preprocessing module 301, a random number management module 302, an extraction management module 303, a queue management module 304 and a detection module 305; wherein,
the preprocessing module 301 is configured to arrange sample data of the sample data set into a sequence queue and determine an extraction number;
the random number management module 302 is configured to generate a random number corresponding to current sample data at the head of the queue in the sequential queue, detect whether the random number is smaller than the extraction number, and trigger the extraction management module 303 if the random number is smaller than the extraction number; otherwise, triggering the queue management module 304;
the extraction management module 303 is configured to take out the current sample data at the head of the queue as reference sample data under the triggering of the random number management module 302, and trigger the detection module 305;
the queue management module 304 is configured to place the current sample data at the head of the queue at the tail of the sequential queue under the trigger of the random number management module, and trigger the random number management module 302;
the detecting module 305 is configured to detect the current number of each extracted reference sample data under the trigger of the extraction management module, and trigger the random number management module 302 when the current number is smaller than the extraction number.
As shown in fig. 4, in an embodiment of the present invention, the method further includes: a setting module 401, an analysis module and a 402 storage processing module 403; wherein,
the setting module 401 is configured to preset at least two weight tables, where each weight table corresponds to a weight coefficient and at least one piece of feature information;
the analyzing module 402 is configured to analyze the reference sample data to determine current feature information carried in the reference sample data;
the storage processing module 403 is configured to store the reference sample data in a target weight table of the at least two weight tables, where at least one piece of target feature information corresponding to the target weight table includes the current feature information.
Based on the embodiment shown in fig. 4, in an embodiment of the present invention, the method further includes: a quantity determination module (not shown in the drawings) and a screening extraction module (not shown in the drawings); wherein,
the number determining module is configured to determine, when the current number is not less than the extracted number, a filtering number corresponding to each weight table according to a storage number of reference sample data stored in each weight table and a weight coefficient corresponding to each weight table;
and the screening and extracting module is used for extracting target screening quantity target sample data corresponding to the weight tables from each reference sample data stored in the weight tables aiming at each weight table.
Because the information interaction, execution process, and other contents between the units in the device are based on the same concept as the method embodiment of the present invention, specific contents may refer to the description in the method embodiment of the present invention, and are not described herein again.
An embodiment of the present invention provides a readable medium, which includes an execution instruction, and when a processor of a storage controller executes the execution instruction, the storage controller executes the method for randomly extracting sample data provided in any embodiment of the present invention.
An embodiment of the present invention provides a storage controller, including: a processor, a memory, and a bus;
the processor and the memory are connected through the bus;
the memory, when the storage controller is running, the processor executes the execution instructions stored in the memory to make the storage controller execute the method for randomly drawing sample data provided in any one embodiment of the present invention.
In summary, the embodiments of the present invention have at least the following advantages:
1. in an embodiment of the present invention, by arranging each sample data of the sample data set into a sequential queue, and determining the number of samples to be extracted, then generating a random number corresponding to the current sample data at the head of the queue in the sequential queue for the formed sequential queue, and detecting whether the random number is less than the number of extractions, if the random number is not less than the number of extractions, it indicates that the current sample data at the head of the queue in the sequential queue does not meet the extraction condition in the current extraction process, the current sample data can be placed at the tail of the sequential queue, otherwise, if the random number is less than the number of extractions, the current sample data at the head of the queue can be taken out as the target sample data, the aforementioned extraction process is cyclically performed for the current sample data at the head of the queue in the sequential queue until the number of the taken-out reference sample data reaches the determined number of extractions, therefore, the sample data of corresponding quantity can be extracted from each sample data of the sample data set. In summary, according to the technical scheme provided by the embodiment of the invention, when the corresponding amount of sample data is randomly extracted from the sample data set, whether the condition that the sample data can be extracted as reference sample data is not directly related to the amount of the sample data in the sample data set is determined, and the corresponding amount of sample data can be more accurately randomly extracted from the sample data set.
2. In one embodiment of the invention, at least two weight tables are preset, each weight table corresponds to a weight coefficient and at least one piece of feature information, after target sample data is taken out, the target sample data can be analyzed to determine current feature information carried in the target sample data, the target sample data is stored in a target weight table of the at least two weight tables, and the at least one piece of target feature information corresponding to the target weight table comprises the current feature information; therefore, the user can conveniently extract the sample data with corresponding quantity from different weight tables according to different feature information and different weight coefficients in combination with actual service requirements in the subsequent process.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other similar elements in a process, method, article, or apparatus that comprises the element.
Finally, it is to be noted that: the above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (4)

1. A method for randomly extracting sample data, comprising:
a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity;
a1: generating a random number corresponding to current sample data at the head of the sequence queue;
a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed;
a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5;
a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1;
a5: detecting the current number of each extracted reference sample data, and executing A1 when the current number is smaller than the extraction number;
further comprising: presetting at least two weight tables, wherein each weight table corresponds to a weight coefficient and at least one piece of characteristic information;
after the a3, further comprising:
analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
storing the reference sample data to a target weight table of the at least two weight tables, wherein at least one feature information corresponding to the target weight table comprises the current feature information;
further comprising:
when the current number is not less than the extraction number, determining the screening number respectively corresponding to each weight table according to the storage number of the reference sample data respectively stored in each weight table and the weight coefficient respectively corresponding to each weight table;
and extracting target sample data with the screening quantity corresponding to the weight tables from each reference sample data stored in the weight tables for each weight table.
2. An apparatus for randomly extracting sample data, comprising:
the device comprises a preprocessing module, a random number management module, an extraction management module, a queue management module and a detection module; wherein,
the preprocessing module is used for arranging all sample data of the sample data set into a sequence queue and determining the number of the samples to be extracted;
the random number management module is used for generating a random number corresponding to the current sample data at the head of the queue in the sequence queue, detecting whether the random number is smaller than the extraction quantity, and if so, triggering the extraction management module; otherwise, triggering the queue management module;
the extraction management module is used for taking out the current sample data at the head of the queue as reference sample data under the triggering of the random number management module and triggering the detection module;
the queue management module is used for placing the current sample data at the head of the queue at the tail of the sequence queue under the triggering of the random number management module and triggering the random number management module;
the detection module is used for detecting the current number of each taken reference sample data under the triggering of the extraction management module, and triggering the random number management module when the current number is smaller than the extraction number;
further comprising: the device comprises a setting module, an analysis module and a storage processing module; wherein,
the setting module is used for presetting at least two weight tables, and each weight table corresponds to a weight coefficient and at least one piece of characteristic information respectively;
the analysis module is used for analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
the storage processing module is configured to store the reference sample data to a target weight table of the at least two weight tables, where at least one piece of feature information corresponding to the target weight table includes the current feature information;
further comprising: the device comprises a quantity determining module and a screening and extracting module; wherein,
the number determining module is configured to determine, when the current number is not less than the extracted number, a filtering number corresponding to each weight table according to a storage number of reference sample data stored in each weight table and a weight coefficient corresponding to each weight table;
and the screening extraction module is used for extracting target sample data corresponding to the screening quantity of the weight tables from each reference sample data stored in the weight tables aiming at each weight table.
3. A readable medium comprising executable instructions that, when executed by a processor of a storage controller, cause the storage controller to perform the method of claim 1.
4. A storage controller, comprising: a processor, a memory, and a bus;
the processor and the memory are connected through the bus;
the memory, the processor executing execution instructions stored by the memory to cause the storage controller to perform the method of claim 1 when the storage controller is running.
CN201710959595.7A 2017-10-16 2017-10-16 Method, device, readable medium and storage controller for randomly extracting sample data Active CN107766486B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710959595.7A CN107766486B (en) 2017-10-16 2017-10-16 Method, device, readable medium and storage controller for randomly extracting sample data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710959595.7A CN107766486B (en) 2017-10-16 2017-10-16 Method, device, readable medium and storage controller for randomly extracting sample data

Publications (2)

Publication Number Publication Date
CN107766486A CN107766486A (en) 2018-03-06
CN107766486B true CN107766486B (en) 2021-04-20

Family

ID=61269610

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710959595.7A Active CN107766486B (en) 2017-10-16 2017-10-16 Method, device, readable medium and storage controller for randomly extracting sample data

Country Status (1)

Country Link
CN (1) CN107766486B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108984700B (en) * 2018-07-05 2021-07-27 腾讯科技(深圳)有限公司 Data processing method and device, computer equipment and storage medium
CN111176611B (en) * 2019-12-31 2023-10-31 深圳远征技术有限公司 Method and device for generating random data set

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004185170A (en) * 2002-12-02 2004-07-02 Sony Corp Random number data forming device
CN101159673A (en) * 2007-11-01 2008-04-09 杭州华三通信技术有限公司 Arbitrary sampling method and apparatus
JP2009110400A (en) * 2007-10-31 2009-05-21 Toppan Printing Co Ltd Random number generating device, random number generating method, and program thereof
CN104424331A (en) * 2013-09-10 2015-03-18 深圳市腾讯计算机系统有限公司 Data sampling method and device
CN104518880A (en) * 2014-12-17 2015-04-15 中国船舶重工集团公司第七0九研究所 Big data reliability validation method and system based on random sampling detection
CN104881475A (en) * 2015-06-02 2015-09-02 北京京东尚科信息技术有限公司 Method and system for randomly sampling big data
CN105589683A (en) * 2014-10-22 2016-05-18 腾讯科技(深圳)有限公司 Sample extraction method and apparatus
CN106548196A (en) * 2016-10-20 2017-03-29 中国科学院深圳先进技术研究院 A kind of random forest sampling approach and device for non-equilibrium data
CN106570060A (en) * 2016-09-30 2017-04-19 微梦创科网络科技(中国)有限公司 Data random extraction method and apparatus in information flow

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004185170A (en) * 2002-12-02 2004-07-02 Sony Corp Random number data forming device
JP2009110400A (en) * 2007-10-31 2009-05-21 Toppan Printing Co Ltd Random number generating device, random number generating method, and program thereof
CN101159673A (en) * 2007-11-01 2008-04-09 杭州华三通信技术有限公司 Arbitrary sampling method and apparatus
CN104424331A (en) * 2013-09-10 2015-03-18 深圳市腾讯计算机系统有限公司 Data sampling method and device
CN105589683A (en) * 2014-10-22 2016-05-18 腾讯科技(深圳)有限公司 Sample extraction method and apparatus
CN104518880A (en) * 2014-12-17 2015-04-15 中国船舶重工集团公司第七0九研究所 Big data reliability validation method and system based on random sampling detection
CN104881475A (en) * 2015-06-02 2015-09-02 北京京东尚科信息技术有限公司 Method and system for randomly sampling big data
CN106570060A (en) * 2016-09-30 2017-04-19 微梦创科网络科技(中国)有限公司 Data random extraction method and apparatus in information flow
CN106548196A (en) * 2016-10-20 2017-03-29 中国科学院深圳先进技术研究院 A kind of random forest sampling approach and device for non-equilibrium data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《基于分布式系统的大数据随机抽样算法的实现》;王磐等;《电脑知识与技术》;20161231;全文 *

Also Published As

Publication number Publication date
CN107766486A (en) 2018-03-06

Similar Documents

Publication Publication Date Title
CN106980573B (en) Method, device and system for constructing test case request object
US10216848B2 (en) Method and system for recommending cloud websites based on terminal access statistics
US11221904B2 (en) Log analysis system, log analysis method, and log analysis program
US11108787B1 (en) Securing a network device by forecasting an attack event using a recurrent neural network
CN110647896B (en) Phishing page identification method based on logo image and related equipment
CN109409964B (en) Method and device for identifying high-quality brand
CN109063482B (en) Macro virus identification method, macro virus identification device, storage medium and processor
CN107766486B (en) Method, device, readable medium and storage controller for randomly extracting sample data
CN110991171A (en) Sensitive word detection method and device
CN104636319A (en) Text duplicate removal method and device
CN108846117A (en) The duplicate removal screening technique and device of business news flash
CN107632972B (en) Form processing method and device
CN110647895B (en) Phishing page identification method based on login box image and related equipment
CN111914257A (en) Document detection method, device, equipment and computer storage medium
CN106997350B (en) Data processing method and device
CN108182363B (en) Detection method, system and storage medium of embedded office document
CN113688240A (en) Threat element extraction method, device, equipment and storage medium
CN107743087A (en) The detection method and system of a kind of e-mail attack
CN109670153A (en) A kind of determination method, apparatus, storage medium and the terminal of similar model
CN108287831B (en) URL classification method and system and data processing method and system
WO2015024457A1 (en) Method and device for obtaining virus signatures cross-reference to related applications
CN106407218B (en) Navigation webpage detection method and device
CN105589683B (en) Sample extraction method and device
CN104794397B (en) Virus detection method and device
CN109391626B (en) Method and related device for judging whether network attack result is unsuccessful

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20210329

Address after: No. 1036, Shandong high tech Zone wave road, Ji'nan, Shandong

Applicant after: INSPUR GENERAL SOFTWARE Co.,Ltd.

Address before: 250100 No. 2877 Kehang Road, Sun Village Town, Jinan High-tech District, Shandong Province

Applicant before: SHANDONG INSPUR GENESOFT INFORMATION TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant