CN107766486B - Method, device, readable medium and storage controller for randomly extracting sample data - Google Patents
Method, device, readable medium and storage controller for randomly extracting sample data Download PDFInfo
- Publication number
- CN107766486B CN107766486B CN201710959595.7A CN201710959595A CN107766486B CN 107766486 B CN107766486 B CN 107766486B CN 201710959595 A CN201710959595 A CN 201710959595A CN 107766486 B CN107766486 B CN 107766486B
- Authority
- CN
- China
- Prior art keywords
- sample data
- current
- queue
- weight
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 239000000523 sample Substances 0.000 claims abstract description 158
- 239000013074 reference sample Substances 0.000 claims abstract description 77
- 238000000605 extraction Methods 0.000 claims abstract description 63
- 238000012216 screening Methods 0.000 claims description 18
- 238000001514 detection method Methods 0.000 claims description 8
- 238000007781 pre-processing Methods 0.000 claims description 6
- 238000001914 filtration Methods 0.000 claims description 3
- 230000008569 process Effects 0.000 description 6
- 239000002184 metal Substances 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention provides a method, a device, a readable medium and a storage controller for randomly extracting sample data, wherein the method comprises the following steps: a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity; a1: generating a random number corresponding to current sample data at the head of the sequence queue; a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed; a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5; a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1; a5: and detecting the current number of each extracted reference sample data, and executing A1 when the current number is less than the extraction number. By the technical scheme of the invention, the sample data with the corresponding quantity can be more accurately and randomly extracted from the sample data set.
Description
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for randomly extracting sample data, a readable medium, and a storage controller.
Background
The application scene of randomly extracting the sample data is very wide. Specifically, when the sample data set is large, a small amount of sample data can be randomly extracted from a large amount of sample data in the sample data set for analysis so as to realize a corresponding service.
At present, when m sample data are randomly extracted from n sample data of a sample data set, n sample data of the sample data set may be arranged into a sequence queue, then m positive integers smaller than n are generated according to actual requirements, and then m sample data with the same sequence position as each positive integer in the sequence queue are taken out, that is, m sample data are randomly extracted from n sample data of the sample data set.
In the above technical solution, the sample data in the sample data set may not be accurately randomly extracted due to the too large or too small amount of sample data in the sample data set.
Disclosure of Invention
The embodiment of the invention provides a method, a device, a readable medium and a storage controller for randomly extracting sample data, which can more accurately randomly extract a corresponding amount of sample data from a sample data set.
In a first aspect, the present invention provides a method for randomly extracting sample data, including:
a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity;
a1: generating a random number corresponding to current sample data at the head of the sequence queue;
a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed;
a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5;
a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1;
a5: and detecting the current number of each extracted reference sample data, and executing A1 when the current number is less than the extraction number.
Preferably, the first and second electrodes are formed of a metal,
further comprising: presetting at least two weight tables, wherein each weight table corresponds to a weight coefficient and at least one piece of characteristic information;
after the a3, further comprising:
analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
and storing the reference sample data to a target weight table in the at least two weight tables, wherein at least one piece of target characteristic information corresponding to the target weight table comprises the current characteristic information.
Preferably, the first and second electrodes are formed of a metal,
further comprising:
when the current number is not less than the extraction number, determining the screening number respectively corresponding to each weight table according to the storage number of the reference sample data respectively stored in each weight table and the weight coefficient respectively corresponding to each weight table;
and extracting target sample data with the target screening quantity corresponding to the weight table from each reference sample data stored in the weight table aiming at each weight table.
In a second aspect, an embodiment of the present invention provides an apparatus for randomly extracting sample data, including:
the device comprises a preprocessing module, a random number management module, an extraction management module, a queue management module and a detection module; wherein,
the preprocessing module is used for arranging all sample data of the sample data set into a sequence queue and determining the number of the samples to be extracted;
the random number management module is used for generating a random number corresponding to the current sample data at the head of the queue in the sequence queue, detecting whether the random number is smaller than the extraction quantity, and if so, triggering the extraction management module; otherwise, triggering the queue management module;
the extraction management module is used for taking out the current sample data at the head of the queue as reference sample data under the triggering of the random number management module and triggering the detection module;
the queue management module is used for placing the current sample data at the head of the queue at the tail of the sequence queue under the triggering of the random number management module and triggering the random number management module;
the detection module is configured to detect the current number of each of the reference sample data taken out under the trigger of the extraction management module, and trigger the random number management module when the current number is smaller than the extraction number.
Preferably, the first and second electrodes are formed of a metal,
further comprising: the device comprises a setting module, an analysis module and a storage processing module; wherein,
the setting module is used for presetting at least two weight tables, and each weight table corresponds to a weight coefficient and at least one piece of characteristic information respectively;
the analysis module is used for analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
the storage processing module is configured to store the reference sample data to a target weight table of the at least two weight tables, where at least one piece of target feature information corresponding to the target weight table includes the current feature information.
Preferably, the first and second electrodes are formed of a metal,
further comprising: the device comprises a quantity determining module and a screening and extracting module; wherein,
the number determining module is configured to determine, when the current number is not less than the extracted number, a filtering number corresponding to each weight table according to a storage number of reference sample data stored in each weight table and a weight coefficient corresponding to each weight table;
and the screening and extracting module is used for extracting target screening quantity target sample data corresponding to the weight tables from each reference sample data stored in the weight tables aiming at each weight table.
In a third aspect, an embodiment of the present invention provides a readable medium, which is characterized by including an execution instruction, and when a processor of a storage controller executes the execution instruction, the storage controller executes the method according to any one of the first aspect.
In a fourth aspect, an embodiment of the present invention provides a storage controller, including: a processor, a memory, and a bus;
the processor and the memory are connected through the bus;
the memory, when the storage controller is running, the processor executes the execution instructions stored by the memory to cause the storage controller to perform the method of any one of the first aspect.
The embodiment of the invention provides a method, a device, a readable medium and a storage controller for randomly extracting sample data, wherein in the method, each sample data of a sample data set is arranged into a sequence queue, the extraction quantity of the sample data needing to be extracted is determined, then a random number corresponding to the current sample data at the head of the queue in the sequence queue can be generated aiming at the formed sequence queue, whether the random number is less than the extraction quantity or not is detected, when the random number is not less than the extraction quantity, the current sample data at the head of the queue in the sequence queue does not accord with the extraction condition in the extraction process, the current sample data can be placed at the tail of the sequence queue, otherwise, when the random number is less than the extraction quantity, the current sample data at the head of the queue can be taken out as target sample data, the extraction process is circularly executed aiming at the current sample data at the head of the queue in the sequence queue, until the number of the extracted reference sample data reaches the determined extraction number, the sample data with the corresponding number is extracted from each sample data of the sample data set. In summary, according to the technical scheme provided by the embodiment of the invention, when the corresponding amount of sample data is randomly extracted from the sample data set, whether the condition that the sample data can be extracted as reference sample data is not directly related to the amount of the sample data in the sample data set is determined, and the corresponding amount of sample data can be more accurately randomly extracted from the sample data set.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a flowchart of a method for randomly extracting sample data according to an embodiment of the present invention;
FIG. 2 is a flow chart of another method for randomly sampling sample data according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of an apparatus for randomly extracting sample data according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of another apparatus for randomly extracting sample data according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer and more complete, the technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention, and based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without creative efforts belong to the scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a method for randomly extracting sample data, including:
a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity;
a1: generating a random number corresponding to current sample data at the head of the sequence queue;
a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed;
a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5;
a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1;
a5: and detecting the current number of each extracted reference sample data, and executing A1 when the current number is less than the extraction number.
In the above embodiment of the present invention, by arranging each sample data of the sample data set into a sequential queue, and determining the number of samples to be extracted, then generating a random number corresponding to the current sample data at the head of the queue in the sequential queue for the formed sequential queue, and detecting whether the random number is smaller than the number of extractions, if the random number is not smaller than the number of extractions, it indicates that the current sample data at the head of the queue in the sequential queue does not meet the extraction condition in the current extraction process, the current sample data can be placed at the tail of the sequential queue, otherwise, if the random number is smaller than the number of extractions, the current sample data at the head of the queue can be taken out as the target sample data, the aforementioned extraction process is cyclically performed for the current sample data at the head of the queue in the sequential queue until the number of the taken-out reference sample data reaches the determined number of extractions, therefore, the sample data of corresponding quantity can be extracted from each sample data of the sample data set. In summary, according to the technical scheme provided by the embodiment of the invention, when the corresponding amount of sample data is randomly extracted from the sample data set, whether the condition that the sample data can be extracted as reference sample data is not directly related to the amount of the sample data in the sample data set is determined, and the corresponding amount of sample data can be more accurately randomly extracted from the sample data set.
In one embodiment of the present invention, the method further comprises: presetting at least two weight tables, wherein each weight table corresponds to a weight coefficient and at least one piece of characteristic information;
after the a3, further comprising:
analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
and storing the reference sample data to a target weight table in the at least two weight tables, wherein at least one piece of target characteristic information corresponding to the target weight table comprises the current characteristic information.
In the above embodiment of the present invention, at least two weight tables are preset, each weight table corresponds to one weight coefficient and at least one feature information, after the target sample data is taken out, the target sample data can be analyzed to determine the current feature information carried in the target sample data, and the target sample data is stored in the target weight table of the at least two weight tables, where at least one target feature information corresponding to the target weight table includes the current feature information; therefore, the user can conveniently extract the sample data with corresponding quantity from different weight tables according to different feature information and different weight coefficients in combination with actual service requirements in the subsequent process.
Specifically, in an embodiment of the present invention, the method further includes: when the current number is not less than the extraction number, determining the screening number respectively corresponding to each weight table according to the storage number of the reference sample data respectively stored in each weight table and the weight coefficient respectively corresponding to each weight table; and extracting target sample data with the target screening quantity corresponding to the weight table from each reference sample data stored in the weight table aiming at each weight table.
In order to more clearly illustrate the technical solution and advantages of the present invention, for example, a set number of reference sample data are randomly extracted from a sample data set, each reference sample data is respectively stored in a corresponding weight table according to feature information carried in each reference sample data, and then a corresponding number of target sample data are respectively extracted from each weight table, as shown in fig. 2, the following steps may be specifically included:
In steps 201 to 202, the user may set the number of weight tables, the weight coefficient corresponding to each weight table, and at least one piece of feature information according to the actual service requirement.
For example, taking the example that each sample data in the sample data set is the business information of each enterprise registered in the east city and the west city of beijing city, and the business information of each enterprise in the east city needs to be extracted with emphasis, two weight tables A, B may be set, where at least one feature information corresponding to the weight table a includes the east city, the weight coefficient corresponding to the weight table a is 0.6, at least one feature information corresponding to the weight table B includes the west city, and the weight coefficient corresponding to the weight table B is 0.4.
It should be understood that, a corresponding black list may be further configured to store sample data that does not comply with the corresponding rule, for example, when the extracted reference sample data does not include feature information "eastern city region" or "western city region", the reference sample data may be stored in the black list.
And step 205, placing the current sample data at the head of the queue at the tail of the sequence queue.
Here, after step 205 is performed, step 203 may be performed again.
And step 206, taking out the current sample data at the head of the queue as reference sample data.
And at least one piece of target characteristic information corresponding to the target weight table comprises current characteristic information.
For example, when the extracted reference sample data is analyzed in step 207 to determine that the current feature information carried by the reference sample data is "eastern city area", the reference sample data may be stored in the weight table a, when the extracted reference sample data is analyzed in step 207 to determine that the current feature information carried by the reference sample data is "western city area", the reference sample data may be stored in the weight table B, and when the extracted reference sample data is analyzed in step 207 to determine that the current feature information carried by the reference sample data is neither "eastern city area" nor "western city area", the reference sample data may be stored in the blacklist.
For example, when the number of extractions is 400, the number of reference sample data stored in the weight table a is 200, and the number of reference sample data stored in the weight table B is 200, it may be determined that the number of filters corresponding to the weight table a is 120, which is a product of the number of stores 200 and the corresponding weight coefficient 0.6, and similarly, it may be determined that the number of filters corresponding to the weight table B is 80, which is a product of the number of stores 200 and the corresponding weight coefficient 0.4.
It is understood that, when the product of the storage quantity of the reference sample data stored in a certain weight table and the corresponding weight coefficient is not an integer, the product can be rounded by a rounding method, and the rounded numerical value is used as the screening quantity corresponding to the weight table.
Obviously, 120 target sample data can be randomly extracted from 200 reference sample data stored in the weight value table a by a method similar to the steps 202 to 206; 80 target sample data are randomly extracted from 200 reference sample data stored in the weight value table B by a method similar to the steps 202 to 206.
As shown in fig. 3, an embodiment of the present invention provides an apparatus for randomly extracting sample data, including:
a preprocessing module 301, a random number management module 302, an extraction management module 303, a queue management module 304 and a detection module 305; wherein,
the preprocessing module 301 is configured to arrange sample data of the sample data set into a sequence queue and determine an extraction number;
the random number management module 302 is configured to generate a random number corresponding to current sample data at the head of the queue in the sequential queue, detect whether the random number is smaller than the extraction number, and trigger the extraction management module 303 if the random number is smaller than the extraction number; otherwise, triggering the queue management module 304;
the extraction management module 303 is configured to take out the current sample data at the head of the queue as reference sample data under the triggering of the random number management module 302, and trigger the detection module 305;
the queue management module 304 is configured to place the current sample data at the head of the queue at the tail of the sequential queue under the trigger of the random number management module, and trigger the random number management module 302;
the detecting module 305 is configured to detect the current number of each extracted reference sample data under the trigger of the extraction management module, and trigger the random number management module 302 when the current number is smaller than the extraction number.
As shown in fig. 4, in an embodiment of the present invention, the method further includes: a setting module 401, an analysis module and a 402 storage processing module 403; wherein,
the setting module 401 is configured to preset at least two weight tables, where each weight table corresponds to a weight coefficient and at least one piece of feature information;
the analyzing module 402 is configured to analyze the reference sample data to determine current feature information carried in the reference sample data;
the storage processing module 403 is configured to store the reference sample data in a target weight table of the at least two weight tables, where at least one piece of target feature information corresponding to the target weight table includes the current feature information.
Based on the embodiment shown in fig. 4, in an embodiment of the present invention, the method further includes: a quantity determination module (not shown in the drawings) and a screening extraction module (not shown in the drawings); wherein,
the number determining module is configured to determine, when the current number is not less than the extracted number, a filtering number corresponding to each weight table according to a storage number of reference sample data stored in each weight table and a weight coefficient corresponding to each weight table;
and the screening and extracting module is used for extracting target screening quantity target sample data corresponding to the weight tables from each reference sample data stored in the weight tables aiming at each weight table.
Because the information interaction, execution process, and other contents between the units in the device are based on the same concept as the method embodiment of the present invention, specific contents may refer to the description in the method embodiment of the present invention, and are not described herein again.
An embodiment of the present invention provides a readable medium, which includes an execution instruction, and when a processor of a storage controller executes the execution instruction, the storage controller executes the method for randomly extracting sample data provided in any embodiment of the present invention.
An embodiment of the present invention provides a storage controller, including: a processor, a memory, and a bus;
the processor and the memory are connected through the bus;
the memory, when the storage controller is running, the processor executes the execution instructions stored in the memory to make the storage controller execute the method for randomly drawing sample data provided in any one embodiment of the present invention.
In summary, the embodiments of the present invention have at least the following advantages:
1. in an embodiment of the present invention, by arranging each sample data of the sample data set into a sequential queue, and determining the number of samples to be extracted, then generating a random number corresponding to the current sample data at the head of the queue in the sequential queue for the formed sequential queue, and detecting whether the random number is less than the number of extractions, if the random number is not less than the number of extractions, it indicates that the current sample data at the head of the queue in the sequential queue does not meet the extraction condition in the current extraction process, the current sample data can be placed at the tail of the sequential queue, otherwise, if the random number is less than the number of extractions, the current sample data at the head of the queue can be taken out as the target sample data, the aforementioned extraction process is cyclically performed for the current sample data at the head of the queue in the sequential queue until the number of the taken-out reference sample data reaches the determined number of extractions, therefore, the sample data of corresponding quantity can be extracted from each sample data of the sample data set. In summary, according to the technical scheme provided by the embodiment of the invention, when the corresponding amount of sample data is randomly extracted from the sample data set, whether the condition that the sample data can be extracted as reference sample data is not directly related to the amount of the sample data in the sample data set is determined, and the corresponding amount of sample data can be more accurately randomly extracted from the sample data set.
2. In one embodiment of the invention, at least two weight tables are preset, each weight table corresponds to a weight coefficient and at least one piece of feature information, after target sample data is taken out, the target sample data can be analyzed to determine current feature information carried in the target sample data, the target sample data is stored in a target weight table of the at least two weight tables, and the at least one piece of target feature information corresponding to the target weight table comprises the current feature information; therefore, the user can conveniently extract the sample data with corresponding quantity from different weight tables according to different feature information and different weight coefficients in combination with actual service requirements in the subsequent process.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other similar elements in a process, method, article, or apparatus that comprises the element.
Finally, it is to be noted that: the above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.
Claims (4)
1. A method for randomly extracting sample data, comprising:
a0: arranging all sample data of the sample data set into a sequence queue, and determining the extraction quantity;
a1: generating a random number corresponding to current sample data at the head of the sequence queue;
a2: detecting whether the random number is less than the extraction number, if so, executing A3; otherwise, a4 is executed;
a3, taking out the current sample data at the head of the queue as reference sample data, and executing A5;
a4: placing the current sample data at the head of the queue at the tail of the sequence queue, and executing A1;
a5: detecting the current number of each extracted reference sample data, and executing A1 when the current number is smaller than the extraction number;
further comprising: presetting at least two weight tables, wherein each weight table corresponds to a weight coefficient and at least one piece of characteristic information;
after the a3, further comprising:
analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
storing the reference sample data to a target weight table of the at least two weight tables, wherein at least one feature information corresponding to the target weight table comprises the current feature information;
further comprising:
when the current number is not less than the extraction number, determining the screening number respectively corresponding to each weight table according to the storage number of the reference sample data respectively stored in each weight table and the weight coefficient respectively corresponding to each weight table;
and extracting target sample data with the screening quantity corresponding to the weight tables from each reference sample data stored in the weight tables for each weight table.
2. An apparatus for randomly extracting sample data, comprising:
the device comprises a preprocessing module, a random number management module, an extraction management module, a queue management module and a detection module; wherein,
the preprocessing module is used for arranging all sample data of the sample data set into a sequence queue and determining the number of the samples to be extracted;
the random number management module is used for generating a random number corresponding to the current sample data at the head of the queue in the sequence queue, detecting whether the random number is smaller than the extraction quantity, and if so, triggering the extraction management module; otherwise, triggering the queue management module;
the extraction management module is used for taking out the current sample data at the head of the queue as reference sample data under the triggering of the random number management module and triggering the detection module;
the queue management module is used for placing the current sample data at the head of the queue at the tail of the sequence queue under the triggering of the random number management module and triggering the random number management module;
the detection module is used for detecting the current number of each taken reference sample data under the triggering of the extraction management module, and triggering the random number management module when the current number is smaller than the extraction number;
further comprising: the device comprises a setting module, an analysis module and a storage processing module; wherein,
the setting module is used for presetting at least two weight tables, and each weight table corresponds to a weight coefficient and at least one piece of characteristic information respectively;
the analysis module is used for analyzing the reference sample data to determine current characteristic information carried in the reference sample data;
the storage processing module is configured to store the reference sample data to a target weight table of the at least two weight tables, where at least one piece of feature information corresponding to the target weight table includes the current feature information;
further comprising: the device comprises a quantity determining module and a screening and extracting module; wherein,
the number determining module is configured to determine, when the current number is not less than the extracted number, a filtering number corresponding to each weight table according to a storage number of reference sample data stored in each weight table and a weight coefficient corresponding to each weight table;
and the screening extraction module is used for extracting target sample data corresponding to the screening quantity of the weight tables from each reference sample data stored in the weight tables aiming at each weight table.
3. A readable medium comprising executable instructions that, when executed by a processor of a storage controller, cause the storage controller to perform the method of claim 1.
4. A storage controller, comprising: a processor, a memory, and a bus;
the processor and the memory are connected through the bus;
the memory, the processor executing execution instructions stored by the memory to cause the storage controller to perform the method of claim 1 when the storage controller is running.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710959595.7A CN107766486B (en) | 2017-10-16 | 2017-10-16 | Method, device, readable medium and storage controller for randomly extracting sample data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710959595.7A CN107766486B (en) | 2017-10-16 | 2017-10-16 | Method, device, readable medium and storage controller for randomly extracting sample data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107766486A CN107766486A (en) | 2018-03-06 |
CN107766486B true CN107766486B (en) | 2021-04-20 |
Family
ID=61269610
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710959595.7A Active CN107766486B (en) | 2017-10-16 | 2017-10-16 | Method, device, readable medium and storage controller for randomly extracting sample data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107766486B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108984700B (en) * | 2018-07-05 | 2021-07-27 | 腾讯科技(深圳)有限公司 | Data processing method and device, computer equipment and storage medium |
CN111176611B (en) * | 2019-12-31 | 2023-10-31 | 深圳远征技术有限公司 | Method and device for generating random data set |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004185170A (en) * | 2002-12-02 | 2004-07-02 | Sony Corp | Random number data forming device |
CN101159673A (en) * | 2007-11-01 | 2008-04-09 | 杭州华三通信技术有限公司 | Arbitrary sampling method and apparatus |
JP2009110400A (en) * | 2007-10-31 | 2009-05-21 | Toppan Printing Co Ltd | Random number generating device, random number generating method, and program thereof |
CN104424331A (en) * | 2013-09-10 | 2015-03-18 | 深圳市腾讯计算机系统有限公司 | Data sampling method and device |
CN104518880A (en) * | 2014-12-17 | 2015-04-15 | 中国船舶重工集团公司第七0九研究所 | Big data reliability validation method and system based on random sampling detection |
CN104881475A (en) * | 2015-06-02 | 2015-09-02 | 北京京东尚科信息技术有限公司 | Method and system for randomly sampling big data |
CN105589683A (en) * | 2014-10-22 | 2016-05-18 | 腾讯科技(深圳)有限公司 | Sample extraction method and apparatus |
CN106548196A (en) * | 2016-10-20 | 2017-03-29 | 中国科学院深圳先进技术研究院 | A kind of random forest sampling approach and device for non-equilibrium data |
CN106570060A (en) * | 2016-09-30 | 2017-04-19 | 微梦创科网络科技(中国)有限公司 | Data random extraction method and apparatus in information flow |
-
2017
- 2017-10-16 CN CN201710959595.7A patent/CN107766486B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004185170A (en) * | 2002-12-02 | 2004-07-02 | Sony Corp | Random number data forming device |
JP2009110400A (en) * | 2007-10-31 | 2009-05-21 | Toppan Printing Co Ltd | Random number generating device, random number generating method, and program thereof |
CN101159673A (en) * | 2007-11-01 | 2008-04-09 | 杭州华三通信技术有限公司 | Arbitrary sampling method and apparatus |
CN104424331A (en) * | 2013-09-10 | 2015-03-18 | 深圳市腾讯计算机系统有限公司 | Data sampling method and device |
CN105589683A (en) * | 2014-10-22 | 2016-05-18 | 腾讯科技(深圳)有限公司 | Sample extraction method and apparatus |
CN104518880A (en) * | 2014-12-17 | 2015-04-15 | 中国船舶重工集团公司第七0九研究所 | Big data reliability validation method and system based on random sampling detection |
CN104881475A (en) * | 2015-06-02 | 2015-09-02 | 北京京东尚科信息技术有限公司 | Method and system for randomly sampling big data |
CN106570060A (en) * | 2016-09-30 | 2017-04-19 | 微梦创科网络科技(中国)有限公司 | Data random extraction method and apparatus in information flow |
CN106548196A (en) * | 2016-10-20 | 2017-03-29 | 中国科学院深圳先进技术研究院 | A kind of random forest sampling approach and device for non-equilibrium data |
Non-Patent Citations (1)
Title |
---|
《基于分布式系统的大数据随机抽样算法的实现》;王磐等;《电脑知识与技术》;20161231;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN107766486A (en) | 2018-03-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106980573B (en) | Method, device and system for constructing test case request object | |
US10216848B2 (en) | Method and system for recommending cloud websites based on terminal access statistics | |
US11221904B2 (en) | Log analysis system, log analysis method, and log analysis program | |
US11108787B1 (en) | Securing a network device by forecasting an attack event using a recurrent neural network | |
CN110647896B (en) | Phishing page identification method based on logo image and related equipment | |
CN109409964B (en) | Method and device for identifying high-quality brand | |
CN109063482B (en) | Macro virus identification method, macro virus identification device, storage medium and processor | |
CN107766486B (en) | Method, device, readable medium and storage controller for randomly extracting sample data | |
CN110991171A (en) | Sensitive word detection method and device | |
CN104636319A (en) | Text duplicate removal method and device | |
CN108846117A (en) | The duplicate removal screening technique and device of business news flash | |
CN107632972B (en) | Form processing method and device | |
CN110647895B (en) | Phishing page identification method based on login box image and related equipment | |
CN111914257A (en) | Document detection method, device, equipment and computer storage medium | |
CN106997350B (en) | Data processing method and device | |
CN108182363B (en) | Detection method, system and storage medium of embedded office document | |
CN113688240A (en) | Threat element extraction method, device, equipment and storage medium | |
CN107743087A (en) | The detection method and system of a kind of e-mail attack | |
CN109670153A (en) | A kind of determination method, apparatus, storage medium and the terminal of similar model | |
CN108287831B (en) | URL classification method and system and data processing method and system | |
WO2015024457A1 (en) | Method and device for obtaining virus signatures cross-reference to related applications | |
CN106407218B (en) | Navigation webpage detection method and device | |
CN105589683B (en) | Sample extraction method and device | |
CN104794397B (en) | Virus detection method and device | |
CN109391626B (en) | Method and related device for judging whether network attack result is unsuccessful |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210329 Address after: No. 1036, Shandong high tech Zone wave road, Ji'nan, Shandong Applicant after: INSPUR GENERAL SOFTWARE Co.,Ltd. Address before: 250100 No. 2877 Kehang Road, Sun Village Town, Jinan High-tech District, Shandong Province Applicant before: SHANDONG INSPUR GENESOFT INFORMATION TECHNOLOGY Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |