WO2008015489A1 - Classification using support vector machines and variables selection - Google Patents
Classification using support vector machines and variables selection Download PDFInfo
- Publication number
- WO2008015489A1 WO2008015489A1 PCT/IB2006/002398 IB2006002398W WO2008015489A1 WO 2008015489 A1 WO2008015489 A1 WO 2008015489A1 IB 2006002398 W IB2006002398 W IB 2006002398W WO 2008015489 A1 WO2008015489 A1 WO 2008015489A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- variables
- subset
- classifier
- data
- scaling
- Prior art date
Links
- 238000012706 support-vector machine Methods 0.000 title description 19
- 238000000034 method Methods 0.000 claims abstract description 44
- 239000013598 vector Substances 0.000 claims description 20
- 230000003247 decreasing effect Effects 0.000 claims 2
- 101100493820 Caenorhabditis elegans best-1 gene Proteins 0.000 claims 1
- 238000004590 computer program Methods 0.000 claims 1
- 230000001419 dependent effect Effects 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 claims 1
- 238000010998 test method Methods 0.000 claims 1
- 238000005259 measurement Methods 0.000 description 9
- 238000012360 testing method Methods 0.000 description 6
- 230000000875 corresponding effect Effects 0.000 description 4
- 238000010606 normalization Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 230000032683 aging Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000000428 dust Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
- G06N20/10—Machine learning using kernel methods, e.g. support vector machines [SVM]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2115—Selection of the most significant subset of features by evaluating different subsets according to an optimisation criterion, e.g. class separability, forward selection or backward elimination
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2132—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on discrimination criteria, e.g. discriminant analysis
Definitions
- the invention relates to improvements in classification.
- the invention is especially applicable to classification of items of currency or value, such as banknotes or coins.
- the invention is related to our co-pending applications, EP 1 217 589 A and EP 1 516 293 A, the contents of which are incorporated herein by reference.
- SVM Support vector machine
- FVS Feature Vector Selection (see EP 1 516 293 A mentioned above)
- billway stands for the feeding of a bill in a validator in one orientation.
- a bill acceptor comprising transport means, sensor means, memory and processing means, scanning the document with many sensors and many wavelengths results in a large volume of multivariate data stored in a memory that can be used to discriminate and authenticate the documents, and to denominate a bill in the case of a banknote acceptor. It is desirable for this process to be fast and it is known that a subset of the data can be sufficient to achieve a better result. This application relates to finding an optimal subset of variables representing the data.
- This new denomination algorithm deals with the selection of the input data that are given to a SVM (Support Vector Machine) algorithm.
- variable selection is to eliminate irrelevant or less relevant variables for classification purposes and at the same time to keep a high separation performance.
- the idea is to find a set of variables that are highly correlated with the projected data on the separation vector obtained with the SVM when trained with all variables.
- the process can be used to generate data inputs for the bill validator to use in order to denominate bills from a pre-defined bill set comprising classes.
- the classes can be the 4 bill ways of a single denomination and/or other denominations.
- the selected variables and discriminant axis generated by the process may be loaded in the memory of the validator. They are used by the validator to later validate a new sample presented to the unit as being a member of one of the classes.
- Fig. 1 is a schematic diagram of a banknote sensing system
- Fig. 2 is a plan view from above of the sensor array of the sensing system of Fig. 1;
- Fig. 3 is a plan view from below of the light source array of the sensing system of Fig. 1;
- FIG. 4 is graph illustrating classification.
- a banknote sensing system according to an embodiment of the invention is shown schematically in Fig. 1.
- the system includes a light source array 2 arranged on one side of a banknote transport path, and a light sensor array 4 arranged on the other side of the banknote transport path, opposite the light source array 2.
- the system includes banknote transport means in the form of four sets of rollers 6 for transporting a banknote 8 along the transport path between the light source array 2 and the light sensor array 4.
- the light source array 4 is connected to a processor 10 and the system is controlled by a controller 12.
- a diffuser 14 for diffusing and mixing light emitted from the light source array 2 is arranged between the light source array 2 and the banknote transport path.
- Fig. 2 is a plan view from below of the light source array 2.
- the light source array is a linear array of a plurality of light sources 9.
- the array is arranged in groups 11 of six sources, and each source in a group emits light of a different wavelength, which are chosen as suitable for the application, usually varieties of blue and red.
- a plurality of such groups 11 are arranged linearly across the transport path, so that light sources for each wavelength are arranged across the transport path.
- Fig. 3 is a plan view from above of the light sensor array 4.
- the light sensor array includes eight circular light sensors arranged in a line across the transport path. The sensors are 7 mm in diameter and the centres are spaced 7 mm apart in a line, so that the sensors are side by side.
- Figs. 2 and 3 are not to scale, and the light source and light sensor arrays are approximately the same size.
- a banknote is transported by the rollers 6, under control of the controller 12, along the transport path between the source and sensor arrays 2,
- the banknote is transported by a predetermined distance then stopped. AU the light sources of one wavelength are operated and, after mixing of the light in the diffuser 14 to spread it uniformly over the width of the banknote, the light impinges on the banknote. Light transmitted through the banknote is sensed by the sensor array 4, and signals are derived from the sensors for each measurement spot on the banknote corresponding to each sensor. Similarly, the light sources of all the other wavelengths are similarly operated in succession, with measurements being derived for the sensors for each wavelength, for the corresponding line.
- rollers 6 are activated to move the banknote again by the predetermined distance and the sequence of illuminating the banknote and taking measurements for each wavelength for each sensor is repeated.
- the document is scanned by a linear sensor array that measures spots on the document with a pre-defined scanning resolution in width and length and for multiple wavelengths.
- a discrimination algorithm works by comparing objects using measurement vectors that have to be of a common size. As in practice the documents can have different size, if only for tolerance reasons, a scanning maximum area that is common to all sizes to denominate is defined. This common size will vary with the bill sets targeted.
- an SVM classifier works with 2 classes.
- the discrimination and the variable selection is performed using 2 classes, the first class is the reference class and the second class contains all the bills other than class 1.
- the documents used in class 2 are close to those of class 1 according to their dimension. It means that they pass the length and width test of class 1 (it is otherwise trivial to separate documents of different dimensions). Further details of SVM classifiers can be found in prior art literature.
- Test samples for the reference class and second class are measured using a sensor array as described above, and the measured values are processed as described below, to generate a SVM with a reduced set of variables.
- a variable is a given spot location and a given wavelength.
- xy,j I...N .
- the variable j is submitted to a normalisation by substracting the mean of four tracks in the associated wavelength.
- the spots used for normalisation are defined by four tracks.
- the mean of the four tracks is computed for each wavelength and stored in a vector ⁇ n ⁇ ⁇ ,m 2 ,m 3 ,m A ,m 5 ,m 6 ⁇ , for example, for six wavelengths. Then each measurement x ⁇ of the associated wavelength k is normalized by subtracting m k .
- the whole set of data is normalized before applying the variable selection algorithm.
- the normalisation above removes from each variable the global effect of the bill such as the paper or dust or aging.
- the scaling process deals with the dispersion of the whole set of bills.
- the two sets of data are scaled using the mean and the standard deviation of class 1.
- the scaling is given by:
- the problem of finding the best S variables is solved by a forward sequential selection.
- the algorithm starts with an empty set of variables and adds variables until S variables have been selected.
- a fitness criterion is evaluated for the set of variables that combines the already selected and the current variable. The one giving the maximum fitness is added to the set of selected variables.
- the forward sequential selection is just one of many selection algorithm that could be used such as backward or stepwise selection. See, for example, Fukunaga K.,"Introduction to Statistical Pattern Recognition", Academic Press, INC, 2 nd ed. 1990 for further details regarding selection algorithms.
- variable giving the best fitness is added to set of selected variables. We stop selecting variables when the fitness reaches a given value (0.998) or if we have selected a predefined number of variables .
- the banknote is sensed by the sensor array, and the measurements for the selected variables are processed by the SVM.
- the measurements are used to build a test vector, which is projected onto the discriminant vector W2.
- This results in a scalar value which can be compared with one or more thresholds. For example, if the scalar value is less than or greater than a give threshold it can be treated either as belonging to a reference class or not. Similarly, for two thresholds, if the scalar value is between two thresholds it can be treated as belong to a reference class or otherwise not.
- the processing for deriving the subset of variables and discriminant axis can be performed either in the banknote tester itself, or in a separate device, with the parameters representing the selected variables and the discriminant axis being subsequently stored in the memory of a banknote tester for later use.
- this approach gives a similar result in terms of performance to using the original number N variables, but actually using fewer variables, so that there is less processing and hence faster results. It is also possible not to norm P and use another range for the fitness criteria. The higher the value of the fitness, the better is the reconstruction of the projected data. It is convenient to fix this value to a given level smaller than 1 to avoid numerical problems, for instance 0.998 and stop the selection when this value is reached.
- Fig. 4 shows the SVM results for the denomination of a 2 specific denominations with all variables, in this case 357 (top) and 16 variables (bottom).
- all the objects of the 2" class are on one side of the reference class and the classes can be separated by a single threshold.
- the reference class can be enclosed between 2 values defining an acceptance window to cover the possibility that a foreign object could be classified on the other side of the reference class.
- the subset of variables is selected using a SVM classifier, and then the selected subset of variables is used for classification using an other SVM classifier.
- any suitable type of classification can be carried out using the relevant variables, such as LDA (linear discriminant analysis) or Mahalanobis distance, or similar, as known to the person skilled in the art.
- the representative subset is chosen to reduce the number of variables whilst maintaining adequate performance in terms of classification, so that the selected variables are representative of the original variables and the corresponding data. This can be evaluated in various ways, such as using a fitness function as described above.
- references to banknotes include other similar types of value sheets such as coupons, cheques, and includes genuine and fake examples of such documents.
- a system may involve the use of means, such as edge-detectors, for detecting the orientation, such as skew and offset of a banknote relative to, eg, the transport direction and/or the sensor array or a fixed point(s).
- a system may include means for positioning a banknote in a desired orientation, such as with the length of the bill along the transport path with edges parallel to the transport direction, or at a desired angle relative to the transport direction and/or sensor array.
- the described embodiments are banknote testers. However, the invention may also be applied to other types of currency testers, such as coin testers. For example, signals from a coin tester taking measurements of coin characteristics, such as material, at a succession of points across a coin may be interpolated to produce a signal representative of the characteristic across the coin.
- coin is employed to mean any coin (whether valid or counterfeit), token, slug, washer, or other metallic object or item, and especially any metallic object or item which could be utilised by an individual in an attempt to operate a coin-operated device or system.
- a "valid coin” is considered to be an authentic coin, token, or the like, and especially an authentic coin of a monetary system or systems in which or with which a coin-operated device or system is intended to operate and of a denomination which such coin-operated device or system is intended selectively to receive and to treat as an item of value.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Medical Informatics (AREA)
- Inspection Of Paper Currency And Valuable Securities (AREA)
- Image Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Testing And Monitoring For Control Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AT06795394T ATE546792T1 (en) | 2006-07-28 | 2006-07-28 | CLASSIFICATION USING SUPPORT VECTOR MACHINES AND VARIABLE SELECTION |
US12/375,158 US8706669B2 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variables selection |
JP2009521356A JP2009545049A (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variable selection |
PCT/IB2006/002398 WO2008015489A1 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variables selection |
ES06795394T ES2381860T3 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variable selection |
AU2006346894A AU2006346894B2 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variables selection |
EP06795394A EP2054844B1 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variables selection |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/IB2006/002398 WO2008015489A1 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variables selection |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2008015489A1 true WO2008015489A1 (en) | 2008-02-07 |
Family
ID=37779272
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2006/002398 WO2008015489A1 (en) | 2006-07-28 | 2006-07-28 | Classification using support vector machines and variables selection |
Country Status (7)
Country | Link |
---|---|
US (1) | US8706669B2 (en) |
EP (1) | EP2054844B1 (en) |
JP (1) | JP2009545049A (en) |
AT (1) | ATE546792T1 (en) |
AU (1) | AU2006346894B2 (en) |
ES (1) | ES2381860T3 (en) |
WO (1) | WO2008015489A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010014700A1 (en) * | 2008-07-29 | 2010-02-04 | Mei, Inc. | Currency discrimination |
AT509022A3 (en) * | 2009-08-14 | 2012-06-15 | Ait Austria Inst Of Technology Gmbh | METHOD FOR QUALITY TESTING OF A DIGITAL IMAGE |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110246262A1 (en) * | 2010-04-02 | 2011-10-06 | QMedtrix Systems, Inc. | Method of classifying a bill |
EP2810257B1 (en) | 2012-02-03 | 2017-01-18 | Crane Payment Innovations, Inc. | Apparatus and method for characterizing items of currency |
CN103679263B (en) * | 2012-08-30 | 2017-03-29 | 重庆邮电大学 | Forecasting Methodology is closed on based on the thunder and lightning of particle swarm support vector machine |
US20160005045A1 (en) * | 2013-02-25 | 2016-01-07 | Mei, Inc. | System to accept an item of value |
CN105372202B (en) * | 2015-10-27 | 2019-01-29 | 九江学院 | Transgene cotton variety ecotype method |
CN106204893A (en) * | 2016-08-10 | 2016-12-07 | 恒银金融科技股份有限公司 | Paper currency detection method based on support vector machine |
CN107909011B (en) * | 2017-10-30 | 2021-08-24 | Oppo广东移动通信有限公司 | Face recognition method and related product |
CN108320374B (en) * | 2018-02-08 | 2019-12-03 | 中南大学 | A kind of multinational paper money number character identifying method based on finger image |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1217589A1 (en) * | 2000-12-15 | 2002-06-26 | Mars, Incorporated | Currency validator |
WO2004001683A1 (en) * | 2002-06-19 | 2003-12-31 | Mars Incorporated | Currency validator |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CH684222A5 (en) | 1992-03-10 | 1994-07-29 | Mars Inc | Means for classifying a pattern, particularly a banknote or a coin. |
US7232024B2 (en) * | 1996-05-29 | 2007-06-19 | Cunnins-Allison Corp. | Currency processing device |
JP2002505770A (en) | 1996-06-28 | 2002-02-19 | コインスター,インコーポレイテッド | Coin discriminating apparatus and method |
GB2344446A (en) | 1998-12-02 | 2000-06-07 | Mars Inc | Classifying currency items |
US6662170B1 (en) * | 2000-08-22 | 2003-12-09 | International Business Machines Corporation | System and method for boosting support vector machines |
EP1376484A1 (en) | 2002-06-25 | 2004-01-02 | Mars Incorporated | Method and apparatus for processing signals in testing currency items |
US8019699B2 (en) * | 2004-09-29 | 2011-09-13 | Panscient, Inc. | Machine learning system |
US20070140551A1 (en) * | 2005-12-16 | 2007-06-21 | Chao He | Banknote validation |
EP1868166A3 (en) | 2006-05-31 | 2007-12-26 | MEI, Inc. | Method and apparatus for validating banknotes |
-
2006
- 2006-07-28 JP JP2009521356A patent/JP2009545049A/en active Pending
- 2006-07-28 AU AU2006346894A patent/AU2006346894B2/en not_active Ceased
- 2006-07-28 ES ES06795394T patent/ES2381860T3/en active Active
- 2006-07-28 AT AT06795394T patent/ATE546792T1/en active
- 2006-07-28 WO PCT/IB2006/002398 patent/WO2008015489A1/en active Application Filing
- 2006-07-28 EP EP06795394A patent/EP2054844B1/en not_active Not-in-force
- 2006-07-28 US US12/375,158 patent/US8706669B2/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1217589A1 (en) * | 2000-12-15 | 2002-06-26 | Mars, Incorporated | Currency validator |
WO2004001683A1 (en) * | 2002-06-19 | 2003-12-31 | Mars Incorporated | Currency validator |
Non-Patent Citations (4)
Title |
---|
FUNG ET AL.: "Data Selection for Support Vector Machine Classifiers", PROCEEDINGS OF THE SIXTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2000, pages 64 - 70 |
G. BAUDAT AND F. ANOUAR: "Feature selection and projection using kernels", NEUROCOMPUTING, vol. 55, 27 June 2003 (2003-06-27), pages 21 - 38, XP002423405 * |
S. HOCHREITER AND K. OBERMAYER: "Classification, regression and feature selection on matrix data", TECHNISCHER BERICHT 2004/02, 2004, pages 1 - 59, XP002423407 * |
T. JEBARA: "Machine learning: discriminative and generative", KLUWER ACADEMIC PUBLISHERS, 2004, pages 99 - 130, XP002423406 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010014700A1 (en) * | 2008-07-29 | 2010-02-04 | Mei, Inc. | Currency discrimination |
US8474592B2 (en) | 2008-07-29 | 2013-07-02 | Mei, Inc. | Currency discrimination |
AT509022A3 (en) * | 2009-08-14 | 2012-06-15 | Ait Austria Inst Of Technology Gmbh | METHOD FOR QUALITY TESTING OF A DIGITAL IMAGE |
AT509022B1 (en) * | 2009-08-14 | 2012-09-15 | Ait Austrian Inst Technology | METHOD FOR QUALITY TESTING OF A DIGITAL IMAGE |
DE102010032806B4 (en) * | 2009-08-14 | 2015-02-12 | Ait Austrian Institute Of Technology Gmbh | Method for quality testing a digital image |
Also Published As
Publication number | Publication date |
---|---|
US20090307167A1 (en) | 2009-12-10 |
JP2009545049A (en) | 2009-12-17 |
EP2054844A1 (en) | 2009-05-06 |
ATE546792T1 (en) | 2012-03-15 |
ES2381860T3 (en) | 2012-06-01 |
EP2054844B1 (en) | 2012-02-22 |
US8706669B2 (en) | 2014-04-22 |
AU2006346894A1 (en) | 2008-02-07 |
AU2006346894B2 (en) | 2012-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2006346894B2 (en) | Classification using support vector machines and variables selection | |
EP1484719B1 (en) | Currency validation | |
US9633500B1 (en) | Systems, methods and devices for managing rejected coins during coin processing | |
JP5175210B2 (en) | Media confirmation device and confirmation method | |
EP1217589B1 (en) | Currency validator | |
CN1685373B (en) | Paper sheet identifying method and paper sheet identifying device | |
KR100893613B1 (en) | Method and apparatus for recognizing and counting currency notes and securities having barcodes | |
US7000754B2 (en) | Currency validator | |
AU2003247029B2 (en) | Currency validator | |
US20210264711A1 (en) | Cash handling machine | |
JP5460658B2 (en) | How to use support vector machine and variable selection | |
US8474592B2 (en) | Currency discrimination | |
Beigh et al. | Counterfeit Currency Detection using Machine Learning | |
EP3703018B1 (en) | Cash-handling machine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 06795394 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009521356 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006346894 Country of ref document: AU |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2006346894 Country of ref document: AU Date of ref document: 20060728 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006795394 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12375158 Country of ref document: US |