CN112365340A - Multi-mode personal loan risk prediction method - Google Patents
Multi-mode personal loan risk prediction method Download PDFInfo
- Publication number
- CN112365340A CN112365340A CN202011312355.6A CN202011312355A CN112365340A CN 112365340 A CN112365340 A CN 112365340A CN 202011312355 A CN202011312355 A CN 202011312355A CN 112365340 A CN112365340 A CN 112365340A
- Authority
- CN
- China
- Prior art keywords
- face
- video
- audio
- key points
- features
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000001514 detection method Methods 0.000 claims abstract description 13
- 238000012545 processing Methods 0.000 claims abstract description 10
- 238000013058 risk prediction model Methods 0.000 claims abstract description 8
- 238000004364 calculation method Methods 0.000 claims description 16
- 230000002123 temporal effect Effects 0.000 claims description 11
- 238000005070 sampling Methods 0.000 claims description 8
- 210000001525 retina Anatomy 0.000 claims description 7
- 238000001914 filtration Methods 0.000 claims description 4
- 238000002790 cross-validation Methods 0.000 claims description 3
- 238000011156 evaluation Methods 0.000 claims description 3
- 238000005457 optimization Methods 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 abstract 1
- 230000003595 spectral effect Effects 0.000 description 22
- 238000012706 support-vector machine Methods 0.000 description 9
- 230000008859 change Effects 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000004907 flux Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000033001 locomotion Effects 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000006854 communication Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 238000012502 risk assessment Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000002759 z-score normalization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/03—Credit; Loans; Processing thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
- G06N20/10—Machine learning using kernel methods, e.g. support vector machines [SVM]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
- G06V40/166—Detection; Localisation; Normalisation using acquisition arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/174—Facial expression recognition
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- General Health & Medical Sciences (AREA)
- Business, Economics & Management (AREA)
- Finance (AREA)
- Software Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Accounting & Taxation (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Technology Law (AREA)
- Marketing (AREA)
- Economics (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Development Economics (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
Abstract
The invention relates to the technical field of online loans, and particularly discloses a multi-mode personal loan risk prediction method, which comprises the following steps: obtaining audio and video information of the borrower through the answer of the specified questions of the online video; preprocessing audio and video information and extracting audio features; carrying out face detection on a borrower in a video to obtain a face frame, carrying out face detection in the face frame to obtain 68 face key points, carrying out min-max coordinate standardization according to the face frame, and selecting 32 key points from the 68 face key points to extract video time domain features; performing z-score processing on the audio characteristic and the video time domain characteristic, and constructing a multi-modal combined characteristic according to weight proportion; and establishing an SVM risk prediction model of the personal loan based on the multi-mode combined characteristics. The multi-mode personal loan risk prediction method provided by the invention can predict the loan risk of a loan applicant in real time by extracting the micro-expression time domain characteristics of the audio frequency and the human face of the loan.
Description
Technical Field
The invention relates to the technical field of online loans, in particular to a multi-mode personal loan risk prediction method.
Background
With the rapid development of internet technology, big data and artificial intelligence technology, internet finance (online loan) gradually becomes an important channel for small and micro enterprises, family workshop-type enterprises and low-income groups to acquire financing. Meanwhile, the nation advocates the promotion of "general finance" to stabilize growth, expand employment, promote innovation, thrive the market and meet the various needs of the people.
The current online loan auditing is mainly implemented by auditing the authenticity of data provided by a borrower through a telephone access mode by an auditing person, and specifically comprises auditing the authenticity of basic information of the borrower, personal credit investigation, multi-head loan, e-commerce and internet behavior data, black and grey lists and the like.
However, most loan applicants are small and miniature enterprises, family workshop-type enterprises and low-income groups, and the groups have the problems of difficult income information acquisition, difficult confirmation, lack of qualified mortgage, missing or insufficient credit records and the like.
In the face of the group, the approvers are difficult to accurately evaluate the risk level of the client only through telephone communication, and meanwhile, the manual auditing mode is low in efficiency, changes of micro-expression and context of the borrowers in the communication process are ignored, so that the auditing result of the manual auditing is not accurate enough, namely the credit degree is not high.
Disclosure of Invention
The invention aims to overcome the defects in the prior art, provides a multi-mode personal loan risk prediction method, can make reference to the preliminary loan risk level of small and micro enterprises and low-income groups, which have difficulty in acquiring, confirming and lacking qualified mortgage information, and have missing or insufficient credit records, and effectively improve the loan auditing efficiency.
As a first aspect of the present invention, there is provided a multi-modal personal loan risk prediction method, comprising:
step S1: obtaining audio and video information of a borrower through the answer of the specified questions of the online video, and carrying out comprehensive weighted manual evaluation scores according to the answer results of the questions and the speed and the micro-expression of the borrower;
step S2: carrying out mean value smooth denoising pretreatment on the audio and video information of the borrower, and extracting various audio features;
step S3: performing face detection on a borrower in a video by utilizing Retina face to obtain a face frame, performing face key point detection in the face frame by utilizing a PRNet to obtain 68 face key points, performing min-max coordinate standardization according to the face frame, and selecting 32 key points of eyes and mouths from the 68 face key points to extract multiple types of video time domain features;
step S4: carrying out z-score standardization processing on the multi-class audio features and the video time domain features, and constructing multi-modal combined features according to weight proportion;
step S5: and establishing an SVM risk prediction model of the personal loan based on the multi-mode combined characteristics.
Further, the step S2 further includes:
adopting 3 multiplied by 3 mean value smooth filtering to the audio and video information of the borrower to extract 34 types of audio featuresWhere N is the number of samples, the sampling frequency fs is 16000Hz, the calculation window win is 0.05, and the calculation step is 0.05.
Further, the step S3 further includes:
obtaining the largest face frame bbox in the video by utilizing Retina face, and performing 1.25 times of expansion on the face frame bbox to obtain the maximum face frame bboxAn area;
to this by PRNetDetecting key points of the human face in the face frame of the area to obtain 68 key points of the human face;
carrying out min-max standardization processing on the point of landmark in the face frame in the x and y directions by a bbox scale respectively;
selecting 32 key points of the eyes and the mouth part in the face frame as target key points;
constructing micro-expression data set by target key points in key frames in video streamWherein fpsnumThe number of key frames;
extracting 34 types of video time domain features F in the micro expression data set DB∈{RN×68|]Rx N×34,Ry N×34]And f, wherein the sampling frequency fs is 16000Hz, the calculation window win is 0.05, and the calculation step is 0.05.
Further, the step S4 further includes:
separately for audio features F1And video temporal features F2Carrying out the standardization processing of z-score to correspondingly obtain the audio frequency characteristic F conforming to the standard normal distribution1' and video temporal feature F2';
For audio frequency characteristic F conforming to standard normal distribution1' and video temporal feature F2' Association to form a multimodal Association signature F ═ w1*F'A,w2*F'B]∈{RN×102In which w1+w2=1。
Further, the step S5 further includes:
loading the multi-modal combined features into an SVM regression prediction model to train a loan risk prediction model; wherein, the SVM regression prediction model adopts RBF kernel function image, and punishment coefficients C, gamma and w are subjected to cross validation grid optimization method according to 10 folds1,w2And optimizing.
The multi-mode personal loan risk prediction method provided by the invention has the following advantages: the method has the advantages that through analyzing and displaying the answer audio and video of the specified questions (such as personal income, loan experience and the like) submitted by an applicant during loan application, in the answering process, fluent speech speed, firm micro-expression change and the like of the loan applicant have positive correlation effects on the risk assessment of the loan, so that the answer audio and video information of the specified questions can be combined with an artificial intelligence technology to carry out preliminary assessment on the risk level of the loan applicant to serve as a new data dimension source for big data wind control; the method does not need manual intervention, can make reference to the preliminary loan risk level of small and micro enterprises and low-income groups, which have difficulty in collecting, identifying and lacking qualified mortgage information and lack or insufficient credit records, and effectively improves the loan auditing efficiency.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.
Fig. 1 is a flow chart of a multi-modal personal loan risk prediction method provided by the invention.
Fig. 2 is a flowchart of an embodiment of the method for predicting the risk of a multi-modal personal loan.
Detailed Description
To further illustrate the technical means and effects of the present invention for achieving the predetermined objects, the following detailed description of the embodiments, structures, features and effects of the method for predicting a multi-modal personal loan risk according to the present invention will be made with reference to the accompanying drawings and preferred embodiments. It is to be understood that the embodiments described are only a few embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without any inventive step, are within the scope of the present invention.
In this embodiment, a multi-modal personal loan risk prediction method is provided, as shown in fig. 1, the multi-modal personal loan risk prediction method includes:
step S1: obtaining audio and video information of a borrower through the answer of the specified questions of the online video, and carrying out comprehensive weighted manual evaluation scores according to the answer results of the questions and the speed and the micro-expression of the borrower;
step S2: carrying out mean value smooth denoising pretreatment on the audio and video information of the borrower, and extracting various audio features;
step S3: performing face detection on a borrower in a video by utilizing Retina face to obtain a face frame, performing face key point detection in the face frame by utilizing a PRNet to obtain 68 face key points, performing min-max coordinate standardization according to the face frame, and selecting 32 key points of eyes and mouths from the 68 face key points to extract multiple types of video time domain features;
step S4: carrying out z-score standardization processing on the multi-class audio features and the video time domain features, and constructing multi-modal combined features according to weight proportion;
step S5: and establishing an SVM risk prediction model of the personal loan based on the multi-mode combined characteristics.
Specifically, the multi-class audio features include 34 classes of audio features such as Spectral Spread and Energy, and the multi-class video time domain features include 34 classes of video time domain features such as Spectral Spread and Energy.
The multi-mode personal loan risk prediction method provided by the invention extracts multi-mode characteristics through the answering audio and video of the questions specified by the loan applicant, and performs preliminary loan risk assessment and loan amount estimation on a borrower by combining a risk prediction model of an SVM (support vector machine).
Preferably, in step S2, the method further includes:
in order to reduce the influence of factors such as background noise and the like, 3 multiplied by 3 mean value smoothing filtering is adopted for the audio and video information of the borrower, and 34-class audio features F are extractedA∈{RN×34N is the number of samples, the sampling frequency fs is 16000Hz, the calculation window win is 0.05, and the calculation step is 0.05; wherein,respectively represent: zero Crossing Rate, Energy, entry of Energy, Spectral Centroid, Spectral Spread, Spectral entry, Spectral Flux, Spectral roll off,are MFCCs (mel-frequency cepstral coefficients),is Chroma Vector (12 steps scale),chroma development (standard Deviation of 12-step scale), and N is the number of samples.
Preferably, in step S3, the method further includes:
obtaining the largest Face frame bbox in the video by using a Retina Face (Single-stage Face localization in the Wild) Face detection algorithm, and performing 1.25 times expansion on the Face frame bbox to obtain the maximum Face frame bboxAn area;
the information is processed by PRNet (Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network)Detecting key points of the human face in the face frame of the area to obtain 68 key points of the human face;
in order to eliminate the influence of face scale change caused by face or screen relative motion, the min-max (dispersion standardization) of the bbox (face coordinate frame) scale is respectively carried out on the points of the landmark (face key points) in the face frame in the directions of x and y;
in the key points of the face, the eyes and the mouth can reflect the change of the micro-expression of the face, so that 32 key points index [ 36.,. 68] of the eyes and the mouth in the face frame are selected as target key points;
constructing micro-expression data set by target key points in key frames in video streamWherein fpsnumAs to the number of key frames,representing the x-axis data information,representing y-axis data information;
extracting 34 types of video time domain features F in the micro expression data set DB∈{RN×68|[Rx N×34,Ry N×34]And in order to keep consistency with the audio information, the sampling frequency fs is 16000Hz, the calculation window win is 0.05, and the calculation step is 0.05.
Specifically, a human Face detection algorithm is used to perform human Face detection on a borrower in a video to obtain a maximum human Face frame (x min, y min, x max, y max), in order to enable a human Face key point to be inside the human Face frame, the center of the human Face frame is expanded by 1.25 times, a PRNet (Joint 3D Face Reconstruction and place Alignment with Position Map Regression Network) is used to perform human Face key point detection to obtain 68 human Face key points, and the coordinate standardization of min-max is performed according to the human Face frame, as shown in formula (1):
preferably, in step S4, the method further includes:
in order to ensure that the audio features and the video temporal features can meet the order of magnitude consistency, the audio features F are respectively matched1And video temporal features F2Carrying out normalization processing on z-score (standard deviation normalization) to obtain an audio feature F 'conforming to standard normal distribution'1And video temporal feature F'2;
For audio feature F 'conforming to standard normal distribution'1And video temporal feature F'2Combining to form a multi-mode combined feature F ═ w1*F'A,w2*F'B]∈{RN×102In which w1+w2=1。
In particular, separately for the audio features FAAnd video temporal features FBThe z-score normalization is performed as shown in formula (2), where u is the mean of the corresponding features and σ is the pairF 'conforming to standard normal distribution is obtained according to characteristic standard deviation'AAnd F'B;
Preferably, in step S5, the method further includes:
loading the multi-modal combined features into an SVM regression prediction model to train a loan risk prediction model; wherein, the SVM regression prediction model adopts RBF kernel function image, and punishment coefficients C, gamma and w are subjected to cross validation grid optimization method according to 10 folds1,w2And optimizing.
Specifically, the loan applicant video feature extraction process comprises the following steps:
acquiring a video key frame, and acquiring a face frame bbox of a borrower in a video by using a RetinaFace face detection algorithm;
performing 1.25 times of central expansion on bbox, and acquiring landmark of the borrower in the video stream by using PRNet;
carrying out min-max standardization processing on the key points in the x and y directions to eliminate scale influence caused by relative movement of the human face, wherein the standardized scale is the bbox size;
acquiring a target key point index [ 36.., 68] of eyes and mouth, which is easier to reflect the change of micro expression, in the human face;
merging the normalized target key points in the x and y directions, extracting 34 types of video time domain features F by adopting a sampling frequency fs of 16000Hz, a calculation window win of 0.05 and a calculation step of 0.05B∈{RN×68|[Rx N×34,Ry N ×34]}; wherein,respectively represent: zero Crossing Rate, Energy of Energy, Spectral Central, Spectral Spread, Spectral Entrophy, Spectral Flux, and Spectral Rolloff,in the form of the MFCCs, is,is a Chroma Vector, and is a Chroma Vector,is Chroma development.
As shown in fig. 2, a flow chart of an embodiment of a method for predicting a multi-modal personal loan risk includes the following specific steps:
acquiring audio and video information of a loan applicant;
eliminating the influence of noise by adopting 3 multiplied by 3 mean filtering on audio information, setting the sampling frequency fs to 16000Hz, the calculation window win to 0.05, the calculation step size step to 0.05, and extracting 34 types of audio features FA∈{RN×34And (c) the step of (c) in which,respectively represent: zero Crossing Rate, Energy of Energy, Spectral Central, Spectral Spread, Spectral Entrophy, Spectral Flux, Spectral Rolloff,in the form of the MFCCs, is,is a Chroma Vector, and is a Chroma Vector,chroma development;
utilizing Retina face to detect the face of a borrower in a video to obtain a maximum face frame (x min, y min, x max, y max), expanding the center of the face frame by 1.25 times in order to enable a face key point to be in the face frame, obtaining 68 personal face key points in an expansion area through PRNet, and carrying out min-max coordinate standardization according to the face frame to eliminate scale change caused by relative movement of a camera and the face, and selecting 32 key points index of eyes and mouth in the key points [36, 68] of the eyes and mouth]Combining the target key points in all the key frames to form a micro-expression data setWherein fpsnumThe number of key frames is such that,representing the x-axis data information,representing y-axis data information, and extracting 34 types of video time domain features FB∈{RN×68|[Rx N×34,Ry N×34]And (c) the step of (c) in which,respectively represent: zero Crossing Rate, Energy of Energy, Spectral Central, Spectral Spread, Spectral Entrophy, Spectral Flux, and Spectral Rolloff,in the form of the MFCCs, is,is a Chroma Vector, and is a Chroma Vector,chroma development;
to ensure that the audio and video features meet the order of magnitude consistency, the audio feature F is separately alignedAAnd FBStandardized to z-score to obtain F 'according with standard normal distribution'AAnd F'BThe normalized features are combined to form a combined feature F ═ w1*F'A,w2*F'B]∈{RN×102In which w1+w2=1;
And training a loan risk SVM regression prediction model by using the joint features.
The multi-mode personal loan risk prediction method provided by the invention predicts the loan risk of the user by the audio and video information of the loan applicant for the specified questions, increases the analysis of micro-expression and language dimension information, does not need manual intervention, and can make reference for the preliminary loan risk level of small and micro enterprises and low-income groups with difficulty in income determination, lack of qualified quality guarantee, and lack or deficiency of credit records.
Although the present invention has been described with reference to a preferred embodiment, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.
Claims (5)
1. A method for multi-modal personal loan risk prediction, comprising:
step S1: obtaining audio and video information of a borrower through the answer of the specified questions of the online video, and carrying out comprehensive weighted manual evaluation scores according to the answer results of the questions and the speed and the micro-expression of the borrower;
step S2: carrying out mean value smooth denoising pretreatment on the audio and video information of the borrower, and extracting various audio features;
step S3: performing face detection on a borrower in a video by utilizing Retina face to obtain a face frame, performing face key point detection in the face frame by utilizing a PRNet to obtain 68 face key points, performing min-max coordinate standardization according to the face frame, and selecting 32 key points of eyes and mouths from the 68 face key points to extract multiple types of video time domain features;
step S4: carrying out z-score standardization processing on the multi-class audio features and the video time domain features, and constructing multi-modal combined features according to weight proportion;
step S5: and establishing an SVM risk prediction model of the personal loan based on the multi-mode combined characteristics.
2. The method for predicting the risk of multi-modal personal loan according to claim 1, wherein the step S2 further comprises:
adopting 3 multiplied by 3 mean value smooth filtering to the audio and video information of the borrower to extract 34-class audio features FA∈{RN×34N is the number of samples, the sampling frequency fs is 16000Hz, the calculation window win is 0.05, and the calculation step is 0.05.
3. The method for predicting the risk of multi-modal personal loan according to claim 1, wherein the step S3 further comprises:
obtaining the largest face frame bbox in the video by utilizing Retina face, and performing 1.25 times of expansion on the face frame bbox to obtain the maximum face frame bboxAn area;
to this by PRNetDetecting key points of the human face in the face frame of the area to obtain 68 key points of the human face;
carrying out min-max standardization processing on the point of landmark in the face frame in the x and y directions by a bbox scale respectively;
selecting 32 key points of the eyes and the mouth part in the face frame as target key points;
constructing micro-expression data set by target key points in key frames in video streamWherein fpsnumThe number of key frames;
extracting 34 types of video time domain features F in the micro expression data set DB∈{RN×68|[Rx N×34,Ry N×34]And f, wherein the sampling frequency fs is 16000Hz, the calculation window win is 0.05, and the calculation step is 0.05.
4. The method for predicting the risk of multi-modal personal loan according to claim 1, wherein the step S4 further comprises:
separately for audio features F1And video temporal features F2Carrying out the standardization processing of z-score to correspondingly obtain the audio frequency characteristic F conforming to the standard normal distribution1' and video temporal feature F2';
For audio frequency characteristic F conforming to standard normal distribution1' and video temporal feature F2' Association to form a multimodal Association signature F ═ w1*F'A,w2*F'B]∈{RN×102In which w1+w2=1。
5. The method for predicting the risk of multi-modal personal loan according to claim 1, wherein the step S5 further comprises:
loading the multi-modal combined features into an SVM regression prediction model to train a loan risk prediction model; wherein, the SVM regression prediction model adopts RBF kernel function image, and punishment coefficients C, gamma and w are subjected to cross validation grid optimization method according to 10 folds1,w2And optimizing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011312355.6A CN112365340A (en) | 2020-11-20 | 2020-11-20 | Multi-mode personal loan risk prediction method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011312355.6A CN112365340A (en) | 2020-11-20 | 2020-11-20 | Multi-mode personal loan risk prediction method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112365340A true CN112365340A (en) | 2021-02-12 |
Family
ID=74533074
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011312355.6A Pending CN112365340A (en) | 2020-11-20 | 2020-11-20 | Multi-mode personal loan risk prediction method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112365340A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113158978A (en) * | 2021-05-14 | 2021-07-23 | 无锡锡商银行股份有限公司 | Risk early warning method for micro-expression recognition in video auditing |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170124432A1 (en) * | 2015-11-03 | 2017-05-04 | Baidu Usa Llc | Systems and methods for attention-based configurable convolutional neural networks (abc-cnn) for visual question answering |
CN109583569A (en) * | 2018-11-30 | 2019-04-05 | 中控智慧科技股份有限公司 | A kind of multi-modal Feature fusion and device based on convolutional neural networks |
CN109697665A (en) * | 2018-12-15 | 2019-04-30 | 深圳壹账通智能科技有限公司 | Loan checking method, device, equipment and medium based on artificial intelligence |
CN109767321A (en) * | 2018-12-18 | 2019-05-17 | 深圳壹账通智能科技有限公司 | Question answering process optimization method, device, computer equipment and storage medium |
CN109829358A (en) * | 2018-12-14 | 2019-05-31 | 深圳壹账通智能科技有限公司 | Micro- expression loan control method, device, computer equipment and storage medium |
CN109840485A (en) * | 2019-01-23 | 2019-06-04 | 科大讯飞股份有限公司 | A kind of micro- human facial feature extraction method, apparatus, equipment and readable storage medium storing program for executing |
CN110009481A (en) * | 2019-03-12 | 2019-07-12 | 平安科技(深圳)有限公司 | A kind of loan measures and procedures for the examination and approval and system based on recognition of face |
-
2020
- 2020-11-20 CN CN202011312355.6A patent/CN112365340A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170124432A1 (en) * | 2015-11-03 | 2017-05-04 | Baidu Usa Llc | Systems and methods for attention-based configurable convolutional neural networks (abc-cnn) for visual question answering |
CN109583569A (en) * | 2018-11-30 | 2019-04-05 | 中控智慧科技股份有限公司 | A kind of multi-modal Feature fusion and device based on convolutional neural networks |
CN109829358A (en) * | 2018-12-14 | 2019-05-31 | 深圳壹账通智能科技有限公司 | Micro- expression loan control method, device, computer equipment and storage medium |
CN109697665A (en) * | 2018-12-15 | 2019-04-30 | 深圳壹账通智能科技有限公司 | Loan checking method, device, equipment and medium based on artificial intelligence |
CN109767321A (en) * | 2018-12-18 | 2019-05-17 | 深圳壹账通智能科技有限公司 | Question answering process optimization method, device, computer equipment and storage medium |
CN109840485A (en) * | 2019-01-23 | 2019-06-04 | 科大讯飞股份有限公司 | A kind of micro- human facial feature extraction method, apparatus, equipment and readable storage medium storing program for executing |
CN110009481A (en) * | 2019-03-12 | 2019-07-12 | 平安科技(深圳)有限公司 | A kind of loan measures and procedures for the examination and approval and system based on recognition of face |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113158978A (en) * | 2021-05-14 | 2021-07-23 | 无锡锡商银行股份有限公司 | Risk early warning method for micro-expression recognition in video auditing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110399929B (en) | Fundus image classification method, fundus image classification apparatus, and computer-readable storage medium | |
US12039454B2 (en) | Microexpression-based image recognition method and apparatus, and related device | |
CN111563422B (en) | Service evaluation acquisition method and device based on bimodal emotion recognition network | |
CN105160318B (en) | Lie detecting method based on facial expression and system | |
CN108717663B (en) | Facial tag fraud judging method, device, equipment and medium based on micro expression | |
US11849998B2 (en) | Method for pupil detection for cognitive monitoring, analysis, and biofeedback-based treatment and training | |
CN109522815A (en) | A kind of focus appraisal procedure, device and electronic equipment | |
CN109670430A (en) | A kind of face vivo identification method of the multiple Classifiers Combination based on deep learning | |
CN110084149B (en) | Face verification method based on hard sample quadruple dynamic boundary loss function | |
CN111666845B (en) | Small sample deep learning multi-mode sign language recognition method based on key frame sampling | |
CN104123543A (en) | Eyeball movement identification method based on face identification | |
CN112633221B (en) | Face direction detection method and related device | |
CN109325472B (en) | Face living body detection method based on depth information | |
CN112084927A (en) | Lip language identification method fusing multiple visual information | |
CN111860253A (en) | Multitask attribute identification method, multitask attribute identification device, multitask attribute identification medium and multitask attribute identification equipment for driving scene | |
CN115546861A (en) | Online classroom concentration degree identification method, system, equipment and medium | |
CN112348809A (en) | No-reference screen content image quality evaluation method based on multitask deep learning | |
CN112365340A (en) | Multi-mode personal loan risk prediction method | |
CN113283372A (en) | Method and apparatus for processing image of person | |
CN112488165A (en) | Infrared pedestrian identification method and system based on deep learning model | |
Ellahi et al. | Hmm-based framework to measure the visual fidelity of tone mapping operators | |
CN116844202A (en) | Construction method of distributed online teaching auxiliary system | |
JP6927495B2 (en) | Person evaluation equipment, programs, and methods | |
CN115116117A (en) | Learning input data acquisition method based on multi-mode fusion network | |
CN109409325B (en) | Identification method and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210212 |
|
RJ01 | Rejection of invention patent application after publication |