CN110278128B - WeChat payment data cleaning method - Google Patents
WeChat payment data cleaning method Download PDFInfo
- Publication number
- CN110278128B CN110278128B CN201910612699.XA CN201910612699A CN110278128B CN 110278128 B CN110278128 B CN 110278128B CN 201910612699 A CN201910612699 A CN 201910612699A CN 110278128 B CN110278128 B CN 110278128B
- Authority
- CN
- China
- Prior art keywords
- data
- signaling
- payment
- wechat payment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/02—Capturing of monitoring data
- H04L43/026—Capturing of monitoring data using flow identification
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W24/00—Supervisory, monitoring or testing arrangements
- H04W24/08—Testing, supervising or monitoring using real traffic
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
The invention belongs to the technical field of wireless communication, and discloses a data cleaning method for WeChat payment. Firstly, establishing an IP address library of a WeChat payment server, reducing the information capturing range and reducing the data volume to be analyzed; then, capturing IP data flow in an IP address base by using a packet capturing tool at an S1-U interface of the LTE core network; cleaning according to the signaling quantity of the TCP data stream; selecting port 443 to transmit the TCP data stream because the wechat payment uses https encryption protocol, and the https protocol uses port 443; determining the data length of the http post signaling; and determining the data length of the http 200ok signaling corresponding to the http post. According to the invention, the relevant characteristics of the micro-signaling payment are summarized by establishing the micro-signaling payment IP address library and researching the micro-signaling payment signaling, and the captured data is screened according to the characteristics, so that effective data is screened out, the analysis data volume is reduced, the analysis efficiency is improved, and the hardware cost is reduced.
Description
Technical Field
The invention belongs to the technical field of wireless communication, and relates to a data cleaning method applicable to 4G and 5G cashless payment, in particular to a data cleaning method for WeChat payment.
Background
At present, China has stepped into the society of cashless payment, and it is estimated that the amount of cashless payment in China will exceed the United states by 2021, becoming the first worldwide. Among them, payment by WeChat is a main consumption mode, so that each large operator pays more and more attention to the optimization of payment perception. However, the daily data volume in the mobile communication network is particularly large, taking Jiangsu telecom as an example, the data volume of the S1-U interface per day reaches the scale of 2000T, and if data cleaning is not performed, the data volume is too large, so that effective analysis processing cannot be performed on the data volume, or the data volume needs to be completed only by high hardware configuration, and the investment cost is too high. Moreover, the amount of data is such that the video data occupies a larger proportion, resulting in a reduced share of useful data. In addition, with such a large amount of data, it is difficult to store the data without performing data cleansing.
For the sake of understanding the concept of the present invention, a brief description will now be made of a 4G mobile communication network. The structure of the 4G mobile communication network is shown in fig. 1, wherein an Evolved Node B, referred to as eNB, is the name of a base station in lte (long Term evolution). Compared with the Node B in the existing 3G, the method integrates the functions of part of RNC, and reduces the protocol level during communication. The functions of the eNB include: RRM function; IP header compression and user data stream encryption; MME selection when UE is attached; scheduling transmission of paging information; scheduling transmission of broadcast information; and setting and providing measurements of the eNB, etc. SAE is a short writing of System Architecture Evolution, in 4G, LTE mainly studies the long term Evolution technology of 3GPP radio access network, an upgraded LTE Advanced will eventually meet the requirements of international telecommunication union on 4G System, SAE is a long term Evolution of core network, and defines an all-IP Packet core network epc (evolved Packet core), and the System is characterized in that only Packet domain but no circuit domain, based on the all-IP structure, control and bearer separation and network structure flattening, wherein the System mainly includes MME, SGW, PGW, PCRF network elements, wherein SGW is SAE Gateway, and the user plane function of SGSN network element in the original 3G network is sometimes written as S-GW.
The number of users paying by using the WeChat is increased, and the number of times of use of each person per day is counted to be 1-4. However, along with the above, the complaint amount of the WeChat payment with poor perception is more and more, and the optimization investment of three operators (mobile, telecom and Unicom) in China on the WeChat payment with poor perception is more and more. When operators summarize and analyze complaints related to WeChat payment, the fact that the poor perception of part of WeChat payment is not completely caused by the problems of a wireless network is found, and the abnormity of the WeChat payment server is also the main reason of the poor perception of WeChat payment.
When the perception of the WeChat payment is poor, the ordinary user usually cannot complain to the WeChat technology provider, and the first reaction of the user is often caused by the problem of the network of the operator and only complains to the operator. At present, operators have few means for analyzing the WeChat payment perception difference, the number of core networks of the operators is super large, for example, Jiangsu telecom, the data volume of an S1-U interface reaches 2000T every day, if data cleaning is not carried out, the data volume is too large, so that effective analysis processing cannot be carried out on the data volume, or the data volume can be completed only by needing very high hardware configuration, and the investment cost is too high. Meanwhile, unless the operator cooperates with the technical provider of the wechat, the signaling related to the wechat payment cannot be accurately identified on the core network side of the LTE, and the related perception of the wechat payment behavior cannot be analyzed, such as: failure to pay, card on payment, etc.
Disclosure of Invention
The invention aims to provide a WeChat payment data cleaning method aiming at the defects of the data cleaning technology, which can screen the captured data according to characteristics, screen out effective data, reduce the analysis data volume, improve the analysis efficiency and reduce the hardware cost.
In order to achieve the purpose, the technical scheme adopted by the invention is a WeChat payment data cleaning method, which specifically comprises the following steps:
s1: an IP address library of the WeChat payment server is established, the information capturing range is narrowed, and the data volume needing to be analyzed is reduced;
s2: capturing IP data flow in an IP address base by using a packet capturing tool at an S1-U interface of the LTE core network;
s3: cleaning according to the signaling quantity of the TCP data stream;
s4: selecting port 443 to transmit the TCP data stream because the wechat payment uses https encryption protocol, and the https protocol uses port 443;
s5: determining the data length of the http post signaling;
s6: and determining the data length of the http 200ok signaling corresponding to the http post.
Preferably, the bale plucker is tcpdump, but other bale pluckers may be selected.
In step 3, the signaling number of a single TCP data stream is greater than 1, otherwise, effective analysis cannot be performed.
In step 5, the data length of the http post signaling is larger than 300bytes, and the signaling mainly aims to verify the accuracy of the WeChat payment password and confirm or transmit payment amount by the information of the payee.
In step 6, the data length of the http 200ok signaling corresponding to the http post is greater than 100 bytes.
Compared with the prior art, the invention has the beneficial effects that:
1, establishing a WeChat payment IP address library by accurately identifying WeChat payment signaling at a mobile phone end;
2, only collecting data of the payment IP address during information collection, and reducing information capture amount;
3, according to the signaling behavior characteristics of the WeChat payment, cleaning the data which is obviously not the payment;
4, the IP address library for WeChat payment does not need to be completed to the utmost extent because the payment data is huge and the payment is carried out
Drawings
Fig. 1 is a structure diagram of a 4G mobile communication network;
fig. 2 is a flowchart of WeChat payment data cleansing.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and examples.
According to the invention, the relevant characteristics of the micro-signaling payment are summarized by establishing the micro-signaling payment IP address library and researching the micro-signaling payment signaling, and the captured data is screened according to the characteristics, so that effective data is screened out, the analysis data volume is reduced, the analysis efficiency is improved, and the hardware cost is reduced.
Because the WeChat payment information is encrypted and transmitted by http + mmtls (https), the key information cannot be checked through http, but the main signaling for payment during code scanning is as follows:
for 1 code scan payment, 3 sets of http signaling are required, and no code scan signaling is required (code scan requires 2 additional sets of http).
The first http (authen) performs authentication to confirm that the password is correct.
And the second group of http (f2fsucpage) performs WeChat account verification, pushes a face-to-face success page, and supposedly confirms the account information of the merchant.
And the third group of http (f2 fpaycount) performs transfer amount transmission confirmation, which literally means face-to-face payment confirmation, and transmits the payment amount.
Each http transmission also consists of 3 majority: TCP three-way handshake + http transmission + TCP four-way waving, that is, the payment behavior of the WeChat is short connection, the connection needs to be established every time the information is transmitted, and the connection is released after the transmission is finished, so that the load of the server is reduced.
The WeChat payment data cleaning flow chart is shown in FIG. 2:
the first step is as follows: establishing a WeChat payment IP address library
At present, the micro-message payment can not be directly and accurately identified through signaling, and the IP address of the micro-message payment is counted and integrated into a payment IP database through accurately identifying the payment signaling on a mobile phone side.
The mobile phone terminal for collecting the WeChat payment firstly unloads all irrelevant APPs, and because some APPs have behaviors of background access to the network, the identification of the payment IP address is influenced.
Installing share software on the mobile phone, capturing TCP/IP signaling, inputting sum and password after the mobile phone successfully scans the password, and keeping the last password not to be input. Starting sharp software to start packet capturing, inputting the last bit of the password after the interval of 20s, and stopping packet capturing after payment is successful.
According to the operation, the computer opens the signaling analysis, about 20s is the time node of payment, and the corresponding IP address is the IP address of the WeChat payment.
And repeating the above operations to collect IP addresses related to payment, wherein 15 payment IP addresses are arranged currently. The second step is that: WeChat payment signaling capture
The TCPDUMP is installed on the grabbing signaling server, only 15 pieces of information of the payment IP addresses are grabbed at an s1-u interface, the information grabbing amount is reduced, the difficulty of subsequent data analysis is reduced, and the hardware cost is reduced.
The s1-u interface belongs to the interface between ENB and SAE Gateway, and all data of user accessing network passes through the interface.
The third step: payment information cleansing
Although the first step has been to locate the payment IP address, the server to which the IP address corresponds may be configured with other functions. That is, the information interaction between the mobile phone and the IP address is not necessarily all information of the wechat payment.
The number of signaling of one TCP data stream is more than 1;
adopting http protocol data flow;
the port number of one party is 443;
the length of the http post data packet is larger than 300 bytes;
the length of 200ok corresponding to the http post data packet is larger than 100 bytes;
the tcp data stream satisfying the above condition is an effective WeChat Payment data stream.
Note: the IP address that initiates the TCP triple handshake is the handset IP address.
Note: a TCP data flow refers to a data flow in which the source IP and the destination IP are the same and the port numbers are the same.
Experiments show that after the cleaning method is adopted, the data volume paid by WeChat in Jiangsu telecommunication province is about 4T, the calculation amount is greatly reduced, and the processing time is greatly shortened.
The above description of the specific embodiments is not intended to limit the present invention, and any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the scope of the present invention.
Claims (3)
1. A WeChat payment data cleaning method is characterized by comprising the following steps:
s1: an IP address library of the WeChat payment server is established, the information capturing range is narrowed, and the data volume needing to be analyzed is reduced;
s2: capturing IP data flow in an IP address base by using a packet capturing tool at an S1-U interface of the LTE core network;
s3: cleaning according to the signaling quantity of the TCP data stream;
s4: selecting port 443 to transmit the TCP data stream;
s5: determining the data length of the http post signaling, wherein the data length is more than 300 bytes;
s6: and determining the data length of the http 200ok signaling corresponding to the http post, wherein the data length is greater than 100 bytes.
2. The WeChat payment data cleansing method of claim 1, wherein the bale plucking tool in step 2 is tcpdump.
3. The WeChat payment data cleansing method of claim 1, wherein in step 3, the signaling number of a single TCP data stream is greater than 1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910612699.XA CN110278128B (en) | 2019-07-05 | 2019-07-05 | WeChat payment data cleaning method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910612699.XA CN110278128B (en) | 2019-07-05 | 2019-07-05 | WeChat payment data cleaning method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110278128A CN110278128A (en) | 2019-09-24 |
CN110278128B true CN110278128B (en) | 2021-09-03 |
Family
ID=67964126
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910612699.XA Active CN110278128B (en) | 2019-07-05 | 2019-07-05 | WeChat payment data cleaning method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110278128B (en) |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8489469B1 (en) * | 2012-08-30 | 2013-07-16 | Elbex Video Ltd. | Method and structure for simplified coding of display pages for operating a closed circuit E-commerce |
CN104978644A (en) * | 2015-06-14 | 2015-10-14 | 兰兴欣 | Pickup method using intelligent express cabinet |
CN105354210A (en) * | 2015-09-23 | 2016-02-24 | 深圳市爱贝信息技术有限公司 | Mobile game payment account behavior data processing method and apparatus |
CN108053257A (en) * | 2017-12-27 | 2018-05-18 | 互动派科技股份有限公司 | A kind of big data user runs the method for building up and application system of Pyramid |
CN108898426A (en) * | 2018-06-14 | 2018-11-27 | 上海米飞网络科技有限公司 | The visualization system and method for payment data processing classification |
-
2019
- 2019-07-05 CN CN201910612699.XA patent/CN110278128B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN110278128A (en) | 2019-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3214861B1 (en) | Method, device and system for detecting fraudulent user | |
US9853867B2 (en) | Method and apparatus to determine network quality | |
EP1794927B1 (en) | Apparatus and method for integrated billing management by real-time session management in wire/wireless integrated service network | |
CN108337652B (en) | Method and device for detecting flow fraud | |
WO2005076644A1 (en) | Method for determining mobile terminal performance in a running wireless network | |
CN103517245B (en) | A kind of charging method and system of D2D communications | |
US20140171021A1 (en) | Method and apparatus for optimizing delivery of network usage and billing data | |
CN102868988B (en) | Based on the method for processing business of policy and charging control and system in wireless network | |
CN110505069B (en) | Method and device for generating customized ticket | |
KR100389801B1 (en) | Billing agency apparatus and method for wireless internet service | |
CN103096356A (en) | Wireless network performance analysis method | |
US20180183939A1 (en) | Method and system for detecting anomalies in consumption of data and charging of data services | |
CN103167502B (en) | Based on the method for the illegal calling of OTA technology regulation | |
CN110278128B (en) | WeChat payment data cleaning method | |
EP3050334B1 (en) | Managing roaming information in communications | |
KR100812676B1 (en) | Method for Generation of Charging Data per Contents in Mobile Communication System | |
CN109257711B (en) | System and method for backfilling number based on communication charging ticket | |
US20150071129A1 (en) | Subscriber-specific tracing in communications | |
KR102353814B1 (en) | Method and appratus for providing roaming services | |
CN102572840B (en) | A kind of method utilizing monitoring signaling technology to differentiate novel malicious callback service | |
KR100848501B1 (en) | Method for Receiving Wireless Internet Quality Measure Information of Network, System and Server Therefor | |
KR100568471B1 (en) | Apparatus and method for subdividing charge in data exclusive network comprising data call connection apparatus of different machine | |
CN103888922B (en) | A kind of method and device of dismantling call and management of end-user account safety | |
CN115065995B (en) | Associated information management method, device, electronic equipment and storage medium | |
EP3061042B1 (en) | Method, user equipment and system for revenue maximization in a communication network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |