CN113076316A - Information relation mapping analysis method, device, equipment and storage medium - Google Patents

Information relation mapping analysis method, device, equipment and storage medium Download PDF

Info

Publication number
CN113076316A
CN113076316A CN202110374166.XA CN202110374166A CN113076316A CN 113076316 A CN113076316 A CN 113076316A CN 202110374166 A CN202110374166 A CN 202110374166A CN 113076316 A CN113076316 A CN 113076316A
Authority
CN
China
Prior art keywords
terminal
keyword
data
tac
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110374166.XA
Other languages
Chinese (zh)
Other versions
CN113076316B (en
Inventor
刘春龙
林炜锋
李少青
孟宝权
王杰
杨满智
蔡琳
梁彧
田野
傅强
金红
陈晓光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Eversec Beijing Technology Co Ltd
Original Assignee
Eversec Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Eversec Beijing Technology Co Ltd filed Critical Eversec Beijing Technology Co Ltd
Priority to CN202110374166.XA priority Critical patent/CN113076316B/en
Publication of CN113076316A publication Critical patent/CN113076316A/en
Application granted granted Critical
Publication of CN113076316B publication Critical patent/CN113076316B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/02Capturing of monitoring data
    • H04L43/028Capturing of monitoring data by filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses an analysis method, a device, equipment and a storage medium for information relation mapping, wherein the method comprises the following steps: extracting UA keywords in a user agent UA from DPI data to generate a terminal UA keyword temporary table; screening the acquired terminal data according to a preset support network to obtain a terminal data table; using the terminal UA keyword temporary table and the terminal data table to perform incremental updating on the current terminal labeling table; screening out the screened UA keywords corresponding to the TAC through the occurrence times of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table, and generating a terminal UA keyword filtering table; and carrying out preset association aggregation operation on the terminal UA keyword filter table and the terminal marking table to obtain a terminal TAC matching table. The embodiment of the invention can realize the mapping relation between the TAC and the terminal information by extracting the keyword information in the UA in the DPI and carrying out correlation analysis on the keyword information and the terminal information.

Description

Information relation mapping analysis method, device, equipment and storage medium
Technical Field
The present invention relates to information association technologies, and in particular, to an analysis method, an analysis device, an analysis apparatus, and a storage medium for information relationship mapping.
Background
With the continuous development of the information technology level, more and more intelligent terminal devices emerge in the market, and with the continuous maturity of the 5G technology, the 5G terminal devices are also continuously popularized in hands of more users, and at this time, how to correlate the relationship mapping between the users and the 5G terminal devices becomes a problem to be broken through, such as a lot of data analysis indexes for scene marketing, such as time length analysis of the users when changing the machine, time length analysis of the 5G terminal devices when using the 5G terminal devices, number analysis of the 5G terminals when users in a 5G package use the 5G terminals, and geographic distribution information. In order to expand the analysis possibility of more index dimensions, it is necessary to perform a process of mapping the relationship between the user information and the 5G terminal information, and to expand the point and area.
Currently, a current network generally analyzes Deep Packet Inspection (DPI) data received by a large data platform, but information in the DPI data is complicated and lacks formatting, and a data source is single, information to which a user belongs does not include terminal related information, and data index analysis in a terminal direction cannot be performed, so that a mapping relationship between a Type Allocation Code (TAC) and a terminal needs to be analyzed by associating terminal information.
Disclosure of Invention
The embodiment of the invention provides an analysis method, device and equipment for information relationship mapping and a storage medium, which are used for obtaining a mapping relationship between a TAC and terminal information.
In a first aspect, an embodiment of the present invention provides an analysis method for information relationship mapping, including:
extracting UA keywords in a User Agent (UA) from DPI data of deep packet inspection to generate a terminal UA keyword temporary table; the terminal UA keyword temporary table comprises the equipment name, the type allocation code TAC and UA keywords of each DPI data;
screening the acquired terminal data according to a preset support network to obtain a terminal data table; the terminal data table comprises the equipment name and the support network category of the terminal;
using the terminal UA keyword temporary table and the terminal data table to perform incremental updating on the current terminal labeling table;
screening the screened UA keywords corresponding to the TAC according to the occurrence frequency of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table to generate a terminal UA keyword filtering table;
and carrying out preset association aggregation operation on the terminal UA keyword filtering table and the terminal marking table to obtain a terminal TAC matching table.
In a second aspect, an embodiment of the present invention further provides an apparatus for analyzing information relationship mapping, where the apparatus includes:
the key word temporary table generating module is used for extracting UA key words in the UA of the user agent from the DPI data to generate a terminal UA key word temporary table; the terminal UA keyword temporary table comprises the equipment name, the type allocation code TAC and UA keywords of each DPI data;
the terminal data table generating module is used for screening the acquired terminal data according to a preset support network type to obtain a terminal data table; the terminal data table comprises the equipment name and the support network category of the terminal;
the terminal registry updating module is used for performing incremental updating on the current terminal labeling table by using the terminal UA keyword temporary table and the terminal data table;
a key word filter table generating module, configured to screen out a screened UA key word corresponding to the TAC and generate a terminal UA key word filter table according to the occurrence number of the UA key word corresponding to each TAC in the terminal UA key word temporary table;
and the terminal TAC matching table generating module is used for carrying out preset association aggregation operation on the terminal UA keyword filtering table and the terminal marking table to obtain a terminal TAC matching table.
In a third aspect, an embodiment of the present invention further provides an information relationship mapping analysis device, where the information relationship mapping analysis device includes:
one or more processors;
a memory for storing one or more programs;
when the one or more programs are executed by the one or more processors, the one or more processors implement the method for analyzing information relationship mapping provided by any embodiment of the present invention.
In a fourth aspect, embodiments of the present invention further provide a storage medium containing computer-executable instructions, which when executed by a computer processor, are used to perform the method for analyzing information relationship mapping provided in any of the embodiments of the present invention.
According to the embodiment of the invention, the mapping relation between the TAC and the terminal information is obtained by extracting the keyword information in the UA in the DPI and performing correlation analysis on the keyword information and the terminal information, the problem that data index analysis in the terminal direction cannot be performed only by analyzing DPI data is solved, and the effect of obtaining the mapping relation between the TAC and the terminal information by analyzing is realized.
Drawings
Fig. 1 is a flowchart of an analysis method of information relationship mapping according to a first embodiment of the present invention;
fig. 2 is a flowchart of an analysis method of information relationship mapping in the second embodiment of the present invention;
fig. 3 is a flowchart of an analysis method of information relationship mapping in the third embodiment of the present invention;
fig. 4 is a schematic structural diagram of an analysis apparatus for information relationship mapping according to a fourth embodiment of the present invention;
fig. 5 is a schematic structural diagram of an analysis device for information relationship mapping in the fifth embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be further noted that, for the convenience of description, only some of the structures related to the present invention are shown in the drawings, not all of the structures.
Example one
Fig. 1 is a flowchart of an analysis method for information relationship mapping according to an embodiment of the present invention, where the present embodiment is applicable to a case of analyzing relationship mapping between terminal information and DPI data, and the method may be executed by an analysis apparatus for information relationship mapping, where the apparatus may be implemented by hardware and/or software, and the method specifically includes the following steps:
110, extracting UA keywords in a user agent UA from Deep Packet Inspection (DPI) data to generate a terminal UA keyword temporary table;
the terminal UA keyword temporary table comprises the device name, the type assignment code TAC and the UA keyword of each DPI data. The ticket data of the DPI data can be processed, UA keywords of UA in the ticket are extracted through a custom ETL (Extract-Transform-Load) program, and are incorporated into a library established in advance, for example, a database established through Clickhouse. The terminal UA keyword temporary table may include an end time of each ticket data, an International Mobile Equipment Identity (IMEI), TAC and UA, and may further include an Equipment alias, a brand name, an Equipment formatted model name, a date, and the like.
Step 120, screening the acquired terminal data according to a preset support network to obtain a terminal data table;
the terminal data table comprises the device name and the support network category of the terminal. The terminal information can be crawled through a crawler program and sorted and stored in a database. The method includes the steps that fields of original crawler data such as a medium brand, a model, a terminal support network, a formatted model name and a terminal alias are required to be screened, data screening is required to be carried out according to the field of the terminal support network, for example, a user using a 5G terminal carries out model name formatting operation, the formatting mode is that contents in brackets in terminal information, including information such as versions, internal memories and storage sizes, are removed according to the model names, and only preset information such as Iphone12 and Huazhimate 40 is reserved. Optionally, the screening, according to the preset support network, the obtained terminal data to obtain a terminal data table, includes: and comparing the support network category in each piece of terminal data with the preset support network category, reserving the terminal data with the same support network category as the preset support network category, and recording the terminal data in a terminal data table. The preset support network can be 5G, 4G and the like.
Step 130, using the terminal UA keyword temporary table and the terminal data table to perform incremental updating on the current terminal labeling table;
wherein existing prepared tag data may be imported into the terminal registry and the terminal registry stored in the database. When the terminal UA keyword temporary table is obtained, the data of the terminal UA keyword temporary table can be imported into the terminal registry, so that incremental updating of the data is realized, and no repeated UA keywords are stored in a database.
Step 140, screening out screening UA keywords corresponding to the TAC through the occurrence number of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table, and generating a terminal UA keyword filter table;
the TAC and UA keywords in the terminal UA keyword temporary table are subjected to aggregation analysis, the number of times of occurrence of each UA keyword corresponding to each TAC is calculated, the UA keyword with the highest number of times is used as a screening UA keyword, and a final association operation result is written into the terminal UA keyword filter table. Optionally, the screening UA keywords corresponding to the TAC are screened out through the occurrence number of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table, and a terminal UA keyword filter table is generated, including: calculating the occurrence frequency of UA keywords corresponding to each TAC; using the UA key words with the most occurrence times as corresponding screening key words of the TAC; and recording the screened keywords in a terminal UA keyword filtering table.
And 150, carrying out preset association aggregation operation on the terminal UA keyword filter table and the terminal label table to obtain a terminal TAC matching table.
And performing grouping, aggregation operation, TopN algorithm and other associated operations on the terminal UA keyword filter table and the terminal label table, and finally aggregating a data terminal table and a terminal TAC matching table. The terminal TAC matching table may include field information such as TAC, brand name, device name, and date.
According to the technical scheme of the embodiment, the mapping relation between the TAC and the terminal information is obtained by extracting the keyword information in the UA in the DPI and performing correlation analysis on the keyword information and the terminal information, the problem that data index analysis in the terminal direction cannot be performed only by analyzing the DPI data is solved, and the effect of obtaining the mapping relation between the TAC and the terminal information through analysis is achieved.
Example two
Fig. 2 is a flowchart of an analysis method for information relationship mapping according to a second embodiment of the present invention, which is further detailed based on the foregoing technical solution, and the method includes:
step 210, screening DPI data, and filtering null values and invalid data;
the invalid data is DPI data outside a preset terminal system. And performing primary data screening on the UA, and filtering out { null value and invalid data }, wherein the invalid data is subjected to regular matching through terminal operating system information in the UA, and finally reserving DPI data which is a preset terminal system in the UA, for example, the preset terminal system can be an IOS and an Android system.
Step 220, extracting UA keywords in the UA of the user agent from the DPI data to generate a terminal UA keyword temporary table;
step 230, screening the acquired terminal data according to a preset support network to obtain a terminal data table;
step 240, using the terminal UA keyword temporary table and the terminal data table to perform incremental update on the current terminal labeling table;
step 250, screening out screening UA keywords corresponding to the TAC according to the occurrence frequency of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table, and generating a terminal UA keyword filtering table;
and step 260, carrying out preset association aggregation operation on the terminal UA keyword filter table and the terminal label table to obtain a terminal TAC matching table.
Optionally, a preset association aggregation operation is performed on the terminal UA keyword filter table and the terminal label table to obtain a terminal TAC matching table, which includes:
aggregating the terminal UA keyword filter table and the terminal label table to obtain an incremental terminal label table;
calculating a UA keyword uniquely corresponding to the TAC in the incremental terminal labeling table based on a TopN algorithm;
and performing series updating through a preset terminal information field to obtain the relation mapping between the TAC and the terminal information, and recording the relation mapping into a terminal TAC matching table.
And finally, realizing the mapping relation between the TAC and the terminal information by integrating the terminal UA keyword filter table and the terminal label table together through aggregation, TopN operation and series updating. And the terminal UA keyword filtering table and the terminal labeling table are aggregated. And filtering TAC data, and calculating the unique value of the UA keyword corresponding to the TAC according to the TopN algorithm. And finally, performing serial updating through the preset equipment information field to obtain the relation mapping between the TAC and the terminal, so as to realize the relation mapping between the TAC and the terminal information. The preset terminal information field may be a terminal name field, etc.
EXAMPLE III
Fig. 3 is a flowchart of an analysis method for information relationship mapping according to a third embodiment of the present invention, where the present embodiment is a specific implementation manner based on the foregoing technical solution, but is not limited to the following implementation manner, and the method specifically includes:
step 301, extracting DPI data, wherein an example of data required by the process is as follows:
Figure BDA0003010500680000081
the data volume of the existing network for one day is selected as a demonstration sample, i.e. IMEI (International Mobile Equipment Identity).
And 302, screening data of the DPI data aiming at the Useragent, and filtering out { null value and invalid data }, wherein the invalid data is completed by performing regular matching on terminal operating system information in the Useragent, and finally an IOS and Android system is reserved.
Step 303, generating a terminal UA keyword temporary table (device _ UA _ keyword _ all) by the DPI data obtained in the above two steps, which is exemplified as follows:
Figure BDA0003010500680000091
wherein, device _ alias is terminal alias, vendor _ name is brand, and device _ format is formatted model name.
Step 304, performing TAC and UA keyword aggregation analysis on the terminal UA keyword temporary table generated in step 303 to generate a terminal UA keyword filter table, that is, calculating the UA keyword that is the most accurate for each TAC, and keeping the optimal solution through continuous data update by calculating the number of times that each keyword of each TAC appears, with the highest data as the standard, where an example of the process result is as follows:
device_alias tac pv day
X20A 01353400 11 20201103
IPHONE6,1 01398700 11 20201103
_VSIMLL 15708549 11 20201103
step 311, arranging and storing the crawled terminal information in a warehouse, wherein an example of the implementation process is as follows: **
date;ls crawler_original.csv|xargs-i-t cat{}|clickhouse-client--database=hdfs--format_csv_delimiter'|'--query="INSERT INTO hdfs.device_crawler_original FORMAT CSV";date
Step 312, inputting the terminal crawler data into a crawler data original table (crawler _ original);
313, screening original crawler data, and screening fields such as brands, models, terminal support networks, formatted model names, terminal aliases, and the like, wherein data screening needs to be performed according to the field of the terminal support networks, a user using the 5G terminal performs model name formatting operation, the formatting mode is to remove all the contents in brackets according to the model names, including information such as versions, memories, storage sizes, and the like, and only essence is reserved, such as Iphone12, and Hua is mate 40.
Step 314, inputting the screened data into a terminal crawler data format table (device _ crawler _ format), and storing in a database, which is exemplified as follows:
Figure BDA0003010500680000101
step 315, importing the existing prepared tag data (terminal UA keyword temporary table) into a terminal label table device _ tagging; the method mainly aims to realize incremental updating of data and ensure that no repeated UA keywords are put into a database; and performing frequency calculation on each UA keyword (namely calculating the occurrence frequency of each keyword);
step 316, as a result of step 315, the process of step 316 is to perform aggregation update on the device _ crawler _ format and the device _ tagging of the terminal crawler format table, and update the two data sources to the latest state, that is, to synchronize data, which is as follows:
Figure BDA0003010500680000111
305, integrating multi-table aggregation, TopN operation and series updating together to finally realize the mapping relation between the TAC and the terminal information;
the method comprises the steps that a keyword filtering table device _ ua _ keyword _ filter and a terminal labeling table device _ tagging are aggregated; filtering TAC data, and calculating a unique value of the TAC according to the TopN algorithm; and finally, performing serial updating through the Alias field to obtain the relation mapping between the TAC and the terminal, and ending the process.
The results set sample is as follows:
Tac Vendor_name Device_name day
00000008 Huawei huachen nova6SE 20201103
00100115 OPPO OPPOR9 20201103
Example four
Fig. 4 is a schematic structural diagram of an information relationship mapping analysis apparatus according to a fourth embodiment of the present invention, where the apparatus may be disposed in an information relationship mapping analysis device, and the information relationship mapping analysis device may be a computer device such as a server, and the apparatus specifically includes:
a keyword temporary table generating module 410, configured to extract a UA keyword in a UA of a user agent from Deep Packet Inspection (DPI) data, and generate a terminal UA keyword temporary table; the terminal UA keyword temporary table comprises the equipment name, the type allocation code TAC and UA keywords of each DPI data;
the terminal data table generating module 420 is configured to screen the acquired terminal data according to a preset support network type to obtain a terminal data table; the terminal data table comprises the equipment name and the support network category of the terminal;
a terminal registry updating module 430, configured to use the terminal UA keyword temporary table and the terminal data table to perform incremental updating on the current terminal labeling table;
a keyword filter table generating module 440, configured to screen out the screened UA keywords corresponding to the TAC and generate a terminal UA keyword filter table according to the occurrence number of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table;
and the terminal TAC matching table generating module 450 is configured to perform preset association aggregation operation on the terminal UA keyword filtering table and the terminal labeling table to obtain a terminal TAC matching table.
According to the technical scheme of the embodiment, the mapping relation between the TAC and the terminal information is obtained by extracting the keyword information in the UA in the DPI and performing correlation analysis on the keyword information and the terminal information, the problem that data index analysis in the terminal direction cannot be performed only by analyzing the DPI data is solved, and the effect of obtaining the mapping relation between the TAC and the terminal information through analysis is achieved.
Optionally, the analysis apparatus for information relationship mapping further includes:
the DPI data screening module is used for screening DPI data before UA keywords in a user agent UA are extracted from deep packet inspection DPI data and a terminal UA keyword temporary table is generated, and filtering null values and invalid data; the invalid data is DPI data outside a preset terminal system.
Optionally, the keyword filtering table generating module is specifically configured to:
calculating the occurrence frequency of UA keywords corresponding to each TAC;
using the UA key words with the most occurrence times as corresponding screening key words of the TAC;
and recording the screened keywords in a terminal UA keyword filtering table.
Optionally, the terminal data table generating module 420 is specifically configured to:
and comparing the support network category in each piece of terminal data with the preset support network category, reserving the terminal data with the same support network category as the preset support network category, and recording the terminal data in a terminal data table.
Optionally, the terminal TAC matching table generating module 450 is specifically configured to:
aggregating the terminal UA keyword filter table and the terminal label table to obtain an incremental terminal label table;
calculating a UA keyword uniquely corresponding to the TAC in the incremental terminal labeling table based on a TopN algorithm;
and performing series updating through a preset terminal information field to obtain the relation mapping between the TAC and the terminal information, and recording the relation mapping into a terminal TAC matching table.
The analysis device for information relationship mapping provided by the embodiment of the invention can execute the analysis method for information relationship mapping provided by any embodiment of the invention, and has corresponding functional modules and beneficial effects of the execution method.
EXAMPLE five
Fig. 5 is a schematic structural diagram of an information relationship mapping analysis apparatus according to a fifth embodiment of the present invention, as shown in fig. 5, the information relationship mapping analysis apparatus includes a processor 510, a memory 520, an input device 530, and an output device 540; the number of the processors 510 in the analysis device of the information relationship mapping may be one or more, and one processor 510 is taken as an example in fig. 5; the processor 510, the memory 520, the input device 530 and the output device 540 in the analysis apparatus of the information relationship mapping may be connected by a bus or other means, and fig. 5 illustrates an example of connection by a bus.
The memory 520 is a computer-readable storage medium, and can be used for storing software programs, computer-executable programs, and modules, such as program instructions/modules corresponding to the analysis method of the information relationship mapping in the embodiment of the present invention (for example, the keyword temporary table generation module 410, the terminal data table generation module 420, the terminal registry update module 430, the keyword filter table generation module 440, and the terminal TAC matching table generation module 450 in the analysis apparatus of the information relationship mapping). The processor 510 executes various functional applications and data processing of the analysis apparatus for information relationship mapping by executing software programs, instructions and modules stored in the memory 520, that is, implements the analysis method for information relationship mapping described above.
The memory 520 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal, and the like. Further, the memory 520 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some examples, memory 520 may further include memory located remotely from processor 510, which may be connected to an information relationship mapped analysis device via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input means 530 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the analysis device of the information relationship mapping. The output device 540 may include a display device such as a display screen.
EXAMPLE six
An embodiment of the present invention further provides a storage medium containing computer-executable instructions, which when executed by a computer processor, are configured to perform a method for analyzing an information relationship map, including:
extracting UA keywords in a user agent UA from Deep Packet Inspection (DPI) data to generate a terminal UA keyword temporary table; the terminal UA keyword temporary table comprises the equipment name, the type allocation code TAC and UA keywords of each DPI data;
screening the acquired terminal data according to a preset support network to obtain a terminal data table; the terminal data table comprises the equipment name and the support network category of the terminal;
using the terminal UA keyword temporary table and the terminal data table to perform incremental updating on the current terminal labeling table;
screening the screened UA keywords corresponding to the TAC according to the occurrence frequency of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table to generate a terminal UA keyword filtering table;
and carrying out preset association aggregation operation on the terminal UA keyword filtering table and the terminal marking table to obtain a terminal TAC matching table.
Of course, the storage medium provided by the embodiment of the present invention contains computer-executable instructions, and the computer-executable instructions are not limited to the method operations described above, and may also perform related operations in the analysis method of information relationship mapping provided by any embodiment of the present invention.
From the above description of the embodiments, it is obvious for those skilled in the art that the present invention can be implemented by software and necessary general hardware, and certainly, can also be implemented by hardware, but the former is a better embodiment in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as a floppy disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a FLASH Memory (FLASH), a hard disk or an optical disk of a computer, and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute the methods according to the embodiments of the present invention.
It should be noted that, in the embodiment of the analysis apparatus for information relationship mapping, each unit and each module included in the analysis apparatus are only divided according to functional logic, but are not limited to the above division, as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims (10)

1. An analysis method for information relationship mapping, comprising:
extracting UA keywords in a user agent UA from Deep Packet Inspection (DPI) data to generate a terminal UA keyword temporary table; the terminal UA keyword temporary table comprises the equipment name, the type allocation code TAC and UA keywords of each DPI data;
screening the acquired terminal data according to a preset support network to obtain a terminal data table; the terminal data table comprises the equipment name and the support network category of the terminal;
using the terminal UA keyword temporary table and the terminal data table to perform incremental updating on the current terminal labeling table;
screening the screened UA keywords corresponding to the TAC according to the occurrence frequency of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table to generate a terminal UA keyword filtering table;
and carrying out preset association aggregation operation on the terminal UA keyword filtering table and the terminal marking table to obtain a terminal TAC matching table.
2. The method according to claim 1, further comprising, before said extracting the UA keyword in the UA of the user agent from the Deep Packet Inspection (DPI) data to generate the terminal UA keyword temporary table:
screening the DPI data, and filtering out null values and invalid data; and the invalid data is DPI data outside a preset terminal system.
3. The method of claim 1, wherein the generating a terminal UA keyword filter table by filtering out the filtered UA keywords corresponding to the TAC through the number of occurrences of the UA keywords corresponding to each TAC in the terminal UA keyword temporary table comprises:
calculating the occurrence frequency of the UA keyword corresponding to each TAC;
taking the UA keyword with the largest occurrence number as the corresponding screening keyword of the TAC;
and recording the screened keywords in the terminal UA keyword filtering table.
4. The method according to claim 1, wherein the screening the acquired terminal data according to the preset support network to obtain a terminal data table comprises:
and comparing the support network category in each piece of terminal data with the preset support network category, reserving the terminal data with the same support network category as the preset support network category, and recording the terminal data in the terminal data table.
5. The method according to claim 1, wherein the performing a preset association aggregation operation on the terminal UA keyword filter table and the terminal label table to obtain a terminal TAC matching table comprises:
aggregating the terminal UA keyword filter table and the terminal label table to obtain the incremental terminal label table;
calculating a unique UA keyword corresponding to the TAC in the incremental terminal labeling table based on a TopN algorithm;
and performing series updating through a preset terminal information field to obtain the relation mapping between the TAC and the terminal information, and recording the relation mapping into the TAC matching table of the terminal.
6. An apparatus for analyzing information relationship mapping, comprising:
the key word temporary table generating module is used for extracting UA key words in the UA of the user agent from the DPI data to generate a terminal UA key word temporary table; the terminal UA keyword temporary table comprises the equipment name, the type allocation code TAC and UA keywords of each DPI data;
the terminal data table generating module is used for screening the acquired terminal data according to a preset support network type to obtain a terminal data table; the terminal data table comprises the equipment name and the support network category of the terminal;
the terminal registry updating module is used for performing incremental updating on the current terminal labeling table by using the terminal UA keyword temporary table and the terminal data table;
a key word filter table generating module, configured to screen out a screened UA key word corresponding to the TAC and generate a terminal UA key word filter table according to the occurrence number of the UA key word corresponding to each TAC in the terminal UA key word temporary table;
and the terminal TAC matching table generating module is used for carrying out preset association aggregation operation on the terminal UA keyword filtering table and the terminal marking table to obtain a terminal TAC matching table.
7. The apparatus of claim 6, further comprising:
a DPI data screening module, configured to screen DPI data to filter null values and invalid data before a UA keyword in a user agent UA is extracted from Deep Packet Inspection (DPI) data and a terminal UA keyword temporary table is generated; and the invalid data is DPI data outside a preset terminal system.
8. The apparatus of claim 6, wherein the keyword filter table generating module is specifically configured to:
calculating the occurrence frequency of the UA keyword corresponding to each TAC;
taking the UA keyword with the largest occurrence number as the corresponding screening keyword of the TAC;
and recording the screened keywords in the terminal UA keyword filtering table.
9. An information relationship mapping analysis apparatus, characterized by comprising:
one or more processors;
a memory for storing one or more programs;
when executed by the one or more processors, cause the one or more processors to implement the method of analyzing an information relationship map as recited in any of claims 1-5.
10. A storage medium containing computer-executable instructions for performing the method of analyzing an information relationship map according to any one of claims 1-5 when executed by a computer processor.
CN202110374166.XA 2021-04-07 2021-04-07 Information relation mapping analysis method, device, equipment and storage medium Active CN113076316B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110374166.XA CN113076316B (en) 2021-04-07 2021-04-07 Information relation mapping analysis method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110374166.XA CN113076316B (en) 2021-04-07 2021-04-07 Information relation mapping analysis method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN113076316A true CN113076316A (en) 2021-07-06
CN113076316B CN113076316B (en) 2023-12-19

Family

ID=76615425

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110374166.XA Active CN113076316B (en) 2021-04-07 2021-04-07 Information relation mapping analysis method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113076316B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102013004795A1 (en) * 2012-03-21 2013-09-26 Gabriele Trinkel Method for generating noise for noise generator for generating random numbers, passwords in computer technology, cloud computing, involves generating true random number for processing or transporting electric binary data
CN103747002A (en) * 2014-01-16 2014-04-23 电信科学技术第一研究所 Communication terminal capacity management system and terminal capacity management method
DE102012022894A1 (en) * 2012-11-23 2014-05-28 Gabriele Lisa Trinkel System for identification, verification and/or authentication of projectile e.g. railgun projectile, has sensor, communication unit, processing unit and power supply or power generation unit which are arranged in housing of projectile
CN105260365A (en) * 2014-06-04 2016-01-20 中国移动通信集团宁夏有限公司 Terminal information processing method and device
CN107105428A (en) * 2016-02-23 2017-08-29 中国移动通信集团内蒙古有限公司 The method and device in quick completion end message storehouse
CN107301192A (en) * 2016-04-14 2017-10-27 中国移动通信集团河北有限公司 A kind of terminal identification method and identification server
US20200351762A1 (en) * 2019-05-02 2020-11-05 Nokia Technologies Oy Method and apparatus for support of migration and co-existence of public land mobile network and user equipment capability identifications

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102013004795A1 (en) * 2012-03-21 2013-09-26 Gabriele Trinkel Method for generating noise for noise generator for generating random numbers, passwords in computer technology, cloud computing, involves generating true random number for processing or transporting electric binary data
DE102012022894A1 (en) * 2012-11-23 2014-05-28 Gabriele Lisa Trinkel System for identification, verification and/or authentication of projectile e.g. railgun projectile, has sensor, communication unit, processing unit and power supply or power generation unit which are arranged in housing of projectile
CN103747002A (en) * 2014-01-16 2014-04-23 电信科学技术第一研究所 Communication terminal capacity management system and terminal capacity management method
CN105260365A (en) * 2014-06-04 2016-01-20 中国移动通信集团宁夏有限公司 Terminal information processing method and device
CN107105428A (en) * 2016-02-23 2017-08-29 中国移动通信集团内蒙古有限公司 The method and device in quick completion end message storehouse
CN107301192A (en) * 2016-04-14 2017-10-27 中国移动通信集团河北有限公司 A kind of terminal identification method and identification server
US20200351762A1 (en) * 2019-05-02 2020-11-05 Nokia Technologies Oy Method and apparatus for support of migration and co-existence of public land mobile network and user equipment capability identifications

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JENNY LU 等: "Maintaining a Robust Device Identity System: Introducing the GSMA TAC and IMEI Integrity Framework", pages 1, Retrieved from the Internet <URL:https://www.gsma.com/services/blog/tac-and-imei-integrity-framework/> *
全俊斌 等: "基于信令数据的移动物联网终端识别特征研究", 物联网技术, pages 101 - 103 *
夏日微风SUMMERBREEZE: "手机TAC码介绍", pages 1, Retrieved from the Internet <URL:https://blog.csdn.net/jbhand/article/details/78207036> *

Also Published As

Publication number Publication date
CN113076316B (en) 2023-12-19

Similar Documents

Publication Publication Date Title
CN111708860A (en) Information extraction method, device, equipment and storage medium
CN110472068A (en) Big data processing method, equipment and medium based on heterogeneous distributed knowledge mapping
CN103425687A (en) Retrieval method and system based on queries
CN107590291A (en) A kind of searching method of picture, terminal device and storage medium
CN104462396B (en) Character string processing method and device
CN107679208A (en) A kind of searching method of picture, terminal device and storage medium
CN103714086A (en) Method and device used for generating non-relational data base module
CN109388659B (en) Data storage method, device and computer readable storage medium
CN110765195A (en) Data analysis method and device, storage medium and electronic equipment
CN110909168A (en) Knowledge graph updating method and device, storage medium and electronic device
CN104317850A (en) Data processing method and device
CN112463986A (en) Information storage method and device
CN115438740A (en) Multi-source data convergence and fusion method and system
CN112395425A (en) Data processing method and device, computer equipment and readable storage medium
CN109117467A (en) Generation method, system, equipment and the medium of configurable dynamic data report
CN111625567A (en) Data model matching method, device, computer system and readable storage medium
CN107368500B (en) Data extraction method and system
CN112069305B (en) Data screening method and device and electronic equipment
CN113704343A (en) Data blood margin visualization implementation method and system in data processing
CN110704635B (en) Method and device for converting triplet data in knowledge graph
CN112634004A (en) Blood margin map analysis method and system for credit investigation data
CN113076316B (en) Information relation mapping analysis method, device, equipment and storage medium
CN104991920A (en) Label generation method and apparatus
CN111475505B (en) Data acquisition method and device
CN105740260A (en) Method and device for extracting template file data structure

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant