CN105657003B - Information processing method and server - Google Patents

Information processing method and server Download PDF

Info

Publication number
CN105657003B
CN105657003B CN201511004500.3A CN201511004500A CN105657003B CN 105657003 B CN105657003 B CN 105657003B CN 201511004500 A CN201511004500 A CN 201511004500A CN 105657003 B CN105657003 B CN 105657003B
Authority
CN
China
Prior art keywords
data
information
user
dimension parameter
identifier
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201511004500.3A
Other languages
Chinese (zh)
Other versions
CN105657003A (en
Inventor
章国瑾
陶帅
李彪
王宇
唐少刚
丁道峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201511004500.3A priority Critical patent/CN105657003B/en
Publication of CN105657003A publication Critical patent/CN105657003A/en
Application granted granted Critical
Publication of CN105657003B publication Critical patent/CN105657003B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/55Push-based network services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0257User requested

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention discloses an information processing method and a server. The method comprises the following steps: acquiring and storing reporting information of a terminal, wherein the reporting information comprises a terminal identifier and a media information bit identifier; when a user portrait function request aiming at an application is received, acquiring a media information bit identifier corresponding to the application identifier based on the application identifier; inquiring the stored reported information, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier; querying a stored user representation data set to obtain a first user representation data set matched with the user information; analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data volume corresponding to each dimension parameter, and generating the applied crowd portrait data based on the data volume corresponding to each dimension parameter.

Description

Information processing method and server
Technical Field
The invention relates to an information processing technology, in particular to an information processing method and a server.
Background
In the process of implementing the technical solution of the embodiment of the present application, the inventor of the present application finds at least the following technical problems in the related art:
in the existing media information pushing system, portrait analysis capability and data analysis capability of a client user are generally lacked, so that after the media information pushing system pushes media information to a client, the presented media information is probably uninteresting to the client user, so that the client user cannot open the media information, or after the media information is opened, no further operation (such as downloading operation, purchasing operation and the like) is performed on specific content presented by the media information, and therefore, the directional pushing of the media information is inaccurate; such as actively pushed advertising content. However, no effective solution to the above problems exists in the related art.
Disclosure of Invention
In order to solve the existing technical problem, embodiments of the present invention provide an information processing method and a server, which can implement accurate directional push of media information.
In order to achieve the above purpose, the technical solution of the embodiment of the present invention is realized as follows:
the embodiment of the invention provides an information processing method, which comprises the following steps:
acquiring and storing reporting information of a terminal, wherein the reporting information comprises a terminal identifier and a media information bit identifier;
when a user portrait function request aiming at an application is received, acquiring a media information bit identifier corresponding to the application identifier based on the application identifier;
inquiring the stored reported information, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier;
querying a stored user representation data set to obtain a first user representation data set matched with the user information;
analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, and generating the applied crowd portrait data based on the data corresponding to each dimension parameter.
In the foregoing solution, before querying a pre-stored user representation data set, the method includes:
establishing an incidence relation between the index identification and data in the user portrait data set to generate first data; the index mark is characterized by the preset dimension parameter.
In the foregoing solution, the querying a pre-stored user portrait data set to obtain a first user portrait data set matching the user information includes:
and querying the first data based on the user information, obtaining at least one dimension parameter and a dimension parameter value corresponding to the user information, and generating the first user portrait data set based on the at least one dimension parameter and the dimension parameter value.
In the foregoing solution, the generating the applied people portrait data based on the data corresponding to each dimension parameter includes:
counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter;
and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
In the above scheme, the obtaining and storing the report information of the terminal includes:
obtaining source reporting information transmitted by a terminal in a Software Development Kit (SDK) mode or obtaining source reporting information transmitted by the terminal in a JS code mode;
screening the source reporting information to obtain reporting information;
and storing the report information in association with a preset identifier.
In the above scheme, the method further comprises: and adjusting the push proportion of the media information based on the applied crowd image data.
An embodiment of the present invention further provides a server, where the server includes: the system comprises a data access module and a multi-dimensional perspective service module; wherein the content of the first and second substances,
the data access module is used for acquiring and storing the report information of the terminal, wherein the report information comprises a terminal identifier and a media information bit identifier;
the multidimensional perspective service module is used for obtaining a media information bit identifier corresponding to an application identifier based on the application identifier when a user portrait function request aiming at the application is received; inquiring the reported information stored by the data access module, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier; querying a stored user representation data set to obtain a first user representation data set matched with the user information; analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, and generating the applied crowd portrait data based on the data corresponding to each dimension parameter.
In the above scheme, the multidimensional perspective service module is further configured to establish an association relationship between an index identifier and data in a user portrait data set to generate first data before querying the pre-stored user portrait data set; the index mark is characterized by the preset dimension parameter.
In the foregoing solution, the multidimensional perspective service module is configured to query the first data based on the user information, obtain at least one dimension parameter and a dimension parameter value corresponding to the user information, and generate the first user portrait data set based on the at least one dimension parameter and the dimension parameter value; analyzing data in the first user portrait data set according to the at least one dimension parameter to obtain data corresponding to each dimension parameter; counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
In the above scheme, the multidimensional perspective service module includes an index construction module, a storage unit and a multidimensional data processing module; wherein the content of the first and second substances,
the index building module is used for building an incidence relation between an index identifier and data in a user portrait data set to generate first data and sending the first data to the storage unit; the index identification is characterized by the preset dimension parameter;
the storage unit is used for storing the first data;
the multidimensional data processing module is used for querying first data generated by the index construction module based on the user information, obtaining at least one dimension parameter and a dimension parameter value corresponding to the user information, and generating the first user portrait data set based on the at least one dimension parameter and the dimension parameter value; analyzing data in the first user portrait data set according to the at least one dimension parameter to obtain data corresponding to each dimension parameter; counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
In the above scheme, the data access module is configured to obtain source reporting information that is transmitted by the terminal in an SDK mode, or obtain source reporting information that is transmitted by the terminal in a JS code mode; screening the source reporting information to obtain reporting information; and storing the report information in association with a preset identifier.
The information processing method and the server of the embodiment of the invention comprise the following steps: acquiring and storing reporting information of a terminal, wherein the reporting information comprises a terminal identifier and a media information bit identifier; when a user portrait function request aiming at an application is received, acquiring a media information bit identifier corresponding to the application identifier based on the application identifier; inquiring the stored reported information, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier; inquiring a pre-stored user portrait data set to obtain a first user portrait data set matched with the user information; analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, and generating the applied crowd portrait data based on the data corresponding to each dimension parameter. Therefore, by adopting the technical scheme of the embodiment of the invention, the corresponding data are acquired corresponding to the user portrait function requests of different applications through the acquisition of mass source data, so that the corresponding crowd portrait data are acquired through the analysis and statistics of the acquired data according to the preset dimension parameters, the accurate calculation of the crowd portrait data of different types of media information bits in the applications is realized, the push strategy of the media information can be adjusted based on the crowd portrait data, and the accuracy of the directional push of the media information is greatly improved.
Drawings
Fig. 1 is a schematic structural diagram of a server according to a first embodiment of the present invention;
fig. 2 is a schematic application diagram of a data access module in a server according to an embodiment of the present invention;
FIG. 3 is a schematic diagram illustrating a structure of a perspective service module in a server according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of an application of a perspective service module in a server according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of an application architecture of a data access module and a perspective service module in a server according to an embodiment of the present invention;
FIG. 6 is a flowchart illustrating an information processing method according to an embodiment of the present invention;
FIG. 7 is a schematic flow chart of an information processing method according to an embodiment of the present invention;
FIG. 8a to FIG. 8d are schematic diagrams illustrating user image data obtained by an information processing method according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.
Example one
The embodiment of the invention provides a server. Fig. 1 is a schematic structural diagram of a server according to a first embodiment of the present invention; as shown in fig. 1, the server includes: a data access module 11 and a multidimensional perspective service module 12; wherein the content of the first and second substances,
the data access module 11 is configured to obtain and store reporting information of a terminal, where the reporting information includes a terminal identifier and a media information bit identifier;
the multidimensional perspective service module 12 is configured to, when a user portrait function request for an application is received, obtain a media information bit identifier corresponding to the application identifier based on the application identifier; inquiring the reported information stored in the data access module 11, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier; querying a stored user representation data set to obtain a first user representation data set matched with the user information; analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, and generating the applied crowd portrait data based on the data corresponding to each dimension parameter.
The server described in the embodiment of the present invention may specifically be a single server or a server cluster.
In this embodiment, the data access module 11 is configured to obtain source reporting information that is transmitted by the terminal in a Software Development Kit (SDK) manner, or obtain source reporting information that is transmitted by the terminal in a JS code manner; screening the source reporting information to obtain reporting information; and storing the report information in association with a preset identifier.
That is to say, the data access module 11 of the embodiment of the present invention may obtain the report information of the terminal in the following manner: by means of code (e.g. JS code), by means of Software Development Kit (SDK). Specifically, the present embodiment is directed to a pushing scenario of media information in a mobile application, where the mobile application is any application applied to a mobile terminal; the content of the media information may specifically be advertisement content, that is, the present embodiment is an application scenario for advertisement push in a mobile application. Specifically, in a practical application, there is at least one media information presentation bit for any application. When the media information is displayed, the application reports the information to the server through the built-in service logic. Specifically, when the application is an application under an IOS (operation operating system) and an android operating system, transmitting source report information in an SDK (software development kit) mode; and when the application is made by 5 th generation hypertext markup language (H5), transmitting source report information in a JS code mode. The source reporting information obtained by the data access module 11 includes a terminal identifier and a media information bit identifier; when the media information display positions in the application are multiple, the source reporting information comprises a terminal identifier and multiple media information position identifiers. The terminal identifier may specifically be an International Mobile Equipment Identifier (IMEI); when the media information content is an advertisement content, the media information bit identifier may be specifically an advertisement bit identifier.
Further, in this embodiment, the data access module 11 may screen the source reporting information according to a preset identification method, remove, for example, error data and illegal data (i.e., cheating data) in the source reporting information, and reserve valid data, thereby obtaining the reporting information. In this embodiment, the source reporting information and the reporting information are the same type of information, and the difference is that the source reporting information is data before identification, and the reporting information is data after invalid data is removed. Further, the data access module 11 stores the report information in association with a preset identifier, so as to facilitate subsequent data query and analysis; wherein the preset identifiers are associated with users, i.e. each preset identifier represents a user. The preset identifier is different from an application account of the user, so that the application account of the user is prevented from being directly adopted as an identifier of the reported information corresponding to the user, and the application account, such as a QQ number, is prevented from revealing the privacy of the user.
Fig. 2 is a schematic application diagram of the data access module 11 in the server according to the embodiment of the present invention; as can be seen in fig. 2, the data obtained by the data access module 11 at least includes first party data and second party data; the first party data may be the report data transmitted in the JS mode, the report data transmitted in the SDK mode, and the exposure and click data of the media information obtained by the data access module 11 (the exposure and click data of the media information may be stored as an exposure log). Further, the data access module 11 is pre-configured with a first data collection tool for data transmission (e.g., the octopus tool shown in fig. 2). The second party data is specifically data which is obtained from other equipment and can be used for user portrait analysis; the other device and the server are in a heterogeneous system; the heterogeneous system may be understood as a function or responsibility different from that described for the server, or as being supported by a service provided by a different facilitator, etc. The second party data may be, for example, first data characterizing customer relationships and second data characterizing marketing situations for media information. Further, a second data collection tool is also preconfigured, which is used to communicate with other devices (e.g., the Datahub tool shown in fig. 2).
Referring to fig. 2, after obtaining source data (including the first party data and the second party data), the data access module 11 cleans and integrates the source data, that is, identifies the source data according to a preset identification method, removes, for example, error data and illegal data (i.e., cheating data) in the source data, and retains valid data; and, performing effect measurement on the source data, namely measuring whether the source data (including the first party data and the second party data) is supported for user portrait analysis through related business logic. On the other hand, the first data and the preset identifier are stored in a correlation mode, so that on one hand, data statistics and analysis are facilitated, on the other hand, data statistics indexes are facilitated, and a business report representing various data distribution conditions of a client user can be provided.
In this embodiment, when the multidimensional perspective service module 12 receives a user representation request for an application, a media information bit identifier corresponding to the application identifier is obtained based on the application identifier. Wherein the user representation request may be triggered by the user representation function unit of the mobile alliance platform at WEB level as shown in figure 3 for enabling user representation functionality for any application. Further, the server queries a pre-stored exposure log based on the identifier of the application, as shown in fig. 2, the multidimensional perspective service module 12 queries a media information exposure log containing exposure and click data of media information stored in the data access module 11 based on the identifier of the application, obtains a media information bit identifier associated with the identifier of the application, and obtains a media information bit identifier corresponding to the media information displayed in the application.
In this embodiment, after obtaining the media information bit identifier, the multidimensional perspective service module 12 obtains the terminal identifier corresponding to the media information bit identifier based on the previously stored reporting information (the reporting information includes the terminal identifier and the media information bit identifier), which terminals browse the media information corresponding to the media information bit identifier can be understood as being obtained. Further, the server determines user information of the terminal based on the terminal identifier, where the user information is specifically user identifier information, and the user identifier information is, for example, a QQ number or the like.
In this embodiment, a user portrait data set is pre-configured in the server, and the user portrait data set may be implemented by a user portrait database (specifically, shown in the user portrait library (tub) in fig. 3). The user representation data set has stored therein representation data associated with user information, specifically user identification information, such as age, gender, academic history, keywords, business interests, and the like. The multi-dimensional perspective service module 12 queries the user representation data set based on the user information (specifically user identification information) to obtain a first user representation data set that matches the user information (specifically user identification information).
In this embodiment, the multidimensional perspective service module 12 analyzes the first user portrait data according to a preset dimension parameter; for example, when the portrait data included in the first user portrait data set is specifically classified into five types, namely age, gender, academic calendar, keywords and business interest, the five types are used as dimension parameters, statistical analysis is performed on the first user portrait data according to the five dimension parameters, data corresponding to each dimension parameter is obtained, and the applied crowd portrait data is generated based on the data corresponding to each dimension parameter.
Specifically, the multidimensional perspective service module 12 is configured to count a data amount corresponding to each dimension parameter value range in each dimension parameter, and calculate a proportional relationship between the data amount of each dimension parameter value range and a total data amount of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
Wherein, taking the dimension parameter as the age as an example, the dimension parameter takes a value of 25 years old, and the dimension parameter takes a value range of less than 18 years old, 19-23 years old, 24-30 years old, 31-40 years old, 41-50 years old, and so on. When the data value of one attribute age dimension parameter in the obtained first user portrait data is 25 years old, recording the dimension parameter value range of the data as 24-30 years old; by analogy, the data quantity of each dimension parameter value range in the dimension parameters representing the age and the total data quantity in the dimension parameters representing the age are counted; and calculating the proportional relation between the data quantity of each dimension parameter value range and the total data quantity in the dimension parameters representing the age to obtain the crowd image data of the dimension parameters representing the age. When the dimension parameter is gender, academic calendar, keywords or business interest, the description is similar to the above description and is not repeated here.
By adopting the technical scheme of the embodiment of the invention, the corresponding data are acquired corresponding to the user portrait function requests of different applications through the acquisition of mass source data, so that the corresponding crowd portrait data are acquired through the analysis and statistics of the acquired data according to the preset dimension parameters, the accurate calculation of the crowd portrait data of different types of media information bits in the applications is realized, the push strategy of the media information can be adjusted based on the crowd portrait data, and the accuracy of the directional push of the media information is greatly improved. For the client user, the browsed media information is also the content which the client user is interested in, and accordingly, the user experience is greatly improved.
Example two
An embodiment of the present invention further provides a server, where a composition structure of the server is shown in fig. 1, and the server includes: a data access module 11 and a multidimensional perspective service module 12; wherein the content of the first and second substances,
the data access module 11 is configured to obtain source reporting information transmitted by the terminal in a Software Development Kit (SDK) manner, or obtain source reporting information transmitted by the terminal in a JS code manner; the source reporting information comprises a terminal identifier and a media information bit identifier; identifying the source reporting information, and screening the source reporting information to obtain reporting information; storing the report information and a preset identifier in a correlation manner;
the multidimensional perspective service module 12 is configured to, when a user portrait function request for an application is received, obtain a media information bit identifier corresponding to the application identifier based on the application identifier; inquiring the stored reported information, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier; establishing an incidence relation between the index identification and data in the user portrait data set to generate first data; the index identification is characterized by the preset dimension parameter; querying the first data based on the user information, obtaining at least one dimension parameter and a dimension parameter value corresponding to the user information, and generating the first user portrait data set based on the at least one dimension parameter and the dimension parameter value; analyzing data in the first user portrait data set according to preset dimension parameters to obtain data corresponding to each dimension parameter; and counting the data volume corresponding to each dimension parameter value range in each dimension parameter, calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter, and generating the applied crowd portrait data based on the obtained proportion relation corresponding to each dimension parameter.
The server described in the embodiment of the present invention may specifically be a single server or a server cluster.
In this embodiment, the data access module 11 is configured to obtain source reporting information that is transmitted by the terminal in a Software Development Kit (SDK) manner, or obtain source reporting information that is transmitted by the terminal in a JS code manner; identifying the source reporting information, and removing invalid data in the source reporting information to obtain reporting information; wherein the invalid data comprises: error data and illegal data; and storing the report information in association with a preset identifier.
That is to say, the data access module 11 of the embodiment of the present invention may obtain the report information of the terminal in the following manner: by means of code (e.g. JS code), by means of Software Development Kit (SDK). Specifically, the present embodiment is directed to a pushing scenario of media information in a mobile application, where the mobile application is any application applied to a mobile terminal; the content of the media information may specifically be advertisement content, that is, the present embodiment is an application scenario for advertisement push in a mobile application. Specifically, in a practical application, there is at least one media information presentation bit for any application. When the media information is displayed, the application reports the information to the server through the built-in service logic. Specifically, when the application is an application under an IOS (operation operating system) and an android operating system, transmitting source report information in an SDK (software development kit) mode; and when the application is made by 5 th generation hypertext markup language (H5), transmitting source report information in a JS code mode. The source reporting information obtained by the data access module 11 includes a terminal identifier and a media information bit identifier; when the media information display positions in the application are multiple, the source reporting information comprises a terminal identifier and multiple media information position identifiers. The terminal identifier may specifically be an International Mobile Equipment Identifier (IMEI); when the media information content is an advertisement content, the media information bit identifier may be specifically an advertisement bit identifier.
Further, in this embodiment, the data access module 11 may identify the source report information according to a preset identification method, remove, for example, error data and illegal data (i.e., cheating data) in the source report information, and reserve valid data, thereby obtaining the report information. In this embodiment, the source reporting information and the reporting information are the same type of information, and the difference is that the source reporting information is data before identification, and the reporting information is data after invalid data is removed. Further, the data access module 11 stores the report information in association with a preset identifier, so as to facilitate subsequent data query and analysis; wherein the preset identifiers are associated with users, i.e. each preset identifier represents a user. The preset identifier is different from an application account of the user, so that the application account of the user is prevented from being directly adopted as an identifier of the reported information corresponding to the user, and the application account, such as a QQ number, is prevented from revealing privacy data of the user.
As can be seen in fig. 2, the data obtained by the data access module 11 at least includes first party data and second party data; the first party data may be the report data transmitted in the JS mode, the report data transmitted in the SDK mode, and the exposure and click data of the media information obtained by the data access module 11 (the exposure and click data of the media information may be stored as an exposure log). Further, the data access module 11 is pre-configured with a first data collection tool for data transmission (e.g., the octopus tool shown in fig. 2). The second party data is specifically data which is obtained from other equipment and can be used for user portrait analysis; the other device and the server are in a heterogeneous system; the heterogeneous system may be understood as a function or responsibility different from that described for the server, or as being supported by a service provided by a different facilitator, etc. The second party data may be, for example, first data characterizing customer relationships and second data characterizing marketing situations for media information. Further, a second data collection tool is also preconfigured, which is used to communicate with other devices (e.g., the Datahub tool shown in fig. 2).
Referring to fig. 2, after obtaining source data (including the first party data and the second party data), the data access module 11 cleans and integrates the source data, that is, identifies the source data according to a preset identification method, removes, for example, error data and illegal data (i.e., cheating data) in the source data, and retains valid data; and, performing effect measurement on the source data, namely measuring whether the source data (including the first party data and the second party data) is supported for user portrait analysis through related business logic. On the other hand, the first data and the preset identifier are stored in a correlation mode, so that on one hand, data statistics and analysis are facilitated, on the other hand, data statistics indexes are facilitated, and a business report representing various data distribution conditions of a client user can be provided.
In this embodiment, fig. 3 is a schematic structural diagram of a perspective service module in a server according to an embodiment of the present invention; as shown in fig. 3, the multidimensional perspective service module 12 includes an index construction unit 121, a storage unit 122, and a multidimensional data processing unit 123; wherein the content of the first and second substances,
the index building unit 121 is configured to build an association relationship between an index identifier and data in a user portrait data set to generate first data, and send the first data to the storage unit 122; the index identification is characterized by the preset dimension parameter;
the storage unit 122 is configured to store the first data;
the multidimensional data processing unit 123 is configured to query the first data generated by the index building unit 121 based on the user information, obtain at least one dimension parameter and a dimension parameter value corresponding to the user information, and generate the first user portrait data set based on the at least one dimension parameter and the dimension parameter value; analyzing data in the first user portrait data set according to the at least one dimension parameter to obtain data corresponding to each dimension parameter; counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
In this embodiment, fig. 4 is an application schematic diagram of a perspective service module in a server according to an embodiment of the present invention; as shown in fig. 5, the index building unit 121 builds an associated index identifier for a user portrait data set (i.e. for the user portrait library in fig. 3), where the index identifier may be characterized by preset dimension parameters; the data corresponding to the index identifier may be a dimension parameter value corresponding to the index identifier and corresponding user information (e.g., user identifier information) in the user portrait data set. Referring to fig. 4, in this embodiment, the user portrait data set stores portrait data associated with user information (specifically, user identification information), such as age, gender, academic calendar, keywords, business interests, and the like. The dimensional parameters corresponding to the index identification are the same as the dimensional parameters characterized in the user representation data set; taking the portrait data stored in the user portrait data set as an example, which includes five dimensional parameters of age, gender, academic calendar, keywords and business interest, the index identifier also establishes first data using the five dimensional parameters as index identifiers respectively based on the five dimensional parameters. So as to provide index support when data query is subsequently carried out. Further, the first data is stored in the storage unit 122 according to a preset format, for example, the first data may specifically be bitmap data stored in an sstable format.
In the illustration shown in FIG. 4, a pre-stored set of user tags (i.e., a library of user tags) is also included; the user tab set stores operation behavior data of the user, such as webpage browsing operation, commodity purchasing operation and the like. Presetting the user tag set (namely a user tag library) and business logic associated with the user tag set (namely the user tag library) in a system architecture; however, the information processing method according to the embodiment of the present invention does not use the user tag set (i.e., the user tag library), and therefore, the embodiment does not describe the user tag set (i.e., the user tag library) in detail.
In this embodiment, when the multidimensional data processing unit 123 receives a user representation request for an application, a media information bit identifier corresponding to the application identifier is obtained based on the application identifier. Wherein the user representation request may be triggered by the user representation function unit of the mobile alliance platform at WEB level as shown in figure 3 for enabling user representation functionality for any application. Further, the server queries a pre-stored exposure log based on the identifier of the application, as shown in fig. 2, the server queries a media information exposure log containing exposure and click data of media information stored in the data access module 11 based on the identifier of the application, obtains a media information bit identifier associated with the application identifier, and obtains a media information bit identifier corresponding to the media information displayed in the application. Further, after the media information bit identifier is obtained, based on the previously stored reporting information (the reporting information includes the terminal identifier and the media information bit identifier), the terminal identifier corresponding to the media information bit identifier is obtained, which terminals browse the media information corresponding to the media information bit identifier can be understood as being obtained. Further, the server determines user information of the terminal based on the terminal identifier, where the user information is specifically user identifier information, and the user identifier information is, for example, a QQ number or the like.
Further, the multidimensional data processing unit 123 queries the first data based on the user information (specifically, user identification information), and obtains at least one dimension parameter and a corresponding dimension parameter value corresponding to the user information, for example, obtains: age 25, sex male, etc.; generating a first user portrait data set based on the obtained at least one dimension parameter and the dimension parameter value; the first set of user representation data may be characterized by the at least one dimensional parameter and a corresponding dimensional parameter value. Further, analyzing each dimension parameter data code in the first user portrait data set to count the data quantity of each dimension parameter.
Further, taking the dimension parameter as age as an example, the dimension parameter takes a value of 25 years old, and the dimension parameter takes a value in a range of less than 18 years old, 19-23 years old, 24-30 years old, 31-40 years old, 41-50 years old, and so on. When the data value of one attribute age dimension parameter in the obtained first user portrait data is 25 years old, recording the dimension parameter value range of the data as 24-30 years old; by analogy, the multidimensional data processing unit 123 counts the data amount of each dimension parameter value range in the dimension parameters representing the age, and the total data amount in the dimension parameters representing the age; and calculating the proportional relation between the data quantity of each dimension parameter value range and the total data quantity in the dimension parameters representing the age to obtain the crowd image data of the dimension parameters representing the age. When the dimension parameter is gender, academic calendar, keywords or business interest, the description is similar to the above description and is not repeated here.
By adopting the technical scheme of the embodiment of the invention, the corresponding data are acquired corresponding to the user portrait function requests of different applications through the acquisition of mass source data, so that the corresponding crowd portrait data are acquired through the analysis and statistics of the acquired data according to the preset dimension parameters, the accurate calculation of the crowd portrait data of different types of media information bits in the applications is realized, the push strategy of the media information can be adjusted based on the crowd portrait data, and the accuracy of the directional push of the media information is greatly improved. For the client user, the browsed media information is also the content which the client user is interested in, and accordingly, the user experience is greatly improved.
In embodiments of the present invention, the media information is, for example, advertisement content. Taking the media information as an example of advertisement content, fig. 4 is an application architecture diagram of a data access module and a perspective service module in a server according to an embodiment of the present invention; wherein, the WEB page (WEB) layer is used for obtaining the source data in real time. The application management function in the WEB layer is used for managing a user portrait function when an application is newly built; the user representation function is to request a user representation function of any application from a perspective service module; wherein, the application can be understood as APP application; at least one media information display position (namely an advertisement position) is configured in the APP application in advance. The data access module in the server of the embodiment of the invention can obtain data from a WEB layer, and the data is integrated, processed and stored; and the multidimensional perspective service module performs multidimensional statistical analysis and calculation on the crowd portraits correspondingly applied under the request of the user portraits function. The specific functions of the data access module and the multidimensional perspective service module may be described with reference to the first embodiment or the second embodiment, and are not described herein again.
EXAMPLE III
The embodiment of the invention provides an information processing method. FIG. 6 is a flowchart illustrating an information processing method according to an embodiment of the present invention; as shown in fig. 6, the information processing method includes:
step 601: and acquiring and storing the reported information of the terminal, wherein the reported information comprises a terminal identifier and a media information bit identifier.
Step 602: when a user portrait function request for an application is received, a media information bit identification corresponding to the application identification is obtained based on the application identification.
Step 603: and inquiring the stored reported information, acquiring a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier.
Step 604: the stored user representation data set is queried to obtain a first user representation data set matching the user information.
Step 605: analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, and generating the applied crowd portrait data based on the data corresponding to each dimension parameter.
The information processing method described in this embodiment is applied to a server or a server cluster.
In this embodiment, the obtaining and storing the report information of the terminal includes:
obtaining source reporting information transmitted by a terminal in a Software Development Kit (SDK) mode or obtaining source reporting information transmitted by the terminal in a JS code mode; screening the source reporting information to obtain reporting information; and storing the report information in association with a preset identifier.
That is to say, the information processing method according to the embodiment of the present invention may obtain the report information of the terminal in the following manner: by means of code (e.g. JS code), by means of Software Development Kit (SDK). Specifically, step 601 is directed to a push scenario of media information in a mobile application, where the mobile application is any application applied to a mobile terminal; the content of the media information may specifically be advertisement content, that is, the present embodiment is an application scenario for advertisement push in a mobile application. Specifically, in a practical application, there is at least one media information presentation bit for any application. When the media information is displayed, the application reports the information to the server through the built-in service logic. Specifically, when the application is an application under an IOS (operation operating system) and an android operating system, transmitting source report information in an SDK (software development kit) mode; and when the application is made by 5 th generation hypertext markup language (H5), transmitting source report information in a JS code mode. The obtained source reporting information comprises a terminal identifier and a media information bit identifier; when the media information display positions in the application are multiple, the source reporting information comprises a terminal identifier and multiple media information position identifiers. The terminal identifier may specifically be an International Mobile Equipment Identifier (IMEI); when the media information content is an advertisement content, the media information bit identifier may be specifically an advertisement bit identifier.
Further, in this embodiment, the server may identify the source report information according to a preset identification method, remove, for example, error data and illegal data (i.e., cheating data) in the source report information, and reserve valid data, so as to obtain the report information in step 601. In this embodiment, the source reporting information and the reporting information are the same type of information, and the difference is that the source reporting information is data before identification, and the reporting information is data after invalid data is removed. Further, the reported information is stored in association with a preset identifier, so that data query and analysis can be performed subsequently; wherein the preset identifiers are associated with users, i.e. each preset identifier represents a user. The preset identifier is different from an application account of the user, so that the application account of the user is prevented from being directly adopted as an identifier of the reported information corresponding to the user, and the application account, such as a QQ number, is prevented from revealing the privacy of the user.
Based on the server described in the first embodiment or the second embodiment, step 601 can be implemented by a data access module in the server. As can be seen in fig. 2, the data obtained by the data access module at least includes first party data and second party data; the first party data may be the report data transmitted in the JS mode, the report data transmitted in the SDK mode, and the exposure and click data of the media information obtained by the data access module (the exposure and click data of the media information may be stored as an exposure log). Further, the data access module pre-configures a first data collection tool for data transfer (e.g., the octopus tool shown in fig. 2). The second party data is specifically data which is obtained from other equipment and can be used for user portrait analysis; the other device and the server are in a heterogeneous system; the heterogeneous system may be understood as a function or responsibility different from that described for the server, or as being supported by a service provided by a different facilitator, etc. The second party data may be, for example, first data characterizing customer relationships and second data characterizing marketing situations for media information. Further, a second data collection tool is also preconfigured, which is used to communicate with other devices (e.g., the Datahub tool shown in fig. 2).
Referring to fig. 2, after obtaining source data (including the first party data and the second party data), cleaning and integrating the source data, that is, identifying the source data according to a preset identification method, removing, for example, error data and illegal data (i.e., cheating data) in the source data, and retaining valid data; and, performing effect measurement on the source data, namely measuring whether the source data (including the first party data and the second party data) is supported for user portrait analysis through related business logic. On the other hand, the first data and the preset identifier are stored in a correlation mode, so that on one hand, data statistics and analysis are facilitated, on the other hand, data statistics indexes are facilitated, and a business report representing various data distribution conditions of a client user can be provided.
In step 602, when a user representation request for an application is received, a media information bit id corresponding to the application id is obtained based on the application id. Wherein the user representation request may be triggered by the user representation function unit of the mobile alliance platform at WEB level as shown in figure 3 for enabling user representation functionality for any application. Further, the server queries a pre-stored exposure log based on the identifier of the application, as shown in fig. 2, a media information exposure log containing exposure and click data of media information stored in the data access module is queried based on the identifier of the application, a media information bit identifier associated with the application identifier is obtained, and a media information bit identifier corresponding to the media information displayed in the application is obtained.
In this embodiment step 603, after obtaining the media information bit identifier, the server obtains the terminal identifier corresponding to the media information bit identifier based on the previously stored reporting information (the reporting information includes the terminal identifier and the media information bit identifier), which can be understood as which terminals browse the media information corresponding to the media information bit identifier. Further, the server determines user information of the terminal based on the terminal identifier, where the user information is specifically user identifier information, and the user identifier information is, for example, a QQ number or the like.
In step 604, a user image data set is pre-configured, and the user image data set may be implemented by a user image database (specifically, refer to the user image library (TUBI) in FIG. 3). The user representation data set has stored therein representation data associated with user information, specifically user identification information, such as age, gender, academic history, keywords, business interests, and the like. The server queries the user representation data set based on the user information (in particular user identification information) to obtain a first user representation data set that matches the user information (in particular user identification information).
In step 605 of this embodiment, the server analyzes the first user portrait data according to a preset dimension parameter; for example, when the portrait data included in the first user portrait data set is specifically classified into five types, namely age, gender, academic calendar, keywords and business interest, the five types are used as dimension parameters, statistical analysis is performed on the first user portrait data according to the five dimension parameters, data corresponding to each dimension parameter is obtained, and the applied crowd portrait data is generated based on the data corresponding to each dimension parameter.
Specifically, the generating of the applied people portrait data based on the data corresponding to each dimension parameter includes: counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
Wherein, taking the dimension parameter as the age as an example, the dimension parameter takes a value of 25 years old, and the dimension parameter takes a value range of less than 18 years old, 19-23 years old, 24-30 years old, 31-40 years old, 41-50 years old, and so on. When the data value of one attribute age dimension parameter in the obtained first user portrait data is 25 years old, recording the dimension parameter value range of the data as 24-30 years old; by analogy, the data quantity of each dimension parameter value range in the dimension parameters representing the age and the total data quantity in the dimension parameters representing the age are counted; and calculating the proportional relation between the data quantity of each dimension parameter value range and the total data quantity in the dimension parameters representing the age to obtain the crowd image data of the dimension parameters representing the age. When the dimension parameter is gender, academic calendar, keywords or business interest, the description is similar to the above description and is not repeated here.
By adopting the technical scheme of the embodiment of the invention, the corresponding data are acquired corresponding to the user portrait function requests of different applications through the acquisition of mass source data, so that the corresponding crowd portrait data are acquired through the analysis and statistics of the acquired data according to the preset dimension parameters, the accurate calculation of the crowd portrait data of different types of media information bits in the applications is realized, the push strategy of the media information can be adjusted based on the crowd portrait data, and the accuracy of the directional push of the media information is greatly improved. For the client user, the browsed media information is also the content which the client user is interested in, and accordingly, the user experience is greatly improved.
Example four
The embodiment of the invention also provides an information processing method. FIG. 7 is a schematic flow chart of an information processing method according to an embodiment of the present invention; as shown in fig. 7, the information processing method includes:
step 701, obtaining source reporting information transmitted by a terminal in a Software Development Kit (SDK) mode, or obtaining source reporting information transmitted by the terminal in a JS code mode; the source reporting information comprises a terminal identifier and a media information bit identifier.
Step 702: identifying the source reporting information, and screening the source reporting information to obtain reporting information; and storing the report information in association with a preset identifier.
Step 703: when a user portrait function request for an application is received, a media information bit identification corresponding to the application identification is obtained based on the application identification.
Step 704: and inquiring the stored reported information, acquiring a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier.
Step 705: establishing an incidence relation between the index identification and data in the user portrait data set to generate first data; the index mark is characterized by the preset dimension parameter.
Step 706: and querying the first data based on the user information, obtaining at least one dimension parameter and a dimension parameter value corresponding to the user information, and generating the first user portrait data set based on the at least one dimension parameter and the dimension parameter value.
Step 707: and analyzing data in the first user portrait data set according to preset dimension parameters to obtain data corresponding to each dimension parameter.
Step 708: and counting the data volume corresponding to each dimension parameter value range in each dimension parameter, calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter, and generating the applied crowd portrait data based on the obtained proportion relation corresponding to each dimension parameter.
The information processing method described in this embodiment is applied to a server or a server cluster.
In step 701 of this embodiment, the server may obtain the report information of the terminal in the following manner: by means of code (e.g. JS code), by means of Software Development Kit (SDK). Specifically, step 601 is directed to a push scenario of media information in a mobile application, where the mobile application is any application applied to a mobile terminal; the content of the media information may specifically be advertisement content, that is, the present embodiment is an application scenario for advertisement push in a mobile application. Specifically, in a practical application, there is at least one media information presentation bit for any application. When the media information is displayed, the application reports the information to the server through the built-in service logic. Specifically, when the application is an application under an IOS (operation operating system) and an android operating system, transmitting source report information in an SDK (software development kit) mode; and when the application is made by 5 th generation hypertext markup language (H5), transmitting source report information in a JS code mode. The obtained source reporting information comprises a terminal identifier and a media information bit identifier; when the media information display positions in the application are multiple, the source reporting information comprises a terminal identifier and multiple media information position identifiers. The terminal identifier may specifically be an International Mobile Equipment Identifier (IMEI); when the media information content is an advertisement content, the media information bit identifier may be specifically an advertisement bit identifier.
In step 702 of this embodiment, the server may identify the source report information according to a preset identification method, remove, for example, error data and illegal data (i.e., cheating data) in the source report information, and reserve valid data, thereby obtaining the report information. In this embodiment, the source reporting information and the reporting information are the same type of information, and the difference is that the source reporting information is data before identification, and the reporting information is data after invalid data is removed. Further, the reported information is stored in association with a preset identifier, so that data query and analysis can be performed subsequently; wherein the preset identifiers are associated with users, i.e. each preset identifier represents a user. The preset identifier is different from an application account of the user, so that the application account of the user is prevented from being directly adopted as an identifier of the reported information corresponding to the user, and the application account, such as a QQ number, is prevented from revealing privacy data of the user.
Based on the server described in the first embodiment or the second embodiment, steps 701 to 702 may be implemented by a data access module in the server. As can be seen in fig. 2, the data obtained by the data access module at least includes first party data and second party data; the first party data may be the report data transmitted in the JS mode, the report data transmitted in the SDK mode, and the exposure and click data of the media information obtained by the data access module (the exposure and click data of the media information may be stored as an exposure log). Further, the data access module pre-configures a first data collection tool for data transfer (e.g., the octopus tool shown in fig. 2). The second party data is specifically data which is obtained from other equipment and can be used for user portrait analysis; the other device and the server are in a heterogeneous system; the heterogeneous system may be understood as a function or responsibility different from that described for the server, or as being supported by a service provided by a different facilitator, etc. The second party data may be, for example, first data characterizing customer relationships and second data characterizing marketing situations for media information. Further, a second data collection tool is also preconfigured, which is used to communicate with other devices (e.g., the Datahub tool shown in fig. 2).
Referring to fig. 2, after obtaining source data (including the first party data and the second party data), cleaning and integrating the source data, that is, identifying the source data according to a preset identification method, removing, for example, error data and illegal data (i.e., cheating data) in the source data, and retaining valid data; and, performing effect measurement on the source data, namely measuring whether the source data (including the first party data and the second party data) is supported for user portrait analysis through related business logic. On the other hand, the first data and the preset identifier are stored in a correlation mode, so that on one hand, data statistics and analysis are facilitated, on the other hand, data statistics indexes are facilitated, and a business report representing various data distribution conditions of a client user can be provided.
In step 703, when a user portrait request for an application is received, a media information bit id corresponding to the application id is obtained based on the application id. Wherein the user representation request may be triggered by the user representation function unit of the mobile alliance platform at WEB level as shown in figure 3 for enabling user representation functionality for any application. Further, the server queries a pre-stored exposure log based on the identifier of the application, as shown in fig. 2, a media information exposure log containing exposure and click data of media information stored in the data access module is queried based on the identifier of the application, a media information bit identifier associated with the application identifier is obtained, and a media information bit identifier corresponding to the media information displayed in the application is obtained.
In step 704 of this embodiment, after obtaining the media information bit identifier, the server obtains the terminal identifier corresponding to the media information bit identifier based on the previously stored reporting information (the reporting information includes the terminal identifier and the media information bit identifier), which can be understood as which terminals browse the media information corresponding to the media information bit identifier. Further, the server determines user information of the terminal based on the terminal identifier, where the user information is specifically user identifier information, and the user identifier information is, for example, a QQ number or the like.
In step 705, an associated index identifier is constructed for the user portrait data set (i.e., for the user portrait library in fig. 3), where the index identifier may be characterized by a preset dimension parameter; the data corresponding to the index identifier may be a dimension parameter value corresponding to the index identifier and corresponding user information (e.g., user identifier information) in the user portrait data set. Referring to fig. 4, in this embodiment, the user portrait data set stores portrait data associated with user information (specifically, user identification information), such as age, gender, academic calendar, keywords, business interests, and the like. The dimensional parameters corresponding to the index identification are the same as the dimensional parameters characterized in the user representation data set; taking the portrait data stored in the user portrait data set as an example, which includes five dimensional parameters of age, gender, academic calendar, keywords and business interest, the index identifier also establishes first data using the five dimensional parameters as index identifiers respectively based on the five dimensional parameters. So as to provide index support when data query is subsequently carried out. Further, the first data is stored according to a preset format, for example, the first data may specifically be bitmap data stored in an sstable format.
In the illustration shown in FIG. 4, a pre-stored set of user tags (i.e., a library of user tags) is also included; the user tab set stores operation behavior data of the user, such as webpage browsing operation, commodity purchasing operation and the like. Presetting the user tag set (namely a user tag library) and business logic associated with the user tag set (namely the user tag library) in a system architecture; however, the information processing method according to the embodiment of the present invention does not use the user tag set (i.e., the user tag library), and therefore, the embodiment does not describe the user tag set (i.e., the user tag library) in detail.
In steps 706 to 707 in this embodiment, the server queries the first data based on the user information (specifically, user identification information), to obtain at least one dimension parameter and a corresponding dimension parameter value corresponding to the user information, for example, to obtain a value corresponding to the user information: age 25, sex male, etc.; generating a first user portrait data set based on the obtained at least one dimension parameter and the dimension parameter value; the first set of user representation data may be characterized by the at least one dimensional parameter and a corresponding dimensional parameter value. Further, analyzing each dimension parameter data code in the first user portrait data set to count the data quantity of each dimension parameter.
In step 708 of this embodiment, taking the dimension parameter as the age, the dimension parameter takes a value of, for example, 25 years old, and the dimension parameter takes a range of, for example, less than 18 years old, 19-23 years old, 24-30 years old, 31-40 years old, 41-50 years old, and so on. When the data value of one attribute age dimension parameter in the obtained first user portrait data is 25 years old, recording the dimension parameter value range of the data as 24-30 years old; by analogy, the data quantity of each dimension parameter value range in the dimension parameters representing the age and the total data quantity in the dimension parameters representing the age are counted; and calculating the proportional relation between the data quantity of each dimension parameter value range and the total data quantity in the dimension parameters representing the age to obtain the crowd image data of the dimension parameters representing the age. When the dimension parameter is gender, academic calendar, keywords or business interest, the description is similar to the above description and is not repeated here.
By adopting the technical scheme of the embodiment of the invention, the corresponding data are acquired corresponding to the user portrait function requests of different applications through the acquisition of mass source data, so that the corresponding crowd portrait data are acquired through the analysis and statistics of the acquired data according to the preset dimension parameters, the accurate calculation of the crowd portrait data of different types of media information bits in the applications is realized, the push strategy of the media information can be adjusted based on the crowd portrait data, and the accuracy of the directional push of the media information is greatly improved. For the client user, the browsed media information is also the content which the client user is interested in, and accordingly, the user experience is greatly improved.
Specifically, fig. 8a to 8d are schematic diagrams of the crowd image data obtained by the information processing method according to the embodiment of the present invention, and the crowd image data for at least one media information bit in an application obtained by the technical solution according to the embodiment of the present invention may be as shown in fig. 8a to 8 d. Based on the user image data shown in fig. 8a to 8d, in the process of pushing the media information, the push proportion of the media information can be adjusted for the crowd image data, for example, taking the crowd image data shown in fig. 8a as an example, the proportion between 24 years old and 30 years old is the largest, and the proportion between more than 50 years old is the smallest, and in the media information push strategy, the amount of the media information pushed to the end user aged between 24 years old and 30 years old can be correspondingly adjusted to be relatively larger; the amount of media information pushed to end users over the age of 50 is relatively small.
Taking the media information as an example of advertisement content, the technical scheme of the embodiment of the invention has the following beneficial effects:
1. for the advertisement pushing party, the accurate orientation of the advertisement pushing is realized, and the accuracy of the advertisement oriented pushing is greatly improved.
2. For the user, the browsed advertisement content is also the content which is much interested by the client user, the probability of clicking the advertisement by the user is greatly increased, and the probability of operating (such as downloading, activating, purchasing and the like) the content promoted in the advertisement is also greatly increased; on one hand, the user experience is greatly improved; on the other hand, for the advertisement pushing party, the click quantity and the conversion quantity of the advertisement are also greatly increased.
3. For the advertiser, marketing and showing effects aiming at advertisement pushing are greatly improved, and accordingly profits are greatly improved.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above-described device embodiments are merely illustrative, for example, the division of the unit is only a logical functional division, and there may be other division ways in actual implementation, such as: multiple units or components may be combined, or may be integrated into another system, or some features may be omitted, or not implemented. In addition, the coupling, direct coupling or communication connection between the components shown or discussed may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be electrical, mechanical or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed on a plurality of network units; some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, all the functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may be separately regarded as one unit, or two or more units may be integrated into one unit; the integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.
Those of ordinary skill in the art will understand that: all or part of the steps for implementing the method embodiments may be implemented by hardware related to program instructions, and the program may be stored in a computer readable storage medium, and when executed, the program performs the steps including the method embodiments; and the aforementioned storage medium includes: a mobile storage device, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
Alternatively, the integrated unit of the present invention may be stored in a computer-readable storage medium if it is implemented in the form of a software functional module and sold or used as a separate product. Based on such understanding, the technical solutions of the embodiments of the present invention may be essentially implemented or a part contributing to the prior art may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.

Claims (12)

1. An information processing method, characterized in that the method comprises:
acquiring and storing reporting information of a terminal, wherein the reporting information comprises a terminal identifier and a media information bit identifier;
when a user portrait function request aiming at an application is received, acquiring a media information bit identifier corresponding to the application identifier based on the application identifier;
inquiring the stored reported information, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier;
querying a stored user representation data set to obtain a first user representation data set matched with the user information;
analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, generating applied crowd portrait data based on the data corresponding to each dimension parameter, and adjusting a media information pushing strategy based on the applied crowd portrait data.
2. The method of claim 1, wherein prior to said querying the stored user representation data set, the method comprises:
establishing an incidence relation between the index identification and data in the user portrait data set to generate first data; the index mark is characterized by the preset dimension parameter.
3. The method of claim 2, wherein said querying a pre-stored user representation data set to obtain a first user representation data set matching said user information comprises:
and querying the first data based on the user information, obtaining at least one dimension parameter and a dimension parameter value corresponding to the user information, and generating the first user portrait data set based on the at least one dimension parameter and the dimension parameter value.
4. The method of claim 3, wherein generating the applied people profile data based on the data corresponding to each dimension parameter comprises:
counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter;
and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
5. The method of claim 1, wherein the obtaining and storing the report information of the terminal comprises:
obtaining source reporting information transmitted by a terminal in an SDK mode through a Software Development Kit (SDK) or obtaining source reporting information transmitted by the terminal in a JS code mode;
screening the source reporting information to obtain reporting information;
and storing the report information in association with a preset identifier.
6. The method of claim 1, wherein adjusting the push strategy for media information based on the crowd portrayal data of the application comprises: and adjusting the push proportion of the media information based on the applied crowd image data.
7. A server, characterized in that the server comprises: the system comprises a data access module and a multi-dimensional perspective service module; wherein the content of the first and second substances,
the data access module is used for acquiring and storing the report information of the terminal, wherein the report information comprises a terminal identifier and a media information bit identifier;
the multidimensional perspective service module is used for obtaining a media information bit identifier corresponding to an application identifier based on the application identifier when a user portrait function request aiming at the application is received; inquiring the reported information stored by the data access module, obtaining a terminal identifier corresponding to the media information bit identifier, and determining the user information of the terminal based on the terminal identifier; querying a stored user representation data set to obtain a first user representation data set matched with the user information; analyzing data in the first user portrait data set according to preset dimension parameters, obtaining data corresponding to each dimension parameter, generating applied crowd portrait data based on the data corresponding to each dimension parameter, and adjusting a media information pushing strategy based on the applied crowd portrait data.
8. The server of claim 7, wherein the multidimensional perspective service module is further configured to generate the first data by establishing an association relationship between an index identifier and data in a user representation data set before querying the pre-stored user representation data set; the index mark is characterized by the preset dimension parameter.
9. The server of claim 8, wherein the multidimensional perspective service module is configured to query the first data based on the user information, obtain at least one dimension parameter and a dimension parameter value corresponding to the user information, and generate the first user portrait data set based on the at least one dimension parameter and the dimension parameter value; analyzing data in the first user portrait data set according to the at least one dimension parameter to obtain data corresponding to each dimension parameter; counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
10. The server according to claim 9, wherein the multidimensional perspective service module comprises an index construction module, a storage unit and a multidimensional data processing module; wherein the content of the first and second substances,
the index building module is used for building an incidence relation between an index identifier and data in a user portrait data set to generate first data and sending the first data to the storage unit; the index identification is characterized by the preset dimension parameter;
the storage unit is used for storing the first data;
the multidimensional data processing module is used for querying first data generated by the index construction module based on the user information, obtaining at least one dimension parameter and a dimension parameter value corresponding to the user information, and generating the first user portrait data set based on the at least one dimension parameter and the dimension parameter value; analyzing data in the first user portrait data set according to the at least one dimension parameter to obtain data corresponding to each dimension parameter; counting the data volume corresponding to each dimension parameter value range in each dimension parameter, and calculating the proportion relation of the data volume of each dimension parameter value range in the total data volume of the corresponding dimension parameter; and generating the applied people portrait data based on the obtained proportional relation corresponding to each dimension parameter.
11. The server according to claim 7, wherein the data access module is configured to obtain source report information that is transmitted by the terminal in an SDK manner through a software development kit, or obtain source report information that is transmitted by the terminal in a JS code manner; screening the source reporting information to obtain reporting information; and storing the report information in association with a preset identifier.
12. A storage medium having stored thereon computer instructions, characterized in that the computer instructions, when executed, implement the information processing method according to any one of claims 1 to 6.
CN201511004500.3A 2015-12-28 2015-12-28 Information processing method and server Active CN105657003B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511004500.3A CN105657003B (en) 2015-12-28 2015-12-28 Information processing method and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511004500.3A CN105657003B (en) 2015-12-28 2015-12-28 Information processing method and server

Publications (2)

Publication Number Publication Date
CN105657003A CN105657003A (en) 2016-06-08
CN105657003B true CN105657003B (en) 2020-01-10

Family

ID=56477081

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511004500.3A Active CN105657003B (en) 2015-12-28 2015-12-28 Information processing method and server

Country Status (1)

Country Link
CN (1) CN105657003B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107770234B (en) * 2016-08-23 2021-11-19 平安科技(深圳)有限公司 Message pushing method and device
CN106651443A (en) * 2016-12-19 2017-05-10 中国联合网络通信集团有限公司 User portrait information processing method and device
CN108205575B (en) * 2016-12-20 2021-10-08 北京国双科技有限公司 Data processing method and device
CN106886597A (en) * 2017-02-24 2017-06-23 乐蛙科技(上海)有限公司 Control system, control method and receiving terminal that a kind of trigger-type is notified
CN106909686A (en) * 2017-03-06 2017-06-30 吉林省盛创科技有限公司 A kind of man-machine interaction builds user's portrait cluster calculation method
CN108965360B (en) * 2017-05-19 2021-01-01 腾讯科技(深圳)有限公司 Recommended data processing method, server and computer storage medium
CN107908740B (en) * 2017-11-15 2022-11-22 百度在线网络技术(北京)有限公司 Information output method and device
CN110197387A (en) * 2018-04-18 2019-09-03 腾讯科技(北京)有限公司 A kind of orientation launches the processing method and processing device of media information
CN109145197B (en) * 2018-06-28 2022-10-04 土巴兔集团股份有限公司 User portrait label generation method, device and system
CN109062970B (en) * 2018-06-28 2022-07-12 土巴兔集团股份有限公司 User portrait generation method, user portrait generation device and computer-readable storage medium
CN109684364B (en) * 2018-08-21 2024-04-19 深圳市与米数字科技有限公司 User portrait-based problem processing method, device, equipment and storage medium
CN110968572B (en) * 2018-09-29 2022-11-18 北京小米移动软件有限公司 User portrait data cleaning method and device
CN109257764B (en) * 2018-10-24 2022-02-22 北京小米移动软件有限公司 User portrait data processing method and device
CN111382266A (en) * 2018-12-28 2020-07-07 沈阳美行科技有限公司 User portrait generation method, device and equipment
CN111223235A (en) * 2019-12-27 2020-06-02 合肥美的智能科技有限公司 Commodity putting method of unmanned cabinet, unmanned cabinet and control device of unmanned cabinet
CN113723984A (en) * 2021-03-03 2021-11-30 京东城市(北京)数字科技有限公司 Method and device for acquiring crowd consumption portrait information and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101621823A (en) * 2008-06-30 2010-01-06 上海全成通信技术有限公司 Method for accurately building customer portrait of mobile communication data service
CN103778555A (en) * 2014-01-21 2014-05-07 北京集奥聚合科技有限公司 User attribute mining method and system based on user tags
CN104657828A (en) * 2015-02-17 2015-05-27 华为技术有限公司 Digitalization method, data matching method, code recommendation method and related devices
CN104991917A (en) * 2015-06-23 2015-10-21 上海斐讯数据通信技术有限公司 Personalized advertisement pushing system and method
CN105005587A (en) * 2015-06-26 2015-10-28 深圳市腾讯计算机系统有限公司 User portrait updating method, apparatus and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9715677B2 (en) * 2006-12-14 2017-07-25 Aol Inc. Conditional advertising for instant messaging

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101621823A (en) * 2008-06-30 2010-01-06 上海全成通信技术有限公司 Method for accurately building customer portrait of mobile communication data service
CN103778555A (en) * 2014-01-21 2014-05-07 北京集奥聚合科技有限公司 User attribute mining method and system based on user tags
CN104657828A (en) * 2015-02-17 2015-05-27 华为技术有限公司 Digitalization method, data matching method, code recommendation method and related devices
CN104991917A (en) * 2015-06-23 2015-10-21 上海斐讯数据通信技术有限公司 Personalized advertisement pushing system and method
CN105005587A (en) * 2015-06-26 2015-10-28 深圳市腾讯计算机系统有限公司 User portrait updating method, apparatus and system

Also Published As

Publication number Publication date
CN105657003A (en) 2016-06-08

Similar Documents

Publication Publication Date Title
CN105657003B (en) Information processing method and server
AU2018282471B2 (en) Methods and apparatus to share online media impressions data
US20190147461A1 (en) Methods and apparatus to estimate total audience population distributions
US11042899B2 (en) System and method for tracking users across a plurality of media platforms
WO2020190650A1 (en) Methods and apparatus to estimate population reach from different marginal ratings and/or unions of marginal ratings based on impression data
WO2021031688A1 (en) Service parameter acquisition method and apparatus, computer device and storage medium
US20160210658A1 (en) Determining touchpoint attributions in a segmented media campaign
US20080184129A1 (en) Presenting website analytics associated with a toolbar
US20160210656A1 (en) System for marketing touchpoint attribution bias correction
WO2014190032A1 (en) System and method for predicting an outcome by a user in a single score
CN109120719B (en) Information pushing method, information display method, computer equipment and storage medium
EP2471038A2 (en) Audience segment estimation
CN110648172B (en) Identity recognition method and system integrating multiple mobile devices
CN109977296A (en) A kind of information-pushing method, device, equipment and storage medium
WO2016188334A1 (en) Method and device for processing application access data
CN109831358A (en) A kind of client traffic statistical method, device, server and readable storage medium storing program for executing
CN108694174B (en) Content delivery data analysis method and device
CN111292108A (en) Order counting method, device, equipment and computer readable storage medium
CN112734227A (en) Big data decision system and method
CN107045688B (en) Control method and device for accurately identifying target terminal
CN107784062B (en) Product data processing system and method
US10757063B2 (en) Method for selecting second messages for online inserting said second messages in social network content
CN106790339A (en) Metadata server, network device, and automatic resource management method
CN110020166A (en) A kind of data analysing method and relevant device
CN110968785B (en) Target account identification method and device, storage medium and electronic device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant