WO2017201905A1 - Data distribution method and device, and storage medium - Google Patents

Data distribution method and device, and storage medium Download PDF

Info

Publication number
WO2017201905A1
WO2017201905A1 PCT/CN2016/096850 CN2016096850W WO2017201905A1 WO 2017201905 A1 WO2017201905 A1 WO 2017201905A1 CN 2016096850 W CN2016096850 W CN 2016096850W WO 2017201905 A1 WO2017201905 A1 WO 2017201905A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
distributed
feature
personnel
matrix
Prior art date
Application number
PCT/CN2016/096850
Other languages
French (fr)
Chinese (zh)
Inventor
杨磊
胡建华
章利
戴俊
郭继春
杜星
张磊
李培帅
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Publication of WO2017201905A1 publication Critical patent/WO2017201905A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/01Customer relationship services
    • G06Q30/015Providing customer assistance, e.g. assisting a customer within a business location or via helpdesk
    • G06Q30/016After-sales

Definitions

  • the embodiments of the present invention relate to the field of information processing technologies, and in particular, to a data distribution method, apparatus, and storage medium.
  • customer data is the core data and assets, which are characterized by a large number, varying degrees of quality, timeliness and multidimensional features.
  • sales CRM the salesperson is the core user, and the salesperson itself has the difference in sales ability, industry accumulation and geographical preference. It is a critical issue for sales personnel to efficiently and accurately find customer profiles that match their business capabilities.
  • the salesperson will generally check the customer information he wants to follow according to the conditions, and then go to the phone to follow up the specific customer information and reach an order.
  • the way in which such information is obtained requires the salesperson to obtain it manually, and the obtained customer data depends on the subjective understanding of the customer's subjective knowledge of the customer, and the efficiency and matching degree of obtaining the customer's data are relatively low.
  • the customer information is input into the CRM system by a dedicated staff member, and then distributed to the salesperson by the system randomly or according to the principle of even distribution. This distribution method is less targeted, and the salesperson can only passively accept the customer data, ignoring The difference in the sales staff themselves.
  • the invention provides a data distribution method and device for distributing information in a targeted manner.
  • an embodiment of the present invention provides a data distribution method, including:
  • the to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
  • an embodiment of the present invention further provides a data distribution apparatus, including:
  • a data feature acquiring module configured to acquire data feature information of one or more materials to be distributed
  • a personnel feature obtaining module configured to acquire personnel characteristic information of one or more data receiving personnel
  • a feature matching module configured to match the data feature information with the personnel feature information
  • a data distribution module configured to distribute the to-be-distributed data to a corresponding data receiving person according to a matching degree.
  • an embodiment of the present invention further provides a storage medium including computer executable instructions for executing a data distribution method when executed by a computer processor, the method comprising:
  • the to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
  • the embodiment of the present invention further provides a data distribution device, including:
  • One or more processors are One or more processors;
  • a memory that stores one or more programs
  • the to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
  • the data to be distributed is distributed to the corresponding data receiving personnel according to the matching degree, and the data can be distributed in a targeted manner.
  • Embodiment 1 is a flowchart of a data distribution method in Embodiment 1 of the present invention.
  • FIG. 2A is a flowchart of a data distribution method in Embodiment 2 of the present invention.
  • FIG. 2B is a schematic diagram showing a result of analyzing customer data in a data distribution method according to Embodiment 2 of the present invention
  • FIG. 2C is a schematic diagram showing the distribution result of using the existing data distribution method in Embodiment 2 of the present invention.
  • FIG. 2D is a schematic diagram of distribution results provided in a data distribution method according to Embodiment 2 of the present invention.
  • FIG. 2E is a schematic diagram of a data distribution display interface in a data distribution method according to Embodiment 2 of the present invention.
  • Embodiment 3 is a structural diagram of a data distribution apparatus in Embodiment 3 of the present invention.
  • Embodiment 4 is a structural diagram of a data distribution device in Embodiment 3 of the present invention.
  • FIG. 1 is a flowchart of a data distribution method according to a first embodiment of the present invention.
  • the present embodiment is applicable to a data distribution apparatus provided by an embodiment of the present invention.
  • Executing, the device may be implemented by using a software or a hardware, and the device may be integrated into a mobile terminal, a tablet computer, a fixed terminal, or a server, as shown in FIG.
  • the information to be distributed may be one piece of information or multiple pieces of information.
  • the type of the to-be-distributed material may specifically be, but is not limited to, any one of a file, a document, and a customer profile.
  • the material feature information is composed of at least one keyword, and the keyword represents the content or attribute included in the to-be-distributed material, including but not limited to industry information to which the data belongs, enterprise business information and data of the enterprise to which the data belongs. At least one of a geographical location of the affiliated company, a telephone communication record of the material, and a visit record of the information.
  • the telephone communication record is the number or time of historical telephone communication performed by the relevant person (such as a customer or a salesperson) for the data.
  • the visit record is the number, location or time of historical visits made by the relevant person (eg, customer or salesperson) for the material.
  • the content to be distributed may be obtained by parsing the to-be-distributed data, and the keyword is extracted from the included content as the feature information, or is summarized according to the included content.
  • a keyword capable of characterizing the meaning or attribute of the material to be distributed, and the summarized keyword is used as the material feature information.
  • the data receiving personnel is a receiving object of the data to be distributed.
  • the personnel characteristic information represents related information of the data receiving personnel, including but not limited to at least one of industry information engaged by the data receiving personnel, business operation information of the enterprise, geographical location, telephone communication record, and visit record.
  • the telephone communication record is the historical telephone communication time or time for a data receiving person (such as a customer or a sales person) for a certain data.
  • a visit record is the number, location, or time of historical visits to a profile by a person (such as a customer or salesperson) who receives the data.
  • the data feature information includes industry information to which the data belongs, enterprise business information of the enterprise to which the data belongs, geographic location of the enterprise to which the data belongs, telephone communication record of the data, and visit of the data Recording
  • the personnel characteristic information includes industry information engaged by the data receiving personnel, business operation information of the enterprise, geographic location, telephone communication record, and visit record.
  • S104 Distribute the to-be-distributed data to a corresponding data receiving personnel according to the degree of matching.
  • the piece of data is Distribute to the recipient of the data.
  • the industry information to which the data belongs is matched with the industry information of the data receiving personnel, and the information is preferentially distributed to the personnel of the same industry; the business operation information of the enterprise to which the data belongs and the business operation information of the enterprise where the data receiving personnel are located are performed.
  • Match, Priority is given to distributing the information to persons with similar business conditions; matching the geographical location of the data to the geographic location of the recipient of the data, prioritizing the distribution of the information to the same geographical location, and so on.
  • the data feature information of the data to be distributed is matched with the personnel feature information of the data receiving personnel, and the data to be distributed is distributed to the corresponding data receiving personnel according to the matching degree, so that the data can be distributed in a targeted manner.
  • the data feature information and the personnel feature information are matrixized, and specifically includes:
  • the data feature information and the person feature information are respectively a data feature matrix and a personnel feature matrix, wherein each of the to-be-distributed materials and each of the data receiving personnel have the same feature dimension.
  • matching the data feature information with the personnel feature information includes:
  • a similarity matrix of the data feature matrix and the person feature matrix is calculated, and the similarity matrix is used as a matching degree.
  • the material feature matrix is obtained by performing feature modeling on the data to be distributed.
  • the data feature matrix includes n feature values, and the features represented by the feature values include, but are not limited to, at least one of industry information, business operation information, geographic location, telephone communication record, and visit record.
  • the similarity may specifically be, but not limited to, any one of a cosine value, an Euclidean distance, a Pearson correlation coefficient, a Spearman rank correlation coefficient, a Tanimoto coefficient, a log likelihood similarity, and a Manhattan distance.
  • the industry information contained in the data feature matrix is IT, chemical, and legal
  • the eigenvalue corresponding to the chemical in the data feature matrix is assigned a value of 1
  • the remaining features are The eigenvalues corresponding to IT and law are assigned a value of zero.
  • similar methods are used for assignment.
  • the preferences of the features in the personnel feature matrix are respectively assigned.
  • each feature item in the person feature matrix is the same as each feature item in the data feature matrix.
  • each feature is assigned, and the assignment range is 0 ⁇ 1. For example, if the person has a degree of interest in Beijing of 1, the feature value corresponding to the feature of Beijing is assigned a value of 1. If the degree of interest of the person to IT is 0.5, the feature value corresponding to the feature of IT is assigned a value of 0.5, etc. .
  • the distributing the data to be distributed to the corresponding data receiver according to the degree of matching further includes:
  • the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm, so that one piece of information is distributed to one person and distributed.
  • the sum of the similarities is the largest.
  • the optimization algorithm is preferably but not limited to a weighted bipartite graph optimal matching algorithm.
  • using an optimization algorithm to separately distribute the plurality of data to be distributed to the corresponding data receiving personnel mainly includes the following two distribution methods:
  • the first distribution method based on the calculated similarity matrix, an optimization algorithm is used to distribute a plurality of data to be distributed to a corresponding data receiving person in a fixed distribution form, that is, when each data is distributed,
  • the same number of items to be distributed are fixedly allocated by personnel.
  • each data receiver is fixedly allocated k pieces of data to be distributed, where m and t are positive integers, and m is greater than t;
  • the similarity matrix C mt is transformed into a similarity matrix C m(t ⁇ k) of t ⁇ k data receivers in the column direction; the m data to be distributed is distributed to t ⁇ k virtual data receiving by an optimization algorithm. personnel.
  • the second distribution method is: based on the calculated similarity matrix, the plurality of to-be-distributed materials are respectively distributed to the corresponding data receiving personnel in a variable distribution amount by using an optimization algorithm, that is, when distributing the data, according to each The data receiver's personal ability to assign a different number of data to be distributed to each data receiver.
  • the to-be-distributed data to be distributed is distributed to t data receiving personnel
  • a set number of to-be-distributed materials are separately allocated according to the personal capabilities of each data receiving person, wherein the i-th data receiving personnel
  • the number of allocated data to be distributed is denoted as k i , where m and t are positive integers, and m is greater than t; the similarity matrix C mt is transformed in the column direction to Similarity matrix of virtual data receivers Distribution of m data to the optimization algorithm A virtual data receiver.
  • the data of the data to be distributed and the personnel characteristic information of the data receiver are matrixed, and the similarity of the two matrices is calculated, and the data to be distributed is distributed to the corresponding data based on the principle of maximum similarity. Receiving personnel, able to distribute information in a targeted manner.
  • FIG. 2A is a flowchart of a data distribution method according to Embodiment 2 of the present invention.
  • the data to be distributed is used as customer data, and the data receiving personnel considers the sales as a scenario, and the present invention is described in detail, such as As shown in FIG. 2A, the specific includes:
  • S201 Analyze customer data, and extract data feature information that is used to represent the customer data, where the data feature information is a data feature matrix that includes n feature values.
  • the customer data is classified and described in a tree manner as shown in FIG. 2B, and all the leaf nodes in the tree are all feature values.
  • n feature values Can be expressed as an n ⁇ 1 dimensional matrix.
  • Customer profile information can describe a row matrix, ie data profile matrix A ⁇ security security, office culture, medical health, ..., Beijing, Shanghai, Guangzhou, ... ⁇ to indicate that its value will be similar ⁇ 0,1,0,...,0,1,0,... ⁇ (1 indicates that the feature is met, 0 means no), the matrix indicates that it is an office culture and education industry, customer information in the Shanghai area.
  • all the feature values of the customer data can be represented as an m ⁇ n matrix A mn .
  • the salesperson's preference can also be expressed as a matrix or sales.
  • the matrix indicates that the salesperson's interest in the safety and security industry is 0.3, the degree of interest in office culture and education is 0.5, and the degree of interest in the medical and health industry is 0.1. The greater the value, the more interested. If the total number of salespersons is t, then all salespersons' preference values are represented as an nxt matrix Bnt .
  • the similarity matrix of the two is calculated. Due to the sparsity of the two matrices, the cosine similarity matching algorithm is used to represent the customer data characteristics and sales. The degree of matching of people's interests.
  • a ⁇ A 1 , A 2 ,..., A n ⁇ represents the data feature matrix of a customer profile
  • B ⁇ B 1 , B 2 ,..., B n ⁇ represents the personnel identity matrix of a salesperson.
  • the value of i is 1 to n, and the value of the cosine similarity is larger, indicating that the characteristics of the customer data and the preference of the salesperson match.
  • the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm based on the calculated similarity matrix.
  • t ⁇ k virtual sales personnel can be abstracted (k times for each sales), and the similarity matrix C can be used.
  • Mt is transformed into a matrix of t ⁇ k dimensions C m(t ⁇ k) in the direction of the column. , indicating m customer data, t ⁇ k virtual sales staff similarity matrix.
  • the optimal distribution of m customer data to t ⁇ k virtual sales personnel is converted into a weighted bipartite graph (t ⁇ k is generally less than m), and the set of sales data and the set of virtual sales personnel are composed of two points.
  • a customer data matched by t ⁇ k virtual sales personnel can be obtained, so that the overall similarity is maximized, thereby achieving the best matching effect.
  • FIG. 2E a schematic diagram of a distribution interface is provided in this embodiment.
  • the user can select the customer data to be distributed, and the salesperson who receives the data, and then click the distribution button to start the embodiment of the present invention.
  • the data distribution process and the distribution results as shown below in Figure 2E.
  • FIG. 3 is a schematic structural diagram of a data distribution apparatus according to Embodiment 3 of the present invention.
  • the apparatus may be implemented by using software or hardware, and the apparatus may be integrated into a mobile terminal, a tablet computer, a fixed terminal, or a server, such as As shown in FIG. 3, the specific structure of the device is as follows: a material feature acquisition module 31, a personnel feature acquisition module 32, a feature matching module 33, and a data distribution module 34.
  • the data feature obtaining module 31 is configured to acquire data feature information of one or more materials to be distributed;
  • the person feature obtaining module 32 is configured to acquire personnel feature information of one or more data receiving personnel;
  • the feature matching module 33 is configured to match the material feature information with the personnel feature information
  • the data distribution module 34 is configured to distribute the to-be-distributed materials to corresponding data receiving personnel according to the degree of matching.
  • the data distribution device in the embodiment is used to perform the data distribution method described in the foregoing embodiments, and the technical principle and the generated technical effects are similar, and details are not described herein again.
  • the data feature information and the personnel feature information are respectively a data feature matrix and a personnel feature matrix, wherein each of the to-be-distributed materials and each data receiving person has a special feature The same dimension;
  • the feature matching module 33 is specifically configured to calculate a similarity matrix of the data feature matrix and the person feature matrix, and use the similarity matrix as a matching degree.
  • the data feature acquiring module is configured to obtain a data feature matrix by performing feature modeling on the data to be distributed.
  • the data feature obtaining module 31 is specifically configured to parse the data to be distributed; if the corresponding feature in the data feature matrix is included, assign the corresponding feature value in the data feature matrix to 1 Otherwise, the value is 0.
  • the personnel feature obtaining module 32 is specifically configured to respectively assign a preference to each feature in the personnel feature matrix according to the data receiving personnel.
  • the data distribution module 34 is specifically configured to: when the plurality of materials to be distributed are multiple, use an optimization algorithm to distribute multiple to-be-distributed data to each other based on the calculated similarity matrix. Corresponding data receivers, so that a piece of information is distributed to a person and the sum of similarities is maximized after distribution.
  • the data distribution module 34 is specifically configured to allocate, to each data receiving person, k pieces of to-be-distributed data for the case where the m pieces of data to be distributed are distributed to t data receiving personnel, where m , t is a positive integer, m is greater than t;
  • the similarity matrix C mt is transformed in the column direction into a similarity matrix C m(t ⁇ k) of t ⁇ k data receivers;
  • the m pieces of data to be distributed are distributed to t ⁇ k virtual data receivers by using an optimization algorithm.
  • the data distribution module 34 is specifically configured to allocate, according to the personal capabilities of each data receiving personnel, the distribution of the m to-be-distributed materials to be distributed to the t-receiving personnel.
  • Distribution of m data to the optimization algorithm A virtual data receiver.
  • the optimization algorithm is a weighted bipartite graph optimal matching algorithm.
  • the similarity is any one of a cosine value, an Euclidean distance, a Pearson correlation coefficient, a Spearman rank correlation coefficient, a Tanimoto coefficient, a log likelihood similarity, and a Manhattan distance.
  • the data distribution device described in the above embodiments is used to perform the data distribution method described in the above embodiments, and the technical principle and the generated technical effects are similar, and are not described herein again.
  • a fourth embodiment of the present invention provides a data distribution device, including the data distribution device provided by any embodiment of the present invention, which can be integrated into a mobile terminal, a tablet computer, a fixed terminal, or a server.
  • an embodiment of the present invention provides a data distribution device, which includes a processor 40, a memory 41, an input device 42, and an output device 43.
  • the number of processors 40 in the data distribution device may be One or more, one processor 40 is taken as an example in FIG. 4; the processor 40, the memory 41, the input device 42, and the output device 43 in the data distribution device may be connected by a bus or other means, in FIG. Connection is an example.
  • the memory 41 is used as a computer readable storage medium, and can be used for storing software programs, computer executable programs, and modules, such as program instructions/modules corresponding to the data distribution method in the embodiment of the present invention (for example, data characteristics in the data distribution device)
  • the processor 30 executes various functional applications and data processing of the material distribution device by executing software programs, instructions, and modules stored in the memory 31, that is, implementing the above-described data distribution method.
  • the memory 41 may mainly include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the material distribution device, and the like. Further, the memory 41 may include a high speed random access memory, and may also include a nonvolatile memory such as at least one magnetic disk storage device, flash memory device, or other nonvolatile solid state storage device. In some examples, memory 41 may further include memory remotely located relative to processor 40, which may be connected to the data distribution device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • Input device 42 can be used to receive input numeric or character information, as well as to generate key signal inputs related to user settings and function control of the data distribution device.
  • the output device 43 may include a display device such as a display screen.
  • Embodiments of the present invention also provide a storage medium including computer executable instructions for executing a data distribution method when executed by a computer processor, the method comprising:
  • the to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
  • the present invention can be implemented by software and necessary general hardware, and can also be implemented by hardware, but in many cases, the former is a better implementation. .
  • the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product, which may be stored in a computer readable storage medium, such as a floppy disk of a computer. , Read-Only Memory (ROM), Random Access Memory (RAM), Flash (FLASH), hard disk or optical disk, etc., including a number of instructions to make a computer device (can be a personal computer)
  • the server, or network device, etc. performs the methods described in various embodiments of the present invention.
  • each unit and module included in the data distribution device is divided according to functional logic, but is not limited to the above division, as long as the corresponding function can be implemented;
  • the specific names of the functional units are also for convenience of distinguishing from each other and are not intended to limit the scope of the present invention.

Abstract

Disclosed are a data distribution method and device and a storage medium. The method comprises: obtaining data feature information of one or more pieces of data to be distributed; obtaining recipient feature information of one or more data recipients; matching the data feature information with the recipient feature information; and distributing the data to be distributed to a corresponding data recipient according to a matching degree. Embodiments of the present invention can achieve targeted data distribution by matching data feature information of data to be distributed with recipient feature information of a data recipient and distributing the data to be distributed to the corresponding data recipient according to a matching degree.

Description

资料分发方法、装置和存储介质Data distribution method, device and storage medium
本专利申请要求于2016年05月25日提交的、申请号为201610354138.0、申请人为百度在线网络技术(北京)有限公司、发明名称为“资料分发方法及装置”的中国专利申请的优先权,该申请的全文以引用的方式并入本申请中。This patent application claims priority from the Chinese patent application filed on May 25, 2016, the application number is 201610354138.0, the applicant is Baidu Online Network Technology (Beijing) Co., Ltd., and the invention name is "data distribution method and device". The full text of the application is hereby incorporated by reference.
技术领域Technical field
本发明实施例涉及信息处理技术领域,尤其涉及一种资料分发方法、装置和存储介质。The embodiments of the present invention relate to the field of information processing technologies, and in particular, to a data distribution method, apparatus, and storage medium.
背景技术Background technique
在客户关系管理(Customer Relationship Management,CRM)系统中,客户资料是核心数据和资产,其存在着数量庞大、质量程度不一、时效性强和特征多维等特点。在销售型CRM系统中,销售人员是核心用户,而销售人员本身也有销售能力、行业积累和地域偏好的差异。销售人员高效精准找到与自己业务能力相匹配的客户资料是一个至关重要的问题。In the Customer Relationship Management (CRM) system, customer data is the core data and assets, which are characterized by a large number, varying degrees of quality, timeliness and multidimensional features. In the sales CRM system, the salesperson is the core user, and the salesperson itself has the difference in sales ability, industry accumulation and geographical preference. It is a critical issue for sales personnel to efficiently and accurately find customer profiles that match their business capabilities.
以百度的销售CRM系统为例,销售人员一般会按照条件查询获取他想跟进的客户资料,然后去电话跟进这些特定客户资料并达成订单。这种资料的获取方式需要销售人员手工操作来获取,并且获取的客户资料依赖于销售人员主观对客户资料的主观理解,获取客户资料的效率以及匹配度都比较低。Taking Baidu's sales CRM system as an example, the salesperson will generally check the customer information he wants to follow according to the conditions, and then go to the phone to follow up the specific customer information and reach an order. The way in which such information is obtained requires the salesperson to obtain it manually, and the obtained customer data depends on the subjective understanding of the customer's subjective knowledge of the customer, and the efficiency and matching degree of obtaining the customer's data are relatively low.
或者,由专门的工作人员将客户资料输入CRM系统中,然后由系统随机或者按照平均分配的原则分发给销售人员,这种分配方式针对性较差,销售人员只能被动的接受客户资料,忽略了销售人员本身的差异。 Or, the customer information is input into the CRM system by a dedicated staff member, and then distributed to the salesperson by the system randomly or according to the principle of even distribution. This distribution method is less targeted, and the salesperson can only passively accept the customer data, ignoring The difference in the sales staff themselves.
发明内容Summary of the invention
本发明提供一种资料分发方法及装置,以有针对性的分发资料。The invention provides a data distribution method and device for distributing information in a targeted manner.
第一方面,本发明实施例提供了一种资料分发方法,包括:In a first aspect, an embodiment of the present invention provides a data distribution method, including:
获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
获取一个或多个资料接收人员的人员特征信息;Obtaining personnel characteristic information of one or more data receiving personnel;
将所述资料特征信息与所述人员特征信息进行匹配;Matching the material feature information with the person feature information;
按照匹配程度将所述待分发资料分发给相应的资料接收人员。The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
第二方面,本发明实施例还提供了一种资料分发装置,包括:In a second aspect, an embodiment of the present invention further provides a data distribution apparatus, including:
资料特征获取模块,用于获取一个或多个待分发资料的资料特征信息;a data feature acquiring module, configured to acquire data feature information of one or more materials to be distributed;
人员特征获取模块,用于获取一个或多个资料接收人员的人员特征信息;a personnel feature obtaining module, configured to acquire personnel characteristic information of one or more data receiving personnel;
特征匹配模块,用于将所述资料特征信息与所述人员特征信息进行匹配;a feature matching module, configured to match the data feature information with the personnel feature information;
资料分发模块,用于按照匹配程度将所述待分发资料分发给相应的资料接收人员。And a data distribution module, configured to distribute the to-be-distributed data to a corresponding data receiving person according to a matching degree.
第三方面,本发明实施例还提供了一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行一种资料分发方法,该方法包括:In a third aspect, an embodiment of the present invention further provides a storage medium including computer executable instructions for executing a data distribution method when executed by a computer processor, the method comprising:
获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
获取一个或多个资料接收人员的人员特征信息;Obtaining personnel characteristic information of one or more data receiving personnel;
将所述资料特征信息与所述人员特征信息进行匹配;Matching the material feature information with the person feature information;
按照匹配程度将所述待分发资料分发给相应的资料接收人员。The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
第四方面,本发明实施例还提供了一种资料分发设备,包括:In a fourth aspect, the embodiment of the present invention further provides a data distribution device, including:
一个或者多个处理器;One or more processors;
存储器,存储有一个或多个程序; a memory that stores one or more programs;
当所述一个或多个程序被所述一个或者多个处理器执行时,使得所述一个或多个处理器执行如下操作:When the one or more programs are executed by the one or more processors, causing the one or more processors to perform the following operations:
获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
获取一个或多个资料接收人员的人员特征信息;Obtaining personnel characteristic information of one or more data receiving personnel;
将所述资料特征信息与所述人员特征信息进行匹配;Matching the material feature information with the person feature information;
按照匹配程度将所述待分发资料分发给相应的资料接收人员。The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
本发明实施例通过将待分发资料的资料特征信息与资料接收人员的人员特征信息进行匹配,按照匹配程度将所述待分发资料分发给相应的资料接收人员,能够有针对性的分发资料。In the embodiment of the present invention, by matching the data feature information of the data to be distributed with the personnel feature information of the data receiving personnel, the data to be distributed is distributed to the corresponding data receiving personnel according to the matching degree, and the data can be distributed in a targeted manner.
附图说明DRAWINGS
图1是本发明实施例一中的一种资料分发方法的流程图;1 is a flowchart of a data distribution method in Embodiment 1 of the present invention;
图2A是本发明实施例二中的一种资料分发方法的流程图;2A is a flowchart of a data distribution method in Embodiment 2 of the present invention;
图2B是本发明实施例二中的一种资料分发方法中的客户资料解析结果示意图;2B is a schematic diagram showing a result of analyzing customer data in a data distribution method according to Embodiment 2 of the present invention;
图2C是本发明实施例二中的提供采用现有的资料分发方法的分发结果示意图;2C is a schematic diagram showing the distribution result of using the existing data distribution method in Embodiment 2 of the present invention;
图2D是本发明实施例二中的一种资料分发方法中提供的分发结果示意图;2D is a schematic diagram of distribution results provided in a data distribution method according to Embodiment 2 of the present invention;
图2E是本发明实施例二中的一种资料分发方法中资料分发显示界面示意图;2E is a schematic diagram of a data distribution display interface in a data distribution method according to Embodiment 2 of the present invention;
图3是本发明实施例三中的一种资料分发装置的结构图;3 is a structural diagram of a data distribution apparatus in Embodiment 3 of the present invention;
图4是本发明实施例三中的一种资料分发设备的结构图。 4 is a structural diagram of a data distribution device in Embodiment 3 of the present invention.
具体实施方式detailed description
下面结合附图和实施例对本发明作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释本发明,而非对本发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与本发明相关的部分而非全部结构。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. It should also be noted that, for ease of description, only some, but not all, of the structures related to the present invention are shown in the drawings.
实施例一Embodiment 1
图1为本发明实施例一提供的一种资料分发方法的流程图,本实施例可适用于各个场景下将资料分发给相关人员的情况,该方法可以由本发明实施例提供的资料分发装置来执行,该装置可采用软件或硬件的方式实现,该装置可集成于移动终端、平板电脑、固定终端或服务器中,如图1所示,具体包括:FIG. 1 is a flowchart of a data distribution method according to a first embodiment of the present invention. The present embodiment is applicable to a data distribution apparatus provided by an embodiment of the present invention. Executing, the device may be implemented by using a software or a hardware, and the device may be integrated into a mobile terminal, a tablet computer, a fixed terminal, or a server, as shown in FIG.
S101、获取一个或多个待分发资料的资料特征信息。S101. Acquire data feature information of one or more materials to be distributed.
其中,所述待分发资料可以为一份资料,也可以为多份资料。所述待分发资料的类型具体可以为但不限于文件、文档和客户资料中的任一种。The information to be distributed may be one piece of information or multiple pieces of information. The type of the to-be-distributed material may specifically be, but is not limited to, any one of a file, a document, and a customer profile.
其中,所述资料特征信息由至少一个关键词组成,该关键词表征了所述待分发资料中包含的内容或属性,包含但不限于资料所属的行业信息、资料所属企业的企业经营信息、资料所属企业的地理位置、该资料的电话沟通记录和该资料的拜访记录中的至少一种。其中,所述电话沟通记录为相关人员(例如客户或销售人员)针对该资料进行的历史电话沟通次数或时间。所述拜访记录为相关人员(例如客户或销售人员)针对该资料进行的历史拜访次数、地点或时间。The material feature information is composed of at least one keyword, and the keyword represents the content or attribute included in the to-be-distributed material, including but not limited to industry information to which the data belongs, enterprise business information and data of the enterprise to which the data belongs. At least one of a geographical location of the affiliated company, a telephone communication record of the material, and a visit record of the information. The telephone communication record is the number or time of historical telephone communication performed by the relevant person (such as a customer or a salesperson) for the data. The visit record is the number, location or time of historical visits made by the relevant person (eg, customer or salesperson) for the material.
具体的,可通过解析所述待分发资料,获取所述待分发资料中包含的内容,从包含的内容中提取关键词作为资料特征信息,或者根据包含的内容概括得出 能够表征所述待分发资料含义或属性的关键词,将概括的关键词作为资料特征信息。Specifically, the content to be distributed may be obtained by parsing the to-be-distributed data, and the keyword is extracted from the included content as the feature information, or is summarized according to the included content. A keyword capable of characterizing the meaning or attribute of the material to be distributed, and the summarized keyword is used as the material feature information.
S102、获取一个或多个资料接收人员的人员特征信息。S102. Acquire person characteristic information of one or more data receiving personnel.
其中,所述资料接收人员为所述待分发资料的接收对象。所述人员特征信息表征资料接收人员的相关信息,包含但不限于资料接收人员所从事的行业信息、所在企业的企业经营信息、所在地理位置、电话沟通记录和拜访记录中的至少一种。其中,所述电话沟通记录为资料接收人员(例如客户或销售人员)针对某份资料的历史电话沟通次数或时间。拜访记录为资料接收相关人员(例如客户或销售人员)针对某份资料的历史拜访次数、地点或时间。The data receiving personnel is a receiving object of the data to be distributed. The personnel characteristic information represents related information of the data receiving personnel, including but not limited to at least one of industry information engaged by the data receiving personnel, business operation information of the enterprise, geographical location, telephone communication record, and visit record. The telephone communication record is the historical telephone communication time or time for a data receiving person (such as a customer or a sales person) for a certain data. A visit record is the number, location, or time of historical visits to a profile by a person (such as a customer or salesperson) who receives the data.
S103、将所述资料特征信息与所述人员特征信息进行匹配。S103. Match the data feature information with the personnel feature information.
具体的,以上述S102和S103为例,若所述资料特征信息包含资料所属的行业信息、资料所属企业的企业经营信息、资料所属企业的地理位置、该资料的电话沟通记录和该资料的拜访记录,所述人员特征信息包含资料接收人员所从事的行业信息、所在企业的企业经营信息、所在地理位置、电话沟通记录和拜访记录。在进行匹配时,将所述资料特征信息中包含的各个特征项和所述人员特征信息中包含的各个特征项进行一一匹配。Specifically, taking the above S102 and S103 as an example, if the data feature information includes industry information to which the data belongs, enterprise business information of the enterprise to which the data belongs, geographic location of the enterprise to which the data belongs, telephone communication record of the data, and visit of the data Recording, the personnel characteristic information includes industry information engaged by the data receiving personnel, business operation information of the enterprise, geographic location, telephone communication record, and visit record. When matching is performed, each feature item included in the material feature information and each feature item included in the person feature information are matched one by one.
S104、按照匹配程度将所述待分发资料分发给相应的资料接收人员。S104. Distribute the to-be-distributed data to a corresponding data receiving personnel according to the degree of matching.
具体的,如果某一份待分发资料与某一资料接收人员的匹配度超过预设阈值,或者该份资料与该资料接收人员的匹配度为所有匹配度中的最大值,则将该份资料分发给该资料接收人员。例如,将资料所属的行业信息与资料接收人员所从事的行业信息进行匹配,优先将该资料分发给相同行业的人员;将资料所属企业的企业经营信息与资料接收人员所在企业的企业经营信息进行匹配, 优先将该资料分发给企业经营状况相似的人员;将资料归属的地理位置与资料接收人员所在地理位置进行匹配,优先将该资料分发给相同地理位置的人员,等等。Specifically, if the matching degree of a piece of information to be distributed to a certain data receiving person exceeds a preset threshold, or the matching degree of the piece of data with the data receiving person is the maximum value of all matching degrees, the piece of data is Distribute to the recipient of the data. For example, the industry information to which the data belongs is matched with the industry information of the data receiving personnel, and the information is preferentially distributed to the personnel of the same industry; the business operation information of the enterprise to which the data belongs and the business operation information of the enterprise where the data receiving personnel are located are performed. Match, Priority is given to distributing the information to persons with similar business conditions; matching the geographical location of the data to the geographic location of the recipient of the data, prioritizing the distribution of the information to the same geographical location, and so on.
本实施例通过将待分发资料的资料特征信息与资料接收人员的人员特征信息进行匹配,按照匹配程度将所述待分发资料分发给相应的资料接收人员,能够有针对性的分发资料。In this embodiment, the data feature information of the data to be distributed is matched with the personnel feature information of the data receiving personnel, and the data to be distributed is distributed to the corresponding data receiving personnel according to the matching degree, so that the data can be distributed in a targeted manner.
示例性的,在上述实施例的基础上,为便于计算所述资料特征信息与所述人员特征信息的匹配程度,本发明实施例将所述资料特征信息和人员特征信息矩阵化,具体包括:Exemplarily, on the basis of the foregoing embodiment, in order to facilitate the calculation of the matching degree between the data feature information and the personnel feature information, the embodiment of the present invention, the data feature information and the personnel feature information are matrixized, and specifically includes:
所述资料特征信息和人员特征信息分别为资料特征矩阵和人员特征矩阵,其中每个待分发资料和每个资料接收人员所具有的特征维度相同。The data feature information and the person feature information are respectively a data feature matrix and a personnel feature matrix, wherein each of the to-be-distributed materials and each of the data receiving personnel have the same feature dimension.
相应的,将所述资料特征信息与所述人员特征信息进行匹配,包括:Correspondingly, matching the data feature information with the personnel feature information includes:
计算所述资料特征矩阵与人员特征矩阵的相似度矩阵,将所述相似度矩阵作为匹配程度。A similarity matrix of the data feature matrix and the person feature matrix is calculated, and the similarity matrix is used as a matching degree.
其中,所述资料特征矩阵通过对待分发资料进行特征建模获得。所述资料特征矩阵包含n个特征值,所述特征值所表征的特征包含但不限于行业信息、企业经营信息、地理位置、电话沟通记录和拜访记录中的至少一种。所述相似度具体可以为但不限于余弦值、欧式距离、皮尔逊相关系数、Spearman秩相关系数、Tanimoto系数、对数似然相似度和曼哈顿距离中的任意一种。The material feature matrix is obtained by performing feature modeling on the data to be distributed. The data feature matrix includes n feature values, and the features represented by the feature values include, but are not limited to, at least one of industry information, business operation information, geographic location, telephone communication record, and visit record. The similarity may specifically be, but not limited to, any one of a cosine value, an Euclidean distance, a Pearson correlation coefficient, a Spearman rank correlation coefficient, a Tanimoto coefficient, a log likelihood similarity, and a Manhattan distance.
在对所述资料特征矩阵进行赋值时,可采用如下方法:When assigning the data feature matrix, the following methods can be used:
解析待分发资料,如果包含所述资料特征矩阵中对应的特征,则将所述资 料特征矩阵中对应的特征值赋为1,否则赋值为0。Parsing the data to be distributed, and if the corresponding feature in the data feature matrix is included, the capital is The corresponding feature value in the material feature matrix is assigned to 1, otherwise the value is assigned to 0.
如果所述资料特征矩阵包含的行业信息为IT、化工和法律,如果通过解析所述待分发资料确定行业为化工,则将所述资料特征矩阵中化工对应的特征值赋值为1,其余特征即IT和法律对应的特征值赋值为0。类似的,对于其它的特征例如企业经营信息、地理位置、电话沟通记录和拜访记录,采用相似的方法进行赋值。If the industry information contained in the data feature matrix is IT, chemical, and legal, if the industry is determined to be chemical by analyzing the data to be distributed, the eigenvalue corresponding to the chemical in the data feature matrix is assigned a value of 1, and the remaining features are The eigenvalues corresponding to IT and law are assigned a value of zero. Similarly, for other features such as business operations information, geographic location, telephone communication records, and visit records, similar methods are used for assignment.
在对所述人员特征矩阵进行赋值时,可采用如下方法:When assigning the personnel feature matrix, the following method can be used:
根据所述资料接收人员对所述人员特征矩阵中各特征的偏好分别进行赋值。According to the data receiving personnel, the preferences of the features in the personnel feature matrix are respectively assigned.
在本发明实施例中,所述人员特征矩阵中的各特征项与所述资料特征矩阵中的各特征项相同。根据人员对特征的感兴趣程度分别为各特征进行赋值,赋值范围为0~1。例如,如果人员对北京感兴趣程度为1,则将北京这个特征对应的特征值赋值为1,如果人员对IT感兴趣程度为0.5,则将IT这个特征对应的特征值赋值为0.5,等等。In the embodiment of the present invention, each feature item in the person feature matrix is the same as each feature item in the data feature matrix. According to the degree of interest of the person to the feature, each feature is assigned, and the assignment range is 0~1. For example, if the person has a degree of interest in Beijing of 1, the feature value corresponding to the feature of Beijing is assigned a value of 1. If the degree of interest of the person to IT is 0.5, the feature value corresponding to the feature of IT is assigned a value of 0.5, etc. .
在对进行所述资料特征矩阵与人员特征矩阵的特征赋值完成之后,在进行资料分发时,所述按照匹配程度将所述待分发资料分发给相应的资料接收人员进一步包括:After the data assignment is performed, the distributing the data to be distributed to the corresponding data receiver according to the degree of matching further includes:
当所述待分发资料为多个时,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员,以使得一份资料分发给一个人员且分发后相似度总和最大。When the plurality of materials to be distributed are multiple, based on the calculated similarity matrix, the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm, so that one piece of information is distributed to one person and distributed. The sum of the similarities is the largest.
其中,所述最优化算法优选为但不限于加权二分图最优匹配算法。The optimization algorithm is preferably but not limited to a weighted bipartite graph optimal matching algorithm.
具体的,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员主要包括以下两种分发方式: Specifically, based on the calculated similarity matrix, using an optimization algorithm to separately distribute the plurality of data to be distributed to the corresponding data receiving personnel mainly includes the following two distribution methods:
第一种分发方式:基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别以固定分发量形式分发给相应的资料接收人员,即在分发资料时,为每个资料接收人员均固定分配相同个数的待分发资料。具体的,对于将m个待分发资料分发给t个资料接收人员的情况,为每个资料接收人员固定分配k个待分发资料,其中m、t均为正整数,m大于t;将所述相似度矩阵Cmt在列方向上变换为t×k个资料接收人员的相似度矩阵Cm(t×k);采用最优化算法将m个待分发资料分发给t×k个虚拟的资料接收人员。The first distribution method: based on the calculated similarity matrix, an optimization algorithm is used to distribute a plurality of data to be distributed to a corresponding data receiving person in a fixed distribution form, that is, when each data is distributed, The same number of items to be distributed are fixedly allocated by personnel. Specifically, in the case of distributing m pieces of data to be distributed to t data receivers, each data receiver is fixedly allocated k pieces of data to be distributed, where m and t are positive integers, and m is greater than t; The similarity matrix C mt is transformed into a similarity matrix C m(t×k) of t×k data receivers in the column direction; the m data to be distributed is distributed to t×k virtual data receiving by an optimization algorithm. personnel.
第二种分发方式:基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别以以可变分发量形式分发给相应的资料接收人员,即在分发资料时,根据每个资料接收人员的个人能力,为每个资料接收人员分配不同个数的待分发资料。具体的,对于将待分发m个待分发资料分发给t个资料接收人员的情况,根据每个资料接收人员的个人能力分别分配设定个数的待分发资料,其中,第i个资料接收人员分配的待分发资料的个数记为ki,其中m、t均为正整数,m大于t;将所述相似度矩阵Cmt在列方向上变换为
Figure PCTCN2016096850-appb-000001
个虚拟资料接收人员的相似度矩阵
Figure PCTCN2016096850-appb-000002
采用最优化算法将m个资料分发给
Figure PCTCN2016096850-appb-000003
个虚拟的资料接收人员。
The second distribution method is: based on the calculated similarity matrix, the plurality of to-be-distributed materials are respectively distributed to the corresponding data receiving personnel in a variable distribution amount by using an optimization algorithm, that is, when distributing the data, according to each The data receiver's personal ability to assign a different number of data to be distributed to each data receiver. Specifically, for the case where the to-be-distributed data to be distributed is distributed to t data receiving personnel, a set number of to-be-distributed materials are separately allocated according to the personal capabilities of each data receiving person, wherein the i-th data receiving personnel The number of allocated data to be distributed is denoted as k i , where m and t are positive integers, and m is greater than t; the similarity matrix C mt is transformed in the column direction to
Figure PCTCN2016096850-appb-000001
Similarity matrix of virtual data receivers
Figure PCTCN2016096850-appb-000002
Distribution of m data to the optimization algorithm
Figure PCTCN2016096850-appb-000003
A virtual data receiver.
上述各实施例通过将待分发资料的资料特征信息与资料接收人员的人员特征信息矩阵化,通过计算两个矩阵的相似度,基于最大相似度原则,将所述待分发资料分发给相应的资料接收人员,能够有针对性的分发资料。In the above embodiments, the data of the data to be distributed and the personnel characteristic information of the data receiver are matrixed, and the similarity of the two matrices is calculated, and the data to be distributed is distributed to the corresponding data based on the principle of maximum similarity. Receiving personnel, able to distribute information in a targeted manner.
实施例二 Embodiment 2
图2A为本发明实施例二提供的一种资料分发方法的流程图,本实施例以所述待分发资料为客户资料,所述资料接收人员为销售认为为场景,对本发明进行详细说明,如图2A所示,具体包括:2A is a flowchart of a data distribution method according to Embodiment 2 of the present invention. In this embodiment, the data to be distributed is used as customer data, and the data receiving personnel considers the sales as a scenario, and the present invention is described in detail, such as As shown in FIG. 2A, the specific includes:
S201、解析客户资料,提取表征所述客户资料的资料特征信息,所述资料特征信息为包含n个特征值的资料特征矩阵。S201: Analyze customer data, and extract data feature information that is used to represent the customer data, where the data feature information is a data feature matrix that includes n feature values.
通过解析客户资料,采用如图2B所示树状方式来归类和描述所述客户资料,该树中所有的叶子节点也就是所有的特征值,如果客户资料可以抽象为n个特征值,就可以表示为一个n×1维的矩阵。客户资料特征信息都可以描述一个行矩阵即资料特征矩阵A{安全安保,办公文教,医疗健康,...,北京,上海,广州,...}来表示,它的取值会是类似于{0,1,0,...,0,1,0,...}(1表明符合特征,0表示否),该矩阵表明是一条办公文教行业,上海地域的客户资料。假定库内有m条客户资料,那么客户资料的所有特征取值可以表示为一个m×n的矩阵AmnBy parsing the customer data, the customer data is classified and described in a tree manner as shown in FIG. 2B, and all the leaf nodes in the tree are all feature values. If the customer data can be abstracted into n feature values, Can be expressed as an n × 1 dimensional matrix. Customer profile information can describe a row matrix, ie data profile matrix A{security security, office culture, medical health, ..., Beijing, Shanghai, Guangzhou, ...} to indicate that its value will be similar {0,1,0,...,0,1,0,...} (1 indicates that the feature is met, 0 means no), the matrix indicates that it is an office culture and education industry, customer information in the Shanghai area. Assuming that there are m customer data in the library, all the feature values of the customer data can be represented as an m×n matrix A mn .
S202、获取销售人员的人员特征信息,所述人员特征信息分别为包含n个特征值的资料特征矩阵。S202. Obtain a person characteristic information of a salesperson, where the person feature information is a data feature matrix including n feature values.
具体的,可以基于以往的销售人员的成单记录和跟进记录,分析出销售所跟进的客户资料的统计特征,这里称之为销售人员的偏好,这个偏好也可以表示为一个矩阵即销售人员对应的资料特征矩阵B{安全安保,办公文教,医疗健康,...,北京,上海,广州,...},它的取值类似于{0.3,0.5,0.1...,0},该矩阵表示销售人员对安全安保行业的感兴趣程度为0.3,对办公文教的感兴趣程度为0.5,对医疗健康行业的感兴趣程度为0.1,取值越大表示越感兴趣。如果销售人员的总数为t,则所有销售人员的偏好取值表示为一个n×t的矩阵BntSpecifically, based on the previous salesperson's single record and follow-up records, the statistical characteristics of the customer data that the sales follow-up can be analyzed. This is called the salesperson's preference. This preference can also be expressed as a matrix or sales. Personnel corresponding data feature matrix B{security security, office culture, medical health, ..., Beijing, Shanghai, Guangzhou, ...}, its value is similar to {0.3,0.5,0.1...,0} The matrix indicates that the salesperson's interest in the safety and security industry is 0.3, the degree of interest in office culture and education is 0.5, and the degree of interest in the medical and health industry is 0.1. The greater the value, the more interested. If the total number of salespersons is t, then all salespersons' preference values are represented as an nxt matrix Bnt .
S203、计算所述资料特征矩阵与人员特征矩阵的相似度矩阵。S203. Calculate a similarity matrix of the data feature matrix and the person feature matrix.
基于前面得到的客户资料特征矩阵Amn和销售人员的偏好矩阵Bnt,计算两者的相似度矩阵,由于两个矩阵的稀疏性,此处采用余弦相似度匹配算法来表示客户资料特征与销售人员兴趣的匹配程度。Based on the previously obtained customer data feature matrix A mn and the salesperson's preference matrix B nt , the similarity matrix of the two is calculated. Due to the sparsity of the two matrices, the cosine similarity matching algorithm is used to represent the customer data characteristics and sales. The degree of matching of people's interests.
若A{A1,A2,...,An}表示某条客户资料的资料特征矩阵,B{B1,B2,...,Bn}表示某个销售人员的人员特征矩阵,这两者之间的余弦相似度计算公式如下:If A{A 1 , A 2 ,..., A n } represents the data feature matrix of a customer profile, B{B 1 , B 2 ,..., B n } represents the personnel identity matrix of a salesperson. The cosine similarity between the two is calculated as follows:
Figure PCTCN2016096850-appb-000004
Figure PCTCN2016096850-appb-000004
其中,i的取值为1到n,该余弦相似度取值越大,说明客户资料的特征和销售人员的偏好越匹配。Wherein, the value of i is 1 to n, and the value of the cosine similarity is larger, indicating that the characteristics of the customer data and the preference of the salesperson match.
两个特征矩阵Amn和Bnt进行矩阵余弦相似度计算,最终得到一个相似度矩阵Cmt,其中Cij(i<=m,j<=t)表示客户资料i和销售人员j的相似度系数。The two characteristic matrices A mn and B nt are used to calculate the cosine similarity of the matrix, and finally a similarity matrix C mt is obtained , where C ij(i<=m, j<=t) represents the similarity between the customer data i and the salesperson j. coefficient.
Figure PCTCN2016096850-appb-000005
Figure PCTCN2016096850-appb-000005
基于这个相似度矩阵,可以得到两部分的信息:(1)客户资料维度上,对该客户资料最感兴趣的前k名销售人员列表;(2)销售人员维度上,该销售人员最感兴趣的前k个客户资料列表。Based on this similarity matrix, two pieces of information can be obtained: (1) the top k salesperson list that is most interested in the customer profile in the customer data dimension; (2) the salesperson dimension is most interested in the salesperson dimension A list of the top k customer profiles.
S204、基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员。 S204. The plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm based on the calculated similarity matrix.
在分发时需要满足以下两个条件:(1)独占性的需求:一条客户资料只能分发个一个销售人员;(2)最优化的需求:分配后的客户资料的集合的相似度总和要最大。In the distribution, the following two conditions must be met: (1) exclusive demand: one customer data can only be distributed to one salesperson; (2) optimized demand: the sum of similarity of the collected customer data is the largest .
假设有t个销售人员,平均每个销售人员的客户资料推荐个数为k,为了方便计算,可以抽象出t×k个虚拟销售人员(每个销售出现k次),可以将相似度矩阵Cmt在列的方向上变换为t×k维的矩阵Cm(t×k)。,表示了m条客户资料,t×k个虚拟销售人员的相似度矩阵。Suppose there are t sales personnel, and the average number of customer data recommended by each salesperson is k. For the convenience of calculation, t×k virtual sales personnel can be abstracted (k times for each sales), and the similarity matrix C can be used. Mt is transformed into a matrix of t × k dimensions C m(t × k) in the direction of the column. , indicating m customer data, t × k virtual sales staff similarity matrix.
至此,m条客户资料最优分配给t×k个虚拟销售人员就转化为一个加权二分图(t×k一般小于m)的最优匹配问题,销售资料的集合与虚拟销售人员的集合组成二分图的两个顶点集,cij(i<=m,j<=t×k)表示客户资料i和虚拟销售人员j之间的权重系数。基于Kuhn-Munkres算法,可得到一个t×k个虚拟销售人员分别匹配的客户资料,使得总体的相似度达到最大,从而达到最佳的匹配效果。At this point, the optimal distribution of m customer data to t×k virtual sales personnel is converted into a weighted bipartite graph (t×k is generally less than m), and the set of sales data and the set of virtual sales personnel are composed of two points. The two sets of vertices of the graph, c ij (i<=m, j<=t×k), represent the weight coefficients between the customer data i and the virtual salesperson j. Based on the Kuhn-Munkres algorithm, a customer data matched by t×k virtual sales personnel can be obtained, so that the overall similarity is maximized, thereby achieving the best matching effect.
如图2C所示,为初始的销售人员和客户资料之间的相似关系,采用本实施例的方案进行分发后,得到如图2D所示的最终分配关系。As shown in FIG. 2C, after the similar relationship between the initial salesperson and the customer profile is distributed by using the solution of the embodiment, a final distribution relationship as shown in FIG. 2D is obtained.
如图2E所示,为本实施例提供的一种分发界面示意图,在该界面上,用户可以选择待分发的客户资料,以及接收资料的销售人员,然后点击分发按钮,启动本发明实施例提供的资料分发过程,并产生如图2E下方所示的分发结果。As shown in FIG. 2E, a schematic diagram of a distribution interface is provided in this embodiment. On the interface, the user can select the customer data to be distributed, and the salesperson who receives the data, and then click the distribution button to start the embodiment of the present invention. The data distribution process and the distribution results as shown below in Figure 2E.
另外,本发明实施例的方案在百度的销售CRM系统中使用后,客户资料下发的各项指标都有较大幅度的提升,具体如下表所示:In addition, after the solution of the embodiment of the present invention is used in the sales CRM system of Baidu, the indicators issued by the customer data are greatly improved, as shown in the following table:
评估指标Evaluation index 人工分发方式Manual distribution 智能获取方式Intelligent acquisition method
分发速度(1万客户资料)Distribution speed (10,000 customer data) 2小时2 hours 10分钟10 minutes
分配客户资料准确率Assign customer data accuracy ** 90%90%
销售人员成单量(月)Sales staff into a single amount (months) 4.54.5 5.55.5
可见采用了本发明实施例所提出的资料分发方法,节省了大量的分配人力,资料分发也更具针对性,最为关键的是,销售人员的成单量的提升明显,取得了较好的经济效益。It can be seen that the data distribution method proposed by the embodiment of the present invention is adopted, which saves a large amount of distribution manpower, and the data distribution is more targeted. The most important thing is that the salesperson's single-sheet quantity is obviously improved, and a better economy is obtained. benefit.
实施例三Embodiment 3
图3所示为本发明实施例三提供的一种资料分发装置的结构示意图,该装置可采用软件或硬件的方式实现,该装置可集成于移动终端、平板电脑、固定终端或服务器中,如图3所示,该装置的具体结构如下:资料特征获取模块31、人员特征获取模块32、特征匹配模块33和资料分发模块34。FIG. 3 is a schematic structural diagram of a data distribution apparatus according to Embodiment 3 of the present invention. The apparatus may be implemented by using software or hardware, and the apparatus may be integrated into a mobile terminal, a tablet computer, a fixed terminal, or a server, such as As shown in FIG. 3, the specific structure of the device is as follows: a material feature acquisition module 31, a personnel feature acquisition module 32, a feature matching module 33, and a data distribution module 34.
所述资料特征获取模块31用于获取一个或多个待分发资料的资料特征信息;The data feature obtaining module 31 is configured to acquire data feature information of one or more materials to be distributed;
所述人员特征获取模块32用于获取一个或多个资料接收人员的人员特征信息;The person feature obtaining module 32 is configured to acquire personnel feature information of one or more data receiving personnel;
所述特征匹配模块33用于将所述资料特征信息与所述人员特征信息进行匹配;The feature matching module 33 is configured to match the material feature information with the personnel feature information;
所述资料分发模块34用于按照匹配程度将所述待分发资料分发给相应的资料接收人员。The data distribution module 34 is configured to distribute the to-be-distributed materials to corresponding data receiving personnel according to the degree of matching.
本实施例所述资料分发装置用于执行上述各实施例所述的资料分发方法,其技术原理和产生的技术效果类似,这里不再赘述。The data distribution device in the embodiment is used to perform the data distribution method described in the foregoing embodiments, and the technical principle and the generated technical effects are similar, and details are not described herein again.
在上述实施例的基础上,所述资料特征信息和人员特征信息分别为资料特征矩阵和人员特征矩阵,其中每个待分发资料和每个资料接收人员所具有的特 征维度相同;Based on the foregoing embodiment, the data feature information and the personnel feature information are respectively a data feature matrix and a personnel feature matrix, wherein each of the to-be-distributed materials and each data receiving person has a special feature The same dimension;
相应的,所述特征匹配模块33具体用于计算所述资料特征矩阵与人员特征矩阵的相似度矩阵,将所述相似度矩阵作为匹配程度。Correspondingly, the feature matching module 33 is specifically configured to calculate a similarity matrix of the data feature matrix and the person feature matrix, and use the similarity matrix as a matching degree.
在上述实施例的基础上,所述资料特征获取模块用于通过对待分发资料进行特征建模来获得资料特征矩阵。Based on the foregoing embodiment, the data feature acquiring module is configured to obtain a data feature matrix by performing feature modeling on the data to be distributed.
在上述实施例的基础上,所述资料特征获取模块31具体用于解析待分发资料;如果包含所述资料特征矩阵中对应的特征,则将所述资料特征矩阵中对应的特征值赋为1,否则赋值为0。On the basis of the foregoing embodiment, the data feature obtaining module 31 is specifically configured to parse the data to be distributed; if the corresponding feature in the data feature matrix is included, assign the corresponding feature value in the data feature matrix to 1 Otherwise, the value is 0.
在上述实施例的基础上,所述人员特征获取模块32具体用于根据所述资料接收人员对所述人员特征矩阵中各特征的偏好分别进行赋值。Based on the foregoing embodiment, the personnel feature obtaining module 32 is specifically configured to respectively assign a preference to each feature in the personnel feature matrix according to the data receiving personnel.
在上述实施例的基础上,所述资料分发模块34具体用于当所述待分发资料为多个时,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员,以使得一份资料分发给一个人员且分发后相似度总和最大。On the basis of the foregoing embodiment, the data distribution module 34 is specifically configured to: when the plurality of materials to be distributed are multiple, use an optimization algorithm to distribute multiple to-be-distributed data to each other based on the calculated similarity matrix. Corresponding data receivers, so that a piece of information is distributed to a person and the sum of similarities is maximized after distribution.
在上述实施例的基础上,所述资料分发模块34具体用于对于将m个待分发资料分发给t个资料接收人员的情况,为每个资料接收人员固定分配k个待分发资料,其中m、t均为正整数,m大于t;Based on the foregoing embodiment, the data distribution module 34 is specifically configured to allocate, to each data receiving person, k pieces of to-be-distributed data for the case where the m pieces of data to be distributed are distributed to t data receiving personnel, where m , t is a positive integer, m is greater than t;
将相似度矩阵Cmt在列方向上变换为t×k个资料接收人员的相似度矩阵Cm(t×k)The similarity matrix C mt is transformed in the column direction into a similarity matrix C m(t×k) of t×k data receivers;
采用最优化算法将m个待分发资料分发给t×k个虚拟的资料接收人员。The m pieces of data to be distributed are distributed to t×k virtual data receivers by using an optimization algorithm.
在上述实施例的基础上,所述资料分发模块34具体用于对于将待分发m个待分发资料分发给t个资料接收人员的情况,根据每个资料接收人员的个人能力 分别分配设定个数的待分发资料,其中,第i个资料接收人员分配的待分发资料的个数记为ki,其中m、t均为正整数,m大于t;On the basis of the foregoing embodiment, the data distribution module 34 is specifically configured to allocate, according to the personal capabilities of each data receiving personnel, the distribution of the m to-be-distributed materials to be distributed to the t-receiving personnel. The number of data to be distributed, wherein the number of data to be distributed allocated by the i-th data receiver is recorded as k i , where m and t are positive integers, and m is greater than t;
将相似度矩阵Cmt在列方向上变换为
Figure PCTCN2016096850-appb-000006
个虚拟资料接收人员的相似度矩阵
Figure PCTCN2016096850-appb-000007
Transform the similarity matrix C mt in the column direction to
Figure PCTCN2016096850-appb-000006
Similarity matrix of virtual data receivers
Figure PCTCN2016096850-appb-000007
采用最优化算法将m个资料分发给
Figure PCTCN2016096850-appb-000008
个虚拟的资料接收人员。
Distribution of m data to the optimization algorithm
Figure PCTCN2016096850-appb-000008
A virtual data receiver.
在上述实施例的基础上,所述最优化算法为加权二分图最优匹配算法。Based on the foregoing embodiment, the optimization algorithm is a weighted bipartite graph optimal matching algorithm.
在上述实施例的基础上,所述相似度为余弦值、欧式距离、皮尔逊相关系数、Spearman秩相关系数、Tanimoto系数、对数似然相似度和曼哈顿距离中的任意一种。Based on the above embodiment, the similarity is any one of a cosine value, an Euclidean distance, a Pearson correlation coefficient, a Spearman rank correlation coefficient, a Tanimoto coefficient, a log likelihood similarity, and a Manhattan distance.
上述各实施例所述资料分发装置用于执行上述各实施例所述的资料分发方法,其技术原理和产生的技术效果类似,这里不再赘述。The data distribution device described in the above embodiments is used to perform the data distribution method described in the above embodiments, and the technical principle and the generated technical effects are similar, and are not described herein again.
实施例四Embodiment 4
本发明第四实施例提供了一种资料分发设备,包括本发明任意实施例所提供的资料分发装置,该资料分发设备可集成于移动终端、平板电脑、固定终端或服务器中。A fourth embodiment of the present invention provides a data distribution device, including the data distribution device provided by any embodiment of the present invention, which can be integrated into a mobile terminal, a tablet computer, a fixed terminal, or a server.
具体的,如图4所示,本发明实施例提供一种资料分发设备,该资料分发设备包括处理器40、存储器41、输入装置42和输出装置43;资料分发设备中处理器40的数量可以是一个或多个,图4中以一个处理器40为例;资料分发设备中的处理器40、存储器41、输入装置42和输出装置43可以通过总线或其他方式连接,图4中以通过总线连接为例。 Specifically, as shown in FIG. 4, an embodiment of the present invention provides a data distribution device, which includes a processor 40, a memory 41, an input device 42, and an output device 43. The number of processors 40 in the data distribution device may be One or more, one processor 40 is taken as an example in FIG. 4; the processor 40, the memory 41, the input device 42, and the output device 43 in the data distribution device may be connected by a bus or other means, in FIG. Connection is an example.
存储器41作为一种计算机可读存储介质,可用于存储软件程序、计算机可执行程序以及模块,如本发明实施例中的资料分发方法对应的程序指令/模块(例如,资料分发装置中的资料特征获取模块31、人员特征获取模块32、特征匹配模块33和资料分发模块34)。处理器30通过运行存储在存储器31中的软件程序、指令以及模块,从而执行资料分发设备的各种功能应用以及数据处理,即实现上述的资料分发方法。The memory 41 is used as a computer readable storage medium, and can be used for storing software programs, computer executable programs, and modules, such as program instructions/modules corresponding to the data distribution method in the embodiment of the present invention (for example, data characteristics in the data distribution device) The acquisition module 31, the personnel feature acquisition module 32, the feature matching module 33, and the material distribution module 34). The processor 30 executes various functional applications and data processing of the material distribution device by executing software programs, instructions, and modules stored in the memory 31, that is, implementing the above-described data distribution method.
存储器41可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据资料分发设备的使用所创建的数据等。此外,存储器41可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实例中,存储器41可进一步包括相对于处理器40远程设置的存储器,这些远程存储器可以通过网络连接至资料分发设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 41 may mainly include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the material distribution device, and the like. Further, the memory 41 may include a high speed random access memory, and may also include a nonvolatile memory such as at least one magnetic disk storage device, flash memory device, or other nonvolatile solid state storage device. In some examples, memory 41 may further include memory remotely located relative to processor 40, which may be connected to the data distribution device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
输入装置42可用于接收输入的数字或字符信息,以及产生与资料分发设备的用户设置以及功能控制有关的键信号输入。输出装置43可包括显示屏等显示设备。Input device 42 can be used to receive input numeric or character information, as well as to generate key signal inputs related to user settings and function control of the data distribution device. The output device 43 may include a display device such as a display screen.
本发明实施例还提供一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行一种资料分发方法,该方法包括:Embodiments of the present invention also provide a storage medium including computer executable instructions for executing a data distribution method when executed by a computer processor, the method comprising:
获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
获取一个或多个资料接收人员的人员特征信息; Obtaining personnel characteristic information of one or more data receiving personnel;
将所述资料特征信息与所述人员特征信息进行匹配;Matching the material feature information with the person feature information;
按照匹配程度将所述待分发资料分发给相应的资料接收人员。The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
通过以上关于实施方式的描述,所属领域的技术人员可以清楚地了解到,本发明可借助软件及必需的通用硬件来实现,当然也可以通过硬件实现,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如计算机的软盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、闪存(FLASH)、硬盘或光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。Through the above description of the embodiments, those skilled in the art can clearly understand that the present invention can be implemented by software and necessary general hardware, and can also be implemented by hardware, but in many cases, the former is a better implementation. . Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product, which may be stored in a computer readable storage medium, such as a floppy disk of a computer. , Read-Only Memory (ROM), Random Access Memory (RAM), Flash (FLASH), hard disk or optical disk, etc., including a number of instructions to make a computer device (can be a personal computer) The server, or network device, etc.) performs the methods described in various embodiments of the present invention.
值得注意的是,上述资料分发装置的实施例中,所包括的各个单元和模块只是按照功能逻辑进行划分的,但并不局限于上述的划分,只要能够实现相应的功能即可;另外,各功能单元的具体名称也只是为了便于相互区分,并不用于限制本发明的保护范围。It should be noted that, in the embodiment of the foregoing data distribution device, each unit and module included in the data distribution device is divided according to functional logic, but is not limited to the above division, as long as the corresponding function can be implemented; The specific names of the functional units are also for convenience of distinguishing from each other and are not intended to limit the scope of the present invention.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明披露的技术范围内,可轻易想到的变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。 The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present disclosure. All should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the appended claims.

Claims (22)

  1. 一种资料分发方法,其特征在于,包括:A data distribution method, comprising:
    获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
    获取一个或多个资料接收人员的人员特征信息;Obtaining personnel characteristic information of one or more data receiving personnel;
    将所述资料特征信息与所述人员特征信息进行匹配;Matching the material feature information with the person feature information;
    按照匹配程度将所述待分发资料分发给相应的资料接收人员。The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
  2. 根据权利要求1所述的方法,其特征在于:The method of claim 1 wherein:
    所述资料特征信息和人员特征信息分别为资料特征矩阵和人员特征矩阵,其中每个待分发资料和每个资料接收人员所具有的特征维度相同;The data feature information and the person feature information are respectively a data feature matrix and a personnel feature matrix, wherein each of the to-be-distributed materials and each data receiving person has the same feature dimension;
    相应的,将所述资料特征信息与所述人员特征信息进行匹配,包括:Correspondingly, matching the data feature information with the personnel feature information includes:
    计算所述资料特征矩阵与人员特征矩阵的相似度矩阵,将所述相似度矩阵作为匹配程度。A similarity matrix of the data feature matrix and the person feature matrix is calculated, and the similarity matrix is used as a matching degree.
  3. 根据权利要求2所述的方法,其特征在于:The method of claim 2 wherein:
    所述资料特征矩阵通过对待分发资料进行特征建模获得。The data feature matrix is obtained by performing feature modeling on the data to be distributed.
  4. 根据权利要求3所述的方法,其特征在于,获取待分发资料的资料特征信息,包括:The method according to claim 3, wherein the obtaining the data feature information of the data to be distributed comprises:
    解析待分发资料;Parsing the information to be distributed;
    如果包含所述资料特征矩阵中对应的特征,则将所述资料特征矩阵中对应的特征值赋为1,否则赋值为0。If the corresponding feature in the data feature matrix is included, the corresponding feature value in the data feature matrix is assigned to 1, otherwise the value is 0.
  5. 根据权利要求2所述的方法,其特征在于,获取资料接收人员的人员特征信息包括:The method according to claim 2, wherein the obtaining the personnel characteristic information of the data receiving person comprises:
    根据所述资料接收人员对所述人员特征矩阵中各特征的偏好分别进行赋值。According to the data receiving personnel, the preferences of the features in the personnel feature matrix are respectively assigned.
  6. 根据权利要求2所述的方法,其特征在于,按照匹配程度将所述待分发 资料分发给相应的资料接收人员,包括:The method according to claim 2, wherein said to be distributed is matched according to the degree of matching The information is distributed to the appropriate recipients of the data, including:
    当所述待分发资料为多个时,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员,以使得一份资料分发给一个人员且分发后相似度总和最大。When the plurality of materials to be distributed are multiple, based on the calculated similarity matrix, the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm, so that one piece of information is distributed to one person and distributed. The sum of the similarities is the largest.
  7. 根据权利要求6所述的方法,其特征在于,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员,包括:The method according to claim 6, wherein, based on the calculated similarity matrix, the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm, including:
    对于将m个待分发资料分发给t个资料接收人员的情况,为每个资料接收人员固定分配k个待分发资料,其中m、t均为正整数,m大于t;For the case of distributing m pieces of data to be distributed to t data receivers, each data receiver is fixedly allocated k pieces of data to be distributed, where m and t are positive integers, and m is greater than t;
    将相似度矩阵Cmt在列方向上变换为t×k个资料接收人员的相似度矩阵Cm(t×k)The similarity matrix C mt is transformed in the column direction into a similarity matrix C m(t×k) of t×k data receivers;
    采用最优化算法将m个待分发资料分发给t×k个虚拟的资料接收人员。The m pieces of data to be distributed are distributed to t×k virtual data receivers by using an optimization algorithm.
  8. 根据权利要求6所述的方法,其特征在于,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员,包括:The method according to claim 6, wherein, based on the calculated similarity matrix, the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm, including:
    对于将待分发m个待分发资料分发给t个资料接收人员的情况,根据每个资料接收人员的个人能力分别分配设定个数的待分发资料,其中,第i个资料接收人员分配的待分发资料的个数记为ki,其中m、t均为正整数,m大于t;For the case where the to-be-distributed materials to be distributed are distributed to t data receiving personnel, a set number of to-be-distributed materials are respectively allocated according to the individual capabilities of each data receiving person, wherein the i- th data receiving personnel are allocated The number of distribution materials is recorded as k i , where m and t are positive integers, and m is greater than t;
    将所述相似度矩阵Cmt在列方向上变换为
    Figure PCTCN2016096850-appb-100001
    个虚拟资料接收人员的相似度矩阵
    Figure PCTCN2016096850-appb-100002
    Converting the similarity matrix C mt in the column direction to
    Figure PCTCN2016096850-appb-100001
    Similarity matrix of virtual data receivers
    Figure PCTCN2016096850-appb-100002
    采用最优化算法将m个资料分发给
    Figure PCTCN2016096850-appb-100003
    个虚拟的资料接收人员。
    Distribution of m data to the optimization algorithm
    Figure PCTCN2016096850-appb-100003
    A virtual data receiver.
  9. 根据权利要求7或8所述的方法,其特征在于,所述最优化算法为加权二分图最优匹配算法。 The method according to claim 7 or 8, wherein the optimization algorithm is a weighted bipartite graph optimal matching algorithm.
  10. 根据权利要求2~8任一项所述的方法,其特征在于,所述相似度为余弦值、欧式距离、皮尔逊相关系数、Spearman秩相关系数、Tanimoto系数、对数似然相似度和曼哈顿距离中的任意一种。The method according to any one of claims 2 to 8, wherein the similarity is a cosine value, an Euclidean distance, a Pearson correlation coefficient, a Spearman rank correlation coefficient, a Tanimoto coefficient, a log likelihood similarity, and Manhattan Any of the distances.
  11. 一种资料分发装置,其特征在于,包括:A data distribution device, comprising:
    资料特征获取模块,用于获取一个或多个待分发资料的资料特征信息;a data feature acquiring module, configured to acquire data feature information of one or more materials to be distributed;
    人员特征获取模块,用于获取一个或多个资料接收人员的人员特征信息;a personnel feature obtaining module, configured to acquire personnel characteristic information of one or more data receiving personnel;
    特征匹配模块,用于将所述资料特征信息与所述人员特征信息进行匹配;a feature matching module, configured to match the data feature information with the personnel feature information;
    资料分发模块,用于按照匹配程度将所述待分发资料分发给相应的资料接收人员。And a data distribution module, configured to distribute the to-be-distributed data to a corresponding data receiving person according to a matching degree.
  12. 根据权利要求11所述的装置,其特征在于:The device of claim 11 wherein:
    所述资料特征信息和人员特征信息分别为资料特征矩阵和人员特征矩阵,其中每个待分发资料和每个资料接收人员所具有的特征维度相同;The data feature information and the person feature information are respectively a data feature matrix and a personnel feature matrix, wherein each of the to-be-distributed materials and each data receiving person has the same feature dimension;
    相应的,所述特征匹配模块具体用于:Correspondingly, the feature matching module is specifically configured to:
    计算所述资料特征矩阵与人员特征矩阵的相似度矩阵,将所述相似度矩阵作为匹配程度。A similarity matrix of the data feature matrix and the person feature matrix is calculated, and the similarity matrix is used as a matching degree.
  13. 根据权利要求12所述的装置,其特征在于:The device of claim 12 wherein:
    所述资料特征获取模块用于通过对待分发资料进行特征建模来获得资料特征矩阵。The data feature acquisition module is configured to obtain a data feature matrix by performing feature modeling on the data to be distributed.
  14. 根据权利要求12所述的装置,其特征在于,所述资料特征获取模块具体用于:The device according to claim 12, wherein the data feature acquisition module is specifically configured to:
    解析待分发资料;Parsing the information to be distributed;
    如果包含所述资料特征矩阵中对应的特征,则将所述资料特征矩阵中对应 的特征值赋为1,否则赋值为0。If the corresponding feature in the data feature matrix is included, the data feature matrix is correspondingly The eigenvalue is assigned to 1, otherwise it is assigned a value of 0.
  15. 根据权利要求12所述的装置,其特征在于,所述人员特征获取模块具体用于:The device according to claim 12, wherein the personnel feature acquisition module is specifically configured to:
    根据所述资料接收人员对所述人员特征矩阵中各特征的偏好分别进行赋值。According to the data receiving personnel, the preferences of the features in the personnel feature matrix are respectively assigned.
  16. 根据权利要求12所述的装置,其特征在于,所述资料分发模块具体用于:The device according to claim 12, wherein the data distribution module is specifically configured to:
    当所述待分发资料为多个时,基于所计算出的相似度矩阵,采用最优化算法将多个待分发资料分别分发给相应的资料接收人员,以使得一份资料分发给一个人员且分发后相似度总和最大。When the plurality of materials to be distributed are multiple, based on the calculated similarity matrix, the plurality of to-be-distributed materials are separately distributed to the corresponding data receiving personnel by using an optimization algorithm, so that one piece of information is distributed to one person and distributed. The sum of the similarities is the largest.
  17. 根据权利要求16所述的装置,其特征在于,所述资料分发模块具体用于:The device according to claim 16, wherein the data distribution module is specifically configured to:
    对于将m个待分发资料分发给t个资料接收人员的情况,为每个资料接收人员固定分配k个待分发资料,其中m、t均为正整数,m大于t;For the case of distributing m pieces of data to be distributed to t data receivers, each data receiver is fixedly allocated k pieces of data to be distributed, where m and t are positive integers, and m is greater than t;
    将相似度矩阵Cmt在列方向上变换为t×k个资料接收人员的相似度矩阵Cm(t×k)The similarity matrix C mt is transformed in the column direction into a similarity matrix C m(t×k) of t×k data receivers;
    采用最优化算法将m个待分发资料分发给t×k个虚拟的资料接收人员。The m pieces of data to be distributed are distributed to t×k virtual data receivers by using an optimization algorithm.
  18. 根据权利要求16所述的装置,其特征在于,所述资料分发模块具体用于:The device according to claim 16, wherein the data distribution module is specifically configured to:
    对于将待分发m个待分发资料分发给t个资料接收人员的情况,根据每个资料接收人员的个人能力分别分配设定个数的待分发资料,其中,第i个资料接收人员分配的待分发资料的个数记为ki,其中m、t均为正整数,m大于t;For the case where the to-be-distributed materials to be distributed are distributed to t data receiving personnel, a set number of to-be-distributed materials are respectively allocated according to the individual capabilities of each data receiving person, wherein the i-th data receiving personnel are allocated The number of distribution materials is recorded as k i , where m and t are positive integers, and m is greater than t;
    将相似度矩阵Cmt在列方向上变换为
    Figure PCTCN2016096850-appb-100004
    个虚拟资料接收人员的相似度矩 阵
    Figure PCTCN2016096850-appb-100005
    Transform the similarity matrix C mt in the column direction to
    Figure PCTCN2016096850-appb-100004
    Similarity matrix of virtual data receivers
    Figure PCTCN2016096850-appb-100005
    采用最优化算法将m个资料分发给
    Figure PCTCN2016096850-appb-100006
    个虚拟的资料接收人员。
    Distribution of m data to the optimization algorithm
    Figure PCTCN2016096850-appb-100006
    A virtual data receiver.
  19. 根据权利要求17或18所述的装置,其特征在于,所述最优化算法为加权二分图最优匹配算法。The apparatus according to claim 17 or 18, wherein the optimization algorithm is a weighted bipartite graph optimal matching algorithm.
  20. 根据权利要求12~18任一项所述的装置,其特征在于,所述相似度为余弦值、欧式距离、皮尔逊相关系数、Spearman秩相关系数、Tanimoto系数、对数似然相似度和曼哈顿距离中的任意一种。The apparatus according to any one of claims 12 to 18, wherein the similarity is a cosine value, an Euclidean distance, a Pearson correlation coefficient, a Spearman rank correlation coefficient, a Tanimoto coefficient, a log likelihood similarity, and Manhattan Any of the distances.
  21. 一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行一种资料分发方法,其特征在于,该方法包括:A storage medium comprising computer executable instructions for performing a data distribution method when executed by a computer processor, the method comprising:
    获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
    获取一个或多个资料接收人员的人员特征信息;Obtaining personnel characteristic information of one or more data receiving personnel;
    将所述资料特征信息与所述人员特征信息进行匹配;Matching the material feature information with the person feature information;
    按照匹配程度将所述待分发资料分发给相应的资料接收人员。The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
  22. 一种资料分发设备,其特征在于,包括:A data distribution device, comprising:
    一个或者多个处理器;One or more processors;
    存储器,存储有一个或多个程序;a memory that stores one or more programs;
    当所述一个或多个程序被所述一个或者多个处理器执行时,使得所述一个或多个处理器执行如下操作:When the one or more programs are executed by the one or more processors, causing the one or more processors to perform the following operations:
    获取一个或多个待分发资料的资料特征信息;Obtaining data feature information of one or more materials to be distributed;
    获取一个或多个资料接收人员的人员特征信息;Obtaining personnel characteristic information of one or more data receiving personnel;
    将所述资料特征信息与所述人员特征信息进行匹配; Matching the material feature information with the person feature information;
    按照匹配程度将所述待分发资料分发给相应的资料接收人员。 The to-be-distributed materials are distributed to corresponding data recipients according to the degree of matching.
PCT/CN2016/096850 2016-05-25 2016-08-26 Data distribution method and device, and storage medium WO2017201905A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610354138.0A CN107437182B (en) 2016-05-25 2016-05-25 Data distribution method and device
CN201610354138.0 2016-05-25

Publications (1)

Publication Number Publication Date
WO2017201905A1 true WO2017201905A1 (en) 2017-11-30

Family

ID=60410965

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/096850 WO2017201905A1 (en) 2016-05-25 2016-08-26 Data distribution method and device, and storage medium

Country Status (2)

Country Link
CN (1) CN107437182B (en)
WO (1) WO2017201905A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112163868A (en) * 2020-09-30 2021-01-01 深圳前海微众银行股份有限公司 Data processing method, device, equipment and storage medium
CN116011906A (en) * 2022-10-19 2023-04-25 中建八局第二建设有限公司 Wagon balance weighing method based on optimal matching algorithm of service flow

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112232612A (en) * 2019-06-28 2021-01-15 百度在线网络技术(北京)有限公司 Intelligent client data distribution method and device
CN112633901B (en) * 2020-12-18 2023-08-04 深圳市思为软件技术有限公司 Passenger judging method and related equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102646229A (en) * 2012-02-26 2012-08-22 云南电网公司 Fuzzy comprehensive evaluation method of adaptation between employee and position of enterprise
CN103605723A (en) * 2013-11-15 2014-02-26 南京云川信息技术有限公司 Video recommending method based on particle swarm algorithm
CN103620592A (en) * 2011-04-19 2014-03-05 诺基亚公司 Method and apparatus for flexible diversification of recommendation results

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103778145A (en) * 2012-10-22 2014-05-07 中国移动通信集团广东有限公司 Information recommendation method and system
CN103116639B (en) * 2013-02-20 2016-05-11 新浪网技术(中国)有限公司 Based on article recommend method and the system of user-article bipartite graph model
CN105005798B (en) * 2014-04-24 2018-09-04 南京理工大学 One kind is based on the similar matched target identification method of structures statistics in part
CN104182732B (en) * 2014-08-12 2017-09-26 南京师范大学 Feature based matrix carries out the hand-written Chinese character strokes confirmation method of similarity mode
CN104410913A (en) * 2014-12-01 2015-03-11 北京国双科技有限公司 Television program recommending method and device for intelligent video equipment

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103620592A (en) * 2011-04-19 2014-03-05 诺基亚公司 Method and apparatus for flexible diversification of recommendation results
CN102646229A (en) * 2012-02-26 2012-08-22 云南电网公司 Fuzzy comprehensive evaluation method of adaptation between employee and position of enterprise
CN103605723A (en) * 2013-11-15 2014-02-26 南京云川信息技术有限公司 Video recommending method based on particle swarm algorithm

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HUANG, TAN ET AL.: "Bipartite Graph Recommendation Algorithm Based on Hybrid User Model", COMPUTER TECHNOLOGY AND DEVELOPMENT, vol. 24, no. 6, 30 June 2014 (2014-06-30) *
LI, XIA ET AL.: "Research on Collaborative Filtering Algorithm of Bipartite Network Oriented to Personal Recommendation System", APPLICATION RESEARCH OF COMPUTERS, vol. 30, no. 7, 31 July 2013 (2013-07-31) *
LIANG , QINGZHONG: "Study on Multi-Objective Scheduling Algorithms of Hybrid Cloud", ELECTRONIC TECHNOLOGY & INFORMATION SCIENCE , CHINA DOCTORAL DISSERTATIONS FULL-TEXT DATABASE, 15 January 2016 (2016-01-15), pages 88 - 90, ISSN: 1674-022X *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112163868A (en) * 2020-09-30 2021-01-01 深圳前海微众银行股份有限公司 Data processing method, device, equipment and storage medium
CN116011906A (en) * 2022-10-19 2023-04-25 中建八局第二建设有限公司 Wagon balance weighing method based on optimal matching algorithm of service flow

Also Published As

Publication number Publication date
CN107437182B (en) 2021-01-08
CN107437182A (en) 2017-12-05

Similar Documents

Publication Publication Date Title
US11710054B2 (en) Information recommendation method, apparatus, and server based on user data in an online forum
JP7343568B2 (en) Identifying and applying hyperparameters for machine learning
US11734233B2 (en) Method for classifying an unmanaged dataset
CN107451832B (en) Method and device for pushing information
CN108170692B (en) Hotspot event information processing method and device
US20160034560A1 (en) Method and system for securely storing private data in a semantic analysis system
US20160125432A1 (en) Identifying influential users of a social networking service
WO2015192667A1 (en) Advertisement recommending method and advertisement recommending server
CN111125453B (en) Opinion leader role identification method in social network based on subgraph isomorphism and storage medium
CN105893406A (en) Group user profiling method and system
WO2017201905A1 (en) Data distribution method and device, and storage medium
US9177249B2 (en) Scientometric methods for identifying emerging technologies
CN104077723B (en) A kind of social networks commending system and method
Baker Big data: A survey research perspective
US9846746B2 (en) Querying groups of users based on user attributes for social analytics
US20230037222A1 (en) Method, apparatus and computer program product for generating tiered search index fields in a group-based communication platform
US20140289268A1 (en) Systems and methods of rationing data assembly resources
CN111429161B (en) Feature extraction method, feature extraction device, storage medium and electronic equipment
Alkhamees et al. Event detection from social network streams using frequent pattern mining with dynamic support values
US9552415B2 (en) Category classification processing device and method
Kumar et al. Cluster analysis in data mining using k-means method
US20210357553A1 (en) Apparatus and method for option data object performance prediction and modeling
Wu et al. Bootstrap variability studies in ROC analysis on large datasets
JP5903376B2 (en) Information recommendation device, information recommendation method, and information recommendation program
Al-Dhelaan et al. Graph summarization for hashtag recommendation

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16902891

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16902891

Country of ref document: EP

Kind code of ref document: A1