WO2023185125A1 - Product resource data processing method and apparatus, electronic device and storage medium - Google Patents

Product resource data processing method and apparatus, electronic device and storage medium Download PDF

Info

Publication number
WO2023185125A1
WO2023185125A1 PCT/CN2022/140756 CN2022140756W WO2023185125A1 WO 2023185125 A1 WO2023185125 A1 WO 2023185125A1 CN 2022140756 W CN2022140756 W CN 2022140756W WO 2023185125 A1 WO2023185125 A1 WO 2023185125A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
resources
features
product
product resources
Prior art date
Application number
PCT/CN2022/140756
Other languages
French (fr)
Chinese (zh)
Inventor
刘瀚文
Original Assignee
富途网络科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 富途网络科技(深圳)有限公司 filed Critical 富途网络科技(深圳)有限公司
Publication of WO2023185125A1 publication Critical patent/WO2023185125A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/04Trading; Exchange, e.g. stocks, commodities, derivatives or currency exchange
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/24323Tree-organised classifiers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Abstract

Disclosed in the present application are a product resource data processing method and apparatus, an electronic device and a storage medium. The method comprises: extracting original features from original data of target product resources, and performing feature construction processing on the basis of the original features to obtain target features; determining a predicted income probability value of the target product resources on the basis of the target features; determining a trend of the target product resources on the basis of the predicted income probability value of the target product resources and the predicted income probability value of other product resources; and obtaining target interpretation information of the target product resources on the basis of the trend and original interpretation information of the target product resources.

Description

产品资源的数据处理方法及装置、电子设备、存储介质Data processing methods and devices, electronic equipment, and storage media for product resources
相关申请的交叉引用Cross-references to related applications
本申请要求于2022年4月2日提交的,申请名称为“产品资源的数据处理方法及装置、电子设备、存储介质”的、中国专利申请号为“202210353762.4”的优先权,该中国专利申请的全部内容通过引用结合在本申请中。This application requests the priority of the Chinese patent application number "202210353762.4", which was submitted on April 2, 2022 and is titled "Data processing method and device, electronic equipment, storage medium for product resources". This Chinese patent application The entire contents of are incorporated herein by reference.
技术领域Technical field
本申请涉及人工智能技术领域,具体而言,涉及一种产品资源的数据处理方法及装置、电子设备、计算机可读存储介质。This application relates to the field of artificial intelligence technology, specifically, to a data processing method and device for product resources, electronic equipment, and computer-readable storage media.
背景技术Background technique
现在人们对于产品资源的关注越来越多,产品资源通常在相应的应用程序中进行展示,在对应的应用程序中,设置有相应的页面用于展示产品资源相关的一些数据,用户可以通过这些数据获取到相关的信息,较为专业的用户可以从展示的这些数据中提炼出个人对该产品资源的理解,从而对该产品资源做出相应的决策,同时,对于专业用户的个人能力要求较高;但是对于大部分非专业用户来说,他们不能有效利用这些数据,进而在有利的时间点做出相对正确的决策,导致产品资源的信息量和信息传递效率较低的问题。Nowadays, people are paying more and more attention to product resources. Product resources are usually displayed in corresponding applications. In the corresponding applications, corresponding pages are set up to display some data related to product resources. Users can use these Relevant information is obtained from the data. More professional users can extract their personal understanding of the product resources from the displayed data, thereby making corresponding decisions about the product resources. At the same time, professional users have higher requirements for their personal abilities. ; However, for most non-professional users, they cannot effectively use this data to make relatively correct decisions at a favorable time point, resulting in low information volume and information transmission efficiency of product resources.
技术解决方案Technical solutions
为解决上述技术问题,本申请的实施例提供了一种产品资源的数据处理方法及装置、电子设备、计算机可读存储介质,旨在解决在对产品资源做出决策时,产品资源的信息量和信息传递效率较低的问题。In order to solve the above technical problems, embodiments of the present application provide a data processing method and device for product resources, electronic equipment, and computer-readable storage media, aiming to solve the problem of the amount of information on product resources when making decisions about product resources. and low efficiency of information transmission.
本申请的其他特性和优点将通过下面的详细描述变得显然,或部分地通过本申请的实践而习得。Additional features and advantages of the invention will be apparent from the detailed description which follows, or, in part, may be learned by practice of the invention.
根据本申请实施例的一个方面,提供了一种产品资源的数据处理方法,包括:According to one aspect of the embodiment of the present application, a data processing method for product resources is provided, including:
从目标产品资源的原始数据中提取原始特征,并基于原始特征进行特征构建处理,得到目标特征;Extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features;
基于目标特征确定目标产品资源的预测收益概率值;Determine the predicted revenue probability value of the target product resources based on the target characteristics;
基于目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定目标产品资源的走向趋势;Based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources, determine the trend of the target product resource;
基于走向趋势以及目标产品资源的原始解读信息,得到目标产品资源的目标解读信息。Based on the trend and the original interpretation information of the target product resources, the target interpretation information of the target product resources is obtained.
根据本申请实施例的一个方面,提供了一种产品资源的数据处理装置,包括:According to one aspect of the embodiment of the present application, a data processing device for product resources is provided, including:
提取模块,配置为从目标产品资源的原始数据中提取原始特征,并基于原始特征进行特征构建处理,得到目标特征;The extraction module is configured to extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features;
第一确定模块,配置为基于目标特征确定目标产品资源的预测收益概率值;The first determination module is configured to determine the predicted revenue probability value of the target product resource based on the target characteristics;
第二确定模块,配置为基于目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定目标产品资源的走向趋势;The second determination module is configured to determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources;
目标解读信息模块,配置为基于走向趋势以及目标产品资源的原始解读信息,得到目标产品资源的目标解读信息。The target interpretation information module is configured to obtain the target interpretation information of the target product resources based on the trend and the original interpretation information of the target product resources.
根据本申请实施例的一个方面,提供了一种电子设备,包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当一个或多个程序被一个或多个处理器执行时,使得电子设备实现如前的产品资源的数据处理方法。According to an aspect of an embodiment of the present application, an electronic device is provided, including: one or more processors; a storage device for storing one or more programs. When one or more programs are processed by one or more processors, When executed, the electronic device is caused to implement the previous data processing method of product resources.
根据本申请实施例的一个方面,提供了一种计算机可读存储介质,其上存储有计算机可读指令,当计算机可读指令被计算机的处理器执行时,使计算机执行如上的产品资源的数据处理方法。According to one aspect of the embodiment of the present application, a computer-readable storage medium is provided, on which computer-readable instructions are stored. When the computer-readable instructions are executed by a processor of a computer, the computer is caused to execute the above product resource data. Approach.
根据本申请实施例的一个方面,提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各种可选实施例中提供的产品资源的数据处理方法。According to an aspect of an embodiment of the present application, a computer program product or a computer program is provided, the computer program product or the computer program including computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device performs the data processing method of product resources provided in the various optional embodiments described above.
在本申请的实施例所提供的技术方案中,对目标产品资源的原始特征进行特征构建,得到目标特征,目标特征相对于原始特征而言,信息量更多,根据目标特征去确定目标产品资源的预测收益概率值能够更加准确;同时,还根据预设收益概率值得到走向趋势,基于走向趋势与原始解读信息得到目标解读信息,用户通过查看目标解读信息了解到目标产品资源在未来的走向,在对目标产品资源做相应的决策时,即可结合目标解读信息进行,而不用再依靠人为经验去查看目标产品资源的数据信息后再归纳总结做出决策,在提高信息的转换率和接收率的同时,提升了有效信息的信息量,为用户带来诸多便利,使得用户使用体验满意度高,并且确定出的目标解读信息也更为准确,为用户基于目标解读信息对目标产品资源做相应的决策提供了有力准确的支持。In the technical solution provided by the embodiment of the present application, the original features of the target product resources are constructed to obtain the target features. The target features have more information than the original features. The target product resources are determined based on the target features. The predicted revenue probability value can be more accurate; at the same time, the trend is also obtained based on the preset revenue probability value, and the target interpretation information is obtained based on the trend trend and the original interpretation information. The user understands the future direction of the target product resources by viewing the target interpretation information. When making corresponding decisions about target product resources, you can combine the target interpretation information, instead of relying on human experience to view the data information of target product resources and then summarize and make decisions, which will improve the conversion rate and reception rate of information. At the same time, it increases the amount of effective information, brings a lot of convenience to users, makes users more satisfied with the user experience, and determines the target interpretation information more accurately, allowing users to respond to target product resources based on the target interpretation information. Provide strong and accurate support for decision-making.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本申请。It should be understood that the above general description and the following detailed description are only exemplary and explanatory, and do not limit the present application.
附图说明Description of drawings
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本申请的实施例, 并与说明书一起用于解释本申请的原理。显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术者来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。在附图中:The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description, serve to explain the principles of the application. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without exerting creative efforts. In the attached picture:
图1是本申请涉及的一种实施环境的示意图;Figure 1 is a schematic diagram of an implementation environment involved in this application;
图2是本申请涉及的一个实施例中的产品资源的数据处理方法的流程图;Figure 2 is a flow chart of a data processing method for product resources in one embodiment of this application;
图3是本申请涉及的一个实施例中步骤S210的流程图;Figure 3 is a flow chart of step S210 in one embodiment of the present application;
图4是本申请涉及的另一个实施例中的产品资源的数据处理方法的流程图;Figure 4 is a flow chart of a data processing method for product resources in another embodiment of the present application;
图5是本申请涉及的一个实施例中步骤S440之后还包括的流程图;Figure 5 is a flow chart also included after step S440 in an embodiment of the present application;
图6是本申请涉及的一个实施例中步骤S420之前还包括的流程图;Figure 6 is a flow chart also included before step S420 in one embodiment of the present application;
图7是本申请涉及的一个实施例中步骤S620的流程图;Figure 7 is a flow chart of step S620 in one embodiment of the present application;
图8是本申请涉及的一个实施例中步骤S240的流程图;Figure 8 is a flow chart of step S240 in one embodiment of the present application;
图9是本申请涉及的另一个实施例中的产品资源的数据处理方法的流程图;Figure 9 is a flow chart of a data processing method for product resources in another embodiment of the present application;
图10是本申请涉及的一种产品资源的数据处理装置的框图;Figure 10 is a block diagram of a data processing device for product resources involved in this application;
图11是适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。FIG. 11 is a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
具体实施方式Detailed ways
这里将详细地对示例性实施例执行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the drawings, the same numbers in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatus and methods consistent with aspects of the application as detailed in the appended claims.
附图中所示的方框图仅仅是功能实体,不一定必须与物理上独立的实体相对应。即,可以采用软件形式来实现这些功能实体,或在一个或多个硬件模块或集成电路中实现这些功能实体,或在不同网络和/或处理器装置和/或微控制器装置中实现这些功能实体。The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. That is, these functional entities may be implemented in software form, or implemented in one or more hardware modules or integrated circuits, or implemented in different networks and/or processor devices and/or microcontroller devices. entity.
附图中所示的流程图仅是示例性说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解,而有的操作/步骤可以合并或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowcharts shown in the drawings are only illustrative, and do not necessarily include all contents and operations/steps, nor must they be performed in the order described. For example, some operations/steps can be decomposed, and some operations/steps can be merged or partially merged, so the actual order of execution may change according to the actual situation.
还需要说明的是:在本申请中提及的“多个”是指两个或者两个以上。“和/或”描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。It should also be noted that the “multiple” mentioned in this application refers to two or more. "And/or" describes the relationship between related objects, indicating that there can be three relationships. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the related objects are in an "or" relationship.
请参阅图1,图1是本申请涉及的一种实施环境的示意图。该实施环境包括终端110和服务器120,终端110和服务器120之间通过有线或者无线网络进行通信。Please refer to Figure 1, which is a schematic diagram of an implementation environment involved in this application. The implementation environment includes a terminal 110 and a server 120. The terminal 110 and the server 120 communicate through a wired or wireless network.
终端110中运行有产品资源相关的应用程序,用户可以在应用程序上查看相关的产品资源或对产品资源做出相应的操作。具体的,应用程序上设置有相应的页面用于展示产品 资源的各种数据,同时设置有相应的解读页面用于展示产品资源经过数据处理方法处理后得到的目标解读信息。其中,终端110可以是智能手机、平板、笔记本电脑、计算机等任意能够运行产品资源相关的应用程序的电子设备。The terminal 110 runs an application program related to product resources, and the user can view related product resources or perform corresponding operations on the product resources on the application program. Specifically, the application is provided with corresponding pages for displaying various data of product resources, and at the same time, corresponding interpretation pages are provided for displaying the target interpretation information obtained after the product resources are processed by the data processing method. The terminal 110 may be a smartphone, a tablet, a laptop, a computer, or any other electronic device that can run applications related to product resources.
服务器120中存储有大量的产品资源的相关数据,终端在需要相关数据时,从服务器中进行获取。服务器120可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群或者分布式系统,还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN(Content Delivery Network,内容分发网络)以及大数据和人工智能平台等基础云计算服务的云服务器,本处不对此进行限制。The server 120 stores a large amount of product resource related data, and the terminal obtains the related data from the server when it needs it. Server 120 may be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or may provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, Cloud servers for basic cloud computing services such as middleware services, domain name services, security services, CDN (Content Delivery Network, content distribution network), and big data and artificial intelligence platforms are not restricted here.
需要说明的是,本申请实施例中,产品资源包括但不限于虚拟产品资源,例如股票、期货、期权、证券、虚拟货币、基金或者外汇等。It should be noted that in the embodiment of this application, product resources include but are not limited to virtual product resources, such as stocks, futures, options, securities, virtual currency, funds, or foreign exchange, etc.
本申请实施例提供的产品资源的数据处理方法由图1所示实施例环境中的终端110具体执行,本申请提供的产品资源的数据处理方法对目标产品资源的原始特征进行特征构建,得到目标特征,目标特征相对于原始特征而言,信息量更多,根据目标特征去确定目标产品资源的预测收益概率值能够更加准确;同时,还根据预设收益概率值得到走向趋势,基于走向趋势与原始解读信息得到目标解读信息,用户通过查看目标解读信息了解到目标产品资源在未来的走向,在对目标产品资源做相应的决策时,即可结合目标解读信息进行,而不用再依靠人为经验去查看目标产品资源的数据信息后,再归纳总结做出决策,为用户带来诸多便利,使得用户使用体验满意度高,并且确定出的目标解读信息也更为准确,为用户基于目标解读信息对目标产品资源做相应的决策提供了有力准确的支持。The data processing method for product resources provided by the embodiment of this application is specifically executed by the terminal 110 in the embodiment environment shown in Figure 1. The data processing method of product resources provided by this application constructs features of the original characteristics of the target product resource to obtain the target Features, target features have more information than original features, and it can be more accurate to determine the predicted profit probability value of target product resources based on target features; at the same time, the trend is also obtained based on the preset profit probability value, based on the trend and The target interpretation information is obtained from the original interpretation information. The user understands the future direction of the target product resources by viewing the target interpretation information. When making corresponding decisions about the target product resources, the target interpretation information can be combined with the target interpretation information without relying on human experience. After viewing the data information of the target product resources, we can summarize and make decisions, which brings a lot of convenience to users, making the user experience more satisfying, and the determined target interpretation information is also more accurate, providing users with more accurate information based on the target interpretation information. It provides strong and accurate support for target product resources to make corresponding decisions.
图2是根据一示例性实施例示出的一种产品资源的数据处理方法的流程图,该方法可以应用于图1所示的实施环境。如图2所示,在一示例性实施例中,该产品资源的数据处理方法可以包括步骤S210至步骤S240,详细介绍如下:FIG. 2 is a flow chart of a data processing method for product resources according to an exemplary embodiment. The method can be applied to the implementation environment shown in FIG. 1 . As shown in Figure 2, in an exemplary embodiment, the data processing method of the product resource may include steps S210 to S240, which are described in detail as follows:
步骤S210,从目标产品资源的原始数据中提取原始特征,并基于原始特征进行特征构建处理,得到目标特征。Step S210: Extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features.
本申请实施例中,目标产品资源在应用程序中进行展示时,设置有相应的详情页面,详情页面上展示有该目标产品资源不同维度的数据,这些不同维度的数据则为目标产品资源的原始数据,原始数据中包括有大量的数据,但有些数据是无意义的,因此,需要从原始数据中提取能够用于预测未来是否获利的原始特征。In the embodiment of this application, when the target product resources are displayed in the application, a corresponding details page is set. The details page displays data of the target product resources in different dimensions. These data in different dimensions are the original data of the target product resources. Data, the original data includes a large amount of data, but some data is meaningless. Therefore, it is necessary to extract original features from the original data that can be used to predict future profits.
但为了更好的保留原始数据的信息量,更好的便于后续根据原始数据去对目标产品资源进行解读,需要对原始特征进行特征构造处理,得到相应的目标特征。特征构造指的是从原始特征中构造新特征的处理过程,生成能更好体现业务特性的新特征,这些新特征要 与想要得到的预测收益概率值关系紧密。However, in order to better retain the information content of the original data and facilitate the subsequent interpretation of target product resources based on the original data, it is necessary to perform feature construction processing on the original features to obtain the corresponding target features. Feature construction refers to the process of constructing new features from original features to generate new features that can better reflect business characteristics. These new features must be closely related to the desired predicted revenue probability value.
在本申请的一个实施例中,请参阅图3,原始特征的数量为多个;在步骤S210中基于原始特征进行特征构建处理,得到目标特征,包括步骤S310至S330,详细介绍如下:In one embodiment of the present application, please refer to Figure 3. The number of original features is multiple; in step S210, feature construction processing is performed based on the original features to obtain the target features, including steps S310 to S330. The details are as follows:
步骤S310,对多个原始特征进行特征交叉处理,得到交叉特征。Step S310: Perform feature intersection processing on multiple original features to obtain intersection features.
本申请实施例中,对各个原始特征进行特征交叉处理,即可在多维特征数据集上,进行很好的非线性特征拟合。例如,特征A有三个属性(A1,A2,A3),特征B有两个属性(B1,B2),采用特征A的属性对特征进行交叉,可以得到6个新的特征(A1,B2)、(A2,B2)、(A3,B2)、(B1,A1)、(B1,A2)和(B1,A3)。In the embodiment of the present application, feature cross processing is performed on each original feature, so that good nonlinear feature fitting can be performed on the multi-dimensional feature data set. For example, feature A has three attributes (A1, A2, A3), feature B has two attributes (B1, B2), and the attributes of feature A are used to cross the features, and 6 new features (A1, B2), (A2, B2), (A3, B2), (B1, A1), (B1, A2) and (B1, A3).
在一些实施例中,在进行特征交叉处理时,可以把特征交叉处理看成数据的逻辑与操作,在进行特征交叉处理的过程中,先把原始特征进行分档处理,再把分档的结果进行特征交叉,此时可以获得更好的交叉据特征,从而极大地简化计算量。In some embodiments, when performing feature cross processing, the feature cross processing can be regarded as a logical AND operation of data. In the process of performing feature cross processing, the original features are first processed into bins, and then the binning results are processed. By performing feature intersection, better intersection data features can be obtained, thus greatly simplifying the calculation amount.
在一些实施例中,在进行特征交叉处理时,可通过计算笛卡尔积(Cartesian product)的方式进行特征交叉处理,笛卡尔积就是任意两个集合A和B,若序偶的第一个成员是A的元素,第二个成员是B的元素,所有这样的序偶集合,称为集合A和B的笛卡尔乘积或直积,记做A×B。本实施例中如果通过上述方式生成的特征对预测目标有贡献,即有比原始特征更高的线性或非线性相关性时,采纳作为新特征,即交叉特征。In some embodiments, when performing feature crossover processing, feature crossover processing can be performed by calculating a Cartesian product. The Cartesian product is any two sets A and B. If the first member of the sequence pair is an element of A, and the second member is an element of B. All such even sets are called the Cartesian product or direct product of sets A and B, denoted as A×B. In this embodiment, if the features generated through the above method contribute to the prediction target, that is, if they have a higher linear or nonlinear correlation than the original features, they are adopted as new features, that is, cross features.
步骤S320,对原始特征和交叉特征进行特征衍生处理,得到衍生特征。Step S320: Perform feature derivation processing on the original features and cross features to obtain derived features.
本申请实施例中,所谓特征衍生处理指的是通过既有数据进行新特征的创建,特征衍生有时也被称为特征创建、特征提取等。特征衍生处理有两类方法,其一是依据数据集特征进行新特征的创建,此时的特征衍生其实是一类无监督的特征衍生,例如把月度费用(MonthlyCharges)和总费用(TotalCharges)两列相加,创建新的一列;而另外一种情况是将数据集标签情况也纳入进行考虑来创建新的特征,此时特征衍生其实是有监督的特征衍生。在大多数时候特征衍生特指无监督特征衍生,而有监督的特征衍生我们会称其为目标编码。In the embodiment of this application, the so-called feature derivation process refers to the creation of new features through existing data. Feature derivation is sometimes also called feature creation, feature extraction, etc. There are two methods for feature derivation processing. One is to create new features based on the characteristics of the data set. The feature derivation at this time is actually a type of unsupervised feature derivation, such as combining monthly charges (MonthlyCharges) and total charges (TotalCharges). The columns are added to create a new column; in another case, the data set label is also taken into consideration to create a new feature. At this time, feature derivation is actually supervised feature derivation. Most of the time feature derivation specifically refers to unsupervised feature derivation, while supervised feature derivation we will call it target encoding.
特征衍生包括但不限于特征组合、特征交叉、图像特征生成、文本特征生成等。其中,特征组合具体可以通过特征两两之间的四则运算组合,逻辑与、或组合,多项式构造,特征自身与其均值作差等来实现。Feature derivation includes but is not limited to feature combination, feature intersection, image feature generation, text feature generation, etc. Among them, feature combination can be realized through the combination of four arithmetic operations between pairs of features, logical AND, or combination, polynomial construction, the difference between the feature itself and its mean, etc.
步骤S330,将原始特征、交叉特征以及衍生特征进行组合,得到目标特征。Step S330: Combine original features, cross features and derived features to obtain target features.
本申请实施例中,将原始特征、交叉特征和衍生特征组合在一起,形成目标特征,构建目标特征的原则是将人为觉得可能有意义的特征都制作加入进来,能尽可能多地保留原始数据的信息量,使得后续的预测模型能有更好的表现。根据决策树模型的特性,在对预测模型进行建模阶段,决策树模型会根据各个维度的特征带来的信息增益的不同自适应地 筛选掉对最终结果没有意义的特征。In the embodiment of this application, the original features, cross features and derived features are combined to form the target features. The principle of constructing the target features is to add all the features that people think may be meaningful, so as to retain as much of the original data as possible. The amount of information enables the subsequent prediction model to perform better. According to the characteristics of the decision tree model, during the modeling stage of the prediction model, the decision tree model will adaptively filter out features that are not meaningful to the final result based on the different information gains brought by the features of each dimension.
本实施例中,对目标产品资源的原始特征进行特征构建,得到目标特征,目标特征相对于原始特征而言,信息量中的噪声较少,即有效特征的比例增加,根据目标特征去确定目标产品资源的预测收益概率值能够更加准确。In this embodiment, feature construction is performed on the original features of the target product resources to obtain the target features. Compared with the original features, the target features have less noise in the amount of information, that is, the proportion of effective features increases. The target is determined based on the target features. The predicted revenue probability value of product resources can be more accurate.
步骤S220,基于目标特征确定目标产品资源的预测收益概率值。Step S220: Determine the predicted profit probability value of the target product resource based on the target characteristics.
本申请实施例中,根据构建得到的目标特征能够确定出目标产品资源的预测收益概率值,这个预测收益概率值可表示在未来N天内,该目标产品资源是否有可能获利的概率。In the embodiment of this application, the predicted profit probability value of the target product resource can be determined based on the constructed target characteristics. This predicted profit probability value can represent the probability of whether the target product resource is likely to make a profit in the next N days.
在本申请一个实施例中,请参阅图4,图4是根据一示例性实施例示出的一种产品资源的数据处理方法的流程图,该产品资源的数据处理方法包括步骤S410至步骤S460,详细介绍如下:In one embodiment of the present application, please refer to Figure 4. Figure 4 is a flow chart illustrating a data processing method for product resources according to an exemplary embodiment. The data processing method for product resources includes steps S410 to S460. The details are as follows:
步骤S410,从目标产品资源的原始数据中提取原始特征,并基于原始特征进行特征构建处理,得到目标特征。Step S410: Extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features.
本申请实施例中,上述步骤S410与前述步骤S210的介绍一致,在此不进行赘述。In the embodiment of the present application, the above step S410 is consistent with the introduction of the above step S210, and will not be described again.
步骤S420,将目标特征输入至训练好的预测模型中,得到目标产品资源在多个维度上的预测收益概率值。Step S420: Input the target features into the trained prediction model to obtain the predicted profit probability values of the target product resources in multiple dimensions.
本申请实施例中,在目标特征构建好后,将其输入至训练好的预测模型中,得到多个维度上的预测收益概率值,在一些实施例中,多个维度上的预测收益概率可包括目标产品资源在未来N天内任意一天卖出的收益期望、目标产品资源在规定时间段后卖出的收益期望和目标产品资源在最理想状况下能获得的收益期望。预测模型会对这三种收益期望取或的关系,求得目标产品资源在上述三种情况是否有可能获利的预测收益概率值。In the embodiments of this application, after the target features are constructed, they are input into the trained prediction model to obtain predicted profit probability values in multiple dimensions. In some embodiments, the predicted profit probabilities in multiple dimensions can be It includes the profit expectation of selling the target product resource on any day in the next N days, the profit expectation of selling the target product resource after a specified period of time, and the profit expectation of the target product resource under the most ideal conditions. The prediction model will take the OR relationship between these three types of revenue expectations and obtain the predicted revenue probability value of whether the target product resources are likely to be profitable in the above three situations.
进一步地,上述预测模型可基于决策树模型或XGBoost(eXtreme Gradient Boosting,极端梯度提升)等模型训练而成。决策树模型是一种基于实例的归纳学习方法,它能从给定的无序的训练样本中,提炼出树型的分类模型。树中的每个非叶子节点记录了使用哪个特征来进行类别的判断,每个叶子节点则代表了最后判断的类别。根节点到每个叶子节点均形成一条分类的路径规则。而对新的样本进行测试时,只需要从根节点开始,在每个分支节点进行测试,沿着相应的分支递归地进入子树再测试,一直到达叶子节点,该叶子节点所代表的类别即是当前测试样本的预测类别。Furthermore, the above prediction model can be trained based on a decision tree model or a model such as XGBoost (eXtreme Gradient Boosting). The decision tree model is an instance-based inductive learning method that can extract a tree-type classification model from given unordered training samples. Each non-leaf node in the tree records which feature is used to judge the category, and each leaf node represents the final category judged. A classified path rule is formed from the root node to each leaf node. When testing a new sample, you only need to start from the root node, test at each branch node, recursively enter the subtree along the corresponding branch and test again, until you reach the leaf node. The category represented by the leaf node is is the predicted category of the current test sample.
进一步的,本实施例中基于决策树模型确定预测收益概率值时,先获取叶子节点中正样本的数量、以及总样本的数量,其中正样本包括预测得到正向结果的样本,例如预测结果为上升、收益等对应的样本。之后基于正样本的数量Num_psi、以及总样本的数量Num_tal来确定预测收益概率值Por_val为:Furthermore, in this embodiment, when determining the predicted profit probability value based on the decision tree model, the number of positive samples in the leaf nodes and the number of total samples are first obtained, where the positive samples include samples that are predicted to have positive results, for example, the predicted result is rising , income, etc. corresponding samples. Then, based on the number of positive samples Num_psi and the number of total samples Num_tal, the predicted profit probability value Por_val is determined as:
Figure PCTCN2022140756-appb-000001
Figure PCTCN2022140756-appb-000001
其中,α表示预设的概率因子。上述方式通过基于节点正样本数量和总样本数量之间的比例,同时将除正样本之外的其余样本情况考虑预测收益概率值的计算中,提高了预测精度和全面性。Among them, α represents the preset probability factor. The above method improves the accuracy and comprehensiveness of prediction by considering the ratio of the number of node positive samples to the total number of samples and considering the remaining sample conditions except the positive samples in the calculation of the predicted profit probability value.
XGBoost模型中会构建T颗回归树,当构建到第t颗回归树的时候,需要对前t-1颗回归树对训练样本分类回归产生的残差进行拟合。每次拟合产生新的回归树的时候,遍历所有可能的回归树,并选择使得目标函数值最小的回归树。但是这样在实践中难以实现,因此需要将步骤进行分解,在构造新的回归树的时候,每次只产生一个分支,并选择最好的那个分支。如果产生分支的目标函数值(cost)比不产生的时候大或者改进效果不明显,那么就放弃产生分支。T regression trees will be constructed in the XGBoost model. When the t-th regression tree is constructed, the residuals generated by the classification regression of the training samples from the first t-1 regression trees need to be fitted. Each time the fitting generates a new regression tree, all possible regression trees are traversed and the regression tree that minimizes the objective function value is selected. However, this is difficult to achieve in practice, so the steps need to be decomposed. When constructing a new regression tree, only one branch is generated at a time and the best branch is selected. If the objective function value (cost) of generating a branch is greater than when it is not generated or the improvement effect is not obvious, then the branch will be abandoned.
步骤S430,基于目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定目标产品资源的走向趋势。Step S430: Determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources.
本申请实施例中,上述步骤S430与后述步骤S230的介绍一致,在此先不进行描述。In the embodiment of the present application, the above-mentioned step S430 is consistent with the description of the later-described step S230, and will not be described here.
步骤S440,从预测模型含有的目标产品资源的多条可选决策路径中,获取目标决策路径。Step S440: Obtain the target decision path from multiple optional decision paths of the target product resource contained in the prediction model.
本申请实施例中,预测模型基于决策树模型训练而成,在训练时,决策树模型是从根节点开始,对产品资源的某一特征进行测试,根据测试结果将产品资源分配到其子节点,此时每个子节点对应着该特征的一个取值,如此递归的对产品资源进行测试并分配,直到到达叶节点,最后将产品资源分到叶节点的类中。因此,预测模型中会形成多条决策路径,将目标产品资源输入到预测模型中后,经过一条决策路径,到达对应的叶节点中,该条决策路径即为目标决策路径。In the embodiment of this application, the prediction model is trained based on the decision tree model. During training, the decision tree model starts from the root node, tests a certain feature of the product resource, and allocates the product resource to its child nodes according to the test results. , at this time, each child node corresponds to a value of the feature, and the product resources are tested and allocated recursively until reaching the leaf node, and finally the product resources are divided into the classes of the leaf node. Therefore, multiple decision paths will be formed in the prediction model. After the target product resources are input into the prediction model, they will go through a decision path and reach the corresponding leaf node. This decision path is the target decision path.
步骤S450,基于目标决策路径上的节点属性确定目标产品资源的原始解读信息。Step S450: Determine the original interpretation information of the target product resource based on the node attributes on the target decision path.
本申请实施例中,决策路径上包括有多个节点,这些节点具有对应的节点属性,这些节点属性决定了后续节点如何分裂,根据目标决策路径的节点属性能够确定出得到预测收益概率值的判断条件,对其进行归纳处理,得到目标产品资源的原始解读信息,该原始解读信息是对目标产品资源的综合解读内容。In the embodiment of this application, the decision path includes multiple nodes, and these nodes have corresponding node attributes. These node attributes determine how subsequent nodes are split. According to the node attributes of the target decision path, the judgment of obtaining the predicted profit probability value can be determined. Conditions are summarized and processed to obtain the original interpretation information of the target product resources. The original interpretation information is a comprehensive interpretation of the target product resources.
步骤S460,基于走向趋势以及目标产品资源的原始解读信息,得到目标产品资源的目标解读信息。Step S460: Obtain the target interpretation information of the target product resource based on the trend and the original interpretation information of the target product resource.
本申请实施例中,上述步骤S460与后述步骤S240的介绍一致,在此先不进行描述。In the embodiment of the present application, the above-mentioned step S460 is consistent with the description of the later-described step S240, and will not be described here.
本实施例中所提供的技术方案,通过训练好的预测模型去得到预测收益概率值,预测模型能够将目标产品资源的各项数据进行紧密联系,进而给出相应的预测收益概率值。同 时,预测模型训练好后,能够快速的计算出预测收益概率值,不需要依靠人工经验。The technical solution provided in this embodiment obtains the predicted revenue probability value through the trained prediction model. The prediction model can closely connect various data of the target product resources, and then provide the corresponding predicted revenue probability value. At the same time, after the prediction model is trained, the predicted profit probability value can be quickly calculated without relying on manual experience.
在本申请的一个实施例中,参见图5,步骤S440中在从预测模型含有的目标产品资源的多条可选决策路径中,获取目标决策路径之后,产品资源的数据处理方法还包括步骤S510-S520,详细介绍如下:In one embodiment of the present application, referring to Figure 5, in step S440, after obtaining the target decision path from the multiple optional decision paths of the target product resource contained in the prediction model, the data processing method of the product resource also includes step S510 -S520, detailed introduction is as follows:
步骤S510,将目标产品资源的目标决策路径与各个历史产品资源的历史决策路径进行匹配。Step S510: Match the target decision path of the target product resource with the historical decision path of each historical product resource.
本申请实施例中,历史产品资源都经过预测模型的处理,从而能够获取到与步骤S440一致的目标决策路径,历史产品资源的目标决策路径即为对应的历史决策路径,将目标产品资源的目标决策路径和历史产品资源的历史决策路径进行匹配,确定目标决策路径和历史决策路径是否相同。In the embodiment of this application, the historical product resources are processed by the prediction model, so that the target decision path consistent with step S440 can be obtained. The target decision path of the historical product resource is the corresponding historical decision path. The target decision path of the target product resource is The decision path is matched with the historical decision path of historical product resources to determine whether the target decision path and the historical decision path are the same.
步骤S520,获取与目标决策路径相匹配的历史决策路径对应的历史产品资源作为目标产品资源的参考产品资源,以基于参考产品资源的信息确定对目标产品资源的相关决策。Step S520: Obtain the historical product resources corresponding to the historical decision paths that match the target decision path as reference product resources for the target product resources, so as to determine relevant decisions on the target product resources based on the information of the reference product resources.
本申请实施例中,当历史决策路径与目标决策路径相同时,则将与目标决策路径相同的历史决策路径所对应的历史产品资源作为目标产品资源的参考产品资源,用户在需要对目标产品资源做出相关决策时,可以结合参考产品资源的一些情况综合分析,进而做出决策。如用户在获取到参考产品资源后,可以通过查看参考产品资源的走势,确定目标产品资源是否要买入或卖出,或在哪天买入或卖出的收益更好。In the embodiment of this application, when the historical decision path is the same as the target decision path, the historical product resources corresponding to the historical decision path that are the same as the target decision path are used as the reference product resources of the target product resources. When the user needs to make reference to the target product resources When making relevant decisions, you can make a comprehensive analysis based on some situations of reference product resources, and then make a decision. For example, after the user obtains the reference product resources, he can check the trend of the reference product resources to determine whether the target product resources should be bought or sold, or on which day the profit from buying or selling is better.
在本申请的一个实施例中,参见图6,步骤S420在将目标特征输入至训练好的预测模型中,得到产品资源在多个维度上的预测收益概率值之前,方法还包括步骤S610-S620,详细介绍如下:In one embodiment of the present application, referring to Figure 6, before step S420 inputs the target features into the trained prediction model to obtain the predicted profit probability value of the product resource in multiple dimensions, the method also includes steps S610-S620. , the details are as follows:
步骤S610,获取样本产品资源的训练样本数据。Step S610: Obtain training sample data of sample product resources.
本申请实施例中,样本产品资源包括多个,获取样本产品资源的训练样本数据,样本产品资源的训练样本数据相当于目标产品资源的原始数据。在训练时,可准备有足量的样本产品资源进行训练,当所有的样本产品资源输入到决策树模型中进行训练即可;还可将样本产品资源分为训练组和验证组,使用训练组对决策树模型进行训练后,在通过验证组去验证训练后的决策树模型的性能是否满足要求,当不满足时,重新进行训练。In the embodiment of this application, the sample product resources include multiple, and the training sample data of the sample product resources is obtained. The training sample data of the sample product resources is equivalent to the original data of the target product resources. During training, a sufficient amount of sample product resources can be prepared for training, and all sample product resources can be input into the decision tree model for training; the sample product resources can also be divided into training groups and verification groups, using the training group After training the decision tree model, use the verification group to verify whether the performance of the trained decision tree model meets the requirements. If it does not meet the requirements, retrain.
步骤S620,基于训练样本数据得到样本产品资源的流向分布特征以及涨幅特征。Step S620: Obtain the flow distribution characteristics and increase characteristics of the sample product resources based on the training sample data.
本申请实施例中,对训练样本数据进行如前述的特征构造处理,得到对应的流向分布特征,在训练时对训练样本数据采用何种特征构造处理,就对目标产品资源的原始特征采用何种特征构造处理。同样的,训练样本数据为实际的数据,因此可基于训练样本数据得到涨幅特征。In the embodiment of the present application, the training sample data is subjected to the feature construction processing as mentioned above to obtain the corresponding flow direction distribution characteristics. Which feature construction processing is used for the training sample data during training will be used for the original features of the target product resources. Feature construction processing. Similarly, the training sample data is actual data, so the growth characteristics can be obtained based on the training sample data.
在本申请的一个实施例中,训练样本数据包括表征样本产品资源流向的第一数据、表 征样本产品资源分布的第二数据以及表征样本产品资源涨跌的第三数据;参见图7,步骤S620基于训练样本数据得到样本产品资源的流向分布特征以及涨幅特征,包括步骤S710-S730,详细介绍如下:In one embodiment of the present application, the training sample data includes first data characterizing the flow direction of the sample product resources, second data characterizing the distribution of the sample product resources, and third data characterizing the rise and fall of the sample product resources; see Figure 7, step S620 Based on the training sample data, the flow distribution characteristics and growth characteristics of the sample product resources are obtained, including steps S710-S730, which are described in detail as follows:
步骤S710,基于第一数据、第二数据和第三数据进行特征构建处理,得到样本产品资源的流向分布特征。Step S710: Perform feature construction processing based on the first data, the second data, and the third data to obtain the flow distribution characteristics of the sample product resources.
本申请实施例中,将第一数据、第二数据和第三数据经过如前述一致的特征构造处理,即对第一数据、第二数据和第三数据相应的第一特征进行特征交叉处理,得到第二特征,再将第二特征与第一特征进行特征衍生处理,得到第三特征,将第一特征、第二特征和第三特征进行组合,得到流向分布特征。In the embodiment of the present application, the first data, the second data and the third data are processed through the same feature structure as mentioned above, that is, the corresponding first features of the first data, the second data and the third data are subjected to feature intersection processing. Obtain the second feature, then perform feature derivation processing on the second feature and the first feature to obtain the third feature, and combine the first feature, the second feature and the third feature to obtain the flow direction distribution feature.
步骤S720,基于第三数据计算样本产品资源在平均收益维度上的第一收益、样本产品资源在随机收益维度上的第二收益以及样本产品资源在最大收益维度上卖出的第三收益,并基于第一收益、第二收益和第三收益得到样本产品资源的涨幅特征。Step S720: Calculate the first income of the sample product resource in the average income dimension, the second income of the sample product resource in the random income dimension, and the third income of the sample product resource in the maximum income dimension based on the third data, and The growth characteristics of sample product resources are obtained based on the first income, second income and third income.
本申请实施例中,第三数据表征样本产品资源的涨跌,即第三数据记录了样本产品资源在预设时间段内的价格变动情况。根据第三数据能够计算出第一收益、第二收益和第三收益,其中第一收益为样本产品资源未来N天内任意一天卖出的收益期望,第二收益为样本产品资源在规定时间段后卖出的收益期望,第三收益为样本产品资源在最理想状况下能获得的收益期望。In the embodiment of the present application, the third data represents the rise and fall of the sample product resources, that is, the third data records the price changes of the sample product resources within a preset time period. The first income, the second income and the third income can be calculated based on the third data. The first income is the income expectation of the sample product resources sold on any day in the next N days, and the second income is the sample product resources after the specified time period. The profit expectation of selling, and the third profit is the profit expectation that the sample product resources can obtain under the most ideal conditions.
在一些实施例中,第一收益是在假定样本产品资源在T 0日买入,在未来的N天内,分别在T 1、T 2、...、T N日卖出样本产品资源,得到的不同的收益率P 1、P 2、...、P N,再根据每天的收益率计算得到的。第一收益可通过公式
Figure PCTCN2022140756-appb-000002
表示,其中,FAP表示第一收益,N表示未来N天,N为大于等于1的整数,i表示第i天,i为小于等于N的整数,Pi为样本产品资源在第i天卖出的收益。
In some embodiments, the first profit is obtained by assuming that the sample product resources are purchased on T 0 day , and the sample product resources are sold on T 1 , T 2 , ..., T N days respectively in the next N days. The different rates of return P 1 , P 2 ,..., P N are calculated based on the daily rate of return. The first income can be obtained by the formula
Figure PCTCN2022140756-appb-000002
represents, among which, FAP represents the first profit, N represents the next N days, N is an integer greater than or equal to 1, i represents the i-th day, i is an integer less than or equal to N, and Pi is the sample product resource sold on the i-th day. income.
第二收益是假定样本产品资源在T 0日买入,那么在第T N日卖出样本产品资源,得到的收益率P NThe second return is the return rate P N obtained by assuming that the sample product resources are purchased on day T 0 and the sample product resources are sold on day T N .
第三收益是假定样本产品资源在T 0日买入,那么在T 1、T 2、...、T N日卖出样本产品资源,得到的每天的收益率P 1、P 2、...、P N中值最大的那边的收益率,第三收益可通过公式FMP=max({P 1,P 2,...,P N})表示。 The third income is assuming that the sample product resources are purchased on T 0 day , then the sample product resources are sold on T 1 , T 2 ,..., T N days, and the daily returns P 1 , P 2 ,... ., the yield on the side with the largest value in P N , and the third yield can be expressed by the formula FMP = max ({P 1 , P 2 ,..., P N }).
步骤S730,将流向分布特征和涨幅特征输入至决策树模型中进行迭代训练,得到预测模型。Step S730: Input the flow distribution characteristics and increase characteristics into the decision tree model for iterative training to obtain a prediction model.
本申请实施例中,将样本产品资源的流向分布特征和涨幅特征输入至决策树模型中进行迭代训练,使得训练好的预测模型能够根据目标产品资源的目标特征预测到未来N天内 是否有收益。In the embodiment of this application, the flow distribution characteristics and growth characteristics of the sample product resources are input into the decision tree model for iterative training, so that the trained prediction model can predict whether there will be profits in the next N days based on the target characteristics of the target product resources.
在一实施例中,在对决策树模型进行训练得到预测模型后,可按照每周、月、年等时间周期的滑动窗口对预测模型进行更新,使得每次更新后的预测模型能够更加适应各种产品资源的预测。In one embodiment, after training the decision tree model to obtain the prediction model, the prediction model can be updated according to the sliding window of weekly, monthly, yearly and other time periods, so that the prediction model after each update can be more adaptable to each time period. Forecast of product resources.
步骤S230,基于目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定目标产品资源的走向趋势。Step S230: Determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources.
本申请实施例中,预测模型根据目标特征输出的预测收益概率值为一个数值,对这个预测收益概率值进行标准化,标准化时考虑其他产品资源所对应的预测收益概率值的分布,将目标产品资源的预测收益概率值分级为“上涨”、“无明显趋势”以及“下跌”三种走向趋势。In the embodiment of this application, the predicted profit probability value output by the prediction model according to the target characteristics is a numerical value, and this predicted profit probability value is standardized. When standardizing, the distribution of predicted profit probability values corresponding to other product resources is considered, and the target product resources are The predicted profit probability value is classified into three trends: "up", "no obvious trend" and "down".
步骤S240,基于走向趋势以及目标产品资源的原始解读信息,得到目标产品资源的目标解读信息。Step S240: Obtain the target interpretation information of the target product resource based on the trend and the original interpretation information of the target product resource.
本申请实施例中,目标产品资源具有相应的原始解读信息,该原始解读信息是根据原始数据得到的,将走向趋势插入到原始解读信息的相应位置上,使得形成完整的目标解读信息展示在目标产品资源的解读页面上,如目标解读信息为:近期,筹码最大获利价差为X,主力成交占比持续下降,目标产品资源的股价在未来N天内可能下跌,其中的下跌即为经过计算得到的走向趋势。In the embodiment of this application, the target product resources have corresponding original interpretation information. The original interpretation information is obtained based on the original data. The trend is inserted into the corresponding position of the original interpretation information, so that the complete target interpretation information is displayed on the target On the interpretation page of product resources, if the target interpretation information is: Recently, the maximum profit spread of chips is trend.
本实施例中,对目标产品资源的原始特征进行特征构建,得到目标特征,目标特征相对于原始特征而言,信息量更多,根据目标特征去确定目标产品资源的预测收益概率值能够更加准确,同时,还根据预设收益概率值得到走向趋势,基于走向趋势与原始解读信息得到目标解读信息,用户通过查看目标解读信息了解到目标产品资源在未来的走向,在对目标产品资源做相应的决策时,可以结合目标解读信息进行,而不用依靠人为经验去查看目标产品资源的数据信息后,再归纳总结做出决策。In this embodiment, feature construction is performed on the original features of the target product resources to obtain the target features. Compared with the original features, the target features have more information. Determining the predicted revenue probability value of the target product resources based on the target features can be more accurate. , at the same time, the trend is also obtained based on the preset profit probability value, and the target interpretation information is obtained based on the trend trend and the original interpretation information. The user understands the future direction of the target product resources by viewing the target interpretation information, and makes corresponding adjustments to the target product resources. When making decisions, you can combine the target interpretation information instead of relying on human experience to review the data information of the target product resources and then summarize and make decisions.
在本申请的一个实施例中,参见图8,在步骤S240中基于走向趋势以及目标产品资源的原始解读信息,得到目标产品资源的目标解读信息,包括步骤S810和步骤S820,详细介绍如下:In one embodiment of the present application, referring to Figure 8, in step S240, based on the trend and the original interpretation information of the target product resource, the target interpretation information of the target product resource is obtained, including step S810 and step S820. The details are as follows:
步骤S810,通过预设的分析描述算法对原始解读信息进行处理,得到处理后的原始解读信息。Step S810: Process the original interpretation information through a preset analysis description algorithm to obtain processed original interpretation information.
本实施例中,预先设置有分析描述算法,该分析描述算法基于支持向量回归(Support Vector Regression,SVR)得到的,支持向量回归模型是由支持向量机演变而来的用于回归的机器学习模型,由于该模型具有较好的预测性能,且该模型的推导过程是在凸集上求解拉格朗日对偶方程的解,是一个二次优化问题,利用该模型得到的解为全局最优解。经过 分析描述算法处理后的原始解读信息,能够更加通顺、准确的对目标产品资源进行描述。In this embodiment, an analysis description algorithm is preset. The analysis description algorithm is based on Support Vector Regression (SVR). The support vector regression model is a machine learning model for regression evolved from the support vector machine. , because this model has good prediction performance, and the derivation process of this model is to solve the solution of the Lagrangian dual equation on a convex set, which is a quadratic optimization problem, and the solution obtained by using this model is the global optimal solution. . The original interpretation information processed by the analysis and description algorithm can describe the target product resources more smoothly and accurately.
步骤S820,将走向趋势和处理后的原始解读信息进行合并,得到目标产品资源的目标解读信息。Step S820: Combine the trend and the processed original interpretation information to obtain the target interpretation information of the target product resource.
本申请实施例中,将走向趋势插入到处理后的原始解读信息的固定位置,形成一句完成的话,即可得到目标产品资源的目标解读信息。In the embodiment of the present application, the trend trend is inserted into a fixed position of the processed original interpretation information to form a complete sentence, and the target interpretation information of the target product resource can be obtained.
以下对本申请实施例的一个具体应用场景进行详细说明:A specific application scenario of the embodiment of this application is described in detail below:
可选实施例中,请参阅图9,图9是根据一示例性实施例示出的一种产品资源的数据处理方法的流程图,该产品资源的数据处理方法包括步骤S910至步骤S9130,详细介绍如下;In an optional embodiment, please refer to Figure 9. Figure 9 is a flow chart illustrating a data processing method for product resources according to an exemplary embodiment. The data processing method for product resources includes steps S910 to step S9130. Detailed introduction as follows;
步骤S910,基于样本数据的第三数据计算样本产品资源在平均收益维度上的第一收益、样本产品资源在随机收益维度上的第二收益以及样本产品资源在最大收益维度上卖出的第三收益,并基于第一收益、第二收益和第三收益得到样本产品资源的涨幅特征。Step S910, based on the third data of the sample data, calculate the first income of the sample product resources in the average income dimension, the second income of the sample product resources in the random income dimension, and the third income of the sample product resources sold in the maximum income dimension. income, and obtain the growth characteristics of the sample product resources based on the first income, second income and third income.
本申请实施例中,如步骤S910描述,根据样本数据的第三数据计算出第一收益、第二收益和第三收益,第一收益、第二收益和第三收益在前述均已描述,在此不重复描述。In the embodiment of the present application, as described in step S910, the first income, the second income and the third income are calculated according to the third data of the sample data. The first income, the second income and the third income have all been described above. This description is not repeated.
步骤S920,基于样本产品资源的第一数据、第二数据和第三数据进行特征构建处理,得到样本产品资源的流向分布特征。Step S920: Perform feature construction processing based on the first data, second data and third data of the sample product resources to obtain the flow distribution characteristics of the sample product resources.
本申请实施例中,对第一数据、第二数据和第三数据相应的第一特征进行特征交叉处理,得到第二特征,再将第二特征与第一特征进行特征衍生处理,得到第三特征,将第一特征、第二特征和第三特征进行组合,得到流向分布特征。In the embodiment of the present application, feature intersection processing is performed on the first features corresponding to the first data, second data and third data to obtain the second feature, and then feature derivation processing is performed on the second feature and the first feature to obtain the third feature. Features, combine the first feature, the second feature and the third feature to obtain the flow distribution feature.
步骤S930,将流向分布特征和涨幅特征输入至决策树模型中进行迭代训练,得到预测模型。Step S930: Input the flow distribution characteristics and increase characteristics into the decision tree model for iterative training to obtain a prediction model.
本申请实施例中,将样本产品资源的流向分布特征和涨幅特征输入到决策树模型中进行迭代训练,使得训练好的预测模型能够根据目标产品资源的目标特征得到目标产品资源在未来N天内是否有可能获利的预测收益概率值。In the embodiment of this application, the flow distribution characteristics and increase characteristics of the sample product resources are input into the decision tree model for iterative training, so that the trained prediction model can obtain whether the target product resource will be in the next N days based on the target characteristics of the target product resource. The predicted profit probability value that is likely to make a profit.
步骤S940,从目标产品资源的原始数据中提取原始特征,并基于原始特征进行特征构建处理,得到目标特征。Step S940: Extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features.
本申请实施例中,将目标产品资源的原始特征提取出来后,再根据原始特征进行如前述一致的特征构建处理,得到对应的目标特征。In the embodiment of this application, after the original features of the target product resources are extracted, the same feature construction process as described above is performed based on the original features to obtain the corresponding target features.
步骤S950,将目标特征输入到训练好的预测模型中得到目标产品资源在多个维度上的预测收益概率值。Step S950: Input the target features into the trained prediction model to obtain the predicted profit probability values of the target product resources in multiple dimensions.
本申请实施例中,将目标特征输入到基于决策树模型训练好的预测模型中进行计算,得到对应的预测收益概率值,In the embodiment of this application, the target features are input into the prediction model trained based on the decision tree model for calculation, and the corresponding predicted profit probability value is obtained.
步骤S960,基于目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定目标产品资源的走向趋势。Step S960: Determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources.
本申请实施例中,预测模型根据目标特征输出的预测收益概率值为一个数值,对这个预测收益概率值依据其他产品资源所对应的预测收益概率值的分布,将目标产品资源的预测收益概率值分级为“上涨”、“无明显趋势”以及“下跌”三种走向趋势。In the embodiment of this application, the predicted profit probability value output by the prediction model according to the target characteristics is a numerical value. For this predicted profit probability value, the predicted profit probability value of the target product resource is calculated based on the distribution of predicted profit probability values corresponding to other product resources. It is classified into three trends: "up", "no obvious trend" and "down".
步骤S970,从预测模型含有的目标产品资源的多条可选决策路径中,获取目标决策路径。Step S970: Obtain the target decision path from multiple optional decision paths of the target product resource contained in the prediction model.
本申请实施例中,决策树模型中具有多条决策路径,每条决策路径中均有一个叶节点,目标产品资源会最终被分类到的正确的叶节点,正确的叶节点所对应的决策路径则为目标产品资源的目标决策路径。In the embodiment of this application, the decision tree model has multiple decision paths, and each decision path has a leaf node. The target product resources will eventually be classified into the correct leaf node, and the decision path corresponding to the correct leaf node is the target decision path of the target product resources.
步骤S980,基于目标决策路径上的节点属性确定目标产品资源的原始解读信息。Step S980: Determine the original interpretation information of the target product resource based on the node attributes on the target decision path.
本申请实施例中,决策树模型中的各个节点具有对应的节点属性,这些节点属性决定了后续节点如何分裂,根据目标决策路径的节点属性能够确定出得到预测收益概率值的判断条件,对其进行归纳处理,得到目标产品资源的原始解读信息,该原始解读信息是对目标产品资源的综合解读内容。In the embodiment of this application, each node in the decision tree model has corresponding node attributes. These node attributes determine how subsequent nodes are split. According to the node attributes of the target decision path, the judgment conditions for obtaining the predicted profit probability value can be determined. For this Perform inductive processing to obtain the original interpretation information of the target product resources, which is a comprehensive interpretation of the target product resources.
步骤S990,通过预设的分析描述算法对原始解读信息进行处理,得到处理后的原始解读信息。Step S990: Process the original interpretation information through a preset analysis description algorithm to obtain processed original interpretation information.
本申请实施例中,经过分析描述算法处理后的原始解读信息,能够更加通顺、准确的对目标产品资源进行描述。In the embodiment of this application, the original interpretation information processed by the analysis and description algorithm can describe the target product resources more smoothly and accurately.
步骤S9100,将走向趋势和处理后的原始解读信息进行合并,得到目标产品资源的目标解读信息。Step S9100: Combine the trend and the processed original interpretation information to obtain the target interpretation information of the target product resource.
本申请实施例中,处理后的原始解读信息中有专门放置走向趋势的空缺,直接将走向趋势插入到原始解读信息中即可形成一句完整的文字,即为目标解读信息。用户可以通过直接查看目标解读信息了解到未来N天是否有可能获利,进而对目标产品资源做出相关的决策。In the embodiment of this application, the processed original interpretation information has a vacancy specifically for placing the trend. Directly inserting the trend into the original interpretation information can form a complete sentence, which is the target interpretation information. Users can directly view the target interpretation information to learn whether profits are possible in the next N days, and then make relevant decisions about target product resources.
步骤S9110,将目标产品资源的目标决策路径与各个历史产品资源的历史决策路径进行匹配。Step S9110: Match the target decision path of the target product resource with the historical decision path of each historical product resource.
本申请实施例中,为了更好的便于用户对目标产品资源做出相关决策,将目标产品资源的目标决策路径和历史产品资源的历史决策路径进行匹配,历史产品资源的历史决策路径与目标产品资源吃目标决策路径采用同样的方式得到的,直接将两者进行匹配,可以很好的确定出与目标产品资源较为相似的一个历史产品资源。In the embodiment of this application, in order to better facilitate users to make relevant decisions about target product resources, the target decision path of the target product resource is matched with the historical decision path of historical product resources. The historical decision path of the historical product resource is matched with the target product The resource-eating target decision path is obtained in the same way. By directly matching the two, a historical product resource that is similar to the target product resource can be well determined.
步骤S9120,获取与目标决策路径相匹配的历史决策路径对应的历史产品资源作为目标产品资源的参考产品资源,以基于参考产品资源的信息确定对目标产品资源的相关决策。Step S9120: Obtain the historical product resources corresponding to the historical decision paths that match the target decision path as reference product resources for the target product resources, so as to determine relevant decisions on the target product resources based on the information of the reference product resources.
本申请实施例中,目标决策路径与历史决策路径匹配,代表两者在决策树模型中经过相同的路径,直接将与目标决策路径匹配的历史决策路径所对应的历史产品资源作为目标产品资源的参考产品资源,参考产品资源的历史决策路径同样用于确定未来N天是否有可能获利,但参考产品资源的得到历史决策路径的基础上,具有真实的未来N天的相应数据,因此,用户可以通过查看历史产品资源真实的未来N天的相应数据做出决策。In the embodiment of this application, the target decision path matches the historical decision path, which means that the two go through the same path in the decision tree model, and the historical product resources corresponding to the historical decision path that matches the target decision path are directly used as the target product resources. Reference product resources, the historical decision-making path of reference product resources are also used to determine whether it is possible to make a profit in the next N days, but based on the historical decision-making path of reference product resources, there are real corresponding data for the next N days. Therefore, the user Decisions can be made by viewing the real corresponding data of historical product resources in the next N days.
步骤S9130,将目标解读信息和参考产品资源作为输出结果展示在目标产品资源的解读页面。Step S9130: Display the target interpretation information and reference product resources as output results on the interpretation page of the target product resource.
本申请实施例中,将目标解读信息和参考产品资源一起展示给用户,用户可以直观的查看到最终的结果,同时解读页面上还可设置参考产品资源的跳转按钮,使得用户可以由解读页面跳转到参考产品资源的相应页面上,更加全面的观察参考产品资源。In the embodiment of this application, the target interpretation information and the reference product resources are displayed to the user together, and the user can intuitively view the final results. At the same time, a jump button for the reference product resources can also be set on the interpretation page, so that the user can view the result from the interpretation page. Jump to the corresponding page of the reference product resources to observe the reference product resources more comprehensively.
本申请实施例提供的技术方案,通过训练好的预测模型对目标产品资源的目标特征进行处理,得到预测收益概率值,并对预测收益概率值根据其他产品资源的预测收益概率值进行处理,得到走向趋势,将走向趋势插入到由目标决策路径得到的原始解读信息中,形成目标解读信息,能够很好的依据目标产品资源的信息,依靠算法来得到目标解读信息,在提高信息的转换率和接收率的同时,提升了有效信息的信息量,而不是依靠人为经验进行归纳,用户依靠目标解读信息能够更加准确的做出相关决策,同时,本实施例还为目标产品资源提供了一个参考产品资源,用户可以结合目标解读信息和参考产品资源来做出相关决策。The technical solution provided by the embodiment of this application processes the target characteristics of the target product resources through the trained prediction model to obtain the predicted revenue probability value, and processes the predicted revenue probability value based on the predicted revenue probability values of other product resources to obtain To trend, the trend is inserted into the original interpretation information obtained from the target decision path to form the target interpretation information. It can rely on the information of the target product resources and rely on the algorithm to obtain the target interpretation information, which improves the conversion rate of information and At the same time as the reception rate, the amount of effective information is increased. Instead of relying on human experience to summarize, users can make relevant decisions more accurately by relying on target interpretation information. At the same time, this embodiment also provides a reference product for target product resources. Resources, users can interpret information based on goals and refer to product resources to make relevant decisions.
请参阅图10,本申请一示例性实施例提供了一种产品资源的数据处理装置,包括:Referring to Figure 10, an exemplary embodiment of the present application provides a data processing device for product resources, including:
提取模块1010,配置为从目标产品资源的原始数据中提取原始特征,并基于原始特征进行特征构建处理,得到目标特征;The extraction module 1010 is configured to extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features;
第一确定模块1020,配置为基于目标特征确定目标产品资源的预测收益概率值;The first determination module 1020 is configured to determine the predicted revenue probability value of the target product resource based on the target characteristics;
第二确定模块1030,配置为基于目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定目标产品资源的走向趋势;The second determination module 1030 is configured to determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources;
目标解读信息模块1040,配置为基于走向趋势以及目标产品资源的原始解读信息,得到目标产品资源的目标解读信息。The target interpretation information module 1040 is configured to obtain the target interpretation information of the target product resources based on the trend and the original interpretation information of the target product resources.
在一示例性实施例中,原始特征的数量为多个;提取模块1010,包括:第一处理子模块,配置为对多个原始特征进行特征交叉处理,得到交叉特征;第二处理子模块,配置为对原始特征和交叉特征进行特征衍生处理,得到衍生特征;组合子模块,配置为将原始特 征、交叉特征以及衍生特征进行组合,得到目标特征。In an exemplary embodiment, the number of original features is multiple; the extraction module 1010 includes: a first processing sub-module configured to perform feature intersection processing on multiple original features to obtain intersection features; a second processing sub-module, It is configured to perform feature derivation processing on original features and cross features to obtain derived features; the combination submodule is configured to combine original features, cross features and derived features to obtain target features.
在一示例性实施例中,第一确定模块1020,包括:输入子模块,配置为将目标特征输入至训练好的预测模型中,得到目标产品资源在多个维度上的预测收益概率值;装置还包括:第一获取模块,配置为从预测模型含有的目标产品资源的多条可选决策路径中,获取目标决策路径;第三确定模块,配置为基于目标决策路径上的节点属性确定目标产品资源的原始解读信息。In an exemplary embodiment, the first determination module 1020 includes: an input submodule configured to input target features into the trained prediction model to obtain predicted revenue probability values of the target product resources in multiple dimensions; device It also includes: a first acquisition module configured to acquire a target decision path from multiple optional decision paths of target product resources contained in the prediction model; a third determination module configured to determine the target product based on node attributes on the target decision path Original interpretation information of the resource.
在一示例性实施例中,装置还包括:匹配模块,配置为将目标产品资源的目标决策路径与各个历史产品资源的历史决策路径进行匹配;第二获取模块,配置为获取与目标决策路径相匹配的历史决策路径对应的历史产品资源作为目标产品资源的参考产品资源,以基于参考产品资源的信息确定对目标产品资源的相关决策。In an exemplary embodiment, the device further includes: a matching module configured to match the target decision path of the target product resource with the historical decision path of each historical product resource; a second acquisition module configured to obtain the target decision path corresponding to the target decision path. The historical product resources corresponding to the matched historical decision paths are used as reference product resources for the target product resources to determine relevant decisions on the target product resources based on the information of the reference product resources.
在一示例性实施例中,装置,还包括:第三获取模块,配置为获取样本产品资源的训练样本数据;特征模块,配置为基于训练样本数据得到样本产品资源的流向分布特征以及涨幅特征;训练模块,配置为将流向分布特征和涨幅特征输入至决策树模型中进行迭代训练,得到预测模型。In an exemplary embodiment, the device further includes: a third acquisition module configured to acquire training sample data of the sample product resources; a feature module configured to obtain the flow distribution characteristics and increase characteristics of the sample product resources based on the training sample data; The training module is configured to input flow distribution characteristics and increase characteristics into the decision tree model for iterative training to obtain a prediction model.
在一示例性实施例中,训练样本数据包括表征样本产品资源流向的第一数据、表征样本产品资源分布的第二数据以及表征样本产品资源涨跌的第三数据;特征模块,包括:特征构建子模块,配置为基于第一数据、第二数据和第三数据进行特征构建处理,得到样本产品资源的流向分布特征;以及,计算子模块,配置为基于第三数据计算样本产品资源在平均收益维度上的第一收益、样本产品资源在随机收益维度上的第二收益以及样本产品资源在最大收益维度上卖出的第三收益,并基于第一收益、第二收益和第三收益得到样本产品资源的涨幅特征。In an exemplary embodiment, the training sample data includes first data characterizing the flow of sample product resources, second data characterizing the distribution of sample product resources, and third data characterizing the rise and fall of sample product resources; the feature module includes: feature construction The sub-module is configured to perform feature construction processing based on the first data, the second data and the third data to obtain the flow distribution characteristics of the sample product resources; and the calculation sub-module is configured to calculate the average income of the sample product resources based on the third data. The first income in the dimension, the second income in the random income dimension of the sample product resources, and the third income from the sale of the sample product resources in the maximum income dimension, and the sample is obtained based on the first income, the second income and the third income. Growth characteristics of product resources.
在一示例性实施例中,目标解读信息模块1040,包括:第三处理子模块,配置为通过预设的分析描述算法对原始解读信息进行处理,得到处理后的原始解读信息;合并子模块,配置为将走向趋势和处理后的原始解读信息进行合并,得到目标产品资源的目标解读信息。In an exemplary embodiment, the target interpretation information module 1040 includes: a third processing sub-module configured to process the original interpretation information through a preset analysis description algorithm to obtain processed original interpretation information; a merging sub-module, It is configured to merge the trend and the processed original interpretation information to obtain the target interpretation information of the target product resources.
需要说明的是,上述实施例所提供的装置与上述实施例所提供的方法属于同一构思,其中各个模块和子模块执行操作的具体方式已经在方法实施例中进行了详细描述,此处不再赘述。It should be noted that the device provided by the above embodiments and the method provided by the above embodiments belong to the same concept. The specific manner in which each module and sub-module performs operations has been described in detail in the method embodiments and will not be described again here. .
本申请的实施例还提供了一种电子设备,包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行时,使得所述电子设备实现上述各个实施例中提供的产品资源的数据处理方法。Embodiments of the present application also provide an electronic device, including: one or more processors; a storage device for storing one or more programs. When the one or more programs are processed by the one or more When the processor is executed, the electronic device is caused to implement the data processing method of product resources provided in the above embodiments.
图11示出了适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。FIG. 11 shows a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
需要说明的是,图11示出的电子设备的计算机系统1100仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。It should be noted that the computer system 1100 of the electronic device shown in FIG. 11 is only an example, and should not impose any restrictions on the functions and scope of use of the embodiments of the present application.
如图11所示,计算机系统1100包括中央处理单元(Central Processing Unit,CPU)1101,其可以根据存储在只读存储器(Read-Only Memory,ROM)1102中的程序或者从储存部分1108加载到随机访问存储器(Random Access Memory,RAM)1103中的程序而执行各种适当的动作和处理,例如执行上述实施例中所述的方法。在RAM 1103中,还存储有系统操作所需的各种程序和数据。CPU 1101、ROM 1102以及RAM 1103通过总线1104彼此相连。输入/输出(Input/Output,I/O)接口1105也连接至总线1104。As shown in Figure 11, the computer system 1100 includes a central processing unit (Central Processing Unit, CPU) 1101, which can be loaded into a random accessory according to a program stored in a read-only memory (Read-Only Memory, ROM) 1102 or from a storage part 1108. Access the program in the memory (Random Access Memory, RAM) 1103 to perform various appropriate actions and processing, such as performing the method described in the above embodiment. In RAM 1103, various programs and data required for system operation are also stored. CPU 1101, ROM 1102 and RAM 1103 are connected to each other through bus 1104. An input/output (I/O) interface 1105 is also connected to bus 1104.
以下部件连接至I/O接口1105:包括键盘、鼠标等的输入部分1106;包括诸如阴极射线管(Cathode Ray Tube,CRT)、液晶显示器(Liquid Crystal Display,LCD)等以及扬声器等的输出部分1107;包括硬盘等的储存部分1108;以及包括诸如LAN(Local Area Network,局域网)卡、调制解调器等的网络接口卡的通信部分1109。通信部分1109经由诸如因特网的网络执行通信处理。驱动器1110也根据需要连接至I/O接口1105。可拆卸介质1111,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器1110上,以便于从其上读出的计算机程序根据需要被安装入储存部分1108。The following components are connected to the I/O interface 1105: an input part 1106 including a keyboard, a mouse, etc.; an output part 1107 including a cathode ray tube (Cathode Ray Tube, CRT), a liquid crystal display (Liquid Crystal Display, LCD), etc., and a speaker, etc. ; a storage part 1108 including a hard disk, etc.; and a communication part 1109 including a network interface card such as a LAN (Local Area Network) card, a modem, etc. The communication section 1109 performs communication processing via a network such as the Internet. Driver 1110 is also connected to I/O interface 1105 as needed. Removable media 1111, such as magnetic disks, optical disks, magneto-optical disks, semiconductor memories, etc., are installed on the drive 1110 as needed, so that a computer program read therefrom is installed into the storage portion 1108 as needed.
特别地,根据本申请的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本申请的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的计算机程序。在这样的实施例中,该计算机程序可以通过通信部分1109从网络上被下载和安装,和/或从可拆卸介质1111被安装。在该计算机程序被中央处理单元(CPU)1101执行时,执行本申请的系统中限定的各种功能。In particular, according to embodiments of the present application, the process described above with reference to the flowchart may be implemented as a computer software program. For example, embodiments of the present application include a computer program product including a computer program carried on a computer-readable medium, the computer program including a computer program for performing the method shown in the flowchart. In such embodiments, the computer program may be downloaded and installed from the network via communication portion 1109 and/or installed from removable media 1111 . When the computer program is executed by the central processing unit (CPU) 1101, various functions defined in the system of the present application are executed.
需要说明的是,本申请实施例所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本申请中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本申请中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的计算机程序。这种传播的数据信号可以采用多种形式,包括 但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的计算机程序可以用任何适当的介质传输,包括但不限于:无线、有线等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium may be, for example, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmable Read-Only Memory (Erasable Programmable Read Only Memory, EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any of the above suitable The combination. As used herein, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, in which a computer-readable computer program is carried. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device . Computer programs embodied on computer-readable media may be transmitted using any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the above.
附图中的流程图和框图,图示了按照本申请各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。其中,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings illustrate the architecture, functionality, and operations of possible implementations of systems, methods, and computer program products according to various embodiments of the present application. Each block in the flow chart or block diagram may represent a module, program segment, or part of the code. The above-mentioned module, program segment, or part of the code includes one or more executable components for implementing the specified logical function. instruction. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block in the block diagram or flowchart illustration, and combinations of blocks in the block diagram or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations, or may be implemented by special purpose hardware-based systems that perform the specified functions or operations. Achieved by a combination of specialized hardware and computer instructions.
描述于本申请实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现,所描述的单元也可以设置在处理器中。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments of this application can be implemented in software or hardware, and the described units can also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.
本申请的另一方面还提供了一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如前所述的方法。该计算机可读存储介质可以是上述实施例中描述的电子设备中所包含的,也可以是单独存在,而未装配入该电子设备中。Another aspect of the present application also provides a computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the method as described above is implemented. The computer-readable storage medium may be included in the electronic device described in the above embodiments, or may exist separately without being assembled into the electronic device.
本申请的另一方面还提供了一种计算机程序产品或计算机程序,该计算机程序产品或计算机程序包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得该计算机设备执行上述各个实施例中提供的方法。Another aspect of the present application also provides a computer program product or computer program, which includes computer instructions stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device performs the methods provided in the above embodiments.
上述内容,仅为本申请的较佳示例性实施例,并非用于限制本申请的实施方案,本领域普通技术人员根据本申请的主要构思和精神,可以十分方便地进行相应的变通或修改,故本申请的保护范围应以权利要求书所要求的保护范围为准。The above content is only a preferred exemplary embodiment of the present application and is not intended to limit the implementation of the present application. Those of ordinary skill in the art can easily make corresponding modifications or modifications based on the main concept and spirit of the present application. Therefore, the protection scope of this application should be subject to the protection scope required by the claims.

Claims (11)

  1. 一种产品资源的数据处理方法,包括:A data processing method for product resources, including:
    从目标产品资源的原始数据中提取原始特征,并基于所述原始特征进行特征构建处理,得到目标特征;Extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features;
    基于所述目标特征确定所述目标产品资源的预测收益概率值;Determine the predicted profit probability value of the target product resource based on the target characteristics;
    基于所述目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定所述目标产品资源的走向趋势;Determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources;
    基于所述走向趋势以及所述目标产品资源的原始解读信息,得到所述目标产品资源的目标解读信息。Based on the trend and the original interpretation information of the target product resource, the target interpretation information of the target product resource is obtained.
  2. 如权利要求1所述的方法,其中,所述原始特征的数量为多个;所述基于所述原始特征进行特征构建处理,得到目标特征,包括:The method of claim 1, wherein the number of the original features is multiple; the feature construction process based on the original features to obtain the target features includes:
    对所述多个原始特征进行特征交叉处理,得到交叉特征;Perform feature intersection processing on the multiple original features to obtain intersection features;
    对所述原始特征和所述交叉特征进行特征衍生处理,得到衍生特征;Perform feature derivation processing on the original features and the cross features to obtain derived features;
    将所述原始特征、所述交叉特征以及所述衍生特征进行组合,得到所述目标特征。The original features, the intersection features and the derived features are combined to obtain the target features.
  3. 如权利要求1所述的方法,其中,所述基于所述目标特征确定所述目标产品资源的预测收益概率值,包括:The method of claim 1, wherein determining the predicted revenue probability value of the target product resource based on the target characteristics includes:
    将所述目标特征输入至训练好的预测模型中,得到所述目标产品资源在多个维度上的预测收益概率值;Input the target features into the trained prediction model to obtain the predicted profit probability values of the target product resources in multiple dimensions;
    在所述基于所述走向趋势以及所述目标产品资源的原始解读信息,得到所述目标产品资源的目标解读信息之前,所述方法还包括:Before obtaining the target interpretation information of the target product resource based on the trend and the original interpretation information of the target product resource, the method further includes:
    从所述预测模型含有的所述目标产品资源的多条可选决策路径中,获取目标决策路径;Obtain a target decision path from multiple optional decision paths of the target product resource contained in the prediction model;
    基于所述目标决策路径上的节点属性确定所述目标产品资源的原始解读信息。The original interpretation information of the target product resource is determined based on the node attributes on the target decision path.
  4. 如权利要求3所述的方法,其中,在从所述预测模型含有的所述目标产品资源的多条可选决策路径中,获取目标决策路径之后,所述方法还包括:The method of claim 3, wherein after obtaining a target decision path from a plurality of optional decision paths of the target product resource contained in the prediction model, the method further includes:
    将所述目标产品资源的目标决策路径与各个历史产品资源的历史决策路径进行匹配;Match the target decision path of the target product resource with the historical decision path of each historical product resource;
    获取与所述目标决策路径相匹配的历史决策路径对应的历史产品资源作为所述目标产品资源的参考产品资源,以基于所述参考产品资源的信息确定对所述目标产品资源的相关决策。Obtain historical product resources corresponding to historical decision paths that match the target decision path as reference product resources for the target product resources, so as to determine relevant decisions for the target product resources based on information about the reference product resources.
  5. 如权利要求3所述的方法,其中,在所述将所述目标特征输入至训练好的预测模型中,得到所述产品资源在多个维度上的预测收益概率值之前,所述方法还包括:The method of claim 3, wherein before inputting the target features into the trained prediction model to obtain the predicted profit probability values of the product resources in multiple dimensions, the method further includes :
    获取样本产品资源的训练样本数据;Obtain training sample data of sample product resources;
    基于所述训练样本数据得到所述样本产品资源的流向分布特征以及涨幅特征;Obtain the flow distribution characteristics and increase characteristics of the sample product resources based on the training sample data;
    将所述流向分布特征和所述涨幅特征输入至决策树模型中进行迭代训练,得到所述预测模型。The flow distribution characteristics and the increase characteristics are input into the decision tree model for iterative training to obtain the prediction model.
  6. 如权利要求5所述的方法,其中,所述训练样本数据包括表征所述样本产品资源流向的第一数据、表征所述样本产品资源分布的第二数据以及表征所述样本产品资源涨跌的第三数据;所述基于所述训练样本数据得到所述样本产品资源的流向分布特征以及涨幅特征,包括:The method of claim 5, wherein the training sample data includes first data characterizing the flow direction of the sample product resources, second data characterizing the distribution of the sample product resources, and characterizing the rise and fall of the sample product resources. The third data; the flow distribution characteristics and increase characteristics of the sample product resources obtained based on the training sample data include:
    基于所述第一数据、所述第二数据和所述第三数据进行特征构建处理,得到所述样本产品资源的流向分布特征;以及,Perform feature construction processing based on the first data, the second data and the third data to obtain the flow distribution characteristics of the sample product resources; and,
    基于所述第三数据计算所述样本产品资源在平均收益维度上的第一收益、所述样本产品资源在随机收益维度上的第二收益以及所述样本产品资源在最大收益维度上卖出的第三收益,并基于所述第一收益、第二收益和第三收益得到所述样本产品资源的涨幅特征。Based on the third data, calculate the first income of the sample product resources in the average income dimension, the second income of the sample product resources in the random income dimension, and the sales of the sample product resources in the maximum income dimension. Third income, and obtain the growth characteristics of the sample product resources based on the first income, second income and third income.
  7. 如权利要求1至6中任一项所述的方法,其中,所述基于所述走向趋势以及所述目标产品资源的原始解读信息,得到所述目标产品资源的目标解读信息,包括:The method according to any one of claims 1 to 6, wherein obtaining the target interpretation information of the target product resource based on the trend and the original interpretation information of the target product resource includes:
    通过预设的分析描述算法对所述原始解读信息进行处理,得到处理后的原始解读信息;Process the original interpretation information through a preset analysis and description algorithm to obtain the processed original interpretation information;
    将所述走向趋势和所述处理后的原始解读信息进行合并,得到所述目标产品资源的目标解读信息。The trend trend and the processed original interpretation information are combined to obtain the target interpretation information of the target product resource.
  8. 一种产品资源的数据处理装置,包括:A data processing device for product resources, including:
    提取模块,配置为从目标产品资源的原始数据中提取原始特征,并基于所述原始特征进行特征构建处理,得到目标特征;The extraction module is configured to extract original features from the original data of the target product resources, and perform feature construction processing based on the original features to obtain the target features;
    第一确定模块,配置为基于所述目标特征确定所述目标产品资源的预测收益概率值;A first determination module configured to determine the predicted revenue probability value of the target product resource based on the target characteristics;
    第二确定模块,配置为基于所述目标产品资源的预测收益概率值以及其他产品资源的预测收益概率值,确定所述目标产品资源的走向趋势;The second determination module is configured to determine the trend of the target product resource based on the predicted profit probability value of the target product resource and the predicted profit probability value of other product resources;
    目标解读信息模块,配置为基于所述走向趋势以及所述目标产品资源的原始解读信息,得到所述目标产品资源的目标解读信息。The target interpretation information module is configured to obtain the target interpretation information of the target product resource based on the trend and the original interpretation information of the target product resource.
  9. 一种电子设备,包括:An electronic device including:
    一个或多个处理器;one or more processors;
    存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行时,使得所述电子设备实现如权利要求1至7中任一项所述的产品资源的数据处理方法。Storage device, used to store one or more programs, when the one or more programs are executed by the one or more processors, so that the electronic device implements any one of claims 1 to 7 Data processing methods for product resources.
  10. 一种计算机可读存储介质,其上存储有计算机可读指令,当所述计算机可读指令被计算机的处理器执行时,使计算机执行权利要求1至7中任一项所述的产品资源的数据处理方法。A computer-readable storage medium on which computer-readable instructions are stored. When the computer-readable instructions are executed by a processor of a computer, the computer is caused to execute the product resources described in any one of claims 1 to 7. Data processing methods.
  11. 一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,所述计算机程序被计算机的处理器执行时,使计算机执行权利要求1至7中任一项所述的产品资源的数据处理方法。A computer program product, which includes a computer program carried on a computer-readable medium. When the computer program is executed by a processor of a computer, it causes the computer to execute the data of the product resource described in any one of claims 1 to 7. Approach.
PCT/CN2022/140756 2022-04-02 2022-12-21 Product resource data processing method and apparatus, electronic device and storage medium WO2023185125A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210353762.4A CN114897607A (en) 2022-04-02 2022-04-02 Data processing method and device for product resources, electronic equipment and storage medium
CN202210353762.4 2022-04-02

Publications (1)

Publication Number Publication Date
WO2023185125A1 true WO2023185125A1 (en) 2023-10-05

Family

ID=82715612

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/140756 WO2023185125A1 (en) 2022-04-02 2022-12-21 Product resource data processing method and apparatus, electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN114897607A (en)
WO (1) WO2023185125A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114897607A (en) * 2022-04-02 2022-08-12 富途网络科技(深圳)有限公司 Data processing method and device for product resources, electronic equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111178498A (en) * 2019-12-09 2020-05-19 北京邮电大学 Stock fluctuation prediction method and device
CN111476418A (en) * 2020-04-08 2020-07-31 智谭(浙江)信息科技有限公司 Stock data fluctuation prediction method based on deep neural network and multitask learning
CN111489259A (en) * 2020-04-16 2020-08-04 江苏知诺智能科技有限公司 Stock market risk prediction intelligent implementation method based on deep learning
CN111563821A (en) * 2020-05-12 2020-08-21 电子科技大学 Financial stock fluctuation prediction method based on quantitative investment of support vector machine
CN112365075A (en) * 2020-11-19 2021-02-12 中国科学院深圳先进技术研究院 Stock price trend prediction method, system, terminal and storage medium
CN113095484A (en) * 2021-04-07 2021-07-09 华南师范大学 Stock price prediction method based on LSTM neural network
CN113919944A (en) * 2021-09-07 2022-01-11 暨南大学 Stock trading method and system based on reinforcement learning algorithm and time series model
CN114091789A (en) * 2021-12-15 2022-02-25 中国平安人寿保险股份有限公司 Project data fluctuation prediction method and device, computer equipment and storage medium
CN114897607A (en) * 2022-04-02 2022-08-12 富途网络科技(深圳)有限公司 Data processing method and device for product resources, electronic equipment and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111178498A (en) * 2019-12-09 2020-05-19 北京邮电大学 Stock fluctuation prediction method and device
CN111476418A (en) * 2020-04-08 2020-07-31 智谭(浙江)信息科技有限公司 Stock data fluctuation prediction method based on deep neural network and multitask learning
CN111489259A (en) * 2020-04-16 2020-08-04 江苏知诺智能科技有限公司 Stock market risk prediction intelligent implementation method based on deep learning
CN111563821A (en) * 2020-05-12 2020-08-21 电子科技大学 Financial stock fluctuation prediction method based on quantitative investment of support vector machine
CN112365075A (en) * 2020-11-19 2021-02-12 中国科学院深圳先进技术研究院 Stock price trend prediction method, system, terminal and storage medium
CN113095484A (en) * 2021-04-07 2021-07-09 华南师范大学 Stock price prediction method based on LSTM neural network
CN113919944A (en) * 2021-09-07 2022-01-11 暨南大学 Stock trading method and system based on reinforcement learning algorithm and time series model
CN114091789A (en) * 2021-12-15 2022-02-25 中国平安人寿保险股份有限公司 Project data fluctuation prediction method and device, computer equipment and storage medium
CN114897607A (en) * 2022-04-02 2022-08-12 富途网络科技(深圳)有限公司 Data processing method and device for product resources, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN114897607A (en) 2022-08-12

Similar Documents

Publication Publication Date Title
WO2020224405A1 (en) Image processing method and apparatus, computer-readable medium and electronic device
US11270375B1 (en) Method and system for aggregating personal financial data to predict consumer financial health
WO2020088058A1 (en) Information generating method and device
CN116127020A (en) Method for training generated large language model and searching method based on model
US11775504B2 (en) Computer estimations based on statistical tree structures
CN110909222A (en) User portrait establishing method, device, medium and electronic equipment based on clustering
CN111427971A (en) Business modeling method, device, system and medium for computer system
CN109034199B (en) Data processing method and device, storage medium and electronic equipment
CN113408668A (en) Decision tree construction method and device based on federated learning system and electronic equipment
WO2023185125A1 (en) Product resource data processing method and apparatus, electronic device and storage medium
WO2024051707A1 (en) Recommendation model training method and apparatus, and resource recommendation method and apparatus
US11645563B2 (en) Data filtering with fuzzy attribute association
WO2021093320A1 (en) Method and apparatus for outputting information
US11593740B1 (en) Computing system for automated evaluation of process workflows
WO2022271369A1 (en) Training of an object linking model
US20220067623A1 (en) Evaluate demand and project go-to-market resources
CN112446777B (en) Credit evaluation method, device, equipment and storage medium
CN114139059A (en) Resource recommendation model training method, resource recommendation method and device
CN112307334A (en) Information recommendation method, information recommendation device, storage medium and electronic equipment
CN113342998B (en) Multimedia resource recommendation method and device, electronic equipment and storage medium
CN114969543B (en) Popularization method, popularization system, electronic equipment and storage medium
KR102449831B1 (en) Electronic device for providing information regarding new text, server for identifying new text and operation method thereof
CN114547448B (en) Data processing method, model training method, device, equipment, storage medium and program
CN117788166A (en) Investment type analysis method, apparatus, device, medium, and program product
CN117033757A (en) Information selection method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22934940

Country of ref document: EP

Kind code of ref document: A1