WO2019001120A1 - 商品动态定价数据处理方法和系统 - Google Patents
商品动态定价数据处理方法和系统 Download PDFInfo
- Publication number
- WO2019001120A1 WO2019001120A1 PCT/CN2018/084382 CN2018084382W WO2019001120A1 WO 2019001120 A1 WO2019001120 A1 WO 2019001120A1 CN 2018084382 W CN2018084382 W CN 2018084382W WO 2019001120 A1 WO2019001120 A1 WO 2019001120A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- price
- expected
- commodity
- sales
- inventory
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0283—Price estimation or determination
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
Definitions
- the present disclosure relates to the field of data processing, and in particular, to a method and system for processing commodity dynamic pricing data.
- an algorithm can be used to fit the price elasticity parameter of each commodity, and accordingly, the sales volume of a certain commodity under a given price condition can be predicted, and then, the existing inventory level and the expectation are combined.
- the length of the clearance automatically recommends a most reasonable pricing, but the relevant algorithm cannot dynamically adjust the suggested price.
- the price elasticity parameter After the price elasticity parameter is determined, it will not change during the entire slow-moving clearance period, and thus lacks the ability to self-update according to the latest data. Since the volume-price relationship of goods during the normal sales period and the slow-moving clearance period is likely to be different, and the relevant algorithm uses the price elasticity parameter during the normal sales period, this will have a certain impact on the pricing accuracy.
- the existing static model can not get feedback and make corresponding adjustments, so even if the volume-price relationship of the commodity does not change much during the slow-moving clearance period.
- the use of static volume-price relationships also affects the accuracy of pricing.
- the algorithm does not support human intervention in the proposed price. After the algorithm gives the suggested price, for various reasons, the user may adjust the price, which causes the relevant algorithm to deviate from the evaluation of the next clearance effect. At the same time, the related algorithm cannot adjust the future pricing strategy according to the current price.
- One technical problem to be solved by the present disclosure is to provide a method and system for processing dynamic pricing data of commodities, and to maximize the profit of commodities under the premise of clearing inventory.
- a commodity dynamic pricing data processing method including: acquiring historical sales data of an item; and determining, according to historical sales data, an item from a current inventory status to an expected inventory in a case of executing a predetermined price policy The state transition probability of the quantity state; using the expected target as the reward function, constructing the Markov decision model based on the state transition probability, the expected cumulative return and the reward function, and determining the expected cumulative return expression based on the Markov decision model, with the predetermined price as the variable
- the iterative calculation is performed by using an enhanced learning algorithm to determine an optimal price of the commodity; wherein the expected target is an expected return of the commodity from the current inventory state to the expected inventory state in the case of executing the predetermined price strategy.
- determining an average sales value of the commodity determining an average sales value of the commodity; determining a state transition probability of the commodity from the current inventory state to the expected inventory state in the case of executing the predetermined price policy based on the current inventory amount, the expected inventory amount, and the sales average value.
- determining a first price interval according to a predetermined price determining whether the quantity of historical sales data in the first price interval is greater than or equal to a threshold; and if the quantity of historical sales data in the first price interval is greater than or equal to a threshold, according to the first
- the historical sales data of the price range determines the average sales value of the commodity; if the quantity of the historical sales data in the first price interval is less than the threshold, the first price interval is gradually expanded to the second price interval, so that the second price interval
- the quantity of historical sales data is greater than or equal to the threshold value, and the price elasticity model is used to determine the price elasticity parameter of the commodity in the second price range, and the average value of the sales volume of the commodity is determined according to the price elasticity parameter.
- the sales probability density function is determined based on the sales average and the sales standard deviation, and the difference between the current inventory quantity and the expected inventory quantity minus the predetermined value is a lower limit, The difference between the current inventory quantity and the expected inventory quantity is an upper limit, and the sales probability density function is integrated to calculate a state transition probability of the commodity from the current inventory quantity state to the expected inventory quantity state in the case of executing the predetermined price strategy;
- the sales standard deviation shown is determined by the standard deviation formula based on the sales average.
- S is the current inventory quantity
- S' is the expected inventory quantity
- L is the remaining clearance time
- ⁇ is the possible price of the commodity
- P i is the optimal price of the commodity i day
- P min is the minimum price of the commodity
- P max For the maximum price of the commodity
- T(S, ⁇ , S') is the state transition probability from the current inventory state S to the expected inventory state S' in the case where the price ⁇ is executed
- T(S, P i , S') The state transition probability from the current stock quantity state S to the expected stock quantity state S' in the case of performing the optimal price P i
- F(S, i) is the cumulative profit when the remaining clearance time is i
- the current stock quantity is S
- F(S', i-1) is the expected cumulative return when the remaining clearance period is i-1
- the expected maximum benefit of the quantity state S to the expected inventory quantity state S', R(S, P i , S') is the expected from the current inventory quantity state S to the expected inventory quantity state S' in the case of performing the optimal price P i
- the maximum return, F(S, 0) and F(0, i) are initial values.
- a commodity dynamic pricing data processing system comprising: a data obtaining unit, configured to acquire historical sales data of the commodity; and a probability determining unit, configured to determine, according to the historical sales data, that the commodity is executed The state transition probability from the current inventory status to the expected inventory status in the case of a predetermined price strategy; an optimal price determination unit for constructing a Marco based on the state transition probability, the expected cumulative return, and the reward function with the expected target as a reward function
- the decision-making model is based on the Markov decision model to determine the expected cumulative return expression, with the predetermined price as the variable, and the iterative calculation using the enhanced learning algorithm to determine the optimal price of the commodity; wherein the expected target is the commodity in the execution of the predetermined price strategy.
- the probability determining unit is configured to determine a sales average value of the commodity; and determine a state transition probability from the current inventory state to the expected inventory state in the case of executing the predetermined price policy based on the current inventory amount, the expected inventory amount, and the sales average value .
- the probability determining unit is further configured to determine a first price interval according to the predetermined price; determine whether the quantity of historical sales data in the first price interval is greater than or equal to the threshold; if the quantity of historical sales data in the first price interval is greater than or equal to The threshold value is used to determine the average sales value of the commodity according to the historical sales data of the first price range; if the quantity of the historical sales data in the first price interval is less than the threshold, the first price interval is gradually expanded to the second price interval to The quantity of historical sales data of the second price interval is greater than or equal to the threshold value, and the price elasticity model is used to determine the price elasticity parameter of the commodity in the second price interval, and the average value of the sales volume of the commodity is determined according to the price elasticity parameter.
- the probability determining unit is further configured to determine, by using a maximum likelihood estimation algorithm, if the sales volume of the commodity obeys the positive distribution, determine a sales probability density function based on the sales average and the sales standard deviation, and subtract the difference between the current inventory quantity and the expected inventory quantity.
- the predetermined value is the lower limit
- the difference between the current inventory quantity and the expected inventory quantity is the upper limit
- the sales probability density function is integrated to determine the state transition of the commodity from the current inventory state to the expected inventory state in the case of executing the predetermined price strategy. Probability; wherein the indicated sales standard deviation is determined based on the sales mean using the standard deviation formula.
- the probability determining unit is further configured to use the maximum likelihood estimation algorithm to determine the sales volume of the commodity to obey the Poisson distribution, and then use the formula A state transition probability from a current inventory status to an expected inventory status in the case of executing a predetermined price policy; wherein S is the current inventory quantity, S' is the expected inventory quantity, and ⁇ is the sales volume average.
- the expected maximum return of the inventory state S to the expected inventory state S', R(S, P i , S') is the current inventory quantity state S to the expected inventory quantity state S' in the case where the optimal price P i is executed
- Expected maximum benefit, F(S, 0) and F(0, i) are initial values.
- a commodity dynamic pricing data processing system comprising: a memory; and a processor coupled to the memory, the processor being configured to perform the method as described above based on the instructions stored in the memory.
- a computer readable storage medium having stored thereon computer program instructions that, when executed by a processor, implement the steps of the method described above.
- the embodiments of the present disclosure use the historical sales data to perform data processing based on the Markov decision model and the enhanced learning algorithm, and obtain an optimal pricing strategy for the commodity, which can maximize the profit of the commodity.
- FIG. 1 is a schematic flow chart of some embodiments of a method for processing dynamic pricing data of a commodity according to the present disclosure.
- FIG. 2 is a schematic flow chart of still another embodiment of a method for processing dynamic pricing data of a commodity according to the present disclosure.
- FIG. 3 is a schematic structural diagram of some embodiments of a commodity dynamic pricing data processing system according to the present disclosure.
- FIG. 4 is an application architecture diagram of some embodiments of a commodity dynamic pricing data processing system of the present disclosure.
- FIG. 5 is a schematic structural diagram of another embodiment of a commodity dynamic pricing data processing system according to the present disclosure.
- FIG. 6 is a schematic structural diagram of still another embodiment of a commodity dynamic pricing data processing system according to the present disclosure.
- FIG. 1 is a schematic flow chart of some embodiments of a method for processing dynamic pricing data of a commodity according to the present disclosure. The method includes the following steps 110-140.
- historical sales data for the item is obtained. For example, a sales record in which the commodity transaction price is between ⁇ -0.5 and ⁇ +0.5 in a predetermined period of time may be extracted, wherein ⁇ is a price at which the merchandise is sold, and the sales data includes sales data, a selling price, and the like.
- a state transition probability from the current inventory status to the expected inventory status is determined for the item in the event that the predetermined price strategy is executed.
- the Markov decision model is constructed based on the state transition probability, the expected cumulative return, and the reward function with the expected goal as a reward function.
- the expected target is the expected return of the commodity from the current inventory status to the expected inventory status in the event that the predetermined price strategy is executed, wherein the expected return may be the expected sales or the expected maximum gross profit.
- the Markov Decision Process is a mathematical model used to deal with the optimal scheduling problem of partially random, partially decision maker control systems.
- the Markov decision process can be expressed as a tuple as follows:
- Sta is a set of states
- Act is a set containing all decisions
- T assigns a suitable probability to all state transition rules of the form (S, ⁇ , S'), that is, given a state S, a decision ⁇ And a subsequent state S', T(S, ⁇ , S') represents the probability of moving from S to S' in the case of executing the decision ⁇
- R is a reward function, that is, when a certain rule is executed, the system can obtain Reward.
- the status is all possible stocks for a commodity, and the decision is all possible pricing for the commodity.
- the setting of the reward function R depends on the given goal.
- the reward corresponding to the rule (S, ⁇ , S') is (S-S') ⁇ ⁇ , where (S-S' ) indicates the change in stock quantity, that is, the quantity of the item sold on the day, and ⁇ is the price of the item on the day. If the goal is set to maximize gross profit, the corresponding reward should be (S-S') x ( ⁇ -P cost ), where P cost is the cost price of the item.
- the main task of the Markov decision model is to determine the state transition probability T(S, ⁇ , S'), that is, the probability corresponding to each rule (S, ⁇ , S'), that is, in the case where the price is ⁇ .
- an expected cumulative return expression is determined based on the Markov decision model, with the predetermined price as a variable, and an iterative calculation using the enhanced learning algorithm to dynamically determine the optimal price of the commodity.
- Enhanced learning also known as reinforcement learning, is a machine learning field that emphasizes how to dynamically adjust strategies based on changes in the external environment to maximize expected benefits. For example, taking the price ⁇ as a variable, thinking that the remaining clearance time is i, and the cumulative return F(S, i) when the inventory is S is an iterative calculation for the objective function, so that F(S, i) takes the maximum value, when F( S, i) The value of ⁇ when the maximum value is obtained is the optimal price of the commodity.
- the optimal price of the commodity may be calculated using the formula, which may be referred to as the Bellman (shortest path algorithm) equation.
- S is the current inventory quantity
- S' is the expected inventory quantity
- L is the remaining clearance time
- ⁇ is the possible price of the commodity
- P i is the optimal price of the commodity i day
- P min is the minimum price of the commodity
- P max For the maximum price of the commodity
- T(S, ⁇ , S') is the state transition probability from the current inventory state S to the expected inventory state S' in the case where the price ⁇ is executed
- T(S, P i , S') The state transition probability from the current stock quantity state S to the expected stock quantity state S' in the case where the optimal price P i is executed
- F(S, i) is the cumulative profit when the remaining clearance time is i and the stock quantity is S
- F(S', i-1) is the expected cumulative return when the remaining clearance period is i-1 and the stock quantity is S'
- R(S, ⁇ , S') is the current stock status when the execution price is ⁇ .
- R(S, P i , S') is the expected maximum return from the current stock quantity state S to the expected stock quantity state S' in the case of performing the optimal price P i , F(S, 0) and F(0, i) are initial values.
- historical sales data is used for data processing to obtain an optimal pricing strategy for the commodity, and when the product is slow-moving, it can be maximized under the premise of clearing the inventory. The proceeds of the goods.
- FIG. 2 is a schematic flow chart of still another embodiment of a method for processing dynamic pricing data of a commodity according to the present disclosure.
- the present disclosure can achieve good results in the sales of slow-moving products. The following description will be made by taking a slow-moving product as an example, and the method includes the following steps 210-290.
- step 210 historical sales data for the slow-moving item is obtained.
- a first price interval is determined based on the predetermined price. For example, if the predetermined price is ⁇ , it may be determined that the first price interval is ⁇ -0.5 to ⁇ +0.5. Those skilled in the art will appreciate that different price ranges can be set depending on the value of different slow-moving items.
- step 230 it is determined whether the number of historical sales data in the first price interval is greater than or equal to the threshold. If the threshold is greater than the threshold, step 240 is performed; otherwise, step 250 is performed. For example, if the number of sales records exceeds 30 days, the data is considered available, otherwise, the data is considered too small.
- the average sales volume of the unsalable item is determined based on the historical sales data for each day of the first price range.
- the maximum likelihood estimation algorithm can be used to determine that the sales volume of the slow-moving products obeys the positive distribution. If the historical sales volume is expressed as (Q N , Q N-1 , ..., Q 1 ), where N is the number of days of sales.
- the mean ⁇ of the fitted positive distribution is as follows:
- the first price interval is gradually expanded to a second price interval such that the amount of historical sales data for the second price interval is greater than or equal to the threshold. For example, when the number of sales records in a given price range is less than 30 and a reliable distribution cannot be fitted, the price range is gradually increased until there is a sufficient number of historical records.
- the price elasticity model is used to determine the price elasticity parameter of the slow-moving item in the second price range.
- the volume model is an economic model used to measure the sensitivity of slow-moving products to price changes.
- the volume and price model is as follows:
- the average sales volume of the unsalable item is determined based on the price elasticity parameter.
- the sales standard deviation ⁇ is calculated using the standard deviation formula based on the sales average.
- a state transition probability from the current inventory status to the expected inventory status in the case where the predetermined price strategy is executed is determined.
- the sales probability density function is determined based on the sales average and the sales standard deviation, and the difference between the current inventory quantity and the expected inventory quantity minus the predetermined value is the lower limit, and the difference between the current inventory quantity and the expected inventory quantity is the upper limit, and the sales probability density function is performed.
- the integral calculation determines a state transition probability of the slow-moving product from the current inventory state to the expected inventory state in the case of executing the predetermined price policy; wherein, if the predetermined value is, for example, 1, the specific formula may be Among them, S is the current inventory, S' is the expected inventory, ⁇ is the sales average, and ⁇ is the sales standard deviation.
- the historical sales volume of other similar commodities under the same category may be referred to, and the sales data of the current unsalable products may be filled to obtain sufficient sales data. Thereafter, the rule probabilities are fitted using steps 240-260.
- the Markov decision model is constructed based on the state transition probability, the expected cumulative return, and the reward function with the expected goal as a reward function.
- the expected cumulative return expression is determined based on the Markov decision model, and the predetermined price is used as a variable, and the enhanced learning algorithm is used for iterative calculation to dynamically determine the optimal price of the unsalable product.
- the fitting MDP mainly reflects the dynamic conversion relationship between the inventory, price and sales volume of a given slow-moving product.
- an optimal pricing strategy can be automatically calculated to maximize a given goal.
- the optimal pricing strategy and expected goals of a certain slow-moving product under any given inventory level, price and clearance period can be determined, and the subjectivity and inaccuracy brought about by human decision-making can be avoided. Save a lot of inventory costs and avoid unnecessary losses.
- the embodiment uses the classical price model to backfill the missing information, which improves the accuracy of the model fitting and solves the problem of unsalable pricing under the condition of lack of information.
- the dynamic adjustment optimal strategy when the price is set artificially, when the system monitors the deviation, the dynamic adjustment optimal strategy can be supported, and the expected sales volume can be more accurately predicted, and one of the most recommended A good adjustment strategy to achieve the goal of supporting both manual intervention and dynamic adjustment.
- step 260 using the maximum likelihood estimation algorithm to determine the sales volume of the unsalable item obeying the Naposon distribution; wherein S is the current inventory quantity, S' is the expected inventory quantity, and ⁇ is the sales average value.
- the sales volume of the unsalable goods can be fitted to the distribution of the positive distribution, or the distribution of the Poisson can be made, or it can be fitted to other distributions, and the corresponding formula can be used to calculate the unsalable goods at the predetermined price.
- the state transition probability from the current inventory status to the expected inventory status may be sufficient.
- FIG. 3 is a schematic structural diagram of some embodiments of a commodity dynamic pricing data processing system according to the present disclosure.
- the system includes a data acquisition unit 310, a probability determination unit 320, and an optimal price determination unit 330, wherein:
- the data acquisition unit 310 is configured to acquire historical sales data of the commodity. For example, a sales record in which the commodity transaction price is between ⁇ -0.5 and ⁇ +0.5 within a predetermined time period may be extracted, where ⁇ is the price of the sale.
- the probability determination unit 320 is configured to determine, based on the historical sales data, a state transition probability from the current inventory state to the expected inventory state in the case of executing the predetermined price policy, ie, determining an input of the Markov decision model.
- the optimal price determining unit 330 is configured to construct a Markov decision model according to the state transition probability, the expected cumulative return, and the reward function with the expected target as a reward function, and determine an expected cumulative return expression based on the Markov decision model, at a predetermined price Variables, using the enhanced learning algorithm for iterative calculations, dynamically determine the optimal price of the commodity.
- the expected target is the expected return from the current inventory status to the expected inventory status in the case of executing the predetermined price strategy, wherein the expected return may be the expected sales or the expected maximum gross profit.
- Enhanced learning also known as reinforcement learning, is a machine learning field that emphasizes how to dynamically adjust strategies based on changes in the external environment to maximize expected benefits.
- the optimal price of the commodity may be calculated using the formula, which may be referred to as the Bellman equation.
- S is the current inventory quantity
- S' is the expected inventory quantity
- L is the remaining clearance time
- ⁇ is the possible price of the commodity
- P i is the optimal price of the commodity i day
- P min is the minimum price of the commodity
- P max For the maximum price of the commodity
- T(S, ⁇ , S') is the state transition probability from the current inventory state S to the expected inventory state S' in the case where the price ⁇ is executed
- T(S, P i , S') The state transition probability from the current stock quantity state S to the expected stock quantity state S' in the case where the optimal price P i is executed
- F(S, i) is the cumulative profit when the remaining clearance time is i and the stock quantity is S
- F(S', i-1) is the expected cumulative return when the remaining clearance period is i-1 and the stock quantity is S'
- R(S, ⁇ , S') is the current stock status when the execution price is ⁇ .
- R(S, P i , S') is the expected maximum return from the current stock quantity state S to the expected stock quantity state S' in the case of performing the optimal price P i , F(S, 0) and F(0, i) are initial values.
- the historical sales data is used for data processing, and the optimal pricing strategy of the commodity is obtained, and the profit of the commodity is maximized under the premise of clearing the inventory.
- the probability determining unit 320 is further configured to determine, according to the predetermined price, the first price interval, determine whether the quantity of historical sales data in the first price interval is greater than or equal to the threshold; if in the first price interval If the number of historical sales data is greater than or equal to the threshold, the average sales value of the commodity is determined based on the historical sales data of the daily price of the first price interval. For example, if the predetermined price is ⁇ , it may be determined that the first price interval is ⁇ -0.5 to ⁇ +0.5. Those skilled in the art will appreciate that different price ranges can be set depending on the value of the different commodities.
- the maximum likelihood estimation algorithm can be used to determine that the sales volume of the commodity obeys the positive distribution. If the historical sales volume is expressed as (Q N , Q N-1 , ..., Q 1 ), where N is the number of days of sales.
- the mean ⁇ of the fitted positive distribution is as follows:
- the probability determination unit 320 calculates the sales standard deviation ⁇ using the standard deviation formula based on the sales average value. E.g And based on the current inventory amount, the expected inventory amount, the sales average value, and the sales standard deviation, the state transition probability from the current inventory status to the expected inventory status in the case of executing the predetermined price policy is determined.
- the optimal pricing strategy and the expected goal of a certain commodity under any given inventory level, price and clearance period can be determined, and the subjectivity and inaccuracy brought about by human decision-making can be avoided.
- the company saves a lot of inventory costs and avoids unnecessary losses.
- the technology supports manual adjustment of the results, and can quickly calculate the new optimal strategy based on the adjustment, greatly enhancing the flexibility.
- the probability determining unit 320 is further configured to utilize the formula if the sales volume of the commodity is determined to obey the Poisson distribution using the maximum likelihood estimation algorithm.
- the sales volume of the goods can be fitted to obey the distribution of positive and too, or the distribution of the Poisson can be made, or it can be fitted to other distributions, and the corresponding formula can be used to calculate the execution of the predetermined price strategy.
- the state transition probability from the current inventory status to the expected inventory status is sufficient.
- FIG. 4 is an application architecture diagram of some embodiments of a commodity dynamic pricing data processing system of the present disclosure.
- An application platform 410, a machine learning platform 420, and a big data platform 430 may be included in the architectural diagram.
- the present disclosure can achieve good results in the field of sales to slow-moving products.
- the application platform 410 can implement functions such as information entry, model calling, result output, real-time monitoring, and manual adjustment.
- the main input of information input includes the inventory quantity of the unsalable goods, the lowest price allowed, the current selling price of the goods, the length of the unsalable clearance, and the target (maximizing sales or maximizing gross profit). If the goal is to maximize the gross profit, the input also includes the cost price.
- the model call refers to calling the model constructed by the machine learning platform 420 and the corresponding algorithm to derive the suggested price and predicted sales volume of a certain commodity.
- the result of the output means that the output result is manually confirmed (optional), the commodity information is synchronized to the downstream slow-moving system, and the commodity price is modified to the suggested price.
- Real-time monitoring and manual adjustment means that during the slow-moving clearance period, the system will capture the sales volume and inventory data of the goods in real time, and adjust the proposed optimal strategy and pricing to achieve the goal of slow-moving clearance, to a greater extent, in the presence of manual intervention.
- the strategy of the model will be adjusted accordingly to calculate the optimal strategy for a given input.
- the machine learning platform 420 is capable of data synchronization, calling volume modules, performing Markov decision model construction, and enhancing learning.
- data synchronization means that after inputting the required information, the system will automatically synchronize the historical information of the commodity from the big data platform 430, mainly including sales volume and transaction price.
- the volume price model, the Markov decision model and the machine learning specific implementation have been described in detail in the above embodiments, and will not be further elaborated here.
- the big data platform 430 includes a product database, sales data, and an inventory database.
- the slow-moving effect can be monitored in real time, and the price can be adjusted in time according to actual sales volume and real-time inventory, thereby improving the effect of slow sales clearance and increasing sales or gross profit.
- the system also supports human intervention on the proposed price, which is to make a more accurate forecast of the expected sales volume when artificially setting a price, and propose an optimal adjustment strategy accordingly.
- FIG. 5 is a schematic structural diagram of another embodiment of a commodity dynamic pricing data processing system according to the present disclosure.
- the system includes a memory 510 and a processor 520, wherein:
- Memory 510 can be a magnetic disk, flash memory, or any other non-volatile storage medium.
- the memory is used to store the instructions in the embodiment corresponding to Figures 1-2.
- the processor 520 is coupled to the memory 510 and can be implemented as one or more integrated circuits, such as a microprocessor or a microcontroller.
- the processor 520 is configured to execute instructions stored in the memory, and can perform historical data processing using the historical sales data based on the Markov decision model and the enhanced learning algorithm to obtain an optimal pricing strategy for the commodity, and under the premise of clearing the inventory, the maximum is Increase the profitability of goods.
- the system 600 can include a memory 610 and a processor 620.
- Processor 620 is coupled to memory 610 via BUS bus 630.
- the system 600 can also be coupled to the external storage device 650 via the storage interface 640 for invoking external data, and can also be connected to the network or another computer system (not shown) via the network interface 660, and will not be described in detail herein.
- the data processing can be performed by using the historical sales data based on the Markov decision model and the enhanced learning algorithm to obtain the optimal pricing strategy of the commodity, and the inventory is cleared. Under the premise, maximize the profit of the goods.
- a computer readable storage medium having stored thereon computer program instructions that, when executed by a processor, implement the steps of the method of the embodiment of FIGS. 1-2.
- a processor implements the steps of the method of the embodiment of FIGS. 1-2.
- embodiments of the present disclosure may be provided as a method, apparatus, or computer program product. Accordingly, the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware aspects. Moreover, the present disclosure may take the form of a computer program product embodied on one or more computer-usable non-transitory storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer usable program code. .
- the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
- the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
- These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
- the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
- the methods and apparatus of the present disclosure may be implemented in a number of ways.
- the methods and apparatus of the present disclosure may be implemented in software, hardware, firmware or any combination of software, hardware, firmware.
- the above-described sequence of steps for the method is for illustrative purposes only, and the steps of the method of the present disclosure are not limited to the order specifically described above unless otherwise specifically stated.
- the present disclosure may also be embodied as programs recorded in a recording medium, the programs including machine readable instructions for implementing a method in accordance with the present disclosure.
- the present disclosure also covers a recording medium storing a program for executing the method according to the present disclosure.
Abstract
Description
Claims (14)
- 一种商品动态定价数据处理方法,包括:获取商品的历史销售数据;根据所述历史销售数据,确定所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的状态转移概率;以预期目标为奖励函数,根据状态转移概率、预期累计收益和奖励函数构建马尔科夫决策模型;基于马尔科夫决策模型确定预期累计收益表达式,以预定价格为变量,利用增强学习算法进行迭代计算,确定所述商品的最优价格;其中,所述预期目标为所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的预期收益。
- 根据权利要求1所述的方法,其中,确定所述商品的销量均值;基于当前库存量、预期库存量和销量均值,确定所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的状态转移概率。
- 根据权利要求2所述的方法,其中,根据预定价格确定第一价格区间;判断在所述第一价格区间的历史销售数据的数量是否大于等于阈值;若在所述第一价格区间的历史销售数据的数量大于等于阈值,则根据所述第一价格区间的每天的历史销量数据确定所述商品的销量均值;若在所述第一价格区间的历史销售数据的数量小于阈值,则将所述第一价格区间逐渐扩大到第二价格区间,以使所述第二价格区间的历史销售数据的数量大于等于阈值,利用量价模型确定所述商品在所述第二价格区间的价格弹性参数,根据价格弹性参数确定所述商品的销量均值。
- 根据权利要求2或3所述的方法,其中,利用最大似然估计算法若确定所述商品的销量服从正太分布,则基于销量均值和 销量标准差确定销量概率密度函数,以当前库存量与预期库存量之差减预定值为下限,以当前库存量与预期库存量之差为上限,对所述销量概率密度函数进行积分计算,确定所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的状态转移概率;其中,所示销量标准差根据销量均值利用标准差公式确定。
- 根据权利要求1所述的方法,其中,根据以下公式确定商品的最优价格:F(S,0)=0;F(0,i)=0,1≤i≤L;其中,S为当前库存量,S'为预期库存量,L为剩余清仓时长,α为商品的可能价格,P i为商品第i天的最优价格,P min为商品的最小价格,P max为商品的最大价格,T(S,α,S')为执行价格α的情况下从当前库存量状态S到预期库存量状态S'的状态转移概率,T(S,P i,S')为执行最优价格P i的情况下从当前库存量状态S到预期库存量状态S'的状态转移概率,F(S,i)为剩余清仓时长为i,当前库存量为S时的累计收益,F(S',i-1)为剩余清仓时长为i-1,预期库存量为S'时的预期累计收益,R(S,α,S')为执行价格α的情况下从当前库存量状态S到预期库存量状态S'的预期最大收益,R(S,P i,S')为执行最优价格P i的情况下从当前库存量状态S到预期库存量状态S'的预期最大收益,F(S,0)和F(0,i)为初始值。
- 一种商品动态定价数据处理系统,包括:数据获取单元,用于获取商品的历史销售数据;概率确定单元,用于根据所述历史销售数据,确定所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的状态转移概率;最优价格确定单元,用于以预期目标为奖励函数,根据状态转移概率、预期累计收益和奖励函数构建马尔科夫决策模型,基于马尔科夫决策模型确定预期累计收益表达式,以预定价格为变量,利用增强学习算法进行迭代计算,确定所述商品的最优价格;其中,所述预期目标为所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的预期收益。
- 根据权利要求7所述的系统,其中,所述概率确定单元用于确定所述商品的销量均值;基于当前库存量、预期库存量和销量均值,确定所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的状态转移概率。
- 根据权利要求8所述的系统,其中,所述概率确定单元还用于根据预定价格确定第一价格区间;判断在所述第一价格区间的历史销售数据的数量是否大于等于阈值;若在所述第一价格区间的历史销售数据的数量大于等于阈值,则根据所述第一价格区间的每天的历史销量数据确定所述商品的销量均值;若在所述第一价格区间的历史销售数据的数量小于阈值,则将所述第一价格区间逐渐扩大到第二价格区间,以使所述第二价格区间的历史销售数据的数量大于等于阈值,利用量价模型确定所述商品在所述第二价格区间的价格弹性参数,根据价格弹性参数确定所述商品的销量均值。
- 根据权利要求8或9所述的系统,其中,所述概率确定单元还用于利用最大似然估计算法若确定所述商品的销量服从正太分布,则基于销量均值和销量标准差确定销量概率密度函数,以当前库存量与预期库存量之差减预定值为下限,以当前库存量与预期库存量之差为上限,对所述销量概率密度函数进行积分计算,确定所述商品在执行预定价格策略的情况下从当前库存量状态到预期库存量状态的状态转移概率;其中,所示销量标准差根据销量均值利用标准差公式确定。
- 根据权利要求7所述的系统,其中,所述最优价格确定单元用于根据以下公式确定商品的最优价格;F(S,0)=0;F(0,i)=0,1≤i≤L;其中,S为当前库存量,S'为预期库存量,L为剩余清仓时长,α为商品的可能价格,P i为商品第i天的最优价格,P min为商品的最小价格,P max为商品的最大价格,T(S,α,S')为执行价格α的情况下从当前库存量状态S到预期库存量状态S'的状态转移概率,T(S,P i,S')为执行最优价格P i的情况下从当前库存量状态S到预期库存量状态S'的状态转移概率,F(S,i)为剩余清仓时长为i,当前库存量为S时的累计收益,F(S',i-1)为剩余清仓时长为i-1,预期库存量为S'时的预期累计收益,R(S,α,S')为执行价格α的情况下从当前库存量状态S到预期库存量状态S'的预期最大收益,R(S,P i,S')为执行最优价格P i的情况下从当前库存量状态S到预期库存量状态S'的预期最大收益,F(S,0)和F(0,i)为初始值。
- 一种商品动态定价数据处理系统,包括:存储器;以及耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器的指令执行如权利要求1至6任一项所述的方法。
- 一种计算机可读存储介质,其上存储有计算机程序指令,该指令被处理器执行时实现权利要求1至6任一项所述的方法的步骤。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710510912.7A CN107123004A (zh) | 2017-06-29 | 2017-06-29 | 商品动态定价数据处理方法和系统 |
CN201710510912.7 | 2017-06-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019001120A1 true WO2019001120A1 (zh) | 2019-01-03 |
Family
ID=59718884
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/084382 WO2019001120A1 (zh) | 2017-06-29 | 2018-04-25 | 商品动态定价数据处理方法和系统 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107123004A (zh) |
WO (1) | WO2019001120A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113343577A (zh) * | 2021-06-23 | 2021-09-03 | 平安国际融资租赁有限公司 | 一种参数优化方法、装置、计算机设备及可读存储介质 |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107123004A (zh) * | 2017-06-29 | 2017-09-01 | 北京京东尚科信息技术有限公司 | 商品动态定价数据处理方法和系统 |
CN109978428A (zh) * | 2017-12-28 | 2019-07-05 | 北京京东尚科信息技术有限公司 | 数据估算方法和装置以及计算机可读存储介质 |
CN108428153A (zh) * | 2018-03-12 | 2018-08-21 | 上海哔哩哔哩科技有限公司 | 预售商品的策略调整方法、服务器及策略调整系统 |
CN110516895B (zh) * | 2018-06-20 | 2021-09-03 | 北京京东尚科信息技术有限公司 | 人机仓的选品决策方法和装置、计算机可读存储介质 |
CN110634007A (zh) * | 2018-06-22 | 2019-12-31 | 北京京东尚科信息技术有限公司 | 数据处理方法、装置、系统、计算机可读介质 |
CN108960923A (zh) * | 2018-07-09 | 2018-12-07 | 北京百悟科技有限公司 | 一种定价的方法、装置及计算机存储介质 |
CN110827047A (zh) * | 2018-08-07 | 2020-02-21 | 北京京东尚科信息技术有限公司 | 动态定价方法和装置 |
CN109272405B (zh) * | 2018-09-30 | 2020-09-29 | 大唐碳资产有限公司 | 碳资产交易方法及系统 |
CN109816459A (zh) * | 2019-03-21 | 2019-05-28 | 深圳市丰巢科技有限公司 | 一种动态调价方法、装置、终端及存储介质 |
CN110298689A (zh) * | 2019-05-28 | 2019-10-01 | 合肥工业大学 | 用于更新商品信息的方法、系统及存储介质 |
CN110570245A (zh) * | 2019-09-05 | 2019-12-13 | 上海摩库数据技术有限公司 | 可配置、多维度的实时计价引擎 |
CN111126917A (zh) * | 2019-12-24 | 2020-05-08 | 北京每日优鲜电子商务有限公司 | 单位转移数值调整方法、装置、计算机设备及存储介质 |
CN110930201A (zh) * | 2020-01-02 | 2020-03-27 | 山东浪潮通软信息科技有限公司 | 一种基于动态建模的计价方法及系统 |
CN111311362B (zh) * | 2020-02-12 | 2023-12-01 | 安超云软件有限公司 | 产品属性更新方法以及系统 |
CN111460363A (zh) * | 2020-04-01 | 2020-07-28 | 丰车(上海)信息技术有限公司 | 一种二手车供应链选址与库存水平管理算法 |
CN113762993A (zh) * | 2020-06-03 | 2021-12-07 | 阿里巴巴集团控股有限公司 | 数据处理方法、系统、装置、设备和存储介质 |
CN112036985A (zh) * | 2020-09-04 | 2020-12-04 | 烟台冰兔网络科技有限公司 | 一种电商经营的营销系统 |
CN112232953A (zh) * | 2020-10-14 | 2021-01-15 | 深圳壹账通智能科技有限公司 | 一种债券交易预付款方法、装置、设备及存储介质 |
CN112819379A (zh) * | 2021-02-28 | 2021-05-18 | 广东电力交易中心有限责任公司 | 风险偏好信息的获取方法、系统、电子装置及存储介质 |
CN114662933B (zh) * | 2022-03-24 | 2023-04-07 | 杭州良时智能科技有限公司 | 营销运营风险管控方法及系统 |
CN115358427A (zh) * | 2022-08-31 | 2022-11-18 | 成都九洲电子信息系统股份有限公司 | 一种零部件配置建议采集计算方法 |
CN117236649B (zh) * | 2023-11-10 | 2024-01-26 | 天津麦旺生物技术有限公司 | 一种用于宠物饲料加工原料需求量的调度方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140039979A1 (en) * | 2012-08-01 | 2014-02-06 | Opera Solutions, Llc | System and Method for Demand Forecasting |
CN103593770A (zh) * | 2013-10-24 | 2014-02-19 | 清华大学 | 基于马尔科夫模型的亚马逊弹性计算云竞价方法 |
CN105205701A (zh) * | 2015-09-22 | 2015-12-30 | 创点客(北京)科技有限公司 | 一种网络动态定价方法和系统 |
CN107123004A (zh) * | 2017-06-29 | 2017-09-01 | 北京京东尚科信息技术有限公司 | 商品动态定价数据处理方法和系统 |
-
2017
- 2017-06-29 CN CN201710510912.7A patent/CN107123004A/zh active Pending
-
2018
- 2018-04-25 WO PCT/CN2018/084382 patent/WO2019001120A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140039979A1 (en) * | 2012-08-01 | 2014-02-06 | Opera Solutions, Llc | System and Method for Demand Forecasting |
CN103593770A (zh) * | 2013-10-24 | 2014-02-19 | 清华大学 | 基于马尔科夫模型的亚马逊弹性计算云竞价方法 |
CN105205701A (zh) * | 2015-09-22 | 2015-12-30 | 创点客(北京)科技有限公司 | 一种网络动态定价方法和系统 |
CN107123004A (zh) * | 2017-06-29 | 2017-09-01 | 北京京东尚科信息技术有限公司 | 商品动态定价数据处理方法和系统 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113343577A (zh) * | 2021-06-23 | 2021-09-03 | 平安国际融资租赁有限公司 | 一种参数优化方法、装置、计算机设备及可读存储介质 |
CN113343577B (zh) * | 2021-06-23 | 2023-09-26 | 平安国际融资租赁有限公司 | 一种基于机器学习的参数优化方法、装置、设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
CN107123004A (zh) | 2017-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019001120A1 (zh) | 商品动态定价数据处理方法和系统 | |
US11281969B1 (en) | Artificial intelligence system combining state space models and neural networks for time series forecasting | |
Xia et al. | A seasonal discrete grey forecasting model for fashion retailing | |
TWI690865B (zh) | 交易量的預測方法及裝置 | |
JP6514342B2 (ja) | オンライン予測システムおよび方法 | |
US20140310059A1 (en) | System , method and computer program forecasting energy price | |
CN110059896A (zh) | 一种基于强化学习的股票预测方法及系统 | |
US11854022B2 (en) | Proactively predicting transaction dates based on sparse transaction data | |
EP3371764A1 (en) | Systems and methods for pricing optimization with competitive influence effects | |
CN114549046A (zh) | 基于融合模型的销售预测方法、系统、设备及存储介质 | |
CN109544233A (zh) | 一种面向电商业务的时尚品需求预测方法 | |
WO2019214455A1 (zh) | 一种数据序列预测方法及计算设备 | |
CN109767032A (zh) | 一种基于数据分析的企业财务运营数字化管理优化系统 | |
WO2020024718A1 (zh) | 外汇交易量预测方法和装置 | |
Li et al. | A belief-rule-based inventory control method under nonstationary and uncertain demand | |
CN112950262A (zh) | 销量预测方法、装置、计算机设备和存储介质 | |
Saha et al. | Demand forecasting of a multinational retail company using deep learning frameworks | |
CN116188061A (zh) | 商品销量预测方法、装置、电子设备及存储介质 | |
JP5251217B2 (ja) | 販売数予測システム、販売数予測システムの動作方法および販売数予測プログラム | |
CN112308293A (zh) | 违约概率预测方法及装置 | |
CN115759751A (zh) | 一种企业风险预测方法、装置、存储介质、电子设备及产品 | |
CN106537442A (zh) | 收集量调节辅助装置,收集量调节辅助方法和计算机可读记录介质 | |
CN106920000B (zh) | 基于es预测技术的煤炭供应链牛鞭效应抑制方法 | |
JP7438008B2 (ja) | 商品の販売数予測方法、装置及びプログラム、発注量決定方法、装置及びプログラム | |
CN117011063B (zh) | 客户交易风险预测处理方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18824006 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18824006 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 21.04.2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18824006 Country of ref document: EP Kind code of ref document: A1 |