Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Inventory, Periodic Discounts, and the Timing Effect
Hyun-soo Ahn • Mehmet Gumus • Philip Kaminsky
Department of Operations and Management Science, Ross School of Business, Universityof Michigan, Ann Arbor MI 48109
Desautels Faculty of Management, McGill University,Montreal, Quebec H3A 1G5
Department of Industrial Engineering and Operations Research, University of California,Berkeley CA 94720
[email protected] • [email protected] • [email protected]
We consider the joint pricing and inventory control problem for a single product whosedemand distribution in each period is determined both by whether or not a sale price isoffered in the current period, and the number of periods since the last time the sale pricewas offered. We show that optimal inventory ordering policy is a state dependent basestock policy; however, the optimal pricing policy can be quite complicated due to both thevalue and cost of holding inventory and delaying sales. We conduct a computational studyto explore the effect of various cost and demand parameters on optimal pricing, ordering,and markdown decisions, propose a simple threshold type policy suggested by relaxation ofthe model to a make-to-order system, and computationally evaluate the performance of thisheuristic.
1. Introduction and Literature Review
The success of dynamic pricing and revenue management in the air travel, hotel, and car
rental industries has naturally led to the desire to extend these concepts to other industries,
including those with non-perishable products. Indeed, dynamic pricing is not new to these
industries, but in the past it has primarily been utilized in the form of markdowns and pro-
motions to eliminate excess inventory. However, as sophisticated data processing technology
has spread in various retail industries, and as the internet and e-commerce have grown, dy-
namic pricing approaches have become more feasible as tools to help retailers match supply
with demand more effectively and increase operating profit. There is a growing literature
in operations management that explores dynamic pricing models in inventory ordering or
production planning settings, modifying the assumptions, models, and results of traditional
revenue management settings for these sometimes very different environments. In an envi-
ronment in which inventory can be replenished and customers have multiple opportunities
1
to buy the same product, for example, intertemporal demand interaction can significantly
impact effective ordering and pricing strategies.
In this context, intertemporal demand interaction refers to the sensitivity of current
demand not only to current pricing, but also to past pricing decisions. A quick review of the
circulars in the Sunday paper suggests that retailers use price reductions for more than just
eliminating excess inventory of out-of-season products – these retailers are instead attempting
to benefit from intertemporal demand interactions in order to increase profits. Indeed, in
a recent empirical study, Pesendorfer (2002) analyzed pricing data for selected products in
a supermarket chain, where prices were changed weekly. He used the scanner data from
three ketchup products, and observed several intertemporal price effects that drive demand
in each period. The level effect refers to the effect of the previous week’s price on the
current week’s demand. For a given current price, the current demand will be stochastically
higher when the previous week’s price was high than when the previous week’s price was low.
For a product that is periodically put on sale, the timing effect refers to the observation
that the demand at a given sale price is stochastically increasing in the time since the last
sale. Ahn et al. (2007) explore the level effect; in this paper, we explore the timing effect.
Other marketing researchers as well as economists have modeled similar issues; however,
most of this literature has focused on the effects of intertemporal demand interactions on
pricing decisions ignoring inventory considerations. Also, demand is primarily deterministic
in these models. Conlisk et al. (1984) and Sobel (1984, 1991) consider two kinds of customers
who differ in their valuation for a product. With this demand model, they show that firms
engage in cyclic pricing behavior as a traditional skimming strategy in order to distinguish
between high and low valuation customers. Assuncao and Meyer (1993) consider stockpiling
behavior of consumers who are uncertain about future prices, and investigate how frequency
of price reductions affects consumers’ purchase decision. Their analysis suggests that, as
in our model, sales volume during promotional periods is larger than sales volume during
regular periods. Slade (1998,1999) proposes a demand model in which consumers’ goodwill
increases (decreases) as the firm continues charging low (high) prices, and analyzes the
resulting optimal pricing strategy.
However, for a dynamic pricing strategy to be effective in practice, it typically has to be
aligned with inventory ordering decisions, and there are several growing streams of research
focusing on joint pricing and inventory ordering models in the operations management lit-
erature. The first stream focuses on the coordination of pricing and production decisions
2
for perishable or seasonal goods – this is typically referred to as revenue management. For
these products, it is reasonable to assume that no replenishment (production) is allowed
after the initial period, and thus most existing work focuses on analyzing the trajectory of
the optimal pricing policy as a function of remaining inventory and time. Gallego and van
Ryzin (1994) and Bitran and Mondschein (1997) consider a finite horizon dynamic pricing
problem with uncertain demand and show that the optimal price increases in the number of
periods remaining and decreases in inventory level. Subrahmanyan and Shoemaker (1996)
consider a similar problem, but allow the retailer to determine the initial stocking quan-
tity. Review papers by McGill and van Ryzin (1999), Petruzzi and Dada (1999) and Bitran
and Caldentey (2003) provide comprehensive surveys of the area. Although there are a few
recent papers that deal with more sophisticated consumer behavior (e.g., Aviv and Pazgal
(2007), Elmaghraby et al. (2007), Su (2007), and Zhou et al. (2007)), all of them assume
no replenishment during the selling season, and thus do not consider inventory/production
factors.
In the second stream, researchers consider the coordination of pricing and inventory
control with independent demand. In this stream of research, demand is a random variable
that depends only on current price. Under the assumption that unsatisfied demand in
each period is fully backlogged, Federgruen and Heching (1999) and Chen and Simchi-Levi
(2004a, 2004b) consider periodic review models with both finite and infinite horizons and
characterize the form of optimal inventory ordering policies. Polatoglu and Sahin (2000),
Chen et al. (2006) and Song et al. (2007) analyze various lost sales models. For continuous
review models, Feng and Chen (2003) model demand using a Poisson process with price-
sensitive intensity, while Chen et al. (2007) model the demand process as a Brownian
motion with price sensitive drift rate. Researchers have also explored the coordination of
pricing and inventory control with Markovian demand, where demand is a random variable
that depends on both the pricing decision and the state of the world. Gayon et al. (2007)
consider a demand model that is generated by a Markov modulated Poisson process and
obtain structural results for the optimal pricing and inventory ordering policies. Yin and
Rajaram (2007) extend Chen and Simchi-Levi’s model to the Markovian demand case and
characterize the form of the optimal policy. Several review papers provide comprehensive
surveys of research milestones and future opportunities for joint production-pricing problems;
see Eliashberg and Steinberg (1993) for classical models and Elmaghraby and Keskinocak
(2003), Yano and Gilbert (2003), Chan et al. (2004), and the references therein for recent
3
developments in this area. In most of these models, however, the current demand (or demand
intensity) is assumed to be independent of price history.
In contrast to this research, we explicitly model the relationship between current demand
and price history. In this respect, our model is most closely related to Cheng and Sethi
(1999), who consider an (endogenous) Markovian demand model where the current demand
is influenced by a state variable, which in turn is controlled by the pricing decision. However,
in our model, the demand distribution is a function of the state variable and the current
price, whereas in Cheng and Sethi (1999), the demand distribution is solely determined by
the state variable. This distinction allows us to capture the timing effect discussed above.
This paper is also related to Ahn et al. (2007), in which the authors analyze the level effect
of intertemporal demand interaction in a deterministic setting. In this paper, in contrast, we
focus on capturing the timing effect in a stochastic setting. See Gumus (2007) for a detailed
comparison of the models in Ahn et al. (2007) and those in this paper.
In this paper, we present a stylized model that captures the impact of pricing decisions
in one period on demands in others, and characterize the optimal inventory policy under
this demand model. In particular, we explicitly capture the impact of the time since the
last time a sale price was offered on the demand at a given sale price. We show that that
the presence of inventory has a non-trivial effects on dynamic pricing decisions. To develop
insight into these effects, we analyze a make-to-order version of our model, and building on
this analysis, we propose a simple threshold-type policy for the pricing strategy. Finally, we
conduct an extensive computational study to illustrate the impact of various cost and demand
parameters on pricing and inventory ordering decisions and to evaluate the performance of
our proposed threshold policy.
In the next section, we introduce our model, in which the distribution of demand in sale
periods depends on both the current price and the price history.
2. Model Formulation
We consider a discrete time, T -period finite horizon, single item stochastic demand model
where at each period the retailer offers one of two possible prices, a regular price or a sale
price, denoted pr and ps (pr > ps), respectively. At the start of each period (t = 1, . . . , T ),
the retailer makes inventory ordering (i.e., how much to order) and pricing decisions (sale or
regular) based on the current inventory level and the number of periods since the last sale.
4
Once decisions are made, demand is realized and holding or penalty cost is incurred. Let xt
represent the inventory level at the beginning of period t before ordering, kt be the number
of periods since the last sale before period t (where kt = 1 if there was a sale in period t−1),
yt be the inventory level in period t after ordering, and pt be the retail price in period t.
Our objective is to determine the optimal pricing and inventory plan that maximizes the
expected (discounted) profit. There is no fixed ordering cost, no maximum ordering capacity
and no delivery lead time (orders are received instantaneously). For the bulk of the paper,
we assume that all unsatisfied demand is lost. We briefly consider the backorder case towards
the end of the paper.
To model the reaction of customers to a sale, we divide potential customers into two
groups: consumers who will purchase independent of whether or not the product is on sale
or not and consumers who will purchase only if the product is on sale. We call the first
cohort regular customers and the second cohort sale customers. When the retailer charges
the regular price, only regular customers will purchase the product. Some sale customers
will remain and wait for a sale while others will leave. On the other hand, when the retailer
offers a sale in period t, regular and sale customers who enter the market in period t as well
as sale customers who have entered the system and remained since the last sale will buy
the product (at the sale price). Hence, demand at the sale price is likely to be higher than
demand at the regular price, and demand at the sale price is likely to be higher when more
periods have passed since the last sale than when fewer periods have passed. Demand in
period t, t = 1, . . . , T is therefore a function of the price in that period, pt and the number of
periods since the last sale, kt. To capture this, let ξt(pt, kt) be a random variable representing
the demand in period t when the retail price is pt and the number of periods since the last
sales is kt. We assume that ξt(pt, kt) is a non-negative continuous random variable and
denote its distribution and density Φt(ξ|pt, kt) and φt(ξ|pt, kt), respectively. Given pt and kt,
ξt(pt, kt) is defined as follows:
ξt(pt, kt) ∼
ξr if pt = pr;
ξsk, if pt = ps and kt = k.
where ξr and ξsk are both non-negative continuous random variables for regular and sale
demand when k periods have elapsed since the last sale. Also, let Φr(ξ) and φr(ξ) be the
c.d.f. and p.d.f. of demand when the regular price is offered. Similarly, let Φsk(ξ) and φs
k(ξ)
be the c.d.f. and p.d.f. of demand when the sale price is offered and k periods have passed
5
since the last sale. Let µr = E[ξr] and µsk = E[ξs
k].
To capture the consumer behavior discussed above, we make the following assumption.
Assumption 1. ξr ≤ST ξs1 and ξs
k is stochastically increasing in k.
Although one could argue that regular demand should be also be effected by the frequency
of sales, we do not model this possibility in order to keep the model tractable while still
capturing relevant intertemporal demand effects.
Consider the state transitions in this lost sales model. For each period t = 1, . . . , T ,
suppose (xt, kt) represent the inventory level at the start of period t and the number of
periods since the last sale. If the retailer orders yt − xt units and sets its retail price at pt,
then the inventory level and the number of periods since the last sale at the start of period
t + 1 are described as follows:
xt+1 = max[yt − ξt(pt, kt), 0] = [yt − ξt(pt, kt)]+, t = 1, . . . , T and
kt+1 =
{kt + 1, if pt = pr;1, if pt = ps.
Throughout the paper, we define [u]+ = max[u, 0] and [u]− = max[−u, 0] for any real
number u. For a given state (xt, kt) and retailer action (yt, pt), the retailer realizes the
following revenue and costs.
• pt min[yt, ξt(pt, kt)] is the revenue in period t.
• c(yt− xt) = c · (yt− xt) is the cost of raising inventory from xt to yt in period t, where
c is the variable cost.
• h(yt − ξt(pt, kt)) = h+ · [yt − ξt(pt, kt)]+ + h− · [yt − ξt(pt, kt)]
− is the inventory and
penalty cost at the end of period t, where h+ is the per-unit holding cost and h− is
the per-unit stock-out penalty cost. Note that since we are already capturing the loss
of revenue in our revenue term, h− represents the loss of goodwill cost or any other
lost-sales related cost that is not directly related to the current revenue. (When we
consider the backorder case, the same cost function applies except that h− represents
the per-unit backorder cost.)
• fT+1(xT+1) = c · xT+1 is the salvage value function for the terminal inventory. This
is a standard assumption in many inventory finite horizon inventory models (see, e.g.,
Porteus (2002)) that will facilitate subsequent analysis.
6
Let Vt(xt, kt) be the expected discounted profit-to-go function under the optimal policy
starting from state (xt, kt) and Jt(yt, pt; xt, kt) be the expected profit-to-go function if the
retailer chooses to offer price pt and raise the inventory to level yt, and then continues
optimally afterwards. Then, the retailer’s problem can be expressed as a stochastic dynamic
program satisfying the following recursive relation:
Vt(xt, kt) = maxpt∈{pr,ps}
{maxyt≥xt
Jt(yt, pt; xt, kt)
}and
Jt(yt, pt; xt, kt) = −c(yt − xt) +
∫ ∞
0
[pt min[yt, ξ]− h+[yt − ξ]+ − h−[yt − ξ]−
+αVt+1([yt − ξ]+, kt+1)
]φt(ξ|pt, kt)dξ
where VT+1(xT+1, kT+1) = fT+1(xT+1), and α is the discount rate. For notational simplicity,
we drop time indices from decision and state variables when references are obvious.
In many inventory problems, it is useful to work with transformed versions of value
functions. We let Jt(y, p ; k) = Jt(y, p; x, k) − cx and Vt(x, k) = Vt(x, k) − cx. Then, the
optimality equations can be rewritten as follows:
Vt(x, k) = maxp∈{pr,ps}
{maxy≥x
Jt(y, p; k)
}(2.1)
where Jt(y, p; k) satisfies:
Jt(y, p ; k) =∫ ∞
0
[p min{y, ξ}− cy +αc[y− ξ]+−h+[y− ξ]+−h−[y− ξ]−+αVt+1([y− ξ]+, kt+1)]φt(ξ|p, k)dξ.
In this transformed formulation, Jt(y, p ; k) does not depend on x and the terminal
condition for the value function becomes VT+1(·) = 0. Note that we have two decision
variables and two state variables, and thus, in general the optimal decisions are functions
of the two state variables, y∗t (x, k) and p∗t (x, k). In the next section, we analyze the form
of these optimal decisions and characterize the structural relationships between the optimal
inventory ordering/pricing decisions and the state variables. In Section 4, we explore a
make-to-stock version of our model, in Section 5, we use the insight from this exploration
to develop an effective heuristic for our model and present the results of our computational
study, and in Section 6, we present several extensions of our model and conclude.
3. Optimal Policy
In order to characterize the structure of the optimal inventory ordering policy, we need to
make some additional assumptions.
7
Assumption 2. The regular and sale demand distributions (i.e., the distributions of ξr and
ξsk, k = 1, 2, . . .) are strongly unimodal.
Generally, the convolution of a unimodal distribution with a unimodal function is not
unimodal. In other words, the unimodality is not preserved by the class of unimodal dis-
tributions. Ibragimov (1965) introduced the class of strongly unimodal distributions and
defined it as follows:
Definition 1. A distribution of a random variable is said to be strongly unimodal if its
convolution with any unimodal function is unimodal.
Ibragimov (1965) proved that a strongly unimodal distribution is unimodal and showed
that the class of non-degenerate strongly unimodal distributions coincides with the class of
distributions with log-concave density. We utilize this property to show that a base stock
policy is optimal for our model. Typically, to prove the optimality of a base stock policy in
a periodic review inventory model, researchers will show that the value function is concave
(convex under minimization) in inventory order-up-to level. Since concavity of functions
is preserved under the expectation operator, an induction-based argument can be used to
demonstrate that a base stock policy is optimal for all time periods. Unfortunately, the single
period value function of this model is not concave in inventory order-up-to levels. However,
it can be shown to be unimodal, which is well known to also be a sufficient condition for the
optimality of the base stock policy, and thus we need Assumption 2 to preserve unimodality
under the expectation operator.
Although this assumption is somewhat technical, a number of important random variables
are indeed strongly unimodal. Examples include normal, truncated normal, exponential,
gamma and uniform distributions, among the others. See Dharmadhikari and Joag-Dev
(1988) for examples, results, and applications of strongly unimodal distributions.
Using this assumption, we now show that the optimal ordering policy is a state-dependent
base stock policy:
Theorem 1. Suppose that Assumption 2 holds. In each time period t, for each p and k,
there exists an optimal st(p, k) such that if the starting inventory level xt < st(p, k), it is
optimal to raise the inventory level to st(p, k), and otherwise it is optimal to do nothing.
Proof. For notational simplicity, we omit the subscripts from state and decision variables. Let kbe a positive integer and p ∈ {pr, ps}. It is sufficient to show that Jt(y, k, p) is unimodal in y for
8
given p and k. After some algebraic manipulation, we can rewrite Jt(y, p; k) as a convolution oftwo functions, the second of which is strongly unimodal by Assumption 2:
Jt(y, p; k) =∫ ∞
0Gt(y − ξ; p, k)φt(ξ|p, k)dξ
where Gt(w; p, k) is as follows:
Gt(w; p, k) = (p− c)E[ξt(p, k)]− p[w]− − (1− α)c[w]+ + c[w]− − h+[w]+ − h−[w]− + αVt+1([w]+, kt+1)
Let πt(p, k) = (p− c)E[ξt(p, k)]. Noting that E[ξt(p, k)] = µr if p = pr and that E[ξt(p, k)] = µsk
if p = ps, we have
πt(p, k) ={
(pr − c)µr, if p = pr,(ps − c)µs
k, if p = ps.
Replacing (p− c)E[ξt(p, k)] with πt(p, k) and rearranging the terms, we have
Gt(w; p, k) = πt(p, k)− (p + h− − c)[w]− − (h+ + (1− α)c)[w]+ + αVt+1([w]+, kt+1)
It is sufficient to show that for a given p and k, Gt(w; p, k) is unimodal in w since the convolutionof a unimodal function with a strongly unimodal density is also unimodal (c.f., Ibragimov (1956) orTheorem 1.10 in Dharmadhikari and Joag-Dev (1988)). We show the unimodality of Gt(w; p, k) inw by induction on t. Suppose t = T . The first term π(p, k) is constant in w. Since p + h− − c ≥ 0,the sum of the second and third terms is a unimodal function and has a unique maximizer at w = 0.Since VT+1(w, k) = cw for all w ≥ 0, we have VT+1(w, k) = 0. Hence, GT (w; p, k) is unimodal inw because the sum of functions unimodal in w that have the same unique maximizer must also beunimodal in w with the same maximizer.Now, we show that if Gt+1(w, p, k) is unimodal, Gt(w, k, p) is also unimodal. The first three termsare the same as in GT (w, p, k), and therefore the sum of these three terms is unimodal in w witha unique maximizer at w = 0. To complete the proof, we need to show that the final term (i.e.αVt+1([w]+, kt+1)) is non-increasing in w and that its maximum is achieved at w = 0. Note thatfrom induction hypothesis, Gt+1(w, p, k) is unimodal in w, so Jt+1(y, p; k) is unimodal in y. Hence,maxy≥x Jt+1(y, p; k) is non-increasing in x ≥ 0 as increasing x decreases the region over which thefunction is maximized. This implies that
Vt+1([w]+, k) = max{
maxy≥[w]+
Jt+1(y, pr; k), maxy≥[w]+
Jt(y, ps; k)}
is also non-increasing in w and achieves the maximum at w ≤ 0 since [w]+ = 0 for w ≤ 0 andVt+1([w]+, k) is the maximum of two non-increasing functions. Hence, Gt(w; p, k) is unimodal inw because the sum of functions unimodal in w that have same unique maximizer must also beunimodal in w.
Theorem 1 characterizes the optimal order quantity for a given retail price for a retailer
with starting inventory level x and k periods since the last sale. Recall that st(p, k) is the
unconstrained order-up-to level in period t when the offered price is p and the number of
periods since the last sale is k, so
st(p, k) = arg maxy≥0
Jt(y, p; k).
9
This optimal order-up-to level critically depends on two factors, the difference between the
regular and sale prices and the number of periods since the last sale. The sale demand
increases in the number of periods since the last sales, k, and this drives the desired order-
up-to level up. On the other hand, the lower retail price during the sale makes the product
less attractive and drives the order-up-to level down. Depending on which force is more
significant, the order-up-to level under the sale price can be larger (or smaller) than that
under the regular price. For many realistic scenarios, however, it is reasonable to think that
the retailer will try to sell more under the sale price than the regular price. In order to to
capture this behavior, we make the following assumption:
Assumption 3. The order-up-to level of a single period newsvendor problem with the regular
and sale price demand distributions of our model is decreasing in price. That is,
Φs(−1)1
(ps + h− − c
ps + h+ + h− − αc
)≥ Φr(−1)
(pr + h− − c
pr + h+ + h− − αc
).
Assumption 3 enables us to further characterize the structure of the optimal policy.
Theorem 2. Under Assumptions 1 - 3:
(i) When the regular price is offered, the base-stock level is independent of the number of
periods since the last sale and the number of remaining periods in the planning horizon.
That is, there exists a constant sr > 0 such that st(pr, k) = sr for all t and k.
(ii) The base-stock level under the sale price is always larger than the base-stock level under
the regular price: st(ps, k) ≥ sr for all k and t.
(iii) For a given k ≥ 1, Vt(x, k) is constant in x ∈ [0, sr] for all t.
Proof. We prove the results by induction. First, let t = T .
(i) Since VT+1(x, k) = 0 for all x ≥ 0,
JT (y, pr; k) = πT (pr, k) +∫ ∞
0
{−(p + h− − c)[y − ξ]− − (h+ + (1− α)c)[y − ξ]+}
φ(ξ|pr, k)dξ
= πT (pr, k)−∫ ∞
y(p + h− − c)(ξ − y)φ(ξ|pr, k)dξ −
∫ y
0(h+ + (1− α)c)(y − ξ)φ(ξ|pr, k)dξ
Define J′
T (y, pr; k) to be the derivative of JT (y, pr; k) with respect to y. Applying Leibniz’srule, J
′T (y, pr; k) can be written as follows:
J′
T (y, pr; k) =∫ ∞
y(pr + h− − c)φ(ξ|pr, k)dξ −
∫ y
0(h+ + (1− α)c)φ(ξ|pr, k)dξ
= (p + h− − c)− Φr(y|pr, k)(p + h+ + h− − αc)
10
where Φr(y) is the cumulative distribution of regular demand. Since JT (y, pr; k) is unimodalin y by Theorem 1, we find the maximizer (i.e. the base stock level under the regular price)employing the standard newsvendor solution:
sT (pr, k) = (Φr)−1
(pr + h− − c
pr + h+ + h− − αc
∣∣∣∣ pr, k
)
where (Φr)−1(·|pr, k) is the inverse function of the cumulative distribution of regular demandand pr+h−−c
pr+h++h−−αcis the critical fractile associated with regular price. Since Φr(y) does not
depend on k, sT (pr, k) is constant in k. We denote this base stock level sr.
(ii) sT (ps, k) ≥ sr immediately follows from Assumption 3 and ξr ≤st ξsk, k ≥ 1. Thus, VT+1(x, k) =
0 for all x ≥ 0.
(iii) For given p and k, JT (y, p; k) is a unimodal function in y. Furthermore, from part (ii)
sr = arg maxy≥0
JT (y, pr; k) ≤ arg maxy≥0
JT (y, ps; k) = sT (ps, k) for all k ≥ 1.
Hence, for any x ≤ sr,
VT (x, k) = maxp∈{pr,ps}
{maxy≥x
JT (y, p; k)}
= max[JT (sr, pr; k), JT (sT (ps, k), ps; k)
]
Therefore, VT (x, k) is a constant for x ≤ sr. On the other hand, for x > sr, we have
VT (x, k) = max[JT (x, pr; k), JT (max[x, sT (ps, k)], ps; k)
]
Since both JT (x, pr; k) and JT (max[x, sT (ps, k)], ps; k) are continuous and non-increasing inx, VT (x, k) is also continuous and non-increasing in x.
Now, assume the results hold for t + 1. We prove that they hold for t:
(i) Since Jt(y, pr; k) is unimodal in y, it is sufficient to show that sr satisfies the first ordercondition. From the induction hypothesis, Vt+1(x; k) is constant for x ≤ sr and non-increasingbeyond sr. Hence V
′t+1(x; k) = 0 for x ≤ sr. Taking the derivative of Jt(y, pr; k) with respect
to y yieldsJ
′t (y, pr; k) = (p + h− − c)− Φr(y)(p + h+ + h− − αc)
for y ≤ sr. Solving for y, we have
st(pr, k) = sr = (Φr)−1
(pr + h− − c
pr + h+ + h− − αc
∣∣∣∣ pr, k
)for all k.
From the induction hypothesis and the fact that Jt(y, pr; k) is unimodal in y, Jt(y, pr; k)achieves the maximum at y = sr.
(ii) Note that for any y ≤ sr, w = [y − ξ(p, k)]+ ≤ sr. Thus, Vt+1(w, k) remains constant for allw ≤ sr and
J′
t (y, ps; k) = (ps + h− − c)− Φsk(y)(ps + h+ + h− − αc) for y ≤ sr.
11
From Assumption 3 and the fact that ξr ≤st ξsk,
J′
t (y, ps; k)∣∣∣y=sr
= (ps + h− − c)− Φsk(s
r)(ps + h+ + h− − αc)
≥ (pr + h− − c)− Φr(sr)(pr + h+ + h− − αc) = J′
T (y, pr; k)∣∣∣y=sr
= 0.
Therefore, st(ps; k) must be greater than or equal to sr for all k.
(iii) Since sr ≤ st(ps, k), the result follows from the same argument employed when t = T .
By combining the results of previous two theorems, we can begin to characterize the
optimal pricing and inventory policy. For this purpose, in addition to the regular base stock
and sale base stock levels, we define a third critical level st(k) that represents the smallest
starting inventory level such that offering the sale price is optimal when k periods have passed
since the last sale. In other words, if the starting inventory level is below st(k), it is optimal to
charge the regular price. Note that st(k) can range from zero if offering the sale price is more
profitable for any starting inventory level (i.e., maxy≥0 JT (y, pr; k) ≤ maxy≥0 JT (y, ps; k)), to
a value greater than the regular or sale price base stock level. The structure of optimal
policy depends on the relative ordering of st(k) with respect to sr and st(ps, k):
Theorem 3. Suppose Assumptions 1 − 3 hold. If there have been k periods since the last
sale, the optimal pricing and inventory policy in period t takes one of the following three
forms depending on the starting inventory level at the beginning of period t, x, relative to the
threshold st(k):
(i) st(k) = 0 : If x ≤ st(ps, k), it is optimal to order up to st(p
s, k) and offer the sale
price ps. Otherwise, it is optimal to not order and charge a state dependent price,
p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k) (see Figure 1).
(ii) sr < st(k) ≤ st(ps, k): If x ∈ [0, sr), it is optimal to order up to sr and sell at the
regular price pr. If x ∈ [sr, st(k)), it is optimal not to order, and to sell the product at
the regular price pr. If x ∈ [st(k), st(ps, k)), it is optimal to order up to st(p
s, k) and
sell at the sale price ps. If x ≥ st(ps, k), it is optimal to not order, and to follow the
state dependent pricing policy, p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k) (see Figure 2).
(iii) st(ps, k) ≤ st(k): If x ≤ sr, it is optimal to order up to sr and sell at the regular
price pr. If x > sr, it is optimal to not order and to offer the regular price pr for
12
x ∈ (sr, st(k)), and the state-dependent price p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k) if
xt ≥ st(k) (see Figure 3).
Proof. Notice that from equation (2.1),
Vt(x, k) = max{
maxy≥x
Jt(y, pr; k),maxy≥x
Jt(y, pr; k)}
.
Define J∗t (x, pr; k) = maxy≥x Jt(y, pr; k) and J∗t (x, ps; k) = maxy≥x Jt(y, ps; k), respectively. Bothare non-increasing in x by Theorem 1. Specifically, from Theorems 1 and 2, J∗t (x, pr; k) =Jt(sr, pr; k) for x ≤ sr and J∗t (x, pr; k) = Jt(x, pr; k) otherwise, and J∗t (x, ps; k) = Jt(st(ps; k), ps; k)for x ≤ st(ps; k) and J∗t (x, pr; k) = Jt(x, pr; k) otherwise. Hence, the structure of the optimal policyis determined by how these two non-increasing functions behave with respect to starting inventorylevel x; we explore three different cases:
Case 1: Jt(sr, pr; k) ≤ Jt(st(ps; k), ps; k)The fact that st(ps; k) ≥ sr (Theorem 2.(ii)) and the fact that Jt(y, p; k) is unimodal in yimply that
J∗t (x, pr; k) = Jt(sr, pr; k) ≤ J∗t (x, ps; k) = Jt(st(ps; k), ps; k) for x ≤ sr and,
J∗t (x, pr; k) ≤ Jt(sr, pr; k) ≤ J∗t (x, ps; k) = Jt(st(ps; k), ps; k) for sr < x ≤ st(ps; k).
Hence, as long as the starting inventory x is below st(ps; k), it is optimal to raise theinventory level to st(ps; k) and offer the sale, hence st(k) = 0. For x > st(ps; k), no-tice that the unimodality of Jt(y, p; k) in y implies that J∗t (x, pr; k) = Jt(x, pr; k) andJ∗t (x, ps; k) = Jt(x, ps; k), thus it is optimal to charge a state-dependent price p∗t (x, k) =arg maxp∈{pr,ps} Jt(x, p; k) and not order.
Case 2: Jt(sr, pr; k) > Jt(st(ps; k), ps; k) and Jt(st(ps; k), pr; k) ≤ Jt(st(ps; k), ps; k)Applying the condition and using the fact that st(ps; k) ≥ sr, we have
J∗t (x, pr; k) = Jt(sr, pr; k) > J∗t (x, ps; k) = J∗t (st(ps; k), ps; k) for x ≤ sr.
For x ≤ sr, ordering up to sr and selling at the regular price pr is therefore optimal.
For x ∈ (sr, st(ps; k)], note that J∗t (x, pr; k) = Jt(x, pr; k), which is (weakly) decreasing whileJ∗t (x, ps; k) remains to be J t(st(ps; k), ps; k). Since Jt(y, pr; k) is continuous in y, there mustbe a st(k) ∈ (sr, st(ps; k)] such that
st(k) = min{x ∈ (sr, st(ps; k)] | Jt(x, pr; k) = J∗t (st(ps; k), ps; k)}.
It follows from the definition of st(k) and the unimodality of Jt(y, pr; k) that
J∗t (x, pr; k) = Jt(x, pr; k) > Jt(st(ps; k), ps; k) for sr ≤ x < st(k) and
J∗t (x, pr; k) = Jt(x, pr; k) ≤ J∗t (x, ps; k) = J∗t (st(ps; k), ps; k) for st(k) ≤ x ≤ st(ps; k).
Hence, when sr ≤ x < st(k), it is optimal to sell the existing inventory at the regular price,pr and not order. On the other hand, if st(k) ≤ x ≤ st(x, k), then it is optimal to raisethe inventory to st(x, k) and sell at the sale price, ps. For x > st(ps; k), J∗t (x, pr; k) =Jt(x, pr; k) and J∗t (x, ps; k) = Jt(x, ps; k). Hence, it is optimal to follow a state-dependentprice p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k) and not order.
13
Case 3: Jt(sr, pr; k) ≥ Jt(st(ps; k), pr; k) > Jt(st(ps; k), ps; k).Applying the inequality Jt(sr, pr; k) > Jt(st(ps; k), ps; k) and the fact that st(ps; k) ≥ sr againimply that
J∗t (x, pr; k) = Jt(sr, pr; k) > J∗t (x, ps; k) = J∗t (st(ps; k), ps; k) for x ≤ sr.
For x ≤ sr, ordering up to sr and selling at the regular price pr is therefore optimal. Noticethat the condition implies that
Jt(sr, pr; k) ≥ J∗t (x, pr; k) = Jt(x, pr; k) > Jt(st(ps; k), ps; k) for x ∈ (sr, st(ps, k),
so it is optimal to sell the existing inventory at the regular price, pr, and not order. Ifx > st(ps, k), then it is optimal to not order in either price. Hence, it is optimal to follow astate-dependent price p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k) and not order.
Ordering Policy
Pricing Policyps ps p∗t (xt, kt)
st(kt) = 0 sr st(ps, kt)
st(kt) = 0 sr st(ps, kt)
xt
xt
Figure 1: Optimal base stock and pricing policy when st(k) = 0
Ordering Policy
Pricing Policy
0
p∗t (xt, kt)
st(ps, kt)
st(ps, kt)
xt
xtpspr pr
st(kt)
st(kt)sr
sr
0
Figure 2: Optimal base stock and pricing policy when st(k) ∈ [sr, st(p, k))
Ordering Policy
Pricing Policy
0
p∗t (xt, kt)
xt
xtpr pr
sr
sr
st(kt)
st(kt)
pr
st(ps, kt)
st(ps, kt)0
Figure 3: Optimal base stock and pricing policy when st(k) ∈ [st(p, k),∞]
In general, when the starting inventory level is higher than the appropriate base stock
level (regular or sale base stock as described above), the optimal pricing policy becomes state
14
dependent. In fact, the optimal pricing policy can be quite complicated with respect to both
state variables, as we illustrate with several examples.
First, consider the change in p∗t (x, k) with respect to x for a fixed k. Intuitively, one
might expect that p∗t (x, k) would decrease in starting inventory x because higher starting
inventory level lead to higher inventory holding costs, suggesting that inventory should be
liquidated through a sale. However, this intuition is not in general correct. To see this,
consider the following simple 2-period example with the following demand function:
Example 1.
ξt(p, k) =
{[12 + εr]
+ , if p = pr;[∑ki=1 18(1− β)i−1 + εs
]+
, if p = ps.
where β = 0.1 and εr and εs are normally distributed random variables with E[εr] = E[εs] = 0,
σεr = 4 and σεs = 6. Finally, pr = 30, ps = 15, h+ = 4, h− = 0, c = 10, α = 1 and the
planning horizon is T = 2 periods long.
It can be shown that Example 1 satisfies Assumptions 1-3. Figure 4 plots maxy≥x Jt(y, k, pr)
and maxy≥x Jt(y, k, ps) with respect to x for k = 2 in period one, and for k = 1 and k = 3
in period two. Recall that the optimal pricing decision at period t is determined by the
point-wise maximization of maxy≥x Jt(y, k, pr) and maxy≥x Jt(y, k, ps):
p∗t (x, k) = arg maxp∈{pr,ps}
{maxy≥x
Jt(y, k, p)}
and that if k = 2 in period one, it will either equal 1 or 3 in period two. Observe that in
period one (the left graph) the plots of maxy≥x J1(y, 2, pr) and maxy≥x J1(y, 2, ps) cross each
other both at point x1 and point x2, implying that the optimal pricing decision at in period
one first switches from regular to sale price at x1 and then back to regular price at x2 as
starting inventory increases. Why is this the case?
From Figure 4, observe that of the four possible pricing strategies, two dominate. Offer
regular price in the first period and sale price (call this strategy-RS) and offer sale price
in the first period and regular price in the second (strategy-SR). When initial inventory
is low, RS dominates, but as initial inventory increases, the leftover inventory and thus
holding cost in period one begins to increase, until SR becomes preferable. However, as
starting inventory continues to increase, at some point the leftover inventory at the end of
period two becomes significant, and so RS again dominates, since it eliminates additional
inventory in period two.
15
10 20 30 40 50 60 70 80
200
250
300
350
400
450
500
550t = 1
Starting inventory, x
maxy≥x J1(y, k, pr)
maxy≥x J1(y, k, ps)
10 20 30 40 50 60 70 80
−100
0
100
200
300
400
t = 2
Starting inventory, x
maxy≥x J2(y, k, pr)
maxy≥x J2(y, k, ps)
x1x2
k=2
k=2
k=1
k=1,2,3,…
k=3
Offer RegularPrice
Offer SalePrice
Offer RegularPrice
Figure 4: maxy≥x J1(y, k, p) and maxy≥x J2(y, k, p) for p = pr, ps.
Next, consider the relationship between between the optimal pricing decision and the
number of periods since the last sale, that is, the sensitivity of pt∗(x, k) to k for a fixed
x. In general, the pricing policy need not be decreasing in k since the optimal pricing
decision depends on how the marginal accumulation of sale demand changes with k. For this
example, we assume that the expected marginal accumulation in sale demand decreases as
k increases, i.e. µk+1 − µk is positive and decreasing in k. With this restriction, one might
expect that if it were optimal to offer the sale price when the last sale was k periods ago, it
would still be optimal to offer sale if the last sale is offered more than k periods ago since
delaying the sale decision would seem not to increase the demand enough to justify waiting
one more period to offer a sale. However, even with this type of restriction on sale demand
accumulation, the optimal price is not necessarily monotone in k. To see this, consider
another example satisfying Assumptions 1–3, where regular and sale demands are drawn
from uniform distributions:
Example 2.
ξt(p, k) =
{U(µr, vr), if p = pr;U(µk, vk), if p = ps.
16
where U(µ, v) is a uniform random variable that has a support from µ − v to µ + v. Let
µr = 6, vr = 3, µk = 9∑k
i=1 βi−1 and vk = vr
√∑ki=1 β2(i−1) with β = 0.90. Finally, pr = 30,
ps = 15, h+ = 5, h− = 0, c = 10, α = 1 and the planning horizon is T = 2 periods.
10 15 20 25
160
180
200
220
240
260
280
300
320
340
t = 1
Starting inventory, x
maxy≥x J1(y, k, pr)
maxy≥x J1(y, k, ps)
0 10 20 30 40 50 60 70
−200
−100
0
100
200
300
t = 2
Starting inventory, x
maxy≥x J2(y, k, pr)
maxy≥x J2(y, k, ps)
x1
x2
k=3
k=3
k=4
k=4
k=4
k=5
k=1
k=1,2,3,…
Offer RegularPrice
Offer RegularPrice
Offer SalePrice
Offer SalePrice
Figure 5: maxy≥x J1(y, k, p) and maxy≥x J2(y, k, p) for p = pr, ps.
As for the previous example, we illustrate the possible strategies in figure 5, and as before,
of the four possible pricing strategies, two dominate. Offer regular price in the first period
and sale price (strategy-RS) and offer sale price in the first period and regular price in the
second (strategy-SR). Observe that in period two, for any starting inventory between x1
and x2, for k = 4 RS dominates, but for k = 3, SR dominates. Again, this counterintuitive
behavior is caused by two opposite effects. On one hand, as k increases from 3 to 4, the
total demand over two periods doesn’t change much regardless of whether RS or SR is
employed due to the nature of the demand function. On the other hand, RS generates
leftover inventory that lasts two periods rather than one, which impacts the overall cost
more as k increases from 3 to 4.
In both these examples, the future impact of current inventory (we call this the long-term
or propagation effect) has a direct and somewhat counterintuitive impact on optimal policy.
17
4. The Make-to-order System
As we demonstrated in the previous section, the optimal policy of our model is sometimes
quite complicated, as pricing and inventory decisions are influenced by intertemporal de-
mand, starting inventory level, and the remaining planning horizon. In general, the optimal
pricing policy is not necessarily monotone in starting inventory level or time since the last
sale. In order to isolate the intertemporal demand effect on the pricing component of our
model, we next consider a simplified version of the model, in which we assume that the
retailer places an order after it sees the realized demand- a “make-to-order” system that is,
from the perspective of the retailer, deterministic. In this case, it is optimal to order exactly
the realized demand, thus the state is simply k, the number of periods since the last sale.
We write the retailer’s problem as a deterministic dynamic programming problem:
Vt(kt) = maxpt∈{pr,ps}
[π(pt, kt) + αVt+1(kt+1)]
where π(pt, kt) = (pr− c)µr if pt = pr; (ps−c)µsk if pt = ps, and VT+1(k) = 0 for all k. Noting
that the demand at the regular price is independent of the the number of periods since last
sale , we omit k in π(pr, k) and write it as π(pr).
We assume that the expected demand under the sale price, E[ξ(ps, k)] = µsk is strictly
concave and increasing in k, that is, µsk+1 − µs
k > µsk+2 − µs
k+1 ≥ 0, so that the marginal
accumulation of sale demand decreases as k increases. Under this assumption, we show that
a threshold policy is indeed optimal: there is a k∗t such that it is optimal to charge the
regular price, pr, if k ≤ k∗t , and to offer the sale price ps, otherwise.
Theorem 4. Let p∗t (k) be the optimal price at period t when k periods have passed since the
last sale. Then,
(i) p∗t (k) is non-increasing in k.
(ii) Vt(k + 1)− Vt(k) ≤ π(ps, k + 1)− π(ps, k) for all k = 1, . . . , t and t = 1, . . . , T .
In Appendix A, we prove this result by induction.
For finite horizon problems, finding the optimal pricing policy is quite straight forward, as
one can recast the dynamic programming as a shortest path problem. Indeed, this motivates
us to explore the effectiveness of this type of threshold policy for our original model, the
make-to-order model. We investigate this in the computational section of the paper.
18
4.1 The Infinite Horizon Model
The threshold levels suggested by Theorem 4 are in general time dependent, so that there
exists k∗t for each period t such that if the number of periods since the last sale is less than
k∗t , it is optimal to offer regular price and otherwise it is optimal offer sale price. In order to
better understand the nature of these threshold levels, we next consider the infinite horizon
version of this problem in both the discounted and average profit cases.
For both cases, in Appendix B, we prove that time invariant optimal threshold levels, k∗
exist for each period t. In other words,
Lemma 1. There exists an optimal stationary policy for both discounted and average profit
cases.
This suggests that there exists a cyclic policy where offering a sale every k∗ periods is
optimal. In order to characterize the optimal cycle length (that is, the optimal stationary
threshold level), we write the expected profit associated with a policy in which the sale price
is offered every k periods and find the optimal cycle length. Let Πα(k) and ΠA(k) be the
discounted and average expected profit of a k-period cyclic policy with initial state k0 = 1,
respectively. After some algebraic manipulation, we get, for all k ≥ 1,
Πα(k) =π(pr)
1− α+ αk−1π(ps, k)− π(pr)
1− αk(Discounted profit), and
ΠA(k) = π(pr) +π(ps, k)− π(pr)
k(Average profit).
The derivations of both expressions above are delegated to the Appendix C. From this, it
is easy to see that a periodic sale is better than selling at regular price every period if and
only if there exists a k such that π(ps, k)− π(pr) ≥ 0. In such case, the optimal cycle length
is the one that maximizes the profit:
kα∗ = arg max[αk−1π(ps, k)− π(pr)
1− αk] (Discounted profit), and
kA∗ = arg maxπ(ps, k)− π(pr)
k(Average profit).
To find the optimal cycle length, it suffices to consider k ≥ k = min{k|π(ps, k)−π(pr) ≥0}.
Lemma 2. In the infinite horizon discounted profit problem with discount factor α, for
k ≥ k,
19
(i) the k-period cyclic policy is better than the k + 1-period cyclic policy if and only if
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)≤ 1/α∑k
i=1 αi−1=
1− α
α
1
1− αk(4.1)
(ii) Πα(k) is unimodal in k.
Lemma 3. In the infinite horizon average profit problem, for k ≥ k,
(i) The k-period cyclic policy is better than a k + 1-period cyclic policy if and only if
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)≤ 1
k(4.2)
(ii) ΠA(k) is unimodal in k .
The proofs of of these lemmas are in Appendices D and E. Conditions (4.1) and (4.2)
imply that if the marginal benefit of extending the cycle by one period becomes sufficiently
small, then it is optimal to not extend the length of a cycle. In fact, both conditions describe
precisely the minimum marginal gain required to optimally extend the cycle length by at
least one period. Solving for the optimal cycle length is not hard since, in both cases, the
profit function is unimodal in k.
We employ Lemmas 2 and 3 to characterize the optimal pricing policy for the infinite
horizon problem.
Theorem 5. In the discounted profit problem,
(i) It is optimal to sell at the regular price in every period if and only if π(pr) ≥ π(ps, k)
for all k ≥ 1.
(ii) Otherwise, it is optimal to use the k∗-period cyclic pricing policy where k∗ is the small-
est integer greater than k satisfying condition (4.1).
The same results hold for the average profit case with condition (4.2) replacing condition
(4.1).
The proof can be found in Appendix D (discounted profit case) and Appendix E (average
profit case).
This result is related to other intertemporal pricing results, such as those in Sobel (1984)
and Pesendorfer (2002). Both papers model the demand accumulation assuming customers
20
are strategic and characterize the equilibrium pricing policy that consists of periodic price
reductions. In contrast to their models, this paper considers more general demand accumu-
lations and fully characterizes the optimal pricing policies for both discounted and average
profit cases with myopic customers. This result is also qualitatively similar to the optimality
of 2-period cyclic pricing policy proved in Ahn et al. (2007) where customers stay in the
market for at most 2 periods.
5. Computational Studies
We completed two computational studies, one to assess the impact of system parameters on
the operation of the system, and one to assess the the performance of a simple threshold
policy for this system in light of the potentially complicated structure of the optimal policy
discussed previously.
5.1 The Operational Impact of System Parameters
In this set of experiments, we vary the system parameters described below, and assess the
impact of these parameters on the optimal number of sales over the time horizon, as well as
(for the case of markdown percentage) the impact on expected profit. The parameters we
vary can be divided into three categories:
Prices We fix the regular price, pr and vary the markdown percentage. Let δ be the
percentage markdown with respect to the regular price, i.e.
∆ =
(1− ps
pr
)100%
Production parameters We use stationary linear production costs and inventory holding
costs (where shortage cost is zero), i.e. ct(x) = cx and ht(x) = h+x+ for all t = 1, . . . T .
Furthermore, we vary the holding cost as a percentage of production cost, i.e. h =
h+
c100%, and the production cost as a percentage of regular price, i.e.
m =
(pr − c
pr
)100%
where m is the percentage markup for the regular price.
21
Demand scenarios For the demand distributions, we use truncated normal random vari-
ables. Let N+(µ, σ) be the positive part of a normal random variable with mean µ and
standard deviation σ, then:
ξt(pt, kt) =
{N+(µr, cvµ
r), if pt = pr;∑kt
i=1 N+(µsi , cvµ
si ), if pt = ps.
where cv is coefficient of variation (the ratio of the standard deviation to the mean),
µr = µ(pr) and
µsi =
{µ(ps), if i = 1;µ(ps)γ(1− β)i−1, if i > 1.
where 0 < β < 1 and 0 < γ < 1. Note that the expected marginal accumulation
of sale demand is controlled by two parameters, β and γ. β controls the decrease in
rate of expected marginal accumulation as the time since the last sale increases. For
example, if β = 0.50, then the marginal accumulation decreases by half each period
that the retailer delays the sale decision. γ controls the ratio of expected marginal
accumulation in sale demand to the sale demand in the first period after a sale.
5.1.1 One Varying Parameter
For the initial computational study, we consider a base scenario and change the various
parameters one at a time, fixing the other parameters at the base level.
Set of Parameters We consider three values for each of the parameters:
1. We fix pr = 20 and change the markdown percentage with respect to regular price
by varying ∆: {10%, 20%, 30%}.2. We assume that µ(p) = 30− p.
3. The rate of decrease in marginal accumulation in sale demand measured by β:
{0.05, 0.15, 0.25}.4. The proportion of expected marginal accumulation sale demand to expected sale
demand at k = 1 is varied by varying γ =µs
1−µr
µs1
.
5. The uncertainty in demand as measured by the coefficient of variation cv: {0.20, 0.30, 0.40}.6. Percentage markup for the regular price m: {50%, 70%, 90%}.7. Inventory holding cost as a percentage of unit cost h: {10%, 15%, 20%}.
22
Base instance We use following values for the base instance:
{∆ = 10%, β = 0.15, cv = 0.20,m = 50%, h = 10%}.
Finally, we fix the discount factor, α = 0.9.
20 40 60 80 100Initial Inventory Level, x
m = 0.5m = 0.7m = 0.9
0 20 40 60 80 1000
2
4
6
8
10
Ns(x
)
Initial Inventory Level, x
h = 0.1h = 0.15h = 0.2
20 40 60 80 100Initial Inventory Level, x
r = 0.1r = 0.2r = 0.3
0 20 40 60 80 1000
2
4
6
8
10
Ns(x
)
Initial Inventory Level, x
β = 0.15β = 0.25β = 0.05
20 40 60 80 100Initial Inventory Level, x
cv = 0.2cv = 0.3cv = 0.4
Figure 6: Plots of Ns(x) for various cases
We computationally solve the dynamic program programming problem for this study. For
each instance, we solve a 12-period problem (i.e. T = 12) and calculate the optimal decision
rules, both inventory order-up-to levels and pricing decisions, i.e. (y∗t (x, k), p∗t (x, k)). Then,
we compute the expected number of sales during the planning horizon:
Ns(x) = E[number of sales|x0 = x, k0 = 1]
where the initial inventory level is x (i.e. x0 = x) and the initial k is 1 (i.e. k0 = 1). The
results are plotted in Figure 6. Each plot shows a set of three plots of Ns(x) for different
values of a specific parameter as the rest of the parameters are held constant at the base
level. We observe the following:
23
• The expected number of sales increases as the initial inventory level increases. This
behavior can be observed in almost all plots in Figure 6. This is quite intuitive since
as inventory increases, the retailer will want to offer sale more frequently in order to
liquidate the excess inventory as quickly as possible. In a few cases (see the figure in
which the markdown percentage ∆ is varied), the expected number of sales decreases
as the inventory increases for the high inventory levels. Recall our discussion at the
end of Section 3, where we pointed out that in some instances, the firm delays its sale
decision to avoid the propagation of excess inventory and this causes expected total
number of sales to decrease when the inventory is high.
• As individual parameters vary:
– As the markup from cost for the regular price decreases, the expected number
of sales decreases for all initial inventory levels. This behavior is intuitive as
decreasing the markup also decreases the profit margins, making it necessary for
the firm to delay the sale decision in order to accumulate a profitable level of sale
demand. This in turn leads to less frequent sales.
– The expected number of sales increases for all inventory levels as the unit inventory
cost increases. This is also quite intuitive since increasing holding costs make it
costly to delay the sale decision, inducing more frequent sales.
– As the markdown percentage goes up (that is, the sale prices go down), it is
optimal to offer less frequent sales. As sale prices decrease, both sale demand and
marginal accumulation in sale demand increase. This leads to less frequent sales,
so that the firm can take advantage of larger accumulations in sale demand.
– As the speed of decrease in marginal accumulation in sale demand increases,
the expected number of sales increases. Marginal accumulation in sale demand
decreases more rapidly as β increases, and thus it becomes less and less profitable
to delay the sale decision, since waiting for an additional period to accumulate
additional sales demand does not generate enough revenue to justify the wait.
– The expected number of sales decreases as the coefficient of variation in demand
increases, but the impact is marginal.
24
1.5 2 2.5 3markdown percentage (i.e. r)
m = 0.5m = 0.7m = 0.9
1 1.5 2 2.5 31
2
3
4
5
Ns
speed of decrease in marginal accumulation (i.e. β)
m = 0.5m = 0.7m = 0.9
1.5 2 2.5 3holding cost
m = 0.5m = 0.7m = 0.9
1 1.5 2 2.5 31.5
2
2.5
3
3.5
4
Ns
speed of decrease in marginal accumulation (i.e. β)
r = 0.1r = 0.2r = 0.3
1.5 2 2.5 3holding cost
r = 0.1r = 0.2r = 0.3
1 1.5 2 2.5 31.5
2
2.5
3
3.5
4
Ns
holding cost
β = 0.05β = 0.15β = 0.25
Figure 7: Plots of expected number of sales for various cases
5.1.2 Two Varying Parameters
Next, instead of changing a single parameter at a time while keeping the rest constant, we
change all of the parameters except the coefficient of variation of demand (i.e. cv = 0.10)
and generate 81 (i.e. 3 levels of m × 3 levels of ∆ × 3 levels of β × 3 levels of h) instances.
For each specific instance, we compute the expected total sales averaged across all initial
inventory levels:
Ns =1
xf
∫ xf
0
Ns(x)dx
where xf is the maximum inventory level we consider in our computational study. The results
are plotted in Figure 7 for different pairs of parameters. Each point in the plot represents the
expected total sales Ns averaged across all instances while keeping two selected parameters
constant at specific levels. Observe that each of our observations from the single parameter
study extends to this case.
25
5.1.3 Varying Markdown Percentage
Recall that in our model, the markdown percentage is assumed to be constant. Next, we
analyze the impact on total profit of varying markdown percentage. For this purpose, we
compute the expected total profit for various levels of markdown percentage:
Π∗(∆) = E[Total Profit|x0 = 0, k0 = 1]
where the markdown percentage is ∆, initial inventory level is 0 (i.e. x0 = 0) and initial k
is 1 (i.e. k0 = 1). The results are plotted in Figure 8. Each plot in the figure shows a set of
three curves representing Π∗(∆) for different values of a specific parameter, while the rest of
parameters are held constant at the base level. Observe the following:
• In almost all cases, the expected total profit first increases and then decreases as
markdown percentage increases and achieves its maximum level at some markdown
percentage level.
• As a single parameter varies, we observe:
– The expected total profit increases for all markdown levels as the regular price
markup percentage increases whereas it decreases as holding cost, uncertainty
in demand and speed of decrease in marginal demand accumulation goes up. All
these results are self-explanatory except for the impact of decrease rate in marginal
demand accumulation (i.e. β). As β increases, the sale demand accumulates very
fast in the beginning, however its growth slows down later. This causes more
frequent sales and less accumulation in sale demand. This in turn decreases the
expected total profit that can be generated.
– The markdown level that maximizes the expected total profit within the range of
computational study increases as the markup percentage (that is, the markup of
from cost to regular price) goes up. This is because a relatively lower cost (that
is, higher regular price) allows the firm to decrease the sale price, generating more
profit due to both increased sale demand and increased marginal accumulation of
sale demand.
– The optimal markdown level increases as the speed of decrease in marginal de-
mand accumulation decreases. We’ve seen that the smaller β is, the less frequently
26
sales occur. As in the previous item, this induces the firm to decrease the sale
price in order to generate more revenue due to increased sale demand.
– The impact of holding cost and coefficient of variation of demand on the optimal
markdown percentage appears to be marginal.
0.15 0.2 0.25 0.3Markdown percentage
m = 0.5m = 0.6m = 0.7
0.1 0.15 0.2 0.25 0.31200
1220
1240
1260
1280
1300
Expecte
dP
rofit
Markdown percentage
h = 0.1h = 0.15h = 0.2
0.15 0.2 0.25 0.3Markdown percentage
β = 0.15β = 0.25β = 0.05
0.1 0.15 0.2 0.25 0.31180
1200
1220
1240
1260
1280
1300
Expecte
dP
rofit
Markdown percentage
cv = 0.2cv = 0.3cv = 0.4
Figure 8: Plots of expected profits with respect to markdown percentage (i.e. Delta) forvarious cases
5.2 A Threshold Heuristic
Recall that the optimal pricing policy for the make-to-stock model can be quite complicated.
Thus, we are motivated to test a simple threshold policy suggested by the make-to-order
model:
Threshold Policy
• If k < k∗t then offer regular price and use base stock with sr.
• If k ≥ k∗t then offer sale price and use base stock with s(ps, k).
27
To implement this policy, we utilize threshold levels found by solving the deterministic
dynamic programming problem for the make-to-order version of the particular instance we
are interested in. We then manage the inventory using base stock policy with the following
levels:
sr = (Φr)−1(f r)
and
s(ps, k) = (Φsk)−1(f s)
where f r and f s are the critical fractiles of the single period problem associated with regular
and sale prices, respectively. Note that the sr is equal to the optimal base stock associated
with regular price and s(ps, k) is greater than the optimal base stock level associated with
sale price st(ps, k).
Of course, the optimal cost for a particular make-to-order instance will be less than
the optimal cost of the equivalent make-to-stock instance since the former avoids inventory
costs, and thus these threshold levels come from solving a potentially very different problem.
Nevertheless, this heuristic approach seems to be a reasonable compromise between the
complexity of the actual policy, and a policy which is relatively simple to calculate and
implement.
As before, we evaluate the performance of this policy by changing a single parameter at a
time while fixing the rest of the parameters at base levels. We plot expected profits that are
generated by optimal versus threshold type policies policies as a function of initial inventory
level, x0 = x assuming k0 = 1. Now,let Π∗(x) and ΠH(x) be the expected profits generated
by optimal and heuristic policies, respectively.
The results are plotted in Figures 9. Each plot shows a set of three plots of Π∗(x) and
ΠH(x) for different values of a specific parameter, while the rest of parameters are held
constant at the base level. We observe that in general, the heuristic policy performs quite
well. Remember that a sale is scheduled at a particular time for two reasons: to clear excess
inventory or to generate more revenue by strategically delaying the sale until expected sale
demand has accumulated. The optimal policy reflects both of these sales criteria, whereas
the heuristic policy only reflects the strategic value of delaying sale.incentives to the pricing
policy whereas the heuristic policy reflects only strategic incentive of sales decision to pricing
policy. In spite of that, heuristic policy performing quite well, particularly when inventory is
low. However, as inventory gets higher, the quality of the heuristic degrades, since it doesn’t
28
20 40 60 80 100
Π∗(x) vs ΠH(x)
Initial Inventory Level, x
0 20 40 60 80 1000
500
1000
1500Π∗(x) vs ΠH(x)
Initial Inventory Level, x
20 40 60 80 100
Π∗(x) vs ΠH(x)
Initial Inventory Level, x
0 20 40 60 80 100600
800
1000
1200
1400Π∗(x) vs ΠH(x)
Initial Inventory Level, x
20 40 60 80 100
Π∗(x) vs ΠH(x)
Initial Inventory Level, x
m = 0.5m = 0.7m = 0.9
h = 0.1h = 0.15h = 0.2
r = 0.1r = 0.2r = 0.3
β = 0.15β = 0.25β = 0.05
cv = 0.2cv = 0.3cv = 0.4
Figure 9: Plots of Π∗(x) and ΠH(x) for various cases
account for the impact of short term holding cost effects.
6. Extensions, Conclusions and Future Research
6.1 Make-to-Stock Model with Backorder
In Section 3, we characterized the structure of optimal policy when unsatisfied demand is
lost. In this section, we extend the result to the backorder case where unsatisfied demand is
backlogged. We assume that backlogged buyers do not cancel and reorder even if the price
goes down – this assumption is consistent with the recent models of Federgruen and Heching
(1999) and Chen and Simchi-Levi (2004a, 2004b). Hence, the inventory level x may be
positive or negative. As in Section 2, h+ is the per-unit holding cost, but h− now represents
the per-unit backorder cost. If the retailer raises inventory level to yt from xt and sets its
retail price at pt when the state in period t is (xt, kt), then the state in the next period is
29
described as follows:
xt+1 = yt − ξt(pt, kt), t = 1, . . . , T, and
kt+1 =
{kt + 1, if pt = pr;1, if pt = ps.
We assume that demand is fulfilled on a first-come-first-serve basis, so that no demand in
the current period will be satisfied without first clearing backorders. Just as in the lost sale
case, we subtract the ordering cost cx in each period to transform the original model into
an equivalent, but more tractable, form. As before, let Vt(x, k) be the optimal expected
discounted revenue from period t and onwards, and Jt(y, p; k) be the expected discounted
revenue of a policy that raises the inventory to y and sets the retail price p in period t and
follows the optimal policy afterward. Then,
Vt(x, k) = maxp∈{pr,ps}
{maxy≥x
Jt(y, p; k)
}
where Jt(y, p; k) satisfies:
Jt(y, p; k) =
∫ ∞
0
[pξ−cy+αc(y−ξ)−h+[y−ξ]+−h−[y−ξ]−+αVt+1(y−ξ, kt+1)]φt(ξ|p, k)dξ.
To facilitate our analysis, we make the following assumption in addition to Assumptions 1
and 2:
Assumption 4. h− > (1− α)c.
This implies that satisfying the demand in the current period is better than delaying it
for one period and satisfying it in the next period, and it is a fairly standard assumption
in the inventory literature. Furthermore, the result is trivial when Assumption 4 does not
hold: The firm simply accrues all backorders until the end of horizon. In Appendix F, we
prove that a base-stock policy is optimal when the price and number of periods since the
last sale are fixed:
Theorem 6. Suppose that Assumptions 1, 2 and 4 hold. Jt(y, p; k) is unimodal in y for given
p and k. Therefore, a base-stock policy is optimal.
Similarly, the result of Theorem 2 can be extended to the backorder case. Unlike the
lost sale case, we do not need Assumption 3, as the critical fractiles, which determine the
order-up-to levels, do not depend on the retail price. In Appendix G, we prove:
30
Theorem 7. If Assumptions 1,2, and 4 hold, then
(i) When the regular price is offered, the base-stock level is independent of the number of
periods since the last sale and the number of remaining periods in the planning horizon.
That is, there exists a constant sr > 0 such that i.e. st(pr, k) = sr for all t and k.
(ii) The base-stock level under the sale price is always larger than the base-stock level under
the regular price: st(ps, k) ≥ sr for all k and t.
(iii) For given k ≥ 1, Vt(x, k) is constant in x ∈ [0, sr] for all t.
Using Theorems 6 and 7, we now characterize the optimal pricing and inventory policy
for the backorder case. As in the lost sale case, let st(k) be the smallest starting inventory
level such that offering a sale is optimal when k periods have passed since the last sale.
Theorem 8. Suppose that Assumptions 1, 2 and 4 hold. Then, if there have been k periods
since the last sale, the optimal pricing and inventory policy in period t takes one of the
following three forms depending on the starting inventory level at the beginning of period t,
x, relative to st(k):
(i) st(k) < sr: If x ≤ st(ps, k), it is optimal to order up to st(p
s, k) and sell at the sale
price ps. Otherwise, it is optimal not to order, and to follow the state dependent pricing
policy, p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k).
(ii) sr ≤ st(k) < st(ps, k): If x < sr, it is optimal to order up to sr and sell at the regular
price pr. If x ∈ [sr, st(k)), it is optimal not to order, and to sell at the regular price pr.
If x ∈ [st(k), st(ps, k)), it is optimal to order up to st(p
s, k) and sell at the sale price
ps. If x ≥ st(ps, k), it is optimal not to order and to follow the state dependent pricing
policy, p∗t (x, k) = arg maxp∈{pr,ps} Jt(x, p; k) .
(iii) st(p, k) ≤ st(k): If x ≤ sr, it is optimal to order up to sr and sell at the regular price pr.
If x > sr, it is optimal not to order, and to sell at the regular price pr for x ∈ (sr, st(k)),
and follow the state-dependent pricing policy, p∗t (x, k) = arg maxp={pr,ps} Jt(x, p; k) for
x ≥ st(k).
Proof. The proof is similar to that of Theorem 3, and is therefore omitted.
31
6.2 Conclusions and Future Research
In this paper, we introduced and analyzed a model that explicitly considers the timing effect
of intertemporal pricing – the concept that demand during a sale depends on the time since
the last sale as well as the sale price. We presented structural results that characterize
the interaction between the decision to have a sale and inventory ordering decisions. Our
computational analysis helped to highlight the key factors that impact the optimal sale
frequency, and the optimal markdown amount. We also developed a heuristic for this complex
model, and computationally assessed its performance.
Our model has a variety of limitations. We assume that the demand at regular price is
not affected by past pricing decisions, although in reality we expect that regular demand
may erode as sales are offered more and more frequently, inducing even regular price cus-
tomers to wait for promotional periods. Nevertheless, we believe that the model and results
presented in this paper contribute to the understanding of the impact of consumer behavior
on inventory and pricing.
Building on the results in this paper, we intend to extend our model by including non-zero
fixed inventory ordering costs, and a variety of different demand models. We will explore
the effect of capacity limitations on optimal policy. Also, our model focused on the problem
of a monopolist; we believe that a game theoretic model in which demand is influenced by
current and past pricing decisions of multiple retailers in the market is worth investigating.
Acknowledgment
This research was partially supported by NSF Grants DMI-0092854 and DMI-0200439.
References
Ahn, H, M. Gumus, P. Kaminsky. 2007. Pricing and Manufacturing Decisions when Demand
is a Function of Prices in Multiple Periods. Operations Research, Forthcoming.
Assuncao, J.L., R.J. Meyer. 1993. The Rational Effect of Price Promotions on Sales and
Consumption. Management Science 39(5) 517-535.
Aviv, Y., A. Pazgal. 2007. Optimal pricing of seasonal products in the presence of forward-
looking consumers. Manufacturing & Service Operations Management, Forthcoming.
Bitran, G.R., S.V. Mondschein. 1997. An Application of Yield Management to the Hotel
Industry Considering Multiple Day Stays. Operations Research 43(3) 427-443.
32
Bitran, G.R., R. Caldentey. 2003. An Overview of Pricing Models for Revenue Management.
Manufacturing & Service Operations Management 5 203-229.
Chan, L. M. A., Z. J. M. Shen, D. Simchi-Levi, J. Swann. 2004. Coordination of Pricing
and Inventory Decisions: A Survey and Classification. S. David, S. D. Wu and Z. Shen,
eds. Handbook of Quantitative Supply Chain Analysis: Modeling in the E-Business Era,
Kluwer Academic Publishers.
Chen, F., S. Ray, Y. Song. 2006. Optimal Pricing and Inventory Control Policy in Periodic-
Review Systems with Fixed Ordering Cost and Lost Sales. Naval Research Logistics 53
117-136.
Chen, H., O. Wu, D. D. Yao. 2007. Optimal Pricing and Replenishment in a Single Product
Inventory System, Working paper.
Chen, X., D. Simchi-Levi. 2004a. Coordinating Inventory Control and Pricing Strategies
with Random Demand and Fixed Ordering Cost: The Finite Horizon Case. Operations
Research 52 887-896.
, . 2004b. Coordinating Inventory Control and Pricing Strategies with Ran-
dom Demand and Fixed Ordering Cost: The Ininite Horizon Case. Mathematics of
Operations Research 29 698-723.
Cheng, F., S. P. Sethi. 1999. A Periodic Review Inventory Model with Demand Influenced
by Promotion Decisions. Management Science 45(11) 1510-1523.
Conlisk, J., E. Gerstner, J. Sobel. 1984. Cyclic Pricing by a Durable Goods Monopolist.
The Quarterly Journal of Economics 99 489-505.
Dharmadhikari, S. K. Joag-Dev. 1988. Unimodality, Convexity, and Applications. Academic
Press Series in Probability and Mathematical Statistics, Academic Press, Boston, MA.
Eliashberg, J., R. Steinberg. 1993. Marketing-Production Joint Decision-Making. J. Eliash-
berg, G.L. Lilien, eds. Handbooks in Operations Research and Management Science:
Marketing 5 827-880, North Holland, The Netherlands.
Elmaghraby, W., P. Keskinocak. 2003. Dynamic Pricing in the Presence of Inventory Con-
siderations: Research Overview, Current Practices and Future Directions. Management
Science 49 1287-1309.
Elmaghraby, W., A. Gulcu, P. Keskinocak. 2007. Optimal Pre-Announced Markdowns in
the Presence of Rational Customers with Multi-unit Demands. Manufacturing & Service
33
Operations Management, Forthcoming.
Federgruen, A., A. Heching. 1999. Combined Pricing and Inventory Control Under Uncer-
tainty. Operations Research 47 454-475.
Feng, Y., F. Y. Chen. 2003. Joint Pricing and Inventory Control with Setup Costs and
Demand Uncertainty, Working Paper.
Gallego, G., G. Van Ryzin. 1994. Optimal Dynamic Pricing of Inventories with Stochastic
Demand over Finite Horizons. Management Science 40(8) 999-1020.
Gayon J.-P., I. Talay, F. Karaesmen, L. Ormeci. 2007. Dynamic Pricing and Replenishment
in a Production-Inventory System with Markov-Modulated Demand. Working paper.
Gumus, M. 2007. The effect of intertemporal demand interactions on joint pricing and inven-
tory ordering models. PhD Thesis, Department of Industrial Engineering and Operations
Research, University of California, Berkeley, CA.
Ibragimov, I.A. 1956. On the composition of unimodal distributions. Theor. Probab. Appl.
1(2) 255-260.
Lippman, S. A. 1975. On dynamic programming with unbounded rewards. Management
Science 21(11) 1225-1233.
McGill, J. I., G. Van Ryzin. 1999. Revenue Management: Research Overview and Prospects.
Transportation Science 33(2) 233-256.
Pesendorfer, M. 2002. Retail sale: A Study of Pricing Behavior in Supermarkets. Journal
of Business 75 33-66.
Petruzzi, N. C., M. Dada. 1999. Pricing and the Newsvendor Problem: A Review with
Extensions. Operations Research 47 183-194.
Polatoglu, H., I. Sahin. 2000. Optimal Procurement Policies Under Price- Dependent De-
mand, International Journal of Production Economics 65 141-171.
Porteus, E.L. 2002. Foundations of Stochastic Inventory Theory. Stanford University Press,
Stanford, CA.
Slade, M. E. 1998. Optimal pricing with costly adjustment: Evidence from retail-grocery
prices. Review of Economic Studies 65 87-107.
Slade, M. E. 1999. Sticky prices in a dynamic oligopoly: An investigation of (s, S) thresholds.
International Journal of Industrial Organization 17(4) 477-511.
34
Sobel, J. 1984. The Timing of sale. Review of of Economic Studies 51(3) 353-368.
Sobel, J. 1991. Durable Goods Monopoly with Entry of New Customers. Econometrica 59
1455-1485.
Song, Y., T. Boyaci, S. Ray. 2007. Optimal Dynamic Joint Pricing and Inventory Control
for Multiplicative Demand with Fixed Order Costs and Lost Sales, Working Paper.
Su, X. 2007. Intertemporal Pricing with Strategic Customer Behavior. Management Science,
53(5) 726-741.
Subrahmanyan, S. R. Shoemaker. 1996. Developing Optimal Pricing and Inventory Policies
for Retailers Who Face Uncertain Demand. J. Retailing 72(1) 7-30.
Yano, C. A., S. M. Gilbert. 2003. Coordinated pricing and production/procurement de-
cisions: A review. A. Chakravarty, J. Eliashberg, eds. Managing Business Interfaces:
Marketing, Engineering and Manufacturing Perspectives, Kluwer Academic Publishers,
Boston, MA.
Yin, R., K. Rajaram. 2007. Joint Pricing and Inventory Control with a Markovian Demand
Model. European Journal of Operational Research, Forthcoming.
Zhou, Y.P., M. Fan, M. Cho. 2007. On the threshold purchasing behavior of customers
facing dynamically priced perishable products. Working paper.
A. Proof of Theorem 4
Proof. We prove the results by induction. First, let t = T . Suppose the first claim does not
hold for t = T . Then, there must exist a k such that p∗T (k) = ps and p∗T (k + 1) = pr. In
other words,
π(ps, k)+αVT+1(1) > π(pr)+αVT+1(k+1) and π(ps, k+1)+αVT+1(1) < π(pr)+αVT+1(k+2)
From the fact that VT (·) = 0, we get 0 > π(ps, k + 1)− π(ps, k), which is a contradiction to
the assumption that µsk is increasing. Thus, p∗T (k) must be non-increasing in K. Applying
this, the second claim follows easily since
VT (k + 1)− VT (k) ≤ max{π(pr)− π(pr), π(ps, k + 1)− π(ps, k), π(ps, k + 1)− π(pr, k + 1)}≤ max{π(ps, k + 1)− π(ps, k), π(ps, k + 1)− π(pr)}≤ π(ps, k + 1)− π(ps, k).
35
Now suppose the results hold for period t + 1 onwards and consider period t. Suppose
the first claim does not hold for some k, that is, there exists a k such that p∗t (k) = ps and
p∗t (k + 1) = pr. Then, it must be the case that
π(k, ps)+αVt+1(1) > π(pr)+αVt+1(k+1) and π(k+1, ps)+αVt+1(1) < π(pr)+αVt+1(k+2).
Combining two inequalities, we get
α(Vt+1(k + 1)− Vt+1(k)) > π(ps, k + 1)− π(ps, k).
For any α ∈ [0, 1], this inequality contradicts the induction hypothesis, Vt+1(k+1)−Vt+1(k) ≤π(ps, k + 1)− π(ps, k). Thus, p∗t (k) must be non-increasing in k.
For the second claim, we consider three cases:
• Case (i) p∗t (k) = pr and p∗t (k + 1) = pr:
Vt(k + 1)− Vt(k) = π(pr) + αVt+1(k + 2)− [π(pr) + αVt+1(k + 1)]
= Vt+1(k + 2)− Vt+1(k + 1)
≤ π(ps, k + 2)− π(ps, k + 1) ≤ π(ps, k + 1)− π(ps, k)
where the first inequality is from the induction hypothesis and the second inequality
is from the assumption that µsk is increasing and concave.
• Case (ii) p∗t (k) = ps and p∗t (k + 1) = ps:
Vt(kt + 1)− Vt(kt) = π(ps, kt + 1) + αVt+1(1)− [π(ps, kt) + αVt+1(1)] = π(ps, kt + 1)− π(ps, kt).
• Case (iii) p∗t (k) = pr and p∗t (k + 1) = ps:
Vt(k + 1)− Vt(k) = π(ps, k + 1) + αVt+1(1)− [π(pr) + αVt+1(k + 1)]
≤ π(ps, k + 1) + αVt+1(1)− [π(ps, k) + αVt+1(1)]
= π(ps, kt + 1)− π(ps, kt)
where the inequality comes from the fact that p∗t (k) = pr.
Combining three cases proves the second claim.
36
B. Proof of Lemma 1
Proof. For discounted profit case, we will use Theorem 1 in Lippman [1975]. For that
purpose, he defined the weighted supremum norm ‖ · ‖w,m for real-valued functions v on Z+
by
‖v(·)‖w,m = supk∈Z+
w(k)−m|v(k)|
He showed that there is an optimal stationary policy if
(1) there exists w(k) ≥ 1 and m ≥ 1 such that ‖ supp∈{pr,ps} π(p, k)‖w,m = M ≤ ∞,
(2) there is a b > 0 such that for kt ∈ Z+,
supp∈{pr,ps}
∑
kt+1∈Z+
w(kt+1)q(kt+1|kt, p) ≤ (w(kt) + b)n
where n = 1, . . . , m, and q(kt+1|kt, p) is the transition probability from kt to kt+1 given
pt = p, and
(3) action space is finite.
Let w(k) = π(ps, 1)k and m = 1. Now, we will show that condition (1) holds since π(ps, k)
is concave increasing in k and:
‖ supp∈{pr,ps}
π(p, k)‖w,m = supp∈{pr,ps}
supk∈Z+
π(p, k)
π(ps, 1)k≤ max{1, π(pr)
π(ps, 1)} ≤ ∞
Letting b = π(ps, 1) and noting that q(kt+1 = k + 1|kt = k, pr) = 1 and q(kt+1 = 1|kt =
k, ps) = 1, we can show that condition (2) also holds:
max{π(ps, 1)(k + 1), π(ps, 1)} = π(ps, 1)(k + 1) ≤ (π(ps, 1)k + π(ps, 1))
Since the action space is also finite in our case, conditions (1)-(3) hold.
For the average cost case, it is sufficient to consider the case that there exists a k such
that π(ps, k) > π(pr) for k ≥ k. (Otherwise, selling at the regular price every period
is optimal, and thus the result holds trivially.) We prove the result by constructing the
optimal stationary policy and showing that it generates more profit than any non-stationary
pricing policy. Assume that the initial state is k0 = 1. Consider a pricing policy denoted by
37
κ and let ki be the length of ith pricing cycle in that policy. Let ΠAκ be the average profit
generated by this policy:
ΠAκ = lim
N→∞1∑N
j=1 kj
N∑i=1
[(ki−1)π(pr)+π(ps, ki)] = limN→∞
1∑Nj=1 kj
N∑i=1
ki
ki
[(ki−1)π(pr)+π(ps, ki)]
Let ΠA(ki) = 1ki
[(ki−1)π(pr)+π(ps, ki)]. Note that ΠA(ki) is the average profit generated
by a stationary ki-period cyclic pricing policy. We can rewrite ΠAκ as a weighted average of
ΠA(ki) as follows:
ΠAκ = lim
N→∞
N∑i=1
ki∑Nj=1 kj
ΠA(ki)
From Lemma 3, ΠA(k) is unimodal in k and it is maximized at kA∗.1 Hence,
ΠAκ = lim
N→∞
N∑i=1
ki∑Nj=1 kj
ΠA(ki) ≤ ΠA(kA∗).
Since the kA∗-period cyclic pricing policy is an optimal stationary policy, the optimal sta-
tionary policy always generates more profit than any non-stationary pricing policy. Now,
suppose that the initial state is greater than 1, i.e. k0 > 1. After the first sale, the system
behaves exactly the same as the system described above, where the initial state is k0 = 1.
Hence, the optimal policy is to use a stationary k∗-period cyclic pricing policy after the first
sale. The timing of first sale however can be easily calculated by finding the maximizer ofπ(ps,k0−1+k)−π(pr)
k, which is also unimodal in k by Lemma 3 with π(ps, k0 − 1 + k) replacing
π(ps, k).
1The maximizer, kA∗ is well defined since the expected sale demand is strictly concave and increasing.
38
C. Derivation of expected discounted and average profit
functions
The derivation of expected discounted profit function of a k-cyclic pricing policy is as follows:
Πα(k) = π(pr) + απ(pr) + . . . + αk−2π(pr) + αk−1(π(pr) + π(ps, k)− π(pr))
+ αk[π(pr) + απ(pr) + . . . + αk−2π(pr) + αk−1(π(pr) + π(ps, k)− π(pr))]
+ ....
=∑
i
αiπ(pr) + αk−1∑
i
(αk)i(π(ps, k)− π(pr))
=π(pr)
1− α+ αk−1π(ps, k)− π(pr)
1− αk
Next, we will derive the expression for average profit function of a k-cyclic pricing policy as
follows:
ΠA(k) = limN→∞
1
NkN [(k − 1)π(pr)− π(ps, k)]
=(k − 1)π(pr)− π(ps, k)
k= π(pr) +
π(ps, k)− π(pr)
k
D. Proofs for the Make-to-Order Case: Discounted
Profit
Proof of Lemma 2(i): Note that a k-cyclic pricing policy is better than a k + 1-cyclic
pricing policy if and only if Πα(k)− Πα(k + 1) is positive:
Πα(k)− Πα(k + 1)
= αk−1π(ps, k)− π(pr)
1− αk− αk π(ps, k + 1)− π(pr)
1− αk+1
= αk−1
(1−αk)(1−αk+1)
[(1− αk+1)(π(ps, k)− π(pr))− α(1− αk)(π(ps, k + 1)− π(pr))
]
Adding and subtracting απ(ps, k) and organizing the expression inside the brackets:
= αk−1
(1−αk)(1−αk+1)
[α(1− αk)(π(ps, k)− π(ps, k + 1)) + (1− α)(π(ps, k)− π(pr))
]
The expression inside the brackets are positive if and only if
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)≤ (1− α)
α
1
1− αk.
39
Proof of Lemma 2(ii):
Using the previous lemma, we will show that if Πα(k) ≥ Πα(k + 1), i.e.,
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)≤ 1
α
(1− α)
1− αk=
1
α∑k
i=1 αi−1
then Πα(k + 1) ≥ Πα(k + 2), i.e.
π(ps, k + 2)− π(ps, k + 1)
π(ps, k + 1)− π(pr)≤ 1
α
(1− α)
1− αk+1=
1
α∑k+1
i=1 αi−1.
Note that
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)=
1π(ps,k)−π(pr)
π(ps,k+1)−π(ps,k)
=1
π(ps,k)−π(ps,k−1)+π(ps,k−1)−π(ps,k−2)+...+π(ps,1)−π(pr)π(ps,k+1)−π(ps,k)
=1
π(ps,k)−π(ps,k−1)π(ps,k+1)−π(ps,k)
+ π(ps,k−1)−π(ps,k−2)π(ps,k+1)−π(ps,k)
+ . . . + π(ps,1)−π(pr)π(ps,k+1)−π(ps,k)
≤ 1
α∑k
i=1 αi−1
which implies that
π(ps, k)− π(ps, k − 1)
π(ps, k + 1)− π(ps, k)+
π(ps, k − 1)− π(ps, k − 2)
π(ps, k + 1)− π(ps, k)+. . .+
π(ps, 1)− π(pr)
π(ps, k + 1)− π(ps, k)≥ α
k∑i=1
αi−1.
Multiplying each term in the left hand side with π(ps,k+1)−π(ps,k)π(ps,k+2)−π(ps,k+1)
does not change the in-
equality since π(ps,k+1)−π(ps,k)π(ps,k+2)−π(ps,k+1)
> 1;
π(ps, k)− π(ps, k − 1)
π(ps, k + 2)− π(ps, k + 1)+
π(ps, k − 1)− π(ps, k − 2)
π(ps, k + 2)− π(ps, k + 1)+. . .+
π(ps, 1)− π(pr)
π(ps, k + 2)− π(ps, k + 1)≥ α
k∑i=1
αi−1.
If we add π(ps,k+1)−π(ps,k)π(ps,k+2)−π(ps,k+1)
to the left hand side and αk+1 to the right side, the inequality
still holds since π(ps,k+1)−π(ps,k)π(ps,k+2)−π(ps,k+1)
> 1 > αk+1 due to the fact that that π(ps, k) is concave
increasing and α < 1. Hence, we obtain:
π(ps, k + 1)− π(ps, k)
π(ps, k + 2)− π(ps, k + 1)+
π(ps, k)− π(ps, k − 1)
π(ps, k + 2)− π(ps, k + 1)+. . .+
π(ps, 1)− π(pr)
π(ps, k + 2)− π(ps, k + 1)≥ α
k+1∑i=1
αi−1
which in turn implies that
π(ps, k + 2)− π(ps, k + 1)
π(ps, k + 1)− π(pr)≤ 1
α∑k+1
i=1 αi−1.
Proof of Theorem 5
40
(i) Recall that the expected discounted profit of k-cyclic pricing policy is
Πα(k) =π(pr)
1− α+ αk−1π(ps, k)− π(pr)
1− αk= Πα
r + αk−1π(ps, k)− π(pr)
1− αk
where Παr = π(pr)
1−αis the expected discounted profit of constant regular pricing policy.
Note that Παr ≥ Πα(k) if and only if π(pr) ≥ π(ps, k) for all k.
(ii) The uniqueness comes from the unimodality of Πα(k) in k. Using (4.1) and the uni-
modality of Πα(k) in k, cyclic pricing with cycle length k∗ generates more profit than
any cycle length k < k∗ and k > k∗ if and only if
π(ps, k∗)− π(ps, k∗ − 1)
π(ps, k∗ − 1)− π(pr)>
1− α
α
1
1− αk∗−1
and
π(ps, k∗ + 1)− π(ps, k∗)π(ps, k∗)− π(pr)
<1− α
α
1
1− αk∗
which implies that k∗ is the smallest integer satisfying condition (4.1).
E. Proofs for the Make-to-Order Model: Average Profit
Case
Proof of Lemma 3(i): Note that a k-cyclic pricing policy is better than a k + 1-cyclic
pricing policy if and only if ΠA(k)− ΠA(k + 1) is positive:
ΠA(k)− ΠA(k + 1) =π(ps, k)− π(pr)
k− π(ps, k + 1)− π(pr)
k + 1= 1
k(k+1)[(k + 1)(π(ps, k)− π(pr))− k(π(ps, k + 1)− π(pr))]
= 1k(k+1)
[k(π(ps, k)− π(ps, k + 1)) + (π(ps, k)− π(pr))] .
The expression inside the brackets are positive if and only if
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)≤ 1
k.
Proof of Lemma 3(ii): We will show that if ΠA(k) ≥ ΠA(k+1) then ΠA(k+1) ≥ ΠA(k+2).
We know that ΠA(k) ≥ ΠA(k + 1) if and only if
π(ps, k + 1)− π(ps, k)
π(ps, k)− π(pr)≤ 1
k
41
which implies that
k(π(ps, k + 1)− π(ps, k)) ≤ π(ps, k)− π(pr).
Adding and subtracting π(ps, k + 1) to the right hand side and organizing the expression:
(k + 1)(π(ps, k + 1)− π(ps, k)) ≤ π(ps, k + 1)− π(pr).
Since π(ps, k) is concave increasing in k, we obtain:
(k + 1)(π(ps, k + 2)− π(ps, k + 1)) ≤ (k + 1)(π(ps, k + 1)− π(ps, k)) ≤ π(ps, k + 1)− π(pr)
which implies thatπ(ps, k + 2)− π(ps, k + 1)
π(ps, k + 1)− π(pr)≤ 1
k + 1
hence ΠA(k + 1) ≥ ΠA(k + 2).
Proof of Theorem 5
(i) Recall that the expected average profit of a k-cyclic pricing policy is
ΠA(k) = π(pr) +π(ps, k)− π(pr)
k= ΠA
r +π(ps, k)− π(pr)
k
where ΠAr is the expected average profit of constant regular pricing policy. Note that
ΠAr ≥ ΠA(k) if and only if π(pr) ≥ π(ps, k) for all k.
(ii) The uniqueness comes from the unimodality of ΠA(k) in k. Using (4.2) and unimodality
of ΠA(k) in k, a cyclic pricing with cycle length k∗ generates more profit than any cycle
length k < k∗ and k > k∗ if and only if
π(ps, k∗)− π(ps, k∗ − 1)
π(ps, k∗ − 1)− π(pr)>
1
k∗
and
π(ps, k∗ + 1)− π(ps, k∗)π(ps, k∗)− π(pr)
<1
k∗
which implies that k∗ is the smallest integer satisfying condition (4.2).
42
F. Proof of Theorem 6
Proof. As in the lost sale case, we write Jt(y, p; k) as a convolution of two functions:
Jt(y, p; k) =
∫ ∞
0
Gt(y − ξ; p, k)φt(ξ|p, k)dξ
where Gt(w; p, k) is given as follows:
Gt(w; p, k) =π(p, k)− (1− α)c[w]+ + (1− α)c[w]− − h+[w]+ − h−[w]− + αVt+1(w, kt+1)
=π(p, k)− (h+ + (1− α)c)[w]+ − (h− − (1− α)c)[w]− + αVt+1(w, kt+1) (F.1)
where π(p, k) = (p − c)E[ξ(p, k)], thus π(p, k) = (pr − c)µr when p = pr and π(p, k) =
(ps − c)µsk when p = ps.
Since ξ(p, k) is strongly unimodal, it is sufficient to show that Gt(w; p, k) is unimodal in
w for a given p and k. We will do this by induction on t.
Suppose t = T . From our assumption, VT+1 = 0. The first term in equation (F.1),
π(p, k), is constant in x. From Assumption 4, the sum of the second and third terms is
unimodal function and is maximized when w = 0. Therefore, GT (w; p, k) is unimodal in w
and is maximized at w = 0, and JT (y, p; k) is unimodal in y.
Suppose the result holds for period t + 1 and consider Gt(w; k, p). Once again, the first
three terms of equation (F.1) are the same as in t = T , and therefore the sum of these three
terms is unimodal in w and maximized at w = 0. We complete the proof by showing that
the last term (i.e. αVt+1(w, kt+1)) is non-increasing in w and maximized at w = 0. From
the induction hypothesis that Gt+1(w; p, k) is unimodal in w, Jt+1(y, p; k) is unimodal in y.
From the fact that pr > ps > c and the fact that h− > (1 − α)c, it can be easily shown
through a sample path argument that it is suboptimal to not satisfy the existing backorder.
Thus, arg maxy Jt+1(y, p; k) ≥ 0, and maxy≥w Jt+1(y, p; k) is non-increasing in w. As a result,
maxy≥0 Jt+1(y, p; k) contains the maximum. This implies that
Vt+1(w, kt+1) = max
{maxy≥w
Jt+1(y, pr; k). maxy≥w
Jt+1(y, ps; k)
}
is also non-increasing in w and achieves the maximum at w = 0. Since the sum of unimodal
functions which have the same maximizer (mode) is also unimodal, Gt(w; k, p) is unimodal
in w.
43
G. Proof of Theorem 7
Proof. We prove this by induction. Suppose t = T .
(i) Since VT+1(x, k) = 0 for all x ≥ 0,
JT (y, pr; k) =
∫ ∞
0
[π(p, k)− (h+ + (1− α)c)[y − ξ]+ − (h− − (1− α)c)[y − ξ]−
]φ(ξ; pr, k)dξ
= π(p, k)−∫ y
0
(h+ + (1− α)c)(y − ξ)φ(ξ; pr, k)dξ −∫ ∞
y
(h− − (1− α)c)(ξ − y)φ(ξ; pr, k)dξ
The derivative of JT (y, pr; k) with respect to y, J′
T (y, pr; k) is written as follows:
J′
T (y, pr; k) =
∫ ∞
y
(h− − (1− α)c)φ(ξ; pr, k)dξ −∫ y
0
(h+ + (1− α)c)φ(ξ; pr, k)dξ
= (h− − (1− α)c)− Φr(y)(h+ + h−)
where Φr(y) is the cumulative distribution of regular demand. Since JT (y, pr; k) is
unimodal in y and arg max JT (y, pr; k) ≥ 0 by Theorem 6, we find the maximizer (i.e.
the base stock level under the regular price) by equating the J′
T (y, pr; k) to zero and
solving for y, which yields
sT (pr, k) = (Φr)−1
(h− − (1− α)c
h+ + h−
∣∣∣∣ pr, k
)
where (Φr)−1(·|pr, k) is the inverse function of cumulative distribution of regular de-
mand and h−−(1−αc)h++h− is the critical-fractile associated with a newsvendor facing regular
demand. Since Φr(y) does not depend on k, sT (pr, k) is constant in k. We will denote
regular base stock level sr.
(ii) Taking the derivative of JT (y, ps; k) with respect to y, we have
J′
T (y, ps; k) = (h− − (1− α)c)− Φsk(y)(h+ + h−)
where Φsk(·) is the cumulative distribution function of sale demand given that k periods
have passed since the last sale. Equating the J′
T (y, ps; k) to zero and solving for y yields
sT (ps, k) = (Φsk)−1
(h− − (1− α)c
h+ + h−
∣∣∣∣ pr, k
)
where (Φsk)−1(·|ps, k) is the inverse function of cumulative distribution of the sale de-
mand when k periods have passed since the last sale. Since ξr ≤ST ξsk for all k ≥ 1,
Φr(y) ≥ Φsk(y) for all y, sT (ps, k) ≥ sr.
44
(iii) The fact that JT (y, p; k) is unimodal in y and part (ii) above (that is, sr ≤ sT (ps, k)
for all k) together imply that, for x ≤ sr,
VT (x, k) = maxp∈{pr,ps}
{maxy≥x
JT (y, p; k)
}
= max[JT (sr, pr; k), JT (sT (ps, k), ps; k)]
Hence, VT (x, k) is a constant for x ≤ sr. For x ≤ sr, note that
VT (x, k) = maxp∈{pr,ps}
{maxy≥x
JT (y, p; k)
}
= max[JT (x, pr; k), JT (max[x, sT (ps, k)], ps; k)]
Since both JT (x, pr; k) and JT (max[x, sT (ps, k)], ps; k) are continuous and non-increasing
in x, VT (x, k) is also continuous and non-increasing in x.
Now, suppose the results hold for period t + 1, and consider period t:
(i) Since Jt(y, pr; k) is unimodal in y by Theorem 6, it is sufficient to show that sr satisfies
the first order condition. From the induction hypothesis, Vt+1(x; k) is constant (i.e.,
V′t+1(x; k) = 0) for x ≤ sr and non-increasing afterwards. For y ≤ sr, the derivative of
Jt(y, pr; k) with respect to y is
J′
t (y, pr; k) = (h− − (1− α)c)− Φr(y)(h+ + h−).
Solving for y, we have
sT (pr, k) = (Φr)−1
(h− − (1− α)c
h+ + h−|pr, k
)for all k.
From the induction hypothesis and the fact that Jt(y, pr; k) is unimodal in y, Jt(y, pr; k)
achieves the maximum at y = sr.
(ii) Note that, for any y ≤ sr, w = y − ξ(p, k) ≤ sr. Thus, Vt+1(w, k) remains constant for
all w ≤ sr and
J′
t (y, ps; k) = (h− − (1− α)c)− Φsk(y)(h+ + h−) for y ≤ sr.
Since ξr ≤st ξsk, J
′t (y, ps; k)
∣∣∣y=sr
≥ J′
T (y, pr; k)∣∣∣y=sr
= 0, st(ps; k) must be greater than
or equal to sr for all k.
(iii) The result follows immediately from the observation that st(ps, k) ≥ sr.
45