An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals

Chang, Minjun; Hochbaum, Dorit S.; Spaen, Quico; Velednitsky, Mark

doi:10.1007/s00224-019-09947-7

An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals

Published: 07 September 2019

Volume 64, pages 645–661, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Theory of Computing Systems Aims and scope Submit manuscript

An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals

Download PDF

Minjun Chang¹,
Dorit S. Hochbaum²,
Quico Spaen² &
…
Mark Velednitsky ORCID: orcid.org/0000-0003-1176-5159²

281 Accesses
1 Citation
Explore all metrics

Abstract

We present an optimally-competitive algorithm for the problem of maximum online perfect bipartite matching with i.i.d. arrivals. In this problem, we are given a known set of workers, a distribution over job types, and non-negative utility weights for each pair of worker and job types. At each time step, a job is drawn i.i.d. from the distribution over job types. Upon arrival, the job must be irrevocably assigned to a worker and cannot be dropped. The goal is to maximize the expected sum of utilities after all jobs are assigned. We introduce Dispatch, a 0.5-competitive, randomized algorithm. We also prove that 0.5-competitive is the best possible. When a job arrives, Dispatch first selects a “preferred worker” and assigns the job to this worker if it is available. The preferred worker is determined based on an optimal solution to a fractional transportation problem. If the preferred worker is not available, Dispatch randomly selects a worker from the available workers.

DISPATCH: An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals

Online Stochastic Matching: New Algorithms and Bounds

Article 27 April 2020

Online Bipartite Matching with Decomposable Weights

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We consider the problem of maximum online perfect bipartite matching. Suppose that we have a set of jobs and a set of workers. At every time step, a single job arrives to be served by one of the workers. Upon a job’s arrival, we observe the utility of assigning the job to each of the workers. We must immediately decide which worker will serve the job. Once a worker is assigned a job, the worker is busy and cannot be assigned to another job. Jobs continue to arrive until all workers are busy.

It is natural to model this problem setup as a bipartite graph, where there is an edge between each worker and job. The weight of the edge equals the non-negative utility of assigning that worker to that job. The assignment of workers to jobs will form a perfect matching in this bipartite graph. Our goal is to design a dispatching algorithm that maximizes the expected sum of utilities of the perfect matching.

In this work, we consider the maximum online perfect bipartite matching problem with independent and identically distributed (i.i.d.) arrivals. This means that, at each time step, a job is drawn i.i.d. from a known distribution over job types.

We introduce the randomized algorithm Dispatch for the problem of online weighted perfect bipartite matching with i.i.d. arrivals. Dispatch is 0.5-competitive algorithm: the total expected utility of the perfect matching produced by Dispatch is at least half of the total expected utility of an optimal algorithm that knows the job arrival sequence in advance. We also describe a family of problem instances for which 0.5 is the best-possible competitive ratio. The Dispatch algorithm, thus, achieves the best-possible competitive ratio. In contrast, the same problem with adversarial job arrivals cannot be bounded, as observed by Feldman et al. [8].

To assign workers to jobs, Dispatch first selects a preferred worker. This preferred worker is determined based on an optimal solution to a fractional transportation problem. If the preferred worker is available, then the job is assigned to this worker. Otherwise, Dispatch randomly selects a worker from the available workers.

1.1 Related Work

Our work resides in the space of online matching problems. We review several variants of online matching, including the maximum (imperfect) bipartite matching problem and the minimum (perfect) bipartite matching problem. We also review the closely-related k-Server problem. For each of these problems, several arrival models are considered. Arrival models including adversarial, where the adversary chooses jobs and their arrival order; random order, where the adversary chooses jobs but not their arrival order; and i.i.d., where the adversary specifies a probability distribution over job types and each arrival is sampled independently from the distribution. We briefly describe each of these problems and present best-known results, contrasting it to the setting considered here. A summary is in Table 1.

Table 1 Best-known competitive ratios and impossibility bounds for online bipartite matching problems

Full size table

1.1.1 Maximum Online (Imperfect) Bipartite Matching

The maximum online (imperfect) bipartite matching problem is defined on a bipartite graph with n known workers and n jobs that arrive one at a time. Jobs either get assigned to a worker or are discarded. The goal is to maximize the cardinality (or sum of weights) of the resulting matching. In contrast to our problem, jobs may be the discarded and the resulting matching may be imperfect.

For the unweighted problem with adversarial arrivals, Karp, Vazirani, and Vazirani [12] showed a best-possible algorithm that achieves a competitive ratio of $1-\frac {1}{e} \approx 0.632$. Variations of the problem have been proposed: addition of edge or vertex weights, the use of budgets, different arrival models, etc. Mehta [19] provides an excellent overview of this literature. When the arrivals are in a random order, it is possible to do better than $1-\frac {1}{e}$. Mahdian and Yan [16], in 2011, achieved a competitive ratio of 0.696. Manshadi et al. [18] showed that you cannot do better than 0.823. If the problem also has weights, then the best-possible competitive ratio is 0.368 by a reduction from the secretary problem as shown by Kesselheim et al. [13]. They also give an algorithm that attains this competitive ratio.

The problem has also been studied when the jobs are drawn i.i.d. from a known distribution. This problem is also referred to as Online Stochastic Matching. The first result to break the $1 - \frac {1}{e}$ barrier for the unweighted case was the 0.67-competitive algorithm of Feldman et al. [9] in 2009. To date, the best-known competitive ratio of 0.730 is due to Brubach et al [4]. This is close the best-known bound of 0.745 by Correa et al. [6].

1.1.2 Online Minimum (Perfect) Bipartite Matching

The online minimum (perfect) bipartite matching addresses the question of finding a minimum cost perfect matching on a bipartite graph with n workers and n jobs. Given any arbitrary sequence of jobs arriving one by one, each job needs to be irrevocably assigned to worker on arrival. This problem is the minimization version of the problem considered in this work. However, the obtained competitive ratios do not transfer.

The problem was first considered by Khuller, Mitchell, and Vazirani [14] and independently by Kalyanasundaram and Pruhs [11]. If the weights are arbitrary, then the competitive ratio cannot be bounded. To address this, both papers considered the restriction where the edge weights are distances in some metric on the set of vertices. They give a 2n − 1 competitive algorithm, which is the best-possible for deterministic algorithms. When randomized algorithms are allowed, the best-known competitive ratio is $O(\log ^{2}(n))$ by Bansal et al. [3]. If the arrival order is also randomized, then Raghvendra [21] shows that $2\log {(n)}$ is attainable. He also shows that this is the best possible.

1.1.3 k-Server Problem

In the k-server problem, k workers are distributed at initial positions in a metric space. Jobs are elements of the same metric space and arrive one at a time. When a job arrives, it must be assigned to a worker which moves to the job’s location. The goal in the k-server problem is to minimize the total distance traveled by all workers to serve the sequence of jobs. After an assignment, the worker remains available for assignment to new jobs. This reassignment distinguishes the k-server problem from ours, where workers are fixed to a job once assigned.

The k-server problem was introduced by Manasse, McGeoch, and Sleater [17]. A review of the k-server problem literature was written by Koutsoupias [15]. For randomized algorithms in discrete metrics, the competitive ratio $O(\log ^{2}{(k)} \log {(n)})$ was attained by Bubeck et al. [5], where n is the number of points in the discrete metric space. On the other hand, ${\varOmega }(\log {(k)})$ is a known lower bound. In the i.i.d. setting, Dehghani et al. [7] consider a different kind of competitive ratio: they give an online algorithm with a cost no worse than $O(\log {(n)})$ times the cost of the optimal online algorithm.

1.2 Applications

We offer four examples of settings where the model of maximum online perfect bipartite matching with i.i.d. arrivals may be suitable. These examples are not intended to be exhaustive, only to illustrate how the model can be applied in diverse settings.

In the first setting, consider the problem of assigning medical practitioners to patients arriving in an emergency room [23]. Here, workers represent medical practitioners, the jobs are the patients that arrive, and the job type is determined by the patient’s symptoms. Most patients arrive unannounced, forming a stochastic arrival sequence. Since not every practitioner is equally suitable to treat a given patient, the goal is to assign practitioners to arriving patients such that practitioners and patients are well-matched in the resulting assignment.

In the second setting, consider customers calling a call center [2]. The customer calls arrive stochastically and needs to be assigned to an operator, the worker. The callers are typically asked to categorize their problem and are routed accordingly. The utility is a measure of the operator’s expertise in handling the particular call type.

In the third setting, consider the problem of matching ad impressions to customers in an online marketplace for e.g. travel, lodgings, or insurance [19]. In this case, the customers are the jobs, arriving one at a time. Customers are characterized into types based on their demographic information and any other factors that may affect the bids of advertisers, such as previous activity on the site. Unlike general search engine advertising, the business needs of an online marketplace dictate that an advertisement must be shown. The utility of displaying an advertisement is the increase in revenue from the customer.

In the fourth setting, consider the problem of matching drivers to riders in a ride-sharing system [1]. Trips are bucketed geographically according to the regions in which the ride starts and ends. More granular regions result in more job types. The utility of a pairing between driver and rider depends on many aspects, but is not limited to the distance and duration of the trip, the current location of the driver, and the driver’s desire to go to the trip’s destination.

The arrival sequence of jobs in many of these applications can be cast as a Poisson process. When jobs are split into sub-processes according to their type, the resulting process is a Poisson splitting process. It is well-known that the sub-processes are independent Poisson processes and that the distribution of waiting times in a Poisson process is memoryless [22]. As a result, the arrival type of the jobs is i.i.d. when the the distribution over types is (approximately) independent of time. The i.i.d arrival model, as used here, is thus well-suited when an arrival sequence is a Poisson process.

1.3 Structure of this Work

This paper is organized as follows. Section 2 formally introduces the problem of online perfect bipartite matching with i.i.d. arrivals and defines the concept of competitive ratio. Section 3 describes Dispatch, presents an example to demonstrate the algorithm, and provides the proof that Dispatch is 0.5-competitive. Section 4 introduces a family of instances of the online perfect bipartite matching problem for which no online algorithm performs better than $\frac {1}{2}$ in terms of competitive ratio. Finally, Section 5 summarizes the results and suggests directions for future research.

2 Preliminaries

The set of workers is denoted by W with size n = |W|. The set J denotes the set of job types with size k = |J|. For every worker w ∈ W and job type j ∈ J there is a utility of u_wj ≥ 0 for assigning a job of type j to worker w. Let $\mathcal {D}(J)$ be a known probability distribution over the job types.

At every time step $t=1, \dots , n$, a single job is drawn i.i.d. from J according to $\mathcal {D}$. The job must be irrevocably assigned to a worker before the next job arrives. Workers are no longer available after they have been assigned a job. Let r_j denote the expected number of jobs of type j that arrive. After n steps, each worker is assigned to one job and the resulting assignment forms a perfect matching. Our goal is to design a procedure such that the expected sum of the utilities of the resulting perfect matching is as high as possible.

Throughout this work, we will repeatedly use two bipartite graphs; the expectation graphG and the realization graph$\widehat {G}$. The expectation graph G = (W,J,E) is a complete bipartite graph defined over the set of workers W and the set of job types J. An edge [w,j] ∈ E has associated utility u_wj ≥ 0, for w ∈ W and j ∈ J. The realization graph $\widehat {G} = (W, \widehat {J}, \widehat {E})$ is the random bipartite graph obtained after all n jobs have arrived. $\widehat {J}$ denotes the set of n jobs that arrived. We use $\hat {j}_{t} \in \widehat {J}$ to denote the job that arrives at time t and j_t ∈ J to denote its job type. The edge set $\widehat {E}$ consists of all worker-job pairs, such that $\widehat {G}$ is a complete bipartite graph defined over W and $\widehat {J}$. Every edge $[w, \hat {j}] \in \widehat {E}$ has utility u_wj, where j is the job type of job $\hat {j}$. It is important to remember that the expectation graph G is deterministic and known in advance whereas the realization graph $\widehat {G}$ is a random graph representing a realization of the job arrival process and is revealed over time.

An instance of the online perfect bipartite matching problem with i.i.d. arrivals is defined by the set of workers W, the job types J, non-negative utilities u_wj, and a distribution over the job types $\mathcal {D}(J)$. Equivalently, the expectation graph G and the distribution $\mathcal {D}(J)$ defines an instance of this problem. Here we analyze the family of potentially randomized algorithms that return a perfect matching $\hat {M}$ on $\widehat {G}$. The performance of an algorithm ALG for a single realization $\widehat {G}$ is given by:

$$ ALG(\widehat{G}) = \mathbb{E}\left[\sum\limits_{[w,j] \in E} u_{wj} I_{wj}\right], $$

where I_wj is a random indicator variable that equals 1 if ALG assigned a job of type j to worker w and equals 0 otherwise. For a given problem instance defined by expectation graph G and distribution $\mathcal {D}(J)$, $\mathbb {E}\left [ ALG(\widehat {G})\right ]$ measures the algorithm’s expected performance over samples of $\widehat {G}$ from G according to $\mathcal {D}(J)$.

The worst-case performance across instances is measured by the competitive ratio. Let $OPT(\widehat {G})$ be the maximum weight perfect matching in the realization graph $\widehat {G}$ and let $\mathbb {E}\left [ OPT(\widehat {G}) \right ]$ be its expectation across different realizations for a given expectation graph G and distribution $\mathcal {D}(J)$. $\mathbb {E}\left [ OPT(\widehat {G}) \right ]$ measures the performance of an optimal algorithm that has full information about the arrival sequence. This is known as an adaptive online adversary. The ratio $\frac {\mathbb {E}\left [ ALG(\widehat {G}) \right ]}{\mathbb {E}\left [ OPT(\widehat {G}) \right ]}$ measures the performance of ALG relative to the optimal algorithm for a given instance of the problem. The competitive ratio is the worst-case, i.e. lowest, ratio among all possible instances of the expectation graph G and distributions $\mathcal {D}(J)$:

Definition 1 (Competitive Ratio)

An algorithm ALG is said to have a competitive ratio of α when α is the largest value such that, for all instances of the expectation graph G and distribution $\mathcal {D}(J)$,

$$ \alpha \le \frac{\mathbb{E}\left[ ALG(\widehat{G}) \right]}{\mathbb{E}\left[ OPT(\widehat{G}) \right]}. $$

2.1 Bounding the Performance of OPT

It is difficult to compute $\mathbb {E}\left [ OPT(\widehat {G}) \right ]$ directly. We show that the randomness in $\widehat {G}$ reduces the expected value of the optimal perfect matching compared to the value of the optimal transportation problem where the number of jobs of each type is equal to its expectation. This offline transportation problem is then used to guide the online assignment.

A similar approach was used in the context of unweighted online imperfect bipartite matching by Feldman et al. [9] and Haepler et al. [10]. Here, we use a transportation problem instead of a maximum weight matching. We also bound the performance of OPT differently.

Recall that, in expectation, r_j jobs of job type j ∈ J will arrive in $\hat {G}$. An optimal fractional matching of these jobs is obtained by solving a fractional transportation problem on the expectation graph G, where each job type has a demand of r_j and each worker has a supply of 1 and the sum of utilities is maximized.

Formally, let f_wj ≥ 0 be the flow from worker w ∈ W to job type j ∈ J. This can be interpreted as a fractional assignment of worker w to jobs of job type j. We define the transportation problem TPP:

$$ \begin{array}{@{}rcl@{}} TPP(G) &=& \max_{f_{wj} \ge 0} \quad \sum\limits_{w \in W} \sum\limits_{j \in J} u_{wj} f_{wj},\\ \sum\limits_{w \in W} f_{wj} & =& r_{j} \quad \forall j \in J, \\ \sum\limits_{j \in J} f_{wj} & =& 1 \quad \forall w \in W. \end{array} $$

Let $f^{*}_{wj}$ be an optimal flow on edge [w,j] ∈ E.

We claim that $\mathbb {E}\left [ OPT(\widehat {G}) \right ] \le TPP(G)$. The reason is that the weighted average of perfect matchings $OPT(\widehat {G})$ forms a feasible solution to the transportation problem above.

Lemma 1

Given any expectation graph G and distribution over jobtypes$\mathcal {D}(J)$,

$$ \mathbb{E}\left[ OPT(\widehat{G}) \right] \le TPP(G).\enlargethispage{-25.5pt} $$

Proof

Assign each edge in G an indicator variable I_wj, which takes on the value 1 if OPT assigns worker w to a job of type j in $\widehat {G}$ and 0 otherwise. We claim that $f_{wj} = \mathbb {E}\left [ I_{wj}\right ]$ forms a feasible solution to the transportation problem in G. Indeed,

$$ \sum\limits_{w \in W}\mathbb{E}\left[ I_{wj}\right] = \mathbb{E}\left[ \sum\limits_{w \in J} I_{wj} \right] = r_{j}, \qquad \sum\limits_{j \in J}\mathbb{E}\left[ I_{wj}\right] = \mathbb{E}\left[ \sum\limits_{j \in J} I_{wj} \right] = 1. $$

Since $\mathbb {E}\left [I_{wj}\right ]$ is feasible for the transportation problem, it must have objective smaller than TPP(G):

$$ \mathbb{E}\left[OPT(\widehat{G})\right] = \mathbb{E}\left[ \sum\limits_{[w, j] \in E} u_{wj} I_{wj} \right] = \sum\limits_{[w,j] \in E} u_{ij} \mathbb{E}\left[I_{wj}\right] \leq TPP(G). $$

□

This implies that we can bound the performance of an algorithm with respect to TPP(G). We apply this technique in Section 3.3.

3 A 1/2-Competitive Algorithm

3.1 The Dispatch Algorithm

Before any jobs arrive, Dispatch solves the offline transportation problem TPP on the expectation graph G. We find an optimal flow $f^{*}_{wj}$ from workers to jobs. Throughout the online stage, the algorithm reconstructs this flow between job types and workers as much as possible. For each arriving job, a preferred workerw^P is randomly selected with a probability proportional to the optimal flow f^∗ between the corresponding job type and the worker in the transportation problem. If the preferred worker is no longer available, then the job is assigned to a worker selected uniformly at random from the set of available workers AW. We refer to this worker as the assigned workerw^A. The resulting assignment forms a perfect matching on $\widehat {G}$ since each worker is assigned at most once and each job is assigned to a worker.

In the context of online bipartite matching, the idea of using an offline solution to guide the online algorithm was used in the “Suggested Matching” algorithm [9] and subsequent work, e.g. [10]. Our algorithm differs in two ways. First, the offline solution is a transportation problem instead of a maximum weight matching problem. Second, the job is randomly assigned instead of discarded when the preferred worker is no longer available. This random selection ensures that we obtain a perfect matching and is crucial for Lemma 3. The analysis of the competitive performance of Dispatch is also novel except for Lemma 2.

The algorithm is formally defined in Algorithm 1. We prove the following result:

Theorem 1

Dispatch achieves a competitive ratio of at least $\frac {1}{2}$ for the online perfect bipartite matching problem with i.i.d. arrivals.

3.2 Example

To illustrate Dispatch, we consider the example shown in Fig. 1. The example has five workers (n = 5) and three job types (k = 3). The expectation graph is shown in Fig. 1a. Note that the distribution over job types, $\mathcal {D}(J)$, is fully specified by r_j. An instance of the realization graph is shown in Fig. 1c.

Figure 1b shows f^∗, the solution to the transportation problem on G that is used by Dispatch. The corresponding objective value is TPP(G) = 8. Figure 1d to h show the arrival of the jobs and the corresponding assignment made by Dispatch. Figure 1h illustrates an instance where the preferred worker selected by Dispatch is not available, and a different worker is assigned. For this particular realization $\widehat {G}$, the perfect matching constructed by Dispatch has a total utility 6, while the optimal perfect matching on $\widehat {G}$ has a total utility 8. Note that these values are for this particular realization of $\widehat {G}$. The performance guarantee is with respect to the expectation over all realizations of $\widehat {G}$.

3.3 Proof of $\frac {1}{2}$-Competitiveness

To prove that the perfect matching produced by Dispatch has a competitive ratio of a $\frac {1}{2}$, we rely on a key feature of Dispatch: It maintains the invariant, Lemma 4, that workers are equally likely to be available even though the distribution over job types may not be uniform. To prove this invariant, we first show that both the preferred and the assigned worker are selected uniformly across workers. Recall that the preferred worker may be different than the assigned worker. In fact, the preferred worker does not have to be available and could have been assigned to another job already. Lemma 2 states this formally for the selection of the preferred worker. The observation underlying this lemma is that each worker is selected with a probability proportional to the total flow f^∗ originating at the worker, which is equal to one for each worker.

Throughout this section we use additional notation. Let the random variable ${W_{t}^{P}}$ represent the preferred worker for the job arriving at time t, and let the random variable ${W_{t}^{A}}$ be the assigned worker. Furthermore, let the random set AW_t consist of the available workers when the job at time t arrives. We make no further assumptions on the expectation graph G and/or distribution $\mathcal {D}(J)$ other than those outlined in Section 2. Lemmas and theorems in this section are therefore applicable to all problem instances.

Lemma 2

At each time t, thepreferredworker${W^{P}_{t}}$isdrawn uniformly from all workers:

$$ \mathbb{P}\left( {W^{P}_{t}} = w\right) = \frac{1}{n} \quad \text{for all } w \in W \text{ and } t = 1,\dots, n. $$

Proof

By conditioning on the job type j_t at stage t and using the law of total probability, we can rewrite the probability of selecting worker w as:

$$ \mathbb{P}\left( {W^{P}_{t}} = w\right) = \sum\limits_{j \in J} \mathbb{P}\left( {W^{P}_{t}} = w | j_{t} = j\right) \mathbb{P}\left( j_{t} = j\right). $$

Since the jobs are drawn i.i.d., a job of type j is selected with probability $\mathbb {P}\left (j_{t} = j\right ) = \frac {r_{j}}{n}$, by definition of r_j. Given a job of type j, the algorithm selects a worker w as the preferred worker with probability $\mathbb {P}\left ({W^{P}_{t}} = w | j_{t} = j\right ) = \frac {f^{*}_{wj}}{r_{j}}$. Thus,

$$ \mathbb{P}\left( {W^{P}_{t}} = w\right) = \sum\limits_{j \in J} \frac{f^{*}_{wj}}{r_{j}} \frac{r_{j}}{n} = \sum\limits_{j \in J} \frac{f^{*}_{wj}}{n}. $$

Finally, recall that every worker supplies a unit of flow in the offline transportation problem, equivalent to the expected number of jobs it serves. The edges adjacent to worker w must thus transport a unit of flow, so ${\sum }_{j} f^{*}_{wj} = 1$. Thus, $\mathbb {P}\left ({W^{P}_{t}} = w\right )$$= \frac {1}{n}$. □

Next we show that the assigned worker is selected uniformly at random from the set of available workers. For this lemma to hold, it is crucial that the draw of the assigned worker is done uniformly at random when the preferred worker is not available. Recall that ${W_{t}^{A}}$ is the assigned worker for the job arriving at time t and that AW_t are the available workers before the job arrives.

Lemma 3

At each time step t, theassignedworker${W^{P}_{t}}$isdrawn uniformly from the available workers:

$$ \mathbb{P}\left( {W^{A}_{t}} = w |w \in AW_{t}\right) = \frac{1}{n-(t-1)}. $$

Proof

Assume that w is fixed and that w ∈ AW_t. There are two ways for w to be the assigned worker. Either w is the preferred worker or the preferred worker is not available and w is randomly selected. We express this as:

$$ \begin{array}{@{}rcl@{}} \mathbb{P}\left( {W^{A}_{t}} = w | w \in AW_{t}\right) &=& \mathbb{P}\left( {W^{P}_{t}} = w | w \in AW_{t}\right)\\ &&+ \mathbb{P}\left( {W^{A}_{t}} = w | {W^{P}_{t}} \notin AW_{t}, w \in AW_{t}\right) \\ && \times \mathbb{P}\left( {W^{P}_{t}} \notin AW_{t} | w \in AW_{t}\right) \end{array} $$

The selection of ${W^{P}_{t}}$ is independent of whether w ∈ AW_t. Therefore,

$$ \begin{array}{@{}rcl@{}} \mathbb{P}\left( {W^{A}_{t}} = w | w \in AW_{t}\right) &=& \mathbb{P}\left( {W^{P}_{t}} = w \right) \\ &&+ \mathbb{P}\left( {W^{A}_{t}} = w | {W^{P}_{t}} \notin AW_{t}, w \in AW_{t}\right) \mathbb{P}\left( {W^{P}_{t}} \notin AW_{t}\right) \end{array} $$

Now we use three observations to complete the proof. First, Lemma 2 implies that $\mathbb {P}\left ({W^{P}_{t}} = w\right ) = \frac {1}{n}$. Second, since there are t − 1 busy workers, Lemma 2 implies that $\mathbb {P}\left ({W^{P}_{t}} \notin AW_{t}\right ) = \frac {(t-1)}{n}$. Third, the fact that the assigned worker is drawn uniformly at random when the preferred worker is not available implies that $\P {{W^{A}_{t}} = w | {W^{P}_{t}} \notin AW_{t}, w \in AW_{t}} = \frac {1}{n-(t-1)}$. Thus,

$$ \mathbb{P}\left( {W^{A}_{t}} = w | w \in AW_{t}\right) = \frac{1}{n} + \frac{1}{n-(t-1)} \frac{(t-1)}{n} = \frac{1}{n-(t-1)}. $$

□

Lemma 3 specifies each available worker is equally likely to be assigned to the next job. As a consequence, we can derive the probability that a worker is still available after t − 1 jobs have arrived:

Lemma 4

Dispatch maintains the following invariant throughout the online stage:

$$ \mathbb{P}\left( w \in AW_{t}\right) = \frac{n-(t-1)}{n} \quad \text{for all } w \in W \text{ and } t = 1,\dots, n. $$

Proof

At every time step, a worker is chosen randomly from the remaining available workers, as shown in Lemma 3. The probability that an available worker in time step t is still available in time step t + 1 is:

$$ \begin{array}{@{}rcl@{}} \mathbb{P}\left( w \in AW_{t+1} | w \in AW_{t}\right) &=& 1 - \mathbb{P}\left( {W^{A}_{t}} = w | w \in AW_{t}\right)\\ &=& 1 - \frac{1}{n-(t-1)} = \frac{n-t}{n-(t-1)}. \end{array} $$

Thus, the probability of being available for the t^th job is equal to:

$$ \begin{array}{@{}rcl@{}} \mathbb{P}\left( w \in AW_{t}\right) &=& {\prod}_{i=1}^{t}{\P{w \in AW_{t} | w \in AW_{t-1}}}\\ &=& \frac{n- (t-1)}{n - (t-2)} \frac{n- (t-2)}{n - (t-3)} {\dots} \frac{n-1}{n} = \frac{n-(t-1)}{n}. \end{array} $$

□

From Lemma 4, we know the probability that a worker is available at each time step. We use this to bound the probability that a worker w is assigned to a job with job type j by Dispatch. We use the indicator random variable I_wj. I_wj = 1 when the Dispatch assigns worker w to a job with job type j, and I_wj = 0 otherwise. We bound the probability with respect to $f^{*}_{wj}$ in TPP(G). By bounding the algorithm’s performance with respect to TPP(G) we can bound the competitive ratio of Dispatch. See Section 2.1 for more details.

Lemma 5

Given a perfect matching$\hat {M}$constructedbyDispatch, the probability that worker w is assigned to a job of type j is bounded by:

$$ \mathbb{P}\left( I_{wj} = 1\right) \ge \frac{1}{2} f^{*}_{wj}. $$

Proof

If I_wj = 1, then worker w must have been assigned to a job of type j in one of the time steps. Thus, $I_{wj} = {\sum }_{t=1}^{n} I^{t}_{wj}$ where $I^{t}_{wj}$ is indicator for whether worker w is assigned to a job of type j at time step t:

$$ \mathbb{P}\left( I_{wj} = 1\right) = \sum\limits_{t=1}^{n} \mathbb{P}\left( I^{t}_{wj} = 1\right). $$

Let us bound the probability $\mathbb {P}\left (I^{t}_{wj} = 1\right )$ for all $t=1, \dots , n$. First, we condition on the job type arriving at time t. Note that j_t must equal j:

$$ \mathbb{P}\left( I^{t}_{wj} = 1\right) = \mathbb{P}\left( I^{t}_{wj} = 1 | j_{t} = j\right) \mathbb{P}\left( j_{t} = j\right). $$

Recall that there are two ways for worker w to be assigned after a job of type j arrives. Either w is the preferred worker and is assigned the job, or another worker $w^{\prime }$ is selected as the preferred worker but is not available. w is then selected as the assigned worker. We lower bound the probability that worker w is assigned for the job of type j by considering only the case where w is the preferred worker.

$$ \begin{array}{@{}rcl@{}} \mathbb{P}\left( I^{t}_{wj} = 1\right) &\ge& \mathbb{P}\left( w \in AW_{t}, {W^{P}_{t}} = w | j_{t} = j\right) \mathbb{P}\left( j_{t} = j\right)\\ &=& \mathbb{P}\left( w \in AW_{t}\right) \mathbb{P}\left( {W^{P}_{t}} = w| j_{t} = j\right) \mathbb{P}\left( j_{t} = j\right)\\ &=& \frac{n-(t-1)}{n} \frac{f^{*}_{wj}}{r_{j}} \frac{r_{j}}{n}\\ &=& \frac{1}{n}\frac{n-(t-1)}{n} f^{*}_{wj}. \end{array} $$

For the first equality, we use that the job type at time t and the selection of the preferred worker are independent from whether w is available at time t. The second equality follows from Lemma 4, the weighted random selection of the preferred worker, and the job arrival process.

We use $\mathbb {P}\left (I^{t}_{wj} = 1\right ) = \frac {1}{n}\frac {n-(t-1)}{n} f^{*}_{wj}$ to bound the total probability of assigning worker w for a job of type j:

$$ \mathbb{P}\left( I_{wj} = 1\right) = \sum\limits_{t=1}^{n} \mathbb{P}\left( I^{t}_{wj} = 1\right) \ge \sum\limits_{t=1}^{n} \frac{1}{n}\frac{n-(t-1)}{n} f^{*}_{wj} = \frac{1}{2} \frac{n+1}{n}f^{*}_{wj} \ge \frac{1}{2}f^{*}_{wj}. $$

□

Lemma 5 bounds the probability that worker w is matched to a job of type j. By linearity of expectation, Theorem 1 and the $\frac {1}{2}$ competitive ratio follow almost immediately from Lemma 5.

Proof Proof of Theorem 1

The expected utility returned by the algorithm is a weighted sum of indicators I_wj, where I_wj = 1 when worker w is assigned to a job of type j and 0 otherwise. Note that each worker is assigned to at most one job (type). We can then apply Lemma 5 to bound the probability P(I_wj = 1) and the expected utility of the algorithm:

$$ \begin{array}{@{}rcl@{}} \mathbb{E}\left[\textsc{Dispatch}{}(\widehat{G})\right] &=& \mathbb{E}\left[ \sum\limits_{w \in W, j \in J} u_{wj} I_{wj} \right] \\ &=& \sum\limits_{w \in W, j \in J} u_{wj} \mathbb{E}\left[I_{wj}\right] \\ &=& \sum\limits_{w \in W, j \in J} u_{wj} \P{I_{wj} = 1} \\ &\ge& \frac{1}{2} \sum\limits_{w \in W, j \in J} u_{wj} f^{*}_{wj}= \frac{1}{2} TPP(G). \end{array} $$

Note that the inequality requires that the utility weights are non-negative.

Finally, we apply Lemma 1 to obtain a bound on the competitive ratio attained by Dispatch for any expectation graph G and distribution $\mathcal {D}(J)$:

$$ \mathbb{E}\left[\textsc{Dispatch}{}(\widehat{G})\right] \ge \frac{1}{2} TPP(G) \ge \frac{1}{2} \mathbb{E}\left[OPT(\widehat{G})\right] $$

□

4 Best-Possible Competitive Ratio

We present here a family of instances for which any online algorithm attains a competitive ratio of at most $\frac {1}{2}$. The Dispatch algorithm guarantees a competitive ratio of $\frac {1}{2}$ and is thus optimal with respect to competitive ratio.

Theorem 2

For the online perfect bipartite matching problem with an i.i.d. arrival process, no online algorithm can achieve a competitive ratio better than $\frac {1}{2}$ .

Proof

Consider an instance G with the number of job types k = n + 1. Let the job types be indexed from 0 to n and the workers from 1 to n. Job types 1 to n each arrive with probability p/n and job type 0 arrives with probability 1 − p. For this graph, we set u_wj = 1 if w = j and to 0 otherwise. This implies u_w,0 = 0 for all w ∈ W. See Fig. 2 for an illustration.

Note that OPT gains a utility of one per unique job type in {1,…,n} that arrives. The expected number of unique job types is computed by considering each job type as a geometric random variable with a success probability of $\frac {p}{n}$. Thus, $\mathbb {E}\left [OPT(\widehat {G})\right ] = n \left (1 - \left (1 - \frac {p}{n}\right )^{n}\right )$.

For any online algorithm ALG^∗, t − 1 workers are no longer available at time step t regardless of the strategy. Thus, with probability $(1-p) + p \frac {t-1}{n}$ the increase in utility is zero. Thus, the total expected utility increases by at most $p \frac {n - (t-1)}{n}$ in time step t. The total expected utility obtained by ALG^∗ is then:

$$ \mathbb{E}\left[ALG^{*}(\widehat{G})\right] \le p \frac{n}{n} + p \frac{n-1}{n} + p \frac{n-2}{n} + {\dots} + p \frac{1}{n} = \frac{1}{2} p (n+1) $$

We compute the relevant ratio and then take the limit as n goes to infinity:

$$ \lim_{n \rightarrow \infty} \frac{\mathbb{E}\left[ALG^{*}(\widehat{G})\right]}{\mathbb{E}\left[OPT(\widehat{G})\right]} = \lim_{n \rightarrow \infty} \frac{\frac{1}{2} p (n+1)}{n \left( 1 - \left( 1 - \frac{p}{n}\right)^{n}\right)} = \frac{1/2 \cdot p}{1 - e^{-p}} $$

Since p can take on any value in the interval (0,1), we consider the limit as p goes to zero:

$$ \lim_{p \rightarrow 0^{+}} \frac{1/2 \cdot p}{1 - e^{-p}} = \frac{1}{2}. $$

□

Corollary 1

Dispatch achieves the best-possible competitive ratio of $\frac {1}{2}$ for the Online Perfect Bipartite Matching problem.

5 Conclusion

In this paper, we examine the problem of online perfect bipartite matching with i.i.d. arrivals from a known distribution. We present the Dispatch algorithm. It attains a competitive ratio of $\frac {1}{2}$. We show that this is the best possible. Thus, the algorithm Dispatch is optimal in terms of competitive ratio.

There is an intriguing difference between online perfect bipartite matching algorithms for minimization and the Dispatch algorithm for maximization. Whereas the competitive ratio for minimization is bounded logarithmically, a constant bound was obtained for maximization with i.i.d. arrivals. This raises the question of whether a constant competitive ratio is possible for minimization with i.i.d. arrivals.

It may be possible to translate the analysis in this work to other contexts. Our analysis relied on two key ideas; the use of the expectation graph and proving that, regardless of how the jobs arrive, the Dispatch algorithm effectively translates the non-uniform sampling over jobs to a uniform sampling over workers.

References

Agatz, N., Erera, A., Savelsbergh, M., Wang, X.: Optimization for dynamic ride-sharing: A review. Eur. J. Oper. Res. 223(2), 295–303 (2012)
Article Google Scholar
Aksin, Z., Armony, M., Mehrotra, V.: The modern call center: A multi-disciplinary perspective on operations management research. Prod. Oper. Manag. 16(6), 665–688 (2007)
Article Google Scholar
Bansal, N., Buchbinder, N., Gupta, A., Naor, J. S.: An $O(\log ^{2} k)$-competitive algorithm for metric bipartite matching. In: European symposium on algorithms. pp. 522–533. Springer (2007), https://doi.org/10.1007/978-3-540-75520-3_47
Brubach, B., Sankararaman, K. A., Srinivasan, A., Xu, P.: New algorithms, better bounds, and a novel model for online stochastic matching. In: 24th annual European symposium on algorithms. vol. 57, pp. 24:1–24:16. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik (2016), https://doi.org/10.4230/LIPIcs.ESA.2016.24
Bubeck, S., Cohen, M. B., Lee, Y. T., Lee, J. R., Madry, A.: K-server via multiscale entropic regularization. In: Proceedings of the 50th annual ACM SIGACT symposium on theory of computing. pp. 3–16. ACM (2018), https://doi.org/10.1145/3188745.3188798
Correa, J., Foncea, P., Hoeksma, R., Oosterwijk, T., Vredeveld, T.: Posted price mechanisms for a random stream of customers. In: Proceedings of the 2017 ACM conference on economics and computation. pp. 169–186. ACM (2017), https://doi.org/10.1145/3033274.3085137
Dehghani, S., Ehsani, S., Hajiaghayi, M., Liaghat, V., Seddighin, S.: Stochastic k-server: How should uber work?. In: 44th international colloquium on automata, languages, and programming. vol. 80, pp. 126:1–126:14. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany (2017)
Feldman, J., Korula, N., Mirrokni, V., Muthukrishnan, S., Pál, M.: Online ad assignment with free disposal. In: International workshop on internet and network economics. pp. 374–385. Springer (2009), https://doi.org/10.1007/978-3-642-10841-9_34
Feldman, J., Mehta, A., Mirrokni, V., Muthukrishnan, S.: Online stochastic matching: Beating 1-1/e. In: 50th annual IEEE symposium on foundations of computer science. pp. 117–126. IEEE (2009), https://doi.org/10.1109/FOCS.2009.72
Haeupler, B., Mirrokni, V. S., Zadimoghaddam, M.: Online stochastic weighted matching: Improved approximation algorithms. In: International workshop on internet and network economics. pp. 170–181. Springer (2011), https://doi.org/10.1007/978-3-642-25510-6_15
Kalyanasundaram, B., Pruhs, K.: Online weighted matching. J. Algorithms 14(3), 478–488 (1993). https://doi.org/10.1006/jagm.1993.1026
Article MathSciNet MATH Google Scholar
Karp, R. M., Vazirani, U. V., Vazirani, V. V.: An optimal algorithm for on-line bipartite matching. In: Proceedings of the 22nd annual ACM symposium on theory of computing. pp. 352–358. ACM (1990), https://doi.org/10.1145/100216.100262
Kesselheim, T., Radke, K., Tönnis, A., Vöcking, B.: An optimal online algorithm for weighted bipartite matching and extensions to combinatorial auctions. In: European symposium on algorithms. pp. 589–600. Springer (2013), https://doi.org/10.1007/978-3-642-40450-4_50
Khuller, S., Mitchell, S. G., Vazirani, V. V.: On-line algorithms for weighted bipartite matching and stable marriages. Theor. Comput. Sci. 127(2), 255–267 (1994). https://doi.org/10.1016/0304-3975(94)90042-6
Article MathSciNet MATH Google Scholar
Koutsoupias, E.: The k-server problem. Comput. Sci. Rev. 3(2), 105–118 (2009). https://doi.org/10.1016/j.cosrev.2009.04.002
Article MATH Google Scholar
Mahdian, M., Yan, Q.: Online bipartite matching with random arrivals: an approach based on strongly factor-revealing lps. In: Proceedings of the 43rd annual ACM symposium on Theory of computing. pp. 597–606. ACM (2011), https://doi.org/10.1145/1993636.1993716
Manasse, M. S., McGeoch, L. A., Sleator, D. D.: Competitive algorithms for server problems. J. Algorithms 11(2), 208–230 (1990). https://doi.org/10.1016/0196-6774(90)90003-W
Article MathSciNet MATH Google Scholar
Manshadi, V. H., Gharan, S. O., Saberi, A.: Online stochastic matching: Online actions based on offline statistics. Math. Oper. Res. 37(4), 559–573 (2012). https://doi.org/10.1287/moor.1120.0551
Article MathSciNet MATH Google Scholar
Mehta, A., et al.: Online matching and ad allocation. Foundations and Trends in Theoretical Computer Science 8(4), 265–368 (2013). https://doi.org/10.1561/0400000057
Article MathSciNet MATH Google Scholar
Meyerson, A., Nanavati, A., Poplawski, L.: Randomized online algorithms for minimum metric bipartite matching. In: Proceedings of the 17th annual ACM-SIAM symposium on discrete algorithms. pp. 954–959. Society for Industrial and Applied Mathematics (2006)
Raghvendra, S.: A robust and optimal online algorithm for minimum metric bipartite matching. In: Approximation, randomization, and combinatorial optimization. Algorithms and techniques. vol. 60, pp. 18:1–18:16. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik (2016), https://doi.org/10.4230/LIPIcs.APPROX-RANDOM.2016.18
Ross, S. M.: Introduction to probability models. Academic press (2014)
Su, X., Zenios, S. A.: Patient choice in kidney allocation: A sequential stochastic assignment model. Oper. Res. 53(3), 443–455 (2005)
Article MathSciNet Google Scholar

Download references

Acknowledgments

D.S. Hochbaum was supported in part by NSF award CMMI 1760102.

Author information

Authors and Affiliations

Fuqua School of Business, Duke University, Durham, NC, 27708, USA
Minjun Chang
Department of Industrial Engineering and Operations Research, University of California, Berkeley, CA, 94720, USA
Dorit S. Hochbaum, Quico Spaen & Mark Velednitsky

Authors

Minjun Chang
View author publications
You can also search for this author in PubMed Google Scholar
Dorit S. Hochbaum
View author publications
You can also search for this author in PubMed Google Scholar
Quico Spaen
View author publications
You can also search for this author in PubMed Google Scholar
Mark Velednitsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark Velednitsky.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on Special Issue on Approximation and Online Algorithms 2018

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, M., Hochbaum, D.S., Spaen, Q. et al. An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals. Theory Comput Syst 64, 645–661 (2020). https://doi.org/10.1007/s00224-019-09947-7

Download citation

Published: 07 September 2019
Issue Date: May 2020
DOI: https://doi.org/10.1007/s00224-019-09947-7

An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals

Abstract

Similar content being viewed by others

DISPATCH: An Optimally-Competitive Algorithm for Maximum Online Perfect Bipartite Matching with i.i.d. Arrivals

Online Stochastic Matching: New Algorithms and Bounds

Online Bipartite Matching with Decomposable Weights

1 Introduction

1.1 Related Work

1.1.1 Maximum Online (Imperfect) Bipartite Matching

1.1.2 Online Minimum (Perfect) Bipartite Matching

1.1.3 k-Server Problem

1.2 Applications

1.3 Structure of this Work

2 Preliminaries

Definition 1 (Competitive Ratio)

2.1 Bounding the Performance of OPT

Lemma 1

Proof

3 A 1/2-Competitive Algorithm

3.1 The Dispatch Algorithm

Theorem 1

3.2 Example

3.3 Proof of \(\frac {1}{2}\)-Competitiveness

Lemma 2

Proof

Lemma 3

Proof

Lemma 4

Proof

Lemma 5

Proof

Proof Proof of Theorem 1

4 Best-Possible Competitive Ratio

Theorem 2

Proof

Corollary 1

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation