3.1 Introduction

Within the SM framework, the Higgs boson is responsible for the dynamic generation of masses of all SM particles via the Brout–Englert–Higgs mechanism. In the fermion sector, Higgs interacts via yukawa couplings which are proportional to the fermion masses. Top quark being the heaviest SM fermion, may play a yet unknown role in the electroweak symmetry breaking. This makes the top-Higgs yukawa coupling an important probe for BSM physics. Although indirect constraints on its value (via gluon fusion and the \(H \rightarrow \gamma \gamma \) loop contribution) are available, its direct measurement is possible only via studying top quark associated Higgs production (\(t\bar{t}H\)and tH). This article overviews all major Run-2 \(t\bar{t}H\)searches: \(\boldsymbol{t\bar{t}H (H\rightarrow b\bar{b})}\) [3, 4], \(\boldsymbol{t\bar{t}H (H\rightarrow \gamma \gamma )}\) [5] and \(\boldsymbol{t\bar{t}H}\) Multi-lepton [6] search (which combines \(\boldsymbol{t\bar{t}H (H\rightarrow VV^{*})}\)Footnote 1 [7] and \(\boldsymbol{t\bar{t}H (H\rightarrow \tau \tau )}\) [8] searches) (Fig. 3.1).

Fig. 3.1
figure 1

LO Feynman diagrams for top quark associated Higgs boson production (\(t\bar{t}H\)) with Higgs decaying to pair of b-quarks (left), W-bosons (middle) and Z-bosons (right)

3.2 Analysis Strategy and Event Categorization

The general analysis strategy of all \(t\bar{t}H\)searches involve categorization to seperate events into categories of varying sensitivity. Signal over background discrimination in each category is then enhanced via. Dedicated shape analysis of a usually multi-variate (MVA) discriminator. Finally, a binned maximum likelihood fit on the distribution of this discriminator is performed for signal extraction and limit computation. The major categories employed by the \(t\bar{t}H\)searches are as follows:

  1. 1.

    \(\boldsymbol{t\bar{t}H (H\rightarrow b\bar{b})}\) \(:\) This search benefits from the large SM \(H \rightarrow b\bar{b}\) branching fraction (\(\sim \)57%) but is limited by systematic uncertainity on the (irreducible) \(t\bar{t}+b\bar{b}\) background. It is divided into two primary channels (based on the hadronic or leptonic decays of the top quarks):

(a):

Hadronic channel \(:\) Events are first selected using dedicated multi-Jet triggers. They are then divided into six categories depending on jet and b-tagged jet multiplicity. These are (7Jets, 3b-tagged), (7Jets, \(\ge \)4b-tagged), (8Jets, 3b-tagged), (8Jets, \(\ge \)4b-tagged), (\(\ge \)9Jets, 3b-tagged), and (\(\ge \)9Jets, \(\ge \)4b-tagged). MEMFootnote 2 was employed in all the categories to distinguish signal (\(t\bar{t}H\)) from background (\(t\bar{t}+b\bar{b}\)) and was also used for signal extraction.

(b):

Leptonic channel \(:\) Events are first selected using single lepton triggers and divided into two main categories (each of which is further split by lepton flavor).

\(*\):

Single Lepton category \(:\) Events are split into (4Jets, \(\ge \)3b-tagged), (5Jets, \(\ge \)3b-tagged), and (\(\ge \)6Jets, \(\ge \)3b-tagged) categories. DNNsFootnote 3 are trained to seperate \(t\bar{t}H\)signal from \(t\bar{t}+X\) background (where \(X =b\bar{b}/c\bar{c}\)/2b/b and light flavors).

\(*\):

Di-Lepton category \(:\) Events are divided into (\(\ge \)4Jets, 3b-tagged) and (\(\ge \)4Jets, \(\ge \)4b-tagged) categories with BDTsFootnote 4 trained to distinguish \(t\bar{t}H\)signal from \(t\bar{t}\) background in both. While the former uses this BDT itself for signal extraction the latter gives this as input to MEM discrimintaor.

  1. 2.

    \(\boldsymbol{t\bar{t}H (H\rightarrow \gamma \gamma )}\) \(:\) Events are first selected using asymmetricFootnote 5 di-photon triggers. This search is itself a channel under the general SM \(H\rightarrow \gamma \gamma \) search and utilizes special BDT for tagging \(t\bar{t}H\)multi-jet events. This BDT uses the following variables as inputs:

    • Number of jets (\(p_{T} > 25\) GeV).

    • Leading jet \(p_{T}\).

    • 2 jets with the highest b-tagging score.

Contributions of other SM (non \(t\bar{t}H\)) \(H\rightarrow \gamma \gamma \) processes e.g., Gluon fusion, VBFFootnote 6 are treated as background by this BDT. Additional BDTs are used for photon identification and di-photon vertex assignment. This search too is split into Leptonic and Hadronic channels targetting semi-leptonic and hadronic top decays.

(a):

Leptonic channel \(:\) Events enter this channel if they satisfy the following kinematic selections:

\(*\):

\(\ge \)1 Lepton (\(p_{T} > 20\) GeV) non-overlapping with any photon.

\(*\):

\(\ge \)2 Jets, (\(p_{T} > 25\) GeV, \(|\eta | < \) 2.4) non-overlapping with any photon or lepton.

\(*\):

\(\ge \)1 Medium b-tagged Jet.

\(*\):

Di-photon BDT score \(>0.11\).

(b):

Hadronic channel \(:\) This comprises events passing the following kinematic selections:

\(*\):

\(\ge \)3 Jets, (\(p_{T} > 25\) GeV, \(|\eta | < \) 2.4) non-overlapping with any photon.

\(*\):

\(\ge \)1 Loose b-tagged Jet.

\(*\):

No Lepton in the event (passing the lepton selections of the Leptonic channel described above).

\(*\):

High score (>0.75) on the \(t\bar{t}H\)multi-jet tagging BDT (described above).

\(*\):

Di-photon BDT score \(>0.4\).

In addition to this, di-photon invariant mass dependent \(p_{T}\) cuts are applied to selected photons in both channels to get a distortion free di-photon mass spectrum.

  1. 3.

    \(\boldsymbol{t\bar{t}H}\) Multi-lepton \(:\) In this search, events are first selected using lepton or lepton \(+ \tau _{h}\) triggers. They are then required to have \(\ge \)2 loose b-tagged jetsFootnote 7 out of which \(\ge \)1 is medium b-tagged. A special BDT trained on simulated \(t\bar{t}H\)(\(t\bar{t}\)) events as signal (background) is used to distinguish “prompt” leptons (produced by W/Z/leptonic \(\tau \) decays) from “non-prompt” leptons (produced in b-hadron decays, decays-in-flight, and photon conversions). Leptons passing (failing) it are called tight (loose) leptons in this analysis. An additional di-lepton invariant mass cut (\(m_{ll} > 12\) GeV) is applied to all channels to reject phase space dominated by low mass SM di-lepton resonances (e.g., \(J/\psi \), \(\varUpsilon \) etc.) which are not well modeled in simulation. The major channels of this search (which are also split by lepton flavor) are

(a):

2 Lepton Same-Sign channel (2lSS) \(:\) Events containing exactly 2 same charge tight leptons (\(p^{lep1/2}_{T}>\) 25/15 GeV) and \(\ge \) 4 Jets. Additional selections are applied to reduce backgrounds due to conversions and lepton charge mis-identification.

(b):

3 Lepton channel (3l) \(:\) Events containing exactly 3 leptons (\(p^{lep1/2/3}_{T}>\) 25/15/15 GeV) with additional selections for rejection of backgrounds due to \(Z\rightarrow ll\) events and conversions.

(c):

4 Lepton channel (4l) \(:\) Events passing exactly the same selections as 3l channel but now having an additional requirement of a fourth lepton (\(p_{T}>\) 10 GeV) in the event.

(d):

1 Lepton + 2\(\boldsymbol{\tau _{h}}\) channel (1l+2\(\boldsymbol{\tau _{h}}\)) \(:\) Events containing exactly 1 tight lepton (\(p^{e/\mu }_{T}>\) 25/20 GeV, \(|\eta |<\) 2.1) and 2 opposite charge tight \(\tau _{h}\) leptons (\(p_{T}>\) 30 GeV each). Events should also contain \(\ge \) 3 Jets.

(e):

2 Lepton Same-Sign + 1\(\boldsymbol{\tau _{h}}\) channel (2lSS+1\(\boldsymbol{\tau _{h}}\)) \(:\) Events containing exactly 2 same charge tight leptons (\(p^{lep1/2}_{T}>\) 25/15 GeV for electrons, \(p^{lep1/2}_{T}>\) 25/10 GeV for muons) and 1 tight \(\tau _{h}\) (\(p_{T}>\) 30 GeV) with sign opposite to that of the leptons. Events should also contain \(\ge \) 3 Jets. Additional selections were applied to reduce background due to charge mis-identification and \(Z\rightarrow ll\).

(f):

3 Lepton + 1\(\boldsymbol{\tau _{h}}\) channel (3l+1\(\boldsymbol{\tau _{h}}\)) \(:\) Events containing \(\ge \) 3 tight leptons (\(p^{lep1/2/3}_{T}>\) 20/10/10 GeV) and \(\ge \) 1 tight \(\tau _{h}\) (\(p_{T}>\) 30 GeV). Sum of the charges of the leptons and \(\tau _{h}\) must be zero. Additional selections were applied to reduce background due to charge mis-identification and \(Z\rightarrow ll\).

Events containing \(\tau _{h}\) are vetoed in the pure leptonic channels ((a), (b), and (c)) to keep them orthogonal to the lepton + \(\tau _{h}\) channels ((d), (e), and (f)) For the 2lSS and 3l channels, a pair of BDTs (one each for training against \(t\bar{t}V\) and \(t\bar{t}\) backgrounds vs the \(t\bar{t}H\)signal) was used for enhanced sensitivity. The BDTs for 2lSS included discriminators for “hadronic top tagging”. For signal extraction, the BDT pairs were mapped onto one dimension using “Likelihood based clustering”. The minimum invariant mass of the opposite sign di-lepton pair was used for signal extraction for the 4l channel. BDT (trained to distinguish \(t\bar{t}H\)from \(t\bar{t}\)) was used as final discriminator in 1l+2\(\tau _{h}\) channel. MEM (useful in separating \(t\bar{t}H\)from \(t\bar{t}V\) and \(t\bar{t}\) backgrounds) was used for signal extraction in 2lSS+1\(\tau _{h}\) channel. BDT pair (one each trained against \(t\bar{t}V\) and \(t\bar{t}\)) mapped into a one-dimensional discriminant was used for signal extraction in 3l+1\(\tau _{h}\) channel.

3.3 Backgrounds

Backgrounds in all the \(t\bar{t}H\)searches can be classified into three major categories depending on their source/origin.

  1. 1.

    Reducible/Fake backgrounds \(:\) These backgrounds arise predominantly due to Jets faking the final state particles (\(\gamma \)/leptons/\(\tau _h\) or b-jets) used in the searches. They are mostly either estimated from data or from simulation (with their yield and shape corrected from background rich sidebands in data). In the \(t\bar{t}H\)Multi-Lepton search, the probability of a Jet to pass tight selections is measured in a multi-jet enriched sideband in data (measurement region). This probability is then used to reweight another data sideband having the same selections as the signal region but with relaxed lepton identification requirements (application region). This reweighted sideband is then used to estimate fake background in all channels.

  2. 2.

    Charge Flip background \(:\) This background is caused due to mis-identification of lepton charge inside the detector (due to inelastic scattering or missing hits in the tracker). It is measured from data via.  “Tag and Probe” method in \(Z \rightarrow ll\) events in bins of lepton \(p_{T}\) and \(|\eta |\). It is measured to be \(\sim \) \(10^{-3}\) for electrons and was found to be negligible for muons.

  3. 3.

    Ir-reducible backgrounds \(:\) This background is caused by genuine physical processes having the same final state particles as in the signal process. They are usually estimated via.  Simulation and their modeling validated using control regions in data. Examples include \(t\bar{t}V\) and WW/WZ/ZZ backgrounds.

3.4 Results

Results of the above mentioned \(t\bar{t}H\)searches with the data collected by CMS in 2016 (35.9 \(fb^{-1}\) at \(\sqrt{s} =\) 13 TeV) were combined with the CMS Run-1 datasetFootnote 8 to obtain a “5 \(\sigma \) discovery” of \(t\bar{t}H\)process (Fig. 3.2) [9]. For the 2017 dataset,Footnote 9 public results were available only for the \(t\bar{t}H\)Multi-lepton search at the time of the symposium [10]. A new category (2l + 2\(\tau _{h}\)) was added by this search on top of the existing ones and the combined (2016 + 2017) signal strength from the \(t\bar{t}H\)Multi-lepton search alone equals \(0.96^{+0.34}_{-0.31}\) (\(1.00^{+0.30}_{-0.27}\)) observed (expected) in units of the SM expectation.

Fig. 3.2
figure 2

Results of the CMS Run-1+Run-2 \(t\bar{t}H\)combination. Likelihood scan of the signal strength (a) and Channel-wise signal strength split by era (b)