1 Introduction

In hadron-hadron collisions, the hard scattering of partons is accompanied by additional activity from multiple-parton interactions (MPI) that take place within the same collision, and by interactions between the remnants of the hadrons. To describe the underlying-event (UE) activity in a hard scattering process, and minimum-bias (MB) events, Monte Carlo (MC) event generators such as \({\textsc {herwig}} \,7\) [1,2,3] and \({\textsc {pythia}} \,8\) [4] include a model of these additional interactions. Because these processes are soft in nature, perturbative quantum chromodynamics (QCD) cannot be used to predict them, so they must be described by a phenomenological model. The parameters of the models must be optimized to provide a reasonable description of measured observables that are sensitive to the UE and MB events. An accurate description of the UE by MC event generators, along with an understanding of the uncertainties in the description, is of particular importance for precision measurements at hadron colliders, such as the extraction of the top quark mass. This paper presents new sets of parameters (“tunes”) for the UE model of the \({\textsc {herwig}} \,7\) event generator.

The \({\textsc {herwig}} \,7\) event generator is a multipurpose event generator, which can perform matrix-element (ME) calculations beyond leading order (LO) in QCD, via the matchbox module [5], matched with parton shower (PS) calculations. Both an angular-ordered and a dipole-based PS simulation are available in \({\textsc {herwig}} \,7\), and the former is used in this paper. The ME calculations can also be provided by an external ME generator, such as powheg [6,7,8] or MadGraph 5_amc@nlo [9]. The \({\textsc {herwig}} \,7\) generator is built upon the development of the preceding herwig [10] and herwig++ [1] event generators. In addition to the simulation of hard scattering of partons in hadron-hadron collisions, a simulation of MPI, which is modelled by a combination of soft and hard interactions and by colour reconnection (CR) [1, 11,12,13], is included in \({\textsc {herwig}} \,7\). As shown in Ref. [13], a model of CR is required in \({\textsc {herwig}} \,7\) to describe the structure of colour connections between different MPI, and to obtain a good description of the mean charged-particle transverse momentum (\(p_{\mathrm {T}}\)) as a function of the charged-particle multiplicity (\(N_{\text {ch}}\)).

The model describing the soft interactions, and also diffractive processes, was improved in version 7.1 of \({\textsc {herwig}} \,7\). This resulted in a new tune of the MPI parameters, called \(\text {SoftTune}\), which improved the description of MB data [3, 12]. In particular, the description of final-state hadronic systems separated by a large rapidity gap [14, 15] is notably improved because a significant contribution is expected from diffractive events. The tune \(\text {SoftTune}\) is based on the MMHT 2014 LO parton distribution function (PDF) set [16], and was derived by fitting MB data at \(\sqrt{s} =0.9,~7,\) and \(13\,\text {Te}\text {V} \) from the ATLAS experiment [17]. The MB data used in the tuning include the pseudorapidity (\(\eta \)) and \(p_{\mathrm {T}} \) distributions of charged particles for various lower bounds on \(N_{\text {ch}}\), namely \(N_{\text {ch}} \ge 1,~2,~6,\) and 20. The mean charged-particle \(p_{\mathrm {T}} \) as a function of \(N_{\text {ch}}\) was also included in the tuning procedure. Three models of CR are available in \({\textsc {herwig}} \,7\), and \(\text {SoftTune}\) was derived with the plain colour reconnection (PCR) model implemented. The same PCR model is considered in our studies.

In this paper, we present new UE tunes for the \({\textsc {herwig}} \,7\) (version 7.1.4) generator. In contrast to \(\text {SoftTune}\), the tunes presented here are based on the NNPDF 3.1 PDF sets [18], and use the next-to-next-to-leading-order (NNLO) PDF set for the simulation of the PS, and either an LO or NNLO PDF set for the simulation of MPI and the beam remnants. This choice of PDF sets is similar to that used to obtain tunes for the \({\textsc {pythia}} \,8\) event generator in Ref. [19], where it was shown that predictions from \({\textsc {pythia}} \,8\) using LO, next-to-leading-order (NLO), and NNLO PDFs with their associated tunes can all give a reliable description of the UE. Based on these findings and the wide use by the CMS Collaboration of the CP5 \({\textsc {pythia}} \,8\) tune, we concentrate on deriving tunes for the \({\textsc {herwig}} \,7\) generator that are also based on an NNLO PDF set for the simulation of the parton shower. It is verified that using an NNLO PDF in the simulation of the PS in \({\textsc {herwig}} \,7\) also provides a reliable description of MB data. A consistent choice of PDF in the \({\textsc {herwig}} \,7\) and \({\textsc {pythia}} \,8\) generators, as well as a similar method of the MPI model tuning, provides a better comparison of predictions from these two generators.

The tunes are derived by fitting measurements from proton-proton collision data collected by the CMS experiment [20] at \(\sqrt{s} = 0.9\), 7, and \(13\,\text {Te}\text {V} \). The measurements used in the fitting procedure are chosen because of their sensitivity to the modelling of the UE in \({\textsc {herwig}} \,7\). Uncertainties in the parameters of one of the new tunes are also derived. This quantifies the effect of the uncertainties in the fitted parameters for future analyses. To validate the performance of the new tunes, the corresponding \({\textsc {herwig}} \,7\) predictions are compared with a range of MB data from proton-proton and proton-antiproton collisions. Comparisons are also made using event shape observables from electron-positron collisions collected at the CERN LEP accelerator, which are particularly sensitive to the choice of the strong coupling \(\alpha _\mathrm {S}\) in the description of final-state radiation. To further validate the new tunes, predictions of differential \(\mathrm{t}{\bar{\mathrm{t}}}\) , Z boson, and W boson cross sections are also obtained from matching ME calculations from powheg and MadGraph 5_amc@nlo with the \({\textsc {herwig}} \,7\) PS description. The kinematics of the \(\mathrm{t}{\bar{\mathrm{t}}}\) system are studied, along with the multiplicity of additional jets, which are sensitive to the modelling by the PS simulation, in \(\mathrm{t}{\bar{\mathrm{t}}}\) , Z boson, and W boson events. The modelling of the UE in Z boson events, and the substructure of jets in \(\mathrm{t}{\bar{\mathrm{t}}}\) and in inclusive jet events are also investigated. Some of these comparisons are sensitive to the modelling by the ME calculations, and the purpose of those is to validate that the various predictions using the tunes do not differ from each other by a significant amount. Other comparisons are more sensitive to the modelling of the PS and MPI simulation, allowing us to test the new tunes in data other than MB data.

This paper is organized as follows. In Sect. 2, we summarize the UE model employed by \({\textsc {herwig}} \,7\), and describe the model parameters considered in the tuning. The choice of PDF and the value of the strong coupling in the tunes is discussed in Sect. 3 in addition to details of the fitting procedure. The new tunes are presented in Sect. 4, and the corresponding predictions from \({\textsc {herwig}} \,7\) are compared with MB data. Uncertainties in one of the derived tunes are presented in Sect. 5. Further validation of the new tunes is performed in the following sections: their predictions are compared with event shape observables from the CERN LEP in Sect. 6, and with top quark, inclusive jet, and Z and W boson production data in Sects. 7, 8, and 9, respectively. Finally, we present a summary in Sect. 10.

Table 1 Parameters considered in the tuning, and their allowed ranges in the fit

2 The UE model in HERWIG 7

The UE in \({\textsc {herwig}} \,7\) is modelled by a combination of soft and hard interactions [1, 11, 12]. The parameter \(p_{\bot }^\text {min}\) defines the transition between the soft and hard MPI. The interactions with a pair of outgoing partons with \(p_{\mathrm {T}}\) above \(p_{\bot }^\text {min}\) are treated as hard interactions, which are constructed from QCD two-to-two processes. The \(p_{\bot }^\text {min}\) transition threshold depends on the centre-of-mass energy of the hadron-hadron collision and is given by:

$$\begin{aligned} p_{\bot }^\text {min} = p_{\bot ,\mathrm {0}}^\mathrm {min} \left( \frac{\sqrt{s}}{E_{\mathrm {0}}}\right) ^b, \end{aligned}$$
(1)

where \(p_{\bot ,\mathrm {0}}^\mathrm {min}\) is the value of \(p_{\bot }^\text {min}\) at a reference energy scale \(E_{\mathrm {0}}\), which is set to 7\(\,\text {Te}\text {V}\), \(\sqrt{s}\) is the centre-of-mass energy of the hadron-hadron collision, and the parameter \(b\) controls the energy dependence of \(p_{\bot }^\text {min}\). Decreasing the value of \(p_{\bot }^\text {min}\) increases the number of hard interactions whilst reducing the number of soft interactions, which typically increases the amount of activity in the UE.

The average number \(\langle n\rangle \) of these additional hard interactions per hadron-hadron collision is given by:

$$\begin{aligned} \langle n\rangle =A(d) \sigma (s), \end{aligned}$$
(2)

where \(\sigma (s)\) is the production cross section of a pair of partons with \(p_{\mathrm {T}} >p_{\bot }^\text {min} \) and \(A(d)\) describes the overlap between the two protons at a given impact parameter \(d\). The form of the overlap function is given by:

$$\begin{aligned} A(d) = \frac{\mu ^{2}}{96\pi }(\mu d)^{3} K_{3}, \end{aligned}$$
(3)

where \(\mu ^{2} \) is the inverse proton radius squared, and \(K_{3}\equiv {K_{3}(\mu d)}\) is the modified Bessel function of the third kind. The overlap function is obtained by the convolution of the electromagnetic form factors of two protons. The number of additional hard interactions per hadron-hadron collision at a given \(d \) is described by a Poissonian probability distribution with a mean given by Eq. (2), which is then integrated over the impact parameter space. Increasing \(\mu ^{2}\) increases the density of the partons in the hadrons, and results in a higher probability for additional hard scatterings to take place.

Additional soft interactions, which produce pairs of partons below \(p_{\bot }^\text {min}\), are based on a model of multiperipheral particle production [12]. The number of additional soft interactions between the two hadron remnants is described in a similar way to the hard interactions above \(p_{\bot }^\text {min}\). In a soft interaction between the two hadron remnants, the mean number of particles produced is given by:

$$\begin{aligned} \langle N \rangle = N_\mathrm {0} \left( \frac{s}{1 \,\text {Te}\text {V} ^2} \right) ^P \ln \frac{\left( p_{\mathrm {r1}} + p_{\mathrm {r2}}\right) ^2}{m^{2}_{\mathrm {rem}}}, \end{aligned}$$
(4)

where \(p_{\mathrm {r1}}\) and \(p_{\mathrm {r2}}\) are the four-momenta of the two remnants, and \(m_{\mathrm {rem}}\) is the mass of a proton remnant, i.e. the remaining valence quarks of a proton treated as a diquark system, and is set to \(0.95\,\text {Ge}\text {V} \). The parameters \(N_\mathrm {0}\) and \(P\) control the energy dependence of the mean number of soft particles produced. They were tuned to MB data, which resulted in the values \(P =-0.08\) and \(N_\mathrm {0} =0.95\) [3]. In deriving the tune \(\text {SoftTune}\) the values of \(N_\mathrm {0}\) and \(P\) were kept fixed at these values.

The cluster model [21] is used to model the hadronization of quarks into hadrons. After the PS calculation, gluons are split into quark-antiquark pairs, and a cluster is formed from each colour connected pair of quarks. Before hadrons are produced from the clusters, CR can modify the configuration of the clusters. With the PCR model, the quarks from two clusters can be reconfigured to form two alternative clusters. The change of the cluster configuration takes place only if the sum of the masses of the new clusters is smaller than before. If this condition is satisfied, the CR is accepted with a probability \(p_{\text {reco}}\), which is the only parameter of the PCR model. The PCR model typically leads to clusters with smaller invariant mass compared with the clusters that would be obtained without CR, and will typically reduce the overall activity in the UE.

3 Tuning procedure

We derive three tunes based on the NNPDF 3.1 PDF sets [18]. A different PDF set is chosen for each aspect of the \({\textsc {herwig}} \,7\) simulation: hard scattering, parton showering, MPI, and beam remnant handling. The value of \(\alpha _\mathrm {S}\) at a scale equal to the Z boson mass \(m_{\mathrm {\mathrm{Z}\,}}\)in each tune is set to \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})=0.118\) for all parts of the \({\textsc {herwig}} \,7\) simulation, with a two-loop running of \(\alpha _\mathrm {S}\).

Fig. 1
figure 1

Illustration of the different \(\phi \) regions, with respect to the leading object in an event, used to probe the properties of the UE in measurements

Table 2 Value of the parameters for the \(\text {SoftTune}\) [3, 12], CH1, CH2, and CH3 tunes

The first tune, CH1 (“CMS herwig ”), uses an NNLO PDF set in all aspects of simulation in \({\textsc {herwig}} \,7\), where the PDF was derived with a value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\). This is equivalent to the choice of PDF and \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) used in the CP5 \({\textsc {pythia}} \,8\) tune [19]. In the second tune, CH2, an LO PDF set that was also derived with \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\), is used in the simulation of MPI and beam remnant handling, whereas an NNLO PDF set is used elsewhere. The final tune, CH3, is similar to CH2, but uses an LO PDF set that was derived with \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.130\) for the simulation of MPI and remnant handling. The choice of an LO PDF set for the simulation of MPI and beam remnant handling, regardless of the choice of PDF used in the PS and ME calculation, is motivated by ensuring that the gluon PDF is positive at the low energy scales involved, which is not necessarily the case with higher-order PDF sets. However, as was shown in Ref. [19], the gluon PDF in the NNLO NNPDF 3.1 set remains positive at low energy scales, and predictions from \({\textsc {pythia}} \,8\) using LO and higher-order PDFs can both give a reliable description of MB data. The configurations of PDF sets in the CH1, CH2, and CH3 tunes allow us to study whether using an NNLO PDF set consistently for all aspects of the \({\textsc {herwig}} \,7\) simulation, or an LO PDF set for the simulation of MPI, can both give a reliable description of MB data. For both of these choices the gluon PDF is positive at low energy scales.

The names of the parameters being tuned in the \({\textsc {herwig}} \,7\) configuration, and their allowed ranges in the fit, are shown in Table 1. The values of \(N_\mathrm {0} =0.95\) and \(P =-0.08\) are fixed at the values that were used in the tune \(\text {SoftTune}\). As shown later, no further tuning of these parameters is necessary, because of the good description of measured observables obtained with these values.

The tunes are derived by fitting unfolded MB data that are available in the rivet [22] toolkit. The proton-proton collision data used in the fit were collected by the CMS experiment at \(\sqrt{s} =0.9\), 7, and 13\(\,\text {Te}\text {V}\). In measurements probing the UE, charged particles in a particular event are typically categorized into different \(\eta \)-\(\phi \) regions with respect to a leading object in that event, such as the highest \(p_{\mathrm {T}}\) track or jet, as illustrated in Fig. 1. The difference in azimuthal \(\phi \) between each charged particle and the leading object (\(\varDelta \phi \)) is used to assign each charged particle to a region, namely the toward (\(|\varDelta \phi |\le 60^{\circ }\)), away (\(|\varDelta \phi |>120^{\circ }\)), and transverse regions (\(60<|\varDelta \phi |\le 120^{\circ }\)). The properties of the charged particles in the transverse regions are the most sensitive to the modelling of the UE. The two transverse regions can be further divided into the transMin and transMax regions, which are the regions with the least and most charged-particle activity, respectively. Data that have been categorized in this way are referred to as UE data in this paper.

Fig. 2
figure 2

The normalized \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) of charged hadrons as a function of \(\eta \) [27]. CMS MB data are compared with \(\text {SoftTune}\) and the CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 3
figure 3

The normalized \(p_{\mathrm {T}} ^{\text {sum}}\) (upper) and \(N_{\text {ch}}\) (lower) density distributions in the transMin (left) and transMax (right) regions, as a function of the \(p_{\mathrm {T}}\) of the leading track, \(p_{\mathrm {T}} ^{\text {max}}\) [24]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 4
figure 4

The \(p_{\mathrm {T}} ^{\text {sum}}\) (upper) and \(N_{\text {ch}}\) (lower) density distributions in the transMin (left) and transMax (right) regions, as a function of the \(p_{\mathrm {T}}\) of the leading track, \(p_{\mathrm {T}} ^{\text {max}}\) [23]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 5
figure 5

The \(p_{\mathrm {T}} ^{\text {sum}}\) (left) and \(N_{\text {ch}}\) (right) density distributions in the transverse regions, as a function of the \(p_{\mathrm {T}}\) of the leading track jet, \(p_{\mathrm {T}} ^{\text {jet}}\) [25]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 6
figure 6

The \(p_{\mathrm {T}} ^{\text {sum}}\) (upper) and \(N_{\text {ch}}\) (lower) density distributions in the transMin (left) and transMax (right) regions, as a function of the \(p_{\mathrm {T}}\) of the leading track, \(p_{\mathrm {T}} ^{\text {max}}\) [31]. CDF MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 7
figure 7

The \(p_{\mathrm {T}} ^{\text {sum}}\) (upper) and \(N_{\text {ch}}\) (lower) density distributions in the transMin (left) and transMax (right) regions, as a function of the \(p_{\mathrm {T}}\) of the leading track, \(p_{\mathrm {T}} ^{\text {max}}\) [24]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the CH1 and CH3 tunes, and from \({\textsc {pythia}} \,8\), with the CP1 and CP5 tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

At \(\sqrt{s} =7\) and 13\(\,\text {Te}\text {V}\), the \(N_{\text {ch}}\) and transverse momentum sum (\(p_{\mathrm {T}} ^{\text {sum}}\)), with respect to the beam axis, as functions of the \(p_{\mathrm {T}}\) of the leading track (\(p_{\mathrm {T}} ^{\text {max}}\)) in the transMin and transMax regions are used in the fit [23, 24]. At \(\sqrt{s} =0.9\,\text {Te}\text {V} \), the observables used are the \(N_{\text {ch}}\) and \(p_{\mathrm {T}} ^{\text {sum}}\) in the transverse region, as a function of the \(p_{\mathrm {T}}\) of the leading jet (\(p_{\mathrm {T}} ^{\text {jet}}\)) [25]. The track jets are clustered using the SISCone algorithm [26] with a distance parameter of 0.5. The regions \(p_{\mathrm {T}} ^{\text {max}} <3\,\text {Ge}\text {V} \) and \(p_{\mathrm {T}} ^{\text {jet}} <3\,\text {Ge}\text {V} \) are not included in the fit because the parameters of diffractive processes, which dominate this region, are not considered. The charged-hadron multiplicity as a function of \(\eta \), \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \), as measured by CMS at \(\sqrt{s} =13\,\text {Te}\text {V} \) with zero magnetic field strength (\(\mathrm {B}=0\,\text {T} \)) [27] is also used in the fitting procedure. The charged-particle \(p_{\mathrm {T}}\) and \(\eta \) as measured by CMS in Ref. [28] are not considered here, since they are biased by predictions obtained with \({\textsc {pythia}} \,6\) [29], as discussed in Ref. [12].

The tuning is performed within the professor (v1.4.0) framework [30]. Around 60 random choices of the parameters are made, and predictions for each of these choices are obtained using rivet. Approximately 10 million MB events are generated for each choice of parameters, such that the uncertainty in the prediction in any bin is typically not larger than the uncertainty in the data in the same bin.

The fit is performed by minimising the \(\chi ^{2}\) function:

$$\begin{aligned} \chi ^{2} (p) = \sum _{{\mathcal {O}}}w_{{\mathcal {O}}} \sum _{i \in {\mathcal {O}}}\frac{(f^{i}(p) -{\mathcal {R}}_i)^2}{\varDelta ^2_i}, \end{aligned}$$
(5)

where \({\mathcal {R}}_i \) is the measured content of bin \(i\) of the distribution of observable \({\mathcal {O}}\), while \(f^{i}(p)\) is the predicted content in bin \(i\), which is obtained by professor from a parameterization of the dependence of the prediction on the tuning parameters p. The total uncertainty in the data and the simulated prediction in bin \(i\) of a given observable is denoted by \(\varDelta ^2_i \), and \(w_{{\mathcal {O}}}\) is a weight that increases or decreases the importance of an observable \({\mathcal {O}}\) in the fit. The weight is typically set to \(w_{{\mathcal {O}}} =1\). However, for the CH1 tune, where the PDF set used in the simulation of MPI and beam remnants is an NNLO set instead of an LO set, the weight is set to \(w_{{\mathcal {O}}} =3\) for the \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) distribution. This is the smallest weight that ensures the distribution is well described after the tuning. Beyond this, the parameters for the three tunes and their predictions are stable with respect to a change in the weight assigned to the \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) distribution in the fit. Correlations between the bins \(i\) are not taken into account when minimising Eq. (5), because these were not available for the used input distributions. A third-order polynomial is used to parameterize the dependence of the prediction on the tuning parameters. Using a fourth-order polynomial to perform this interpolation between the 60 choices of parameters has a negligible effect on the outcome of the fits.

The number of degrees of freedom (\(N_{\text {dof}}\)) in the fit is calculated as:

$$\begin{aligned} N_{\text {dof}} = \frac{(\sum _{{\mathcal {O}}}\sum _{i \in {\mathcal {O}}}w_{{\mathcal {O}}})^2}{\sum _{{\mathcal {O}}} \sum _{i \in {\mathcal {O}}}w_{{\mathcal {O}}} ^2} - N_{\text {param}}, \end{aligned}$$
(6)

where \(N_{\text {param}}\) is the number of parameters being optimized in the fit.

4 Results from the new HERWIG 7 tunes

The tuned values of the parameters and the \(\chi ^{2}\) values from the fit, i.e. the minimum values of Eq. (5), divided by the \(N_{\text {dof}}\) of the fit are shown in Table 2, along with the values of the parameters for the default tune \(\text {SoftTune}\). The \(N_{\text {dof}}\) in the fit is 118 for CH1, and 152 for CH2 and CH3. To provide a comparison between the compatibilities of the CH tunes and \(\text {SoftTune}\) with the data, the \(\chi ^{2}\)/\(N_{\text {dof}}\) corresponding to the prediction of SoftTune and the data is also shown with \(N_{\text {dof}}\) set to 152.

The values of the parameters of the MPI model are intertwined with each other since they are tuned simultaneously to reproduce the amount of UE activity observed in the data. Nonetheless, a general interpretation of the variations in the tuned parameters for each tune can be distinguished. For example, the value of \(p_{\bot ,\mathrm {0}}^\mathrm {min}\) is lower for all three CH tunes than for \(\text {SoftTune}\), and significantly lower for CH1, which increases the amount of MPI in an event compared to that with the tune \(\text {SoftTune}\).

The lower value of \(b\) for all CH tunes further increases the contribution of MPI in collisions at \(\sqrt{s} =13\,\text {Te}\text {V} \). Because of the lower values of \(p_{\text {reco}}\), the amount of CR in the CH tunes is lower than in \(\text {SoftTune}\). This also has the effect of increasing the overall amount of activity in the UE for the CH tunes. The value of \(\mu ^{2}\) for CH2 and CH3 is lower than the corresponding value for \(\text {SoftTune}\). Even though a lower value of \(\mu ^{2}\) would lead to a lower amount of MPI in a given event, the combined effect of the parameters of the CH tunes results in a larger amount of MPI compared with \(\text {SoftTune}\).

The tuned parameters of CH2 and CH3 are fairly similar, as are the values of \(\chi ^{2}\)/\(N_{\text {dof}}\) of these two tunes, indicating that the choice of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) used when deriving the LO PDF set in the simulation of MPI does not have a large effect. The parameters for the tune CH1 differ from those for the tunes CH2 and CH3, and the value of \(\chi ^{2}\)/\(N_{\text {dof}}\) is larger, implying that using an LO PDF set is somewhat preferred over an NNLO PDF set for the simulation of MPI. In the following, the predictions from the three CH tunes are compared with the data used in the tuning procedure. These predictions are obtained by generating events with the corresponding parameters shown in Table 2 rather than from the parameterization of the tune parameters used in the fit.

Figure 2 shows the normalized \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) of charged hadrons as a function of \(\eta \) at 13\(\,\text {Te}\text {V}\) in MB events. Only the predictions for \(\text {SoftTune}\) deviate significantly from the data, and underestimate the \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) in data by 10–18%. The CH tunes each provide a slightly different prediction, but all have a similar level of agreement with the data. The CH tunes compared with SoftTune predict an increase in the UE activity, which is observed.

Figure 3 shows the normalized \(p_{\mathrm {T}} ^{\text {sum}}\) and \(N_{\text {ch}}\) densities as a function of \(p_{\mathrm {T}} ^{\text {max}}\) with comparisons from \(\text {SoftTune}\) and the CH tunes for both transMin and transMax. The predictions of \(\text {SoftTune}\) and the CH2, CH3 tunes are broadly similar, and give a good description the data in the plateau region (\(p_{\mathrm {T}} ^{\text {max}} \gtrsim 4\,\text {Ge}\text {V} \)). In the rising part of the spectrum, the predictions from the tunes CH2, CH3, and \(\text {SoftTune}\) deviate from the data in some bins by up to 40%. The CH3 tune provides the best predictions in the rising region of the spectrum. However, only the region \(p_{\mathrm {T}} ^{\text {max}} >3\,\text {Ge}\text {V} \) was included in the tuning procedure, because the region \(p_{\mathrm {T}} ^{\text {max}} <3\,\text {Ge}\text {V} \) is dominated by diffractive processes whose model parameters are not used in the fit.

The effect of using an NNLO PDF, instead of an LO PDF, in the simulation of MPI is seen from the predictions with the tune CH1 in Fig. 3. This tune provides a good description of the \(N_{\text {ch}}\) distributions in both the transMin and transMax regions, and is typically within 10% of the data. However, the tune CH1 does not simultaneously provide a good description of the \(p_{\mathrm {T}} ^{\text {sum}}\) distributions in either the transMin or transMax region, with a 10% difference to the data in the plateau region of the corresponding transMax distribution.

Fig. 8
figure 8

The normalized \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) of charged hadrons as a function of \(\eta \) [27]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the CH1 and CH3 tunes, and from \({\textsc {pythia}} \,8\), with the CP1 and CP5 tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Table 3 Parameters of the central, “up”, and “down” variations of the CH3 tune
Fig. 9
figure 9

The \(p_{\mathrm {T}} ^{\text {sum}}\) (upper) and \(N_{\text {ch}}\) (lower) density distributions in the transMin (left) and transMax (right) regions, as a function of the \(p_{\mathrm {T}}\) of the leading track, \(p_{\mathrm {T}} ^{\text {max}}\) [24]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties. The grey-shaded band corresponds to the envelope of the “up” and “down” variations of the CH3 tune

Figure 4 shows the normalized \(N_{\text {ch}}\) and \(p_{\mathrm {T}} ^{\text {sum}}\) densities as a function of \(p_{\mathrm {T}} ^{\text {max}}\) using UE data at 7 TeV and compared with various tunes. In the transMax region, the predictions from the CH tunes describe the data well, with at most a 15% discrepancy at low \(p_{\mathrm {T}} ^{\text {max}}\). In the transMin region, the predictions from all tunes deviate from the data at intermediate values of \(p_{\mathrm {T}} ^{\text {max}} \approx 3\text {--}8\,\text {Ge}\text {V} \). The deviation is up to \(\approx \)10% for the CH2 and CH3 tunes, whereas the difference between data and the tunes \(\text {SoftTune}\) and CH1 is larger than this. The prediction of CH1 deviates further from the data at lower values of \(p_{\mathrm {T}} ^{\text {max}}\).

The predictions are compared with UE data at \(\sqrt{s} =0.9\,\text {Te}\text {V} \) to normalized \(p_{\mathrm {T}} ^{\text {sum}}\) densities in the transverse regions in Fig. 5. All tunes provide a similar prediction of the observables above \(p_{\mathrm {T}} ^{\text {jet}} >4\,\text {Ge}\text {V} \), and agree with the data. Some differences are apparent between the predictions at low \(p_{\mathrm {T}} ^{\text {jet}}\), with the tunes CH2 and CH3 providing a better description of the data compared to the tunes CH1 and \(\text {SoftTune}\).

Figure 6 shows comparisons of the normalized \(p_{\mathrm {T}} ^{\text {sum}}\) and \(N_{\text {ch}}\) densities using tune predictions with UE data collected by the CDF experiment at the Fermilab Tevatron at \(\sqrt{s} =1.96\,\text {Te}\text {V} \) [31]. The CH tunes describe the distributions in both transMin and transMax well, however the CH3 tune underestimates the \(p_{\mathrm {T}} ^{\text {sum}}\) data somewhat at \(p_{\mathrm {T}} ^{\text {max}} <10\,\text {Ge}\text {V} \), in both the transMin and transMax regions. Although these data were not used in deriving any of the tunes considered here, they validate that the energy dependence of the new tunes is correctly modelled. The tune \(\text {SoftTune}\) overestimates the data by \(\approx \)5–15% in all distributions. Additional comparisons of the predictions of \({\textsc {herwig}} \,7\) with the various tunes using MB data from the ATLAS experiment, which were used in deriving \(\text {SoftTune}\), are shown in Appendix A. One notable difference between the distribution of \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) shown in Fig. 2 and the one shown in Fig. 24 is that the former includes all charged particles, whereas the latter includes only charged particles with \(p_{\mathrm {T}} > 500\,\text {Me}\text {V} \).

Based on the comparisons shown in this section, the tunes CH2 and CH3 both provide a similar description of the data, indicating that the choice between the two LO PDFs used for the simulation of MPI and remnant handling has little effect on the predictions. These two PDFs are both LO PDFs, but a value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) is used in deriving the PDF used with CH2, and a value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.130\) is assumed for the PDF used with CH3. As stated in Sect. 3, \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) is used in all parts of the \({\textsc {herwig}} \,7\) simulation for the three CH tunes. From Table 2, the \(\chi ^{2}\)/\(N_{\text {dof}}\) for the tune CH2 is slightly lower than that for the tune CH3. However, the use of the LO PDF in the tune CH3, which was derived with \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.130\), is consistent with the value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) typically associated with LO PDFs and therefore is a preferred choice over the tune CH2. Both of the tunes CH2 and CH3 provide a better description of the data than the tune CH1, where the NNLO NNPDF3.1 PDF was used for the simulation of MPI and remnant handling. This suggests that the use of the LO NNPDF3.1 PDF is preferred in this aspect of the \({\textsc {herwig}} \,7\) simulation, even though the gluon PDF in both the LO and NNLO PDF sets are positive at low energy scales, as discussed earlier.

Fig. 10
figure 10

The normalized \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \) of charged hadrons as a function of \(\eta \) [27]. CMS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties. The grey-shaded band corresponds to the envelope of the “up” and “down” variations of the CH3 tune

In Fig. 7 the normalized \(N_{\text {ch}}\) and \(p_{\mathrm {T}} ^{\text {sum}}\) density predictions of the UE data at \(\sqrt{s} =13\,\text {Te}\text {V} \) show a comparison of the CH1 and CH3 tunes with those obtained from the \({\textsc {pythia}} \,8\) (version 8.230) using the tunes CP1 and CP5 [19]. The tune CH2 is not displayed, because its prediction is similar to the one of the tune CH3. The CP1 tune uses an LO NNPDF3.1 PDF set in all aspects of the \({\textsc {pythia}} \,8\) simulation, an \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) value of 0.130 in the simulation of MPI and hard scattering, and an \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) value of 0.1365 for the simulation of initial- and final-state radiation. The CP5 tune uses an NNLO PDF set with an \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) value of 0.118 in all aspects of simulation. The choice of the PDF set and \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) value in the CP5 tune is the same as the CH1  \({\textsc {herwig}} \,7\) tune. Although all the predictions show a reasonable agreement with the data in the plateau region of the UE distributions, the use of an LO PDF for MPI and remnant handling in CH3 provides a slightly improved description of the \(p_{\mathrm {T}} ^{\text {sum}}\) data compared to using an NNLO PDF in CH1. This differs from the predictions of \({\textsc {pythia}} \,8\), where the use of an LO and NNLO PDF for simulating MPI give a similar description of the data in this region. Each prediction exhibits different behaviour at low \(p_{\mathrm {T}} ^{\text {max}}\). None of the \({\textsc {herwig}} \,7\) or \({\textsc {pythia}} \,8\) tunes provides a perfect description of the data at low \(p_{\mathrm {T}} ^{\text {max}}\), since they exhibit at least a 10% difference between any one of the tunes and the data. Figure 8 shows a similar comparison for the \(\eta \) distribution of charged hadrons at 13\(\,\text {Te}\text {V}\). The prediction from CP5 provides a better description of the data compared with the other tunes at larger values of \(|\eta |\). The predictions from the \({\textsc {herwig}} \,7\) tunes show a closer behaviour to the CP1 tune in this distribution.

Fig. 11
figure 11

Normalized differential cross sections for \(\mathrm{e}^-\) \(\mathrm{e}^+\) [32] as a function of the variables \(T\) (upper left), \(T_{\mathrm {major}}\) (upper right), \(O\) (lower left), and \(S\) (lower right) for ALEPH data at \(\sqrt{s} =91.2\,\text {Ge}\text {V} \). ALEPH data are compared with the predictions from \({\textsc {herwig}} \,7\) using the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratios of the different predictions from simulation to the data represents the total experimental uncertainty in the data

Fig. 12
figure 12

The differential cross sections are shown as functions of: the \(p_{\mathrm {T}}\) (upper left) and rapidity (upper right) of the hadronically decaying top quark; the invariant mass of the \(\mathrm{t}{\bar{\mathrm{t}}}\) system (lower left); the additional jet multiplicity (lower right) [38]. CMS \(\mathrm{t}{\bar{\mathrm{t}}}\) data are compared with the predictions from powheg  + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\), CH1, CH2, and CH3 tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 13
figure 13

The differential cross sections are shown as functions of \(H_{\mathrm {T}}\) (left) and \(p_{\mathrm {T}} ^\text {miss}\) (right) [41]. CMS \(\mathrm{t}{\bar{\mathrm{t}}}\) data are compared with the predictions from powheg  + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\), CH1, CH2, and CH3 tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 14
figure 14

The normalized jet substructure observables in single-lepton events: the charged-particle multiplicity (upper left); the eccentricity (upper right); the groomed momentum fraction (lower left); and the angle between the groomed subjects (lower right) [42]. CMS \(\mathrm{t}{\bar{\mathrm{t}}}\) data are compared with the predictions from powheg  + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\), CH1, CH2, and CH3 tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 15
figure 15

The differential jet shape \(\rho (\mathrm {r})\) (upper left and right) and the second moment of the jet transverse width \(\langle \delta R^2\rangle \) in inclusive jet events [43]. CMS inclusive jet data are compared with the predictions from \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 16
figure 16

The \(p_{\mathrm {T}} ^{\text {sum}}\) (left) and \(N_{\text {ch}}\) (right) density distributions in the transverse region, as a function of the \(p_{\mathrm {T}}\) of the two muons, \(p_{\mathrm {T}} (\mu \mu )\) [45]. The transverse region is defined with respect to \(p_{\mathrm {T}} (\mu \mu )\), where the two muons are required to have an invariant mass close the the mass of the Z boson. CMS Z boson data are compared with the predictions from MadGraph 5_amc@nlo + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 17
figure 17

The \(p_{\mathrm {T}} ^{\text {sum}}\) (left) and \(N_{\text {ch}}\) (right) density distributions in the toward (upper), and away (lower) regions, as a function of the \(p_{\mathrm {T}}\) of the two muons, \(p_{\mathrm {T}} (\mu \mu )\) [45]. The toward and away regions are defined with respect to \(p_{\mathrm {T}} (\mu \mu )\), where the two muons are required to have an invariant mass close the the mass of the Z boson. CMS Z boson data are compared with the predictions from MadGraph 5_amc@nlo + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 18
figure 18

The exclusive jet multiplicity in Z (left) and W (right) boson events, measured by CMS at \(\sqrt{s} =13\,\text {Te}\text {V} \) [46, 47]. CMS Z boson and W boson data are compared with the predictions from MadGraph 5_amc@nlo + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

Fig. 19
figure 19

Differential cross sections as a function of \(p_{\mathrm {T}} (\mathrm{Z}\,)\) (upper left), \(p_{\mathrm {T}} ^{\text {bal}}\) (upper right), and \(\mathrm {JZB}\) (lower) [46]. CMS Z boson data are compared with the predictions from MadGraph 5_amc@nlo + \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties

5 Uncertainties in the HERWIG 7 tunes

Alternative tunes are derived in this section that provide an approximation to the uncertainties in the parameters of the tune CH3. These are obtained from the eigentunes provided by professor. These eigentunes are variations of the tuned parameters along the maximally independent directions in the parameter space by an amount corresponding to a change in the \(\chi ^{2}\) (\(\varDelta \chi ^{2}\)) equal to the optimal \(\chi ^{2}\) of the fit. Because a change \(\varDelta \chi ^{2}\) in Eq. (5) does not result in a variation with a meaningful statistical interpretation, the value of \(\varDelta \chi ^{2}\) is chosen in an empirical way. The change \(\varDelta \chi ^{2} =\chi ^{2} \), which is suggested by the professor Collaboration, results in variations that are similar in magnitude to the uncertainties in the fitted data points and judged to provide a reasonable set of variations that reflect the combined statistical and systematic uncertainty in the model parameters. A consequence of this adopted procedure is that the uncertainty may not necessarily cover the data in every bin. If the uncertainties in the fitted data points were uncorrelated between themselves, then the magnitude of the uncertainties in the data points depends on their bin widths. For the data used in the fit, the uncertainties are typically dominated by uncertainties that are correlated between the bins. However, the uncertainties in the data points at high \(p_{\mathrm {T}} ^{\text {max}}\) and \(p_{\mathrm {T}} ^{\text {jet}}\), e.g. \(p_{\mathrm {T}} ^{\text {max}} \gtrsim 10\,\text {Ge}\text {V} \) for the UE observables at \(\sqrt{s} =13\,\text {Te}\text {V} \), are dominated by statistical uncertainties, which are uncorrelated between bins. This introduces some dependence of the eigentunes on the bin widths of the data used in the fit.

The variations of the tunes provided by the eight eigentunes are reduced to two variations, as explained below, one “up” and one “down” variation. The “up” variation is obtained by considering the positive differences in each bin between each eigentune and the central prediction of the CH3 tune for the distributions used in the tuning procedure. The difference for each eigentune is summed in quadrature. Similarly, the “down” variation is obtained by considering the negative differences between the eigentunes and the central predictions. The two variations are then fitted, using the same procedure described in Sect. 3 to obtain a set of tune parameters that describe these two variations. The parameters of the two variations are shown in Table 3. The values of each parameter of the variations do not necessarily encompass the corresponding values of the CH3 tune, as a result of the method of determining the variations from the differences between several eigentunes. The two variations accurately replicate the combination of all eigentunes, i.e. the sum in quadrature of all positive or negative differences with respect to the central prediction. By using these variations, the uncertainties in the tune CH3 are estimated by considering only two variations of the tune parameters, rather than eight variations. However, the correlations between bins of an observable for each of the eight individual variations are not known when considering only the “up” and “down” variations.

Figures 9 (normalized \(p_{\mathrm {T}} ^{\text {sum}}\) and \(N_{\text {ch}}\) densities) and 10 (normalized \(\mathrm {d} N_{\text {ch}}/\mathrm {d} \eta \)) show predictions from the CH tunes. The grey-shaded band corresponds to the envelope of the “up” and “down” variations, for the UE and MB observables used in the tuning procedure. The differences between the CH1 and CH2 predictions and those from CH3 are within the uncertainty of CH3, except for a small deviation at low \(p_{\mathrm {T}} ^{\text {max}}\).

6 Comparison with LEP data

\({\textsc {herwig}} \,7\) predictions are obtained in this section for event shape observables measured in LEP electron-positron collisions at \(\sqrt{s} =91.2\,\text {Ge}\text {V} \). The predictions are obtained using NLO MEs implemented within \({\textsc {herwig}} \,7\). Figure 11 shows the thrust (\(T\)), thrust major (\(T_{\mathrm {major}}\)), oblateness (\(O\)), and sphericity (\(S\)) observables as measured by the ALEPH Collaboration [32].

Because these observables are measured in collisions with a lepton-lepton initial state, the difference in choice of PDF and parameters of the MPI model in the three CH tunes has no effect on the predictions. Similarly, the only difference between the CH tunes and \(\text {SoftTune}\) is in the value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\). The value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) is used in the CH tunes, and is consistent with the value used by the PDF set for the hard process and the PS when simulating proton-proton collisions. A set of next-to-leading corrections to soft gluon emissions can be incorporated in the PS by using two-loop running of \(\alpha _\mathrm {S}\) and including the Catani–Marchesini–Webber rescaling [33] of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) from \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) to \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.1262\), which corresponds to the value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) used in \(\text {SoftTune}\) [34].

The CH tunes underestimate the number of events with \(0.80< T < 0.95\), whereas \(\text {SoftTune}\) predicts too many isotropic events with lower values of \(T <0.8\) and with higher values of \(S >0.4\). The CH tune provides a better overall description of the \(T_{\mathrm {major}}\) observable compared with \(\text {SoftTune}\). Both tunes predict too many planar events, as can be seen at larger values of \(O\); however, the CH tune provides a better description of the data at smaller values of \(O\).

7 Comparison with top quark pair production data

Predictions using the \({\textsc {herwig}} \,7\) tunes are compared in this section with observables measured in data containing top quark pairs.

The powheg v2 generator is used to perform ME calculations in the hvq mode [35] at NLO accuracy in QCD. In the powheg ME calculations, a value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) with a two-loop evolution of \(\alpha _\mathrm {S}\) is used, along with the NNPDF 3.1 NNLO PDF set, derived with a value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\). The ME calculations are interfaced with \({\textsc {herwig}} \,7\) for the simulation of the UE and PS. The mass of the top quark is set to \(m_{\mathrm{t}\,} = 172.5\,\text {Ge}\text {V} \), and the value of the \(h_{\text {damp}} \) parameter, which controls the matching between the ME and PS, is set to \(1.379~m_{\mathrm{t}\,} \). The value of \(h_{\text {damp}} \) in powheg was derived from a fit to \(\mathrm{t}{\bar{\mathrm{t}}}\) data in the dilepton channel at \(\sqrt{s} =8\,\text {Te}\text {V} \), where powheg was interfaced with \({\textsc {pythia}} \,8\) using the CP5 tune [19, 36].

Samples are generated with the different \({\textsc {herwig}} \,7\) tunes that use the same parton-level events for each tune. For generating NLO matched samples such as these, an NLO (or NNLO) PDF set may be desirable for the simulation of the hard process. In Ref. [37], it is then advocated that the same PDF set and \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) value should be used in the PS. However, one can still choose an LO PDF set for the simulation of the MPI and remnant handling in this case, such as the choices in the tunes CH2 and CH3. This configuration of PDF sets is not possible in pythia.

First, kinematic properties of the \(\mathrm{t}{\bar{\mathrm{t}}}\) system are compared with \(\sqrt{s} =13\,\text {Te}\text {V} \) CMS data in the single-lepton channel [38]. Figure 12 presents normalized differential cross sections as functions of the \(p_{\mathrm {T}}\) and rapidity y of the particle-level hadronically decaying top quark. The invariant mass of the reconstructed \(\mathrm{t}{\bar{\mathrm{t}}}\) system and the number of additional jets with \(p_{\mathrm {T}} > 30\,\text {Ge}\text {V} \) in the event are also shown, where the jets are reconstructed using the anti-\(k_{\mathrm {T}}\) algorithm [39, 40] with a distance parameter of 0.4. Normalized cross sections as a function of global event variables, namely \(H_{\mathrm {T}}\), the scalar \(p_{\mathrm {T}}\) sum of all jets, and \(p_{\mathrm {T}} ^\text {miss}\), the magnitude of the missing transverse momentum vector [41] are shown in Fig. 13.

The predictions from the different simulations are mostly compatible with each other, indicating a small effect of the tune on these observables. The only notable difference is seen in the additional jet multiplicity, originating from the smaller \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) value used in the simulations with \({\textsc {herwig}} \,7\)  CH tunes. The simulated events with the CH tunes describe the CMS data well up to 4 additional jets, but slightly underestimate the multiplicity for a higher number of jets. The differences between the predictions with the CH tunes and the tune \(\text {SoftTune}\) are comparable with the typical size of the theoretical uncertainties in the ME calculation, as studied in Ref. [36].

Next, jet substructure observables are compared to \(\sqrt{s} =13\,\text {Te}\text {V} \) CMS data in the single-lepton channel [42]. Normalized number of jets as a function of four variables with relatively low correlations amongst themselves are shown in Fig. 14. The variables presented are the charged-particle multiplicity (\(\lambda _0^0\)), the eccentricity (\(\varepsilon \)) calculated from the charged jet constituents, the groomed momentum fraction (\(z_\mathrm {g}\)), and the angle between the groomed subjets (\(\varDelta R_\mathrm {g}\)).

The choice of tune has little effect on most of the jet substructure observables. All choices of \({\textsc {herwig}} \,7\) tune overestimate \(\lambda _0^0\), which was also observed in Ref. [42]. The predictions for \(\varepsilon \) and \(z_\mathrm {g}\) distributions agree closely with the data in all cases. The \(\varDelta R_\mathrm {g}\) spectrum at very low values is somewhat less well described by the simulation employing the CH tunes, whereas for high values the description is better for the CH tune samples than with \(\text {SoftTune}\). Since the \(\varDelta R_\mathrm {g}\) observable is strongly dependent on the amount of final-state radiation [42], the difference comes mostly from the choice of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\), with the choice of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) in the CH tunes preferred to that in \(\text {SoftTune}\).

8 Comparisons with inclusive jet events

The predictions of \({\textsc {herwig}} \,7\) with the various tunes for inclusive jet production are investigated in this section. In particular, the substructure of the jets is considered. Events are generated with the LO QCD two-to-two MEs implemented in \({\textsc {herwig}} \,7\). Although a comparison of the substructure of jets in \(\mathrm{t}{\bar{\mathrm{t}}}\) events was already presented in Sect. 7, the comparison based on inclusive jet events is complementary because it probes a wider range of jet \(p_{\mathrm {T}}\).

Figure 15 shows the differential jet shape, \(\rho (\mathrm {r})\), as measured by the CMS experiment at \(\sqrt{s} =7\,\text {Te}\text {V} \) [43] for two bins of ranges of jet \(p_{\mathrm {T}}\) (\(p_{\mathrm {T}} ^{\text {jet}}\)): \(40< p_{\mathrm {T}} ^{\text {jet}} < 50 \,\text {Ge}\text {V} \) and \(600< p_{\mathrm {T}} ^{\text {jet}} < 1000 \,\text {Ge}\text {V} \). The observable \(\rho (\mathrm {r})\) is defined as the average fraction of the \(p_{\mathrm {T}}\) of the jet constituents contained inside an annulus with inner radius \(\mathrm {r}-0.1\) and outer radius \(\mathrm {r}+0.1\). The second moment of the jet transverse width, \(\langle \delta R^2\rangle \), is also shown. The jets are clustered with the anti-\(k_{\mathrm {T}}\) algorithm with a distance parameter of 0.7 for the jet shape observables, and 0.5 for the \(\langle \delta R^2\rangle \) observable. The predictions from the three CH tunes are very similar for all distributions, and agree with the data. On the other hand, the prediction from \(\text {SoftTune}\) differs from the CH tunes, and also does not agree well with the \(\langle \delta R^2\rangle \) distribution in data.

Additional comparisons of the predictions for various tunes of \({\textsc {herwig}} \,7\) tunes with the substructure of jets collected by the ATLAS experiment are shown in Appendix B.

9 Comparison with Z  and W  boson production data

In this section, the performance of the \({\textsc {herwig}} \,7\) tunes is compared with \(\sqrt{s} =13\,\text {Te}\text {V} \) data on Z and W boson production. Predictions for Z and W boson production are obtained with MadGraph 5_amc@nlo v2.6.7 [9] for ME calculations at NLO, which are interfaced with \({\textsc {herwig}} \,7\) using the the FxFx merging scheme [44], with the merging scale set to 30\(\,\text {Ge}\text {V}\). Up to two additional partons in the final state are included in the NLO ME calculations. The PDF in the ME calculations is NNPDF 3.1 NNLO, and the value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}})\) in the ME calculations is set to \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) in all the predictions considered here.

First, the \(p_{\mathrm {T}} ^{\text {sum}}\) and \(N_{\text {ch}}\) distributions characterizing the UE in Z boson production [45] are compared to simulation in Figs. 16 and 17. Events are required to have two muons with an invariant mass between 81 and 101\(\,\text {Ge}\text {V}\) to select events within the Z boson mass peak. The \(p_{\mathrm {T}} ^{\text {sum}}\) and \(N_{\text {ch}}\) distributions are measured in the transverse region as shown in Fig. 16, and in the toward and away regions as shown in Fig. 17, in analogy to the corresponding distributions measured in MB data introduced in Sect. 3. The regions are defined with respect to the \(p_{\mathrm {T}}\) of the Z boson, calculated from the \(p_{\mathrm {T}}\) of the two muons. The CH tunes describe the data well, and are typically similar to each other. However, the configuration with \(\text {SoftTune}\) fails to give a simultaneous description of the \(p_{\mathrm {T}} ^{\text {sum}}\) and \(N_{\text {ch}}\) distributions in any region at low \(p_{\mathrm {T}} (\mu \mu )\).

Next, the exclusive jet multiplicity distributions in Z and W boson events are shown in Fig. 18 [46, 47]. Events in the Z boson sample contain at least two electrons or muons with \(p_{\mathrm {T}} >20\,\text {Ge}\text {V} \) and \(|\eta |<2.4\), and the invariant mass of the two highest \(p_{\mathrm {T}}\) electrons or muons must have an invariant mass within 20\(\,\text {Ge}\text {V}\) of the Z boson mass. In the W boson measurement, only final states with a muon of \(p_{\mathrm {T}} >25\,\text {Ge}\text {V} \) and \(|\eta |<2.4\) are considered. The transverse mass of the W boson candidate, defined as \(m_{\mathrm {T}} = \sqrt{\smash [b]{2p_{\mathrm {T}} ^\mu p_{\mathrm {T}} ^\text {miss} [1-\cos (\varDelta \phi _{\mu ,{\vec p}_{\mathrm {T}}^{\text {miss}}})]}}\), where \(\cos (\varDelta \phi _{\mu ,{\vec p}_{\mathrm {T}}^{\text {miss}}})\) is the difference in azimuthal angle between the direction of the muon momentum and \(p_{\mathrm {T}} ^\text {miss}\), must satisfy \(m_{\mathrm {T}} >50\,\text {Ge}\text {V} \). In both Z and W events jets are reconstructed using the anti-\(k_{\mathrm {T}}\) algorithm with a distance parameter of 0.4, and are required to satisfy \(p_{\mathrm {T}} >30\,\text {Ge}\text {V} \) and \(|y |<2.4\). Jets must also be separated from any lepton by \(\sqrt{\smash [b]{(\varDelta \eta )^2+(\varDelta \phi )^2}} >0.4\), where \(\phi \) is in radians. The jet multiplicity is well described by all tunes in both Z and W boson events at both low multiplicities, where the ME calculations dominate, and high multiplicities, where the PS is important.

Finally, in Fig. 19, the \(p_{\mathrm {T}} (\mathrm{Z}\,)\) and \(p_{\mathrm {T}} ^{\text {bal}}\) distributions are shown, both for final states containing at least one additional jet. The \(p_{\mathrm {T}} ^{\text {bal}}\) variable is defined as \(p_{\mathrm {T}} ^{\text {bal}} = |{\vec p}_{\mathrm {T}} (\mathrm{Z}\,) +\sum _{\mathrm {jets}}{\vec p}_{\mathrm {T}} (\mathrm {j}) |\). The so-called jet-Z balance (\(\mathrm {JZB}\)) variable, defined as \(\mathrm {JZB} = |\sum _{\mathrm {jets}}{\vec p}_{\mathrm {T}} (\mathrm {j}) |-|{\vec p}_{\mathrm {T}} (\mathrm{Z}\,) |\), is also shown in Fig. 19. All distributions are measured for events with at least one additional jet. The \(p_{\mathrm {T}} (\mathrm{Z}\,)\) predictions for all tunes are similar for \(p_{\mathrm {T}} (\mathrm{Z}\,) >30\,\text {Ge}\text {V} \), where the predictions are driven by the ME calculations. At lower \(p_{\mathrm {T}} (\mathrm{Z}\,)\), where events contain additional hadronic activity that is not clustered into jets, the predictions with the CH tunes are similar to each other, and differ slightly from the prediction with \(\text {SoftTune}\), which provides a closer description of the data at very low \(p_{\mathrm {T}} (\mathrm{Z}\,) <10\,\text {Ge}\text {V} \). The \(p_{\mathrm {T}} ^{\text {bal}}\) and \(\mathrm {JZB}\) distributions are also sensitive to additional hadronic activity not clustered into jets. For \(p_{\mathrm {T}} ^{\text {bal}}\), all tunes are compatible with each other, except at \(p_{\mathrm {T}} ^{\text {bal}} <10\,\text {Ge}\text {V} \), where the prediction with \(\text {SoftTune}\) differs from the predictions with the CH tunes. The \(\mathrm {JZB}\) distributions are well described by all the predictions.

10 Summary

Three new tunes for the multiple-parton interaction (MPI) model of the \({\textsc {herwig}} \,7\) (version 7.1.4) generator have been derived from minimum-bias (MB) data collected by the CMS experiment. All of the CH (“CMS herwig ”) tunes, CH1, CH2, and CH3, are based on the next-to-next-to-leading-order (NNLO) NNPDF 3.1 PDF set for the simulation of the parton shower (PS) in \({\textsc {herwig}} \,7\); the value of the strong coupling at a scale equal to the Z boson mass is \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\) with a two-loop evolution of \(\alpha _\mathrm {S}\). The configuration of the tunes differs in the PDF used for the simulation of MPI and beam remnants. The tune CH1 uses the same NNLO PDF set for these aspects of the \({\textsc {herwig}} \,7\) simulation, whereas CH2 and CH3 use leading-order (LO) versions of the PDF set. The tune CH2 is based on an LO PDF set that was derived assuming \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.118\), and CH3 on an LO PDF set assuming \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) =0.130\).

The parameters of the MPI model were optimized for each tune with the professor framework to describe the underlying event (UE) in MB data collected by CMS. The predictions using the tune CH2 or CH3 provide a better description of the data than those using CH1 or \(\text {SoftTune}\). Furthermore, the differences in the predictions of CH2 and CH3 are observed to be small. The configuration of PDF sets in the tune CH3, where the LO PDF used for the simulation of MPI, was derived with a value of \(\alpha _\mathrm {S} (m_{\mathrm {\mathrm{Z}\,}}) \) typically associated with LO PDF sets, is the preferred choice over CH2. Two alternative tunes representing the uncertainties in the fitted parameters of CH3 are also derived, based on the tuning procedure provided by professor.

Predictions using the three CH tunes are compared with a range of data beyond MB events: event shape data from LEP; proton-proton data enriched in top quark pairs, Z bosons and W bosons; and inclusive jet data. This validated the performance of \({\textsc {herwig}} \,7\) using these tunes against a wide range of data sets sensitive to various aspects of the modelling by \({\textsc {herwig}} \,7\), and in particular the modelling of the UE. The event shape observables measured at LEP, which are sensitive to the modelling of final-state radiation, are well described by \({\textsc {herwig}} \,7\) with the new tunes. Predictions using the new tunes are also shown to describe the UE in events containing Z bosons, demonstrating the universality of the UE modelling in \({\textsc {herwig}} \,7\). The kinematics of top quark events, and the modelling of jets in \(\mathrm{t}{\bar{\mathrm{t}}}\) , Z boson, W boson, and inclusive jet data are also well described by predictions using the new tunes. In general, predictions with the new CH tunes derived in this paper provide a better description of measured observables than those using \(\text {SoftTune}\), the default tune available in \({\textsc {herwig}} \,7\).

Fig. 20
figure 20

Normalized plots [17] for the pseudorapidity of charged particles for \(N_{\text {ch}} \ge 1\) (upper left), and \(N_{\text {ch}} \ge 6\) (lower left), for charged particles with \(p_{\mathrm {T}} >500\,\text {Me}\text {V} \). The figure on the upper right shows a similar distribution for \(N_{\text {ch}} \ge 2\), and the lower right for \(N_{\text {ch}} \ge 20\), where the charged particles have \(p_{\mathrm {T}} >100\,\text {Me}\text {V} \). ATLAS MB data are compared with the predictions from \({\textsc {herwig}} \,7\), with the \(\text {SoftTune}\) and CH tunes. The coloured band in the ratio plot represents the total experimental uncertainty in the data. The vertical bars on the points for the different predictions represent the statistical uncertainties