1 Introduction

Minimum bias proton–proton collisions present an interesting, and theoretically challenging subject for detailed studies. Their understanding is important for the interpretation of measurements of heavy-ion collisions, and in the search for signatures of new physics at the Large Hadron Collider (LHC) and Fermilab.

However, the wealth of experimental information is currently poorly understood by theoretical models or Monte Carlo (MC) event generators, which are unable to explain with one set of parameters all the measured observables. Examples of measured observables which are not presently well described theoretically include the reported multiplicity distribution [13], the transverse momentum distribution [4] and the variation of the transverse momentum with multiplicity [57].

In this paper, we present measurements of the transverse sphericity for minimum bias pp events over a wide multiplicity range at several energies using the ALICE detector. Transverse sphericity is a momentum space variable, commonly classified as an event shape observable [8]. Event shape analyses, well known from lepton collisions [911], also offer interesting possibilities in hadronic collisions, such as the study of hadronization effects, underlying event characterization and comparison of pQCD computations with measurements in high E T jet events [1214].

The goal of this analysis is to understand the interplay between the event shape, the charged particle multiplicity, and their transverse momentum distribution. Hence, the present paper is focused on the following aspects:

  • The evolution of the mean transverse sphericity with multiplicity, 〈S T〉(N ch). This study was done for different subsets of events defined by the transverse momentum of the leading particle;

  • the behavior of the mean transverse momentum as a function of multiplicity, 〈p T〉(N ch);

  • the normalized transverse sphericity distributions for various multiplicity ranges.

The results of these analyses are compared with event generators and will serve for a better understanding of the underlying processes in proton–proton interactions at the LHC energies.

2 Event shape analysis

At hadron colliders, event shape analyses are restricted to the transverse plane in order to avoid the bias from the boost along the beam axis [12]. The transverse sphericity is defined in terms of the eigenvalues: λ 1>λ 2 of the transverse momentum matrix:

$$\mathbf{S}_{\mathbf{xy}}^{\mathbf{Q}} = \frac{1}{\sum_{i} {p_{\mathrm{T}}}_{i}}\sum _{i} \left (\begin{array}{@{}c@{\quad}c@{}} {p_{\mathrm{x}}}_{i}^{2} & {p_{\mathrm{x}}}_{i} \, {p_{\mathrm{y}}}_{i} \\ {p_{\mathrm{y}}}_{i} \, {p_{\mathrm{x}}}_{i} & {p_{\mathrm{y}}}_{i}^{2} \end{array} \right )\ [\mbox{GeV/$c$}] $$

where (p x i ,p y i ) are the projections of the transverse momentum of the particle i. The index i runs over the all particles in an event.

Since \(\mathbf{S}_{\mathbf{xy}}^{\mathbf{Q}}\) is quadratic in particle momenta, this sphericity is a non-collinear safe quantity in pQCD. For instance, if a parton with high momentum along the x direction splits into two equal collinear momenta, then the sum \(\sum_{i} {p_{\mathrm{x}}}_{i}^{2}\) will be half that of the original momentum. To avoid this dependence on possible collinear splittings, the transverse momentum matrix is linearized as follows:

$$\mathbf{S}_{\mathbf{xy}}^{\mathbf{L}} = \frac{1}{\sum_{i} {p_{\mathrm{T}}}_{i}}\sum _{i} \frac{1}{ {p_{\mathrm{T}}}_{i} }\left (\begin{array}{@{}c@{\quad}c@{}} {p_{\mathrm{x}}}_{i}^{2} & {p_{\mathrm{x}}}_{i} \, {p_{\mathrm{y}}}_{i} \\ {p_{\mathrm{y}}}_{i} \, {p_{\mathrm{x}}}_{i} & {p_{\mathrm{y}}}_{i}^{2} \end{array} \right ) $$

The transverse sphericity is defined as

$$ S_{\mathrm{T}} \equiv \frac{2\lambda_{2}}{\lambda_{2}+\lambda_{1}}. $$
(1)

By construction, the limits of the variable are related to specific configurations in the transverse plane

$$S_{\mathrm{T}} = \begin{cases} 0 & \text{``pencil-like'' limit} \\ 1 &\text{``isotropic'' limit.} \end{cases} $$

This definition is inherently multiplicity dependent, for instance, S T→0 for very low multiplicity events.

3 Experimental conditions

The relevant detectors used in the present analysis are the Time Projection Chamber (TPC) and the Inner Tracking System (ITS), which are located in the central barrel of ALICE inside a large solenoidal magnet providing a uniform 0.5 T field [15].

The ALICE TPC is a large cylindrical drift detector with a central membrane maintained at −100 kV and two readout planes at the end-caps composed of 72 multi-wire proportional chambers [16]. The active volume is limited to 85<r<247 cm and −250<z<250 cm in the radial and longitudinal directions, respectively. The amount of material between the interaction point and the active volume of the TPC corresponds to 11 % of a radiation length, averaged in |η|<0.8. The central membrane divides the nearly 90 m3 active volume into two halves. The homogeneous drift field of 400 V/cm in the Ne–CO2–N2 (85.7 %–9.5 %–4.8 %) gas mixture leads to a maximum drift time of 94 μs. The typical gas gain is 104 [7].

The ITS is composed of high resolution silicon tracking detectors, arranged in six cylindrical layers at radial distances to the beam line from 3.9 to 43 cm. The two innermost layers are Silicon Pixel Detectors (SPD), covering the pseudorapidity ranges |η|<2 and |η|<1.4, respectively. A total of 9.8 millions 50 μm×425 μm pixels enable the reconstruction of the primary event vertex and the track impact parameters with high precision, results from alignment with cosmic-ray tracks indicate a space point resolution of about 14 μm [17]. The SPD was also included in the trigger scheme for data collection. The outer third and fourth layers are formed by Silicon Drift Detectors (SDD) with a total of 133k readout channels. The two outermost Silicon Strip Detector (SSD) layers consist of double-sided silicon micro-strip sensors with 95 μm pitch, comprising a total of 2.6 million readout channels. The design spatial resolutions of the ITS sub-detectors (σ ×σ z ) are: 12 μm×100 μm for SPD, 35 μm×25 μm for SDD, and 20 μm×830 μm for SSD.

The VZERO detector consists of two forward scintillator hodoscopes. Each detector is segmented into 32 scintillator counters which are arranged in four rings around the beam pipe. They are located at distances z=3.3 m and z=−0.9 m from the nominal interaction point and cover the pseudorapidity ranges: 2.8<η<5.1 and −3.7<η<−1.7, respectively. The time resolution of this detector is better than 1 ns. Information from the VZERO response is recorded in a time window of ±25 ns around the nominal beam crossing time. The beam-related background was rejected offline using the VZERO time. Also a criterion based on the correlation between the number of clusters and track segments in the SPD was applied.

The minimum bias (MB) trigger used in this analysis required a hit in one of the VZERO counters or in the SPD detector. In addition, a coincidence was required between the signals from two beam pickup counters, one on each side of the interaction region, indicating the presence of passing bunches [1].

3.1 Data analysis

MB events at \(\sqrt{s}= 0.9\mbox{ and }7~\mathrm{TeV}\) (recorded in 2010) and at \(\sqrt{s}=2.76~\mathrm{TeV}\) (recorded in 2011) have been analyzed. The number of analyzed events was about 40, 15.5 and 3.2 millions at 7, 2.76 and 0.9 TeV, respectively. Since no energy dependence is found for the event shape observable, we present mostly results for 0.9 and 7 TeV.

The position of the interaction vertex is reconstructed by correlating hits in the two silicon-pixel layers. The vertex resolution depends on the track multiplicity, and is typically 0.1–0.3 mm in the longitudinal (z) and 0.2–0.5 mm in the transverse direction. The event is accepted if its longitudinal vertex position (z v ) satisfies |z v z 0|<10 cm, where z 0 is the nominal position.

To ensure a good resolution on the transverse sphericity, only events with more than two primary tracks in |η|<0.8 and p T>0.5 GeV/c are selected. The cuts on η and p T ensure high charged particle track reconstruction efficiency for primary tracks [7]. These cuts reduce the available statistics to about 13.8, 4.2 and 0.42 million of MB events for the 7 TeV, 2.76 TeV and 0.9 TeV data, respectively.

At 7 TeV collision energy, the fractions of non-diffractive events after the cuts are 99.5 % and 93.6 % according to PYTHIA6 version 6.421 [18] (tune PERUGIA-0 [19]) and PHOJET version 1.12 [20, 21], respectively. In the case of single-diffractive events the fractions are 0.3 % and 4.8 %, while the double-diffractive events represent 0.2 % and 1.6 % of the sample as predicted by PYTHIA6 and PHOJET, respectively.

3.2 Track selection

Charged particle tracks are selected in the pseudorapidity range |η|<0.8. In this range, tracks in the TPC can be reconstructed with minimal efficiency losses due to detector boundaries. Additional quality requirements are applied to ensure a good transverse momentum resolution and low contamination from secondary and fake tracks [7]. A track is accepted if it has at least 70 space pointsFootnote 1 in the TPC, and the χ 2 per space point used for the momentum fit is less than 4. Tracks coming from secondary interactions in general have larger transverse impact parameter (d 0) than the primary ones. Since the resolution of the transverse impact parameter as a function of p T can be approximated as \(a+\frac{b}{p_{\mathrm{T}}}\), we reject a track if its d 0 exceeds around seven standard deviations of the p T dependent transverse momentum resolution, \(0.245 +\frac{0.294}{p_{\mathrm{T}}^{0.9}} \) (p T in GeV/c, d 0 in cm). This cut was tuned to select primary charged particles with high efficiency and to minimize the contributions from weak decays, conversions and secondary hadronic interactions in the detector material.

3.3 Selection of soft and hard events

The analysis is presented for two categories of events defined by the maximum charged-particle transverse momentum for |η|<0.8 in each event. This method is often used in an attempt to characterize events by separating the different modes of production. It aims to divide the sample into two event classes: (a) events dominantly without any hard scattering (“soft” events) and (b) events dominantly with at least one hard scattering (“hard” events). Figure 1 shows the mean transverse sphericity versus maximum p T (\(p_{\mathrm{T}}^{\max}\)) of the event obtained from minimum bias simulations at \(\sqrt{s}=7~\mathrm{TeV}\) using the particle and event cuts described previously. Note that PYTHIA6 simulations (tunes: ATLAS-CSC [22], PERUGIA-0 and PERUGIA-2011 [23]) exhibit a maximum around 1.5–2.0 GeV/c, while PHOJET shows an intermediate transition slope in \(p_{\mathrm{T}}^{\max}=1\mbox{--}3\mbox{~GeV/c}\). This observation motivated the choice of the following separation cut: “soft” events are defined as events that do not have a track above 2 GeV/c, while “hard” events are all others. The aggregate of both classes is called “all”. The selection of 2 GeV/c has been motivated in the past as an accepted limit between soft and hard processes [24]. For parton-parton interactions the differential cross section is divergent for p T→0, so that a lower cut-off is generally introduced in order to regularize the divergence. For example in PYTHIA6, the default cut-off is 2 GeV/c for 2→2 processes.

Fig. 1
figure 1

Mean transverse sphericity versus \(p_{\mathrm{T}}^{\max}\) for MC simulations at \(\sqrt{s}=7~\mathrm{TeV}\). Results are shown for PHOJET and PYTHIA6 (tunes ATLAS-CSC, PERUGIA-0 and PERUGIA-2011) simulations. The events are required to have more than 2 primary charged particles in |η|<0.8 and transverse momentum above 0.5 GeV/c

Table 1 shows the ratio of “soft” to “hard” events for ALICE data and the generators: PHOJET, PYTHIA6 (tunes ATLAS-CSC, PERUGIA-0 and PERUGIA-2011) and PYTHIA8 version 8.145 [25]. The ALICE ratios are corrected for trigger and vertexing inefficiency, it results in <2 % systematic uncertainty. It illustrates the difficulties to reproduce the evolution of simple observables with collision energy.

Table 1 Ratio of the number of “soft” to “hard” events for data and MC generators according to the event selection criteria defined in the text

3.4 Corrections

The MC simulations used to compute the correction include transport through the detector and full reconstruction with the same algorithms as the data.

To correct the measured mean sphericity for efficiency, acceptance, and other detector effects, and to obtain it as the number of charged particles (N ch) in |η|<0.8 two steps were followed. First, the measured sphericity distributions in bins of measured mid-rapidity charged particle multiplicity (N m) are unfolded using the detector sphericity response matrices. The unfolding implements a χ 2 minimization with regularization [26]. Second, to account for the experimental resolution of the measured multiplicities, the mean values of the unfolded distributions (〈S Tunf) are weighted by the detector multiplicity response, R(N ch,N m). This procedure can be seen as

$$ \langle S_{\mathrm{T}} \rangle (N_{\mathrm{ch}}) = \sum _{m} \langle S_{\mathrm{T}} \rangle^{\mathrm{unf}} (N_{\mathrm{m}}) R(N_{\mathrm{ch}},N_{\mathrm{m}}). $$
(2)

Figures 2 and 3 show an example of the sphericity response matrix with a measured multiplicity of 25 charged particles at mid-rapidity and the multiplicity response matrix, respectively. The correlation between the true multiplicity and the measured multiplicity deviates from unity due to tracking inefficiency. The MC simulations are based on the PYTHIA6 tune ATLAS-CSC. Different simulations were tested, and all produce the same results to within 1 %.

Fig. 2
figure 2

Example of the sphericity response matrix for a measured multiplicity of 25 charged particles at mid-rapidity. The events are generated using the PYTHIA6 tune ATLAS-CSC (pp collisions at \(\sqrt{s}=7~\mathrm{TeV}\)) and then transported through the detector. Particles and tracks with |η|<0.8 and p T>0.5 GeV/c are used

Fig. 3
figure 3

Example of the multiplicity response matrix. The events are generated using PYTHIA6 tune ATLAS-CSC (pp collisions at \(\sqrt{s}=7~\mathrm{TeV}\)) and then transported through the detector. Particles and tracks with |η|<0.8 and p T>0.5 GeV/c are used

The sphericity distributions in four bins of multiplicity: (a) 3≤N ch<10, (b) 10≤N ch<20, (c) 20≤N ch<30 and (d) N ch>30 are also presented. The normalized spectra give the probability of finding an event with certain sphericity at given multiplicity. The normalized spectra were corrected bin-by-bin as follows

$$ P(S_{\mathrm{T}})|_{N_{\mathrm{ch}}} =P \bigl(S_{\mathrm{T}}^{\mathrm{m}} \bigr) \big{|}_{N_{\mathrm{m}}} \times C_{1} \times C_{2}, $$
(3)

where \(P(S_{\mathrm{T}}^{\mathrm{m}})\mid_{N_{\mathrm{m}}}\) is the measured probability of finding an event with sphericity S T in a bin of measured multiplicity (N m). This probability is corrected by C 1 and C 2, which are computed using MC. C 1 is the correction of the spectra at the measured multiplicity bin

$$ C_{1}=\frac{P(S_{\mathrm{T}}^{\mathrm{unf}})}{P(S_{\mathrm{T}}^{\mathrm{m}})}\bigg{|}_{N_{\mathrm{m}}}, $$
(4)

and C 2 corrects the probability by the migration from high to low multiplicity

$$ C_{2}=\frac{P(S_{\mathrm{T}}^{\mathrm{t}})|_{N_{\mathrm{ch}}}}{P(S_{\mathrm{T}}^{\mathrm{t}})|_{N_{\mathrm{m}}}}. $$
(5)

In the expressions, \(P(S_{\mathrm{T}}^{\mathrm{t}})\) is the probability of finding an event with true sphericity \(S_{\mathrm{T}}^{\mathrm{t}}\), where “true” refers to the value obtained at generator level. \(S_{\mathrm{T}}^{\mathrm{t}}\) and \(S_{\mathrm{T}}^{\mathrm{unf}}\) are the true and unfolded sphericity distributions, respectively. The latter are the results of the unfolding of the simulated measurements, i.e. PYTHIA6 (tune PERUGIA0) corrected by PHOJET and vice versa.

Finally, to determine 〈p T〉(N ch), we take the mean p T by counting all tracks that pass the cuts discussed above as a function of measured multiplicity (N m).

Once we get measured mean p T as a function of N m, 〈p Tm(N m), we follow the approximation:

$$ \langle p_{\mathrm{T}} \rangle (N_{\mathrm{ch}}) = \sum _{m} \langle p_{\mathrm{T}} \rangle_{\mathrm{m}} (N_{\mathrm{m}} ) R(N_{\mathrm{ch}},N_{\mathrm{m}}). $$
(6)

Note that in this case an unfolding of the mean p T is not implemented. Figure 4 illustrates the performance of the procedure using PHOJET simulations as input. The response matrices are computed as above using the PYTHIA6 event generator. The corrected points are compared with MC at generation level. The differences, at high multiplicity, reach about 1.5 %.

Fig. 4
figure 4

Performance of the procedure to correct the reconstructed mean p T as a function of multiplicity for “all” events. The method is tested using PHOJET as input and applying corrections derived from PYTHIA. The MC true (PHOJET result at generation level) is compared with the corrected result after simulation and reconstruction

3.5 Systematic uncertainties

The systematic uncertainties on 〈S T〉 are evaluated as follows. To minimize the adverse effects of pile-up on the multiplicity, only runs with a low probability of multiple collisions were used. The parameter used to measure the pile-up level is the mean of the Poisson distribution which is based on the recorded beam luminosity. It characterizes the probability to have n interactions reconstructed as a single event. Furthermore, all events having two reconstructed interaction vertices, separated by more than 8 mm, and having at least 3 associated track segments in the SPD, were tagged as pile-up and rejected. The systematic uncertainty was estimated from the differences between results using runs with the smallest and largest pile-up probability, and for the “all” sample was found to be less than 0.2 %. The uncertainty due to the rejection of secondaries was estimated by increasing their contribution up to ∼8 %. This is done by varying the cut on the distance of closest approach (\(d_{0}>0.0350+\frac{0.0420}{p_{\mathrm{T}}^{0.9}}\), p T in GeV/c, d 0 in cm) of the considered track to the primary vertex in the plane perpendicular to the beam. The event generator dependence was determined from a comparison of the results obtained when either PYTHIA6 or PHOJET were used to compute the correction matrices, and found to be of the order of few percent. The most significant contribution to the systematic uncertainties is due to the correction method. It was estimated from MC by the ratio true-S T to corrected-S T as a function of multiplicity. For example, the largest uncertainty is at low multiplicity (N ch≈3) for the “hard” sample, where it reaches ∼11 %. Different sets of cuts were implemented in order to estimate the systematic uncertainty due to track selection. Table 2 summarizes the systematic uncertainties on 〈S T〉. In addition, other checks were performed to ensure an accurate interpretation of the results. For instance, when applying the analysis to randomized events (where the track azimuthal angles are uniformly distributed between 0 and 2π), we obtain results that are about 10 % larger than in data. The conclusion is that measured sphericity in data is not the result of a random track combination. Also, the analysis was applied to events with sphericity axes in different regions of the TPC, to ensure that the results are not biased by any residual geometry effects.

Table 2 Contributions to the systematic uncertainties on the mean transverse sphericity 〈S T

In the case of the mean transverse momentum as a function of multiplicity the systematic uncertainties are taken from [7], the only difference being the method of correction. The uncertainty was estimated by applying the correction algorithm to reconstructed events generated with PYTHIA6, while the correction matrices were computed using events generated with PHOJET. The final distributions were compared with the results at generator level. For the “all” sample the uncertainty reaches 1.5 %, while for “soft” and “hard” it reaches 1.0 % and 5.1 %, respectively.

For the case of the sphericity distributions in intervals of multiplicity, the main uncertainties are listed in Table 3. They were estimated following similar procedures as described above.

Table 3 Systematic uncertainties on the sphericity distributions

4 Results

In this section the results of the analyses are presented along with predictions of different models: PHOJET, PYTHIA6 version (tunes: ATLAS-CSC, PERUGIA-0 and PERUGIA-2011) and PYTHIA8.

4.1 Mean sphericity

The mean transverse sphericity as a function of N ch at \(\sqrt{s}=0.9\mbox{ and }7~\mathrm{TeV}\) is shown in Fig. 5 for the different event classes. The mean sphericity (right panel) increases up to around 15 primary charged particles, however, for larger multiplicities the ALICE data exhibit an almost constant or slightly rising behavior. For “soft” events and \(\sqrt{s}=0.9~\mathrm{TeV}\), the models are in agreement with the ALICE measurements over the full range of multiplicity, except for PYTHIA8 prediction, which is 5–10 % lower. There is insufficient statistics to perform the unfolding for N ch>18. At 7 TeV, the differences between models and data are below 10 % for “soft” events. For the “hard” events, PHOJET, ATLAS-CSC, PERUGIA-0 and PYTHIA8 predict a lower 〈S T〉 than observed in data, actually the differences between models and data are larger than 10 % for multiplicities below 10 and larger than 40, that is true at 0.9 and 7 TeV. The differences observed are larger than the systematic and statistical uncertainties. It is interesting to note that PERUGIA-2011 describes the data quite well. The fraction of “soft” and “hard” events in data and MC simulations as a function of N ch (integral values given in Table 1) is found to be different between data and the event generators. At large N ch, the event generators generally produce more “hard” events than observed in data. This difference is reflected in the “all” event class, since more “hard” events contribute in the case of the generators, while more “soft” events in the case of data. The largest isotropy in the azimuth is found at high multiplicity, N ch>40 (|η|<0.8, p T>0.5 GeV/c), in a similar pseudo-rapidity density region where the CMS collaboration discovered the long-range near-side angular correlations [27]. Comparing the results at 0.9 and 7 TeV, it is seen that except for PYTHIA8 the predictions of models describe better the 0.9 TeV data than the 7 TeV ones. Lastly, the mean sphericity evolution with multiplicity at the three measured energies are shown in Fig. 6 for “soft”, “hard” and “all” events at \(\sqrt{s}= 0.9,~2.76\mbox{ and }7~\mathrm{TeV}\). The functional form of the mean sphericity as a function of N ch is the same at all three energies in the overlapping multiplicity region.

Fig. 5
figure 5figure 5

Mean transverse sphericity as a function of charged particle multiplicity. The ALICE data are compared with five models: PHOJET, PYTHIA6 (tunes: ATLAS-CSC, PERUGIA-0 and PERUGIA-2011) and PYTHIA8. Results at \(\sqrt{s}=0.9\mbox{ and }7~\mathrm{TeV}\) are shown in the top and bottom rows, respectively. Different event classes are presented: (left) “soft”, (middle) “hard” and (right) “all” (see text for definitions). The statistical errors are displayed as error bars and the systematic uncertainties as the shaded area. The horizontal error bars indicate the bin widths. Symbols for data points and model predictions are presented in the legend

Fig. 6
figure 6

Mean sphericity versus multiplicity for (left) “soft”, (middle) “hard” and (right) “all” events for \(\sqrt{s}= 0.9,~2.76~\mbox{and}~7~\mathrm{TeV}\). The statistical errors are displayed as error bars and the systematic uncertainties as the shaded area

4.2 Mean transverse momentum

The mean transverse momentum as a function of N ch at \(\sqrt{s}=0.9\mbox{ and }7~\mathrm{TeV}\) is shown in Fig. 7. As seen in left panel, PERUGIA-0, PERUGIA-2011 and PYTHIA8 are within the systematic uncertainty bands of the data for soft events, though PYTHIA8 has a different functional form than the data. For the “hard” events there is a significant difference between the data and the generators above a multiplicity of about 20, in particular for the 7 TeV data. For lower multiplicities, ATLAS-CSC has an overall different shape than the other generators. For “all” events, at 0.9 TeV PERUGIA-0 and PERUGIA-2011 best reproduces the data, while the rest of the models do not give a good description. At 7 TeV, the calculations exhibit a change in the slope around N ch=30, which is not observed in the data. At similar multiplicities, the MC mean sphericity reaches a maximum before it decreases with increasing multiplicity (Fig. 5). The similarity in the multiplicity dependence between 〈S T〉 and 〈p T〉 suggests that the models may generate more back-to-back correlated high p T particles (jets) than present in the data.

Fig. 7
figure 7

Mean transverse momentum versus multiplicity. The ALICE data are compared with five models: PHOJET, PYTHIA6 (tunes: ATLAS-CSC, PERUGIA-0 and PERUGIA-2011) and PYTHIA8. Results at \(\sqrt{s}=0.9\mbox{ and }7~\mathrm{TeV}\) are shown in the top and bottom rows, respectively. Different event classes are presented: (left) “soft”, (middle) “hard” and (right) “all”. The gray lines indicate the systematic uncertainty on data and the horizontal error bars indicate the bin widths

4.3 S T spectra in multiplicity intervals

The behavior of averaged quantities does not illustrate the whole complexity of the events. For instance, events with the same multiplicity may have different transverse sphericity and depending on S T they have a different mean transverse momentum. To disentangle these kind of ambiguities among p T, S T and multiplicity, the normalized transverse sphericity spectra (the probability of having events of different transverse sphericity in a given multiplicity interval) are computed at 7 TeV for four different intervals of multiplicity: N ch=3–9, 10–19, 20–29 and above 30. These are shown in Fig. 8 along with their ratios to each MC calculation. In the first multiplicity bin (N ch=3–9), the agreement between data and MC is generally good, but in the second bin (N ch=10–19) the ratio data to MC is systematically lower for S T<0.4 except for PERUGIA-2011. In the last bin of multiplicity the overproduction of high S T events, believed to be due to back-to-back jets (in azimuth), reaches a factor of 3, and there is an underestimation of isotropic events by a factor 2. As in previous cases, the best description is done by PERUGIA-2011.

Fig. 8
figure 8

Sphericity distributions in four bins of multiplicity: (upper-left) 3≤N ch≤9, (upper-right) 10<N ch≤19, (bottom-left) 20<N ch≤29 and (bottom-right) N ch≥30 at \(\sqrt{s}=7~\mathrm{TeV}\). The statistical errors are displayed as error bars and the systematic uncertainties as the shaded area. Lines are drawn to guide the eye

To obtain information about the interplay between multiplicity and 〈p T〉 through the event shapes, we also investigated the 〈p T〉 as a function of 〈S T〉 in intervals of multiplicity. The study is presented using MC generators at \(\sqrt{s}=7~\mathrm{TeV}\), but the conclusion also holds at the other two energies. Figure 9 shows 〈p T〉 as a function of S T for two multiplicity bins (top panels) along with the contribution of each sphericity bin (bottom panels) to the final 〈p T〉, i.e. the 〈p T〉 weighted by the value P(S T). There are two points to emphasize. First, a large dependence of 〈p T〉 on sphericity is observed for high multiplicities while at low ones the dependence is weaker. Second, the sphericity distribution determines the mean p T in a specific bin of multiplicity. For instance, for S T=0.3–0.4 PHOJET and ATLAS-CSC have nearly the same value of 〈p T〉, while the contribution to 〈p T〉 in the multiplicity bin is twice larger for PHOJET compared to ATLAS-CSC. Hence, the reproduction of the sphericity should be taking into account in the tuning of the MC generators.

Fig. 9
figure 9

Mean p T (top) as a function of sphericity for two multiplicity bins (left) 3≤N ch≤9 and (right) N ch≥30 for minimum bias pp collisions at \(\sqrt{s}=7~\mathrm{TeV}\) simulated with four different MC generators: PHOJET, PYTHIA6 (tunes ATLAS-CSC and PERUGIA-2011) and PYTHIA8. Also the contributions of the different event topologies to the averaged mean p T are presented (bottom)

5 Conclusion

A systematic characterization of the event shape in minimum bias proton–proton collisions at \(\sqrt{s}=0.9,~2.76\ \mbox{and} \, \mbox{7~TeV}\) is presented. Confronted with the persistent difficulties of event generators to reproduce simultaneously the charged particle transverse momentum and multiplicity, the transverse sphericity is used to provide insight into the particle production mechanisms. The observables are measured using primary charged tracks with p T>0.5 GeV/c in |η|<0.8 and reported as a function of the charged particle multiplicity at mid-rapidity (N ch) for events with different scales (“soft” and “hard”) defined by the transverse momentum of the leading particle. The data are compared with calculations of standard Monte Carlo event generators: PHOJET, PYTHIA6 (tunes: ATLAS-CSC, PERUGIA-0 and PERUGIA-2011) and PYTHIA8 (default MB parameters).

The MC generators exhibit a decrease of 〈S T〉 at high multiplicity with a simultaneous steep rise of 〈p T〉. On the contrary, in ALICE data 〈S T〉 stays approximately constant or slightly rising (Fig. 5) accompanied with a mild increase in 〈p T〉 (Fig. 7). The mean sphericity seems to primarily depend on the multiplicity and not on \(\sqrt{s}\) (Fig. 6). At high multiplicity (N ch≥30) the generators underestimate the production of isotropic events and overestimate the production of pencil-like events (Fig. 8). It seems that the generators tend to produce large multiplicity events by favoring the production of back-to-back high-p T jets (low S T) more so than in nature. The level of disagreement between data and generators is markedly different for “soft” and “hard” events, being much larger for the latter (Figs. 57). It is worthwhile to point out that PERUGIA-2011 describes the various aspects of the data generally quite well, except for the mean p T , which it overestimates at high multiplicities. Our studies suggest that the tuning of generators should include the sphericity as an additional reference.