Energy Correlation Functions for Jet Substructure
Abstract
We show how generalized energy correlation functions can be used as a powerful probe of jet substructure. These correlation functions are based on the energies and pairwise angles of particles within a jet, with point correlators sensitive to prong substructure. Unlike many previous jet substructure methods, these correlation functions do not require the explicit identification of subjet regions. In addition, the correlation functions are better probes of certain soft and collinear features that are masked by other methods. We present three Monte Carlo case studies to illustrate the utility of these observables: 2point correlators for quark/gluon discrimination, 3point correlators for boosted //Higgs boson identification, and 4point correlators for boosted top quark identification. For quark/gluon discrimination, the 2point correlator is particularly powerful, as can be understood via a nexttoleading logarithmic calculation. For boosted 2prong resonances the benefit depends on the mass of the resonance.
[1]Table LABEL:#1
\preprint
MIT–CTP 4446
CERN–PH–TH/2013–066
LPN13–026
1 Introduction
The field of jet substructure has evolved significantly over the last few years Abdesselam:2010pt ; Altheimer:2012mn . Many procedures have been developed not only for identifying and classifying jets Seymour:1993mx ; Butterworth:2002tt ; Butterworth:2008iy ; Brooijmans:1077731 ; Thaler:2008ju ; Kaplan:2008ie ; Almeida:2008yp but also for removing jet contamination due to underlying event or pileup Cacciari:2007fd ; Butterworth:2008iy ; Krohn:2009th ; Ellis:2009me ; Alon:2011xb ; Soyez:2012hv . On the theoretical side, there has been substantial progress in computing and understanding these observables and procedures in perturbative QCD Ellis:2010rwa ; Banfi:2010pa ; Walsh:2011fz ; Li:2012bw ; Dasgupta:2012hg ; Feige:2012vc ; Larkoski:2012eh ; Chien:2012ur ; Jouttenus:2013hs . On the experimental side, the ATLAS and CMS experiments at the Large Hadron Collider (LHC) have begun measuring and testing jet substructure ideas ATLASCONF2011073 ; Miller:2011qg ; ATLASCONF2011053 ; ATLAS:2012am ; ATLAS:2012xna ; Aad:2012meb ; Aad:2012raa ; ATLAS:2012dp ; ATLASCONF2012066 ; ATLASCONF2012065 ; CMSPASQCD10041 ; CMSPASJME10013 ; CMS:2011bqa ; Chatrchyan:2012mec ; Chatrchyan:2012tt ; Chatrchyan:2012sn , with pileup suppression becoming increasingly important at higher luminosities. With the recent discovery of a Higgslike particle :2012gk ; :2012gu , jet substructure methods for identifying the decay mode Butterworth:2008iy (and potentially the decay mode) could be vital for testing Higgs properties.
A common strategy for jet substructure studies is to first identity subjets, namely, localized subclusters of energy within a jet. Jet discrimination then involves studying the properties of and relationship between the subjets. For example, BDRS Butterworth:2008iy and related methods Kaplan:2008ie ; Plehn:2009rk ; Plehn:2010st involve first reclustering the jet with the Cambridge/Aachen Dokshitzer:1997in ; Wobisch:1998wt ; Wobisch:2000dk or Catani:1993hr ; Ellis:1993tq jet algorithm and then stepping through the clustering history to identify a hard splitting in the jet; pruning Ellis:2009me is similar. subjettiness Thaler:2010tr ; Thaler:2011gf relies on a (quasi)minimization procedure to identify subjet directions in the jet. Of course, there are jet shapes such as jet angularities Berger:2003iw ; Almeida:2008yp , planar flow Thaler:2008ju ; Almeida:2008yp , Zernike coefficients GurAri:2011vx , and FoxWolfram moments Bernaciak:2012nh that can be used for classifying jets without subjet finding. Considered individually, however, these jet shapes tend not to yield the same discrimination power as subjet methods, since they are sensitive mainly to exotic kinematic configurations and not directly to pronglike substructure.
In this paper, we introduce generalized energy correlation functions that can identify prong jet substructure without requiring a subjet finding procedure. These correlators only use information about the energies and pairwise angles of particles within a jet, but yield discrimination power comparable to methods based on subjets. As we will see, point correlation functions are sensitive to prong substructure, with an angular exponent that can be adjusted to optimize the discrimination power. To our knowledge, the 2point correlators—schematically where the sum runs over all particles and in a jet or event—first appeared in Ref. Banfi:2004yd and independently in Ref. Jankowiak:2011qa , with no previous studies of higherpoint correlators.^{2}^{2}2Our definition of the energy correlation function should not be confused with Refs. Basham:1977iq ; Basham:1978bw ; Basham:1978zq ; Hofman:2008ar , which refer to an ensemble average of products of energies measured at fixed angles. Here, energy correlation functions are measured on an eventbyevent basis.
Besides the novelty of not requiring subjet finding, a key feature of the generalized energy correlation functions is that the angular exponent can be set to any value consistent with infrared and collinear safety, namely . In contrast, observables like angularities Berger:2003iw ; Almeida:2008yp are required to have to avoid being dominated by recoil effects.^{3}^{3}3As will be discussed in Sec. 2.2 and a forthcoming publication broadening , subjettiness may or may not have recoil sensitivity depending on how the axes are chosen. By choosing values of , the correlators are able to more effectively probe smallscale collinear splittings, which will turn out to be useful for quark/gluon discrimination.
To put our work in perspective, it is worth remembering that the basic idea for using energy correlation functions to determine the number of jets in an event is actually quite old. As we will review, the parameter for collisions Parisi:1978eg ; Donoghue:1979vi is essentially a 3point energy correlation function that can be used to identify events that have two jets. However, the parameter is defined as a function of the eigenvalues of the sphericity tensor and therefore only gives sensible values for systems that have zero total momentum and for events that are nearly dijetlike. In contrast, our generalized energy correlation functions give sensible results in any Lorentz frame and can be used to identify any number of jets in an event (or subjets within a jet). In addition, they can be defined in any number of spacetime dimensions.
The remainder of the paper is organized as follows. In Sec. 2, we introduce arbitrarypoint energy correlation functions and define appropriate energy correlation double ratios (built from the point correlator), which can be used to identify a system with (sub)jets. We also contrast the behavior of with subjettiness ratios. We then present three case studies to show how these generalized energy correlation functions work for different types of jet discrimination.

Quark/gluon discrimination. Using (built from the 2point correlator) in Sec. 3, we perform both an analytic study and a Monte Carlo study of quark/gluon separation. Through a nexttoleading logarithmic study, we explain why quark/gluon discrimination greatly improves as the angular exponent approaches zero (at least down to ), highlighting the importance of working with recoilfree observables.

Boosted //Higgs identification. Using (built from the 3point correlator) in Sec. 4, we will see that the discrimination power between QCD jets and jets with two intrinsic subjets from a coloursinglet decay depends strongly on the ratio of the jet mass to its transverse momentum. This occurs because a QCD jet obtains mass in different ways depending on this ratio. In particular, we will see that the energy correlation function performs better than subjettiness in situations where the jet mass is dominated by soft wideangle emissions.

Boosted top quark identification. Using (built from the 4point correlator) in Sec. 5, we find comparable discrimination power to other toptagging methods. While one might worry that the 4point correlators would face a high computational cost, we find that a boosted top event can be analyzed for a single value of in a few milliseconds.
We conclude in Sec. 6 with an experimental and theoretical outlook. The energy correlation functions are available as an addon to FastJet 3 Cacciari:2011ma as part of the FastJet contrib project (http://fastjet.hepforge.org/contrib/).
2 Generalized Energy Correlation Functions
The basis for our analysis is the point energy correlation function (ECF)
(1) 
Here, the sum runs over all particles within the system (either a jet or the whole event). Each term consists of energies multiplied together with pairwise angles raised to the angular exponent . This function is welldefined in any number of spacetime dimensions as well as for systems that do not have zero total momentum. Note that it is infrared and collinear (IRC) safe for all . Moreover, goes to zero in all possible soft and collinear limits of partons.
As written, Eq. (1) is most appropriate for colliders where energies and angles are the usual experimental observables. For hadron colliders, it is more natural to define as a transverse momentum correlation function:^{4}^{4}4We will continue to use the notation ECF, though we will mainly use the transverse momentum version in this paper.
(2) 
where is the Euclidean distance between and in the rapidityazimuth angle plane, , with
(3)  
(4)  
(5)  
(6)  
(7) 
If a jet has fewer than constituents then . Note that the computational cost for with particles scales like .
From the , we would like to define a dimensionless observable that can be used to determine if a system has subjets. The key observation is that the point correlators go to zero if there are only particles. More generally, if a system has subjets, then should be significantly smaller than . One potentially interesting ratio is
(8) 
which behaves much like subjettiness in that for a system of partons plus soft radiation, the observable is linear in the energy of the soft radiation.^{5}^{5}5Unlike subjettiness, this ratio scales like under transverse Lorentz boosts , which is somewhat undesirable when considering systems with several subjets. Of course, this is but one choice for an interesting combination of the energy correlation functions, and one can imagine using the whole set of energy correlation functions in a multivariate analysis.
In this paper, we will work exclusively with the energy correlation double ratio
(9) 
which is dimensionless.^{6}^{6}6This double ratio scales as under transverse Lorentz boosts. One way to motivate this observable is that we already know that subjettiness ratios are good probes of prong substructure Thaler:2010tr ; Thaler:2011gf . As we will see, the notation “” is motivated by the fact that this variable generalizes the parameter Parisi:1978eg ; Donoghue:1979vi . One should keep in mind that involves point correlators, and when clear from context, we will drop the superscript.
The energy correlation double ratio effectively measures higherorder radiation from leading order (LO) substructure. For a system with subjets, the LO substructure consists of hard prongs, so if is small, then the higherorder radiation must be soft or collinear with respect to the LO structure. If is large, then the higherorder radiation is not stronglyordered with respect to the LO structure, so the system has more than subjets. Thus, if is small and is large, then we can say that a system has subjets. In this way, the energy correlation double ratio behaves like subjettiness ratios , with key advantages to be discussed in Sec. 2.2.
2.1 Relationship to Previous Observables
While the definition of the energy correlation double ratio is new, it is related to previous observables for and hadron colliders that have been studied in great detail.
An energyenergy correlation (EEC) function for events was introduced in Ref. Banfi:2004yd for its particularly nice factorization and resummation properties. It is defined as
(10) 
where the sum runs over all particles in the event and is the direction of the thrust axis. This variable is IRC safe for all . The function is only nonzero if the pair of particles is in the same hemisphere. This removes the large correlation of the two initial hard partons which would otherwise dominate the sum, and means that behaves much like the jet angularities Berger:2003iw ; Almeida:2008yp with the same angular exponent . The EEC was introduced because it is insensitive to recoil effects and has smooth behavior for all allowed values of . In particular, has a smooth transition through , whereas angularities exhibit nonsmooth behavior and also are increasingly sensitive to recoil effects as the angular power increases. If one considers only one hemisphere of a dijet event, then is approximately the same as in our notation with . Both observables are sensitive to 1prong (sub)structure, and we will discuss the issue of recoil further in Sec. 2.2.
A related twoparticle angular correlation function was introduced in Refs. Jankowiak:2011qa ; Jankowiak:2012na ; Larkoski:2012eh for discrimination of jets initiated by QCD from jets from boosted heavy particle decays. The angular correlation function is defined as
(11) 
where the function only allows pairs of particles separated by an angular scale of or less to contribute to the observable. The behavior of the observable can be studied as a function of , and jets that are approximately scale invariant should have an angular correlation function that scales as a power of . For a fixed value of , the properties of the angular correlation function are very similar to that of and .
As mentioned above, the notation was chosen because of its relation to the parameter from collisions Parisi:1978eg ; Donoghue:1979vi . The parameter is used to identify twojet configurations without recourse to a jet algorithm or explicit jet axes choice. It is defined as
(12) 
which can also be expressed in terms of the eigenvalues of the sphericity tensor. At first glance, this looks very much like in the sense that the numerator looks like a 2point correlation function with . There is a crucial difference between the behavior of and , however, such that the parameter vanishes for dijet configurations when the jets are backtoback (i.e. ). If we expand around the dijet limit, then the parameter really behaves like a 3point correlation function (i.e. like ). To see this, note that for , the parameter has the simple form
(13) 
where and is the total fourvector of the system.^{7}^{7}7The parameter only properly makes sense if the total momentum of the system is zero, and so is not immediately applicable for hadron collisions. The angle between finalstate particles and in the system is
(14) 
Thus, if we change in the definition of , then can be expressed as
(15) 
which, up to normalization, is the traditional parameter. Of course, at higher orders in perturbation theory the definitions of the parameter and diverge. Both observables are sensitive to 2prong (sub)structure, though gives sensible answers even for systems with nonzero total momentum and has an adjustable angular exponent .
Higherpoint energy correlation functions have been studied very little in the literature. Two early studies for collisions are in Refs. Donoghue:1979vi ; Fox:1979id . However, both define observables that only make sense for systems with total momentum equal to zero and explicitly use operations only defined in threedimensional space, such as crossproducts and the properties of momentum tensors with rank greater than 2. Thus, these observables cannot be easily generalized to determine if a (boosted) system has (sub)jets. Historically, observables like the parameter Parisi:1978eg ; Donoghue:1979vi ; Ellis:1980wv have been used to identify peculiar phase space configurations such as a planar configuration of particles. However, this is not directly related to the number of jets in the event. Recent substructure variables like planar flow Thaler:2008ju ; Almeida:2008yp , Zernike coefficients GurAri:2011vx , and FoxWolfram moments Bernaciak:2012nh are similarly sensitive to peculiar phase space configurations rather than pronglike substructure. Planar flow, for example, vanishes if the constituents of the jet lie along a line, which is a good probe for some (but not all) 3prong configurations. The energy correlation double ratio is designed to directly probe prong configurations, though the high computational cost of likely limits the practical range to (i.e. up to threeprongs).
2.2 Advantages Compared to subjettiness
The variable subjettiness Thaler:2010tr ; Thaler:2011gf (based on jettiness Stewart:2010tn ) is a jet observable that can be used to test whether a jet has subjets, and it has been used in a number of theoretical Englert:2011iz ; Bai:2011mr ; Feige:2012vc ; Curtin:2012rm ; Cohen:2012yc ; Soyez:2012hv ; Ellis:2012zp and experimental ATLAS:2012dp ; ATLAS:2012am substructure studies. Since both subjettiness and the energy correlation double ratio share the same motivation, it is worth highlighting some of the advantages of the energy correlation double ratio.
First, a quick review of subjettiness. It is defined in terms of subjet axes as^{8}^{8}8In Refs. Thaler:2010tr ; Thaler:2011gf , subjettiness was defined with an overall normalization factor to make it dimensionless. Here, we remove the normalization factor so it has the same dimensions as Eq. (8).
(16) 
where the sum runs over all particles in the jet and is the distance from axis to particle . There are a variety of methods to determine the subjet directions, with arguably the most elegant way being to minimize over all possible subjet directions Thaler:2011gf . If a jet has subjets, then should be much larger than , so the observable that is typically used for jet discrimination studies is the ratio
(17) 
As discussed above, this ratio is directly analogous to the energy correlation double ratio .
One immediate point of contrast between subjettiness and the energy correlation double ratio is that does not require a separate procedure (such as minimization) to determine the subjet directions. While novel, this by itself does not necessarily imply that will have better discrimination power than , though it does mean that is a simpler variable to study.^{9}^{9}9In particular, serves two different roles for subjettiness. As in , controls the weight given to collinear or wideangle emissions. In addition, when the minimization procedure is used, controls the location of the axes which minimize subjettiness. When trying to determine the optimal value for for subjet discrimination, it is difficult to disentangle these two effects. We now explain two test cases where can perform better than : insensitivity to recoil for and sensitivity to soft wideangle emissions for .
2.2.1 Insensitivity to Recoil
A recoilsensitive observable is one for which soft emissions have an indirect effect on the observable. In addition to the direct contribution to the observable, soft radiation in a recoilsensitive observable changes the collinear contribution by an amount. An example of a recoilsensitive observable is angularities for the angular exponent (), which was studied in Ref. Banfi:2004yd . Because is insensitive to recoils, it is better able to resolve the collinear singularity of QCD.
For 1prong jets, the effect of recoil on an observable is illustrated in Fig. (a)a. Because of conservation of momentum, soft wideangle radiation displaces the hard jet core from the jet axis. Angularities (i.e. 1subjettiness) are sensitive to this displacement since they are measured with respect to the jet center. For a jet with two constituents separated by an angle (using the notation in Eq. (1) for simplicity),
(18) 
Taking the limit one can view the first term as the contribution directly from the emission , while the second term comes about because particle recoils when it emits particle . The dependence of on the energies and emission angle is different according to the value of . For , the second term is negligible, and the angularities become
(19) 
such that is linear in the soft radiation . However, for smaller values of , the expression for angularities changes because recoil effects become important. For , both terms are identical in the limit and angularities become
(20) 
For , the first term is negligible in the limit, and the angularities are dominated by the effect of recoil of the hard radiation
(21) 
By contrast, the observable has the same behavior for all values of :
(22) 
which is dominated by the splitting angle and energy of the softer particle in the jet for all values of .
Because subjettiness is essentially a sum over subjet angularities, can suffer from the same recoilsensitivity as angularities for , depending on how the subjet axes are determined. For example, if subjettiness is defined using subjet axes, then is recoil sensitive. subjettiness is also recoil sensitive if the subjet axes are aligned with the subjet momenta. A related issue is that if the subjet axes are determined using the minimization procedure, then the minimization is only wellbehaved for .^{10}^{10}10That said, the minimization procedure does eliminate the recoil effect. In all of these cases, the useful range of is limited to . In contrast, the energy correlation double ratio is recoilfree and wellbehaved for the whole IRCsafe range . As we will see in Sec. 3 (and demonstrated recently in Ref. Gallicchio:2012ez ), this is relevant for quark/gluon discrimination, where for is the optimal choice.
It should be noted that one can construct a recoilfree version of subjettiness where the subjet axes are always chosen to minimize the measure (see forthcoming work in Ref. broadening ), regardless of which is used in Eq. (16). We refer to axes defined in this way as “broadening axes”, since is closely related to the jet broadening measure Catani:1992jc . We will make use of this fact later when comparing to 1subjettiness in Sec. 3.3.
2.2.2 Sensitivity to Soft WideAngle Emissions
Another point of contrast between and is in how the two variables behave in the presence of emissions at multiple angular scales. The way subjettiness is defined, every jet is partitioned into subjets, even if there are fewer than “real” subjets. For example, when a jet has a soft subjet separated at large angle (as one might expect from the radiation off a quark or gluon), subjettiness will still identify that soft subjet region, yielding a relatively low value of (and therefore making the jet look more pronglike than it really is). In contrast, because the energy correlation function is sensitive to all possible soft and collinear singularities, takes on a relatively high value in the presence of a soft wideangle subjet, making the jet look less prong like (as desired).
We can show this concretely for using the configuration in Fig. (b)b where there is the following hierarchy of the energies and angles:^{11}^{11}11 Roughly the same conclusions about versus hold for the limit as well, which is relevant for the boson discussion below.
(23) 
Again using the notation in Eq. (1), the energy correlation functions are
(24) 
yielding
(25) 
For subjettiness with three jet constituents, it is consistent to choose axes that lie along the hardest particle in a subjet. For 1subjettiness, the axis lies along particle 1. For subjettiness, one axis lies along particle 1 and the other axis lies along particle 2 or particle 3, depending on the relationship between and . This gives
(26) 
Regardless of the ordering of and we see that:
(27) 
so in the presence of a soft subjet at large angle , yields a larger value than (i.e. more backgroundlike as desired). As we will see in Sec. 4, this allows to perform better than for background rejection in regions of phase space where soft wideangle radiation plays an important role.
One way to understand the improved performance of with respect to is to consider the concrete example of at fixed jet mass .^{12}^{12}12We thank Gregory Soyez for helpful discussions on these points. Using the kinematic limit above, the jet masssquared is given approximately by
(28) 
and it is convenient to define as the energy fraction of the emission that dominates the mass (e.g. if ). For fixed jet mass, QCD backgrounds tend to peak at small values of , but we see from Eq. (26) that does not have any dependence for fixed jet mass. For , if particle 2 dominates the mass (i.e. if a soft wideangle emission dominates the mass), then
(29) 
so penalizes small values of . In this way, acts similarly to taggers that reject jets if the kinematics of the dominant splitting of the jet is consistent with background Seymour:1993mx ; Butterworth:2002tt ; Brooijmans:1077731 ; Butterworth:2008iy ; Ellis:2009me ; Krohn:2009th . In contrast, only exploits the degree to which radiation is collimated with respect to the two subjet directions, and does not take into account the dependence at fixed jet mass.
If particle 3 dominates the mass (i.e. if the mass is dominated by a hard core of energy), then is constant in the energy fraction , and so is no longer affected by the kinematics of the emission that generated the mass. However, there is still the potential for improved performance in identifying boosted color singlet resonances like bosons. For a boosted boson, emissions at wide angle with respect to the angle between decay products are suppressed by color coherence. As one goes to higher boosts where the ratio of jet mass to jet decreases for fixed jet radius, the volume of phase space for allowed emissions decreases, which can also be seen as a consequence of angular ordering. It is therefore less likely for a boson signal to generate final state radiation at large , while background QCD jets will emit at large angle independently of the . Because radiation at large angles has an enhanced effect on as compared to , cf. Eq. (27), we expect to be more effective at discriminating colorsinglet signals from background QCD jets.
3 Quark vs. Gluon Discrimination with
Our first case study is to use the energy correlation functions to discriminate between quark jets and gluon jets. The observable contains the 2point energy correlation function and so is sensitive to radiation in a jet about a single hard core.^{13}^{13}13The CMS experiment uses an observable they call for quark versus gluon discrimination Chatrchyan:2012sn ; Pandolfi:1480598 . It is related to the limit of as . This case study is simple enough that we can predict the quark/gluon discrimination power through an analytic calculation, which we will subsequently validate with Monte Carlo simulations. In our later case studies involving higherpoint correlators, we will rely on Monte Carlo alone.
In any discussion of quark–gluon discrimination, one should start with a reminder that defining what is meant by a quark or a gluon jet is a subtle task, since the one existing infraredsafe way of defining quark and gluon jets Banfi:2006hf works only at parton level. Existing work on practical aspects of quark–gluon discrimination in Refs. Gallicchio:2011xq ; Gallicchio:2012ez ; Krohn:2012fg ; Chatrchyan:2012sn ; Pandolfi:1480598 has not entered into these issues. Instead the discussion has relied on Monte Carlo simulations, defining a quark (gluon) jet to be whatever results from the showering of a quark (gluon) parton. We will adopt a variant of this methodology in our Monte Carlo studies. Our analytic approach will instead define a quark or gluon jet in terms of the sum of the flavors of the partons contained inside it. It is based on resummation and therefore contains similar physics to the Monte Carlo parton shower.
3.1 Leading Logarithmic Analysis
We begin our analysis by considering the leading logarithmic (LL) structure of the cross section for the observable . With equal to the logarithm of , we define LL order as including all terms in the cross section that scale like , for . At LL order, quark versus gluon jet discrimination can be understood as a consequence of quarks and gluons having different color charges. To LL order, the strong coupling constant can be taken fixed and only the most singular term in the splitting function need be retained. With only one softcollinear gluon emission, the normalized differential cross section for any infrared and collinear safe observable has the same form for both quark and gluon jets:
(30) 
where is the color factor, is the jet radius,^{14}^{14}14We use this somewhat nonstandard notation because will later be used with a different meaning. is the energy fraction of the emitted gluon, is its splitting angle, and is a function of and . Recall that for quarks and for gluons.
At this order, the observable is
(31) 
which takes a maximum value of . So integrating Eq. (30) yields, for small , the cross section
(32) 
We identify the logarithm as
(33) 
which we use in the following expressions for compactness. This distribution can be resummed to LL order by exponentiating the cumulative distribution. The resummed distribution that follows is then
(34) 
Because the quark color factor is smaller than the gluon color factor, the Sudakov suppression is less for quarks. Thus, the distribution for quark jets is peaked at smaller values than for gluon jets.
To figure out the quark/gluon discrimination power from this resummed distribution, we will make a sliding cut on and count the number of events that lie to the left of the cut. Adjusting this cut then defines a ROC curve relating the signal (quark) jet efficiency to the background (gluon) jet rejection. To LL accuracy, the (normalized) cumulative distributions for quarks and gluons are:
(35) 
Note that at LL order, there is a simple relationship between these cumulative distributions:
(36) 
Thus, if a sliding cut on retains a fraction of the quarks, it will retain a fraction of the gluons. The quark/gluon discrimination curve is then
(37) 
which (perhaps surprisingly) is independent of . This LL discrimination result holds for a wide class of IRC safe observables sensitive to the overall jet color factor, including the jet mass. Only beyond LL order does the discrimination curve depend on .
3.2 NexttoLeading Logarithmic Analysis
We continue our analysis to nexttoleading logarithmic (NLL) order, which we define as including all terms that scale as and in . In addition, we will also include the nonlogarithmically enhanced term arising at . At NLL order, there are several new effects that must be included, which together turn out to improve the quark/gluon discrimination power of compared to the LL estimate. The dominant effects are subleading terms in the splitting functions and phase space restrictions due to multiple emissions. In addition, one must account for the running of , fixedorder corrections, and nonglobal logarithms Dasgupta:2001sh arising from the phase space cut of the jet algorithm. We will consider how these affect the discrimination power of , ultimately showing that small values of improve quark/gluon discrimination. We will work in an approximation of small jet radius, , which will allow us to consider only the effects of radiation from the jet, while neglecting modifications associated with the full antenna structure of initial and finalstate partons.
The resummation to NLL for generic (global) observables was carried out in Ref. Banfi:2004yd . The central result of that analysis was an expression for the NLL cumulative distribution for an arbitrary observable (satisfying certain basic conditions, e.g. recursive infrared safety). From Ref. Banfi:2004yd , the probability that the value of an observable is less than takes a form such as
(38) 
where is a matching factor to fixed order, , and is the EulerMascheroni constant. In a fixedcoupling approximation, the “radiator” function for the observable is
(39) 
where is the color factor of the jet and encodes subleading terms in the splitting functions.^{15}^{15}15To obtain Eq. (39), we used the fact that, for a general jet observable that takes the form
(40) 
unless an explicit scale is used as the argument of . Because for a jet is nonglobal, it is necessary to include an extra factor in the resummation, discussed in detail in Sec. 3.2.3. We will also include information obtained by matching to the fixedorder cross section, where the matching procedure is described in App. A.
, where
Armed with the matched NLL cumulative distribution, including the nonglobal and corrections, we can now determine the quark versus gluon discrimination curve by numerically inverting and plugging it into the expression for . This is shown for various values of in Fig. (a)a. In Fig. (b)b, we fix 50% quark efficiency and show the gluon rejection rate (i.e. one minus the gluon efficiency) as a function of for . Also on this plot is an approximate analytic expression for the rejection rate as a function of that we derive below in Eq. (3.2.4). We see that the discrimination power improves as decreases. It is, however, not sensible to take too small: for our observable is collinear unsafe, and large nonperturbative effects can be expected as approaches zero. Furthermore for the convergence of our calculation breaks down (cf. App. B).
To understand the behavior of Fig. (b)b semianalytically, we will study the impact of different physical effects on the discrimination. To do so, we will again express in terms of so as to determine the discrimination power of a cut on . In fact, we are most interested in the exponent relating to (as in Eq. (36)), so we will actually relate the logarithms of the two cumulative distributions to one another. We are interested in the regime where , which, from Eq. (35), implies that . The logarithm of the cumulative distribution has the schematic expansion
(41) 
With the power counting of , we will consider all terms from Eq. (41) that scale as , , or . This corresponds to all terms at order from Eq. (41), as well as the terms at , , and . To illustrate this power counting, consider, for example, the term , which scales as as one varies while keeping fixed and of order .
In what follows we will pay special attention to the terms at order and , which turn out to be the most relevant ones when establishing deviations from our LL analysis and whose dominant contributions have clearly identifiable physical origins. The terms at order and are simply proportional to the LL color factor, multiplied by powers of the function, and so do not significantly modify the LL analysis.
3.2.1 Subleading Terms in Splitting Functions
We first consider the effect on the discrimination from the subleading terms in the splitting functions. In the observable , controls the weight given to collinear and wideangle emissions in the jet. At large values of , wideangle emissions are given greater weight, and at small values of , collinear emissions are given greater weight. Wideangle soft radiation is controlled by the term in the splitting function that diverges as the energy fraction goes to zero; i.e., the term . Both quarks and gluons have the same functional form for the soft limit of the splitting function, with the only difference being the overall color factor. By contrast, collinear emissions are controlled by the subleading terms in the splitting function, which differ for quarks and gluons (i.e. different values of the coefficient). Therefore, as goes to zero and the collinear emissions become more important in , the differences between the quark and gluon splitting functions are accentuated.
To see this behavior directly from Eq. (38), we can ignore the dependent prefactor and focus on the factor. We can write , where
(42) 
which is for . We then have
(43)  
This last form allows us to relate the cumulative distribution for gluons to that of quarks, in the same spirit as Eq. (36):
(44) 
This implies that the separation between the quark and gluon distributions increases as decreases and so smaller values of result in better discrimination. Because this effect first arises at , there will be corrections at due to the running coupling. Note also that the coefficient is quite small in QCD, and so the total effect from the subleading terms in the splitting functions on the discrimination power is minimal.
3.2.2 Multiple Emissions
Next, consider the effect of multiple emissions. The Sudakov logarithm corresponds to the integral of the area (in , space) over which emissions are forbidden. At LL, any number of emissions can lie arbitrarily close to the lower boundary of the phase space region without changing the value of the observable. At NLL, one must consider the cumulative effect of the emissions that lie near the phase space boundary. Multiple emissions tend to increase the value of the observable , and so, for a fixed value of , they must be suppressed. This introduces an extra degree of discrimination between quarks and gluons; there are likely to be more such emissions for gluons than quarks and so it costs more to “accept” a gluon jet. For a given LL Sudakov factor, the extent of the boundary region is effectively increased as is decreased, leading to better quark versus gluon discrimination at small .
In Eq. (38), the effect of muliple emissions is seen in the dependent prefactor. For small values of , the prefactor has the expansion
(45)  
We will drop terms at and higher, which constrains us to consider . The cumulative distribution can then be written approximately as
(46) 
which allows us to relate in terms of as
(47) 
This again suggests an increase in discrimination power for relatively small . While this effect appears at order rather than , it has a substantially larger coefficient.
3.2.3 NonGlobal Logarithms
Because jets are defined in a restricted phase space, nonglobal logarithms may contribute to the quark versus gluon discrimination power. The effect of nonglobal logarithms on the cumulative distribution can, for our purposes, be approximated in the large limit as Dasgupta:2001sh ; Dasgupta:2002bw ; Banfi:2002hw
(48) 
This neglects some contributions starting at order in the exponent, but these would not affect the quarkgluon discrimination at our accuracy. Recently a first numerical calculation has been performed including the full structure Hatta:2013iba and it suggests that finite corrections are small.
If we temporarily ignore the dependent prefactor in Eq. (38), the inclusion of nonglobal logarithms leads to
(49) 
All quark/gluon dependence resides in the color factor inside , so we still have the property from the LL calculation (again, ignoring the prefactor and setting )
(50) 
Hence nonglobal logarithms do not modify the above arguments in any significant way.
This analysis holds for the anti jet algorithm, whose boundary is unaffected by soft radiation at angles . For other algorithms of the generalized family, which have irregular, softemissiondependent boundaries, there are additional terms, clustering logarithms Appleby:2002ke ; Delenda:2006nf , which also appear starting from order . Some of the clustering logarithms involve color factor combinations such as and for quarks and gluons respectively, and so presumably would have an impact on quarkgluon discrimination at our accuracy. We leave the study of these terms for future work.
3.2.4 Summary of NLL Result
Using the results of Ref. Banfi:2004yd and App. A to include all effects up through in the logarithm of the cumulative distribution we find
(51) 
This expression includes two terms beyond those discussed in the subsections above. The one proportional to , where function coefficient, has two origins: it comes from the running coupling corrections to the contribution from the subleading terms in the splitting functions and from the runningcoupling corrections to the relation between the logarithm and . The last term in parentheses comes from matrix element corrections, discussed in detail in App. A. It depends on the choice of the jet definition, including the procedure by which one defines quark versus gluon jets at parton level. Specifically, we assume any algorithm is equivalent to the generalized family of jet algorithms at order , and at this order define a quark jet to be one that contains a quark and a gluon, while jets containing or are considered to be gluon jets.^{16}^{16}16In contrast to the situation with LO studies, at it does not makes sense to discuss jet flavor based on the flavor of the parton that “initiates” the jet, since interference effects between diagrams mean that the initiating parton cannot be uniquely identified. The question of quarkgluon jet definition at fixed order is discussed further in App. A. Beyond , the calculation assumes that the algorithm maintains a rigid circular boundary in the presence of multiple soft particles at angles of order , i.e. that it behaves like the anti algorithm. is the oneloop
Note that every subleading term in Eq. (3.2.4) is proportional to a difference of color or quark number factors and so the discrimination power depends sensitively on these differences. The overall quark versus gluon discrimination power increases as is decreased (even though the last term favors larger values of for ). Numerically, this behavior is dominated by the subleading terms in the splitting functions and the multiple emissions effect. The effect of the subleading terms in the splitting functions goes like and so is formally more important than the multiple emissions effect which is . However, the effect of the subleading terms is multiplied by the small number and so is numerically smaller than the contribution from multiple emissions. Runningcoupling and fixedorder effects are significantly smaller.
Robustly, then, smaller values of lead to better discrimination between quark and gluon jets. One explicitly sees that we have an expansion in powers of , and so it can only be trusted for substantially larger than ; in practice, perhaps (see App. B). It is interesting to comment also on traditional angularities: for most of Eq. (3.2.4) still holds, and only the last term in parentheses would be modified. However, for angularities are dominated by recoil effects, with a structure that is independent of , and so we expect that the discrimination should saturate. Because the energy correlation double ratio is recoilfree for all values of , it is better able to probe the collinear singularity and multiple emission effects that distinguish quarks from gluons.
3.3 Monte Carlo Study
We now use a showering Monte Carlo simulation to validate the above NLL analysis of . A similar study of the EEC function appears in Ref. Gallicchio:2012ez , where it was called the twopoint moment.^{17}^{17}17Ref. Gallicchio:2012ez examined the quarkgluon discrimination for a range of values and reached a conclusion that is consistent with ours. While their initial analysis naïvely suggests that , Fig. 18, performs worse than jet broadening (“girth”, or equivalently with ), Fig. 13, that comparison involves different Monte Carlo event samples. Table 1 of Ref. Gallicchio:2012ez compares the observables on equal footing, which shows that indeed has better discrimination power than jet broadening, consistent with our discussion here. Through this paper, jets are identified with the anti algorithm Cacciari:2008gp using FastJet 3.0.3 Cacciari:2011ma . No detector simulations are used other than to remove muons and neutrinos from the event samples before jet finding, as was done in analyses for the BOOST 2010 report Abdesselam:2010pt .
We generate pure quark and gluon dijet samples from the processes and in Pythia 8.165 Sjostrand:2006za ; Sjostrand:2007gs at the 8 TeV LHC using tune 4C Corke:2010yf . While Pythia is not fully accurate to NLL, it does include subleading terms in the splitting functions and multiple emissions, so not surprisingly we find improved discrimination at smaller values of , in agreement with Sec. 3.2. We scan over various jet radii and cuts to study the dependence of the quark/gluon discrimination on these parameters. For this study, we only use the hardest reconstructed hadronlevel jet in the event with a transverse momentum in the ranges of , , or .^{18}^{18}18The reason for focusing only on the leading jet is that we want to minimize ambiguities related to defining quark and gluon jets. The subleading jet is the one more likely to have undergone radiation, and with radiation, quark jets may change into gluon jets, and viceversa. Additionally, the local emission environment is changed (e.g. nonglobal logs may become more important). The probability that an event has radiation in the vicinity of the subleading jet is , while it is near the leading jet. As a crosscheck on the flavour composition of our events, we have clustered the partonlevel showered events with the flavor algorithm Banfi:2006hf . We find that the flavor of the leading jet is consistent with expectations except in a small fraction of events, between a few percent and ten percent depending on the generator. If the hardest jet in the event lies outside the range of interest, the event is ignored. In addition, we scan over jet radii values of , , and . Because our broad conclusions hold for all samples generated, we only show representative plots to illustrate the quark/gluon performance of .
In Fig. (a)a, we plot the distribution of for jets initiated by quarks and gluons with transverse momentum in the range and jet radius in Pythia. As expected, the gluon curve lies at larger values than the quark curve because of the greater Sudakov suppression in gluon jets. The quark/gluon discrimination curves for different values of are shown in Fig. (b)b, which are directly comparable to the NLL results in Fig. 2, up to jet contamination effects included in Pythia such as underlying event and initialstate radiation. Again, we see that is the optimal value. In Fig. (a)a, we show the gluon rejection rate for 50% quark efficiency as a function of , comparing different ranges and values, all of which favor small values of . Note that the gluon rejection power degrades as the jet radius is increased, exhibited in Fig. (a)a. This may be associated with the increase in the amount of underlying event and initialstate radiation captured in the jet as the jet radius increases. This radiation is uncorrelated with the dynamics of the quark or gluon which initiates the jet. The degradation is most prominent at large values of , where wide angles in the jet are emphasized (which is where most of the uncorrelated radiation resides).