1 Introduction
Fluid models are an extremely important tool in many areas of space physics, astrophysics and laboratory plasmas. Even though many physical systems studied in these fields are almost collisionless, where a proper kinetic description should be used, traditional fluid models with isotropic scalar pressures (temperatures), such as the usual magnetohydrodynamic description (MHD) and multi-fluid models based on this description, were extremely successful in modelling, interpreting or at least offering the first insight into many space physics phenomena. For example, it was indeed the simplified fluid approach that allowed Parker (Reference Parker1958b ) to predict the existence of the solar wind, which was surprisingly several decades after the breakthrough discoveries in quantum mechanics and relativity. In recent years, observational studies re-sparked interest in the temperature anisotropy effects, that cannot be studied with the usual MHD fluid descriptions, and we anticipate that the interest will grow even further, once data from the Parker Solar Probe and the future Solar Orbiter missions are analysed.
The correct modelling and understanding of collisionless plasmas in a fluid framework concerns not only systems with plasma temperatures far from an isotropic state. It is sometimes forgotten that while the application of fluid theory to strongly collisional systems is intuitively obvious, the approach to collisionless (or weakly collisional) systems is not. From a linear perspective, the usual MHD description does not converge to the collisionless kinetic description even in the low-frequency long-wavelength limit (even when the kinetic distribution function is considered to be an isotropic Maxwellian), since in the absence of collisions the isotropic equation of state is never correct. Additionally, Landau damping never completely vanishes (even when electrons are hot). The situation is much more complicated from a nonlinear perspective where, on average, the effect of Landau damping might be balanced by processes called stochastic plasma echoes (Meyrand et al. Reference Meyrand, Kanekar, Dorland and Schekochihin2019).
As a linear example, the ordering of the phase speeds in MHD is always slow, Alfvén, fast, i.e. $v_{s}\leqslant v_{A}\leqslant v_{f}$ . In contrast, in collisionless systems with a sufficiently high plasma beta, the real phase speed of the slow mode can become faster than the Alfvén mode, so that the ordering can become Alfvén, slow, fast, i.e. $v_{A}\leqslant v_{s}\leqslant v_{f}$ . Or in another words, the phase speeds of linear eigenmodes that are present in MHD do not hold in the collisionless (or weakly collisional) regime, and this effect exists even if the temperatures are isotropic. From a fluid perspective, the main reason for this discrepancy is that, in magnetized collisionless systems, the pressure fluctuations in the directions parallel and perpendicular to the magnetic field lines are not equivalent, and cannot be described with a single scalar pressure equation. The pressure fluctuations have to be described with two different evolution equations for $p_{\Vert }$ and $p_{\bot }$ , even if the mean pressure values are equal $(p_{\Vert }^{(0)}=p_{\bot }^{(0)})$ and no mean temperature anisotropy exists.
The simplest collisionless fluid description is the CGL fluid model – named after Chew, Goldberger & Low (Reference Chew, Goldberger and Low1956) – and sometimes also referred to as collisionless MHD. The CGL dispersion relation is not equivalent to the MHD dispersion relation (even in the case $p_{\Vert }^{(0)}=p_{\bot }^{(0)}$ ), since the pressure fluctuations still remain anisotropic and the evolution equations for $p_{\Vert }$ and $p_{\bot }$ remain different. In another words, in collisionless systems the distribution function is free to evolve from its initial state and to become anisotropic. By ‘forcibly’ prescribing only one scalar pressure, one effectively prescribes a high-collisionality regime, even if collisions are not prescribed explicitly. Formally, the MHD equations can indeed be derived from the collisionless Vlasov equation, i.e. with no explicit collisional operator. It is therefore often stated, that the MHD description is highly collisional implicitly. Moreover, as discussed for example by Kulsrud (Reference Kulsrud, Galeev and Sudan1983), while in the presence of a magnetic field, transverse motions can in some circumstances be described by fluid-type equations, the determination of pressures as well as longitudinal motions a priori requires a kinetic description.
Here we focus on collisionless fluid models. Nevertheless, weak collisions can be incorporated easily, and calculations just yield additional terms on the right-hand sides of the parallel and perpendicular pressure and heat flux equations. For the simple Bhatnagar–Gross–Krook (BGK) collisional operator (Bhatnagar, Gross & Krook Reference Bhatnagar, Gross and Krook1954), see for example Snyder, Hammett & Dorland (Reference Snyder, Hammett and Dorland1997). A thorough review of anisotropic fluid models, including the collisional dynamics, was presented by Barakat & Schunk (Reference Barakat and Schunk1982). A very good discussion about collisionality of various astrophysical plasmas, such as the solar wind, interstellar medium, accretion disks and galaxy clusters, can be found for example in Schekochihin et al. (Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsung2009). Weakly collisional fluid models also seem to be applicable for modelling the upper solar photosphere and chromosphere, where a curious situation exists and the proton–proton collisional frequency is roughly equal to the proton cyclotron frequency – see for example figure 1 in Khomenko et al. (Reference Khomenko, Collados, Díaz and Vitas2014). We note that in this guide we use the definition of a ‘collisionless fluid model’ as being a fluid model that (i) is derived from the collisionless Vlasov equation with zero right-hand side; and (ii) that has two different pressure equations. Our definition therefore differs from an alternative view of for example Zank et al. (Reference Zank, Hunana, Mostafavi and Goldstein2014) and Zank (Reference Zank2014), where models with a scattering operator that reflects charged particle scattering by electromagnetic fluctuations on the right-hand side of the Vlasov equation are also viewed as collisionless.
Fluid modelling of collisionless plasmas is an extremely attractive subject, and an enormous amount of theoretical and numerical work was done in this field in the past. The manuscript presented here has no intention of being a proper review paper in this field. In our opinion, no satisfactory easy-to-read introductory paper exists about collisionless fluid models, and this manuscript attempts to fill such a spot. Instead of just stating the major results and discussing what was done in the past and by whom, we make a significant effort to present a (hopefully consistent) derivation of basic collisionless fluid models. On one hand, the presented calculations might be considered as too detailed in many places. On the other hand, it is exactly the relatively complicated algebra of collisionless fluid models, that makes the field difficult to enter for new researchers. The primary goal of this paper is to allow researchers new to the subject to follow the algebra easily. Instead of spending years, an interested reader should be able to become comfortable with the basics of the subject in a matter of days, or perhaps a couple of weeks. The text is separated to two parts. Part 1 is dedicated to fluid models that are obtained by closing the fluid hierarchy with simple (non-Landau fluid) closures. Part 2 is dedicated to Landau fluid closures.
Here, in Part 1, § 2, we start with the detailed derivation of the pressure tensor equation. The pressure tensor $\boldsymbol{p}$ is then decomposed to its gyrotropic part $\boldsymbol{p}^{g}=p_{\Vert }\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}+p_{\bot }(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ (also referred to as $\boldsymbol{p}^{\text{CGL}}$ ), and non-gyrotropic part $\unicode[STIX]{x1D72B}$ , the latter usually called the finite Larmor radius (FLR) corrections to the pressure tensor, or the gyroviscous stress tensor. In the highly collisional decomposition $\boldsymbol{p}=p\unicode[STIX]{x1D644}+\unicode[STIX]{x1D72B}$ , see e.g. Braginskii (Reference Braginskii1958, Reference Braginskii1965), the quantity $\unicode[STIX]{x1D72B}$ is called the stress tensor. The decomposition procedure yields rigorously exact (even though still not closed) evolution equations for $p_{\Vert }$ and $p_{\bot }$ , see e.g. Oraevskii, Chodura & Feneberg (Reference Oraevskii, Chodura and Feneberg1968), Passot & Sulem (Reference Passot and Sulem2004) and Goswami, Passot & Sulem (Reference Goswami, Passot and Sulem2005). Importantly, at the leading order (by neglecting the $\unicode[STIX]{x1D72B}$ and also the non-gyrotropic heat flux contributions $\boldsymbol{q}^{\text{ng}}$ ), the equations of Chew et al. (Reference Chew, Goldberger and Low1956), hereafter referred to as CGL, are recovered. We discuss the paper by Chew et al. (Reference Chew, Goldberger and Low1956) and point out that the paper derived the correct form of the pressure equations with the gyrotropic heat flux contributions included, however, the quantities $q_{n},q_{s}$ used in that paper are related to the usual $q_{\Vert },q_{\bot }$ by relations $q_{n}=q_{\Vert }-3q_{\bot }$ and $q_{s}=q_{\bot }$ . By further neglecting the heat flux contributions, the pressure equations can be written in conservative form. The resulting pressure equations became known as the CGL equations, and they can be interpreted as the conservation laws for the first and second adiabatic invariants (Kulsrud Reference Kulsrud, Galeev and Sudan1983; Gurnett & Bhattacharjee Reference Gurnett and Bhattacharjee2005).
Furthermore, we discuss the general equations of collisionless multi-fluid models. The pressure equations are rigorously exact, even though the system is not closed, since the FLR pressure tensor and the entire heat flux tensor are not specified; for a quick look see (2.89)–(2.92). Rewriting the system to a form where the usual CGL conservation laws are on the left-hand side, and all other contributions on the right-hand side, nicely represents the complicated plasma heating processes that can be encountered, and that are responsible for the breaking of the adiabatic invariants; see (2.100), (2.101). We also discuss the conservation of energy. For the case of periodic boundary conditions, the total conservation of energy has a very illuminating form, that can be found for example in Yang et al. (Reference Yang, Matthaeus, Parashar, Haggerty, Roytershteyn, Daughton, Wan, Shi and Chen2017a ,Reference Yang, Matthaeus, Parashar, Wu, Wan, Shi, Chen, Roytershteyn and Daughton b ). That formulation is obtained by considering the total ‘internal’ energy for each particle species $E_{r}^{\text{int}}=\frac{1}{2}\langle \text{Tr}\,\boldsymbol{p}_{r}\rangle$ (where the brackets represent integration over the entire spatial domain), which only reveals the total plasma heating. Here, we split the internal energy into its parallel and perpendicular components $E_{\Vert r}^{\text{int}}=\frac{1}{2}\langle p_{\Vert r}\rangle$ , $E_{\bot r}^{\text{int}}=\langle p_{\bot r}\rangle$ , and formulate the total conservation of energy with the possibility of anisotropic plasma heating, see (2.113)–(2.116).
The CGL description is analysed in great detail in § 3. We derive the CGL dispersion relation, and discuss properties of the slow, Alfvén and fast modes that are present. We verify many of the classical results of Abraham-Shrauner (Reference Abraham-Shrauner1967), here written in a slightly more convenient notation by using the parallel plasma beta $\unicode[STIX]{x1D6FD}_{\Vert }$ , and the temperature anisotropy ratio $a_{p}=T_{\bot }/T_{\Vert }$ . Collisionless plasmas cannot reach arbitrarily large values of temperature anisotropy, and the linear CGL eigenmodes can indeed become unstable, with the associated instabilities referred to as the (parallel and oblique) firehose instability and the mirror instability. Similarly to MHD, the simple CGL description does not contain any dispersive effects and is technically scale invariant, even though valid only on the largest scales. The instability thresholds present in the CGL model therefore can be referred to as ‘hard’ thresholds, i.e. thresholds that are obtained in the long-wavelength low-frequency limit. The firehose and mirror instabilities are believed to play a crucial role in solar wind dynamics (see e.g. Hellinger et al. Reference Hellinger, Trávníček, Kasper and Lazarus2006; Bale et al. Reference Bale, Kasper, Howes, Quataert, Salem and Sundkvist2009), and in figure 1, they are plotted in the usual $\unicode[STIX]{x1D6FD}_{\Vert }-T_{\bot }/T_{\Vert }$ plane with logarithmic scales. While the firehose threshold matches the one obtained from kinetic theory, the CGL mirror threshold contains the well-known factor of 6 error for large $\unicode[STIX]{x1D6FD}_{\Vert }$ values. The factor of 6 error can be interpreted as inadequacy of the adiabatic CGL closure in the very slow-dynamics context, such as the mirror instability. This is further addressed in § 9.4, where we discuss that the ‘static’ closure, which can be viewed as a generalization of isothermal closure in the presence of temperature anisotropy and variations of magnetic field strength, reproduces the correct mirror threshold (Constantinescu Reference Constantinescu2002; Chust & Belmont Reference Chust and Belmont2006; Passot, Ruban & Sulem Reference Passot, Ruban and Sulem2006).
We discuss the core differences between MHD and CGL, which can be nicely summarized with the concept of adiabatic indices $\unicode[STIX]{x1D6FE}$ , related to the number of degrees of freedom $i$ by $\unicode[STIX]{x1D6FE}=(i+2)/i$ . While MHD is fully three-dimensional with $\unicode[STIX]{x1D6FE}=5/3$ , the CGL can be viewed as composed of one- and two-dimensional dynamics with $\unicode[STIX]{x1D6FE}_{\Vert }=3$ and $\unicode[STIX]{x1D6FE}_{\bot }=2$ . We also address the velocity and magnetic field eigenvectors. Similarly to the velocity field eigenvector in MHD (see figure 2), which shows a ‘singular’ behaviour for strictly parallel propagation with $V_{A}=C_{s}$ , the CGL eigenvector (figure 3) shows similar singularity for $\unicode[STIX]{x1D6FD}_{\Vert }=2/(4-T_{\bot }/T_{\Vert })$ . We also briefly discuss fluid models with empirical ‘free’ polytropic indices $\unicode[STIX]{x1D6FE}_{\Vert },\unicode[STIX]{x1D6FE}_{\bot }$ , studied for example by Hau & Sonnerup (Reference Hau and Sonnerup1993), Hau et al. (Reference Hau, Phan, Sonnerup and Paschmann1993) and Abraham-Shrauner (Reference Abraham-Shrauner1973).
In § 4, we introduce the simplest dispersive effects by including the Hall term into the induction equation, and study dispersion relations of the Hall-CGL model. We focus on the parallel firehose instability, and show that the instability is indeed associated with the whistler mode, see figures 4 and 5. We show that negative real frequencies $\unicode[STIX]{x1D714}_{r}<0$ have to be handled carefully, and that non-causal analytic solutions (4.32)–(4.35) have to be modified to the causal form (4.43)–(4.46). We briefly discuss the simplest ion-cyclotron resonances, and compare solutions of the Hall-CGL model with solutions of linear kinetic theory, see figure 6.
In § 5, we evaluate the FLR tensor $\unicode[STIX]{x1D72B}$ at several levels of approximation. The evaluation of the FLR tensor is cumbersome, because the tensor is described by the pressure tensor equation implicitly. We first reproduce the fully nonlinear ‘inversion’ procedure on how to obtain $\unicode[STIX]{x1D72B}$ from expression $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})+(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})^{\text{T}}$ that can be found for example in Hsu, Hazeltine & Morrison (Reference Hsu, Hazeltine and Morrison1986), Passot & Sulem (Reference Passot and Sulem2004) and Ramos (Reference Ramos2005) as a brief note. Applying this inversion procedure to the pressure tensor equation evaluates the FLR corrections correctly along the magnetic field lines, but leads to an equation for $\unicode[STIX]{x1D72B}$ that is still implicit. Nevertheless, evaluating the resulting equation at the leading order (technically first order in frequency and wavenumber), leads to an explicit expression for $\unicode[STIX]{x1D72B}$ . We first recover the nonlinear result of Schekochihin et al. (Reference Schekochihin, Cowley, Rincon and Rosin2010), derived in that paper slightly differently, without using the inversion procedure. We point out that the result can be slightly simplified, and rearranging the expression yields two different useful forms for writing the nonlinear $\unicode[STIX]{x1D72B}$ . Finally, by using the non-dispersive (MHD) induction equation, we obtain the nonlinear result of Ramos (Reference Ramos2005) (see also Macmahon (Reference Macmahon1965)). For further evaluation of nonlinear FLR corrections to higher orders, an advanced reader is referred to Ramos (Reference Ramos2005). We continue with the evaluation of the FLR tensor in the linear approximation, i.e. when the magnetic field lines are not too distorted. For the first-order tensor (here called FLR1), we recover the classical result of Yajima (Reference Yajima1966), which is notably different from the one provided by Oraevskii et al. (Reference Oraevskii, Chodura and Feneberg1968). In the isotropic case, the FLR1 tensor is consistent with the one extracted from the stress tensor of Braginskii (Reference Braginskii1965), if the collisional terms are ‘ignored’. It is noteworthy that a proper collisionless limit cannot be achieved from the stress tensor of Braginskii (Reference Braginskii1965), because expressions are proportional to $\unicode[STIX]{x1D70F}$ (and also $1/\unicode[STIX]{x1D70F}$ ), where $\unicode[STIX]{x1D70F}$ is time between two collisions ( $\unicode[STIX]{x1D70F}\sim 1/\unicode[STIX]{x1D708}$ where $\unicode[STIX]{x1D708}$ is the usual collisional frequency), so the collisionless limit $\unicode[STIX]{x1D70F}\rightarrow \infty$ does not work.
We consider the Hall-CGL-FLR1 fluid model, and we provide dispersion relation for generally oblique propagation, which can be also found in Hunana & Zank (Reference Hunana and Zank2017). For higher-order FLR corrections, we only provide analytic dispersion relations for the parallel propagating whistler and ion-cyclotron modes, as well as the perpendicular fast mode. Nevertheless, we provide linearized, normalized and Fourier transformed equations written in the $x$ – $z$ plane for all the fluid models. To obtain the dispersion relation for an oblique propagation, the reader is encouraged to use analytic software such as Maple or Mathematica, or to solve the system numerically. The second-order FLR corrections (FLR2) are here defined as containing the Hall term and the time derivative $\unicode[STIX]{x2202}\unicode[STIX]{x1D72B}/\unicode[STIX]{x2202}t$ . We also consider FLR corrections with the non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ , that are here defined as FLR3. The precision of various FLR corrections can be compared nicely by considering the perpendicular fast mode in the long-wavelength limit. The comparison is especially meaningful, when written in the notation of Del Sarto, Pegoraro & Tenerani (Reference Del Sarto, Pegoraro and Tenerani2017), see (5.119)–(5.121). We proceed by showing that the FLR3 corrections (with the second-order non-gyrotropic heat flux vectors) indeed recover the fully kinetic dispersion relation for the fast mode in the long-wavelength (low-frequency) limit, a result reported by Mikhailovskii & Smolyakov (Reference Mikhailovskii and Smolyakov1985). Instead of expanding the pressure tensor equation, one can derive very precise linear FLR corrections from linear kinetic theory, which is not addressed here, and the reader is referred to papers by Passot & Sulem (Reference Passot and Sulem2007) and Sulem & Passot (Reference Sulem and Passot2015) and references therein.
In § 6, we investigate the parallel and oblique firehose instability. The FLR and Hall dispersive effects are crucial for the stabilization of the firehose instability at small scales, and a comprehensive discussion can be found in Hunana & Zank (Reference Hunana and Zank2017). That paper was essentially extracted from this guide (with many figures that we do not republish here), and an interested reader who wants to focus on the firehose instability can find further information there. Nevertheless, here we briefly investigate improvements that can be made by considering the FLR2 and FLR3 corrections, see figures 7–11. Importantly, we show that the non-gyrotropic heat flux vectors in the FLR3 model, partially reproduce the large ‘bump’ in the imaginary phase speed (growth rate normalized to the wavenumber), when the plasma is close to the long-wavelength limit ‘hard’ firehose threshold, see figure 7. The firehose instability in a fluid formalism was also investigated by Wang & Hau (Reference Wang and Hau2003, Reference Wang and Hau2010), Schekochihin et al. (Reference Schekochihin, Cowley, Rincon and Rosin2010) and Rosin et al. (Reference Rosin, Schekochihin, Rincon and Cowley2011) and references therein.
In § 7, we derive the heat flux tensor equation. The heat flux tensor is then decomposed into its gyrotropic and non-gyrotropic parts, $\boldsymbol{q}=\boldsymbol{q}^{\text{g}}+\boldsymbol{q}^{\text{ng}}$ . The procedure yields evolution equations for the gyrotropic parallel and perpendicular heat fluxes, $q_{\Vert }$ and $q_{\bot }$ , that contain the tensor of the fourth-order moment $\boldsymbol{r}$ . It is emphasized that, if one wants to keep the non-gyrotropic $\unicode[STIX]{x1D72B}$ contributions in the scalar heat flux equations, one needs to keep the non-gyrotropic contributions of the fourth-order moment $\boldsymbol{r}^{\text{ng}}$ as well, since there are several possible cancellations even at the linear level. The non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ can be further decomposed to the non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ and tensor $\unicode[STIX]{x1D748}$ . The detailed algebra of the non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ , i.e. how to express them through lower-order moments, is presented in appendix D. The first-order expressions are obtained at the nonlinear level and the second-order expressions at the linear level. We do not address how to decompose the tensor $\unicode[STIX]{x1D748}$ through lower-order moments. Such calculations require a complicated ‘inversion’ procedure for a third-rank tensor $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D748})^{S}$ , and an advanced reader is referred to Ramos (Reference Ramos2005).
In § 8, we consider the fourth-order moment $\boldsymbol{r}$ which is a tensor of fourth rank, $r_{ijkl}$ . The moment is again decomposed to its gyrotropic and non-gyrotropic parts, $\boldsymbol{r}=\boldsymbol{r}^{\text{g}}+\boldsymbol{r}^{\text{ng}}$ and the gyrotropic part has three scalar components, $r_{\Vert \Vert }$ , $r_{\Vert \bot }$ and $r_{\bot \bot }$ . We show that for a bi-Maxwellian distribution function, the gyrotropic components can indeed be evaluated as $r_{\Vert \Vert }=3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}$ , $r_{\Vert \bot }=p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C}$ and $r_{\bot \bot }=2p_{\bot }^{2}/\unicode[STIX]{x1D70C}$ . This constitutes a ‘normal’ closure, a name suggested by Chust & Belmont (Reference Chust and Belmont2006). By using a similar procedure to the one provided by Grad (Reference Grad1949) for dilute gases, it is possible to express the non-gyrotropic bi-Maxwellian $\boldsymbol{r}^{\text{ng}}$ through a combination of $\boldsymbol{p}^{g}$ and $\unicode[STIX]{x1D72B}$ , see e.g. Oraevskii et al. (Reference Oraevskii, Chodura and Feneberg1968).
In § 9, a dispersion relation of a fluid model closed with the bi-Maxwellian ‘normal’ closure is provided for generally oblique propagation, and we call this model second-order CGL (CGL2). We specifically focus on the mirror instability, since this simple fluid model (without any Landau damping) corrects the erroneous $1/6$ factor in the ‘hard’ mirror threshold found in the basic CGL description, a result also reported by Dzhalilov, Kuznetsov & Staude (Reference Dzhalilov, Kuznetsov and Staude2011). The mirror instability is not addressed to a higher level of sophistication in this guide. However, we note that capturing the mirror growth rate (when the threshold is crossed) sufficiently well requires Landau fluid models (Snyder et al. Reference Snyder, Hammett and Dorland1997) and the stabilization at small scales requires FLR corrections (Passot & Sulem Reference Passot and Sulem2007; Sulem & Passot Reference Sulem and Passot2015). We also provide the dispersion relation of the Hall-CGL2 fluid model. Finally, the CGL2 model can be simplified by considering slow-dynamics regime and constructing generalized isothermal closure that is called the ‘static’ closure, yielding the simplest fluid model that captures the correct mirror threshold (Constantinescu Reference Constantinescu2002; Chust & Belmont Reference Chust and Belmont2006; Passot et al. Reference Passot, Ruban and Sulem2006).
In § 10, we consider the bi-kappa distribution function. We show that the closure at the fourth-order moment is constructed by $r_{\Vert \Vert }=\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}$ , $r_{\Vert \bot }=\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C}$ and $r_{\bot \bot }=\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}2p_{\bot }^{2}/\unicode[STIX]{x1D70C}$ , where the coefficient $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=(\unicode[STIX]{x1D705}-3/2)/(\unicode[STIX]{x1D705}-5/2)$ , and the closure is valid for $\unicode[STIX]{x1D705}>5/2$ . We call this closure and the associated fluid model ‘BiKappa’, and we discuss its dispersion relations. Even though the linear modes are generally different in this fluid model than in the CGL2 fluid model, we show that the ‘hard’ thresholds for the parallel and oblique firehose instability, and for the highly oblique mirror instability, are not affected by and are independent of the $\unicode[STIX]{x1D705}$ value. The Hall-BiKappa fluid dispersion relation is also provided. We also provide the first-order non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ . We do not calculate the $\boldsymbol{r}^{\text{ng}}$ for the bi-kappa distribution, and therefore we do not provide the second-order non-gyrotropic heat flux vectors.
In § 11, we discuss the core differences between the usual fluid models and kinetic theory. Namely, we discuss why the usual fluid models do not contain collisionless damping mechanisms, such as Landau damping, regardless of the order to which the fluid hierarchy is developed. The effect of Landau damping is present in the collisionless Vlasov equation, and the crucial difference between the usual fluid hierarchy and kinetic calculations just lies in the technique of how the Vlasov equation is integrated over the velocity space. We introduce preliminary ideas as to how the Landau fluid closures will be constructed. For example, for closures performed at the fourth-order moment instead of the ‘normal’ closure, one needs to consider perturbations around this state, and prescribe $r_{\Vert \Vert }=3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}+\widetilde{r}_{\Vert \Vert }$ , $r_{\Vert \bot }=p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C}+\widetilde{r}_{\Vert \bot }$ and $r_{\bot \bot }=2p_{\bot }^{2}/\unicode[STIX]{x1D70C}+\widetilde{r}_{\bot \bot }$ . The deviations $\widetilde{r}_{\Vert \Vert }$ , $\widetilde{r}_{\Vert \bot }$ , $\widetilde{r}_{\bot \bot }$ will be calculated from linear kinetic theory in Part 2, by performing Landau fluid closures.
In § 12, we derive the evolution equation for a general $n$ th-order fluid moment (a tensor with $3^{n}$ components). We then consider fluid models in the simplified one-dimensional (1-D) geometry that can be viewed as an electrostatic case (or as a propagation along the mean magnetic field), and that are closed at a general $n$ th-order level by a Maxwellian fluid closure. A dispersion relation is obtained, which for $n>4$ always yields some solutions that are unstable. It is therefore concluded that the last non-Landau fluid closure is the ‘normal’ closure and that for $n>4$ , Landau fluid closures are required. This surprising result, first reported in Hunana et al. (Reference Hunana, Zank, Laurenza, Tenerani, Webb, Goldstein, Velli and Adhikari2018), serves as motivation for Part 2, which is a detailed guide to Landau fluid closures.
2 Pressure tensor equation
Collisionless plasmas are described by the Vlasov equation, which in CGS units reads
and which describes how a distribution function $f_{r}(\boldsymbol{x},\boldsymbol{v},t)$ evolves in time. The $r$ is the index of species and $r=p$ for protons, $r=e$ for electrons, etc. The $q_{r}$ is the particle charge, $m_{r}$ the particle mass, $c$ the speed of light, $\boldsymbol{E}$ the electric field vector and $\boldsymbol{B}$ the magnetic field vector. The species index $r$ can sometimes be confusing in lengthy tensor calculations with multiple indices and for clarity we will often drop it and reintroduce it when required. To derive the fluid equations, we need to integrate (perform an averaging) at each spatial point over the ‘kinetic’ velocity $\boldsymbol{v}$ . It is important to realize that the distribution function just describes the probability of finding a particle with velocity $\boldsymbol{v}$ at the position $\boldsymbol{x},t$ and that the ‘kinetic’ velocity $\boldsymbol{v}$ entering the distribution function $f(\boldsymbol{x},\boldsymbol{v},t)$ is a completely independent variable from $\boldsymbol{x},t$ , i.e.
Also, the magnetic and electric fields $\boldsymbol{B}(\boldsymbol{x},t),\boldsymbol{E}(\boldsymbol{x},t)$ are macroscopic quantities that do not depend on $\boldsymbol{v}$ and can be moved outside of velocity integrals over $d^{3}v$ , or in another words $\unicode[STIX]{x2202}B_{i}/\unicode[STIX]{x2202}v_{j}=0$ and $\unicode[STIX]{x2202}E_{i}/\unicode[STIX]{x2202}v_{j}=0$ . The definitions of the fluid moments are
where we have omitted the tensor product notation that is sometimes written down explicitly as $\boldsymbol{u}\boldsymbol{u}=\boldsymbol{u}\otimes \boldsymbol{u}$ and in the index notation $(\boldsymbol{u}\boldsymbol{u})_{ij}=u_{i}u_{j}$ . If two vectors or tensors are next to each other without an operator between them, a tensor product is always assumed. The number density $n$ is a scalar, the fluid velocity $\boldsymbol{u}$ is a vector, the pressure tensor $\boldsymbol{p}$ is a tensor of second rank ( $3\times 3$ matrix), the heat flux tensor $\boldsymbol{q}$ is a tensor of third rank (with $3\times 3\times 3$ components), the tensor $\boldsymbol{r}$ is of fourth rank (with $3\times 3\times 3\times 3$ components), etc. Directly from the definitions, it is obvious that all fluid tensors must be symmetric in all of their indices, i.e. $p_{ij}=p_{ji}$ , $q_{ijk}=q_{ikj}=\cdots \,$ . The fluctuating velocity is defined as
and should not be confused with the speed of light $c$ .
The second important concept that is used in deriving the fluid hierarchy, is the use of the usual Gauss–Ostrogradsky (divergence) theorem. The divergence theorem is used in velocity space and, written in a form that is typically encountered when calculating the fluid hierarchy, it reads
The $\unicode[STIX]{x1D63C}$ is a general $n$ th-order tensor, $f$ is a distribution function, the left-hand side is a 3-D integral calculated over the entire velocity volume $V$ and the right-hand side is a surface integral calculated over a boundary of that volume $\text{d}\boldsymbol{S}=\hat{\boldsymbol{n}}\,\text{d}S$ , where $\hat{\boldsymbol{n}}$ is a unit normal vector to the local surface area pointing outwards. When such an integral is encountered in the fluid hierarchy, the result is always assumed to be zero. The volume integrals are from $v=-\infty$ to $v=\infty$ in each velocity component and the integration on the right-hand side of (2.9) is therefore performed over the velocity surface area at infinity. The necessary (but technically not sufficient) condition for the integral to be negligible is
Since the area $\text{d}S\sim v^{2}\,\text{d}v$ , the sufficient condition for the integral to vanish, can be estimated more precisely as
When calculating the fluid hierarchy, the encountered expressions are always $A(v)\sim v^{n}$ , where $n$ is a positive integer. Then, for example, for a Maxwellian distribution $f(v)\sim \text{e}^{-v^{2}}$ the limit (2.11) is always zero for all $n$ . Even for a slower converging distribution $f(v)\sim \text{e}^{-|v|}$ , the limit is always zero. For distribution functions proportional to a power law, for example $f(v)\sim (v^{2})^{-(\unicode[STIX]{x1D705}+1)}$ as for a kappa distribution, the situation is more restrictive, with some minimum required values of $\unicode[STIX]{x1D705}$ . When we will calculate the $n$ th-order fluid moment (§ 12), we will see that two surface integrals are encountered, one with $A(v)\sim v^{n}$ and one with $A(v)\sim v^{n+1}$ . Therefore, for a kappa distribution, the strict condition for $\lim _{v\rightarrow \infty }(v^{2})^{-(\unicode[STIX]{x1D705}+1)}v^{n+1}v^{2}=0$ yields the requirement $\unicode[STIX]{x1D705}>(n+1)/2$ . For example, many fluid models discussed in this guide are closed at the fourth-order moment $\boldsymbol{r}$ , so $n=4$ , which implies the requirement $\unicode[STIX]{x1D705}>5/2$ . At first sight, the required limit (2.11) can be considered only a technical mathematical detail since, physically, one can argue that particles with enormously large energies will not be measured/observed, and some physical mechanism that is responsible for a cutoff of the distribution function at finite energies can be usually assumed. Or in another words, even observational studies that fit the data with a kappa distribution do not assume that the fit is valid all the way up to $v\rightarrow \infty$ , and usually a cutoff is implicitly assumed. However, once we calculate the fourth-order moments for a kappa distribution, we will see that, for example $r_{\Vert \Vert }=\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}$ , where $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=(\unicode[STIX]{x1D705}-3/2)/(\unicode[STIX]{x1D705}-5/2)$ , so the restriction $\unicode[STIX]{x1D705}>5/2$ returns, and has to be applied regardless. When the limit (2.11) is satisfied, the neglect of the surface integrals (2.9) is therefore based on solid theoretical principles, and it is not an approximate or an ad hoc choice, the surface integrals are really zero. Nevertheless, for complete clarity in the upcoming fluid hierarchy calculations, we will differentiate between expressions that are zero exactly, and between the surface integrals (2.9) that are zero asymptotically, by using $=0$ and $\rightarrow 0$ .
We start directly with the derivation of the pressure tensor equation, since a detailed derivation of the density and momentum equations can be found in many books. To derive the pressure tensor equation, it is possible to multiply the Vlasov equation by $mc_{i}c_{j}$ or $mv_{i}v_{j}$ (another possibility is $mc_{i}v_{j}$ ). Here, we will use the first choice, which is slightly easier to present in detail, because the second choice requires the use of density and momentum equations to cancel several terms. It is useful to derive the following identities
We write down the derivatives with respect to time $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t$ and velocity $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}v_{i}$ explicitly, but we abbreviate the derivative with respect to spatial coordinates as $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}x_{i}\equiv \unicode[STIX]{x2202}_{i}$ . We will need
where in the last identity we used the result $\unicode[STIX]{x1D716}_{klm}\unicode[STIX]{x1D6FF}_{lk}=0$ since the Levi-Civita tensor $\unicode[STIX]{x1D716}_{ijk}$ is antisymmetric and the Kronecker $\unicode[STIX]{x1D6FF}_{ij}$ is a symmetric tensor. Integrating the first term of the Vlasov equation yields
The second term of the Vlasov equation yields
To go back and forth between the index notation and the vector notation in a fully consistent matter, it is important to establish some conventions that were not required in simple fluid models. One important convention that is typically used is that the divergence of a tensor is meant to be through its first component, i.e. for a general tensor $\unicode[STIX]{x1D61F}_{ijk\ldots n}$ , the divergence $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\unicode[STIX]{x1D653}$ means $\unicode[STIX]{x2202}_{i}\unicode[STIX]{x1D61F}_{ijk\ldots n}$ . This convention is the reason why in the above expression we used that $q_{ijk}=q_{kij}$ and also the ordering of $\boldsymbol{u}$ and $\boldsymbol{p}$ inside $\unicode[STIX]{x1D735}\boldsymbol{\cdot }(\boldsymbol{u}\boldsymbol{p})$ reflects that in the index notation we have $\unicode[STIX]{x2202}_{k}(u_{k}p_{ij})$ .
The third term of the Vlasov equation calculates
Finally, the fourth term of the Vlasov equation calculates
In the expression above, one encounters a vector product of a vector with a matrix, $\boldsymbol{B}\times \boldsymbol{p}$ . This operator might appear unusual at first, but it is just a generalization of a vector product of two vectors, it is defined as $(\boldsymbol{B}\times \boldsymbol{p})_{ij}=\unicode[STIX]{x1D716}_{ikl}B_{k}p_{lj}$ and the result is a matrix. Similarly, a vector product between a vector and a tensor of third rank is defined as $(\boldsymbol{B}\times \boldsymbol{q})_{ijk}=\unicode[STIX]{x1D716}_{ilm}B_{l}q_{mjk}$ and for a tensor of $n$ th rank $(\boldsymbol{B}\times \unicode[STIX]{x1D653})_{i\ldots jk}=\unicode[STIX]{x1D716}_{ilm}B_{l}\unicode[STIX]{x1D61F}_{m\ldots jk}$ (i.e. the second index in $\unicode[STIX]{x1D716}$ is the vector $\boldsymbol{B}$ index and the third index in $\unicode[STIX]{x1D716}$ is the first index of tensor $\unicode[STIX]{x1D653}$ ). However, a definition of a vector product where a matrix is applied on a vector reads $(\boldsymbol{p}\times \boldsymbol{B})_{ij}=\unicode[STIX]{x1D716}_{jkl}p_{ik}\boldsymbol{B}_{l}$ , and in general $\boldsymbol{p}\times \boldsymbol{B}=-(\boldsymbol{B}\times \boldsymbol{p}^{\text{T}})^{\text{T}}$ , but since the $\boldsymbol{p}$ is here symmetric, $\boldsymbol{p}\times \boldsymbol{B}=-(\boldsymbol{B}\times \boldsymbol{p})^{\text{T}}$ . The vector product (cross-product) is addressed further in appendix B. The term (2.24) is sometimes rewritten as
Combining all the results together $\unicode[STIX]{x2460}+\unicode[STIX]{x2461}+\unicode[STIX]{x2462}+\unicode[STIX]{x2463}=0$ , yields the entire pressure tensor equation as
One notices that, for example, the pressure tensor equation of Kulsrud (Reference Kulsrud, Galeev and Sudan1983), equation (57), contains a minus sign typo in the last term, which should be written as (2.25). That it is indeed a typo and not a definition of $\boldsymbol{p}\times \boldsymbol{B}$ is evident from his subsequent equation (59). It is useful to introduce a symmetric operator that acts on a matrix $\unicode[STIX]{x1D63C}$ according to
which yields a more compact form of the pressure tensor equation
2.1 Pressure tensor decomposition
By introducing a unit vector in the direction of the local magnetic field $\hat{\boldsymbol{b}}=\boldsymbol{B}/|\boldsymbol{B}|$ and the gyrofrequency $\unicode[STIX]{x1D6FA}=qB_{0}/(mc)$ , the last term in the pressure tensor equation (2.26) can be rewritten as
At very low frequencies $\unicode[STIX]{x1D714}\ll \unicode[STIX]{x1D6FA}$ (and at very long spatial scales) this term will dominate and must be equal to zero. The simplest possibility that makes this term to be equal to zero is $\boldsymbol{p}^{\text{iso}}=p\unicode[STIX]{x1D644}$ , where $p$ is the scalar pressure and $\unicode[STIX]{x1D644}$ is the unit matrix. In index notation $p_{ij}^{\text{iso}}=p\unicode[STIX]{x1D6FF}_{ij}$ , and we can verify that
There is however a much more general solution that makes the term (2.29) equal to zero, which is
where the ‘g’ stands for gyrotropic. For quick analytic calculations it is sometimes easier to use $\boldsymbol{p}^{\text{g}}=(p_{\Vert }-p_{\bot })\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}+p_{\bot }\unicode[STIX]{x1D644}$ since we have to deal with $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ only once instead of twice. We need to verify that the term (2.29) indeed disappears,
It is easy to see that the parallel and perpendicular pressures can be extracted from the gyrotropic pressure tensor matrix by performing double contractions according to
The double contraction (the double dot product, represented by the colon) is a very useful operator that is frequently encountered in higher-order fluid models. For two matrices it is defined as $\unicode[STIX]{x1D63C}\boldsymbol{ : }\unicode[STIX]{x1D63D}=\unicode[STIX]{x1D608}_{ij}\unicode[STIX]{x1D609}_{ij}$ and yields a scalar. Sometimes, the double contraction is defined as $\unicode[STIX]{x1D63C}\boldsymbol{ : }\unicode[STIX]{x1D63D}=\unicode[STIX]{x1D608}_{ij}\unicode[STIX]{x1D609}_{ji}$ , which for symmetric matrices is equivalent to the previous definition. As a reminder, the usual matrix product of two matrices is $(\unicode[STIX]{x1D63C}\boldsymbol{\cdot }\unicode[STIX]{x1D63D})_{ij}=\unicode[STIX]{x1D608}_{ik}\unicode[STIX]{x1D609}_{kj}$ and yields a matrix. It is useful to write down the following identities that involve the double contraction,
Also interestingly, the trace of a matrix $\unicode[STIX]{x1D63C}$ can be expressed through the double contraction with $\unicode[STIX]{x1D644}$ , i.e. $\unicode[STIX]{x1D63C}:\unicode[STIX]{x1D644}=\unicode[STIX]{x1D608}_{ij}\unicode[STIX]{x1D6FF}_{ij}=\unicode[STIX]{x1D608}_{ii}=\text{Tr}(\unicode[STIX]{x1D63C})$ . The pressure decomposition (2.35) can now be verified easily. Even more revealing is to apply the decomposition (2.35) directly to the definition of the entire pressure tensor (2.5), which yields
where we have used the fact that the magnitude of the 3-D velocity vector can be decomposed as $|\boldsymbol{v}-\boldsymbol{u}|^{2}=|\boldsymbol{v}_{\bot }-\boldsymbol{u}_{\bot }|^{2}+(v_{\Vert }-u_{\Vert })^{2}$ , or equivalently $|\boldsymbol{c}|^{2}=|\boldsymbol{c}_{\bot }|^{2}+c_{\Vert }^{2}$ .
A distribution function is called isotropic when the direction of the velocity vector $\boldsymbol{v}$ does not matter, and the distribution function depends only on magnitude $|\boldsymbol{v}|$ , as for example the Maxwellian distribution $\text{e}^{-|\boldsymbol{v}|^{2}}$ . A distribution function is called gyrotropic when the direction of the perpendicular velocity vector $\boldsymbol{v}_{\bot }$ does not matter, and the function depends only on $|\boldsymbol{v}_{\bot }|$ , as for example the bi-Maxwellian distribution $\text{e}^{-\unicode[STIX]{x1D6FC}_{\Vert }v_{\Vert }^{2}}\text{e}^{-\unicode[STIX]{x1D6FC}_{\bot }|\boldsymbol{v}_{\bot }|^{2}}$ . In another words, the gyrotropic distribution function is isotropic only in its transverse velocity components. The same vocabulary is used for the fluid moments, and a fluid moment is called gyrotropic when it involves integrals over $|\boldsymbol{c}_{\bot }|^{2i}$ , $i=0,1,2,3\ldots \,$ . For the pressure tensor, there are only two possibilities, represented by the parallel and perpendicular pressures (2.41), (2.42).
The part of the pressure tensor extracted by the decomposition (2.35) is therefore called the gyrotropic pressure $\boldsymbol{p}^{\text{g}}$ , sometimes also called the CGL pressure $\boldsymbol{p}^{\text{CGL}}$ . The gyrotropic approximation is sufficient at very long spatial scales, where the Larmor radius ( $=$ the gyroradius) of particles gyrating around the magnetic field is small, and the non-gyrotropic contributions become negligible. However, at sufficiently small spatial scales comparable to the gyroradius, the non-gyrotropic pressure contributions become significant and we represent these through a tensor $\unicode[STIX]{x1D72B}$ , that is called the finite Larmor radius corrections to the (gyrotropic) pressure tensor, or sometimes non-gyrotropic or gyroviscous stress tensor. The entire pressure tensor is thus decomposed according to
and for clarity we write down the decomposition explicitly in matrix notation
In the momentum equation, the pressure tensor enters through its divergence, which is useful to break down into its components. Since $\unicode[STIX]{x1D735}\boldsymbol{\cdot }(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})=\hat{\boldsymbol{b}}(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\hat{\boldsymbol{b}})+(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735})\hat{\boldsymbol{b}}$ , and $\unicode[STIX]{x1D735}\boldsymbol{\cdot }(p_{\bot }\unicode[STIX]{x1D644})=\unicode[STIX]{x1D735}p_{\bot }$ , the divergence of the pressure tensor calculates as
Importantly, we have seen in (2.41), (2.42) that the scalar parallel and perpendicular pressures are extracted by applying the double contractions to the entire pressure tensor (and not necessarily only at the gyrotropic part). The decomposition (2.43) can be rewritten as
Applying the double contraction $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ to (2.46) yields the first requirement for the FLR tensor
Similarly, applying $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ to (2.46) yields the second requirement for the FLR tensor,
By further using the first requirement, the second requirement can be rewritten as
By using the pressure decomposition (2.43) in the pressure tensor equation (2.26), and using the fact that the gyrotropic part of pressure satisfies $\hat{\boldsymbol{b}}\times \boldsymbol{p}^{g}+(\hat{\boldsymbol{b}}\times \boldsymbol{p}^{g})^{\text{T}}=0$ , the pressure tensor equation can be rewritten as
We can now derive the time dependent equations for parallel and perpendicular pressures by applying the usual double contractions to this pressure tensor equation.
2.2 Parallel pressure equation
We calculate the double contraction with $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ term by term. We will need the following identities
The first term calculates as
The second term calculates as
The third term calculates as
The fourth term calculates similarly as
and the final fifth term becomes
The entire equation for the parallel pressure therefore reads
Using the symmetric operator (2.27) and the definition for the convective derivative $\text{d}/\text{d}t\equiv \unicode[STIX]{x2202}/\unicode[STIX]{x2202}t+\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ , the equation is written in a simple form as
which corresponds to (26) of Passot & Sulem (Reference Passot and Sulem2004), equation (9) of Goswami et al. (Reference Goswami, Passot and Sulem2005) (see also Oraevskii et al. (Reference Oraevskii, Chodura and Feneberg1968), Passot & Sulem (Reference Passot and Sulem2007) and Passot, Sulem & Hunana (Reference Passot, Sulem and Hunana2012)).
2.3 Perpendicular pressure equation
It is possible to either apply the double contraction $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ to the pressure tensor equation to directly obtain the equation for $\unicode[STIX]{x2202}p_{\bot }/\unicode[STIX]{x2202}t$ , or to apply the trace (the double contraction $:\unicode[STIX]{x1D644}$ ) to obtain the equation for $\unicode[STIX]{x2202}p_{\Vert }/\unicode[STIX]{x2202}t+2\unicode[STIX]{x2202}p_{\bot }/\unicode[STIX]{x2202}t$ , and then subtract the previously obtained equation for $\unicode[STIX]{x2202}p_{\Vert }/\unicode[STIX]{x2202}t$ . Both approaches are equivalent, since $p_{\bot }=(\text{Tr}(\boldsymbol{p})-p_{\Vert })/2$ . In another words, directly calculating the trace of the pressure tensor yields
where we have used that $\text{Tr}\,\unicode[STIX]{x1D72B}=\unicode[STIX]{x1D6F1}_{ii}=0$ . Applying the trace operator to the pressure tensor equation (2.50) term by term yields for the first term
for the second term
the third term
the fourth term is equivalent to the third term
even though we kept the last expression in transpose form so that we can use the symmetric operator after we combine all the terms. Finally the fifth term
Combining all the terms, using the parallel pressure equation (2.59) and dividing by 2 yields the time-dependent equation for the perpendicular pressure that reads
The result corresponds to, for example, equation (25) of Passot & Sulem (Reference Passot and Sulem2004), equation (8) of Goswami et al. (Reference Goswami, Passot and Sulem2005). It is important to note that the parallel and perpendicular pressure equations are exact equations. These equations are valid regardless of what kind of higher-order closure will be adopted later. It is useful to rewrite the term $\unicode[STIX]{x1D72B}:(\text{d}/\text{d}t)(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ slightly so that we can directly use the induction equation and eliminate the time derivative. The term can be rewritten as
We will need the following identity for the time derivative of the unit vector (see later in the text),
which, when used in the above expression together with $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D72B}\boldsymbol{\cdot }\hat{\boldsymbol{b}}=0$ , allows us to write
which is an expression of general validity since no specific form of the induction equation was assumed yet. In the next section we will calculate the decomposition of the heat flux tensor $\boldsymbol{q}$ and we will show that, if only the gyrotropic heat flux components $q_{\Vert },q_{\bot }$ are considered, the heat flux terms entering the pressure equations read
Therefore, by splitting the heat flux into gyrotropic and non-gyrotropic parts, $\boldsymbol{q}=\boldsymbol{q}^{\text{g}}+\boldsymbol{q}^{\text{ng}}$ , the pressure equations read
Terms with the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ can be further split into non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ and the heat flux tensor $\unicode[STIX]{x1D748}$ , which is addressed in appendix D, see (D 124), (D 126). Terms containing the FLR pressure tensor $\unicode[STIX]{x1D72B}$ are usually called the FLR stress forces, and these forces can generate complicated plasma heating processes. The FLR stress forces can generate both parallel and perpendicular plasma heating that can be determined only from fully nonlinear numerical simulations, since all the terms disappear at the linear level. This process is also sometimes called ‘stochastic heating’. The importance of stochastic heating was shown by Laveder, Passot & Sulem (Reference Laveder, Passot and Sulem2013), who performed 1.5-dimensional numerical simulations with the sophisticated FLR Landau fluid model of Passot & Sulem (Reference Passot and Sulem2007), Passot et al. (Reference Passot, Sulem and Hunana2012).
2.4 On the paper by Chew et al. (Reference Chew, Goldberger and Low1956)
The very influential paper by Chew et al. (Reference Chew, Goldberger and Low1956) that is typically cited for the CGL equations (with zero heat fluxes) actually also derived the pressure equations when the gyrotropic heat fluxes are considered. It is important to emphasize that, even though the scalar pressures $p_{n}$ , $p_{s}$ in the notation of that paper are equal to the usual pressures $p_{\Vert }$ , $p_{\bot }$ , this is not the case for the heat fluxes. The gyrotropic heat flux tensor in that paper is decomposed into components $q_{n}$ , $q_{s}$ according to (Chew et al. Reference Chew, Goldberger and Low1956, equation (34))
In contrast, the usual gyrotropic heat flux decomposition to $q_{\Vert }$ , $q_{\bot }$ reads (see later in the text)
i.e. the last term is not present in their decomposition. However, this does not mean that their pressure equations (Chew et al. Reference Chew, Goldberger and Low1956, equations (31), (32)) are incorrect. It only implies that their equations are written in terms of
By neglecting the FLR corrections $\unicode[STIX]{x1D72B}$ and $\boldsymbol{q}^{\text{ng}}$ , the parallel and perpendicular pressure equations (2.74), (2.75) simplify to
By using (2.78), the heat flux contributions in the parallel pressure equation above can be rewritten as
which agrees with the parallel pressure equation obtained by Chew et al. (Reference Chew, Goldberger and Low1956), equation (31). The pressure equations of Chew et al. (Reference Chew, Goldberger and Low1956) are therefore correct, but it has to be remembered that $q_{n}$ is not the usual parallel heat flux $q_{\Vert }=m\!\int (v_{\Vert }-u_{\Vert })^{3}f\,\text{d}^{3}v$ .Footnote 1 Actually, the authors do not even call the quantities $q_{n},q_{s}$ heat flux components, but components of a ‘pressure-transport’ tensor.
To conclude this subsection, we will go slightly ahead and use (3.51), (3.52) (without the zero right-hand sides) that are the usual CGL pressure equations and valid only at long spatial scales when the Hall term in the induction equation is neglected. Under this assumption, the pressure equations (2.79), (2.80) are rewritten as
It is often forgotten that the paper by Chew et al. (Reference Chew, Goldberger and Low1956) derived the correct pressure equations with gyrotropic heat flux contributions. The authors, however, did not write the equations in the form (2.82), (2.83), and left them in the form (2.79), (2.80), using the ‘conservative’ form only after prescribing the closure $q_{\Vert }=q_{\bot }=0$ (their equations (35), (36)),
Over time, equations (2.84), (2.85) became known as the CGL description. Equations (2.84), (2.85) are often replaced by the equations of state $p_{\Vert }|\boldsymbol{B}|^{2}/\unicode[STIX]{x1D70C}^{3}=\text{const.}$ and $p_{\bot }/(\unicode[STIX]{x1D70C}|\boldsymbol{B}|)=\text{const.}$ , see e.g. Mjølhus (Reference Mjølhus2009). Additionally, similarly to the definition of (appropriately normalized) entropy in MHD, $s=\ln (p/\unicode[STIX]{x1D70C}^{\unicode[STIX]{x1D6FE}})$ , it is useful to define parallel and perpendicular entropy in the CGL model, according to
The total entropy, $s_{\Vert }+s_{\bot }=\ln (p_{\Vert }^{1/3}p_{\bot }^{2/3}\unicode[STIX]{x1D70C}^{-5/3})$ , is equivalent to the MHD entropy when $p_{\Vert }=p_{\bot }$ , see for example (9)–(11) in Abraham-Shrauner (Reference Abraham-Shrauner1967).
2.5 Physical meaning of CGL equations
The exact derivation of the CGL equation (2.84), (2.85) will be completed in the next section. Here, we briefly discuss the physical meaning of these equations. The CGL equations (2.85), (2.84) can be interpreted as the conservation of the first and second adiabatic invariants, as is nicely discussed for example by Kulsrud (Reference Kulsrud, Galeev and Sudan1983) and by Gurnett & Bhattacharjee (Reference Gurnett and Bhattacharjee2005). Considering many particles with velocity components $v_{\Vert }$ and $v_{\bot }$ with respect to the mean magnetic field, the parallel and perpendicular pressures can be estimated as
where the brackets represent an average over all particles. To complete the estimate, we need expressions for $\langle v_{\Vert }^{2}\rangle$ and $\langle v_{\bot }^{2}\rangle$ , which come from the conservation of the adiabatic invariants. The first adiabatic invariant is the conservation of magnetic moment $\unicode[STIX]{x1D707}$ of a particle that is gyrating periodically around a mean magnetic field. The second adiabatic invariant, sometimes also called the longitudinal invariant, is associated with a particle bouncing periodically between two magnetic mirror points. The conserved quantity is the integral over the parallel momentum of a particle $J\equiv \oint _{a}^{b}mv_{\Vert }\,\text{d}l$ , where the magnetic mirrors are located at ‘ $a$ ’ and ‘ $b$ ’ and the integral is performed along the magnetic field line. The distance between the two magnetic mirrors at ‘ $a$ ’ and ‘ $b$ ’ can be labelled as $L$ . The conservation of the first and second adiabatic invariants is therefore
Since we are considering only non-relativistic particles, the particle mass ‘ $m$ ’ is also a constant. The conservation of magnetic moment $\unicode[STIX]{x1D707}$ therefore implies $v_{\bot }^{2}\sim B$ , and using this in (2.87) yields $p_{\bot }\sim B\unicode[STIX]{x1D70C}$ , recovering (2.85). The CGL equation (2.85) therefore corresponds to conservation of magnetic moment, i.e. the first adiabatic invariant. The conservation of the second adiabatic invariant $J$ is more tricky, since we need to somehow estimate the non-intuitive length $L$ between the two magnetic mirrors, and no magnetic mirrors are explicitly assumed, since we are just dealing with somewhat random magnetic field lines. The length $L$ can be nevertheless estimated from two fundamental physical principles. Consider a magnetic flux tube (a deformed cylinder) with cross-sectional area $A$ and length $L$ . The conservation of the total magnetic flux through the area $A$ implies $AB=\text{const.}$ , which yields an estimate for the area $A\sim 1/B$ . The second principle is the conservation of the total mass of particles that are completely trapped in that flux tube and that cannot escape, $mnAL=\text{const.}$ , yielding $L\sim 1/(\unicode[STIX]{x1D70C}A)$ . The use of $A\sim 1/B$ yields the final estimate for the non-intuitive length $L\sim B/\unicode[STIX]{x1D70C}$ . The conservation of the second adiabatic invariant $J$ therefore implies $v_{\Vert }\sim 1/L\sim \unicode[STIX]{x1D70C}/B$ . Using this result in (2.87) implies $p_{\Vert }\sim \unicode[STIX]{x1D70C}^{3}/B^{2}$ , recovering (2.84). The CGL equation (2.84) corresponds to conservation of the second adiabatic invariant $J$ , and the equation is valid for particles that are completely trapped and therefore cannot carry a heat flux. As we discuss later, the consideration of the Hall term, the heat flux and the FLR stress forces (the stochastic heating), leads to the breaking of these two adiabatic invariants. Nevertheless, exact conservation laws can still be derived, and the very simple CGL equations (2.84), (2.85) have to be modified with non-zero right-hand sides.
In Part 2 of our guide (see § 4.1), we show that the conservation of magnetic moment is very useful for understanding the form of the perturbed distribution function $f^{(1)}$ in the gyrotropic limit. By performing calculations in the laboratory reference frame, we show that to obtain the correct $f^{(1)}$ in the gyrotropic limit actually requires the usual complicated kinetic integration around the unperturbed orbit (see the Appendix of Part 2) and only then the gyrotropic limit can be imposed. In contrast, performing calculations in the guiding-centre reference frame allows one to prescribe the conservation of magnetic moment from the beginning, and obtain the correct $f^{(1)}$ in the gyrotropic limit perhaps more intuitively. Conservation of adiabatic invariants is important in many areas of space physics and astrophysics. For example, conservation of adiabatic invariants is used to construct models that describe particles trapped inside of magnetic islands and that are being accelerated during magnetic island contraction and merging (Drake, Swisdak & Fermo Reference Drake, Swisdak and Fermo2013; Zank et al. Reference Zank, le Roux, Webb, Dosch and Khabarova2014).
2.6 Exact equations of anisotropic multi-fluid models
Before we discuss solutions of specific fluid models, it is beneficial to summarize the most general equations of multi-fluid models that were derived with no simplifications at all. By reintroducing the species index $r$ , the equations that were directly obtained by integrating the collisionless Vlasov equation read
where the convective derivative $\text{d}_{r}/\text{d}t=\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t+\boldsymbol{u}_{r}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ . In the above equations, the pressure tensor for each species was just decomposed into gyrotropic contributions $p_{\Vert r}$ , $p_{\bot r}$ and the rest of the pressure tensor (the FLR pressure tensor $\unicode[STIX]{x1D72B}_{r}$ ), according to
which is a rigorous decomposition not introducing any simplifications. The heat flux $\boldsymbol{q}_{r}$ contributions are here left at the most general level without any simplifications either. The equations are accompanied by Maxwell’s equations, that for now we write down here with no simplifications to emphasize that no Maxwell equations were used in deriving the above system,
Equations (2.89)–(2.95) represent an exact (even though not closed) kinetic Vlasov–Maxwell system formulated in fluid variables, and the description is valid for any general distribution function and for all considered spatial and temporal scales. All advanced collisionless multi-fluid models have to be based in one way or another on these equations. One can concentrate on the proton dynamics or one can concentrate on the electron dynamics and simplify these equations accordingly for these spatial scales. Naturally, one can consider complicated multi-fluid descriptions composed of many particle species.
For a new reader who just jumped straight to this section, the symmetric operator ‘ $S$ ’ acts on a matrix $\unicode[STIX]{x1D608}_{ij}$ according to $\unicode[STIX]{x1D608}_{ij}^{S}=\unicode[STIX]{x1D608}_{ij}+\unicode[STIX]{x1D608}_{ji}$ . If one does not like the symmetric operator in the expressions above, the symmetric operator is actually not necessary, since
where we also got rid of the double contractions. Note that $\unicode[STIX]{x1D6F1}_{ik}=\unicode[STIX]{x1D6F1}_{ki}$ and also $p_{ik}=p_{ki}$ .
Equations (2.89)–(2.92) contain two very important catches – the entire heat flux tensor $\boldsymbol{q}_{r}$ and the non-gyrotropic pressure tensor $\unicode[STIX]{x1D72B}_{r}$ are not specified yet. In this moment, the entire class of collisionless fluid models separates to many possible classes and sub-classes, depending on how precisely one wants to evaluate the heat flux tensor $\boldsymbol{q}_{r}$ and the FLR pressure tensor $\unicode[STIX]{x1D72B}_{r}$ . There are several complications how to correctly model these two quantities, but before we discuss this, we want to point out that by using the simple general identities (3.46), (3.47) (that are nothing more than calculating derivatives and using the density equation for each species separately), the above pressure equations can be rewritten to the following form
Again, no simplifications were introduced and no Maxwell equations were used yet. The equations are exact. The equations represent the complicated plasma heating processes that are responsible for anisotropic plasma heating, and that are in general very difficult to classify. The left-hand sides of (2.100)–(2.101) are the familiar CGL equations that represent conservations of the first and second adiabatic invariants, and all the expressions on the right-hand sides break these adiabatic invariants. On the right-hand side, we have the $\unicode[STIX]{x2202}\boldsymbol{B}/\unicode[STIX]{x2202}t$ that couples various species together through Maxwell’s equations, and that also introduces the Hall term responsible for the simplest dispersive effects. The heat flux tensor $\boldsymbol{q}_{r}$ (a tensor of third rank), is decomposed into its gyrotropic and non-gyrotropic (FLR) parts $\boldsymbol{q}_{r}=\boldsymbol{q}_{r}^{\text{g}}+\boldsymbol{q}_{r}^{\text{ng}}$ . At large scales, the gyrotropic heat flux can be viewed as a gateway for the simplest form of collisionless damping mechanism, known as Landau damping, that can be further separated to a ‘pure’ electrostatic Landau damping, and its magnetic analogue, the transit-time damping – see Part 2 of the text, subsection ‘Coulomb force and mirror force (Landau damping and transit time damping)’. Nevertheless, fluid models that contain the gyrotropic heat flux fluctuations, but that do not contain any Landau damping, can be constructed as well (see CGL2 model later in the text). The FLR pressure tensor $\unicode[STIX]{x1D72B}_{r}$ introduces further dispersive effects and therefore also modifies the collisionless damping rates, as well as the heat flux $\boldsymbol{q}_{r}$ modifies the FLR pressure corrections $\unicode[STIX]{x1D72B}_{r}$ . The quantities $\boldsymbol{q}_{r}$ and $\unicode[STIX]{x1D72B}_{r}$ are therefore generally coupled. Importantly, the FLR pressure tensor $\unicode[STIX]{x1D72B}_{r}$ introduces complicated turbulent plasma heating processes, referred to as stochastic heating. Since these FLR stress forces completely disappear at the linear level in the above pressure equations, they can be explored only with fully nonlinear numerical simulations ( $\unicode[STIX]{x1D72B}_{r}$ still enters at the linear level in the momentum equation and modifies the dispersion relations).
From a fluid perspective, there are several major difficulties how to correctly model the quantities $\unicode[STIX]{x1D72B}_{r}$ and $\boldsymbol{q}_{r}$ . The heat flux tensor $\boldsymbol{q}_{r}$ is described by an infinite hierarchy of higher-order fluid moments, and one needs to find an appropriate way, how to close the system. The FLR pressure tensor $\unicode[STIX]{x1D72B}_{r}$ is described by equations that are implicit, and the FLR corrections have to be typically expanded on temporal and spatial scales in order to obtain tractable expressions that can be used for numerical simulations. This expansion will naturally restrict the area of validity of such fluid models to those scales considered. For example, for the proton–electron plasma where the spatial and temporal scales are largely separated because of the mass ratio $m_{p}/m_{e}=1836$ , simple first-order FLR corrections expanded around the proton scales $k\unicode[STIX]{x1D70C}_{i},\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}_{p}$ will long lose their validity at the electron scales. The most complicated Landau fluid models that use linear kinetic theory to evaluate $\unicode[STIX]{x1D72B}$ and that contain the Bessel functions (Passot & Sulem Reference Passot and Sulem2007; Passot et al. Reference Passot, Sulem and Hunana2012; Sulem & Passot Reference Sulem and Passot2015), have technically no restriction for wavenumbers $k_{\bot }\unicode[STIX]{x1D70C}_{i}$ , but the first-order frequency restriction is there nevertheless.
2.7 Conservation of energy
The equations (2.100)–(2.101), that correctly split the entire plasma heating into parallel and perpendicular heating, are indeed quite complicated. To gain further insight, let us briefly consider a case in which we are not interested in parallel and perpendicular heating (whose net effect can possibly be zero), and we are only interested in the heating of the entire system. Such formulations are sometimes used to interpret plasma heating in fully kinetic simulations (Yang et al. Reference Yang, Matthaeus, Parashar, Haggerty, Roytershteyn, Daughton, Wan, Shi and Chen2017a ,Reference Yang, Matthaeus, Parashar, Wu, Wan, Shi, Chen, Roytershteyn and Daughton b ). By summing together the pressure equations (2.91), (2.92), we are interested in the evolution equation $\unicode[STIX]{x2202}(p_{\Vert }+2p_{\bot })/\unicode[STIX]{x2202}t$ , or in another words, we are interested in $\unicode[STIX]{x2202}\text{Tr}\,\boldsymbol{p}/\unicode[STIX]{x2202}t$ . Summing the equations together (actually the trace of the entire pressure tensor equation was calculated a few pages back, that is how the $p_{\bot }$ equation was derived) yields
Note that $\text{Tr}\,\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}_{r}=\unicode[STIX]{x1D735}\boldsymbol{\cdot }(\text{Tr}\,\boldsymbol{q}_{r})$ , and also $\text{Tr}\,\unicode[STIX]{x1D72B}_{r}=0$ . By integrating over the entire spatial volume of the plasma, one can define the internal energy (or thermal energy) for each particle species as
By considering a special case of periodic boundary conditions, i.e. typical numerical simulations of turbulence in a periodic box, one can use the Gauss–Ostrogradsky theorem (2.9), now in a spatial domain, and for periodic boundary conditions all divergence operators in (2.102) vanish, yielding
Equations (2.102), (2.104) are equivalent to (4) and (7) of Yang et al. (Reference Yang, Matthaeus, Parashar, Wu, Wan, Shi, Chen, Roytershteyn and Daughton2017b ). Here the equations are just written in such a way that one can explicitly see the pressure components $p_{\Vert },p_{\bot },\unicode[STIX]{x1D72B}$ , but the equations are equivalent. Following Yang et al. (Reference Yang, Matthaeus, Parashar, Wu, Wan, Shi, Chen, Roytershteyn and Daughton2017b ), one can also easily derive the evolution equations for the ‘kinetic’ energy and the electromagnetic energy
For the kinetic energy, by first applying $\boldsymbol{u}_{r}\boldsymbol{\cdot }$ to the momentum equation (2.90)
which further implies
and for the electromagnetic energy
Therefore, by integrating over the entire volume and assuming periodic boundary conditions, the total conservation of energy can be expressed as (Yang et al. Reference Yang, Matthaeus, Parashar, Wu, Wan, Shi, Chen, Roytershteyn and Daughton2017b )
which beautifully clarifies how the energy can be transferred. Nevertheless, we point out that it is exactly the splitting into the parallel and perpendicular heating that is so complicated, since even with periodic boundary conditions, only a few terms in the pressure equations (2.91), (2.92) vanish. Considering anisotropic heating, and splitting the internal (thermal) energy into parallel and perpendicular parts
the conservation of total energy can be expressed as
where
The anisotropic plasma heating is obviously a very complicated process.
3 CGL description
The CGL model is the simplest possible fluid model that incorporates anisotropic temperatures. In contrast to MHD, CGL contains two pressure equations. For very good MHD reviews, from the perspective of turbulence in the solar wind, and the use of MHD in modelling heliospheric and astrophysical plasmas, we recommend the reviews by Goldstein, Roberts & Matthaeus (Reference Goldstein, Roberts and Matthaeus1995), Tu & Marsch (Reference Tu and Marsch1995), Zank (Reference Zank1999), Zhou, Matthaeus & Dmitruk (Reference Zhou, Matthaeus and Dmitruk2004) and Bruno & Carbone (Reference Bruno and Carbone2013). The CGL model is obtained by using the parallel and perpendicular pressure equations (2.74), (2.75) and prescribing zero heat flux $q_{\Vert }=q_{\bot }=0$ , $\boldsymbol{q}^{\text{ng}}=0$ , and by neglecting the FLR pressure tensor $\unicode[STIX]{x1D72B}=0$ . The pressure equations greatly simplify to the form
By using the notation for the convective derivative $\text{d}/\text{d}t=\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t+\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ , the pressure equations can be rewritten as
Note that the CGL pressure equations are fully nonlinear and that the nonlinearities are actually of fourth order, i.e. CGL pressure equations are more nonlinear than the usual MHD pressure equation $\text{d}p/\text{d}t+\unicode[STIX]{x1D6FE}p\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}=0$ , where the nonlinearity is only of second order. At first sight, expressions $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ might appear to be a little unusual when written without any brackets. From MHD, one is familiar with nonlinear expressions of the type $\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}$ , which are also usually written without any brackets and can be interpreted in two equivalent forms, either as $(\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735})\boldsymbol{u}$ (a scalar $\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ applied on a vector $\boldsymbol{u}$ ; or $\boldsymbol{u}\boldsymbol{\cdot }(\unicode[STIX]{x1D735}\boldsymbol{u})$ (a vector $\boldsymbol{u}$ multiplied by a matrix $\unicode[STIX]{x1D735}\boldsymbol{u}$ ). In a similar fashion, the expression $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ is meant to be interpreted as $(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u})\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ or $\hat{\boldsymbol{b}}\boldsymbol{\cdot }(\unicode[STIX]{x1D735}\boldsymbol{u})\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ ; but importantly, it is not meant to be interpreted as $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}(\boldsymbol{u}\boldsymbol{\cdot }\hat{\boldsymbol{b}})$ , because the derivative is meant to act only on $\boldsymbol{u}$ in this case. For direct numerical simulations, the fourth-order nonlinearity explicitly reads
and is of course a scalar. So far, we have not made any assumptions about particle species, and have only integrated the Vlasov equation and derived the density, momentum and scalar pressure equations, which can be done for each species separately. By re-introducing the species index $r$ , where $r=p$ for protons, $r=e$ for electrons etc., the n-fluid CGL-type equations become
The above CGL pressure equations are valid regardless of the form of the induction equation $\unicode[STIX]{x2202}\boldsymbol{B}/\unicode[STIX]{x2202}t$ and are therefore very general and valid for a wide range of CGL-type n-fluid models. It is advisable to keep them in this form when considering direct numerical simulations or solving linear dispersion relations. The above equations are accompanied by Maxwell’s equations. Specifically, (i) Gauss’s law $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{E}=4\unicode[STIX]{x03C0}\unicode[STIX]{x1D70C}_{c}$ where the total charge density $\unicode[STIX]{x1D70C}_{c}=\sum _{r}q_{r}n_{r}$ , (ii) $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{B}=0$ , (iii) Faraday’s induction equation $\unicode[STIX]{x1D735}\times \boldsymbol{E}=-(1/c)(\unicode[STIX]{x2202}\boldsymbol{B}/\unicode[STIX]{x2202}t)$ and (iv) Ampère’s law $\unicode[STIX]{x1D735}\times \boldsymbol{B}=(4\unicode[STIX]{x03C0}/c)\boldsymbol{j}+(1/c)(\unicode[STIX]{x2202}\boldsymbol{E}/\unicode[STIX]{x2202}t)$ , where the total current $\boldsymbol{j}=\sum _{r}q_{r}n_{r}\boldsymbol{u}_{r}$ . In many areas of space physics and astrophysics it is often very useful to eliminate very high frequency effects occurring at frequencies higher than the plasma frequency. This is achieved by prescribing local charge neutrality (so called quasi-neutrality since the plasma is still ionized) by $\unicode[STIX]{x1D70C}_{c}=0$ and by neglecting the second term in Ampère’s law $(1/c)(\unicode[STIX]{x2202}\boldsymbol{E}/\unicode[STIX]{x2202}t)$ known as Maxwell’s displacement current (or Maxwell’s correction to Ampère’s law). This eliminates high-frequency effects such as for example plasma oscillations, known as Langmuir waves. This assumption is also equivalent to assuming that the Alfvén speed is much less than the speed of light $V_{A}/c\ll 1$ . It is important to emphasize that Gauss’s law $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{E}=4\unicode[STIX]{x03C0}\unicode[STIX]{x1D70C}_{c}$ is replaced by the quasi-neutrality condition $\unicode[STIX]{x1D70C}_{c}=0$ and no requirement is imposed on $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{E}$ (other than that it is small). For further discussion concerning the charge neutrality and $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{E}$ , see for example Braginskii (Reference Braginskii1965) page 264, Webb et al. (Reference Webb, McKenzie, Mace, Ko and Zank2007) equation (7) and Passot & Sulem (Reference Passot and Sulem2007) equation (A 1). Maxwell’s equations in this approximation simplify to
and this form of Maxwell’s equations is considered in most of the fluid models that we discuss, regardless of future complexities arising from FLR corrections, heat flux, Landau damping and so on. The exception is the section in Part 2 ‘Electron Landau damping of the Langmuir mode’, where the displacement current has to be retained. Here, we further focus only on a plasma consisting of protons and electrons, with charges $q_{p}=e$ and $q_{e}=-e$ , but models with more species can of course be considered. Charge neutrality implies that the proton and electron number densities are equal, i.e. $n_{p}=n_{e}=n$ . By using the definition of current $\boldsymbol{j}=en\boldsymbol{u}_{p}-en\boldsymbol{u}_{e}$ , the electron velocities are related to the proton velocities by
which after using (3.12) yields
By using the momentum equation (3.7) for electrons, the electric field can be expressed as
The last term represents the effects of electron inertia and is for example responsible for the effect that at small spatial scales the frequency $\unicode[STIX]{x1D714}$ of the parallel whistler mode converges to the electron cyclotron frequency $\unicode[STIX]{x1D6FA}_{e}$ instead of increasing to infinity. When considering only relatively low frequencies, it is very useful to neglect the electron inertia term.Footnote 2 This yields somewhat simpler analytic expressions, but most importantly, it provides a great benefit for direct numerical simulations since the time step is not restricted by the requirement to fully resolve the electron motion. This of course restricts the validity of a model to frequencies that are sufficiently smaller than the electron cyclotron frequency. By neglecting the electron inertia and using (3.14), the electric field can be expressed as
Using this expression in the proton momentum equation and in the induction equation eliminates the electric field from the entire system. The system of equations therefore reads
and is accompanied by the parallel and perpendicular pressure equations (3.8), (3.9) for the proton and electron species. In the induction equation (3.20), the first term is the familiar term from the MHD induction equation and considering only this first term yields the classical CGL model (where in addition the electrons are prescribed to be cold to eliminate $\boldsymbol{p}_{e}$ from the momentum equation). The second term in the induction equation represents the Hall term, and fluid models containing this term are generally referred to as Hall-CGL fluid models. The third term represents the electron pressure contributions and in CGL plasmas the term is in general non-zero, since
In contrast, by prescribing the electrons to be isotropic and isothermal (a typical assumption used in hybrid simulations) with pressure $\boldsymbol{p}_{e}=p_{e}\unicode[STIX]{x1D644}$ , $p_{e}=nT_{e}^{(0)}$ , so that $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{p}_{e}=\unicode[STIX]{x1D735}p_{e}=T_{e}^{(0)}\unicode[STIX]{x1D735}n$ , eliminates the pressure term since $\unicode[STIX]{x1D735}\times [(1/n)\unicode[STIX]{x1D735}n]=\unicode[STIX]{x1D735}\times [\unicode[STIX]{x1D735}\ln (n)]=0$ . By prescribing the electrons to be isotropic but not isothermal, with pressure $\boldsymbol{p}_{e}=p_{e}\unicode[STIX]{x1D644}$ , the term initially does not disappear and it is equal to
The term only disappears after the equation of state is prescribed, for example for $p_{e}\sim n^{\unicode[STIX]{x1D6FE}}$ . The term (3.22) is sometimes called the ‘battery’ term (Biermann Reference Biermann1950; Kulsrud et al. Reference Kulsrud, Cen, Ostriker and Ryu1997; Khomenko et al. Reference Khomenko, Vitas, Collados and de Vicente2017), since it is argued that for any deviations from an ideal equation of state (caused for example by shocks), the term can produce magnetic field fluctuations even if there is no magnetic field initially.
3.1 Normalized equations and definitions
As with MHD and Hall-MHD, it is often useful to work with normalized equations. The density, velocity, magnetic field and pressure are normalized with respect to $\unicode[STIX]{x1D70C}_{0},u_{0},B_{0},p_{\Vert }^{(0)}$ . The length is normalized with respect to (for now) arbitrary $x_{0}$ and the time with respect to $t_{0}=x_{0}/u_{0}$ . The normalized quantities are then
Obviously $\widetilde{n}=\widetilde{\unicode[STIX]{x1D70C}}$ . After dropping the tilde, the normalization yields the same density and pressure equations, whereas the normalized momentum and induction equation become
where the Alfvén speed $V_{A}=B_{0}/\sqrt{4\unicode[STIX]{x03C0}\unicode[STIX]{x1D70C}_{0}}$ . The momentum equation implies that it is beneficial to choose the (so far unspecified) normalizing velocity $u_{0}$ to be the Alfvén speed $u_{0}=V_{A}$ . If the second and third terms are neglected in the induction equation (i.e. when the usual MHD induction equation is used), the CGL system is independent of length scale $x_{0}$ , similarly to the MHD system. If the Hall term is considered, it is beneficial to choose the (so far unspecified) normalizing length scale $x_{0}$ to be the ion inertial length
Therefore by choosing $x_{0}=d_{i}$ , the normalizations that we use in all fluid models in this paper read
The normalization also has the advantage that, when transferred to Fourier space, one obtains the dispersion relations for normalized wavenumber $\widetilde{k}$ and frequency $\widetilde{\unicode[STIX]{x1D714}}$ as
It is useful to define the parallel and perpendicular thermal speeds
and the definition of parallel and perpendicular temperatures are $T_{\Vert }=p_{\Vert }/n$ and $T_{\bot }=p_{\bot }/n$ . We use the usual convention with the Boltzmann constant $k_{B}=1$ .Footnote 3 The parallel and perpendicular plasma beta are defined according to
and later we will often use the abbreviation $a_{p}$ for the proton temperature anisotropy ratio
The normalized momentum and induction equations therefore read (tilde are dropped)
Another useful quantity that we will use later in the text is the proton Larmor radius (the gyroradius) $\unicode[STIX]{x1D70C}_{i}$ , defined according to
We note that normalizations do not have to be done with respect to mean values, and normalizations with respect to fluctuating (turbulent) quantities are used for example in the nearly incompressible models of Zank et al. (Reference Zank, Adhikari, Hunana, Shiota, Bruno and Telloni2017) and Zank & Matthaeus (Reference Zank and Matthaeus1993).
3.2 Classical CGL model with cold electrons
Here we want to consider the simplest possible CGL model for the proton species. We assume the electrons to be massless and cold ( $\boldsymbol{p}_{e}=0$ ), and we also neglect the Hall term in the induction equation. Since only proton species are present, for simplicity we can drop the proton index ‘ $p$ ’. Let us work in physical units for a moment. The classical CGL model with cold electrons therefore reads (written in physical units)
Similarly to the usual MHD model, the CGL equations are scale invariant and do not contain any information about the physical length scale. A length scale is introduced by considering the Hall term in the induction equation, and we discuss the Hall-CGL model in the next section. Also, the absence of the Hall term allows the CGL pressure equations to be rewritten into ‘conservative’ form as
where $\text{d}/\text{d}t$ is the convective derivative. This formulation is the most often cited form of the CGL pressure equations (often the $|\boldsymbol{B}|$ is abbreviated only as $B$ , which can lead to confusion on how to correctly calculate the derivatives). We want to show the equivalence of (3.41) and (3.38), (3.39). We need the following identity,
and the result is of course a scalar. Similarly for the spatial and convective derivative
Later we will need
Now we can directly calculate
and, by using the density equation $\text{d}\unicode[STIX]{x1D70C}/\text{d}t=-\unicode[STIX]{x1D70C}\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}$ and the identity (3.43), we obtain
The above equation is completely general since we did not use the induction equation so far and we will use this equation in the next section when we consider the Hall-CGL model. In a similar way one derives a completely general identity
Now, we use the induction equation
that allows us to calculate
and which, when used in (3.47)–(3.46) yields the final result
Equations (3.51)–(3.52) verify that the CGL pressure equations (3.41) written in the ‘conservative’ form and the directly derived CGL equations (3.38)–(3.39) are equivalent. We will see that the right-hand side of the usual CGL equations (3.41) is non-zero if the Hall term is considered, and that it is further modified by the heat flux contributions and by the FLR stress forces.
3.3 Linearized CGL equations
To obtain the dispersion relations, the equations need to be linearized and transformed to Fourier space. Equations (3.36)–(3.40) are linearized with respect to mean values $\unicode[STIX]{x1D70C}_{0};\boldsymbol{u}^{(0)}\equiv \langle \boldsymbol{u}\rangle =0;p_{\Vert \bot }^{(0)};\boldsymbol{B}_{0}=(0,0,B_{0});\hat{\boldsymbol{b}}_{0}=(0,0,1)$ where the magnetic field $B_{0}$ is assumed to be in the $z$ -direction. We assume that the mean value of the velocity is zero (the zero mean velocity value $\boldsymbol{u}^{(0)}=0$ should not be confused with the normalizing velocity as in (3.23), which is later chosen to be $V_{A}$ ). The terms $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ are linearized as $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}\boldsymbol{\cdot }\hat{\boldsymbol{b}}\overset{\text{lin}}{=}\unicode[STIX]{x2202}_{z}u_{z}$ and the linearized system reads
The divergence of the pressure tensor is
and the first step of linearization yields, by components,
which can be conveniently written in matrix notation as
Notice that, since $\unicode[STIX]{x2202}_{i}|\boldsymbol{B}|=\hat{\boldsymbol{b}}\boldsymbol{\cdot }(\unicode[STIX]{x2202}_{i}\boldsymbol{B})$ , the derivatives of unit vectors are calculated according to
and in the linear approximation one has
which by components reads
where, importantly, $\unicode[STIX]{x2202}_{z}\hat{b}_{z}\overset{\text{lin}}{=}0$ . The divergence of the pressure tensor in the liner approximation is therefore expressed as
where the second term contributes if the temperature anisotropy is present, i.e. when $p_{\Vert }^{(0)}\neq p_{\bot }^{(0)}$ . The vectors $(\unicode[STIX]{x1D735}\times \boldsymbol{B})\times \boldsymbol{B}_{0}$ and $\unicode[STIX]{x1D735}\times (\boldsymbol{u}\times \boldsymbol{B}_{0})$ are straightforward to calculate and are
The entire set of linearized CGL equations reads
Additionally, the 3 components of the induction equation are not independent, but satisfy the $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{B}=0$ constraint. In order to find the linear dispersion relations, without any loss of generality, we consider wave propagation in the $x$ – $z$ plane and assume that there is no variation with respect to the $y$ -coordinate (i.e. one assumes that all expressions containing $\unicode[STIX]{x2202}_{y}$ are zero; alternatively, one can consider propagation in the $y$ – $z$ plane, where all expressions containing $\unicode[STIX]{x2202}_{x}$ would be zero). By exploring the above system, it is noteworthy that the density equation does not play any role in determining the dispersion relation, since no other equation contains the variable $\unicode[STIX]{x1D70C}$ .
3.4 Alfvén mode
On considering (3.68)–(3.74) written in the $x$ – $z$ plane with $\unicode[STIX]{x2202}_{y}=0$ , it is apparent that the equations for $u_{y}$ and $B_{y}$ decouple from the entire system and applying $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t$ to the equation for $u_{y}$ yields
which is a wave equation describing the propagation of (generally oblique) Alfvén waves in the CGL description. The same wave equation can be obtained for the component $B_{y}$ . For isotropic mean pressures $p_{\Vert }^{(0)}=p_{\bot }^{(0)}$ the equation is equivalent to the MHD description with the usual Alfvén speed $V_{A}=B_{0}/(4\unicode[STIX]{x03C0}\unicode[STIX]{x1D70C}_{0})^{1/2}$ . Since
where $a_{p}$ is the temperature ratio (3.32), the Alfvén wave equation can be rewritten as
Considering a wave propagating obliquely in the $x$ – $z$ plane with the wavevector $\boldsymbol{k}=(k_{\bot },0,k_{\Vert })$ at an angle $\unicode[STIX]{x1D703}$ with respect to the $z$ -direction (the $B_{0}$ direction), the parallel and perpendicular wavenumbers are defined as
and the transformation to Fourier space is performed according to (see appendix A)
The wave equation (3.77) is then
which yields the dispersion relation for the Alfvén waves
It is important to emphasize that, similarly to MHD, the Alfvén mode propagates in all the oblique directions, with a frequency $\unicode[STIX]{x1D714}$ that is proportional to the projection of the wavenumber $\boldsymbol{k}$ in the direction of $B_{0}$ , i.e. $k_{\Vert }=k\cos \unicode[STIX]{x1D703}$ . If the expression under the square root in (3.81) becomes negative, i.e. if $1+\unicode[STIX]{x1D6FD}_{\Vert }(a_{p}-1)/2<0$ , the frequency $\unicode[STIX]{x1D714}$ is imaginary and the Alfvén mode becomes unstable. This instability, that affects the obliquely propagating Alfvén mode, is known as the oblique firehose instability. The necessary (but not sufficient) condition for the instability to develop is $a_{p}=T_{\bot }^{(0)}/T_{\Vert }^{(0)}<1$ . The firehose instability criterion can be rewritten in many forms, the most common being
The first expression directly implies that, if $\unicode[STIX]{x1D6FD}_{\Vert }\leqslant 2$ , no firehose instability can exist. The firehose instability therefore exists only for a relatively high plasma beta $\unicode[STIX]{x1D6FD}_{\Vert }>2$ and only if the parallel temperature is higher than the perpendicular temperature. Importantly, the firehose instability criterion in the CGL model is equivalent to the one found from linear kinetic theory in the long-wavelength limit (for example, Rosenbluth (Reference Rosenbluth1956), Chandrasekhar, Kaufman & Watson (Reference Chandrasekhar, Kaufman and Watson1958), Parker (Reference Parker1958a ) and Vedenov & Sagdeev (Reference Vedenov and Sagdeev1958)). Alfvén wave propagation only affects components $u_{y}$ and $B_{y}$ which are decoupled from the rest of the system, and yields the other (eigenvector) components $u_{x},u_{z},B_{x},B_{z}$ as zero. The $u_{y}$ component does not enter the density equation and Alfvén mode fluctuations therefore do not produce any density fluctuations or pressure fluctuations. The Alfvén mode is therefore incompressible. It is also useful to define the magnetic compressibility $\unicode[STIX]{x1D712}_{b}(k_{\Vert },k_{\bot })$ in Fourier space, that measures the relative ratio of magnetic energy in the parallel component versus the total magnetic energy
Since the Alfvén mode produces only $B_{y}$ fluctuations, with $B_{x}=B_{z}=0$ , the magnetic compressibility for this mode is
for all the propagation directions. Sometimes the magnetic compressibility is defined as a ratio of parallel and perpendicular energies $|B_{z}|^{2}/(|B_{x}|^{2}+|B_{y}|^{2})$ , this quantity however has a slight disadvantage in that the values can become very large (and actually equal to infinity), which makes plotting of this quantity problematic. In contrast, the magnetic compressibility as defined in (3.83), has a nicely bounded range of values between 0 and 1. A similar quantity is defined for the ratio of the energy in velocity fluctuations
and the Alfvén mode has $\unicode[STIX]{x1D712}_{u}=0$ for all propagation directions, since only the perpendicular velocity component $u_{y}$ is non-zero.
3.5 Slow and fast modes
By applying $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t$ to the equations for $\unicode[STIX]{x2202}u_{x}/\unicode[STIX]{x2202}t$ and $\unicode[STIX]{x2202}u_{z}/\unicode[STIX]{x2202}t$ and using the evolution equations for $B_{x},B_{z},p_{\Vert },p_{\bot }$ one obtains
The easiest way to solve this coupled system is to transform to Fourier space, which yields
The system has non-trivial solutions only if the determinant of the matrix is zero, yielding the CGL dispersion relation at fourth order in frequency $\unicode[STIX]{x1D714}$ that contains the slow and fast modes
One can proceed in several ways with the above dispersion relation.
In the specific cases of strictly parallel ( $k_{\bot }=0$ ) and strictly perpendicular ( $k_{\Vert }=0$ ) propagation directions it is actually easier to work directly with the system (3.88), because the off-diagonal components in the matrix are zero and the dynamics in the $u_{x}$ and $u_{z}$ components decouples. For strictly parallel propagation, one solution is in the $u_{x}$ component, with the dispersion relation identical to the Alfvén wave (3.81). Since the magnetic and velocity fluctuations are only in the $u_{x}$ , $b_{x}$ components, the compressibility $\unicode[STIX]{x1D712}_{b}=0$ and $\unicode[STIX]{x1D712}_{u}=0$ for this mode. The second solution for parallel propagation is in the $u_{z}$ component, and it is called the sound mode, or the ion-acoustic mode, with dispersion relation $\unicode[STIX]{x1D714}=\pm k_{\Vert }\sqrt{3p_{\Vert }^{(0)}/\unicode[STIX]{x1D70C}_{0}}=\pm k_{\Vert }V_{A}\sqrt{3\unicode[STIX]{x1D6FD}_{\Vert }/2}$ . The parallel ion-acoustic mode has $\unicode[STIX]{x1D712}_{u}=1$ and $\unicode[STIX]{x1D712}_{b}=0$ . Considering strictly perpendicular propagation, the waves in the $u_{z}$ component vanish with the solution $\unicode[STIX]{x1D714}=0$ . The waves in the $u_{x}$ component are fast magnetosonic waves with dispersion relation $\unicode[STIX]{x1D714}=\pm k_{\bot }\sqrt{V_{A}^{2}+2p_{\bot }^{(0)}/\unicode[STIX]{x1D70C}_{0}}=\pm k_{\bot }V_{A}\sqrt{1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }}$ , and $\unicode[STIX]{x1D712}_{u}=0$ and $\unicode[STIX]{x1D712}_{b}=1$ . It is also possible to define the parallel and perpendicular sound speeds $C_{\Vert }=\sqrt{3p_{\Vert }^{(0)}/\unicode[STIX]{x1D70C}_{0}}$ ; $C_{\bot }=\sqrt{2p_{\bot }^{(0)}/\unicode[STIX]{x1D70C}_{0}}$ , which yields the parallel acoustic mode dispersion $\unicode[STIX]{x1D714}=\pm k_{\Vert }C_{\Vert }$ and the perpendicular fast mode dispersion $\unicode[STIX]{x1D714}=\pm k_{\bot }\sqrt{V_{A}^{2}+C_{\bot }^{2}}$ .Footnote 4
In the general case of oblique propagation, we choose to use the normalized frequencies and wavenumbers (3.29), and rewrite the dispersion relation as
It is sometimes useful to introduce the parallel Alfvén phase speed $v_{A\Vert }$ , and its normalization $\widetilde{v}_{A\Vert }\equiv v_{A\Vert }/V_{A}$ , so that
since $\widetilde{v}_{A\Vert }^{2}$ can be used to write the CGL dispersion relation in a shorter form, as done later in (3.193). The solution of the polynomial (3.90) is simply $\widetilde{\unicode[STIX]{x1D714}}^{2}=(A_{2}\pm \sqrt{A_{2}^{2}-4A_{0}})/2$ , and it is obvious that the coefficient $A_{2}$ is always positive, since both the plasma beta and the temperature anisotropy ratio must always be positive. It is also possible to show that the discriminant under the square root $A_{2}^{2}-4A_{0}\geqslant 0$ (proof shown a few lines below). The solution with the minus sign is called the slow mode and the solution with the plus sign is called the fast mode. For the slow mode, $\widetilde{\unicode[STIX]{x1D714}}^{2}$ becomes negative (and $\widetilde{\unicode[STIX]{x1D714}}$ complex) when $A_{0}<0$ , implying that the slow mode becomes unstable when
where we suppress the tildes on the wavenumbers since the above condition is valid for both $\widetilde{k}$ and $k$ forms of wavenumbers. The above condition implies that there are two competing instabilities. The highly parallel propagation with $k_{\Vert }\gg k_{\bot }$ yields an instability threshold equivalent to (3.82) and the associated instability is called the parallel firehose instability. The highly oblique propagation with $k_{\bot }\gg k_{\Vert }$ yields the mirror instability and the threshold in the CGL description readsFootnote 5
It is well documented that the mirror instability threshold in the CGL description is not equivalent to the one found in linear kinetic theory. In particular, the CGL mirror threshold contains a factor of 6 error (Abraham-Shrauner Reference Abraham-Shrauner1967; Tajiri Reference Tajiri1967; Kulsrud Reference Kulsrud, Galeev and Sudan1983; Ferrière & André Reference Ferrière and André2002), since the correct threshold obtained from kinetic theory reads
The threshold (3.93) implies that the necessary condition for the mirror instability to exist in the CGL description is $a_{p}=T_{\bot }^{(0)}/T_{\Vert }^{(0)}>6$ , whereas in kinetic theory the necessary condition is $a_{p}>1$ . A very good discussion about the physical mechanism of the mirror instability and the role of resonant particles can be found in Southwood & Kivelson (Reference Southwood and Kivelson1993) and Kivelson & Southwood (Reference Kivelson and Southwood1996). The correct mirror instability threshold is obtained when using the normal closure discussed later on in the text or, more simply, its quasi-static limit (Constantinescu Reference Constantinescu2002; Chust & Belmont Reference Chust and Belmont2006; Passot et al. Reference Passot, Ruban and Sulem2006). In the opposite case where ions are cold, a similar electron temperature anisotropy instability is obtained which is called the field-swelling instability, see e.g. Basu & Coppi (Reference Basu and Coppi1984).
Returning to the oblique dispersion relation, we show that indeed $A_{2}^{2}-4A_{0}\geqslant 0$ . By using (3.78) the wavenumber $k$ can be pulled out of the $A_{0},A_{2}$ coefficients and the dispersion relation can be rewritten in terms of a phase speed as
and (after staring for a sufficiently long time) it can be shown that
which is obviously always non-negative. The dispersion relation for the slow and fast waves in the CGL description can be written in the compact form
which is equivalent to the dispersion relation of Abraham-Shrauner (Reference Abraham-Shrauner1967), equation (30), here written in the convenient units of $\unicode[STIX]{x1D6FD}_{\Vert }$ and $a_{p}=T_{\bot }^{(0)}/T_{\Vert }^{(0)}$ (for a direct comparison with the units used in that paper, the relations are $\unicode[STIX]{x1D6FD}_{\Vert }=2S_{\Vert }$ and $a_{p}\unicode[STIX]{x1D6FD}_{\Vert }=2S_{\bot }$ ). The magnetic compressibility of the slow and fast modes can be calculated easily directly from the induction equations (3.72). The slow and fast modes generate only components $B_{x}$ and $B_{z}$ with the component $B_{y}=0$ . The transformation of (3.72) to Fourier space yields $-\unicode[STIX]{x1D714}B_{x}=B_{0}k_{\Vert }u_{x}$ , and $-\unicode[STIX]{x1D714}B_{z}=-B_{0}k_{\bot }u_{x}$ , and by applying the magnitude operator yields the magnetic compressibility
For quasi-parallel propagation angles the magnetic compressibility $\unicode[STIX]{x1D712}_{b}$ is therefore not a good indicator for mode recognition, since all three modes have $\unicode[STIX]{x1D712}_{b}$ close to zero. The magnetic compressibility is, however, an excellent tool for oblique propagation angles, because the Alfvén mode has $\unicode[STIX]{x1D712}_{b}=0$ regardless of the propagation direction, whereas for the slow and fast modes $\unicode[STIX]{x1D712}_{b}$ increases with $\unicode[STIX]{x1D703}$ towards $\unicode[STIX]{x1D712}_{b}=1$ . We will leave discussion about the CGL velocity eigenvector (the ratio $\unicode[STIX]{x1D712}_{u}$ ) to a later subsection since the discussion is a bit technical, and we instead focus now on an interesting effect that can be directly obtained by considering the CGL dispersion relations.
3.6 Slow mode can become faster than Alfvén mode
There is a very interesting phenomenon present in the CGL description that is also observed in the kinetic description but that is absent in the MHD description. For a sufficiently high plasma beta, the slow mode phase speed can be larger than the Alfvén mode phase speed. To obtain the critical plasma beta when this happens, one can write the dispersion relation for the Alfvén wave as $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{A}^{2}=(1-\unicode[STIX]{x1D6FD}_{\Vert }/2+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }/2)\cos ^{2}\unicode[STIX]{x1D703}\equiv A_{A}$ and using this expression with the dispersion relation (3.95), the critical condition when the slow mode speed matches the Alfvén speed is $(A_{2}-\sqrt{A_{2}^{2}-4A_{0}})/2=A_{A}$ , and since $A_{2}^{2}-4A_{0}\geqslant 0$ , this further yields $A_{A}^{2}-A_{2}A_{A}+A_{0}=0$ . It takes quite a few algebraic operations to simplify the result, the easiest way is to collect terms with different powers of $\unicode[STIX]{x1D6FD}_{\Vert }$ and then simplify. A very valuable approach available now is to use mathematical software such as Maple or Mathematica for guidance, and for example the Maple command simplify(expr,trig) immediately simplifies the result, which can then be of course verified with pencil and paper. The result is
The expression is naturally satisfied for cases of strictly parallel and strictly perpendicular propagation. For $\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2$ , both the slow mode and the Alfvén mode have zero frequency. For $\unicode[STIX]{x1D703}=0$ , the slow mode speed can be only smaller or equal to the speed of the Alfvén mode (never higher), depending on another condition that will be easier to clarify later when specifically considering the parallel propagation limit. Now, on which side of the threshold (3.101) does the speed of the slow mode become faster than the speed of the Alfvén mode? Working with inequalities is a bit trickier. By assuming that both modes have real frequencies (and are not firehose or mirror unstable), the condition is derived to be
which agrees with the result obtained by Abraham-Shrauner (Reference Abraham-Shrauner1967), equation (42). The expression (3.102) reveals that there is a condition that can be satisfied for all directions of propagation and that reads
This is a quadratic equation in $\unicode[STIX]{x1D6FD}_{\Vert }$ which can be solved easily for a given $a_{p}$ (or solved for $a_{p}$ for a given $\unicode[STIX]{x1D6FD}_{\Vert }$ ). To visualize this equation, the threshold is plotted in figure 1 as a black dashed line. A particularly useful result is obtained for isotropic temperatures ( $a_{p}=1$ ). In this case the inequality (3.103) becomes $1-\unicode[STIX]{x1D6FD}_{\Vert }/2-\unicode[STIX]{x1D6FD}_{\Vert }^{2}5/4\leqslant 0$ and the critical plasma beta at which the slow mode speed becomes larger than the Alfvén mode speed is
Thus, this effect exists in the CGL description even when the temperatures are isotropic.
3.7 CGL parallel propagation
When numerically solving the CGL dispersion relations for finite (non-zero) propagation angles, the slow and fast mode expressions always get the correct ordering with $\unicode[STIX]{x1D714}_{s}\leqslant \unicode[STIX]{x1D714}_{f}$ . It is sometimes confusing about how the dispersion relations hold in the limit of parallel propagation $\unicode[STIX]{x1D703}\rightarrow 0$ which we will address here. In this limit, the dispersion relations (3.99) for the slow and fast modes can be directly evaluated as
Now, because $\sqrt{a^{2}}=|a|$ , for further calculation one needs to determine the sign of the expression under the square root. For $1+\frac{1}{2}a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-2\unicode[STIX]{x1D6FD}_{\Vert }>0$ , i.e. when $\unicode[STIX]{x1D6FD}_{\Vert }<2/(4-a_{p})$ , the above equations yield
so the slow mode is the sound/acoustic mode and the fast mode coincides with the Alfvén mode. However, for $1+\frac{1}{2}a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-2\unicode[STIX]{x1D6FD}_{\Vert }<0$ , i.e. when $\unicode[STIX]{x1D6FD}_{\Vert }>2/(4-a_{p})$ , the dispersion relations yield
so the slow mode coincides with the Alfvén mode and the fast mode is the sound/acoustic mode. In this region, the slow mode phase speed will always stay equal to the Alfvén mode phase speed, regardless of further increases of $\unicode[STIX]{x1D6FD}_{\Vert }$ . In the special case when $1+\frac{1}{2}a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-2\unicode[STIX]{x1D6FD}_{\Vert }=0$ , which is equivalent to
all three modes propagate with the same phase speed $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{Asf}^{2}=\frac{3}{2}\unicode[STIX]{x1D6FD}_{\Vert }$ . The criterion (3.108) can be satisfied only for $\unicode[STIX]{x1D6FD}_{\Vert }\geqslant 1/2$ and in the limit $\unicode[STIX]{x1D6FD}_{\Vert }\rightarrow \infty$ the asymptote is $a_{p}=4$ , so the possible range of $a_{p}$ is $[0,4]$ . This curve is plotted in figure 1 as a green dotted line. The curve separates two regions. In the first region $\unicode[STIX]{x1D6FD}_{\Vert }<2/(4-a_{p})$ and $v_{s\Vert }<v_{A\Vert }$ . In the second region $\unicode[STIX]{x1D6FD}_{\Vert }>2/(4-a_{p})$ and $v_{s\Vert }=v_{A\Vert }$ . The interesting result to remember is that, for isotropic temperatures ( $a_{p}=1$ ) and parallel propagation, the critical plasma beta at which the slow mode speed matches the Alfvén mode speed is
3.8 CGL with $T_{\bot }=T_{\Vert }$ is not equivalent to MHD
It is important to emphasize that, even for isotropic temperatures $a_{p}=1$ , the CGL dispersion (3.99) is not equivalent to the MHD dispersion for the slow and fast waves. The MHD dispersion is usually written in the form (see (C 20) in appendix C)
or alternatively
which nicely shows that the expression under the square root is always greater or equal to zero. The MHD sound speed is defined as $C_{s}^{2}=\unicode[STIX]{x1D6FE}p^{(0)}/\unicode[STIX]{x1D70C}_{0}$ , where $\unicode[STIX]{x1D6FE}=5/3$ . Rewritten with the parameters we use here $C_{s}^{2}/V_{A}^{2}=\unicode[STIX]{x1D6FE}\unicode[STIX]{x1D6FD}_{\Vert }/2=5\unicode[STIX]{x1D6FD}_{\Vert }/6$ , the MHD dispersion for the slow and fast waves reads
or alternatively
In the CGL description with isotropic temperatures $(a_{p}=1)$ the dispersion relation (3.99) for slow and fast waves reads
or alternatively
We wrote down all major possibilities on purpose, since we wanted to clearly show that, regardless of the choice of $\unicode[STIX]{x1D6FD}_{\Vert }$ and $\unicode[STIX]{x1D6FE}$ , the MHD dispersion relation cannot match the CGL dispersion relation simultaneously for all propagation directions $\unicode[STIX]{x1D703}$ . One could even play with an abstract idea to make the MHD $\unicode[STIX]{x1D6FE}$ dependent on the angle $\unicode[STIX]{x1D703}$ , and possibly $\unicode[STIX]{x1D6FD}_{\Vert }$ . Even then, there is no obvious way to modify the MHD dispersion relation so that it behaves like the CGL model. Additionally, the CGL description leads to the development of pressure anisotropy even when the initial pressure is isotropic.
3.9 MHD velocity eigenvector
Before we proceed with the discussion of the CGL velocity eigenvector (which can appear to be a bit tricky), for the sake of clarity it is useful to consider the simpler MHD case first. By following the steps outlined above, it is easy to show that again the Alfvén mode dynamics in the $u_{y}$ decouples from the dynamics in the other directions and the velocity eigenvector matrix for the slow and fast waves in MHD is
or alternatively
This yields the MHD dispersion relations for the slow and fast modes in the form of (3.110) or alternatively (3.111). It possible to either express the ratio $u_{x}/u_{z}$ or $u_{z}/u_{x}$ . We want to calculate the velocity ratio $\unicode[STIX]{x1D712}_{u}$ and we choose the first possibility that allows us to express that for the slow and fast modes (since $u_{y}=0$ )
The ratio $u_{x}/u_{z}$ can be found for example from the second row of the matrix as
or equivalently from the first row of the matrix as
The possible range of the ratio $u_{x}/u_{z}$ is from $-\infty$ to $+\infty$ and the possible range for $\unicode[STIX]{x1D712}_{u}$ is from $0$ to $1$ . Solutions can be easily divided into two categories – depending on the behaviour for parallel propagation – and are plotted in figure 2. For the sake of clarity, we only consider propagation with positive wavenumbers $k_{\Vert }$ and $k_{\bot }$ , so the possible range of angle $\unicode[STIX]{x1D703}$ is from $0$ to $\unicode[STIX]{x03C0}/2$ and $\sin \unicode[STIX]{x1D703}\geqslant 0$ , $\cos \unicode[STIX]{x1D703}\geqslant 0$ . Consider first the case $C_{s}<V_{A}$ . For parallel propagation, the dispersion relation (3.111) directly yields that for the fast mode $(\unicode[STIX]{x1D714}/k)_{f}^{2}|_{\unicode[STIX]{x1D703}=0}=V_{A}^{2}$ and for the slow mode $(\unicode[STIX]{x1D714}/k)_{s}^{2}|_{\unicode[STIX]{x1D703}=0}=C_{s}^{2}$ . Using the expression (3.119) then yields that for the fast mode $(u_{x}/u_{z})_{f}|_{\unicode[STIX]{x1D703}=0}=+\infty$ and for the slow mode $(u_{x}/u_{z})_{s}|_{\unicode[STIX]{x1D703}=0}=0$ . This implies that for $C_{s}<V_{A}$ , all the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves start at
regardless of the actual value of $C_{s}/V_{A}<1$ . The second case $C_{s}>V_{A}$ is straightforward, since for parallel propagation everything is just turned around; the fast mode dispersion relation is $(\unicode[STIX]{x1D714}/k)_{f}^{2}|_{\unicode[STIX]{x1D703}=0}=C_{s}^{2}$ and the slow mode dispersion relation is $(\unicode[STIX]{x1D714}/k)_{s}^{2}|_{\unicode[STIX]{x1D703}=0}=V_{A}^{2}$ . Implying that for $C_{s}>V_{A}$ , all the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves start at
regardless of the actual value of $C_{s}/V_{A}>1$ . The perpendicular propagation limit is even easier since the split into two categories is not required and for both cases the dispersion (3.110) yields $(\unicode[STIX]{x1D714}/k)_{f}^{2}|_{\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2}=V_{A}^{2}+C_{s}^{2}$ and $(\unicode[STIX]{x1D714}/k)_{s}^{2}|_{\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2}=0$ , implying that for the fast mode $(u_{x}/u_{z})_{f}|_{\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2}=+\infty$ and that for the slow mode $(u_{x}/u_{z})_{s}|_{\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2}=0$ . All the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves therefore ‘end’ at
whether the ratio $C_{s}/V_{A}<1$ or $C_{s}/V_{A}>1$ . The behaviour of the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves is plotted in figure 2.
In MHD, the ratio $C_{s}^{2}/V_{A}^{2}$ is sometimes used as the definition of plasma beta, which we will call here $\unicode[STIX]{x1D6FD}_{\text{MHD}}\equiv C_{s}^{2}/V_{A}^{2}$ . As can be seen in figure 2, for $\unicode[STIX]{x1D6FD}_{\text{MHD}}$ approaching 1, the curves display a ‘singular’ behaviour, and it is difficult to guess what will happen when $\unicode[STIX]{x1D6FD}_{\text{MHD}}$ reaches the value of 1 exactly. This special case is addressed in the next subsection. There are also two special cases, one for $C_{s}\ll V_{A}$ or $\unicode[STIX]{x1D6FD}_{\text{MHD}}\ll 1$ and one for $C_{s}\gg V_{A}$ or $\unicode[STIX]{x1D6FD}_{\text{MHD}}\gg 1$ . Both limits $C_{s}\ll V_{A}$ and $C_{s}\gg V_{A}$ are easily obtained by rewriting the dispersion relation (3.110) to a form
In both limits the second term under the square root is small and the expression can be expanded as $\sqrt{1-\unicode[STIX]{x1D716}}=1-\unicode[STIX]{x1D716}/2$ , yielding the slow and fast mode dispersion relations
which further yield dispersion relations (3.126), (3.128). Note that the limit $C_{s}\ll V_{A}$ can be easily obtained directly from the velocity matrix (3.117) by neglecting the off-diagonal terms that decouple the two modes. In contrast, the limit $C_{s}\gg V_{A}$ is not that obvious and one needs to calculate the determinant.
In the limit $\unicode[STIX]{x1D6FD}_{\text{MHD}}\ll 1$ , the MHD dispersion relation simplifies to
The expression (3.119) yields for the fast mode $(u_{x}/u_{z})_{f}(\unicode[STIX]{x1D703})=\infty$ and the expression (3.120) yields for the slow mode $(u_{x}/u_{z})_{s}(\unicode[STIX]{x1D703})=0$ , implying
In the opposite limit $\unicode[STIX]{x1D6FD}_{\text{MHD}}\gg 1$ , the MHD dispersions simplify to
In this limit, the expression (3.119) yields $(u_{x}/u_{z})_{f}(\unicode[STIX]{x1D703})=(1+V_{A}^{2}/C_{s}^{2})\tan \unicode[STIX]{x1D703}$ , that describes the behaviour of the fast mode for large beta values and the complete limit is $(u_{x}/u_{z})_{f}(\unicode[STIX]{x1D703})=\tan \unicode[STIX]{x1D703}$ . For the slow mode the expression (3.119) yields $(u_{x}/u_{z})_{s}(\unicode[STIX]{x1D703})=-(1-V_{A}^{2}/C_{s}^{2})\cot \unicode[STIX]{x1D703}$ and the complete limit is $(u_{x}/u_{z})_{s}(\unicode[STIX]{x1D703})=-\cot \unicode[STIX]{x1D703}$ . The $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ solutions for large beta values therefore converge to
and these two curves are plotted in figure 2 as dashed lines.
3.9.1 MHD special case $V_{A}=C_{s}$
Consider the very peculiar case of parallel propagation $\unicode[STIX]{x1D703}=0$ with $V_{A}=C_{s}$ . The velocity eigenvector matrix (3.117), when evaluated exactly at these values reads
It is not possible to determine the ratio $u_{x}/u_{z}$ and the eigenvector must be found by taking the limit $\unicode[STIX]{x1D703}\rightarrow 0$ . For $V_{A}=C_{s}$ the dispersion relation for the slow and fast modes (3.111) simplifies to $(\unicode[STIX]{x1D714}/k)^{2}=V_{A}^{2}\pm V_{A}^{2}|\sin \unicode[STIX]{x1D703}|$ . As before, for the sake of simplicity we consider only positive wavenumbers, so the propagation angle $\unicode[STIX]{x1D703}$ is in the range $[0,\unicode[STIX]{x03C0}/2]$ and $\sin \unicode[STIX]{x1D703}\geqslant 0$ , $\cos \unicode[STIX]{x1D703}\geqslant 0$ . Evaluating the velocity matrix (3.117) at $V_{A}=C_{s}$ only, it is possible to pull the $\sin \unicode[STIX]{x1D703}$ out of the system so that
which nicely demonstrates that for $\unicode[STIX]{x1D703}\rightarrow 0$ the system approaches (3.130). Nevertheless, we can now calculate the velocity ratio (for example from the second row) as
which can be evaluated at $\unicode[STIX]{x1D703}=0$ . For the fast mode $(u_{x}/u_{z})^{f}(0)=1$ and for the slow mode $(u_{x}/u_{z})^{s}(0)=-1$ . This implies that for the peculiar case of $V_{A}=C_{s}$ , the velocity ratio $\unicode[STIX]{x1D712}_{u}^{sf}(0)=1/2$ for both the fast and slow modes. The expression (3.132) can of course be directly obtained by using the result $(\unicode[STIX]{x1D714}/k)^{2}=V_{A}^{2}\pm V_{A}^{2}\sin \unicode[STIX]{x1D703}$ in the expression (3.119) and by cancelling the $\sin \unicode[STIX]{x1D703}$ in the nominator and denominator. The expression (3.132) is valid for any angle $\unicode[STIX]{x1D703}$ . However, for the slow mode it becomes singular at $\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2$ , and has to be rearranged by multiplying and dividing with $1+\sin \unicode[STIX]{x1D703}$ (or equivalently, use the first row of the matrix). To have useful expression for the entire range of $\unicode[STIX]{x1D703}$ , we therefore split the expressions for the slow and fast modes as
which yields, in this peculiar case of $V_{A}=C_{s}$ , that the ratio $\unicode[STIX]{x1D712}_{u}$ for the slow and fast modes can be expressed as
These solutions are plotted in figure 2 as dotted lines.
3.10 CGL velocity eigenvector
To find the ratio for velocity components $\unicode[STIX]{x1D712}_{u}$ in the CGL description, it is useful for a moment to work with abbreviated symbols, and we write the matrix (3.88) for the $u_{x}$ and $u_{z}$ components as
where the coefficients are
The determinant must be zero, which implies $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})^{4}-(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})^{2}(a+c)+ac-b^{2}=0$ , and has a solution
The last step was actually used in obtaining (3.98) which is otherwise quite difficult to see. Using this result in the above matrix, the eigenvector matrix becomes
where
The ratio of $u_{x}/u_{z}$ can be expressed (for example from the second row) as
or alternatively (from the first row) as
The behaviour of these solutions is plotted in figure 3, where we plot solutions for isotropic temperatures $a_{p}=1$ . Similarly to MHD, two classes of solutions can be characterized based upon the behaviour for parallel propagation $\unicode[STIX]{x1D703}=0$ , depending if the quantity $a-c$ (3.139) evaluated at $\unicode[STIX]{x1D703}=0$ is positive or negative. The quantity $(a-c)|_{\unicode[STIX]{x1D703}=0}=1+\frac{1}{2}a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-2\unicode[STIX]{x1D6FD}_{\Vert }$ was discussed in the section for parallel propagation and this quantity represents the green dotted curve in figure 1 that separates the system into two regions. For $(a-c)|_{\unicode[STIX]{x1D703}=0}>0$ , i.e. in the region $\unicode[STIX]{x1D6FD}_{\Vert }<2/(4-a_{p})$ , for the slow mode (use (3.141)) the ratio $(u_{x}/u_{z})^{s}(0)=0$ and for the fast mode (use (3.140)) the ratio $(u_{x}/u_{z})^{f}(0)=+\infty$ . In this region, all the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves start at
For $(a-c)|_{\unicode[STIX]{x1D703}=0}<0$ , i.e. in the region $\unicode[STIX]{x1D6FD}_{\Vert }>2/(4-a_{p})$ , for the slow mode (use (3.140)) the ratio $(u_{x}/u_{z})^{s}(0)=+\infty$ and for the fast mode (use (3.141)) the ratio $(u_{x}/u_{z})^{f}(0)=0$ . In this region, all the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves start at
For perpendicular propagation $\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2$ the quantity $(a-c)|_{\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2}=1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }$ is always positive and the separation into two regions is not necessary. For the slow mode $(u_{x}/u_{z})^{s}(\unicode[STIX]{x03C0}/2)=0$ and for the fast mode $(u_{x}/u_{z})^{f}(\unicode[STIX]{x03C0}/2)=+\infty$ . All the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ curves, regardless of the value of $\unicode[STIX]{x1D6FD}_{\Vert }$ therefore end at
This summarizes the general behaviour of the solutions. However, as can be seen from figure 3, there is a special case when the parameters approach $\unicode[STIX]{x1D6FD}_{\Vert }=2/(4-a_{p})$ , i.e. when $(a-c)|_{\unicode[STIX]{x1D703}=0}=0$ , that will be discussed in the next subsection. One can again consider the two limiting cases when $\unicode[STIX]{x1D6FD}_{\Vert }$ is small or large. The $\unicode[STIX]{x1D6FD}_{\Vert }\ll 1$ case yields dispersion relationsFootnote 6
Using expression (3.141) for the slow mode yields $(u_{x}/u_{z})^{s}(\unicode[STIX]{x1D703})=0$ and expression (3.140) for the fast mode yields $(u_{x}/u_{z})^{f}(\unicode[STIX]{x1D703})=\infty$ , implying that, in low beta limit, the $\unicode[STIX]{x1D712}_{u}(\unicode[STIX]{x1D703})$ solutions are
and the constant solutions approaching this limit are visible in the plot of figure 3. The opposite $\unicode[STIX]{x1D6FD}_{\Vert }\gg 1$ limit in the CGL description is not particularly revealing since there is no obvious way to simplify the expansion, even for isotropic temperatures. For $a_{p}=1$ , the $\unicode[STIX]{x1D6FD}_{\Vert }\gg 1$ expansion yields
where we neglected a term proportional to $1/\unicode[STIX]{x1D6FD}_{\Vert }$ . The expression results in the correct parallel limit $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{f}^{2}(0)=\frac{3}{2}\unicode[STIX]{x1D6FD}_{\Vert }$ , $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{s}^{2}(0)=1$ and the correct perpendicular limit $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{f}^{2}(\unicode[STIX]{x03C0}/2)=1+\unicode[STIX]{x1D6FD}_{\Vert }$ , $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{s}^{2}(\unicode[STIX]{x03C0}/2)=0$ . If the perpendicular fast mode limit is sufficient to be $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{f}^{2}(\unicode[STIX]{x03C0}/2)=\unicode[STIX]{x1D6FD}_{\Vert }$ (which is acceptable since $\unicode[STIX]{x1D6FD}_{\Vert }\gg 1$ ), only the terms proportional to $\unicode[STIX]{x1D6FD}_{\Vert }$ can be retained and the fast mode dispersion relation can be simplified to
The expression (3.140) yields the velocity ratio of the fast mode as
that is independent of $\unicode[STIX]{x1D6FD}_{\Vert }$ . For the slow mode wave the dispersion relation cannot be further simplified because if we collect only terms proportional to $\unicode[STIX]{x1D6FD}_{\Vert }$ , the resulting expression will be non-zero everywhere except exactly at $\unicode[STIX]{x1D703}=0$ (and also $\unicode[STIX]{x1D703}=\unicode[STIX]{x03C0}/2$ ). For $\unicode[STIX]{x1D703}=0$ , the entire contribution of $(\widetilde{\unicode[STIX]{x1D714}}/\widetilde{k})_{s}^{2}(0)=1$ comes from terms in (3.149) that are not proportional to $\unicode[STIX]{x1D6FD}_{\Vert }$ . Nevertheless, the analytic velocity ratio (3.152) for the fast mode is sufficiently simple and to obtain analytic $\unicode[STIX]{x1D712}_{u}^{s}(\unicode[STIX]{x1D703})$ for the slow mode, it is actually possible to cheat a little by realizing that the slow and fast mode curves are always symmetric and that $\unicode[STIX]{x1D712}_{u}^{s}(\unicode[STIX]{x1D703})+\unicode[STIX]{x1D712}_{u}^{f}(\unicode[STIX]{x1D703})=1$ . Our final large beta results therefore are
Solutions are plotted in figure 3 as dashed lines and fit the solutions of the full dispersion relations with $\unicode[STIX]{x1D6FD}_{\Vert }=10^{3}$ very nicely.
3.10.1 CGL special case $\unicode[STIX]{x1D6FD}_{\Vert }=2/(4-T_{\bot }/T_{\Vert })$
In this peculiar case when all 3 modes propagate in the parallel direction with the same phase speed, one again faces complications when calculating the velocity eigenvector, as with the MHD peculiar case when $V_{A}=C_{s}$ . This CGL special case can be encountered only for $\unicode[STIX]{x1D6FD}_{\Vert }\geqslant 1/2$ because the temperature anisotropy must be $a_{p}\geqslant 0$ . We start with the velocity eigenvector matrix (3.138) and for the special case considered $1+\frac{1}{2}a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-2\unicode[STIX]{x1D6FD}_{\Vert }=0$ , the expression (3.139) simplifies to $a-c=(4\unicode[STIX]{x1D6FD}_{\Vert }-1)\sin ^{2}\unicode[STIX]{x1D703}$ . One can already see that for strictly parallel propagation the entire matrix will be zero and it will not be possible to evaluate the eigenvector. Similarly to the MHD case, the $\sin \unicode[STIX]{x1D703}$ can be pulled out of the matrix or alternatively one can use (3.140), (3.141) and calculate the limit. For the special case considered here, the parameter $b=\frac{1}{2}(4\unicode[STIX]{x1D6FD}_{\Vert }-2)\sin \unicode[STIX]{x1D703}\cos \unicode[STIX]{x1D703}$ . Considering again for simplicity only positive wavenumbers, the expression (3.140) yields in this particular case
The expression can be evaluated at $\unicode[STIX]{x1D703}=0$ , which yields for the slow mode $(u_{x}/u_{z})^{s}(0)=-1$ and for the fast mode $(u_{x}/u_{z})^{f}(0)=+1$ , implying that both modes start at
For the perpendicular propagation the discussion is unchanged from the previous general case and yields $\unicode[STIX]{x1D712}_{u}^{s}(\unicode[STIX]{x03C0}/2)=1$ and $\unicode[STIX]{x1D712}_{u}^{f}(\unicode[STIX]{x03C0}/2)=0$ .
3.11 Empirical models with polytropic indices $\unicode[STIX]{x1D6FE}_{\Vert }$ , $\unicode[STIX]{x1D6FE}_{\bot }$
The differences between MHD and CGL can be further clarified with polytropic indices. The linearized MHD pressure equation reads
One can introduce the concept of polytropic indices $\unicode[STIX]{x1D6FE}_{\Vert }$ and $\unicode[STIX]{x1D6FE}_{\bot }$ to the CGL pressure equations (3.73), (3.74) and write
where for the CGL model $\unicode[STIX]{x1D6FE}_{\Vert }=3$ and $\unicode[STIX]{x1D6FE}_{\bot }=2$ .Footnote 7 The reasoning for such a construction is to enable a smooth transition between two extreme cases – the CGL model on one side, and the isothermal model $\unicode[STIX]{x1D6FE}_{\Vert }=1$ and $\unicode[STIX]{x1D6FE}_{\bot }=1$ on the other side. The important observation is that in the parallel pressure equation the $\unicode[STIX]{x1D6FE}_{\Vert }$ acts only on the parallel velocity component $u_{z}$ , and in the perpendicular pressure equation the $\unicode[STIX]{x1D6FE}_{\bot }$ acts only on the perpendicular velocity components $u_{x},u_{y}$ . In MHD, the $\unicode[STIX]{x1D6FE}$ acts on all three components $u_{x},u_{y},u_{z}$ . The values of polytropic indices are directly related to the number of degrees of freedom $i$ , through the well-known relation $\unicode[STIX]{x1D6FE}=(i+2)/i$ . In the MHD description the number of degrees of freedom is $i=3$ and so $\unicode[STIX]{x1D6FE}=5/3$ . In contrast, the CGL description can be interpreted as being composed of a (strongly coupled) mixture of one-dimensional and two-dimensional dynamics, where for the parallel direction $i=1$ and $\unicode[STIX]{x1D6FE}_{\Vert }=3$ and for the perpendicular direction $i=2$ and $\unicode[STIX]{x1D6FE}_{\bot }=2$ . The apparently subtle, but nevertheless fundamental difference between (3.157) and (3.158), (3.159) is the core of many differences between the MHD and CGL descriptions, such as for example the effect of the slow mode becoming faster than the Alfvén mode. The two systems remain different even if the mean CGL pressures (or temperatures) are prescribed to be isotropic $p_{\Vert }^{(0)}=p_{\bot }^{(0)}$ and equal to the mean MHD pressure $p^{(0)}$ . This means that the MHD does not match the CGL description even if the distribution function is isotropic, as for example Maxwellian.
We briefly consider models with polytropic indices $\unicode[STIX]{x1D6FE}_{\Vert },\unicode[STIX]{x1D6FE}_{\bot }$ that are based on the linearized pressure equations (3.158), (3.159). If these pressure equations are ‘un-linearized’, i.e. if one performs a ‘reverse engineering’ procedure to obtain nonlinear equations (a procedure that is possible to do because of the known guidance by the CGL model), the nonlinear equations read
which can be rewritten as
Empirical fluid models of this type were studied for example by Hau & Sonnerup (Reference Hau and Sonnerup1993) and Hau et al. (Reference Hau, Phan, Sonnerup and Paschmann1993) with the motivation that spacecraft observations in the Earth’s magnetosheath often show behaviour that cannot be explained by the CGL values of $\unicode[STIX]{x1D6FE}_{\Vert }=3,\unicode[STIX]{x1D6FE}_{\bot }=2$ or by the isothermal values of $\unicode[STIX]{x1D6FE}_{\Vert }=1,\unicode[STIX]{x1D6FE}_{\bot }=1$ , but can be fitted by the values that lie somewhere in between (and actually quite close to the isothermal case). For a direct comparison with the CGL model, the equations (3.162) can be rewritten to a form
or to an interesting alternative form
Obviously, for CGL values of $\unicode[STIX]{x1D6FE}_{\Vert }=3,\unicode[STIX]{x1D6FE}_{\bot }=2$ the right-hand sides disappear and one obtains the classical CGL expressions. The right-hand sides of the above expressions can be compared with equations of Chew et al. (Reference Chew, Goldberger and Low1956) equations (2.82), (2.83) which contain the gyrotropic heat flux contributions. Therefore, the deviations from the CGL values of $\unicode[STIX]{x1D6FE}_{\Vert }=3,\unicode[STIX]{x1D6FE}_{\bot }=2$ can be considered to offer a very simple empirical model (perhaps too simple) that shows how the heat flux contributions modify the CGL dynamics. The dispersion relations for this fluid model can be found in Hau et al. (Reference Hau, Phan, Sonnerup and Paschmann1993) and also in Abraham-Shrauner (Reference Abraham-Shrauner1973), who studied an even more general empirical model with four polytropic indices (essentially, the $\unicode[STIX]{x1D6FE}_{\Vert }$ , $\unicode[STIX]{x1D6FE}_{\bot }$ acting on $|\boldsymbol{B}|$ and $\unicode[STIX]{x1D70C}$ in (3.162) can be further split to two independent parameters). Interestingly, complex polytropic indices have been suggested to model Landau damping (Belmont & Mazelle Reference Belmont and Mazelle1992). Considering the linearized CGL equations (3.68)–(3.72) and linearized polytropic pressure equations (3.158)–(3.159) written in the $x$ – $z$ plane yields the dispersion relations for this model. The Alfvén wave is still decoupled from the rest of the system and the Alfvén wave dispersion relation is unaffected by the different pressure equations, therefore also yielding the same oblique firehose instability threshold. What is modified is the dispersion relation of the slow and fast waves. The velocity equations read
and the transformation to Fourier space yields the velocity matrix
or alternatively
Calculating the determinant yields a dispersion relation for the slow and fast modes in the form
Solutions for parallel propagation $(k_{\bot }=0)$ are the usual CGL Alfvén mode and the ion-acoustic mode $\widetilde{\unicode[STIX]{x1D714}}=\pm \sqrt{(\unicode[STIX]{x1D6FE}_{\Vert }/2)\unicode[STIX]{x1D6FD}_{\Vert }}\widetilde{k}_{\Vert }$ , and for perpendicular propagation $(k_{\Vert }=0)$ the fast mode $\widetilde{\unicode[STIX]{x1D714}}=\pm \sqrt{1+(\unicode[STIX]{x1D6FE}_{\bot }/2)a_{p}\unicode[STIX]{x1D6FD}_{\Vert }}\widetilde{k}_{\bot }$ . Similarly to the CGL description it can be shown that $A_{2}^{2}-4A_{0}\geqslant 0$ because of the trick (3.137), implying that the slow mode becomes unstable when $A_{0}<0$ , i.e. when
The two competing instabilities are again the parallel firehose instability and the highly oblique mirror instability. The parallel firehose instability has the same threshold as obtained previously. However, the mirror instability threshold reads
which is consistent with the result of Hau & Sonnerup (Reference Hau and Sonnerup1993), their equation (14), written in the formFootnote 8
For $\unicode[STIX]{x1D6FE}_{\Vert }=3,\unicode[STIX]{x1D6FE}_{\bot }=2$ , the condition (3.175) is naturally equivalent to the CGL mirror threshold. However, the condition offers a temptingly simple result to keep the perpendicular polytropic index unchanged $\unicode[STIX]{x1D6FE}_{\bot }=2$ and modifies the parallel polytropic index to $\unicode[STIX]{x1D6FE}_{\Vert }=1/2$ . In this way, the mirror threshold matches the correct kinetic threshold (3.94). Nevertheless, this is just an interesting empirical concept, it fixes the highly oblique mirror threshold but at the same time moves the slow and fast mode dispersion relations in other directions, for example the parallel sound speed in such a model is $C_{\Vert }^{2}=\frac{1}{2}p_{\Vert }^{(0)}/\unicode[STIX]{x1D70C}_{0}$ , which appears to be unrealistically low. Such a ‘tuning’ of models with free parameters – only in one particular direction and only at the level of the dispersion relations – can be sometimes useful for interpretation of observational data, however, such models will usually lead to unphysical situations. We will not consider models with polytropic indices further in this text. Instead, to obtain a better match with kinetic description, it is advisable to use the correct CGL values of $\unicode[STIX]{x1D6FE}_{\Vert }=3,\unicode[STIX]{x1D6FE}_{\bot }=2$ and focus on fluid models that derive the correct heat flux contributions $q_{\Vert }$ and $q_{\bot }$ .
3.12 CGL and radially expanding solar wind
Similarly to MHD, the CGL fluid model can be very useful for understanding the evolution of temperature in the radially expanding solar wind. A good discussion can be found for example in Matteini et al. (Reference Matteini, Landi, Hellinger, Pantellini, Maksimovic, Velli, Goldstein and Marsch2007), Matteini et al. (Reference Matteini, Hellinger, Goldstein, Landi, Velli and Neugebauer2013) and references therein. Let us first consider the MHD case. Combining the pressure equation $\text{d}p/\text{d}t+\unicode[STIX]{x1D6FE}\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}=0$ with the density equation $\text{d}\unicode[STIX]{x1D70C}/\text{d}t+\unicode[STIX]{x1D70C}\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}=0$ yields
which in a steady state (or co-moving with the convective derivative) implies $T\sim \unicode[STIX]{x1D70C}^{\unicode[STIX]{x1D6FE}-1}$ . By considering simple radially expanding solar wind, one prescribes that the density $\unicode[STIX]{x1D70C}$ evolves with the heliocentric distance $r$ as $\unicode[STIX]{x1D70C}\sim 1/r^{2}$ , and by using $\unicode[STIX]{x1D6FE}=5/3$ , the ideal MHD model yields (in the absence of any dissipation and associated turbulent heating)
In contrast, the CGL equations (3.41) are rewritten as
which in a steady state implies
For the magnetic field it is possible to consider profiles of roughly $|\boldsymbol{B}|\sim 1/r^{2}$ , or $|\boldsymbol{B}|\sim 1/r$ (one can be more precise and consider the Parker spiral profile). The first magnetic field profile yields
The second magnetic field profile yields
There is also a curious case when the magnetic field profile is prescribed to be $|\boldsymbol{B}|\sim r^{-4/3}$ , i.e. a profile between two cases of $1/r$ and $1/r^{2}$ , that yields
and the temperature anisotropy stays constant. The estimations are of course valid only until a firehose threshold or a mirror threshold is reached. For example, evolution according to (3.181) will lead the system to a firehose threshold, and evolution according to (3.182) will lead the system to a mirror threshold. In general, if we fix the density profile to $\unicode[STIX]{x1D70C}\sim 1/r^{2}$ , any magnetic field profile steeper than $r^{-4/3}$ will evolve the system towards a firehose threshold, and a magnetic field profile shallower than $r^{-4/3}$ will evolve the system towards a mirror threshold. The estimations do not reveal the presence of the firehose instability or the mirror instability in the CGL model, since the solutions are written in a steady state, and concern only evolution of mean temperatures (i.e. the solutions should be written with $T_{\Vert }^{(0)},T_{\bot }^{(0)}$ ). To find an instability, one of course needs to consider evolution equations for fluctuating variables and to analyse the associated dispersion relations, since it is the fluctuations that become unstable. Further discussion can be found in Hunana & Zank (Reference Hunana and Zank2017).
From a linear perspective, the CGL model has to be accompanied by the Hall term and FLR corrections, so that the instabilities are stabilized at small spatial scales. However, considering nonlinear evolution with finite amplitudes of fluctuations, even the non-dispersive CGL model is stabilized. For example, considering parallel propagation, Tenerani, Velli & Hellinger (Reference Tenerani, Velli and Hellinger2017) has shown that the firehose instability criterion reads (see (9) in that paper)
i.e. for infinitely small amplitudes of the magnetic field, the usual firehose criterion is obtained. However, when the CGL system reaches firehose-unstable regime and the amplitude of the fluctuations starts to grow, the firehose criterion is modified according to (3.184). Therefore, the amplitude of the fluctuations cannot grow without bounds, and beyond some critical amplitude, the system becomes again stabilized. The condition (3.184) nicely demonstrates how the CGL system is stabilized during nonlinear evolution, once the firehose instability is reached. For further discussion, see Tenerani et al. (Reference Tenerani, Velli and Hellinger2017), Tenerani & Velli (Reference Tenerani and Velli2018).
To finish this subsection, the heuristically generalized CGL model with polytropic indices $\unicode[STIX]{x1D6FE}_{\Vert }$ , $\unicode[STIX]{x1D6FE}_{\bot }$ yields that, in the steady state, the mean temperatures evolve according to
and such heuristic models can yield a wide array of possible temperature profiles.
3.13 CGL model – summary
The usual ‘combining’ of evolution equations (by performing $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t$ and substituting one equation into another) is very useful to gain insight into the linear eigenmodes of a considered system. However, for more advanced fluid models, such a procedure can become very analytically involved, and it is much easier to calculate the determinant of the entire system with analytic software, such as Maple or Mathematica. Therefore, for more advanced fluid models we often do not write down the final dispersion relation for arbitrary propagation angle (which might be uneconomically large to write down), and we only provide normalized, linearized in the $x$ – $z$ plane and Fourier transformed equations. For easy comparison with more advanced fluid models, it is beneficial to write the final CGL equations in the same form. It is useful to work in normalized units, and we drop writing the tilde. The normalized, linearized in the $x$ – $z$ plane, and Fourier transformed CGL equations read
where we have introduced the normalized parallel Alfvén speed $v_{A\Vert }^{2}\equiv 1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ (the tilde on $v_{A\Vert }$ is dropped too). The determinant can be easily calculated and factorized in Maple, yielding the CGL dispersion relation in normalized units (tilde are dropped)
which of course agrees with the previously obtained CGL solutions for the Alfvén mode (3.81), and the slow and fast modes (3.90).
4 Hall-CGL model
Here we consider a slightly generalized CGL model when the Hall term is included in the induction equation. Concerning the isotropic Hall-MHD model, a good comparison with solutions of kinetic theory can be found for example in Howes (Reference Howes2009). Before we proceed with the discussion of dispersion relations, we want to point out how the Hall term and the electron pressure contributions and the electron inertia contributions modify the usual ‘CGL conservation equations’ for protons. The general electric field was already written down in (3.15). For a moment it is beneficial to consider this most general form including the electron inertia and the electron pressure contributions and separate the electric field into two parts
The notation $\boldsymbol{E}_{H}$ is perhaps slightly misleading since only the first term in (4.2) is usually called the Hall electric field. Note that the first term in $\boldsymbol{E}_{H}$ still has to be used in the proton momentum equation even for the simplest CGL (or MHD) models. The induction equation then reads
which is then rewritten with the use of the convective derivative as
The last expression is then used in the completely general equations (3.46), (3.47), which yields
By cancelling pressure equations (3.3), (3.4), that are also the pressure equations of the Hall-CGL model, one obtains that for a general Hall-CGL model the electric field $\boldsymbol{E}_{H}$ modifies the conservation laws according to
implying the Hall term breaks the first and second adiabatic invariants.
4.1 Hall-CGL model with cold electrons
Here, we want to study the simplest Hall-CGL model for the proton species. We neglect the electron inertia and we make the electrons cold with $\boldsymbol{p}_{e}=0$ . The full model is described by the usual CGL equations (3.36)–(3.39), but the induction equation now reads
The induction equation is linearized as
By using (3.67), direct calculation of the second term yields
and by using $c/(4\unicode[STIX]{x03C0}en_{0})=V_{A}^{2}/(\unicode[STIX]{x1D6FA}_{p}B_{0})$ , the Hall-term contributions are
The entire linearized induction equation reads
which when written in the $x$ – $z$ plane (all $\unicode[STIX]{x2202}_{y}=0$ ) yields
These equations replace the $\unicode[STIX]{x2202}B/\unicode[STIX]{x2202}t$ equations in the linearized CGL system written in the $x$ – $z$ plane. And the trouble is immediately apparent. In contrast to the CGL system, we are no longer able to separate the Alfvén mode which was before exclusively in the $B_{y}$ , $u_{y}$ components. Now, all the components are coupled and all three modes, the Alfvén mode, the slow mode and the fast mode are coupled. This is not surprising, since the same situation is in the Hall-MHD. In this case, we have no other choice and we have to calculate the dispersion relation for all three modes. The dispersion relation can be solved analytically only for special cases and, in general, the dispersion relation has to be solved numerically. Let us quickly consider the two special cases of parallel and perpendicular propagation that can be solved analytically.
4.2 Parallel propagation
For propagation parallel to the mean magnetic field, all $\unicode[STIX]{x2202}_{x}=0$ and the familiar sound mode/ion-acoustic mode decouples in the $u_{z}$ , $p_{\Vert }$ components as
with the familiar dispersion relation $\unicode[STIX]{x1D714}^{2}=C_{\Vert }^{2}k_{\Vert }^{2}$ . The dispersion relation of this mode does not depend on the Hall term. However, the other two modes are now coupled through
If the last terms in the $B$ -field equations are neglected, i.e. if the Hall term is neglected, the two modes decouple and one naturally recovers the CGL parallel propagating Alfvén modes, both propagating with the phase speed
Applying $\unicode[STIX]{x2202}_{t}$ to the $B$ -field equations yields
which when transformed to Fourier space read
and the zero determinant requirement implies
The solutions of the dispersion relation (quadratic polynomial in $\unicode[STIX]{x1D714}^{2}$ ) is simply
where, importantly, $\sqrt{k_{\Vert }^{2}}=|k_{\Vert }|$ was used. The solution (4.28) is the whistler mode, and the solution (4.29) is the ion-cyclotron mode. For isotropic temperatures ( $a_{p}=1$ ), the result is equal to the Hall-MHD dispersion relation. At long wavelengths ( $k_{\Vert }\rightarrow 0$ ) the solutions become non-dispersive and connect to the usual CGL Alfvén modes. For parallel propagation, the Hall term is therefore responsible for splitting the two modes that otherwise would propagate at the same speed, while it does not influence the ion-acoustic mode. Or in another words, using the vocabulary of slow, Alfvén, fast, depending on plasma $\unicode[STIX]{x1D6FD}_{\Vert }$ and temperature anisotropy $a_{p}$ , the Hall term either splits the parallel propagating Alfvén and fast mode, or the slow and Alfvén mode. At very short spatial scales $(k_{\Vert }\gg 1)$ , the limits areFootnote 9
For isotropic temperatures $a_{p}=1$ the frequency of the ion-cyclotron mode (that should be actually called the proton-cyclotron mode) converges towards the proton-cyclotron frequency $\pm \unicode[STIX]{x1D6FA}_{p}$ . For anisotropic temperatures, the frequency of the mode starts to be $\unicode[STIX]{x1D6FD}_{\Vert }$ dependent and will converge to frequencies that can be higher or lower than $\unicode[STIX]{x1D6FA}_{p}$ . The frequency of the whistler mode is not bounded and technically goes to infinity, which is a consequence of neglecting the electron inertia. Importantly, both modes are stable for high wavenumbers. The solutions (4.28), (4.29) are analytically correct, but similarly to the Hall-MHD, the solutions can be rewritten to a more useful form, by realizing that the dispersion equation (4.27) can be decomposed asFootnote 10
The first bracket yields two solutions
and the second bracket yields two solutions
It is straightforward to check that $(\unicode[STIX]{x1D714}-\unicode[STIX]{x1D714}_{W,1})(\unicode[STIX]{x1D714}-\unicode[STIX]{x1D714}_{W,2})=0$ yields the whistler mode solution (4.28), and $(\unicode[STIX]{x1D714}-\unicode[STIX]{x1D714}_{IC,1})(\unicode[STIX]{x1D714}-\unicode[STIX]{x1D714}_{IC,2})=0$ yields the ion-cyclotron mode solution (4.29). Often, solutions (4.32)–(4.35) are written in an alternative form where the absolute value on $k_{\Vert }$ is removed (which obviously can be done), and the solutions are just slightly re-arranged. Additionally, for more complicated fluid models, we want to write solution in one line instead of four lines. In this example, we just write a shortcut that two solutions are
with another two solutions obtained by substituting $\unicode[STIX]{x1D714}$ with $-\unicode[STIX]{x1D714}$ .
4.2.1 Firehose instability
Now we can discuss the firehose instability. If the quantity $1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)>0$ , then all the solutions (4.32)–(4.35) have frequencies that are purely real for the entire range of wavenumbers. For isotropic temperatures $a_{p}=1$ , the dispersion relations do not depend on the value of $\unicode[STIX]{x1D6FD}_{\Vert }$ , which is a consequence of neglecting the FLR pressure corrections, as will be discussed later, and all solutions are equivalent to the Hall-MHD dispersion relations. Solutions (4.32)–(4.35) with $a_{p}=1$ are plotted in figure 4(a), where we plotted both positive and negative wavenumbers and positive and negative frequencies to clearly understand how the modes are connected to each other. The magenta and cyan lines are plotted as dashed lines, so that the growth rates in the firehose-unstable regime (plotted in figure 5) are clearly visible. For anisotropic temperatures $a_{p}\neq 1$ , the solutions are naturally $\unicode[STIX]{x1D6FD}_{\Vert }$ dependent. To show solutions in a firehose unstable regime, one has to choose $\unicode[STIX]{x1D6FD}_{\Vert }>2$ since the instability does not exist for lower values of $\unicode[STIX]{x1D6FD}_{\Vert }$ . In figure 4(b), we have chosen a value of $\unicode[STIX]{x1D6FD}_{\Vert }=4$ and varied the temperature anisotropy from $a_{p}=2$ down to a critical value of $a_{p}=0.5$ . In this regime, all the modes have purely real frequencies.
Importantly, if the temperature anisotropy parameter $a_{p}$ is further decreased, the frequencies of all the modes will become complex numbers (with a real and imaginary part) in the region where
which can be viewed as a modified Hall-CGL firehose instability threshold that is now length-scale dependent. At long spatial scales ( $k_{\Vert }\rightarrow 0$ ), the above criterion is naturally equivalent to the CGL firehose criterion. The term proportional to $k_{\Vert }^{2}$ in (4.37) is always positive and as $k_{\Vert }$ increases, after some critical wavenumber the frequency of modes will become purely real. The Hall term is therefore responsible for the stabilization of the firehose instability at sufficiently high wavenumbers (small spatial scales). The solutions in this firehose-unstable regime are plotted in figure 5, where again $\unicode[STIX]{x1D6FD}_{\Vert }=4$ and the temperature anisotropy is varied from $a_{p}=0.49$ to the lowest possible value of $a_{p}=0.0$ . Figure 5(a) shows real frequencies and the (b) shows imaginary frequencies.
It is of interest to determine which mode becomes unstable. According to our Fourier transform (see appendix A), a mode with positive imaginary frequency is unstable and growing, and a mode with negative imaginary frequency is stable and damped. If the firehose threshold (4.37) is satisfied, for some range of $k_{\Vert }$ the second terms in solutions (4.32)–(4.35) become purely imaginary and the first terms stay purely real. Consider one of the four possible quadrants in figure 5(a), for example the quadrant with $k_{\Vert }>0;\unicode[STIX]{x1D714}_{r}>0$ . Up to some critical $k_{\Vert }$ where the Hall-CGL firehose criterion is satisfied, both the blue mode and the cyan mode propagate with the same real frequency $\unicode[STIX]{x1D714}_{r}=+k_{\Vert }^{2}V_{A}^{2}/(2\unicode[STIX]{x1D6FA}_{p})$ . The differences are in the imaginary frequency, see figure 5(b), where in this quadrant the blue mode is unstable and the cyan mode is stable (and damped). After a critical $k_{\Vert }$ (for example for $a_{p}=0$ it is $k_{\Vert }V_{A}/\unicode[STIX]{x1D6FA}_{p}=2$ ), both modes become purely real and evolve with different real frequencies, here the blue mode is obviously the whistler mode and the cyan mode is the ion-cyclotron mode. The same situation is in the quadrant $k_{\Vert }<0;\unicode[STIX]{x1D714}_{r}>0$ . We can therefore conclude that for positive real frequencies, $\unicode[STIX]{x1D714}_{r}>0$ , the whistler mode is unstable and the ion-cyclotron mode is stable and damped.
Now we could say that considering both positive and negative wavenumbers and frequencies is redundant, and that we are finished. However, from a perspective of this guide, we consider useful to show that one might easily come into contradictions, if negative frequencies are not handled with sufficient care. Importantly, according to figure 5, for negative real frequencies $\unicode[STIX]{x1D714}_{r}<0$ , the whistler mode is stable and the ion-cyclotron mode is unstable. This is indeed contradictory. Keeping the $k_{\Vert }>0$ , a change from positive to negative $\unicode[STIX]{x1D714}_{r}$ represents a change of direction of propagation along $B_{0}$ . One can argue, that in a homogeneous system (e.g. excluding propagation in a stratified fluid), a change of direction of propagation should not yield a change of stability. Additionally, kinetic theory and simulations show, that it is the whistler mode that is firehose unstable for both $\unicode[STIX]{x1D714}_{r}>0$ and $\unicode[STIX]{x1D714}_{r}<0$ , and the ion-cyclotron mode is stable. It is important to emphasize that, from a perspective of solutions (4.32)–(4.35), at the range of wavenumbers where the firehose instability is present, the whistler and ion-cyclotron modes propagate (in a given quadrant) with the same real frequency. Moreover, both modes have the same polarization of electric and magnetic field (see § 4.2.3). Both modes are completely degenerate, and not distinguishable. Thus, how the modes are continued for higher wavenumbers once the firehose instability disappears, is not obvious (since $\sqrt{-1}$ can be possibly both $+\text{i}$ or $-\text{i}$ ). One should consider all the possible solutions and verify, which solutions are physically plausible. Therefore, after the firehose threshold is crossed, the analytic solutions (4.32)–(4.35) are not correctly separated. Obviously, more ‘physical information’ is needed to distinguish between the ion-cyclotron and whistler modes in the firehose-unstable regime, and to determine which solutions are causal and which are non-causal.
4.2.2 Generalization to causal (correct) solutions
The mind boggling puzzle as why the Hall-CGL solutions (4.32)–(4.35) are in contradiction to kinetic theory and to common sense (a discrepancy found by one of the referees of this text), was beautifully solved by Paul Cally. The easiest way to obtain correct solutions is to introduce a very small ‘causal’ dissipation into the system, and remove it later. Alternatively, it should be possible to obtain the same result by using Laplace–Fourier transforms on temporal-spatial scales and considering an initial value problem.
We introduce causality, by adding a tiny amount of dissipation on the right-hand sides of momentum equations (4.19)–(4.20), in the following form
where $\unicode[STIX]{x1D716}$ is a dimensionless real small positive number, for example $\unicode[STIX]{x1D716}=10^{-8}$ . The dissipation will be later completely removed by the limit $\lim _{\unicode[STIX]{x1D716}\rightarrow 0^{+}}$ . The $V_{A}^{2}/\unicode[STIX]{x1D6FA}_{p}$ is there so that the parameter $\unicode[STIX]{x1D716}$ is indeed dimensionless (since here we usually normalize with respect to $V_{A}$ and $\unicode[STIX]{x1D6FA}_{p}$ , but other normalizations can be of course considered). The factor of 2 is there for pure convenience, so that the final dispersion relations (4.43)–(4.46) are of the utmost beauty, and that we do not have to later redefine $\unicode[STIX]{x1D716}$ . The momentum equations are accompanied by the induction equations (4.21)–(4.22). The reader is encouraged to verify the calculations by switching to normalized variables $\widetilde{k_{\Vert }}=k_{\Vert }V_{A}/\unicode[STIX]{x1D6FA}_{p}$ , $\widetilde{\unicode[STIX]{x1D714}}=\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}_{p}$ , $\widetilde{v}_{A\Vert }^{2}=1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ , we continue in physical units.
By adding the dissipation, the dispersion relation (4.31) is modified to the following form
It is useful to define the Hall-CGL firehose threshold
and similar generalizations can be done for models with FLR corrections. Then, since $\unicode[STIX]{x1D716}$ is small, it is easy to show that the non-causal solutions (4.32)–(4.35) are ‘corrected’ by the tiny dissipation to the following causal form
The reader is encouraged to plot these solutions, for example with $\unicode[STIX]{x1D716}=10^{-8}$ , and verify that figure 5(b), is indeed corrected, i.e. that the whistler is unstable for both $\unicode[STIX]{x1D714}_{r}>0$ and $\unicode[STIX]{x1D714}_{r}<0$ . We want to further address the limits. For clarity, let us first consider the following simplest case
where one can further suppress the small real $\unicode[STIX]{x1D716}/2$ on the right-hand side. The limit (4.47) provides an answer, when $\sqrt{-1}$ is equal to $+\text{i}$ or $-\text{i}$ . The result (4.47) can be easily generalized, and limits in expressions (4.43)–(4.46) are calculated according to
Obviously, the whistler solutions (4.44), (4.45) are unstable, and the ion-cyclotron solutions (4.43), (4.46) are damped. Only now we can conclude with confidence, that the parallel firehose instability as an instability of the whistler mode.
We note that if instead of the momentum equations, we added the dissipation (diffusivity) to the induction equations, the solutions (4.43)–(4.46) are recovered with $\unicode[STIX]{x1D716}\rightarrow -\unicode[STIX]{x1D716}$ . In other words, by adding dissipation to the induction equations, the whistler mode would be always damped and the ion-cyclotron mode unstable. The result makes sense with respect to kinetic theory, since the ion-cyclotron resonances enter through the velocity moment (by integration over the perturbation of the distribution function $f^{(1)}$ ), or more precisely through the current $\boldsymbol{j}=\sum _{r}q_{r}n_{r}\boldsymbol{u}_{r}$ , see Part 2.
4.2.3 Polarization
The polarization of the magnetic field can be easily checked, actually without plugging any dispersion relations into the matrix (4.26), since the matrix can be represented simply as
where $a=\unicode[STIX]{x1D714}^{2}-v_{A\Vert }^{2}k_{\Vert }^{2}$ and $b=V_{A}^{2}k_{\Vert }^{2}\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}_{p}$ . It is useful to define the polarization as the angle between the complex $B_{x}$ and $B_{y}$ components, which is calculated as $\text{Arg}(B_{y}/B_{x})$ . If the result is smaller than zero, the wave is left polarized. If the result is larger than zero, the wave is right polarized. The determinant of the matrix (4.49) is $a^{2}-b^{2}=0$ , so $a=\pm b$ . The case $a=-b$ (which is directly equal to the first bracket in (4.31) with solutions (4.32), (4.33)) yields that $B_{y}=-\text{i}B_{x}$ , so that
The second case $a=+b$ (which is equal to the second bracket in (4.31) with solutions (4.34), (4.35)) yields that $B_{y}=+\text{i}B_{x}$ , so that
Similarly, one can calculate the polarization of electric field $\boldsymbol{E}$ , and (when not specified that the $\boldsymbol{B}$ field is considered), the words left/right polarized are usually meant for the $\boldsymbol{E}$ field. The ratio $E_{y}/E_{x}$ can be easily calculated, for example from equations (4.74), (4.75) (written there in the normalized form), which when multiplied by $\unicode[STIX]{x1D714}$ and use of momentum equations, yields for the special case of parallel propagation considered here
When plotting many solutions, it is useful to additionally divide the angle by $\unicode[STIX]{x03C0}$ , and define the $\boldsymbol{B}$ and $\boldsymbol{E}$ field polarizations as
Electric field polarization defined this way was used for example by Hunana et al. (Reference Hunana, Goldstein, Passot, Sulem, Laveder and Zank2013) to investigate properties of (highly oblique) kinetic Alfvén waves in various fluid models and the kinetic description; and also by Camporeale & Burgess (Reference Camporeale and Burgess2017), who compared various linear modes in hybrid, gyrokinetic and fully kinetic descriptions.Footnote 11 When ${\mathcal{P}}_{E}<0$ the wave is called left polarized and when ${\mathcal{P}}_{E}>0$ , the wave is called right polarized. For the first group (4.50) this yields $E_{y}/E_{x}=-\text{i}$ and ${\mathcal{P}}_{E}=-1/2<0$ (left polarized wave), and for the second group (4.51) $E_{y}/E_{x}=+\text{i}$ and ${\mathcal{P}}_{E}=+1/2>0$ (right polarized wave).
Finally, it is of interest to note another possible definition of polarization, that is defined with respect to positive real frequency, i.e. the electric field polarization (4.53) is redefined to
A similar definition of polarization with $\text{sign}(\unicode[STIX]{x1D714}_{r})$ is used for example in the book by Gary (Reference Gary1993), page 91. With such a definition, the whistler mode is always right polarized, both in the firehose-stable and firehose-unstable regimes, since for the whistler mode ((4.34), blue line) the real frequency remains $\unicode[STIX]{x1D714}_{r}>0$ , and for the whistler mode ((4.33), magenta line) $\unicode[STIX]{x1D714}_{r}<0$ . The situation is more confusing for the ion-cyclotron mode, where the firehose-stable and firehose-unstable regimes have to be considered separately. (i) In the firehose-stable regime, both ion-cyclotron modes are left polarized, since for the mode ((4.32), green line) $\unicode[STIX]{x1D714}_{r}>0$ , and for the mode ((4.35), cyan line) $\unicode[STIX]{x1D714}_{r}<0$ . (ii) In contrast, in the firehose-unstable regime, both ion-cyclotron modes are right polarized, since for the mode ((4.32), green line) $\unicode[STIX]{x1D714}_{r}<0$ , and for the mode ((4.35), cyan line) $\unicode[STIX]{x1D714}_{r}>0$ .
4.2.4 Simplest ion-cyclotron resonances
When talking about ‘resonances’, one often expects to see expressions with a denominator that becomes zero. Generally speaking, the simplest ‘singular’ expressions representing resonances just arise from a technique used in deriving the dispersion relations. If one eliminates the electric field $\boldsymbol{E}$ from the beginning and expresses the fluid model through a matrix multiplied by a vector ( $\unicode[STIX]{x1D70C}$ , $u_{x}$ , $u_{y}$ , $u_{z}$ , $b_{x}$ , $b_{y}$ , $b_{z}$ , $p_{\Vert }$ , $p_{\bot }$ ), leads to dispersion relations that are not singular explicitly. In contrast, expressing the model through a matrix multiplied by a vector $(E_{x},E_{y},E_{z})$ , yields equivalent dispersion relations where the resonances are shown explicitly. The dispersion equation (4.31) can be easily rewritten as
which, in the cold plasma limit ( $\unicode[STIX]{x1D6FD}_{\Vert }=0$ ) or in this case also for isotropic temperatures $(a_{p}=1)$ , simplifies to
making it more evident that we have the simplest ion-cyclotron resonances here too. Indeed, for $k\rightarrow \infty$ the frequency $\unicode[STIX]{x1D714}\rightarrow \unicode[STIX]{x1D6FA}_{p}$ (first case) and $\unicode[STIX]{x1D714}\rightarrow -\unicode[STIX]{x1D6FA}_{p}$ (second case). For a plasma with finite $\unicode[STIX]{x1D6FD}_{\Vert }$ and non-isotropic temperatures, the $k\rightarrow \infty$ limit yields more general ion-cyclotron resonances (4.30).
To better understand what effects are present and what effects are absent in the Hall-CGL model, we plot fully kinetic solutions obtained with the WHAMP code in figure 6. The proton temperature is prescribed to be isotropic $T_{\bot }/T_{\Vert }=1$ and the electrons are prescribed to be cold with $T_{e}/T_{p}=0$ (in the WHAMP code, the value is chosen to be $10^{-8}$ ). In the WHAMP code it is necessary to prescribe the ratio of the parallel proton thermal speed to the speed of light, and we choose $v_{\text{th}\Vert }/c=10^{-4}$ . The solid lines are kinetic solutions with different $\unicode[STIX]{x1D6FD}_{\Vert }=10^{-4};0.1;1;2;4$ . The blue curve is the whistler mode and the green curve is the ion-cyclotron mode. Solutions of the Hall-CGL model are shown in figure 6(a) and, for isotropic temperatures, the solutions are completely $\unicode[STIX]{x1D6FD}_{\Vert }$ independent and are represented with two dashed curves. It is shown that the kinetic solutions for the whistler mode are almost $\unicode[STIX]{x1D6FD}_{\Vert }$ independent and only the mode with $\unicode[STIX]{x1D6FD}_{\Vert }=4$ deviates from the other whistler solutions, and only between $kd_{i}=0.1-2.0$ . For comparison, in figure 6(b) we plot solutions of the Hall-CGL-FLR2 model (the model is discussed later in the text).
4.3 Hall-CGL dispersion relation for arbitrary propagation angle
To make everything as clear as possible, let us write down one more time the normalized equations we are solving, including the linearized and Fourier transformed equations in the $x$ – $z$ plane. The nonlinear Hall-CGL equations with cold electrons are (dropping the tilde)
Linearizing in normalized units is even easier, because (temporarily introducing back the tilde for clarity) $\widetilde{p}_{\Vert p}^{(0)}=1$ ; $\widetilde{p}_{\bot p}^{(0)}=a_{p}$ ; $\widetilde{\unicode[STIX]{x1D70C}}_{0}=1$ ; $\widetilde{B}_{0}=1$ . Linearizing these equations, writing them in the $x$ – $z$ plane and Fourier transforming yields (dropping also the proton index ‘ $p$ ’)
where the normalized $v_{A\Vert }^{2}=1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ . Alternatively, it is sometimes illuminating to work with the general induction equation where the electric field is used
and in this case with cold electrons the electric field components are
The induction equation (4.71)–(4.73) with the above electric field components is of course equivalent to the induction equation (4.66)–(4.68). All equations are normalized, the tilde are dropped, and the electric field $\widetilde{E}=E/E_{0}$ , where $E_{0}=V_{A}B_{0}/c$ . The Hall-CGL dispersion relation reads
where the wavenumber $k^{2}=k_{\Vert }^{2}+k_{\bot }^{2}$ . Perhaps the nicest form is to keep the CGL contributions on the left-hand side (so the terms can be factorized to form the CGL dispersion relation) and put the Hall term contributions on the right-hand side, which yields the Hall-CGL dispersion relation
Solutions for the parallel propagation ( $k_{\bot }=0$ ) are of course the whistler and the ion-cyclotron mode
with other two solutions obtained by exchanging $\unicode[STIX]{x1D714}$ with $-\unicode[STIX]{x1D714}$ . By bringing back the tilde to the normalized solution (4.84), the result (4.36) in physical units is easily recovered. The other parallel solution is the ion-acoustic mode $\unicode[STIX]{x1D714}=\pm \sqrt{3\unicode[STIX]{x1D6FD}_{\Vert }/2}k_{\Vert }$ . For perpendicular propagation ( $k_{\Vert }=0$ ), the fast mode solution is $\unicode[STIX]{x1D714}=\pm k_{\bot }\sqrt{1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }}$ , which is equivalent to CGL since the Hall term disappears.
We investigated solutions of the Hall-CGL model in much greater detail in our paper ‘On the Parallel and Oblique Firehose Instability in Fluid Models’ (Hunana & Zank Reference Hunana and Zank2017), a paper which was essentially pulled out of this guide. We do not want to repeat the discussion with many associated figures here, and a reader who is further interested in the Hall-CGL firehose instability can find much more information in that paper.
5 FLR corrections to the pressure tensor
The finite Larmor radius corrections to the pressure tensor can be derived at several levels of approximation. The pressure tensor equation (2.50) can be rewritten as
This equation is exact and since $\boldsymbol{p}=\boldsymbol{p}^{\text{g}}+\unicode[STIX]{x1D72B}$ , the FLR tensor $\unicode[STIX]{x1D72B}$ is described implicitly.
5.1 Fully nonlinear FLR corrections
If preservation of all nonlinearities is desired, this implicit equation can be further rewritten by the following procedure that can be found for example in Hsu et al. (Reference Hsu, Hazeltine and Morrison1986), Passot & Sulem (Reference Passot and Sulem2004) and Ramos (Reference Ramos2005) as a brief note. By applying $\hat{\boldsymbol{b}}\times$ to the left-hand side, the first term calculates as
where we have used that $\unicode[STIX]{x1D716}_{ikl}\unicode[STIX]{x1D716}_{lrs}=\unicode[STIX]{x1D6FF}_{ir}\unicode[STIX]{x1D6FF}_{ks}-\unicode[STIX]{x1D6FF}_{is}\unicode[STIX]{x1D6FF}_{kr}$ . The second term on the left-hand side of (5.1) calculates quite differently and we will need an identity
Let us also remind ourselves that in the index notation $\unicode[STIX]{x1D6F1}_{ss}=\text{Tr}\,\unicode[STIX]{x1D72B}=0$ and $\hat{b}_{k}\hat{b}_{l}\unicode[STIX]{x1D6F1}_{kl}=\unicode[STIX]{x1D72B}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}=0$ . The second term calculates as
The entire left-hand side therefore reads
Performing a usual matrix (single) dot product of this result with matrix $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ yields
This further yields that
and adding this result together with its transpose implies
where we have used that the $\unicode[STIX]{x1D6F1}_{ij}=\unicode[STIX]{x1D6F1}_{ji}$ . One therefore derives that the FLR tensor can be extracted from the left-hand side of (5.1) by performing an ‘inversion’ procedure
We can introduce matrix $\unicode[STIX]{x1D73F}$ that will represent the right-hand side of (5.1) as
By performing operations (5.9) in this equation yields the FLR tensor in the following form
The expression (5.12) is completely general since no simplifications were introduced, the expression is exact. However, the equation is still implicit, since on the right-hand side the tensor $\unicode[STIX]{x1D73F}$ contains the full pressure tensor $\boldsymbol{p}=\boldsymbol{p}^{\text{g}}+\unicode[STIX]{x1D72B}$ . Nevertheless, the equation is extremely useful, once the expansion of the right-hand side is performed, as we will do below. Note that the brackets are not unique, since
where the last choice leaves the brackets unspecified (since both options are allowed), similarly to $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}$ . Alternatively, one can get rid of the operator $\boldsymbol{\cdot }\unicode[STIX]{x1D644}$ (which acting on a matrix yields the same matrix), and split the result into two parts
Another possible form can be obtained by subtracting the evolution equations for the gyrotropic pressure components $\unicode[STIX]{x2202}p_{\Vert }/\unicode[STIX]{x2202}t$ and $\unicode[STIX]{x2202}p_{\bot }/\unicode[STIX]{x2202}t$ . The scalar pressure equations were derived by applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ to the pressure tensor equation. The subtraction can be formally represented by introducing ‘overbar’ projection operator $\bar{\boldsymbol{a}}$ that projects any $(3\times 3)$ matrix $\boldsymbol{a}$ to
The definition is of course motivated by the pressure decomposition (2.46). By applying this operator to (5.10), the left-hand side remains the same, yielding $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})^{S}=-\bar{\unicode[STIX]{x1D73F}}$ . The solution (5.12) for the FLR tensor therefore rewrites
The subtraction of scalar pressure equations is motivated by the observation that when working in the linear approximation directly with (5.10), i.e. without performing the inversion procedure, the scalar pressure equations have to be subtracted at the end. However, by performing the inversion procedure, the scalar pressure equations are subtracted ‘automatically’ during calculations, and it is actually easier to work directly with (5.12) instead of (5.16).
Therefore, let us work with (5.12). The leading-order nonlinear FLR corrections are obtained by neglecting the non-gyrotropic contributions $\unicode[STIX]{x1D72B}$ and $\boldsymbol{q}^{\text{ng}}$ on the right-hand side of (5.12), i.e. by prescribing
Therefore, the only non-gyrotropic quantities that were retained on the right-hand side of (5.12), are the perpendicular components of the velocity $\boldsymbol{u}$ . The slightly unappealing factors $B_{0}/|\boldsymbol{B}|$ in all the expressions can be removed, by redefining the cyclotron frequency with $|\boldsymbol{B}|$ instead of $B_{0}$ , so that $\unicode[STIX]{x1D6FA}_{|\boldsymbol{B}|}=e|\boldsymbol{B}|/(mc)$ . Let us calculate (5.17) step by step. For the brevity of the calculations, it is useful to introduce convective derivative $\text{d}/\text{d}t=\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t+\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ and work with
We want to present complete derivation with the least amount of algebra involved, and we first want to obtain the form of Schekochihin et al. (Reference Schekochihin, Cowley, Rincon and Rosin2010). Let us first write each term in $\unicode[STIX]{x1D73F}^{(1)}$ in the index notation. By using
in the following form
and grouping similar terms in $\unicode[STIX]{x1D73F}^{(1)}$ together yields
The first line in the above expression is left unchanged, since it is the final result, defined below as a matrix $\unicode[STIX]{x1D61E}_{ij}=p_{\bot }(\unicode[STIX]{x2202}_{i}u_{j}+\unicode[STIX]{x2202}_{j}u_{i})+\unicode[STIX]{x2202}_{j}(q_{\bot }\hat{b}_{i})+\unicode[STIX]{x2202}_{i}(q_{\bot }\hat{b}_{j})$ , that can be written as $\unicode[STIX]{x1D652}=p_{\bot }\unicode[STIX]{x1D735}\boldsymbol{u}+\unicode[STIX]{x1D735}(q_{\bot }\hat{\boldsymbol{b}})+(p_{\bot }\unicode[STIX]{x1D735}\boldsymbol{u}+\unicode[STIX]{x1D735}(q_{\bot }\hat{\boldsymbol{b}}))^{\text{T}}=(p_{\bot }\unicode[STIX]{x1D735}\boldsymbol{u}+\unicode[STIX]{x1D735}(q_{\bot }\hat{\boldsymbol{b}}))^{S}$ . However, calculating the rest of the expression according to (5.17) simplifies several terms, and for example all terms in the third line of (5.27) vanish. The terms in the third line containing $\hat{b}_{i}\hat{b}_{j}$ vanish after applying $\hat{\boldsymbol{b}}\times$ , since $(\hat{\boldsymbol{b}}\times (\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}))_{ij}=\unicode[STIX]{x1D716}_{ikl}\hat{b}_{k}\hat{b}_{l}\hat{b}_{j}=0$ . The terms in the third line containing $\unicode[STIX]{x1D6FF}_{ij}$ vanish after applying the symmetric operator since
We therefore need to concentrate only on the second line of (5.27), and slightly rewrite that one. For example, the first term in the second line of (5.27), the term $\text{d}/\text{d}t(\hat{b}_{i}\hat{b}_{j})$ , calculates as
The second term in the second line of (5.27), the term $\hat{b}_{i}\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}u_{j}$ , vanishes after applying $\hat{\boldsymbol{b}}\times$ , since
The third term in the second line of (5.27), the term $\hat{b}_{j}\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}u_{i}$ however does not vanish. Note that $b_{j}\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}u_{i}=((\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u})\hat{\boldsymbol{b}})_{ij}$ and the third term calculates as
The fourth term in the second line of (5.27), calculates in the same fashion than the second and third terms since $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}(\hat{b}_{i}\hat{b}_{j})=\hat{b}_{i}(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\hat{b}_{j})+(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\hat{b}_{i})\hat{b}_{j}$ , yielding
Collecting all the results together yields the FLR tensor
where $\unicode[STIX]{x1D652}$ is a matrix and $\boldsymbol{w}$ is a vector. The result is equivalent to (5)–(8) of Schekochihin et al. (Reference Schekochihin, Cowley, Rincon and Rosin2010), derived in the Appendix of that paper in a somewhat simpler way. In Schekochihin et al. (Reference Schekochihin, Cowley, Rincon and Rosin2010), instead of our notation for matrix $\unicode[STIX]{x1D652}$ and vector $\boldsymbol{w}$ , the symbols $\boldsymbol{S}$ and $\unicode[STIX]{x1D748}$ are used, where $\boldsymbol{S}=\unicode[STIX]{x1D652}$ and $\unicode[STIX]{x1D748}=-\boldsymbol{w}$ . We wanted to avoid the use of $\boldsymbol{S}$ and $\unicode[STIX]{x1D748}$ , since these symbols are typically used when decomposing the heat flux tensor. For a reader who just jumped to this result, the symmetric operator is defined as $\unicode[STIX]{x1D608}_{ij}^{S}=\unicode[STIX]{x1D608}_{ij}+\unicode[STIX]{x1D608}_{ji}$ . For clarity, for any vector $\boldsymbol{w}$ and any matrix $\unicode[STIX]{x1D652}$ , the following identities hold
The last identity follows from an identity for general matrix $\unicode[STIX]{x1D63C}$ , that reads $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D63C})^{\text{T}}=-\unicode[STIX]{x1D63C}^{\text{T}}\times \hat{\boldsymbol{b}}$ , and for a matrix $\unicode[STIX]{x1D63C}=\unicode[STIX]{x1D63D}\boldsymbol{\cdot }\boldsymbol{C}$ the $\unicode[STIX]{x1D63C}^{\text{T}}=\boldsymbol{C}^{\text{T}}\boldsymbol{\cdot }\unicode[STIX]{x1D63D}^{\text{T}}$ , implying $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D63D}\boldsymbol{\cdot }\boldsymbol{C})^{\text{T}}=-\boldsymbol{C}^{\text{T}}\boldsymbol{\cdot }\unicode[STIX]{x1D63D}^{\text{T}}\times \hat{\boldsymbol{b}}$ .
The result (5.33)–(5.35) can be slightly simplified. The matrix $\unicode[STIX]{x1D652}$ in the index notation reads
and by applying $\hat{\boldsymbol{b}}\times$ eliminates only the last term
implying that the matrix $\unicode[STIX]{x1D652}$ could possibly be redefined to
However, now the matrix is not symmetric. Note that the brackets in the expression (5.33) are not unique, and one can for example pull the $\hat{\boldsymbol{b}}\times$ out, so that the expression reads
Let us multiply the last term in (5.40) by $\boldsymbol{\cdot }(\unicode[STIX]{x1D644}+3\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ , it calculates as
Obviously, it is beneficial to move this term to the vector $\boldsymbol{w}$ . Therefore, redefining $\unicode[STIX]{x1D652}$ and $\boldsymbol{w}$ , the solution reads
Now the matrix $\unicode[STIX]{x1D652}$ is again symmetric, and $\boldsymbol{w}$ is just a vector. This solution with matrix (5.44) is slightly nicer than the solution with matrix (5.34). Alternatively, one can write the solution in its full form
The nonlinear solution for the FLR tensor $\unicode[STIX]{x1D72B}$ can be further re-arranged. For example, to obtain the form reported by Ramos (Reference Ramos2005), one needs to separate a part that is obtained by performing $\unicode[STIX]{x1D72B}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ . The solution (5.43) in the index notation reads
and direct calculation yields
where it was useful to define the new vector $\boldsymbol{h}\equiv \unicode[STIX]{x1D652}\boldsymbol{\cdot }\hat{\boldsymbol{b}}+\boldsymbol{w}$ , by adding the quantity
to the vector $\boldsymbol{w}$ . By separating the $\unicode[STIX]{x1D72B}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ part, the solution (5.43) for $\unicode[STIX]{x1D72B}$ is therefore re-arranged to the following form
The solution (5.51) has a nice advantage in that it is decomposed with respect to the magnetic field lines. In other words, projecting $\unicode[STIX]{x1D72B}$ along the field lines by performing $\unicode[STIX]{x1D72B}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ cancels the first term, and directly yields vector $\hat{\boldsymbol{b}}\times \boldsymbol{h}/\unicode[STIX]{x1D6FA}_{|\boldsymbol{B}|}$ , that could be defined as vector $\vec{\unicode[STIX]{x1D72B}}_{z}$ .
5.1.1 Employing a non-dispersive induction equation
Considering the long-wavelength low-frequency limit, the solution (5.45) for vector $\boldsymbol{w}$ and the solution (5.53) for vector $\boldsymbol{h}$ can be further simplified, by using the usual non-dispersive CGL/MHD induction equation, that can be written in the following form (see later in the text)
By further applying $\hat{\boldsymbol{b}}\times$ to the induction equation, the last term on the right-hand side disappears
and vectors (5.45), (5.53) simplify to
The result (5.57) can be further rewritten with vorticity $\unicode[STIX]{x1D74E}=\unicode[STIX]{x1D735}\times \boldsymbol{u}$ , by using identity $(\unicode[STIX]{x1D735}\boldsymbol{u})\boldsymbol{\cdot }\hat{\boldsymbol{b}}-\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}=\hat{\boldsymbol{b}}\times (\unicode[STIX]{x1D735}\times \boldsymbol{u})=\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D74E}$ , finally yielding
Results (5.51), (5.52), (5.58) are equivalent to equations (49)–(51) of Ramos (Reference Ramos2005) (see also (11)–(13) of Macmahon (Reference Macmahon1965)). Further evaluation of nonlinear FLR corrections to higher precision is out of the scope of this simple guide, and the interested reader is referred to Ramos (Reference Ramos2005). We will continue in a much simpler linear approximation.
5.2 Simplest FLR corrections (FLR1)
We could continue by evaluating the obtained nonlinear results in the linear approximation. However, the nonlinear FLR calculations were quite complicated and it is very useful to clarify how we derive the simplest FLR corrections directly in the linear approximation. Let us start again, and work with the general equation (5.1). To derive the simplest finite Larmor radius corrections to the pressure tensor, one has to cancel the $\unicode[STIX]{x1D72B}$ contributions on the right-hand side of equation (5.1) and also neglect the heat flux contributions, which yields
Considering complete linearization (so that we can easily go to Fourier space), this is equivalent to performing expansion to the first order in frequency $\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}$ and to the first order in wavenumber $r_{L}k$ . We do not want to perform complete linearization just yet. However, to move further, we introduce simplification by not evaluating the FLR corrections along the magnetic field lines, but along the ambient magnetic field $B_{0}$ that is prescribed to be in the $z$ -direction, and therefore $\hat{\boldsymbol{b}}=(0,0,1)\equiv \hat{\boldsymbol{b}}_{0}$ . This simplification, typically used in numerical simulations, is appropriate if the magnetic field lines are not too distorted with respect to the mean magnetic field. Here, we will distinguish between complete linearization and evaluation along $\hat{\boldsymbol{b}}_{0}$ . However, it is important to note that the procedure of evaluating some terms with $\hat{\boldsymbol{b}}_{0}$ , while other nonlinearities are kept, is of course not a fully consistent procedure. With the approach presented here, the only fully consistent procedure that simplifies the general equation (5.1), is complete linearization. Additionally, as discussed by Passot et al. (Reference Passot, Henri, Laveder and Sulem2014), neglecting magnetic field line distortion may in some instances lead to spurious instabilities.
Let us evaluate (5.59) along $\hat{\boldsymbol{b}}_{0}$ . On the left-hand side we calculate each component in the matrix separately. Since $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})_{ij}=\unicode[STIX]{x1D716}_{ikl}\hat{b}_{k}\unicode[STIX]{x1D6F1}_{lj}$ , it is obvious that non-zero contributions are obtained only if $\hat{b}_{k}=\hat{b}_{z}=1$ , so the index $k=z$ , and $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})_{ij}=\unicode[STIX]{x1D716}_{izl}\unicode[STIX]{x1D6F1}_{lj}$ . Now, for example, the $^{\prime }xx^{\prime }$ component calculates as $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})_{xx}=\unicode[STIX]{x1D716}_{xzl}\unicode[STIX]{x1D6F1}_{lx}$ and the only non-zero contribution is obtained for index $l=y$ , which yields $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})_{xx}=\unicode[STIX]{x1D716}_{xzy}\unicode[STIX]{x1D6F1}_{yx}=-\unicode[STIX]{x1D716}_{xyz}\unicode[STIX]{x1D6F1}_{yx}=-\unicode[STIX]{x1D6F1}_{yx}=-\unicode[STIX]{x1D6F1}_{xy}$ , where in the last step we used that the FLR tensor is symmetric. The entire left-hand side $[\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B}+(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})^{\text{T}}]_{ij}=\unicode[STIX]{x1D716}_{izl}\unicode[STIX]{x1D6F1}_{lj}+\unicode[STIX]{x1D716}_{jzl}\unicode[STIX]{x1D6F1}_{li}$ and it is straightforward to calculate each component, which yields
The gyrotropic pressure tensor is of course
The first term on the right-hand side of (5.59) calculates as
and using the matrix notation
The second term in (5.63) contributes because of the induction equation $\unicode[STIX]{x2202}\boldsymbol{B}/\unicode[STIX]{x2202}t=-c\unicode[STIX]{x1D735}\times \boldsymbol{E}$ . For now, we will not use the induction equation and keep our FLR calculations in the above form. Let us continue with the second term on the right-hand side of (5.59). When completely linearized this term does not contribute, but for now we will keep it and combine it with the first term into a convective derivative $\unicode[STIX]{x2202}\boldsymbol{p}^{\text{g}}/\unicode[STIX]{x2202}t+\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{p}^{\text{g}}=\text{d}\boldsymbol{p}^{\text{g}}/\text{d}t$ , so that $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t$ in (5.63) is replaced by $\text{d}/\text{d}t$ . The remaining terms on the right-hand side of (5.59) are straightforward to evaluate since no derivatives of $\hat{\boldsymbol{b}}$ are encountered, and the third term $\boldsymbol{p}^{\text{g}}\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}|_{\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}}$ is equal to matrix (5.61) multiplied by $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}$ . The last two terms of (5.59) are together evaluated as
Now we need to subtract the scalar CGL pressure equations obtained previously. Since $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}\boldsymbol{\cdot }\hat{\boldsymbol{b}}|_{\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}}=\unicode[STIX]{x2202}_{z}u_{z}$ , the scalar pressure equations read
By cancelling terms coming from these two equations (affecting only the diagonal components), the entire equation (5.59) rewrites
The identity $\unicode[STIX]{x1D72B}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}=0$ evaluated for $\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}$ implies that $\unicode[STIX]{x1D6F1}_{zz}=0$ and the identity $\text{Tr}\,\unicode[STIX]{x1D72B}=0$ further implies that $\unicode[STIX]{x1D6F1}_{xx}=-\unicode[STIX]{x1D6F1}_{yy}$ . The components of the pressure tensor therefore read
Finally, the time derivative of the unit vector
which evaluated for $\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}=(0,0,1)$ , $|\boldsymbol{B}|=B_{0}$ yields
The general induction equation reads
which evaluates in the $\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}$ approximation as
These expressions are used in the FLR tensor components $\unicode[STIX]{x1D6F1}_{xz}$ and $\unicode[STIX]{x1D6F1}_{yz}$ , and the FLR tensor reads
However, since the FLR tensor was derived with a precision to only first order in wavenumber $k$ , to make everything consistent, the Hall contributions in the induction equation are usually (but not always) neglected. We therefore have $\text{d}\hat{b}_{x}/\text{d}t=\unicode[STIX]{x2202}_{z}u_{x}$ and $\text{d}\hat{b}_{y}/\text{d}t=\unicode[STIX]{x2202}_{z}u_{y}$ which yields the leading-order FLR pressure tensor evaluated along the magnetic field $\hat{\boldsymbol{b}}=(0,0,1)$ in the form
which is equivalent to equation (5) of Yajima (Reference Yajima1966). Perhaps surprisingly, the result is also equivalent to the highly collisional equations (2.20)–(2.24) of Braginskii (Reference Braginskii1965), after one prescribes $p_{\Vert }=p_{\bot }$ and after terms containing the collisional time $\unicode[STIX]{x1D70F}$ are ‘ignored’ ( $\unicode[STIX]{x1D70F}\sim 1/\unicode[STIX]{x1D708}$ where $\unicode[STIX]{x1D708}$ is the usual collisional frequency). A proper collisionless limit $\unicode[STIX]{x1D70F}\rightarrow \infty$ cannot be obtained from the expressions of Braginskii (Reference Braginskii1965), since the results are proportional to $\unicode[STIX]{x1D70F}$ (and also $1/\unicode[STIX]{x1D70F}$ ).
Furthermore, evaluation along $\hat{\boldsymbol{b}}=(0,0,1)$ basically means that the magnetic field terms of the system were linearized and other parts of the system were not. The expansion procedure preserved some nonlinearities, but other nonlinearities that are of the same order were implicitly neglected. Therefore, the only fully consistent procedure is to evaluate the FLR tensor in the linear approximation, yielding
Notably, the tensor is very different from (3.10)–(3.13) of Oraevskii et al. (Reference Oraevskii, Chodura and Feneberg1968). The FLR corrections to the pressure tensor (these ones or more complicated ones) were used in a number of direct numerical simulations, see for example Borgogno et al. (Reference Borgogno, Hellinger, Passot, Sulem and Trávníček2009), Hunana et al. (Reference Hunana, Laveder, Passot, Sulem and Borgogno2011), Kobayashi et al. (Reference Kobayashi, Sahraoui, Passot, Laveder, Sulem, Huang, Henri and Smets2017) and Perrone et al. (Reference Perrone, Passot, Laveder, Valentini, Sulem, Zouganelis, Veltri and Servidio2018), or for the isotropic case see for example Ghosh & Parashar (Reference Ghosh and Parashar2015), and references therein.
5.3 Hall-CGL-FLR1 fluid model
The normalized, linearized, Fourier transformed system of equations written in the $x$ – $z$ plane reads (dropping the tilde everywhere)
where $v_{A\Vert }^{2}=1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ , and the first-order FLR tensor $\unicode[STIX]{x1D72B}=\unicode[STIX]{x1D72B}^{(1)}$ is
The dispersion relation for the Hall-CGL-FLR1 fluid model can be written as
where the polynomial on the right-hand side
The solution for strictly parallel propagation is
For strictly perpendicular propagation the solution is
and the result is consistent with equation (17) of Yajima (Reference Yajima1966), after neglecting the electron inertia in that paper (by $\unicode[STIX]{x1D714}_{0}\rightarrow \infty$ ). The strictly perpendicular propagation is an excellent example to point out the deficiencies of the Hall-CGL-FLR1 fluid model, or actually the CGL-FLR1 model, since the Hall contributions are zero for $k_{\Vert }=0$ . The result (5.93) is also equal to equation (34) of Del Sarto et al. (Reference Del Sarto, Pegoraro and Tenerani2017), who discuss in detail that, compared to solutions of a truncated kinetic Vlasov system, and also of a fluid model with more precise FLR corrections, the solution (5.93) actually has the opposite sign in front of the FLR term ${\sim}k_{\bot }^{2}$ . For further refinement with FLR2, see (5.117) and the subsequent discussion.
5.4 CGL-FLR1 fluid model (no Hall term)
If the Hall term is neglected, the general dispersion relation of the CGL-FLR1 fluid model reads
where the CGL parameters $A_{2},A_{0}$ are (5.87) and the FLR1 polynomial on the right-hand side has the form
where $v_{A\Vert }^{2}=1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ . For strictly parallel propagation, the solution for the whistler and ion-cyclotron waves reads
For strictly perpendicular propagation the solution is equivalent to (5.93) since the Hall contributions vanish.
5.5 Second-order FLR corrections (FLR2)
If more precise FLR corrections are desired, it is relatively easy to increase the precision to the second order in frequency $\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}$ while keeping the precision in wavenumber $kr_{L}$ at first order. The pressure tensor equation (5.1) is simplified to
For clarity, let us first consider the case with the heat flux $\boldsymbol{q}=0$ . Compared to the first-order FLR corrections in the previous section, we just have one more matrix present, $(\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t)\unicode[STIX]{x1D72B}$ . As before, the FLR tensor must be symmetric and also satisfy conditions $\unicode[STIX]{x1D6F1}_{zz}=0$ and $\unicode[STIX]{x1D6F1}_{yy}=-\unicode[STIX]{x1D6F1}_{xx}$ . Following the previous derivation, the system (5.66) now includes also the time derivative of the FLR tensor on the right-hand side. Let us write down the system directly in the linear approximation. The system reads
There are only four FLR components that have to be considered and which can be rewritten separately as
The FLR components could be considered as independent fluid quantities described by the time-dependent equations as written above. However, from a linear perspective this introduces additional linear modes, and from a direct numerical simulation perspective it is preferred not to introduce an additional four time-dependent equations (nevertheless, numerical simulations with the full pressure tensor equation are sometimes performed, see e.g. Wang et al. (Reference Wang, Hakim, Bhattacharjee and Germaschewski2015), Del Sarto, Pegoraro & Califano (Reference Del Sarto, Pegoraro and Califano2016) and Ng et al. (Reference Ng, Hakim, Bhattacharjee, Stanier and Daughton2017)). The above equations are therefore typically simplified by expanding the FLR tensor to the first and second orders as $\unicode[STIX]{x1D72B}=\unicode[STIX]{x1D72B}^{(1)}+\unicode[STIX]{x1D72B}^{(2)}$ and the time derivative of the second-order tensor is neglected, i.e. $(\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t)\unicode[STIX]{x1D72B}^{(2)}=0$ and the system reads
where we also used the linearized induction equation in the last two equations. Now, there are two possibilities to handle the Hall-term contributions coming from the induction equation. It is possible to either define $\unicode[STIX]{x1D72B}^{(1)}$ to be equal to set (5.74) and move the Hall term to $\unicode[STIX]{x1D72B}^{(2)}$ , or it is possible to keep the Hall term in $\unicode[STIX]{x1D72B}^{(1)}$ that is equal to linearized set (5.72). The first choice, i.e. if $\unicode[STIX]{x1D72B}^{(1)}$ is set, (5.74) yields the second-order components
The second choice, i.e. when $\unicode[STIX]{x1D72B}^{(1)}$ is equal to linearized set (5.72), yields
The difference between the two approaches therefore is that the first approach neglects the time derivative of the Hall term. It is noted that the second approach is not necessarily better or more accurate than the first approach. For example, for parallel propagation, the frequency of the whistler mode seems to be better described by the first approach and the solutions are closer to kinetic theory.
5.6 Hall-CGL-FLR2 fluid model
The second-order FLR tensor (5.109) is normalized and written in Fourier space according to
where for cold (and massless) electrons in the $x$ – $z$ plane
We purposely wrote the equations with $E_{H}$ so that the generalization to a more elaborate electric field will be straightforward. Note that the Hall contributions (5.112) in the FLR tensor completely vanish for isotropic temperatures $a_{p}=1$ , regardless of the form of $E_{H}$ , i.e. regardless if the electrons are cold or not.
To write down the solution for parallel propagation $(k_{\bot }=0)$ it is useful for brevity to introduce the following quantity
and the parallel propagating whistler and ion-cyclotron modes factorize as
and the two solutions are
with two further solutions corresponding to $-\unicode[STIX]{x1D714}$ on the left-hand side. It is perhaps useful to rewrite expression (5.114) to the ‘ion-cyclotron’ resonance form
that clearly shows that the ion-cyclotron resonances (in the parallel direction) become more and more complicated. For strictly perpendicular propagation $(k_{\Vert }=0)$ , where notably the Hall contributions (5.112) vanish, the solution of the model is
It is useful to compare this result with the solution obtained for FLR1 corrections, equation (5.93). For direct comparison, expanding the solution (5.117) for small $|k_{\bot }|\ll 1$ yields
Importantly, in comparison to the FLR1 solution (5.93), the FLR2 solution (5.118) has the opposite sign in front of the FLR term ${\sim}k_{\bot }^{2}$ . The equation (5.118) is equivalent to (32) of Del Sarto et al. (Reference Del Sarto, Pegoraro and Tenerani2017), obtained by solving the pressure tensor equation without the expansion to $\unicode[STIX]{x1D72B}^{(1)}$ and $\unicode[STIX]{x1D72B}^{(2)}$ , and by expanding for small wavenumber afterwards. The difference between the FLR1 solution (5.93) and the FLR2 solution (5.118) is perhaps even more clear, when written in physical units for $\unicode[STIX]{x1D714}$ and $k$ , and by using the gyroradius $\unicode[STIX]{x1D70C}_{i}=v_{\text{th}\bot }/\unicode[STIX]{x1D6FA}_{p}$ , in the following form
Equations (5.119), (5.120), (5.121) are equations (34), (32), (56) of Del Sarto et al. (Reference Del Sarto, Pegoraro and Tenerani2017). As discussed in that paper, the FLR1 model not only fails to capture the correction to the Alfvén velocity, it also introduces a correction to the thermal velocity with the incorrect sign. The kinetic result can be obtained for example from (2.10) of Mikhailovskii & Smolyakov (Reference Mikhailovskii and Smolyakov1985), when written for cold electrons $(\unicode[STIX]{x1D6FD}_{e}=0)$ , and rewritten to our notation (thermal speed in that paper does not have a factor of 2, so $(\unicode[STIX]{x1D70C}_{i}^{2})^{(MS)}=\unicode[STIX]{x1D70C}_{i}^{2}/2$ ).
Coming back to the generally oblique propagation, the second approach to writing the FLR2 tensor, i.e. when the time derivative of the Hall term is not neglected and kept in the first-order tensor $\unicode[STIX]{x1D72B}^{(1)}$ with components
yields the second-order tensor
The resulting dispersion relation for strictly parallel propagation is slightly different
with explicit solutions
For isotropic temperatures $(a_{p}=1)$ the result is equivalent to (5.115) since the Hall-term contributions for the FLR tensor disappear for all propagation directions. Also, for perpendicular propagation, the dispersion relation is naturally equal to (5.117).
5.7 FLR corrections with gyrotropic heat flux
The equation (5.99) contains the divergence of the gyrotropic heat flux, $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{g}}$ , which in the index notation calculates as
Evaluation along magnetic field $\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}=(0,0,1)$ yields $\hat{b}_{k}\unicode[STIX]{x2202}_{k}\rightarrow \unicode[STIX]{x2202}_{z}$ and the matrix is evaluated as
where in the last ‘ $zz$ ’ component we also used $\unicode[STIX]{x2202}_{z}\hat{b}_{z}=0$ , but for brevity we left the divergence $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\hat{\boldsymbol{b}}=\unicode[STIX]{x2202}_{x}\hat{b}_{x}+\unicode[STIX]{x2202}_{y}\hat{b}_{y}$ intact in all expressions. Note that $(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{g}})_{yy}\neq -(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{g}})_{xx}$ , which is not a problem as will be described below. The pressure tensor equation is here approximated as
and evaluated with respect to $\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}_{0}=(0,0,1)$ . All components were already evaluated and the calculation is straightforward, however, we need to subtract from diagonal components the scalar pressure equations that contain the gyrotropic heat flux contributions and that evaluated along $\hat{\boldsymbol{b}}_{0}$ read
The requirement again is that $\unicode[STIX]{x1D6F1}_{zz}=0$ and $\unicode[STIX]{x1D6F1}_{yy}=-\unicode[STIX]{x1D6F1}_{xx}$ . Direct calculation yields
Note that after subtraction of the perpendicular pressure equation, the $^{\prime }xx^{\prime }$ and $^{\prime }yy^{\prime }$ components are again anti-symmetric, i.e. $(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{g}})_{yy}-\unicode[STIX]{x2202}_{z}q_{\bot }-2q_{\bot }\unicode[STIX]{x1D735}\boldsymbol{\cdot }\hat{\boldsymbol{b}}=-q_{\bot }(\unicode[STIX]{x2202}_{x}\hat{b}_{x}-\unicode[STIX]{x2202}_{y}\hat{b}_{y})$ , and the system indeed satisfies that $\unicode[STIX]{x1D6F1}_{yy}=-\unicode[STIX]{x1D6F1}_{xx}$ . Also, considering the $^{\prime }zz^{\prime }$ component, after subtraction of the parallel pressure equation all the terms cancel, i.e. $(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{g}})_{zz}-\unicode[STIX]{x2202}_{z}q_{\Vert }-(q_{\Vert }-2q_{\bot })\unicode[STIX]{x1D735}\boldsymbol{\cdot }\hat{\boldsymbol{b}}=0$ and the system indeed satisfies $\unicode[STIX]{x1D6F1}_{zz}=0$ .
Finally, expressing the above system in the linear approximation, with the assumption that mean heat flux values are zero, i.e. $q_{\Vert }^{(0)}=0$ , $q_{\bot }^{(0)}=0$ , the only gyrotropic heat flux contributions that remain are $\unicode[STIX]{x2202}_{x}q_{\bot }$ and $\unicode[STIX]{x2202}_{y}q_{\bot }$ , in equations (5.138), (5.139). The FLR tensor is again separated to $\unicode[STIX]{x1D72B}^{(1)}+\unicode[STIX]{x1D72B}^{(2)}$ and similarly to the Hall-term contributions, it is again a matter of choice if the heat flux contributions are pushed to $\unicode[STIX]{x1D72B}^{(2)}$ or kept in $\unicode[STIX]{x1D72B}^{(1)}$ . We prefer the first choice, i.e. when $\unicode[STIX]{x1D72B}^{(1)}$ is equivalent to (5.74), which yields the second-order tensor
Note that we could have derived the contributions $\unicode[STIX]{x2202}_{x}q_{\bot }$ and $\unicode[STIX]{x2202}_{y}q_{\bot }$ in a much quicker way, if the expression (5.126) was linearized from the beginning. However, we wanted to demonstrate that if nonlinear FLR corrections are considered, the heat flux nonlinearities can significantly complicate the dynamics, even if simplified evaluation along $\hat{\boldsymbol{b}}_{0}$ is performed. Right now, we cannot use these FLR corrections to obtain a dispersion relation, since we have no means to determine a closure for the gyrotropic heat flux $q_{\bot }$ . We would have to consider CGL2 fluid model or more complicated Landau fluid models, that contains evolution equations for $q_{\Vert }$ and $q_{\bot }$ . Additionally, the perpendicular propagating fast mode, as well as the parallel propagating ion-cyclotron and whistler modes, are not influenced by the gyrotropic heat flux. Instead, as the last step, we will consider contributions of the non-gyrotropic heat flux.
5.8 FLR corrections with non-gyrotropic heat flux vectors (FLR3)
It is possible to further increase precision of the FLR corrections by considering non-gyrotropic heat flux contributions. Detailed calculations with the non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ and $\boldsymbol{S}_{\bot }^{\bot }$ are presented in appendix D. At the linear level, these non-gyrotropic heat flux vectors contribute to the linearized pressure equations
and also to the linearized equations for the FLR tensor
These equations are equivalent to (18)–(21) of Goswami et al. (Reference Goswami, Passot and Sulem2005) (by noting that the vector $\boldsymbol{S}^{\bot }=q_{\bot }\hat{\boldsymbol{b}}+\boldsymbol{S}_{\bot }^{\bot }$ , so at the linear level $S_{z}^{\bot }=q_{\bot }$ ). The non-gyrotropic heat flux vectors are separated to the first and second order
By using the nonlinear results for the first-order vectors derived in appendix D, (D 37), (D 60), yields that, at the linear level,
where the gradients are meant to be further linearized. Nevertheless, the above form is useful to point out that the non-gyrotropic heat fluxes are proportional to the gradients of the temperature. For clarity, partially linearized expressions are also written down by components in appendix D, see (D 24), (D 25) and (D 61), (D 62). It is important to emphasize that the expressions (5.148), (5.149) were derived for a bi-Maxwellian distribution function. As discussed later in the text, this is achieved by prescribing closures for the gyrotropic fourth-order moment in the form $r_{\Vert \Vert }=3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}+\widetilde{r}_{\Vert \Vert }$ , $r_{\Vert \bot }=p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C}+\widetilde{r}_{\Vert \bot }$ , and $r_{\bot \bot }=2p_{\bot }^{2}/\unicode[STIX]{x1D70C}+\widetilde{r}_{\bot \bot }$ . In (5.148), (5.149), we additionally neglected the perturbations $\widetilde{r}$ (since right now we do not want to consider models where these perturbations are evaluated from linear kinetic theory). Therefore, the FLR corrections with the non-gyrotropic heat flux, that we call here FLR3, are distribution function specific. Nevertheless, as discussed later in the text, very similar closures can be also obtained for the bi-kappa distribution function, just with additional coefficients $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=(\unicode[STIX]{x1D705}-3/2)/(\unicode[STIX]{x1D705}-5/2)$ , and expressions similar to (5.148), (5.149) can be derived.
The second-order vectors are also derived in appendix D, this time directly in the linear approximation, equations (D 88), (D 89) and the vectors read
As a reminder, $\unicode[STIX]{x1D6F1}_{ij}=\unicode[STIX]{x1D6F1}_{ji}$ . For clarity, we wrote the expressions above explicitly in the index notation. The non-gyrotropic heat flux vectors are perpendicular to $\hat{\boldsymbol{b}}_{0}=(0,0,1)$ , and the non-zero components are for index $k=x,y$ . The expressions can be written in a more elegant vector form, by defining vector $\vec{\unicode[STIX]{x1D72B}}_{z}\equiv (\unicode[STIX]{x1D6F1}_{xz},\unicode[STIX]{x1D6F1}_{yz},\unicode[STIX]{x1D6F1}_{zz}=0)$ , vector $\unicode[STIX]{x1D735}_{\bot }=(\unicode[STIX]{x2202}_{x},\unicode[STIX]{x2202}_{y},0)$ and matrix
so that $\unicode[STIX]{x1D735}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D72B}_{\bot }=(\unicode[STIX]{x2202}_{x}\unicode[STIX]{x1D6F1}_{xx}+\unicode[STIX]{x2202}_{y}\unicode[STIX]{x1D6F1}_{yx},\unicode[STIX]{x2202}_{x}\unicode[STIX]{x1D6F1}_{xy}+\unicode[STIX]{x2202}_{y}\unicode[STIX]{x1D6F1}_{yy})$ , yielding
which are equations (53) and (54) of Passot et al. (Reference Passot, Sulem and Hunana2012).
Normalizing the equations (dropping the tilde), Fourier transforming, and writing them in the $x$ – $z$ plane (with $\unicode[STIX]{x2202}_{y}=0$ ) yields
where the terms with Hall electric field components are specified in (5.112). At this moment we do not consider higher-order fluid models with gyrotropic heat fluxes such as CGL2 or Landau fluids (see later in the text) and to have a closed model, here we prescribe $q_{\bot }=0$ , $q_{\Vert }=0$ . It is useful to briefly explore what fluid models are obtained, if we decide to keep only the first-order vectors $\boldsymbol{S}_{\bot }^{\Vert (1)}$ , $\boldsymbol{S}_{\bot }^{\bot (1)}$ , and ignore the second-order corrections $\boldsymbol{S}_{\bot }^{\Vert (2)}$ , $\boldsymbol{S}_{\bot }^{\bot (2)}$ . By considering only the $\boldsymbol{S}_{\bot }^{\Vert (1)}$ , $\boldsymbol{S}_{\bot }^{\bot (1)}$ contributions, we can either put them into to $\unicode[STIX]{x1D72B}^{(1)}$ or $\unicode[STIX]{x1D72B}^{(2)}$ . However, it can be shown that both options yield dispersion relation for the perpendicular fast mode (long-wavelength, in physical units)
Rather amusingly, in comparison to the FLR2 result (5.120), the wrong sign in the correction of the thermal speed is back!
5.9 Hall-CGL-FLR3 fluid model
Obviously, if matching with kinetic theory for the fast perpendicular mode is of upmost importance (for small wavenumbers), we have no other choice and we have to keep the second-order non-gyrotropic heat flux contributions $\boldsymbol{S}_{\bot }^{\Vert (2)}$ , $\boldsymbol{S}_{\bot }^{\bot (2)}$ . We call this FLR pressure tensor as FLR3, and as stated previously, the FLR3 pressure tensor is distribution function specific. This model is very important to us and especially for the following discussion of the firehose instability. In order to be completely clear on which equations we are solving and also so that our results can be easily reproduced, let us state the entire model in all of its beauty. The linearized, normalized, Fourier transformed equations written in the $x$ – $z$ plane read
where each component of the (FLR) pressure tensor $\unicode[STIX]{x1D72B}$ , and each component of the (FLR) heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ , is separated to
The components of the $\unicode[STIX]{x1D72B}$ tensor are given by
where the Hall electric field contributions for cold massless electrons read
and the components of the non-gyrotropic (FLR) heat fluxes are given by
Importantly, the $\unicode[STIX]{x1D72B}^{(2)}$ expressions contain both the first- and second-order contributions from the heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ . To explicitly see where the closure for the gyrotropic heat fluxes $q_{\Vert }=0,q_{\bot }=0$ was performed, we scratched the terms involving these quantities. Additionally, we also scratched contributions from $(S_{\bot }^{\bot })_{x}^{(1)}$ , which are here zero at the linear level.
Let us check the solution of the Hall-CGL-FLR3 fluid model for the perpendicular propagation ( $k_{\Vert }=0$ ), which in this case reads
As a ‘sanity check’, since the solution stays always positive for all the wavenumbers and we do not have any instability, the model appears to be good. For small wavenumbers, the expansion yields
and in physical units
Therefore, the Hall-CGL-FLR3 fluid model finally matches the analytic result from kinetic theory! We note that if the $\unicode[STIX]{x1D72B}$ contributions are neglected in the second-order heat flux expressions (5.164), instead of the correct $-5/16$ FLR correction to the thermal speed, one obtains $-7/16$ . The necessity to keep the second-order heat flux contributions to recover the kinetic result for the perpendicular fast mode was discovered by Mikhailovskii & Smolyakov (Reference Mikhailovskii and Smolyakov1985).
Checking the solution for the parallel propagating ( $k_{\bot }=0$ ) ion-cyclotron and whistler modes yields the following dispersion relation
where the quantities $v_{b}=\unicode[STIX]{x1D6FD}_{\Vert }(1-a_{p}/2)$ and $v_{A\Vert }^{2}=1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ . We will use this dispersion relation to study the firehose instability. For a general oblique propagation direction, the dispersion relation is obviously way too large to write down, and we recommend using analytic software such as Maple or Mathematica.
Moving the Hall term to $\unicode[STIX]{x1D72B}^{(1)}$
For completeness, just in case we want to investigate later the influence of the Hall term, moving it to $\unicode[STIX]{x1D72B}^{(1)}$ yields the following dispersion relation for the parallel propagation
6 Parallel and oblique firehose instability
6.1 Parallel propagation
For clarity, it is useful to summarize all 3 major models that describe the parallel propagating ion-cyclotron and whistler modes. By prescribing $k_{\bot }=0$ in the equations of the Hall-CGL-FLR3 fluid model, the model greatly simplifies. The parallel magnetic field $B_{z}=0$ , the ion-acoustic mode decouples, and both the first- and second-order contributions to $\unicode[STIX]{x1D6F1}_{xx}=0$ , $\unicode[STIX]{x1D6F1}_{xy}=0$ , $\boldsymbol{S}_{\bot }^{\bot }=0$ . The normalized, Fourier transformed equations written in the $x$ – $z$ plane read
where the components of the non-gyrotropic (FLR) pressure tensor $\unicode[STIX]{x1D72B}$ are given by
and the components of the non-gyrotropic (FLR) heat flux $\boldsymbol{S}_{\bot }^{\Vert }$ read
When the entire $\unicode[STIX]{x1D72B}$ is neglected, yields the Hall-CGL solution
Neglecting the $\unicode[STIX]{x1D72B}^{(2)}$ contributions yields the Hall-CGL-FLR1 solution
and neglecting the heat flux $\boldsymbol{S}_{\bot }^{\Vert }$ yields the Hall-CGL-FLR2 solution
Finally, the full model yields Hall-CGL-FLR3 solution
The quantities $v_{b}$ and $v_{A\Vert }^{2}$ used in the solutions are defined as
Importantly, the solutions are written here for $\unicode[STIX]{x1D714}$ (and not $\unicode[STIX]{x1D714}^{2}$ ), and all the models of course yield 4 solutions, the other two solutions are obtained by substituting $\unicode[STIX]{x1D714}$ with $-\unicode[STIX]{x1D714}$ . For a reader who just jumped to this section, and is confused on how the solutions were split, see § 4.2 where the Hall-CGL model is discussed with final equation (4.36). The solutions can be written in various forms, and one possibility is to keep $|k_{\Vert }|$ in front of the square roots.
The (parallel and oblique) firehose instability was studied in detail by Hunana & Zank (Reference Hunana and Zank2017), who focused on the Hall-CGL and Hall-CGL-FLR1 models. One of the conclusions reached in that paper was that the main reason for the relatively large discrepancy between the usual fluid models and the kinetic description is the appearance of a huge ‘bump’ in the imaginary phase speed, when close to the firehose threshold. The situation is demonstrated in figure 7, where kinetic solutions obtained by the WHAMP code (blue solid lines) are compared to solutions of the Hall-CGL-FLR2 model (a, blue dashed lines) and the Hall-CGL-FLR3 fluid model (b, black dashed lines). The plasma $\unicode[STIX]{x1D6FD}_{\Vert }=4$ , so the long-wavelength firehose threshold, that we call the ‘hard threshold’, is at $a_{p}=0.5$ . The temperature anisotropy is varied from $a_{p}=0$ to $a_{p}=0.501$ . For solutions with the Hall-CGL and Hall-CGL-FLR1 models see figures 3 and 4 of Hunana & Zank (Reference Hunana and Zank2017). In the WHAMP code the value of $a_{p}=0$ cannot be used, and $a_{p}=10^{-4}$ was chosen instead. Also, since here we concentrate on proton dynamics, the influence of electrons in the WHAMP code was eliminated by making the electrons cold (with $T_{e}/T_{p}=10^{-8}$ ).
Very surprisingly, figure 7 shows that the large ‘bump’ is reproduced by the FLR3 model, and the precision is quite good! The situation is further analysed in figure 8, where instead of the imaginary phase speed, the growth rate is plotted. The same conclusion is obtained. For the temperature anisotropy $a_{p}=0.501$ , all simpler fluid models (Hall-CGL, FLR1, FLR2) are fully stable for all ranges of wavenumbers. In contrast, the FLR3 model still develops a strong firehose instability. Therefore, it is the non-gyrotropic heat flux $\boldsymbol{S}_{\bot }^{\Vert }$ of the FLR3 model that is responsible for the ‘bump’ (for the case of strictly parallel propagation). The appearance of the ‘bump’ can be understood analytically, by evaluating the fluid dispersion relations exactly at the ‘hard’ firehose threshold $a_{p}=1-2/\unicode[STIX]{x1D6FD}_{\Vert }$ , where the quantities $v_{A\Vert }^{2}=0$ and $v_{b}=1+\unicode[STIX]{x1D6FD}_{\Vert }/2$ . At the hard firehose threshold, the expression under the square root of the FLR2 solution (6.6) can be factorized as
implying that the FLR2 model is always stable for all values of $k_{\Vert }$ and $\unicode[STIX]{x1D6FD}_{\Vert }$ . Such a factorization cannot be achieved for the FLR3 model, and the solution (6.7) still becomes unstable at some range of wavenumbers, where
which is the (strictly parallel) firehose instability criterion of the FLR3 model. We note that the model can become unstable also for the temperature anisotropy $a_{p}>1$ . This can be perhaps considered as some remnant of the ion-cyclotron anisotropy instability, but we did not study the situation further since the instability should not be reproduced correctly. The parallel firehose instability for high plasma beta value $\unicode[STIX]{x1D6FD}_{\Vert }=100$ is shown in figure 9. The left figure is from Hunana & Zank (Reference Hunana and Zank2017), and it shows solutions of the Hall-CGL model (blue dashed lines) and of the Hall-CGL-FLR1 model (green dashed lines). The kinetic solutions are blue solid lines. It is shown that the FLR corrections are crucial for the correct stabilization of the firehose instability. Figure 9(b) shows refinement with FLR2 (blue dashed lines) and FLR3 tensors (black dashed lines). Obviously, for high plasma beta values, the maximum growth rate of the (strictly) parallel firehose instability is captured very precisely by the Hall-CGL-FLR3 model. The maximum growth rate (for parallel propagation) can be easily found analytically only for the Hall-CGL and Hall-CGL-FLR1 models. For models with the FLR2 and FLR3 tensors it is easier to find the maximum growth rate numerically. For example, let us consider the Hall-CGL model. Assuming the firehose-unstable regime, the frequency of the Hall-CGL solution (6.4) can be split into $\unicode[STIX]{x1D714}=\unicode[STIX]{x1D714}_{r}+\text{i}\unicode[STIX]{x1D714}_{i}$ , where the imaginary part $\unicode[STIX]{x1D714}_{i}=k_{\Vert }\sqrt{-v_{A\Vert }^{2}-k_{\Vert }^{2}/4}$ . Then, calculating $\unicode[STIX]{x2202}\unicode[STIX]{x1D714}_{i}/\unicode[STIX]{x2202}k_{\Vert }=0$ , yields the wavenumber $k_{\Vert \text{max}}$ and the maximum growth rate $\unicode[STIX]{x1D6FE}_{\text{max}}$ in the following form
Similarly, the Hall-CGL-FLR1 model yields
6.2 Oblique propagation
Here, we briefly investigate the parallel and oblique firehose instability for oblique propagation directions. Contour plots of the growth rate in the $k-\unicode[STIX]{x1D703}$ plane are shown in figure 10. The plasma $\unicode[STIX]{x1D6FD}_{\Vert }=4$ and the temperature anisotropy $a_{p}=0.49$ , so all the fluid models are in the firehose-unstable regime. We compare four different fluid models, Hall-CGL (a), and enhancement with FLR1 (b), FLR2 (c) and FLR3 (d). The FLR2 solution shows an additional instability at higher wavenumbers that we did not study further. Note the large differences between the four solutions. Importantly, the FLR3 solution shows large enhancement of the growth rate. We use the WHAMP code for kinetic calculations, and we do not provide a contour plot for the kinetic theory. Instead, we plot the growth rate for several propagation angles, so that the comparison with the contour plots can be made easily. The figure (e) is the FLR3 model, and (f) is the kinetic result. It is noted that, similarly to other fluid models with higher-order FLR corrections, the FLR3 model can develop secondary instabilities at scales below the proton gyroscale.
6.3 Hellinger’s contours for the Hall-CGL-FLR3 model
Here, we prescribe a fixed value for the maximum growth rate, $\unicode[STIX]{x1D6FE}_{\text{max}}=10^{-3};10^{-2};10^{-1}$ , and plot solutions in the $\unicode[STIX]{x1D6FD}_{\Vert }-a_{p}$ plane, which is shown in figure 11. (a,c) Show the parallel firehose instability, and (b,d) show the oblique firehose instability. Panels (a,b) are plotted with the usual logarithmic scales, and (c,d) with linear scales. Additionally, only solutions with $\unicode[STIX]{x1D6FD}_{\Vert }<6$ are shown at the bottom panels. Solutions for the Hall-CGL-FLR3 fluid model are black dashed lines, and kinetic solutions are solid blue lines (a,c) and solid red lines (b,d). The kinetic solutions were provided to us by P. Hellinger (private communication) and are identical to solutions shown in figure 1 of Hellinger et al. (Reference Hellinger, Trávníček, Kasper and Lazarus2006). It is shown that the $\unicode[STIX]{x1D6FE}_{\text{max}}=10^{-3};10^{-2}$ contours clearly lie below the hard firehose threshold, i.e. the firehose instability in the FLR3 fluid model develops at some range of wavenumbers, even if the model is stable in the long-wavelength limit. Importantly, the kinetic contours of Hellinger et al. (Reference Hellinger, Trávníček, Kasper and Lazarus2006) were not calculated for cold electrons, but for isotropic electrons with $\unicode[STIX]{x1D6FD}_{e}=1$ . This does not matter for the parallel propagation, since the isotropic electron pressure does not influence the dispersion relations for the parallel propagating whistler and ion-cyclotron modes. Only the effect of electron inertia will enter, however, the effect should be negligible at the scales considered here. Unfortunately, the solution for the parallel firehose instability cannot be further improved by currently developed fluid models. The oblique case is different, since even isotropic electron pressure will influence the dynamics. The contours for the oblique firehose instability should be therefore recalculated by using a proper proton–electron two fluid model. The oblique case could be further improved, by considering FLR contributions from the non-gyrotropic heat flux tensor $\unicode[STIX]{x1D748}$ that were neglected here (contributions for parallel propagation are zero). For the oblique case, one can also use higher-order fluid models with the gyrotropic heat flux fluctuations $q_{\Vert },q_{\bot }$ (such as the CGL2 model discussed later), or even Landau fluid models. The main reason for the relatively large discrepancies found in the contours for the parallel firehose instability is demonstrated in figure 12.
7 Heat flux tensor equation
Multiply the Vlasov equation by $mc_{i}c_{j}c_{k}$ and integrate over the velocity space. Naturally, the other possibilities are to multiply by $mc_{i}c_{j}v_{k}$ , $mc_{i}v_{j}v_{k}$ or $mv_{i}v_{j}v_{k}$ , although none of them is more revealing and we will use the first choice. It is convenient to define the symmetric operator $^{S}$ that acts on a tensor of third rank according to
i.e. the operator represents all possible cyclic permutations. For the first term we will need the following identities
and the entire first term of the integrated Vlasov equation calculates as
For the second term we will need identities
and the second term calculates as
Note again that the divergence of a tensor operates through its first component $\unicode[STIX]{x2202}_{l}(r_{lijk}+u_{l}q_{ijk})=[\unicode[STIX]{x1D735}\boldsymbol{\cdot }(\boldsymbol{r}+\boldsymbol{u}\boldsymbol{q})]_{ijk}$ . Similarly, if the operator acts on a tensor from the right-hand side, the most natural way is to define that it operates through the last component, i.e. $[\boldsymbol{q}\boldsymbol{\cdot }\unicode[STIX]{x1D735}]_{ij}=q_{ijl}\unicode[STIX]{x2202}_{l}$ . In the expressions above $[\boldsymbol{q}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}]_{ijk}=q_{ijl}\unicode[STIX]{x2202}_{l}u_{k}$ and $[\boldsymbol{p}\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}]_{ijk}=(\boldsymbol{p}\boldsymbol{u})_{ijl}\unicode[STIX]{x2202}_{l}u_{k}=p_{ij}u_{l}\unicode[STIX]{x2202}_{l}u_{k}$ . The last term can also be rewritten as $p_{ij}(\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u})_{k}$ , where the expression $(\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u})_{k}=u_{l}\unicode[STIX]{x2202}_{l}u_{k}$ is familiar from MHD. For the third term we will need identities
and the entire third term calculates as
For the fourth term we will need identities
and the entire fourth term calculates as
Combining all the results together $\unicode[STIX]{x2460}+\unicode[STIX]{x2461}+\unicode[STIX]{x2462}+\unicode[STIX]{x2463}=0$ , the entire heat flux tensor equation obtained by direct integration of the Vlasov equation reads
Now we will need to use 3 different momentum equations that will cancel various terms. We will also need identity
We need to multiply the momentum equation for $\unicode[STIX]{x2202}u_{i}/\unicode[STIX]{x2202}t$ by $p_{jk}$ , the equation for $\unicode[STIX]{x2202}u_{j}/\unicode[STIX]{x2202}t$ by $p_{ki}$ and the equation for $\unicode[STIX]{x2202}u_{k}/\unicode[STIX]{x2202}t$ by $p_{ij}$ . All the three momentum equations that we want to subtract from the heat flux equation can be written together as
and because of identity (7.19), this is equivalent to
Note that, because of the symmetric operator, it does not matter if the tensor $\boldsymbol{p}$ is applied to the momentum equation from the left or right, since all the expressions are symmetric in this regard, i.e. for example $[(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{p})\boldsymbol{p}]^{S}=[\boldsymbol{p}(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{p})]^{S}$ . By subtracting (7.21) from (7.18), the final heat flux tensor equation reads
By defining the cyclotron frequency vector $\unicode[STIX]{x1D734}=q\boldsymbol{B}/mc$ , this equation identifies with equation (A 5) in Chust & Belmont (Reference Chust and Belmont2006). By using the scalar cyclotron frequency defined with respect to $|\boldsymbol{B}|$ as $\unicode[STIX]{x1D6FA}=q|\boldsymbol{B}|/mc$ , the heat flux tensor equation is written explicitly in the index notation
which is equivalent to (4) in Goswami et al. (Reference Goswami, Passot and Sulem2005) with the term
7.1 Heat flux tensor decomposition
Considering only the gyrotropic part of the heat flux tensor $(3\times 3\times 3)$ cube, the heat flux is decomposed into the scalar parallel and perpendicular heat flux components $q_{\Vert },q_{\bot }$ according to
and which in the index notation reads
This part of the pressure tensor represents only the gyrotropic part, and the full heat flux decomposition can be written as $\boldsymbol{q}=\boldsymbol{q}^{\text{g}}+\boldsymbol{q}^{\text{ng}}$ . In the heat flux tensor equation, the term $\boldsymbol{B}\times \boldsymbol{q}$ is proportional to the cyclotron frequency $\unicode[STIX]{x1D6FA}=qB_{0}/mc$ and the equation rewrites as
The situation is now similar to the previously studied pressure tensor. At long spatial scales (low frequencies $\unicode[STIX]{x1D714}$ ), this term will dominate and the gyrotropic contribution must be equal to zero
At first look, it is not that obvious that the decomposition (7.25) satisfies this equation. It is however possible to verify that indeed
Putting all terms together
and we see that all three parts of the symmetric operator are required to make this term equal to zero. Therefore, the exact heat flux tensor equation reads
Since this should be an introductory text, we do not want to be bothered right now with the complicated algebra of the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ . For the clarity of the presented material, here we separate the non-gyrotropic heat flux into a separate term $\boldsymbol{Q}^{\text{ng}}$ , and write the heat flux tensor equation in the following form
where
We will address the non-gyrotropic heat flux contributions in appendix D.
In a similar fashion to the pressure decomposition (2.35), we are going to frequently apply double contractions with $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ . Applying these operators to the heat flux tensor $q_{ijk}$ , must yield quantities that are vectors. It is therefore logical to define parallel and perpendicular heat flux vectors
The scalar parallel and perpendicular heat flux components $q_{\Vert },q_{\bot }$ (which are the only parts that are gyrotropic) are obtained by further projecting these heat flux vectors along the magnetic field lines, i.e. by performing $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ , so that
Briefly considering only the gyrotropic heat flux (7.26) on the right-hand side, it is easy to verify that the parallel decomposition indeed works
For the perpendicular decomposition, it is useful to specifically calculate components
so that
which verifies that the perpendicular decomposition (7.37) is satisfied for $\boldsymbol{q}^{\text{g}}$ . Now we apply the decomposition (7.37) directly to the definition of the entire heat flux tensor (2.6), which yields
and the perpendicular decomposition
The decomposition (7.37) obviously works for the entire heat flux $\boldsymbol{q}$ , as well as for the gyrotropic part $\boldsymbol{q}^{\text{g}}$ . Similarly to the pressure decomposition, this further yields the required properties that the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ must satisfy. By using (7.37), the entire heat flux decomposition reads
Obviously, it would be useful to introduce a triple-contraction operator and instead of $(\boldsymbol{q}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ to write something like $\boldsymbol{q}\,\vdots \,\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ , but we do not want to introduce new notations. Also, an alternative and perhaps prettier expression is to move the $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ to the left-hand side, as $(\boldsymbol{q}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})\boldsymbol{\cdot }\hat{\boldsymbol{b}}=\hat{\boldsymbol{b}}\boldsymbol{\cdot }\boldsymbol{q}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ , but we will keep the first choice. Applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ to the above equation yields
implying the first requirement for the non-gyrotropic heat flux
Similarly, the second requirement is obtained by applying $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ and $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ , yielding
By using the first requirement, the second requirement simplifies to
where obviously the trace and $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ operators commute. The two requirements in the index notation read
Instead of decomposing $\boldsymbol{q}=\boldsymbol{q}^{\text{g}}+\boldsymbol{q}^{\text{ng}}$ , an alternative and very useful decomposition of the entire heat flux tensor reads
with the requirement $\unicode[STIX]{x1D748}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}=0$ and $\unicode[STIX]{x1D748}:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})=0$ (or equivalently $\unicode[STIX]{x1D748}:\unicode[STIX]{x1D644}=0$ ). The heat flux vectors (7.36) therefore satisfy $\boldsymbol{S}^{\Vert }=\boldsymbol{S}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $\boldsymbol{S}^{\bot }=\boldsymbol{S}:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ . The heat flux vectors contain both gyrotropic and non-gyrotropic contributions. Since the gyrotropic contributions are obtained by projecting these vectors along the magnetic field lines $q_{\Vert }=\boldsymbol{S}^{\Vert }\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ , $q_{\bot }=\boldsymbol{S}^{\bot }\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ , it is useful to introduce the following decomposition
The vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ are referred to as the non-gyrotropic heat flux vectors, and their algebra is addressed in appendix D. Here, we only state that the $\boldsymbol{q}^{\text{ng}}$ can be decomposed into vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ and tensor $\unicode[STIX]{x1D748}$ according to
The entire heat flux tensor $\unicode[STIX]{x1D748}$ is of course non-gyrotropic.
Now we need to verify the heat flux contributions (2.72), (2.73) that we used in the pressure equations (2.74), (2.75). To calculate the heat flux contributions to the pressure equations, we will need
and the contributions are
The left-hand side of the above equation can be also written naturally as $(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{g}}):(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ , which is consistent with an alternative derivation of the perpendicular pressure equation, where instead of doing trace and subtracting the parallel pressure equation, one can directly perform $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ .
7.2 Parallel heat flux equation
Now we apply the decomposition (7.37) to the heat flux tensor equation (7.34) step by step. We consider only gyrotropic heat flux components. Starting with the equation for the parallel heat flux, the first term calculates as
the second term calculates as
Note that in all of the expressions here, the operator $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ can be naturally moved to the left as $\hat{\boldsymbol{b}}\boldsymbol{\cdot }$ . The third term calculates as
For the final fourth term we will need
and the fourth term calculates as
Combining all the terms yields the equation for the scalar parallel heat flux
where $Q_{\Vert }^{\text{ng}}\equiv (\boldsymbol{Q}^{\text{ng}}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ . The heat flux equations contain the fourth-order moment $r_{ijkl}$ , which we will consider in the next section and at this stage it is unspecified.
7.3 Perpendicular heat flux equation
We will apply the trace operator to (7.34) and also perform $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ step by step. In the index notation, we are basically multiplying the entire equation by $\unicode[STIX]{x1D6FF}_{ij}\hat{b}_{k}$ . The first term calculates
The second term $\unicode[STIX]{x1D735}\boldsymbol{\cdot }(\boldsymbol{r}+\boldsymbol{u}\boldsymbol{q}^{\text{g}})$ calculates as
The components $r_{liik}=r_{iikl}$ are nothing else but the trace of the moment $\boldsymbol{r}$
The third term in (7.34) calculates as
where we have used that $q_{iil}^{\text{g}}=(q_{\Vert }+2q_{\bot })\hat{b}_{l}$ and $q_{ikl}^{\text{g}}\hat{b}_{k}=q_{\Vert }\hat{b}_{i}\hat{b}_{l}+q_{\bot }(\unicode[STIX]{x1D6FF}_{il}-\hat{b}_{i}\hat{b}_{l})$ . The fourth term in (7.34) calculates as
and we have to calculate each term separately. Since $p_{ii}=p_{\Vert }+2p_{\bot }$ and by using the already calculated equation (7.70) for $\hat{b}_{k}\unicode[STIX]{x2202}_{l}p_{lk}$ and identities $p_{ik}\hat{b}_{k}=p_{\Vert }\hat{b}_{i}+\unicode[STIX]{x1D6F1}_{ik}\hat{b}_{k}$ and $\unicode[STIX]{x1D6F1}_{ik}\hat{b}_{i}\hat{b}_{k}=0$ one obtains
and the final result of (7.87) is
Collecting all the terms, one obtains
and subtracting the parallel heat flux equation (7.74) and dividing by two yields the perpendicular heat flux equation
where $Q_{\bot }^{\text{ng}}\equiv (\boldsymbol{Q}^{\text{ng}}:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2)\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ . The term containing $\boldsymbol{r}$ in the above equation can be rewritten to many possible forms, for example
The last expression is of course consistent with an alternative way to derive the perpendicular heat flux equation, where instead of doing trace of the heat flux tensor equation and subtracting the scalar parallel heat flux equation, one can directly apply operators $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ and $\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ .
7.4 Scalar heat flux equations continued
The parallel and perpendicular heat flux equations (7.74), (7.92) contain expressions for the fourth-order moment $\boldsymbol{r}$ , and even though we will consider this moment in detail in the next section, here we want to finish the derivation of the scalar heat flux equations, and we write down the required expressions. Similarly to the pressure tensor and the heat flux tensor, the fourth-order moment can be decomposed into its gyrotropic and non-gyrotropic parts, $\boldsymbol{r}=\boldsymbol{r}^{\text{g}}+\boldsymbol{r}^{\text{ng}}$ . The gyrotropic part $\boldsymbol{r}^{\text{g}}$ is decomposed according to (8.10), and it can be shown (see later in the text), that direct calculation yields
which yields the parallel heat flux equation
and the perpendicular heat flux equation
Exact nonlinear expressions for $Q_{\Vert }^{\text{ng}}$ and $Q_{\bot }^{\text{ng}}$ that represent contributions from the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ are calculated in appendix D, see (D 110), (D 120). The scalar heat flux equations (7.96) and (7.97) are completely general at this stage, since no distribution function has been prescribed yet, and no simplification has been introduced. These equations are exact.
Nevertheless, equations (7.96), (7.97) are very complicated, and at this stage, it is beneficial to simplify them. One possibility is to cancel all the non-gyrotropic contributions $\unicode[STIX]{x1D72B}$ , $\boldsymbol{q}^{\text{ng}}$ and $\boldsymbol{r}^{\text{ng}}$ , and we will study such fluid models later. Another possibility is to keep only those non-gyrotropic terms that have some non-zero contribution at the linear level. This eliminates the last term in the second line of (7.97) that is proportional to $[\ldots ]\boldsymbol{\cdot }\unicode[STIX]{x1D72B}\boldsymbol{\cdot }\hat{\boldsymbol{b}}$ . Also, by following the derivations in appendix D, it is easy to show that the terms $Q_{\Vert }^{\text{ng}}$ , $Q_{\bot }^{\text{ng}}$ (that represent the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ ), do not contribute at the linear level. Therefore, the heat flux equations simplify
Still, no specific distribution function was assumed. However, to evaluate the non-gyrotropic $\boldsymbol{r}^{\text{ng}}$ , and correctly evaluate possible cancellations with terms containing $\unicode[STIX]{x1D72B}$ , we need to use $\boldsymbol{r}^{\text{ng}}$ decomposition (8.58). Importantly, the decomposition (8.58) is valid only for perturbations around a bi-Maxwellian distribution function. By keeping only terms that have non-zero contribution at the linear level, see equations (8.63), (8.65) later in the text, the $\boldsymbol{r}^{\text{ng}}$ terms are evaluated as
Importantly, both non-gyrotropic contributions in the parallel heat flux equation (7.98) completely cancel out! The cancellation demonstrates the importance of keeping the non-gyrotropic $\boldsymbol{r}^{\text{ng}}$ , if one wants to keep the non-gyrotropic $\unicode[STIX]{x1D72B}$ . Also, there is a partial cancellation in the perpendicular heat flux equation (7.99). The heat flux equations therefore read
The last term can be evaluated with respect to the mean magnetic field, and assuming that $\hat{\boldsymbol{b}}_{0}$ is in the $z$ -direction, the term is equal to $(p_{\bot }/\unicode[STIX]{x1D70C})(\unicode[STIX]{x2202}_{x}\unicode[STIX]{x1D6F1}_{xz}+\unicode[STIX]{x2202}_{y}\unicode[STIX]{x1D6F1}_{yz})$ , since $\unicode[STIX]{x1D6F1}_{zz}=0$ . Essentially, the term should be written in a fully linearized form $(p_{\bot }^{(0)}/\unicode[STIX]{x1D70C}_{0})(\unicode[STIX]{x1D735}\boldsymbol{\cdot }\unicode[STIX]{x1D72B})\boldsymbol{\cdot }\hat{\boldsymbol{b}}_{0}$ , since other non-gyrotropic nonlinear terms were neglected. It is important to emphasize that it is because of this one term, that the equations (7.102), (7.103) are valid only for perturbations around a bi-Maxwellian distribution function. If the term is neglected, i.e. if one neglects from the beginning the non-gyrotropic contributions in (7.98), (7.99), the heat flux equations are valid for perturbations around any distribution function. We will consider a bi-kappa distribution function later.
7.5 Heat flux equations with ‘normal’ closure
Detailed calculations with the fourth-order moment will be presented in the next section. Here we want to finish the derivation, and by using the bi-Maxwellian ‘normal’ fluid closure
terms entering the heat flux equations directly calculate as
The use of these expressions in (7.102), (7.103) cancels various terms, and the scalar heat flux equations read
8 Fourth-order fluid moment
The algebra of the fourth-order fluid moment $\boldsymbol{r}=m\int \boldsymbol{c}\boldsymbol{c}\boldsymbol{c}\boldsymbol{c}f\,\text{d}^{3}v$ can be quite intimidating at first, since the moment is a 4-D cube ( $3x3x3x3$ ). We are not going to derive the time evolution equation for this moment step by step, nevertheless, later in the text we derive the evolution equation of the $n$ th-order fluid moment $\unicode[STIX]{x1D653}^{(n)}$ , see (12.16), and therefore, for $n=4$ , the equation reads
The symmetric operator ‘ $S$ ’ is here defined as
Here, we are not that interested in the evolution equation for $\boldsymbol{r}$ , we just want to clarify its decomposition, that we need in the heat flux equations. The definition of gyrotropy means that the integral has to be evaluated only over combinations of $(v_{\Vert }-u_{\bot })\equiv c_{\Vert }$ and $|\boldsymbol{v}_{\bot }-\boldsymbol{u}_{\bot }|^{2}\equiv c_{\bot }^{2}$ . For the fourth moment $\boldsymbol{r}$ , there are obviously only 3 possibilities: $c_{\Vert }^{4}$ , $c_{\Vert }^{2}c_{\bot }^{2}$ and $(c_{\bot }^{2})^{2}$ , and these gyrotropic components will be called $r_{\Vert \Vert },r_{\Vert \bot }$ and $r_{\bot \bot }$ . We have already seen that the double contractions with $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ , were very useful operators to extract the gyrotropic components for the lower-order moments, $\boldsymbol{p}$ and $\boldsymbol{q}$ . To obtain any scalar quantity from the fourth moment, we obviously need to apply two double contractions. There are 3 possibilities: we can apply $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ twice, we can apply $\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ or we can apply $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ twice. Not surprisingly, these double contractions with $\boldsymbol{r}$ indeed extract the 3 possible gyrotropic parts, as it is easy to verify that
We are now ready to guess how to write the decomposition of the gyrotropic fourth-order moment. Motivated by the previous decompositions, it obviously has to be something in the form of
where we still did not determine how the symmetric operator acts here. Importantly, the symmetric operator ‘Sym’ is not equivalent to the symmetric operator ‘ $S$ ’ that cycles all the indices around, equation (8.2). The fluid hierarchy obviously needs two symmetric operators, one unique ‘ $S$ ’ that is used to derive the evolution equation of a given fluid moment $\unicode[STIX]{x1D653}^{(n)}$ , and one non-unique ‘Sym’ that is used for the decomposition of that fluid moment. The determination of how ‘Sym’ acts here is not that obvious. Nevertheless, one can consider in how many ways one can extract the gyrotropic components from $r_{ijkl}$ . For $r_{\Vert \Vert }$ , one does two double contractions with $(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ . The possible choices are $(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})_{ij}(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})_{kl}$ ; $(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})_{ik}(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})_{jl}$ and $(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})_{il}(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})_{jk}$ , however, all of these choices are equivalent. To obtain the $r_{\Vert \bot }$ , we perform double contractions with $(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ and $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ , where the last operator contains a function $\unicode[STIX]{x1D6FF}_{ij}$ . How many different delta functions we can obtain from 4 indices $i,j,k,l$ ? There are $\binom{4}{2}=4!/2!(4-2)!=6$ different possibilities:
and all of them are non-equivalent. The symmetric operator acting in the second term therefore has 6 components
Lastly, to obtain the $r_{\bot \bot }$ component, one applies two double contractions with $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ that results in combinations of $\unicode[STIX]{x1D6FF}_{ij}\unicode[STIX]{x1D6FF}_{kl}$ . How many possibilities do we have? Obviously, there are 6 possibilities for the first delta function, which, by pairing with the other delta function in a way that indices are not repeated, yields together 6 possibilities $\unicode[STIX]{x1D6FF}_{ij}\unicode[STIX]{x1D6FF}_{kl};\unicode[STIX]{x1D6FF}_{ik}\unicode[STIX]{x1D6FF}_{jl};\unicode[STIX]{x1D6FF}_{il}\unicode[STIX]{x1D6FF}_{jk};\unicode[STIX]{x1D6FF}_{jk}\unicode[STIX]{x1D6FF}_{il};\unicode[STIX]{x1D6FF}_{jl}\unicode[STIX]{x1D6FF}_{ik};\unicode[STIX]{x1D6FF}_{kl}\unicode[STIX]{x1D6FF}_{ij}$ . However, 3 possibilities have an equivalent pair, $\unicode[STIX]{x1D6FF}_{ij}\unicode[STIX]{x1D6FF}_{kl}=\unicode[STIX]{x1D6FF}_{kl}\unicode[STIX]{x1D6FF}_{ij}$ ; $\unicode[STIX]{x1D6FF}_{ik}\unicode[STIX]{x1D6FF}_{jl}=\unicode[STIX]{x1D6FF}_{jl}\unicode[STIX]{x1D6FF}_{ik}$ and $\unicode[STIX]{x1D6FF}_{il}\unicode[STIX]{x1D6FF}_{jk}=\unicode[STIX]{x1D6FF}_{jk}\unicode[STIX]{x1D6FF}_{il}$ and there are only 3 non-equivalent combinations. The symmetric operator acting on the last term in (8.6) can be determined to be
The factor $1/2$ is actually not that obvious, and one needs to verify that the decomposition indeed satisfies (8.5). The entire fourth-order gyrotropic moment $\boldsymbol{r}^{\text{g}}$ is decomposed in the index notation according to
It might be tempting to rearrange this decomposition to a more compact form
nevertheless, in actual calculations we find the form (8.10) to be more useful. It is important to verity that the decomposition (8.10) really works. A straightforward calculation yields
which is consistent with (8.3). The $r_{\Vert \bot }$ component calculates as
which is consistent with (8.4). The $r_{\bot \bot }$ component calculates as
which is consistent with (8.5). The decomposition (8.10) indeed works.
Now we need to verify expressions (7.94), (7.95) that were used in the scalar heat flux equations. The first expression calculates as
and the second expression calculates similarly as
which verifies (7.94), (7.95).
Furthermore, by exploring the $\unicode[STIX]{x2202}\boldsymbol{r}/\unicode[STIX]{x2202}t$ equation (8.1), at frequencies that are much smaller than the gyrofrequency, the gyrotropic part of $\boldsymbol{r}$ should satisfy
in the same way that the gyrotropic parts of $\boldsymbol{p}$ and $\boldsymbol{q}$ satisfied this requirement. By using the gyrotropic (8.10), it is easy to calculate for example that
and by adding together all the 4 representations of the ‘ $S$ ’ operator, one can indeed verify that all the terms cancel, yielding (8.32). By decomposing the entire fourth-order moment into its gyrotropic and non-gyrotropic parts
the evolution equation (8.1) can therefore be rewritten as
8.1 Non-gyrotropic $\boldsymbol{r}^{\text{ng}}$
Basic properties of the non-gyrotropic tensor $\boldsymbol{r}^{\text{ng}}$ can be easily determined with a similar procedure as those we used for the non-gyrotropic pressure $\unicode[STIX]{x1D72B}$ and the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ . The decomposition of the entire fourth-order moment is
and the meaning of the ‘Sym’ operators was specified by (8.8), (8.9). By using definitions (8.3), (8.4), (8.5) for $r_{\Vert \Vert }$ , $r_{\Vert \bot }$ , $r_{\bot \bot }$ in the expression above, the full decomposition reads
Now, applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ twice to both sides of the equation yields $(\boldsymbol{r}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}):(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})=(\boldsymbol{r}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}):(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})+(\boldsymbol{r}^{\text{ng}}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}):(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ , implying $(\boldsymbol{r}^{\text{ng}}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}):(\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})=0$ , which can be also rewritten as
Similarly, the other two properties are derived by either applying $:\hat{\boldsymbol{b}}$ and $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ , or twice $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ , yielding
which, by using the property (8.38), are further reduced to
The last two properties can be also written as $\text{Tr}\,\boldsymbol{r}^{\text{ng}}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}=0$ , $\text{Tr}\,\text{Tr}\,\boldsymbol{r}^{\text{ng}}=0$ . Finally, writing all 3 properties in the index notation
One would assume that the non-gyrotropic $\boldsymbol{r}^{\text{ng}}$ can be evaluated by rewriting (8.35) as
and then expanding the right-hand side similarly to what we did for the non-gyrotropic $\unicode[STIX]{x1D72B}$ (and also the non-gyrotropic heat flux vectors in appendix D). However, the equation does not seem to yield anything useful. Instead, one needs to consider a specific example of a bi-Maxwellian distribution function.
8.2 Bi-Maxwellian distribution
Here we consider a special case of a bi-Maxwellian distribution function
It is useful to use the fluctuating velocity $\boldsymbol{c}=\boldsymbol{v}-\boldsymbol{u}$ , where $c^{2}=c_{\Vert }^{2}+c_{\bot }^{2}$ and $c_{\bot }^{2}=c_{x}^{2}+c_{y}^{2}$ . For brevity of the calculations we introduce
where here the thermal speeds are meant to be spatially dependent, i.e. they are written with $T$ and not with $T^{(0)}$ . The bi-Maxwellian distribution therefore reads
It is useful to remind ourselves of the following one-dimensional integrals
and that the general integral with $x^{n}$ , where $n$ is an integer, reads
The double factorial $(n-1)!!=1\cdot 3\cdot 5\cdots (n-1)$ , with $(-1)!!=1$ . The identity (8.47) is easily obtained by performing differentiation $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6FC}$ of the first result in (8.46), and this technique is useful in the calculation of the fluid moments.
To become more familiar with the bi-Maxwellian distribution, it is useful to verify whether integrals over $f_{0}$ indeed yield the expected fluid moments. Since the macroscopic velocity $\boldsymbol{u}$ is independent of $\boldsymbol{v}$ , changing from variable $\boldsymbol{v}$ to $\boldsymbol{c}$ just yields $\text{d}^{3}v=\text{d}^{3}c$ (similarly to the substitution $y=x+\text{const.}$ that yields $\text{d}y=\,\text{d}x$ ). We use the notations $\text{d}^{3}c=\text{d}c_{\Vert }\,\text{d}^{2}c_{\bot }$ and $\text{d}^{2}c_{\bot }=\text{d}c_{x}\,\text{d}c_{y}$ . Also, because the integrals are evaluated from $-\infty$ to $\infty$ , substitution from $\boldsymbol{v}$ to $\boldsymbol{c}$ does not change these bounds. Integration in velocity space therefore yields
For simplicity we have integrated in Cartesian coordinates (instead of perhaps the more elegant cylindrical coordinates), which has an additional benefit in that we can stop writing the integral bounds, since the integrals are always from $-\infty$ to $\infty$ . By using the bi-Maxwellian $f_{0}$ it is very easy to verify that
Continuing with the heat flux components yields
It is important to emphasize that the zero heat flux values were obtained by assuming some prescribed equilibrium $f_{0}$ , here bi-Maxwellian. A crucial principle used in constructing fluid models with higher-order moments is that the specific $f_{0}$ is assumed only for the last retained moment. Indeed, if one closes the hierarchy by prescribing $q_{\Vert }=0,q_{\bot }=0$ , the CGL fluid model is obtained. However, to go higher in the fluid hierarchy, evolution equations for $q_{\Vert }$ and $q_{\bot }$ cannot be eliminated, even if the hierarchy will be eventually closed, for example at the fourth-order moment level, by assuming a bi-Maxwellian $f_{0}$ . Or in another words, the zero heat flux values were obtained for a distribution function that is strictly $f_{0}$ , however, fluctuations/perturbations around $f_{0}$ are still allowed. Therefore, to obtain the next available model after the CGL, the heat flux equations must be retained, and a closure is performed on the fourth-order moment. Only this last moment is calculated by assuming a specific distribution function, that is strictly $f_{0}$ . For a bi-Maxwellian, the gyrotropic fourth-order moments are calculated according to
Therefore, for a bi-Maxwellian, the following closure can be constructed
and this closure is known as the ‘normal’ closure, a name suggested by Chust & Belmont (Reference Chust and Belmont2006).
8.3 Bi-Maxwellian non-gyrotropic $\boldsymbol{r}^{\text{ng}}$ contributions
Let us consider a specific case of a bi-Maxwellian distribution function. Then, an expansion procedure analogous to the one developed by Grad (Reference Grad1949) for rarefied gases can be used, where the distribution function is expanded in a series of Hermite polynomials. By keeping only the first few terms, one can express the fourth-order moment through the second-order (pressure) moments. The procedure is written down in Oraevskii et al. (Reference Oraevskii, Chodura and Feneberg1968), and the entire fourth-order moment can be decomposed in the following way: see (2.31) in Oraevskii et al. (Reference Oraevskii, Chodura and Feneberg1968) (where a small typo on the left-hand side is present, where instead of $q_{\unicode[STIX]{x1D6FC}\unicode[STIX]{x1D6FD}\unicode[STIX]{x1D6FE}\unicode[STIX]{x1D716}}$ , there should be $q_{\unicode[STIX]{x1D6FC}\unicode[STIX]{x1D6FD}\unicode[STIX]{x1D6FE}\unicode[STIX]{x1D6FF}}$ ), equation (24) in Goswami et al. (Reference Goswami, Passot and Sulem2005), equation (9) in Passot & Sulem (Reference Passot and Sulem2007)
where the non-gyrotropic contributions read
and where terms $\unicode[STIX]{x1D6F1}_{ij}\unicode[STIX]{x1D6F1}_{lk}+\unicode[STIX]{x1D6F1}_{ik}\unicode[STIX]{x1D6F1}_{jl}+\unicode[STIX]{x1D6F1}_{il}\unicode[STIX]{x1D6F1}_{jk}$ were neglected. By using $p_{ij}^{\text{g}}=p_{\Vert }\hat{b}_{i}\hat{b}_{j}+p_{\bot }(\unicode[STIX]{x1D6FF}_{ij}-\hat{b}_{i}\hat{b}_{j})$ , it can be indeed shown by direct straightforward calculation that the gyrotropic part
This agrees with the decomposition (8.10) valid for a general distribution function, after one specifies that for a bi-Maxwellian distribution $r_{\Vert \Vert }=3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}$ , $r_{\Vert \bot }=p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C}$ and $r_{\bot \bot }=2p_{\bot }^{2}/\unicode[STIX]{x1D70C}$ (last term in (8.10) contains $r_{\bot \bot }/2$ ).
Now it is possible to calculate the non-gyrotropic contributions $\boldsymbol{r}^{\text{ng}}$ , by using (8.58). It is useful to pre-calculate the trace of (8.58) that reads
The two terms that enter the parallel and perpendicular heat flux equations (7.96), (7.97) calculate as
At this stage, we need to further simplify, and we keep only those terms that contribute at the linear level, yielding
so the term entering the $q_{\bot }$ equation (7.97) reads
Note that $(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D72B}):\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}=-\unicode[STIX]{x1D72B}:(\hat{\boldsymbol{b}}(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735})\hat{\boldsymbol{b}})^{S}$ , and the term does not contribute at the linear level. Of course, many nonlinear terms were neglected, and technically, the only fully consistent procedure is to explicitly write those terms at the linear level, in the form
The results show that, if the $\unicode[STIX]{x1D72B}$ contributions are kept in the heat flux equations, the $\boldsymbol{r}^{\text{ng}}$ contributions cannot just be straightforwardly neglected, since the $\unicode[STIX]{x1D72B}$ and $\boldsymbol{r}^{\text{ng}}$ contributions partially cancel out.
9 Bi-Maxwellian fluid model, second-order CGL (CGL2)
In the previous section we have seen that for a bi-Maxwellian distribution function, one can close the fluid hierarchy by prescribing the ‘normal’ closure
It is useful to summarize the fully nonlinear model that we want to consider here. We consider only proton species, and we make the electrons massless and cold. We simplify the pressure equations, by neglecting the FLR stress forces (which enter only nonlinearly), and we keep only those non-gyrotropic contributions in the pressure and heat flux equations that have a non-zero linear contributions. The nonlinear model reads
To develop a vocabulary, let us first consider the above fluid model with all the non-gyrotropic fluctuations neglected, i.e. when all the terms with $\unicode[STIX]{x1D72B}$ , $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ are neglected. Additionally, let us neglect the Hall term in the induction equation. Such a fluid model is non-dispersive, and represents a generalization of the non-dispersive CGL model. Even though such a fluid model was not explicitly considered by Chew, Goldberger and Low, the pressure equations that include the gyrotropic heat flux contributions $q_{\Vert },q_{\bot }$ , were indeed given by Chew et al. (Reference Chew, Goldberger and Low1956). As discussed previously, the authors just used a different notation with $q_{n}=q_{\Vert }-3q_{\bot }$ and $q_{s}=q_{\bot }$ , but their pressure equations contain the gyrotropic heat flux. The authors did not consider evolution equations for the gyrotropic heat fluxes, equations (9.7), (9.8). Nevertheless, the name CGL is so well recognized by the community, i.e. the name CGL is essentially recognized as the collisionless MHD, that we suggest calling the non-dispersive fluid model that used the above gyrotropic heat flux equations, ‘the second-order CGL’, abbreviated as ‘CGL2’. The name CGL2 has a nice advantage in that the abbreviations for many models considered previously can be easily and naturally generalized.
The CGL2 fluid model does not contain any dispersive effects and it is length-scale invariant, similarly to the CGL and MHD models. If the Hall term in the induction equation is considered, this yields the ‘Hall-CGL2’ model. Considering also the first-order FLR corrections to the pressure tensor (FLR1) or the second-order corrections (FLR2), yields the ‘Hall-CGL2-FLR1’ and the ‘Hall-CGL2-FLR2’ fluid models. Finally, considering the non-gyrotropic heat flux fluctuations $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ , yields the Hall-CGL2-FLR3 model. Fluid models with the Hall term neglected can be easily abbreviated as ‘CGL2-FLR1’, ‘CGL2-FLR2’ etc. Even though, if the dispersive effects of the FLR corrections are considered, there is really no reason to neglect the simple Hall term, and the use of such fluid models is discouraged. Obviously, the name ‘CGL2’ is extremely beneficial and the very natural for the classification of fluid models, and we will use this name henceforth.
It is important to correctly normalize the heat flux, and the normalization is according to $\widetilde{q}_{\Vert ,\bot }=q_{\Vert ,\bot }/(p_{\Vert }^{(0)}V_{A})$ , so both the parallel and perpendicular heat fluxes are normalized in the same way (similarly, both $p_{\Vert }$ and $p_{\bot }$ are normalized with respect to $p_{\Vert }^{(0)}$ ). By dropping the tilde, the scalar pressure equations (9.5), (9.6) remain unchanged, and the normalized nonlinear heat flux equations read
Now we need to linearize the system, where one prescribes $q_{\Vert }^{(0)}=0$ and $q_{\bot }^{(0)}=0$ . Once normalized, linearized, transformed to Fourier space and written in the $x$ – $z$ plane, the model reads (dropping the tilde everywhere)
As previously, $v_{A\Vert }^{2}=1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)$ .
9.1 CGL2 dispersion relation
Neglecting all the dispersive effects, prescribing $\unicode[STIX]{x1D72B}=0$ , $\boldsymbol{S}_{\bot }^{\Vert }=0$ , $\boldsymbol{S}_{\bot }^{\bot }=0$ , and eliminating the Hall term in the induction equation (i.e. terms proportional to $\text{i}k^{2}$ in the above induction equation), yields the CGL2 model. By exploring the system, it is obvious that the Alfvén mode separates from the system in the $u_{y},B_{y}$ components. The dispersion relation of the Alfvén mode in the (non-dispersive) CGL2 model is therefore the same as in the CGL model, and reads (the normalization tildes are dropped)
The ‘hard’ threshold of the oblique firehose instability is therefore the same as in the CGL model, and correctly reproduced. The entire dispersion relation for the CGL2 fluid model can be written in the following form
The CGL2 fluid model therefore contains 5 forward and 5 backward propagating modes. The oblique Alfvén mode is separated as stated above, and is incompressible. The remaining 4 forward and 4 backward waves are generally compressible and coupled together through the eighth-order polynomial in $\unicode[STIX]{x1D714}$ (fourth-order polynomial in $\unicode[STIX]{x1D714}^{2}$ ). Let us ignore for a moment the distinction between the forward and backward modes, since these will always be symmetric in $\unicode[STIX]{x1D714}$ . The CGL2 model contains 5 waves/modes. The oblique Alfvén mode is separated, and the remaining 4 compressible modes are in general strongly coupled. In contrast to the simpler CGL (and MHD) model, that contains only 2 compressible modes (slow and fast), we therefore have 2 ‘new’ modes, that do not have an analogy in the usual CGL and MHD descriptions. There is no standardized vocabulary on how the modes should be named, perhaps the expression ‘higher-order modes’ is the most appropriate. For the highly oblique propagation, considering the mirror instability, sometimes the expression ‘mirror modes’ is used. Nevertheless, since none of the 4 compressible modes match the slow and fast CGL dispersion relations, the distinction between modes becomes blurry. The reader has to get used to the fact that, by considering higher-order fluid moments, we perhaps came closer to the kinetic theory, which admits an infinite number of modes that are difficult to classify, unless a specific situation is considered. The difference between the CGL2 model and the kinetic description is that, unless a firehose or mirror instability threshold is reached, the fluid modes are un-damped.
For strictly parallel propagation $(k_{\bot }=0)$ , in addition to the parallel Alfvén mode (9.22) $\unicode[STIX]{x1D714}=\pm k_{\Vert }v_{A\Vert }$ that was already separated, the solution of the eighth-order polynomial contains another Alfvén mode $\unicode[STIX]{x1D714}=\pm k_{\Vert }v_{A\Vert }$ , which was of course expected and is equivalent to the CGL model, since the components $u_{x},u_{y},B_{x},B_{y}$ de-couple for parallel propagation. The remaining solutions are
Solutions (9.28) and (9.29) can be obtained by considering a pure 1-D geometry (where $p_{\bot }$ and $q_{\bot }$ disappear since the kinetic velocity $v_{\bot }$ disappears) and considering only fluctuations in the quantities $\unicode[STIX]{x1D70C},u_{z},p_{\Vert },q_{\Vert }$ . The $q_{\Vert }$ heat flux fluctuations in the CGL2 fluid model are therefore responsible for ‘splitting’ of the CGL ion-acoustic mode $\unicode[STIX]{x1D714}=\pm k_{\Vert }\sqrt{3\unicode[STIX]{x1D6FD}_{\Vert }/2}$ into two modes (9.28) and (9.29).
Considering solutions for strictly perpendicular propagation $(k_{\Vert }=0)$ , the only non-zero mode is the fast mode
and the dispersion relation is unchanged from the CGL model (for perpendicular propagation the $q_{\Vert },q_{\bot }$ contributions to the equations for $p_{\Vert },p_{\bot }$ naturally vanish and the CGL2 model is equivalent to the CGL model).
9.2 Mirror instability
A very interesting direction of propagation for this fluid model is the highly oblique limit, $k_{\bot }\gg k_{\Vert }$ , since the CGL2 fluid model contains the correct mirror instability threshold. The correct mirror threshold can be already seen in the quantity $A_{0}$ , where, in contrast to the usual CGL model, the factor $1/6$ disappears in the last term ${\sim}a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }$ .
In the highly oblique limit, the fast mode (9.31) can be separated from the coupled eighth degree polynomial (actually fourth degree polynomial in $\unicode[STIX]{x1D714}^{2}$ ) in the following way. Importantly, one cannot prescribe a completely perpendicular propagation $k_{\Vert }=0$ , and a highly oblique limit $k_{\bot }\gg k_{\Vert }$ , or $k_{\Vert }\rightarrow 0$ , has to be considered. The polynomial coefficients can be approximated as
An alternative approximation is to use $k_{\bot }\rightarrow k$ in the above expressions. The coefficient $A_{6}$ does not contain any $k_{\Vert }$ and is very large compared to $A_{4},A_{2},A_{0}$ that all contain $k_{\Vert }$ . The solutions will inevitably be one fast mode, and 3 very slow modes. In this specific case, the fast mode can be quickly separated from the eighth-order polynomial in (9.23) with a neat trick
The trick can be verified by multiplying both brackets term by term, dividing by $-A_{6}$ and observing that $(A_{4},A_{2},A_{0})/A_{6}$ is negligible. The fast mode is always obtained correctly in this way (when the $A_{6}$ is much larger than all the other coefficients), however, the dispersion relation for the 3 slow modes is correct only in this specific case when the system is non-dispersive, and all 3 slow modes have constant phase speeds. A proper limit and expansion should be always checked.
Nevertheless, in the highly oblique limit, the fast mode $\unicode[STIX]{x1D714}=\pm k_{\bot }\sqrt{1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }}$ is separated, and the rest of the dispersion is of sixth order in $\unicode[STIX]{x1D714}$ and contains 3 ‘slow’ modes. By cancelling out the $k_{\bot }^{2}$ that is common in all of the remaining coefficients (or $k^{2}$ ), an obvious substitution is offered that reads $\bar{\unicode[STIX]{x1D714}}=\unicode[STIX]{x1D714}/(k_{\Vert }\sqrt{\unicode[STIX]{x1D6FD}_{\Vert }})$ , and that yields a dispersion equation of third order in $\bar{\unicode[STIX]{x1D714}}^{2}$ in the form
Now, at exactly the mirror threshold $a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }=1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }$ , the last term disappears and we have one solution $\bar{\unicode[STIX]{x1D714}}^{2}=0$ , and two solutions
that are both positive. Slightly below or beyond the mirror threshold, we can prescribe $a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }=(1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert })(1+\unicode[STIX]{x1D716})$ , where $\unicode[STIX]{x1D716}$ is a small parameter, meaning that for $\unicode[STIX]{x1D716}<0$ , the system is slightly below the expected mirror threshold, i.e. stable. For $\unicode[STIX]{x1D716}>0$ , the system is slightly beyond the expected mirror threshold, i.e. unstable. The polynomial transforms to
It is possible to show that this polynomial has a discriminantFootnote 12 $\triangle >0$ , implying the existence of 3 distinct real roots. Now, when $\unicode[STIX]{x1D716}$ is small, the two solutions (9.38) will remain almost the same and change only slightly, importantly, the solutions will remain positive. Solutions of a cubic polynomial $(x-x_{1})(x-x_{2})(x-x_{3})=0$ form the last coefficient in that polynomial, $-x_{1}x_{2}x_{3}$ , here equal to $\frac{3}{8}\unicode[STIX]{x1D716}$ , and here $x_{1}$ and $x_{2}$ are positive. Therefore, $\unicode[STIX]{x1D716}<0$ (slightly below the threshold) implies $x_{3}>0$ ; and $\unicode[STIX]{x1D716}>0$ (slightly beyond the threshold) implies $x_{3}<0$ . The negative solution (in $(\bar{\unicode[STIX]{x1D714}}^{2})$ ) represents the mirror instability, which finishes the analytic proof that the (highly oblique) mirror threshold in CGL2 model is indeed $a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }=1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }$ . The CGL2 fluid model therefore contains the same mirror instability threshold, as is found in kinetic theory.
The conclusion can be double checked by numerically exploring solutions of (9.39) for several values of $\unicode[STIX]{x1D716}$ , and it is indeed possible to conclude that, for $\unicode[STIX]{x1D716}<0$ (slightly below the threshold), the signs of the solutions are $\bar{\unicode[STIX]{x1D714}}^{2}=+,+,+$ ; and that, for $\unicode[STIX]{x1D716}>0$ (slightly beyond the mirror threshold), the signs of the solutions are $\bar{\unicode[STIX]{x1D714}}^{2}=+,+,-$ . The solution with the minus sign represents the mirror instability. It is noted that the Landau fluid description allows for the construction of a fluid model with quasi-static heat fluxes, that also correctly reproduces the ‘hard’ mirror instability threshold (Sulem & Passot Reference Sulem and Passot2012). Considering Landau fluid models, the mirror instability was numerically investigated in detail for example by Passot & Sulem (Reference Passot and Sulem2007), Passot et al. (Reference Passot, Sulem and Hunana2012) and Sulem & Passot (Reference Sulem and Passot2015). As a suggestion for possible future work, it would be beneficial to numerically calculate ‘Hellinger’s’ contours (see § 6.3) with the prescribed maximum growth rate for the mirror instability. If one uses Landau fluid models with sufficiently precise FLR corrections, the results should match the kinetic contours very precisely.
9.3 Hall-CGL2 dispersion relation
Considering the Hall term, the dispersion relation of the Hall-CGL2 fluid model reads
where the left-hand side represents the CGL2 dispersion relation (9.23) and the right-hand side is the Hall-term contribution. For strictly parallel propagation, the solutions are the usual Hall-CGL whistler and ion-cyclotron waves $\unicode[STIX]{x1D714}=\pm k_{\Vert }^{2}/2+k_{\Vert }\sqrt{v_{A\Vert }^{2}+k_{\Vert }^{2}/4}$ , $\unicode[STIX]{x1D714}=\pm k_{\Vert }^{2}/2-k_{\Vert }\sqrt{v_{A\Vert }^{2}+k_{\Vert }^{2}/4}$ , that are accompanied by the CGL2 solutions (9.28), (9.30) and (9.29). For strictly perpendicular propagation, the solution is (9.31) since the Hall term vanishes. The dispersion relation of the Hall-CGL-FLR1 model was already quite large to write down, and we do not provide the final dispersion relation for the Hall-CGL2-FLR1 model, instead we recommend working with the full system of linearized equations and solving it numerically or using analytic software such as Maple.
9.4 ‘Static’ closure – generalized isothermal closure
There exists even simpler fluid closure that recovers the correct mirror threshold, called the ‘static’ or ‘quasi-static’ closure (Constantinescu Reference Constantinescu2002; Chust & Belmont Reference Chust and Belmont2006; Passot et al. Reference Passot, Ruban and Sulem2006). By using heat flux equations (9.7), (9.8) and considering the quasi-static regime with $\unicode[STIX]{x2202}q_{\Vert }/\unicode[STIX]{x2202}t=0$ , $\unicode[STIX]{x2202}q_{\bot }/\unicode[STIX]{x2202}t=0$ , $\boldsymbol{u}=0$ yields
where the FLR pressure tensor $\unicode[STIX]{x1D72B}$ was neglected in the second equation. It is useful to define $\unicode[STIX]{x2202}_{\Vert }\equiv \hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ , which represents the gradient along the magnetic field lines, and to use the identity
further yielding
Equation (9.43) describes the evolution of the parallel and perpendicular temperature fluctuations along the magnetic field lines, and it can be verified that the solution is
where $a_{p}=T_{\bot }^{(0)}/T_{\Vert }^{(0)}$ . Solution (9.44) is referred to as the ‘static’ closure and it is, for example, equivalent to (19), (20) of Passot et al. (Reference Passot, Ruban and Sulem2006), and (2) of Constantinescu (Reference Constantinescu2002). For isotropic mean temperatures $a_{p}=1$ , equation (9.44) yields $T_{\bot }=T_{\bot }^{(0)}=T_{\Vert }^{(0)}$ and both parallel and perpendicular temperatures are isothermal. A similar result is obtained for general $a_{p}$ with $|\boldsymbol{B}|=B_{0}$ , which yields $T_{\bot }=T_{\bot }^{(0)}$ . The closure (9.44) can be therefore viewed as a generalization of the isothermal closure in the presence of temperature anisotropy and variations of magnetic field strength. The closure is used directly in the momentum equation (9.3), and the time-dependent pressure equations (9.5), (9.6) are disregarded. Therefore, even though the closure was derived by considering fourth-order moments, the resulting fluid model is actually simpler than the CGL2 and CGL models. As discussed for example by Passot et al. (Reference Passot, Ruban and Sulem2006), the closure prescribes a limitation for the minimal value of $|\boldsymbol{B}|/B_{0}>1-1/a_{p}$ that is required to prevent temperature singularity. Or in other words, by separating $|\boldsymbol{B}|=B_{0}+B^{(1)}$ the requirement reads $B^{(1)}>-1/a_{p}$ , meaning that the fluctuating $B^{(1)}$ cannot be too negative. Mirror instability is often associated with nonlinear structures in the form of magnetic holes and humps, and the requirement implies that the magnetic holes cannot be too deep. To easily analyse the dispersion relations, it is useful to write the closure at the linear level, in the following form
The result can be obtained by linearizing (9.44) or (9.43), and at the linear level $\unicode[STIX]{x2202}_{i}|\boldsymbol{B}|\overset{\text{lin}}{=}\unicode[STIX]{x2202}_{i}B_{z}$ . The normalized closure reads (tildes are dropped) $p_{\Vert }=n$ and $p_{\bot }=a_{p}n+a_{p}(1-a_{p})B_{z}$ . Considering the simplest non-dispersive model (by neglecting the Hall term and $\unicode[STIX]{x1D72B}$ ), the dispersion relation of this fluid model reads
In comparison with the dispersion relation (3.193) of the adiabatic CGL model, the ‘static’ closure eliminates the erroneous $1/6$ factor in the $A_{0}$ coefficient and yields the correct mirror threshold. The erroneous $1/6$ factor in the CGL model can be therefore interpreted as a result of inadequacy of adiabatic closures in the very slow-dynamics context, such as the mirror instability. For completeness, solutions for parallel propagation ( $k_{\bot }=0$ ) are $\unicode[STIX]{x1D714}=\pm v_{A\Vert }k_{\Vert }$ and $\unicode[STIX]{x1D714}=\pm \sqrt{(\unicode[STIX]{x1D6FD}_{\Vert }/2)}k_{\Vert }$ . For perpendicular propagation ( $k_{\Vert }=0$ ) the solution is $\unicode[STIX]{x1D714}=\pm \sqrt{1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-a_{p}^{2}(\unicode[STIX]{x1D6FD}_{\Vert }/2)}k_{\bot }$ . Obviously, somewhere beyond the mirror threshold, the perpendicular fast mode experiences unphysical instability, which can be interpreted as a result of inadequacy of ‘static’ (isothermal) closures in the fast-dynamics context.
To clearly demonstrate that variations of magnetic field in the ‘static’ closures (9.45) are indeed crucial in recovering the mirror instability, let us neglect them and quickly consider strictly isothermal closure $p_{\Vert }=nT_{\Vert }^{(0)}$ ; $p_{\bot }=nT_{\bot }^{(0)}$ , which yields the following dispersion relation
As can be seen from the $A_{0}$ coefficient, this model does not recover the mirror instability. For parallel propagation the solutions are the same as for the ‘static’ closure, nevertheless, the perpendicular fast mode is now always stable with $\unicode[STIX]{x1D714}=\pm \sqrt{1+a_{p}(\unicode[STIX]{x1D6FD}_{\Vert }/2)}k_{\bot }$ . As a double check, after prescribing $a_{p}=1$ , the dispersion relation (9.47) is equal to (9.46), since the $B_{z}$ contributions in the ‘static’ closure disappear. Also, we considered models with polytropic indices $\unicode[STIX]{x1D6FE}_{\Vert },\unicode[STIX]{x1D6FE}_{\bot }$ in § 3.11, and the dispersion relation (9.47) is consistent with (3.171), after prescribing $\unicode[STIX]{x1D6FE}_{\Vert }=1$ , $\unicode[STIX]{x1D6FE}_{\bot }=1$ .
10 Bi-kappa fluid model (BiKappa)
10.1 Bi-kappa distribution function
Often in the solar wind and in space plasma physics, distribution functions with elongated high-energy ‘suprathermal’ tails are observed, that cannot be modelled with the bi-Maxwellian distribution. A slightly more general distribution function is the bi-kappa distribution, that contains one free parameter $\unicode[STIX]{x1D705}$ , see for example Summers & Thorne (Reference Summers and Thorne1992), Pierrard & Lazar (Reference Pierrard and Lazar2010), Livadiotis & McComas (Reference Livadiotis and McComas2013) and references therein. The $\unicode[STIX]{x1D705}$ parameter can be different in the parallel and transverse directions (see e.g. Basu (Reference Basu2009), dos Santos, Ziebell & Gaelzer (Reference dos Santos, Ziebell and Gaelzer2015)). Here, we consider only one free $\unicode[STIX]{x1D705}$ parameter. The distribution has the following form
where the generalized thermal speeds
and $\unicode[STIX]{x1D6E4}$ is the usual gamma function. In the limit $\unicode[STIX]{x1D705}\rightarrow \infty$ , the generalized thermal speeds are the usual thermal speeds and the distribution function is bi-Maxwellian, since
Why the generalized thermal speeds are defined this way will become clear from later calculations of the moments, and the choice of the power law index $-(\unicode[STIX]{x1D705}+1)$ , instead of perhaps the more logical $-\unicode[STIX]{x1D705}$ , is attributed to purely historical reasons. Calculating second-order (pressure) integrals with this distribution requires for convergence $\unicode[STIX]{x1D705}>3/2$ , and this value is also the minimum value for the generalized thermal speeds (10.2) to have real values. Sometimes, in the definition of the isotropic kappa distribution, a value of $\unicode[STIX]{x1D6E4}(3/2)=\sqrt{\unicode[STIX]{x03C0}}/2$ is used and $\unicode[STIX]{x03C0}^{3/2}=2\unicode[STIX]{x03C0}\sqrt{\unicode[STIX]{x03C0}}/2=2\unicode[STIX]{x03C0}\unicode[STIX]{x1D6E4}(3/2)$ . Again, for brevity of the calculations, it is useful to introduce $\unicode[STIX]{x1D6FC}_{\Vert }=1/(\unicode[STIX]{x1D705}\unicode[STIX]{x1D703}_{\Vert }^{2})$ , $\unicode[STIX]{x1D6FC}_{\bot }=1/(\unicode[STIX]{x1D705}\unicode[STIX]{x1D703}_{\bot }^{2})$ , and rewrite the bi-kappa distribution (10.1) in the following form
The important integral is
For the critical value of $a=1/2$ , the integral $\int 1/\sqrt{1+x^{2}}\,\text{d}x=\text{arcsinh}(x)$ , and since $\lim _{x\rightarrow \pm \infty }\text{arcsinh}(x)=\pm \infty$ , the integral diverges. For an isotropic kappa distribution $f(c^{2})$ , the three-dimensional integrals can be conveniently evaluated by transformation to spherical coordinates
In this case, the important integral is
and by a direct substitution
For an isotropic kappa distribution therefore
which explains the normalization factors. For an anisotropic bi-kappa distributions $f(c_{\Vert }^{2},c_{\bot }^{2})$ , the integration is slightly more complicated because, in contrast to a bi-Maxwellian distribution, the integration cannot be fully separated into two independent integrations over parallel and perpendicular velocity components, and has to be performed successively. It is possible to use a cylindrical coordinate system and integrate
However, here we keep the old fashioned Cartesian coordinates, and at the expense of a slightly bit more algebra (i.e. one more integration), we will integrate over $\,\text{d}^{3}c=\,\text{d}c_{\Vert }\,\text{d}^{2}c_{\bot }=\,\text{d}c_{\Vert }\,\text{d}c_{x}\,\text{d}c_{y}$ . A small added benefit is that we do not have to follow whether the integrals are $\int _{-\infty }^{\infty }$ or $\int _{0}^{\infty }$ , and we can stop writing the boundaries.
The most important integral is a slight generalization of (10.5) in the form
where for simplicity $b,\unicode[STIX]{x1D6FC}$ are assumed to be positive constants. So one can calculate
where, importantly, the ‘power law’ changed from $-a$ to $-(a-1/2)$ . Performing a successive three-dimensional integral therefore yields
and integrating the bi-kappa distribution over the velocity space
which verifies that the normalization constants in the bi-kappa definition (10.4) are indeed correct, and that $\int f_{0}d^{3}c=n$ . For later calculations, it is useful to write down two partial integrals when the integration is done over $\,\text{d}c_{\Vert }$ and $\,\text{d}^{2}c_{\bot }$ that read
and
Continuing with the calculation of velocity moments, it is easy to show that $\int \boldsymbol{c}f_{0}\,\text{d}^{3}c=0$ , which verifies $n\boldsymbol{u}=\int \boldsymbol{v}f_{0}\,\text{d}^{3}v$ . Continuing with the higher-order moments, it is possible to calculate those from table integrals (a great help for double checking is an analytic software like Maple or Mathematica) or similarly for a bi-Maxwellian distribution to use a trick with the differentiation with respect to $\unicode[STIX]{x1D6FC}$ as
where we have also used a property of the gamma function $x\unicode[STIX]{x1D6E4}(x)=\unicode[STIX]{x1D6E4}(x+1)$ , and the last integral requires $\unicode[STIX]{x1D705}>\frac{1}{2}$ . A slightly more general integral is
which can be used to evaluate the following integrals
Since we like to double check everything, the above integral can be also calculated as
For the calculation of the parallel pressure, we first integrate over $\text{d}^{2}c_{\bot }$ so we can use result (10.20) and then over $\text{d}c_{\Vert }$
where we have used $(\unicode[STIX]{x1D705}-\frac{3}{2})\unicode[STIX]{x1D6E4}(\unicode[STIX]{x1D705}-\frac{3}{2})=\unicode[STIX]{x1D6E4}(\unicode[STIX]{x1D705}-\frac{1}{2})$ . Calculating the perpendicular pressure, we here first integrate over $\text{d}c_{\Vert }$ so we can use result (10.17) and then over $\text{d}^{2}c_{\bot }$
The last two calculations clarify the reasoning behind the definition of the bi-kappa generalized thermal speeds $\unicode[STIX]{x1D703}_{\Vert }$ , $\unicode[STIX]{x1D703}_{\bot }$ (here rewritten with the notation $\unicode[STIX]{x1D6FC}_{\Vert }$ and $\unicode[STIX]{x1D6FC}_{\bot }$ ), where one starts with a bi-kappa distribution with unspecified $\unicode[STIX]{x1D6FC}_{\Vert }$ and $\unicode[STIX]{x1D6FC}_{\bot }$ , and calculating the parallel and perpendicular pressure integrals, yields the required forms $\unicode[STIX]{x1D6FC}_{\Vert }^{-1}=(\unicode[STIX]{x1D705}-\frac{3}{2})(2T_{\Vert }/m)$ ; $\unicode[STIX]{x1D6FC}_{\bot }^{-1}=(\unicode[STIX]{x1D705}-\frac{3}{2})(2T_{\bot }/m)$ . Then, the split to $\unicode[STIX]{x1D6FC}_{\Vert }^{-1}=\unicode[STIX]{x1D705}\unicode[STIX]{x1D703}_{\Vert }^{2}$ ; $\unicode[STIX]{x1D6FC}_{\bot }^{-1}=\unicode[STIX]{x1D705}\unicode[STIX]{x1D703}_{\bot }^{2}$ is dictated by the requirement that for $\unicode[STIX]{x1D705}\rightarrow \infty$ , the generalized thermal speeds converge to the usual thermal speeds, and which also yields that in this limit the distribution is bi-Maxwellian.
The bi-kappa distribution was specified with perhaps a somewhat obscure power law $-(\unicode[STIX]{x1D705}+1)$ instead of the more logical $-\unicode[STIX]{x1D705}$ , which is attributed to purely historical reasons. If one wants to define an anisotropic power-law distribution, a first guess would be
which then yields normalization constants
Then, calculating the parallel and perpendicular pressures yields the requirement $\unicode[STIX]{x1D6FC}_{\Vert ,\bot }^{-1}=(\unicode[STIX]{x1D705}-\frac{5}{2})(2T_{\Vert ,\bot }/m)$ . Further writing $\unicode[STIX]{x1D6FC}_{\Vert ,\bot }^{-1}=\unicode[STIX]{x1D705}\unicode[STIX]{x1D703}_{\Vert ,\bot }^{2}$ yields a distribution function
with generalized thermal speeds
The convergence of the pressure integrals requires $\unicode[STIX]{x1D705}>\frac{5}{2}$ , which is also the limiting case for the thermal speeds to be real. In the limit $\unicode[STIX]{x1D705}\rightarrow \infty$ the generalized thermal speeds converge to the usual thermal speeds and the distribution converges to the bi-Maxwellian. To double check that the distribution (10.31) is really correct, one can of course substitute $\unicode[STIX]{x1D705}\rightarrow \unicode[STIX]{x1D705}+1$ , which fully recovers the original bi-kappa distribution (10.1), since $\unicode[STIX]{x1D705}\unicode[STIX]{x1D703}^{2}\rightarrow (\unicode[STIX]{x1D705}+1)(1-5/2(\unicode[STIX]{x1D705}+1))(2T/m)=\unicode[STIX]{x1D705}(1-3/2\unicode[STIX]{x1D705})(2T/m)$ . From now on, let us continue calculations with the original bi-kappa distribution (10.1), that uses the $-(\unicode[STIX]{x1D705}+1)$ power-law index.
10.2 Bi-kappa fluid closure
Now that we are familiar with the bi-kappa distribution, we are ready to calculate higher-order moments. The heat fluxes are zero and easy to calculate, since both $q_{\Vert },q_{\bot }$ are anti-symmetric in $c_{\Vert }$ , and by a direct calculation
implying
As discussed previously for the bi-Maxwellian distribution, the heat fluxes must be kept in the general form, since we want to consider closure performed at the fourth-order moment level. To calculate the fourth-order fluid moments, we will need the following integrals
The $r_{\Vert \Vert }$ moment then calculates as
where in the last step we used that $\unicode[STIX]{x1D6E4}(x)(x+1)^{2}/\unicode[STIX]{x1D6E4}(x+2)=(x+1)/x$ with $x=\unicode[STIX]{x1D705}-5/2$ . The $r_{\Vert \bot }$ moment calculates as
and finally the $r_{\bot \bot }$ moment calculates as
To summarize, the following closure can be constructed
where for the bi-kappa distribution (10.1), the $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient reads
It is noteworthy that all three fourth-order moments are just multiplied by the same constant $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ . Obviously, in the limit $\unicode[STIX]{x1D705}\rightarrow \infty$ , $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}\rightarrow 1$ , and the ‘normal’ bi-Maxwellian closure is obtained.
As suggested for example by Chust & Belmont (Reference Chust and Belmont2006), a very large class of distribution functions can be accounted for by considering closures
where the 3 proportionality constants $\unicode[STIX]{x1D6FC}_{\Vert \Vert },\unicode[STIX]{x1D6FC}_{\Vert \bot },\unicode[STIX]{x1D6FC}_{\bot \bot }$ have to be determined, once a specific distribution function is prescribed. Such an unspecified closure can account for a very wide class of distribution functions. Also, a much more complicated distribution functions can be modelled by considering multi-fluid models. For example, one could consider a plasma consisting of 3 different electron species, that would describe the core, halo and the tail/strahl parts of the electron distribution function. Further information about the core, halo and strahl components of the electron distribution function can be found for example in Vocks et al. (Reference Vocks, Salem, Lin and Mann2005), Vocks, Mann & Rausche (Reference Vocks, Mann and Rausche2008), Pierrard, Lazar & Schlickeiser (Reference Pierrard, Lazar and Schlickeiser2011) and references therein.
Here, we consider closure (10.42), (10.43). Nevertheless, in the following calculations one can we keep the value of $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ unspecified. Therefore, the results are actually valid for a much larger class of fluid models, and not just for a bi-kappa distribution. Let us completely neglect the non-gyrotropic $\unicode[STIX]{x1D72B}$ and $\boldsymbol{r}^{\text{ng}}$ in the heat flux equations (7.98) and (7.99). We need to calculate
and the nonlinear heat flux equations for the bi-kappa fluid model read
Obviously, for $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=1$ (i.e. for $\unicode[STIX]{x1D705}\rightarrow \infty$ ), the bi-Maxwellian heat flux equations are obtained. The model is now closed, and it is accompanied by (9.2)–(9.6). The model is nonlinear and numerical simulations for a chosen (and fixed) value of $\unicode[STIX]{x1D705}$ can be considered. Even though the $\unicode[STIX]{x1D705}$ coefficient is fixed, similarly to bi-Maxwellian nonlinear simulations, the temperature anisotropy is free to develop, and complicated plasma heating processes can be studied (we recommend adding the FLR pressure stress forces to the scalar pressure equations). We abbreviate the non-dispersive fluid model that uses the above heat flux equations as ‘BiKappa’. If the Hall term is considered in the induction equation, the ‘Hall-BiKappa’ fluid model is obtained. Similarly, considering FLR1 and FLR2 corrections to the pressure tensor, yields the ‘Hall-BiKappa-FLR1’ and ‘Hall-BiKappa-FLR2’ fluid models. The heat flux equations are normalized, linearized, written in the $x$ – $z$ plane and Fourier transformed according to
where we have used the same plasma beta definition as always $\unicode[STIX]{x1D6FD}_{\Vert }/2=p_{\Vert }^{(0)}/(V_{A}^{2}\unicode[STIX]{x1D70C}_{0})$ . The heat flux equations are accompanied by the linearized system (9.11)–(9.19).
To consider FLR3 corrections, one needs to derive non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ for the bi-kappa distribution. For the first-order vectors the derivation is straightforward, and by following the calculations in appendix D, it is easy to show that the nonlinear
Here, we neglected perturbations $\widetilde{r}$ , since we will not consider Landau fluid models for bi-kappa distribution. For $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=1$ , the results are of course equivalent to the bi-Maxwellian expressions (D 37), (D 60). The second-order heat flux vectors are not addressed here, since we would have to derive $\boldsymbol{r}^{\text{ng}}$ for the bi-kappa distribution.
10.3 BiKappa dispersion relation
The BiKappa dispersion relation can be written in the following form
and for $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=1$ it is of course equivalent to the CGL2 dispersion relation (9.23). The oblique Alfvén mode is not affected, and is identical to (9.22), implying that the long-wavelength threshold of the oblique firehose instability is identical as well.
For strictly parallel propagation $(k_{\bot }=0)$ , the solutions of the remaining eighth-order polynomial are
Note that in the last solution (10.61), the expression under the square root remains always positive, regardless of how large $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ is (i.e. when the $\unicode[STIX]{x1D705}$ -index approaches $5/2$ ). For strictly perpendicular propagation $(k_{\Vert }=0)$ , the fast mode is not affected, and is equivalent to (9.31).
10.4 Mirror instability
By considering the highly oblique limit $k_{\bot }\gg k_{\Vert }$ , and by assuming that $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ is not enormously large (i.e. that $\unicode[STIX]{x1D705}$ is not very close to $5/2$ ) so that $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}k_{\Vert }$ can be neglected with respect to $k_{\bot }$ , the polynomial coefficients can be approximated as
The fast mode can be quickly separated with the trick (9.36), and by using the substitution $\bar{\unicode[STIX]{x1D714}}=\unicode[STIX]{x1D714}/(k_{\Vert }\sqrt{\unicode[STIX]{x1D6FD}_{\Vert }})$ , the resulting polynomial reads
At exactly the (assumed) mirror threshold $a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }=1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }$ , the coefficient $A_{0}$ disappears, so we have one solution $\bar{\unicode[STIX]{x1D714}}^{2}=0$ ; from the remaining polynomial the $1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }$ can be factored out and the solutions are
both always positive. Slightly below $(\unicode[STIX]{x1D716}<0)$ or beyond $(\unicode[STIX]{x1D716}>0)$ the mirror threshold, one can prescribe $a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }=(1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert })(1+\unicode[STIX]{x1D716})$ , where $\unicode[STIX]{x1D716}$ is a small parameter and the polynomial transforms to
and which for $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=1$ is equivalent to the bi-Maxwellian polynomial (9.39). As previously, for small $\unicode[STIX]{x1D716}$ the solutions (10.67) remain almost the same, and positive. Furthermore, since multiplying all 3 solutions together must yield $-\frac{3}{8}\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}^{2}\unicode[STIX]{x1D716}$ , this implies that below $(\unicode[STIX]{x1D716}<0)$ the threshold the third solution is positive, and beyond $(\unicode[STIX]{x1D716}>0)$ the threshold the third solution is negative, which analytically proves that the (highly oblique) mirror instability threshold is $a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }=1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }$ , regardless of $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ . The long-wavelength limit or ‘hard’ mirror threshold for the bi-kappa distribution is therefore the same as for the bi-Maxwellian distribution.
Conclusion
We investigated solutions of the (non-dispersive) BiKappa fluid model. The calculations were done with closure (10.42) for a general $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient, i.e. without using the specific value (10.43) for the bi-kappa distribution. The calculations are therefore more general and valid for any distribution function that can be closed as (10.42). We conclude that all 3 long-wavelength ‘hard’ thresholds – for the parallel and oblique firehose instability and the highly oblique mirror instability – are not affected by the value of $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ .
10.5 Hall-BiKappa dispersion relation
The Hall-BiKappa dispersion relation can be written in the following form
where we kept the BiKappa dispersion relation (10.53)–(10.57) on the left-hand side, and the Hall term contributions to the right-hand side. For the bi-Maxwellian value $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=1$ , the result is equivalent to the Hall-CGL2 dispersion relation (9.40). For strictly parallel propagation, the solutions are the usual dispersive ion-cyclotron and whistler waves (that are not affected by the $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient) and the non-dispersive solutions (10.59)–(10.61) of the BiKappa model. For strictly perpendicular propagation, the Hall term naturally disappears and the fast mode solution is (9.31).
11 Collisionless damping in fluid models – Landau fluid models
In the previous sections, we have calculated the fluid hierarchy up to the fourth-order fluid moment $\boldsymbol{r}$ . We have seen that, unless we crossed some instability threshold, the dispersion relations yielded frequencies $\unicode[STIX]{x1D714}$ that were purely real, and there was no collisionless damping present. Remarkably, collisionless damping, i.e. Landau damping, will be absent even if we continue to develop the fluid hierarchy to higher-order moments. In fact, we will see in the next section that, by considering parallel 1-D propagation (where all the fluctuations are along $\boldsymbol{B}_{0}$ ), the fluid models closed by a bi-Maxwellian fluid closure at higher-order moments than $\boldsymbol{r}$ are always unstable. Such fluid models are physically ill posed. To go higher in the fluid hierarchy, one needs to incorporate the Landau damping phenomenon into the fluid framework, and consider Landau fluid closures.
Nevertheless, to capture the Landau damping, it is not necessary to go beyond the fourth-order moment and closures performed at the fourth moment level describe the Landau damping with very good accuracy. Even though the entire Part 2 of this text is dedicated to Landau fluid closures, it is beneficial to briefly show why the Landau damping was omitted in the previous calculations. Let us consider the simplest example of 1-D electrostatic propagation. The collisionless Vlasov equation reads
Direct integration of the Vlasov equation (over $v_{z}$ ) for a general distribution function $f$ , of course yields the usual density equation, $\unicode[STIX]{x2202}n/\unicode[STIX]{x2202}t+\unicode[STIX]{x2202}_{z}(nu_{z})=0$ . The same result is obtained if one considers a specific $f_{0}$ , such as for example 1-D Maxwellian $f_{0}=n\sqrt{(\unicode[STIX]{x1D6FC}/\unicode[STIX]{x03C0})}\text{e}^{-\unicode[STIX]{x1D6FC}(v_{z}-u_{z})^{2}}$ , 1-D kappa distribution, etc. The last term of the Vlasov equation ${\sim}E_{z}$ vanishes by the integration, since $\int _{-\infty }^{\infty }(\unicode[STIX]{x2202}f/\unicode[STIX]{x2202}v_{z})\,\text{d}v_{z}=0$ . Now let us do the calculation differently, by expanding the distribution function $f=f_{0}+f^{(1)}$ , where $f_{0}$ does not have any time or spatial dependence, so $\unicode[STIX]{x2202}f_{0}/\unicode[STIX]{x2202}t=0,\unicode[STIX]{x2202}_{z}f_{0}=0$ . We can consider the Maxwellian $f_{0}=n_{0}\sqrt{(\unicode[STIX]{x1D6FC}/\unicode[STIX]{x03C0})}\text{e}^{-\unicode[STIX]{x1D6FC}v_{z}^{2}}$ where $\unicode[STIX]{x1D6FC}=m/(2T^{(0)})=1/v_{\text{th}}^{2}$ . The electric field is also expanded $E_{z}=E_{z}^{(0)}+E_{z}^{(1)}$ , and one assumes that there is no large-scale electric field, so $E_{z}^{(0)}=0$ . By neglecting the term with two small quantities $E_{z}^{(1)}\unicode[STIX]{x2202}f^{(1)}/\unicode[STIX]{x2202}v_{z}$ , the Vlasov equation reads
and transformation to Fourier space yields
The equation (11.3) is actually not that much different from (11.1), the Vlasov equation is just fully linearized and written in Fourier space. However, in kinetic calculations, the (11.3) is not directly integrated in this form. The crucial technique that reveals the presence of Landau damping in the Vlasov equation is that the equation (11.3) is first divided by $(\unicode[STIX]{x1D714}-k_{z}v_{z})$ , and an expression for $f^{(1)}$ is obtained. Only then is the equation integrated over the velocity space, yielding
The integral contains a ‘singularity’ in the denominator, and it is not obvious how to calculate this integral. Such singularities are called wave–particle resonances, and can have a more general form $\unicode[STIX]{x1D714}-k_{z}v_{z}\pm l\unicode[STIX]{x1D6FA}$ , where $\unicode[STIX]{x1D6FA}$ is the cyclotron frequency of the considered species, and $l$ is an integer. The resonance for $l=0$ is called the Landau resonance, and the other resonances for $l\neq 0$ are called cyclotron resonances. It is the presence of these resonances in integrals such as (11.4), that yields collisionless damping mechanisms, the Landau damping and the cyclotron damping. In Part 2 of our guide, we will see that the integral actually cannot be ‘calculated’, i.e. the integral cannot be expressed through elementary functions, even if $f_{0}$ is a simple Maxwellian. The integral is just expressed through the famous plasma dispersion function $Z(\unicode[STIX]{x1D701})$ , where $\unicode[STIX]{x1D701}=\unicode[STIX]{x1D714}/(|k_{z}|v_{\text{th}})$ , since the plasma dispersion function was specifically developed to describe this integral. Skipping many technical (but very important) details, such as the definition of $Z(\unicode[STIX]{x1D701})$ , for a Maxwellian $f_{0}$ the density integral is simply expressed as
One can define function $R(\unicode[STIX]{x1D701})=1+\unicode[STIX]{x1D701}Z(\unicode[STIX]{x1D701})$ , that is called the plasma response function. Now, for example, considering a proton–electron plasma with $T_{e}^{(0)}=T_{p}^{(0)}$ at wavelengths that are much longer than the Debye length, yields the following dispersion relation and numerical solution
The negative imaginary part represents the Landau damping, and the effect comes from the Landau resonance in the integral (11.4). Obviously, integrating the Vlasov equation directly in the form (11.1) or (2.1) neglects the wave–particle resonances in velocity space, which clarifies why traditional fluid models do not contain collisionless damping mechanisms. It is $f^{(1)}$ – the fluctuations/perturbations around $f_{0}$ – where the wave–particle resonances are written down explicitly, that yields the collisionless damping mechanisms in kinetic theory. To introduce collisionless damping into fluid models, we have no other choice, and we have to calculate the hierarchy of moments again, this time by integrating over the ‘kinetic’ $f^{(1)}$ . We find it useful to introduce the following vocabulary:
(i) Fluid moments $=$ integrals over a general (unspecified) distribution function $f$ , or over a specific $f_{0}$ . Evolution equations for fluid moments are obtained by direct integration of the Vlasov equation written in a form that does not explicitly contain wave–particle resonances. The resulting expressions can be nonlinear or linear and contain only fluid variables, i.e. expressions do not contain $Z(\unicode[STIX]{x1D701})$ .
(ii) Kinetic moments $=$ integrals over $f^{(1)}$ (perturbations $f^{(1)}=f-f_{0}$ , where a specific equilibrium $f_{0}$ is assumed), where the wave–particle resonances in velocity space are written down explicitly. The resulting expressions are linear and contain $Z(\unicode[STIX]{x1D701})$ , which is not a fluid variable.
Importantly, we want to keep the entire nonlinear fluid hierarchy discussed so far, including the nonlinear heat flux equations. Therefore, considering bi-Maxwellian distribution $f_{0}$ , instead of the ‘normal’ closure, the fourth-order moment is decomposed in the following form
The first terms represent the ‘normal’ closure, i.e. when an exact bi-Maxwellian $f_{0}$ is prescribed. The tilde components represent fluctuations/perturbations around $f_{0}$ , and a good notation would also be $\unicode[STIX]{x1D6FF}r$ . The decomposition (11.7) yields the following form of the nonlinear heat flux equations
Similarly to decomposition (11.7), ‘kinetic’ corrections with wave–particle resonances can also be considered for the non-gyrotropic $\boldsymbol{r}^{\text{ng}}$ , which is not addressed in this guide. Neglecting those contributions, the heat flux equations are equivalent for example to equations (14), (15) of Passot & Sulem (Reference Passot and Sulem2007), and equations (9), (10) of Passot et al. (Reference Passot, Sulem and Hunana2012).
In Part 2 of this guide, we will calculate the ‘kinetic moments’ for $\widetilde{r}_{\Vert \Vert }$ , $\widetilde{r}_{\Vert \bot }$ , $\widetilde{r}_{\bot \bot }$ , by integrating over $f^{(1)}$ . In the 3-D electromagnetic geometry, we will use a more sophisticated $f^{(1)}$ in the gyrotropic limit, which describes both Landau damping and its magnetic analogue, the transit-time damping. The expressions for the entire ‘kinetic’ hierarchy will contain $Z(\unicode[STIX]{x1D701})$ , or actually $R(\unicode[STIX]{x1D701})$ , which is not a fluid variable. Nevertheless, we will see that by introducing the concept of the Padé approximation to $R(\unicode[STIX]{x1D701})$ , that sometimes a rare possibility arises, when the ‘kinetic’ moment $\widetilde{r}$ can be expressed through lower-order ‘kinetic’ moments. Such a closure, where the last retained kinetic moment (in this case $\widetilde{r}$ ) is expressed through lower-order moments, in a way such that the kinetic $Z(\unicode[STIX]{x1D701})$ function is completely eliminated and that the closure is valid for all $\unicode[STIX]{x1D701}$ values, will be called a Landau fluid closure. For the impatient reader, we will reveal that one of the possibilities is going to be (written in Fourier space)
The first closure was obtained in the slab geometry by Hammett & Perkins (Reference Hammett and Perkins1990) and the remaining closures were obtained in the 3-D geometry by Snyder et al. (Reference Snyder, Hammett and Dorland1997) (note that thermal speeds in those papers are defined without the factor of 2). In Part 2 of our guide, we are going to map all the Landau fluid closures that can be constructed at the fourth-moment level, and we will identify the most precise closures. A summary of these findings can be found in Hunana et al. (Reference Hunana, Zank, Laurenza, Tenerani, Webb, Goldstein, Velli and Adhikari2018).
12 Evolution equation for $n$ th-order fluid moment
Now we are proficient with calculating the fluid moments for general $f$ , and before we start with kinetic calculations, it is useful to derive the time-dependent evolution equation for a general $n$ th-order fluid moment. Before going through this part, we highly recommend going through the parts of the paper where the pressure tensor and heat flux tensor equations are obtained. Then the algebra and the notation used here will look hopefully logical and natural. We define the $n$ th-order fluid moment $\unicode[STIX]{x1D653}^{(n)}$ according to
The symmetric operator is defined such that it cycles around through all possibilities according to
and it is compatible with the previously used symmetric operators for a second-order tensor $[\unicode[STIX]{x1D61F}^{(2)}]_{ij}^{S}=\unicode[STIX]{x1D61F}_{ij}^{(2)}+\unicode[STIX]{x1D61F}_{ji}^{(2)}$ and for a third-order tensor $[\unicode[STIX]{x1D61F}^{(3)}]_{ijk}^{S}=\unicode[STIX]{x1D61F}_{ijk}^{(3)}+\unicode[STIX]{x1D61F}_{jki}^{(3)}+\unicode[STIX]{x1D61F}_{kij}^{(3)}$ . We multiply the Vlasov equation by the mass $m$ and by the velocities $c_{i}c_{j}\ldots c_{n}$ and integrate over $d^{3}v$ . To integrate the first term of the Vlasov equation we will need
and the first term of the Vlasov equation calculates as
For the second term we will need
The second term calculates as
The third term calculates as
For the fourth term we will need
The fourth term calculates as
Combining all the results together $\unicode[STIX]{x2460}+\unicode[STIX]{x2461}+\unicode[STIX]{x2462}+\unicode[STIX]{x2463}=0$ , the evolution equation for $n$ th-order fluid moment $\unicode[STIX]{x1D653}_{ij\ldots n}^{(n)}$ reads (written in the vector notation without indices)
The final result should have only the time derivative of $\unicode[STIX]{x1D653}^{(n)}$ and, similarly to the heat flux tensor, we need to use the momentum equation to eliminate $\unicode[STIX]{x2202}_{t}\boldsymbol{u}$ . We subtract a total of $n$ momentum equations combined with $\unicode[STIX]{x1D653}^{(n)}$ in the form
Again, because of the symmetric operator, it does not matter if $\unicode[STIX]{x1D653}^{(n)}$ is applied to the momentum equation from the left or right. The subtraction also conveniently eliminates the electric field, however it introduces $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{p}$ . We also need an identity
where in the last step we emphasized that it does not matter if the $\boldsymbol{B}\times$ applies only to $\boldsymbol{u}$ or to the entire tensor $\boldsymbol{u}\unicode[STIX]{x1D653}^{(n-1)}$ , since the vector product operates only through the first component of the tensor anyway. The final evolution equation reads
and it is valid for $n\geqslant 2$ . Evaluation at $m=3$ recovers the heat flux tensor equation (7.22) and evaluation at $m=2$ recovers the pressure tensor equation (2.28). Note that $\unicode[STIX]{x1D653}^{(4)}=\boldsymbol{r}$ , $\unicode[STIX]{x1D653}^{(3)}=\boldsymbol{q}$ , $\unicode[STIX]{x1D653}^{(2)}=\boldsymbol{p}$ , however, and very importantly, according to our definition (12.1), $\unicode[STIX]{x1D653}^{(1)}=0$ and it is not equal to the velocity $\boldsymbol{u}$ , which is defined according to (2.4). Additionally, $\unicode[STIX]{x1D653}^{(0)}=\unicode[STIX]{x1D70C}$ . The momentum equation can still be recovered by evaluating (12.13) at $n=1$ , and by dividing by $\unicode[STIX]{x1D70C}$ (where naturally the symmetric operator does not have any influence on a vector, and $u_{i}^{S}=u_{i}$ ). Similar equation to our (12.13) was also obtained for example by Waelbroeck (Reference Waelbroeck2010), equation (22), even though we did not verify whether it is consistent with ours since $\unicode[STIX]{x1D653}^{(n)}$ in that work is defined with $v$ instead of our $c$ .
We want to always decompose the tensor $\unicode[STIX]{x1D653}^{(n)}$ into its gyrotropic and non-gyrotropic parts, and to have space to write the ‘ $g$ ’ and ‘ $ng$ ’, let us move the index $(n)$ down, so that
By generalizing the results we have seen for the gyrotropic $\boldsymbol{p}^{\text{g}},\boldsymbol{q}^{\text{g}},\boldsymbol{r}^{\text{g}}$ , the same result is obtained for the $n$ th moment, and at low frequencies the gyrotropic part must satisfy
which can be indeed viewed as a definition of gyrotropy. The general (12.16) therefore rewrites as
yielding an implicit equation for the non-gyrotropic part in the form
Ideally, an ‘inversion procedure’ should be found for the left-hand side, so that an equation for $\unicode[STIX]{x1D653}_{(n)}^{\text{ng}}$ can be found, and then by expanding the right-hand side (for example at the leading order by considering only gyrotropic $\unicode[STIX]{x1D653}^{\text{g}}$ ), a fully nonlinear expressions for $\unicode[STIX]{x1D653}_{(n)}^{\text{ng}}$ can be obtained. Nevertheless, we have seen that the inversion procedure was already quite complicated for the matrix $(\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})^{S}$ , see (5.9). For a tensor of third rank, such as $(\hat{\boldsymbol{b}}\times \boldsymbol{q}^{\text{ng}})^{S}$ , the full inversion procedure is actually not addressed in this guide and an advanced reader is referred to (43) of Ramos (Reference Ramos2005). The inversion procedure was obtained only for a part of the $\boldsymbol{q}^{\text{ng}}$ , that can be decomposed into the non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ , which is done in detail in appendix D. For the rest of $\boldsymbol{q}^{\text{ng}}$ that consists of heat flux tensor $\unicode[STIX]{x1D748}$ , the inversion procedure cannot be avoided. Additionally, we have seen that for fourth-order moment $\boldsymbol{r}^{\text{ng}}$ , the equation (12.20) does not seem to yield anything useful even at the linear level, and a different procedure was used, where the specific bi-Maxwellian distribution was expanded in Hermite polynomials. Therefore, for higher-order moments, the equation (12.20) is immensely complicated.
It is useful to briefly consider how many scalar gyrotropic moments an $n$ th-order tensor $\unicode[STIX]{x1D653}_{(n)}$ has. The scalar gyrotropic moments can be constructed only from combinations of $c_{\Vert }$ and $c_{\bot }^{2}$ . It is beneficial to write down the following self-explanatory table
Obviously, the number of gyrotropic moments for tensor $\unicode[STIX]{x1D653}_{(n)}$ is equal to $1+\text{int}[n/2]$ . The function ‘int’ means integer part. In mathematics, the integer function is often abbreviated with square brackets, as $1+[n/2]$ . However, this notation might be possibly confusing (we used square brackets in many other places), and we prefer to write explicitly ‘int’. Another possibility would be to use the ‘floor’ function.
12.1 Hierarchy of moments in a 1-D geometry (electrostatic)
Let us see if we can use the $n$ th-order moment equation (12.16) for something useful, since the equation is immensely complicated. Let us consider the special case of strictly parallel propagation along $B_{0}$ (in the $z$ -direction), where all the moments (including the velocity) are along $B_{0}$ , i.e. let us consider the electrostatic case. Let us change the index of a moment from $(n)$ to $(l)$ , so that the index is not confused with the number density. Let us stop writing the $\Vert$ subscript on all the variables starting from the pressure, so $p_{\Vert }$ , $q_{\Vert }$ , $r_{\Vert \Vert }$ are simply $p,q,r$ and the $l$ th-order parallel moment is just $X^{(l)}\equiv m\int c_{\Vert }^{l}f\,\text{d}v$ . We keep the $z$ -subscript for $u_{z}$ to remind us we are writing 1-D equations. In a 1-D geometry, the nonlinear evolution equation (12.16) simplifies to
If one wants to be more general, instead of propagation along $B_{0}$ , it is possible to consider propagation along magnetic field line $\hat{\boldsymbol{b}}$ , and write the same equation with $u_{\Vert }$ and $\unicode[STIX]{x2202}_{\Vert }$ . The equation is accompanied by the density and momentum equations
where for the charge $q_{r}$ it is useful to keep the species index $r$ , so that it is not confused with the heat flux $q$ . The last term disappears anyway since in the case considered here $E_{z}=0$ . Evaluating equation (12.21) for $l=2,3,4,5,6\ldots$ yields
and so on. The nonlinear equation (12.21) can be linearized, in the first step by assuming zero mean $u_{z0}$ value, yielding
and calculating it for $l=2-8$ and by prescribing $q_{0}=0$ , $X_{0}^{(5)}=0$ and in general for $l$ odd $X_{0}^{(l)}=0$ , yields
Alternatively, we can split the linearized equations for even and odd $l$ according to
When closing the system, one has to be careful how to deal with $l=\text{even}$ quantities, since we want to separate the ‘deviations’, similarly as we did for $r=3p^{2}/\unicode[STIX]{x1D70C}+\widetilde{r}$ . For a Maxwellian $f_{0}$ and $l\geqslant 4$ and even, we therefore write
or equivalently
so for example $r_{0}=3p_{0}^{2}/\unicode[STIX]{x1D70C}_{0}$ , $X_{0}^{(6)}=15p_{0}^{3}/\unicode[STIX]{x1D70C}_{0}^{2}$ , $X_{0}^{(8)}=105p_{0}^{4}/\unicode[STIX]{x1D70C}_{0}^{3}$ , and linearization of the $l$ th-order moment yields
Using (12.41) in the first term of $l=\text{even}$ equation (12.37), and using the pressure and density equations, yields linear evolution equation for deviations of even moments
and using (12.41) in the second term of $l=\text{odd}$ equation (12.38), and $T/T_{0}=p/p_{0}-\unicode[STIX]{x1D70C}/\unicode[STIX]{x1D70C}_{0}$ yields linear evolution equation for deviations of odd moments
Finishing the calculation with evaluations of $X_{0}^{(l)}$ values, the evolution equations of deviations are
or by using $p_{0}/\unicode[STIX]{x1D70C}_{0}=v_{\text{th}}^{2}/2$ , equivalently
Continuing to higher orders, the hierarchy of deviations of Maxwellian moments therefore evaluates as
The equations are written in physical units, and these results will be useful in Part 2, where we will calculate Landau fluid closures for deviations of various moments.
12.2 Impossibility of going beyond CGL2 without Landau fluid closures
One can analyse the dispersion relations easily in physical units, but since in this Part 1 we used normalized units with $\widetilde{k}=kV_{A}/\unicode[STIX]{x1D6FA}_{p}$ , $\widetilde{\unicode[STIX]{x1D714}}=\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}_{p}$ and $\unicode[STIX]{x1D6FD}=v_{\text{th}}^{2}/V_{A}^{2}$ in almost all dispersion relations, let us rewrite the fluid hierarchy to normalized units, so that everything feels more familiar. The $X^{(5)}$ is normalized with $p_{0}V_{A}^{3}$ , the $X^{(6)}$ with $p_{0}V_{A}^{4}$ and $X^{(n)}$ with $p_{0}V_{A}^{n-2}$ . By dropping the normalization tilde as usual (but obviously not for even moments such as $\widetilde{r}$ , that should be perhaps called $\unicode[STIX]{x1D6FF}r$ ), the equations in Fourier space read
Now, using just first 3 equations (12.55)–(12.57), and closing the system with a closure $q=0$ , yields the CGL model, with solution
the familiar CGL ion-acoustic mode. Going higher in the fluid hierarchy and using the first 4 equations (12.55)–(12.58), and closing the system with $\widetilde{r}=0$ (which is equivalent to prescribing $r=3p^{2}/\unicode[STIX]{x1D70C}$ ), yields the CGL2 model. The dispersion relation is $\unicode[STIX]{x1D714}^{4}-3\unicode[STIX]{x1D6FD}k^{2}\unicode[STIX]{x1D714}^{2}+\frac{3}{4}\unicode[STIX]{x1D6FD}^{2}k^{4}=0$ , with solutions
the already obtained (9.28), (9.29). The heat flux fluctuations in the CGL2 model therefore did ‘split’ the CGL ion-acoustic mode to two modes. One might assume that this will always be the case when going higher and higher in the fluid hierarchy. With the next example we demonstrate that this does not happen. Let us consider fluctuations in $\widetilde{r}$ , and use the first 5 equations (12.55)–(12.59) with the closure $X^{(5)}=0$ . We might call this model CGL3. The dispersion relation reads $\unicode[STIX]{x1D714}^{4}=\frac{15}{4}\unicode[STIX]{x1D6FD}^{2}k^{4}$ , and the solutions are
Importantly, the last two modes are imaginary, and one is unstable. Therefore, such a fluid model does not make any physical sense. Why did this happen? For example, expressing everything through pressure fluctuations, and putting the $\widetilde{r}$ fluctuations on the right-hand side yields
Therefore, if $\widetilde{r}=0$ , so when (12.66) is used, the CGL2 model is obtained. However, when fluctuations in $\widetilde{r}$ are considered, so when (12.67) is used, the term $-3\unicode[STIX]{x1D6FD}k^{2}\unicode[STIX]{x1D714}^{2}$ cancels on both sides, with the resulting dispersion relation
and by prescribing closure $X^{(5)}=0$ , yields the dispersion relation of the CGL3 model.
Now it is easy to go higher in the fluid hierarchy. By considering the first 6 equations (12.55)–(12.60), the dispersion relation reads
and prescribing closure $\widetilde{X}^{(6)}=0$ (which is equivalent to prescribing $X^{(6)}=15p^{3}/\unicode[STIX]{x1D70C}^{2}$ ), yields a model that we can call CGL4, with numerical solutions
again, two modes being imaginary, and one unstable.
The variable $\unicode[STIX]{x1D714}/(k\sqrt{\unicode[STIX]{x1D6FD}})$ , written here in normalized units, is easily rewritten to physical units (reintroducing tilde for clarity)
which is directly related to the ‘kinetic’ variable $\unicode[STIX]{x1D701}$ , that we will use in Part 2 of the text, defined as
where the absolute value does not make any difference now (since all fluid models here contain $\unicode[STIX]{x1D701}^{2}$ ), and will become important only when Landau fluid closures are considered. The hierarchy of CGL dispersion relations then can be written as
All the models beyond CGL2 contain modes that are unstable.
Double check
Let us double check our results, to verify that we calculated everything correctly. As stated previously, a closure should be performed only at the last retained fluid moment, and this is especially true when nonlinear numerical simulations are performed.Footnote 13 The CGL and CGL2 systems were already discussed at great length. Let us check the CGL3 dispersion relation, and let us work directly in physical units. The system of equations that we have in mind reads
and the closure is performed by setting $X^{(5)}=0$ . Importantly, the fourth-order moment $r$ is kept undetermined, and it is not separated to its ‘core’ and $\widetilde{r}$ , nor any specific form of a distribution function is prescribed yet (other than the mean values of odd moments $X_{0}^{(l)}=0$ ). Calculating dispersion relation of this system (a matrix multiplied by vector $(\unicode[STIX]{x1D70C},u_{z},p,q,r)$ ) yields result $\unicode[STIX]{x1D714}^{4}-5k^{4}r_{0}/\unicode[STIX]{x1D70C}_{0}=0$ , which for $r_{0}=3p_{0}^{2}/\unicode[STIX]{x1D70C}_{0}$ recovers the CGL3 dispersion relation. Our calculations were therefore done right. Similarly, double checking the CGL4 dispersion, the system (12.80) is supplemented with (12.33), and a closure is performed with $X^{(6)}=15p^{3}/\unicode[STIX]{x1D70C}$ . Again, the $r$ is left untouched, and dispersion relation of this system recovers the CGL4 result. Checking the CGL5 dispersion relation, the system is supplemented with (12.34), and a closure $X^{(7)}=0$ . Again, the $X^{(6)}$ is kept untouched and generally undetermined. Calculating the dispersion relation yields $\unicode[STIX]{x1D714}^{6}-7k^{6}X_{0}^{(6)}/\unicode[STIX]{x1D70C}_{0}=0$ , and prescribing Maxwellian $X_{0}^{(6)}=15p_{0}^{3}/\unicode[STIX]{x1D70C}_{0}^{2}$ recovers the CGL5 result. Going higher in the hierarchy, and using closure $X^{(8)}=105p^{4}/\unicode[STIX]{x1D70C}^{3}$ recovers the CGL6 results. And finally, keeping all the variables up to $X^{(8)}$ undetermined and using closure $X^{(9)}=0$ yields dispersion relation $\unicode[STIX]{x1D714}^{8}-9k^{8}X_{0}^{(8)}/\unicode[STIX]{x1D70C}_{0}=0$ , which for Maxwellian $X_{0}^{(8)}=105p_{0}^{4}/\unicode[STIX]{x1D70C}_{0}^{3}$ recovers the CGL7 result.
Or, in general, without yet performing a closure, and going higher and higher in the hierarchy step by step, a straightforward calculation yields
and for higher orders therefore
where the validity can be proven by induction and using (12.37), (12.38). CGLl model (or CGL of $l$ th order) is therefore defined as performing a closure on $X^{(l+2)}$ moment. In the equations (12.81)–(12.89) we used the word CGL, even though no specific distribution function (nor any closure) was prescribed yet.
The closure in each model is performed by specifying the right-hand side of (12.88), (12.89), which yields dispersion relations
The results are written in physical units. Alternatively, by using the $\unicode[STIX]{x1D701}$ variable, the results are
which for specific $l$ values recovers results (12.73)–(12.79).
The results (12.90)–(12.91) are written for a closure performed on moment $X_{0}^{(l+2)}$ , which is dictated by the CGLl vocabulary, and which might be perhaps a bit confusing. Alternatively, one can reformulate the results for closures performed on moment $X_{0}^{(l)}$ (corresponding name for a Maxwellian is $\text{CGL}(l-2)$ ) and that read
or rewritten with the $\unicode[STIX]{x1D701}$ variable
a result reported in Hunana et al. (Reference Hunana, Zank, Laurenza, Tenerani, Webb, Goldstein, Velli and Adhikari2018), equation (10). Importantly, solutions of the above dispersion relations for $l>4$ , will always yield results with some complex numbers, and some solutions will be unstable.
Therefore, the last physically meaningful fluid model (without considering the Landau fluid closures) is the CGL2 model, and the last closure is the ‘normal’ closure, $r=3p^{2}/\unicode[STIX]{x1D70C}$ . To go higher in the fluid hierarchy, we necessarily need to make connection to kinetic theory, and consider Landau fluid closures. These closures are described in Part 2 of our guide, and for direct comparison with the simplest Landau fluid dispersion relations, see Part 2, § 3.12 ‘Parallel ion-acoustic (sound) mode, cold electrons’.
13 Conclusions
We offer a brief summary of the major results discussed throughout the text.
∙ A collisionless plasma is completely described by the kinetic Vlasov equation. Directly integrating the Vlasov equation over velocity space, yields evolution equations for an infinite hierarchy of fluid moments. In addition to the usual density and momentum equations, it is possible to derive an evolution equation for a general $n$ th-order fluid moment ( $n\geqslant 2$ ), see (12.16). Then, evolution equations for the pressure tensor and heat flux tensor can be obtained by simply evaluating (12.16) at $n=2$ and $n=3$ .
∙ Even though the fluid hierarchy is infinite, the reformulation of the kinetic description to a fluid formalism is not necessarily complete, since the usual calculations neglect wave–particle resonances in velocity space. These wave–particle resonances, that are implicitly present in the Vlasov equation, are responsible for collisionless damping mechanisms, such as Landau damping, transit-time damping and cyclotron damping.
∙ The presence (or absence) of collisionless damping mechanisms in a fluid hierarchy is determined by the type of closure selected to truncate that fluid hierarchy. We differentiate between two classes of closures: a) ‘classical’ (non-Landau fluid) closures, that neglect wave–particle resonances, and b) Landau fluid closures, that account for Landau wave–particle resonances and associated Landau damping and transit-time damping, addressed in Part 2.
∙ There are currently no known closures that account for cyclotron wave–particle resonances and associated cyclotron damping. Nevertheless, there is a priori no reason why such fluid closures can not be found in the future, at least for the simplified case of electromagnetic propagation along the magnetic field (slab geometry).
∙ Considering ‘classical’ closures, the hierarchy of fluid moments therefore does not contain collisionless damping mechanisms, regardless of the order to which the hierarchy is developed. Or in another words, collisionless damping is beyond all orders in the classical hierarchy of fluid moments.
∙ In fact, it is impossible to go beyond the fourth-order moment with classical bi-Maxwellian fluid closures $\unicode[STIX]{x1D653}^{(n)}=\unicode[STIX]{x1D653}_{0}^{(n)}+\widetilde{\unicode[STIX]{x1D653}}^{(n)}$ , where $\unicode[STIX]{x1D653}_{0}^{(n)}$ represents bi-Maxwellian value of $\unicode[STIX]{x1D653}^{(n)}$ and its perturbation $\widetilde{\unicode[STIX]{x1D653}}^{(n)}=0$ , since all the fluid models contain higher-order modes that are unstable. This is perhaps the most surprising result discussed in Part 1. For a detailed proof, see § 12.2, where a 1-D geometry is considered and a bi-Maxwellian closure is used at the $n$ th-order fluid moment, which yields dispersion relation (12.96). The same dispersion relation will be valid in a 3-D geometry for the propagation along the magnetic field, where additional de-coupled modes will be present as well. For $n>4$ , all the fluid models contain unphysical instabilities. The same result is expected for other distribution functions.
∙ The classical bi-Maxwellian closure at the fourth-order moment reads $r_{\Vert \Vert }=3(p_{\Vert }^{2}/\unicode[STIX]{x1D70C});r_{\Vert \bot }=p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C};r_{\bot \bot }=2(p_{\bot }^{2}/\unicode[STIX]{x1D70C})$ (or equivalently $\widetilde{\boldsymbol{r}}=0$ ), and is called the ‘normal’ closure. Therefore, the ‘normal’ closure is the last classical fluid closure ( $\widetilde{\unicode[STIX]{x1D653}}^{(n)}=0$ ) and beyond the fourth-order moment, Landau fluid closures ( $\widetilde{\unicode[STIX]{x1D653}}^{(n)}\neq 0$ ) are required.
∙ From a collisionless (or weakly collisional) perspective, the long-wavelength low-frequency limit of a distribution function is not necessarily an isotropic Maxwellian, but a general distribution function that is gyrotropic, i.e. isotropic only in its perpendicular velocity components. Fluid moments are therefore typically decomposed into their gyrotropic and non-gyrotropic parts. For example the pressure tensor $\boldsymbol{p}=\boldsymbol{p}^{\text{g}}+\unicode[STIX]{x1D72B}$ , the heat flux tensor $\boldsymbol{q}=\boldsymbol{q}^{\text{g}}+\boldsymbol{q}^{\text{ng}}$ , the fourth-order moment $\boldsymbol{r}=\boldsymbol{r}^{\text{g}}+\boldsymbol{r}^{\text{ng}}$ . The non-gyrotropic parts are also referred to as the FLR corrections, since they represent deviations from gyrotropy at small spatial scales, when the Larmor radius (gyroradius) is not infinitely small.
∙ A general $n$ th-order fluid moment contains $1+\text{int}[n/2]$ scalar gyrotropic moments, where the function ‘int’ means integer part.
∙ The pressure tensor is second-order fluid moment and contains two gyrotropic moments, $p_{\Vert }$ and $p_{\bot }$ . Therefore, any fluid description of collisionless (or weakly collisional) plasmas, requires two separate evolution equations for $p_{\Vert }$ and $p_{\bot }$ . Importantly, from a linear perspective the evolution equations remain different even when the distribution function is isotropic, i.e. even when the mean pressure values are equal ( $p_{\Vert }^{(0)}=p_{\bot }^{(0)}$ ), since pressure fluctuations in the directions parallel and perpendicular to the local magnetic field remain anisotropic. We often use proton pressure (temperature) anisotropy coefficient $a_{p}=T_{\bot }^{(0)}/T_{\Vert }^{(0)}=p_{\bot }^{(0)}/p_{\Vert }^{(0)}$ .
∙ The simplest fluid model that describes a collisionless plasma in an adiabatic regime is the CGL description (model) – named after Chew–Goldberger–Law (Chew et al. Reference Chew, Goldberger and Low1956). The model is non-dispersive and obtained by a closure with zero heat flux. The CGL description should be viewed as collisionless magnetohydrodynamics (collisionless MHD), since the usual MHD description with isotropic scalar pressure is highly collisional implicitly.
∙ The CGL pressure tensor is decomposed with respect to the direction of the local magnetic field lines according to $\boldsymbol{p}^{\text{g}}=p_{\Vert }\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}+p_{\bot }(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})$ . It is possible to switch to the reference frame of the magnetic field lines, where the pressure tensor is a diagonal matrix. However, it should not be forgotten that in a such a reference frame, imposing an external (mean) magnetic field is not straightforward. For nonlinear numerical simulations of turbulence and plasma heating, the laboratory reference frame is strongly recommended.
∙ The two pressure equations of the CGL model can be interpreted as conservation laws for the first and second adiabatic invariants, see § 2.5. The first adiabatic invariant is a conservation of the magnetic moment of a particle that is periodically gyrating around a mean magnetic field. The second adiabatic invariant is a conservation of the average parallel momentum of a particle that is completely trapped and periodically bouncing inside of a magnetic bottle.
∙ Without performing (yet) any kind of closure and leaving the heat flux tensor $\boldsymbol{q}$ and FLR pressure tensor $\unicode[STIX]{x1D72B}$ unspecified, it is possible to derive rigorously exact evolution equations for $p_{\Vert }$ and $p_{\bot }$ , see (2.91)–(2.92).
∙ Rewriting the pressure equations to an alternative form, equations (2.100)–(2.101), shows that the adiabatic invariants in the CGL model are broken by the inclusion of the Hall term in the induction equation, gyrotropic heat flux, non-gyrotropic FLR corrections to the pressure and heat flux, as well as by coupling of various species together. All of these contributions therefore yield very complicated anisotropic plasma heating processes, that can be studied only by nonlinear numerical simulations.
∙ The anisotropic plasma heating does not simplify much even in the case of periodic boundary conditions (i.e. when the system can be viewed as completely isolated) and when averaging over the entire spatial volume is performed and expressed as a conservation of total energy, see (2.113)–(2.116). Only when the anisotropic heating is neglected, i.e. when only total plasma heating studied, the conservation of energy significantly simplify, see (2.109)–(2.111).
∙ By using the concept of polytropic indices, the pressure equations in the ideal CGL model can be interpreted as having $\unicode[STIX]{x1D6FE}_{\Vert }=3$ and $\unicode[STIX]{x1D6FE}_{\bot }=2$ . Consequently, the CGL dispersion relation is in general not equal to the MHD dispersion relation, even when $a_{p}=1$ . The exception is the Alfvén mode, which for $a_{p}=1$ propagates with the same phase speed $\unicode[STIX]{x1D714}/k=V_{A}\cos \unicode[STIX]{x1D703}$ in both CGL and MHD models, for all the propagation directions $\unicode[STIX]{x1D703}$ .
∙ Polytropic indices can be further related to the number of degrees of freedom $i$ through $\unicode[STIX]{x1D6FE}=(i+2)/i$ , which for the CGL model yields $i_{\Vert }=1$ and $i_{\bot }=2$ . The CGL pressure equations can be therefore viewed as being composed of strongly coupled 1-D and 2-D dynamics, whereas the dynamics in the MHD model with $\unicode[STIX]{x1D6FE}=5/3$ and $i=3$ can be viewed as isotropically three-dimensional.
∙ In the CGL model, the Alfvén mode propagates with the phase speed $\unicode[STIX]{x1D714}/k=V_{A}\cos \unicode[STIX]{x1D703}\sqrt{1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)}$ , which matches the (collisionless) kinetic theory in the long-wavelength limit. Consequently, for $a_{p}=1$ , the Alfvén mode in the MHD model matches the kinetic theory as well. The CGL result is very useful when numerically solving kinetic dispersion relations (for example with the WHAMP solver), since the Alfvén mode can be easily identified at long wavelengths.
∙ In contrast to MHD, where the ordering of phase speeds is always $v_{s}\leqslant v_{A}\leqslant v_{f}$ (slow, Alfvén, fast), in collisionless plasmas the oblique slow mode can become faster than the oblique Alfvén mode. In general, oblique slow and fast modes in the CGL model do not necessarily match the kinetic theory in the long-wavelength limit. Nevertheless, the effect when $v_{s}>v_{A}$ is present in the CGL model and exists even for $a_{p}=1$ , see § 3.6. The effect is very important for the parallel firehose instability.
∙ Considering strictly perpendicular propagation ( $\unicode[STIX]{x1D703}=90^{\circ }$ or $k_{\Vert }=0$ ), the fast mode in the CGL model propagates with the phase speed $\unicode[STIX]{x1D714}/k=V_{A}\sqrt{1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }}$ , which matches the kinetic theory in the long-wavelength limit. The expression can be rewritten as $(\unicode[STIX]{x1D714}/k)^{2}=V_{A}^{2}+v_{\text{th}\bot }^{2}=V_{A}^{2}+2(p_{\bot }^{(0)}/\unicode[STIX]{x1D70C}_{0})$ , and can be contrasted with the MHD result $(\unicode[STIX]{x1D714}/k)^{2}=V_{A}^{2}+\frac{5}{3}(p^{(0)}/\unicode[STIX]{x1D70C}_{0})$ . The factor of 2 in the CGL result comes from $\unicode[STIX]{x1D6FE}_{\bot }=2$ , which shows that the adiabatic assumption of the CGL model is appropriate for the perpendicular fast mode.
∙ Contributions of the Hall term completely disappear for strictly perpendicular propagation.
∙ It is more difficult to analytically show where the adiabatic CGL value $\unicode[STIX]{x1D6FE}_{\Vert }=3$ is appropriate, since for example the parallel propagating ion-acoustic (sound) mode is strongly Landau damped in kinetic theory (unless the electrons are hot), which also modifies its real frequency. The ion-acoustic mode in the CGL model therefore does not match kinetic theory. One needs to consider Landau fluid models with proton and electron species, which is addressed in Part 2.
∙ Nevertheless, when electrons are hot ( $T_{e}\gg T_{i}$ ) and the ion-acoustic mode is Landau damped only weakly, the real phase speed obtained from kinetic theory in the long-wavelength limit reads $\unicode[STIX]{x1D714}/k=\sqrt{(T_{e}+3T_{i})/m_{i}}$ . Thus, in this example the kinetic theory is matched by a fluid model where protons are described adiabatically (with the CGL value $\unicode[STIX]{x1D6FE}_{\Vert }=3$ ), while electrons are isothermal (with $\unicode[STIX]{x1D6FE}_{\Vert }=1$ ). Also, a good example for $\unicode[STIX]{x1D6FE}_{\Vert }=3$ is the real frequency of the Langmuir mode, $\unicode[STIX]{x1D714}^{2}=\unicode[STIX]{x1D714}_{pe}^{2}+3(T_{\Vert e}^{(0)}/m_{e})k^{2}$ , which is addressed in Part 2.
∙ For anisotropic temperatures ( $a_{p}\neq 1$ ), some modes can become unstable. A threshold of an instability that is obtained in the long-wavelength (non-dispersive) limit is called a ‘hard’ threshold. The CGL model contains 3 instabilities, the oblique firehose instability, the parallel firehose instability and the mirror instability. The mirror instability is not described correctly.
∙ The oblique firehose instability is an instability of the oblique Alfvén mode. Since the Alfvén mode is described correctly by the CGL model, its instability threshold $1+(\unicode[STIX]{x1D6FD}_{\Vert }/2)(a_{p}-1)<0$ matches the kinetic theory. The instability requires $a_{p}<1$ and (in the long-wavelength limit) $\unicode[STIX]{x1D6FD}_{\Vert }>2$ . Since the CGL model is non-dispersive, here the instability growth rate only has a simple $\cos \unicode[STIX]{x1D703}$ dependence. Once dispersive effects are considered, the instability reaches maximum growth rate at some wavenumber and angle $\unicode[STIX]{x1D703}$ that is oblique, see figure 10.
∙ The parallel firehose instability is an instability of the whistler mode, which can be shown by considering the Hall-CGL model. However, considering usual (non-causal) analytic solutions (4.32)–(4.35), one arrives at a contradiction that for $\unicode[STIX]{x1D714}_{r}>0$ the whistler mode is unstable, and that for $\unicode[STIX]{x1D714}_{r}<0$ the whistler mode is stable. The contradiction arises, because at the range of wavenumbers where the firehose instability exists, the ion-cyclotron and whistler modes are completely degenerate and distinguishing between them loses sense. The problem is resolved by introducing a small $({\sim}\unicode[STIX]{x1D716})$ causal dissipation into the momentum equations, which is later removed by the limit $\lim _{\unicode[STIX]{x1D716}\rightarrow 0^{+}}$ . The procedure yields causal dispersion relations (4.43)–(4.46), where the whistler mode is firehose unstable for both positive and negative $\unicode[STIX]{x1D714}_{r}$ , and the ion-cyclotron mode is stable.
∙ The parallel firehose instability occurs for quasi-parallel (small $\unicode[STIX]{x1D703}$ ) propagation directions, with the maximum growth rate at $\unicode[STIX]{x1D703}=0^{\circ }$ , see figure 10. In the non-dispersive CGL model it corresponds to the instability of the slow mode. Its threshold obtained for $\unicode[STIX]{x1D703}=0^{\circ }$ is equivalent to the oblique firehose instability (since $v_{s}=v_{A}$ ), and matches the kinetic theory. At first it might sound surprising that the slow mode connects to the whistler mode once dispersive effects are considered. However, considering $\unicode[STIX]{x1D703}=0^{\circ }$ , $\unicode[STIX]{x1D6FD}_{\Vert }>2$ , $a_{p}<1$ yields that the fast mode is the ion-acoustic (sound) mode and $v_{s}=v_{A}$ , see figure 1. After dispersive effects are introduced, which split the Alfvén and slow mode to the ion-cyclotron and whistler modes, it is clarified that the instability is associated with the whistler mode. For quasi-parallel propagation directions the situation is more obvious, since $v_{s}>v_{A}$ (see figure 1), and the slow mode clearly becomes the whistler mode.
∙ In the non-dispersive CGL model, all 3 instabilities are non-propagating, i.e. purely growing with zero real frequency. Once dispersive effects are introduced, the parallel firehose instability becomes propagating.
∙ The mirror instability requires $a_{p}>1$ and usually develops for highly oblique propagation angles. In the CGL model it corresponds to the instability of the slow mode. Its threshold obtained in the highly oblique (but not completely perpendicular) limit reads $1+a_{p}\unicode[STIX]{x1D6FD}_{\Vert }-\frac{1}{6}a_{p}^{2}\unicode[STIX]{x1D6FD}_{\Vert }<0$ , and with respect to kinetic theory the $1/6$ factor is erroneous. The erroneous $1/6$ factor shows that the adiabatic CGL closure is not appropriate for the very slow dynamics of the mirror instability.
∙ The simplest fluid closure that recovers the correct threshold of the mirror instability is the ‘static’ closure, see § 9.4. This closure is derived from the ‘normal’ closure in a simplified quasi-static approximation, and represents a generalization of the isothermal closure (in the presence of temperature anisotropy and variations of magnetic field strength). The ‘static’ closure is very useful since it clarifies that the highly oblique very slow dynamics of the slow mode (and of the associated mirror instability) is better described by a generalized isothermal closure than the adiabatic CGL closure.
∙ However, the ‘static’ closure modifies the dynamics of the perpendicular fast mode, that now does not match the kinetic theory. Additionally, the fast mode now experiences an unphysical instability, even though the instability is beyond the mirror threshold, and should perhaps not play a role in numerical simulations. The erroneous result shows that the generalized isothermal closure is not appropriate for the relatively fast dynamics of the perpendicular fast mode, where the adiabatic CGL closure is appropriate.
∙ Heuristically, the correct mirror threshold can be also obtained by keeping the CGL value $\unicode[STIX]{x1D6FE}_{\bot }=2$ and modifying the parallel polytropic index to $\unicode[STIX]{x1D6FE}_{\Vert }=1/2$ . Such a closure does not alter the dynamics of the perpendicular fast mode, since the phase speed depends only on $\unicode[STIX]{x1D6FE}_{\bot }$ (and not on $\unicode[STIX]{x1D6FE}_{\Vert }$ ). Thresholds of parallel and oblique firehose instability are not altered either. Nevertheless, the $\unicode[STIX]{x1D6FE}_{\Vert }=1/2$ is even below the isothermal value $\unicode[STIX]{x1D6FE}_{\Vert }=1$ , the closure is heuristic, and it is mentioned only as a curiosity.
∙ Obviously, to correctly capture both the slow dynamics of the highly oblique mirror instability, as well as the fast dynamics of the perpendicular fast mode, neither adiabatic nor (generalized) isothermal closures are sufficient. Considering classical (rigorously derived) closures, one has no other choice but to keep the correct adiabatic CGL values $\unicode[STIX]{x1D6FE}_{\Vert }=3$ , $\unicode[STIX]{x1D6FE}_{\bot }=2$ unaltered, and instead, break the (long-wavelength) adiabaticity by going higher in the fluid hierarchy. One needs to consider fluid models with evolution equations for the heat flux tensor $\boldsymbol{q}$ , that has two gyrotropic components, $q_{\Vert }$ and $q_{\bot }$ . The system is closed at the fourth-order moment $\boldsymbol{r}$ , that has 3 gyrotropic components, $r_{\Vert \Vert };r_{\Vert \bot };r_{\bot \bot }$ . To perform a closure, a specific distribution function (or a class of distribution functions) has to be assumed.
∙ Considering a bi-Maxwellian distribution function, the closure $\widetilde{\boldsymbol{r}}=0$ is known as the ‘normal’ closure (see above). We call this non-dispersive bi-Maxwellian fluid model the ‘second-order CGL’, abbreviated as CGL2. The abbreviation CGL2 seems beneficial, since the name CGL is associated with collisionless magnetohydrodynamics, now just taken to one higher order. Additionally, by showing that all classical bi-Maxwellian models beyond CGL2 do not make physical sense, fluid models can be easily classified as based on CGL or CGL2 descriptions. For direct comparison of the two systems, see dispersion relations of the CGL model (3.193) and the CGL2 model (9.23). The Hall term introduces the simplest dispersive effects; see the dispersion relations of the Hall-CGL model (4.81), and the Hall-CGL2 model (9.40).
∙ By comparing the dispersion relations, it is obvious that the gyrotropic heat flux fluctuations in the CGL2 model modify the general dynamics (the mean heat flux values are zero $q_{\Vert }^{(0)}=q_{\bot }^{(0)}=0$ ). One of the exceptions is the perpendicular fast mode, which is not altered by the gyrotropic heat fluxes, and therefore described correctly. Other exceptions are the oblique Alfvén mode (the oblique firehose instability), and once dispersive effects are introduced also the parallel propagating ion-cyclotron and whistler modes (the parallel firehose instability).
∙ Two additional evolution equations typically create two additional modes. For example, for the case of highly oblique CGL2 propagation, one obtains the Alfvén mode, the fast mode, and 3 ‘slow’ modes. One of these slow modes is responsible for the mirror instability, and is sometimes called the mirror mode. The CGL2 mirror threshold matches the kinetic theory. The CGL2 model is therefore the simplest classical model that correctly captures (in the long-wavelength limit) both the mirror instability threshold, as well as the perpendicular fast mode. Nevertheless, only the ‘hard’ mirror threshold is correctly recovered, and capturing the (long-wavelength) mirror instability growth rate requires Landau damping.
∙ One can construct closures for other distribution functions than bi-Maxwellians. For example, one can consider the bi-kappa distribution function (10.1) with a free parameter $\unicode[STIX]{x1D705}>3/2$ . The closure then reads $r_{\Vert \Vert }=3\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}(p_{\Vert }^{2}/\unicode[STIX]{x1D70C});r_{\Vert \bot }=\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}(p_{\Vert }p_{\bot }/\unicode[STIX]{x1D70C});r_{\bot \bot }=2\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}(p_{\bot }^{2}/\unicode[STIX]{x1D70C})$ , where the coefficient $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}=(\unicode[STIX]{x1D705}-\frac{3}{2})/(\unicode[STIX]{x1D705}-\frac{5}{2})$ . The closure is restricted to $\unicode[STIX]{x1D705}>\frac{5}{2}$ , which can be seen from the $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient, and the same requirement is necessary for the convergence of the velocity integrals. We call this closure and resulting fluid model simply ‘BiKappa’.
∙ If a closure for the bi-kappa distribution is performed at a higher $n$ th-order moment (which is presumably not possible with classical closures since it is expected that for $n>4$ all models contain unphysical instabilities), the closure will be restricted to $\unicode[STIX]{x1D705}>(n+1)/2$ , which increases with $n$ . The requirement for the minimum $\unicode[STIX]{x1D705}$ value is there, since the power law in the bi-kappa distribution is assumed all the way till infinite velocities, and the requirement is necessary for the convergence of the velocity integrals. Therefore, by going higher and higher in the bi-kappa fluid hierarchy, the minimum value of $\unicode[STIX]{x1D705}$ increases, and in the limit $\unicode[STIX]{x1D705}\rightarrow \infty$ the required distribution function converges towards bi-Maxwellian.
∙ The dispersion relation for the BiKappa model is given by (10.53). A general $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient is used, so the dispersion relation is valid for a much larger class of distribution functions that can be closed with an analogous closure with just one $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient. The dispersion relation shows that the general dynamics is of course altered by the value of the $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ coefficient. Nevertheless, few exceptions are: the oblique Alfvén mode (oblique firehose threshold), the perpendicular fast mode, the parallel firehose threshold and the mirror threshold. When dispersive effects are considered, and figures similar to figure 10 created, the growth rates for oblique propagation will of course be affected by the $\unicode[STIX]{x1D6FC}_{\unicode[STIX]{x1D705}}$ value. The Hall-BiKappa dispersion relation is given by (10.69).
∙ The non-gyrotropic (FLR) corrections $\unicode[STIX]{x1D72B}$ to the pressure tensor were studied in § 5. The tensor $\unicode[STIX]{x1D72B}$ is described by the pressure tensor equation implicitly. To prevent introducing several new independent fluid variables for components of the tensor $\unicode[STIX]{x1D72B}$ , each with its own evolution equation, expansion on temporal and spatial scales is required. Correct evaluation of the FLR tensor with respect to magnetic field lines is quite cumbersome, and at the leading order (in temporal and spatial scales) one obtains (5.43)–(5.45), or equivalently (5.51)–(5.53). The FLR tensor $\unicode[STIX]{x1D72B}$ is therefore often evaluated in the linear approximation, which is appropriate when the magnetic field lines are not too distorted.
∙ Even though the FLR pressure tensor $\unicode[STIX]{x1D72B}$ has a zero trace, its contributions should not be called ‘off diagonal’, since the diagonal components are non-zero. In the collisionless case considered here with the mean field in the $z$ -direction, only $\unicode[STIX]{x1D6F1}_{zz}=0$ , and $\unicode[STIX]{x1D6F1}_{xx}=-\unicode[STIX]{x1D6F1}_{yy}\neq 0$ . In the collisional case (which was not considered) also $\unicode[STIX]{x1D6F1}_{zz}\neq 0$ .
∙ For the purpose of comparing various contributions, we differentiate between FLR1, FLR2 and FLR3 corrections, even though the classification can be a bit blurry and there are various possibilities. Perhaps the best definition that we used later is as follows. The classical FLR1 tensor $\unicode[STIX]{x1D72B}$ contains only velocity gradients, see (5.74). The FLR2 tensor also contains $\unicode[STIX]{x2202}\unicode[STIX]{x1D72B}/\unicode[STIX]{x2202}t$ and the Hall term from the induction equation, see (5.109). It can also contain gyrotropic heat flux contributions, see (5.140). The FLR3 contains non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ , described in § 5.8, with the detailed algebra presented in appendix D. It can also contain the non-gyrotropic heat flux tensor $\unicode[STIX]{x1D748}$ , which we neglected. For complete clarity in analysing the dispersion relations, the entire Hall-CGL-FLR3 model (linearized in the $x$ – $z$ plane, normalized and Fourier transformed) is written down in § 5.9, see (5.160)–(5.164).
∙ Considering the strictly perpendicular fast mode, kinetic theory yields the leading-order FLR corrections (in the long-wavelength limit) to the phase speed in the following form $(\unicode[STIX]{x1D714}/k)^{2}=V_{A}^{2}(1-\frac{1}{8}k^{2}\unicode[STIX]{x1D70C}_{i}^{2})+v_{\text{th}\bot }^{2}(1-\frac{5}{16}k^{2}\unicode[STIX]{x1D70C}_{i}^{2})$ . The kinetic result is reproduced by the CGL-FLR3 model (the Hall contributions are zero). Importantly, both the first- and second-order non-gyrotropic heat flux vectors have to be retained.
∙ If the second-order non-gyrotropic heat flux contributions are neglected in the FLR3 model, yields solution for the perpendicular fast mode (5.159). In such a model, the correction to the Alfvén speed is captured correctly, however, the correction to the thermal speed is $+\frac{1}{16}k^{2}\unicode[STIX]{x1D70C}_{i}^{2}$ instead of $-\frac{5}{16}k^{2}\unicode[STIX]{x1D70C}_{i}^{2}$ , i.e. the correction has the wrong sign. This is surprising, since the solution of the FLR2 model, equation (5.120), has a correction to the thermal speed $-\frac{1}{16}k^{2}\unicode[STIX]{x1D70C}_{i}^{2}$ , i.e. at least the sign is correct. Finally, the FLR1 model yields solution (5.119), and does not capture any correction to the Alfvén speed, additionally, the correction to the thermal speed has the wrong sign as well.
∙ We used the Hall-CGL-FLR3 model to investigate the parallel and oblique firehose instability, and compare it with results of the FLR2 and FLR1 models, see figures 7–11. It is shown that the growth rates of the parallel and oblique firehose instability are strongly enhanced by the non-gyrotropic heat flux vectors of the FLR3 model, see figure 10 (see the scales of the colour bars).
∙ In general, when the maximum growth rates are sufficiently large (let us say $\unicode[STIX]{x1D714}_{i}/\unicode[STIX]{x1D6FA}\sim 0.1$ ) and not tiny (such as $0.001$ ), the FLR3 model reproduces the kinetic results with unexpectedly good accuracy. See for example bottom panels of figure 10. Not only is the value of the maximum growth rate for the oblique firehose instability (red lines) approximately the same in the FLR3 model and kinetic theory, the maximum growth rate is also reached approximately for the same angle of propagation and the same wavenumber.
∙ The oblique firehose instability can be (in general) reproduced with better accuracy than the parallel firehose instability. Nevertheless, for very high $\unicode[STIX]{x1D6FD}_{\Vert }$ values, the parallel firehose instability is reproduced very accurately, see figure 9.
∙ Importantly, we show that the non-gyrotropic heat flux vectors in the FLR3 model, partially reproduce the large ‘bump’ in the imaginary phase speed (growth rate normalized to the wavenumber), when the plasma is close to the long-wavelength limit ‘hard’ firehose threshold, see figure 7. The result clearly shows that, similarly to kinetic theory, fluid models can develop the firehose instability at small spatial scales, even when they are stable in the long-wavelength limit. Or in other words, fluid models can become unstable even before the ‘hard’ firehose threshold is reached. This unexpected result is perhaps the second most surprising result discussed in Part 1.
Acknowledgements
We acknowledge support of the NSF EPSCoR RII-Track-1 Cooperative Agreement no. OIA-1655280 ‘Connecting the Plasma Universe to Plasma Technology in Alabama’, led by G.P.Z. This work was supported by the European Research Council in the frame of the Consolidating grant ERC-2017-CoG771310-PI2FA ‘Partial Ionisation: Two-Fluid Approach’, led by Elena Khomenko. A.T. acknowledges support of the NASA Heliophysics Supporting Research grant no. 80NSSC18K1211. P.H. thanks T. Passot, M. Laurenza, N. Vitas, P. Hellinger and S. P. Gary for many useful discussions. We are also very thankful to two (out of three) anonymous referees whose comments and suggestions had a great impact on this text. Significant effort has been made to eliminate all the misprints from the equations. However, we will amend possible misprints, if found, in a corrigendum.
Appendix A. Fourier transformations
If we have an equation in real space, we decompose each quantity as a superposition of waves according to
Most of plasma physics books and kinetic papers use $\text{e}^{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{x}-\text{i}\unicode[STIX]{x1D714}t}$ in the definition (A 1). Decomposing the frequency to real and imaginary parts $\unicode[STIX]{x1D714}=\unicode[STIX]{x1D714}_{r}+\text{i}\unicode[STIX]{x1D714}_{i}$ , so $\text{e}^{-\text{i}\unicode[STIX]{x1D714}t}=\text{e}^{-\text{i}\unicode[STIX]{x1D714}_{r}t}\text{e}^{+\unicode[STIX]{x1D714}_{i}t}$ , implies that a wave with $\unicode[STIX]{x1D714}_{i}>0$ grows (i.e. has a positive growth rate) and a wave with $\unicode[STIX]{x1D714}_{i}<0$ is damped. Of course, an alternative definition of (A 1) with $\text{e}^{-\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{x}+\text{i}\unicode[STIX]{x1D714}t}$ is allowed, one just has to remember that a wave with $\unicode[STIX]{x1D714}_{i}>0$ is damped and a wave with $\unicode[STIX]{x1D714}_{i}<0$ grows. However, in Part 2 of the text, we will calculate the Landau damping and we will see that following all the correct minus signs in the Landau integral can be very confusing. We therefore recommend using the first decomposition, where waves with $\unicode[STIX]{x1D714}_{i}<0$ are damped. This choice yields that Fourier transformations in the $x$ – $z$ plane (with $B_{0}$ in the $z$ -direction) are performed according to a shortcut
The transformation (A 1) is technically the inverse/backward Fourier transform ${\mathcal{F}}^{-1}\hat{f}(\boldsymbol{k},\unicode[STIX]{x1D714})$ . The forward Fourier transform reads
The location of the normalization constants $1/(2\unicode[STIX]{x03C0})$ is an ad hoc choice, one just needs to be consistent in using them, especially when calculating convolutions.
Appendix B. Generalized vector (cross) product
In the collisionless fluid hierarchy one encounters a vector product between vectors and tensors, as for example in the pressure tensor equation (2.26). It is useful to clarify the definition of this generalized vector product. Consider matrix $\unicode[STIX]{x1D63C}$ , which is written as a tensor product between vectors $\boldsymbol{a},\boldsymbol{b}$ , so $\unicode[STIX]{x1D63C}=\boldsymbol{a}\otimes \boldsymbol{b}=\boldsymbol{a}\boldsymbol{b}$ , where, as everywhere in this text, we omit writing the tensor product $\otimes$ . Or in the index notation $\unicode[STIX]{x1D608}_{ij}=a_{i}b_{j}$ . Now consider vector $\boldsymbol{c}$ , and how can one define a generalized vector product $\boldsymbol{c}\times \unicode[STIX]{x1D63C}$ , by using the usual vector product between two vectors. The natural definition is $\boldsymbol{c}\times \unicode[STIX]{x1D63C}=\boldsymbol{c}\times (\boldsymbol{a}\boldsymbol{b})\equiv (\boldsymbol{c}\times \boldsymbol{a})\boldsymbol{b}$ , or in the index notation $(\boldsymbol{c}\times \unicode[STIX]{x1D63C})_{ij}=\unicode[STIX]{x1D716}_{ikl}c_{k}a_{l}b_{j}=\unicode[STIX]{x1D716}_{ikl}c_{k}\unicode[STIX]{x1D63C}_{lj}.$ Now let us consider how to define the much more confusing $\unicode[STIX]{x1D63C}\times \boldsymbol{c}$ . The natural definition is $\unicode[STIX]{x1D63C}\times \boldsymbol{c}=(\boldsymbol{a}\boldsymbol{b})\times \boldsymbol{c}\equiv \boldsymbol{a}(\boldsymbol{b}\times \boldsymbol{c})$ , and in the index notation $(\unicode[STIX]{x1D63C}\times \boldsymbol{c})_{ij}=a_{i}\unicode[STIX]{x1D716}_{jkl}b_{k}c_{l}=\unicode[STIX]{x1D716}_{jkl}\unicode[STIX]{x1D63C}_{ik}c_{l}$ . Now it is straightforward to show that
Generalizing the vector product to higher-order tensors is easy and in the fluid hierarchy we always encounter a vector $\hat{\boldsymbol{b}}$ , and tensors $\boldsymbol{p},\boldsymbol{q},\boldsymbol{r}$ , let us therefore write
The vector product is very useful for decomposing quantities to the directions parallel and perpendicular with respect to $\hat{\boldsymbol{b}}$ . For example, the usual decomposition of the velocity $\boldsymbol{u}$ reads
Nevertheless, it is possible to directly obtain $\boldsymbol{u}_{\bot }$ , by applying $\hat{\boldsymbol{b}}\times$ twice on $\boldsymbol{u}$ , so that
or $\boldsymbol{u}_{\bot }=\hat{\boldsymbol{b}}\times \boldsymbol{u}\times \hat{\boldsymbol{b}}$ . Similar decompositions can be made for tensors. For example, a general matrix $\unicode[STIX]{x1D72B}$ can be decomposed according to
where the perpendicular component can be written as $\unicode[STIX]{x1D72B}\boldsymbol{\cdot }(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})=-(\unicode[STIX]{x1D72B}\times \hat{\boldsymbol{b}})\times \hat{\boldsymbol{b}}$ , i.e. where all the $\hat{\boldsymbol{b}}$ act on $\unicode[STIX]{x1D72B}$ from the right. Alternatively, one can write a decomposition when all the $\hat{\boldsymbol{b}}$ act on the $\unicode[STIX]{x1D72B}$ from the left, $\unicode[STIX]{x1D72B}=\hat{\boldsymbol{b}}(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D72B})+(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})\boldsymbol{\cdot }\unicode[STIX]{x1D72B}$ , where the perpendicular part $(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})\boldsymbol{\cdot }\unicode[STIX]{x1D72B}=-\hat{\boldsymbol{b}}\times (\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D72B})$ .
Appendix C. MHD dispersion relation
We assume that many people reading this text will have some previous experience with MHD. It is therefore beneficial to obtain the MHD dispersion relation, so that it will be clear how more advanced fluid models are treated in this text. The MHD fluid model reads
where $\unicode[STIX]{x1D6FE}=5/3$ . As in advanced fluid models, it is beneficial to normalize the speed with respect to the Alfvén speed $V_{A}$ , and normalize the length with respect to the ion inertial length $d_{i}=V_{A}/\unicode[STIX]{x1D6FA}_{p}$ , i.e. by using normalizations (3.23)–(3.28), which in Fourier space yields $\widetilde{k}=kd_{i}$ , $\widetilde{\unicode[STIX]{x1D714}}=\unicode[STIX]{x1D714}/\unicode[STIX]{x1D6FA}_{p}$ . Normalizing the MHD equations and dropping the tilde yields unchanged density, induction and pressure equations, and the momentum equation reads
One can decide how to rewrite $p^{(0)}/(\unicode[STIX]{x1D70C}_{0}V_{A}^{2})$ , either by introducing the usual MHD sound speed $C_{s}^{2}=\unicode[STIX]{x1D6FE}p^{(0)}/\unicode[STIX]{x1D70C}_{0}$ , or by introducing the plasma beta
where we use the Boltzmann constant $k_{B}=1$ , see the footnote 3 after the definition of thermal speeds (3.30). By specifying the mean magnetic field to be in the $z$ -direction, the normalized equations are linearized according to
and without a loss of generality, we consider propagation in the $x$ – $z$ plane (with $\unicode[STIX]{x2202}_{y}=0$ ). By considering a wave propagating with wavenumber $\boldsymbol{k}=(k\sin \unicode[STIX]{x1D703},0,k\cos \unicode[STIX]{x1D703})=(k_{x},0,k_{z})=(k_{\bot },0,k_{\Vert })$ , i.e. a wave propagating in the direction that makes an angle $\unicode[STIX]{x1D703}$ with respect to $\boldsymbol{B}_{0}$ , the MHD system written in Fourier space reads
Calculating the determinant of this system yields the MHD dispersion relation in normalized units (bringing back tildes for clarity) that reads
or alternatively by using (C 6) with the sound speed
or in physical units
yielding the MHD dispersion relation for the Alfvén mode $\unicode[STIX]{x1D714}_{A}=\pm V_{A}k\cos \unicode[STIX]{x1D703}=\pm V_{A}k_{\Vert }$ (sometimes called the shear Alfvén mode), and the slow and fast modes (3.110).
In the limit $\unicode[STIX]{x1D6FD}\rightarrow 0$ that corresponds to a cold plasma ( $C_{s}\rightarrow 0$ ) or a plasma with a very strong magnetic field ( $V_{A}\rightarrow \infty$ ), the slow mode $\unicode[STIX]{x1D714}_{s}=\pm C_{s}k_{\Vert }$ , and the fast mode $\unicode[STIX]{x1D714}_{f}=\pm V_{A}k$ (sometimes called the compressional Alfvén mode). In the limit $\unicode[STIX]{x1D6FD}\rightarrow \infty$ that corresponds to the incompressible MHD ( $C_{s}\rightarrow \infty$ ) or a plasma with vanishing magnetic field ( $V_{A}\rightarrow 0$ ), the slow mode $\unicode[STIX]{x1D714}_{s}=\pm V_{A}k_{\Vert }$ , and the fast mode $\unicode[STIX]{x1D714}_{f}=\pm C_{s}k$ . Note the difference in $k_{\Vert }$ and $k$ in these two limits.
Appendix D. Non-gyrotropic heat flux tensor $\boldsymbol{q}^{\text{ng}}$
The heat flux tensor equation (7.33) is rewritten as
The ‘inversion’ procedure for the left-hand side of this is equation is very complicated, and will not be addressed here. Nevertheless, the inversion procedure exists, and an interested reader can check equation (43) of Ramos (Reference Ramos2005). The leading-order $\boldsymbol{q}^{\text{ng}}$ (first order in frequency and wavenumber) can be obtained by making the quantities $\boldsymbol{q},\boldsymbol{r},\boldsymbol{p}$ on the right-hand side gyrotropic, and we want to solve
The heat flux tensor decomposition is $\boldsymbol{q}=\boldsymbol{q}^{\text{g}}+\boldsymbol{q}^{\text{ng}}$ , or alternatively $\boldsymbol{q}=\boldsymbol{S}+\unicode[STIX]{x1D748}$ , where $\unicode[STIX]{x1D748}:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}=0$ , $\unicode[STIX]{x1D748}:\unicode[STIX]{x1D644}=0$ . Since
one can define perpendicular components of the heat flux vectors
Therefore, to obtain these vectors, we want to apply $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ , and $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ to the entire equation (D 2). A very useful expression also is
and the trace of the entire heat flux tensor reads
D.1 Non-gyrotropic heat flux vector $\boldsymbol{S}_{\bot }^{\Vert }$
Applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ to the left-hand side of (D 2) yields
Terms on the right-hand side of (D 2) calculate as
For the last term on the right-hand side of (D 2)
and so for the gyrotropic
Collecting all the results together yields
The equation appears very complicated, however, it is not overly so, since it is only a vector equation. The equation just describes perpendicular components of $\boldsymbol{S}_{\bot }^{\Vert }$ . By multiplying it with $\hat{b}_{k}$ , the left-hand side is zero, and on the right-hand side one recovers the gyrotropic parallel heat flux equation (7.102). To understand the equation better, let us simplify for a moment and evaluate it with respect to $\hat{\boldsymbol{b}}_{0}=(0,0,1)$ . The $x$ -component $(\hat{\boldsymbol{b}}_{0}\times \boldsymbol{S}_{\bot }^{\Vert })_{x}=-(\boldsymbol{S}_{\bot }^{\Vert })_{y}$ , so that
and similarly for the $y$ -component $(\hat{\boldsymbol{b}}_{0}\times \boldsymbol{S}_{\bot }^{\Vert })_{y}=+(\boldsymbol{S}_{\bot }^{\Vert })_{x}$ , yielding
Now, by using the leading-order MHD induction equation $\text{d}\hat{b}_{x}/\text{d}t=\unicode[STIX]{x2202}_{z}u_{x}$ and $\text{d}\hat{b}_{y}/\text{d}t=\unicode[STIX]{x2202}_{z}u_{y}$ (which was for example used to get the FLR1 pressure tensor), and further considering specific example of a bi-Maxwellian expressions for the fourth-order moments, i.e. $r_{\Vert \Vert }=3p_{\Vert }^{2}/\unicode[STIX]{x1D70C}+\widetilde{r}_{\Vert \Vert }$ etc., yields
The above equations are very useful if we ever need linearized expressions for $\boldsymbol{S}_{\bot }^{\Vert }$ , which is beneficial to do right now, and the partially linearized expressions read
Note that the mean values of perturbations $\widetilde{r}^{(0)}=0$ . It is also noteworthy to point out that the contributions from the induction equation $\text{d}\hat{\boldsymbol{b}}/\text{d}t$ are completely eliminated by the linearization process, and more elaborate forms of induction equation will not bring additional precision at the linear level. Coming back to the expressions (D 22), (D 23), it is also possible to introduce vorticity $\unicode[STIX]{x1D74E}=\unicode[STIX]{x1D735}\times \boldsymbol{u}$ , with components $\unicode[STIX]{x1D714}_{x}=(\unicode[STIX]{x2202}_{y}u_{z}-\unicode[STIX]{x2202}_{z}u_{y})$ and $\unicode[STIX]{x1D714}_{y}=-(\unicode[STIX]{x2202}_{x}u_{z}-\unicode[STIX]{x2202}_{z}u_{x})$ . By noting that $(\hat{\boldsymbol{b}}_{0}\times \unicode[STIX]{x1D74E})_{x}=-\unicode[STIX]{x1D714}_{y}=(\unicode[STIX]{x2202}_{x}u_{z}-\unicode[STIX]{x2202}_{z}u_{x})$ and $(\hat{\boldsymbol{b}}_{0}\times \unicode[STIX]{x1D74E})_{y}=\unicode[STIX]{x1D714}_{x}=(\unicode[STIX]{x2202}_{y}u_{z}-\unicode[STIX]{x2202}_{z}u_{y})$ , both equations can be written together as
Now that we understand that equation (D 19) just describes the perpendicular components of $\boldsymbol{S}_{\bot }^{\Vert }$ with respect to $\hat{\boldsymbol{b}}$ , since the parallel components are zero (directly from the decomposition of $\boldsymbol{S}^{\Vert }$ )
we actually do not have to evaluate equation (D 19) with respect to $\hat{\boldsymbol{b}}_{0}$ . For a general vector $\boldsymbol{a}$ , the following identity holds $\hat{\boldsymbol{b}}\times (\hat{\boldsymbol{b}}\times \boldsymbol{a})=\hat{\boldsymbol{b}}(\hat{\boldsymbol{b}}\boldsymbol{\cdot }\boldsymbol{a})-\boldsymbol{a}$ . So for a vector $\boldsymbol{a}$ which does not have any parallel components to $\hat{\boldsymbol{b}}$ , so that $\hat{\boldsymbol{b}}\boldsymbol{\cdot }\boldsymbol{a}=0$ , the identity is $\hat{\boldsymbol{b}}\times (\hat{\boldsymbol{b}}\times \boldsymbol{a})=-\boldsymbol{a}$ , and
We can apply $\hat{\boldsymbol{b}}\times$ to the entire equation (D 19), and get the fully nonlinear expression for $\boldsymbol{S}_{\bot }^{\Vert }$ . Let us apply $\hat{\boldsymbol{b}}\times$ to each term separately, since we will need these results later, the terms calculate as
To summarize, applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $\hat{\boldsymbol{b}}\times$ to equation (D 2), yields the following nonlinear expression
Now the calculations proceed in a similar way as before. The usual (MHD) induction equation can be written as
and since $(\unicode[STIX]{x1D735}\boldsymbol{u})\boldsymbol{\cdot }\hat{\boldsymbol{b}}-\hat{\boldsymbol{b}}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u}=\hat{\boldsymbol{b}}\times (\unicode[STIX]{x1D735}\times \boldsymbol{u})=\hat{\boldsymbol{b}}\times \unicode[STIX]{x1D74E}$ ,
Finally, by prescribing bi-Maxwellian perturbations for the fourth-order moment yields
We can report a sign typo in (3.7) of Sulem & Passot (Reference Sulem and Passot2015), whose term $3\widetilde{r}_{\Vert \bot }$ has the opposite sign.
D.2 Conversion of other notations
In the paper of Ramos (Reference Ramos2005), the following notation is used
and his equations (54), (57) are indeed equivalent to (D 37).
In the paper by Macmahon (Reference Macmahon1965), the following notation is used
and his equation (14) is equivalent to (D 36), if in Macmahon (Reference Macmahon1965) we continue the calculation of
D.3 Non-gyrotropic heat flux vector $\boldsymbol{S}_{\bot }^{\bot }$
Now we want to apply $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ to equation (D 2). Additionally, we now know that $\boldsymbol{S}_{\bot }^{\bot }$ has only perpendicular components, and that afterward we will be also applying $\hat{\boldsymbol{b}}\times$ . The left-hand side calculates trivially as
and applying $\hat{\boldsymbol{b}}\times$ therefore yields
Let us continue with applying trace to equation (D 2) term by term
and by further applying $\hat{\boldsymbol{b}}\times$ to these terms yields
and direct subtraction with results (D 29)–(D 33) gives
To summarize, applying $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ and $\hat{\boldsymbol{b}}\times$ to equation (D 2) yields the following nonlinear expression
Using the simple MHD induction equation gives
and finally prescribing bi-Maxwellian values for the fourth-order moments yields
Additionally, evaluation from linear kinetic theory in the gyrotropic limit (at long wavelengths) yields $\widetilde{r}_{\bot \bot }=0$ , as we will see in Part 2. With this requirement, equation (D 60) is equivalent to (3.6) of Sulem & Passot (Reference Sulem and Passot2015). The result (D 60) is also equivalent to expressions (55), (58) of Ramos (Reference Ramos2005). However, equation (15) of Macmahon (Reference Macmahon1965) appears to have a typo, where in comparison to our (D 59), his definition of $\boldsymbol{T}$ appears to be missing a factor of 2 in front of the $\unicode[STIX]{x1D735}\boldsymbol{u}$ term. Ramos (Reference Ramos2005) however states that his results are equivalent to those of Macmahon (Reference Macmahon1965), so we might not correctly understand the notation in that paper.
Partial linearization of result (D 60) yields that by components
where we have kept the $\widetilde{r}_{\bot \bot }$ contributions, just in case we need them in the future. Note that, similarly to the linearization of the vector $\boldsymbol{S}_{\bot }^{\Vert }$ , the induction equation is completely eliminated at the linear level, and more elaborate forms of induction equation will not have additional contributions.
D.4 Second-order heat flux vectors
In a previous section we have seen that by applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ , $:(\unicode[STIX]{x1D644}-\hat{\boldsymbol{b}}\hat{\boldsymbol{b}})/2$ and $\hat{\boldsymbol{b}}\times$ to equation (D 1), the fully nonlinear (but implicit) expressions for the non-gyrotropic heat flux vectors read
Here, we are interested only in the linear level contributions, which we want to use in the FLR pressure tensor. Therefore, since we know that here we will linearize everything at the end, we can immediately get rid of the two heat flux terms $\boldsymbol{q}\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{u}$ and $(\boldsymbol{q}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{u})^{S}$ , and also replace the $\text{d}/\text{d}t$ by $\unicode[STIX]{x2202}/\unicode[STIX]{x2202}t$ , so partial linearization yields
Additionally, the $\unicode[STIX]{x2202}\boldsymbol{q}/\unicode[STIX]{x2202}t$ term can be easily precalculated as
and by applying $\hat{\boldsymbol{b}}\times$ , terms proportional to $\hat{\boldsymbol{b}}$ disappear, yielding
The first-order vectors were obtained by keeping only gyrotropic quantities on the right-hand side. Linearizing results (D 37), (D 60) indeed yields that at the linear level
where the gradients are meant to be further linearized, and where we also neglected perturbations $\widetilde{r}$ , since right now we do not want to calculate these quantities from linear kinetic theory. To obtain the second-order heat flux vectors at the linear level, we need to calculate
To calculate the $\boldsymbol{r}^{\text{ng}}$ contributions it is enough to work with the partially linearized (valid only for bi-Maxwellian)
which further yields
and further evaluation with $\hat{\boldsymbol{b}}_{0}$ (e.g. $\hat{b}_{l}\unicode[STIX]{x2202}_{l}\rightarrow \unicode[STIX]{x2202}_{z}$ , or $\unicode[STIX]{x1D6F1}_{ik}\hat{b}_{i}\rightarrow \unicode[STIX]{x1D6F1}_{zk}$ , but keeping the $\hat{b}_{k}$ intact for now) yields
We purposely kept the $\hat{b}_{k}$ for now, since multiplying the results by $\hat{b}_{k}$ and further linearizing yields
which verifies that the contributions (7.100), (7.101) to the gyrotropic heat flux equations were indeed calculated correctly. In contrast, applying $\hat{\boldsymbol{b}}\times$ eliminates terms proportional to $\hat{b}_{k}$ , implying
where we have used perhaps a somewhat strange mixture of vector and index notations, but which should be otherwise clear. For any vector $\boldsymbol{a}$ , the notation just means $\hat{\boldsymbol{b}}_{0}\times \boldsymbol{a}=(-a_{y},a_{x},0)$ . Similarly straightforward calculation yields
The final form of the second-order non-gyrotropic heat flux vectors reads
Alternatively, one can write $\unicode[STIX]{x2202}_{l}\unicode[STIX]{x1D6F1}_{lk}-\unicode[STIX]{x2202}_{z}\unicode[STIX]{x1D6F1}_{kz}=\unicode[STIX]{x2202}_{x}\unicode[STIX]{x1D6F1}_{xk}+\unicode[STIX]{x2202}_{y}\unicode[STIX]{x1D6F1}_{yk}$ . The second-order heat flux vectors contribute to the scalar (gyrotropic) pressure equations, and also to the FLR pressure tensor $\unicode[STIX]{x1D72B}$ . The non-gyrotropic pressure tensor $\unicode[STIX]{x1D72B}$ , and the non-gyrotropic heat flux vectors $\boldsymbol{S}_{\bot }^{\Vert }$ , $\boldsymbol{S}_{\bot }^{\bot }$ are therefore generally coupled. To make the system easily solvable, terms with $\unicode[STIX]{x1D72B}$ on the right-hand side of (D 88), (D 89) are approximated by $\unicode[STIX]{x1D72B}^{(1)}$ . The equations (D 88), (D 89) can be written in a more elegant vector form, see equations (5.153), (5.154), and the result is equivalent to equations (53), (54) of Passot et al. (Reference Passot, Sulem and Hunana2012).
Instead of splitting of $\unicode[STIX]{x1D72B}$ , $\boldsymbol{S}_{\bot }^{\Vert }$ and $\boldsymbol{S}_{\bot }^{\bot }$ into the first- and second-order components, one might be interested in the future to check the behaviour of the coupled system, i.e. one might be interested in solving the pressure tensor equation, that is coupled to the heat flux vectors, by making the associated variables independent quantities. For example, one might be interested to double check the solutions for the parallel firehose instability. The procedure of how to obtain time-dependent equations for $\boldsymbol{S}_{\bot }^{\Vert }$ and $\boldsymbol{S}_{\bot }^{\bot }$ can be perhaps a bit blurry from the steps outlined above, so let us write it one more time, only for $\boldsymbol{S}_{\bot }^{\Vert }$ . One can work directly with the heat flux tensor equation (7.33), and keeping only terms that contribute at the linear level, the partially linearized heat flux tensor equation reads
Applying $:\hat{\boldsymbol{b}}\hat{\boldsymbol{b}}$ and $\hat{\boldsymbol{b}}\times$ , yields that, at the linear level,
or in the index notation
Importantly, bi-Maxwellian distribution function was assumed. Normalizing the equations, writing them in the $x$ – $z$ plane and Fourier transforming yields
Considering parallel propagation ( $k_{\bot }=0$ ), these equations are coupled with equations for $\unicode[STIX]{x1D6F1}_{xz},\unicode[STIX]{x1D6F1}_{yz}$ , equations (5.157), (5.158), and also with equations for $u_{x},u_{y},B_{x},B_{y}$ , system (6.1). Now, it should be easy to derive the dispersion relation and, for example, check if the solutions for the parallel firehose instability are improved.
D.5 Contributions of $\boldsymbol{q}^{\text{ng}}$ to various equations
Here we calculate contributions of $\boldsymbol{q}^{\text{ng}}$ to the FLR pressure tensor, scalar heat flux equation and scalar pressure equations. Contributions to the FLR pressure tensor are calculated at the linear level by neglecting $\unicode[STIX]{x1D748}$ . Contributions to scalar heat flux equations and scalar pressure equations are calculated nonlinearly. In these calculations, no specific form of a distribution function is assumed.
D.5.1 Contributions to FLR pressure tensor
For contributions to the FLR tensor $\unicode[STIX]{x1D72B}$ , we need to calculate $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{q}^{\text{ng}}$ . To calculate it, we will neglect the heat flux tensor $\unicode[STIX]{x1D748}$ , and use the decomposition of the heat flux tensor $\boldsymbol{S}$ , that can be figured out to be
or in the index notation
Here, we want to calculate only contributions at the linear level, and linearizing $\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{S}$ yields
and straightforward evaluation gives
D.5.2 Contributions to scalar heat flux equations
When we calculated the scalar heat flux equations, we separated the non-gyrotropic heat flux $\boldsymbol{q}^{\text{ng}}$ by defining a quantity
and we need to calculate
Here, we want to obtain exact nonlinear expressions. Direct calculations yield
and the final fully nonlinear result reads
Similar calculations yield a nonlinear expression for $Q_{\bot }^{\text{ng}}$ , which calculates as
To continue the calculation, we need to use (an exact expression)
which allows us to calculate
and therefore
The final fully nonlinear result reads
However, it is easy to see that, at the linear level, both $Q_{\Vert }^{\text{ng}}\overset{\text{lin}}{=}0$ and $Q_{\bot }^{\text{ng}}\overset{\text{lin}}{=}0$ .
D.5.3 Contributions to scalar pressure equations
The non-gyrotropic heat flux term entering the parallel pressure equation calculates as
and since
which yields the final nonlinear result
Similarly, the term entering the perpendicular pressure equation calculates as
The nonlinear results (D 124), (D 126) are exact, and here we do not address how to further decompose the tensor $\unicode[STIX]{x1D748}$ . Finally, considering contributions at the linear level (assuming that the mean values of the entire heat flux $\boldsymbol{q}^{(0)}=0$ ), it is easy to see that
The heat flux tensor $\unicode[STIX]{x1D748}$ therefore does not contribute to the gyrotropic pressure equations at the linear level.