1 Introduction

Microstructures resulting from conventional turbulence (CT) and double-diffusive convection (DDC) are among the many noteworthy physical processes occurring in the ocean. Although the rates of microstructure occurrence and their effects are gradually being revealed, more complete information about the occurrence of microstructures remains unknown. A better understanding of oceanic microstructures will provide value to multiple fields and may help in answering some outstanding questions in climatic modeling, water mass modification, and oceanic nutrient distribution processes.

CT and DDC are related to large-scale oceanic processes. For example, internal wave (IW) breaking can produce significant amounts of turbulence (e.g., Polzin et al. 1997). DDC occurring at ~ 400 db generates North Pacific Intermediate Water (Talley and Yun 2001) and leads to intrusions in the subsurface layer off the Sanriku Coast of Japan (e.g., Nagata 1970; Nagasaka et al. 1999). Taken together, studies on microscale mixing [~ O(10−2) m] are strongly correlated with large-scale processes (e.g., meridional circulation: ~ O(103) m, intrusion and IWs: ~ O(102) m; Munk 1966; Bryan 1987; Gargett and Holloway 1992; Karl 1999). Nonetheless, the effects of DDC have been historically ignored in scientific study.

One reason why DDC has been ignored is the shortage of empirical knowledge typically obtained through observation. The opportunity for observations is limited because DDC is known to occur in areas such as shallow regions with commercial usage or in polar regions (e.g., Hirano et al. 2010). Moreover, limited ship time for observation and high cost of the microstructure profiler interrupt microstructure observations. Difficulties in handling microstructure data also exist. In addition, the areas surveyed for the detection of microstructures have incomplete coverage because the spatiotemporal scales of microscale processes are smaller than those detected by routine observations.

In order to compensate for the difficulties mentioned above, parameterizations of eddy diffusivities and kinetic energy dissipation rates have been developed using the conductivity temperature depth (CTD) profiler, lowered ADCP (LADCP), and other common oceanic instruments for hydrographic data collection. However, at its current state, the parameterization is not completely developed, because the methods are based on certain limitations. Nearly all of the parameterization concerns deal with shear-driven turbulence (CT), which is due to IWs. When the velocity shear is superior (Kunze 1990), DDC coexists with CT; nevertheless, DDC has been ignored in the parameterization. Parameterizations of DDC are carried out using laboratory experiments and direct numerical simulations (DNS). This means that a comparison focusing on DDC with microstructure data is still required. Therefore, a more precise parameterization is required for future microstructure studies.

The rest of this overview is structured as follows. We summarize previous studies on eddy diffusivity and present the results of DDC parameterization in oceanic turbulent mixing. DDC in the turbulent kinetic energy (TKE) equation is discussed in Sect. 2. Parameterization of eddy diffusivity using a second-moment closure (SMC) model is described in Sect. 3. Other types of DDC parameterizations in numerical simulations are described in Sects. 4 and 5. Key points regarding the eddy diffusivity estimation with measurement data are described in Sect. 6. Finally, concluding remarks are presented in Sect. 7. Details regarding the turbulent kinetic energy (TKE) equation, laboratory flux laws, SMC model, and relevant terminologies are presented in Appendices A–D, respectively.

2 Eddy diffusivity with turbulent kinetic energy equation and flux laws

DDC has two forms of convection: salt finger convection (SF) and diffusive convection (DC). DDC is characterized by the density ratio \(R_{\rho } {{ = \alpha \frac{{\partial \bar{T}}}{\partial z}} \mathord{\left/ {\vphantom {{ = \alpha \frac{{\partial \bar{T}}}{\partial z}} {\beta \frac{{\partial \bar{S}}}{\partial z}}}} \right. \kern-0pt} {\beta \frac{{\partial \bar{S}}}{\partial z}}}\), which is the ratio of the background density gradient due to temperature to that due to salt, where α and β are the expansion and contraction coefficients for heat and salt, respectively (Eq. 77). \(\frac{{\partial \bar{T}}}{\partial z}\) and \(\frac{{\partial \bar{S}}}{\partial z}\) represent the background temperature and salt gradients, respectively. Generally, SF is considered active when 1 < Rρ< 2, and DC is considered active when 0.5 < Rρ< 1 (e.g., Inoue et al. 2007). When CT is weak and DDC is active, the density is transported downward because of the difference in molecular diffusivity for heat and salt; therefore, the eddy diffusivities for salt KS, heat KT, and density Kρ are not equal to one another (see Appendices A and B). This characteristic is unique to DDC.

Consider the steady-state TKE equation for SF without background velocity shear (refer to Eq. 67). The balance equation between the dissipation rate of the TKE ε (refer to kinetic energy dissipation rates) and the energy production via buoyancy flux Jb is as follows:

$$0 = \varepsilon + g\frac{{\overline{\rho 'w'} }}{{\bar{\rho }}} = \varepsilon + J_{b} .$$
(1)

Thus, Jb should be negative for DDC. From Eq. (82), and under the Boussinesq approximation (\(\bar{\rho } = \rho_{0}\)), Jb can be written as

$$J_{b} = g\frac{{\overline{{\rho^{\prime}w^{\prime}}} }}{{\rho_{0} }} = g\frac{{F_{\rho } }}{{\rho_{0} }} = g(\beta F_{S} - \alpha F_{T} ) = g\beta F_{S} \left( {1 - \frac{{\alpha F_{T} }}{{\beta F_{S} }}} \right),$$
(2)

Then, Eq. (1) can be rewritten as

$$\varepsilon = - g\beta F_{S} \left( {1 - \frac{{\alpha F_{T} }}{{\beta F_{S} }}} \right) = - g\beta F_{S} \left( {1 - \gamma^{\text{SF}} } \right),$$
(3)

where \(\gamma^{\text{SF}}\) is the density flux ratio due to SF (see Appendix B). Here, the square of buoyancy frequency \(N\) is described as

$$N^{2} = - \frac{g}{{\rho_{0} }}\frac{{\partial {\kern 1pt} \bar{\rho }}}{{\partial {\kern 1pt} z}} = g\alpha \frac{{\partial {\kern 1pt} \bar{T}}}{{\partial {\kern 1pt} z}} - g\beta \frac{{\partial {\kern 1pt} \bar{S}}}{{\partial {\kern 1pt} z}} = - g\beta \frac{{\partial {\kern 1pt} \bar{S}}}{{\partial {\kern 1pt} z}}\left( {1 - R_{\rho } } \right).$$
(4)

From the definition of \(K_{S}\) and \(K_{T}\) in DDC (Eq. 101) with Eq. 4, we obtain an expression for the vertical eddy diffusivity of salt for SF \(K_{S}^{\text{SF}}\):

$$K_{S}^{\text{SF}} = \frac{{R_{\rho } - 1}}{{1 - \gamma^{\text{SF}} }}\frac{\varepsilon }{{N^{2} }}.$$
(5)

From the definition of \(R_{\rho }\), the vertical eddy diffusivity of heat for SF \(K_{T}^{\text{SF}}\) is given by:

$$K_{T}^{\text{SF}} = \frac{{\gamma^{\text{SF}} }}{{R_{\rho } }}K_{S}^{\text{SF}} = \frac{{\gamma^{\text{SF}} \left( {R_{\rho } - 1} \right)}}{{R_{\rho } \left( {1 - \gamma^{\text{SF}} } \right)}}\frac{\varepsilon }{{N^{2} }}.$$
(6)

Rewriting Eq. (82) as Eq. (7), the vertical eddy diffusivity of the density of SF \(K_{\rho }^{\text{SF}}\) can be written as Eq. (8):

$$- K_{\rho } \frac{g}{{\rho_{0} }}\frac{{\partial {\kern 1pt} \bar{\rho }}}{{\partial {\kern 1pt} z}} = g\alpha K_{T} \frac{{\partial {\kern 1pt} \bar{T}}}{{\partial {\kern 1pt} z}} - g\beta K_{S} \frac{{\partial {\kern 1pt} \bar{S}}}{{\partial {\kern 1pt} z}}$$
(7)
$$K_{\rho }^{\text{SF}} = \frac{{K_{T}^{\text{SF}} R_{\rho } - K_{S}^{\text{SF}} }}{{R_{\rho } - 1}}$$
(8)

From Eqs. (5, 6, 7, and 8), we have

$$K_{\rho }^{\text{SF}} = - \frac{\varepsilon }{{N^{2} }} < 0.$$
(9)

Eddy diffusivities for DC (\(K_{S}^{\text{DC}}\), \(K_{T}^{\text{DC}}\), and \(K_{\rho }^{\text{DC}}\)) are obtained in the same way:

$$K_{S}^{\text{DC}} = \gamma^{\text{DC}} R_{\rho } K_{T} = \frac{{\gamma^{\text{DC}} \left( {1 - R_{\rho } } \right)}}{{\left( {1 - \gamma^{\text{DC}} } \right)}}\frac{\varepsilon }{{N^{2} }}{\kern 1pt} {\kern 1pt}$$
(10)
$$K_{T}^{\text{DC}} = \frac{{\left( {1 - R_{\rho } } \right)}}{{R_{\rho } \left( {1 - \gamma^{\text{DC}} } \right)}}\frac{\varepsilon }{{N^{2} }},$$
(11)
$$K_{\rho }^{\text{DC}} = \frac{{K_{T}^{DC} R_{\rho } - K_{S}^{DC} }}{{R_{\rho } - 1}} = - \frac{\varepsilon }{{N^{2} }} < 0.$$
(12)

Note that \(K_{\rho }\) is negative in the presence of DDC, indicating that DDC reduces the potential energy of the system and intensifies density stratification. Using the flux laws created by Huppert (1971, Eq. 102), Kunze (1987, Eq. 93), and Kelley (1986, Eq. 94, Kelley 1990, Eq. 103), variations of the eddy diffusivity in DDC with inactive CT (taking ε = 10−10 W kg−1 and N = 5.2 × 10−3) are shown in Fig. 1. \(K_{S}^{\text{SF}}\) and \(K_{T}^{SF}\) take large values with active SF (1 < \(R_{\rho }\) < 2).\(K_{S}^{\text{DC}}\) and \(K_{T}^{\text{DC}}\) take large values with active DC (0.5 < \(R_{\rho }\) < 1). The validity of this range will be confirmed in the next section.

Fig. 1
figure 1

Eddy diffusivities calculated from flux laws created by Huppert (1971); Kunze (1987) and Kelley (1986, 1990), taking ε = 10−10 W kg−1 and N = 5.2 × 10−3 s−1 (mode values for both quantities obtained in NATRE, Gregg 1989)

3 DDC in SMC

When estimating the eddy diffusivity in the presence of DDC, the effect of velocity shear has been traditionally ignored. Linden (1974) experimentally showed that three-dimensional SF in the steady shear flow aligned with the velocity shear to form two-dimensional sheets, and with the resultant vertical transports of salt and heat remaining unchanged. However, Kunze (1990) analyzed C-SALT data and confirmed that oceanic SF should take the form of two-dimensional sheets due to velocity shear, leading to a reduction in the vertical buoyancy flux of SF. Wells et al. (2001) numerically and experimentally investigated the structure of SF in the presence of periodic shear flow, with the results revealing a reduced vertical buoyancy flux of SF. Therefore, we cannot neglect the shear effects on DDC.

For investigating the effect of shear on both DDC and CT, SMC was employed by Canuto et al. (2008), Kantha and Carniel (2009), and Kantha (2012). In this review, we follow the approach used in Kantha et al. (2011) and Kantha (2012), including the variances of both temperature and salinity in the steady-state energy equation (Eq. 79). The turbulent timescale τ is introduced as

$$\tau = B_{1} \frac{{\ell {\kern 1pt} }}{q}{\kern 1pt} = \frac{{q^{2} }}{\varepsilon } = \frac{2K}{\varepsilon },$$
(13)

where B1 is the coefficient for the turbulent timescale, q is the turbulence velocity scale, \(\ell\) is the turbulence length scale, and K is the TKE (= q2/2). The second-moment terms of transport for heat \(\overline{{w^{\prime}T^{\prime}}}\), salt \(\overline{{w^{\prime}S^{\prime}}}\), and momentum \(\overline{{u^{\prime}w^{\prime}}}\) are parameterized in Eqs. (80, 81 and 88) (the first-order closure), and the structure functions for the salt \(S_{S}\), heat \(S_{T}\), density \(S_{\rho }\), and momentum \(S_{\upsilon }\) are introduced with the eddy diffusivity for salt KS, temperature KT, density Kρ, and momentum \(K_{\upsilon }\), defined as:

$$K_{S} = K\tau S_{S}$$
(14)
$$K_{T} = K\tau S_{T}$$
(15)
$$K_{\rho } = K\tau S_{\rho }$$
(16)
$$K_{\upsilon } = K\tau S_{\upsilon }$$
(17)

From Eq. (8) or Eq. (12), relations among \(S_{S}\), \(S_{T}\), and \(S_{\rho }\) can be obtained:

$$S_{\rho } = \frac{{R_{\rho } S_{T} - S_{S} }}{{R_{\rho } - 1}}.$$
(18)

This model is described in Appendix C. After a series of manipulations involving Eqs. (52, 71, and 72) using Eqs. (105, 106, 107, 108, 109, 110, 111, and 112), one can obtain the relations between the structure functions in the DDC as functions of the gradient Richardson number Ri, defined as Eq. (90), Rρ and N:

$$\tau^{2} N^{2} \frac{{R_{\rho } }}{{\left( {R_{\rho } - 1} \right)}}\left[ {S_{\upsilon } \frac{{\left( {R_{\rho } - 1} \right)}}{{R_{\rho } R_{i} }} - \left( {S_{T} - \frac{{S_{S} }}{{R_{\rho } }}} \right)} \right] = 2.$$
(19)

Introduce the non-dimensional numbers, \(G_{T}\) and \(G_{\upsilon }\) such that

$${\kern 1pt} {\kern 1pt} G_{T} = \tau^{2} N^{2} ,$$
(20)
$$G_{\upsilon } = \tau^{2} \left( {\frac{{\partial {\kern 1pt} \bar{u}{\kern 1pt} }}{{\partial {\kern 1pt} z}}} \right)^{2} .$$
(21)

Using Eqs. (20 and 21), we have the ratio between \(G_{T}\) and \(G_{\upsilon }\) as follows

$$\frac{{G_{T} }}{{G_{\upsilon } }} = \frac{{N^{2} }}{{\left( {\frac{{\partial {\kern 1pt} \bar{u}}}{\partial z}} \right)^{2} }} = R_{i}$$
(22)

Using Eq. (22), Eq. (19) can be written as

$$S_{\upsilon } G_{\upsilon } - \frac{{G_{T} R_{\rho } }}{{\left( {R_{\rho } - 1} \right)}}\left( {S_{T} - \frac{{S_{S} }}{{R_{\rho } }}} \right) = 2$$
(23)

From Eqs. (13, 14, 15, 16, 17, 18, and 20), Eq. (23) becomes:

$$\frac{{K_{\upsilon } }}{{R_{i} }} - \frac{{K_{T} R_{\rho } - K_{S} }}{{R_{\rho } - 1}} = \frac{\varepsilon }{{N^{2} }}.$$
(24)

When shear is ignored (Ri ≫ 1, DDC only), Eq. (23) is reduced to

$$S_{S} G_{T} - R_{\rho } S_{T} G_{T} = 2\left( {R_{\rho } - 1} \right).$$
(25)

In this case, Eq. (24) becomes equivalent to Eqs. (8 and 12). Thus, negative diffusion of density is obtained.

Kantha (2012) obtained the density flux ratio as a function of Rρ such that

$$\gamma = \frac{{R_{\rho } \left\{ {\lambda_{9} + \left[ {\lambda_{11} \left( {\frac{1}{{R_{\rho } }} + 1} \right) - \lambda_{10} \frac{1}{{R_{\rho } }}} \right]C_{\text{SMC}} } \right\}}}{{\left\{ {\lambda_{5} + \left[ {\lambda_{8} - \lambda_{11} \left( {\frac{1}{{R_{\rho } }} + 1} \right)} \right]C_{\text{SMC}} } \right\}}},$$
(26)

and obtained relations among the structure functions for DDC without shear for SF:

$$S_{T} = \frac{{2\gamma^{\text{SF}} }}{{C_{\text{SMC}} \left( {1 - \gamma^{\text{SF}} } \right)}},$$
(27)
$$S_{S} = \frac{{R_{\rho } }}{{\gamma^{\text{SF}} }}S_{T} ,$$
(28)
$$S_{\rho } = - \frac{{R_{\rho } \left( {1 - \gamma^{\text{SF}} } \right)}}{{\gamma^{\text{SF}} \left( {R_{\rho } - 1} \right)}}S_{T} ,$$
(29)

and for DC:

$$S_{T} = - \frac{2}{{C_{\text{SMC}} \left( {1 - \gamma^{\text{DC}} } \right)}},$$
(30)
$$S_{S} = R_{\rho } \gamma^{\text{DC}} S_{T} ,$$
(31)
$$S_{\rho } = - \frac{{R_{\rho } \left( {1 - \gamma^{\text{DC}} } \right)}}{{1 - R_{\rho } }}S_{T} .$$
(32)

CSMC is a parameter to be determined. Here, we have used \(\gamma\) obtained by Kelley (1986, Eq. 94 for SF and Eq. 103 for DC) on the left-hand side of Eq. 26 to calculate CSMC, and then to calculate the structure functions (Eqs. 27, 28, 29, 30, 31, 32, Fig. 2). SS and ST steeply increased as Rρ approached unity, which means that mixing due to DDC was intensified. Negative Sρ for both SF and DC implies negative diffusion of density. These functions indicate that the effect of DDC is certainly important but is restricted to a narrow range of Rρ (0.8 ~ 1.2). This point should be investigated in greater detail in future modeling studies.

Fig. 2
figure 2

Dependence of structure functions for (top) heat ST, (middle) salt SS, and (bottom) density Sρ on \(R_{\rho }\)

SMC theories continue to be developed; however, there is difficulty when it comes to observational usage. Therefore, other parameterizations, which are mentioned in Sects. 3 and 4, have been proposed.

4 K-profile parameterization with DDC

Large et al. (1994) simulated meridional ocean circulation (MOC) using K-profile parameterization (KPP) and considered three different mechanisms that contribute to eddy diffusivity, namely vertical shear instability, IW breaking, and DDC, providing a linear combination for eddy diffusivity: \(K_{\rho } = K_{\rho }^{\text{Shear}} + K_{\rho }^{\text{IW}} + K_{\rho }^{\text{DDC}}\). When active SF occurred (\(1 < R_{\rho } < 1.9\)), they used a constant value of 0.7 for \(\gamma^{\text{SF}}\), describing \(K_{S}^{\text{SF}}\) and \(K_{T}^{\text{SF}}\) as

$$K_{S}^{\text{SF}} = \left[ {1 - \left( {\frac{{R_{\rho } - 1}}{0.9}} \right)^{2} } \right]^{3} \times 10^{ - 3} ,$$
(33)
$$K_{T}^{\text{SF}} = \frac{{\gamma^{\text{SF}} }}{{R_{\rho } }}K_{S}^{\text{SF}} .$$
(34)

When \(R_{\rho }\) was greater than 1.9, \(K_{S}^{\text{SF}} = 0\). In the case of active DC (\(0.5 < R_{\rho } < 1\)), \(K_{T}^{\text{DC}}\) is calculated using \(\gamma^{\text{DC}}\) as proposed by Marmorino and Caldwell (1976) and \(K_{S}^{\text{DC}}\) as proposed by Huppert (1971, Eq. 102):

$$K_{T}^{\text{DC}} = 0.909 \times 1.5 \times 10^{ - 6} \exp \left[ {4.6\exp \left( { - 0.54\left( {R_{\rho }^{ - 1} - 1} \right)} \right)} \right],$$
(35)
$$K_{S}^{\text{DC}} = (1.85 - 0.85R_{\rho }^{ - 1} )R_{\rho } K_{T}^{\text{DC}} .$$
(36)

If \(R_{\rho }\) was less than 0.5,

$$K_{S}^{\text{DC}} = 0.15R_{\rho } K_{T}^{\text{DC}} .$$
(37)

Zhang et al. (1998) also simulated the MOC using a parameterization considering DDC effects. They defined the background diffusivity as Kb= 3 × 10−5 m2/s and parameterized SF and DC eddy diffusivity. When SF occurred, they used a constant value of 0.7 for \(\gamma^{\text{SF}}\) and described \(K_{S}^{\text{SF}}\) and \(K_{T}^{\text{SF}}\) as

$$K_{S}^{\text{SF}} = \frac{{1 \times 10^{ - 4} }}{{1 + \left( {\frac{{R_{\rho } }}{1.6}} \right)^{6} }} + K_{b} ,$$
(38)
$$K_{T}^{\text{SF}} = \frac{{\gamma^{\text{SF}} }}{{R_{\rho } }}\left( {K_{S}^{\text{SF}} - K_{b} } \right) + K_{b} .$$
(39)

When DC occurred, they used the \(\gamma^{\text{DC}}\) presented by Kelley (1984), wherein the molecular heat diffusivity kT = 1.5 × 10−7 m2 s−1, and they described \(K_{S}^{\text{DC}}\) and \(K_{T}^{\text{DC}}\) as

$$K_{S}^{\text{DC}} = R_{\rho } \gamma^{\text{DC}} (K_{T}^{\text{DC}} - K_{b} ) + K_{b} ,$$
(40)
$$K_{T}^{\text{DC}} = 0.0032\exp \left( {4.8R_{\rho }^{0.72} } \right) \cdot (0.25 \times 10^{9} R_{\rho }^{ - 1.1} )^{1/3} \cdot k_{T} + K_{b} .$$
(41)

For both treatments, \(K_{\rho }^{\text{DDC}}\) is taken as

$$K_{\rho }^{\text{DDC}} = \frac{{K_{T}^{\text{DDC}} R_{\rho } - K_{S}^{\text{DDC}} }}{{R_{\rho } - 1}}.$$
(42)

A calculation of the eddy diffusivities in the range of 0.5 < Rρ< 2 is shown in Fig. 3.

Fig. 3
figure 3

Eddy diffusivities employed for numerical simulations by Large et al. (1994) and Zhang et al. (1998)

The parameterization set by Zhang et al. (1998) has smaller values than that of Large et al. (1994). However, the absolute diffusivity values in both parameterizations increase as Rρ approaches unity. When Rρ becomes smaller than 1.7, \(K_{\rho }^{\text{SF}}\) becomes negative. The notable difference between Zhang et al. (1998) and Large et al. (1994) is the behavior around Rρ= 1. Both \(K_{\rho }^{\text{SF}}\) diverge negatively, but Large et al. (1994)’s \(K_{\rho }^{\text{SF}}\) rapidly diverges because of the relatively large differences between \(K_{S}^{\text{SF}}\) and \(K_{T}^{\text{SF}}\). As for \(K_{\rho }^{\text{DC}}\), when we take the limit of \(K_{\rho }^{\text{DC}}\) as Rρ approaches unity, \(K_{\rho }^{\text{DC}}\) diverges negatively for Zhang et al. (1998) while becoming nearly constant for Large et al. (1994).

Merryfield et al. (1999) used parameterization similar to that of Zhang et al. (1998), which changed the background diffusivity. Following studies by following Gargett (1984) and Gargett and Holloway (1984), they defined the background diffusivity as proportional to \(N^{ - 1}\), and found that relatively minor changes occurred in the global circulation (mass transport) even when DDC was present. Nevertheless, there were substantial changes in the local temperature and salt distributions: the lower layer became saltier because of the efficient salt transport resulting from SF. Inoue et al. (2007) analyzed turbulence data observed in a perturbed region off Sanriku Coast, Japan, and compared their observed diffusivity values with those of Zhang et al. (1998, Eqs. 38, 39, and 40). This comparison showed a fairly good agreement for SF, but not for DC.

5 Direct numerical simulation of DDC

Recent developments in computer power have enabled us to conduct DNS of DDC. Such studies have the advantage of directly estimating the vertical fluxes and diffusivities. Kimura et al. (2011) conducted DNS at low Rρ (< 2.0, active SF). The study showed that when SF develops, both \(K_{S}^{\text{SF}}\) and \(K_{T}^{\text{SF}}\) increase as Ri increases, which is an unexpected result. In typical cases, a shear instability (energy source) should be inactive as Ri increases, with both \(K_{S}^{\text{SF}}\) and \(K_{T}^{\text{SF}}\) increasing as Rρ decreases. This result agrees with previous theoretical, observational, and situational estimations. The result follows the functional dependency of diffusivity on Ri and Rρ:

$$K_{S}^{\text{SF}} = 4.38 \times 10^{ - 5} R_{\rho }^{ - 2.7} R_{i}^{0.17} ,$$
(43)
$$K_{T}^{\text{SF}} = 3.07 \times 10^{ - 5} R_{\rho }^{ - 4.0} R_{i}^{0.17} .$$
(44)

This parameterization was verified and improved by Nakano et al. (2014), who analyzed the microstructure and CTD/LADCP results in the perturbed region off the Sanriku Coast, Japan, and western North Pacific Ocean. They also employed the buoyancy Reynolds number Reb and Ri (both at 10 m scale) as the distinguishing parameters between CT and DDC. They obtained the following new relationship between Reb and Ri:

$$R_{eb} = 19.5R_{i}^{ - 1.03} .$$
(45)

From this relation, we can obtain critical values for Reb from Ri such that:

$$\left( {R_{eb} ,R_{i} } \right){ = }\left( { 80,0. 2 5} \right) ,\left( { 20, 1} \right).$$

The value of Ri = 1 is the stability criterion of the water column, and if Ri < 0.25, the water column can become unstable and turbulent. Therefore, values of Reb= 20 and 80 corresponding to Ri, which indicate that Reb< 80 and Ri> 0.25, are suitable as criteria for the onset of DDC. Taking into account this criteria, Nakano et al. (2014) applied a DNS parameterization of diffusivity as the functions of Ri and Rρ (Kimura et al. 2011, Eqs. 43 and 44), improving their functional dependency using the following equations:

$$\left. {\begin{array}{*{20}c} {K_{S}^{\text{SF}} = 9.35 \times 10^{ - 5} R_{\rho }^{ - 2.7} R_{i}^{0.17} } \\ {K_{T}^{\text{SF}} = 7.61 \times 10^{ - 5} R_{\rho }^{ - 2.7} R_{i}^{0.17} } \\ \end{array} } \right\},R_{i} > 0.25\left( {R_{eb} < 80} \right).$$
(46)

The estimated average diffusivities of salt and heat are 2.2 × 10−5 m2/s and 3.5 × 10−5 m2/s (Rρ= 1.25), and 3.5 × 10−5 m2/s and 1.1 × 10−4 m2/s (Rρ= 1.75), respectively. It was considered that the difference in coefficients between DNS (Eqs. 43, 44) and observation (Eq. 46) was caused by vertical scale difference.

Radko and Smith (2012) conducted fine-grid simulations and non-dimensional analyses of typical SF width and length scales at Rρ= 1.9. They produced vertically aligned fingers disturbed by a secondary instability. In their calculations, fluxes become almost constant after a secondary instability became comparable to the elevator mode. They obtained γ as a function of Rρ, which agrees fairly well with the laboratory prediction:

$$F_{S}^{\text{SF}} = \frac{135.7}{{\sqrt {R_{\rho } - 1} }} - 62.75,$$
(47)
$$\gamma^{\text{SF}} = 2.709\exp \left( { - 2.513R_{\rho } } \right) + 0.5128,$$
(48)
$$F_{T}^{\text{SF}} = \gamma^{\text{SF}} F_{S}^{\text{SF}} .$$
(49)

As mentioned above, although parameterizations will continue to be refined with increasing computer machine power, verification of parameterization with observational data is still required.

6 Key points of eddy diffusivity estimation with measurement data

6.1 Mixing coefficients and distinguishing DDC from CT

Most microstructure observations aimed at evaluating eddy diffusivity in the presence of DDC have been based on observations of the dissipation rate of temperature variance χT (and thus, KT estimation by Eq. 86) and mixing efficiency Γ.

To elucidate the effects of microstructures, eddy diffusivity of density for CT generated by shear \(K_{\rho }^{\text{CT}}\) is parameterized as follows:

$$K_{\rho }^{\text{CT}} = \varGamma^{\text{CT}} \frac{\varepsilon }{{N^{2} }},$$
(50)

where ΓCT is the mixing coefficient for CT, which can be regarded as the mixing efficiency. A detailed derivation of Eq. (50) is presented in Appendix A. ΓCT is the result of the observed values of \(\varepsilon\), χT, density stratification, and temperature stratification (see Eq. 86), but ΓCT has been considered to have a constant value of 0.2 or 0.25 (e.g., Osborn 1980; Oakey 1982). Thus, \(K_{\rho }^{\text{CT}}\) is calculated using ε and N. However, the results discussed in the previous studies cast doubt on the validity of using a constant value for ΓCT (= 0.2) when estimating the eddy diffusivity in the presence of DDC.

When estimating eddy diffusivity in the presence of DDC, non-dimensional parameters, such as Rρ, and Ri measured by the vertical velocity shear \(\frac{{\partial \bar{u}}}{\partial z}\), N and Reb (see Eq. 92) have been used to distinguish between CT and DDC. Also, the value of Γ for DDC (ΓDDC) is a key factor in distinguishing DDC from CT. The definition of ΓDDC is the same as ΓCT via observation (right-hand side of Eq. 86):

$$\varGamma^{\text{DDC}} = \frac{{\chi_{T} N^{2} }}{{2\varepsilon \left( {\frac{{\partial \,\overline{T} }}{\partial \,z}} \right)^{2} }}.$$
(51)

Historically, ΓDDC has been investigated separately from SF (ΓSF) or DC (ΓDC). St. Laurent and Schmitt (1999) surveyed the distributions of ΓSF and ΓDC with Rρ and Ri and found that ΓSF and ΓDC increased substantially because of DDC. This is one of the current key issues in microstructure studies (e.g., de Lavergne et al. 2016). This is readily understood because DDC can efficiently diffuse temperature fluctuations and create a large diffusion of temperature (Fig. 4). Inoue et al. (2007) proposed that DDC is effective in mixing when Reb< 20 in the perturbed region. Inoue et al. (2008) revisited North Atlantic Tracer Release Experiment (NATRE) data, adding the Ri criterion to restrict their attention to cases when CT was not active (Ri> 0.25 and Reb< 20). They found that ΓSF decreased when Rρ increased:

$$\begin{aligned} \left( {\varGamma^{\text{SF}} ,R_{\rho } } \right) & = \left( { 1.0, 1. 3} \right),\left( {0. 6, 1. 9} \right), \\ \left( {\varGamma^{\text{SF}} ,R_{i} } \right) & = \left( {0. 6,0. 4} \right),\left( {0. 9, 1.0} \right),\left( { 1. 3, 10} \right). \\ \end{aligned}$$
Fig. 4
figure 4

a Occurrence of diffusive convection. (a) Initially warm/salty layer is above, and the cold/fresh layer is below. The separating interface is initially at rest. (b) The interface becomes unstable because of differences in molecular diffusivity of heat and salt (\(k_{T} \approx 100k_{S}\)). The upward portion of the lower layer (w′ > 0, vertical blue arrow) has both negative temperature and salt anomalies due to a surrounding warm and salty layer (T′ < 0 and S′ < 0). The downward portion (w′ < 0, vertical red arrow) has positive temperature and salt anomalies due to a surrounding cold fresh layer (T′ > 0 and S′ > 0). Lateral molecular diffusion of heat is greater than that of salt. Therefore, the upward portion is warmed and the downward portion is cooled. (c) Consequently, the upward portion attains a positive buoyancy force and keeps ascending upward (vertical blue arrow), whereas the downward portion attains a negative buoyancy force, causing its descent downward (vertical red arrow). The motions are aligned horizontally to form a salt finger cell. b Occurrence of diffusive convection. (a) Initially, cold/fresh layer is above, and the warm/salty layer is below. The separating interface is initially at rest. (b) The interface becomes unstable to be wavy because of differences in molecular diffusivity of heat and salt (\(k_{T} \approx 100\;k_{S}\)). The upward portion from the lower layer (w′ > 0, vertical red arrow) has positive temperature and salt anomalies from a surrounding cold and fresh layer (T′ > 0 and S′ > 0). The negative portion from the lower layer (w′ < 0, vertical blue arrow) has negative temperature and salt anomalies from a surrounding warm and salty layer (T′ > 0 and S′ > 0). Lateral molecular diffusion of heat is greater than that of salt. Therefore, the upward portion is cooled and the downward portion is warmed. (c) Consequently, the upward portion receives negative buoyancy force and descends downward (vertical blue arrow), whereas the downward portion obtains positive buoyancy force and ascends upward (vertical red arrow). These upward and downward portions lose or gain heat repeatedly from the surrounding water. These upward and downward motions repeat to produce mixed layers separated by the interface to form a clear diffusive interface

Nakano (2016) analyzed the TurboMAP and CTD/LADCP data at 10 m scale and surveyed ΓDDC for wide ranges in Ri and Reb values, showing that ΓDDC became large as Ri increased and Reb decreased (Fig. 5, also see Eq. 45). Large values of ΓDDC apparently stem from the large values of χT (Eq. 75) in DDC. Previous investigations cited above also showed low ε and high χT values in DDC layers, resulting in large values of ΓDDC. The observed values of Γ are summarized in Table 1. Taken together, it is certain that ΓDDC takes a large value. Thus, in evaluating the eddy diffusivity in the presence of DDC, the use of ΓCT (~ 0.2) should be avoided.

Fig. 5
figure 5

Γ plots on Log(\(R_{eb}\)) − Log(Ri) plane. Data were obtained from the western North Pacific Ocean (Nakano et al. 2014)

Table 1 Examples of direct estimation of Γ in the presence of DDC

6.2 Practical eddy diffusivity estimation

St. Laurent and Schmitt (1999) calculated KT (as shown in Table 2 together with other estimations). They separated the observed layers as favorable to either DDC or CT in order to calculate the percentages of DDC and CT layers. In addition, they obtained weighted averages of diffusivities at almost 100 m depth intervals. Relatively high values (~ 10−4 m2/s) were obtained at 90 m depth, but values were generally lower below the thermocline. Inoue et al. (2007) presented four scenarios for estimating diffusivity and vertical buoyancy flux: (1) CT (2), DDC, (3) a simple average of CT and DDC, and (4) weighted average of CT and DDC. They concluded that scenario (4) provided the best estimation for diffusivity due to DDC and CT (1.56 × 10−5 m2/s for heat, 1.85 × 10−5 m2/s for salt). Nakano et al. (2014) also obtained a relatively small diffusivity value (10−5 m2 /s). Schmitt et al. (2005) estimated a relatively high diffusivity value for salt (> 10−4 m2/s) in the western Tropical Atlantic Ocean using Eq. (5). Ishizu et al. (2012) and Nagai et al. (2015) obtained a high diffusivity value (> 10−4 m2/s) under the Soya Current and the Kuroshio Extension, respectively.

Table 2 Examples of direct estimation of eddy diffusivity in the presence of DDC

7 Concluding remarks

In oceanic regions susceptible to DDC, parameterizations of \(K_{S}^{\text{DDC}}\) and \(K_{T}^{\text{DDC}}\) have been carried out under the assumption that velocity shear is negligible. However, CT is a common feature in the Global Ocean and can coexist with DDC. Therefore, in this note, parameterizations of DDC in oceanic mixing processes are reviewed and their applicability assessed.

The notion of representing DDC in TKE with an inactive CT variable was introduced. The applicability of DDC was investigated using an SMC model. In cases where DDC and CT coexist, the effect of DDC is certainly important but is restricted to a narrow range of Rρ (0.8–1.2). Some DDC parameterizations used in numerical simulations were reviewed in terms of physical empirical validity and applicability. An approximation can be made by combining Rρ and Ri to roughly estimate the eddy diffusivity for SF, but these parameterizations are currently being verified. A mixing coefficient is required to distinguish DDC from CT and is related to Rρ and Ri. The details of this relationship require further scientific study.

Therefore, measurements of Ri, Reb, and Rρ are essential for determining the intensity of mixing due to DDC. When measuring the eddy diffusivity in the ocean interior, it is thus necessary to deploy an ADCP/LADCP or electromagnetic current meter, along with a microstructure profiler. The accumulation of observations gained by these instruments will improve the ability to map eddy diffusivity in the Global Ocean, potentially leading to better parameterization of eddy diffusivity in numerical modeling.