1 Introduction

The Sacramento/San Joaquin Delta (henceforth referred to as the Delta), located just east of San Francisco, is the confluence of the Sacramento and San Joaquin rivers near the western edge of the Central Valley and east of the Coast Ranges. It provides half of California’s fresh water supply. About 1600 km of levees surrounding low-lying islands now protect agriculture and population in the Delta. If these levees were to fail, brackish water from the San Francisco Bay would inundate the islands and contaminate the fresh water supply. Shaking from local earthquakes may pose a significant threat to the failure of the levees. To develop a more comprehensive seismic risk assessment, we have constructed a shear-wave velocity model of the region around the Delta to permit the computer simulation of shaking in the Delta from nearby earthquakes. We have included the region around the Delta to allow some comparison between structures in the Delta to that just outside and to provide model space that includes local faults that may be locations for potential sources. The goal is to have a model that incorporates basins and basement features that can be correlated with geology and geophysics and that will likely affect the character of shaking in the Delta.

To this end, Fletcher et al. (2016) used ambient noise tomography to determine the spatial distribution of Rayleigh wave group velocities (Campillo and Paul 2003; Shapiro et al. 2005; Fletcher et al. 2016) in the Central Valley, CA centered near the Delta. In this paper, we invert the Rayleigh group velocities to build a 3-D shear-wave velocity model. This paper explores the use of two algorithms; one that employs a linear least squares approach and one that is a global optimization algorithm. Other methods include Barmin et al. (2001) and Shapiro and Ritzwoller (2002) who more generally focus on global datasets (see Romanowicz 2002, for a review). The dispersion data range from about 4 to 18.5 s with depth sensitivity from 1 to 2 km to about 20 km. For this model, the dispersion data did not meet signal to noise criteria below about 4 s, and consequently higher mode data and shallow profiles will be needed to develop a better model for the upper 1 km or so.

2 Regional Geologic Structure

The geologic structure of the Great Valley and the Coast Ranges to the west reflects its history as a subducting margin (Atwater 1970; Constenius et al. 2000; Godfrey and Klemperer 1998) during the Mesozoic and early Tertiary. The various formations exposed in the Coast Ranges, such as the Franciscan, Coast Range ophiolite and Great Valley sequence make up a subduction assemblage from that tectonic environment (Constenius et al. 2000) of Mesozoic age. Starting about 30 My ago subduction of the Farallon Plate ceased (Atwater 1970) and movement on the San Andreas commenced. The Central Valley is mostly made up of the Great Valley sequence, which is marine clastic rocks of late Mesozoic and early Tertiary age (Bartow and Nilsen 1990). The Great Valley sequence was deposited in a forearc basin situated between the eastern Sierran magmatic rocks and the western Franciscan rocks. Now the Great Valley is an oil and gas producing synclinal basin with the deepest point near the western edge. The Great Valley can be split into northern and southern segments that are separated by the Stockton arch, a basement high that crosses the valley just south of the Sacramento/San Joaquin Delta (Bartow and Nilsen 1990). The western margin of the Great Valley appears controlled by active local faults including the Pittsburgh and Kirby Hills faults just east of the Montezuma Slough, the Greenville fault just east of the Calaveras and the Midland and west Tracy faults which are located in the Delta.

3 Rayleigh Wave Group Velocity Model

Fletcher et al. (2016) processed ground motion recordings for ambient noise tomography using an array of 14 broadband seismometers in or near the Delta together with broadband stations from the Berkeley Digital Seismic Network and the Transportable Array of Earthscope; a total of 31 stations (see Fig. 1). The time period covered is between 2006 and 2013 with various configurations to the number of stations and arrays being used (Fletcher et al. 2016). Ambient ground motion data spanning from 8 months to over a year were processed to create dispersion curves between all pairs of stations and then inverted for a 3-D grid of Rayleigh wave velocities as shown in Fig. 2. In general, slower velocities are found in the Central Valley compared to either the Coast Ranges to the west or Sierra Nevada Mountains to the east. The correlation to geologic features, particularly faults at the western boundary of the Central Valley is strong. The array is designed to be centered on the Delta, but also to extend well beyond the Delta so that its structure fits within a geologic context. The model space is 101 × 101 cells that cover a total of 5.0° in each dimension centered on the Delta. Each cell is about 5 km in width (0.05°). Figure 3 (from Fletcher et al. 2016) shows the weights used in the inversion for Rayleigh wave velocities. The weights are determined from the time spent in each cell and reflect the uneven array geometry. In this paper, we invert the dispersed Rayleigh group velocities using two algorithms that complement each other, as one is linear and sensitive to the starting model, whereas the other uses a starting model that is randomly generated and is based on an optimization algorithm.

Fig. 1
figure 1

Landsat map of the Central Valley with seismic stations used in the calculation of Rayleigh wave group velocities

Fig. 2
figure 2

Examples of Rayleigh wave velocities that were inverted for shear-wave velocity. These data are from Fletcher et al. (2016) and are the result of an inversion of all dispersion data between all pairs of stations. See Fletcher et al. (2016) for correlation of changes in Rayleigh wave velocity and geologic features in the Central Valley, CA

Fig. 3
figure 3

Weights used in the inversion for Rayleigh wave velocity. The weights are the cumulative time spent in each cell

4 Linearized, Least Squares Approach

We used an inversion code called SURF96 to invert Rayleigh wave dispersion curves for a model that consists of horizontal homogeneous isotropic layers (Herrmann 2013). The inversion program is an iterative, linearized, least squares approach based on Russell (1987) and has been used by others to invert for structure (Warren et al. 2008). The inversion gives explicit control on whether the depth of the horizontal interfaces are solved for or kept at the starting model. It also allows control over whether a differential smoothing constraint is used. At each node, a dispersion curve is formed using the Rayleigh wave group velocities at the resolved periods. Figure 4 shows a fairly typical dispersion curve for a region (Coast Ranges) with a smooth increase in velocity from a low of 1.6 km/s at 3 s to more than 2.6 km/s at 17 s. This original data (at 1 s spacing) has been interpolated by a factor of four (spacing at 0.25 s) before entering the inversion. The starting model uses a 1-D (vertically variable) velocity structure with 13 homogeneous layers extending to 24.5 km depth and a 14th layer as the half space (see Table 1). The starting model includes some low shear-wave velocities near the surface based on Gibbs et al. (2001) as well as curves from Nafe and Drake (1957). Included in the starting model are values for P-wave velocity, S-wave velocity, density, and Q. The algorithm maintains the Vp/Vs values of the starting model for each layer and inverts for density. Q was set to a high value (500) and not inverted for. The starting model was constructed to provide a modest gradient in properties approximated by the generally increasing values of velocity for each layer. The layers near the surface, in particular, are thinner because of the higher gradient expected near the surface (Gibbs et al. 2001).

Fig. 4
figure 4

Observed dispersion curve, fit to the data using the best-fit model and the corresponding shear-wave velocity model. Both the starting model and ending model are shown

Table 1 Starting Model, SURF96

The program was run twice to accommodate certain profiles that did not invert easily (high final error). The first time the inversion ran in four steps (a step is one command line that instructs the program to invert for velocity or several other possibilities such as layer thickness) and at each step we increased the damping from 0.3 to 1.0 (each time by 0.2 except the last at 0.3) with the goal of achieving a stable final model. In the first step, we use eight groups of commands to alternately solve for velocity and layer thickness or just velocity. At each successive step, we reduce the number of commands, reduce how often it solves for thickness, and increase the damping. This was done to avoid converging too fast on a less than best model. After this attempt we check all nodes for goodness-of-fit. For all nodes where the fit was worse than 98% (in fitting the dispersion curve), the code is rerun with a simpler scheme. In the second run, we used just five groups of commands that solve for velocity or layer thickness and a damping of 0.3. At the end, all nodes (101 × 101) were fit to better than 98%.

Figure 4 also shows the fit to the dispersion data. The fit is remarkably good, though visually the shortest periods are not quite fit as well. The left side of the figure shows the starting model and ending model. For this data, the ending model moved to higher velocities compared to the starting model, and has a sharp gradient near the surface (0.7 km/s). The final model, discussed later, was constructed using the above procedure.

However, this method does not provide a straightforward estimate of uncertainty. Different starting models could be tried, but only some starting models would produce a stable result. Consequently, the models would need to be selected carefully, biasing the result. One tactic would be to use fewer layers, but the number chosen is necessary to produce the expected gradients near the surface. In light of these concerns, we also tested an optimization algorithm that tries many models during a run and selects the best-fitting models to project to the end. These optimization schemes are better at estimating errors because many models that fit a certain criteria (such as goodness-of-fit) can be grouped and for which a variance can then be calculated.

5 Genetic Inversions

To this end, we used the genetic code documented in Yamanaka and Ishida (1996) to invert three nodes for 1-D shear-wave velocity models (Goldberg 1989, Sen and Stoffa 1992) to compare to the previous output. This algorithm is a global optimization system in comparison to a linear, least squares inversion. Genetic algorithms as well as simulated-annealing algorithms have the primary benefit that they are able to find a range of unbiased models that fit to some criteria. In both methods, the starting model is a random model and not sensitive to the user biases. In both methods, a set of parameters have to be set that describe the number of layers and the range of velocities for each layer that the algorithm will search. The genetic code uses an L2 norm for a fit function (the dispersion curve). Yamanaka and Ishida (1996) invert synthetic and actual Love wave data for a 1-D model. This program has been modified to invert Rayleigh wave group velocities. The genetic algorithm starts with a given set of possible values (e.g., a set of values for velocity). In the genetic method, possible choices are coded in binary (for example, the second possible velocity in a range that has three choices would be coded 10). All the binary-coded parameters are then put end-to-end to develop a chromosome (each model). In each generation, the code keeps track of the best-fitting model and mutates the present set of chromosomes (changes a 0 to a 1 and vice versa). The best model which fits the dispersion data with an L2 norm from early generations then proceeds to the next generation. The chromosomes are mutated in two different ways (see Yamanaka and Ishida 1996) to also develop new individuals in each generation while retaining the best model from the previous generation. The program assumes a Vp/Vs ratio that decreases with depth. We changed the original algorithm to Vp = 400 + 1.6Vs. This relationship seemed to match the assumed decrease of Vp/Vs with depth that was input to SURF96. We provide estimates of density in the input files. We tried changing the fitness function to include a penalty for non-smooth models by squaring the difference between velocities of adjacent layers to simulate the effect of differential smoothing in SURF96, but we found that this did not improve the convergence to a best-fit model. Eventually, we just used the best-fit cost function. We run the algorithm for 300 generations with 40 models in each generation.

The method is not strictly comparable to SURF96 as the genetic method does not solve for density and fixes the P-wave velocity according to a prescribed equation. But the input data is the same and the parameters that describe each framework (e.g., number of layers, etc.) are comparable. With these limitations in mind, we chose three sample nodes and ran the genetic algorithm with five different seeds to the random number generator that starts the program at each node. The input data are the values of dispersion for every 0.5 s from 3.5 s to 18 s (31 points). The dispersion curve and the best model out of five for each node is shown in Fig. 5. The parameters used are given in Table 2, which sets the number of generations, the number of models in each generation, the range of various parameters such as shear-wave velocity. To calculate the error, we computed the standard deviation of the five best models from the five runs, which is also plotted in Fig. 5. As an aside, the original program tried to form a consensus model from the best model (closest fit to the original dispersion data) plus all models within some percentage (say 3%) of the error for the best model and then form a final model by averaging all of the models in that set. This did not produce a good-fitting model to the dispersion data, suggesting a consistent bias in good-fitting, but not best-fitting models. Instead, we settle on using the best fit from each of the five runs and comparing those models.

Fig. 5
figure 5

Observed and calculated dispersion curves for three nodes (nodes run from 1 to 101 east–west and 1 to 101 north–south). The blue line is the observed Rayleigh wave group velocity, red is calculated from the best model using the genetic algorithm. The thin red lines in the right-hand plots are estimates of error from calculating the standard deviation of the best models from the five runs. The gray line is the estimate for that node from the SURF96 inversion

Table 2 Parameters for using genetic algorithm

From Fig. 5 left side, it is apparent that the fits to the dispersion curves from running the forward problem for the best model are remarkably good fits to the observed data. Two of the three show a close fit of the velocity values from the genetic algorithm to those from SURF96 (Fig. 5 right side). The genetically derived velocity values for node 40–50 (x and y indices) are slightly higher at depths less than 7 km and lower at depths below 7 km. The error bars nicely bracket the estimates of velocity for node 30–40, but the estimates of velocity fall near the limits for node 40–50. The error bars are remarkably tight for node 50–55. This comparison suggests that the velocity models from SURF96 are close to those from an unbiased global optimization search for equivalent models. Because SURF96 can be run more efficiently we use SURF96 to invert for the final, complete model.

6 Depth Resolution

The sensitivities of the Rayleigh waves to different depth intervals are shown in Fig. 6 (Fletcher et al. 2016). The Rayleigh wave sensitivities are a function of period with the shortest period being more sensitive to the upper few kilometers. The 4 s period peaks between 3 and 4 km depth but is above 0.10 until it is shallower than 1 km (peak is at a value of 0.3 for comparison). Consequently, the data (Rayleigh wave group velocities at around 4 s period) are probably not sensitive to the shear-wave velocity at depths of just a few hundred meters.

Fig. 6
figure 6

Sensitivity of the Rayleigh wave data versus depth. Note that the longer periods are more sensitive to the greater depths. Note that the shortest periods (about 4s) have some sensitivity at shallow depths

7 Results

Maps of spatial averages of the complete velocity model using SURF96 in selected depth intervals are shown in Fig. 7 with local faults. We averaged over 2 km intervals to suppress noise or small scale anomalies. Smoothing has been applied to the 3-D model in addition to the averaging over depth for each image. The kernel for smoothing covers three nodes in each horizontal dimension. This smoothing is done only in the horizontal dimension. There is no smoothing in the plotting function. The algorithm fills out every node in the defined space (which covers most of Central Valley and extends into the foothills of the Sierra Nevada Mountains and the Coast Ranges), whether rays are present or not. Many nodes near the edge of the map are given a default value, which is the average. This is necessary because the algorithm does not know ahead of time which nodes are crossed by rays. The weights, shown in Fig. 3, may be used to judge which nodes have values that are well determined. The map from 1 to 3 km mimics the map of the spatial distribution of 4.5 s Rayleigh wave group velocity (Fig. 2) with low velocities in the Central Valley compared to both the Sierra Nevada and Coast Ranges. Note that the color model shows absolute velocities over a broad range (0.8–4.5 km/s) compared to the model of group velocity variations in Fletcher et al. (2016). In the shear-wave model, we see a low-velocity notch (red in color) north of 38° on the western boundary of the Central Valley between the Kirby Hills and Green Valley faults, which was also prominent in the Rayleigh wave velocity data in Fletcher et al. (2016). Near the Montezuma Hills zone this region coincides with the Rio Vista basin, but the low velocities extend further to the west. In general, the shear-wave model has boundaries that fit well to the orientation and location of local faults in the western edge of the Central Valley. The eastern edge of the Central Valley has a scalloped appearance where low velocities of the upper Central Valley abut faster velocities in the Sierra Nevada foothills. The area around Mt. Diablo at 37.8°N, −121.8°W also appears higher in velocity compared to the surroundings at shallow depths possibly due to more mafic rocks associated with the Mt. Diablo ophiolite (Williams 1984). At greater depths, the average velocities increase and the stark contrast in the Central Valley that is seen in the 1–3 km depth range becomes less prominent in the depth range of 3–5 km and absent at 9–11 km, indicating the basement is more homogeneous across the Coast Ranges, Central Valley, and Sierra Nevada Mountains. Also, shown in Fig. 7 are local faults. The orientation and number of faults suggest the west side of the Central Valley is fault-controlled and the distribution of shear-wave velocities reflects the geometry of faults. For example, the westward notch at 38.2°N is reflected in the orientation of the Cordelia and Green Valley faults apparent in the 1–3 km depth image. There are spots of lower velocities (yellow in color) in the 9–11 km slice that are mostly correlated with the faulting on the west side of the Central Valley. One spot at 38.2°N is associated with the deeper sections of the Rio Vista basin.

Fig. 7
figure 7

Maps of the spatial distribution of shear-velocity at various depth intervals. Each map is an average over 2 km with shallowest on the left. The values of shear-wave velocity have been smoothed in the horizontal dimension using a kernel of 3. Color map ranges from 0.8 to 4.5 km/s. Also shows local faults in the Central Valley and eastern Coast Ranges. The colors near the edge of the figure are default values and Fig. 3 should be used to judge which nodes are well determined. The Rio Vista basin is approximately co-located with the Montezuma Hills zone. The faulting and other features are from DRMS (2006)

The contrast between slow and fast velocities seems less at greater depths with the exception of a lineament of high velocities that trends NNW at the right edge of the array. Because this area is close to the eastern border of the stations used in the ambient noise studies (Fig. 1), the fast anomalies may not be real (one possibility is that there are high velocities in this region, but the shape is not well controlled). This lineament is close to a Central Valley high in aeromagnetics (Jachens et al. 1995), which may reflect an ophiolite terrane.

The slices shown in Fig. 8 show the asymmetric synclinal structure of the Central Valley (see cross-section in supplementary materials), particularly at about 38.2°N as shown in various geologic cross-sections (e.g., Sullivan and Sullivan 2012) which probably reflects a back arc basin associated with the subduction of the Farallon plate. The Rio Vista Basin is apparent in slices of the model shown in Fig. 8. The basin is approximately at 38.2°N and between the Kirby Hills and South Midland faults. However, there is also a significant basin with low velocities near the surface to the west of the Kirby Hills fault. This is the region of the notch previously mentioned and is near the Montezuma Hills. There is a strong gravity low (Fletcher et al. 2016 and Langenheim, personal communication 2016) that marks the Rio Vista Basin, suggesting that the Rio Vista Basin is a major structure. While our data sees the Rio Vista Basin, it identifies another deeper, slower structure in the shallow crust just west of the Kirby Hills fault.

Fig. 8
figure 8

East–west xz slices through the model. The slices are displayed with x (longitude) and z (depth) axes. The slices are arranged in pairs (vertically) and go from the upper left to upper right to lower left. The color model goes from 1 to 4 km/s for each frame

Interesting features of the Central Valley are included in Fig. 9 such as the major rivers of the western slope of the Sierra Nevada Mountains and the Stockton Arch (Barstow 1990), which is a major east–west basement high nar 38°N that demarcates the north Central Valley from the South. The rivers, such as the Cosumnes and Mokelumne, and Stanislaus seem to correlate with slow velocities in the shallow western Sierra Nevada Mountains although the American River does not (it is located near a large patch in the image that is poorly determined). The Stockton fault falls in a slight local high in velocities just north of a patch of lower velocities (dark red in color) directly correlated with the location of the Stockton Arch. The correlation is stronger in the 7–9 km image in Fig. 7. The dark gray patches, which are mostly gas producing zones are correlated with slow velocity regions. The location of the cross-section included in supplementary materials is shown as white line labeled C–D and shows the asymmetrical synclinal form of the Central Valley.

Fig. 9
figure 9

The image from Fig. 7 for the 1–3 km depth and geologic features of the Central Valley such as the Stockton Arch and gas fields (dark patches). The cross-section from supplementary materials is noted as C–D in the figure. Major rivers draining the Sierra Nevada Mountains are noted

Lin et al. (2010) provide a state-wide P- and S-wave velocity model from inverting arrival times at local and regional networks. Unfortunately, the area around the Delta has few estimates of S-wave velocity presumably from lack of adequate ray coverage. However, they show a southwest–northeast cross-section in P-wave velocity (7b in Lin et al. 2010) that is close to the Delta. Lin et al. (2010) find low velocities associated with the shallow Central Valley similar to the ones depicted in Figs. 7 and 8. They also show higher velocities at depths of about 10 km under the Central Valley similar to that shown for 38.4° and 38.2° cross-sections in Fig. 8. Although there can be varying Vp/Vs ratios, the P-wave velocity model of Lin et al. (2010) appears similar to the S-wave model in this study.

8 Conclusions

3-D Rayleigh wave group velocities for the Central Valley, California, have been inverted for shear-wave velocity using a linear, least squares routine and compared with the result from using a genetic algorithm. The region covers most of the central and northern Central Valley centered on the Sacramento/San Joaquin Delta. This model will facilitate the simulation of ground motions in the Delta from sources at local and regional faults. Features of the model show low velocities at shallow depths (less than 3–5 km) in the Central Valley compared to either the Coast Ranges to the west or the foothills of the Sierra Nevada Mountains to the east. This shear-wave velocity model reflects the tectonic history of the region where the Central Valley was a forearc basin from the subduction of the Farallon plate. In particular, low shear velocities are associated with the Rio Vista Basin and extend further to the west near the Potrero and Kirby Hills faults. At greater depths, the contrast is not as strong with spots of low velocities in the eastern Coast Ranges compared to the average. Mt. Diablo is visible as a local high at shallow depths. A lineament of high velocities near the eastern edge of the Central Valley may be caused by an ophiolite body associated with a high in aeromagnetic and gravity data, however, it is near the eastern edge of the array of stations used in the ambient noise tomography, and consequently the spatial resolution is poor in this region. Simulations using this model will permit a better estimate of the seismic hazard to the Delta, one of the California’s most important resources.