Introduction

Solar radiation is a driver of photosynthesis and evapotranspiration and could become one of the most important sources of electrical energy in the near future [1, 2]. Accurate estimation of spatially continuous long-term solar radiation data is necessary information for solar electricity projects, building design processes, irrigation scheduling strategies, hydrologic models, crop simulation models, and different land management techniques such as reforestation.

Collecting radiation data in most countries is costly, difficult, and involves uncertainties, so solar radiation data are usually only recorded by a few meteorological stations [3]. For this reason, over the last few decades, many models have been developed for representing the spatio-temporal variability of global solar radiation that are applied in design and planning of solar energy systems, including recent satellite-based models, or predictive models based on correlations of solar radiation with sunshine duration, air temperature, cloud observations, and other weather data (Table 1). These spatial models are based on solar geometries and take into account site latitude, local topography, and shadowing effects. However, they use different methods to calculate solar radiation, and they need different data for implementing atmospheric attenuation (Table 1).

Table 1 A brief summary of the spatially based solar radiation models, their corresponded errors and testing locations

Atmospheric attenuation often varies substantially with surface radiation. However, most locations lack long-term accurate radiation, cloudiness, and/or satellite data for many climate stations, which are needed in order to identify and estimate atmospheric attenuation. For example, models such as r.sun [16, 17], Solei-32 [17, 18] and, ESRA [5, 6, 8] calculate the overcast radiation from clear sky values and a clear-sky index called Linke (Table 1, [23, 24]). The Linke must be derived from one of the three methods: (1) from the ratio between measured global radiation and computed values of clear sky global radiation, (2) from other climatologic data such as cloudiness [23], or (3) directly from shortwave surface irradiance measured by satellites [24]. Although satellite-based models may be suitable for solar radiation estimation in large regions, their disadvantages are high cost and lack of historical records, because these methods are comparably new [25].

Sunshine duration is recognized as the most widely used meteorological parameter for solar radiation estimation, in the literature. This variable is more available and gives relatively better results than other variables [26]. However, air temperature data can be used instead, when the sunshine duration is not available at a specific location [27] as air temperature data have been collected at many locations globally and generally for a much longer period relative to any meteorological records [28].

However, using only air temperature, or air temperature along with wind speed or relative humidity data, to predict solar radiation may result in better estimation of solar radiation in clear sky conditions (higher solar radiation values) compared with cloudy sky conditions (lower solar radiation values) [29, 30]. When hourly or other short-term estimation of solar radiation is necessary, for example for solar energy production processes, it may be more practical to start with daily estimations and calculate hourly values form daily values [31]. However, the method of decomposing hourly solar radiation values from daily values also performs well for clear days [29, 30]. Nonetheless, the error caused in cloudy sky (lower solar radiation) conditions may not be very important when using the model for applications such as solar energy generation, because power generation is relatively small in these kinds of conditions and clear days produce most of the outputs from these systems [30]. In those applications, the number of sunny versus cloudy days would be a well-known variable, already taken into account.

Furthermore, empirical models, either unified and continuous or as combinations of empirical models for certain ranges, can estimate solar radiation for specific climate or specific region, because they use empirical coefficients estimated using correlations between global solar radiation and other climate variables for that specific area [27]. Accurate and simple models that are applicable for different types of climates or regions have been used in this way.

According to the spatially based solar radiation models’ average uncertainty reported in Table 1, all these models are comparable with the measured values with a mean bias error (\({\text{MBE}}\)) of about − 5 to 10% except Solar Analyst [17, 19, 20] that shows an under-estimation of about 25% relative to the corresponded observed values and also its reliability decreases in cloudy sky conditions. Myers (2005) [32] reviewed uncertainties in several solar radiation models and concluded that the best model uncertainties are representing the uncertainties in existing measured data which is corresponded to the mean absolute bias error (\({\text{MABE}}\)) of 25–100 W m−2. However, developing models with fewer input parameters under different climate conditions remains a challenge [32].Therefore, a daily global solar radiation model, within the usual range of accuracy, with lower input parameters, with more available input data, and suitable for different climate conditions applied to estimate daily and hourly solar radiation convergence curves can be usefully applied for accurate estimation of solar energy production.

Solar Analyst [17, 19, 20], which has been implemented as a tool in ESRI ArcGIS, estimates solar radiation for any geographical locations specified by a latitude and longitude or for any study area as a sum of Direct Radiation (\(D_{\text{R}}\)) and Diffuse Radiation (\(F_{\text{R}}\)) [20]. More specifically, Solar Analyst [17, 19, 20] considers atmospheric attenuation using direct atmospheric transmissivity \((\tau )\), defined as the proportion of extra-terrestrial radiation (\(H_{0}\)) transmitted as direct radiation at sea level along the shortest atmospheric path, and diffuse fraction \((K_{\text{D}} ),\) which is the fraction of global normal radiation flux that is diffused [20].

Here, we present a daily solar radiation estimation method which uses ambient air temperature, a Digital Elevation Model (DEM), time of year, and monthly radiation estimates from Solar Analyst (\(G_{{{\text{R}}_{\text{m}} }}\)) [17, 19, 20]. Our objective was to provide improved, spatially and more widely applicable estimation using air temperature-based empirical models for atmospheric transmissivity (\(K_{\text{T}}\)) and diffuse fraction to vary total monthly radiation estimation from Solar Analyst [17, 19, 20] and then calculate total daily radiation (\(G_{{{\text{R}}_{{d_{\text{TRAD}} }} }}\)) as a fraction of total monthly radiation by applying a daily transmissivity-based ratio using TRAD (temperature-estimation of radiation) daily model, as air temperature data is readily available at most locations on the planet. Our model stems from the observation that the difference between maximum and minimum air temperature \(\left( {\Delta T} \right)\) has a strong correlation with daily average shortwave radiation [33]. Furthermore, hourly solar radiation can be decomposed from the daily values using a ratio between daily and hourly radiation [34, 35].

Methods

TRAD (temperature-estimation of radiation) daily model

Our presented TRAD daily model assumed that the daily variations in irradiation for each month are a function of daily variations in atmospheric transmissivity, the proportion of daily irradiance reaching the earth’s surface to the daily extra-terrestrial insolation, in the same month. Irradiation is the amount of solar energy arriving on a unit area over a stated time interval (Wh m−2) [36]. We proposed TRAD model to estimate daily irradiation values in any ground station by Eq. 1:

$$G_{{{\text{R}}_{{d_{\text{TRAD}} }} }} = \left[ {{\raise0.7ex\hbox{${K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} }$} \!\mathord{\left/ {\vphantom {{K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} } {\mathop \sum \nolimits_{d = 1}^{n} K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\mathop \sum \nolimits_{d = 1}^{n} K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} }$}}} \right] \times G_{{{\text{R}}_{\text{m}} }}$$
(1)

where \(G_{{{\text{R}}_{{d_{\text{TRAD}} }} }}\) is the irradiation for the day in question (Wh m−2), \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}\) is the corresponding atmospheric transmissivity of that day calculated using the Bristow and Campbell (1984) model [33] by only using maximum and minimum air temperature inputs, \(\sum\nolimits_{d = 1}^{n} {K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} }\) is sum of all days atmospheric transmissivity values in the corresponding month, and \(G_{{{\text{R}}_{\text{m}} }}\) is the total monthly irradiation (Wh m−2) from Solar Analyst [17, 19, 20]. \(G_{{{\text{R}}_{\text{m}} }}\) is estimated using a DEM, latitude and longitude of the study location and, average annual \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}\) at sea level (\(\left( {K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}} ({\text{sl}})}} }} }} } \right)\) [33] and also average annual daily diffuse fraction calculated by Carroll (1985) model \(\left( {K_{{{\text{D}}_{{d_{\text{Ca}} }} }} } \right)\) [37].

Values of \(G_{{{\text{R}}_{{d_{\text{TRAD}} }} }}\)(Wh m−2) were calculated for different study locations for their available observed daily irradiance (W m−2) record period. Daily irradiance (W m−2), instantaneous energy received on a unit area per unit time [36] was calculated from \(G_{{{\text{R}}_{{d_{\text{TRAD}} }} }}\) (Wh m−2). Following, we bring descriptions of daily atmospheric transmissivity and diffuse fraction models [33, 37] and also Solar Analyst [17, 19, 20] which is used in the daily irradiation estimation using TRAD.

Daily atmospheric transmissivity and diffuse fraction models

Daily atmospheric transmissivity using the Bristow and Campbell (1984) model used to estimate daily diffuse fraction using Carroll’s model [37]. \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}\) and \(K_{{{\text{D}}_{{d_{\text{Ca}} }} }}\) then used to calculate average annual \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}} ({\text{sl}})}} }} }}\) and average annual \(K_{{{\text{D}}_{{d_{\text{Ca}} }} }}\) in order to estimate \(G_{{{\text{R}}_{\text{m}} }}\) values using Solar Analyst [17, 19, 20] (Eq. 1). Then, \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}\) and \(G_{{{\text{R}}_{\text{m}} }}\) were used in TRAD model to calculate \(G_{{{\text{R}}_{{d_{\text{TRAD}} }} }}\).

Bristow and Campbell (1984) [33] proposed the following model for daily atmospheric transmissivity using the difference between maximum and minimum air temperature \((\Delta T)\):

$$K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} = A \times \left( {1 - \exp \left( { - B \times \Delta T^{C} } \right)} \right)$$
(2)

In this equation, \(A\), \(B\), and C are empirical coefficients. Although empirical in nature, these coefficients represent the physics involved in the relationship, where \(A\) represents the maximum clear sky atmospheric transmissivity characteristics of the study region, which varies with elevation and pollution content of the air. Bristow and Campbell (1984) [33] developed and tested their method using data from three different sites in the northwestern United States and showed that this method provides accurate estimates of daily atmospheric transmissivity at these stations. Thornton and Running (1998) [38] reformulated the Bristow–Campbell model [33] using daily measured temperature, humidity, and precipitation to calculate daily solar radiation in 40 stations in locations with different climates. Their model gives a spatially and temporally variable estimation of \(A\). However, Bristow and Campbell (1984) [33] used a constant value of \(A\) over all their study stations. \(B\) and \(C\) display the partitioning of energy which is characteristic of the region of interest. Bristow and Campbell (1984) [33] found it adequate to hold \(C\) constant at 2.4 for their study sites and consider \(B\) related to monthly mean \(\Delta T\) via Relation 3:

$$B = 0.036 \times \exp \left( { - 0.154 \times \overline{\Delta T} } \right)$$
(3)

In our study, \(A\), \(B\) and \(C\) were applied as they were used in Bristow and Campbell (1984) [33]. \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}\) was corrected based on elevation of the stations by the following equation, in order to use in Solar Analyst to estimate \(G_{{{\text{R}}_{\text{m}} }}\) [17, 19, 20]:

$$K_{{T_{{d_{{{\text{B}} - {\text{C}} ({\text{sl}})}} }} }} = K_{{T_{{d_{{{\text{B}} - {\text{C}}}} }} }} - (t_{\text{lapse }} \times Z ),$$
(4)

where \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}\;({\text{sl}})}} }} }}\) is the atmospheric transmissivity at sea level, \(t_{\text{lapse }}\) is typically equal to 0.00008 m−1 and \(Z\) is elevation in m above the sea level [21, 39].

Liu and Jordan (1960) [34] suggested that diffuse fraction should be a well-behaved function atmospheric transmissivity. Carroll (1985) [37] proposed two relations for diffuse fraction based on atmospheric transmissivity. We used atmospheric transmissivity from Bristow and Campbell (1984) [33] model in Carroll’s model [37]. The following equations show how we estimated diffuse fraction using Carroll’s model [37] and \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}\).

$${\text{for cloud}} - {\text{free conditions:}}\;K_{{{\text{D}}_{{d_{\text{Ca}} }} }} = 0.88 - 1.024 \times K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }}.$$
(5)
$${\text{and for cloudy conditions:}}\;K_{{{\text{D}}_{{d_{{{\text{Ca}}}} }} }} =\min \left\{ {\begin{array}{l} {1.11 - 1.16 \times K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}}}} }} }} } \\ {{\text{or}}} \\ {1.0} \\ \end{array} } \right. .$$
(6)

We applied the thresholds proposed by Colli et al. (2016) [40] for classification of daily average sky conditions of a ground station (Table 2). The model was evaluated for three different groups of days based on different sky conditions including cloudy days, partly cloudy days and sunny or clear days (Table 2).

Table 2 Daily atmospheric transmissivity thresholds for day classification [40]

Application of solar analyst

Total annual global solar radiation in monthly intervals \(\left( {G_{{{\text{R}}_{\text{m}} }} } \right)\) was calculated using Solar Analyst [17, 19, 20] by a DEM, latitude and longitude of the study location and also, average annual \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}} ({\text{sl}})}} }} }}\) and average annual \(K_{{{\text{D}}_{{d_{\text{Ca}} }} }}\).

Four calculations are contained in the upward-looking hemispherical algorithm applied in Solar Analyst [17, 19, 20]: viewshed, sunmap, skymap calculation, and a concluding calculation that uses the previous three calculations to estimate a solar radiation value. The total amount of global solar radiation in Wh m−2 is obtained by the sum of direct solar radiation (Wh m−2) for all sunmap sectors and diffuse solar radiation for all skymap sectors (Wh m−2) [20]. The direct solar radiation from a sunmap sector with a centroid at solar zenith angle (degrees) and solar azimuth angle (degrees) is a function of the solar constant (1367 W m−2), \(\tau\), the elevation above sea level (meters), the time duration represented by the sky factor, the surface zenith angle (degrees), the surface azimuth angle (degrees), and the gap fraction for the sunmap sector. Correspondingly, diffuse solar radiation for each sky sector at the same centroid is related diffuse fraction, the elevation above sea level (meters), the bounding zenith angles of the sky sector (degrees), the number of azimuthal divisions in the sky map, the time interval for analysis, fraction of visible sky for the sky sector, and the angle of incidence between the centroid of the sky sector and the intercepting surface (degrees). However, providing a correct value for \(\tau\) is difficult because it is sensitive to the presence of clouds [17]. For this reason, we applied \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}} ({\text{sl}})}} }} }}\) and \(K_{{{\text{D}}_{{d_{\text{Ca}} }} }}\) in the calculation of \(G_{{{\text{R}}_{\text{m}} }}\) using Solar Analyst [17, 19, 20].

Observed daily atmospheric transmissivity

In the present study, observed daily atmospheric transmissivity on a horizontal surface was calculated using Eq. 7 which defines atmospheric transmissivity \(\left( {K_{\text{t}} } \right)\) as a ratio of a day’s radiation \(\left( {G_{\text{R}} } \right)\) to that day’s extra-terrestrial radiation \((H_{0} )\)

$$K_{\text{t}} = \frac{{G_{\text{R}} }}{{H_{0} }}.$$
(7)

The value of \(G_{\text{R}}\) (W m−2) is provided using the available pyranometer measurements, but \(H_{0}\) (J m−2) is calculated by the following method [41]

$$H_{0} = \frac{{24 \times 3600G_{\text{sc}} }}{\pi } \left( {1 + 0.033\cos \frac{360n}{365}} \right) \times \left( {\cos \phi \cos \delta \sin \omega_{\text{s}} + \frac{{\pi \omega_{\text{s}} }}{180}\sin \phi \sin \delta } \right)$$
(8)

\(G_{\text{sc}}\) is the solar constant (1367 W m−2), \(D\) is the number of the day of the year starting from January 1 = 1, and \(\phi\) is the latitude. \(H_{0}\) then converted to W m−2 to use in Eq. 7.

The declination \(\delta\) is found from the equation below [41] (23.45 is the Earth’s rotational axis vector to the ecliptic plane in degrees):

$$\delta = 23.45 \sin \left( {360\frac{284 + D}{365}} \right).$$
(9)

The sunset hour angle \(\omega_{\text{s}}\) in degrees from horizontal is calculated using the following equation [41]:

$$\cos \omega_{\text{s}} = - \tan \phi \tan \delta .$$
(10)

Hourly solar radiation model

Accurate hourly solar radiation data are necessary in many studies, for example in solar photovoltaic projects. We can apply a ratio to the estimated daily solar radiation values from the previous steps (Eq. 11) and decompose the hourly solar radiation values. Liu and Jordan (1960) [34] improved the study by Whillier (1956) [42] to calculate the ratio of hourly to daily global solar radiation using a set of regression curves. Collares-Pereira and Rabl (1979) [31] validated their approach and presented Eq. 11 for estimating this ratio, which only needs site latitude and day number;

$$r_{\text{G}} = \frac{\pi }{24}\left( {a + b\cos \omega } \right)\frac{{\cos \omega - \cos \omega_{\text{s}} }}{{\sin \omega - \omega_{\text{s}} \cos \omega_{\text{s}} }}$$
(11)

where \(r_{\text{G}}\) is the ratio of hourly to daily global radiation, \(a\) and \(b\) are dependent on sites (Eq. 12 and 13), \(\omega\) is hour angle in degrees and defined as the angular displacement of the sun east or west of the local meridian due to rotation of the earth on its axis at 15°/h, morning negative, afternoon positive, and \(\omega_{\text{s}}\) is the sunset hour angle in degrees that is calculated by Eq. 10.

$$a = 0.4090 + 0.5016\sin \left( {\omega_{\text{s}} - 1.047} \right)$$
(12)
$$b = 0.6609 - 0.4767\sin \left( {\omega_{\text{s}} - 1.047} \right).$$
(13)

Equation 11 has been developed based on data from measurement sites in the US, but it also has been validated for 13 Indian locations by Hawas and Muneer (1984) [43] and 16 sites in the UK [35].

Data availability for verification

The Surface Radiation Budget Network (SURFRAD) was established in 1993 in the US [44] and became an official part of the Global Climate Observing System (GCOS) in April 2004 [45]. Currently, seven SURFRAD stations are operating in diverse climate regions in the USA, in Montana, Colorado, Illinois, Mississippi, Pennsylvania, Nevada, and South Dakota (Table 3 and Fig. 1). The stations are not surrounded by trees or any other obstacles, but may be located in areas with small shrubs or agriculture.

Table 3 SURFRAD network information
Fig. 1
figure 1

A geographical map of SURFRAD stations located in the US

An upward-looking pyranometer measures total global solar radiation (W m−2) on its main platform. Accuracy of the pyranometer is about 5%, which is achieved by the standards of calibrations and operations recommended by the Baseline Surface Radiation Network (BSRN), sponsored by the World Climate Research Program (WCRP) of the World Meteorological Organization (WMO). SURFRAD data are available into daily files of one- or three-minute data. We extracted daily maximum and minimum air temperature (°C) and mean irradiance (W m−2) from the one- or three-minute data records for the available historical time period at each station.

A small number of spurious values in air temperature and irradiance records were removed from the analysis. Irradiance data that had negative values due to cooling of the thermopile near dusk and dawn [45] and values higher than the extra-terrestrial solar radiation were filtered out as well. Missing data were not used in the analysis (Table 4). However, the amount of missing data was small (Table 4) as the data have already been quality controlled by SURFRAD (Table 4) [44]. Furthermore, in order to analyze the results in both a large and a small sample size in terms of years, we used 2015 as a case study. A 30 × 30 m DEM covering all the SURFRAD sites was applied [46].

Table 4 The time period and data used in the study

The extracted observed daily maximum and minimum air temperature (°C) data were used in the proposed solar radiation method to estimate daily mean irradiance (W m−2, Eqs. 16) in order to compare it with the observed mean irradiance (W m−2) at different SURFRAD stations. The observed daily mean irradiance (W m−2) was also used to find observed daily atmospheric transmissivity (Eqs. 710) to compare it with the calculated daily atmospheric transmissivity values from the Bristow and Campbell (1984) model [33] (Eqs. 24).

Statistical validation methods

In the present study, the predictive efficiency of the model was tested using the mean bias error \(({\text{MBE}})\) and the mean absolute bias error \(({\text{MABE}})\). These terms are defined by the following equations;

$${\text{MBE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {y_{i} - x_{i} } \right)$$
(14)
$${\text{MABE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left| {y_{i} - x_{i} } \right|$$
(15)

where \(x_{i}\) is the \(i\) th measured value, \(y_{i}\) is the \(i\) th calculated value, and \(n\) is the total number of observations. The \({\text{MBE}}\) is a measure of the systematic error of a model. It evaluates the tendency of a model to under- or over-estimate the measured values and for an accurate model is equal to zero [47]. The \({\text{MABE}}\) is a measure of the goodness of the fit for a model, and a natural measure of average error and a good test for inter-comparisons of the average model performance error. For precise data modeling, \({\text{MABE}}\) should be close to zero [47]. We also provide the Pearson correlation coefficient between the observed and modeled data. The Pearson correlation produces a sample correlation coefficient, \(r\), which measures the strength and direction of linear relationships, negative or positive, between paired continuous variables.

Results and discussion

All years

Daily radiation verification

Pearson correlation test results showed a strong positive correlation between the measured and estimated irradiance values and this correlation increased from cloudy to sunny days (Table 5). However, exceptions were Fort Peck and Penn State, which had very small differences in correlation between partly cloudy and sunny days (Table 5). Considering all the stations together for the entire study years, in case of cloudy days, there was a significant positive correlation (r = 0.55; Table 5) and partly cloudy and sunny days showed a stronger positive correlation (r = 0.85 and r = 0.89, respectively; Table 5).

Table 5 Statistical comparison of the observed daily atmospheric transmissivity and daily irradiance against the corresponding modeled values using \({\text{MBE,}}\;{\text{MABE}}\), and Pearson correlation coefficients (r) between the observed and modeled values for each study site individually, and also for all the seven sites together in their total study years (\({\text{MBE}}\) and \({\text{MABE }}\) in % are related to the mean observed value)

The result showed the \({\text{MABE}}\) decreased significantly from cloudy (about 112%) and partly cloudy (about 38%) to sunny days (about 18%) (Table 5). This error was more than three times larger under cloudy sky conditions than partly cloudy and more than six times larger in cloudy days than sunny days (Fig. 3b).

\({\text{MBE}}\) calculations showed the model over-estimated the measured daily irradiance for cloudy days (about 88%) (Table 5; Fig. 3d). In partly cloudy conditions, the over-estimation (about 16%) slightly decreased compared to the cloudy days (Tables 5; Fig. 3d). However, clear or sunny days showed under-estimations relative to the observed values (about 8%) (Tables 5; Fig. 3d). The only exception was Fort Peck station with under-estimations on partly cloudy days (Fig. 3d).

The results showed that the model is less able to account for partly cloudy and cloudy days. However, it gives a reasonable estimation for solar radiation during the clear and sunny days, taking into account the average 8% percent under-estimation or 18% error in overall model performance. One of the likely sources of error is using a value for maximum atmospheric transmissivity of 0.7 in Bristow–Campbell model (1984) [33], although this value is known to differ with elevation and air particle content [33]. Variable maximum atmospheric transmissivity can be applied using the reformulated Bristow–Campbell model [33] by Thornton and Running (1998) [38]; however, their model uses precipitation and humidity data plus air temperature data. The \({\text{MABE}}\) and \({\text{MBE}}\) in estimating daily solar radiation in clear and sunny days using the presented model in this study, which are around 18 and 8%, respectively, are reasonable compared to the same errors in estimation daily solar radiation by Thornton and Running (1998) [38] which are 15 and 4%, respectively. The average \({\text{MBE}}\) of our model (8%) on clear and sunny days is also within the range of \({\text{MBE}}\) of the Solei-32 [17, 18], Solar Analyst [17, 19, 20], SRAD [17, 21] and r.sun [16, 17] models in the same sky condition, which is under 10%. However, our model depends mainly on air temperature data, whereas other models need accurate radiation, cloudiness, and/or satellite data as input. TRAD model (Eq. 1) is also a function of total monthly radiation estimated by Solar Analyst [17, 19, 20]. Atmospheric attenuation for calculating total monthly radiation in Solar Analyst [17, 19, 20] was applied using the estimated average annual \(K_{{{\text{T}}_{{d_{{{\text{B}} - {\text{C}} ({\text{sl}})}} }} }}\) and \(K_{{{\text{D}}_{{d_{\text{Ca}} }} }} \varvec{ }\) may affect the results and consequently the daily solar radiation estimation in TRAD model (Eq. 1). Furthermore, the accuracy of the measured data should be considered (accuracy of the pyranometer is about 5%).

Desert Rock station showed the best overall model performance (\({\text{MABE}}\) about 17%) between all other stations when considering all sky conditions (Table 5). This is because this station had the highest average percentage of clear and sunny days (89%) (Fig. 2).

Fig. 2
figure 2

Average percentage of each sky condition in different sites

Daily atmospheric transmissivity verification

The Pearson correlation test between the estimated and measured daily atmospheric transmissivity values in cloudy and partly cloudy conditions applied for all the study years showed that there was a low positive correlation between the estimated and measured values in all stations, except for Desert Rock and Boulder in cloudy days and Sioux Falls in partly cloudy days (Table 5). These correlations also increased from cloudy and partly cloudy to sunny days in all stations except in Fort Peck (Table 5).

The \({\text{MABE}}\) for daily atmospheric transmissivity is considerably higher for cloudy days (about 135%) than partly cloudy days (about 40%) (Table 5; Fig. 3a). The \({\text{MBE}}\) test showed that the Bristow–Campbell model [33] over-estimated the measured daily atmospheric transmissivity for cloudy days (118%) (Fig. 3c). Over-estimations for partly cloudy days (about 25%) decreased compared to the cloudy days (Fig. 3c). However, clear or sunny days showed under-estimations (about 9%) (Fig. 3c). The only exception was Fort Peck station with over-estimations on partly cloudy days (Fig. 3c).

Fig. 3
figure 3

\({\text{MABE}}\) (%) of the estimated daily atmospheric transmissivity and estimated daily irradiance (a and b) and also \({\text{MBE}}\) (%) of the estimated daily atmospheric transmissivity and estimated daily irradiance (c and d) for each sky condition in different sites for the entire study time period

The result showed that the Bristow and Campbell (1984) [33] model gives better estimation (about 15% error) of daily atmospheric transmissivity for sunny days than for partly cloudy or cloudy sky conditions, which also describes the performance of the daily solar radiation model using TRAD model (Eq. 1) in sunny and clear days. The average \({\text{MBE}}\) of Bristow and Campbell (1984) [33] considering all study years and sky conditions together is around 4% (Table 5).

Year 2015

Daily radiation verification

The result of Pearson correlation tests showed that there always was a strong positive correlation between the measured and estimated irradiance values, and this correlation increased from cloudy to sunny days in year 2015 (Table 6). However, exceptions were very small differences in correlation between partly cloudy and sunny days in Fort Peck and Penn State (Table 6).

Table 6 Same as Table 5 in 2015

The result showed that the \({\text{MABE}}\) declined significantly from cloudy (105%) and partly cloudy (about 38%) to sunny days (17%) (Table 6). This error was more than three times higher under cloudy days than partly cloudy and more than six times higher in cloudy sky conditions than sunny (Fig. 4b).

Fig. 4
figure 4

\({\text{MABE}}\) (%) of the estimated daily atmospheric transmissivity and estimated daily irradiance (a and b) and also \({\text{MBE}}\) (%) of the estimated daily atmospheric transmissivity and estimated daily irradiance (c and d) for each sky condition in different sites for 2015

\({\text{MBE}}\) result indicated that the model over-estimated the measured daily irradiance for cloudy days (about 83%) (Tables 6; Figs. 4d and 5a–h). In partly cloudy days, the over-estimation (about 19%) decreased relative to the cloudy conditions (Table 6; Figs. 4d and 5a–h). However, clear or sunny sky condition showed under-estimations relative to the observed values (6%) (Table 6; Figs. 4d and 5a–h). The only exception was Fort Peck station with under-estimations in partly cloudy days (Figs. 4d and 5a–h).

Fig. 5
figure 5

Comparison of the measured versus predicted daily irradiance (W m−2) for Bondville, IL (a), Boulder, CO (b), Desert Rock, NV (c), Fort Peck, MT (d), Goodwin Creek, MS (e), Penn State, PA (f), Sioux Falls, SD (g) and all the SURFRAD stations (h) in 2015, as an example of a single year

Daily atmospheric transmissivity verification

In 2015, there was no correlation between the estimated and measured daily atmospheric transmissivity values in cloudy and partly cloudy days in any stations, except there were significant correlations in Fort Peck and Penn State in cloudy days and Goodwin Creek and Penn State in partly cloudy days (Table 6). On sunny days, there was a positive correlation in all stations except Fort Peck and Bondville (Table 6).

The \({\text{MABE}}\) for daily atmospheric transmissivity was considerably higher for cloudy days (145%) than partly cloudy days (42%) (Table 6; Fig. 4a). The \({\text{MBE}}\) test showed Bristow–Campbell model [33] over-estimated the measured daily atmospheric transmissivity for cloudy days (about 130%) (Fig. 4c). Over-estimations for partly cloudy days (about 28%) decreased compared to the cloudy days (Fig. 4c). However, clear or sunny days showed under-estimations (about 9%) (Fig. 4c). The only exception was Fort Peck station with over-estimations on sunny sky conditions (Fig. 4c).

Conclusions

In this study, we presented a method for estimating daily solar radiation using only maximum and minimum air temperature, topography, and time of year. We designed the method based on TRAD, a daily solar radiation model that used estimated daily atmospheric transmissivity from the Bristow and Campbell (1984) [33] model and calculated total monthly solar radiation values by Solar Analyst [17, 19, 20]. We used estimated average annual atmospheric transmissivity at sea level and diffuse fraction according to the models of Bristow and Campbell (1984) [33] and Carroll (1984) [37] to calculate total monthly solar radiation using Solar Analyst [17, 19, 20]. In addition, hourly solar radiation values can be calculated using the ratio between daily and hourly radiation estimates suggested in the previous literature, combined with the estimated daily insolation from the method presented in this paper. The method was validated using seven different sites in climatologically different areas over the United States. Overall, results showed that daily solar radiation can be estimated very well with \({\text{MABE}}\) of about 40 to 53 W m−2 or \({\text{MBE}}\) of ± 10% under all sky conditions between the seven sites using the presented method, which is an improvement over previously used methods with \({\text{MBE}}\) of under 10% because the modified approach and model presented here also require significantly less input data. This method can be very useful especially for those stations with substantially higher number of sunny days than cloudy or partly cloudy days, assuming the availability of air temperature data. The estimated values for those days that the model is not able to estimate accurately can be corrected using available measured data. The implemented DEM environment of this method makes it applicable in many studies that need spatial estimation of solar radiation, especially solar energy generation projects.