Introduction

Multiphase flow correlations are routinely used to calculate pressure (and temperature) drop in the tubing and flow lines for production and injection wells. The pressure drop calculations have many applications in artificial lift design and optimization, well deliverability forecasting, and flow assurance applications. Over several decades, many investigators developed multiphase flow correlations to calculate pressure drop in tubing at variety of conditions. Each correlation was derived for certain conditions. Several comparison studies between different correlations for specific sets of data are also available in the literature. Predicting multiphase flow pressure drop can sometimes present a challenge for practicing engineers.

Multiphase flow occurs due to the variations of phases that flow through one pipe. These phases may be oil, water, and gas. Due to the different properties of gas compared with oil and water for viscosities and other properties, the gas tends to flow faster than oil and past the liquid in the upward flow (gas slippage). The in situ liquid volume in the pipe is defined as the liquid holdup.

The pressure drop calculations depend on the properties of the mixture of liquid including oil and water in addition to the gas. The total pressure drop is equal to the summation of the hydrostatic, friction, and acceleration pressure drops. The hydrostatic term depends on the mixture density, which is a function of the liquid holdup. The friction term depends on the friction factor for the mixture. The acceleration term appears due to the change in velocity resulting from the change in pressure. The friction calculations depend on the flow regime, which in turn depends on the superficial velocity of liquid and gas phases.

The most common flow regimes for vertical wells include single liquid flow, bubble, slug, churn, annular, and mist flow. The possible flow regimes for horizontal wells usually include dispersed bubble, annular, slug, stratified wavy, and stratified smooth.

Over the years, both empirical correlations and mechanistic models were derived to calculate the pressure drop in pipes. Empirical correlations can be classified into three categories (Ansari et al. 1990):

  1. 1.

    Correlations that assume no slip and do not consider flow patterns. Examples include Poettmann and Carpenter (1952) and Fancher and Brown (1963).

  2. 2.

    Correlations that consider the slip but not the flow patterns. Examples include Gray (1978) and Hagedorn and Brown (1964 and 1965), etc.

  3. 3.

    Correlations that consider both the slip and the flow patterns. Examples include correlations by Duns and Ros (1963) and Beggs and Brill (1973).

The mechanistic models, however, usually include some physical phenomena with analytical relations to predict the flow patterns. In these models, mostly analytical expressions are used to calculate the different parameters for each flow regime. Examples for mechanistic models include Aziz et al. (1972) and Ansari et al. (1990).

Although many correlations to calculate the pressure drop in pipes are available, clear rules on how to select appropriate correlations do not exist in the absence of actual pressure measurements. In this work, an expert system is developed to select the best correlation(s) for specific well and flow conditions.

Expert system development

The first step in developing the expert system was to collect a large database of pressure drop points covering a wide variety of well and flow conditions.

Abd El-Moniem (2016) and Abd El-Moniem and El-Banbi (2015) collected bottom-hole flowing pressure data points from both the literature from Ashiem (1986), Baxendell and Thomas (1961), Chierici et al. (1974), Hill and Wood (1994), Peffer et al. (1988) and Reinicke et al. (1987) and actual oil wells covering different well and flow conditions. The current study used around 2730 oil well data points for developing the expert system and 144 different data points for its validation. The range of the data is listed in Table 1.

Table 1 Range of data used in developing and validating the expert system

To cover all the points in the database, more than 560 well models were developed using a commercial pipe modeling software. We performed around 1042 simulation runs and calculated the predicted bottom-hole flowing pressure using 14 different correlations for each point. We, then, calculated the error from each correlation for each point.

Table 2 shows the average and mean absolute percent error for each correlation through the entire database. The minimum and maximum absolute percent errors are also shown. The mean absolute percent error ranges from 12.67 to 57.52 for the best and worst correlation over the entire dataset. More importantly, the table shows large range of error between minimum and maximum errors for all correlations.

Table 2 Average error and average absolute error of the different fourteen correlations for the entire database

As shown in Figs. 1 and 2 as examples, we show the error for two selected correlations for all the data points in the database to show the correlation performance for different points. The large variations in the error (some points show very small error, while other points show very large error); we can conclude that no correlation is good for all the points within the database. However, each correlation has reasonable prediction capabilities for some range of well and flow conditions. Our objective was to divide the data into clusters (groups) and to identify the best correlation(s) within each data cluster.

Fig. 1
figure 1

Hagedorn and Brown correlation performance through the entire database

Fig. 2
figure 2

Mukherjee and Brill correlation performance through the entire database

The database was divided into clusters according to well depth, well geometry and tubing size as shown in Fig. 3.

Fig. 3
figure 3

Clusters map

Shallow depth is for points that are less than 2000 ft. TVD, deep is for pressure points that are more than 9000 ft. in depth, and the moderate depth is between.

We, then, divided the data into sub-clusters according to production criteria. Table 3 shows the selected criteria based on oil rate, water cut, and GOR. Therefore, each cluster of the major ones in Fig. 3 was further divided into smaller segments (sub-cluster) representing the different production conditions given in Table 3. The selection of cluster and sub-cluster ranges was essentially arbitrary. However, we wanted to make sure enough data points were available for every cluster and sub-cluster. We also wanted to have the cluster and sub-clusters bounded by easy values to remember.

Table 3 Sub-clusters production criteria for oil wells [after Abd El-Moniem and El-Banbi (2015)]

These sub-clusters within the clusters were given different code numbers (reference numbers) as listed in Table 4. Each segment of data is then defined by which cluster it lies in and its reference number (sub-cluster number). Table 4 also contains the number of data points available for every sub-cluster. The number of data points is given in the green cells. The sub-clusters represented by the red cells did not contain data points.

Table 4 Sub-clusters reference number and data points for oil wells

A Microsoft Access program was developed for the entire dataset to store the actual values of pressure, calculated values from all 14 correlations, and the error for every point. The Access Database was then used to extract a total of 131 sub-clusters for tubular flow and identify the best correlation(s) in each sub-cluster.

The clusters (13) are divided based on geometry (well depth, well deviation, and ID). The sub-clusters division is based on flow conditions (oil rate, WC, and GOR). According to these divisions, we should theoretically have 13 × 36 sub-cluster. However, many sub-clusters did not contain data (e.g., it is not possible to have high oil rate wells with small ID, and it is not possible to have high GOR wells with high oil rate and high WC). The sub-clusters that contained enough data points to draw conclusions were 131.

As listed in Table 4, some sub-cluster contained large number of data points, and others contained fewer data points. We further classified the sub-clusters according to how many points they had into three categories:

  1. 1.

    Strong sub-cluster containing more than 20 points.

  2. 2.

    Moderate sub-cluster containing between 5 and 20 points.

  3. 3.

    Weak sub-cluster containing less than 5 points.

The identification of different levels of correlation strength was covered by defining a correlation “strength factor” and covered in Abd El-Moniem and El-Banbi (2015). They also reported the error for every correlation in every sub-cluster.

Results

The detailed results of sub-clusters can be found in Abd El-Moniem (2016). A sample of the results is listed in Table 5. The tables given in the paper and the appendix are sufficient to fully utilize the expert system.

Table 5 Sample of complete results for one sub-cluster [after Abd El-Moniem and El-Banbi (2015)]

Application of the expert system

The use of the expert system is based on selecting the appropriate cluster as shown in Fig. 3. The selection of the cluster is based on (1) well type (vertical, deviated, or horizontal), (2) tubing size, and (3) depth. The following step is to select the sub-cluster based on (4) oil production rate, (5) gas/oil ratio (GOR), and (6) water cut as listed in Table 4. The sub-cluster is determined from Fig. 3 and Table 4. The detailed sub-cluster tables to determine the best correlation, its strength, and expected error range are given in “Appendix A.” An example on how to use the expert system is also given in “Appendix B.”

We also developed a VBA (Visual Basic code) for Microsoft Excel to facilitate the selection of the appropriate correlation. The user can input the six inputs required by the expert system, and the VBA code will automatically find the following information: (1) the cluster and sub-cluster where the user data point is located, (2) the multiphase correlation(s) that should be used to predict the pressure drop for this particular data point, (3) the expected error of this best correlation(s) in the database, and (4) the strength factor for the selected correlation(s) within the sub-cluster. The authors can share the VBA code with interested readers.

An example run of the VBA code is shown in Fig. 4. The expert system run for this example case came up with two best correlations (not one). The strength factor for the first correlation (MB) is 43%, and the strength factor for the second correlation (HYDRO) is 35%. It is the same for the mean absolute percent error (1.54% for MB and 2.22% for HYDRO).

Fig. 4
figure 4

Expert system best correlation/strength factor/mean absolute percent error output window

Validation of the expert system

A new dataset with 144 points from 62 wells was used in validating the expert system. All these data points were not used in developing the expert system. The validation dataset was distributed among 48 sub-clusters of data. All 14 correlations were run for all 144 points, and the best correlation (and its error) was determined for each point. The expert system was also run for all 144 points, and the best correlation selected by the expert system was compared to the best correlation based on the error calculation. The expert system validation was done on two levels. First, Table 6 shows a summary for all 144 points, and it shows that the expert system was able to predict the best correlation in 70% of the cases. Secondly, the predicted correlation by the expert system was used to compute the pressure drop for every point in the 144 points. Equations (1) and (2) are used to calculate the mean absolute percent error and the mean percent error.

$$ {\text{MAPE}} = \frac{100}{n}\mathop \sum \limits_{i = 1}^{n} \left| {\frac{{A_{t} - F_{t} }}{{A_{t} }}} \right| $$
(1)
$$ {\text{MPE}} = \frac{100}{n}\mathop \sum \limits_{i = 1}^{n} \frac{{A_{t} - F_{t} }}{{A_{t} }} $$
(2)

Table 7 shows the error results for the expert system prediction over the 144 points (whether the predicted correlation from the expert system was the best one or not) in comparison with each correlation error over the same data (144 data points). The mean absolute percent error is around 8.8% using the expert system compared to more than 21% for the best single correlation over the validation dataset. The validation results also show that even in the cases where the best correlation was not selected by the expert system (30% of the cases), the selected correlation always produced reasonably good results compared with the actual values (i.e., when the expert system fails to select the absolute best correlation, it will select one of the second best correlations).

Table 6 Expert system validated points status
Table 7 Performance results of the expert system versus all correlations for validation dataset

Sensitivity of the expert system to uncertain input parameters

In this section, we wanted to test the strength of the expert system and how it reacts to inaccurate input data. The measured GOR is usually among the most uncertain parameters in oil well operations. Other parameters that may carry some uncertainty include API gravity of the oil, specific gravity of the gas, and tubing roughness. In each one of these four parameters, a random subset of the database was used to test the effect of the parameter sensitivity on the expert system accuracy. In the sensitivity analysis study, we used a randomly selected dataset out of the database. We made sure the selected dataset covered points from most sub-clusters. The sensitivity analysis work involves changing a parameter (e.g., GOR or API) and running all correlations again with the new value for GOR or API, then extracting the data, calculating errors, and comparing correlations with actual data. Enough points were tested to validate and generalize the conclusions regarding the effect of these parameters.

For each point within the subset of the database, the parameter was perturbed by 10–20% and the expert system was run to predict the best correlation(s). The suggested correlation by the expert system was also used (with the perturbed input parameter) to predict the pressure drop and compared with the actual value of the pressure. The following paragraphs summarize our findings from the sensitivity study.

GOR sensitivity

GOR affects both the input to the expert system and the input to correlation calculations. Four groups of wells were selected: (1) low GOR and high oil rate wells, (2) high GOR and high oil rate wells, (3) low GOR and low oil rate wells, and (4) high GOR and low oil rate wells.

For low GOR and high rate wells, the expert system could predict the best correlation with a success rate of 56%, when the GOR input (one of the sensitive input parameters) was changed by ± 20%. This reflects that changing GOR for low GOR high rate wells has a moderate effect in changing the order of the best correlation.

For high GOR and high rate wells, the expert system could predict the best correlation by 75%. It reflects that changing GOR for high GOR high rate wells by a percentage of ± 20% has a slight effect in changing the order of the best correlation.

For low GOR and low rate wells, the expert system could predict the best correlation by 37.5%. It reflects that changing GOR for low GOR low rate wells by a percentage of ± 20% has a significant effect in changing the order of the best correlation.

For high GOR and low rate wells, the expert system could predict the best correlation 95% of the cases. We can safely conclude that a ± 20% error in GOR for those wells had minor effect in changing the order of the best correlation.

GOR was found to have considerable effect on the calculated pressure drop from all correlations, especially for low GOR wells (in both high oil rate and low oil rate wells). This is probably due to changing flow regimes in the calculation of pressure drop. Although GOR here is the producing GOR, solution gas/oil ratio (Rs) is responsible for part of the producing GOR, especially in low GOR wells. Therefore, for low GOR wells, flow regimes may change rapidly due to the release of gas with the decline of pressure and temperature in the tubing and consequently change the order of the best correlation as listed in Table 8.

Table 8 Effect of different sensitive parameters on correlation prediction

Other parameters sensitivity

Other parameters were investigated to check their uncertainty on the expert system and the error of the calculated pressure by the selected correlation(s). These parameters include API, gas specific gravity and tubing roughness.

Changing the API of the oil by ± 10% had slight effect on both the expert system selection and the calculated pressure drop by the selected correlation. In more than 90% of the cases, the expert system was successful in picking up the best correlation even with the ± 10% error in API.

For gas specific gravity, changing gas specific gravity with ± 10% had minor effect on changing the order of the best correlation as the expert system predicted the best correlation with 90%. The exception was for low GOR low rate wells, as the expert system could obtain the best correlation with only 37.5%.

For tubing roughness, the expert system could obtain the best correlations 95% of the time. The exception was for high GOR high rate wells, where the expert system could predict the best correlation with only 37.5%.

The selection of the range of sensitivity analysis parameters was mainly based on measurement errors in operations of oil fields. GOR measurement error may reach ± 20%, while API and specific gravity of gas are usually estimated with much more certainty. Table 8 summarizes the results of the sensitivity analysis of these parameters and their effect on changing the order of the best correlation and consequently the capability of the expert system to predict the best correlation even with some measurements errors.

Conclusions

We utilized a large dataset to test the accuracy of 14 correlations among the most widely used multiphase flow correlations. We also used an approach to take advantage of the fact that each correlation is strong under certain conditions of well geometry, flow conditions, and PVT properties. The approach is based on data clustering. The following conclusions can be made:

  1. 1.

    The overall mean absolute percent error is around 12.7% for the best multiphase flow correlation in the entire dataset, while the range of errors for the best correlation(s) in different data sub-clusters is from 0.01 to 3% for most cases with accurate PVT.

  2. 2.

    In the validation dataset, the mean absolute percent error was 8.8% when the expert system was used. This represents approximately one-third of the error value when any single correlation is used over the entire dataset. (Error for using single correlation ranges from more than 21 to 29%.)

  3. 3.

    From the sensitivity runs, we found that GOR, API, and gas specific gravity may have different degrees of effectiveness on the correlation selection. PVT data are probably one of the most important multiphase flow correlations input data affecting the accuracy of the correlation.

  4. 4.

    In practice, an accurate gas rate should be determined through production test separator especially for low rate oil wells as GOR has important effect on pressure drop results. GOR was found to have impact on changing the best correlations for different clusters of oil data in case of low GOR wells. However, GOR has minor effect on selection of best correlations for high GOR wells.

  5. 5.

    Tubing roughness was found to have small impact on changing the best correlations for different clusters of oil well data (except for high GOR and high oil rate wells).