Abstract
Background
The goals of our study are to determine the most appropriate model for alcohol consumption as an exposure for burden of disease, to analyze the effect of the chosen alcohol consumption distribution on the estimation of the alcohol Population Attributable Fractions (PAFs), and to characterize the chosen alcohol consumption distribution by exploring if there is a global relationship within the distribution.
Methods
To identify the best model, the LogNormal, Gamma, and Weibull prevalence distributions were examined using data from 41 surveys from Gender, Alcohol and Culture: An International Study (GENACIS) and from the European Comparative Alcohol Study. To assess the effect of these distributions on the estimated alcohol PAFs, we calculated the alcohol PAF for diabetes, breast cancer, and pancreatitis using the three abovenamed distributions and using the more traditional approach based on categories. The relationship between the mean and the standard deviation from the Gamma distribution was estimated using data from 851 datasets for 66 countries from GENACIS and from the STEPwise approach to Surveillance from the World Health Organization.
Results
The LogNormal distribution provided a poor fit for the survey data, with Gamma and Weibull distributions providing better fits. Additionally, our analyses showed that there were no marked differences for the alcohol PAF estimates based on the Gamma or Weibull distributions compared to PAFs based on categorical alcohol consumption estimates. The standard deviation of the alcohol distribution was highly dependent on the mean, with a unit increase in alcohol consumption associated with a unit increase in the mean of 1.258 (95% CI: 1.223 to 1.293) (R^{2 }= 0.9207) for women and 1.171 (95% CI: 1.144 to 1.197) (R^{2 }= 0. 9474) for men.
Conclusions
Although the Gamma distribution and the Weibull distribution provided similar results, the Gamma distribution is recommended to model alcohol consumption from population surveys due to its fit, flexibility, and the ease with which it can be modified. The results showed that a large degree of variance of the standard deviation of the alcohol consumption Gamma distribution was explained by the mean alcohol consumption, allowing for alcohol consumption to be modeled through a Gamma distribution using only average consumption.
Keywords:
Alcohol consumption; Empirical distribution; Gamma distribution; LogNormal distribution; Weibull distribution; PopulationAttributable Fraction; Exposure distribution; Upestimation; Per capita consumption; Mean; Standard deviationIntroduction
Alcohol consumption is a component cause [1] for over 200 International Classification of Diseases (ICD10) threedigit codes [2,3]. In other words, a fraction, usually called the PopulationAttributable Fraction (PAF) of the incidence of these diseases, would disappear if exposure to one of the causal components was eliminated [47] (in the case of alcohol, under the counterfactual scenario of every person being a lifetime abstainer). The proportion of the diseases caused by alcohol consumption in a component cause model for a population is determined by both the patterns and volume of alcohol consumption and by the relative risks associated with each exposure level [3,8]. For most major diseases where alcohol plays a role (for example, alcoholattributable cancers, pancreatitis, and cirrhosis of the liver), the average volume of alcohol consumption alone was found to be an adequate predictor of the risk [3,810]; however, some diseases and injuries (for example, ischemic heart disease, unintentional injuries, and intentional injuries) were found to be also dependent on drinking patterns [1114].
The calculation of an alcohol PAF involves a threestage process: 1) estimation of an exposure distribution of alcohol, 2) establishment of the relative risk function, and 3) the solving of the equation for the PAF [15]. Since the distribution of alcohol consumption on an international level has not been agreed upon, the common approach is to estimate the PAF using categorical measurements rather than modeling it in a more mathematically appropriate continuous manner [16,17]. The mathematical expression is as follows:(Formula 1)
where i is the exposure category with baseline exposure or no exposure, i = 0, RR_{i }is the relative risk at exposure level i compared to no consumption, and P_{i }is the prevalence of the j^{th }category of exposure.
When a continuous distribution for the volume of alcohol consumption is used, this calculation can be represented by the following formula:(Formula 2)
where P_{a }is the prevalence of lifetime abstainers, RR_{a }is the relative risk of lifetime abstainers, P_{ex }is the prevalence of former drinkers, RR_{ex }is the relative risk of former drinkers, x is the average volume of alcohol consumption per day, P(x) is the prevalence of alcohol consumption, and RR(x) is the relative risk of drinkers [15]. Although this is the most accurate way to calculate a PAF, it requires that the distribution of alcohol consumption be known. Previous attempts at modeling alcohol consumption using a LogNormal distribution have been criticized for various reasons [18,19]; however, the LogNormal distribution has provided adequate approximations for most applications [20,21]. Recently, more adaptable distributions such as the Gamma distribution have been favored over the LogNormal distribution [15,22], and it has been suggested that a mixing of distributions is needed to separately model the frequency of drinking and the quantity of alcohol consumed [23].
There are two main instruments to monitor alcohol exposure currently used by countries and international organizations: 1) general population surveys and 2) estimates of per capita consumption, where per capita consumption is an aggregate measure of recorded, unrecorded, and tourist per capita consumption of alcohol (derived from sales, production, and other economic statistics) [9,24,25]. These instruments, however, have limitations [26].
There are no available surveys for many countries, and in some cases where they do exist they do not allow for the accurate estimation of the volume of consumption, as these surveys only ask about the absence or presence of drinking [27]. Existing surveys often considerably underestimate real consumption levels [2830] by typically covering only 30% to 60% of alcohol sales [26]. As a result, per capita consumption figures are considered to be a best estimate of overall volume of consumption in a country [31]; however, per capita consumption does not provide any disaggregated statistic and, thus, does not provide age and genderspecific consumption estimates. Since in some instances the risk relationship between alcohol consumption and diseasespecific mortality is dependent on gender as well as on age, alcohol exposure by gender and age is required to estimate the PAF and to calculate the alcoholattributable burden of disease in a population [3].
The problems noted above with respect to surveys lead to an underestimated burden of disease attributable to alcohol consumption when PAFs are calculated from population data without adjustment. As a consequence, methods have been developed to triangulate both average alcohol consumption derived from population surveys and from per capita consumption information [15,26]. However, current PAF calculation methods are based on categorical estimates of consumption with alcohol consumption being corrected by multiplying the two top alcohol consumption categories by the inverse of the estimated undercoverage (per capita consumption/the estimated per capita consumption from the survey) [17]. For most categories of disease where there is an association with volume of alcohol consumption, the doseresponse relationship is nonlinear and, thus, distribution estimates of alcohol consumption by age and gender are required for accurate estimates of alcohol PAFs [3].
Given the recent recognition of the need to strengthen and disseminate information about alcohol as outlined in the World Health Organization's strategy to reduce harmful consumption of alcohol [32], there is a need to find an appropriate model for exposure, prevalence, and distribution of alcohol consumption that can easily be modeled to make the fit more compatible with per capita consumption data and that also has properties that make it possible to estimate the exposure distribution for countries that lack survey data except for estimates of prevalence of abstention. Thus, the first aim of this study is to assess internationally if alcohol consistently follows one of the three wellknown rightskewed distributions, LogNormal, Gamma, or Weibull, and to determine if the chosen exposure distribution has a significant effect on the estimation of a PAF, using the PAFs for pancreatitis, diabetes, and breast cancer as examples. The second aim of this study is to investigate if a global relationship between parameters exists so that a distribution of alcohol consumption can be estimated based on mean alcohol consumption.
Methods
Description of underlying surveys
This study used data from Gender, Alcohol and Culture: An International Study (GENACIS), from the European Comparative Alcohol Study (ECAS), and from the STEPwise approach to Surveillance (STEPS). Survey data were collected for the average volume of consumption for Argentina, Australia (two surveys from Australia were used: Australia and Australia1), Austria, Belize, Brazil, Canada, Costa Rica, Czech Republic, Denmark, Finland, France, Germany, Hungary, Iceland, India, Ireland, Isle of Man, Israel, Italy, Japan, Kazakhstan, Mexico, Netherlands, Nicaragua, Nigeria, Norway, Peru, Spain, Sri Lanka, Sweden, Switzerland, Uganda, United Kingdom, Uruguay, and the United States of America from GENACIS (three surveys from the United States of America were used: USA1, USA2, and USA3; USA1 was a 2001 longitudinal study that surveyed women only, and USA2 and USA3 were 19951996 and 2000 National Alcohol Surveys, respectively); for Finland, France, Germany, Italy, Sweden, and the United Kingdom from ECAS; and for Cameroon, Côte D' Ivoire, Dominica, Democratic Republic of the Congo, Eritrea, Kuwait, Mali, Mozambique, American Samoa, Barbados, Benin, Botswana, Cape Verde, Republic of the Congo, Cook Islands, Indonesia, Madagascar, St. Kitts and Nevis, Swaziland, Zambia, Fiji, Kiribati, Marshall Islands, Mongolia, Nauru, Solomon Islands, Tokelau, Tonga, Vanuatu, Micronesia, and Samoa from STEPS. (For information on sampling methodology and the questions used in GENACIS surveys see [3335], ECAS see [30], and STEPS see [36]). For most of the GENACIS surveys and for the ECAS surveys alcohol consumption was measured by a beveragespecific usual quantityfrequency technique (i.e., asking separate questions on usual frequency of drinking, and then eliciting the usual quantity per drinking occasion), and in the remaining GENACIS surveys alcohol consumption was measured by a global quantityfrequency measure. In the STEPS surveys alcohol consumption was measured in standard drinks consumed in the seven days preceding the survey.
All data from surveys were divided by sex and age into eight age groups; 1524, 2534, 3544, 4554, 5564, 6574, 7584, and 85 +.
Methods for fitting the distributions
As alcohol consumption distributions have been shown to have a unimodal shape, [19,37,38] we evaluated the fit of the LogNormal, Gamma, and Weibull distributions (unimodal distributions commonly used to fit rightskewed empirical data) to determine the most appropriate distribution to model alcohol consumption from national survey data. The LogNormal, Gamma, and Weibull probability densities are similar in shape, but have significantly different tail behaviors. In the past, alcohol consumption has been more commonly modeled by the LogNormal distribution as it is used to model continuous random quantities that are rightskewed and is based on the normal distribution, making it easy to fit, test, and modify [20,21]. Although alcohol consumption is frequently modeled using the LogNormal distribution, empirical distributions often deviate considerably from the LogNormal model. In comparison, the Gamma and Weibull distributions have a scale parameter and a shape parameter, making them more adaptable since the scale parameter can stretch or compress the distribution.
The LogNormal distribution is a function of the mean (μ) and standard deviation (σ) parameters, and describes a random variable x where log (x) is normally distributed. The probability density function of the LogNormal distribution can be expressed as follows:
where x > 0 and ∞ < μ < ∞, σ > 0 The Gamma distribution is characterized by a shape (κ) and a scale parameter (θ), has a mean of κθ and a standard deviation of The probability density function of the Gamma distribution can be expressed as follows:
where x > 0, κ > 0, θ > 0 and Similar to the Gamma distribution, the Weibull distribution is commonly characterized by a shape (γ) and a scale parameter (θ). The Weibull distribution has a mean of and a standard deviation of , where is the Gamma function evaluated at x. The probability density function of the Weibull distribution is expressed as follows:
where x ≥ 0, γ > 0, θ > 0 Maximum likelihood estimation was used to fit all three distribution models to the drinking population data obtained from GENACIS and ECAS. All missing values were excluded from the fitted models. The NewtonRaphson algorithm was used to optimize the likelihood equations solving for the maximum likelihood estimates of the unknown parameters [39]. Data values of alcohol consumption over 300 g/day were truncated to 300 g/day. Numerical integration utilizing the trapezoidal rule was used to characterize each distribution.
Method for deriving the alcohol PAF
We performed a sensitivity analysis where the alcohol PAFs for pancreatitis, diabetes, and breast cancer were calculated using a continuous model (LogNormal, Gamma, and Weibull) and using a categorical model in order to see if the chosen exposure distribution had an effect on the estimation of the alcohol PAF. All PAFs were calculated with zero alcohol consumption as the counterfactual scenario, similarly to the Comparative Risk Analysis for alcohol. This counterfactual scenario under certain circumstances of a light drinking average alcohol consumption without heavy drinking occasions may not reflect the theoretical minimum risk depending on the distribution of diseases and cause of death in a society. However, for this paper these considerations are not relevant. The relative risks of lifetime abstainers and former drinkers for pancreatitis, diabetes, and breast cancer were obtained from the metaanalysis [4042].
In order to illustrate that the alcohol PAF estimates based on the Gamma distribution model deviated only slightly from the PAF derived from the categorical model, we calculated the difference between the PAFs calculated for both models.
Methods for characterizing the gamma distributions
The Gamma distribution can be characterized by a shape (κ) and a scale parameter (θ), where the mean and the standard deviation of the Gamma distribution can be obtained directly from the parameter estimates as follows:
Since the mean of the Gamma distribution is equal to the mean of the empirical distribution, the mean of the Gamma distribution does not need to be estimated from the shape and scale parameters.
A maximum likelihood algorithm (see description above) was used to obtain the shape and scale parameters using the maximum likelihood function for the shape and scale parameters of the Gamma distribution:
Regression analysis
The maximum likelihood method was used to fit a Gamma model in order to summarize the alcohol consumption of 66 countries by gender and age (in total 851 datasets [422 for women; 429 for men]). After the data was fit by a Gamma model, the relationship between the Gamma mean and the Gamma standard deviation was examined using various general linear models. The performance of the general linear models was then assessed by how well the assumption of homoscedasticity was upheld and based on the distribution of the residuals.
All data analyses were performed in R version 2.13.0 [43].
Results
Modeling alcohol consumption as a distribution
The three distributions, LogNormal, Gamma, and Weibull, were fit to 41 datasets; parameter estimates are outlined in Table 1 for women and in Table 2 for men. The mean and standard deviation estimates from the empirical data and the estimates from each fitted model are summarized in Table 3 for women and in Table 4 for men. When comparing the empirical mean to each distribution's mean, we observed that the mean estimates from the Weibull distribution were much closer to the empirical mean than were the LogNormal distribution mean estimates, while the mean estimates from the Gamma distribution were equal to the empirical mean. When comparing the standard deviation estimates, the estimates from the LogNormal distribution deviated furthest from the empirical data, while there was no statistically significant difference between the empirical standard deviation estimate and the standard deviation estimates from either of the Weibull or the Gamma distributions.
Table 1. Parameter estimates from LogNormal, Gamma, and Weibull models for women from 43 datasets
Table 2. Parameter estimates from LogNormal, Gamma, and Weibull models for men from 41 datasets
Table 3. Mean and standard deviation estimates from the empirical data, LogNormal model, Gamma model, and the Weibull model for alcohol consumption of women from 43 datasets
Table 4. Mean and standard deviation estimates from the empirical data, LogNormal model, Gamma model, and the Weibull model for alcohol consumption of men from 41 datasets
Three countries with diverse economic conditions and drinking patterns, namely Germany, Sri Lanka, and Uganda, were selected to display their density curves (LogNormal, Gamma, and Weibull) superimposed on the populationbased data histograms; see Figures 1, 2, 3, 4, 5, and 6 for both women and men. We observed a common trend among men in Figures 2, 4, and 6: the LogNormal distribution tended to underestimate the number of men who drank 25 g/day to 50 g/day, whereas the Gamma and Weibull distributions accurately estimated alcohol consumption for these populations. A similar trend was observed with respect to women from Germany and Uganda who drank between 10 g/day to 30 g/day and for Sri Lankan women who drank between 0.5 g/day to 2.0 g/day.
Figure 1. Alcohol consumption distribution in grams per day of pure alcohol for women in Germany. Alcohol consumption distribution in grams per day of pure alcohol for women in Germany.
Figure 2. Alcohol consumption distribution in grams per day of pure alcohol for men in Germany.
Figure 3. Alcohol consumption distribution in grams per day of pure alcohol for women in Sri Lanka.
Figure 4. Alcohol consumption distribution in grams per day of pure alcohol for men in Sri Lanka. Alcohol consumption distribution in grams per day of pure alcohol for men in Sri Lanka.
Figure 5. Alcohol consumption distribution in grams per day of pure alcohol for women in Uganda.
Figure 6. Alcohol consumption distribution in grams per day of pure alcohol for men in Uganda.
Alcohol PAF estimates modeled using the LogNormal, Gamma, and Weibull distributions, together with the proportion estimates for lifetime abstainers and former drinkers, are listed in Table 5 for breast cancer (women), Tables 6 and 7 for diabetes (women and men, respectively), and Tables 8 and 9 for pancreatitis (women and men, respectively).
Table 5. Proportion estimates for lifetime abstainers and former drinkers, as well as PopulationAttributable Fraction (PAF) estimates for breast cancer using a categorical model and continuous models (Gamma, LogNormal, and Weibull) for women
Table 6. Proportion estimates for lifetime abstainers and former drinkers, as well as PopulationAttributable Fraction (PAF) estimates for diabetes using a categorical model and continuous models (Gamma, LogNormal, and Weibull) for women
Table 7. Proportion estimates for lifetime abstainers and former drinkers, as well as PopulationAttributable Fraction (PAF) estimates for diabetes using a categorical model and continuous models (Gamma, LogNormal, and Weibull) for men
Table 8. Proportion estimates for lifetime abstainers and former drinkers, as well as PopulationAttributable Fraction (PAF) estimates for pancreatitis using a categorical model and continuous models (Gamma, LogNormal, and Weibull) for women
Table 9. Proportion estimates for lifetime abstainers and former drinkers, as well as PopulationAttributable Fraction (PAF) estimates for pancreatitis using a categorical model and continuous models (Gamma, LogNormal, and Weibull) for men
The alcohol PAF estimates that incorporated the Gamma and Weibull distributions are very similar and, for the most part, are within 1% of one another. Since the LogNormal distribution is known to have a heavy tail, and this study includes data values for alcohol consumption up to 300 g/day, the alcohol PAF estimates from the LogNormal distribution tend to be much larger and unrealistic when compared to the estimates from the Gamma and Weibull distributions.
Overall, the PAF estimates from the categorical model, Gamma model, and Weibull model are relatively similar when the survey data are more compact, but for those countries where data are more spread out, PAF estimates are more susceptible to sampling bias for diseases with a relatively linear or exponential risk relationship with alcohol, such as pancreatitis and breast cancer. For example, for Brazilian men the alcohol consumption prevalence data tend to be very spread out when compared to men from France, leading to a small difference in the PAFs for pancreatitis. However, this trend does not apply when we look at a disease, such as diabetes, that has a Jshaped relative risk function. If we look at the same example, we find that the alcohol PAFs for diabetes provide similar estimates from the categorical model, Gamma model, LogNormal model, and Weibull model for men from both Brazil and France. This is due to the fact that the relative risk functions are exponential for pancreatitis and are Jshaped for diabetes and thus have different properties. The Jshaped curve in some cases leads to a negative PAF (which represents the fraction of deaths prevented) as the risk of diabetes at the population level is less under current levels of alcohol consumption than under the counterfactual scenario of no alcohol consumption.
Characterizing the alcohol consumption gamma distribution
Based on data from GENACIS and STEPS, the mean daily average per capita alcohol consumption among drinkers was estimated to be 7.549 grams for women (the Gamma standard deviation was 9.862) and 18.292 grams for men (the Gamma standard deviation was 22.015) (see Table 10).
Table 10. Descriptive statistics of the alcohol surveys from 66 countries
After analyzing the association between the Gamma mean and the Gamma standard deviation, a strong linear relationship was established. Analysis of the residuals of various general linear models led to the conclusion that a general linear model with a normal distribution and an identity link (i.e., a linear regression model) is the best possible model to characterize the relationship between the standard deviation of the Gamma distribution and the mean of the Gamma distribution. As a statistical interaction was determined to be present by gender for the relationship between the Gamma mean and the Gamma standard deviation, this linear relationship was modeled separately for men and for women.
Figures 7 and 8 illustrate the linear fit for women and men, respectively. The linear regressions indicate that a unit increase in mean alcohol consumption is associated with an increase of 1.258 (95% CI: 1.223 to 1.293) in the standard deviation of the Gamma alcohol consumption distribution for women and 1.171 (95% CI: 1.144 to 1.197) in the standard deviation of the Gamma alcohol consumption distribution for men. Additionally, for women the linear regression indicated that 92.07% of the variation of the standard deviation of the Gamma distribution was explained by the mean, while for men 94.74% of the variation of the standard deviation of the Gamma distribution was explained by the mean.
Figure 7. Regression analysis and scatter plot for the mean and standard deviation of the alcohol consumption Gamma distribution for women.
Figure 8. Regression analysis and scatter plot for the mean and standard deviation of the alcohol consumption Gamma distribution for men.
Regression diagnostics indicated that there were some outliers. For women, two data points from Nigeria and one from Uganda were identified as influential observations, while for men, two observations in Germany and one in Nigeria were identified as influential observations. There was no indication of a lack of homoscedasticity for any of the regression models (Additional file 1).
Additional file 1. Web Appendix. This web appendix includes parameter estimates using nontruncated data for women and men from LogNormal, Gamma, and Weibull models, proportion estimates for lifetime abstainers and former drinkers, as well as Population Attributable Fraction (PAF) estimates for breast cancer, diabetes, pancreatitis using a categorical model and a continuous model. Count, proportion and weighted global proportion estimates for women and men drinkers that drink ≤ 96 g/day, > 96 g/day, ≤ 120 g/day, and > 120 g/day were also included. Proportion estimates for the decomposition of alcohol Population Attributable Fraction (PAF) are listed for breast cancer and pancreatitis consisting of drinkers that drink ≤ 96 g/day and > 96 g/day, ≤ 120 g/day and > 120 g/day, ≤ 150 g/day and > 150 g/day, and ≤ 200 g/day and > 200 g/day using a continuous model (Gamma, LogNormal, and Weibull) for women and men.
Format: DOC Size: 1.7MB Download file
This file can be viewed with: Microsoft Word Viewer
Discussion
Both the Gamma and the Weibull distributions summarized the population distribution of average volume of alcohol consumption more accurately than did the LogNormal distribution. Moreover, for the Gamma and Weibull distributions the ratio of mean to standard deviation was comparable across all countries, irrespective of drinking patterns and the survey measure used to measure alcohol consumption. Overall, both the Gamma and Weibull distributions yield similar PAFs and could be used in descriptive alcohol epidemiology. Although not examined specifically, these outcomes would also apply to PAFs that are calculated when using a counterfactual scenario where alcohol consumption is decreased due to a policy or intervention such as taxation. Since the Weibull distribution is a more complicated distribution and less flexible than the Gamma distribution, and since it is possible to shift the Gamma distribution upwards (necessary in modeling the burden of disease attributable to alcohol consumption), the Gamma distribution is the best distribution for modeling alcohol consumption.
Modeling survey alcohol consumption data alone without correcting the distribution for undercoverage will lead to inaccurate alcohol PAFs as selfreported survey data typically underestimate alcohol consumption based on sales or taxation (e.g., [26]). In other words, alcohol surveys often do not accurately represent the population due to undercoverage where some members of the population are inadequately represented (or excluded) or due to response bias [30]. Accordingly, a method must be developed that will shift the exposure distribution so that it is consistent with per capita consumption data in order to correct for survey bias and allow for a more accurate estimation of the true alcohol consumption distribution and for an accurate comparison of the alcoholattributable burden of disease across countries.
Given the relationship between the mean and the standard deviation of alcohol consumption [15], modeling alcohol consumption using the Gamma distribution, upestimating this distribution using the relationship between the mean and the standard deviation, and using per capita consumption data, allows us to correct for the biases that lead to undercoverage (for specifics on the upshifting methods see [15]) and allows for the estimation of the distribution of alcohol consumption in a country as if it were measured by a survey with a much higher coverage rate. Additionally, based on the relationship between the mean and the standard deviation of the alcohol consumption Gamma distribution, we can use the mean alcohol consumption from sales and taxation data to obtain the κ and θ parameters for the alcohol exposure distribution for those countries where no survey data exist. Due to great variations in the populations surveyed, and in the sampling frame, response rate, and coverage rate for each of the individual surveys within the main survey groups of GENACIS, ECAS, and STEPS, our observations that alcohol consumption can best be modeled through a Gamma distribution and that the mean is highly correlated with the standard deviation of the alcohol consumption Gamma distribution indicate that these results are applicable to a wide range of countries and are valid for population surveys that use different methodologies.
An interesting finding from our study was the identification as outliers of some of the observations from Nigeria. This could be due to multiple factors. The number of observations from Nigeria upon which the mean and the standard deviation of the alcohol consumption Gamma distribution are based are fewer than the number of observations from other countries. A further factor is that the relationship between the mean and standard deviation of the alcohol consumption Gamma distribution for Nigeria may be different when compared to other countries. Given that only some age groups in Nigeria were identified by the regression diagnostics as outliers, it is very likely that these outliers were due to the low number of individuals surveyed in Nigeria. Future research will focus on modeling alcohol consumption by global region (such as by using the 2005 Comparative Risk Assessment regions [44]) to see if there are regional differences in the relationship between the mean and the standard deviation of the alcohol consumption Gamma distribution.
Conclusion
When comparing the LogNormal, Weibull, and Gamma distributions to calculate average consumption of alcohol, the Gamma distribution and the Weibull distribution outperform the LogNormal distribution in fitting the empirical consumption distribution. Of these two distributions, the Gamma distribution appears to be the best choice for modeling as it has two parameters that can easily be shifted to make the fit more compatible with the per capita consumption data, thus making it possible to estimate the exposure distribution of countries with only aggregate per capita consumption reported, as long as prevalence of abstention is known (see [15]). Thus, shifting the mean upwards is possible, as the Gamma distribution can be described by two parameters (mean and standard deviation), which empirically can be reduced to one, as a large degree of variance of the standard deviation of the alcohol consumption Gamma distribution is explained by the mean alcohol consumption. Accurate modeling of alcohol consumption as an upshifted distribution will provide public health decisionmakers with accurate data to assess the impact of alcohol consumption within and across countries and will aid in determining public health priorities and where to allocate resources.
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
TK, GG, and JR conceptualized the overall article. TK, GG, KDS, GG, and JR contributed to the methodology, identified sources for risk relations and exposure, and contributed to the writing. TK performed all statistical analyses. All authors have approved the final version.
Acknowledgements
This paper uses data from Gender, Alcohol and Culture: An International Study (GENACIS). GENACIS is a collaborative international project affiliated with the Kettil Bruun Society for Social and Epidemiological Research on Alcohol and coordinated by GENACIS partners from the University of North Dakota, the University of Southern Denmark, the Charité University Medicine Berlin, the Pan American Health Organization (PAHO), and the Swiss Institute for the Prevention of Alcohol and Drug Problems. Support for aspects of the project comes from the World Health Organization (WHO), the Quality of Life and Management of Living Resources Programme of the European Commission (Concerted Action QLG4CT20010196), the United States National Institute on Alcohol Abuse and Alcoholism/National Institutes of Health (Grant Numbers R21 AA012941 and R01 AA015775), the German Federal Ministry of Health, PAHO, and Swiss national funds. Support for individual country surveys was provided by government agencies and other national sources. The study leaders and funding sources for datasets used in this study are:
Argentina (Myriam Munné, WHO); Australia (Jillian Fleming, National Campaign Against Drug Abuse, National Centre for Epidemiology and Population Health, Australian National University; Paul Dietze, National Health and Medical Research Council (Grant 398500)); Austria (Irmgard EisenbachStangl, Boltzmann Institute); Belize (Claudina Cayetano, PAHO); Brazil (Florence KerrCorrea; Foundation for the Support of Sao Paulo State Research (Fundação de Amparo a Pesquisa do Estado de São Paulo, FAPESP) (Grant 01/031506)); Canada (Kate Graham; Canadian Institutes of Health Research (CIHR)); Costa Rica (Julio Bejarano, WHO); Czech Republic (Ladislav Csémy, Ministry of Health (Grant MZ 23752)); Denmark (Kim Bloomfield, Sygekassernes Helsefond; Danish Medical Research Council); Finland (Pia Mäkelä, National Research and Development Centre for Welfare and Health (STAKES)); France (Francois Beck, National Institute of Prevention and Heath Education (INPES)); Germany (Ludwig Kraus, German Federal Ministry of Health (BMGS) and in cooperation with the Institute for Therapy Research, Munich, Germany); Hungary (Zsuzsanna Elekes, Ministry of Youth and Sport); Iceland (Hildigunnur Ólafsdóttir, Alcohol and Drug Abuse Prevention Council, Public Health Institute of Iceland, Reykjavík, Iceland); India (Vivek Benegal, WHO); Ireland (Ann Hope, Department of Health and Children (HPU)); Isle of Man (Martin Plant, Moira Plant, Isle of Man Medical Research Council; University of the West of England, Bristol); Israel (Giora Rahav, Meir Teichman, Anti Drugs Authority of Israel); Italy (Allaman Allamani, Centro Alcologico, Florence Health Agency, Regional Health Agency of Tuscany); Japan (Shinji Shimizu, Japan Society for the Promotion of Science (Grant 13410072)); Kazakhstan (Bedel Sarbayev, WHO); Mexico (MariaElena MedinaMora, Ministry of Health, Mexico, Office of Antinarcotics Issues; US Embassy in Mexico; National Institute of Psychiatry; National Council Against Addictions; General; Directorate of Epidemiology and Subsecretary of Prevention and Control of Diseases, Ministry of Health, Mexico); Netherlands (Ronald Knibbe, Ministry of Health and Welfare of the Netherlands); Nicaragua (Jose Trinidad Caldera, PAHO); Nigeria (AkanidomoIbanga, WHO); Norway (Sturla Nordlund, Norwegian Institute for Alcohol and Drug Research); Peru (Marina Piazza, PAHO); Spain (Juan C. Valderrama, Dirección General de Atención a la Dependencia, Conselleria de Sanidad, Generalitat Valenciana; Comisionado do Plan de Galicia sobre Drogas, Conselleria de Sanidade, Xunta de Galicia; Dirección General de Drogodependencias y Servicios Sociales, Gobierno de Cantabria); Sri Lanka (Siri Hettige, WHO); Sweden (Karin Bergmark, Ministry for Social Affairs and Health, Sweden); Switzerland (Gerhard Gmel, Swiss Federal Office for Education and Science (Contract 01.0366); Swiss Federal Statistical Office; Uganda (Nazarius Mbona Tumwesigye, WHO); Uruguay (Raquel Magri, WHO); UK (Martin Plant, Moira Plant, Alcohol Education and Research Council; European Forum for Responsible Drinking; University of the West of England, Bristol); US (Sharon C. Wilsnack, Richard W. Wilsnack, Thomas Greenfield, National Institute on Alcohol Abuse and Alcoholism/National Institutes of Health (Grants; R01 AA015775 and R21 AA012941; P50 AA05595; P50 AA05595); University of North Dakota (Subcontract No. 254, Amendment No.2 UND Fund 41530425)).
This paper also uses data from STEPwise approach to Surveillance (STEPS). We would like to thank the following individuals who have supported the concept of STEPS:
Hyppolyte Agbuton, Kingsley Akinroye, Annette Akinsete, Tim Albion, Julia Alfred, Ala Alwan, Ezzat Amine, Krishnan Anand, Craig Anderson, Martha Anker, N.K. Arora, Kjell Asplund, Nahla Baba, Albert Barcelo, Abdul Bari Abdulla, Kidist Bartolomeos, Robert Beaglehole, Mohammed Belhocine, Lydia Bendib, Rafael Bengoa, Ruth Berkelman, Pedro Mas Bermejo, I.P. Bhagwat, Tran Huu Bich, Steve Blair, Leigh Blizzard, Martin Bobak, Pascal Bouvet, Debbie Bradshaw, Joanna Broad, Fiona Bull, Peter Byass, Peter Callan, Dennis Calvert, Lucimar Cannon, Barbro Carlsson, Vikashni Chand, Jie Chen, Bernard Choi, Miriam Claeson, Alberto ConchaEastman, Stephen Corber, Margaret Cornelius, Vera Costa e Silva, Albertino Damasceno, Isabel Danel, Niklas Danielsson, Ian Darnton Hill, N.G. Desai, Abolghassem Djazayery, Hind Djerrari, Annette Dobson, Kathy Douglas, Terry Dwyer, Joan Dzenowagis, Anders Emmelin, Alfredo Espinosa Brito, Sarah Faletoese, Anna FerroLuzzi, Antonio Filipe, Noela Fitzgerald, Limbo Fiu, Sunia Foliaki, Monica Fong, Terrence Forrester, Jayne Fryer, Gauden Galea, Deborah Galuska, Elize Gershater, JeanPierre Gervaisoni, Mariano Bonet Gorbea, Vilnius Grabauskas, Robert Granger, P.C. Gupta, Rajeev Gupta, Djohar Hannoun, Toshihiko Hasegawa, Richard Heller, Susilowati Herman, Dionisio Herreira, Helen Hermann, John Jabbour, Samer Jabbour, Rally Jim, S.K. Jindal, Abraham Joseph, Prashant Joshi, Umesh Kapil, S.K. Kapoor, Oussama Khatib, Robert KimFarley, Hilary King, Makeleta Koloi, Lingzhi Kong, Andrea Kriska, Etienne Krug, Thomas Kurian, Kerry Kutch, Kari Kuulasmaa, Louise Hayes, Gael Kernen, Stevenson Kuartei, Justina Langidrik, H. Latiri, Jerzy Leowski, Dominique LeFévre, Xinhua Li, L. Lili'o, Kipier Lippwe, Alan Lopez, Heather MacDonald, Nancy Macdonald, Sarah MacFarlane, Judith Mackay, Nejma Macklai, U.A. Maga, Blerta Maliqi, JeanClaude Mbanya, Tony Mbewu, Laura McDougall, David McQueen, Shanthi Mendis, George Mensah, Airambiata Metai, Dan Miller, Anoop Misra, V. Mohan, Maristela Monteiro, Alfredo Morabia, D. McDonald Mtotha, David McQueen, Ferdinand Mugusi, Gano Mwarewo, Shakila Naidu, Richard Nesbit, Angela Newill, Nawi Ng, Chizuru Nishida, Robyn Norton, Ayoade OlatunbosunAlakija, Pedro Ordunez, Stipe Orešković, Fred Paccaud, Arvind Pandey, Lili Pasat, Margie Peden, Rachel Pedersen, Janina Petkeviciene, Pirjo Pietinen, Barry Popkin, Rimina Potemkinov, Viliami Puloka, Pekka Puska, Jan Pryor, Mahmudur Rahman, Sawat Ramaboot, Lars Ramstrom, K Srinath Reddy, Peter Redert, Nina Rehn, Claude Renaud, Sylvia Robles, Paz Rodriquez, Gojka Roglic, Salanieta Taka Saketa, Susana Sans, Shekhar Saxena, Cristina Schneider, Jimaima Schultz, Cecilia Sepulveda, Bella Shah, Aushra Shatchkute, Prakash Shetty, N. Short, Padam Singh, S.K. Sinha, Michael Sjöström, Seilini Soakai, Lakshmi Somatunga, C. Sookram, Soeharsono Soemantri, Harley John Stanton, Krisela Steyn, Kathleen Strong, T.N. Sugathan, Hirotu Suzuki, Karen Tairea, Julita Tellei, K.R. Thankappan, Benete Tokanang, Hanna Tolonen, Steve Tollman, O. Tommaso, Thomas Truelsen, Nigel Unwin, Ulla Uusitalo, Cherian, Barbora Vozarova, Godfrey Waidubu, Stig Wall, Lepani Waqatakirewa, Franklin White, Derek Yach, Zaida Yadon, Mabel Yap, Helena Zabina, Paul Zimmet. Support from the Governments of Australia, the Netherlands, Sweden, and the United Kingdom toward the development and implementation of the WHO STEPwise approach to Surveillance (STEPS) is also gratefully acknowledged.
References

Murray CJL, Lopez A: Global mortality, disability, and the contribution of risk factors: global burden of disease study.
Lancet 1997, 349:14361442. PubMed Abstract  Publisher Full Text

World Health Organization: International classification of diseases and related health problems, 10th revision (version for 2007). Geneva, Switzerland: World Health Organization; 2007.

Rehm J, Baliunas D, Borges GLG, Graham K, Irving HM, Kehoe T, et al.: The relation between different dimensions of alcohol consumption and burden of disease  An overview.
Addiction 2010, 105:817843. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Eide G, Heuch I: Attributable fractions: fundamental concepts and their visualization.
Stat Methods Med Res 2001, 10:159193. PubMed Abstract  Publisher Full Text

Rothman KJ, Greenland S, Lash TL: Modern Epidemiology. 3rd edition. PA, USA: Lippincott Williams & Wilkins; 2008.

Walter SD: The estimation and interpretation of attributable risk in health research.
Biometrics 1976, 32:829849. PubMed Abstract  Publisher Full Text

Walter SD: Prevention of multifactorial disease.
Am J Epidemiol 1980, 112:409416. PubMed Abstract

Rehm J, Room R, Graham K, Monteiro M, Gmel G, Sempos C: The relationship of average volume of alcohol consumption and patterns of drinking to burden of disease  An overview.
Addiction 2003, 98:12091228. PubMed Abstract  Publisher Full Text

Rehm J, Room R, Monteiro M, Gmel G, Graham K, Rehn N, et al.: Alcohol as a risk factor for global burden of disease.
Eur Addict Res 2003, 9:157164. PubMed Abstract  Publisher Full Text

Gutjahr E, Gmel G, Rehm J: The relation between average alcohol consumption and disease: an overview.
Eur Addict Res 2001, 7:117127. PubMed Abstract  Publisher Full Text

Roerecke M, Rehm J: Irregular heavy drinking occasions and risk of ischemic heart disease: a systematic review and metaanalysis.
Am J Epidemiol 2010, 171:633644. PubMed Abstract  Publisher Full Text

Puddey IB, Rakic V, Dimmitt SB, Beilin LJ: Influence of pattern of drinking on cardiovascular disease and cardiovascular risk factors  a review.
Addiction 1999, 94:649663. PubMed Abstract  Publisher Full Text

Taylor B, Irving HM, Kanteres F, Room R, Borges G, Cherpitel C, et al.: The more you drink, the harder you fall: a systematic review and metaanalysis of how acute alcohol consumption and injury or collision risk increase together.
Drug Alcohol Depend 2010, 110:108116. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Gmel G, Kuntsche E, Rehm J: Risky single occasion drinking: bingeing is not bingeing.
Addiction 2011, 106:10371045. PubMed Abstract  Publisher Full Text

Rehm J, Kehoe T, Gmel G, Stinson F, Grant B, Gmel G: Statistical modeling of volume of alcohol exposure for epidemiological studies of population health: the example of the US.
Popul Health Metr 2010, 8:3. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Rehm J, Mathers C, Popova S, Thavorncharoensap M, Teerawattananon Y, Patra J: Global burden of disease and injury and economic cost attributable to alcohol use and alcohol use disorders.
Lancet 2009, 373:22232233. PubMed Abstract  Publisher Full Text

Patra J, Taylor B, Rehm J, Baliunas D, Popova S: Substanceattributable morbidity and mortality changes to Canada's epidemiological profile: measurable differences over a tenyear period.
Can J Public Health 2007, 98:228234. PubMed Abstract

Duffy JC: The distribution of alcohol consumption  30 years on.
Br J Addict 1986, 81:735741. PubMed Abstract  Publisher Full Text

Skog OJ: The distribution of alcohol consumption. In Part I. A critical discussion of the Ledermann Model. Oslo, Norway: National Institute for Alcohol Research; 1982.

Guttorp P, Hiang H: A note on the distribution of alcohol consumption.

Skog OJ: A note on the distribution of alcohol consumption; Gamma vs Lognormal distributions.
A reply to Guttorp and Song. Drinking and Drug Practices Surveyor 1979, 14:36.

Skog OJ: The tail of the alcohol consumption distribution.
Addiction 1993, 88:601610. PubMed Abstract  Publisher Full Text

Gruenewald P, Nephew T: Drinking in California: Theoretical and empirical analyses of alcohol consumption patterns.
Addiction 1994, 89:707723. PubMed Abstract  Publisher Full Text

Rehm J, Room R: Monitoring of alcohol use and attributable harm from an international perspective.

World Health Organization: Global status report on alcohol and health. Geneva, Switzerland, World Health Organization;

Rehm J, Klotsche J, Patra J: Comparative quantification of alcohol exposure as risk factor for global burden of disease.
Int J Methods Psychiatr Res 2007, 16:6676. PubMed Abstract  Publisher Full Text

Rehm J, Room R, Monteiro M, Gmel G, Graham K, Rehn N, et al.: Alcohol Use. In Comparative quantification of health risks: global and regional burden of disease attributable to selected major risk factors. Edited by Ezzati M, Lopez AD, Rodgers A, Murray CJL. Geneva, Switzerland: World Health Organization; 2004:9591109.

Midanik LT: The validity of selfreported alcohol consumption and alcohol problems: a literature review.
Br J Addict 1982, 77:357382. PubMed Abstract  Publisher Full Text

Midanik L: Validity of selfreported alcohol use: a literature review and assessment.
Br J Addict 1988, 83:10191029. PubMed Abstract  Publisher Full Text

Shield K, Rehm J: Difficulties with telephonebased surveys on alcohol in highincome countries: the Canadian example.
International Journal of Methods in Psychiatric Research 2009, in press.

World Health Organization: Global strategy to reduce the harmful use of alcohol. [http://www.who.int/substance_abuse/activities/globalstrategy/en/index.html] webcite
Geneva, Switzerland: World Health Organization; 2010.

Bloomfield K, Allamani A, Back F, Bergmark KH, Csemy L, EisenbachStang I, et al.: Gender, culture and alcohol problems: a multinational study. An EU concerted Action. Project final report. Berlin, Germany: Institut for Medical Informatics, Biometrics & Epidemiology; 2005.

Bloomfield K, Gmel G, Wilsnack S: Introduction to special issue 'Gender, Culture and Alcohol Problems: a Multinational Study'.
Alcohol 2006, 41:37. Publisher Full Text

Taylor B, Rehm J, Trinidad J, Aburto C, Bejarano J, Cayetano C, et al.: Alcohol, gender, culture and harms in the Americas: PAHO Multicentric Study final report. Washington, D.C.: Pan American Health Organization (PAHO); 2007.

World Health Organization: WHO STEPS Surveillance Manual. Geneva, Switzerland: World Health Organization; 2008.

Ledermann S: Alcool, Alcoolisme, Alcoolisation. Volume Volume I. Paris, France: Presses Universitaires de France; 1956.

Ledermann S: Alcool, Alcoolisme, Alcoolisation. Volume Volume II. Paris, France: Presses Universitaires de France; 1964.

Ypma TJ: Historical development of the NewtonRaphson method.
Society for Industrial and Applied Mathematics 1995, 37:531551.

Baliunas D, Taylor B, Irving H, Roerecke M, Patra J, Mohapatra S, et al.: Alcohol as a risk factor for type 2 diabetes  A systematic review and metaanalysis.
Diabetes Care 2009, 32:21232132. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Corrao G, Bagnardi V, Zambon A, La Vecchia C: A metaanalysis of alcohol consumption and the risk of 15 diseases.
Prev Med 2004, 38:613619. PubMed Abstract  Publisher Full Text

Irving HM, Samokhvalov A, Rehm J: Alcohol as a risk factor for pancreatitis. A systematic review and metaanalysis.
JOP 2009, 10:387392. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

R Development Core Team: R: A Language and Environment for Statistical Computing (version 2.13.0). Vienna, Austria: R Foundation for Statistical Computing; 2011.

Institute for Health Metrics and Evaluation: GBD operations manual final draft. [cited 2011]. Available from: URL: http://www.who.int/healthinfo/global_burden_disease/GBD_2005_study/en/index.html webcite