Genetic Background and Sex: Impact on Generalizability of Research Findings in Pharmacology Studies

Open Access
Part of the Handbook of Experimental Pharmacology book series (HEP, volume 257)


Animal models consisting of inbred laboratory rodent strains have been a powerful tool for decades, helping to unravel the underpinnings of biological problems and employed to evaluate potential therapeutic treatments in drug discovery. While inbred strains demonstrate relatively reliable and predictable responses, using a single inbred strain alone or as a background to a mutation is analogous to running a clinical trial in a single individual and their identical twins. Indeed, complex etiologies drive the most common human diseases, and a single inbred strain that is a surrogate of a single genome, or data generated from a single sex, is not representative of the genetically diverse patient populations. Further, pharmacological and toxicology data generated in otherwise healthy animals may not translate to disease states where physiology, metabolism, and general health are compromised. The purpose of this chapter is to provide guidance for improving generalizability of preclinical studies by providing insight into necessary considerations for introducing systematic variation within the study design, such as genetic diversity, the use of both sexes, and selection of appropriate age and disease model. The outcome of implementing these considerations should be that reproducibility and generalizability of significant results are significantly enhanced leading to improved clinical translation.


Animal models Genetic diversity Pharmacodynamics Pharmacokinetics Sex  

1 Introduction

There are many perspectives on what defines an “animal model,” but at the most fundamental level, it reflects an animal with a disease or condition with either face or construct validity to that observed in humans. Spontaneous animal models represent the truest form of the definition and are best exemplified by cross-species diseases such as cancer and diabetes where a particular species naturally develops the condition as observed in humans. However even in these models with high face, and seemingly construct validity, care must be taken when extrapolating from the animal phenotype to the human disease as the underlying mechanisms driving the disease may not be identical across species.

Animal models serve two primary purposes. The first use of animal models is to elucidate biological mechanisms and processes. A key assumption in this approach is that the animal species being examined has comparable enough physiology to reasonably allow for extrapolation to human biology and disease states. An extension of the first purpose is to use animal models for estimating efficacy and safety of new therapeutic treatments for alleviating human disorders. In both of these uses, the fidelity of the animal model is critically dependent upon the homology of the physiology between the animal model and human. The best model for human is human, and the greater divergence from human across the phylogenetic scale (e.g., nonhuman primates > rodents > zebrafish > drosophila) introduces increasingly larger gaps in genetic and physiological homology. For complex human-specific disorders such as schizophrenia or Alzheimer’s disease, our confidence in findings from animal models must be guarded as there is not a spontaneous animal model of these human conditions. For instance, besides humans, there is no animal that spontaneously exhibits Aβ plaques and neurofibrillary tangles that define the pathology of Alzheimer’s disease. Moreover, the complex spectrum of cognitive dysfunction and neuropsychiatric comorbidities that these diseases produce cannot be fully recapitulated or assessed (e.g., language impairment) in lower animal species. In such cases, animal models are relegated to attempts in simulating specific symptoms of the disorder (e.g., increasing striatal dopamine in rodents to model the striatal hyperdopaminergia observed in schizophrenia patients and thought to underlie the emergence of positive symptoms) or to model specific pathological processes observed in the human disease (e.g., generation of amyloid precursor protein overexpressing mice to model the Aβ deposition seen in Alzheimer’s disease patients). In this latter example, it is important to note that the translation of transgenic mice Aβ deposition and mechanisms that reduce its accumulation have translated well into human AD patients; however, because this is an incomplete representation of the disease, agents that reduce Aβ deposition in both animals and human AD patients have yet to prove successful in delaying disease progression.

Reproducibility and generalizability are two aspects of preclinical research that have come under much scrutiny over the last several years. Examples of failures to reproduce research findings, even in high-impact journals, are numerous and well described in the literature (Jarvis and Williams 2016). Perhaps the most obvious factor impacting across-lab reproducibility are deficiencies to note important methodological variables of the study. As we discuss later in this chapter, it is surprising how often key experimental variables such as specific strain or sex of animal used are omitted in the methods section. In a direct attempt to improve scientific reporting practices, initiatives such as use of the ARRIVE guidelines (Kilkenny et al. 2010) have been instituted across the majority of scientific journals. Such factors can also affect intra-lab reproducibility; for instance, when a particular student that ran the initial study has left the lab or the lab itself has relocated to another institution and the primary investigator reports challenges in reestablishing the model.

Challenges in generalizability of research findings are best exemplified by noted failures in the realm of drug development in which a novel compound exhibits robust efficacy in putative animal models of a human condition but fails to demonstrate therapeutic benefit in subsequent clinical trials. Indeed, medication development for Alzheimer’s disease has a remarkable failure rate of over 99% with a large percentage of drug development terminations attributed to lack of efficacy in Phase II or Phase III clinical trials (Cummings et al. 2014).

It is interesting to speculate that improvements in reproducibility of preclinical research may not necessarily translate into improved generalizability to the human condition (see Würbel 2002). For instance, close adherence in using the same age, substrain of mouse, husbandry conditions, and experimental details should improve the likelihood of reproducing another lab’s findings. However, it also follows that if a reported research finding is highly dependent upon a specific experimental configuration and the finding is lost with subtle procedural variations, then how likely is the finding to translate to the human condition? Humans are highly heterogeneous in their genetic composition and environmental determinants, often resulting in subpopulations of patients that are responsive to certain treatments and others that are described as treatment resistant. In preclinical research the best balance of improving both reproducibility and generalizability is to institute the inclusion of both sexes and incorporation of another strain or species. This approach will most certainly reduce the number of positive findings across these additional variables, but those findings that are consistent and robust will likely result in increased reproducibility across labs and to translate into clinical benefit. In the sections that follow, we highlight the importance of genetic background and sex in conducting preclinical research.

2 Genetic Background: The Importance of Strain and Substrain

Dating back to the early 1900s, researchers have recognized the value of genetic uniformity and stability of inbred strains, which have provided such benefits as reducing study variability and needed samples sizes and improving reliability of data. To date more than 20 Nobel Prizes have resulted from work in inbred strains, and this knowledge has provided significant medical and health benefits (Festing 2014). Certainly, it continues to be an acceptable strategy to conduct research on a single inbred strain of mice, provided that the context of the results is reported to not suggest that the data are generalizable to other strains and species (e.g., humans). A single inbred strain is not representative of the genetically diverse patient populations and is instead representative of a single genome. Moreover, even different substrains of a common strain of mice (e.g., C57BL/6J, C57BL/6N, C57BL/6NTac) exhibit unique genetic dispositions resulting in surprisingly divergent phenotypes (reviewed in Casellas 2011). Therefore, a major constraint in translational research has been the common practice of limiting preclinical pharmacology studies to that of a single strain of mice.

Within the context of rodent studies, one example where lack of generalizability of strain, substrain, and sex has been well documented is the rodent experimental autoimmune encephalomyelitis (EAE) model of multiple sclerosis (MS). MS is an autoimmune disease caused by demyelination in the CNS, which results in a spectrum of clinical presentations accompanied by progressive neuromuscular disorders and paralysis (reviewed in Summers deLuca et al. 2010 ). In mice, immunization with myelin/oligodendrocyte glycoprotein peptide induces EAE; however the variability of disease presentation across mouse models has been a major hindrance for facilitating drug development. In line with the genetic contributions to MS in human patients, mouse strains and substrains are genetically and phenotypically divergent which introduces heterogeneous loading of risk alleles and variations in phenotypes that contribute to the variability in disease onset and severity (Guapp et al. 2003 ). The MS field is not unique to the challenges of experimental variability resulting from the choice of genetic background in their rodent model and has been documented in most fields of study (Grarup et al. 2014; Jackson et al. 2015; Loscher et al. 2017; Nilson et al. 2000). While known for decades that mouse substrains are genetically and phenotypically diverse from each other, many in the research community are still not aware of this important caveat and the implication on experimental findings.

Case in point, the C57BL/6 mouse strain is one of the most common and widely used inbred strains with many substrains derived from the original lineage and now maintained as separate substrain colonies. The C57BL/6J line originated at the Jackson Laboratory by C.C. Little in the 1920s, and in the 1950s a cohort of mice were shipped to the National Institutes of Health where a colony was established and aptly named C57BL/6N (the suffix “N” refers to the NIH colony, while the “J” suffix refers to the Jackson Laboratory colony) (reviewed in Kiselycznyk and Holmes 2011). At some point spontaneous mutations (i.e., genetic drift) occurred in each of these colonies resulting in these two substrains becoming genetically distinct from each other with recent reports citing >10,000 putative and 279 confirmed variant differences as well as several phenotypic differences between C57BL/6 substrains (Keane et al. 2011; Simon et al. 2013). These genetic and phenotypic differences between substrains are not unique to C57BL/6 as 129 substrains, among others, and also have similar genetic diversity issues that must be considered when reporting and extrapolating research (Kiselycznyk and Holmes 2011). Important to note is that substrain nomenclature alone is not the sole information that identifies genetic and phenotypic diversity. Individual or private colonies established for >20 generations either at a commercial vendor or an academic institution are considered a substrain and hence must adhere to the guidelines for nomenclature of mouse and rat strains as established by the International Committee on Standardized Genetic Nomenclature for Mice (reviewed in Sundberg and Schofield 2010). Laboratory code which follows substrain notation annotates for strain/substrain source including commercial vendor (e.g., C57BL/6NHsd and C57BL/6NTac, respectively, for Harlan and Taconic) and is a critical piece of information to researchers that a substrain may have further genetic variation, as in the case for C57BL/6N, than the original NIH colony. The implication on research findings where failure to understand the role of substrain differences, as well as failures to prevent inadvertent backcrossing of substrains, has been highlighted recently (Mahajan et al. 2016 ; Bourdi et al. 2011; McCracken et al. 2017). In one example, Bourdi and colleagues reported that JNK2−/− knockout mice were more susceptible than their WT controls to acetaminophen-induced liver injury which was in contrast to findings from other laboratories demonstrating that JNK2−/− and inhibitors of JNK were protective from acetaminophen-induced liver injury (Bourdi et al. 2011). Through careful retrospective analysis, the researchers were able to determine that backcrossing on two different background substrains conferred either toxicity or protective effects (Bourdi et al. 2011).

This issue of genetic drift is not unique to mice. For instance, in the study of hypertension and attention-deficit/hyperactivity disorder (ADHD), one of the most studied rat models are the spontaneously hypertensive (SHR) and Wistar Kyoto (WKY) ratlines. In terms of ADHD, the SHR rats display symptoms of inattention, hyperactivity, and impulsiveness in various behavioral paradigms (Sagvolden et al. 2009). However like the C57BL/6 substrains, numerous SHR and WKY substrains have been generated over the years. The SHR ratline was derived originally from a WKY male with marked hypertension and a female with moderate blood pressure elevations. Brother-sister matings continued with selection pressure for spontaneous hypertension. The SHR line arrived at the National Institutes of Health (NIH) in 1966 from the Kyoto School of Medicine. From the NIH colony (SHR/N), SHR lines were derived by Charles River, Germany (SHR/NCrl), and the Møllegaard Breeding Centre, Denmark (SHR/NMol), as well as other institutions over the years. The SHR rat strains exhibit an ADHD-like phenotype, whereas the WKY line serves as a normative control. A problem exists, in that, while the WKY strain was established from the same parental Wistar stock as the SHR line, there is considerable genetic variability among WKY strains because the WKY breeding stock was not fully inbred prior to being distributed to different institutions for breeding which resulted in accelerated genetic drift. A further issue for using the WKY strain as a genetic and behavioral control for the SHR strain is that the inbreeding for the WKY strain was initiated over 10 years later than that of the SHR strain which calls into question the validity of the WKY rats as a proper control for findings in SHR rats (Louis and Howes 1990). As one might expect from such genetic diversity in SHR and WKY lines, findings from both cardiovascular blood pressure and ADHD phenotypes have at times been contradictory, and much commentary has been made about the appropriate selection of controls when studying phenotypes associated with these strains of rats (St. Lezin et al. 1992).

3 Importance of Including Sex as a Variable

The X and Y chromosomes are not the only difference that separates a female from a male. In preclinical studies there has been a pervasively, flawed assumption that male and female rodents have similar phenotypes. Publications that include such general statements as “data were combined for sex since no sex effect was observed” without the inclusion of the analysis, or simply reporting “data not shown” for the evaluation of effects of sex, are unacceptable. From basic physiological phenotypes (e.g., body weight, lean and fat mass) to any number of neuroendocrine, immune, and behavioral phenotypes beyond reproductive behaviors, males and females differ (reviewed in Hughes 2007; Karp et al. 2017). Furthermore, many human diseases affect males and females differently, whereas the influence of sex can affect disease susceptibility, symptom presentation and progression, and treatment outcomes. Well-documented sex differences exist for cardiovascular disease, autoimmune diseases, chronic pain, and neuropsychiatric disorders with females generally having greater incidences than males (reviewed in Regitz-Zagrosek 2012; IOM 2011). Therefore, ignorance of sex-specific effects in study design, phenotypes, pharmacokinetics, pharmacodynamic measures, or interpretation of data without sex as a covariate are failing to provide accuracy in reporting of the data. To this end, in 2014 the NIH issued a directive to ensure that both male and female subjects are represented in preclinical studies, an extension of the 1993 initiative to include women as participants in clinical trials receiving NIH funding (Clayton and Collins 2014).

With respect to animal models used in pharmacology experiments, sex differences in disease presentation and progression have also been reported. For example, while women have a higher prevalence of chronic pain and related disorders, preclinical studies have largely focused on male subjects. Problematically, after hundreds of studies historically employed male mice to study nociceptive responses mediated by the toll-like 4 receptor (TLR4), and subsequent pharmacology studies targeting TLR4 for analgesia, it was later discovered that the involvement of TLR4 in pain behaviors in male mice was dependent on testosterone (Sorge et al. 2011). Therefore, these results and any potential therapeutics for the treatment of pain with a mechanism of action targeting TLR4 could not be generalized to both sexes (Sorge et al. 2011). In another example, the NOD mouse model of Type 1 diabetes has a higher incidence and an earlier onset of diabetes symptoms in females than males (Leiter 1997). Consequently, female NOD mice are much more widely used than males although the incidence in the clinic is nearly 1:1 for males/females which may present a conundrum when potential novel treatments are only studied in a single sex as in the TLR4 experiments highlighted above. Furthermore, in neuropsychiatric disorders whereas major depressive disorder, for example, has a higher incidence in females than males, preclinical studies have largely used only males for testing – even though sex differences in rodent emotional behavior exist (Dalla et al. 2010; Kreiner et al. 2013; Kokras et al. 2015; Laman-Maharg et al. 2018). One of the more common arguments made for not including female subjects in preclinical studies is that they have larger variability, likely contributed to by the estrus cycle. However, a meta-analysis of 293 publications revealed that variability in endpoints using female mice was not greater than those in males, inclusive of variations in the estrus cycle as a source of variability in the females (Becker et al. 2016; Prendergast et al. 2014; Mogil and Chanda 2005). There are, however, baseline differences for males versus females across behavioral phenotypes that further highlight the need to study both sexes and with data analyzed within sex when drug treatment is evaluated in both sexes.

4 Pharmacokinetic and Pharmacodynamic Differences Attributable to Sex

In addition to the observation of sex differences across disease and behavioral phenotypes, sex differences are also commonly observed in pharmacokinetic (PK) and drug efficacy studies; yet for many years test subjects in both clinical and preclinical studies have most commonly been male. A survey of the pain and neuroscience field in the early 1990s revealed that only 12% of published papers had used both male and female subjects and 45% failed to reveal the sex of the subjects included in the studies (Berkley 1992). A later study building on this revealed that between 1996 and 2005 although researchers now reliably reported the sex of their preclinical subjects (97%), most studies (79%) were still performed on male animals (Mogil and Chanda 2005). Although the translatability of preclinical sex differences to human may not always be clear-cut, assessments of these parameters in both sexes can provide additional information during phenotyping and genetic studies, as well as the drug discovery and development process.

In the drug discovery field, there are multiple examples in the clinical literature of sex differences in both measured exposure and pharmacological effect in response to novel drugs. A meta-analysis of 300 new drug applications (NDAs) reviewed by the FDA between 1995 and 2000 showed that 163 of these included a PK analysis by sex. Of these 163, 11 studies showed greater than 40% difference in PK parameters between males and females (Anderson 2005). There are important implications for sex differences in exposure levels. For example, zolpidem (Ambien®) results in exposure levels 40–50% higher in females when administered sublingually (Greenblatt et al. 2014 ). These sex differences in exposure levels for zolpidem were also observed in rats, with maximal concentration (Cmax) and area under the curve (AUC) both significantly higher in females relative to males (Peer et al. 2016). While Ambien was approved in 1992, in 2013 the FDA recommended decreasing the dose by half for females due to reports of greater adverse events including daytime drowsiness observed in female patients (United States Food and Drug Agency 2018 ).

While any aspect of a drug’s pharmacokinetic properties could potentially lead to sex differences in measured drug exposure, sexually divergent differences in metabolism appear to be the most concerning (Waxman and Holloway 2009). In multiple species, enzymes responsible for drug metabolism show sexually dimorphic expression patterns that affect the rate of metabolism of different drugs. In humans, females show higher cytochrome p450 (CYP) 3A4 levels in the liver as measured by both mRNA and protein (Wolbold et al. 2003 ). Studies have also observed higher activity of this enzyme in females (Hunt et al. 1992). In rodents, both the mouse (Clodfelter et al. 2006, 2007; Yang et al. 2006) and rat (Wautheir and Waxman 2008) liver show a large degree of sexually dimorphic gene expression. For instance, rats exhibit a male-specific CYP2C11 expression pattern, whereas CYP2C12 shows a female-specific one (Shapiro et al. 1995). While rodent sex differences may not necessarily translate into similar patterns in humans, the complexity of metabolic pathways underscore the importance of understanding drug exposure in each sex, at the relevant time point, and in the relevant tissue when making pharmacodynamic measurements.

With respect to pharmacodynamics, sex differences exist in functional outcome measures, both with respect to baseline activity in the absence of drug, and in response to treatments. As critically highlighted in the field of preclinical pain research, a meta-analysis reported sex differences in sensitivity to painful stimuli in acute thermal pain and in chemically induced inflammatory pain (Mogil and Chanda 2005). For example, in a study by Kest and colleagues, baseline nociceptive responses and sensitivity to thermal stimuli were examined across males and females of 11 inbred mouse strains (Kest et al. 1999). Results of this study not only revealed divergent phenotypic responses across genotypes for pain sensitivity but also sex by genotype interactions. Moreover, when morphine was administered directly into the CNS, the analgesic effects varied across both strain and sex, further highlighting the importance of including both sexes in pharmacodynamic studies, as well as considering subject populations beyond a single inbred strain. These sex differences are not specific to morphine as they have also been demonstrated in rats and mice for sensitivity to the effects of other mu opioid receptor agonists (Dahan et al. 2008). Importantly, both sexually dimorphic circuitry and differences in receptor expression levels mediating pain perception and pharmacological responses, likely driven by genetics, are suggested to contribute to these differences (Mogil and Bailey 2010).

In clinical pain research, sex differences in pharmacodynamic responses have been highlighted by reports from clinical trials with MorphiDex, a potential medication for the treatment of pain that combined an NMDA antagonist with morphine (Galer et al. 2005). While many preclinical studies demonstrated robust and reliable efficacy, these reports were almost exclusively conducted in male subjects. During clinical trials where both men and women were included, the drug failed to produce any clinical benefit over standard pain medications (Galer et al. 2005). Intriguingly, it was later determined that while the drug was efficacious in men, it was ineffective in women with retrospective experiments in female mice corroborating these data (discussed in IOM 2011; Grisel et al. 2005). Overall, while we may not fully understand the biological underpinnings of sex differences in responses to pharmacology, profiling both sexes in preclinical pharmacology studies should provide insight into the differences and potentially enable better clinical trial design.

5 Improving Reproducibility Through Heterogeneity

While the major attention on the “reproducibility crisis” in biomedical research has generally been focused on the lack of translation related to issues with experimental design and publication bias, recent literature has provided insight to the concept that researchers might be practicing “overstandardization” as good research practices. For example, the considerations for controlling as much as possible within an experiment (i.e., sex, strain, vendor, housing conditions, etc.), and across experiments within a given laboratory in order to enable replication (i.e., same day of week, same technician, same procedure room), have not necessarily been previously considered an issue with respect to contributing to lack of reproducibility. However, as recently highlighted by several publications, this “standardization fallacy” suggests that the more control and homogeneity given to an experiment within a laboratory may lead to the inability for others to reproduce the findings given the inherent differences in environment that cannot be standardized across laboratories (Würbel 2000; Voelkl et al. 2018; Kafkafi et al. 2018). In this respect, there is indeed value in applying various levels of systematic variation to address a research question, both through intra- and interlaboratory experiments. One approach to improve heterogeneity beyond including both sexes within an experiment and extending experimental findings to multiple laboratories (interlaboratory reproducibility) is to also introduce genetic diversity. While it may be cost prohibitive to engineer genetic mutations across multiple lines of mouse strains in a given study, one could alternatively employ strategically developed recombinant mouse populations such as the Collaborative Cross (CC) (Churchill et al. 2004). The CC are recombinant inbred mouse strains that were created by cross breeding eight different common inbred strains resulting in increased genetic and phenotypic diversity. CC lines include contributions from the common inbred C57BL/6J strain as well as two inbred strains with high susceptibility for Type 1 and Type II diabetes, two inbred strains with high susceptibility for developing cancers (129S1/SvlmJ and A/J), and three wild-derived strains (Srivastava et al. 2017). A recent study from Nachshon et al. (2016) highlighted the value of using a CC population for studying the impact of genetic variation on drug metabolism, while Mosedale et al. (2017) have demonstrated the utility of CC lines for studying potential toxicological effects of drugs on genetic variation in kidney disease (Nachshon et al. 2016 ; Mosedale et al. 2017).

6 Good Research Practices in Pharmacology Include Considerations for Sex, Strain, and Age: Advantages and Limitations

Improving translation from mouse to man requires selection of the appropriate animal model, age, and disease-relevant state. Behavioral pharmacology studies with functional outcome measures that planned for enablement of translational efficacy studies should include pharmacokinetics and PK/PD modeling in the animal model at the pathologically relevant age. It should not be expected that PK data in young, healthy subjects would generalize to PK data in aged, diseased subjects or across both sexes. Similarly, pharmacodynamic measures including behavior, neuroendocrine, immune, metabolic, cardiovascular, and physiology may not generalize across age, sex, or disease state. Figure 1 depicts a sample flow diagram of experimental design parameters required for deliberation where species, strain, substrain, age, sex, and disease state are crucial considerations.
Fig. 1

Example flow diagram for preclinical study design

7 Conclusions and Recommendations

Drug discovery in both preclinical studies and in the clinic has only begun to harness the power of genetic diversity. Large-scale clinical trials have focused on recruitment of patients (i.e., enrollment metrics) based on “all comers” symptom presentation for enrollment. It is tempting to believe that at least some of the high clinical attrition of new therapeutic agents can be attributable to a failure to consider patient heterogeneity. It is a common adage that a rule of thirds exist in patient treatment response to a medication: a third of patients show robust efficacy, a third exhibit partial benefit to the agent, and a third are termed “treatment resistant.” One reason that much of the pharmaceutical industry has moved away from developing antidepressant medications is that established antidepressant medications, such as SSRIs, when used as a positive control, do not separate from placebo in 30–50% of the trials, resulting in a “busted” clinical trial (reviewed in Mora et al. 2011). Importantly, the preclinical studies that have enabled these trials have largely used male subjects and frequently in otherwise healthy mice of a single inbred strain such as C57BL/6J mice (reviewed in Caldarone et al. 2015; reviewed in Belzung 2014). It is possible that preclinical studies focused on treatment response in both sexes and in genetically divergent populations with face and construct validity would be in a better position to translate to a heterogeneous treatment resistant clinical population.

Within the last decade, however, as the genetic contributions of diseases become known, precision medicine approaches that recruit patients with specific genetic factors (e.g., ApoE4 carriers at risk for Alzheimer’s disease) to test specific mechanisms of action will continue to evolve over recruitment for “all comers” patients with a diagnosis of Alzheimer’s disease (Watson et al. 2014). In this respect, in animal studies, analogous genetic factors (e.g., mouse model homozygous for the Apoe4 allele), and at an analogous mouse to human age comparison, to test a similar hypothesis are critical.

As previously stated above, the best model for human is human. In drug discovery prior to the FDA enabling clinical trials in humans, it is critical that the best approach to translation is the design and rigorous execution of preclinical pharmacology studies that best mirror the intended patient population. In this respect, for pharmacokinetic and pharmacodynamics studies, careful consideration should be taken for ensuring that the animal model used has face and construct validity, that both sexes are included and at an analogous age relevant to the disease trajectory, and that studies consider gene by environment interactions as ways to improve reliability, reproducibility, and translation from the bench to the clinic.


  1. Anderson GD (2005) Sex and racial differences in pharmacological response: where is the evidence? Pharmacogenetics, pharmacokinetics, and pharmacodynamics. J Womens Health (Larchmt) 14(1):19–29. ReviewCrossRefGoogle Scholar
  2. Becker JB, Prendergast BJ, Liang JW (2016) Female rats are not more variable than male rats: a meta-analysis of neuroscience studies. Biol Sex Differ 7:34. CrossRefPubMedPubMedCentralGoogle Scholar
  3. Belzung C (2014) Innovative drugs to treat depression: did animal models fail to be predictive or did clinical trials fail to detect effects? Neuropsychopharmacology 39(5):1041–1051. CrossRefPubMedPubMedCentralGoogle Scholar
  4. Berkley KJ (1992) Vive la différence! Trends Neurosci 15(9):331–332. Review. PMID: 1382330CrossRefGoogle Scholar
  5. Bourdi M, Davies JS, Pohl LR (2011) Mispairing C57BL/6 substrains of genetically engineered mice and wild-type controls can lead to confounding results as it did in studies of JNK2 in Acetaminophen and Concanavalin a liver injury. Chem Res Toxicol 24(6):794–796. CrossRefPubMedPubMedCentralGoogle Scholar
  6. Caldarone BJ, Zachariou V, King SL (2015) Rodent models of treatment-resistant depression. Eur J Pharmacol 753:51–65. CrossRefPubMedGoogle Scholar
  7. Casellas J (2011) Inbred mouse strains and genetic stability: a review. Animal 5:1–7CrossRefGoogle Scholar
  8. Churchill GA, Airey DC, Allayee H, Angel JM, Attie AD et al (2004) The collaborative cross, a community resource for the genetic analysis of complex traits. Nat Genet 36:1133–1137CrossRefGoogle Scholar
  9. Clayton JA, Collins FS (2014) Policy: NIH to balance sex in cell and animal studies. Nature 509(7500):282–283. PMID: 24834516CrossRefGoogle Scholar
  10. Clodfelter KH, Holloway MG, Hodor P, Park SH, Ray WJ, Waxman DJ (2006) Sex-dependent liver gene expression is extensive and largely dependent upon signal transducer and activator of transcription 5b (STAT5b): STAT5b-dependent activation of male genes and repression of female genes revealed by microarray analysis. MolEndocrinol 20(6):1333–1351. Epub 2006 Feb 9. PMID: 16469768Google Scholar
  11. Clodfelter KH, Miles GD, Wauthier V, Holloway MG, Zhang X, Hodor P, Ray WJ, Waxman DJ (2007) Role of STAT5a in regulation of sex-specific gene expression in female but not male mouse liver revealed by microarray analysis. Physiol Genomics 31(1):63–74. Epub 2007 May 29. PMID: 17536022; PMCID: PMC2586676CrossRefGoogle Scholar
  12. Cummings JL, Morstorf T, Zhong K (2014) Alzheimer’s disease drug-development pipeline: few candidates, frequent failures. Alzheimers Res Ther 6:37CrossRefGoogle Scholar
  13. Dahan A, Kest B, Waxman AR, Sarton E (2008) Sex-specific responses to opiates: animal and human studies. Anesth Analg 107(1):83–95. Review. PMID: 18635471CrossRefPubMedGoogle Scholar
  14. Dalla C, Pitychoutis PM, Kokras N, Papadopoulou-Daifoti Z (2010) Sex differences in animal models of depression and antidepressant response. Basic Clin Pharmacol Toxicol 106(3):226–233. PMID: 20050844CrossRefPubMedGoogle Scholar
  15. Festing MF (2014) Evidence should trump intuition by preferring inbred strains to outbred stocks in preclinical research. ILAR J 55(3):399–404. CrossRefPubMedPubMedCentralGoogle Scholar
  16. Galer BS, Lee D, Ma T, Nagle B, Schlagheck TG (2005) MorphiDex (morphine sulfate/dextromethorphan hydrobromide combination) in the treatment of chronic pain: three multicenter, randomized, double-blind, controlled clinical trials fail to demonstrate enhanced opioid analgesia or reduction in tolerance. Pain 115(3):284–295. Epub 2005 Apr 20CrossRefGoogle Scholar
  17. Guapp S, Pitt D, Kuziel WA, Cannella B, Raine CS (2003) Experimental autoimmune encephalomyelitis (EAE) in CCR2(−/−) mice: susceptibility in multiple strains. Am J Pathol 162:139–150Google Scholar
  18. Grarup N, Sandholt CH, Hansen T, Pedersen O (2014) Genetic susceptibility to type 2 diabetes and obesity: from genome-wide association studies to rare variants and beyond. Diabetologia 57:1528–1541CrossRefGoogle Scholar
  19. Greenblatt DJ, Harmatz JS, Singh NN, Steinberg F, Roth T, Moline ML, Harris SC, Kapil RP (2014) Gender differences in pharmacokinetics and pharmacodynamics of zolpidem following sublingual administration. J Clin Pharmacol 54(3):282–290. Epub 2013 Nov 27. PMID: 24203450CrossRefPubMedGoogle Scholar
  20. Grisel JE, Allen S, Nemmani KV, Fee JR, Carliss R (2005) The influence of dextromethorphan on morphine analgesia in Swiss Webster mice is sex-specific. Pharmacol Biochem Behav 81(1):131–138CrossRefGoogle Scholar
  21. Hughes RN (2007) Sex does matter: comments on the prevalence of male-only investigations of drug effects on rodent behaviour. Behav Pharmacol 18:583–589CrossRefGoogle Scholar
  22. Hunt CM, Westerkam WR, Stave GM (1992) Effect of age and gender on the activity of human hepatic CYP3A. Biochem Pharmacol 44(2):275–283. PMID: 1642641CrossRefGoogle Scholar
  23. IOM (Institute of Medicine US) (2011) Forum on neuroscience and nervous system disorders. Sex differences and implications for translational neuroscience research: workshop summary. National Academies Press, Washington DCGoogle Scholar
  24. Jackson HM, Onos KD, Pepper KW, Graham LC, Akeson EC, Byers C, Reinholdt LG, Frankel WN, Howell GR (2015) DBA/2J genetic background exacerbates spontaneous lethal seizures but lessens amyloid deposition in a mouse model of Alzheimer’s disease. PLoS One 10:e0125897CrossRefGoogle Scholar
  25. Jarvis MF, Williams M (2016) Irreproducibility in preclinical biomedical research: perceptions, uncertainties, and knowledge gaps. Trends Pharmacol Sci 37:290–302CrossRefGoogle Scholar
  26. Kafkafi N, Agassi J, Chesler EJ, Crabbe JC, Crusio WE, Eilam D, Gerlai R, Golani I, Gomez-Marin A, Heller R, Iraqi F, Jalijuli I, Karp NA, Morgan H, Nicholson G, Pfaff DW, Richter H, Stark PB, Stiedl O, Stodden V, Tarantino LM, Tucci V, Valdar W, Williams RW, Wurbel H, Benjamini Y (2018) Reproducibility and replicability of rodent phenotyping in preclinical studies. Neurosci Biobehav Rev 87:218–232. CrossRefPubMedPubMedCentralGoogle Scholar
  27. Karp NA, Mason J, Beaudet AL et al (2017) Prevalence of sexual dimorphism in mammalian phenotypic traits. Nat Commun 8:15475. CrossRefPubMedPubMedCentralGoogle Scholar
  28. Keane TM, Goodstadt L, Danecek P, White MA, Wong K, Yalcin B, Heger A, Agam A, Slater G, Goodson M, Furlotte NA, Eskin E, Nellåker C, Whitley H, Cleak J, Janowitz D, Hernandez-Pliego P, Edwards A, Belgard TG, Oliver PL, McIntyre RE, Bhomra A, Nicod J, Gan X, Yuan W, van der Weyden L, Steward CA, Bala S, Stalker J, Mott R, Durbin R, Jackson IJ, Czechanski A, Guerra-Assunção JA, Donahue LR, Reinholdt LG, Payseur BA, Ponting CP, Birney E, Flint J, Adams DJ (2011) Mouse genomic variation and its effect on phenotypes and gene regulation. Nature 477(7364):289–294. CrossRefPubMedPubMedCentralGoogle Scholar
  29. Kest B, Wilson SG, Mogil JS (1999) Sex differences in supraspinal morphine analgesia are dependent on genotype. J Pharmacol Exp Ther 289(3):1370–1375. PMID: 10336528PubMedGoogle Scholar
  30. Kilkenny C, Browne WJ, Cuthill IC, Emerson M, Altman DG (2010) Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. PLoS Biol 8:e1000412CrossRefGoogle Scholar
  31. Kiselycznyk C, Holmes A (2011) All (C57BL/6) mice are not created equal. Front Neurosci 5:10. CrossRefPubMedPubMedCentralGoogle Scholar
  32. Kokras N, Antoniou K, Mikail HG, Kafetzopoulos V, Papadopoulou-Daifoti Z, Dalla C (2015) Forced swim test: what about females? Neuropharmacology 99:408–421. Epub 2015 Apr 1. Review. PMID: 25839894CrossRefPubMedGoogle Scholar
  33. Kreiner G, Chmielarz P, Roman A, Nalepa I (2013) Gender differences in genetic mouse models evaluated for depressive-like and antidepressant behavior. Pharmacol Rep 65(6):1580–1590. Review. PMID: 24553006CrossRefGoogle Scholar
  34. Laman-Maharg A, Williams AV, Zufelt MD, Minie VA, Ramos-Maciel S, Hao R, Ordoñes Sanchez E, Copeland T, Silverman JL, Leigh A, Snyder R, Carroll FI, Fennell TR, Trainor BC (2018) Sex differences in the effects of a kappa opioid receptor antagonist in the forced swim test. Front Pharmacol 9:93. PMID: 29491835CrossRefPubMedPubMedCentralGoogle Scholar
  35. Leiter EH (1997) The NOD mouse: a model for insulin-dependent diabetes mellitus. Curr Protoc Immunol 24(Suppl):15.9.1–15.9.23Google Scholar
  36. Loscher W, Ferland RJ, Ferraro TN (2017) The relevance of inter- and intrastrain differences in mice and rats and their implications of seizures and epilepsy. Epilepsy Behav 73:214–235CrossRefGoogle Scholar
  37. Louis WJ, Howes LG (1990) Genealogy of the spontaneously hypertensive rat and Wistar-Kyoto rat strains: implications for studies of inherited hypertension. J Cardiovasc Pharmacol 16(Suppl 7):S1–S5CrossRefGoogle Scholar
  38. Mahajan VS, Demissie E, Mattoo H, Viswanadham V, Varki A, Morris R, Pillai S (2016) Striking immune phenotypes in gene-targeted mice are driven by a copy-number variant originating from a commercially available C57BL/6 strain. Cell Rep 15(9):1901–1909. CrossRefPubMedPubMedCentralGoogle Scholar
  39. McCracken JM, Chalise P, Briley SM et al (2017) C57BL/6 substrains exhibit different responses to acute carbon tetrachloride exposure: implications for work involving transgenic mice. Gene Expr 17(3):187–205. CrossRefPubMedPubMedCentralGoogle Scholar
  40. Mogil JS, Bailey AL (2010) Sex and gender differences in pain and analgesia. Prog Brain Res 186:141–157. Review. PMID: 21094890CrossRefPubMedGoogle Scholar
  41. Mogil JS, Chanda ML (2005) The case for the inclusion of female subjects in basic science studies of pain. Pain 117(1-2):1–5. Review. PMID: 16098670CrossRefGoogle Scholar
  42. Mora MS, Nestoriuc Y, Rief W (2011) Lessons learned from placebo groups in antidepressant trials. Philos Trans R Soc Lond B Biol Sci 366(1572):1879–1888. CrossRefPubMedPubMedCentralGoogle Scholar
  43. Mosedale M, Kim Y, Brock WJ, Roth SE, Wiltshire T, Eaddy JS, Keele GR, Corty RW, Xie Y, Valdar W, Watkins PB (2017) Candidate risk factors and mechanisms for Tolvaptan-induced liver injury are identified using a collaborative cross approach. Toxicol Sci 156(2):438–454. CrossRefPubMedPubMedCentralGoogle Scholar
  44. Nachshon A, Abu-Toamih Atamni HJ, Steuerman Y et al (2016) Dissecting the effect of genetic variation on the hepatic expression of drug disposition genes across the collaborative cross mouse strains. Front Genet 7:172. CrossRefPubMedPubMedCentralGoogle Scholar
  45. Nilson JH, Abbud RA, Keri RA, Quirk CC (2000) Chronic hypersecretion of luteinizing hormone in transgenic mice disrupts both ovarian and pituitary function, with some effects modified by the genetic background. Recent Prog Horm Res 55:69–89PubMedGoogle Scholar
  46. Peer CJ, Strope JD, Beedie S, Ley AM, Holly A, Calis K, Farkas R, Parepally J, Men A, Fadiran EO, Scott P, Jenkins M, Theodore WH, Sissung TM (2016) Alcohol and aldehyde dehydrogenases contribute to sex-related differences in clearance of Zolpidem in rats. Front Pharmacol 7:260. eCollection 2016. PMID: 27574509CrossRefPubMedPubMedCentralGoogle Scholar
  47. Prendergast BJ, Onishi KG, Zucker I (2014) Female mice liberated for inclusion in neuroscience and biomedical research. Neurosci Biobehav Rev 40:1–5. Epub 2014 Jan 20. PMID: 24456941CrossRefPubMedGoogle Scholar
  48. Regitz-Zagrosek V (2012) Sex and gender differences in health: Science & Society Series on sex and science. EMBO Rep 13(7):596–603. CrossRefPubMedPubMedCentralGoogle Scholar
  49. Sagvolden T, Johansen EB, Wøien G, Walaas SI, Storm-Mathisen J, Bergersen LH, Hvalby Ø, Jensen V, Aase H, Russell VA, Killeen PR, DasBanerjee T, Middleton F, Faraone SV (2009) The spontaneously hypertensive rat model of ADHD – the importance of selecting the appropriate reference strain. Neuropharmacology 57(7-8):619–626CrossRefGoogle Scholar
  50. Shapiro BH, Agrawal AK, Pampori NA (1995) Gender differences in drug metabolism regulated by growth hormone. Int J Biochem Cell Biol 27(1):9–20. Review. PMID: 7757886CrossRefGoogle Scholar
  51. Simon MM, Greenaway S, White JK, Fuchs H, Gailus-Durner V, Wells S, Sorg T, Wong K, Bedu E, Cartwright EJ, Dacquin R, Djebali S, Estabel J, Graw J, Ingham NJ, Jackson IJ, Lengeling A, Mandillo S, Marvel J, Meziane H, Preitner F, Puk O, Roux M, Adams DJ, Atkins S, Ayadi A, Becker L, Blake A, Brooker D, Cater H, Champy MF, Combe R, Danecek P, di Fenza A, Gates H, Gerdin AK, Golini E, Hancock JM, Hans W, Hölter SM, Hough T, Jurdic P, Keane TM, Morgan H, Müller W, Neff F, Nicholson G, Pasche B, Roberson LA, Rozman J, Sanderson M, Santos L, Selloum M, Shannon C, Southwell A, Tocchini-Valentini GP, Vancollie VE, Westerberg H, Wurst W, Zi M, Yalcin B, Ramirez-Solis R, Steel KP, Mallon AM, de Angelis MH, Herault Y, Brown SD (2013) A comparative phenotypic and genomic analysis of C57BL/6J and C57BL/6N mouse strains. Genome Biol 14(7):R82. CrossRefPubMedPubMedCentralGoogle Scholar
  52. Sorge RE, LaCroix-Fralish ML, Tuttle AH et al (2011) Spinal cord toll-like receptor 4 mediates inflammatory and neuropathic hypersensitivity in male but not female mice. J Neurosci 31(43):15450–15454. CrossRefPubMedPubMedCentralGoogle Scholar
  53. Srivastava A, Morgan AP, Najarian ML et al (2017) Genomes of the mouse collaborative cross. Genetics 206(2):537–556. CrossRefPubMedPubMedCentralGoogle Scholar
  54. St Lezin E, Simonet L, Pravenec M, Kurtz TW (1992) Hypertensive strains and normotensive ‘control’ strains. How closely are they related? Hypertension 19:419–424CrossRefGoogle Scholar
  55. Summers deLuca LE, Pikor NB, O’Leary J et al (2010) Substrain differences reveal novel disease-modifying gene candidates that alter the clinical course of a rodent model of multiple sclerosis. J Immunol 184(6):3174–3185. CrossRefGoogle Scholar
  56. Sundberg JP, Schofield PN (2010) Mouse genetic nomenclature: standardization of strain, gene, and protein symbols. Vet Pathol 47(6):1100–1104. CrossRefPubMedPubMedCentralGoogle Scholar
  57. United States Food and Drug Agency (2018) Drug safety communication: risk of next-morning impairment after use of insomnia drugs; FDA requires lower recommended doses for certain drugs containing zolpidem (Ambien, Ambien CR, Edluar, and Zolpimist). Accessed 23 Sept 2018
  58. Voelkl B, Vogt L, Sena ES, Würbel H (2018) Reproducibility of preclinical animal research improves with heterogeneity of study samples. PLoS Biol 16(2):e2003693. CrossRefPubMedPubMedCentralGoogle Scholar
  59. Watson JL, Ryan L, Silverberg N, Cahan V, Bernard MA (2014) Obstacles and opportunities in Alzheimer’s clinical trial recruitment. Health Aff (Millwood) 33(4):574–579. CrossRefGoogle Scholar
  60. Wautheir V, Waxman DJ (2008) Sex-specific early growth hormone response genes in rat liver. Mol Endocrinol 22(8):1962–1974Google Scholar
  61. Waxman DJ, Holloway MG (2009) Sex differences in the expression of hepatic drug metabolizing enzymes. Mol Pharmacol 76(2):215–228. CrossRefPubMedPubMedCentralGoogle Scholar
  62. Wolbold R, Klein K, Burk O, Nüssler AK, Neuhaus P, Eichelbaum M, Schwab M, Zanger UM (2003) Sex is a major determinant of CYP3A4 expression in human liver. Hepatology 38(4):978–988. PMID: 14512885CrossRefGoogle Scholar
  63. Würbel H (2000) Behaviour and the standardization fallacy. Nat Genet 26:263CrossRefGoogle Scholar
  64. Würbel H (2002) Behavioral phenotyping enhanced—beyond (environmental standardization). Genes Brain Behav 1:3–8CrossRefGoogle Scholar
  65. Yang X, Schadt EE, Wang S, Wang H, Arnold AP, Ingram-Drake L, Drake TA, Lusis AJ (2006) Tissue-specific expression and regulation of sexually dimorphic genes in mice. Genome Res 16(8):995–1004. Epub 2006 Jul 6. PMID: 16825664; PMCID: PMC1524872CrossRefGoogle Scholar

Copyright information

© The Author(s) 2019

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Authors and Affiliations

  1. 1.University of Pittsburgh School of MedicinePittsburghUSA
  2. 2.Sage TherapeuticsCambridgeUSA
  3. 3.Indiana University School of MedicineIndianapolisUSA

Personalised recommendations