Skip to main content

Data Processing: How Good Are My Data Really?

  • Conference paper
  • First Online:
Advancing Methods for Biomolecular Crystallography

Abstract

Since its inception more than 60 years ago, a “reliability index”, later called “R-value”, has been used to measure the agreement of model and averaged data, and a similar quantity, Rmerge, has been defined to assess the quality of the averaged data.

However, a little known fact is that the two kinds of R-values have very different properties and asymptotic behaviors, and cannot be compared with each other. This is the reason that decisions concerning the high-resolution cutoff of data that are based on these R-values are questionable, and also helps explain why disagreements between journal authors and the manuscript reviewers have been so common.

Here, the authors will show that a different statistic can be used to overcome these deficiencies, and will establish a direct quantitative relation between data and model quality. This relation is important to judge the extent to which the data are useful, and also gives insight into the quality of the model that is derived from the data. The theoretical and practical consequences are at variance with several commonly employed crystallographic concepts and procedures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arndt UW, Crowther RA, Mallett JFW (1968) A computer-linked cathode-ray tube microdensitometer for X-ray crystallography. J Phys E Sci Instrum 1:510

    Article  CAS  Google Scholar 

  2. Diederichs K, Karplus PA (1997) Improved R-factors for diffraction data analysis in macromolecular crystallography. Nature Struct Biol 4:269–275

    Article  CAS  Google Scholar 

  3. Evans P (2011) An introduction to data reduction: space-group determination, scaling and intensity statistics. Acta Crystallogr D 67:282–292

    Article  Google Scholar 

  4. Karplus PA, Diederichs K (2012) Linking crystallographic model and data quality. Science 336:1030–1033

    Article  CAS  Google Scholar 

  5. Rupp B (2009) Biomolecular crystallography: principles, practice, and application to structural biology. Garland Science, New York

    Google Scholar 

  6. Schneider TR, Sheldrick GM (2002) Substructure solution with SHELXD. Acta Crystallogr D 58:1772–1779

    Article  Google Scholar 

  7. Simmons CR, Liu Q, Huang Q, Hao Q, Begley TP, Karplus PA, Stipanuk MH (2006) Crystal structure of Mammalian cysteine dioxygenase. J Biol Chem 281:18723

    Article  CAS  Google Scholar 

  8. Simmons CR, Krishnamoorthy K, Granett SL, Schuller DJ, Dominy JE Jr, Begley TP, Stipanuk MH, Karplus PA (2008) A putative Fe2+-bound persulfenate intermediate in cysteine dioxygenase. Biochemistry 47:11390

    Article  CAS  Google Scholar 

  9. Usón I, Stevenson CE, Lawson DM, Sheldrick GM (2007) Structure determination of the O-methyltransferase NovP using the ‘free lunch algorithm’ as implemented in SHELXE. Acta Crystallogr D 63:1069–1074

    Article  Google Scholar 

  10. Weiss MS (2001) Global indicators of X-ray data quality. J Appl Crystallogr 34:130–135

    Article  CAS  Google Scholar 

  11. Weiss MS, Metzner HJ, Hilgenfeld R (1998) Two non-proline cis peptide bonds may be important for factor XIII function. FEBS Lett 423:291–296

    Article  CAS  Google Scholar 

  12. Wilson AJC (1950) Largest likely values for the reliability index. Acta Crystallogr 3:397–398

    Article  Google Scholar 

  13. Wlodawer A, Minor W, Dauter Z, Jaskolski M (2008) Protein crystallography for non-crystallographers, or how to get the best (but not more) from published macromolecular structures. FEBS J 275:1–21

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kay Diederichs .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media Dordrecht

About this paper

Cite this paper

Diederichs, K., Karplus, P.A. (2013). Data Processing: How Good Are My Data Really?. In: Read, R., Urzhumtsev, A., Lunin, V. (eds) Advancing Methods for Biomolecular Crystallography. NATO Science for Peace and Security Series A: Chemistry and Biology. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-6232-9_6

Download citation

Publish with us

Policies and ethics