Skip to main content

The Deeper Roles of Mathematics in Physical Laws

  • Chapter
  • First Online:
Trick or Truth?

Part of the book series: The Frontiers Collection ((FRONTCOLL))

Abstract

Many have wondered how mathematics, which appears to be the result of both human creativity and human discovery, can possibly exhibit the degree of success and seemingly-universal applicability to quantifying the physical world as exemplified by the laws of physics. In this essay, I claim that much of the utility of mathematics arises from our choice of description of the physical world coupled with our desire to quantify it. This will be demonstrated in a practical sense by considering one of the most fundamental concepts of mathematics: additivity. This example will be used to show how many physical laws can be derived as constraint equations enforcing relevant symmetries in a sense that is far more fundamental than commonly appreciated.

Familiarity breeds the illusion of understanding

–Anonymous

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 89.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In preparing for this essay, I was pleased to find that Hamming had posed a similar question: “I have tried, with little success, to get some of my friends to understand my amazement that the abstraction of integers for counting is both possible and useful. Is it not remarkable that 6 sheep plus 7 sheep make 13 sheep; that 6 stones plus 7 stones make 13 stones? Is it not a miracle that the universe is so constructed that such a simple abstraction as a number is possible? To me this is one of the strongest examples of the unreasonable effectiveness of mathematics. Indeed, l find it both strange and unexplainable” [17].

  2. 2.

    Einstein expressed a similar sentiment about particle physics: “I would just like to know what an electron is.”

  3. 3.

    This is especially difficult since additivity of set measure is assumed by mathematicians. That is where they decided to start. Where else would one start? This highlights one of the subtle and insidious difficulties faced by those in foundational studies. Once it is decided to adopt an assumption as a foundational construct it precludes the ability to delve deeper within that framework. Furthermore, it discourages others from doing so through what I call “The Curse of Familiarity” where one’s familiarity with a problem fosters an illusion of understanding that blinds one from seeing subtle clues, hints, connections and/or difficulties. The parable of Newton and the apple is an example where Newton momentarily saw through the familiarity of falling apples to realize a connection with the falling of the Moon about the Earth.

  4. 4.

    Finite sets are sufficient for our purposes here as we are not attempting to model an infinite world.

  5. 5.

    It would be better to call it a heterarchy since sets in general cannot be linearly ordered.

  6. 6.

    Since the goal is to rank elements via quantification (mapping elements to a total order), it may be helpful to formalize the desired systematic preservation of inequality by making explicit the assumption of cancellativity where for disjoint elements x, y, and z, where \(v(x) \le v(y) \le v(z)\), we have \(v(x) \oplus v(z) \le v(y) \oplus v(z)\), which is implicit in any generally-useful (non-degenerate) notion of ranking.

  7. 7.

    Do we select a particular concept of ordering? Of course we do; an example is given in Fig. 2 where the counting numbers are ordered in two different ways. Selecting one way of ordering gives you one set of laws (min and max) and selecting the other gives you another set of laws (GCD and LCM). The entire of field of number theory results as an attempt to study relationships between these two resulting sets of laws.

  8. 8.

    The entropies appearing in the definition of mutual information derive from probabilities, yet no one insists that probability theory is an exotic form of information theory. Entropy and probability are related in a very specific way with probabilities being used to compute entropies. Similarly, quantum amplitudes are related to probabilities in a very specific way with amplitudes being used to compute probabilities via the Born Rule.

References

  1. Aczél, J., Lectures on Functional Equations and Their Applications, Academic Press, New York, 1966.

    MATH  Google Scholar 

  2. Birkhoff, Lattice Theory, American Mathematical Society, Providence, 1967.

    MATH  Google Scholar 

  3. Cover, T. M., Thomas, J. A. 2012. Elements of information theory. John Wiley & Sons.

    Google Scholar 

  4. Craigen, R., Páles, Z. 1989. The associativity equation revisited. Aequationes Mathematicae, 37(2–3), 306–312.

    Article  MATH  Google Scholar 

  5. Davey, B. A., and Priestley, H. A., Introduction to Lattices and Order, Cambridge Univ. Press, Cambridge, 2002.

    Book  MATH  Google Scholar 

  6. Jaynes, E. T. 1959, Probability Theory in Science and Engineering, No. 4 in Colloquium Lectures in Pure and Applied Science, Socony-Mobil Oil Co. USA.

    Google Scholar 

  7. Klain, D. A., and Rota, G.-C., Introduction to Geometric Probability, Cambridge Univ. Press, Cambridge, 1997.

    MATH  Google Scholar 

  8. Knuth K.H. 2003. Deriving laws from ordering relations. In: G.J. Erickson, Y. Zhai (eds.), Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Jackson Hole WY 2003, AIP Conference Proceedings 707, American Institute of Physics, Melville NY, pp. 204–235. arXiv:physics/0403031 [physics.data-an].

  9. Knuth K.H. 2010. Information physics: The new frontier. P. Bessiere, J.-F. Bercher, A. Mohammad-Djafari (eds.) Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Chamonix, France, 2010, AIP Conference Proceedings 1305, American Institute of Physics, Melville NY, 3–19. arXiv:1009.5161v1 [math-ph].

  10. Knuth K.H., Skilling J. 2012. Foundations of Inference. Axioms 1:38–73, doi:10.3390/axioms1010038, arXiv:1008.4831 [math.PR].

    Google Scholar 

  11. Knuth K.H. 2014. Information-based physics: an observer-centric foundation. Contemporary Physics, 55(1), 12–32, (Invited Submission). doi:10.1080/00107514.2013.853426. arXiv:1310.1667 [quant-ph]

    Google Scholar 

  12. Knuth K.H., Bahreyni N. 2014. A potential foundation for emergent space-time, Journal of Mathematical Physics, 55, 112501. doi:10.1063/1.4899081, arXiv:1209.0881 [math-ph].

    Google Scholar 

  13. Knuth K.H. 2015. Information-based physics and the influence network. In: It from Bit or Bit from It? On Physics and Information, Springer Frontiers Collection, Springer-Verlag, Heidelberg pp. 65–78. FQXi 2013 Essay Contest Entry (Third Place Prize). arXiv:1308.3337 [quant-ph].

    Google Scholar 

  14. Goyal, P. 2015. Informational approach to the quantum symmetrization postulate. New Journal of Physics, 17(1), 013043. arXiv:1309.0478 [quant-ph].

    Google Scholar 

  15. Goyal P., Knuth K.H. 2011. Quantum theory and probability theory: their relationship and origin in symmetry, Symmetry 3(2):171–206. doi:10.3390/sym3020171

    Google Scholar 

  16. Goyal P., Knuth K.H., Skilling J. 2010. Origin of complex quantum amplitudes and Feynman’s rules, Physical Review A 81, 022109. arXiv:0907.0909v3 [quant-ph].

  17. Hamming, R. W. 1980. The unreasonable effectiveness of mathematics. American Mathematical Monthly, 81–90.

    Google Scholar 

  18. Pölya, G., and Szegö, G., Aufgaben und Lehrsätze aus der Analysis, 2 vols., Springer, Berlin, 1964, 3rd ed.

    Google Scholar 

  19. Sorkin, R.D. 1994. Quantum mechanics as quantum measure theory, Mod. Phys. Lett. A9 (1994) 3119–3128.

    Google Scholar 

  20. Wigner, E. 1960. The Unreasonable Effectiveness of Mathematics in the Natural Sciences, in Communications in Pure and Applied Mathematics, vol. 13, No. I (February 1960). New York: John Wiley & Sons Inc.

    Google Scholar 

Download references

Acknowledgments

I would like to thank John Skilling, David Hestenes, and Rob MacDuff for their insightful comments and ongoing discussions on foundations. I would also like to thank James Walsh for his careful proofreading, and Bertrand Carado and Yuchao Ma for their critical reading of the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kevin H. Knuth .

Editor information

Editors and Affiliations

Technical Endnotes

Technical Endnotes

This technical section provides the formal definitions of a partially-ordered set (poset) and a lattice as well as short derivation of the generalized sum rule for non-exclusive join.

A partially-ordered set (poset) P is a set of elements S along with a binary ordering relation, generically denoted \(\le \), postulated to have the following properties for elements \(x, y, z \subseteq P\)

$$\begin{aligned} \begin{array}{lll} &{}\mathrm{For}\,\mathrm{all}\,\, x, x\le x &{}\qquad \qquad \qquad \mathrm{Reflexivity}\\ &{}\mathrm{If}\,\, x \le y \,\, \mathrm{and} \,\, y \le x, \,\, x = y &{}\qquad \qquad \qquad \mathrm{Antisymmetry}\\ &{}\mathrm{If}\,\, x \le y \,\, \mathrm{and}\,\, y \le z, \,\, x\le z &{}\qquad \qquad \qquad \mathrm{Transitivity} \end{array} \end{aligned}$$

A poset \(P = (S,\le )\) is referred to as a partially ordered set since not all elements are assumed to be comparable. That is, there may exist elements \(x, y \subseteq P\) where it is neither true that \(x \le y\) nor that \(y \le x\). In these situations, we say that x and y are incomparable, which is denoted \(x\, \Vert \, y\).

A lattice L is a poset where each pair of elements has a supremum or least upper bound (LUB) called the join, and an infimum or greatest lower bound (GLB) called the meet. The meet of two elements \(x,y \in L\) is denoted \(x \wedge y\) and the join is denoted \(x\vee y\). The meet and the join can be thought of as algebraic operators that take two lattice elements to a third lattice element. It is in this sense that every lattice is an algebra. The meet and join are assumed to obey the following relations

$$\begin{aligned} \begin{array}{lll} &{}x \vee y = y \vee x &{}\qquad \qquad \qquad {\mathrm{Commutativity}}\\ &{}x \wedge y = y \wedge x \\ &{}x \vee (y \vee z) = (x \vee y) \vee z &{}\qquad \qquad \qquad {\mathrm{Associativity}} \\ &{}x \wedge (y \wedge z) = (x \wedge y) \wedge z \\ &{}x \vee (x \wedge y) = x &{}\qquad \qquad \qquad \mathrm{Absorption}\\ &{}x \wedge (x \vee y) = x \end{array} \end{aligned}$$

Lattice elements (and poset elements in general) can be quantified by assigning a real number (or more generally a set of real numbers) to each element. This is performed via a function v called a valuation, which takes each lattice element to a real number: \(v:x \in L \rightarrow \mathbb {R}\).

Fig. 3
figure 3

(Left) An illustration of the join, \(x \vee y\), of two disjoint elements x and y resulting in an additive measure. Disjoint elements have a null meet so that technically this structure is known as a join semi-lattice. (Right) The situation considered in the derivation of the Sum Rule, which applies to general cases

Valuations are meant to encode the ordering of elements in the lattice and this is accomplished by insisting that for \(x\le y\) we have that \(v(x) \le v(y)\). Furthermore, if the valuation is to encode the relationships among the elements, it must be that the valuation \(v(x \vee y)\) assigned to the join of two disjoint elements x and y (Fig. 3, left) can be expressed as a function of the valuations v(x) and v(y) assigned to those two elements. We write this as

$$\begin{aligned} v(x \vee y) = v(x) \oplus v(y). \end{aligned}$$

where the operator \(\oplus \) is to be determined. The concept of a generally-useful quantification by valuation (non-degenerate ranking) implies that the operator \(\oplus \) obeys a cancellativity property for disjoint elements x, y, and z, where for \(v(x) \le v(y) \le v(z)\) we have \(v(x) \oplus v(z) \le v(y) \oplus v(z)\). Commutativity of the lattice join requires that the operator \(\oplus \) is commutative, and associativity of the lattice join requires that the operator \(\oplus \) is associative. The associative relationship represents a functional equation, known as the Associativity Equation, for the operator \(\oplus \), whose solution is known to be an invertible transform of additivity [1, 4, 10], which can be written as

$$\begin{aligned} a\oplus b = f^{-1}(f(a) + f(b)), \end{aligned}$$

where the function f is an arbitrary invertible function. In terms of the valuations this is

$$\begin{aligned} f(v(x) \oplus v(y)) = f(v(x)) + f(v(y)) \end{aligned}$$

which is

$$\begin{aligned} f(v(x \vee y)) = f(v(x)) + f(v(y)). \end{aligned}$$

This suggests that one can always choose a simpler quantification than the valuations v by instead assigning values u(x) defined by \(u(x) = f(v(x))\) so that \(\oplus \) transforms to simple addition for disjoint x and y:

$$\begin{aligned} u(x \vee y) = u(x) + u(y). \end{aligned}$$

Recall that this result holds only for disjoint elements (in a join semi-lattice). We now derive the result for two lattice elements in general. Consider the elements \(x \wedge y\) and z illustrated in Fig. 3 (right). Since their join is y, we have that

$$\begin{aligned} u(y) = u(x \wedge y) + u(z). \qquad (\mathrm{A1}) \end{aligned}$$

Next consider that elements x and z are disjoint and their join is \(x \vee y\). This allows us to write

$$\begin{aligned} u(x \vee y) = u(x) + u(z). \qquad (\mathrm{A2}) \end{aligned}$$

Solving (A1) for u(z) and substituting into (A2) we have the Sum Rule

$$\begin{aligned} u(x \vee y) = u(x) + u(y) - u(x \wedge y), \qquad {\mathrm{Sum}\, \mathrm{Rule}} \end{aligned}$$

which holds for general elements x and y [810].

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Knuth, K.H. (2016). The Deeper Roles of Mathematics in Physical Laws. In: Aguirre, A., Foster, B., Merali, Z. (eds) Trick or Truth?. The Frontiers Collection. Springer, Cham. https://doi.org/10.1007/978-3-319-27495-9_7

Download citation

Publish with us

Policies and ethics