World Library  
Flag as Inappropriate
Email this Article

Occam's razor

Article Id: WHEBN0000036797
Reproduction Date:

Title: Occam's razor  
Author: World Heritage Encyclopedia
Language: English
Subject: William of Ockham, Philosophy of religion, Scientific method, Problem of evil, Hanlon's razor
Publisher: World Heritage Encyclopedia

Occam's razor

The motions of the sun, moon and other solar system planets can be calculated using a geocentric model (the earth is at the center) or using a heliocentric model (the sun is at the center). Both work, but the geocentric system requires many more assumptions than the heliocentric system, which has only seven. This was pointed out in a preface to Copernicus' first edition of De revolutionibus orbium coelestium.

Occam's razor (also written as Ockham's razor and in Latin lex parsimoniae) is a problem-solving principle devised by William of Ockham (c. 1287–1347), who was an English Franciscan friar and scholastic philosopher and theologian. The principle states that among competing hypotheses, the one with the fewest assumptions should be selected. Other, more complicated solutions may ultimately prove correct, but—in the absence of certainty—the fewer assumptions that are made, the better.

The application of the principle can be used to shift the burden of proof in a discussion. However, Alan Baker, who suggests this in the online Stanford Encyclopedia of Philosophy, is careful to point out that his suggestion should not be taken generally, but only as it applies in a particular context, that is: philosophers who argue in opposition to metaphysical theories that involve allegedly “superfluous ontological apparatus”.[1] Baker then notices that principles, including Occam’s Razor, are often expressed in a way that is not clear regarding which facet of “simplicity” — parsimony or elegance — is being referred to, and that in a hypothetical formulation the facets of simplicity may work in different directions: a simpler description may refer to a more complex hypothesis, and a more complex description may refer to a simpler hypothesis.[2]

Solomonoff's theory of inductive inference is a mathematically formalized Occam's Razor:[2][3][4][5][6][7] Shorter computable theories have more weight when calculating the probability of the next observation, using all computable theories which perfectly describe previous observations.

In science, Occam's Razor is used as a heuristic (discovery tool) to guide scientists in the development of theoretical models rather than as an arbiter between published models.[8][9] In the scientific method, Occam's Razor is not considered an irrefutable principle of logic or a scientific result; the preference for simplicity in the scientific method is based on the falsifiability criterion. For each accepted explanation of a phenomenon, there is always an infinite number of possible and more complex alternatives, because one can always burden failing explanations with ad hoc hypothesis to prevent them from being falsified; therefore, simpler theories are preferable to more complex ones because they are better testable and falsifiable.[1][10][11]


The term "Occam's Razor" first appeared in 1852 in the works of Sir William Hamilton, 9th Baronet (1788–1856), centuries after William of Ockham's death in 1347.[12] Ockham did not invent this "razor"; its association with him may be due to the frequency and effectiveness with which he used it (Ariew 1976). Ockham stated the principle in various ways, but the most popular version "entities must not be multiplied beyond necessity" was written by John Punch from Cork in 1639 (Meyer 1957).[13]

Formulations before Ockham

Part of a page from Duns Scotus' book Ordinatio: "Pluralitas non est ponenda sine necessitate", i.e., "Plurality is not to be posited without necessity"

The origins of what has come to be known as Occam's Razor are traceable to the works of earlier philosophers such as John Duns Scotus (1265–1308), Robert Grosseteste (1175-1253), Maimonides (Moses ben-Maimon, 1138–1204), and even Aristotle (384–322 BC).[14][15] Aristotle writes in his Posterior Analytics, "we may assume the superiority ceteris paribus [all things being equal] of the demonstration which derives from fewer postulates or hypotheses."[16] Ptolemy (c. AD 90 – c. AD 168) stated, "We consider it a good principle to explain the phenomena by the simplest hypothesis possible."[17]

Phrases such as "It is vain to do with more what can be done with fewer" and "A plurality is not to be posited without necessity" were commonplace in 13th-century scholastic writing.[17] Robert Grosseteste, in Commentary on [Aristotle's] the Posterior Analytics Books (Commentarius in Posteriorum Analyticorum Libros) (c. 1217–1220), declares: "That is better and more valuable which requires fewer, other circumstances being equal... For if one thing were demonstrated from many and another thing from fewer equally known premises, clearly that is better which is from fewer because it makes us know quickly, just as a universal demonstration is better than particular because it produces knowledge from fewer premises. Similarly in natural science, in moral science, and in metaphysics the best is that which needs no premises and the better that which needs the fewer, other circumstances being equal."[18] The Summa Theologica of Thomas Aquinas (1225–1274) states that "it is superfluous to suppose that what can be accounted for by a few principles has been produced by many". Aquinas uses this principle to construct an objection to God's existence, an objection that he in turn answers and refutes generally (cf. quinque viae), and specifically, through an argument based on causality.[19] Hence, Aquinas acknowledges the principle that today is known as Occam's Razor, but prefers causal explanations to other simple explanations (cf. also Correlation does not imply causation).

Madhva in verse 400 of his Vishnu-Tattva-Nirnaya says: "dvidhAkalpane kalpanAgauravamiti" (To make two suppositions when one is enough is to err by way of excessive supposition).


William of Ockham (c. 1287–1347) was an English Franciscan friar and theologian, an influential medieval philosopher and a nominalist. His popular fame as a great logician rests chiefly on the maxim attributed to him and known as Ockham's razor. The term razor refers to distinguishing between two hypotheses either by "shaving away" unnecessary assumptions or cutting apart two similar conclusions.

This maxim seems to represent the general tendency of Ockham's philosophy, but it has not been found in any of his writings.[20] His nearest pronouncement seems to be Numquam ponenda est pluralitas sine necessitate [Plurality must never be posited without necessity], which occurs in his theological work on the 'Sentences of Peter Lombard' (Quaestiones et decisiones in quattuor libros Sententiarum Petri Lombardi (ed. Lugd., 1495), i, dist. 27, qu. 2, K).

The words attributed to Ockham, entia non sunt multiplicanda praeter necessitatem (entities must not be multiplied beyond necessity), are absent in his extant works;[21] this particular phrasing owes more to John Punch.[22] Indeed, Ockham's contribution seems to be to restrict the operation of this principle in matters pertaining to miracles and God's power: so, in the Eucharist, a plurality of miracles is possible, simply because it pleases God.[17]

This principle is sometimes phrased as pluralitas non est ponenda sine necessitate ("plurality should not be posited without necessity").[23] In his Summa Totius Logicae, i. 12, Ockham cites the principle of economy, Frustra fit per plura quod potest fieri per pauciora [It is futile to do with more things that which can be done with fewer]. (Thorburn, 1918, pp. 352–3; Kneale and Kneale, 1962, p. 243.)

Later formulations

To quote Isaac Newton, "We are to admit no more causes of natural things than such as are both true and sufficient to explain their appearances. Therefore, to the same natural effects we must, so far as possible, assign the same causes."[24][25]

Bertrand Russell offers a particular version of Occam's Razor: "Whenever possible, substitute constructions out of known entities for inferences to unknown entities."[26]

Around 1960, Ray Solomonoff founded the theory of universal inductive inference, the theory of prediction based on observations; for example, predicting the next symbol based upon a given series of symbols. The only assumption is that the environment follows some unknown but computable probability distribution. This theory is a mathematical formalization of Occam's Razor.[2][3][4][5][27]

Another technical approach to Occam's Razor is ontological parsimony.[28]


Beginning in the 20th century, epistemological justifications based on induction, logic, pragmatism, and especially probability theory have become more popular among philosophers.


Prior to the 20th century, it was a commonly held belief that nature itself was simple and that simpler hypotheses about nature were thus more likely to be true. This notion was deeply rooted in the aesthetic value simplicity holds for human thought and the justifications presented for it often drew from theology. Thomas Aquinas made this argument in the 13th century, writing, "If a thing can be done adequately by means of one, it is superfluous to do it by means of several; for we observe that nature does not employ two instruments [if] one suffices."[29]


Occam's Razor has gained strong empirical support as far as helping to converge on better theories (see "Applications" section below for some examples).

In the related concept of overfitting, excessively complex models are affected by statistical noise (a problem also known as the bias-variance trade-off), whereas simpler models may capture the underlying structure better and may thus have better predictive performance. It is, however, often difficult to deduce which part of the data is noise (cf. model selection, test set, minimum description length, Bayesian inference, etc.).

Testing the razor

The razor's statement that "simpler explanations are, other things being equal, generally better than more complex ones" is amenable to empirical testing. Although, another interpretation of the razor's statement would be that "simpler hypotheses (not conclusions i.e. explanations) are generally better than the complex ones". The procedure to test the former interpretation would compare the track records of simple and comparatively complex explanations. If you accept the first interpretation, the validity of Occam's Razor as a tool would then have to be rejected if the more complex explanations were more often correct than the less complex ones (while the converse would lend support to its use). If the latter interpretation is accepted, the validity of Occam's Razor as a tool could possibly be accepted if the simpler hypotheses led to correct conclusions more often than not.

Possible explanations can become needlessly complex. It is coherent, for instance, to add the involvement of leprechauns to any explanation, but Occam's Razor would prevent such additions, unless they were necessary.

In the history of competing hypotheses, it is the case that the simpler hypotheses have led to mathematically rigorous and empirically verifiable theories. In the history of competing explanations this is not the case. At least, not generally (some increases in complexity are sometimes necessary), and so there remains a justified general bias towards the simpler of two competing explanations. To understand why, consider that, for each accepted explanation of a phenomenon, there is always an infinite number of possible, more complex, and ultimately incorrect alternatives. This is so because one can always burden failing explanations with ad hoc hypothesis. Ad hoc hypotheses are justifications that prevent theories from being falsified. Even other empirical criteria like consilience can never truly eliminate such explanations as competition. Each true explanation, then, may have had many alternatives that were simpler and false, but also an infinite number of alternatives that were more complex and false. However, if an alternate ad hoc hypothesis were indeed justifiable, its implicit conclusions would be empirically verifiable. On a commonly accepted repeatability principle, these alternate theories have never been observed and continue to not be observed. In addition, we do not say an explanation is true if it has not withstood this principle.

Put another way, any new, and even more complex theory can still possibly be true. For example: If an individual makes supernatural claims that Leprechauns were responsible for breaking a vase, the simpler explanation would be that he is mistaken, but ongoing ad hoc justifications (e.g. "And, that's not me on film, they tampered with that too") successfully prevent outright falsification. This endless supply of elaborate competing explanations, called saving hypotheses, cannot be ruled out—but by using Occam's Razor.[30][31][32]

Practical considerations and pragmatism

The common form of the razor, used to distinguish between equally explanatory hypotheses, may be supported by the practical fact that simpler theories are easier to understand.

Some argue that Occam's Razor is not an inference-driven model, but a heuristic maxim for choosing among other models and instead underlies induction.

Alternatively, if we want to have reasonable discussion we may be practically forced to accept Occam's Razor in the same way we are simply forced to accept the laws of thought and inductive reasoning (given the problem of induction). Philosopher Elliott Sober states that not even reason itself can be justified on any reasonable grounds, and that we must start with first principles of some kind (otherwise an infinite regress occurs).

The pragmatist may go on, as David Hume did on the topic of induction, that there is no satisfying alternative to granting this premise. Though one may claim that Occam's Razor is invalid as a premise helping to regulate theories, putting this doubt into practice would mean doubting whether every step forward will result in locomotion or a nuclear explosion. In other words still: "What's the alternative?"


One justification of Occam's Razor is a direct result of basic probability theory. By definition, all assumptions introduce possibilities for error; if an assumption does not improve the accuracy of a theory, its only effect is to increase the probability that the overall theory is wrong.

There have also been other attempts to derive Occam's Razor from probability theory, including notable attempts made by Harold Jeffreys and E. T. Jaynes. The probabilistic (Bayesian) basis for Occam's Razor is elaborated by David J. C. MacKay in chapter 28 of his book Information Theory, Inference, and Learning Algorithms,[33] where he emphasises that a prior bias in favour of simpler models is not required.

William H. Jefferys (no relation to Harold Jeffreys) and James O. Berger (1991) generalize and quantify the original formulation's "assumptions" concept as the degree to which a proposition is unnecessarily accommodating to possible observable data.[34] They state "a hypothesis with fewer adjustable parameters will automatically have an enhanced posterior probability, due to the fact that the predictions it makes are sharp".[34] The model they propose balances the precision of a theory's predictions against their sharpness; theories which sharply made their correct predictions are preferred over theories which would have accommodated a wide range of other possible results. This, again, reflects the mathematical relationship between key concepts in Bayesian inference (namely marginal probability, conditional probability, and posterior probability).

Other views

Karl Popper

Karl Popper argues that a preference for simple theories need not appeal to practical or aesthetic considerations. Our preference for simplicity may be justified by its falsifiability criterion: We prefer simpler theories to more complex ones "because their empirical content is greater; and because they are better testable" (Popper 1992). The idea here is that a simple theory applies to more cases than a more complex one, and is thus more easily falsifiable. This is again comparing a simple theory to a more complex theory where both explain the data equally well.

Elliott Sober

The philosopher of science Elliott Sober once argued along the same lines as Popper, tying simplicity with "informativeness": The simplest theory is the more informative one, in the sense that less information is required in order to answer one's questions.[35] He has since rejected this account of simplicity, purportedly because it fails to provide an epistemic justification for simplicity. He now expresses views to the effect that simplicity considerations (and considerations of parsimony in particular) do not count unless they reflect something more fundamental. Philosophers, he suggests, may have made the error of hypostatizing simplicity (i.e. endowed it with a sui generis existence), when it has meaning only when embedded in a specific context (Sober 1992). If we fail to justify simplicity considerations on the basis of the context in which we make use of them, we may have no non-circular justification: "just as the question 'why be rational?' may have no non-circular answer, the same may be true of the question 'why should simplicity be considered in evaluating the plausibility of hypotheses?'".[36]

Richard Swinburne

Richard Swinburne argues for simplicity on logical grounds:

... the simplest hypothesis proposed as an explanation of phenomena is more likely to be the true one than is any other available hypothesis, that its predictions are more likely to be true than those of any other available hypothesis, and that it is an ultimate a priori epistemic principle that simplicity is evidence for truth.
—Swinburne 1997

According to Swinburne, since our choice of theory cannot be determined by data (see Underdetermination and Quine-Duhem thesis), we must rely on some criterion to determine which theory to use. Since it is absurd to have no logical method by which to settle on one hypothesis amongst an infinite number of equally data-compliant hypotheses, we should choose the simplest theory: "either science is irrational [in the way it judges theories and predictions probable] or the principle of simplicity is a fundamental synthetic a priori truth" (Swinburne 1997).

Ludwig Wittgenstein

From the Tractatus Logico-Philosophicus:

  • 3.328 If a sign is not necessary then it is meaningless. That is the meaning of Occam's Razor.
(If everything in the symbolism works as though a sign had meaning, then it has meaning.)
  • 4.04 In the proposition there must be exactly as many things distinguishable as there are in the state of affairs which it represents. They must both possess the same logical (mathematical) multiplicity (cf. Hertz's Mechanics, on Dynamic Models).
  • 5.47321 Occam's Razor is, of course, not an arbitrary rule nor one justified by its practical success. It simply says that unnecessary elements in a symbolism mean nothing. Signs which serve one purpose are logically equivalent, signs which serve no purpose are logically meaningless.

and on the related concept of "simplicity":

  • 6.363 The procedure of induction consists in accepting as true the simplest law that can be reconciled with our experiences.


Science and the scientific method

In science, Occam's Razor is used as a heuristic (rule of thumb) to guide scientists in the development of theoretical models rather than as an arbiter between published models.[8][9] In physics, parsimony was an important heuristic in the formulation of special relativity by Albert Einstein,[37][38] the development and application of the principle of least action by Pierre Louis Maupertuis and Leonhard Euler,[39] and the development of quantum mechanics by Max Planck, Werner Heisenberg and Louis de Broglie.[9][40] In chemistry, Occam's Razor is often an important heuristic when developing a model of a reaction mechanism.[41][42] However, while it is useful as a heuristic in developing models of reaction mechanisms, it has been shown to fail as a criterion for selecting among some selected published models.[9] In this context, Einstein himself expressed caution when he formulated Einstein's Constraint: "It can scarcely be denied that the supreme goal of all theory is to make the irreducible basic elements as simple and as few as possible without having to surrender the adequate representation of a single datum of experience". An often-quoted version of this constraint (that cannot be verified as being posited by Einstein himself)[43] says "Everything should be kept as simple as possible, but no simpler."

In the scientific method, parsimony is an epistemological, metaphysical or heuristic preference, not an irrefutable principle of logic or a scientific result.[1][10][44] As a logical principle, Occam's Razor would demand that scientists accept the simplest possible theoretical explanation for existing data. However, science has shown repeatedly that future data often supports more complex theories than existing data. Science prefers the simplest explanation that is consistent with the data available at a given time, but the simplest explanation may be ruled out as new data become available.[8][10] That is, science is open to the possibility that future experiments might support more complex theories than demanded by current data and is more interested in designing experiments to discriminate between competing theories than favoring one theory over another based merely on philosophical principles.[1][10][11]

When scientists use the idea of parsimony, it only has meaning in a very specific context of inquiry. A number of background assumptions are required for parsimony to connect with plausibility in a particular research problem. The reasonableness of parsimony in one research context may have nothing to do with its reasonableness in another. It is a mistake to think that there is a single global principle that spans diverse subject matter.[11] It has been suggested that Occam's Razor is a widely accepted example of extraevidential consideration, even though it is entirely a metaphysical assumption. There is little empirical evidence that the world is actually simple or that simple accounts are more likely than complex ones to be true.[45] Most of the time, Occam's Razor is a conservative tool, cutting out crazy, complicated constructions and assuring that hypotheses are grounded in the science of the day, thus yielding "normal" science: models of explanation and prediction. There are, however, notable exceptions where Occam's Razor turns a conservative scientist into a reluctant revolutionary. For example, Max Planck interpolated between the Wien and Jeans radiation laws and used an Occam's Razor logic to formulate the quantum hypothesis, and even resisting that hypothesis as it became more obvious that it was correct.[9]

However, appeals to simplicity were used to argue against the phenomena of meteorites, ball lightning, continental drift, and reverse transcriptase. One can also argue for atomic building blocks for matter, because it provides a simpler explanation for the observed reversibility of both mixing and chemical reactions as simple separation and re-arrangements of the atomic building blocks. However, at the time, the atomic theory was considered more complex because it implied the existence of invisible particles which had not been directly detected. Ernst Mach and the logical positivists rejected the atomic theory of John Dalton, until the reality of atoms was more evident in Brownian motion, as shown by Albert Einstein.[46] In the same way, postulating the aether is more complex than transmission of light through a vacuum. However, at the time, all known waves propagated through a physical medium, and it seemed simpler to postulate the existence of a medium rather than theorize about wave propagation without a medium. Likewise, Newton's idea of light particles seemed simpler than Christiaan Huygens's idea of waves, so many favored it; however in this case, as it turned out, neither the wave- nor the particle-explanation alone suffices, since light behaves like waves as well as like particles.

Three axioms presupposed by the scientific method are realism (the existence of objective reality), the existence of natural laws, and the constancy of natural law. Rather than depend on provability of these axioms, science depends on the fact that they have not been objectively falsified. Occam's Razor and parsimony support, but do not prove these general axioms of science. The general principle of science is that theories (or models) of natural law must be consistent with repeatable experimental observations. This ultimate arbiter (selection criterion) rests upon the axioms mentioned above.[10]

There are examples where Occam's Razor would have picked the wrong theory given the available data. Simplicity principles are useful philosophical preferences for choosing a more likely theory from among several possibilities that are each consistent with available data. A single instance of Occam's Razor picking a wrong theory falsifies the razor as a general principle.[10] Michael Lee and others[47] provide cases where a parsimonious approach does not guarantee a correct conclusion and, if based on incorrect working hypotheses or interpretations of incomplete data, may even strongly support a false conclusion. He states, "When parsimony ceases to be a guideline and is instead elevated to an ex cathedra pronouncement, parsimony analysis ceases to be science."

If multiple models of natural law make exactly the same testable predictions, they are equivalent and there is no need for parsimony to choose one that is preferred. For example, Newtonian, Hamiltonian, and Lagrangian classical mechanics are equivalent. Physicists have no interest in using Occam's Razor to say the other two are wrong. Likewise, there is no demand for simplicity principles to arbitrate between wave and matrix formulations of quantum mechanics. Science often does not demand arbitration or selection criteria between models which make the same testable predictions.[10]


Biologists or philosophers of biology use Occam's Razor in either of two contexts both in

However, more recent biological analyses, such as Richard Dawkins' The Selfish Gene, have contended that Occam's view is not the simplest and most basic. Dawkins argues the way evolution works is that the genes propagated in most copies will end up determining the development of that particular species, i.e., natural selection turns out to select specific genes, and this is really the fundamental underlying principle, that automatically gives individual and group selection as emergent features of evolution.

Zoology provides an example. Muskoxen, when threatened by wolves, will form a circle with the males on the outside and the females and young on the inside. This is an example of a behavior by the males that seems to be altruistic. The behavior is disadvantageous to them individually but beneficial to the group as a whole and was thus seen by some to support the group selection theory.

However, a much better explanation immediately offers itself once one considers that natural selection works on genes. If the male musk ox runs off, leaving his offspring to the wolves, his genes will not be propagated. If however he takes up the fight his genes will live on in his offspring. And thus the "stay-and-fight" gene prevails. This is an example of kin selection. An underlying general principle thus offers a much simpler explanation, without retreating to special principles as group selection.

genealogy alone should determine classification and pheneticists contend that similarity over propinquity of descent is the determining criterion while evolutionary taxonomists say that both genealogy and similarity count in classification.[48]

It is among the cladists that Occam's Razor is to be found, although their term for it is cladistic parsimony. Cladistic parsimony (or maximum parsimony) is a method of phylogenetic inference in the construction of types of phylogenetic trees (more specifically, cladograms). Cladograms are branching, tree-like structures used to represent lines of descent based on one or more evolutionary changes. Cladistic parsimony is used to support the hypotheses that require the fewest evolutionary changes. For some types of tree, it will consistently produce the wrong results regardless of how much data is collected (this is called long branch attraction). For a full treatment of cladistic parsimony, see Elliott Sober's Reconstructing the Past: Parsimony, Evolution, and Inference (1988). For a discussion of both uses of Occam's Razor in biology, see Sober's article "Let's Razor Ockham's Razor" (1990).

Other methods for inferring evolutionary relationships use parsimony in a more traditional way. Likelihood methods for phylogeny use parsimony as they do for all likelihood tests, with hypotheses requiring few differing parameters (i.e., numbers of different rates of character change or different frequencies of character state transitions) being treated as null hypotheses relative to hypotheses requiring many differing parameters. Thus, complex hypotheses must predict data much better than do simple hypotheses before researchers reject the simple hypotheses. Recent advances employ information theory, a close cousin of likelihood, which uses Occam's Razor in the same way.

Francis Crick has commented on potential limitations of Occam's Razor in biology. He advances the argument that because biological systems are the products of (an on-going) natural selection, the mechanisms are not necessarily optimal in an obvious sense. He cautions: "While Ockham's razor is a useful tool in the physical sciences, it can be a very dangerous implement in biology. It is thus very rash to use simplicity and elegance as a guide in biological research."[49]

In organisms. Given the phylogenetic tree, ancestral migrations are inferred to be those that require the minimum amount of total movement.


When discussing Occam's Razor in contemporary medicine, doctors and philosophers of medicine speak of diagnostic parsimony. Diagnostic parsimony advocates that when diagnosing a given injury, ailment, illness, or disease a doctor should strive to look for the fewest possible causes that will account for all the symptoms. This philosophy is one of several demonstrated in the popular medical adage "when you hear hoofbeats behind you, think horses, not zebras". While diagnostic parsimony might often be beneficial, credence should also be given to the counter-argument modernly known as Hickam's dictum, which succinctly states that "patients can have as many diseases as they damn well please". It is often statistically more likely that a patient has several common diseases, rather than having a single rarer disease which explains their myriad symptoms. Also, independently of statistical likelihood, some patients do in fact turn out to have multiple diseases, which by common sense nullifies the approach of insisting to explain any given collection of symptoms with one disease. These misgivings emerge from simple probability theory—which is already taken into account in many modern variations of the razor—and from the fact that the loss function is much greater in medicine than in most of general science. Because misdiagnosis can result in the loss of a person's health and potentially life, it is considered better to test and pursue all reasonable theories even if there is some theory that appears the most likely.

Diagnostic parsimony and the counterbalance it finds in Hickam's dictum have very important implications in medical practice. Any set of symptoms could be indicative of a range of possible diseases and disease combinations; though at no point is a diagnosis rejected or accepted just on the basis of one disease appearing more likely than another, the continuous flow of hypothesis formulation, testing and modification benefits greatly from estimates regarding which diseases (or sets of diseases) are relatively more likely to be responsible for a set of symptoms, given the patient's environment, habits, medical history and so on. For example, if a hypothetical patient's immediately apparent symptoms include fatigue and cirrhosis and they test negative for Hepatitis C, their doctor might formulate a working hypothesis that the cirrhosis was caused by their drinking problem, and then seek symptoms and perform tests to formulate and rule out hypotheses as to what has been causing the fatigue; but if the doctor were to further discover that the patient's breath inexplicably smells of garlic and they are suffering from pulmonary edema, they might decide to test for the relatively rare condition of selenium poisoning.


In the philosophy of religion, Occam's Razor is sometimes applied to the existence of God. William of Ockham himself was a Christian. He believed in God, and in the authority of Scripture; he writes that "nothing ought to be posited without a reason given, unless it is self-evident (literally, known through itself) or known by experience or proved by the authority of Sacred Scripture."[50] In Ockham's view, an explanation has no sufficient basis in reality when it does not harmonize with reason, experience, or the Bible. However, unlike many theologians of his time, Ockham did not believe God could be logically proven with arguments. To Ockham, science was a matter of discovery, but theology was a matter of revelation and faith. He states: "only faith gives us access to theological truths. The ways of God are not open to reason, for God has freely chosen to create a world and establish a way of salvation within it apart from any necessary laws that human logic or rationality can uncover."[51]

St. Thomas Aquinas, in the Summa Theologica, uses a formulation of Occam's Razor to construct an objection to the idea that God exists, which he refutes directly with a counterargument:[52]

Further, it is superfluous to suppose that what can be accounted for by a few principles has been produced by many. But it seems that everything we see in the world can be accounted for by other principles, supposing God did not exist. For all natural things can be reduced to one principle which is nature; and all voluntary things can be reduced to one principle which is human reason, or will. Therefore there is no need to suppose God's existence.

In turn, Aquinas answers this with the quinque viae, and addresses the particular objection above with the following answer:

Since nature works for a determinate end under the direction of a higher agent, whatever is done by nature must needs be traced back to God, as to its first cause. So also whatever is done voluntarily must also be traced back to some higher cause other than human reason or will, since these can change or fail; for all things that are changeable and capable of defect must be traced back to an immovable and self-necessary first principle, as was shown in the body of the Article.

Rather than argue for the necessity of God, some theists consider their belief to be based on grounds independent of, or prior to, reason, making Occam's Razor irrelevant. This was the stance of Søren Kierkegaard, who viewed belief in God as a leap of faith which sometimes directly opposed reason.[53] This is also the same basic view of Clarkian Presuppositional apologetics, with the exception that Clark never thought the leap of faith was contrary to reason. (See also: Fideism).

There are various arguments in favour of God which establish God as a useful or even necessary assumption. Contrastinghly, atheists hold firmly to the belief that assuming the existence of God would introduce unnecessary complexity (Schmitt 2005, e.g. the Ultimate Boeing 747 gambit). Taking a nuanced position, philosopher Del Ratzsch[54] suggests that the application of the razor to God may not be so simple, least of all when we are comparing that hypothesis with theories postulating multiple invisible universes.[55]

Another application of the principle is to be found in the work of materialism, stating that matter was not required by his metaphysic and was thus eliminable. One potential problem with this view is that it's possible, given Berkeley's position, to find solipsism itself more in line with the razor than a God-mediated world beyond a single thinker.

In his article "Sensations and Brain Processes" (1959), J. J. C. Smart invoked Occam's Razor with the aim to justify his preference of the mind-brain identity theory over spirit-body dualism. Dualists state that there are two kinds of substances in the universe: physical (including the body) and spiritual, which is non-physical. In contrast, identity theorists state that everything is physical, including consciousness, and that there is nothing nonphysical. Despite the fact that it is impossible to appreciate the spiritual when limiting oneself to the physical, Smart maintained that identity theory explains all phenomena by assuming only a physical reality. Subsequently, Smart has been severely criticized for his (mis)use of Occam's Razor and ultimately retracted his advocacy of it in this context. Paul Churchland (1984) states that by itself Occam's Razor is inconclusive regarding duality. In a similar way, Dale Jacquette (1994) stated that Occam's Razor has been used in attempts to justify eliminativism and reductionism in the philosophy of mind. Eliminativism is the thesis that the ontology of folk psychology including such entities as "pain", "joy", "desire", "fear", etc., are eliminable in favor of an ontology of a completed neuroscience.

Penal ethics

In penal theory and the philosophy of punishment, parsimony refers specifically to taking care in the distribution of punishment in order to avoid excessive punishment. In the utilitarian approach to the philosophy of punishment, Jeremy Bentham's "parsimony principle" states that any punishment greater than is required to achieve its end is unjust. The concept is related but not identical to the legal concept of proportionality. Parsimony is a key consideration of the modern restorative justice, and is a component of utilitarian approaches to punishment, as well as the prison abolition movement. Bentham believed that true parsimony would require punishment to be individualised to take account of the sensibility of the individual—an individual more sensitive to punishment should be given a proportionately lesser one, since otherwise needless pain would be inflicted. Later utilitarian writers have tended to abandon this idea, in large part due to the impracticality of determining each alleged criminal's relative sensitivity to specific punishments.[56]

Probability theory and statistics

Marcus Hutter's universal artificial intelligence builds upon Solomonoff's mathematical formalization of the razor to calculate the expected value of an action.

There are various papers in scholarly journals deriving formal versions of Occam's Razor from probability theory, applying it in statistical inference, and using it to come up with criteria for penalizing complexity in statistical inference. Recent papers have suggested a connection between Occam's Razor and Kolmogorov complexity.[57]

One of the problems with the original formulation of the razor is that it only applies to models with the same explanatory power (i.e. it only tells us to prefer the simplest of equally good models). A more general form of the razor can be derived from Bayesian model comparison, which is based on Bayes factors and can be used to compare models that don't fit the data equally well. These methods can sometimes optimally balance the complexity and power of a model. Generally the exact Occam factor is intractable but approximations such as Akaike information criterion, Bayesian information criterion, Variational Bayesian methods, false discovery rate, and Laplace's method are used. Many artificial intelligence researchers are now employing such techniques.

The statistical view leads to a more rigorous formulation of the razor than that which came of previous philosophical discussions. In particular, it shows that "simplicity" must first be defined in some way before the razor may be used, and that this definition will always be subjective. For example, in the Kolmogorov-Chaitin minimum description length approach, the subject must pick a Turing machine whose operations describe the basic operations believed to represent "simplicity" by the subject. However, one could always choose a Turing machine with a simple operation that happened to construct one's entire theory and would hence score highly under the razor. This has led to two opposing views of the objectivity of Occam's Razor.

Objective razor

The minimum instruction set of a universal Turing machine requires approximately the same length description across different formulations, and is small compared to the Kolmogorov complexity of most practical theories. Marcus Hutter has used this consistency to define a "natural" Turing machine of small size as the proper basis for excluding arbitrarily complex instruction sets in the formulation of razors.[58] Describing the program for the universal program as the "hypothesis", and the representation of the evidence as program data, it has been formally proven under Zermelo–Fraenkel set theory that "the sum of the log universal probability of the model plus the log of the probability of the data given the model should be minimized."[59] Interpreting this as minimising the total length of a two-part message encoding model followed by data given model gives us the minimum message length (MML) principle.[60][61]

One possible conclusion from mixing the concepts of Kolmogorov complexity and Occam's Razor is that an ideal data compressor would also be a scientific explanation/formulation generator. Some attempts have been made to re-derive known laws from considerations of simplicity or compressibility.[62][63]

According to Jürgen Schmidhuber, the appropriate mathematical theory of Occam's Razor already exists, namely, Solomonoff's theory of optimal inductive inference[64] and its extensions.[65] See discussions in David L. Dowe's "Foreword re C. S. Wallace"[66] for the subtle distinctions between the algorithmic probability work of Solomonoff and the MML work of Chris Wallace, and see Dowe's "MML, hybrid Bayesian network graphical models, statistical consistency, invariance and uniqueness"[67] both for such discussions and for (in section 4) discussions of MML and Occam's Razor. For a specific example of MML as Occam's Razor in the problem of decision tree induction, see Dowe and Needham's "Message Length as an Effective Ockham's Razor in Decision Tree Induction".[68]

Controversial aspects of the razor

Occam's Razor is not an embargo against the positing of any kind of entity, or a recommendation of the simplest theory come what may.[3] Occam's Razor is used to adjudicate between theories that have already passed "theoretical scrutiny" tests, and which are equally well-supported by the evidence.[4] Furthermore, it may be used to prioritize empirical testing between two equally plausible but unequally testable hypotheses; thereby minimizing costs and wastes while increasing chances of falsification of the simpler-to-test hypothesis.

Another contentious aspect of the razor is that a theory can become more complex in terms of its structure (or syntax), while its ontology (or semantics) becomes simpler, or vice versa.[5] Quine, in a discussion on definition, referred to these two perspectives as "economy of practical expression" and "economy in grammar and vocabulary", respectively.[70] The theory of relativity is often given as an example of the proliferation of complex words to describe a simple concept.

Galileo Galilei lampooned the misuse of Occam's Razor in his Dialogue. The principle is represented in the dialogue by Simplicio. The telling point that Galileo presented ironically was that if you really wanted to start from a small number of entities, you could always consider the letters of the alphabet as the fundamental entities, since you could construct the whole of human knowledge out of them.


Occam's Razor has met some opposition from people who have considered it too extreme or rash. Walter Chatton (c. 1290–1343) was a contemporary of William of Ockham (c. 1287–1347) who took exception to Occam's Razor and Ockham's use of it. In response he devised his own anti-razor: "If three things are not enough to verify an affirmative proposition about things, a fourth must be added, and so on." Although there have been a number of philosophers who have formulated similar anti-razors since Chatton's time, no one anti-razor has perpetuated in as much notability as Chatton's anti-razor, although this could be the case of the Late Renaissance Italian motto of unknown attribution Se non è vero, è ben trovato ("Even if it is not true, it is well conceived") when referred to a particularly artful explanation. For further information, see "Ockham's Razor and Chatton's Anti-Razor" (1984) by Armand Maurer.

Anti-razors have also been created by Gottfried Wilhelm Leibniz (1646–1716), Immanuel Kant (1724–1804), and Karl Menger (1902–1985). Leibniz's version took the form of a principle of plenitude, as Arthur Lovejoy has called it: The idea being that God created the most varied and populous of possible worlds. Kant felt a need to moderate the effects of Occam's Razor and thus created his own counter-razor: "The variety of beings should not rashly be diminished."[71]

Karl Menger found mathematicians to be too parsimonious with regard to variables, so he formulated his Law Against Miserliness, which took one of two forms: "Entities must not be reduced to the point of inadequacy" and "It is vain to do with fewer what requires more." A less serious, but (some might say) even more extremist anti-razor is Tlön, Uqbar, Orbis Tertius". There is also Crabtree's Bludgeon, which takes a cynical view that "[n]o set of mutually inconsistent observations can exist for which some human intellect cannot conceive a coherent explanation, however complicated."

See also


  1. ^ "The aim of appeals to simplicity in such contexts seem to be more about shifting the burden of proof, and less about refuting the less simple theory outright."[1]
  2. ^ "In analyzing simplicity, it can be difficult to keep its two facets – elegance and parsimony – apart. Principles such as Occam's Razor are frequently stated in a way which is ambiguous between the two notions ... While these two facets of simplicity are frequently conflated, it is important to treat them as distinct. One reason for doing so is that considerations of parsimony and of elegance typically pull in different directions."[1]
  3. ^ "Ockham's razor does not say that the more simple a hypothesis, the better."[69]
  4. ^ "Today, we think of the principle of parsimony as a heuristic device. We don't assume that the simpler theory is correct and the more complex one false. We know from experience that more often than not the theory that requires more complicated machinations is wrong. Until proved otherwise, the more complex theory competing with a simpler explanation should be put on the back burner, but not thrown onto the trash heap of history until proven false."[69]
  5. ^ "While these two facets of simplicity are frequently conflated, it is important to treat them as distinct. One reason for doing so is that considerations of parsimony and of elegance typically pull in different directions. Postulating extra entities may allow a theory to be formulated more simply, while reducing the ontology of a theory may only be possible at the price of making it syntactically more complex."[1]


  1. ^ a b c d e f Alan Baker (2010) [2004]. "Simplicity". Stanford Encyclopedia of Philosophy. California: Stanford University.  
  2. ^ a b Induction: From Kolmogorov and Solomonoff to De Finetti and Back to Kolmogorov JJ McCall - Metroeconomica, 2004 - Wiley Online Library.
  3. ^ a b Foundations of Occam's Razor and parsimony in learning from ricoh.comD Stork - NIPS 2001 Workshop, 2001.
  4. ^ a b A.N. Soklakov (2002). "Occam's Razor as a formal basis for a physical theory". Foundations of Physics Letters (Springer). 
  5. ^ a b J. HERNANDEZ-ORALLO (2000). "Beyond the Turing Test". Journal of Logic, Language, and ... 
  6. ^ M. Hutter (2003). "Algorithmic Learning Theory". Springer. 
  7. ^ Samuel Rathmanner;  
  8. ^ a b c Hugh G. Gauch, Scientific Method in Practice, Cambridge University Press, 2003, ISBN 0-521-01708-4, ISBN 978-0-521-01708-4.
  9. ^ a b c d e Roald Hoffmann, Vladimir I. Minkin, Barry K. Carpenter, Ockham's Razor and Chemistry, HYLE—International Journal for Philosophy of Chemistry, Vol. 3, pp. 3–28, (1997).
  10. ^ a b c d e f g Courtney A, Courtney M (2008). """Comments Regarding "On the Nature Of Science. Physics in Canada 64 (3): 7–8. Retrieved 1 August 2012. 
  11. ^ a b c Elliott Sober, Let's Razor Occam's Razor, pp. 73–93, from Dudley Knowles (ed.) Explanation and Its Limits, Cambridge University Press (1994).
  12. ^ Vogel Carey, Toni (Oct 2010). Lewis, Rick, ed. "Parsimony (In as few words as possible)". Philosophy Now (UK) (81). Retrieved 27 October 2012. 
  13. ^ Bauer, Laurie (2007). The linguistics Student's Handbook. Edinburgh: Edinburgh University Press.  p. 155.
  14. ^ Aristotle, Physics 189a15, On the Heavens 271a33. See also Franklin, op cit. note 44 to chap. 9.
  15. ^ Charlesworth, M. J. (1956). "Aristotle's Razor". Philosophical Studies (Ireland)
  16. ^ WorldHeritagens, Complexity and Dynamics citing Richard McKeon (tr.) Aristotle's Posterior Analytics (1963) p.150
  17. ^ a b c   Chap 9. p. 241.
  18. ^ Alistair Cameron Crombie, Robert Grosseteste and the Origins of Experimental Science 1100–1700 (1953) pp. 85–86
  19. ^ "SUMMA THEOLOGICA: The existence of God (Prima Pars, Q. 2)". Retrieved 2013-03-26. 
  20. ^ "What Ockham really said". Boing Boing. 2013-02-11. Retrieved 2013-03-26. 
  21. ^   p. 253.
  22. ^ Alistair Cameron Crombie (1959), Medieval and Early Modern Philosophy, Cambridge, MA: Harvard, Vol. 2, p. 30.
  23. ^ "Ockham's razor". Encyclopædia Britannica. Encyclopædia Britannica Online. 2010. Retrieved 12 June 2010. 
  24. ^  
  25. ^ Primary source: Newton (2011, p. 387) wrote the following two "philosophizing rules" at the beginning of part 3 of the Principia 1726 edition.
    Regula I. Causas rerum naturalium non plures admitti debere, quam quæ & veræ sint & earum phænomenis explicandis sufficiant.
    Regula II. Ideoque effectuum naturalium ejusdem generis eædem assignandæ sunt causæ, quatenus fieri potest.
  26. ^ Stanford Encyclopedia of Philosophy, 'Logical Construction'
  27. ^ On the existence and convergence of computable universal priors from M Hutter – Algorithmic Learning Theory, 2003 – Springer.
  28. ^ Baker, Alan (Feb 25, 2010). Edward N. Zalta, ed, ed. "Simplicity". The Stanford Encyclopedia of Philosophy (Summer 2011 Edition). 
  29. ^ Pegis 1945.
  30. ^ Stanovich, Keith E. (2007). How to Think Straight About Psychology. Boston: Pearson Education, pp. 19–33.
  31. ^ . 22 Jun. 2008.The Skeptic's DictionaryCarroll, Robert T. "Ad hoc hypothesis."
  32. ^ Swinburne 1997 and Williams, Gareth T, 2008.
  33. ^ Information Theory, Inference, and Learning Algorithms | url=
  34. ^ a b Jefferys, William H.; Berger, James O. (1991). "Ockham's Razor and Bayesian Statistics (Preprint available as "Sharpening Occam's Razor on a Bayesian Strop")".  
  35. ^  
  36. ^ Sober, Elliott (2004). "What is the Problem of Simplicity?". In Arnold Zellner,   paper as pdf
  37. ^ Einstein, Albert (1905). "Annalen der Physik" (in German) (18). pp. 639–41.  .
  38. ^ L Nash, The Nature of the Natural Sciences, Boston: Little, Brown (1963).
  39. ^ de Maupertuis, PLM (1744). "Mémoires de l'Académie Royale" (in French). p. 423. .
  40. ^ de Broglie, L (1925). "Annales de Physique" (in French) (3/10). pp. 22–128. .
  41. ^ RA Jackson, Mechanism: An Introduction to the Study of Organic Reactions, Clarendon, Oxford, 1972.
  42. ^ BK Carpenter, Determination of Organic Reaction Mechanism, Wiley-Interscience, New York, 1984.
  43. ^ Quote Investigator: "Everything Should Be Made as Simple as Possible, But Not Simpler"
  44. ^ Sober, Eliot (1994). "Let's Razor Occam's Razor". In Knowles, Dudley. Explanation and Its Limits. Cambridge University Press. pp. 73–93. .
  45. ^ Naomi Oreskes, Kristin Shrader-Frechette, Kenneth Belitz (Feb 4, 1994). "Verification, Validation, and Confirmation of Numerical Models in the Earth Sciences". Science, 263 (5147): 641–646.  
  46. ^ Paul Pojman (2009). "Ernst Mach". The Stanford Encyclopedia of Philosophy. California: Stanford University.  
  47. ^ Lee, M. S. Y. (2002): "Divergent evolution, hierarchy and cladistics." Zool. Scripta 31(2): 217–219. doi:10.1046/j.1463-6409.2002.00101.xPDF fulltext
  48. ^ Sober, Elliot (1998). Reconstructing the Past: Parsimony, Evolution, and Inference (2nd ed.). Massacusetts Institute of Technology: The MIT Press. p. 7.  
  49. ^ Crick 1988, p. 146.
  50. ^ "Encyclopedia of Philosophy". Stanford.  .
  51. ^ Dale T Irvin & Scott W Sunquist. History of World Christian Movement Volume, I: Earliest Christianity to 1453, p. 434. ISBN 9781570753961.
  52. ^ "SUMMA THEOLOGICA: The existence of God (Prima Pars, Q. 2)". Retrieved 2013-03-26. 
  53. ^ McDonald 2005.
  54. ^ "Ratzsch, Del". Calvin. .
  55. ^ "Encyclopedia of Philosophy". Stanford.  .
  56. ^ Tonry, Michael (2005): Obsolescence and Immanence in Penal Theory and Policy. Columbia Law Review 105: 1233–1275. PDF fulltext
  57. ^ Nannen, Volker. "A short introduction to Model Selection, Kolmogorov Complexity and Minimum Description Length". Retrieved 2010-07-03. 
  58. ^ Algorithmic Information Theory
  59. ^ Paul M. B. Vitányi and Ming Li; IEEE Transactions on Information Theory, Volume 46, Issue 2, Mar 2000 Page(s):446–464, "Minimum Description Length Induction, Bayesianism and Kolmogorov Complexity."
  60. ^ Chris S. Wallace and David M. Boulton; Computer Journal, Volume 11, Issue 2, 1968 Page(s):185–194, "An information measure for classification."
  61. ^ Chris S. Wallace and David L. Dowe; Computer Journal, Volume 42, Issue 4, Sep 1999 Page(s):270–283, "Minimum Message Length and Kolmogorov Complexity."
  62. ^ 'Occam’s razor as a formal basis for a physical theory' by Andrei N. Soklakov
  63. ^ 'Why Occam's Razor' by Russell Standish
  64. ^ Ray Solomonoff (1964): "A formal theory of inductive inference. Part I." Information and Control, 7:1–22, 1964.
  65. ^ J. Schmidhuber (2006) "The New AI: General & Sound & Relevant for Physics." In B. Goertzel and C. Pennachin, eds.: Artificial General Intelligence, pp. 177–200
  66. ^ David L. Dowe (2008): Foreword re C. S. Wallace; Computer Journal, Volume 51, Issue 5, Sept 2008 Pages:523–560.
  67. ^ David L. Dowe (2010): "MML, hybrid Bayesian network graphical models, statistical consistency, invariance and uniqueness. A formal theory of inductive inference." Handbook of the Philosophy of Science – (HPS Volume 7) Philosophy of Statistics, Elsevier 2010 Page(s):901–982.
  68. ^ Scott Needham and David L. Dowe (2001):" Message Length as an Effective Ockham's Razor in Decision Tree Induction." Proc. 8th International Workshop on Artificial Intelligence and Statistics (AI+STATS 2001), Key West, Florida, U.S.A., Jan. 2001 Page(s):253–260
  69. ^ a b  
  70. ^  
  71. ^  

Further reading

External links

This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.

Copyright © World Library Foundation. All rights reserved. eBooks from World eBook Library are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.