SCIENCE in the EYES of a SCIENTIST   

                                                                                               By Mark Perakh  

Posted on November 3, 2001

 

INTRODUCTION 

            In many articles on this site, especially those discussing books and papers which are devoted to the most recent incarnation of the creationism, disguised as the so-called “intelligent design theory,” or which purportedly assert the harmony between science and the Bible, the question will repeatedly arise of what constitutes legitimate science and how to distinguish it from either bad science or pseudo-science. In this paper I intend to discuss the above question in general, thus laying foundation for the subsequent discussion of that question in particular books and articles.

Obviously, this distinction is possible only if a certain definition of science has first been agreed upon.  Unfortunately, it seems that a rigorous definition of science is hard to offer, and perhaps even impossible, for reasons which may become clear from the following discourse.  In a non-rigorous and most general way, science can perhaps be defined as a human endeavor consciously aimed at acquiring knowledge about the world in a systematic and logically consistent manner, based on factual evidence obtained by observation and experimentation.  

As it should become clear from the following discourse, it is hard to consistently adhere to the above definition of science because the latter seems to inevitably encompass elements beyond the limitations of that definition.

            The subject of this article is traditionally discussed within the framework of the philosophy of science. My intention is to look at the problem from a different standpoint – that of a scientist rather than of a philosopher.  I envision this as a glance from within science rather than from a philosophical outside.  Such an approach entails considerable difficulty.  Whereas philosophers may legitimately avoid discussing the specific nuts and bolts of scientific work, a glance from within requires analyzing specific details of the process of scientific inquiry and this may pose a serious obstacle to non-scientists trying to understand my thesis. I will have, on the one hand, to avoid the Scylla of engaging in arcane specifics of the modern highly sophisticated scientific methodology and, on the other hand, the Charybdis of oversimplification by limiting myself only to the most general notions and thus actually becoming a dilettante philosopher.

            I don’t know if I can succeed in such an act of balance on a thin wire between the two approaches and probably will not be able to eschew the trap of becoming  an amateur philosopher anyway. 

I also realize that it may be simply impossible to analyze the essence of science as though all of its branches were characterized by the same set of features.  Science is very complex and multifaceted so an analysis which is quite adequate for some fields of science may happen to be irrelevant in other fields.  My approach will be relevant first and most to such fields as physics, chemistry, engineering disciplines, and biology, while many aspects of my discourse may be less applicable to medical science, and even less to history, esthetics, literary criticism and the like. 

            One of the first questions in the discussion of science is what is the driving force of science.  While different scientists may be driven by different motivations, I submit that the main reason for the existence of scientific research is the curiosity which seems to be  inherent in human nature.  This statement does not imply that every man and woman necessarily possesses such curiosity. On the contrary, the vast majority of people lack it.  As the great Russian poet Aleksandr Pushkin said (he meant only the Russian people, but his words can probably be applied to humanity as a whole) “We are lazy and not curious.”  However, there has always existed a fraction of people differing from others in that they are driven by insatiable curiosity.  Apparently, some ethnic groups, at different historical periods and for unknown reasons, had a larger fraction of such people than others.

            It seems that historically attempts to satisfy the curiosity in question were made in three distinctive ways. These are religion, philosophy and science.

            (As any simple scheme, the above classification does not fully represent all aspects of the human drive to understand the world. For example, the above three-part scheme leaves out such important forms of human endeavor as art.  However, this scheme is convenient for the purpose of the main subject of this article).

            At the dawn of civilization, when humankind’s knowledge of the world was extremely limited by the narrow experience within a small locality, the curiosity about the surrounding world could be satisfied only by means of religion.  For example, until recently, the Indian tribes in America, as well as native tribes in Africa and Polynesia, all had certain religious systems, but none of them had developed either philosophy or science.

            Religion is a way of explaining the world which is not based on evidence but largely on imagination and fantasy. The religious explanation of the world does not require either factual evidence or logic.  The holy books of the world’s largest religions not only are incompatible with each other but are also each full of internal inconsistencies and descriptions of improbable events, but this has in no way impaired their impact on large masses of people.  Although religions often borrowed concepts from each other, most of them claim the monopoly on truth.  They have an enormous power of survival despite any evidence against their tenets.

            As the boundaries of the explored world expanded, the next steps in the attempts at explaining the world were philosophy and science. It is hard to establish a definite chronological sequence of their emergence. In some cases science preceded philosophy. Indeed, well developed science is known to have existed in the ancient Babylon and Egypt while philosophy does not seem to have left signs of existence in that remote past. In other cases philosophy and science seemed to emerge simultaneously, often being for a while indistinguishable from each other.  Whatever simple scheme can be suggested, it necessarily will be incomplete and contradict at least some features of the complex relationship between the three ways of comprehending the world I listed above.

            Like religion, philosophy does not necessarily require a factual evidence (although it may) but, unlike religion, philosophy requires logic.  Unlike religion, philosophy rarely claims the monopoly on truth (although some philosophical theories did just that – an example is the Marxist philosophy).

            Finally, as more factual evidence was accumulated about the world, science took over as a way to gain knowledge in a systematic matter, trying to base its conclusions both on factual evidence and logic.

            In the course of millennia science underwent multiple modifications, for many centuries remaining indistinguishable from philosophy.  The modern science which is quite distinctive from philosophy developed largely during the few most recent centuries.

            In a certain sense, the relation between science, philosophy and religion can be represented, necessarily simplifying the actual situation, by three concentric circles. The inner circle encompasses scientific knowledge.  Its radius is constantly increasing, as more and more knowledge is accumulated.  Beyond that inner circle of scientific knowledge is the concentric circle of a larger radius, the area between the two circles representing the domain of philosophy.  Philosophy reigns where science has yet not acquired sufficient knowledge, thus leaving room for speculations limited only by logic.  Finally, an even larger concentric circle surrounds the two mentioned circles – that of science and that of philosophy.  This is the domain of religion where neither science nor philosophy has so far managed to offer good explanations of the world. The inner circle – that of science – is continually expanding at an ever increasing rate, as science accumulates more and more knowledge.  This expansion of science pushes philosophy into a gradually narrowing domain where it still has the freedom of logical explanations not based on rigorously collected factual data. This, in turn, pushes religion even further to the margins of the world’s comprehension.  The enormous power of survival obviously inherent in religions is  due not to their gradually shrinking explanatory abilities but rather to their ability to satisfy the emotional needs of man.                

FEATURES OF GOOD, BAD AND PSEUDO-SCIENCE 

We have to distinguish between “good” and “bad” science as well as between genuine and pseudo-science.  While these two dichotomies are of different nature, sometimes it is difficult to distinguish between bad science and pseudo-science.

Bad science differs from good science in that it employs the same legitimate approach to inquiry as good science but at some stage of that inquiry fails, often inadvertently, to follow the path of an objective and/or logical process and goes astray in its conclusions.  Generally speaking, good science serves the progress of science itself and of its derivatives in engineering, medicine, and technologies of various kinds whereas bad science is rather promptly discarded. Since bad science is usually recognized rather swiftly, it is often just a nuisance and is rarely so harmful as to seriously impede the progress of good science.

A case of bad science can be exemplified by the story of the so-called cold fusion.  The discovery of cold fusion was announced by two groups of scientists, of which one consisted of professors of electrochemistry Martin Fleischman and B. Stanley Pons  and their associate M. Hawkings, and the other was a group headed by Professor Steven Jones.

Whereas I never heard of Pons, Hawkins, and Jones before they announced their discovery, I was acquainted with Professor Fleischman. In the seventies, we both served as members of the Council of the International Society of Electrochemistry (of which Fleischman later served as president).  I also twice visited the University of Southampton where Fleischman was a professor of electrochemistry, and gave a talk there.  Since Fleischman’s research was in a field rather remote from my own, I was not really familiar with his scientific production, but I knew of his reputation as a serious electrochemist. Therefore, when I first heard about cold fusion I was both greatly impressed and puzzled.  I was impressed by the mere claim that two electrochemists, using an experimental set-up which only could be characterized as primitive, reportedly achieved a revolutionary result of immense importance. I was puzzled because that result seemed rather unlikely (in particular because these researchers did not observe emission of neutrons from an alleged nuclear fusion process, and, moreover, their description of the experimental procedure seemed to be too vague.)   On the one hand, it seemed hardly probable that experienced scientists would have made such an extravagant claim without having meticulously studied the claimed phenomenon, and on the other hand, it seemed equally hardly probable (although not impossible) that their claim was true.  Soon,  several groups of researchers were busy with verification of the sensational cold fusion reports.  Most of them rather rapidly came to the conclusion that the cold fusion effect could not be reproduced.  The sensation died and most of the researchers discarded the cold fusion report as bad science.  (Despite the disappointment by the evident irreproducibility of the initial results by Fleischman and Pons, some research  in that direction is still going on, and regular conferences devoted to that subject are being held; although no actual cold fusion has been reported so far, these efforts may be useful if we remember a saying attributed to Rutherford. This famous physicist was once asked why he allowed one of the researchers in his lab to conduct an obviously hopelessly futile study. Reportedly, Rutherford answered: "Let him do it. He hardly will find what he is looking for but maybe he'll find something else which happens to be useful.")

 Pseudo-science, on the other hand, is an endeavor which essentially is not real science but something else disguised in quasi-scientific clothes for the sake of a certain agenda, usually having nothing to do with science and often hostile to science.  It may be quite detrimental to genuine science because of its ability to successfully disguise itself as genuine science and thereby forcing on science an unnecessary defensive effort requiring the spending of valuable human resources needed for the progress of genuine science.   A little later in this article I will suggest some criteria for discriminating between genuine and pseudo-science and suggest appropriate examples.    

It seems useful to distinguish between the language of science (and definitions as a part of it) and its building blocks.  

THE LANGUAGE OF SCIENCE 

            The language of science is overwhelmingly mathematical.  First, this statement can justifiably be viewed as a simplification.  The interaction between science and mathematics is neither straightforward nor one-dimensional.  Second, an adequate analysis of mathematics as a predominant language of science would require a preliminary analysis of what is a language. This would, though, lead well beyond the scope of my topic. Third, this statement may be disparaged as allegedly dismissing non-mathematically expressed knowledge as non-science.  Of course, it is just a question of semantics.  Is psychology a science? Is history a science? Surely, these and many other fields of inquiry are legitimately listed among sciences even though mathematics has so far found only a limited room for itself within those sciences.  (A different story is that mathematics finds its way into those sciences via the ever increasing penetration of computers and artificial intelligence.)  What is more important, the path these sciences follow seems to repeat that traveled by physics, chemistry, and biology.  Centuries ago, actually before Johannes Kepler and Galileo Galilei, physics used little mathematics and was largely a descriptive science, much like psychology or history are today.  It took time for the marriage of physics and mathematics to occur, but when it happened, the impetus for the development of powerful theories of physics was enormous.  “Mathematics is a language,” said the great American physicist Josiah Willard Gibbs.  Modern mathematics is so multifaceted that it is probably more proper to speak about mathematical languages in plural.   

           Mathematics is, though, more than a mere language (or languages).  It is a compressed logic immensely facilitating a theoretical comprehension of experimental facts.  Without mathematics, contemporary physics would not exist.  Hence, while saying that the language of science is mathematics, this is fully applicable to the physics of today but also, probably, to the psychology or history of tomorrow.

            The mathematical language is overwhelmingly used in science not because it is more powerful than the polymorphous natural vernacular.  The opposite is true – the polymorphous natural language is, generally speaking,  more powerful.  The use of mathematical language in science is dictated not by its power but by necessity, as will be discussed in more detail when we turn to the concept of definition.  Actually, whereas mathematical language immensely facilitated and accelerated the development of science, toward the middle of the 20th century it has become evident that the scientific study of the so-called large or poorly-organized systems (which will be discussed in subsequent sections) is sometimes impeded by a strict adherence to the rigorously enforced use of a uniform mathematical language.  The sharp demarcation between natural language with its inherent ambiguity of terms and the rigorous uniform mathematical language started gradually diffusing.

            Hence, although the mathematical language is still the prevalent, largely preferred and often simply necessary way of scientific discourse, polymorphous language gradually finds its way into that discourse, a phenomenon which is often viewed simply as a modification of the mathematical language itself.  Indeed, mathematics constantly develops new tools (such a chaos theory, non-standard logic, and the like) which help it catch up with the needs of physics and other disciplines in refining their mathematical representation.

The question of why mathematics is applicable in physics puzzled many scientists and philosophers alike.

Physics is a discipline in which mathematics is so intertwined with the body of that science that often it seems hard to see where physics ends and mathematics starts.  Physics and mathematics have historically developed, constantly exchanging ideas and methods.  The greatest mathematicians often happened to be also outstanding physicists and vice versa. Isaac Newton developed some of the most fundamental theories in physics and also invented a revolutionary part of mathematics – differential calculus.  Leonard Euler made enormous contributions both to mathematics and to physics.  New mathematical methods which seemed to be of a highly abstract nature (like functional analysis) swiftly found their way into specific fields of physics while methods developed to tackle specific problems of physics (like vector algebra and then vector calculus originally invented in hydrodynamics) very soon were generalized to become chapters of abstract mathematics.

            There is something fascinating in that mysterious usefulness of mathematics in physics despite the obvious difference in their treatment of objects.  Here is a simple example. We all know that matter has a discrete structure. It consists of tiny particles. It is impossible to visualize these particles since their behavior is enormously different from the behavior of the macroscopic bodies we see, hear, taste, smell and touch.  Whatever these particles really are like, we know that matter is not continuous.  However, we routinely apply in physics an abstract mathematical concept of infinitesimally small quantities, like infinitesimally small electric charge and the like.  This mathematical trick works very effectively despite its obvious contradiction of our knowledge of the discrete structure of matter.  The concept of an infinitesimally small charge is contrary to the reliably established facts of the discrete nature of electric charge.

            I believe the described discrepancy between mathematics and physics does not undermine the extreme usefulness of mathematics in physics for the same reason that the use of physical models does not undermine the study of real objects.  One of the following sections will discuss the use of models in science in detail.  Preempting that discussion, we may note that models differ from the real objects in that the models share with the real objects they represent only a small fraction of properties while ignoring the overwhelming majority of the latter.  A good model shares with the real object only those few properties which are crucial for the problem at hand. For example, in Newton’s celestial mechanics the model of a planet is a point mass. It has only one property common with a real planet – its mass.  Replacing a real object with its model, while introducing a certain, usually insignificant imprecision, enables scientists to tackle otherwise enormously complex problems.  Similarly, applying an abstract mathematical concept such as infinitesimally small charge, is similar to using a model in which the discrete structure of the real electric charge is ignored as inconsequential for the problem at hand.  Since those properties of an electric charge which are crucial for the problem at hand are accounted for in the theory assuming a continual structure of the electric charge, the powerful mathematical machine works very well, immensely facilitating the analysis of the behavior of real charges.

         Therefore the contradiction between abstract mathematical concepts and the real properties of the objects of study in physics is more of philosophical than of practical significance.

            Still, the amazing usefulness of mathematics in physics seems to be a vexing problem for curious minds. It has so far not received a commonly accepted interpretation, although more than once various explanations have been suggested. 

            I believe that the fact of mathematics being the language of science can be understood through a rather simple and general idea.  One of the requirements for an approach to be genuinely scientific, which is not limited to science alone, but is equally valid for any discourse which may be viewed as reasonable, is to be logical.  Logic itself cannot serve to distinguish between correct and incorrect statements, because logic is simply the proper path from a premise (or premises) to a conclusion.  The logical conclusion is only as good as the premise.  However, if the premise is correct the conclusion reached via a logical procedure is reliable.  Therefore it is imperative to use logic at every stage of a scientific inquiry.  Mathematics is the most powerful and the most concise form of logic.  Naturally, science had to resort to mathematics as an extremely powerful tool of logical discourse.

Leaving this philosophical problem aside, we may assert, that, generally speaking, mathematics is the proper language for science. 

DEFINITIONS 

In many of the following articles, the question will arise of what constitutes a proper definition of the concepts under discussion (see, for example, http://members.cox.net/perakm/dembski.htm , sections on probability and complexity). Therefore it seems appropriate to provide first a general discussion of what the features of proper definitions are and which types of definitions can be acceptable in a scientific discourse.

Obviously, a fruitful scientific discourse is possible only if certain definitions are agreed upon. Proper definitions are therefore of the utmost importance and introducing them requires the maximum possible care.  

It seems a platitude to say that a definition is essentially an agreement regarding the meaning of a term.

In the everyday discourse, we are rarely concerned with the precise definition of  words used in the vernacular. The vagueness of the vernacular is actually its advantage, supplying flexibility to the discourse and enlivening the latter.  The vagueness of the vernacular is also connected to its redundancy, which facilitates communication.  Redundancy, providing a certain amount of excessive information (in the layman’s sense of that term), helps to compensate for the possible distortion of the message which is due to the unavoidable noise accompanying the transmission of a message.

Scientific terms, on the other hand, usually require precise definitions and this is one of the features which distinguish scientific discourse from an everyday chat. 

A term may have quite different definitions depending on the branch of science in which it is being utilized. For example the term vector in mathematics and its applications in physics means a quantity fully determined by its magnitude and direction (as opposite to a scalar which is determined by its magnitude alone, without any reference to direction). On the other hand, in pathology the same term, vector, means an organism, such as a mosquito or tick, that carries disease-causing microorganisms from one host to another. This example illustrates the utmost importance of rigorous definitions of concepts used in each field of science.

            Science is a collective enterprise. It is impossible without communication within the community of participants in that enterprise.  Communication is predicated on the acceptance of certain common definitions of concepts of which the edifice of a particular science is composed. 

Formulating a definition of a concept is a logical procedure which, although it may vary in particular details, is subordinated to certain general rules common for all the variations of definitions.  One of the most salient features of a definition is that it presupposes a certain preliminary knowledge relevant to the subject of definition. No concept can be defined unless the meaning of certain underlying concepts has already been agreed upon.  If the definition of a concept is attempted in the absence of some underlying relevant knowledge, such an attempt will not provide a legitimate definition, but at best a quasi-definition which will necessarily be ambiguous and logically deficient.

            A corollary to the last statement is that every field of science starts by introducing a seminal concept for which no real definition can be given.  The word “starts” in that sentence implies a logical rather than chronological order of steps in building up a field of science.

There are several possible classifications of definitions. 

One such classification is based on the logical structure of a definition. According to that classification, three types of definition can be distinguished.  They are a) Deductive definition; b) Inductive definition, and c) Descriptive definition. 

A deductive definition has the form of a triad.  It necessarily comprises three elements which are: a) a general concept which encompasses a certain set A of elements and which is assumed to be commonly understood within the appropriate circle of the participants of a discourse; b) a particular concept which is the target of the definition and which encompasses a certain subset B of elements of the set A, and c) qualifiers which specify those features of the elements of subset B distinguishing the latter from all other elements of set A not included in subset B.  Obviously, for a deductive definition to be meaningful, the participants of the discourse must have a preliminary understanding, assumed to be identical for all participants of the discourse, first of the general concept which determines set A and second of the qualifiers characterizing subset B.

A deductive definition also usually contains sentential elements, either explicitly or implicitly,  in the form of  “such…. that,” “that (those)…which (who)” and the like. These sentential elements can be omitted but they are nevertheless implied.

For example here is a deductive definition of the term “colt.” 

A colt is such a  horse that is a) young, and b) male.

In this definition the general concept is that of “horse.”  It is assumed to be known to all participants of the discourse. If it were not known, the definition would make no sense as it would define the target - “colt” – through an unknown concept requiring its own preceding definition.  By the same token, the qualifiers, of which there are two in this case (young and male) are assumed to be commonly understood by all participants of the discourse, otherwise the definition of a colt would be useless. All horses, regardless of their being colts or not colts (i.e. fillies, geldings, stallions and mares) are elements of set A, while only those horses which meet the qualifiers (i.e. are male and young)  are elements of subset B.

Here is another example. 

Daughters of Ann and Andrew are those of their kids who are female.

I have intentionally chosen an example which sounds primitive in the extreme because, despite its simplicity, it is representative of a deductive definition. 

According to the goal of a definition, deductive definitions can be of three basic types, a) phenomenological, b) taxonomic, and c) explanatory definitions.  The distinction between these three types of definitions may sometimes be quite clear, while in some other cases this distinction may be blurred.  For example, a taxonomic definition may be at the same time also either phenomenological or explanatory. Also, a definition may be viewed as either phenomenological or explanatory depending on the requirements to the degree of its explanatory power.

The distinction in question is determined not as much by the contents of a definition as by its goal.

A taxonomic definition is always a member of a set of “parallel” definitions all of which contain the same general concept but differ in qualifiers.  For example, the definition of a colt discussed above is taxonomic, as it is a member of a set, including also definitions of filly, mare, stallion and gelding.

From another standpoint, deductive definitions can be either qualitative or quantitative.

Let us review an example of a definition taken from the elementary course of physics: Semiconductors are those materials whose electric conductivity increases with temperature. 

In this case set A comprises all materials whereas subset B comprises only semiconductors, while the qualifier is a certain feature of the behavior of the elements of subset B which distinguishes them from all other materials – namely the temperature dependence of their electric conductivity. This definition belongs to the phenomenological type.  It determines the target of definition by specifying a certain behavior of elements of subset B without providing an explanation of why they behave as they do.   Assume now that we need a definition of “semiconductor” which is of explanatory type, i.e. containing a reference to the mechanism underlying the behavior of conductivity described in the phenomenological definition.  We discover that the mechanism in question is not the same for all semiconductors, so that there are three distinctive classes of semiconductors, in all of which electric conductivity increases with temperature, but the underlying mechanisms of that phenomenon are different. Therefore, in order to provide an explanatory rather than a phenomenological definition of a semiconductor, we have to first introduce taxonomic definitions for each of the three classes of semiconductors.  One such class is the so-called intrinsic semiconductors (the two other classes are the donor-type or n-type semiconductors, and the acceptor-type or p-type semiconductors).  A taxonomic definition of the intrinsic semiconductors can be given as: An intrinsic semiconductor is such a  semiconductor that is not doped by either donor or acceptor elements. (The term doping means deliberately inserting into a material A small amounts of another material, B,  which alters the properties of A in a desired manner).  Now, as the concept of an intrinsic semiconductor has been taxonomically defined,  we can give an explanatory definition of that concept:  Intrinsic semiconductors are such materials in which the energy gap between the conduction and the valence energy bands is narrow.

 This definition is better in that it specifies the intrinsic property of elements of subset B which predetermines their behavior.  However this definition,  while explanatory, is only qualitative. A still better scientific definition would be quantitative.  It would numerically specify the meaning of the term “narrow.”

            If either a general concept encompassing set A or the qualifiers distinguishing subset B from the rest of set A are unknown, the triad-like deductive definition is impossible.

The first type of situation is common if a seminal concept of a science is to be chosen.  Since at the starting point of a new field of science there are yet no defined concepts in the arsenal of that science, its starting concept cannot be deductively defined.  In this case the first element of the triad is absent, hence the triad cannot be logically constructed.  A good example is the concept of energy in physics.

            Energy is the most fundamental concept in physics. It is also its most seminal concept (not in a chronological but in a logical sense). In a certain sense, physics is a science about energy transformations. The law of energy conservation is the most fundamental basis of the science of physics. However, since physics has no concept which is more general than energy, the latter cannot be deductively defined. 

            Of course, one can find in textbooks on physics, especially those published many years ago, any number of alleged definitions of energy, but most of them are logically deficient and are not real rigorous definitions but rather attempts at explaining the meaning of that concept without really defining it in a logically consistent way.  Some of those quasi-definitions are plainly absurd.  One can see in some old textbooks such quasi-definition of energy as one asserting that energy is “the ability of a body to perform work,” which is a meaningless statement, appealing to the laymen’s understanding of the term “work.”

The famous physicist Richard Feynman, in his excellent course “The Feynman Lectures on Physics,” prudently avoided attempts to define energy in a rigorous way, providing instead an explanation of the term energy as “a certain quantity, which we call energy, that does not change in the manifold changes which nature undergoes.” (R. P. Feynman, R.B. Leyton and M. Sands, “The Feynman Lectures on Physics, Addison-Wesley Publishing Co, 1963, v.1, p. 4-1). This is a descriptive definition which cannot be replaced by a rigorous triad because we lack any concept which is more general than energy, so the vague term a certain quantity is utilized to avoid the unsolvable problem of defining the seminal concept.

 Another case when the rigorous triad-like deductive definition is impossible is when no qualifiers are known.  In such cases we know the general concept and know that, in principle, the target of the definition is a sub-category within that general concept, but we don’t know what distinguishes that sub-object from other sub-objects within the same general concept.  For example,  let us say John and Mike are identical twins. Obviously they are two different persons, both being, though, sons of Andrew and Mary. However, we don’t know how to distinguish between John and Mike so we can’t provide a triad forming the proper definition of either who Mike is or who John is. Any attempt to provide a definition, say, of Mike, will result in a tautology which, although necessarily true, is of no substance (for example, an assertion that Mike is he of the twins whose name is Mike).  At best, such a definition can be of what can be called “accidental” type, which is not really useful because it refers to some irreproducible (accidental) qualifiers and therefore cannot be applied under conditions which are in any way different from the specific situation used for generating such qualifiers. For example, such a definition can sound like: Mike is that son of Andrew and Mary who on Tuesday, January 3, 1978 opened the door of their house when we rang the bell. Although that definition formally is in the form of a deductive triad, its qualifier is of an accidental type and cannot be used to distinguish between the twins at any time other than Tuesday, January 3. 1978. 

While a deductive quantitative explanatory definition is the most desirable scientific definition, quite often we have to satisfy ourselves with different types of definitions.  The reason for that is not necessarily the principal impossibility of formulating a deductive triad, but often simply the history of the development of a certain range of concepts.  If an experimental work generates a set of data, this often provides the basis for formulating an inductive definition of a new concept which becomes indistinguishable from a specific type of law of science.  (We will discuss the concept of laws of science in detail a little later, but at this point we can note that a law of science which results from a process of induction is not equivalent to those laws which constitute parts of a scientific theory.  Inductive laws, which are often no more than inductive definitions, belong to the phenomenological type.  They state facts without explaining them.)

Inductive definitions which spell out a law can often be characterized also as relational definitions – they state a certain relation between two or more concepts.  For example, assume a scientist studied the behavior of the metal Gallium at various temperatures.  Repeating the tests many times, she found that at a pressure of 105  Pascal, pure Gallium melts at 302.5 Kelvin.  She postulates that her data, consistently showing that result, are not accidental but reflect a law.  She formulates a law: “Pure Gallium melts at 302.5 K if the pressure is 105 Pascal.” Her postulate resulted from the procedure of induction wherein she proceeded from a set of individual events to generalizing it as a law.  Her formulation defines a law and therefore is also a definition.  It is a relational definition as it establishes a relation between a certain event (melting of Gallium) and two factors – temperature and pressure.  It is also a phenomenological definition since its states a fact without explaining its nature.

Even if an inductive definition is not stating a law, the path to such a definition is similar to the procedure of scientific induction wherein the results of observations or experiments repeated many times are generalized, postulating that the data in question are the manifestation of a law of nature. We will discuss the transition from a set of observational or experimental data to a postulate of a law in detail in a subsequent section.  At this point, let us simply illustrate the idea of inductive definition by an example.

            Imagine that a zoologist whose name is Johns went to explore jungles in, say, some part of Africa, and discovered there a number of animals hitherto unknown. Let’s say all these newly discovered animals happened to be various kinds of cats – cat M, cat N, cat L, etc. Each of these cats differs in certain characteristics from other cats, but all of these cats have certain features in common distinguishing them from cats found in other parts of the world. The zoologist suggests a name for this newly discovered subfamily of cats, say Johns’s cats. Johns provides a detailed description of each of the kinds of John’s cats – M, N, L etc.  He has then to provide a definition of the new subfamily of cats as a whole. He defines them as follows: “Johns’s cats is a subfamily of cats comprising cats M, N and L.”  He may also provide a description of the features which are common for all Johns’s cats, but are absent in all other cats (for example, all Johns’s cats may have a peculiar shape of their ears, which is not observed in any other cats).  This is an inductive definition, generalizing subcategories into a broader super-category. 

Inductive definitions, by their nature, are open definitions, in that the list of their elements can be expanded if new elements are discovered which can be included into the definition.  On the other hand, deductive definitions are closed in that they encompass all possible elements which can be included in the definition.              

The lack of a rigorous definition, agreed upon by the participants of a discourse, may render an otherwise sophisticated discourse too vague to be genuinely scientific and may deprive even a very ingenious argument of its evidentiary value. 

THE BUILDING BLOCKS OF SCIENCE 

The principal building blocks of science are 

            1) Methods of experimentation,

            2) Data,

            3) Bridging hypotheses,

            4) Laws,

            5) Models,

            6) Cognitive hypotheses,

            7) Theories. 

A warning seems to be in order. The above list may create the impression that science can be presented by a neat straightforward scheme, as a combination of clearly distinguishable building blocks.  No such neat scheme exists.  The above listed building blocks of science can overlap, intersect and emerge in an order different from the above list.  However, the division of the body of science into supposedly separate building blocks, besides providing convenience in analyzing the extremely complex and multifaceted form of human endeavor we call science, also reflects the real composition of science, even though the boundaries between its constituents may sometimes be diffuse.

We will discuss the above concepts in detail. Before doing that, note that for each of the above elements of good science there is a corresponding element of bad science, i.e. bad methods, bad data, bad hypotheses, bad laws, bad models, and bad theories. On the other hand, pseudo-science differs from genuine science not just because one or several of the above building blocks can be characterized by adding the term pseudo (which, of course, is possible in principle) but because pseudo-science typically lacks legitimate data.  Since data are that part of science which provides evidence, the absence of real data means the absence of evidence which would support the hypotheses, laws, models and theories.  A hypothesis, law, model, or theory suggested without the real supporting data is typical of pseudo-science. 

The statements in the preceding paragraph require clarification of what distinguishes good data from bad data and from wrong data. This question will be discussed in the section on data.

            Whereas bad science is usually swiftly recognized as such, pseudo-science sometimes features a strong power of mimicry, disguising it as genuine science.  The litmus test enabling one to distinguish genuine science from pseudo-science is in looking for the data underlying the hypotheses, laws, models, and theories of the supposed science.  Quite often, discovering no data, i.e. no evidence supporting those hypotheses, laws, models and theories, serves as an indication that we are dealing with pseudo-science, regardless of how sophisticated the hypotheses, laws, models and theories in question may seem to be and regardless of how eloquently they are presented.

Review a few examples of pseudo-science.  Probably the most egregious example of a pseudo-science which had tragic consequences on an enormous scale is that of Marxism. Created in the 19th century Europe mainly by Karl Marx and Friedrich Engels, it so successfully disguised as science that it won a large number of adherents despite its obvious contradiction with facts.  Marxism comprised several parts which can be roughly defined first as a philosophy of the so-called dialectical materialism, second as the alleged historical analysis of society’s development (the so-called historical materialism) and third, as the Marxist political economy. The dialectical materialism was an eclectic philosophical theory combining elements of Feuerbachs’s materialism and Hegel’s dialectics.  Neither Marx nor Engels contributed much to the development of their predecessors’ philosophical ideas but they reformulated those ideas in the simplified form of a neat set of simple statements which could be much easier comprehended by non-philosophers (such as a vulgarized version of Hegel’s principles of the transition of quantity into quality or negation of negation, etc). (It is interesting to note that a frequent feature of pseudo-scientific theories is that they often suggest a neat and simple scheme allegedly representing complex reality). The core of Marxism was though their theory of the class struggle whose essence they themselves succinctly expressed by the maxim that the history of mankind is the history of the struggle of classes.  That theory reduced all the complexity of human history to one factor – the struggle of economic classes.  Marx and Engels did not seem to be worried by the fact that their representation of history was completely contrary to the histories of such countries as, for example,  China or India, and reflected even the history of Europe only in a rather strained way.  Such absurdities as interpreting religious wars as the results of the struggle of economic classes did not seem to make the creators of “scientific Marxism” pause and allow for the role of any factors besides the class struggle in the history of the mankind. Their extremely narrow concept ignored a multitude of various factors that affected the history of the human society.  In other words, they arbitrarily chose a very narrow subset of data from the much wider set of available data in order to fit the data to their preconceived theory.  This is a classic example of pseudo-science, whose main fault is the absence of a sufficient scope of data so that the theory is built accounting for only a deliberately selected tiny fraction of factual evidence. Since, however, Marxism had all the appearance of a scientific theory, its predictions won wide popularity, with one of the consequences being the bloody revolution in Russia.  Although the actual revolution and its consequences quite obviously did not fit the blueprint predicted by Marxism, the latter acquired the status of a godless religion, believed in with a fanatical stubbornness despite its obvious futility and the blood bath it caused. Indeed, like in religions, one of the persistent claims by Marxism was that it was “omnipotent because it was true.”

Another example of a pseudo-science is Lysenko’s pseudo-biology which was imposed by decree of the Soviet communist rulers as the only true biology compatible with Marxism-Leninism.  Lysenko’s theory was a complete hogwash with no basis in factual data whatsoever. Its imposition also had tragic consequences, as many genuine scientists were arrested and perished in the Gulag because they tried to refute Lysenko’s pseudo-science by pointing to the numerous data which it plainly contradicted. 

One more, this time rather comic, example was the allegedly revolutionary theory of viruses suggested in the late forties in the USSR by a semi-literate veterinary physician Boshian.  According to that theory, which was officially approved by the communist party, viruses constantly convert into bacteria and vice versa. Of course, Boshian’s theory was based on fictional data.  When, after many unsuccessful attempts, a commission of scientists finally gained access to Boshian’s secret laboratory, all they found, instead of samples of the alleged virus-bacteria colonies, was plain dirt. 

Other examples of pseudo-science include the so-called “creation science,” with its arrogant distortion and misuse of facts, as well as its more recent and more refined reincarnation under the label of the “intelligent design theory.”

 This theory, promoted by a large group of writers, including many with scientific degrees from prestigious universities and with long lists of publications, and propagated on various levels of sophistication, has all the appearance of scientific research, as it offers definitions, hypotheses, laws, models, and theories like a genuine science.  What is, though, absent in the “intelligent design theory,” is evidence.  No relevant data which support hypotheses, laws, models, and theories could be found in the papers and books by  proponents of intelligent design, only unsubstantiated assumptions.  Therefore it can justifiably be viewed as pseudo-science. I will also touch on this subject in the section on the admissibility of the supernatural into science, as well as in other articles on this website.  

METHODS OF EXPERIMENTATION 

          The statement that methods of experimentation are the engine of the progress of science seems to be quite trivial   Whereas this statement is especially transparent in physics, chemistry, biology and in engineering sciences, it is valid as well for any other science even if it may sometimes be not as obvious as it is for the listed fields of inquiry.

            When Galileo built his telescope, this provided an enormous impetus to the development of astronomy.  When the Hubble telescope was put in orbit, astronomy underwent another powerful push forward. When Antony van Leeuwenhoek built his microscope, it revolutionized biological science.  When the first electron microscope was built, it again revolutionized several fields of science, leading to the amazing modern  achievements like actually seeing and even manipulating individual atoms.

            These well known examples are only the tip of the iceberg, because the progress of science is pushed forward by the multiple everyday inventions of various ingenious methods of experiment or observation.

            In thousands of research laboratories scientists whose names are usually unknown to anybody except for the narrow circle of their colleagues, every day apply their ingenuity and inventiveness to creating new, ever more subtle methods to question nature.  Without these often extremely ingenious experimental methods and highly sophisticated experimental set-ups there would be no progress of scientific penetration into objective reality, which often guards its secrets very jealously. 

            There are two paths experimental science takes to progress.

            One path is the ever increasing capabilities of devices designed as versatile tools of scientific inquiry and of engineering or medical development, such as electron microscopes, telescopes, and electronic devices (signal generators, potentiometers, recorders,  oscilloscopes, etc).

            The other path is the design of unique experimental contraptions designed for specific experiments and measurements.

            These two path intersect at many points, as designing a specific new experimental contraptions is often mightily assisted by the availability of the more advanced tools of a versatile type.  

            Whereas inventions such as a microscope or radio transmitters and receivers justifiably gain wide publicity, many amazingly ingenious methods invented by scores of not-very-famous scientists usually remain unknown except for a narrow circle of researchers working in that particular field of science.

            Here is a typical example. In the late seventies, at one of the foremost research institutions in the USA, a group of highly qualified and trained scientists was developing certain types of magnetic memory for computers.  In one part of that study a magnetic wire was pulled at a certain speed through a tube filled with an electrolyte, and certain electrochemical processes on the surface of the wire were investigated.  The researchers encountered a problem: as the electrolyte was pumped through the tube, the friction between the electrolyte and the walls of the tube caused the pressure’s drop along the wire and this distorted the data. Considerable effort was invested in order to somehow eliminate or at least alleviate the described effect, but the effect persisted. Then a guest scientist arrived who has never before dealt with the experiment in question.  At the very first meeting where the detrimental effect of the pressure drop was discussed, the guest scientist looked at the diagram of the tube and the wire moving along the latter, and said: “Why, instead of pumping the electrolyte through the tube, just eliminate the pump, make a rifle-type spiral groove on the inner surface of the tube and put the tube into rotation.  The groove will push the liquid with a pressure which will be automatically uniform all along the wire.”

            This is a very simple example of how a fresh look at a problem may suddenly solve a seemingly difficult problem in a very simple way, but this simple example is typical and exemplifies those thousands and thousands of small and sometimes not very small inventions and innovations which occur every day in numerous research laboratories. 

            The unstoppable development of experimental methods, from designing giant accelerators of elementary particles to small improvements in measuring and observational methods, is what underlies the progress of science. 

            Therefore, the statement that a revolution in science starts when the experimental technique proceeds to the next digit after the decimal point reflects an important aspect of the progress of science. 

DATA 

          a) General discussion of data 

            In the words of the famous Russian physiologist Ivan Pavlov, facts are the bread of science. The term data is synonymous with facts. It is also synonymous, in a certain sense, with evidence. Without data there is no science, only pseudo-science.  In order to function as evidence, data must be reliable.  There are two main types of data, observational and experimental.  Both are equally legitimate, the difference being in that observational data are usually obtained in a passive manner, by merely recording whatever nature deigns to display by itself, whereas experimental data are gained as the result of a deliberate procedure of a planned research in which specific conditions are artificially created to force nature to display answers to specific questions.  The demarcation between observation and experiment are not quite sharply defined since observation may be a natural part of a designed experiment.

            Observation, and even some primitive form of experimentation, may happen to occur in completely non-scientific activities.  In a popular comic Russian poem for children, a Russian orthodox priest counts ravens sitting on trees, just for the heck of it.  It is an observation and it is accompanied by a measurement, but this does not make the priest’s activity scientific. Scientific observation or experimentation, while driven mainly by curiosity, always has an either conscious or subconscious purpose – to establish facts which may shed light on the intrinsic structure and functioning of the real world.

            A crucial element of a scientific experiment is measurement.  Measurement makes data quantitative, thus enormously enhancing the data’s cognitive value. There is a theory of measurement which teaches us about the precautions necessary to ensure the reliability of the measured quantities and about the proper estimate of unavoidable errors of measurement. This theory is beyond the scope of this essay, but we have to realize that, on the one hand, the data obtained via a properly conducted experimental procedure are reasonably reliable, but on the other hand they are always true only approximately and cannot be relied upon to an extent exceeding that determined by the margin of a properly estimated error.

            Consider some examples.

            Astronomy is a science wherein observation is by far more prevalent than experimentation.  A classic example of highly valuable observational data is the tables of the planets’ positions compiled by Tycho de Brahe over the course of many years of painstaking observations accompanied by meticulous measurements.  These tables, after they wound up in the possession of Johannes Kepler, served as the data the latter used to derive his famous three laws of planetary motion.  We will discuss later the transition from data via hypotheses to laws, and from laws to theories.  My point now is to illustrate how reliable data, gained via a proper observational procedure accompanied by measurement became a legitimate part of a scientific arsenal. 

            A classic example of the experimental acquisition of data is the discovery, in the early years of the 20th century, of superconductivity by a group of researchers in Leyden, the Netherlands, headed by Heike Kamerlingh Onnes.  These researchers systematically measured the electric resistivity of various materials at ever lower temperatures.  In a specially designed experiment, samples of material were gradually inserted deeper and deeper into a Dewar flask on whose bottom there was a puddle of liquid helium, while their electric resistance was measured.  Analysis of the obtained data led to the conclusion that at certain very low temperatures the electric resistivity of certain materials dropped to zero.  This unexpected result seemed contrary to the common understanding of electric resistance prevalent at that time.  However, data take precedence over theories.  The scientists of Leyden, however puzzled and astonished by their data, came to believe in those data’s reliability and thus announced the discovery of superconductivity.  Their data were then reproduced by other scientists and a new law asserting the fact of superconductivity was formulated.  The formulation of a law did not mean a theory of superconductivity was forthcoming.  It took half a century until a theory of superconductivity was developed.  Neither the law in question nor the theory of that phenomenon would be possible without first acquiring reliable data on the behavior of superconducting materials, these data acquired in a deliberately designed specific experiment.

            As we will see, the path from data acquisition to a law can be quite arduous and prolonged. It involves steps necessarily requiring imagination and inventiveness, because no law automatically follows from data. This is even more true for the path from data, via law, to theory.  However, only those laws and theories which stem from reliable data are constituents of genuine science.  (The question of what constitutes reliable data will be the subject of subsequent sections wherein the reproducibility of data and the problem of errors will be discussed.)

            I would like to illustrate the last statement by an example. In 1994, three Israelis, Witztum, Rips, and Rosenberg (WRR), published a paper in the mathematical journal “Statistical Science.”  In that paper they claimed to have discovered in the Hebrew text of the book of Genesis a statistically significant effect of the so-called “codes.”  According to WRR, the codes in question contained information about the names and dates of birth and death of many famous rabbis who lived thousands of years after the book of Genesis was written.  If WRR’s claim were true, its only explanation could be that the author of the book of Genesis knew the future.  This would be a powerful argument for the divine origin of that book.  The paper by WRR caused a prolonged and heated discussion.  As a result of a thorough analysis of WRR’s methodology, the overwhelming majority of experts in mathematical statistics concluded that the WRR’s data were obtained in a procedure which in many respects was contrary to the rules of a proper statistical analysis. In other words, the community of experts concluded that WRR’s claim stemmed from bad data.  Therefore, WRR’s work was rejected as bad science.  It was not, though, originally rejected as  pseudo-science, because they based their law on certain data to which they applied a statistical treatment (although the latter was partially flawed as well).  However, despite the scientific community’s almost unanimous rejection of WRR’s work as based on unreliable data, two of them (Witztum and Rips) as well as a small circle of their adherents, stubbornly continued to insist that they made a genuine discovery, thus converting their theory from bad science to pseudo-science.  Because the story of the alleged Bible code is quite educational with regard to how pseudo-science appears and persists, it is discussed at length in a separate article ( see http://members.cox.net/mkarep/ )

            On the other hand, claims such as those by Immanuel Velikovski have been rejected from the very beginning not as bad science but rather as pseudo-science because his claims did not stem from any data but only from arbitrary assumptions. In a book published in 1950 and titled “Worlds in Collision” Velikovski offered a whole bunch of wild theories allegedly explaining many mysteries that contemporary science could not explain. For example, one of his suggestions was that when, according to the biblical story, Yehoshua (Joshua) stopped the sun in the sky, the earth indeed stopped its rotation. Another hypothesis by Velikovski postulated a near-collision of Venus and Mars with Earth, thus allegedly explaining  numerous biblical miracles.  Of course, there are no data whatsoever which would serve as evidence for such theories, therefore they were justifiably relegated to pseudo-science. While Velikovski acquired substantial notoriety and was compared in non-scientific publications to Newton, Einstein, and other great scientists, no scientific magazine accepted his papers because they, while plainly contradicting Newtonian mechanics, did not offer a shred of evidence which would support his claims.  

b) Reproducibility and causality 

            To be legitimately useful in science,  data must meet several requirements, one of which is reproducibility.  Neither the reputation of the scientists claiming certain experimental results nor the impressive appearance of their data seemingly conforming to the strict requirements of a properly conducted experiment are sufficient for the data to be accepted as a contribution to science.  Data become a part of science only after they have been reproduced by other scientists.  Indeed, the demise of the cold fusion (at least as it stands for now) was due precisely to the fact that other groups of researchers could not reproduce the data claimed by Pons-Fleischman and Jones.  Likewise, the data claimed by Witztum and Rips could not be reproduced by other scientists,  which was the main reason their theory was rejected by the scientific community.

            The requirement of reproducibility is based on the assumption of causality as a universal law of nature.  This assumption presupposes that reproducing certain experimental conditions must necessarily lead to reproducing the outcome of the experiment.  This supposition is of course a philosophical principle to be accepted a priori. This principle has an ancient origin. It was already discussed in detail by Aristotle who introduced the concept of a hierarchy of four causes, the so-called material causes, the efficient causes, the formal causes, and the final causes.  In more recent times, the principle of causality known as the principle of determinism was formulated by Laplace, and has been universally accepted in science for at least two centuries.

            The advent of quantum mechanics seemed to have shattered that principle.  The very fact that many outstanding scientists were prepared to abolish a principle that had been a foundation of experimental science for so long testifies against the claims by  adherents of the so-called intelligent design theory who assert that scientists are dogmatically adhering to “icons” of metaphysical concepts rather than keeping open minds.

            However, the interpretation of quantum-mechanical effects as a breach of causality is by no means unavoidable.  Rather, it testifies to the insufficient understanding of submicroscopic processes wherein causality is allegedly absent.

            First of all, even in the case of quantum-mechanical events, causality is obviously present as long as the macroscopic manifestations of those events are observed.  Recall the  example of the alleged absence of causality, namely experiments with microscopic particles passing through slits in a partition.  If only one slit is open, on the screen behind the partition a diffuse image of the slit is observed.  If, though, two slits are open, the image on the screen is a set of fringes.  On macroscopic level, the outcome of the experiment is reliably predictable and fully consistent with the principle of causality.  Indeed, the image on the screen is always the same if only one slit is open, and can be easily reproduced at any time, any number of times, anywhere in the world.  If two slits are open, the picture on the screen is different from the case of only one opened slit, but it is reliably predictable and can be easily reproduced at any time anywhere in the world.  Hence, causality is present as long as the macroscopic effects of the experiment are the issue. A question about the validity of causality is raised when the details of the event on a microscopic level are considered.

If an electron is a particle, it cannot pass simultaneously through two slits.  Therefore we could expect that in the case of two opened slits, the image on the screen would be two separate diffuse images of slits rather than a whole set of fringes. Indeed, how can an electron passing a particular slit “know” whether the other slit is open or not?  However, the behavior of individual electrons is different, depending on whether the other slit is open or not.

Thus, the results of the described experiments are sometimes interpreted as indicating that the same microscopic conditions lead to different outcomes, depending on the variations in macroscopic conditions.  On the microscopic level, as this argument goes, the same conditions may result in different outcomes thus causality is absent. 

Quantum mechanics tells us that the explanation of the described phenomena is in that electrons are not particles like those we can see, touch, etc.  They are very different entities, which under certain conditions behave like waves rather than like particles.  It does not seem possible to visualize an electron, because it is unlike anything we can interact with by means of our senses in our macroscopic world.  Of course, it is well known that macroscopic waves (for example, sea waves) can very well pass several openings simultaneously.  Therefore, the two slit experiment is not really an indication of the absence of causality on a microscopic level, but rather an indication of our inability to visualize a subatomic particle.

Richard Feynman said that nobody understands quantum mechanics. To interpret this statement, we have first to agree what the term “understanding” means.  We can’t visualize an electron, and in this sense we may say that we don’t understand its behavior.  However, we can reasonably well describe the electron’s behavior using the mathematical apparatus of quantum mechanics, and can successfully predict many features of that behavior.  In that sense, we can say that we indeed do understand quantum mechanics.

Since we can’t visualize the intrinsic details of the electron’s behavior, we are actually uncertain of whether, on the microscopic level, its behavior is indeed non-deterministic, or we simply don’t know what causes this or that seemingly random path of the electron.  And since we are uncertain,  there is no reason to assert that the principle of causality breaks down at the microscopic level.

Review another situation wherein causality is often claimed to be absent. If we have a lump of a radioactive material, we can easily measure the rate of its atoms’ decay.  This rate is a constant for a specific material. For example, we can experimentally find that, in the course of a prolonged experiment, the fraction of the atoms which decayed every second was, on the average, 0.01.   Thus, if initially the lump consisted of N0 atoms, one second later the number of atoms of that material was N1< N0 ,  one more second later, the number of atoms was N2 < N1, etc, whereas the ratios N1/N0,   N2/N1, etc, averaged over the duration of the experiment, were 0.99.  This result is well reproducible, thus, again, on the macroscopic level causality seems to be intact.  However, if we discuss the phenomenon on microscopic level, the question arises of why at any particular moment of time a particular atom decays while other atoms do not.  All atoms within the lump of material are, from a macroscopic viewpoint, in the same situation.  There is a certain probability of an atom’s decay, which is exactly the same for every atom in that lump of material.  However, at every moment some atoms decay whereas others do not.  Therefore, the argument goes, causality does not seem to be at work in that process.  All atoms are in the same situation, with the same probability of decay, but only a certain fraction of them decays without any evident reason which would distinguish the decaying atoms from those still waiting their turn.

The described problem of the possible “hidden parameters” affecting microscopic processes is the subject of a rather heated discussion among scientists, wherein no consensus has so far been reached. For example, Feynman maintained that an explanation referring to “hidden parameters” is impossible, because nature itself “does not know in advance” which atom will decay at which moment, or which path a particular electron will choose at a given moment. With all due respect to the views of the brilliant  scientist Feynman, I find it hard to accept his position. I express here my personal view of the problem. 

In my view, when we assert that all atoms are in the same situation, this is true on macroscopic level. We have insufficient knowledge of the situation of each individual atom on the microscopic level.  The deficit of knowledge forces us to discuss the situation in probabilistic terms. Probability is a quantity whose cognitive value is determined by the amount of knowledge about the situation at hand. If we knew exactly the intrinsic details of the situation each atom is in, we could, instead of probabilities, discuss the matter in terms of certainties.  Rather than abolish the principle of causality, the much more natural interpretation of the described process of atoms’ decay is that there are definite causes of individual atoms decays at this or that moment of time, but we simply do not possess sufficient knowledge of the details of the process and therefore cannot predict which atom will decay at which moment.  Our lack of knowledge does not necessarily mean the decay of this or that particular atom occurs without a specific cause.

Actually, an analogous situation can be observed on a macroscopic level as well.  This happens, for example, in the so-called Keno game in Las Vegas casinos. In that game, players choose a set of several numbers out of a table containing, say, 49 numbers. The winning set is determined by a machine in which a pile of small balls is constantly mechanically shuffled in a shaking plastic vessel. Each ball bears a number. Balls move up and down the pile, constantly exchanging their positions, some of them pushed up by the rest of the balls, some others sinking deeper into the pile. Every couple of minutes, one of the balls is pushed high enough up from the pile and rolls out of the vessel.  The number on that ball becomes part of the winning set.  This is a chance process in which each ball has the same probability of rolling out at any given minute.  This situation, albeit on a macroscopic level, is rather similar to the atoms’ decay.  At any moment of time, a particular ball rolls out while the rest of the balls remain within the vessel, waiting for their turn to get out. Why a particular ball happens to get out, but not any other ball, despite all of them having the same probability of being pushed out?  We don’t have a sufficient knowledge of the intricate web of interactions between the balls in the constantly shuffled pile, therefore we resort to probabilistic approach.  However, if we knew precisely all the intricate details of the multiple encounters of those fifty or so balls in the shuffling machine, we would be able, instead of a probabilistic estimate, to predict precisely which ball will roll out and when.  The lack of detailed knowledge, while forcing a probabilistic approach, does not mean that the observed occurrence – the choice of particular ball through a chance procedure - did not have a definite cause making the fallout of that particular ball at that particular moment inevitable while for no other ball there existed a similar cause to get pushed out.

The atoms’ decay occurs on a microscopic level, and we have much less knowledge about the intrinsic conditions within and between the decaying atoms that we have in the case of the balls in a Keno game.  This by no means must be interpreted as an indication that the acts of decay have no definite causes.  The principle of causality can be preserved, even accounting for the quantum-mechanical effects. 

Whatever interpretation of causality is preferred, if there is no causality, there is no science. 

c) Ceteris Paribus

The next question to be discussed in relation to data as a foundation of science is the so-called principle of ceteris paribus.  This Latin expression means “everything else equal.”  The idea of that principle is as follows: Imagine that in a certain experiment the behavior of a quantity X is studied.  This quantity is affected by a number of factors, referred to as factors A, B, C, D, etc. Some of those factors may be known while some others may be not.  From a philosophical standpoint, everything in nature is interconnected and therefore the number of factors affecting X is immensely large, so we never can account for all of them.  However, an overwhelming majority of that multitude of factors have only a very minor effect on X and therefore, in practical terms, most of them are of no consequence for the study at hand.  Still, there is a set of factors whose effect on X cannot be ignored if we want to gain meaningful data from our study.

The principle of ceteris paribus is a prescription regarding how to conduct the study. Namely, according to that principle, the process of research must be divided into a number of independent series of experiments. In each series only one of the factors - either A, or B, or C, etc., has to be controllably changed while the rest of the factors must be kept unchanged. For example, if A is being changed, B, C, and D have to be kept constant, and the behavior of X has to be recorded for each value of A. Then another series of experiments has to be conducted wherein only factor B is being changed and X recorded while A, C, D, etc., are kept constant.

Of course, the application of the described procedure is based on the assumption that factors A, B, C, etc., are independent of each other, so that it is possible to change at will any one of those factors without causing all the rest of those factors to change simultaneously.

There are two problems inseparable from the above approach.  One is related to the errors of measurement and the other to the very core of the ceteris paribus principle, i.e. to the above mentioned assumption of the factors’ independence.   

            The problem of errors stems from the fact that no measurement can be absolutely precise and accurate ( the difference between precision and accuracy is defined in the theory of measurement and is beyond the scope of our discourse).  In particular, even if factors A, B, C, etc,. are independent of each other, it is impossible to guarantee that while one of them is being changed, the rest of them remain indeed absolutely constant, because the values of B, C, D, etc., while in principle they are supposed to remain constant, cannot be measured with absolute precision and accuracy.   Moreover, the values of factor A itself, which are supposed to be changed in a controlled way, are never guaranteed to have precisely the desired values, and, finally, the target of the study,  X itself cannot be measured with absolute precision and accuracy.

The problem with the principal foundation of the ceteris paribus methods stems from the fact that in many systems, factors A, B, C, etc., are in principle not independent of each other. Hence, as factor A is being changed,  it may be impossible in principle to suppress the simultaneous changes of factors B, C, D etc, which are supposed to remain constant.  Such systems are sometimes referred to as “large” or “complex,” or “diffuse,” or “poorly organized” systems and their study necessarily can be conducted only beyond the confines of ceteris paribus.

Philosophically speaking, all systems are complex, so that factors A, B, C, etc., are always to a certain extent interdependent. However, the extent of their interdependence may be insignificant to a degree which makes the assumption of ceteris paribus reasonably substantiated. 

For example, assume we study the behavior of ice/water in the vicinity of 273 K, which is, of course, 00 C or 320 F.  The factor we change is temperature.  Besides temperature, the behavior of ice/water is affected by a multitude of other factors.  Most of them are insignificant and can be ignored because they have only a very minor effect.  However, at least two factors have a pronounced effect – pressure and the purity of the water. In this case it is relatively easy to ensure the condition of ceteris paribus.  We can keep a sample of ice in a vessel where pressure is automatically controlled and kept at a predetermined level, regardless of changes in temperature.  Likewise, the purity of ice can be maintained at a constant level at all values of temperature investigated.  In such a study we find that ice containing, say, not more than 0.01% impurities, at a pressure of about 105 Pascal, melts at 273 K.  

d)  Errors 

            We have to distinguish between various types of errors.  Sometimes errors stem from a basically wrong approach to scientific inquiry. It may be due to an insufficient understanding of the subject by a researcher, which results in an improperly designed experiment. It may be due to a preconceived view or belief, or to a strong desire on the part of a researcher to confirm his preferred theory.  The researcher may subconsciously ignore results which are contrary to his expectations and inadvertently choose from the set of measured data only those which jibe with his/her expectations. Errors of these type, while by no means uncommon, should be viewed as pathological; they produce wrong data, which are a version of bad data, and are one of the constituents of bad science. An example of such bad data is the already discussed case of the alleged Bible codes.

            However, even in the most thoroughly designed experiments, errors inevitably occur despite the most strenuous effort on the part of a researcher to avoid or at least to minimize them. These honest errors are due to the inevitable imperfection of experimental setups and to the equally inevitable imperfection of the researcher’s performance.  Let us discuss these honest errors in experimental studies. 

            Until recently, a common notion in the theory of experiment was the distinction between systematic and random errors. While this distinction is certainly logically meaningful and often can be made quite rigorously, lately it has become evident that on a deeper level the demarcation between these two types of error is rather diffuse.  Moreover, the error can often be viewed as either systematic or random depending on the formulation of the problem.   For example, an error can be viewed as systematic if it occurs systematically provided we always conduct measurements in a specific laboratory A.  However, the same error can be viewed as random if we have a choice among many laboratories A, B, C, etc., and choose laboratory A, to conduct the measurement on a particular day, by chance.  The deeper analysis of that question, as well as of the distinction between precision and accuracy of measurements, which is a proper subject of the theory of experiment, is beyond the scope of this paper, whose subject is the general discussion of the nature of science.

The question of experimental errors is, though, related to the problem of causality.   If data are a legitimate component of science only if they are reproducible, the immediate question is: What are the legitimate boundaries of reproducibility?  It is an indisputable fact that the expression “precise data,” if interpreted literally, is an oxymoron.  If an experiment is repeated many times, in each run the measured data will be to a certain extent different.  There is no way to ensure the absolute reproducibility of data.  The conventional interpretation of that fact is not that it negates the principle of causality but rather that in each experiment there are uncontrollable factors which change despite the most strenuous effort on the part of the experimenter to keep all conditions of the experiment under control.

Therefore the principle of causality is not a direct product of the experimental evidence but a metaphysical principle borne out by the total body of science.  Science is based on the assumption of causality without which it simply would make no sense.

Therefore the inevitable errors of measurement are something science has to live with, making, though, a very strong effort to distill objective truth from the chaos of measured numbers.  Data which are a legitimate foundation of laws and theories are therefore the result of a complex process in which wheat is to be separated from chaff and regularities have to be extracted from the error-laden experimental numbers thus building a bridge from data to laws. The bridge from data to laws will be discussed in subsequent sections.

It seems appropriate to discuss at this juncture of the discourse two points related to the manner in which an experiment is conducted.

One of these points is calibration.  Calibration is a very powerful tool for ensuring the meaningfulness of an experiment, wherein the effects of many factors, which are extraneous to the subject of the study and whose contribution can only mask the phenomenon under study, are summarily neutralized in one step.  For example, when Cavendish performed his famous experiment in which he, in his words, “weighed the earth,” he needed to measure the attraction between lead balls.  The force of attraction to be measured was determined by the angle of rotation of a rod to whose ends the lead balls were attached.  The rod hung on a string, so that the elastic forces in the string created a torque which counterbalanced the forces of gravitational attraction. That torque was approximately proportional to the angle of the rod’s rotation. Instead of calculating the forces which would cause a certain angle of rotation, Cavendish simply calibrated his contraption by applying known forces to the rod and measuring the correspondent angle of rotation. He obtained the “calibration curve,” which incorporated all known and unknown factors that could affect the angle of rotation thus eliminating in one step – calibration – most of the sources of error.  Calibration in various forms is routinely used in scientific experiments whenever possible, thus substantially eliminating many sources of error. 

Another point to discuss is the difference between the discrete and the continual methods of recording experimental data. Each of these two methods has both advantages and shortcomings.

If a discrete recording is employed, the target of study - X - is measured for a set of discrete values of the controlled parameter A, while all other parameters affecting X are kept constant.  Factor A is given values A1, A 2 ,A 3  etc, and at each of those values of A, X is measured, yielding a set X1 , X2 , X3 ,  etc. 

As a shortcoming of such method, it is often asserted that there is never certainty regarding the values of X between the measured points.  However close the values of A at which X is measured are, there are always gaps between those values of X, and to fill those gaps the researcher has an infinitely large number of choices.  It is often expressed as an assertion that there are infinitely many curves which can be drawn through the same set of experimental points,  regardless of how close those points are to each other. (Of course, a more accurate assertion should be that the curves may be drawn through the margins of error surrounding each experimental point).

While the above assertion is certainly correct in an abstract philosophical way, it rarely has serious practical implications, as will be discussed in the section on the bridging hypotheses.

The mentioned shortcoming of the discrete method of experiment seems to be eliminated if a continual method is employed instead.  In this method, the controlled factor A (or often several controlled factors A, B, C, etc.) are being continually changed at a certain rate, while the values of the target of the study, X, are simultaneously and continually recorded.

An obvious advantage of this method is the substantial saving of time and effort necessary to accumulate information about the phenomenon under study.  The wide availability of sophisticated recording equipment in the 20th century made continual recording the favorite method of experimental work in physics, chemistry and other fields of study.

Since the method of continual recording generates continuous curves rather than sets of experimental points, it may seem to eliminate the arbitrariness involved in drawing curves through discrete experimental points.  Unfortunately, this supposed advantage of the continual method is illusory.  The illusion is related to the problem of a system’s response time to a perturbation, i.e. to an abrupt change of parameters.  If any parameter affecting the system’s behavior is abruptly changed, it sets in motion a process of the system’s transition to a new state, this process proceeding with a certain finite speed.  If a system was in a state wherein X had a certain value X1 while factor A had the value A1, and at some moment of time t1 parameter A is changed to A2,  the system’s measured property X does not instantly jump to a new value X2, but rather undergoes a transition to the new value which takes a cert