Deductive explanation of scientific laws

This paper is mainly a response to Charles Morgan's criticisms (this journal, pp. 511–25) of the author's model of the (formal aspects of) explanation. It is claimed in the paper that with two modifications and some additional specifications the model withstands Morgan's criticisms.

Part 1: Truth and Inquiry Any attempt to develop an account of truth within the context of the theory of argument must confront two sources of tension that characterize the core notion of truth in logic. The first is a commitment to some notion of correspondence, whether of the ordinary 'cat is on the mat variety' or the more technical notion, after Tarski, that rests on the availability of the truths of arithmetic independent of logical argument. The parallel intuition that true sentences are those that can be mapped onto an arithmetic model known to be true and the correlative notion that true sentences can be mapped onto existing states of affairs. The second grows out of the irrelevance of both of these images to much of the vast domain within which argument is crucial. There is no point in arguing about cats on mats since the truth is obvious, and arguments about mathematical truths have to do with their being provable, a standard much to high to apply to the overwhelming majority of things one can argue about. Ordinary argumentation most often deals with issues for which a corresponding reality is none to clear. Especially when it engages with inquiry, that is, an attempt to ascertain what is the case, and therefore confronts a situation in which what is the case is not available to correspond to our conjectures during an inquiry. Rather, what is the case appears as our conjectures are seen to bear fruit through the process itself.

RAIMO DEDUCTIVE TUOMELA EXPLANATION SCIENTIFIC OF LAWS* Since the appearance of Hempel’s and Oppenheim’s famous paper (1948) on the logic of scientific explanation a lively discussion has been going on concerning the logical criteria of adequacy for the explanation of singular events (states of affairs, processes, etc.). However, the theoretically much more important topic of the logic of the explanation of scientific laws has been neglected to a great extent in recent discussion. Notable exceptions to this claim are the works by Campbell (1920), Nagel (1961), Bunge (1967), and partly the recent articles by Ackermann (1965), Ackermann and Stenner (1966) and Omer (1970). Furthermore, in the debate between the representatives of the (or a) two-level picture of science (e.g. Hempel, Feigl) and the ‘omnitheoreticians’ (e.g. Feyerabend, Kuhn) the logical aspects of the explanation of (empirical) laws and theories by more developed theories has been treated to some extent. In any case the discussion on the explanation of laws has in general lacked the formal rigour and sophistication characteristic of the discussion of the explanation of singular events. It has often been argued that the notion of (scientific) explanation is a pragmatic notion which does not have a clear-cut unambiguous objective formal structure. Sometimes these arguments purport to show that the relation of explanation holding between an explanans and an explanandum ceases to hold under the substitution of logically equivalent explanantia or explananda. Or then some other commonly accepted formal invariance conditions are claimed to fail (cf. p. 386). Often this type of argumentation is invoked to show the intensional (nontruthfunctional) character of the notion of explanation, which thus - qua intensional - cannot be clarified by an explicate (or explicates) within standard extensional logic. However, as we do not know of any convincing arguments to prove the latter we will proceed under the working hypothesis JournaI of Phikophical Logic 1(1972) 369-392. All R&htx Reserved Copyright 0 1972 by D. Reidel Publishing Company, Dordrecht-Holland 370 RAIMO TUOMELA that for the time being it is meaningful to look for formal criteria that acceptable scientific explanations should satisfy. On the other hand, it seems clear to us that the enormously difficult and complex notion of explanation will have a number of different objectified and idealized explicates - and not only one. This partly follows from the contextual characteristics of ordinary common sense explanations from which the notion or rather the various notions of scientific explanation have been obtained by the processes of abstraction, idealization, schematization, and objectification, and so forth. Common sense explanation seems to be dependent on the properties of the explainer and the explainee as well as on many other aspects of the context. All this is part of what if often meant by referring to explanation as a pragmatic notion. There are a number of pragmatic features which cannot be abstracted away in a full-blown theory of scientific explanation without too great a philosophical loss. Lacking a developed adequate pragmatics for the philosophy of science - which would be needed - something like Kuhnian paradigms presently seem to provide the best pragmatic background frame for a theory of scientific explanation. Indeed, the general framework within which the problems in the present paper can be embedded is briefly this. Consider a fixed Kuhnian paradigm and within it a research programme incorporating a temporal sequence of theories T,, T, ,..., T,, T,,, ,.... We shall assume that the paradigm determines which (essentially general) statements qualify as scientific laws and which (sets of) statements represent acceptable theories. Furthermore, it will be assumed that the paradigm specifies the philosophical purposes of a scientific explanation. Until these conditions are fulfilled it does not seem meaningful to ask for a clarification of the logical and methodological features of scientific explanations. Within the present frame one can start asking various questions pertaining to the growth of science as seen from the point of view of the present paradigm. For instance, what kind of philosophically, methodologically, and logically interesting transitions from some T. to T,+, can be found? In the present paper we shall consider only one special type of scientific growth occurring within this scheme, viz. that where T,+, (deductively) explains and thus supersedes Tn. Here T,, can be an empirical law or theory and T,+, a more comprehensive theory often introducing new explanatory ideas. DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 371 It has often been contended that in actual science a superseding theory T n+i almost never deductively implies T,. Rather T,+ i implies a theory or law T,,* which can be considered an approximation of T. and which is at least initially in experimental agreement with T.. (The explanation of Kepler’s laws by Newtonian mechanics is often mentioned as an example at this point.) Another way of putting this is to say that T,,, explains T, by correcting it at the same time. Undoubtedly this kind of correction process often takes place in science. Nevertheless, one may argue that this does not basically contradict the very idea of deductive explanation. For one may always argue that what one really aims at in this situation is an explanation and deeper understanding of the (extralinguistic) regularity allegedly described by T.. However, in the process of explanation it is found that Tn* rather than T, seems to be a correct description of this regularity. If this characterization of the situation is accepted there is nothing seriously wrong with deductive explanation, not at least on this point, even if deductive explanation alone does not then seem capable of accounting for transitions within actual scientific research programmes. (It should be remarked here that our notions of paradigm and research programme are to be understood in such a way that really revolutionary transitions always involve abandoning a paradigm and a research programme associated with it. Hence we do not have to take stand to questions of meaning variance, etc., which seem to involve a change of paradigm.) Apart from some comments in the final section we shall in this paper concentrate on the logical and methodological rather than on the epistemological and metaphysical aspects of scientific explanation. Our starting point will be to regard explanation as an informative or informationproviding argument. Unless considerably explicated and developed this idea is of course platitudinous. Explanation of laws can and has been regarded, for instance, as aiming at finding hidden causes for observable phenomena, or alternatively a deeper description of reality. Both of these basic types of explanation give us reasons for believing in the laws to be explained. At the same time it seems that an explanation (such as each of the above kinds) always carries a proper amount of relevant information concerning the explanandum and that this piece of information indeed constitutes the grounds for our believing in the explanandum. In 372 RAIMO TUOMELA this paper we shall mainly discuss information as measured by the logical (and especially quantificational) strength of statements. The sense or senses of information due to the introduction of new explanatory and perhaps ontologically more basic theoretical ideas will be emphasized, too. II 1. Our main task in this paper is to examine some of the logical and methodological conditions of adequacy to be satisfied by deductive explanations of laws. Among other things we shall make an attempt to give a set (or actually several alternative sets) of necessary and sufficient logical conditions for an explanatory relation to hold between a theory and an empirical (or observational) generalization. Among theattempts to give exact formal criteria of adequacy for deductive explanation, only those by Ackermann (1965), Ackermann and Stenner (1966), and Omer (1970) were also concerned with the explanation of general explananda in addition to singular ones. Ackerman& (1965) paper has been criticized and shown to be faulty in Ackermann and Stenner (1966). The latter paper again has been criticized by Omer (1970) and by Kgsbauer (reported in Stegmtiller, (1969). Omer also presented a model of explanation of his own based on general information-theoretic considerations. We shall start by a concise description of Omer’s model of explanation. It will be followed by a criticism. Finally we shall try to amend Omer’s model in some respects and discuss the implications of our amended and enriched models. The final amended model to be presented is intended to apply to the explanation of laws only. Whether or not it can easily be applied to the explanation of singular explananda we shall not discuss here in any detail. Hence there is no need to go through the whole welter of counterexamples to various explicates of the deductive-nomological model of explanation, as these counterexamples have generally been restricted to the explanation of singular explananda. Let us thus go on to a review of Omer’s model of explanation with whose main ideas we in general sympathize. Omer starts by applying to scientific explanation one of Grice’s principles concerning the informativeness of assertive discourse. This gives the first general criterion for explanation (see Omer, 1970, p. 419): DEDUCTIVE RI EXPLANATION SCIENTIFIC LAWS 373 In the explanans no sentence which is less informative in the topic should be given when it is possible to give a more informative one. Next Omer states as ‘an important RIS OF subcase’ of this general requirement: No sentence in the explanans should be of less informative content than the explanandum. The explanandum-sentence is considered by Omer to be a true statement ‘on the topic’. R, then indeed follows from RI with the additional premise that it is possible to give an explanans more informative than the explanandum. One important question left open by the above requirements is how to measure the information content of a sentence. Omer uses essentially the logical content explicate due to Carnap’s theory of semantic information. According to it, the content of a statement is identified with its logical strength (and formally measured as one minus the logical probability of the statement). However, following Omer, we do not below utilize the metrical cant-measure at all. The only information-theoretic notion concretely needed in Omer’s account is that of noncomparability of statements. We shall say that two statements. p and q are noncomparable with respect to their information content if and only if not l-p 2 q and not F q up. It is important to notice what is involved in this notion of noncomparability. Two statements are noncomparable exactly when each of them includes some content-elements not included in the other one. Thus it is quite possible that they contain common content-elements. Indeed, unless this were the case within deductive explanation, the requirements concerning the noncomparability of the elements of the explanans and the explanandum to be considered below could not be reconciled with the idea of the explanans logically implying the explanandum. (Consider e.g. Craig’s interpolation theorem from which it follows that there has to be some amount of common content between two statements one of which implies the other.) As a side remark it can be pointed out here that within nondeductive explanation, measures of transmitted information (the inf-, cant- and 374 RAIMO TUOMBLA entropy-measures) have been used to measure the goodness of an explanation. The more an explanans transmits or conveys information concerning the explanandum, the better is the explanation. However, in this sense all deductive explanations are maximally good. For if the explanans logically implies the explanandum, our coming to know the explanans completely relieves us of our agnosticism or uncertainty (these notions understood in the sense of the semantic theory of information) concerning the explanandum. Thus within deductive explanation this kind of information-theoretic explicate of explanation is of no help. Let us now go back to Omer’s general requirements for deductive explanation. In terms of logical content we now have (Omer, 1970, p. 422): In the explanans no sentence in the topic which is of less logical content should be given when it is possible to give a sentence with more logical content. No sentence in the explanans should be of less logical content than the explanandum. The content of the requirement R; (over and above that of R;,) can be illustrated by the following example. Suppose the statements p v r and p v -r are proposed to be included in an explanans. Now jointly they are equivalent to p but both of them are individually of less logical content than p as p implies them. Thus, according to R; instead of p v r and p v -r we should use p in the explanans. Next consider the following example where T is a theory and L a law: T TDL L (Notice that here the second premise is logically equivalent with -T v L, with clearly follows logically from L.) The requirement R;, clearly blocks, among other things, this kind of trivial deductions from being explanations. But it does not block the following paradoxical case given in Hempel and Oppenheim (1948) p. 273) : B K ii DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 375 In this self-explanation K could stand for Kepler’s laws and B for Boyle’s law. Apparently RiS has to be strengthened. Omer also comes to this conclusion, although on the basis of examples with singular explananda. Omer now proposes (Omer, 1970, p. 423): The explanandum should be noncomparable sentences of the explanans. with any of the But what is meant by a sentence here? Is the conjunction of all the sentences occurring in the explanans a sentence? If so, then certainly R& is not acceptable as it is clearly in blatant contradiction with the basic requirement that the explanans should logically imply the explanandum. Apparently some clarification is needed. This may be what Omer thinks, too, as he proceeds to give a more technical equivalent to RI;,, even if he does not motivate his move. Before giving the final version of the noncomparability requirement we shall have to introduce some technical apparatus. Let us consider some scientific language 9 with specified logical and extralogical vocabulary, well formed formulas, logical axioms and rules of inference. In the manner of Ackermann and Stenner (1966, p. 169) we now define the notions of a sequence of truth functional components and a set of ultimate sentential conjuncts of a given formula (sentence) T. A sequence of statemental well formed formulas (W,, WZ, . . . . W,,) of 9 is a sequence of truth functional components of T if and only if T may be built up from the sequence by the formation rules of $P, such that each member of the sequence is used exactly once in the application of the rules in question. The WI’s are thus to be construed as tokens. Thus, for instance, ((x) F(x), (x) F(x)) and not ((x) F(x)) is a sequence of truth functional components of the formula (x) F(X) & (x) F(x). The formation rules of 2 naturally have to be specified in order to see the exact meaning of the notion of a sequence of truth functional components of a theory finitely axiomatized by a sentence T. A set of ultimate sentential conjuncts Tc of a sentence T is any set whose members are the well formed formulas of a longest sequence (W,, W,, . . ., W,> of truth functional components of T such that T and W1 & WZ BE 8~. . . & W, are logically equivalent. If T is a set of sentences then the set Tc of ultimate conjuncts of T is the union of the sets of ultimate sentential conjuncts of each member of T. We may notice here that although by 376 RAIMO TUOMELA definition the Tc-sets of two logically equivalent theories are logically equivalent they need not be the same. Now we are ready to state the final version of the noncomparability requirement for a Tc of a theory T constituting an explanans (cf. Omer, 1970, p. 424): For any Tci in Tc, Tc, is noncomparable dum. with the explanan- The requirement R;I:. seems to avoid at least the paradoxical examples of self-explanation concerned with singular explananda to be found in the literature. Furthermore, it seems to do the same in the case of the explanation of laws as well (cf. the example by Hempel and Oppenheim given above). Next we shall state some necessary conditions for an explicate of deduo tive scientific explanation which also Omer accepts (although in a somewhat different formulation). Let us write E(L, T) for ‘L is potentially explained by T’. Here T and L are statements of our scientific language _Lp.(Sometimes we shall also denote by T and L sets of statements, which does not make any essential difference as long as these sets are finite.) As above, Tc is a set of ultimate sentential conjuncts of T. For simplicity, in the sequel we shall be concerned with E(L, Tc) rather than E(L, T). (Recall that Tc and T are logically equivalent. When speaking of E (L, Tc) Tc is to be understood as the conjunction of its members rather than a set, which makes no essential difference here.) Now we are ready to give some conditions for an explanatory relation E(L, Tc) to hold between Tc and L: E(L, Tc) satisfies the logical criteria of adequacy for deductive explanation only if (1) (2) (3) (4) {L, Tc) is consistent; Tc!-L; some Tci in Tc is a universal law; for any Tci in Tc, TCi is noncomparable with L. In view of what was said in Section I of this paper, (1) seems acceptable. Requirement (2) is obvious. As to (3) we shall here require of a law only the syntactic feature that it essentially contains at least one universal DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 377 quantifier, and no fuller treatment of lawlikeness, beyond giving this necessary condition, is attempted at in this paper. Requirement (4) is just R;:. Notice that in this model L can be either a singular or a general statement. Can the conjunction of the conditions (l)-(4) be regarded as a sufficient condition of explanation as well? It seems that the resulting model of explanation is too wide. For if we have found an explanans for a statement, say G(a), within this model, the same explanans will also be allowed to be an explanans for all the disjunctions in which G(u) occurs as a disjunct and which do not contradict condition (4). In accordance with Omer we are here inclined to consider this consequence unacceptable (or at least undesirable). How can we exclude such counterintuitive explanations? Let us first consider Omer’s own argumentation. (See Omer, 1970, p. 425) First he argues that the phrase ‘in the topic’ of the original R, secures that all additional pieces of information are relevant information. This seems acceptable, though vague. But then he claims that RI is to be interpreted or qualified so as to rule out all redundant information (i.e. redundant over and above what is needed for T to imply L). This requirement, however, is in clear contradiction with R; (supposedly equivalent with RI). Indeed, as RI clearly seems to call for maximally informative explanantia, Omer on the contrary is here led to look - ceteris paribus - for minimally informative explanantia. This is seen from the following, which - apart from an index correction - is Omer’s last condition (Omer, 1970), p. 426): (5) It is not possible to find sentences Si, . . . . S, (r > 1) such that for some TCj, . . ., Tc, (n 2 1) : TCj &.a. & TC”tSi &.a. 8~ S, not S, & . . . & S,~TCj &... &Tc, Tc,tL where Tc, is the result of the replacement of TCj, ..*, Tc, by Si, ..., S, in Tc. Omer now claims that the conditions (l)-(5) are necessary and sufficient for deductive explanation. Condition (5) requires that within Omer’s model of explanation we 378 RAIMO TUOMELA choose as the explanans a minimally strong set of statements. To explain something is then to give a specific (minimal) amount of (proper) information. Omer claims with examples that his model is free from the paradoxes found against the previous models of deductive explanation. (See Omer, 1970, pp. 422f., 427-432 for the discussion of the arguments by Hempel and Oppenheim, 1948; Eberle et al., 1961; Kaplan, 1961; Kim 1963; Ackermann, 1965; Ackermann and Stenner, 1966.) We shall not comment upon them here directly, partly because the paradoxes deal only with singular explananda. Furthermore, we shall below show that there are so obvious flaws in Omer’s model that they lead to the rejection of it in its present form. 2. Let us now proceed to a criticism of Omer’s model which will lead us to two amended versions. Let us consider the following paradigm case for the explanation of singular explananda: (4 (F(x) = G(x)) F(a) G(a) Does it satisfy Omer’s conditions? Somewhat it surprisingly it does not, for it does not satisfy condition (5). Contrary to Omer’s claim that conformity to the above condition (5) assures the relevance of the law more than ever, our explanans would here become simply (G(a)}. But this explanans obviously contradicts both condition (3) and (4). In other words, Omer’s model of explanation is clearly inconsistent. An apparent and trivial flaw in Omer’s model seems to be this. A requirement to the effect that the statements S 1,. . ., S, should be (universal) laws whenever the statements TCj, . . . , Tc, are, seems to be missing from condition (5). But this correction does not yet suffice to establish a consistent model of explanation. For the amended condition (5) will still always lead to a contradiction in those cases where the explanandum is a law statement. That this is the case is seen by taking simply the explanandum as the only Si. This makes the amended condition (5) true but it falsifies (4). To block this we have to insert a conditional consistency requirement. Thus we propose the following amendation of (5): DEDUCTIVE (5’) EXPLANATION OF SCIENTIFIC LAWS 379 It is not possible without contradicting any of the previous conditions to find sentences Si, . .., S, (r> 1) at least some of which are essentially universal such that for some TCj, ,.. Tc,, (n>l): Tcj &... & Tc,l-S, &... &S, not Si &... & S,FTCj &... & Tc, Tc,FL where Tc, is the result of the replacement of TCj, . . . . Tc, by Si, . . ., S, in Tc. The model satisfying (l), (2), (3), (4), and (5’) (call it Omer’s corrected model) really seems to be a ‘minimal law’ model according to which an explanation provides a minimal amount of specific information. To illustrate, if we have a singular explanandum G(a) v H(u) and the initial condition statement F(a), then we have to choose in the explanans the law (x) (F(X) 3 G(x) v H(x)) and not, say, (x) (F(X) 3 F(x)). The latter law is stronger than the former, and it, together with F(u), suffices to imply G(u)v H(u). This model also seems to be a possible candidate for the deductive explanation of scientific laws. As we noticed, neither the model (l)-(5) nor the present version (l)-(5’) satisfies the general information theoretic principle R;; and they satisfy the original version Rr only if it is interpreted in a rather queer way which is indeed the opposite of our (and Omer’s) original reading of it. We suggest that the present model should rather be considered as satisfying something like the following strong modification of R1 (actually compatible with what Omer says on p. 425 of his paper): RF The explanans should contain redundant information is it relevant. only if The notions of information and redundant information are to be understood in the previous information theoretic sense in which information content and logical strength go together. The problem left then is what is to be understood by relevancy. One possibility is that formalized by (5’). But it is not the only possible explicate. Consider the following two proposed explanations for the same singular explanandum : 380 RAIMO (4 (*I (Y) (F(x, F(a, TUOMELA Y) = G(x, YN Tc: Tclf b) G(a, b) (Y) @(a, Y) = G(a, YN (**I Tc:* Tcf* F(a, b) W, b) The more general proposal (*) is not accepted as an explanation by Omer’s corrected model, but (**) is. Now one may argue that there is a sense of explanation in which (*) counts as an explanation of G(a, b), perhaps even as a much better one than (**). In this sense of explanation the information provided by the additional quantificational generality of the law Tc: over and above the generality of the law Tc:* may be considered relevant redundant information (in the sense of RF). (Cf. e.g. Hintikka, 1970, for an argument concerning the increase of ‘surface’ information with quantificational depth, and cf. Hintikka and Tuomela, 1970, for an argument concerning how observational information content increases with quantificational depth due to the introduction of new explanatory concepts.) Let us thus accept that there is a sense of information in which the increase in quantificational generality provides relevant information. How can this observation be taken into account in a more technical fashion? We propose the following. Let us write p iq when q is deducible from p by using sentential logic only. Now (5’) is to be replaced by: (5”) It is not possible, without contradicting any of the previous conditions for explanation, to find sentences S i, ...r S, (r> 1) at least some of which are essentially universal such that for some Tcj, . . . . Tc, (n>, 1): Tc, &... & TC,~Si not Si &... & S,tTCj &... 8r S, &... & Tc, Tc,tL where Tc, is the result of the replacement of TCj, . . . , Tc, by Si, . . .. S, in Tc. The resulting model of deductive explanation thus explicitly defines the DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 381 relation E(L, Tc) by the conditions (I), (2), (3), (4), and (5”). This model accepts both (*) and (**) as explanations. In principle the explananda L can again be either singular or general. To get a simple example with a general explanandum we can change (**) by adding a third argument to the two-place predicates F and G and by universally quantifying over the new argument (say z). The resulting explanation (call it (“)) qualifies in our present model but not in Omer’s corrected model. The same holds true for the analogue (‘) of (*). The only explanatory argument which in the present frame would be accepted both by our present model and Omer’s corrected model is one in which only z is quantified. (Call this explanation (“).) We propose that the conditions (1)-(5”) are to be accepted as necessary and sufficient for determining the logical structure of the explanation of (at least) singular explananda. (Below we shall still require something more of good explanations of scientific ~uws.) It should be noticed that the conditions (1)-(5”) are not restricted to any specified formal language, as most of the previous models of explanation have been. Therefore it should be widely applicable. In addition, we have not made any philosophical restrictions as to the substantial content of explanations, but have restricted our analysis to the purely formal aspects of explanation. 3. At this point we shall recall one of our claims from Section I of this paper. There we mentioned that particularly in the explanation of laws one introduces theories which contain new explanatory concepts. To make justice to this we shall assume that we are dealing with an interpreted scientific language whose extra-logical vocabulary is dichotomized into a set 1 of observational (or descriptive, or old) predicates and a set p of theoretical (or explanatory, or new) predicates. The only thing we essentially require of this dichotomy here is that the members of p cannot in aZZcontexts be used in direct reporting. Therefore they have to be connected by correspondence rules to the members of L, at least in some contexts, to be methodologically useful (and perhaps they are not even fully intelligible without that). We shall below assume that our explanandum laws contain only members of A, and that the explanantia always contain at least some formulas in which members of p occur. In the explanation of a law L(A) the explanans will be a theory axiomatized by a sentence T(1 u p) (or, alternatively, a Tc(3, u ,u) correspon- 382 RAIMO TUOMELA ding to T(3,up)). One can now distinguish the following three parts in the theory: (1) the Campbellian part (or core theory) which consists of those statements of T(Jup) which are solely in the vocabulary ~1; (2) the observational content of T(1 u ,u) which consists of those statements of T(nup) which contain members of 1 only; (3) the correspondence rules of T(J u p) which consists of all the other statements of T(Jup) and are in the vocabulary Iup. Let us call the deductively closed sets comprising the components (l), (2), and (3) by H(P), WI, and C(n u p), respectively. For first-order languages, the decomposition of a sentence T(1up) can be accomplished by means of, for instance, the reduction operation defined in Hintikka and Tuomela (1970). More generally, Craig’s general replacement method shows how this can be done for deductive systems in the case of almost any formal system (cf. Craig, 1956). (We shall below denote by Cn(T(lup)) (= H(,u)uO(~)UC(JU~)) the deductive closure of T(lup), and by the Craigian p-reduct of a deductive system we shall mean the recursively axiomatizable part of Cn(TLup)) solely in the vocabulary PJ When we below investigate how a theory T(rZup) explains a law L(A) this problem can thus also be considered as how a core theory Hb) conjoined with some suitable correspondence rules C(n up) explains a law L(A) belonging to O(A). As we have repeatedly pointed out, there are many ways to explicate the notion of explanation. Furthermore, one may argue about which properties of the relation E(L, Tc) are analytic and which are synthetic. We shall not take a definite standpoint to this here but, rather, we shall liberally try to take into account all the logically and methodologically important features of E(L, Tc). In the model of explanation satisfying (l), (2), (3), (4), and (5”) a number of important aspects of explanation (e.g. proper informativeness, prohibition of completely circular explanation, etc.) are already incorporated. But there are still at least two logically explicable properties to be considered (nonlogical ones will be discussed later). These are the requirements of observational creativity of the explanans over the explanandum law and the theoretical noncreativity of the explanans with respect to its Campbellian part. DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 383 As to the first of these let us quote Nagel. “At least one of the premises in the explanation of a given law will meet the following requirements: when conjoined with suitable additional assumptions the premise should be capable of explaining other laws than the given one; on the other hand it should not in turn be possible to explain the premise with the help of the given law even when those additional assumptions are adjoined to the law” (Nagel, 1961), p. 36). We shall accept a strengthened form of this condition, and its formalized version will appear below as condition (6) among our additional desiderata for explanation. The theoretical noncreativity condition has been formulated by Nagel as follows: “the introduction of new correspondence rules does not change either the formal structure or the intended meaning of the theory, though new rules may enlarge the theory’s range of application” (Nagel, 1961, p. 102. Also see Campbell, 1920, p. 133). In Hintikka (1972) this condition was dubbed the theoretical adhockery condition, as it requires that no ad hoc theoretical principles, but only (or at most) new correspondence rules, should be added when applying the theory to explain new laws. However, this condition cannot be applied unqualified to all explanation as clearly the core theory H(p) must be allowed to grow if this growth takes place ‘in a fruitful way’. We shall not include the theoretical noncreativity (or adhockery) requirement among the conditions of our model proper but as a separate condition ((7) below) to be applied when the above qualification can be accepted. (After this accounting for this requirement, all of Nagel’s logical conditions of adequacy for the explanation of laws are taken case of by our conditions; cf. Nagel, 1961, pp. 33 ff, and p. 102.) Now our final model of explanation of scientific laws can be stated in full as follows. As before, let thus T be a statement. Tc a set of ultimate sentential components of T (or actually a conjunction of components in the context E(L, Tc)), and L a statement to be explained. Then we say that the relation E(L, Tc) satisfies the ZogicaZconditions of adequacy for the deductive explanation of scientific laws if and only if (1) (2) (3’) {L, Tc} is consistent; TcbL; Tc contains universal formulas at least some of which contain members of ~1and L contains only general statements in A; 384 (4) (5”) RAIMO TUOMELA for any TCi in Tc, Tc, is noncomparable with L; it is not possible, without contradicting any of the previous conditions for explanation, to find sentences Si, . . ., S, (r > 1) at least some of which are essentially universal such that for some TCj,...,TCn(n>l): TCj &... &Tc”~S~ &... &S, not Si &..v & S,FTCj &... & Tc~ Tc,l-L where Tc, is the result of the replacement of TCj, . . ., Tc, by S*,..., S, in Tc. As an additional desideratum we consider: (6) If Cn(Tc)=H(p)uO(A)uC(Aup), then there is a L’#L(LEO@), L’EO’(2)) such that not l- LD L’ and such that E(L’, Tc’) for some Tc’ (of a T’) for which Cn(Tc’)= =H’QuO’(A) uC’(Jup) and HcH’; in addition, not {L}u(H’-H)u(O’ -O)u(C’-C)t-H. (I owe Prof. W. Stegmtiller my present formulation of the last part of condition (6).) The optional theoretical adhockery condition, to be applied if no fruitful growth is present, is: (7) If H@) is an explanatory-core theory, and if, for some L, E(L, Tc) with Cn(Tc)=H,(~)uO(~)uC(~u&), then the difference H-H, = 0. (H, is the Craigian p-reduct of Cn (Tc)). It can be noticed that none of the above conditions is restricted to a specific formal system (such as first-order predicate calculus). Therefore they should have a wide area of application. Let us call a model of deductive explanation of laws for which (l), (2), (3’), (4), and (5”) hold the DEL-model. If it is not required that theoretical concepts be essential for deductive explanation our old (3) can be substituted for (3’) in the above model (and the appropriate notational changes can be made elsewhere) and we get the model discussed in subsection 11.2. Call it the weak DEL-model. (An analogous remark holds for the situation where one is not willing to dichotomize the extralogical concepts in the manner assumed here.) Earlier we described three simple law-explanations (‘), (“), and (“‘) DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 385 which all qualified as explanations within the weak DEL-model. If, in addition, the three-place predicate F is theoretical, all these explanations are accepted by the DEGmodel, too. Explanations (‘) and (“) carry more relevant and deeper (quantificational) information than (“‘). One consequence of this is that they in addition satisfy (6) which (“‘) fails to satisfy. (Notice that the fate of condition (7) cannot be determined before fixing an initial core theory H@).) As an additional example, consider the following simple chain explanation : (4 (G, (4 = F(x)) (4 (F(x) = Gz(4) (4 (GI (4 = G, (4) where Iz= { G1, G,} and p = {F}. Here G, and G2 might be observable symptoms of a virus F.) This explanation is accepted by the DEL-model. But its shallowness is reflected by the fact that it does not satisfy (6). As a scientific illustration of the DEL-model we can consider, for instance, the explanation of the well-known probability matching law. This law is explained by the linear learning theory of Bush and Mosteller. But it is also explainable by the general stimulus sampling theory of Suppes and Estes which in turn explains the linear learning theory as its special case. Let us next state some formal properties of E(L, Tc) in the DEL-model. Most of them will also apply to the weak DEL-model, as can easily be seen. First, it is easy to verify that in the DEL-model (a) (b> (c) E(L, Tc) is asymmetric E(L, Tc) is irreflexive E(L, Tc) is not transitive. In the weak DEL-model, E(L, Tc) is easily seen to be irreflexive and neither symmetric nor transitive. To see that it is not transitive assume that E(L, Tc) and E(Tc, Tc’) for some L, Tc, Tc’. Then we may possibly have I-Tc’~zTc,, though not I-Tc] DL, for some TC’i ETc’, Tcj ETC. Thus even if not bTcj 3 L it may happen that tTc’* 2 L. Hence condition (4) fails to hold. But still E(L, Tc) is not intransitive in the weak DEL-model. 386 RAIMO TUOMELA Next we observe that: (4 (4 (0 If E(L, Tc) and if for some Tc’, t Tc’xTc (assuming not ~Tc’ I>Tc), then E(L, Tc’), provided every Tci~Tc’ is noncomparable with L. If E(L, Tc) and if for some T’ such that k TET’, T and T’ possess identical sets of ultimate sentential components (i.e. TccTc’), then E(L, Tc’). If E(L, Tc) and for some L, ~LEL’, then E(L’, Tc), provided that, for all Tc, in Tc, Tc, and L’ are noncomparable. It should be noticed, however, that a DEL-explanation is not invariant with respect to the substitution of logically equivalent explanantia or explananda. This lack of linguistic invariance may be taken to reflect the pragmatic nature of our notion of explanation. How you state your explanatory argument makes a difference. For some purposes it may be adequate to define a notion of indirect explanation. We may say that if tT = T’, E(L, Tc) for some Tc of T and Tc#Tc’ (for all Tc’-sets of T’), then T’ explains L indirectly. The properties (d), (e), (f) and (g) below hold for the weak DEL-model as well. It is easily seen that the explanatory relations of the DEL-model and of the weak DEL-model do notsatisfy the following debated subset and conjunction properties : If E(L, Tc) and L’ is a conjunct of L, then E(L’, Tc). (Subset property) If E(L, TC) for all L, of an L=L, &... & Li & . . . &L”, then E(L, Tc). (Conjunction property) This means that the explanandum can be neither weakened nor strengthened without the explanation possibly ceasing to be valid. We have already noticed that in the DEGmodel a law L may have explanantia differing in their quantificational strength (and information content). The DEGmodel does require ‘minimal’ explanation (in Omer’s sense) only within explanans-candidates of the same quantificational strength. Thus we have a tree-structured hierarchy of explanations such that on each level of this quantificational hierarchythe minimalityprinciple DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 387 is satisfied.We can finally notice that the branches of the explanation-tree for an L may be created in diverse logically imcompatible ways: (g) If E(L, Tc) and E(L, Tc’), then it is possible that Tc and Tc’ (and hence the corresponding theories T and T’) are mutually logically incompatible. The explanantia in one and the same branch are of course logically compatible, as the deeper explanantia quantification-theoretically imply the shallower explanantia for L. III We shall conclude this paper by making a few sketchy philosophical and methodological comments concerning the above results and by briefly discussing a few ideal nonlogical conditions of adequacy for the deductive explanation of scientific 1aws.l The various definitions for deductive explanation given in the previous section were only concerned with potential explanation. To make a potential explanation actual some requirements concerning the truth and/or the evidential support of the explanandum and its explanans are needed. One plausible candidate, which we shall adopt, is simply: If E(L, Tc) then L and Tc should be accepted as true. 03) The term ‘true’ of course means something like ‘true in the actual world or ‘true in intended model(s)‘. The notion of acceptance is a notoriously tricky one, and here we shall not attempt to give an analysis of it. (It may be the case that most actually accepted scientific theories are false and even known to be false, but this does not speak against accepting (8) as an ideal for good explanatory theories.) There is, however, one familiar requirement related to how we accept our explanans as true that we want to make more explicit. This is a requirement related to condition (6) and prohibiting what might be called ‘confirmational adhockery’ (cf. Nagel (1961), p. 43). We shall adopt the following weak version of it for our DEL-model: If Cn(Tc(lZup))=H&)uO(ll)uC(IZup) and if E(L, Tc), (9) then Hh) is - through some set C’(A u ,u) of correspondence rules - adequately supported by evidence based on data other than the observational data upon which the acceptance of the explanandum law L is based. 388 RAIMO TUOMELA (Notice that (9) still allows that O(1)= {L}, although it prohibits that 0’ (A) = {L}.) The conditions (8) and (9) are necessary conditions to make the DEL-model for potential deductive explanation a model for actual (in the above sense) deductive explanation as well. As we have seen, the DEL-model (together with conditions (6) and (7)) seems to be capable of giving an adequate account of many troublesome issues in the theory of explanation while it correctly emphasizes some features considered essential for good deductive explanations of scientific laws. Based on our discussion in Section I we found condition (1) - the consistency condition - of the DEL-model acceptable. Condition (2) is obviously also acceptable. Condition (3’) will be commented upon below. Conditions (4) and (5”) were arrived at from general information-theoretic considerations. (4) was found to take care of circular explanations of the most vicious kind. (5”) is supposed to guarantee that an explanation provides a proper amount of relevant (information-theoretically) redundant information. Condition (6) refers to the dynamics of theorizing. According to (6), it is essential that an explanatory theory be at least capable of growth. Condition (9) then actually realizes this possibility by requiring the growth of theory in the evidential sense. It can be noticed here that our condition (6) guarantes against ad hoc,-type and condition (9) against ad hoc,-type of the three kinds of ad hoc-explanations in Lakatos (1970). In addition, our theoretical adhockery condition (7) seems to be a special kind of Lakatos’ ad hoc,-explanations. Condition (3’) serves to indicate our emphasis on the importance of new theoretical ideas (concepts). As Campbell has frequently emphasized, an explanatory theory shall ‘add to our ideas and the ideas which it adds shall be acceptable’ (Campbell (1921), p. 83). Even if our condition (3’) as such says nothing about the difficult and debated notion of ‘acceptability’ of the concepts in ~1,still it - and the DEL-model as a whole - refers to interpretative rather than phenomenological theorizing (cf. Bunge, 1968). The explanation of observational laws by theories introducing new ideas (hidden causes, etc.) is often regarded as ontologically and epistemologitally deeper, and it has been considered to add to our understanding of the world more than mere phenomenological explanation (explaining laws with the help of theories where no new kind of concepts occur). As we noticed, this kind of penetration into deeper levels of reality was in DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 389 part and in addition made possible by the hierarchical organization (tree-structure) of explanations accomplished by condition (5”). (In a model of explanation based only on minimal theory explanations - such as Omer’s corrected model - this would not have been possible.) The additional and optional desideratum - our condition (7) - was introduced to avoid theoretical adhockery. This condition was above also called the theoretical noncreativeness condition as it requires that new correspondence rules added to the core theory to make it explain some new observational laws should be noncreative with respect to their A-content. Hence one way to satisfy this condition is to give noncreative definitions of the observational concepts in terms of the theoretical concepts of the full theory. For instance, explicit definitions of the observables in terms of the theoreticals - rather than vice versa - would be such definitions (cf. Ramsey, 1931; Sellars, 1961; Carnap, 1966; and Hintikka 1972). In this case one can even theoretically eliminate observational or L-concepts in terms of theoretical ones. Theoretical explanation (such as reductive explanation) seen as redescription or reinterpretation can be considered to require ultimately just this. Applied to intertheoretic reductive explanation this is seen to imply that every case of ‘principle-reduction’ can be performed at least as well by accomplishing a ‘concept-reduction’ simultaneously (see Hempel, 1969, for these notions). Interpretative theorizing thus seems ideally to require explicit definitions of observables in terms of theoreticals while phenomenological theorizing seems to imply the reverse at least asymptotically. Nagel has required that in the explanation of laws the explaining theory should be more general than the explanandum law (Nagel, 1961, pp. 37-8). As the generality in question is denied to be proportional to logical strength, the following suggestion seems to be at hand. A sufficient, though perhaps not necessary, condition for arriving at Nagel’s generality would be to define explicitly the concepts occurring in the law in terms of theoreticals such that each definition is a conjunction of at least twotheoreticals. For then one can in an obvious sense say that observable concepts are applications of theoretical concepts and that every theoretical concept is more general and broader in its scope than each observational concept in whose definiens it occurs. Our final remark is concerned with the relation between explanation and various notions of inductive support. It seems that between such 390 RAIMO TUOMBLA notions as explanation, generalization, cormrmation, corroboration, etc., there are many connected features. Indeed, it might be required that theories of explanation and inductive support should always be developed simultaneously. One argument for this arises from an investigation by Smokler (1968). He pointed out that there are two essentially different notions of inductive support, one corresponding to ‘abductive inference’ and the other to ‘enumerative induction’) the latter includes eliminative induction). Within abductive inference it is true to say that evidence supports a hypothesis if the hypothesis explains the evidence. Of the familiar principles of qualitative confirmation abductive inference satisfies the conditions of consistency, nonuuiversalizability, converse entailment, converse consequence and equivalence but it does not satisfy the conditions of entailment and special consequence. On the other hand, enumerative induction makes true the conditions of consistency, nonuniversalizability, entailment, special consequence, and equivalence but it does not make true converse entailment nor converse consequence condition. (See e.g. Smokler, 1968, for an exact formulation of these conditions.) As the reader can easily verify, apart from minor technicalities, the notion of theoretical explanation that our DEL-model makes precise (clearly) belongs to abductive inference and thus not to inductive enumeration. (One might now also argue that our DEL-model also explicates a notion of corroboration peculiar to abductive inference.) Corresponding to Smokler’s notion of enumerative induction we of course also get an implicit definition of a converse relation which might be dubbed inductive systematization or inductive generalization. The picture that emerges from this is that we have a dichotomy of theorizing both within explanation (interpretive versus phenomenological theorizing) and inductive support (abductive inference versus enumerative induction). Interpretive or theoretical explanation demonstrably goes together with abductive inference (corroboration). It can als be argued that some important forms of phenomenological theorizing go together with the above wide notion of enumerative induction and with a high probability-criterion of confirmation (cf. Niiniluoto and Tuomela, 1972; and Tuomela, 1972). McGill University, Montreal2 DEDUCTIVE EXPLANATION OF SCIENTIFIC LAWS 391 NOTES * I want to thank Mrs. Marja-Liisa Kakkuri-Ketonen, Mr. Seppo Miettinen, and Mr. Ilkka Niiniluoto for suggestionsand criticismsconcerning an earlier version of this paper. 1 For a fuller discussionof the methodological and philosophical applications of the DEL-model of explanation seeTuomela (1972), Chapters VII and VIII. a On leave of absencefrom the University of Helsinki, Helsinki, Finland. BIBLIOGRAPHY Ackermann, R., 1965, ‘Deductive Scientific Explanation’, Philosophy of Science 32, 155-67. Ackermann, R. and Sterner, A., 1966,‘A Corrected Model of Explanation’, Philosophy of Science 33, 168-71. Btmge, M., 1967,Scientific Research ZZ,Springer-Verlag,New York. Bunge, M., 1968, ‘The Maturation of Science’,in I. Lakatos and A. Musgrave (eds.), Problems in the Philosophy of Science, North Holland PublishingCompany, Amsterdam. Campbell, N., 1920, Physics: The Elements; reprinted as: Foundations of Science, Dover, New York. Campbell, N., 1921, What is Science?, Dover, New York. Carnap, R., 1966, Philosophical Foundations of Physics (ed. by M. Gardner), Basic Books, New York. Craig, W., 1956,‘Replacementof Auxiliary Expressions’,Z7zePhilosophical Review 65, 38-55. Eberle, R., Kaplan, D., and Montague, R., 1961, ‘Hempel and Oppenheim on Explanation’, Philosophy of Science 28, 418-28. Hempel, C., 1969, ‘Reduction: Ontological and Linguistic Facets’,in S. Morgenbesser, P. Suppes, and M. White (eds.), Philosophy, Science, and Method, St. Martin’s, New York, pp. 179-99. Hempel, C. and Oppenheim, P., 1948, ‘Studiesin the Logic of Explanation’, Philosophy of Science, 15,135-75; reprinted in Hempel, C., 1965,Aspects of Scientific Explanation, The Free Press,New York, pp. 245-90. Hintikka, J., 1970, ‘SurfaceInformation and Depth Information’, in J. Hintikka and P. Suppes, (eds.), Znformation aud Inference, D. Reidel Publishing Company, Dordrecht. Hintikka, J., 1972,‘On the Different Ingredients of an Empirical Theory’, forthcoming in the Proceedings of the ZVth Znternatiozzal Congress in Logic, Methodology, and Philosophy of Science. Hintikka, J. and Tuomela, R., 1970,‘Towards a GeneralTheory of Auxiliary Concepts and Definability in First-Order Theories’, in J. Hintikka and P. Suppes (eds.), Information azzdInference, D. Reidel Publishing Company, Dordrecht, pp. 92-124. Kaplan, D. 1961, ‘Explanation Revisited’,Philosophy of Science 28,429-36. Kim, J. 1963, ‘On the Logical Conditions of Deductive Explanation’, Philosophy of Science 30, 286-91. 392 RAIMO TUOMELA Lakatos, I., 1970, ‘Falsificationism and the Methodology of ScientificResearchProgrammes’,in I. Lakatos and A. Musgrave (eds.), Criticism and the Growth oflynowledge, Cambridge University Press,Cambridge. Nagel, E. 1961, Z’he Structure ofscience, Harcourt, Braceand World, London. Niiniluoto, I. and Tuomela, R., 1972, Theoretical Concepts Within Inductive Systematization, forthcoming. Omer, I., 1970,‘On the D-N Model of ScientificExplanation’, Phirosophyofscience 37, 417-33. Ramsey, F., 1931, The Foundations of Mathematics and Other Logical Essays, Littlefield, Adams and Company, Paterson,N. J. Sellars,W., 1961, ‘The Language of Theories’, in H. Feigl and G. Maxwell (eds.), Current Zssues in the Philosophy of Science,Holt, Rinehart and Winston, Inc., New York. Smokler, H., 1968, ‘Conflicting Conceptsof Confirmation’, The Journal of Philosophy 65, 300-312. Stegmiiller, W., 1969, Wissenschaftliche Erkltirung und Beg&dung, Springer Verlag, Berlin, Heidelberg and New York. Tuomela, R., 1972, Theoretical Concepts, Springer-Verlag,Vienna.

RELATED PAPERS

RELATED TOPICS

Log In

Deductive explanation of scientific laws

Deductive explanation of scientific laws