9/08/2016

Defining 'inductive argument' should not be this difficult

Carnap, give me strength. I cannot define ‘inductive argument’ and ‘fallacious argument’ in a way that correctly captures the intuitive boundary between inductive and fallacious arguments.

Like (almost) everyone else, I define ‘deductively correct,' i.e. 'valid' as follows:
An argument A is deductively correct (valid) =
If all of A’s premises are true, then A’s conclusion must be true =
If all of A’s premises are true, then it is impossible for A’s conclusion to be untrue.

Now, there are two (or maybe three) definitions of ‘inductive argument’ that follow this pattern of definition.

(Definition 1: probable simpliciter)
An argument B is inductively correct =
If all of B’s premises are true, then B’s conclusion is probably true =
If all of B’s premises are true, it is unlikely that B’s conclusion is untrue

(Definition 2: more probable)
An argument C is inductively correct =
If all of C’s premises are true, then C’s conclusion is more likely to be true =
If all of C’s premises are true, the probability of the conclusion’s untruth decreases

In other words:
Definition 1: Pr(Conclusion | Premises) > 0.5
Definition 2: Pr(Conclusion | Premises) > Pr(Conclusion)

(If you think >0.5 is too low, you can pick whatever higher cutoff you like. My current problem is different from picking where to set that threshold number.)

Now I can state my problem: it looks like neither definition makes the correct classifications for some paradigm examples of fallacies.

On Definition 1, any argument whose conclusion highly probable regardless/independent of the truth or falsity of the premises, will count as inductively correct. That is, any non sequitur whose conclusion is probably true will count as inductively correct. (This is the inductive analog of the fact that a logical truth is a deductive consequence of any set of premises. But it just feels much more wrong in the inductive case, for some reason; maybe just because I've been exposed to this claim about deductive inference for so long that it has lost its un-intuitiveness?)

On Definition 2, hasty generalization (i.e. sample size too small) will count as inductively correct: suppose I talk to 3 likely US Presidential voters and all 3 say they are voting for Clinton. It is intuitively fallacious to conclude that Clinton will win the Presidency, but surely those 3 responses give some (small, tiny) boost to the hypothesis that she will win the Presidency.

But non sequiturs and hasty generalizations are both paradigm examples of fallacies, so neither Definition 1 nor Definition 2 will work.

I said above that there might be a third definition. This would simply be the conjunction of Definitions 1 and 2: If the premises are true, then the conclusion must be BOTH probable simpliciter (Def. 1) AND more probable (Def. 2). It seems like this would rule out both non sequiturs (because the truth of the premises increases the probability of the conclusion) and hasty generalizations (because the conclusion wouldn’t be probable simpliciter).

Problem solved? I don’t think it is, because there could be a hasty generalization for an argument whose conclusion is probable even if the premises are all false. Given our current background information (as of Sept. 6 2016) about the US Presidential race, the above example probably fits this description: ‘Clinton will win’ is more likely to be true than not, and the sample of three voters would boost a rational agent’s confidence in that claim (by a miniscule amount). That said, I will grant that a reasonable person might think this example is NOT a fallacy, but rather just an inductively correct argument that is so weak it is ALMOST a fallacy.

Before signing off, I will float a fourth candidate definition:

Definition 4: Pr(Conclusion | Premises) >> Pr(Conclusion)
put otherwise:
Definition 4: Pr(Conclusion | Premises) > Pr(Conclusion)+n, for some non-tiny n>0.

You could also conjoin this with Definition 1 if you wanted. This would take care of hasty generalizations. But does it create other problems? (You might object “That n will be arbitrary!” My initial reaction is that setting the line between inductive and fallacious at >0.5 [or wherever you chose to set it] is probably arbitrary in a similar way.)

Defining 'inductive argument' should not be this difficult

Carnap, give me strength. I cannot define ‘inductive argument’ and ‘fallacious argument’ in a way that correctly captures the intuitive boundary between inductive and fallacious arguments.

Like (almost) everyone else, I define ‘deductively correct,' i.e. 'valid' as follows:
An argument A is deductively correct (valid) =
If all of A’s premises are true, then A’s conclusion must be true =
If all of A’s premises are true, then it is impossible for A’s conclusion to be untrue.

Now, there are two (or maybe three) definitions of ‘inductive argument’ that follow this pattern of definition.

(Definition 1: probable simpliciter)
An argument B is inductively correct =
If all of B’s premises are true, then B’s conclusion is probably true =
If all of B’s premises are true, it is unlikely that B’s conclusion is untrue

(Definition 2: more probable)
An argument C is inductively correct =
If all of C’s premises are true, then C’s conclusion is more likely to be true =
If all of C’s premises are true, the probability of the conclusion’s untruth decreases

In other words:
Definition 1: Pr(Conclusion | Premises) > 0.5
Definition 2: Pr(Conclusion | Premises) > Pr(Conclusion)

(If you think >0.5 is too low, you can pick whatever higher cutoff you like. My current problem is different from picking where to set that threshold number.)

Now I can state my problem: it looks like neither definition will make the right classifications for some paradigm examples of fallacies.

On Definition 1, any argument whose conclusion highly probable regardless/independent of the truth or falsity of the premises, will count as inductively correct. That is, any non sequitur whose conclusion is probably true will count as inductively correct. (This is the inductive analog of the fact that a logical truth is a deductive consequence of any set of premises. But it just feels much more wrong in the inductive case, for some reason; maybe just because I've been exposed to this claim about deductive inference for so long that it has lost its un-intuitiveness?)

On Definition 2, hasty generalization (i.e. sample size too small) will count as inductively correct: suppose I talk to 3 likely US Presidential voters and all 3 say they are voting for Clinton. It is intuitively fallacious to conclude that Clinton will win the Presidency, but surely those 3 responses give some (small, tiny) boost to the hypothesis that she will win the Presidency.

But non sequiturs and hasty generalizations are both paradigm examples of fallacies, so neither Definition 1 nor Definition 2 will work.

I said above that there might be a third definition. This would simply be the conjunction of Definitions 1 and 2: If the premises are true, then the conclusion must be BOTH probable simpliciter (Def. 1) AND more probable (Def. 2). It seems like this would rule out both non sequiturs (because the truth of the premises increases the probability of the conclusion) and hasty generalizations (because the conclusion wouldn’t be probable simpliciter).

Problem solved? I don’t think it is, because there could be a hasty generalization for an argument whose conclusion is probable even if the premises are all false. Given our current background information (as of Sept. 6 2016) about the US Presidential race, the above example probably fits this description: ‘Clinton will win’ is more likely to be true than not, and the sample of three voters would boost a rational agent’s confidence in that claim (by a miniscule amount). That said, I will grant that a reasonable person might think this example is NOT a fallacy, but rather just an inductively correct argument that is so weak it is ALMOST a fallacy.

Before signing off, I will float a fourth candidate definition:

Definition 4: Pr(Conclusion | Premises) >> Pr(Conclusion)
put otherwise:
Definition 4: Pr(Conclusion | Premises) > Pr(Conclusion)+n, for some non-tiny n>0.

You could also conjoin this with Definition 1 if you wanted. This would take care of hasty generalizations. But does it create other problems? (You might object “That n will be arbitrary!” My initial reaction is that setting the line between inductive and fallacious at >0.5 [or wherever you chose to set it] is probably arbitrary in a similar way.)

3/12/2016

the no-miracles argument may not commit the base-rate fallacy

Certain philosophers argue that the No-Miracles Argument for realism (Colin Howson, Peter Lipton), the Pessimistic Induction against realism (Peter Lewis), or both arguments (P.D. Magnus and Craig Callender) commit the base-rate fallacy. I am not sure these objections are correct, and will try to articulate the reason for my doubt here.

I need to give some set-up; many readers will be familiar with some or all of this. So you can skip the next few paragraphs if you already know about the base-rate objection to the No-Miracles Argument and the Pessimistic Induction.

I suspect many readers are familiar with the base-rate fallacy; there are plenty of explanations of it around the internet. But just to have a concrete example, let’s consider a classic case of base-rate neglect. We are given information like the following, about a disease and a diagnostic test for this disease:

(1) There is a disease D that, at any given time, 1 in every 1000 members of the population has: Pr(D)=.001.

(2) If someone actually has disease D, then the test always comes back positive: Pr(+|D)=1.

(3) But the test has a false positive rate of 5%. That is, if someone does NOT have D, there is a 5% chance the test still comes back positive: Pr(+|~D)=.05.

Now suppose a patient tests positive. What is the probability that this patient actually has disease D?
Someone commits the base-rate fallacy if they say the probability is fairly high, because they discount or ignore the information about the ‘base rate’ of the disease in the population. Only 1 in 1000 people have the disease. But for every 1000 people who don’t have it, 50 people will test positive. You have to use Bayes’ Theorem to get the exact probability that someone who tests positive has the disease; the probability turns out to be slightly under 2%.

In the context of the No-Miracles and Pessimistic Induction arguments, the objection is that both arguments ignore a relevant base rate. For example, the No-Miracles argument says:

(A) Pr (T is empirically successful | T is approximately true) = 1

(B) Pr (T is empirically successful | ~ (T is approximately true)) <<1. Inequality (B) is supposed to capture the ‘no-miracles intuition’: the probability that a false theory would be empirically successful is so low that it would be a MIRACLE if that theory were empirically successful. Hopefully you can see that (A) corresponds to (2) in the original, medical base-rate fallacy example, and (B) corresponds to (3). Empirical success is analogous to a positive test for the truth of a theory, and the no-miracles intuition is that the false-positive rate is very low (so low that a false positive would be a miracle). The base-rate objection to the No-Miracles argument is just that the No-Miracles argument ignores the base rate of true theories in the population of theories. In other words, in the NMA, there is no analogue of (1) in the original example. Without that information, even a very low false-positive rate cannot license the conclusion that an arbitrary empirically successful theory is probably true. (And furthermore, that base rate is somewhere between extremely difficult and impossible to obtain: what exactly is the probability that an arbitrary theory in the space of all possible theories is approximately true?)

OK, that concludes the set-up. Now I can state my concern: I am not sure the objectors’ demand for the base rate of approximately true theories in the space of all possible theories is legitimate. Why? Think about the original medical example again. There, we are simply GIVEN the base rate, namely (1). But how would one acquire that sort of information, if one did not already have it? Well, you would have to run tests on large numbers of people in the population at large, to determine whether or not they had disease D. These tests need not be fast-diagnosing blood or swab tests; they might involve looking for symptoms more ‘directly,’ but they will still be tests. And this test, which we are using to establish the base rate of D in the population, will still presumably have SOME false positives. (I’m guessing that most diagnostic tests are not perfect.) But if there are some false positives, and we don’t yet know the base rate of the disease in the population, then—if we follow the reasoning of the base-rate objectors to the NMA and the PI—any conclusion we draw about the proportion of the population that has the disease is fallacious, for we have neglected the base rate. But on that reasoning, we can never determine the base rate of a disease (unless we have an absolutely perfect diagnostic test), because of an infinite regress.

In short: if the NMA commits the base-rate fallacy, then any attempt to discover a base rate (when detection tools have false positives) also commits the base-rate fallacy. But presumably, we do sometimes discover base rates (at least approximately) without committing the base-rate fallacy, so by modus tollens, the NMA does not commit the base-rate fallacy.

NMA does not commit the base rate fallacy, because it does not ignore AVAILABLE evidence about the base rate of true theories in the population of theories. In the medical example above, the base rate (1) is available information; under-weighing generates the fallacy. In the scientific realism case, however, the base rate is not available. If we did somehow have the base rate of approximately true theories in the population of all theories (the gods of science revealed it to us, say), then yes, it would be fallacious to ignore or discount that information when drawing conclusions about the approximate truth of a theory from its empirical success, i.e. the NMA would be committing the base-rate fallacy. But unfortunately the gods of science have not revealed that information to us. Not taking into account unavailable information is not a fallacy; in other words, the base-rate fallacy only occurs when one fails to take into account available information.

I am not certain about the above. I definitely want to talk to some more statistically savvy people about this. Any thoughts?


2/15/2016

huh

wait, what?

I'm not sure I can articulate why, but this makes me want to stop blogging...