Lies, Damned Lies, or Statistics – How to Tell the Truth with Statistics (Poritz)

Categories:

Recommended

Descriptive Statistics

The first instinct of the scientist should be to organize carefully a question of interest, and to collect some data about this question. How to collect good data is a real and important issue, but one we discuss later. Let us instead assume for the moment that we have some data, good or bad, and first consider what to do with them . In particular, we want to describe them, both graphically and with numbers that summarize some of their features.

We will start by making some basic definitions of terminology – words like individual, population, variable, mean, median, etc. – which it will be important for the student to understand carefully and completely. So let’s briefly discuss what a definition is, in mathematics.

Mathematical definitions should be perfectly precise because they do not describe something which is observed out there in the world, since such descriptive definitions might have fuzzy edges. In biology, for example, whether a virus is considered “alive” could be subject to some debate: viruses have some of the characteristics of life, but not others. This makes a mathematician nervous.

When we look at math, however, we should always know exactly which objects satisfy some definition and which do not. For example, an even number is a whole number which is two times some other whole number. We can always tell whether some number is even, then, by simply checking if there is some other number for which the arithmetic statement is true: if so, is even, if not, is not even. If you claim a number is even, you need just state what is the corresponding ; if claim it is not even, you have to somehow give a convincing, detailed explanation (dare we call it a “proof”) that such a simply does not exist.

So it is important to learn mathematical definitions carefully, to know what the criteria are for a definition, to know examples that satisfy some definition and other examples which do not.

Note, finally, that in statistics, since we are using mathematics in the real world, there will be some terms (like individual and population) which will not be exclusively in the mathematical realm and will therefore have less perfectly mathematical definitions. Nevertheless, students should try to be as clear and precise as possible.

The material in this Part is naturally broken into two cases, depending upon whether we measure a single thing about a collection of individuals or we make several measurements. The first case is called one-variable statistics, and will be our first major topic. The second case could potentially go as far as multi-variable statistics, but we will mostly talk about situations where we make two measurements, our second major topic. In this case of bivariate statistics, we will not only describe each variable separately (both graphically and numerically), but we will also describe their relationship, graphically and numerically as well.

Category:

Attribution

“Book: Lies, Damned Lies, or Statistics – How to Tell the Truth with Statistics (Poritz)” by Jonathan A. Poritz, LibreTexts is licensed under CC BY-SA .

VP Flipbook Maker

Created a flipbook like this. This flipbook is made with Visual Paradigm Online. Try this free flipbook maker and create you own flipbook now!