__Data Project #1, part 1__

**For Data Project #1**--**Submit at least 3 ideas by Thursday, June 6 for your Data Project #1.**- You will be collecting data that is
(**quantitative****continuous**) from two different populations that it makes sense to compare.

**IDEAS ONLY (that's what "part 1" is about) -- DO NOT START COLLECTING DATA YET!**- ONE VARIABLE -- The
**variable of interest**should be thefor both samples and we will compare the two samples later on.**same**__You should take all the measurements yourself__.

- TWO POPULATIONS -- The two populations should be distinct and there should be no question about which population any sampling unit belongs in.

*NOTE: The sample sizes are to be at least 30 for each of your two samples.*

Here are some examples: -
- height of women vs height of men (variable: height, two populations: men, women) -- Note:
*You must measure the height yourself*--__don't__just ask people "how tall are you?" - distance a horseshoe is thrown for people under 30 vs people 30 and over (variable: distance, two populations: people under 30, people 30 and over) -- Note that is is clear which population any person belongs to, rather than having "young" and "old"
- weight of "10-lb bags" of potatoes at Safeway vs at the Coop (variable: weight, two populations: Safeway, Coop)
- length of time a smoker can hold his/her breath vs a non-smoker (variable: time, two populations: smokers, non-smokers

- height of women vs height of men (variable: height, two populations: men, women) -- Note:
*Do not start collecting data yet!*(at least) so that I can help you select something appropriate for this part of the project. To "turn them in," write your ideas in the Forum for this -- so that people can see what other people are thinking about -- makes it more interesting and you might inspire each other. (People on the waitlist will have to email me their ideas.)**Turn in (by Thursday) three ideas****Be sure to state clearly (a) what the variable is and (b) describe the two populations clearly.**

**Math 15 Data Project #1, Part 2 -- DATA COLLECTION AND REPRESENTATION**Due Thursday, June 13

__Do__*NOT*start collecting data until your project has been approved!You should define clearly (1)

**one quantitative variable**, and (2)

**two populations**that it would make sense to compare.

__DATA COLLECTION__Collect data on

__one continuous quantitative variable__from

__two populations__.

- The two populations should be defined clearly so that every person (or unit) indisputably belongs to one of the two populations and not BOTH (though some units might not belong in either population, in which case you would not include them in your data collection).
- Your sample sizes should be AT LEAST 30 (of course more would be better) for each of the two samples and are not required to be the same (for example you might have a sample size of 36 for Population 1 and 52 for Population 2, for some reason.)
- The variable you measure should be the same for both populations.
- The method you use to do the measuring should be the same every time you do it, and you should do all the measuring yourself -- don't just ask people, for instance.
- You should end up with two quantitative data sets (i.e., two lists of numerical measurements), one sample from each population.

__WHAT TO TURN IN:__- an electronic spreadsheet or text file with your raw data in two columns. Label the columns clearly. Use this format for the file name: ‘
**Math15_RawData_firstlast.xls**’ (for example, mine might be ‘Math15_RawData_TamiMatsumoto.xls’). Submit your file here, or paste in your data -- that is, if you prefer, just type in the two lists of numbers, one after the other, as text. - a written paper – just 1 or 2 pages (no more than 2) – do not write on the “back” of the paper – including:
. Don't worry about whether your method was statistically sound – we have yet to learn about that – just report exactly what you did. Include a description of how, out of the entire population, you decided to select the particular units that you measured.__a description of your data collection choice and the data collection process you used__(separately – do not combine the data all together). Present the information in a table (for easier comparison of the two data sets). Include, for each sample, the following: sample size, the min, the max, the mean, median, and any mode(s).**descriptive statistics for each of your data sets**of your data (do not write on the back). Do not pool both sets of data together, but take care to present your data so that the viewer can make visual comparisons.**at least three different visual displays**

NOTE: Your paper should have enough information visible on it so that a reader can understand what the numbers represent and where they came from. If we were to post your paper on the wall, people walking up to it and looking at it should be able to understand what it is you are displaying.

__:__

**For this assignment, you will be graded on**- Appropriateness of Data
- Description of your sampling process
- Descriptive Statistics
- Data Displays
- Overall Visual Presentation of the Data