You may not discuss this exam with anyone except the instructor. I will have office hours in 362 Ford on: Fri Dec. 14, 1:15-2:15, Mon Dec 17, 3:30-4:30, and Wed Dec. 18, 11-12 and 1-3. I can be reached by email any time. If there are any changes or corrections to the exam, I will send email.
Description of variables These data give the number of known crustacean zooplankton species for 69 world lakes. Also included are a number of characteristics of each lake. There are missing values.
Name Type n Info Area Variate 69 Lake area, in hectares Cond Variate 50 Specific conductance, micro Siemans Dist Variate 69 distance to nearest lake, km Elev Variate 69 Elevation, m Lat Variate 69 N latitude, degrees Long Variate 69 W latitude, degrees MaxDepth Variate 69 Maximum lake depth, m MeanDepth Variate 69 Mean lake depth, m NLakes Variate 69 number of lakes within 20 km Photo Variate 47 Rate of photosynthesis, mostly by the 14C method Species Variate 69 Number of zooplankton species Lake Text 69 Name of Lake
Several questions are of interest to the ecologists, and your assignment is to provide specific answers to these specific questions. Each answer should consist of two parts: (a) your answer to the question, and (b) how you got that answer. Short answers are generally adequate. Some of the questions may not be answerable from these data. If there is no answer, state why no answer is possible. As is true in the real world, some questions are poorly worded and vague, and will require interpretation on your part, so many different answers may be possible. Some of the questions require only a little work, while others require lots of work.
The data file drugcost.lsp can be obtained from the class web page. On Linux, type (load "drugcost"). The data includes the response, three primary predictors of interest, and with a few other variables that characterize the health plan:
COST = Ave. cost to plan for 1 prescription for 1 day, THE RESPONSE RXPM = Number of prescriptions per member per year GS = % generic substitution RI = Restrictiveness index (0=none, 100=total) OC = 1 if oral contraceptives are covered, 0 else COPAY = Average prescription co-payment AGE = Average age F = % female members MM = Member months = number of members * average number of months per member ID = Plan nameThe variable RXPM measures the overall use of drugs by the health plan; larger values mean more drug use. MM is a measure of the size of the plan.
What to turn in The question you are to answer is: How to CoPay, GS and RI impact prescription drug costs? Your solution should consist of two parts, a ``Summary" and ``Supporting Evidence." The summary will consist of: (1) a statement of your conclusions, with relevant summary statistics and probability statements. This should be at most 300 words. Your conclusions may be equivocal: for example, they might depend on whether or not a specific case is treated as an outlier. (2) AT MOST two graphical or numerical displays that are designed to convince someone familiar with statistical analysis that your analysis is sound, and that your conclusions are justified. Just giving a graph is NOT enough: you must explain what the graph shows and why it is interesting.
Your supporting evidence will consist of:
(1) AT MOST 500 words explaining how you got your answer, with
up to five figures/tables that
support your text. Unlabeled or unreferenced computer output will
count against you. Word limits will be strictly enforced.