AP统计术语词典

07-08

GLOSSARY

Alternative hypothesis—the theory that the researcher hopes to confirm byrejecting the null hypothesis

Association —when some of the variability in one variable can be accounted forby the other

Bar graph—graph in which the frequencies of categories are displayed withbars; analogous to a histogram for numerical data

Bimodal —distribution with two (or more) most common values; see mode Binomial distribution—probability distribution for a random variable X in a

binomial setting;

where n is the number of independent trials, p is the probability of successon each trial, and x is the count of successes out of the n trials

Binomial setting (experiment)—when each of a fixed number, n , of

observations either succeeds or fails, independently, with probability p Bivariate data—having to do with two variables

Block —a grouping of experimental units thought to be related to the response tothe treatment

Block design—procedure by which experimental units are put into

homogeneous groups in an attempt to control for the effects of the group on theresponse

Blocking —see block design

Boxplot (box and whisker plot)—graphical representation of the five-numbersummary of a dataset. Each value in the five-number summary is located overits corresponding value on a number line. A box is drawn that ranges from Q1to Q3 and “whiskers” extend to the maximum and minimum values from Q1and Q3.

Categorical data—see qualitative data

Census —attempt to contact every member of a population

Center —the “middle” of a distribution; either the mean or the median

Central limit theorem—theorem that states that the sampling distribution of asample mean becomes approximately normal when the sample size is large

Chi-square (χ2) goodness-of-fit test—compares a set of observed categoricalvalues to a set of expected values under a set of hypothesized proportions for

the categories;

Cluster sample—The population is first divided into sections or “clusters.”Then we randomly select an entire cluster, or clusters, and include all of themembers of the cluster(s) in the sample.

Coefficient of determination (r 2) —measures the proportion of variation in theresponse variable explained by regression on the explanatory variable

Complement of an event—set of all outcomes in the sample space that are notin the event

Completely randomized design—when all subjects (or experimental units) arerandomly assigned to treatments in an experiment

Conditional probability—the probability of one event succeeding given thatsome other event has already occurred

Confidence interval—an interval that, with a given level of confidence, islikely to contain a population value; (estimate) ± (margin of error)

Confidence level—the probability that the procedure used to construct aninterval will generate an interval that does contain the population valueConfounding variable—has an effect on the outcomes of the study but whoseeffects cannot be separated from those of the treatment variable

Contingency table—see two-way table

Continuous data—data that can be measured, or take on values in an interval;the set of possible values cannot be counted

Continuous random variable—a random variable whose values are continuousdata; takes all values in an interval

Control —see statistical control

Convenience sample—sample chosen without any random mechanism; choosesindividuals based on ease of selection

Correlation coefficient (r ) —measures the strength of the linear relationship

between two quantitative variables;

Correlation is not causation—just because two variables correlate stronglydoes not mean that one caused the other

Critical value—values in a distribution that identify certain specified areas ofthe distribution

Degrees of freedom—number of independent data-points in a distributionDensity function—a function that is everywhere non-negative and has a totalarea equal to 1 underneath it and above the horizontal axis

Descriptive statistics—process of examining data analytically and graphicallyDimension —size of a two-way table; r × c

Discrete data—data that can be counted (possibly infinite) or placed in orderDiscrete random variable—random variable whose values are discrete dataDotplot —graph in which data values are identified as dots placed above theircorresponding values on a number line

Double blind—experimental design in which neither the subjects nor the studyadministrators know what treatment a subject has received

Empirical Rule (68-95-99.7 Rule)—states that, in a normal distribution, about68% of the terms are within one standard deviation of the mean, about 95%are within two standard deviations, and about 99.7% are within threestandard deviations

Estimate —sample value used to approximate a value of a parameter

Event —in probability, a subset of a sample space; a set of one or more simpleoutcomes

Expected value—mean value of a discrete random variable

Experiment —study in which a researcher measures the responses to a treatmentvariable, or variables, imposed and controlled by the researcher

Experimental units—individuals on which experiments are conductedExplanatory variable—explains changes in response variable; treatmentvariable; independent variable

Extrapolation —predictions about the value of a variable based on the value ofanother variable outside the range of measured values

First quartile—25th percentile

Five-number summary—for a dataset, [minimum value, Q1, median, Q3,maximum value]

Geometric setting—independent observations, each of which succeeds or failswith the same probability p ; number of trials needed until first success isvariable of interest

Histogram —graph in which the frequencies of numerical data are displayedwith bars; analogous to a bar graph for categorical data

Homogeneity of proportions—chi-square hypothesis in which proportions of acategorical variable are tested for homogeneity across two or more

populations

Independent events—knowing one event occurs does not change theprobability that the other occurs; P(A) = P(A|B )

Independent variable—see explanatory variable

Inferential statistics—use of sample data to make inferences about populationsInfluential observation—observation, usually in the x direction, whoseremoval would have a marked impact on the slope of the regression lineInterpolation —predictions about the value of a variable based on the value ofanother variable within the range of measured values

Interquartile range—value of the third quartile minus the value of the firstquartile; contains middle 50% of the data

Least-squares regression line—of all possible lines, the line that minimizesthe sum of squared errors (residuals) from the line

Line of best fit—see least-squares regression line

Lurking variable—one that has an effect on the outcomes of the study butwhose influence was not part of the investigation

Margin of error—measure of uncertainty in the estimate of a parameter;(critical value) · (standard error)

Marginal totals—row and column totals in a two-way table

Matched pairs—experimental units paired by a researcher based on somecommon characteristic or characteristic

Matched pairs design—experimental design that utilizes each pair as a block;one unit receives one treatment, and the other unit receives the other treatmentMean —sum of all the values in a dataset divided by the number of valuesMedian —halfway through an ordered dataset, below and above which lies anequal number of data values; 50th percentile

Mode —most common value in a distribution

Mound-shaped (bell-shaped)—distribution in which data values tend to clusterabout the center of the distribution; characteristic of a normal distributionMutually exclusive events—events that cannot occur simultaneously; if oneoccurs, the other doesn’t

Negatively associated—larger values of one variable are associated withsmaller values of the other; see associated

Nonresponse bias—occurs when subjects selected for a sample do not respondNormal curve—familiar bell-shaped density curve; symmetric about its mean;defined in terms of its mean and standard deviation;

Normal distribution—distribution of a random variable X so that P(a

Null hypothesis—hypothesis being tested—usually a statement that there is noeffect or difference between treatments; what a researcher wants to disproveto support his/her alternative

Numerical data—see quantitative data

Observational study—when variables of interest are observed and measuredbut no treatment is imposed in an attempt to influence the response

Observed values—counts of outcomes in an experiment or study; comparedwith expected values in a chi-square analysis

One-sided alternative—alternative hypothesis that varies from the null in onlyone direction

One-sided test—used when an alternative hypothesis states that the true valueis less than or greater than the hypothesized value

Outcome —simple events in a probability experiment

Outlier —a data value that is far removed from the general pattern of the dataP (A and B)—probability that both A and B occur; P(A and B ) = P (A) · P (A|B)P (A or B)—probability that either A or B occurs; P(A or B ) = P (A) + P (B) –P(A and B )

P value—probability of getting a sample value at least as extreme as thatobtained by chance alone assuming the null hypothesis is true

Parameter —measure that describes a population

Percentile rank—proportion of terms in the distributions less than the valuebeing considered

Placebo —an inactive procedure or treatment

Placebo effect—effect, often positive, attributable to the patient’s expectationthat the treatment will have an effect

Point estimate—value based on sample data that represents a likely value for apopulation parameter

Positively associated—larger values of one variable are associated with largervalues of the other; see associated

Power of the test—probability of rejecting a null hypothesis against a specificalternative

Probability distribution—identification of the outcomes of a random variable

together with the probabilities associated with those outcomes

Probability histogram—histogram for a probability distribution; horizontal axisshows the outcomes, vertical axis shows the probabilities of those outcomesProbability of an event—relative frequency of the number of ways an eventcan succeed to the total number of ways it can succeed or fail

Probability sample—sampling technique that uses a random mechanism toselect the members of the sample

Proportion —ratio of the count of a particular outcome to the total number ofoutcomes

Qualitative data—data whose values range over categories rather than valuesQuantitative data—data whose values are numerical

Quartiles —25th, 50th, and 75th percentiles of a dataset

Random phenomenon—unclear how any one trial will turn out, but there is aregular distribution of outcomes in a large number of trials

Random sample—sample in which each member of the sample is chosen bychance and each member of the population has an equal chance to be in thesample

Random variable—numerical outcome of a random phenomenon (randomexperiment)

Randomization —random assignment of experimental units to treatmentsRange —difference between the maximum and minimum values of a datasetReplication —repetition of each treatment enough times to help control forchance variation

Representative sample—sample that possesses the essential characteristics ofthe population from which it was taken

Residual —in a regression, the actual value minus the predicted value

Resistant statistic—one whose numerical value is not influenced by extremevalues in the dataset

Response bias—bias that stems from respondents’ inaccurate or untruthfulresponse

Response variable—measures the outcome of a study

Robust —when a procedure may still be useful even if the conditions needed tojustify it are not completely satisfied

Robust procedure—procedure that still works reasonably well even if the

assumptions needed for it are violated; the t -procedures are robust against theassumption of normality as long as there are no outliers or severe skewness.Sample space—set of all possible mutually exclusive outcomes of a probabilityexperiment

Sample survey—using a sample from a population to obtain responses toquestions from individuals

Sampling distribution of a statistic—distribution of all possible values of astatistic for samples of a given size

Sampling frame—list of experimental units from which the sample is selectedScatterplot —graphical representation of a set of ordered pairs; horizontal axisis first element in the pair, vertical axis is the second

Shape —geometric description of a dataset: mound-shaped; symmetric, uniform;skewed; etc.

Significance level (α) —probability value that, when compared to the P -value, determines whether a finding is statistically significant

Simple random sample (SRS)—sample in which all possible samples of thesame size are equally likely to be the sample chosen

Simulation —random imitation of a probabilistic situation

Skewed —distribution that is asymmetrical

Skewed left (right)—asymmetrical with more of a tail on the left (right) than onthe right (left)

Spread —variability of a distribution

Standard deviation

—square root of the variance;

Standard error—estimate of population standard deviation based on sampledata

Standard normal distribution—normal distribution with a mean of 0 and astandard deviation of 1

Standard normal probability—normal probability calculated from the standardnormal distribution

Statistic —measure that describes a sample (e.g., sample mean)

Statistical control—holding constant variables in an experiment that mightaffect the response but are not one of the treatment variables

Statistically significant—a finding that is unlikely to have occurred by chanceStatistics —science of data

Stemplot (stem-and-leaf plot)—graph in which ordinal data are broken into“stems” and “leaves”; visually similar to a histogram except that all the dataare retained

Stratified random sample—groups of interest (strata) chosen in such a way thatthey appear in approximately the same proportions in the sample as in thepopulation

Subjects —human experimental units

Survey —obtaining responses to questions from individuals

Symmetric —data values distributed equally above and below the center of thedistribution

Systematic bias—the mean of the sampling distribution of a statistic does notequal the mean of the population; see unbiased estimate

Systematic sample—probability sample in which one of the first n subjects ischosen at random for the sample and then each n th person after that is chosenfor the sample

t -distribution —the distribution with n – 1 degrees of freedom for thet statistic

—

Test statistic

—

Third quartile—75th percentile

Treatment variable—see explanatory variable

Tree diagram—graphical technique for showing all possible outcomes in aprobability experiment

Two-sided alternative—alternative hypothesis that can vary from the null ineither direction; values much greater than or much less than the null provideevidence against the null

Two-sided test—a hypothesis test with a two-sided alternative

Two-way table—table that lists the outcomes of two categorical variables; thevalues of one category are given as the row variable, and the values of theother category are given as the column variable; also called a contingencytable

Type-I error—the error made when a true hypothesis is rejected

Type-II error—the error made when a false hypothesis is not rejected

Unbiased estimate—mean of the sampling distribution of the estimate equalsthe parameter being estimated

Undercoverage —some groups in a population are not included in a samplefrom that population

Uniform —distribution in which all data values have the same frequency ofoccurrence

Univariate data—having to do with a single variable

Variance —average of the squared deviations from their mean of a set of

observations;

Voluntary response bias—bias inherent when people choose to respond to asurvey or poll; bias is typically toward opinions of those who feel moststrongly

Voluntary response sample—sample in which participants are free to respondor not to a survey or a poll

Wording bias—creation of response bias attributable to the phrasing of aquestion

z -score

—number of standard deviations a term is above or below the mean;

与《AP统计术语词典》相关的范文

01-22 单义词和多义词的分类

·单义词和多义词的分类语言中的每一个词都有一定的意义.比如"学校",指"专门进行教育的机构":"学者",指"在学术上有一定成就的人",等等. 有的词只有一个意义,叫单义词.单义词主要有:常见事物的名称,如汽车.飞机.大米.西红柿:专有名称,如马克思.鲁迅.北京.黄河:科技术语,如原子.元素.行星.克隆,等等.汉语中只 ...

03-26 搬块镜子照自己

搬块镜子照自己巴彦县红光乡丰望小学兰静雪骨干教师需提交一份电子稿的反思或感悟，组织思路的时候，无意打开了自己平时记录的一些工作心得笔记，我把这些心得体会当成一面镜子，时时来对照自己的教学行为，鞭策自己不断学习、创新，不断进步。现摘录两篇，记录如下：享受一张素的不能再素的脸，一身淡的不能再淡的衣，给了我许多的猜想：这是从哪个乡村请来的教师呢？现在的孩子可真会说话，当老师问：老师给你的印象 ...

10-14 国庆2014年征文:难忘那个图书角

十二岁那年,我读初中.学校所在地没有新华书店,供销社布店里的"鞋袜毛巾"柜旁,有个小图书角,或许就是那时延伸到公社一级的"新华书店",那里歪斜地摆着几本"毛选"和<果树栽培>之类图书. 一天,我在那角落不期而遇一本<汉语成语小词典>,售货员吹拂灰尘后,我花七角钱买下,这是我第一次买书.那时,仁寿老师教语文,他要我们 ...

10-19 针对学生词汇量不足而设计的长期学习方案

针对学生词汇量不足而设计的长期学习方案很多学生在词汇量方面都存在一些问题，主要现象如下：大部分学生很少预习单词；词汇学习无计划性；词汇量主要来源于课本；虽然很多学生知道死记硬背单词不是掌握词汇的好方法，但这种违背词汇学习策略的方法依然还在广泛的使用中；大部分学生没有及时的复习单词；对于平时的单词测试，很多学生总是采取临时应付的态度；关于词性，主要依据于汉语的词义；在阅读的过程中，碰到陌生的词汇， ...

03-01 中学教务总结

中学教务总结一、前期准备 1．任教务主任之后，与xx蔡校长联系确定共开9个班，其中初一6个班，初二3个班。根据大家意愿确定各科科任老师，加各队员飞信及QQ好友，建立各科目网络讨论组，继而确认各科科目组长。同时与xx蔡校长联系，确定学生人数及所开班级。在第二次培训结束后开始组织教案编写工作。 2．教案编写。首先将上年支教材料发给相应科目成员，出了第一份教务安排，并在安排中将学生所用教材版本网址粘贴 ...

03-31 八年级信息技术教学计划

八年级信息技术教学计划一、学情分析： 1、本人担任八年级信息技术课，之前除上学年学过woRD，别的相关知识没有接触，又因为本校电校上无装FLASH。所以据学生真实情况，酌情从简单实用入手，教学生电子表格知识。 2、学习目的性不明确，上课小动作多，注意力不集中，不记笔记，课后不复习。对一定要求掌握的信息技术概念及操作要领不加强巩固，对信息技术开始有些兴趣，以后兴趣越来越淡，不知任何知识要想学好都要 ...

09-05 英语学习方法与体会

英语学习方法与体会因为本人并非英语专业，所以本文在专业同学看来可能会有班门弄斧嫌疑（我读英语专业的同学和朋友实在是太多了，我认识的人里面，除了食品专业，英语专业绝对是最多的，她们中的绝大多数的英语水平也要比我高出很大一截，毕竟人家都是专业的嘛，而我只是业余而已。）所以本文只是总结一下自己十几年来的学习体会，也许别人还有很多更好的学习方法，只是我不知道罢了，毕竟，学无止境。一、词典学习法这个方 ...

06-14 大学生假期社会实践报告

一.实践报告撰写的内容与要求：一份完整的实践报告应由以下部分组成： 1.报告题目报告题目应该用简短、明确的文字写成，通过标题把实践活动的内容、特点概括出来。题目字数要适当，一般不宜超过20个字。如果有些细节必须放进标题，为避免冗长，可以设副标题，把细节放在副标题里。 2．学院及作者名称学院名称和作者姓名应在题目下方注明，学院名称应用全称。 3．摘要（有英文摘要的中文在前，英文在后）报告需配 ...

08-11 社会实践论文规范

　　社会实践是大学生全面素质提高的重要环节，是学生将所学知识应用于社会的重要过程。它既是学生学习、研究与实践成果的全面总结，又是对学生素质与综合能力的一次全面检验。为培养学生的科学精神，保证我校学生社会实践论文的质量，避免与社会实践总结混淆，为广大同学撰写社会实践论文提供指导，为优秀论文的评定提供依据，特制定本规范。　　一.实践论文撰写的内容与要求　　一份完整的实践论文应由以下部分组成：　　 ...

10-28 高三下学期英语复习的教学计划

古语云：授人以鱼，只供一饭。授人以渔，则终身受用无穷。学知识，更要学方法，培养学生良好的学习习惯为目的，使学生在学习中能够事半功倍。高考既是一场既定已久的战斗，也是一场让考生充分展示才华与潜能的重大人生机遇。如何抓住高三阶段的宝贵时光，尽快提高自己的英语成绩，无疑是每个考生都非常关注的事情。下面谈谈高三学生应如何备考英语：一、高三下学期英语复习的策略与重点英语复习进入高三下学期，同学们相应地 ...

随机推荐

猜你喜欢

AP统计术语词典

·信息技术与学科教学整合示范学校创建工作汇报材料

·圣诞节和平安夜开幕词

·别样青年节用诗歌[再致青春]

·销售工作的生命线

·立志当学霸的个性签名

·当我叩开那扇门的时候

·精神分裂症的语言认知特点及其脑机制_王久菊

·第四单元自然科学小论文学案

·聚酯多元醇羟值的测定乙酰化法

·生态文明建设紧迫性与问题

·助学宣传活动主持词

·运输队安全监督员个人工作总结

·大学生一年自我鉴定

·班干部就职演说

·寒假3000字总结报告

·湖南省公路工程设计变更管理办法

·乡村孩子的四角天空

·20万亿土地出让金去向不明各地收支明细未公开

·乌烟瘴气乌烟瘴气的意思乌烟瘴气是什么意思乌烟瘴气什么意思乌烟

·不同手术方法在腮腺肿瘤手术中面神经功能恢复的效果对比

AP统计术语词典

与《AP统计术语词典》相关的范文

·信息技术与学科教学整合示范学校创建工作汇报材料

·圣诞节和平安夜开幕词

·别样青年节 用诗歌[再致青春]

·销售工作的生命线

·立志当学霸的个性签名

·当我叩开那扇门的时候

·精神分裂症的语言认知特点及其脑机制_王久菊

·第四单元自然科学小论文学案

·聚酯多元醇羟值的测定乙酰化法

·生态文明建设紧迫性与问题

·助学宣传活动主持词

·运输队安全监督员个人工作总结

·大学生一年自我鉴定

·班干部就职演说

·寒假3000字总结报告

·湖南省公路工程设计变更管理办法

·乡村孩子的四角天空

·20万亿土地出让金去向不明 各地收支明细未公开

·乌烟瘴气乌烟瘴气的意思乌烟瘴气是什么意思乌烟瘴气什么意思乌烟

·不同手术方法在腮腺肿瘤手术中面神经功能恢复的效果对比

·别样青年节用诗歌[再致青春]

·20万亿土地出让金去向不明各地收支明细未公开