首页 > 其他分享 >SciTech-Mathmatics-Probability+Statistics-Population-Sampling-Population vs. Sample: What's the

SciTech-Mathmatics-Probability+Statistics-Population-Sampling-Population vs. Sample: What's the

时间:2024-09-09 19:13:19浏览次数:1  
标签:What Statistics might sample collect every Population data population

Difference: Population vs. Sample

BY ZACH BOBBITTPOSTED ON NOVEMBER 27, 2020

Often in statistics we're interested in collecting data so that we can answer some research question.
For example, we might want to answer the following questions:

  1. What is the median household income in Miami, Florida?
  2. What is the mean weight of a certain population of turtles?
  3. What percentage of residents in a certain county support a certain law?

In each scenario, we are interested in answering some question about a population, which represents every possible individual element that we're interested in measuring.

However, instead of collecting data on every individual in a population we instead collect data on a sample of the population, which represents a portion of the population.

Population: Every possible individual element that we are interested in measuring.
Sample: A portion of the population.

Here is an example of a population vs. a sample in the three intro examples.

Three Examples

  1. What is the median household income in Miami, Florida?
    The entire population might include 500,000 households,
    but we might only collect data on a sample of 2,000 total households.
  2. What is the mean weight of a certain population of turtles?
    The entire population might include 800 turtles,
    but we might only collect data on a sample of 30 turtles.
  3. What percentage of residents in a certain county support a certain law?
    The entire population might include 50,000 residents,
    but we might only collect data on a sample of 1,000 residents.

Why Use Samples?

There are several reasons that we typically collect data on samples instead of entire populations, including:

  1. It is too time-consuming to collect data on an entire population. For example, if we want to know the median household income in Miami, Florida, it might take months or even years to go around and gather income for each household. By the time we collect all of this data, the population may have changed or the research question of interest might no longer be of interest.
  2. It is too costly to collect data on an entire population. It is often too expensive to go around and collect data for every individual in a population, which is why we instead choose to collect data on a sample instead.
  3. It is unfeasible to collect data on an entire population. In many cases it's simply not possible to collect data for every individual in a population. For example, it may be extraordinarily difficult to track down and weigh every turtle in a certain population that we're interested in.

By collecting data on samples, we're able to gather information about a given population much faster and cheaper.

And if our sample is representative of the population, then we can generalize the findings from a sample to the larger population with a high level of confidence.

The Importance of Representative Samples

When we collect a sample from a population,
we ideally want the sample to be like a "mini version" of our population.

For example, suppose we want to understand the movie preferences of students in a certain school district that has a population of 5,000 total students. Since it would take too long to survey every individual student, we might instead take a sample of 100 students and ask them about their preferences.

If the overall student population is composed of 50% girls and 50% boys, our sample would not be representative if it included 90% boys and only 10% girls.

Or if the overall population is composed of equal parts freshman, sophomores, juniors, and seniors, then our sample would not be representative if it only included freshman.

A sample is representative of a population if the characteristics of the individuals in the sample \(\large closely\ matches\) the characteristics of the individuals in the overall population.

When this occurs, we can generalize the findings from the sample to the overall population with confidence.

How to Obtain Samples

There are many different methods we can use to obtain samples from populations.

To maximize the chances that we obtain a representative sample, we can use one of the three following methods:

  • Simple random sampling:Randomly select $\large individuals $ through the use of \(\large a\ random\ number\ generator\) or \(\large some\ means\ of\ random\ selection\).

  • Stratified random sampling: Split \(\large a\ population\) into \(\large groups\). Randomly select some \(\large members\) from \(\large each\ group\) to be in the sample.

  • Systematic random sampling: Put every member of a population into some order. Choose a random starting point and select every \(\large n\)th member to be in the sample.

In each of these methods, every individual in the population has an equal probability of being included in the sample. This maximizes the chances that we obtain a sample that is a “mini version” of the population.

标签:What,Statistics,might,sample,collect,every,Population,data,population
From: https://www.cnblogs.com/abaelhe/p/18405119

相关文章

  • MAST90083: Computational Statistics and Data Science
    SchoolofMathematicsandStatisticsMAST90083:ComputationalStatisticsandDataScienceAssignment1Duedate:Nolaterthan08:00amonMonday9thSeptember2024Weight:20%Question1LinearRegressionInthisquestion,wewillapplylinearregression,......
  • SciTech-Mathmatics-Probability+Statistics-Population-Sampling-Types of Sampling
    TypesofSamplingMethods(WithExamples)BYZACHBOBBITTPOSTEDONSEPTEMBER24,2018Researchersareofteninterestedinansweringquestionsaboutpopulationslike:Whatistheaverageheightofacertainspeciesofplant?Whatistheaverageweightofa......
  • SciTech-Mathmatics-Probability+Statistics-Population-Sampling of Region of Popul
    SciTech-Mathmatics-Probability+Statistics-Population-Sample:SampleProportionvs.SampleMean:TheDifferenceSampleProportionvs.SampleMean:TheDifferenceBYZACHBOBBITTPOSTEDONMAY5,2021TwotermsthatareoftenusedinstatisticsareSamplePr......
  • SciTech-Mathmatics-Probability+Statistics: Differences: Population VS Sampling(D
    SciTech-Mathmatics-Probability+Statistics:Differences:PopulationVSSampling(DistributionofParameters)Likelyhood(Assumption)VSProbability(ModelofSolidParameters)Likelihoodvs.Probability:What'stheDifference?BYZACHBOBBITTPOSTEDONAU......
  • What is Convolutional Neural Network(CNN)?
    笔记核心部分摘抄以及自己的理解[附有样例,可以轻松理解]:卷积网络的卷积层中使用了卷积操作,这个操作可以捕捉到图像中的局部特征而不受其位置的影响。在外围边缘补充若干圈0,方便从初始位置以步长为单位可以刚好滑倒末尾位置,通俗地讲就是为了总长能被步长整除。池化层通过减......
  • SciTech-Mathmatics-Probability+Statistics: Statistical Inference统计推断- Estima
    轻松学统计:https://zh-cn.statisticseasily.com/词汇表/什么是统计推断/StatisticalInference:SI(统计推断)的类型SI(统计推断)主要有两种类型:Estimation:根据样本数据确定总体的特征;PointEstimation:提供总体参数的单一值估计;ConfidenceInterval:提供......
  • 【Python】ValueError:Sample larger than population or is negative的原因
    ValueError:Samplelargerthanpopulationorisnegative的原因以及解决办法1.出现这几天重构kook-valorant-bot的代码的时候,遇到了这个问题Job"vip_roll_task(trigger:interval[0:01:20],nextrunat:2023-01-2519:59:21CST)"raisedanexceptionTraceback(mostrec......
  • What's new in PikiwiDB(Pika) v3.5.5
    尊敬的社区朋友及PikiwiDB(Pika)用户们:非常高兴地宣布---PikiwiDB(Pika)【下文简称Pika】长期维护版本v3.5的最新版本---v3.5.5今天正式发布。在这个版本中,除了修复所有已知的主从复制bug外,通过引入RTC特性其性能又有了大幅度提升,且主从failover机制兼容了Redis-Se......
  • [ICPC2020 WF] What's Our Vector, Victor?
    给出\(d\)维空间的\(n\)个点及它们到某个定点的距离,你需要解出这个定点的坐标。保证有解,任意输出一组解即可。\(n,d\le500\),所有点的坐标(包括定点)随机。拜谢zhy大师!给出了一个十分简单好理解的做法!线代不好怎么办?我们可以猜猜结论!首先这道题相当于是给了\(n\)个形......
  • SciTech-Mathmatics-Probability+Statistics: Understanding the $\large Null\ and
    NullHypothesisforLinearRegressionLinearregressionisatechniquewecanusetounderstandtherelationshipbetweenoneormorepredictorvariablesandaresponsevariable.SimpleLinearRegressionIfweonlyhaveonepredictorvariableandonerespo......