Introduction to Sampling
Sampling is the process of selecting a subset of individuals from a larger population to study. Since it's usually impossible to study entire populations, researchers use carefully selected samples to make inferences about populations. Understanding sampling is essential for conducting valid research and interpreting findings appropriately.
Key Terminology
Population
Definition: The entire group you want to study and draw conclusions about
Examples:
- All university students in Thailand
- All patients with diabetes worldwide
- All small businesses in Bangkok
- All high school teachers in the United States
Note: Populations can be finite (countable) or infinite (theoretical)
Sample
Definition: A subset of the population that you actually study
Examples:
- 500 university students from 5 Bangkok universities
- 200 diabetes patients from two hospitals
- 50 small businesses in the Sukhumvit area
- 300 high school teachers from California
Goal: Sample should be representative of the population
Sampling Frame
Definition: A list of all elements in the population from which you'll draw your sample
Examples:
- Student enrollment database
- Hospital patient registry
- Business registration list
- Telephone directory
- Email list of organization members
Important: Sampling frame may not perfectly match the target population
Sampling Unit
Definition: The element or set of elements considered for selection at each stage
Examples:
- Individual persons
- Households
- Schools or classrooms
- Organizations
- Geographic areas
Parameter
Definition: A numerical characteristic of the population (what you want to know)
Examples:
- Population mean (μ)
- Population proportion (p)
- Population standard deviation (σ)
Note: Usually unknown—that's why we sample!
Statistic
Definition: A numerical characteristic of the sample (what you calculate from data)
Examples:
- Sample mean (x̄)
- Sample proportion (p̂)
- Sample standard deviation (s)
Goal: Use statistics to estimate parameters
Why We Sample
Cost-Effective
Studying entire populations is usually prohibitively expensive. Sampling dramatically reduces costs while still providing accurate estimates.
Example: Rather than surveying all 5 million university students in a country, survey 1,000 students for reliable estimates at a fraction of the cost.
Time-Efficient
Collecting data from entire populations takes too long. Sampling allows timely completion of research projects.
Example: Election polls sample voters to predict outcomes quickly rather than waiting for everyone to vote.
Practical Feasibility
Some populations are impossible to access completely or don't have complete lists.
Example: No complete list exists of all people with depression, so sampling from accessible sources is necessary.
Sometimes More Accurate
Well-designed samples with careful data collection can be more accurate than careless population censuses.
Example: A careful sample survey with high response rates may be more accurate than a census with many non-responses.
Destructive Testing
When testing destroys the item, you must sample rather than test everything.
Example: Testing battery life requires using batteries until they die—can't test every battery produced!
Infinite Populations
Some populations are theoretical and infinite, making sampling the only option.
Example: All possible measurements of a physical constant or all potential outcomes of a process.
When NOT to Sample: Census
Use a Census (Study Everyone) When:
- Population is small: With only 30 people in your department, just survey everyone
- Resources allow: You have sufficient time and money for complete enumeration
- Precision required: Need exact counts, not estimates (e.g., organizational records)
- Political/ethical reasons: Everyone should have opportunity to participate (e.g., employee satisfaction surveys)
- High variability: Population so diverse that large sample would be needed anyway
Representative Samples
What Makes a Sample Representative?
A representative sample accurately reflects the characteristics of the population it's drawn from. Key characteristics should be present in the sample in the same proportions as in the population.
Example: University Student Population
| Characteristic | Population | Representative Sample | Biased Sample |
|---|---|---|---|
| Gender | 55% Female, 45% Male | 54% Female, 46% Male ✓ | 70% Female, 30% Male ✗ |
| Year Level | 30% 1st, 25% 2nd, 25% 3rd, 20% 4th | 29% 1st, 26% 2nd, 24% 3rd, 21% 4th ✓ | 50% 1st, 30% 2nd, 15% 3rd, 5% 4th ✗ |
| Major | 40% STEM, 35% Social Sci, 25% Humanities | 39% STEM, 36% Social Sci, 25% Humanities ✓ | 60% STEM, 25% Social Sci, 15% Humanities ✗ |
Warning: Convenience Samples Are Rarely Representative
Just because a sample is large doesn't mean it's representative. A sample of 10,000 people recruited from social media may be less representative than a carefully selected random sample of 400 people.
Two Main Sampling Approaches
Probability Sampling
Every member of the population has a known, non-zero chance of being selected
Characteristics:
- Random selection
- Known probability of selection
- Allows generalization to population
- Can calculate sampling error
- More rigorous and defensible
Use when: Generalizability is important and you have access to sampling frame
Gold standard for quantitative research
Non-Probability Sampling
Not every member has a known or equal chance of being selected
Characteristics:
- Non-random selection
- Unknown probability of selection
- Limited generalizability
- Cannot calculate sampling error statistically
- More practical and accessible
Use when: Exploratory research, hard-to-reach populations, qualitative studies, or practical constraints prevent probability sampling
Very common in practice, especially qualitative research
Choosing Between Probability and Non-Probability
The choice depends on:
- Research goals: Need to generalize to population or explore in-depth?
- Resources: Time, money, and access to sampling frame
- Population characteristics: Accessible or hard-to-reach?
- Research design: Quantitative hypothesis-testing or qualitative exploration?
Remember: Non-probability sampling doesn't mean bad sampling. It's appropriate for many research purposes—just be clear about limitations.