Wednesday, April 18, 2007

Systematic Sampling



Definition-

Any sample drawn from a list using a random start and a fixed sampling interval. This differs from simple random sampling in not giving equal probability of selection to all possible samples which could be taken from the population. This method is widely used in the UK. This type of sampling involves calculating the sample interval, by finding the ratio of the population to the sample. A random number is then chosen between 1 and the sampling interval. ...



Meaning-
Methodology for sampling in which units are selected from the population at a regular interval (e.g., once an hour, every other lot, etc.). ! From the sampling frame, a starting point is chosen at random, and thereafter at regular intervals.
For example,
Suppose you want to sample 8 houses from a street of 120 houses.
120/8=15, so every 15th house is chosen after a random starting point between 1 and 15. If the random starting point is 11, then the houses selected are 11, 26, 41, 56, 71, 86, 101, and 116. If there were 125 houses, 125/8=15.625, so should you take every 15th house or every 16th house? If you take every 16th house, 8*16=128 so there is a risk that the last house chosen does not exist. To overcome this the random starting point should be between 1 and 10. On the other hand if you take every 15th house, 8*15=120 so the last five houses will never be selected. The random starting point should now be between 1 and 20 to ensure that every house has some chance of being selected.
Systematic sampling is especially useful in situations when you want to:
  • Sample over extended periods of time.
  • Reduce the impact of a variety of conditions to prevent systematic errors from influencing the analysis of the data.
  • One reason for this type of sampling is to see if the process has any variations due to different times of the day, shift workers changing, temperature changes during day vs. night times, etc.

Advantages.
Spreads the sample more evenly over the population
easier to conduct than a simple random sample
Easier, faster and less chance of making a sampling mistake, compared with SRS.
Could give a more precise estimate than SRS.

Disadvantages
the system may interact with some hidden pattern in the population, e.g. every third house along the street might always be the middle one of a terrace of three
Could be less precise than SRS
Effect of periodicity (bias caused by particular characteristics arising in the sampling frame at regular units). An example of this would occur if you used a sampling frame of adult residents in an area composed of predominantly couples or young families. If this list was arranged: Husband / Wife / Husband / Wife etc. and if every tenth person was to be interviewed, there would be an increased chance of males being selected.


3 comments:

Hemangi said...

hey ! i thought we were supposed to discuss this abt ssystematic smapling..you have already put up a post about systematic sampling..!! its good though..but i was looking for some kind of interaction...

jailee said...
This comment has been removed by the author.
Hemangi said...

Jailee,
Hey Jailee, I’m really happy that you managed to do systematic sampling on your own and also made efforts to communicate with me about the proceedings of it, somehow due to some communication gap it never worked out. But I’m glad you went ahead and still uploaded the post.
I’d like to say that the first picture you uploaded explains systematic sampling well. I just had a doubt about the illustration of a clock in that picture, does it signify the interval that systematic sampling uses in its procedure or does it show something else, please tell me if you have any ideas on this one.
The house example is also good. Further, you have pointed out times and conditions when systematic sampling can be used and have presented three bullets for it. Out of that I do not understand the first and the third.
Could you please elaborate on how systematic sampling can help if we want to sample over extended periods of time. And also how will this sampling type help me if I wanted to find out see if the process has any variations due to different times of the day?
The advantages and disadvantages seem to be rightly pointed out, but you do not seem to mention what is SRS (check if the full form is mentioned or not).

I would also request you to give 2 web resources as I had earlier said in my post.