Home/Notes/Collection and Organisation of Data
Board Exam Notes

Collection and Organisation of Data Notes

Questions

3–5 questions per paper

Difficulty

Easy

Importance

Foundation for Class 11/12 statistics

Overview

Collection and Organisation of Data forms the foundation of statistical analysis in the NCERT curriculum. Mastering this topic is essential as it serves as the prerequisite for calculating measures of central tendency and dispersion, which carry significant weight in board examinations.

Primary vs Secondary Data

Data can be classified based on its source and original purpose. Primary data is collected specifically for the research objective, while secondary data is sourced from existing records or published documents.

  • Primary data: Original, time-consuming, and costly to collect
  • Secondary data: Pre-existing, economical, but requires careful verification
  • Example of Primary: Personal interviews or census enumeration
  • Example of Secondary: Government publications or NGO reports

Methods of Data Collection

Researchers use specific methodologies to gather raw data depending on the scope of the study. Each method has distinct advantages regarding accuracy and sample coverage.

  • Direct Personal Investigation: High accuracy, small coverage
  • Indirect Oral Investigation: Useful for hostile or large areas
  • Mailed Questionnaire: Cost-effective for widespread respondents
  • Information from Local Sources: Quick but lacks precision

Frequency Distribution

Frequency distribution organizes raw data into groups to identify patterns. Converting raw data into a continuous series is a critical skill for calculating further statistical parameters like mean and median.

  • Class interval: Difference between upper and lower limit
  • Frequency: Number of observations in a class
  • Mid-point: (Upper limit + Lower limit) / 2
  • Exclusive series: Upper limit of one class is lower limit of next
  • Inclusive series: Both limits included; requires conversion to exclusive

Tabulation of Data

Tabulation is the systematic arrangement of data in rows and columns to enable comparison and logical analysis. A well-constructed table must be self-explanatory and contain standard components.

  • Table number and title are mandatory for identification
  • Headnote indicates units of measurement
  • Stub: Designates rows
  • Caption: Designates columns
  • Body: Contains the actual numerical data

Formula Sheet

Mid-value = (L1 + L2) / 2

Range = Maximum Value - Minimum Value

Total Frequency (N) = Σf

Exam Tip

Always ensure your frequency distribution is continuous (Exclusive Series) before attempting any further statistical calculations like Mean or Median.

Common Mistakes

  • Confusing inclusive series with exclusive series and failing to convert them before calculating mid-points
  • Ignoring the necessity of a 'Footnote' or 'Source' note in formal tabulation questions
  • Misidentifying secondary data sources as primary simply because they are used in current research

More Revision Notes

Ready to test yourself?

Play topic-wise Collection and Organisation of Data questions in Aspirant Arcade — gamified MCQ practice.

Download Free