Questions
3–5 questions per paper
Difficulty
Easy
Importance
Foundation for Class 11/12 statistics
Overview
Collection and Organisation of Data forms the foundation of statistical analysis in the NCERT curriculum. Mastering this topic is essential as it serves as the prerequisite for calculating measures of central tendency and dispersion, which carry significant weight in board examinations.
Primary vs Secondary Data
Data can be classified based on its source and original purpose. Primary data is collected specifically for the research objective, while secondary data is sourced from existing records or published documents.
- Primary data: Original, time-consuming, and costly to collect
- Secondary data: Pre-existing, economical, but requires careful verification
- Example of Primary: Personal interviews or census enumeration
- Example of Secondary: Government publications or NGO reports
Methods of Data Collection
Researchers use specific methodologies to gather raw data depending on the scope of the study. Each method has distinct advantages regarding accuracy and sample coverage.
- Direct Personal Investigation: High accuracy, small coverage
- Indirect Oral Investigation: Useful for hostile or large areas
- Mailed Questionnaire: Cost-effective for widespread respondents
- Information from Local Sources: Quick but lacks precision
Frequency Distribution
Frequency distribution organizes raw data into groups to identify patterns. Converting raw data into a continuous series is a critical skill for calculating further statistical parameters like mean and median.
- Class interval: Difference between upper and lower limit
- Frequency: Number of observations in a class
- Mid-point: (Upper limit + Lower limit) / 2
- Exclusive series: Upper limit of one class is lower limit of next
- Inclusive series: Both limits included; requires conversion to exclusive
Tabulation of Data
Tabulation is the systematic arrangement of data in rows and columns to enable comparison and logical analysis. A well-constructed table must be self-explanatory and contain standard components.
- Table number and title are mandatory for identification
- Headnote indicates units of measurement
- Stub: Designates rows
- Caption: Designates columns
- Body: Contains the actual numerical data
Formula Sheet
Mid-value = (L1 + L2) / 2
Range = Maximum Value - Minimum Value
Total Frequency (N) = Σf
Exam Tip
Always ensure your frequency distribution is continuous (Exclusive Series) before attempting any further statistical calculations like Mean or Median.
Common Mistakes
- Confusing inclusive series with exclusive series and failing to convert them before calculating mid-points
- Ignoring the necessity of a 'Footnote' or 'Source' note in formal tabulation questions
- Misidentifying secondary data sources as primary simply because they are used in current research
More Revision Notes
Ready to test yourself?
Play topic-wise Collection and Organisation of Data questions in Aspirant Arcade — gamified MCQ practice.
Download Free