Probability Distributions & Descriptive Statistics

22 important questions on Probability Distributions & Descriptive Statistics

What are descriptive vs inferential statistics?

Descriptive: summarize data (mean sales per month)
Inferential: draw conclusions about a population from a sample (estimating national average income)

What is a population and what is a sample?

  • Population (N): the entire group you want to study.
  • Sample (n): a smaller subset used to estimate the population.
  • What are the most common probability distributions?

    Table:
    • Higher grades + faster learning
    • Never study anything twice
    • 100% sure, 100% understanding
    Discover Study Smart

    What is the mean, variance, and standard deviation?

    ​ They measure the center and spread of a distribution.

    What is the Central Limit Theorem (CLT)?

    When n is large, the sample mean
    Xˉ\bar{X}  is approximately normally distributed:

    ✅ This allows us to use normal theory for inference even when the population isn’t normal.

    What is a point estimate and an interval estimate?

  • Point estimate: a single best guess (e.g. Xˉ\bar{X} X ˉ)
  • Interval estimate (Confidence Interval): a range that likely contains the true mean.
  • Formula for a confidence interval (for μ):

    Formula:

    What is hypothesis testing?

    A method to test if a claim about a population parameter is supported by sample data.

    What is the 5-step plan?

    1️⃣ State H0H_0 (null) and H1H_1 (alternative)
    2️⃣ Choose α (significance level)
    3️⃣ Calculate test statistic (z or t)
    4️⃣ Compare with critical value or find p-value
    5️⃣ Conclude (reject or not reject H0H_0 )

    What is α (alpha level)?

    The significance level, or the maximum risk of rejecting a true null hypothesis.

    What is simple linear regression?

    Models the relationship between a dependent variable (Y) and one independent variable (X):
    Y i​ = β 0​ + β 1​ X i​ + ϵ i​
    β0​ : intercept
    β1 slope (effect of X on Y)
    ϵi random error term

    How do you interpret β1​?

    β 1​ = average change in Y for a 1-unit increase in X.
    Example: if β₁ = 2.5 → each extra euro spent on ads increases sales by 2.5 units.

    What is R² (coefficient of determination)?

    Formula:

    What is the F-test in regression?

    Tests if the overall regression model is significant:
    H0:β1=β2=...=0H_0: \beta_1 = \beta_2 = ... = 0 If p < α, the model explains significant variation in Y.

    What happens if assumptions are violated?

    • Non-linear → model misspecification
    • Unequal variance → biased standard errors
    • Multicollinearity → unstable β estimates
      → check with plots and VIF (Variance Inflation Factor)

    What are the main regression assumptions?

    1. Linearity
    2. Independence of errors
    3. Homoscedasticity (equal variance)
    4. Normality of residuals
    5. No multicollinearity (in MLR)

    What is ANOVA in regression?

    ANOVA (Analysis of Variance) decomposes variation:
    SST = SSR + SSE
    • SST: total variation
    • SSR: explained by regression
    • SSE: unexplained (residual)

    What is the difference between t-test and F-test?

    • t-test → one coefficient (is β₁ ≠ 0?)
    • F-test → multiple coefficients at once (is the model significant?)

    What are confidence intervals used for in regression?

    To estimate the likely range of β₁ values:

    What reduces sampling error?

    Larger sample size (n↑) & Better sampling method (random selection)

    What is a Type I and Type II error?

    I = rejecting H0 when true
    II = failing to reject a false H0

    What is the standard error of the mean?

    Shows how much the sample mean varies between samples.

    The question on the page originate from the summary of the following study material:

    • A unique study and practice tool
    • Never study anything twice again
    • Get the grades you hope for
    • 100% sure, 100% understanding
    Remember faster, study better. Scientifically proven.
    Trustpilot Logo