datarekha

Independence, Span, Basis & Dimension

Linear independence, span, basis and dimension in one pass — plus why orthonormal sets are automatically independent and why a space has many valid bases, not one.

8 min read Intermediate GATE DA Lesson 20 of 122

What you'll learn

  • Linear independence: the only solution to the sum of c_i v_i = 0 is all c_i = 0
  • Span, basis (an independent spanning set), and dimension (the count of basis vectors)
  • Orthonormal sets are mutually perpendicular unit vectors and are automatically independent
  • A vector space has MANY valid bases — independence is weaker than orthogonality

Before you start

Pick a handful of arrows. How small can you make that handful and still reach every point in the space using only additions and scalings of them? That’s the question this lesson lives on. Four tightly linked words come out of it — independence, span, basis, dimension — and GATE tests them as a cluster. The mental picture is simple: independent vectors each point in a genuinely new direction the others can’t reach, and a basis is just enough of those to build the whole space without anything redundant. This is the exact language data scientists use when two features are perfectly correlated (one is a multiple of another — redundant, “dependent”) or when they ask how many truly independent directions a dataset actually has.

Linear independence — no redundant directions

Vectors v₁, …, vₙ are linearly independent when no one of them is a combination of the others. The crisp algebraic test: the only way to make

c₁v₁ + c₂v₂ + … + cₙvₙ = 0

is to take all coefficients c_i = 0. If some nonzero choice of coefficients also gives zero, the vectors are dependent — at least one is redundant.

Independent: span a planev1v2different directions fill the whole planeDependent: only a lineu2ucollinear: the second adds no new direction
Independent vectors open up a plane; collinear (dependent) vectors stay stuck on one line.

Drag the two arrows. When they point in different directions, every combination of them fills out the plane — independence in action. Now drag one to lie on top of (or opposite) the other and the combinations collapse onto a single line: the hallmark of dependence.

Span, basis, dimension

The span of a set of vectors is everything you can build from them by addition and scaling — all their linear combinations. Two independent vectors in span the whole plane; one nonzero vector spans only a line.

A basis of a space is a set that is both linearly independent and spans the space — enough vectors to reach everything, with none redundant. The dimension is simply the number of vectors in a basis (every basis of a given space has the same count). has dimension 2, has dimension 3.

A space has many valid bases — any set of n independent vectors that spans an n-dimensional space qualifies. There is nothing unique about the “standard” one.

Orthonormal sets — perpendicular and unit length

A set is orthonormal when the vectors are mutually perpendicular (every pair has dot product 0) and each has length 1. The standard basis (1,0) and (0,1) is orthonormal. A key fact GATE tests: an orthonormal set is automatically linearly independent — perpendicular directions can never be combinations of one another. But the converse is weaker: independent vectors need not be orthogonal.

How GATE asks this

GATE DA poses these as MCQ/MSQ items — and it appeared in 2025. Typical prompts: “are these vectors linearly independent?”, “which of these sets form a basis of ?”, or a multi-select on properties of orthonormal sets. The 2025 trap played on uniqueness — assuming a space has a single orthonormal basis. Remember: many orthonormal bases exist (rotate the standard one and it is still orthonormal). To check independence fast for n vectors in Rⁿ, form a matrix and ask whether its determinant is nonzero (nonzero means independent).

Worked example — independent vs dependent

(1) Are (1, 0) and (1, 1) independent, and do they form a basis of ? (2) Are (1, 2) and (2, 4) independent?

Pair 1 — independent, and a basis. Solve c₁(1,0) + c₂(1,1) = (0,0). The second coordinate gives c₂ = 0; the first then gives c₁ + 0 = 0, so c₁ = 0. Only the all-zero solution exists, so they are independent. Two independent vectors in the 2-dimensional space automatically span it, so they form a basis — note this is a different basis from the standard (1,0), (0,1).

Pair 2 — dependent. Observe (2, 4) = 2·(1, 2): the second vector is exactly twice the first. So 2·(1,2) − 1·(2,4) = (0,0) is a nonzero combination giving zero. They are dependent and span only a line, not the plane.

det [1 1]  = 1·1 − 0·1 = 1  ≠ 0   →  independent (a basis of R²)
    [0 1]

det [1 2]  = 1·4 − 2·2 = 0        →  dependent
    [2 4]

Quick check

Quick check

0/6
Q1How many vectors are in any basis of R³ (i.e. its dimension)?numerical answer — type a number
Q2What is the maximum number of linearly independent vectors among (1,2), (2,4), (0,1) in R²?numerical answer — type a number
Q3Which statements are TRUE? (select all that apply)select all that apply
Q4Are the vectors (1, 2) and (3, 6) linearly independent?
Q5Which sets form a basis of R²? (select all that apply)select all that apply
Q6Three nonzero vectors lie in R². What can you conclude about their linear independence?

Practice this in an interview

All questions
What are the core assumptions of linear regression, and what breaks when each is violated?

OLS linear regression rests on five assumptions: linearity, independence of errors, homoscedasticity, normality of residuals, and no perfect multicollinearity. Violating any one of them degrades coefficient estimates, standard errors, or the validity of hypothesis tests.

What is PCA, when should you use it, and what are its key limitations?

PCA finds the orthogonal directions of maximum variance in the data and projects onto a lower-dimensional subspace, reducing features while retaining most information. It is most useful before distance-based models or when training is bottlenecked by dimensionality. Its main limits are loss of interpretability, sensitivity to scale, and an assumption of linear structure.

What is the curse of dimensionality, and how does it affect machine learning models?

As the number of features grows, the volume of the feature space increases exponentially, so training data becomes exponentially sparse. Distance-based algorithms degrade because points become approximately equidistant; density estimation requires data that grows exponentially; and overfitting risk rises for any fixed training set size.

What are 1NF, 2NF, and 3NF, and when would you intentionally denormalize?

1NF eliminates repeating groups and requires atomic column values. 2NF further removes partial dependencies on a composite key. 3NF removes transitive dependencies — every non-key column must depend on the key, the whole key, and nothing but the key. Denormalization trades update anomalies for read performance, and is appropriate when the read path dominates and write correctness can be enforced at the application layer or with materialized views.

Sign in to track your progress

Completed lessons, your XP, level, and streak save to your account — it's free and takes a few seconds.

Explore further

Related lessons

Skip to content