4
Data Science
Unit 4 - Data Merging
201+ practice questions available
Sample Practice Questions
5 questions shown — tap “Show Answer” to check yours.
1
Easy
Q3 (3rd quartile) corresponds to which percentile?
A50th
B60th
C75th
D99th
2
Easy
'Duplicate rows' after merge can result from:
AOne-to-many relationships in the join
BNo data
CEmpty tables
DMean values only
3
Medium
Standardising before clustering is needed because:
ADistance-based methods are sensitive to feature scale
BAlways optional
CHas no effect
DIncreases bias
4
Medium
Empirical CDF at value v is:
AFraction of data ≤ v
BMean
CMode
DRange
5
Hard
Best way to merge datasets that have similar but not identical strings as keys (e.g., 'New York' vs 'New york'):
AFuzzy matching or string normalisation before merge
BDirect match
CSkip the merge
DInner join only
Practice all 201+ Unit 4 - Data Merging questions
Adaptive difficulty, instant explanations, XP rewards, chapter mastery tracking.
Start Free — No Credit Card →