4

Data Science

Unit 4 - Data Merging

201+ practice questions available

Sample Practice Questions

5 questions shown — tap “Show Answer” to check yours.

1
Easy

Q3 (3rd quartile) corresponds to which percentile?

A50th
B60th
C75th
D99th
2
Easy

'Duplicate rows' after merge can result from:

AOne-to-many relationships in the join
BNo data
CEmpty tables
DMean values only
3
Medium

Standardising before clustering is needed because:

ADistance-based methods are sensitive to feature scale
BAlways optional
CHas no effect
DIncreases bias
4
Medium

Empirical CDF at value v is:

AFraction of data ≤ v
BMean
CMode
DRange
5
Hard

Best way to merge datasets that have similar but not identical strings as keys (e.g., 'New York' vs 'New york'):

AFuzzy matching or string normalisation before merge
BDirect match
CSkip the merge
DInner join only

Practice all 201+ Unit 4 - Data Merging questions

Adaptive difficulty, instant explanations, XP rewards, chapter mastery tracking.

Start Free — No Credit Card →