Interview prep
Pandas & Data Wrangling interview questions
25 of the most common Pandas & Data Wrangling questions for data and AI interviews — each with a worked answer, the trap to avoid, and a link to learn it properly. Cleaning, reshaping, GroupBy, joins, performance.
Filter by role
- How does boolean indexing work in pandas, and what are the common pitfalls? Easy ·Google·Netflix·Uber
- How do you apply multiple aggregation functions to different columns in a single GroupBy call? Easy ·Amazon·Walmart·Netflix
- How do you detect and remove duplicate rows in pandas, and how do you control which duplicate to keep? Easy ·Amazon·Accenture·Walmart
- What is the difference between loc and iloc in pandas, and when should you use each? Easy ·Amazon·Microsoft·Walmart
- How do common SQL operations map to pandas, and when should you use SQL instead of pandas? Easy ·Amazon·Google·Meta
- What is the split-apply-combine model in pandas GroupBy? Easy ·Amazon·Google·Microsoft
- How do you work with string data in pandas using the .str accessor, and how does it compare to applying Python string methods manually? Easy ·Amazon·Google·Meta
- What is the difference between wide and long (tidy) data formats, and why does it matter for analysis? Easy ·Meta·Netflix·Airbnb
- When should you use apply, map, or applymap versus vectorized pandas operations, and what are the performance implications? Medium ·Google·Uber·Airbnb
- How does the categorical dtype reduce memory and speed up operations in pandas? Medium ·Netflix·Lyft·DoorDash
- How does the category dtype work in pandas and when should you use it? Medium ·Databricks·Amazon·Snowflake
- How do you parse, manipulate, and extract features from datetime columns in pandas? Medium ·Amazon·Microsoft·Uber
- When should you use pandas eval() and query(), and what are their limitations? Medium ·Palantir·Two Sigma·Citadel
- What are the different strategies for handling missing data in pandas — isna, fillna, dropna, and interpolate? Medium ·Amazon·Microsoft·Databricks
- What merge types does pandas support, and what does the validate parameter do? Medium ·Airbnb·Stripe·Shopify
- What is the difference between merge, join, and concat in pandas? Medium ·Google·Meta·Microsoft
- What is method chaining in pandas and how do you use pipe() to extend it? Medium
- How do GroupBy and multi-index interact in pandas, and how do you flatten a multi-index result? Medium ·Bloomberg·Two Sigma·Databricks
- How does Polars differ from pandas, and when should you choose one over the other? Medium ·Databricks·Stripe·Anthropic
- What is the difference between pivot, pivot_table, and melt in pandas, and when do you use each? Medium ·Amazon·Meta·Microsoft
- How do you reduce memory usage in a pandas DataFrame using dtypes, category encoding, and downcasting? Medium ·Databricks·Snowflake·Amazon
- How do rolling and expanding windows work in pandas, and when do you use each? Medium ·Two Sigma·Goldman Sachs·Spotify
- What causes SettingWithCopyWarning in pandas and how do you fix it? Medium ·Airbnb·LinkedIn·Stripe
- What is the difference between GroupBy transform and agg in pandas? Medium ·Amazon·Uber·Airbnb
- Why is pandas slow, and what are the main strategies to speed it up? Hard ·Meta·Uber·Databricks
No questions tagged for that role yet.