datarekha
Blog Production AI · GATE DA · how concepts actually work

From theory to the systems that demand real understanding.

Long-form pieces on how production-AI teams orchestrate agents, serve models, and build the stack, plus GATE DA essays that turn probability, linear algebra, DBMS, ML, and AI into durable concepts.

346 posts · 13 categories
Latest Infrastructure · Jun 8, 2026

AI's real bottleneck isn't intelligence — it's electricity

AI's hardest 2026 limit isn't chips or money — it's electricity. Why data-center power is the bottleneck, and how inference makes every AI query an energy cost.

7 min read
Read story
LLMs Jun 8, 2026

Attention is O(n²) — and Mamba's linear escape

Attention costs O(n²), so long context gets expensive fast. State-space models like Mamba scale linearly — and 2026's winning architectures are hybrids of both.

9 min read Read
LLMs Jun 8, 2026

Beyond next-token: world models and the next paradigm

World models predict the next state of the world, not the next token — making them simulators agents can plan inside. The two camps racing past LLMs in 2026.

8 min read Read
LLMs Jun 8, 2026

Diffusion language models: when AI writes text all at once

Diffusion language models generate text all at once, refining a masked sequence over a few parallel steps — hitting 1,000+ tokens/sec versus left-to-right LLMs.

8 min read Read
LLMs Jun 8, 2026

Reading a model's mind: sparse autoencoders explained

Sparse autoencoders pull human-readable features out of an LLM's tangled activations — the breakthrough tool of mechanistic interpretability.

8 min read Read
LLMs Jun 8, 2026

o3-level reasoning on your laptop: how distillation works

Reasoning distillation trains a small model on a big model's chains of thought — putting o3-level reasoning on a laptop, and beating bigger models.

8 min read Read
LLMs Jun 8, 2026

RLHF is being replaced: how DPO teaches models what good means

RLHF aligned chat models with a reward model and a fragile RL loop. DPO drops both, learning the same preferences directly from chosen-vs-rejected pairs.

8 min read Read
LLMs Jun 8, 2026

The big-model era is ending: the rise of small models

Small language models fine-tuned for a task now beat giants on it — on your laptop or phone, cheaper and private. Why bigger isn't always better in 2026.

7 min read Read
LLMs Jun 8, 2026

Why your AI can't learn after training: catastrophic forgetting

A trained model's weights are frozen, and fine-tuning erases old skills — catastrophic forgetting. Why continual learning is AI's open problem in 2026.

8 min read Read
LLMs Jun 8, 2026

Test-time compute: why thinking longer beats thinking bigger

Test-time compute lets a model think before answering, so a small reasoning model can beat a far larger one. How inference-time scaling works.

10 min read Read
Business Analytics Jun 7, 2026

A/B testing in practice: sample size, p-values, and the traps

A/B testing done wrong wastes months of effort. Master p-values, sample size, and the six traps — peeking, SRM, and more — before your next experiment.

13 min read Read
Business Analytics Jun 7, 2026

Cohort analysis: how to actually read a retention curve

Cohort analysis reveals what aggregate retention metrics hide — learn to build a cohort table, read a retention curve, and spot product-market fit signals.

12 min read Read
Time Series Jun 7, 2026

Evaluating forecasts: MAE, RMSE, MAPE, and honest backtesting

A practical guide to forecast accuracy using MAE RMSE MAPE and MASE, plus rolling-origin backtesting to avoid self-deception in time series.

12 min read Read
CLI Jun 7, 2026

find and xargs: bulk file operations without fear

Master the find command and xargs for safe, efficient bulk file operations: handle spaces in filenames, batch deletes, renames, and parallel processing.

11 min read Read
Time Series Jun 7, 2026

The forecasting baselines that quietly beat fancy models

Why forecasting baselines like the naive forecast and seasonal-naive outperform complex models—and how to pick the right one before you build anything fancy.

11 min read Read
Business Analytics Jun 7, 2026

Funnel analysis: finding exactly where users drop off

Master funnel analysis to pinpoint conversion leaks, prioritize fixes by impact, and turn step-by-step drop-off data into real growth.

11 min read Read
Git Jun 7, 2026

git bisect: find the commit that broke it in log(n) steps

Use git bisect to find the exact commit that introduced a regression or bug with binary search over history — O(log n) instead of O(n).

10 min read Read
Git Jun 7, 2026

Branching strategies that scale: trunk-based vs Git Flow vs GitHub Flow

Compare every major branching strategy: trunk-based development, Git Flow, and GitHub Flow — and know which fits your release cadence.

12 min read Read
Git Jun 7, 2026

Git merge vs rebase: when to use which (without wrecking history)

Git merge vs rebase both integrate branches but produce different histories. Learn when each is right, the golden rule, and how to recover when things go wrong.

12 min read Read
Git Jun 7, 2026

Git's three trees: the mental model that makes Git click

Understand git's working directory, staging area, and HEAD so every git command finally makes sense — no more mystery, no more fear.

11 min read Read
CLI Jun 7, 2026

grep, sed, awk: the text-processing trio worth mastering

grep sed awk command line text processing explained: when to use each tool, real recipes, regex fundamentals, and how pipelines compose all three.

13 min read Read
Business Analytics Jun 7, 2026

LTV and CAC: the unit economics every analyst should model

Master LTV and CAC unit economics: correct formulas, cohort methods, payback periods, and the pitfalls that make most models wrong.

12 min read Read
Business Analytics Jun 7, 2026

North Star metrics: the one number that actually moves a business

What a north star metric is, how to choose one, and why a single well-chosen number beats a dashboard of 40 KPIs.

11 min read Read
Time Series Jun 7, 2026

Stationarity, differencing, and why ARIMA needs a flat series

Understand stationarity, why differencing transforms a trending series, and how the d in ARIMA(p,d,q) bridges raw data to a forecastable model.

13 min read Read
Time Series Jun 7, 2026

Decomposition: reading trend, seasonality, and residual

A practical guide to time series decomposition — separating trend, seasonality, and residual to reveal what a signal is actually doing.

11 min read Read
CLI Jun 7, 2026

Understanding $PATH: how the shell actually finds your commands

Demystify the PATH environment variable and 'command not found': how the shell searches, why order matters, and how to manage it safely.

10 min read Read
Git Jun 7, 2026

Undoing things in Git: reset, revert, restore, and reflog

A practical guide to undo in git: when to reach for reset revert reflog restore or amend, and how to recover commits you thought were gone forever.

12 min read Read
CLI Jun 7, 2026

Unix file permissions: rwx, chmod, and the octal that confuses everyone

Understand Unix file permissions end-to-end: read the ls -l string, master chmod octal and symbolic modes, chown, umask, and special bits.

11 min read Read
CLI Jun 7, 2026

Pipes and redirection: how Unix composes small sharp tools

How unix pipes and redirection wire stdin stdout and stderr into composable data pipelines — and why that one-liner on your terminal is a tiny distributed system.

11 min read Read
Time Series Jun 7, 2026

Why you can't shuffle a time series: splits and leakage

Random k-fold on a time series leaks the future into training. Learn the correct time-based train test split, rolling-origin CV, and how to avoid leakage.

11 min read Read
Agents Jun 3, 2026

Memory poisoning: when your agent remembers a lie

Persistent agent memory is a delayed-action injection vector. Untrusted content read in one session gets written to long-term memory and silently steers a clean session later — sometimes for a different user. Here is how the attack works, the 2024-2026 research, and why the only real fix is treating every memory write as untrusted input.

12 min read Read
Patterns Jun 3, 2026

The silent revenue drop: how drift actually breaks production models

Drift does not crash your service or page your on-call. It quietly degrades a model for weeks while every dashboard stays green, until finance asks why a revenue line is bleeding. This is the operational war story the KS-test tutorials skip.

13 min read Read
Patterns Jun 3, 2026

Why your model made that prediction: SHAP in production

SHAP is the most-used model explainability library on earth, and most teams misread it. A field guide to what Shapley values actually compute, where TreeSHAP beats KernelSHAP by orders of magnitude, and the four ways SHAP quietly lies to you.

12 min read Read
Infrastructure Jun 3, 2026

Training-serving skew: the bug feature stores exist to kill

The model scores 0.95 offline and dies in production. Almost always the cause is one bug: training-serving skew, where the feature the model learned offline is computed differently online — or worse, was joined from the future. Here's the bug, why your holdout set can't catch it, and the three fixes that actually work.

12 min read Read
Agents Jun 3, 2026

Why agents need permissions: the lethal trifecta and least privilege

Prompt injection has no reliable fix at the model layer. Simon Willison's lethal trifecta and OWASP's Excessive Agency say the same thing: agent security must be designed at the system layer, with least privilege.

12 min read Read
Patterns Jun 2, 2026

A/B testing is a sample-size problem wearing a statistics costume

Most A/B-test failures aren't bad statistics — they're underpowered tests that never had a chance of seeing the effect they were designed to find.

10 min read Read
Patterns Jun 2, 2026

Accuracy lies on imbalanced data

A model that never flags a single fraud transaction can score 99% accuracy — and that number is not wrong, it is just completely useless.

8 min read Read
Career Jun 2, 2026

Active listening: the skill most people skip

Most people listen to reply, not to understand — and it costs them rework, trust, and credibility they never see leaving.

8 min read Read
Career Jun 2, 2026

The AI velocity paradox: why faster code means working weekends

AI writes code in minutes, but review, testing, and deployment still run at human speed — and the AI velocity paradox is landing on engineers as weekends.

9 min read Read
Career Jun 2, 2026

Asking for a promotion

Promotions are given for work you are already doing at the next level, not as a reward for time served — and most people lose them by waiting for someone else to notice.

8 min read Read
Career Jun 2, 2026

Async etiquette: working across time zones

Distributed teams that default to async ship faster — but only if everyone writes as if the reader is asleep when they send it.

8 min read Read
Patterns Jun 2, 2026

Attention, explained without the matrices

Self-attention is a soft dictionary lookup: every word asks a question, every other word posts a label, and the answer is a weighted average of what each word actually contains.

9 min read Read
Patterns Jun 2, 2026

Why the average customer does not exist

The mean is a liar on skewed data, and almost all business data is skewed — here is how to stop building products for a customer who never existed.

10 min read Read
Career Jun 2, 2026

Avoiding burnout before it avoids you

Burnout is not a single bad week — it is a slow structural collapse across three dimensions, and by the time most people name it, they have already lost months they cannot get back.

8 min read Read
Patterns Jun 2, 2026

Backprop is the chain rule with good bookkeeping

Backpropagation is not a mysterious learning algorithm — it is systematic calculus, applied backward through a network, made cheap by refusing to compute the same number twice.

9 min read Read
Patterns Jun 2, 2026

Bayes' theorem is just updating beliefs with evidence

A 99%-accurate test sounds iron-clad until you realize that, for rare diseases, a positive result is probably wrong — and Bayes explains exactly why.

8 min read Read
Patterns Jun 2, 2026

Always beat the baseline first

A 0.91 F1 score is impressive until you discover a constant predictor scores 0.90 — which means your model's only real contribution is the gap between those two numbers.

10 min read Read
Career Jun 2, 2026

Beating procrastination: the two-minute rule and friends

Procrastination is not a time-management failure — it is an emotion-regulation problem, and shrinking the first step until it is laughably small is the only lever that reliably breaks the cycle.

8 min read Read
Patterns Jun 2, 2026

The bias-variance tradeoff, drawn from scratch

Model error is not one thing — it is the sum of two competing forces, and the art of machine learning is finding the narrow valley between them.

10 min read Read
Patterns Jun 2, 2026

When O(n squared) quietly kills your data pipeline

An O(n squared) operation is invisible at a thousand rows and catastrophic at a million — and it almost always reaches production disguised as clean-looking code.

10 min read Read
Patterns Jun 2, 2026

Binary search is everywhere once you see it

Binary search is not a data structure trick — it is a way of thinking about any monotonic question, and once you internalize it, you see it lurking inside problems that look nothing like sorted arrays.

9 min read Read
Career Jun 2, 2026

Presenting to executives: lead with the answer

Executives decide in the first 30 seconds — bury the recommendation and you lose the room before you have made a single argument.

8 min read Read
Patterns Jun 2, 2026

Broadcasting is the NumPy idea that takes a week to click

Broadcasting — NumPy's rule for combining arrays of different shapes by silently stretching size-1 dimensions — looks like magic until you see the geometry, and then you can never unsee it.

9 min read Read
Career Jun 2, 2026

Building a track record people remember

Careers are not built on busyness — they are built on a handful of shipped, visible outcomes that compound into a reputation others can point to.

8 min read Read
Patterns Jun 2, 2026

Categoricals: the Pandas dtype that pays for itself

A column of repeated country codes that looks like strings is silently eating ten times the memory it needs — and making every groupby slower than it has to be.

9 min read Read
Patterns Jun 2, 2026

The chart you reach for is an argument, not a default

Every chart type encodes a claim about your data — and reaching for the wrong one doesn't just look bad, it actively misleads the people making decisions.

10 min read Read
Patterns Jun 2, 2026

Cohorts, not totals: how a flat dashboard hides a dying product

A steady active-user count can mask catastrophic churn — and the only way to see the rot is to stop looking at totals and start looking at cohorts.

10 min read Read
Patterns Jun 2, 2026

Correlation isn't causation — but here's what it actually is

Pearson's r is a precise, fragile number: it measures linear co-movement on a scale from -1 to 1, and almost everything interesting about causality lies in what it cannot see.

9 min read Read
Career Jun 2, 2026

Cross-functional work: speaking other teams' languages

The colleague who moves initiatives forward fastest is rarely the most expert person in the room — they are the one who translates.

8 min read Read
Patterns Jun 2, 2026

Cross-validation: a score you didn't overfit to

A single train/test split gives you one lucky or unlucky number; k-fold cross-validation gives you five honest ones — and the variance between them tells you as much as the mean.

9 min read Read
Patterns Jun 2, 2026

Decorators are just functions that wrap functions

The @ symbol is pure syntactic sugar for a pattern you already know: pass a function in, get a smarter function back.

8 min read Read
Career Jun 2, 2026

Deep work in an open office

Knowledge work rewards focus, but the modern office is engineered to destroy it — here is a practical system for reclaiming your output.

8 min read Read
Career Jun 2, 2026

Delegation: letting go without losing control

Most new leaders either micromanage every keystroke or throw ownership over the wall and hope — both fail the team, and both fail the leader.

8 min read Read
Career Jun 2, 2026

Difficult conversations: a playbook

The conversation you keep postponing is already costing you — here is a four-step structure that ends avoidance and actually moves things forward.

8 min read Read
Career Jun 2, 2026

Giving direction without micromanaging

The manager who explains the destination and trusts the route gets better work, faster — and keeps the people worth keeping.

8 min read Read
Career Jun 2, 2026

Disagree and commit: the decision discipline

Teams that re-litigate every decision after the fact pay a tax on execution that no amount of being right can recoup.

8 min read Read
Career Jun 2, 2026

Disagreeing without being disagreeable

You can fight the idea without fighting the person — and the difference between the two is mostly a matter of technique, not temperament.

8 min read Read
Patterns Jun 2, 2026

Eigenvectors: the directions a matrix doesn't turn

Most vectors get rotated when a matrix acts on them — eigenvectors are the stubborn exceptions, and that stubbornness is exactly why they run machine learning.

9 min read Read
Career Jun 2, 2026

Writing email that gets read and acted on

Most work email fails because the ask is buried in paragraph three — here is how to front-load the decision and get a response in one round-trip.

7 min read Read
Patterns Jun 2, 2026

Expected value: how professionals make peace with uncertainty

The number you should optimize for isn't the most likely outcome — it's the probability-weighted average of all outcomes, and ignoring that difference is the hidden tax on every bad business decision.

10 min read Read
Patterns Jun 2, 2026

EXPLAIN: reading the plan your database hands you

Every SQL query is secretly a program your database compiles; EXPLAIN shows you that program, and once you can read it, a 30-second query becomes a 30-millisecond one.

9 min read Read
Patterns Jun 2, 2026

Feature scaling: the step KNN and gradient descent never forgive

When one feature spans dollars and another spans years, the dollar column dominates every distance calculation and every gradient update — silently wrecking models that otherwise look fine.

8 min read Read
Career Jun 2, 2026

Giving feedback that lands: the SBI model

Vague feedback is not kind — it is noise dressed up as a conversation, and the SBI model (Situation, Behavior, Impact) is the antidote.

8 min read Read
Career Jun 2, 2026

Finding a mentor (and being worth mentoring)

Good mentorship is earned through low-cost, specific asks and ruthless follow-through — not assigned by HR or requested with a vague 'will you mentor me?'

8 min read Read
Career Jun 2, 2026

Your first 90 days in a new job

The first quarter shapes how you are seen for years — listen and map before you opine, earn credibility through small visible wins, and diagnose before you ever prescribe.

8 min read Read
Patterns Jun 2, 2026

The conversion funnel is a multiplication problem

Overall conversion is the product of every step rate — not a vague average — and that single fact tells you exactly where to spend your next dollar.

9 min read Read
Patterns Jun 2, 2026

Comprehensions, generators, and the art of not building the list

The gap between a list comprehension and a generator expression is just two characters, but at ten million rows it is the difference between a program that breathes and one that chokes.

10 min read Read
Patterns Jun 2, 2026

GroupBy is a three-act play: split, apply, combine

The split-apply-combine model is the single mental frame that makes Pandas GroupBy, SQL GROUP BY, and every variant of agg/transform/filter finally cohere.

8 min read Read
Patterns Jun 2, 2026

Hash tables: O(1) until they don't

A hash table promises constant-time lookup in the average case — but that asterisk hides a story about collisions, load factors, and adversarial keys that every practitioner should understand.

8 min read Read
Career Jun 2, 2026

Imposter syndrome: naming it and moving past it

Feeling like a fraud at work is nearly universal among capable people — and the antidote is not confidence, it is evidence.

8 min read Read
Patterns Jun 2, 2026

Indexes: the data structure your WHERE clause is begging for

A single missing index turned a 12-millisecond query into a 4-second crawl — not because the database broke, but because it had no choice but to read every row.

9 min read Read
Career Jun 2, 2026

How to get buy-in: the influence playbook

A great idea with no buy-in dies in the meeting it was supposed to win — here is how to pre-wire the room so the vote is a formality.

8 min read Read
Career Jun 2, 2026

Leading without authority

The most consequential moves in your career will be made without a single direct report — here is how to make them land.

8 min read Read
Career Jun 2, 2026

Managing conflict on a team

Unaddressed conflict does not disappear — it goes underground, where it costs far more than the argument you were avoiding.

8 min read Read
Career Jun 2, 2026

Manage your energy, not just your time

A full calendar is not the same as full capacity — learning to match the right work to the right hour is the leverage move that most time-management advice misses.

8 min read Read
Career Jun 2, 2026

Managing up: making your manager's job easier

Managing up is not sucking up — it is the deliberate practice of reducing your manager's uncertainty about you, so trust accumulates instead of eroding.

8 min read Read
Agents Jun 2, 2026

MCP vs A2A vs ACP vs ANP: the agent protocol stack

Four acronyms, one stack. MCP connects an agent to its tools; A2A, ACP, and ANP connect agents to each other — differing only by how open the world is, from the enterprise intranet to the decentralized agentic web.

8 min read Read
Agents Jun 2, 2026

MCP isn't enough: who lets your agents talk to each other?

MCP solved the agent-to-tool problem and solved it well. But the moment one agent needs to hand work to another agent it does not own, MCP runs out of road — and that is exactly the gap A2A and ACP were built to fill.

8 min read Read
Career Jun 2, 2026

How to run a meeting people don't dread

Most meetings are status updates in disguise — learn the three legitimate reasons to gather, and a decision framework that turns calendar dread into momentum.

8 min read Read
Patterns Jun 2, 2026

Your Pandas merge silently 10x'd the rows — here's why

A join between two tables where the key is not unique on one side silently multiplies rows for every matching pair, and pandas will not warn you unless you tell it to.

9 min read Read
Infrastructure Jun 2, 2026

MLOps is a loop, not a pipeline

Shipping a model is not the finish line — it is the starting gun for a feedback loop that runs as long as the model serves traffic, and the teams that forget this find out when a customer calls.

10 min read Read
Patterns Jun 2, 2026

Net revenue retention: the one number investors obsess over

NRR reveals whether your existing customers are worth more or less than they were a year ago — and a single number above 100% changes everything about how you can grow.

9 min read Read
Career Jun 2, 2026

Networking for people who hate networking

Networking is not collecting contacts — it is building a small number of genuine relationships before you need them, and the introverts who understand this tend to do it better.

8 min read Read
Career Jun 2, 2026

Building trust as a new manager

The fastest way to lose your team in your first 90 days is to prove yourself by changing everything — here is the slower, harder, more durable path.

8 min read Read
Career Jun 2, 2026

Note-taking that actually helps you think

Notes you never reread are just expensive transcription — here is how to build a system that generates ideas, not just archives them.

8 min read Read
Patterns Jun 2, 2026

NULL is not a value, and that wrecks your WHERE clause

NULL means unknown, so every comparison involving it yields unknown — not false — and that silent third outcome is responsible for more wrong query results than any other SQL trap.

9 min read Read
Career Jun 2, 2026

Office politics for honest people

Politics is just how decisions and resources get allocated among people — and navigating it with integrity is a learnable skill, not a moral compromise.

8 min read Read
Patterns Jun 2, 2026

One-hot encoding and the curse of high cardinality

One-hot encoding a zip code column turns 42,000 rows into a sparse desert — thousands of nearly-empty columns that starve your model of signal while drowning it in noise.

9 min read Read
Career Jun 2, 2026

One-on-ones that aren't a waste of time

Most 1:1s quietly die because managers treat them as a status meeting and reports let them, but reclaiming one hour a week can change the entire arc of someone's career.

8 min read Read
Patterns Jun 2, 2026

Overfitting is memorizing the answer key

A model that aces its training data but collapses on new data has not learned anything — it has memorized noise, and the gap between its training score and its test score is the confession.

9 min read Read
Patterns Jun 2, 2026

p-values are not the probability you are right

A p-value of 0.04 does not mean there is a 96 percent chance your variant wins — it means something far stranger, and knowing the difference is what separates analysts who ship good decisions from analysts who ship confident noise.

10 min read Read
Patterns Jun 2, 2026

Method chaining: writing pandas like a pipeline, not a pile

Chaining assign, query, groupby, and agg into one readable pipeline beats a pile of intermediate variables — it reads top to bottom like a recipe, kills stale-variable bugs, and sidesteps the SettingWithCopy trap.

10 min read Read
Career Jun 2, 2026

The performance review: making your case

Your manager cannot remember everything you did — your job is to arrive with evidence, not hope.

8 min read Read
Career Jun 2, 2026

Your personal brand at work

Your reputation is built in the moments you are not watching — and most people are either invisible or noisy, with very few landing in the narrow band that actually earns trust and opportunity.

8 min read Read
Career Jun 2, 2026

Prioritization: telling urgent from important

Busy is not the same as effective — the Eisenhower Matrix exposes the one quadrant where real careers are built, and why almost everyone starves it.

8 min read Read
Career Jun 2, 2026

Psychological safety: what it is and how to build it

The team that never disagrees in meetings is not your best team — it is your most dangerous one.

8 min read Read
Career Jun 2, 2026

Public speaking without the panic

Stage fright is just energy without a container — once you know how to structure a talk and practice it out loud, the nerves stop being the enemy and start doing the work for you.

8 min read Read
Patterns Jun 2, 2026

The mutable default argument, and other Python footguns

Python evaluates default argument values exactly once, at function definition time, which means a mutable default is a shared object that silently accumulates state across every call.

8 min read Read
Career Jun 2, 2026

Quiet quitting to quiet cracking: a field guide to disengagement

Quiet quitting, quiet cracking, boreout, conscious unbossing — work keeps inventing words for checking out. A field guide to what each signals.

9 min read Read
Patterns Jun 2, 2026

RAG in one diagram: retrieve, augment, generate

The cleanest mental model for retrieval-augmented generation: why pasting your docs into a prompt beats hoping the LLM already knows them, and the two failure modes that kill every RAG system in production.

8 min read Read
Patterns Jun 2, 2026

How to read a P&L in sixty seconds

The income statement is the one page every business speaks fluently, and once you know the grammar you can judge a company before your coffee gets cold.

9 min read Read
Patterns Jun 2, 2026

CTEs turned my unreadable query into something I can follow

A common table expression is less a feature than a cognitive tool — it lets you read a complex query from top to bottom, in the order you actually think, instead of inside-out from the deepest parenthesis.

8 min read Read
Career Jun 2, 2026

Reading the room

The same message can land perfectly or detonate silently — the difference is rarely the words you chose, it is whether you read the room before you opened your mouth.

8 min read Read
Career Jun 2, 2026

How to receive feedback without getting defensive

The instinct to defend yourself when criticized is perfectly human — and it is the single habit most reliably separating people who plateau from people who compound.

8 min read Read
Patterns Jun 2, 2026

Regularization is a tax on complexity

Your model will always overfit given enough rope — regularization is the mechanism that prices complexity so the model only grows elaborate when the data actually demands it.

9 min read Read
Career Jun 2, 2026

Remote work that does not tank your career

Out of sight becomes out of mind faster than you think — but visibility is a skill you can build deliberately, even from a spare bedroom.

8 min read Read
Career Jun 2, 2026

Running a retrospective that changes things

Most retros are theater — same complaints, different quarter — because the team conflates venting with deciding and leaves without a single named owner.

8 min read Read
Career Jun 2, 2026

Negotiating your salary (and why you must)

The raise you fail to negotiate in year one quietly compounds into a six-figure gap by mid-career — and no one will hand it back to you.

8 min read Read
Career Jun 2, 2026

Saying no without burning bridges

Every yes you give carelessly is a debt you will pay with your credibility — learning to decline well is not a career risk, it is what makes your yeses worth anything.

8 min read Read
Patterns Jun 2, 2026

SettingWithCopyWarning, finally explained

The most-Googled pandas warning is not about copying — it is about whether the memory you are writing to is the DataFrame you think it is.

10 min read Read
Career Jun 2, 2026

Sharing credit (and why it makes you look better)

Hoarding the spotlight is the fastest way to ensure no one wants to shine it on you again.

7 min read Read
Patterns Jun 2, 2026

Standard deviation, explained without the formula

Two archers with identical averages and completely different groupings reveal everything you need to know about spread, punishment for outliers, and why the formula does what it does.

8 min read Read
Career Jun 2, 2026

The STAR method: answering behavioral interviews

Behavioral questions separate candidates who did interesting work from those who can articulate what they did and why it mattered — and STAR is the structural difference.

7 min read Read
Career Jun 2, 2026

The art of the status update

A well-written status update is not a chore — it is the single cheapest act that earns you trust, autonomy, and the benefit of the doubt when things go sideways.

8 min read Read
Career Jun 2, 2026

Storytelling at work: the structure that makes people care

Data persuades the head; story moves people to act — and once you learn the four-beat narrative arc, you will never give a flat project update again.

8 min read Read
Career Jun 2, 2026

Switching jobs gracefully

How you leave a job is remembered longer than most of what you did there, and the industry is far smaller than it looks from the inside.

8 min read Read
Career Jun 2, 2026

Taming Slack and email

Every notification is someone else's priority quietly colonizing your calendar — here is the system to take it back.

7 min read Read
Patterns Jun 2, 2026

Temperature, top-p, top-k: the three knobs on an LLM

Every LLM call hides a small probability lottery behind its confident prose — temperature, top-k, and top-p are the rules of that lottery, and most practitioners tune them without understanding what they are reshaping.

9 min read Read
Agents Jun 2, 2026

The agentic web: how ANP wants to be the HTTP of agents

MCP, A2A, and ACP wire agents to tools and to each other inside the enterprise. ANP aims further: an open, decentralized network where any agent can find and trust any other across the internet, with no central authority — using W3C Decentralized Identifiers, the did:wba method, and schema.org self-descriptions.

11 min read Read
Patterns Jun 2, 2026

The axis argument everyone gets backwards

Axis in NumPy and Pandas names the dimension you destroy, not the one you keep — and once that clicks, every aggregation you write becomes obvious.

8 min read Read
Career Jun 2, 2026

The art of the handoff

Most dropped balls happen not because people are careless but because the seam between two people was never properly closed.

8 min read Read
Career Jun 2, 2026

Time-blocking: designing your week on purpose

A calendar full of back-to-back meetings is not fate — it is an accidental design decision you can undo in forty minutes on a Friday afternoon.

8 min read Read
Patterns Jun 2, 2026

Transfer learning: standing on a pretrained model's shoulders

You do not train from scratch in 2026 — you inherit a model's years of learned representations and spend your compute budget on the narrow task that actually matters.

9 min read Read
Patterns Jun 2, 2026

Two pointers: the trick that turns O(n squared) into O(n)

Most array pair problems feel like they need a nested loop until you notice that sorting gives you a monotonic structure you can exploit to skip the entire inner loop.

8 min read Read
Patterns Jun 2, 2026

Type hints are documentation that can't go stale

A docstring lies without consequence, but a type hint that disagrees with the code is caught by your editor before you even save the file.

9 min read Read
Patterns Jun 2, 2026

Unit economics is the only growth math that matters

Whether pouring money into growth is brilliant or fatal comes down to one ratio — and most people compute it wrong.

9 min read Read
Career Jun 2, 2026

The weekly review: the habit that compounds

Most people run their week reactively and never look up — a 30-minute weekly review is the only place where strategy actually meets the calendar.

7 min read Read
Career Jun 2, 2026

Does wellbeing spending actually work? The case against wellbeing-washing

Companies pour tens of billions into mindfulness apps and resilience courses, but in controlled trials most wellbeing programs show no benefit.

9 min read Read
Patterns Jun 2, 2026

Why everything looks normal: the central limit theorem

The bell curve colonizes measurement not because the world is Gaussian but because averaging destroys the shape of almost any distribution.

8 min read Read
Patterns Jun 2, 2026

Why neural networks need activation functions

Without a nonlinear activation, stacking a thousand layers is mathematically identical to stacking one — and every straight-line model in history could have told you that.

9 min read Read
Patterns Jun 2, 2026

Why Python is slow — and the times it actually matters

A tight Python loop over a million numbers can be 100x slower than C, but that rarely matters — until it suddenly, catastrophically does.

9 min read Read
Patterns Jun 2, 2026

Why we normalize: batch norm, intuitively

Internal covariate shift is not a subtle bug — it is the reason deep networks used to need weeks of careful babysitting, and batch norm is why that era quietly ended.

9 min read Read
Patterns Jun 2, 2026

Window functions changed how I write SQL

The leap from GROUP BY to window functions is the single biggest level-up an analyst makes — and the intuition behind OVER, PARTITION BY, and LAG is simpler than most tutorials admit.

10 min read Read
Career Jun 1, 2026

'40% fear AI will take their job': what the data actually says

Four in ten workers fear AI will take their job, and some cuts are real — but they rarely pay off, and the forecast still points to a net job gain.

9 min read Read
Career Jun 1, 2026

The $8.9 trillion problem: inside the global engagement crisis

Only a fifth of the world's workers are engaged, and the lost productivity runs to $8.9 trillion — but the headline hides what engaged even means.

9 min read Read
Career Jun 1, 2026

The four-day week, backed by the biggest trial yet

The largest four-day-week trial found burnout falling sharply and output holding — but only because firms redesigned the work first. The evidence.

9 min read Read
Career May 31, 2026

The thinning bottom rung: AI and the vanishing entry-level job

AI is best at the repetitive work juniors once learned the craft on, so the entry-level rung is wearing thin. What the data shows and how to stay hireable.

9 min read Read
Career May 31, 2026

The collapsing middle: why managers are cracking first

The clearest signal in 2025 workplace data is not that employees are struggling — it is that managers are cracking faster, and the damage flows downhill.

9 min read Read
Career May 30, 2026

The two-minute interruption: focus, surveillance, and always-on work

For coding and data work, focus is the scarce resource — yet the workday shreds it every two minutes with pings, overtime, and monitoring.

9 min read Read
Career May 30, 2026

Money on the brain: how financial stress steals focus at work

Financial stress is the quiet undercurrent beneath much of what we call burnout, following people to work as distraction and lost sleep. What helps, in order.

9 min read Read
GATE DA May 30, 2026

GATE DA 2026 paper analysis: what the official paper really tested

A concept-first reading of the official GATE DA 2026 paper: the 15+85 mark split, where marks sit by subject, the cutoff, and what the questions really tested.

11 min read Read
Agents May 30, 2026

The Responses API is becoming the agent operating system

Why modern agent stacks are moving from chat-shaped endpoints to response objects that preserve tool calls, traces, citations, and state. Agentic AI guide with sources.

10 min read Read
Patterns May 29, 2026

Agent evals are product specs, not benchmark decoration

Why the best agent teams write evals before they scale prompts, tools, or model upgrades — running capability, regression, and adversarial suites as the real product spec.

11 min read Read
Career May 29, 2026

Boreout: the burnout that comes from too little

Boreout is burnout's overlooked twin — chronic under-stimulation that breeds the same fatigue and cynicism, and quietly hits skilled people hardest.

9 min read Read
GATE DA May 29, 2026

GATE DA cutoff, rank, and score: how to interpret the 2026 numbers

A sober read of the 2026 GATE DA numbers: the GEN/OBC/SC cutoffs, how raw marks become a score and an all-India rank, and why a rank target beats a cutoff target.

9 min read Read
Career May 29, 2026

The lonely org chart: workplace loneliness in the hybrid era

Workplace loneliness carries real health and retention costs, and remote work dismantled the job's social fabric — hitting early-career staff hardest.

9 min read Read
Agents May 28, 2026

The agent harness — the code around the model that actually makes an agent

The LLM is the engine, but the harness is the car. Most of an agent product's real value lives in the scaffolding wrapped around the model — the loop runner, tool dispatcher, context manager, permission gate, and the dozen other things that turn a stateless completion API into something that can work for twenty minutes unsupervised.

15 min read Read
Career May 28, 2026

Conscious unbossing: why Gen Z is turning down the promotion

Gen Z is declining the step into management, and the data says conscious unbossing is less a failure of ambition than a rational read of a quietly broken job.

9 min read Read
Infrastructure May 28, 2026

Denial of wallet: the attack that bankrupts your AI app

Classic DoS tries to take you down. Denial-of-wallet keeps you up and runs your token bill to five figures by lunch. Why request limits don't save you.

8 min read Read
GATE DA May 28, 2026

MCQ, MSQ, NAT in GATE DA: three formats, three risk models

MCQ, MSQ, and NAT carry different GATE marking rules, so each needs a different decision under uncertainty. How negative marking changes when to guess, eliminate, or skip.

10 min read Read
Infrastructure May 28, 2026

MCP security after the hype cycle: what actually needs defending

MCP won mindshare by making tools portable, but it is only a protocol boundary. Auth, sandboxing, output filtering, and approvals still live in your server, not the protocol.

10 min read Read
Career May 28, 2026

Sleep: the performance lever nobody optimizes

For coding, analysis, and exam prep, sleep quietly sets the ceiling on everything else — yet always-on work erodes it first, and the fix is workload design.

9 min read Read
Patterns May 28, 2026

The supervisor-worker pattern: when one agent isn't enough

Anthropic's orchestrator-worker pattern is the only multi-agent topology that consistently ships. Here's how to recognise when you actually need it, what the LangGraph supervisor implementation looks like in practice, and the failure modes nobody warns you about.

13 min read Read
Frameworks May 27, 2026

A2A protocol: agents as services

A2A is not another prompt format; it is the service boundary for delegating work to opaque agents owned by other teams or vendors — above MCP for tools and a workflow engine for orchestration.

10 min read Read
GATE DA May 27, 2026

GATE DA syllabus map: the seven clusters that actually matter

The official GATE DA syllabus collapses into seven connected clusters: probability, linear algebra, calculus, Python-DSA, DBMS, machine learning, and AI. Study them as a graph.

11 min read Read
Agents May 27, 2026

Operator, Claude Computer Use, and Project Mariner: the browser agent shootout

Three browser agents shipped within twelve months — each with a different abstraction over the browser. Here's how the screenshot-and-click, DOM-and-accessibility-tree, and Chrome-extension approaches stack up on WebVoyager, OSWorld, latency, cost, and real-world reliability.

13 min read Read
Career May 27, 2026

996 and the fetishization of overwork

996 — nine to nine, six days a week — is lionized again in startups, but the hours data and WHO death toll check the urge to confuse long with serious.

9 min read Read
Infrastructure May 27, 2026

Surviving 10,000 concurrent requests to your LLM API

An LLM request spends 99% of its life waiting. Design around that one fact and 10k concurrent users stops being scary.

8 min read Read
Career May 27, 2026

The Big Stay: why nobody's quitting, and why that's worse

A falling quit rate looks like loyalty, but the Big Stay can mean the opposite — people who want out but feel trapped, fraying quietly at their desks.

9 min read Read
Agents May 26, 2026

The three kinds of memory production agents actually use

Working memory, episodic memory, semantic memory — the cognitive science taxonomy that every agent memory startup borrows from. Here's what each one means in practice, what Letta, Mem0, and Zep actually ship, and why most production agents only need the cheapest one.

13 min read Read
Infrastructure May 26, 2026

Cache the question, not just the bytes

Your embedding API answers the same query a thousand times a day. Three caches cut LLM cost and latency — and one of them can hand back a confidently wrong answer.

8 min read Read
GATE DA May 26, 2026

Conditional probability in GATE DA: the trap is the condition

Why GATE DA conditional-probability questions are really language parsing, and why mutually exclusive is not the same as independent. With a two-circle diagram.

10 min read Read
Frameworks May 26, 2026

Microsoft Agent Framework: explicit orchestration wins

Sequential, concurrent, handoff, group chat, and Magentic: the five orchestration patterns Microsoft Agent Framework names, and how to pick the right one for a workflow.

9 min read Read
Career May 26, 2026

The recognition gap: the highest-ROI lever teams ignore

Managers think they give feedback and praise constantly; reports barely feel it. That recognition gap is the cheapest, highest-return lever most teams ignore.

9 min read Read
Career May 26, 2026

What actually re-engages people when the perks don't

Ping-pong tables and meditation apps do not move engagement. What the evidence says re-engages people, ranked by leverage, for data and engineering leads.

9 min read Read
Infrastructure May 25, 2026

50,000 documents, one summarization pipeline

You can't loop 50k LLM calls inside a request. The shape that works — a queue, a pool of workers, and the boring reliability details that decide whether it finishes.

8 min read Read
Patterns May 25, 2026

The five patterns that ship real agents in 2026

Anthropic's December 2024 paper became the production reference for agent design. Two years on, here's which patterns actually survived contact with shipped products — and why the simple ones won.

12 min read Read
GATE DA May 25, 2026

Random variables and distributions for GATE DA

Read the distribution from the wording: what Bernoulli, binomial, Poisson, exponential, and normal each count or measure, plus the support traps GATE DA exploits.

10 min read Read
Frameworks May 25, 2026

Google ADK and the return of boring multi-agent architecture

Google ADK's strongest idea is not novelty; it is making agents look like testable software components: explicit sub-agents with tool contracts, evals, and deployment paths.

9 min read Read
Career May 25, 2026

Psychological safety: the precondition every wellbeing program forgets

Before any wellbeing program can work, people must feel safe enough to speak — and most teams fail that psychological-safety test before anyone admits it.

9 min read Read
Career May 25, 2026

Is sitting the new smoking? The desk-bound body of data work

Coding and analysis are some of the most sedentary work ever invented. What sitting does to the body, why 'the new smoking' overstates it, and what helps.

9 min read Read
Patterns May 25, 2026

Why ReAct lost — and structured planning won

The 2023 ReAct paper defined an era of agent design. Its Thought-Action-Observation loop is also why so many agents burn tokens, loop in circles, and time out on long tasks. Here's how Plan-and-Solve, ReWOO, and LLMCompiler displaced it in production — and the concrete numbers that explain why.

13 min read Read
Agents May 24, 2026

Computer-use agents need permission ladders, not bravery

The right product shape for computer-use agents is progressive permission: observe, draft, simulate, ask, act, verify, with approval gated before any irreversible action.

10 min read Read
GATE DA May 24, 2026

CLT, confidence intervals, and tests for GATE DA

A connected view of CLT, confidence intervals, and z/t/chi-square tests for GATE DA — one chain from sampling distribution to interval to decision rule.

10 min read Read
Career May 24, 2026

The right to disconnect: a global map of after-hours work

Right-to-disconnect laws now let workers ignore the boss after hours — France, Australia, Ireland — while the US has none. What the split means for teams.

9 min read Read
Career May 24, 2026

The return-to-office fight, weighed honestly

Return-to-office mandates became a wellbeing flashpoint: the office aids connection and mentorship, but blanket rules read as control and push people out.

10 min read Read
Agents May 24, 2026

Tool selection at 1000 tools: routing techniques that ship

Cramming hundreds of MCP tools into your system prompt destroys both latency and accuracy. Vector retrieval, hierarchical menus, RAG-on-tools, and code mode each take a different bet. Here are the numbers, the production deployments, and the pattern that's winning.

13 min read Read
Infrastructure May 24, 2026

Where your Python packages actually live

venv isolates them, uv makes installing them almost free, and pex bundles them into one file you can throw at any machine. A field guide to three tools and the one question they each answer.

7 min read Read
Agents May 23, 2026

Browser agents in production: Manus, BrowserBase, and Stagehand

Headless browser agents looked like science fiction a year ago. Today they cost real money, fail real ways, and are mostly an infrastructure play, not a model play. Here's where they work, where they don't, and which layer of the stack is actually winning.

13 min read Read
Career May 23, 2026

The caregiving squeeze: work and the sandwich generation

Caring for children and ageing parents at once is quietly pushing experienced people, especially women, to downshift or leave as office mandates collide.

9 min read Read
Infrastructure May 23, 2026

Edge AI in practice: Vercel AI SDK + Cloudflare Workers AI

Some inference belongs at the edge — the user's nearest POP — not in a central GPU cluster. The Vercel AI SDK and Cloudflare Workers AI made that practical. Here's where edge wins, how the cold-start tricks work, and what the streaming-from-the-edge architecture looks like.

12 min read Read
GATE DA May 23, 2026

Projection matrices in GATE DA: idempotent, symmetric, and geometric

Why projection questions in GATE DA are really geometry in matrix notation: idempotent and symmetric properties, eigenvalues of 0 and 1, and rank as subspace dimension.

10 min read Read
Career May 23, 2026

Neurodiversity at work: the talent case beyond accommodation

Autism, ADHD, and dyslexia are common in technical fields, yet work is built around a neurotypical default. Stop bolting on accommodation — design for it.

9 min read Read
Agents May 23, 2026

The eval loop that actually ships agents

Every team that ships AI agents reliably has built roughly the same eval system: a golden set kept honest by humans, an LLM-judge they don't fully trust, a regression dashboard that gates merges, and a culture of 'no win on the eval, no merge.' Here's what that loop actually looks like in practice.

14 min read Read
Infrastructure May 23, 2026

Tool search is the context-budget hack serious agents needed

As tool catalogs grow, stop stuffing every schema into the prompt. Tool search retrieves a small, risk-tagged candidate set per task so the model chooses better on a tighter budget.

9 min read Read
Patterns May 22, 2026

Evals that actually work: beyond the LLM-as-judge trap

LLM-as-judge evaluators are the second most popular eval pattern in 2026. They are also the most over-trusted. The eval pipelines at the teams shipping the fastest agents look almost nothing like the ones described in vendor blog posts — and the difference matters.

13 min read Read
GATE DA May 22, 2026

Rank-nullity and linear systems: one theorem, many GATE DA traps

How comparing rank(A) with rank([A|b]) decides whether a linear system has no solution, one solution, or infinitely many, and the determinant trap GATE DA sets for it.

10 min read Read
Patterns May 22, 2026

Human-in-the-loop is the control plane for agentic software

Human approval is not a UX interruption; it is the control plane that lets an agent pause at a risk boundary, take a typed accept/edit/reject decision, and resume.

9 min read Read
Agents May 22, 2026

Self-correction without infinite loops: agent stopping criteria that actually work

Reflection loops promised agents that fix their own mistakes. In production they more often produce agents that 'improve' working code into garbage, or worse, loop forever. Here's what the teams shipping real agents do instead.

13 min read Read
RAG May 22, 2026

When not to use RAG

RAG is the default answer to 'how do I give my LLM custom knowledge?' — and for a wide class of problems, it's the wrong answer. Long-context prompt caching, fine-tuning, and in-context learning each beat RAG in regimes where the costs and assumptions actually pencil out. Here's the decision tree.

12 min read Read
Career May 22, 2026

The workplace health gap: menstruation, fertility, and menopause

Menstruation, fertility, and menopause quietly shape the careers of much of the workforce, and the silence around them pushes experienced women out the door.

9 min read Read
Agents May 21, 2026

Claude Code, Cursor, and Aider — three answers to the same question

Three coding agents, three different bets on autonomy, context, and where the human belongs in the loop. The interesting part isn't which one wins — it's which assumptions each one refused to compromise on.

14 min read Read
Patterns May 21, 2026

Deterministic workflows vs autonomous agents: the 2026 boundary

The smartest agent architecture decision is often choosing not to let the model decide: fixed steps belong in a workflow, genuine uncertainty belongs in an autonomous agent.

10 min read Read
GATE DA May 21, 2026

SVD and PCA for GATE DA: the geometry behind dimensionality reduction

A concept-first bridge from singular values to principal components: how the SVD factorization A = UΣVᵀ becomes PCA's directions of maximum variance in centered data.

11 min read Read
Agents May 21, 2026

Replit Agent's architecture, two years in

Replit Agent launched in September 2024 and turned a nine-year-old IDE into a $150M-ARR business. The architecture is unfashionably explicit — plan, confirm, execute, checkpoint — and the bet that explicit checkpoints beat full autonomy is paying off. Here's how it actually works.

13 min read Read
Infrastructure May 20, 2026

Agent observability: traces that explain decisions

A trace with tokens and timings is not enough; production teams need decision spans, route, retrieve, select, approve, act, verify, that show why the agent chose each path.

9 min read Read
GATE DA May 20, 2026

Calculus and optimization: why derivative sign beats formula hoarding

Limits, continuity, differentiability, Taylor series, maxima, minima, and single-variable optimization as one story — read by derivative sign, not memorized formulas.

10 min read Read
Frameworks May 20, 2026

MCP is the USB-C of AI tools, finally

Anthropic shipped the Model Context Protocol in late 2024 as a small, JSON-RPC-shaped spec for connecting LLM clients to tools and data sources. Eighteen months later, it's the most adopted open protocol in the AI space — and the hype is, for once, an undercount.

12 min read Read
Agents May 20, 2026

v0 by Vercel: how prompt-to-UI actually works

v0 makes 'build me a dashboard' produce real Next.js + shadcn/ui code that ships. The trick wasn't a smarter model — it was a narrow constraint, a streaming preview loop, and an opinionated component library acting as scaffolding. Here's the architecture, the competitive landscape, and why most clones missed the point.

13 min read Read
RAG May 19, 2026

Agentic RAG needs retrieval evidence contracts

When retrieval becomes an agent tool, each result must carry provenance and evidence quality, not just text chunks, so the agent can judge whether it is actually grounded.

9 min read Read
Agents May 19, 2026

AI SREs in production: Resolve.ai, Cleric, Parity

A new category of on-call agent is being shipped into real production environments — agents that watch alerts, gather evidence, and propose remediations. Most of them are deliberately read-only. The architecture is converging, the MTTR numbers are real, and PagerDuty is responding from the incumbent side. Here's what's actually working.

13 min read Read
GATE DA May 19, 2026

Python and pseudocode tracing: the underrated scoring zone in GATE DA

How to turn GATE DA programming questions into marks by tracing state on paper: loop variables, data-structure updates, recursion frames, and scope, one row at a time.

9 min read Read
Infrastructure May 19, 2026

Speculative decoding in the wild: how labs cut latency by 2-3x

A small fast model proposes, the big slow model verifies. Across Medusa, EAGLE, and draft-model approaches, speculative decoding is the latency-reduction lever that frontier labs reach for when they can't make the model itself any smaller.

12 min read Read
Agents May 19, 2026

Why multi-agent swarms keep failing to ship

Two years after AutoGen and CrewAI promised teams of cooperating AI agents, the production scoreboard is brutal: orchestrator-workers ships, agent teams mostly don't. The reasons are structural, not stylistic — and they explain why every serious agent team has quietly converged on a much more boring shape.

13 min read Read
Patterns May 18, 2026

Agent memory is a product decision before it is a vector database

The hard part of agent memory is not storing facts but governing them: separating preferences, durable facts, task state, and audit history, each with its own retention and consent.

10 min read Read
GATE DA May 18, 2026

DSA complexity for GATE DA: count operations, not vibes

Solve GATE DA time-complexity questions by counting operations across loops, recursion, and data structures, instead of pattern-matching every nested loop to O(n squared).

10 min read Read
Patterns May 18, 2026

Getting tool calling to 99% reliability in production

Most production agent failures are not model failures. They are tool-call failures — the model picked the right tool but wrote the wrong arguments, or hallucinated an argument that doesn't exist. The fix is unglamorous and mostly about schema design.

12 min read Read
Agents May 18, 2026

Voice agents at scale: the Vapi, Retell, Bland.ai engineering

Voice agents are the unsexy success story of the agent era. Underneath the marketing they're all the same five-box pipeline — STT, LLM, TTS, turn-taker, telephony — fighting for the same 500ms latency budget. Here's how the three biggest platforms actually build it, what each one optimizes for, and where the real cost goes.

13 min read Read
Frameworks May 17, 2026

Agents as tools vs handoffs

Calling a specialist agent as a tool is not the same as handing the conversation to it: one keeps the main agent in control, the other transfers ownership of the dialogue.

9 min read Read
Agents May 17, 2026

Devin's architecture, anatomised

Cognition's launch demo promised a 'fully autonomous software engineer.' The product that actually ships is more interesting — a planner-executor split running in a sandboxed VM, with explicit memory windows, hard pruning rules, and a shockingly disciplined view of where the model is allowed to be creative.

14 min read Read
GATE DA May 17, 2026

Sorting and searching in GATE DA: when the simple algorithms matter

How GATE DA tests selection, bubble, insertion, merge sort, quicksort, and binary search: the invariants and single-pass tracing where the marks actually live.

10 min read Read
Frameworks May 17, 2026

We built the same agent three times: MAF, LangGraph, and ADK

Three frameworks, one problem: an invoice-processing agent that ingests PDFs, validates against a database, asks a human when uncertain, and writes back to an ERP. What each framework made easy. What each one made painful.

14 min read Read
Agents May 17, 2026

Sierra's customer-service playbook

Bret Taylor and Clay Bavor built the highest-profile agent company by making three opinionated bets — every brand needs its own AI, agents must run inside the brand's workflows, and pricing should track outcomes. Two years on, the bets are paying off in ways the industry is still copying.

13 min read Read
Infrastructure May 16, 2026

Agent cost engineering: tokens, tools, latency

The expensive part of agents is not just output tokens; it is repeated context, tool schemas, retries, and wall time. Measure cost per completed task, not cost per model call.

10 min read Read
Patterns May 16, 2026

AutoGPT to 2026: what survived

Three years after AutoGPT briefly broke GitHub's star counter, almost nothing in its original form has shipped. The interesting question is what *did* survive — and it's a sharper, more honest answer than the discourse of 2023 ever got close to.

12 min read Read
GATE DA May 16, 2026

Graph algorithms in GATE DA: BFS, DFS, shortest paths

Graph questions turn mechanical once you track visited, frontier, and distance. A GATE DA guide to BFS, DFS, and when shortest paths need Dijkstra instead of BFS.

10 min read Read
RAG May 16, 2026

Self-RAG and Corrective RAG, the loops that actually help

Two papers from 2023 argued RAG should be reflective: retrieve, judge, re-query if the evidence is bad. By 2026 the practitioner verdict is in — Self-RAG and CRAG actually do help, but only on the queries that need them, and the latency cost is real.

12 min read Read
Cloud May 15, 2026

Bedrock, Vertex, Foundry: pick the one whose spirit matches yours

Every comparison of the three hyperscaler AI platforms is a feature matrix that nobody reads twice. The actual decision is about which company's spirit your team can live with for the next five years — and that's a different question.

12 min read Read
Agents May 15, 2026

Coding agents need spec grounding more than longer prompts

The next jump in coding-agent reliability comes from grounding each phase in the codebase's real contracts: tests, types, and conventions, not from stuffing a longer prompt.

10 min read Read
Infrastructure May 15, 2026

Continuous batching: the one trick that made LLM serving 10x cheaper

Static batching wastes GPUs by padding to the longest sequence and waiting for stragglers. Continuous batching reschedules every iteration, evicts finished requests, and admits new ones mid-stream. Here's why that single shift bought a 10-23x throughput win.

11 min read Read
GATE DA May 15, 2026

Relational algebra in GATE DA: joins are set logic with column names

Read selection, projection, natural join, and set operators in GATE DA as row-and-column logic, so you stop losing rows by joining on the wrong shared attribute.

10 min read Read
Agents May 15, 2026

How Cursor's Composer actually works

Multi-file edits feel atomic in Cursor not because the model got smarter, but because the team built a stack of careful workarounds — speculative diffs, a separate Apply Model, and an indexer that stays one step ahead of you. Here's the engineering.

13 min read Read
Infrastructure May 14, 2026

Agent identity across MCP, A2A, and audit

Once agents call tools and delegate to other agents, every action needs an identity chain tying user, agent, tool, policy tier, and approval into one auditable record across MCP and A2A.

9 min read Read
Infrastructure May 14, 2026

Computer-use latency engineering: getting browser agents under a second

A naive browser-use loop is 4-8 seconds per step. Production systems run at 600ms. The gap is closed by half a dozen techniques — differential screenshots, prompt caching, batched actions, vision-model routing — each of which sounds boring until you measure the difference.

13 min read Read
GATE DA May 14, 2026

SQL NULLs, keys, and constraints: tiny DBMS details that move marks

Three-valued logic, candidate keys, and referential integrity are the small DBMS details GATE DA loves to test. Here is how NULL, keys, and constraints actually behave in SQL.

10 min read Read
Frameworks May 14, 2026

Google's Agent Development Kit, and when to reach for it

ADK is Google's bet on agent infrastructure: a thin SDK that takes Gemini's strengths — tool calling, grounding, multimodality — and makes them deployable to Vertex AI Agent Engine in one command. It is less framework than LangGraph, more framework than calling the model directly, and uniquely useful inside Google Cloud.

11 min read Read
RAG May 14, 2026

Late interaction, or why ColBERT keeps coming back

Single-vector dense retrieval crushes one nuance per chunk into one point in space. Late-interaction models like ColBERT keep a vector per token and decide similarity at query time. The math is beautiful. The storage cost is brutal. Here's when it's worth it.

11 min read Read
Agents May 13, 2026

Agentic analytics needs SQL guardrails before clever reasoning

A data-analysis agent is only trustworthy when its SQL, permissions, and metric definitions are controlled. Agentic AI guide with sources, production risks, eval checks.

10 min read Read
GATE DA May 13, 2026

Data warehousing and OLAP for GATE DA

How GATE DA tests analytical data shape: fact-table grain, star vs snowflake schemas, concept hierarchies, and the additive-versus-ratio measure trap on OLAP rollups.

9 min read Read
Patterns May 13, 2026

Sub-agents, handoffs, supervisors — pick exactly one

Three topologies dominate multi-agent systems in 2026, and teams keep mixing them in ways that break debuggability. Sub-agents are parallel scoped work that returns to a parent. Handoffs are baton-passing with no return. Supervisors are explicit central routers. They are not interchangeable. Here's the framework for picking, and the failure modes when you don't.

13 min read Read
Infrastructure May 12, 2026

Prompt caching: the 90% cost cut explained

Anthropic, OpenAI, DeepSeek and Google all ship prompt caching now, but the pricing models diverge sharply. For a tool-using agent with a long system prompt, getting the cache pattern right is the difference between a viable product and a P&L disaster.

12 min read Read
Cloud May 12, 2026

Cloud Run is the most underrated platform for AI agents

Everyone deploys agents on GKE or EC2. The teams quietly shipping the fastest, cheapest production agents have moved to Cloud Run + Gemini + Firestore — and the reasons are not what you'd guess from the marketing pages.

11 min read Read
GATE DA May 12, 2026

Regression, ridge, and bias-variance: the ML core of GATE DA

How OLS, ridge, the bias-variance trade-off, and cross-validation fit together: ridge trades a little bias for less variance, and the U-shaped error curve shows the sweet spot.

11 min read Read
RAG May 12, 2026

Hybrid search, when one retriever isn't enough

Pure vector search misses exact identifiers. Pure BM25 misses paraphrases. Every serious production RAG stack — Notion, Perplexity, Vespa, Glean — runs both and fuses the results. Here's the math, the latency budget, and the production knobs that matter.

12 min read Read
Agents May 12, 2026

Research agents need citation quality, not quantity

A long bibliography can hide weak grounding. Strong research agents grade each claim on source type, recency, directness, and conflict status instead of counting links.

9 min read Read
Infrastructure May 12, 2026

vLLM vs TGI vs SGLang: choosing your inference server in 2026

Three open-source serving stacks, three different bets. vLLM optimises raw throughput. TGI optimises ease and HuggingFace integration. SGLang optimises structured generation and prefix-cache reuse. Here's how to pick.

13 min read Read
Agents May 11, 2026

The 2026 agentic AI stack map: model, memory, tools, workflow, evals

A clear map of the layers that now appear in serious agent products. Agentic AI guide with sources, production risks, eval checks, and datarekha learning links.

11 min read Read
GATE DA May 11, 2026

Logistic regression, Naive Bayes, and LDA for GATE DA

Logistic regression, Naive Bayes, and LDA compared the way GATE DA tests them: discriminative versus generative, and the exact assumption each one makes about the data.

11 min read Read
Patterns May 11, 2026

Long-horizon agent failure modes: context rot, drift, looping

An agent that handles ten turns gracefully will likely collapse at fifty. The reasons aren't bugs — they're emergent properties of how transformers handle growing context. Here's a field guide to how agents disintegrate, with the public failure stories that mapped the territory.

13 min read Read
Frameworks May 11, 2026

Microsoft Agent Framework, six months in

Microsoft consolidated Semantic Kernel and AutoGen into a single Agent Framework in late 2025. Half a year later, the picture is clear: MAF is the Azure-native option for shops that already speak C# and .NET, and its workflow/agent split is the most pragmatic in the industry.

11 min read Read
Agents May 10, 2026

Coding agents in 2026: Cursor, Devin, Sweep, Aider, Claude Code compared

Two years after Devin launched and froze the term 'AI software engineer' in the popular imagination, here's where everyone actually landed. The dominant tools didn't converge — they specialized, and the ones that won did so by picking a workflow lane and dominating it.

14 min read Read
RAG May 10, 2026

Contextual Retrieval, two years on

Anthropic's September 2024 trick — prepend a sentence of context to each chunk before embedding — quietly became the new RAG default. Here's why a 49% retrieval-error reduction held up, what the numbers actually say, and when you still shouldn't bother.

11 min read Read
GATE DA May 10, 2026

SVM, KNN, and decision trees: geometry, memory, and splits

Three high-yield classifiers, three different inductive biases: SVM maximizes a margin, KNN votes over local neighbours, and decision trees split feature space recursively.

10 min read Read
Agents May 10, 2026

Sandbox agents are the new runtime for serious long-horizon work

Why long-horizon agents moved from plain API calls to controlled sandboxes with files, commands, fixed tools, snapshots, and a declared output directory.

10 min read Read
Infrastructure May 9, 2026

Agent manifests make workspaces portable

A manifest declares an agent's inputs, mounts, storage, and outputs so the same workspace rebuilds identically from local prototype to production sandbox, and stays replayable.

9 min read Read
Agents May 9, 2026

Deep research agents explained: Perplexity, GPT Deep Research, Gemini Deep Research

Deep Research was OpenAI's first agent product to actually ship, and it landed in a market where Google had already been live for months and Perplexity had built the search-citation pipeline the others were chasing. The three converged on similar capabilities but radically different architectures.

13 min read Read
GATE DA May 9, 2026

K-means and hierarchical clustering for GATE DA

What GATE DA expects from clustering: the k-means assign/update loop and its WCSS objective, hierarchical linkage rules, and why cluster labels are never ground-truth classes.

10 min read Read
Infrastructure May 9, 2026

How Anthropic serves a hundred million tokens a second

The frontier labs and their serving partners — Anthropic, OpenAI, Together AI, Fireworks — hide an arsenal of inference optimisations behind the simple-looking chat endpoint. Here's the hierarchy, what each layer bought, and the public numbers behind it.

13 min read Read
Infrastructure May 8, 2026

Snapshotting and rehydration: durable execution for agents

How snapshotting and rehydration keep long-running agents alive across container loss, tool timeouts, and user pauses by moving task state out of disposable compute.

10 min read Read
Agents May 8, 2026

Agentic data analysis: PandasAI, Hex Magic, Julius — and why it's harder than it looks

'Just talk to your data' has been the cleanest agentic AI demo for two years. The shipped reality is much narrower than the demos suggest, because the gap between answering a question and answering it cheaply, safely, and correctly is enormous. Here's where the production line lives.

13 min read Read
GATE DA May 8, 2026

MLP and backprop for GATE DA

Read an MLP as stacked affine-plus-activation layers and backprop as the chain rule over a computation graph. A GATE DA guide with a worked forward-and-backward pass.

10 min read Read
Frameworks May 8, 2026

LangGraph is what you reach for when the agent has to remember

By mid-2026 LangGraph is the default orchestration layer for stateful, long-running, human-in-the-loop agents. Not because of the API — because of the checkpoints, the interrupts, and the fact that the state outlives the request.

11 min read Read
Patterns May 7, 2026

The agent observability stack: LangSmith, Langfuse, Helicone, Arize

You can't ship agents without observability. The 2026 stack has settled into three layers — traces, evals, drift — and five vendors who each won by being unmistakably the best at one of them. Here's who picks which, and why.

13 min read Read
GATE DA May 7, 2026

AI search in GATE DA: admissibility, consistency, and why A* works

A clean mental model for uninformed, informed, and adversarial search in GATE DA: what admissible and consistent mean, and why an admissible heuristic makes A* optimal.

10 min read Read
Patterns May 7, 2026

Skills are progressive disclosure for agents

Why the next agent primitive is not another tool but a skill: a compact package of instructions, examples, schemas, and tools the agent loads only when the task calls for it.

9 min read Read
Agents May 6, 2026

Agents for legal, finance, healthcare — the high-stakes pattern

Three industries where hallucination is malpractice. The agent companies winning in legal, finance, and healthcare have converged on the same shape — scoped tasks, retrieval over generation, mandatory human checkpoints, audit logging by default. Here's what that pattern looks like in production at Harvey, Hebbia, Abridge, and Suki.

14 min read Read
Agents May 6, 2026

AGENTS.md is becoming the repo contract for coding agents

A repo-level AGENTS.md gives coding agents what humans already rely on: local conventions, test commands, and boundaries, versioned in the repo instead of lost in a chat prompt.

9 min read Read
GATE DA May 6, 2026

Minimax and alpha-beta: adversarial search as disciplined pessimism

How GATE DA game-tree questions become mechanical once you respect player turns and bounds: back up MIN/MAX values, then prune with alpha-beta.

9 min read Read
Infrastructure May 5, 2026

Agentic commerce needs payment protocols, not just checkout buttons

When agents buy things or meter services, authorization, settlement, receipts, and disputes become protocol problems. Agentic AI guide with sources, production risks.

10 min read Read
GATE DA May 5, 2026

Logic in GATE DA: propositional form before predicate ambition

A compact GATE DA guide to propositional and predicate logic: implication truth values, quantifier negation, and the universal-implication, existential-conjunction pairing.

9 min read Read
Patterns May 5, 2026

Why XGBoost is still winning in 2026

Twelve years after the original paper, and despite a parade of transformer-based challengers, gradient-boosted trees remain the default for tabular data at most enterprises. Here's why the boring answer keeps winning — and what TabPFN, CatBoost, and LightGBM actually changed.

13 min read Read
Frameworks May 4, 2026

Agent Builder vs SDK: product surface or engineering surface?

A hosted agent builder is a product surface and an SDK is an engineering surface; choose by who gets paged for failures, not by which demo looks slicker.

9 min read Read
GATE DA May 4, 2026

Bayesian networks and inference for GATE DA

Conditional independence, exact inference by variable elimination, and approximate inference by sampling — the Bayesian-network ideas GATE DA actually tests.

11 min read Read
Infrastructure May 4, 2026

MLOps platform consolidation: Databricks, Snowflake AI, SageMaker

The MLOps landscape of 2023 — a dozen point tools, three feature stores, four monitoring vendors, two training frameworks — has collapsed. By 2026 the workload lives on Databricks, Snowflake Cortex, SageMaker Unified Studio, or Vertex. Here's what each platform actually does, what got swept under, and who wins which workload.

13 min read Read
Infrastructure May 3, 2026

Feature stores in 2026: Tecton, Feast, Hopsworks — death and rebirth

The feature store hype cycle went peak (2021), trough (2023, 'just use dbt'), and into a quieter rebirth as the data-for-AI layer for both classical ML and LLM agents. Tecton sold to Databricks. Feast survives as the open-source baseline. Hopsworks redefined itself as an AI lakehouse. Here's what the modern feature store actually is in 2026.

13 min read Read
GATE DA May 3, 2026

Data preprocessing in GATE DA: leakage and scaling

How normalization, discretization, and sampling change what a GATE DA model learns, and why fitting them before the train-test split is the leakage trap examiners test.

10 min read Read
Patterns May 3, 2026

Guardrails are runtime checks, not moral adjectives

A guardrail earns its name only when it changes control flow. The useful ones are runtime predicates that block, redact, route, or pause an action for approval.

9 min read Read
GATE DA May 2, 2026

Cross-validation in GATE DA: estimate generalization, not happiness

LOO and k-fold cross-validation for honest model selection: why every preprocessing step belongs inside the fold, and how tuning on the test set leaks into your estimate.

9 min read Read
Infrastructure May 2, 2026

MCP code execution without context bloat

Anthropic's MCP code-execution pattern points to a deeper rule: compute belongs in a sandboxed workspace, and only compact results — summaries, artifacts, previews — return to the prompt.

10 min read Read
Infrastructure May 2, 2026

The vector database shakeout: Pinecone, Weaviate, Qdrant, Chroma, pgvector

The pure-play vector DB category isn't dead, but it's smaller than the 2023 hype suggested. Postgres ate the low end, Pinecone owns serverless ops, Qdrant is the open-source cost winner, and Weaviate's hybrid search is its real moat. Here's the 2026 picture by the numbers.

12 min read Read
Patterns May 1, 2026

Agent UI needs event streams, not fake typing indicators

A useful agent interface shows plans, tool calls, approvals, artifacts, and progress as structured events. Agentic AI guide with sources, production risks, eval checks.

9 min read Read
RAG May 1, 2026

Embeddings in 2026: Voyage, Cohere v4, OpenAI text-embedding-3, mxbai, BGE

The embedding model market split clean. Proprietary models (Voyage 3, Cohere v4, OpenAI v3) own the quality top; open-source (mxbai, BGE-M3, Jina v3, E5) own cost and self-host. Matryoshka rewrote the cost economics. Here's how to actually choose.

12 min read Read
GATE DA May 1, 2026

Classification metrics: GATE DA’s confusion-matrix language

How a 2x2 confusion matrix defines accuracy, precision, recall, and F1 for GATE DA, plus the imbalanced-data trap and a words-before-formulas drill.

10 min read Read
Infrastructure Apr 30, 2026

Agent run state is an API surface

The run object is where product logic meets model behavior. Agentic AI guide with sources, production risks, eval checks, and datarekha learning links.

9 min read Read
GATE DA Apr 30, 2026

General Aptitude for GATE DA: 15 marks done right

General Aptitude is a fixed 15 marks on every GATE paper, split across verbal, quantitative, analytical, and spatial reasoning. How DA candidates should prepare it.

9 min read Read
RAG Apr 30, 2026

Reranking at scale: cross-encoders, ColBERT, mxbai

Retrieve-then-rerank is the single biggest accuracy improvement in production RAG. Three approaches dominate — cross-encoders, late interaction, API rerankers. Here's what each costs, what each wins, and the latency budget that decides between them.

12 min read Read
Frameworks Apr 29, 2026

A2A agent cards are service discovery for agents

An A2A agent card is service discovery for agents: the document that tells a caller what a remote agent can do, how to authenticate, and where its limits are before any work is delegated.

9 min read Read
Infrastructure Apr 29, 2026

Distributed training: FSDP vs DeepSpeed vs Megatron in production

For serious pretraining or fine-tuning, you pick from three: FSDP2 (PyTorch native), DeepSpeed (ZeRO stages), or Megatron-LM (NVIDIA, 3D parallel). The frontier labs have made their bets — Llama 3 went Megatron, FSDP2 is the open-source default under 70B, and DeepSpeed survives where ZeRO offload is necessary. Here's how to choose.

13 min read Read
GATE DA Apr 29, 2026

The PYQ method for GATE DA: solve less randomly, learn more deeply

A repeatable topic-wise PYQ workflow for GATE DA: solve, classify by concept and trap, write a number-swapped variant, then log it in a ledger and retry.

10 min read Read
Infrastructure Apr 28, 2026

The agent protocol stack: MCP, A2A, payments, and identity

Production agents split tool access, delegation, payments, and identity into distinct protocol layers: MCP for tools, A2A for delegation, payment mandates, and identity for audit.

10 min read Read
GATE DA Apr 28, 2026

A 90-day GATE DA revision plan that respects how memory actually works

A 90-day GATE DA plan built on retrieval practice, interleaving, PYQs, and mock feedback — three phases that respect how memory actually consolidates.

10 min read Read
Infrastructure Apr 28, 2026

Mixture of experts in production: Mixtral, DeepSeek, Llama 4

MoE went from research curiosity in 2021 to production default in 2025. The new frontier-model template is sparse, with 5-15% active parameters per token. Here's how the serving infrastructure caught up, what vLLM and SGLang actually do with experts, and the cost math that makes MoE the future.

13 min read Read
GATE DA Apr 27, 2026

GATE DA formula sheets that work: properties over decoration

A GATE DA formula sheet that works gives each entry five fields: formula, when it is valid, the common trap, a tiny example, and a linked PYQ — not just equations.

9 min read Read
Infrastructure Apr 27, 2026

Quantization in production: GPTQ, AWQ, GGUF, FP8 — what to ship

16-bit serving is dead for most production workloads. Here's how to pick between weight-only post-training quantization, activation-aware quantization, and the FP8 native path — without measuring it wrong.

12 min read Read
Agents Apr 27, 2026

Voice agents need turn-taking architecture

Voice agents are not chatbots with speech bolted on. Barge-in, latency, and spoken confirmation demand a turn-taking state machine: listen, acknowledge, clarify, act, confirm.

9 min read Read
Patterns Apr 26, 2026

Agent privacy is data retention plus tool policy

Agents touch files, tools, logs, memory, and external systems, so privacy means a retention policy per object type plus a tool policy, not just securing the model call.

9 min read Read
GATE DA Apr 26, 2026

What GATE DA teaches about real data science

The GATE DA syllabus is a compact map of the data science stack: probability, linear algebra, calculus, Python, DSA, DBMS, ML, and AI each cover one real layer.

11 min read Read
Patterns Apr 26, 2026

ML platform build vs buy: a decision framework for 2026

Most teams default to one of three answers — Databricks, SageMaker, or 'we'll roll our own.' All three are wrong for the wrong team. Here's how to pick by team size, model variety, and latency budget — with the war stories that explain why.

13 min read Read
Patterns Apr 25, 2026

The agent deployment checklist: what must exist before launch

A production agent launch needs more than a passing demo: evals, tracing, permission boundaries, rollback, rate limits, incident playbooks, and user-visible failure states all in place first.

10 min read Read
GATE DA Apr 25, 2026

Build a GATE DA error ledger: the highest-ROI notebook you can keep

A structured GATE DA error ledger turns mocks into learning events: log every lost mark, classify it as a concept or decision error, repair it, then schedule a retry.

9 min read Read
Patterns Apr 25, 2026

Model monitoring in 2026: from accuracy to behavior drift

Classical ML monitoring tracks accuracy decay. LLM monitoring tracks something stranger — the model itself silently changing underneath you. Here's what production observability looks like when the failure modes don't fit the dashboards you built five years ago.

12 min read Read
Infrastructure Apr 24, 2026

Agents and background jobs: queue the work, stream the milestones

Long agent tasks should look like jobs, not HTTP requests that forgot to end: return a run id, enqueue the work, stream milestones, and let the backend own retries and state.

9 min read Read
GATE DA Apr 24, 2026

Interleaving GATE DA topics for mixed-practice gains

Blocked practice feels fluent but trains the wrong skill. Why interleaving GATE DA topics builds the question-recognition the real exam demands, and how to schedule it.

9 min read Read
Patterns Apr 24, 2026

LLM recommenders: Netflix, Spotify, and the post-collaborative-filtering era

Recommendation systems are absorbing LLMs, but almost never the way Twitter expects. Netflix and Spotify both run hybrid stacks — classical CF and content-based ranking as the workhorse, LLMs at the edges for explanation, cold start, and conversation. Pure LLM-as-recommender doesn't survive contact with YouTube-scale latency or freshness.

13 min read Read
GATE DA Apr 23, 2026

The high-value links between linear algebra and ML in GATE DA

Regression is projection, PCA is eigenvectors of covariance, SVM is hyperplanes — a GATE DA concept map linking the linear algebra objects under each ML method.

10 min read Read
RAG Apr 23, 2026

Multi-hop RAG without the hallucinations

Single-hop RAG works until your users start asking questions that require synthesis. Naive multi-hop hallucinates intermediate facts. Here's the working architecture — explicit decomposition, iterative retrieval, self-verification — with the production examples from Harvey and Hebbia that actually ship it.

13 min read Read
Infrastructure Apr 23, 2026

Tool-output prompt injection in agent systems

The moment an agent reads external data, instructions can arrive disguised as evidence. How to keep tool output as data with trust boundaries, minimal permissions, and eval checks.

10 min read Read
Patterns Apr 22, 2026

Agent quality is replayability

If you cannot replay an agent failure, you cannot reliably improve it. Capture enough to reconstruct each run, promote failures into evals, and keep them fixed before merge.

9 min read Read
GATE DA Apr 22, 2026

For GATE DA, primary sources beat rumor: how to read official material

A source-literacy guide for GATE DA: trust the official syllabus, papers, answer keys, and cutoffs first, treat coaching commentary as secondary, and rumor last.

9 min read Read
RAG Apr 22, 2026

GraphRAG: when knowledge graphs beat vector search

Microsoft's GraphRAG paper showed that building a knowledge graph from documents and querying community summaries beats vector RAG on global questions. Two years on, the receipts are mixed — the technique works, but the indexing bill is what most teams remember.

13 min read Read
Agents Apr 21, 2026

Agentic tutors need misconception tracking, not answer vending

Why the best learning agents diagnose a learner's misconception before explaining: ask for the working, classify the error, give the smallest hint, then adapt.

10 min read Read
RAG Apr 21, 2026

Agentic RAG: when the agent decides whether to retrieve

Naive RAG retrieves on every query. Agentic RAG turns retrieval into a tool the agent calls when the answer isn't already obvious — and the production wins are about latency and accuracy at once, not either alone.

12 min read Read
GATE DA Apr 21, 2026

Counting for GATE DA: avoid overcounting

Counting is the base layer under most probability questions: when order matters use permutations, when it does not use combinations, and the main trap is double-counting.

10 min read Read
GATE DA Apr 20, 2026

Covariance and correlation: the geometry GATE DA expects

Correlation is covariance rescaled to [-1, 1]: how to read sign and magnitude, why units cancel, and why zero correlation does not by itself mean independence.

10 min read Read
RAG Apr 20, 2026

Long-context vs RAG: the 2026 verdict

Gemini's 2M-token context, GPT-5's million, Claude's 200K. The argument that long context would kill RAG turned out wrong — but for reasons more interesting than 'cost is high'. Here's where the equilibrium actually settled.

13 min read Read
GATE DA Apr 19, 2026

Determinant, inverse, and eigenvalues: one linear algebra triangle

For a square matrix, invertibility, nonzero determinant, full rank, and a missing zero eigenvalue are the same fact — the one linear algebra triangle GATE DA tests.

10 min read Read
RAG Apr 19, 2026

PDF parsing remains unsolved: LlamaParse, Reducto, Unstructured, Marker

Two years into the production RAG era, the single biggest blocker for most enterprise deployments isn't the LLM, the vector store, or the retrieval algorithm. It's whether your PDF parser got the table right. The vendors have multiplied; the problem hasn't been solved.

13 min read Read
GATE DA Apr 18, 2026

LU decomposition and Gaussian elimination: row operations with memory

LU decomposition is Gaussian elimination with its steps stored as triangular factors. A GATE DA guide to L, U, the role of pivoting, and why it makes repeated solves cheap.

9 min read Read
RAG Apr 18, 2026

RAG evaluation in production: Ragas, Phoenix, Braintrust

RAG evaluation is harder than LLM evaluation. Retrieval and generation fail for different reasons and need different metrics. Two years after Ragas formalized the four-metric core, the tooling has split into three camps — here's what each one actually buys you.

13 min read Read
Patterns Apr 17, 2026

Fine-tuning vs RAG: the settled debate of 2026

Three years of Twitter brawling about whether to fine-tune or retrieve has ended in a boring, useful answer: RAG for knowledge, fine-tuning for behaviour, and both when you actually need both. Here's the decision table production teams actually use.

12 min read Read
GATE DA Apr 17, 2026

Quadratic forms and convexity: why eigenvalues decide shape

Why the eigenvalue signs of a symmetric matrix decide whether a quadratic form is a convex bowl, a concave dome, or a saddle, and how that drives optimization in GATE DA.

10 min read Read
GATE DA Apr 16, 2026

Taylor, L'Hopital, and limit techniques: choose the smallest weapon

Most GATE DA limit questions fall to algebra or a standard approximation before Taylor or L'Hopital are needed. Name the indeterminate form first, then pick the smallest tool.

9 min read Read
Infrastructure Apr 16, 2026

KV cache management: paged attention, prefix caching, LMCache

The KV cache is the dominant memory bottleneck in LLM serving, and three ideas — PagedAttention, prefix caching, and cross-instance LMCache — have rewired how it's managed. Here's how each layer earns its place in production.

13 min read Read
GATE DA Apr 15, 2026

Functional dependencies and normal forms for GATE DA

Normal forms become clear once you compute attribute closure first. A GATE DA guide to functional dependencies, candidate keys, and the 2NF, 3NF, BCNF tests.

10 min read Read
Infrastructure Apr 15, 2026

NVIDIA Dynamo, vLLM, SGLang: serving stacks at scale

NVIDIA's Dynamo (GTC 2025) reframes LLM serving around prefill/decode disaggregation. vLLM, SGLang and TensorRT-LLM all sit underneath. Here's how the four-layer stack actually works, what the throughput numbers really mean, and who picks which.

13 min read Read
GATE DA Apr 14, 2026

Indexing and file organization: the DBMS performance line in GATE DA

How primary, secondary, hash, and B+-tree indexes change DBMS access paths in GATE DA — what each speeds up, what it costs on writes, and the traps examiners set.

9 min read Read
Infrastructure Apr 14, 2026

Inference routing: sending each query to the cheapest model that can answer it

By 2026 the biggest lever on inference cost isn't quantisation or batching — it's deciding which model touches each query. Four routing patterns, three rounds of vendor consolidation, and a real case study where a customer support agent cut spend 80% with no measurable quality drop.

13 min read Read
GATE DA Apr 13, 2026

ER model to relational schema: how GATE DA turns diagrams into tables

How GATE DA converts an ER diagram into tables: strong and weak entities, foreign keys for one-to-many, and junction tables for many-to-many, without dropping cardinality.

9 min read Read
Cloud Apr 13, 2026

Serverless LLM platforms: Modal, Together AI, Fireworks, Replicate

Four platforms, four different bets on what serverless inference should be. Modal is the GPU-as-Python-function. Together is the model marketplace. Fireworks is the speed obsessive. Replicate is the prosumer API. Here's who picks which, and the cost numbers behind each.

13 min read Read
GATE DA Apr 12, 2026

Tuple calculus: DBMS queries as logic

Tuple relational calculus reads cleanly once you treat it as quantified logic over rows: it describes which rows you want, not the steps to fetch them, unlike relational algebra.

9 min read Read
Patterns Apr 12, 2026

Structured outputs engineering: JSON mode, function calling, constrained decoding

Three families of techniques get LLMs to return parseable data. One of them is a guarantee. The other two are negotiations. Here's when each one earns its keep — and the production regression that nobody warns you about.

12 min read Read
Frameworks Apr 11, 2026

CrewAI vs LangGraph vs AutoGen: the ecosystem reality of 2026

Three years into the agent framework era, only two of those three names belong in the same sentence in 2026. Here's the actual usage picture — stars, downloads, who's running what in production, and where each one quietly wins.

12 min read Read
GATE DA Apr 11, 2026

Conditional expectation and variance: beyond basic probability

Why E[X|Y] is a random variable until Y is observed, how the law of total expectation and variance decomposition work, and the conditioning traps GATE DA likes to set.

10 min read Read
Frameworks Apr 10, 2026

DSPy: declarative prompting in production

Stanford's DSPy proposes 'programming, not prompting' — declare a signature, compose modules, let an optimizer write the prompt. The pitch is irresistible; the production reality is more selective. Where DSPy actually ships, and why most teams quietly keep their prompts.

12 min read Read
GATE DA Apr 10, 2026

Poisson and exponential: counts and waiting times in one story

Poisson counts events in an interval; exponential times the wait until the next one — both run on one rate lambda. A GATE DA guide to the formulas and traps.

9 min read Read
GATE DA Apr 9, 2026

Gradient descent in GATE DA: step size is the algorithm

The gradient-descent update is one line, w minus eta times the gradient, but the step size eta decides everything. A GATE DA guide to the one-step NAT and its traps.

10 min read Read
Frameworks Apr 9, 2026

Pydantic AI: typed agents for the Python ecosystem

The Pydantic team built the framework most FastAPI engineers were quietly wishing for. Schema-first, model-agnostic, async-native — the agent framework that feels like Python production code instead of a research demo.

12 min read Read
GATE DA Apr 8, 2026

Perceptron vs logistic regression for GATE DA

Perceptron and logistic regression both draw a linear boundary, but one updates only on mistakes and the other minimizes a probabilistic loss. A GATE DA comparison.

9 min read Read
Frameworks Apr 8, 2026

Inspect AI: the UK AISI's eval framework everyone copied

A government safety institute released an MIT-licensed eval framework in 2024. By 2026 it's the de facto standard at Anthropic, OpenAI, DeepMind, and the enterprise teams that take evals seriously. Here's why it won — and what the alternatives still get right.

12 min read Read
GATE DA Apr 7, 2026

MLP parameter counts: the easiest neural-network marks to stop losing

Count MLP parameters layer by layer for GATE DA: a dense layer is d*h weights plus h biases. Includes a worked 3-4-2 network and the bias trap to avoid.

9 min read Read
GATE DA Apr 6, 2026

KNN and distance scaling: when units secretly choose the neighbor

In KNN, distance defines the neighbours, so unscaled features quietly decide the prediction. A GATE DA walk-through of the majority vote and why normalization matters.

9 min read Read
GATE DA Apr 5, 2026

SVM margin and kernel: the GATE DA version without mysticism

The core SVM idea is margin maximization, and support vectors are the points that pin the boundary; kernels just change the feature space where that maximization happens.

10 min read Read
GATE DA Apr 4, 2026

Data interpretation for GATE DA: tables and charts as executable text

Data interpretation is a scoring zone once you fix the denominator before computing: reading tables and charts as ratios, percentages, and trends without shifting the base.

9 min read Read
GATE DA Apr 3, 2026

Reading comprehension for technical GATE DA learners

Reading comprehension in GATE is evidence discipline, not literary taste: classify each question, answer only from the passage, and underline the line that proves it.

9 min read Read
GATE DA Apr 2, 2026

Spatial aptitude: visual reasoning without panic

Rotation, reflection, adjacency, and overlap questions become manageable when you track one invariant feature at a time instead of mentally spinning the whole figure.

9 min read Read
GATE DA Apr 1, 2026

Variable elimination workflow: exact inference as factor bookkeeping

Variable elimination is just factor bookkeeping: gather the factors holding a hidden variable, multiply, sum it out, and carry the new factor forward until the query remains.

10 min read Read
Skip to content