Friday, June 26, 2026

Worst Case Thinking

2026-Jun-26
WORST CASE SCENARIO       

11 Jun: Abhishek Chakraborty disclosed the termination news — taking my forced resignation instead.
26 Jun: 15 days later.
No other job offer in hand.
A couple of disappointments from a couple of interviews.

Yesterday, Abhishek Chakraborty said I may be getting terminated for deleting my resignation record from my SuccessFactor profile.
That means no salary for June and a marked documentation and blacklisting at IBM.
Read this again: "No salary for June."
I don't have enough in my current account even for next 30 days.
34 + 8K is non-negotiable (42K).
Let's say 50–60K with some expenses even if I live in Tri Nagar.

I have about 2.4L in savings/investments.
That money will last for about 4–5 months.

So, I have about 4 months to find a job (or find a source of money).

Worst case thinking -- If I don't find a job in 4 months:

# I would default on Home Loan.
# Run out of ~2.5 lakhs of savings.
# Would have to redeem other/remaining 10L at (possibly) a loss.

Okay, so 10L might buy me another 20 months (1 year and 8 months) of time.
And after that, I would run out of every penny.
Would have to say goodbye to ***.
AND: Too big and wide a gap in my resume.

How much money needed to live by?

19 Jun 2026

The problem isn't dying — but rather in living.
If I were dying, I wouldn't have to worry about finances or relationships. But if I were to live I'd have to calculate the bare minimum I would need to live by my months and days.
After a more rigorous thinking, I have reduced my monthly spends down to 70–80K (which is not a low number either) but down from 1L by 20–30%.
In this 70–80K:

35K is for paying Home loan EMI.
Rest 35K–45K is for everything else.

Breakdown:

35K: Home loan EMI

8K: Society dues (monthly)

1.2K: Internet

3K: Electricity

4.5K: Food

1K: Milk

4K: Subscriptions like DL, AI, WebMedia, PythonAnywhere...

That's about 57K.
So, 13K for reasons like:

1: Medicine
2: Gadgets
3: Clothes
4: Travel
5: Car/Scooty maintenance
6: Petrol/Gas
7: Cooking gas
8: Or any other luxuries
9: House maintenance (MMV and Tri Nagar)

Effectively, we would run out of 70K, or average out over the time at 70K.
Doing a rough math for an FD @ 6.5% p.a. interest rate: we would need about 1.3 Cr to make enough money for our monthly spends.


If I were to live longer, I may need to have some relationships. And "Building and Managing Relationships" is what I am worst at. Totally clueless.

Interview at IBM For Nestle (Round 1) for Lead Data Scientist Role

Index For Interviews Preparation    « Previously    Next »

Interview Critique Report

Lead Data Scientist
Performance Review

Interviewer : Aman Candidate : Ashish Mode : Technical Screening Questions Covered : 6
Section 01

Structured Transcript

The call was disrupted by two power outages, causing the self-introduction to be delivered twice. The exchange below is reconstructed in order, stripped of repetitions, filler, and the informal closing chat about location.

#
Interviewer Question
Candidate Response — Summary
01
Tell me about yourself.
13 years total (11 AI/ML, 2 software engineering). With IBM since April; prior at Accenture building an agentic analytics platform (text-to-SQL, RAG, visualization, narrative writing). Stack: LangChain, LangGraph, CrewAI, HuggingFace, PyTorch, TensorFlow, scikit-learn, PySpark.
02
Explain the bias-variance trade-off.
Bias = simplistic model assumptions, underfitting (high train + test error). Variance = overfitting (low train error, high test error). Mitigations: increase complexity and add features for bias; apply regularization and reduce features for variance.
03
What is the attention mechanism?
Creates a matrix of token-to-token relevance in transformers. Illustrated via co-reference: in "the cat sat on the mat because it was tired," attention determines whether "it" refers to "cat" or "mat."
04
What is the role of the feed-forward network in a transformer, after the attention layer?
"Basically used to get the probabilities… uses a softmax layer as the activation step." Candidate acknowledged the details were not fully clear.
05
Explain random forest — qualities, assumptions, and construction.
Ensemble of decision trees to reduce single-tree overfitting. Two sources of randomness: (a) random subset of training data per tree, (b) random subset of features per tree. Final prediction via hard or soft voting across all trees.
06
What is the trade-off between fine-tuning and RAG? Why choose RAG over fine-tuning?
RAG = retrieve context from proprietary data, augment the prompt before generation. Fine-tuning = adapt LLM behavior and output format to reduce repetitive prompt instructions. Used text-to-SQL as example: fine-tuned on Q-A pairs so the model defaults to PostgreSQL syntax without being told each time. Framed as complementary tools, not competing choices.
Section 02

Critique — Question by Question

Each exchange is assessed for accuracy, depth, and delivery. The left border signals overall quality: green = strong, amber = partial / gaps, red = factual error.

01 Self-Introduction Needs Work
What You Did

You covered the right inventory — experience quantum, current and prior company, flagship project with four named capabilities, and tech stack. The second delivery (post power cut) was more composed. Naming the agentic platform and its four distinct capabilities was the strongest part.

Weak Points
  • No business impact. "I built an agentic platform" is an activity, not an achievement. How many enterprise clients? What latency reduction, adoption rate, or time-to-insight gain did your work produce? The absence of numbers makes the experience abstract.
  • Weak qualifiers throughout. "Pretty good," "pretty familiar," and "some familiarity with PySpark" are automatic status-reducers. A Senior candidate either owns a skill or precisely names the boundary — not hedges.
  • No positioning thesis. The introduction was a chronological inventory of dates, companies, and tools — not a narrative about why you are the right candidate for this specific role.
  • Structural disarray from the power cut compounded the impression of an unpolished opener. The first 60 seconds of an interview set the credibility frame for every answer that follows.
A Stronger Version (~75 words)

"I'm an AI/ML engineer with 13 years of experience, the last six focused on production LLM systems. At Accenture I led an enterprise agentic analytics platform — text-to-SQL, RAG, visualization, and narrative generation — which reduced analyst report turnaround from hours to minutes for multiple clients. Since April I've been at IBM expanding into multi-agent architectures. I'm looking for a role where I can take more ownership at the system-design level."

Concise, positioned, impactful — and not a single qualifier.

02 Bias-Variance Trade-off Good — Minor Gaps
What You Did

Conceptually solid. You correctly mapped bias to underfitting and variance to overfitting, stated the error patterns accurately (high train + test error vs. low train / high test), and laid out sensible mitigation strategies. Connecting regularization to weight magnitude was correct.

What Was Missing
  • The formal decomposition. At senior level, the expected anchor is: Total Error = Bias² + Variance + Irreducible Noise. The irreducible noise term matters — it defines the floor no model can beat, regardless of complexity.
  • The complexity curve. As model complexity increases, bias falls monotonically and variance rises — the optimal model sits at their sum's minimum. Describing this curve shows architectural thinking, not just definition recall.
  • Cross-validation as the empirical tool for navigating the trade-off was absent. This is how the trade-off is actually managed in practice.
  • Regularization specificity. "High regularization → low weights" is correct for L2 (Ridge), but L1 (Lasso) produces sparse solutions by driving weak feature weights to exactly zero — a meaningfully different mechanism worth distinguishing.
What to Add Next Time

"Expected error decomposes as Bias² + Variance + Irreducible Noise — we can shrink the first two, but not the third. The trade-off is visible in complexity curves: as you add parameters, bias falls and variance rises; the sweet spot is their sum's minimum. I use k-fold cross-validation to find it empirically — when training and validation error diverge, variance is showing up. For regularization: L2 shrinks all weights toward zero, L1 drives sparse solutions by zeroing out weak features entirely."

03 Attention Mechanism Surface-Level
What You Did

The co-reference example — "the cat sat on the mat because it was tired" — was well-chosen and demonstrated genuine intuition about what attention accomplishes. Describing it as a token-relevance matrix is directionally correct at a conceptual level. Good instinct; incomplete mechanism.

What Was Missing
  • Query, Key, Value vectors — the actual mechanism — were entirely absent. These are what interviewers are testing for when they ask this question.
  • The formula: Attention(Q,K,V) = softmax(QKáµ€ / √dâ‚–) · V. Knowing this cold is a seniority signal.
  • Scaling by √dâ‚–. Without scaling, dot products in high dimensions grow large, pushing softmax into saturated zones with near-zero gradients — training becomes unstable. Knowing why the scale factor exists distinguishes an implementer from an architect.
  • Multi-head attention. Running H attention heads in parallel lets the model capture different relationship types simultaneously — syntactic dependencies, semantic similarity, co-reference, positional patterns. Each head learns its own Q/K/V projections.
A Stronger Version

"Each token gets projected into three vectors: a Query (what am I looking for?), a Key (what do I offer others?), and a Value (what information do I carry?). We take the dot product of each token's Query with all Keys, scale by √dâ‚– to prevent softmax saturation in high dimensions, apply softmax to get normalized attention weights, then sum the Values weighted by those scores. In multi-head attention, this runs in parallel across H independent heads — each learning to attend to different relationship types. Your cat/mat example is precisely what a co-reference head learns to resolve."

04 Feed-Forward Network in Transformer Factual Error
What You Said

The FFN "is basically used to get the probabilities… uses a softmax layer as the activation step." You acknowledged uncertainty and moved on.

What Is Actually Wrong

This is a clear factual error — the most consequential one in the interview. The FFN inside each transformer block does not produce token probabilities. Softmax over the vocabulary appears exactly once in a transformer: at the final output layer (the language model head), after all transformer blocks have run. Confusing these two suggests the internal architecture is understood narratively but not mechanically.

The FFN within each block has an entirely different job: it is a position-wise, two-layer fully connected network applied independently to each token after the attention sub-layer. Its architecture is FFN(x) = max(0, xW₁ + b₁)W₂ + b₂ (ReLU or GELU activation). It projects to a much larger intermediate dimension — typically 4× the model dimension — and back. Its purpose is to apply non-linear, token-level transformations that give the model representational capacity that pure attention cannot provide, since attention is essentially a weighted sum (a linear operation). The FFN is where the model stores "factual" associations — recent mechanistic interpretability research has shown individual FFN neurons activate for specific semantic patterns.

This question is frequently used to probe precisely this gap: candidates who know transformers at the narrative level reliably confuse the FFN with the output head.

What to Say Next Time

"After the attention sub-layer, each token passes through a position-wise feed-forward network — same weights applied independently at every position. It's two linear layers with a ReLU or GELU in between, and it typically expands to 4× the model dimension before projecting back. Its job is to add non-linearity: attention is essentially a weighted sum, so it's linear — the FFN is where you get the representational capacity to encode complex patterns. The softmax over the vocabulary is completely separate — it's in the final output head, applied once after all transformer blocks."

05 Random Forest Strong — Precision Gaps
What You Did

Best technical answer of the interview. You correctly identified the ensemble motivation (reducing single-tree overfitting), named both sources of randomness accurately, explained hard vs. soft voting, and tied the design back to the generalization goal. Structured and self-consistent at the conceptual level — you even caught yourself and checked you had answered the full question. That metacognition is a positive signal.

What Would Sharpen It Further
  • Bootstrap sampling = sampling with replacement. Each tree trains on a bootstrap sample of ~63% unique observations. The remaining ~37% are Out-of-Bag (OOB) samples — and this matters for the next point.
  • OOB error estimation. The ~37% of data not seen by each tree can be used to estimate generalization error without a separate validation set — a native property of random forests that is practically very useful. Naming this demonstrates depth.
  • Feature selection is per-split, not just per tree. At every node split, only a random subset of features is considered (controlled by max_features). This is the primary driver of tree diversity — it is what allows trees trained on similar bootstrap samples to still diverge structurally.
  • Feature importance. Random forests produce variable importance scores via mean decrease in Gini impurity across trees — a useful output for feature selection that is worth mentioning as a practical benefit.
06 Fine-Tuning vs. RAG Good Intuition, Weak Framework
What You Did

Definitions were accurate and grounded in real experience. The text-to-SQL PostgreSQL example was vivid and clearly drawn from production work — this is exactly the kind of concrete illustration that builds credibility. Framing the two as complementary (not competing) tools was a genuinely sophisticated view that many candidates miss.

What Was Missing
  • You deflected the actual question. "Why choose RAG over fine-tuning?" was answered with "it depends on the use case" — then the use case definitions. That is a non-answer. Senior candidates give the decision criteria directly.
  • Fine-tuning does not add new factual knowledge. This is a critical distinction that should have been stated explicitly: fine-tuning changes model behavior and style; the model can still hallucinate facts it was not exposed to during pre-training. RAG is the correct tool when factual accuracy and grounding are the primary requirements.
  • Data dynamism. RAG is far better suited to frequently updated knowledge bases — you cannot retrain a model every time a policy document changes. Fine-tuning assumes stable domain knowledge.
  • Latency and cost trade-offs. RAG adds retrieval overhead per inference; fine-tuning has a one-time training cost but cleaner inference latency at scale. This is a practical architectural consideration worth naming.
A Stronger Decision-Framework Answer

"Choose RAG when knowledge changes frequently — you can't retrain every time a policy or document updates; when factual accuracy is critical and you need traceable source citations; or when data sensitivity prevents sharing training data with an external fine-tuning API. Choose fine-tuning when you need consistent output format or domain behavior — like always producing valid PostgreSQL without prompt scaffolding — and your training data is stable and high quality. The key insight is that fine-tuning changes behavior, not knowledge — the model can still hallucinate facts it never saw. In production I'd often combine them: fine-tune for output discipline, RAG for dynamic factual grounding."

Section 03

How to Improve From Here

Six targeted actions, ordered by impact. The first two are not technical — they are the highest-leverage changes you can make before the next interview.

1
Rewrite and rehearse your self-introduction

Write a 75-to-90-word version: one positioning thesis, one flagship project with one metric, what you are looking for next. Deliver it out loud every day for two weeks until it is effortless — not memorized, but automatic. An interview's first 60 seconds set the credibility frame for every technical answer that follows. You cannot recover a weak opening by being technically strong later.

2
Eliminate weak qualifiers — permanently

Record yourself answering two practice questions and flag every "pretty good," "pretty familiar," "I think," and "I'm not sure but." Replace each with either a confident claim or a precise limitation: "I've built production RAG pipelines" or "I haven't used Pinecone at scale — my production experience is FAISS and Azure AI Search." Hedged confidence reads as incompetence; confident precision reads as seniority.

3
Master transformer internals — Q/K/V, FFN, and output head

This is a hard gap with a known fix. Spend three focused hours with the "Attention Is All You Need" paper and Andrej Karpathy's "Let's build GPT from scratch" video. Know the scaled dot-product attention formula cold. Know exactly what the FFN does and why it is architecturally separate from the output head. These three components — attention sub-layer, FFN, language model head — are among the most consistently probed questions for any senior AI/ML role.

4
Build decision frameworks, not just definitions

For every "X vs. Y" topic in your prep list, write a three-part framework: (a) what each optimizes for, (b) the key decision criteria, (c) when to combine them. RAG vs. fine-tuning, bias vs. variance controls, bagging vs. boosting — each needs a framework, not a definition. When an interviewer asks "why choose X?", they are testing your architectural judgment. "It depends" is not an answer; the decision criteria are the answer.

5
Extract and memorize five project metrics

Return to your Accenture project and excavate real or conservative estimates for each capability you built: latency reduction, user adoption, error rate, time-to-insight gain, or cost savings. Find at least one metric per capability (text-to-SQL, RAG, visualization, narrative writing). Weave one number naturally into every project-related answer. Metrics are credibility anchors — they transform abstract experience into verifiable engineering.

6
Invest in terminological precision, not broader breadth

You have solid breadth — that is not the gap. What is missing is precision at depth: bootstrap sampling with replacement vs. random subsampling, OOB error estimation, FFN vs. output head, L1 vs. L2 regularization mechanics. Create a one-page "precision glossary" covering your core topics and review it the morning before any interview. One precisely deployed term signals more seniority than three approximate descriptions.

This report is a structured critique based on a reconstructed transcript. All assessments are intended for professional development purposes only.


Index For Interviews Preparation    « Previously    Next »

Tuesday, June 23, 2026

Interview at Dentsu for Lead AI Engineer Role (2026 Jun 18)

Index For Interviews Preparation    « Previously

Interview Reconstruction & Critical Analysis

Lead AI Engineer — Interview Report

Candidate: Ashish
Interviewer: Swaroop
Format: Video Call
Total Topics: 13
Section I

Organized Candidate Transcript

T-01  ·  Self Introduction
13 years of experience: 11 in AI/ML, 2 in software engineering. Progression from scikit-learn, TensorFlow, PyTorch (traditional ML) through generative AI to agentic systems (LangGraph, CrewAI, AI Suite). Last project was a Text2SQL-based AI capability suite — deployed as Network Engineering Assistant (telecom), Business Intelligence via NLQ (healthcare), Data Analytics Platform (telco), and AOBI — comprising four capabilities: Text2SQL generation, RAG, a generic fallback bot, and a visualization + narrator agent, all routed by an orchestrating agent.
T-02  ·  Career History & IBM Tenure
Confirmed 7 companies in 13 years. Currently at IBM since 24th April; serving notice until 29th June. Stated reason: was promised a project, client did not agree, was moved to CSR work which felt stressful, decided to seek outside opportunities. No offer currently in hand.
T-03  ·  Cloud Technologies
Has Azure familiarity; has not specifically used Azure AI Foundry or GCP Vertex AI. Explained that in most use cases, LLMs are accessed via API directly. When pressed on Azure resources: Azure Functions / App Services / AKS for backend; Azure PostgreSQL for database; Azure Blob Storage for RAG document storage; Azure DevOps for project management; Azure AI Search for vector retrieval; Azure Databricks for data integrations.
T-04  ·  Multi-Agent Frameworks
Has worked with LangGraph and CrewAI (two.ai). LangGraph preference over cloud ADKs argued on four grounds: (1) stateful graph model with shared state across agent nodes vs. LangChain's sequential chains; (2) company platform (AI Refinery) was already built on LangGraph; (3) architectural fit — LangGraph suits defined workflows, CrewAI/Autogen suits autonomous multi-agent swarms; (4) client preferences on open-source, budget, and deployment environment.
T-05  ·  Figma AI
Has not personally used Figma AI. Identified it as a design, editing, and animation tool. Stated willingness to learn quickly if required.
T-06  ·  LLM Cost Reduction (50%)
Initial answers: downgrade model variant (nano/mini), switch to cheaper provider (DeepSeek), use self-hosted LLMs. Raised fine-tuning but noted it increases upfront cost. After interviewer prompt: acknowledged that reducing token count per request — by embedding repetitive behavioral instructions into a fine-tuned model — directly reduces cost at scale.
T-07  ·  Governance for a $10K Claims Approval Agent
Proposed three governance layers: (1) Guardrails — PII/confidential data protection, output formatting validation; (2) Human-in-the-loop — review node in the workflow; (3) Monitoring — accuracy tracking over time, detection of data drift (e.g., claim value thresholds shifting), logging to database or log files.
T-08  ·  Azure AI Search vs. Custom RAG Stack
Offered cost, non-Azure client deployment, and open-source preference as reasons to build custom. Could not independently reach the architecture-level answer. Interviewer supplied it: the need for advanced hybrid retrieval customization, domain-specific re-ranking, and large-scale knowledge indexing strategies beyond managed vector search capabilities.
T-09  ·  When NOT to Use Agentic AI
Four examples: (1) Autonomous coding agents that can corrupt codebases; (2) Defense — autonomous lethal weapons raise accountability questions; (3) Medicine — robotic surgery still needs human oversight; (4) Legal — courts and juries should not be autonomous agents. Framed as "how far can we take agentic AI" rather than engineering trade-off scenarios.
T-10  ·  RAG Pipeline Degradation at Scale
Approach: collect erroneous production cases, sample 100, perform manual error analysis, identify most frequent failure type (prompt/context/hallucination/logic), prioritize, trace to responsible pipeline component, build a golden dataset, iterate. Interviewer redirected to architecture-level answer: retrieval metrics analysis, multi-stage retrieval, semantic caching, context optimization, chunking strategy revision, vector DB overload management.
T-11  ·  Coding Experience
Writes 70-80% of code independently; uses documentation or AI assistance for the remaining 20-30%. Primary languages: Python and SQL.
T-12  ·  Candidate Questions
Asked about the project and company. Interviewer explained: AI innovation team delivering POCs across domains — media, creative, ad design, market research, data science — either as short-cycle innovation work or as longer client-embedded engagements (1.8–2.5 years).
Section II

Reconstructed Q&A — Full Interview

Could you start with a brief introduction about yourself?
13 years in AI/ML and software engineering. Built production systems from traditional ML through to agentic AI — LangGraph, CrewAI. Last project: a multi-modal Text2SQL platform deployed across telecom, healthcare, and other enterprise clients under different product names, combining Text2SQL, RAG, a generic fallback bot, and a visualization agent, all orchestrated by a router.
I see you've worked at 7 companies in 13 years — how do you account for that?
Yes, that's correct. [No narrative offered to contextualize the moves.]
You're currently at IBM — what's your tenure, and what's your situation there?
Joined 24th April, serving notice until 29th June. Was promised a project, client declined, was moved to CSR work, which I found stressful. No offer currently in hand.
What's your experience with cloud AI platforms — specifically Azure AI Foundry and GCP Vertex AI?
I have not used those specifically. In my work, we access LLMs directly via API. My Azure experience has been at the service level — AKS, Blob Storage, Azure AI Search, Azure DevOps, Databricks.
Can you walk me through the Azure AI resources you've worked with and how they fit together architecturally?
For backend hosting: Azure Functions, App Services, or AKS. For storage: Azure PostgreSQL and Blob Storage (used Blob for RAG document ingestion). For retrieval: Azure AI Search as vector database. For project management: Azure DevOps. For data integrations: Azure Databricks.
What multi-agent frameworks have you worked with, and why would a developer choose LangGraph over a cloud-native Agent Development Kit?
LangGraph and CrewAI. Four reasons for LangGraph: its stateful graph model is architecturally superior to LangChain's sequential chains; our internal platform (AI Refinery) was built on it; LangGraph fits structured workflows better than autonomous swarms, for which CrewAI/Autogen is preferable; and client preferences on open-source and budget.
Have you worked with Figma AI or Figma-to-HTML use cases?
Not personally. I understand it's for design, editing, and animations. I learn quickly if that's a requirement here.
How would you reduce LLM cost by 50% without reducing answer quality?
Options include downgrading to a lighter model variant after benchmarking, switching to a lower-cost provider like DeepSeek, using self-hosted LLMs, or fine-tuning the model to compress prompt tokens — so repetitive behavioral instructions don't need to be sent on every request.
I have an AI agent approving insurance claims up to $10,000. How would you enforce a governance layer?
Three layers: guardrails for PII protection and output formatting/validation; human-in-the-loop as a review node for edge cases; and monitoring for accuracy drift, data drift, and full audit logging.
Under what circumstances would you avoid Azure AI Search — or GCP/AWS equivalents — and build your own custom RAG stack?
Cost, client's non-Azure deployment requirement, or preference for open-source tooling. [Could not independently arrive at the retrieval-architecture reason; interviewer provided: advanced hybrid search customization and domain-specific re-ranking needs.]
You're being hired for a senior agentic AI role. When would you tell a client NOT to use an agentic AI solution? Give me real-world examples.
Autonomous coding agents that modify codebases without oversight. Autonomous defense weapons with no human accountability. Robotic surgery without a human in the room. Autonomous legal representation in courtrooms. [Framed as societal/ethical limits rather than engineering trade-offs.]
A customer support RAG system performed at 90%+ accuracy at pilot (500–1,000 users), but at 50,000–1 lakh users: hallucinations increased, latency rose from 3s to 15s, retrieval quality dropped, and LLM cost increased 8–10x. As lead architect, how do you diagnose and redesign?
Collect erroneous production instances, sample 100, manually classify failure type (prompt / context / hallucination / logic), identify the most frequent failure, trace it to the responsible pipeline component, build a golden dataset, and iterate. [Interviewer redirected: should have started with retrieval metrics, multi-stage retrieval, semantic caching, chunking strategy, and vector DB overload.]
What's your hands-on coding experience? Do you write from scratch or rely on AI assistance?
I write 70–80% of code independently in Python and SQL. The remaining 20–30% I look up in documentation or use AI assistance.
Section III

Critical Analysis & Better Answers

The critique below evaluates each answer against the altitude expected of a Lead / Principal AI Engineer — where the standard is architectural decision-making, not implementation narration.
Q-01 Self Introduction
What Went Wrong

This was a portfolio recitation, not a leadership pitch. The answer listed tools (scikit-learn → PyTorch → LangGraph) and project names without anchoring any of it to outcomes, decisions, or the scale of problems solved. Over two minutes of speaking produced no memorable claim the interviewer could hold onto. The question "why hire you?" was left unanswered.

Better Answer

Lead with impact, not inventory. Open with one high-signal sentence about what you built and why it was hard.

"I'm Ashish — Lead AI Engineer with 13 years of experience, the last four focused on designing production agentic and RAG systems. Most recently I led the architecture for a multi-client Text2SQL platform that processed natural language queries over both structured and unstructured data, deployed across telecom and healthcare enterprises. My M.Tech from BITS Pilani gave me the mathematical foundation; building these systems at production scale gave me everything else."
Q-02 7 Companies in 13 Years
What Went Wrong

Confirming "yes, that's correct" and staying silent is the worst possible response to this pattern. It leaves the interviewer to fill the silence with skepticism. No career arc was offered, no intentionality was demonstrated.

Better Answer

Pre-empt with a story. Name the turning point. Show that recent moves were deliberate, not reactive.

"The first few were early-career exploration until I found my domain in AI around 2015-16. Since then, every move has been toward increasing ownership of AI architecture. My IBM role is an exception — I've been bench-allocated and I've decided not to wait. Every other move has been a promotion of scope."
Q-03 Leaving IBM / No Offer in Hand
What Went Wrong

Volunteering "no offer in hand" weakens every subsequent negotiation. Explaining departure as "CSR work was stressful" frames the move as running away from a problem rather than toward an opportunity. Both facts, while honest, cost leverage.

Better Answer

Keep the reason forward-looking and factual without over-sharing.

"At IBM I've been on the bench — the client engagement I was being prepared for didn't materialize. Rather than wait indefinitely for allocation, I decided to be proactive. I'm looking for a role with genuine architectural ownership from day one, which is what this position seems to offer."
Q-04 Azure AI Resources — Architectural Walkthrough
What Went Wrong

The answer was a laundry list of Azure service names without any architectural rationale. "We used AKS" is not a decision — the decision is why AKS over Azure Functions for this workload, or why Azure AI Search over a standalone Pinecone deployment. The interviewer was testing architectural reasoning, not Azure documentation recall.

Better Answer

Organize the answer as a layered architecture with at least one explicit trade-off decision at each layer.

"Our stack had four layers: compute (AKS for the FastAPI orchestration layer — chosen over Azure Functions because our LangGraph workflows exceeded Function timeout limits); storage (Blob for raw documents, PostgreSQL for structured client data); retrieval (Azure AI Search as the vector index — we chose it over standalone Pinecone because of native Azure AD integration and data residency compliance); and observability (Azure Monitor plus custom logging). The hardest call was Azure OpenAI vs. direct OpenAI API — we chose Azure OpenAI for the enterprise data security guarantees."
Q-05 LangGraph vs. Cloud-Native ADKs
What Went Wrong

This was one of the stronger answers — four structured reasons, a genuine architectural insight about stateful graphs vs. sequential chains. Minor issue: the LangChain history detour consumed too much time before reaching the core insight. The answer also buried the most important reason (stateful graph model) after the less important ones.

Better Answer

Lead with the architectural differentiator, then supporting context.

"LangGraph's defining advantage is its stateful graph model — each node is an agent with read/write access to a shared state object, which makes conditional flows, human-in-the-loop interrupts, and multi-step tool-use tractable in a way LangChain's sequential chains never were. Cloud ADKs are tightly coupled to their vendor's runtime and observability stack, which matters when clients need cloud-neutral contracts or full auditability. For fully autonomous multi-agent swarms with back-and-forth communication, CrewAI or AutoGen is the stronger fit — LangGraph excels when the control flow is known and needs to be deterministic."
Q-06 LLM Cost Reduction by 50%
What Went Wrong

The first three answers — model downgrade, provider switch, self-hosting — are procurement and operations decisions, not engineering solutions. Fine-tuning was raised but immediately self-contradicted ("it elevates cost"). The correct engineering answer — semantic caching, prompt compression, intelligent model routing — only emerged after the interviewer had to prompt for it. An architect should have led with the highest-leverage technical levers.

Better Answer "The fastest path to 50% cost reduction in a production RAG system is semantic caching — store embedding-matched responses and serve cache hits for semantically similar queries. In high-volume customer support, 30–50% of queries cluster around the same 20–30 canonical intents. Second: prompt compression via fine-tuning — if your system prompt is 800 tokens of behavioral instructions repeated on every request, one fine-tuning run amortizes that cost across millions of calls. Third: intelligent model routing — classify query complexity and route straightforward queries to a mini/nano model, reserving the full model for ambiguous or multi-step reasoning. Model-switching and self-hosting are last-resort levers because they introduce latency and reliability risk that typically outweigh the savings at moderate scale."
Q-07 Governance for a $10K Claims Approval Agent
What Went Wrong

The answer wandered into PII guardrails and data formatting before reaching the core of the question. Governance for a financial approval agent is fundamentally about accountability, auditability, and decision boundaries — none of which were named clearly. Missing entirely: approval thresholds with hard business rules, explainability requirements, role-based escalation paths, and regulatory compliance dimensions.

Better Answer "Three layers. First, hard decision boundaries: the agent cannot override certain rules regardless of model confidence — e.g., any claim above $8,000 must pass through a human review node. Second, full auditability: every approval decision must be logged with the input claim, retrieved evidence, the agent's reasoning chain, and the output — to satisfy SOC 2 and internal audit requirements. Third, drift monitoring: track the approval rate distribution over rolling windows. If the model begins approving 92% of claims where the historical baseline is 75%, that's a signal of data drift, prompt injection, or adversarial input — and should trigger automatic hold and review. I'd also add a confidence gate: structured output with a confidence score below a threshold automatically routes to human review."
Q-08 Azure AI Search vs. Custom RAG Stack
What Went Wrong

All three answers (cost, cloud preference, open-source vs. closed-source) are procurement/policy decisions — none are engineering reasons. The interviewer had to provide the answer directly. This was a significant gap: a Lead AI Engineer should know immediately that the architectural reason to build custom is when retrieval quality is the primary differentiator and managed vector search doesn't support the required retrieval strategies.

Better Answer "The core engineering reason to build custom is when your retrieval logic exceeds what managed vector search can configure. Azure AI Search offers hybrid dense+sparse search with configurable blend weights — that covers most enterprise RAG cases. But if you need late-interaction re-ranking (ColBERT trained on your domain corpus), multi-hop iterative query refinement, or a custom scoring function incorporating document freshness, entity salience, and proprietary business rules — none of that is configurable in a managed service. You build custom when retrieval quality is your primary differentiator and the algorithm is part of the IP."
Q-09 When NOT to Use Agentic AI — Real-World Examples
What Went Wrong

The examples given (autonomous weapons, robotic surgery, courtroom lawyers) are philosophical and societal — macro-level ethical questions about AI in civilization, not engineering trade-off decisions. The interviewer explicitly said "I'm looking for real-world examples from a senior agentic AI position," which means: scenarios where you would tell an engineering client that agentic AI is the wrong technical choice for their specific problem.

Better Answer "Agentic AI is the wrong choice when: (1) the problem is fully deterministic and bounded — a tax calculation engine or a document format validator; agents introduce latency, cost, and non-determinism where a decision tree is faster, cheaper, and auditable. (2) Sub-100ms latency is required — real-time fraud scoring at transaction time cannot have an LLM in the critical path. (3) High-frequency, low-complexity classification — routing 10,000 support tickets per hour to the right team is a fine-tuned classifier problem, not an agent problem. (4) Regulated determinism — a medical device calculating drug dosage requires certified, deterministic software, not probabilistic inference. The general principle: if the solution space is fully specified and bounded, deterministic code wins on every dimension. Agents earn their cost when the task requires open-ended reasoning, multi-step planning, or natural language understanding."
Q-10 RAG Pipeline Degradation at Scale (3s → 15s latency, 8-10x cost)
What Went Wrong

The answer addressed the problem from a QA / data-science perspective (error sampling, error classification, golden dataset) rather than an infrastructure and retrieval architecture perspective. At a Lead Architect level, the first question is not "which errors are most common?" — it's "which layer is the bottleneck?" The interviewer had to intervene and supply the correct altitude: retrieval metrics, multi-stage retrieval, semantic caching, chunking, vector DB overload.

Better Answer "I'd diagnose in layers. Latency (3s → 15s): separate the LLM call latency from the vector DB query latency using distributed tracing. At 50x user scale, the vector index is almost certainly the bottleneck — crowded ANN indices degrade query time. Fix: switch from flat index to HNSW, add read replicas, or shard the index. Retrieval quality: measure recall@5 and MRR at current scale. Crowded indices push relevant chunks below the top-k cutoff. Fix: add a cross-encoder re-ranker as a second retrieval stage, tighten similarity thresholds, and revisit chunk size — smaller, denser chunks improve precision at scale. Cost (8-10x): almost always points to zero caching. Implement semantic caching with a similarity threshold — most support queries cluster around 20-30 intents. Hallucinations: typically a symptom of poor context — the LLM is receiving irrelevant chunks. Tightening retrieval and adding metadata filters usually resolves this without touching the LLM at all."
Overall Assessment

The interview demonstrated genuine hands-on depth — the LangGraph answer was well-structured, the governance and monitoring instincts were sound, and the coding honesty was appropriate. The recurring failure pattern, however, was consistent: answers were delivered at the altitude of an engineer describing what was built, not an architect explaining why decisions were made.

The interviewer explicitly redirected on Q-06 (cost reduction), Q-08 (custom RAG), Q-09 (anti-patterns), and Q-10 (scaling diagnosis) — all four times asking for architecture-level reasoning and receiving implementation-level narration. This pattern is the single most important thing to correct before the next interview.

The practical fix: for every technical question, answer the "why this, not that" question before the "what we built" question. Lead with the decision and the trade-off, not the outcome.

✓ Strong — LangGraph rationale ✓ Strong — Monitoring & governance instincts ✓ Strong — Coding honesty ✗ Weak — Cost reduction (missed caching) ✗ Weak — Custom RAG trade-offs ✗ Weak — Scale diagnosis altitude △ Mid — Introduction (no impact framing) △ Mid — Career narrative (no story)

Index For Interviews Preparation    « Previously
Tags: Interview Preparation,

Kant's Trap: Why Everything Is Never Enough

See All Articles


5 Key Takeaways

  • Getting everything we want does not guarantee lasting fulfillment because human desires are endlessly expanding.
  • Greed creates a cycle where satisfaction is always one more acquisition away, never truly achieved.
  • True happiness stems from gratitude, purpose, and meaningful relationships, not from external rewards.
  • Self-reliance and an internal moral compass protect against being controlled by circumstances and endless wanting.
  • Contentment is a deliberate practice that requires auditing priorities and recognizing what is genuinely enough.



This HTML delivers a longform blog post centered on Immanuel Kant’s quote about desire and fulfillment, styled as a polished, typographic reading experience without any external dependencies. ```html
Give a man everything he wants and at that moment everything is not everything.
Immanuel Kant 1724 – 1804

There’s a quiet trap hidden inside every major success story, and the 18th-century philosopher Immanuel Kant captured it in one sentence. At first glance it reads like a riddle. Spend a moment with it, however, and it unpacks something unsettling about human nature — the realisation that getting exactly what we chase rarely delivers the permanent fulfilment we imagined. This single line, recently highlighted as a quote of the day, unpacks deep lessons about greed, contentment, self-reliance and the architecture of genuine happiness.

Who Was Kant?

Immanuel Kant was born on April 22, 1724, in Königsberg, Prussia, a Baltic port city that today is Kaliningrad, Russia. Over a quiet, rigorously disciplined life that ended on February 12, 1804, he reshaped Western philosophy. His three towering works — Critique of Pure Reason, Critique of Practical Reason and Critique of Judgment — built a system centred on reason, moral responsibility and the limits of human knowledge. Kant never chased headlines, yet his thoughts on ethics and autonomy have echoed for more than two centuries. The quote about having everything is less a clever aphorism and more a compressed summary of his broader warning: external rewards alone cannot anchor a well-lived life.

To understand what Kant means, it helps to pull the sentence apart. The first half — Give a man everything he wants — sounds like a fantasy. Imagine a person handed perfect health, unlimited wealth, status, admiration, every possession they desire. In that moment, Kant says, everything is not everything. He is arguing that the very idea of “everything” is a moving target. Human desire works like an escalator. Step off at one floor and a new staircase of wants immediately unfolds. The promotion that was supposed to bring lasting pride suddenly feels ordinary; the dream car becomes just a vehicle; the admiration of others demands constant topping up. Satisfaction, in this framing, is not a finish line but a fleeting way-station.

The Engine of Greed

This observation leads directly to the problem of greed. Greed is not simply wanting more. It is the engine that convinces us each new acquisition will plug the gap left by the last one. And the engine never switches off. People assume that a higher salary or a bigger house will silence the inner voice demanding more. What they often discover is that once those things arrive, expectations recalibrate. A raise becomes the new normal, a larger home produces new furnishings to buy. Kant’s quote pulls back the curtain on this cycle. It does not say desire is bad. It says that treating the removal of all desire as the route to peace is a fundamental misunderstanding of how the mind works. Peace of mind does not come from the pile of things accumulated; it comes from learning when the pile is enough.

Satisfaction and the Roots of Happiness

That distinction is where the connection between satisfaction and happiness sharpens. Satisfaction is the ability to look at what is already on the table and feel nourished by it. Happiness, in turn, often grows out of that soil, not out of the next delivery at the door. Kant’s words invite a mental check: if happiness were purely the sum of achieved desires, every billionaire would be blissful and every title-holder permanently serene. Reality tells a different story. Emotional well-being leans heavily on gratitude, purpose and meaningful relationships. It draws from experiences that cannot be added to a shopping cart. When Kant says everything is not everything, he is urging people not to mistake the menu for the meal. External rewards can delight, but they cannot substitute for a sense of direction or the quiet satisfaction of valuing what is already present.

A Call for Self-Reliance

Implicit in this argument is a call for self-reliance. Kant was a fierce advocate of personal responsibility and the use of one’s own reason. If happiness depended solely on things the world gives or takes away — money, praise, status — then a person would be a hostage to circumstances. Self-reliance flips that. It means building an internal compass: developing confidence, skills and moral values that remain steady even when the world withholds applause. A person anchored by purpose can weather a storm of unmet wants without capsizing. Kant is not pushing an extreme of isolation or self-sufficiency. He is pointing out that when people look inside for fulfilment, they are less likely to be crushed by the endless hunger for more.


Countercultural Wisdom for the 21st Century

That message feels almost countercultural in the 21st century. Modern life is a torrent of comparison. Social media feeds serve an infinite scroll of other people’s achievements, vacations, promotions and perfectly lit living rooms. Advertising primes the brain to see existing possessions as slightly shabby and one purchase away from satisfaction. Competitive workplaces turn status into a ladder with no top rung. Kant, writing long before the invention of the smartphone, already diagnosed the illness: the belief that one more win will make everything whole. His quote challenges that mindset by asking a blunt question — if you could have it all right now, would the yearning actually stop? For most, the honest answer is no. That recognition pushes people toward contentment as a deliberate practice, not a default state. It asks them to audit their priorities and decide what is genuinely enough.

The Man Behind the Words

Understanding who Kant was adds weight to the lesson. He did not arrive at this view through a life of luxury. He lived modestly, kept such a predictable routine that neighbours were said to set their clocks by his afternoon walk, and devoted his energy to untangling the architecture of knowledge and morality. His philosophy insisted that humans are not just bundles of appetites. They possess reason, and with it the capacity to reflect on their desires rather than simply obey them. This is the intellectual backbone of the everything is not everything idea. Reason gives people the ability to step back, see the moving target for what it is, and choose a different measure of success.

Practical Lessons

The practical lessons from the quote are simple to list and harder to master:

  • Happiness does not come only from getting what we want.
  • Greed can create a loop where satisfaction is always one acquisition away, and that loop never closes.
  • Contentment is not a luxury reserved for people with fewer ambitions; it is a skill that protects mental well-being.
  • Success is hollow if it omits personal growth and a sense of purpose.
  • Self-reliance acts as a buffer, helping people find fulfilment that is not glued to material rewards.
  • Gratitude is a stronger foundation than accumulation.
  • Genuine happiness tends to emerge from balance — the ability to strive without being enslaved by striving, to enjoy rewards without mistaking them for identity.

Kant left behind a vast philosophical system, but sometimes a single sentence carries the force of a whole lecture. Give a man everything he wants and at that moment everything is not everything is not a counsel of despair. It is an invitation to stop running long enough to notice where the road actually leads. The pursuit of goals is not wrong. The danger lies in believing that the pursuit ends at the next trophy. By recognising that human wants are elastic, people can step off the escalator and find that they already have a great deal of what matters. That shift in perspective is what Kant’s words continue to offer — a compass pointing away from the mirage of total acquisition and toward the more solid ground of contentment, meaning and inner steadiness.


Read more