Skip to main content
Knowledge Hub · Give Back Initiative

HUB_STATUS: OPERATIONAL // 20_YRS_OF_KNOWLEDGE · FREE_ACCESS

Two Decades of Engineering Knowledge,Given Back. For Free.

Thousands of interview questions, real-world errors with root-cause solutions, reusable code archives, and structured learning paths — built through 20 years of actual engineering.

One lamp can light a hundred more without losing its own flame. This knowledge hub is not a product. It is not a funnel. It is a contribution — to every developer who once searched alone at 2 AM for an answer that did not exist anywhere on the internet. It exists now. Here.

"A lamp loses nothing by lighting another lamp. This is why this knowledge exists — not to be held, but to be shared."
— Debasis Bhattacharjee
3,500+
Interview Questions

Across 18 languages & frameworks

1,200+
Debug Solutions

Real errors. Root-cause fixes.

800+
Code Snippets

Copy-paste ready. Production tested.

24
Learning Paths

Beginner → Advanced, structured

Section IV · Knowledge Domains

DOMAINS_MAPPED // PHP · JS · PYTHON · AI · SECURITY · ARCHITECTURE

Explore the Ecosystem

View All Domains →
01 · DOMAIN
Interview Questions

Categorized by language, role, and difficulty. From junior to architect-level. With curated model answers built from real hiring experience.

3,500+ questions Explore →
02 · DOMAIN
Error & Debug Archive

Searchable archive of real runtime errors, stack traces, and exceptions — each with root cause analysis and tested fix. Like Stack Overflow, but curated.

1,200+ solutions Explore →
03 · DOMAIN
Code Snippet Library

Reusable, production-tested code patterns across PHP, Python, JavaScript, VB.NET, SQL and more. No fluff — just working implementations.

800+ snippets Explore →
04 · DOMAIN
System Design Notes

Architecture patterns, design principles, scalability thinking, and real-world system breakdowns explained from an engineer who has built them.

150+ case studies Explore →
05 · DOMAIN
Learning Paths

Structured progression from beginner to professional — curriculum-style roadmaps with sequenced topics, milestones, and recommended resources.

24 paths Explore →
06 · DOMAIN
Security & Ethical Hacking

Penetration testing concepts, vulnerability patterns, OWASP deep dives, and defensive coding practices drawn from real security consulting work.

200+ topics Explore →
Section V · Interview Preparation

INTERVIEW_PREP: ACTIVE // JUNIOR · MID · SENIOR · ARCHITECT

Questions & Answers

All 54 Questions →
Q·011 What is the difference between zero-shot few-shot and fine-tuned models in production?
AI Integration AI Integration Advanced

Zero-shot uses the base model with only instructions (no examples). Few-shot includes examples in the prompt. Fine-tuned models are retrained on domain data. The tradeoff is cost and flexibility versus consistency and performance.

Deep Dive: Zero-shot: just the task description in the prompt. Relies entirely on the model's pretraining. Fast to deploy requires no labeled data. Performance varies by task complexity. Best for: common well-defined tasks (summarization translation sentiment). Few-shot: include 3-10 task examples in the prompt. Dramatically improves consistency and format adherence. Cost: larger prompts = more tokens per call. Performance ceiling limited by context window and what can be communicated via examples. Best for: uncommon tasks new formats specific style requirements. Fine-tuned: domain-specific retraining. Bakes behavior into model weights instead of prompt tokens. Shorter prompts lower inference cost better consistency on trained tasks. Requires labeled data (minimum 100-1000 high-quality examples) compute for training. Not updatable without retraining. Best for: consistent structured output domain-specific terminology and behaviors classification with specific categories.

Real-World: A legal clause extraction system evolution: zero-shot (78% accuracy) → few-shot with 5 examples (86% accuracy) → few-shot with 20 examples (89% accuracy) → fine-tuned on 3000 examples (96% accuracy lower latency lower cost per call). Each step required more investment but delivered better ROI at the production volume they were operating at.

⚠ Common Mistakes: Jumping to fine-tuning before exhausting prompt engineering (expensive and inflexible). Using few-shot examples that are low quality or inconsistent — few-shot examples teach the model a behavior; bad examples teach bad behavior. Not measuring whether the performance gain justifies the cost of fine-tuning. Fine-tuning on a narrow task and breaking general capabilities (catastrophic forgetting).

🏭 Production Scenario: A startup building a document AI product started with zero-shot (fast prototype) discovered insufficient performance moved to few-shot (8 examples in prompt fixed 70% of failures) then fine-tuned only their highest-volume document type (processing 100K documents/month — fine-tuning ROI was clear) while keeping few-shot for lower-volume types. This staged approach minimized cost while maximizing quality where it mattered.

Follow-up questions: How do you measure whether fine-tuning improved over few-shot? What is instruction tuning and how is it different from task-specific fine-tuning? What is RLHF and how does it relate to fine-tuning?

// ID: AI-ADV-004  ·  DIFFICULTY: 7/10  ·  ★★★★★★★☆☆☆

Q·012 How does Retrieval-Augmented Generation (RAG) work and what are its main failure modes?
AI Integration AI Integration Advanced

RAG retrieves relevant documents from a vector database using semantic similarity search injects them into the LLM context and generates a response grounded in the retrieved content. Main failure modes are retrieval failures context window overflow and hallucinations about retrieved content.

Deep Dive: RAG has three main components: indexing (documents are chunked embedded using an embedding model and stored in a vector database like Pinecone Weaviate or pgvector) retrieval (the user query is embedded and semantically similar chunks are retrieved using approximate nearest neighbor search) and generation (retrieved chunks are inserted into the LLM prompt as context and the model generates a response). Key design decisions: chunk size (too small loses context too large wastes context window and dilutes relevance) embedding model choice number of retrieved chunks (k) whether to use reranking to improve retrieved chunk ordering and metadata filtering to constrain retrieval. Advanced patterns include hybrid search (semantic + keyword/BM25) HyDE (hypothetical document embeddings) and multi-hop retrieval for complex questions.

Real-World: A legal research assistant RAG system at a law firm used chunk sizes of 512 tokens for case documents. Attorneys complained answers lacked context. Investigation showed important legal reasoning spanned across chunk boundaries. Implementing larger overlapping chunks (1024 tokens with 200 token overlap) and a reranker (Cohere Rerank) improved answer quality significantly.

⚠ Common Mistakes: Chunking documents arbitrarily without considering semantic boundaries (splitting mid-paragraph). Using cosine similarity retrieval without reranking causing less relevant chunks to appear in context and confuse the model. Not handling the case where no relevant documents are retrieved — the model hallucinates instead of saying it does not know. Embedding the entire document instead of chunking exceeding context limits.

🏭 Production Scenario: A production customer support RAG system was giving confidently wrong answers about product return policies. Investigation revealed the retrieval was returning chunks from old policy documents because they had higher semantic similarity scores than newer updates. Implementing date-based metadata filtering to prefer recent documents and adding a retrieval confidence threshold solved the problem.

Follow-up questions: What is the difference between RAG and fine-tuning — when do you use each? What is a vector database and how does HNSW indexing work? What is RAGAS and how do you evaluate a RAG system?

// ID: AI-ADV-001  ·  DIFFICULTY: 8/10  ·  ★★★★★★★★☆☆

Q·013 What is an AI agent and how is it architecturally different from a simple LLM API call?
AI Integration AI Integration Advanced

An AI agent uses an LLM as a reasoning engine to autonomously plan use tools and complete multi-step tasks. Unlike a single LLM call that maps input to output an agent operates in a loop: observe think act observe again — until the task is complete.

Deep Dive: The ReAct pattern (Reason + Act) describes the core agent loop: the LLM receives a task and available tools generates a thought (reasoning about what to do) selects an action (a tool call) receives the observation (tool output) and repeats until producing a final answer. Tools are functions the LLM can invoke: web search code execution database queries API calls file operations. Agent architectures range from simple (single LLM with tools) to complex (multi-agent systems where specialized agents collaborate with a planner/orchestrator agent routing tasks). Key engineering challenges: tool design (tools must have clear descriptions for the LLM to select them correctly) error handling (agents can get stuck in loops or make wrong tool calls) context management (the agent's action history grows and fills the context window) and cost control (multi-step agents can make many API calls).

Real-World: A customer onboarding agent at a SaaS company replaces a 12-step manual process: it receives a new customer email calls the CRM API to create a contact queries the provisioning API to set up an account generates and sends a personalized welcome email creates a Jira ticket for account review and posts a Slack notification to the account manager — all autonomously from a single trigger.

⚠ Common Mistakes: Building agents without observability — impossible to debug why an agent made wrong decisions without logging the full thought-action-observation trace. Not implementing maximum step limits — agents can loop indefinitely on ambiguous tasks. Giving agents too many tools — LLMs struggle to select from large tool sets. Not handling tool failures gracefully in the agent loop.

🏭 Production Scenario: A document processing agent for an insurance company was processing claims autonomously. Without a step limit it entered an infinite loop trying to resolve a document parsing error making 10000 API calls in 8 minutes and generating a $400 API bill before being detected. Implementing a 20-step maximum and exponential backoff on tool errors fixed the runaway behavior.

Follow-up questions: What is the difference between ReAct Plan-and-Execute and Reflexion agent patterns? How do you implement agent memory (short-term vs long-term)? What is LangGraph and how does it implement agent state machines?

// ID: AI-ADV-002  ·  DIFFICULTY: 8/10  ·  ★★★★★★★★☆☆

Q·014 What is fine-tuning an LLM and when should you fine-tune versus use RAG or prompt engineering?
AI Integration AI Integration Advanced

Fine-tuning adjusts the model weights on domain-specific data to internalize knowledge or style. Use it when the task requires consistent behavior style or format the base model cannot achieve through prompting alone. RAG is better for factual grounding; prompt engineering first for most tasks.

Deep Dive: Fine-tuning: continue training a pretrained LLM on a curated dataset of examples in your target format/domain. Changes the model weights permanently for that task. Types: full fine-tuning (expensive updates all parameters) parameter-efficient fine-tuning (PEFT — LoRA QLORA update a small fraction of parameters cheaply). When to fine-tune: consistent output format the base model keeps breaking (code generation with specific conventions) domain-specific style or tone (legal writing medical reports) task-specific behavior patterns (classification schema extraction) or reducing prompt length at inference (baking instructions into the model). When NOT to fine-tune: you need up-to-date information (use RAG) you are still exploring requirements (use prompting first) you have less than 1000 high-quality examples (insufficient for fine-tuning) or the base model already performs the task well with prompting.

Real-World: A financial services company needed an LLM to consistently extract structured data from loan applications into a specific JSON schema. Prompt engineering achieved 78% schema compliance. RAG did not help (the schema was fixed not document-dependent). Fine-tuning with 5000 labeled examples achieved 97% schema compliance with shorter prompts reducing inference cost.

⚠ Common Mistakes: Fine-tuning with low-quality or insufficient examples — produces a model worse than the base model. Fine-tuning when prompt engineering would suffice — expensive and inflexible. Forgetting that fine-tuned models still hallucinate and still need RAG for factual grounding. Not evaluating catastrophic forgetting — fine-tuning on a narrow dataset can degrade performance on general tasks.

🏭 Production Scenario: A customer service company fine-tuned an LLM on 2000 examples of customer conversations expecting it to handle all intents. In production the model lost general language capabilities and failed on intents not well-represented in the training data. Rebuilding with a larger curated dataset (15000 examples across all intents) with proper evaluation resolved the regression.

Follow-up questions: What is LoRA and how does it make fine-tuning parameter-efficient? What is catastrophic forgetting in fine-tuning? How do you create a high-quality fine-tuning dataset?

// ID: AI-ADV-003  ·  DIFFICULTY: 8/10  ·  ★★★★★★★★☆☆

Q·015 How do you evaluate the quality of an LLM-powered application in production?
AI Integration AI Integration Advanced

LLM application quality requires a multi-layered evaluation strategy: offline evals (automated benchmarks using LLM-as-judge) online monitoring (latency cost error rates) and human evaluation for quality calibration. There is no single metric — you need task-specific criteria.

Deep Dive: Evaluation layers: automated offline evals (run test cases through the system compare outputs against reference answers using another LLM as judge — e.g. GPT-4 scoring responses on accuracy relevance groundedness and format compliance) human evaluation (sample of outputs reviewed by domain experts to calibrate the LLM judge and catch systematic failures) production monitoring (latency per-call cost API error rates user feedback signals like thumbs up/down) and A/B testing (compare system versions on real user traffic). RAGAS framework evaluates RAG systems specifically: faithfulness (is the answer grounded in retrieved context?) answer relevancy (does the answer address the question?) context recall and context precision. For agents: task completion rate steps per completion tool error rate and cost per successful task completion.

Real-World: At a legal document AI company: automated evals used a curated set of 500 document-question pairs with reference answers GPT-4 as judge scored faithfulness and accuracy monthly human review by paralegals calibrated the automated judge real-time dashboards showed per-endpoint latency and cost and a thumbs-down button collected user feedback that triggered human review for systematic issues.

⚠ Common Mistakes: Using only automated LLM-as-judge evaluation without human calibration — the judge model has its own biases and blind spots. Not evaluating on adversarial cases (edge cases failure modes). Measuring only technical metrics (latency cost) and not quality metrics. Not separating evaluation of the retrieval step from the generation step in RAG systems.

🏭 Production Scenario: A customer service AI showed consistently positive automated evaluation scores but had a growing volume of user complaints. The disconnect was because the LLM judge was evaluating response quality in isolation while users were frustrated by the system's failure to resolve their issues (task completion rate was not measured). Adding task completion as a primary metric revealed the real problem.

Follow-up questions: What is LLM-as-judge and what are its limitations? What is RAGAS and how do you set it up? How do you A/B test prompt changes safely in production?

// ID: AI-MLO-001  ·  DIFFICULTY: 8/10  ·  ★★★★★★★★☆☆

Showing 5 of 15 questions

Section VI · Error & Debug Archive

DEBUG_ARCHIVE: LIVE // REAL_ERRORS · ANNOTATED_FIXES

Real Errors. Root-Cause Fixes.

All 1,200 Solutions →
PHP ERROR E_FATAL · #DB-001
Undefined variable: $conn — PDO connection not persisted across scope
Fatal error: Uncaught Error: Call to a member function query() on null

Connection object passed by value. Fix: pass by reference or use dependency injection through constructor.

4,200 views Read Fix →
JAVASCRIPT RUNTIME · #JS-044
Cannot read properties of undefined — React state not yet populated on first render
TypeError: Cannot read properties of undefined (reading 'map')

State initialized as undefined, not empty array. Fix: initialize with useState([]) and guard with optional chaining.

7,800 views Read Fix →
SQL ERROR CONSTRAINT · #SQL-019
Foreign key constraint fails on INSERT — parent row not found in referenced table
ERROR 1452: Cannot add or update a child row: a foreign key constraint fails

Insertion order violation. Fix: insert parent record first, or disable FK checks during bulk migration with SET FOREIGN_KEY_CHECKS=0.

3,100 views Read Fix →
PYTHON IMPORT · #PY-007
ModuleNotFoundError in virtual environment — pip installed globally but not inside venv
ModuleNotFoundError: No module named 'requests'

Package installed to system Python, not active venv. Fix: activate venv first, then pip install. Verify with which python.

5,400 views Read Fix →
VB.NET RUNTIME · #VB-031
NullReferenceException on DataGridView load — DataSource bound before data fetched
System.NullReferenceException: Object reference not set to an instance

Binding fires before async fetch completes. Fix: await the data load, then set DataSource. Use BindingSource for dynamic updates.

2,700 views Read Fix →
WORDPRESS PLUGIN · #WP-012
White Screen of Death after plugin activation — memory limit exhausted on init hook
Fatal error: Allowed memory size of 67108864 bytes exhausted

Plugin loading heavy library on every request. Fix: lazy-load on relevant admin pages only. Increase WP_MEMORY_LIMIT in wp-config as temporary measure.

6,200 views Read Fix →
Section VII · Code Archive

Copy. Adapt. Ship.

All 800 Snippets →
PHP · PATTERN
Singleton Database Connection

Thread-safe PDO connection with single instance guarantee. Works with MySQL, PostgreSQL, SQLite.

private static ?self $instance = null;
12 uses this week View →
PYTHON · UTILITY
Rate-Limited API Client

Async HTTP client with automatic retry, exponential backoff, and per-domain rate limiting.

async def fetch_with_retry(url, max=3):
28 uses this week View →
SQL · QUERY
Recursive CTE Hierarchy

Self-referencing table traversal for category trees, org charts, and menu structures using Common Table Expressions.

WITH RECURSIVE tree AS (SELECT ...)
19 uses this week View →
JAVASCRIPT · HOOK
Custom useDebounce Hook

React hook for debouncing search inputs, form fields, and resize events. Prevents excessive API calls.

const useDebounce = (value, delay) => {
41 uses this week View →
Section VIII · Structured Learning

LEARNING_PATHS: READY // 4_TRACKS · STRUCTURED · MENTOR_GUIDED

Learning Paths

All 24 Paths →

PHP Developer: Zero to Production

Beginner

From syntax fundamentals to building RESTful APIs and WordPress plugins. Designed for complete beginners with no prior programming background.

PHP Syntax & Data Types
OOP: Classes, Interfaces, Traits
Database: PDO & MySQL
REST API Design
WordPress Plugin Development
18 modules · ~40 hrs Start Path →

Full-Stack JavaScript: React + Node

Mid-Level

Modern full-stack development with React, Node.js, Express, and PostgreSQL. Includes deployment, auth, and real project builds.

Modern ES2024 JavaScript
React: State, Hooks, Context
Node.js & Express APIs
Auth: JWT & OAuth 2.0
CI/CD & Deployment
22 modules · ~60 hrs Start Path →

Software Architecture Mastery

Advanced

Design patterns, SOLID principles, microservices, event-driven architecture, and real-world system design interview preparation.

Design Patterns: GoF 23
Domain-Driven Design
Microservices & Event Bus
Scalability Patterns
System Design Interviews
16 modules · ~35 hrs Start Path →

AI Integration for Developers

Mid-Level

Practical AI integration using Claude API, OpenAI, and MCP. Build real AI-powered applications, tools, and automation workflows.

LLM Fundamentals & Prompting
Claude API & OpenAI SDK
Model Context Protocol (MCP)
RAG Systems & Embeddings
Deploying AI-Powered Apps
14 modules · ~28 hrs Start Path →

"The best engineering knowledge is not found in textbooks — it is extracted from late nights, broken builds, angry clients, and the stubborn refusal to stop until the problem is solved."

— Debasis Bhattacharjee · Software Architect · 20 Years in Production

Section X · The Ecosystem Grows

ARCHIVE_GROWING // CONTRIBUTIONS_OPEN · LIVING_DOCUMENT

This Is a Living Archive. Not a Static Library.

Every week, new errors are documented, new interview patterns are added, and new solutions are tested in production. The knowledge hub grows because real problems keep appearing — and every answer earns its place here by actually working.

If you found a fix that saved your project, or spotted an answer that could be better — the door is always open. This ecosystem belongs to everyone who uses it.

Submit via Email
Send your question, error, or solution directly
Submit →
Leave a Testimonial
Did something here help you? Share your experience
Share →
Comment on Facebook
Find us at @iamdebasisbhattacharjee
Visit →
Get Update Alerts
Subscribe to be notified of new additions
Subscribe →
Section XI · Let's Talk

Knowledge is Free.
Mentorship is Personal.

The hub is open to everyone — but if you need structured guidance, 1-on-1 mentorship, or corporate training, that's a different conversation. Let's have it.

hello@debasisbhattacharjee.com  ·  +91 8777088548  ·  Mon–Fri, 9AM–6PM IST