Skip to main content

What is the difference between batch gradient descent stochastic gradient descent and mini-batch gradient descent?

Batch GD computes gradients on the entire dataset — slow but stable. Stochastic GD (SGD) computes gradients on one example — fast but noisy. Mini-batch GD computes on a subset…

WI
What is the difference between batch gradient descent stochastic gradient descent and mini-batch gradient descent?

COVER // WHAT IS THE DIFFERENCE BETWEEN BATCH GRADIENT DESCENT STOCHASTIC GRADIENT DESCENT AND MINI-BATCH GRADIENT DESCENT?

Batch GD computes gradients on the entire dataset — slow but stable. Stochastic GD (SGD) computes gradients on one example — fast but noisy. Mini-batch GD computes on a subset (typically 32-256 examples) — balancing speed and stability. Mini-batch is the standard for deep learning.

Let's Talk

Have a Project in Mind?

Whether it's a software challenge, an AI integration, or a course enquiry — I'm always open to a real conversation.

hello@debasisbhattacharjee.com · +91 8777088548 · Mon–Fri, 9AM–6PM IST