What Actually Happens When You Set Model Weights to Zero (and Why Gradients Still Work)
A common fear in deep learning is: “If I set some weights to zero, won’t I break differentiability?”
Read more →Short updates, context, and essays that complement the long-form notes in the bookshelf.
Looking for a specific topic? Search all posts.
A common fear in deep learning is: “If I set some weights to zero, won’t I break differentiability?”
Read more →If you want to understand Hollywood color, you do not start with knobs and curves. You start with a pipeline.
Read more →Taylor Series Expansion: A Local Lens for Functions
Read more →Learning rate schedules are the steering wheel of optimization. The learning rate controls how far each step moves in parameter space; the schedule controls how that step size e...
Read more →Dijkstra’s algorithm is the workhorse for finding shortest paths in weighted graphs with non‑negative edges. It’s taught in algorithms courses, but it also shows up quietly insi...
Read more →Symmetric Groups: Understanding Permutations from First Principles
Read more →Bijective Functions: The Perfect Correspondence
Read more →This post explores one of the most beautiful concepts in linear algebra: signed volume. We’ll see why volumes need signs, what orientation means geometrically, and how this simp...
Read more →This post explores the determinant—one of the most fundamental concepts in linear algebra. We’ll build intuition from geometric interpretations, derive the Leibniz formula, and ...
Read more →This post explores how modern video editing software has integrated advanced matting and mask refinement technology into production workflows. We’ll compare Adobe Premiere Pro’s...
Read more →This post explores how to extract foreground objects from video sequences with temporally consistent alpha mattes. We build on image matting concepts to handle the additional ch...
Read more →This post explores how to extract foreground objects with accurate transparency information at edges, enabling professional-quality compositing. We’ll cover the mathematical fou...
Read more →This post explores how computers perceive depth in images, bridging classical geometric methods with modern neural approaches. Familiarity with basic linear algebra, projective ...
Read more →This interactive tool lets you explore the fundamental relationship between depth and disparity in stereo vision systems. Experiment with camera parameters to build intuition ab...
Read more →Reading Time: 40-50 minutes This comprehensive article includes rigorous mathematical derivations requiring careful thought. Plan for extra time to work through the ELBO derivat...
Read more →This post provides the mathematical foundation for understanding expectation, a concept central to probability theory and machine learning. This explanation supports the ELBO de...
Read more →Imagine you have thousands of images and need to build a system that can predict their contrast characteristics. The catch? You have no labeled ground truth. No “correct” contra...
Read more →This post builds on concepts from Stochastic Processes and the Art of Sampling Uncertainty. For a broader introduction to differential equations (ODEs, PDEs, SDEs), see The Land...
Read more →About Atul Singh
Read more →Color balance is the practice of aligning the relative intensities of primaries so that neutral surfaces appear neutral and hues stay believable under different illuminants. Whe...
Read more →Note: This post builds on Understanding Contrast in Images: From Perception to Computation, Understanding Contrast in Color Images: Beyond Luminance, Measuring Contrast Betwe...
Read more →This post provides the mathematical foundation for understanding Brownian Motion and Modern Generative Models. It can be read independently or as a prerequisite for that post.
Read more →Table of Contents
Read more →When comparing contrast between two images, there’s a fundamental distinction that’s often overlooked: Are we comparing different versions of the same scene, or are we comparing...
Read more →When working with image processing pipelines, quality assessment, or change detection systems, we often need to quantify how much the contrast differs between two images. This g...
Read more →This post assumes familiarity with basic calculus and Brownian motion. For a broader context on differential equations, see The Landscape of Differential Equations.
Read more →Contrast is one of those terms that feels intuitive until you try to define it precisely. “High contrast” evokes vivid images with deep blacks and brilliant whites, while “low c...
Read more →In grayscale images, contrast is relatively straightforward—it’s all about differences in brightness. But when color enters the picture, the story becomes fascinatingly complex....
Read more →This post provides a detailed mathematical analysis of total variation for Brownian motion. It builds on concepts from Mathematical Properties of Brownian Motion and connects to...
Read more →This post establishes foundational concepts that underpin many advanced topics. For historical context on why this distinction matters, see Why Discriminative Learning Dominated...
Read more →This post provides historical context for the paradigm shift discussed in Machine Learning Paradigms: Learning Distributions vs Approximating Functions.
Read more →This post explores one of the fundamental obstacles in generative modeling, discussed in Why Discriminative Learning Dominated First. For the broader context, see Machine Learni...
Read more →This post explores another fundamental obstacle in machine learning, discussed in Why Discriminative Learning Dominated First. For the broader context, see Machine Learning Para...
Read more →TL;DR
Read more →TL;DR
Read more →Modern C++ gives you several kinds of references, each solving different problems around binding, lifetime, and ownership. Before C++11, you had one tool: the lvalue reference. ...
Read more →Question: The model learns to generate a panoptic mask, which is an array of discrete tokens, conditioned on an input image. This avoids the inductive biases of traditional m...
Read more →Directly sampling from a probability density function (PDF) or probability mass function (PMF) sounds deceptively simple: evaluate the function, interpret the number, and draw a...
Read more →Introduction
Read more →From Elementary Mathematics to Vision Algorithms: The Hidden Life of Normalized Power Sums
Read more →Light is never just “white.” It has a shape, a fingerprint, a dance across wavelengths. Cameras record it, eyes interpret it, and both struggle in their own ways. In this post, ...
Read more →From Elementary Mathematics to Vision Algorithms: The Hidden Life of Normalized Power Sums
Read more →Linear algebra did not start as an abstract theory. It started as a practical question: when does a system of equations have a solution, and how many are there? From that concre...
Read more →Image filters feel natural in the spatial domain because we see pixels and neighborhoods. Yet many operations become simpler—or even possible only—in the frequency domain. This ...
Read more →Kullback–Leibler (KL) divergence quantifies how one probability distribution departs from another. It shows up everywhere—from maximum-likelihood training to variational inferen...
Read more →Color pipelines that ingest YUV video frames but render or analyze in RGB often have to answer two expensive questions before any correction work begins:
Read more →Table of Contents
Read more →Tensor indexing feels natural once you see how dimensions line up. This walkthrough starts with 1D arrays and climbs to N-dimensional tensors, highlighting how PyTorch treats sl...
Read more →Looking for a single place to trace how the notebook has evolved? This reference page aggregates every post that landed on the site, sliced by month, quarter, half-year, and ful...
Read more →If you have written SIMD intrinsics for SSE or AVX, many of the mental models translate directly to CUDA or HIP kernels. Both ecosystems ask you to think in vectors, manage memo...
Read more →Why the Vocabulary Feels Slippery
Read more →Modern GPUs deliver petaflops by letting thousands of lightweight threads execute in lockstep. To harness that power, you need to understand how your kernels are scheduled and w...
Read more →Why Templates Still Matter
Read more →Modern CPUs have wide vector units capable of executing the same operation across multiple data elements at once. Compiler auto-vectorization handles the simple cases, but when ...
Read more →The lαβ color model from Reinhard, Ashikhmin, Gooch, and Shirley (2001) fuses two simple ideas: compress channel intensities with a logarithm so that multiplicative illumination...
Read more →When Theory Meets Practice in Sampling
Read more →Why Stochastic Thinking Matters
Read more →std::async is the standard library’s shortest path to running work asynchronously and receiving the result as a std::future. Used well, it hides thread management, propagates ex...
Read more →In the previous post we focused on the contract between a single promise and future. Real systems rarely stop there. Data pipelines, UI flows, and service backends routinely lau...
Read more →Modern C++ puts powerful concurrency tools in your hands, but raw threads still make it easy to race on shared state, forget to join, or swallow exceptions. Futures and promises...
Read more →Introduction
Read more →Why the Implicit Function Theorem Matters
Read more →Interactive Figures Spin, zoom, and inspect these plots to see how gradient and Hessian information shapes the local geometry and degeneracies we meet in vision pipelines.
Read more →Interactive Figures
Read more →Why Intersection Fails in Lagrange Multipliers: The Geometry of Optimization
Read more →Mathematics often rewards patient observers with patterns that seemed invisible at first glance. One of my favorite examples links two functions that appear to live on separate ...
Read more →Color pipelines can feel abstract until you tie each transform to what your eye sees. In this post we follow a Rec.709 image as it moves into ACES, gets graded in a scene-referr...
Read more →Optical character recognition rarely gets the spotlight, yet it’s the bridge between physical documents and searchable data. In 2005, HP open-sourced its internal Tesseract engi...
Read more →Negative feelings often get lumped together, but the words we choose point to very different shades of experience. I like to think of resent as the seed in a small emotional eco...
Read more →MathJax Test Page
Read more →This is Part 8 (Final) of an 8-part series on Kalman Filtering. Part 7 covered nonlinear extensions.
Read more →This is Part 7 of an 8-part series on Kalman Filtering. Part 6 explored real-world applications.
Read more →This is Part 6 of an 8-part series on Kalman Filtering. Part 5 covered Python implementation details.
Read more →This is Part 5 of an 8-part series on Kalman Filtering. Part 4 provided the complete mathematical derivation.
Read more →This is Part 4 of an 8-part series on Kalman Filtering. Part 3 established the Bayesian foundations.
Read more →This is Part 3 of an 8-part series on Kalman Filtering. Part 2 explored recursive filtering fundamentals.
Read more →This is Part 2 of an 8-part series on Kalman Filtering. Part 1 introduced state estimation concepts.
Read more →This is Part 1 of an 8-part series on Kalman Filtering. This series will take you from basic concepts to advanced applications and implementations.
Read more →One of the benefits of keeping research inside Obsidian is that the vault can double as a content pipeline. Folders map to learning projects, daily notes track reading sessions,...
Read more →The books and long-form research that live in this repository tend to move slowly. They collect notes, citations, and iterative drafts over weeks or months before they finally b...
Read more →Canny’s blend of Gaussian smoothing, gradient estimation, non-maximum suppression, and hysteresis linking made it the go-to detector for general-purpose pipelines in 2013. Below...
Read more →Edge detection has been a foundational pre-processing step in computer vision since the earliest digital imaging pipelines. Around 2013, practitioners still relied on a handful ...
Read more →