Can I learn machine learning with JavaScript?

Yes. Tensorcraft teaches machine learning entirely in JavaScript and TypeScript using TensorFlow.js. Models train and run in the browser, no Python required. The curriculum covers neural networks, LSTMs, CNNs, and transformers through hands-on tutorials built for frontend developers.

How does Tensorcraft teach ML to frontend developers?

Through 50+ 'bridge' analogies that map frontend concepts you already know (like useState, Array.map, and fetch) to their ML equivalents (model weights, tensor operations, and inference APIs). Each course is a story-driven narrative where you build real ML models.

How much math do I need?

None upfront. You need working JavaScript: comfortable with functions, arrays, and async. The math is there when you want it: derivations sit in optional expandable drawers, and you can finish every module without opening one.

Module 1 of Deep Orbit, the live theme, is free, no account or credit card required. The other four themes ship in waves, each with a waitlist. Full themes cost $59 each, with bundle discounts available up to $159 for all 5 themes.

What ML topics does Tensorcraft cover?

Five specializations: Time-Series & Signals (RNNs, LSTMs), Computer Vision (CNNs, YOLO), NLP & Text Intelligence (Transformers, BERT), Multimodal & Generative AI (GANs, Diffusion), and Edge AI & Production ML (quantization, MLOps).

What if it turns out not to be for me?

Module 1 is free before any money moves. After purchase there's a 14-day money-back guarantee: full refund if you've used less than 20% of a theme.

Extras/math-deep-dive/convolution-as-matrix

companion content · math depth

Convolution as Matrix Multiplication

Convolution can be expressed as multiplication by a Toeplitz matrix, a structured matrix where each diagonal is constant.

Instructor

In the Signal Processing module, you applied convolution as a sliding window over data. That's the intuitive view. But mathematically, convolution is matrix multiplication in disguise. The kernel becomes a special structured matrix called a Toeplitz matrix, and the "sliding" becomes a single matrix-vector multiply. GPUs are built for matrix math, which is why they run convolutions so fast.

Learning Objectives

○Understand the Toeplitz matrix and how it encodes a sliding window
○Convert a 1D convolution into an equivalent matrix multiplication
○Explain why convolution in time domain equals multiplication in frequency domain
○Connect CSS transform matrices to the structured matrices used in convolution
○Reason about computational cost of convolution vs. FFT-based approaches

The Sliding Window as a Matrix

When you convolve a kernel [a, b, c] with an input [x1, x2, x3, x4, x5], you slide the kernel across and compute dot products. But you can express this as a single :

Frontend

CSS Transform Matrix

transform: matrix(a, b, c, d, tx, ty)

Machine Learning

Toeplitz Matrix

const output = tf.matMul(toeplitzMatrix, inputVector)

Structural Bridge

Where the analogy ends

CSS transform matrix is a fixed 3x3 you wrote. The Toeplitz matrix that represents a convolution is automatically derived from the kernel, sparse, structured, and used only as a mathematical equivalence; practical implementations exploit the structure to avoid the explicit matrix.

toeplitz-basic.tstypescript

import * as tf from '@tensorflow/tfjs';

// Kernel: [1, 0, -1] (edge detector, like a CSS filter)
// Input:  [3, 7, 2, 8, 4]

// Sliding window (the intuitive way):
// Position 0: 1*3 + 0*7 + (-1)*2 = 1
// Position 1: 1*7 + 0*2 + (-1)*8 = -1
// Position 2: 1*2 + 0*8 + (-1)*4 = -2

const kernel = tf.tensor1d([1, 0, -1]);
const input = tf.tensor1d([3, 7, 2, 8, 4]);

// The SAME operation as a Toeplitz matrix:
const toeplitz = tf.tensor2d([
[1,  0, -1,  0,  0],   // kernel at position 0
[0,  1,  0, -1,  0],   // kernel at position 1
[0,  0,  1,  0, -1],   // kernel at position 2
]);

const resultMatrix = tf.matMul(toeplitz, input.reshape([5, 1]));
console.log('Matrix result:', await resultMatrix.array());
// [[1], [-1], [-2]]: identical to the sliding window

// Using tf.conv1d for comparison
const resultConv = tf.conv1d(
input.reshape([1, 5, 1]),
kernel.reshape([3, 1, 1]),
1,    // stride
'valid'
);
console.log('Conv1d result:', await resultConv.array());
// [[[1], [-1], [-2]]]: same answer

Building a Toeplitz Matrix

The pattern is simple: each row of the Toeplitz matrix is the kernel shifted one position to the right, padded with zeros. The diagonals are constant. That's the defining property of a Toeplitz matrix.

build-toeplitz.tstypescript

import * as tf from '@tensorflow/tfjs';

function buildToeplitz(kernel: number[], inputLength: number): number[][] {
const kernelLength = kernel.length;
const outputLength = inputLength - kernelLength + 1; // 'valid' padding
const matrix: number[][] = [];

for (let i = 0; i < outputLength; i++) {
  const row = new Array(inputLength).fill(0);
  for (let j = 0; j < kernelLength; j++) {
    row[i + j] = kernel[j];
  }
  matrix.push(row);
}
return matrix;
}

// Build Toeplitz for a 3-element kernel on 7-element input
const kernel = [1, -2, 1]; // second-difference operator
const T = buildToeplitz(kernel, 7);

console.log('Toeplitz matrix:');
T.forEach(row => console.log(row.map(v => v.toString().padStart(3)).join(' ')));
// Each row is the kernel shifted right by one position:
//   1 -2  1  0  0  0  0
//   0  1 -2  1  0  0  0
//   0  0  1 -2  1  0  0
//   0  0  0  1 -2  1  0
//   0  0  0  0  1 -2  1

// Apply it
const input = tf.tensor1d([1, 4, 6, 4, 1, 0, 2]);
const toeplitzTensor = tf.tensor2d(T);
const result = tf.matMul(toeplitzTensor, input.reshape([7, 1]));
console.log('Result:', await result.flatten().array());

The Convolution Theorem

One of the most beautiful results in mathematics: convolution in the time domain equals element-wise multiplication in the domain. This means you can compute convolution by:

the input
FFT the kernel
Multiply element-wise
Inverse FFT

For large kernels, this is dramatically faster than sliding the .

convolution-theorem.tstypescript

import * as tf from '@tensorflow/tfjs';

// Demonstrate the convolution theorem
// For simplicity, we'll work with compatible sizes

// The idea: conv(a, b) = ifft(fft(a) * fft(b))
// This is why audio processing uses FFT-based convolution
// for reverb effects: the impulse response kernel can be huge

// Direct convolution: O(n * k) where n=input size, k=kernel size
// FFT convolution:    O(n * log(n))
// When k is large (e.g., audio reverb with 48000-sample kernel),
// FFT wins massively

// In neural networks, kernels are small (3x3, 5x5)
// so direct convolution (via matrix multiply) is usually faster.
// But for signal processing with large kernels, FFT is essential.

// The connection to matrix multiplication:
// For CIRCULAR convolution, the Toeplitz matrix becomes circulant,
// and circulant matrices are diagonalized by the DFT matrix:
// C = F^(-1) * D * F, where D has the FFT of the kernel on diagonal.
// This is WHY the convolution theorem works (pad to make 'valid' circular).

const inputSize = 8;
const input = tf.randomNormal([inputSize]);
const kernel = tf.tensor1d([0.25, 0.5, 0.25]); // smoothing kernel

// Direct convolution
const directResult = tf.conv1d(
input.reshape([1, inputSize, 1]),
kernel.reshape([3, 1, 1]),
1,
'valid'
);

console.log('Direct convolution:', await directResult.flatten().array());
// FFT-based would give the same result, but is only faster for large kernels

Why This Matters for GPUs

GPUs have thousands of cores optimized for matrix multiplication. By expressing convolution as a matrix multiply (via Toeplitz or the related im2col approach), convolution run at near-peak GPU throughput. This is the fundamental reason deep learning took off when GPUs became accessible.

performance-insight.tstypescript

import * as tf from '@tensorflow/tfjs';

// In practice, frameworks use im2col (image to column):
// 1. Rearrange input patches into columns of a matrix
// 2. Reshape kernel into a matrix
// 3. Single matrix multiply = all convolution outputs
//
// This trades memory for speed: classic CS tradeoff

// Benchmark: sliding window vs matrix multiply
const inputLength = 1000;
const kernelSize = 5;
const input = tf.randomNormal([1, inputLength, 1]);
const filter = tf.randomNormal([kernelSize, 1, 1]);

// Time the convolution (internally uses matrix multiply on GPU)
const start = performance.now();
for (let i = 0; i < 100; i++) {
const result = tf.conv1d(input, filter, 1, 'valid');
result.dispose();
}
const elapsed = performance.now() - start;
console.log(`100 convolutions: ${elapsed.toFixed(1)}ms`);
// Fast because it's matrix multiplication under the hood

Challenge

Build a Toeplitz matrix from a kernel and verify it matches TensorFlow.js convolution output.

Loading editor…

Recall Prompt

How does expressing a convolution as a Toeplitz matrix multiplication explain why GPUs run convolutional layers so fast?

Lesson Recap

What you learned

✓A convolution can be rewritten as a single matrix-vector multiply using a Toeplitz matrix, where each row is the kernel shifted by one position.
✓The convolution theorem states that convolution in the time domain equals element-wise multiplication in the frequency domain, which is why FFT-based convolution is faster for large kernels.
✓For small kernels (typical in neural networks) the Toeplitz/im2col matrix approach wins; for large kernels (audio reverb) the FFT approach wins.

The bridge

A CSS `transform: matrix()` is a fixed matrix you write by hand; a Toeplitz matrix is automatically derived from a convolutional kernel and exploits the same matrix-multiply hardware path that makes GPU transforms fast.

You can now

Build a Toeplitz matrix from a kernel and verify its output matches `tf.conv1d`, then reason about when to prefer matrix versus FFT convolution.

Need a hint?

Guidance

Solution

← All Extras