Can I learn machine learning with JavaScript?

Yes. Tensorcraft teaches machine learning entirely in JavaScript and TypeScript using TensorFlow.js. Models train and run in the browser, no Python required. The curriculum covers neural networks, LSTMs, CNNs, and transformers through hands-on tutorials built for frontend developers.

How does Tensorcraft teach ML to frontend developers?

Through 50+ 'bridge' analogies that map frontend concepts you already know (like useState, Array.map, and fetch) to their ML equivalents (model weights, tensor operations, and inference APIs). Each course is a story-driven narrative where you build real ML models.

How much math do I need?

None upfront. You need working JavaScript: comfortable with functions, arrays, and async. The math is there when you want it: derivations sit in optional expandable drawers, and you can finish every module without opening one.

Module 1 of Deep Orbit, the live theme, is free, no account or credit card required. The other four themes ship in waves, each with a waitlist. Full themes cost $59 each, with bundle discounts available up to $159 for all 5 themes.

What ML topics does Tensorcraft cover?

Five specializations: Time-Series & Signals (RNNs, LSTMs), Computer Vision (CNNs, YOLO), NLP & Text Intelligence (Transformers, BERT), Multimodal & Generative AI (GANs, Diffusion), and Edge AI & Production ML (quantization, MLOps).

What if it turns out not to be for me?

Module 1 is free before any money moves. After purchase there's a 14-day money-back guarantee: full refund if you've used less than 20% of a theme.

Extras/math-deep-dive/calculus-of-neurons

companion content · math depth

The Calculus of Neurons

Partial derivatives and the chain rule are the engine behind every gradient update in neural networks.

Instructor

In the Neural Networks module, you used backpropagation to train models. The framework handled the math. But understanding what's actually happening (partial derivatives flowing backward through a computational graph) gives you the intuition to debug vanishing gradients, pick better architectures, and understand why certain tricks work.

Learning Objectives

○Compute partial derivatives for multi-variable functions
○Apply the chain rule to composite functions step by step
○Trace gradient flow through a simple neural network by hand
○Use TensorFlow.js automatic differentiation to verify manual gradients
○Connect the concept of derivatives to rates of change in animation code

Derivatives as Rates of Change

You already think in derivatives when you write animation code. In requestAnimationFrame, velocity is the derivative of position with respect to time: how fast the position changes per frame.

Frontend

requestAnimationFrame

const velocity = (pos - lastPos) / deltaTime

Machine Learning

Gradient

const grad = tf.grad(loss)(weights)

Structural Bridge

Where the analogy ends

rAF velocity is a finite-difference of an observed position you authored frame by frame. tf.grad uses reverse-mode autodiff: it walks the computational graph backwards applying the chain rule to compute exact partial derivatives. Velocity tells you what already happened; the gradient tells you which direction to step in to reduce a loss you have not yet measured.

A in ML is the same idea: how fast does the change when you nudge a ? If the gradient is large, a small weight change causes a big loss change. If it's near zero, that weight isn't doing much.

derivative-intuition.tstypescript

import * as tf from '@tensorflow/tfjs';

// In frontend: velocity = rate of change of position
function animationDerivative(lastPos: number, currentPos: number, dt: number) {
return (currentPos - lastPos) / dt;  // This IS a derivative
}

// In ML: gradient = rate of change of loss w.r.t. weight
// For f(x) = x^2, the derivative is f'(x) = 2x
const f = (x: tf.Tensor) => x.square();
const df = tf.grad(f);

const x = tf.scalar(3);
const gradient = df(x);
console.log(await gradient.array()); // 6, because 2 * 3 = 6
// At x=3, increasing x by a tiny amount increases x^2 by ~6 times that amount

Partial Derivatives

Neural networks have many weights. A partial derivative tells you how the loss changes when you nudge one weight while holding all others fixed.

partial-derivatives.tstypescript

import * as tf from '@tensorflow/tfjs';

// f(x, y) = x^2 * y + y^3
// Partial with respect to x: df/dx = 2xy (treat y as constant)
// Partial with respect to y: df/dy = x^2 + 3y^2 (treat x as constant)

// Manual computation at point (2, 3):
// df/dx = 2 * 2 * 3 = 12
// df/dy = 2^2 + 3 * 3^2 = 4 + 27 = 31

// Verify with TensorFlow.js
const f = (x: tf.Tensor, y: tf.Tensor) =>
x.square().mul(y).add(y.pow(3));

// Gradient with respect to x
const dfdx = tf.grad((x) => f(x, tf.scalar(3)));
console.log(await dfdx(tf.scalar(2)).array()); // 12

// Gradient with respect to y
const dfdy = tf.grad((y) => f(tf.scalar(2), y));
console.log(await dfdy(tf.scalar(3)).array()); // 31

The Chain Rule

The is the single most important idea in deep learning. Every neural network is a deeply nested composite function, and the chain rule tells you how to compute the derivative of the whole thing.

chain-rule.tstypescript

import * as tf from '@tensorflow/tfjs';

// Consider a 2-layer network (no bias, for simplicity):
//   z1 = w1 * x         (layer 1: linear)
//   a1 = relu(z1)       (activation)
//   z2 = w2 * a1        (layer 2: linear)
//   loss = (z2 - y)^2   (MSE loss)
//
// Chain rule for dLoss/dw1:
//   dLoss/dw1 = dLoss/dz2 * dz2/da1 * da1/dz1 * dz1/dw1
//
// Each factor is a simple local derivative. Let's compute them:

const x = 2.0;
const y = 1.0;   // target
const w1 = 0.5;
const w2 = -0.3;

// Forward pass
const z1 = w1 * x;           // 1.0
const a1 = Math.max(0, z1);  // 1.0 (ReLU)
const z2 = w2 * a1;          // -0.3
const loss = (z2 - y) ** 2;  // 1.69

// Backward pass (chain rule, right to left)
const dLoss_dz2 = 2 * (z2 - y);         // 2 * (-1.3) = -2.6
const dz2_da1 = w2;                      // -0.3
const da1_dz1 = z1 > 0 ? 1 : 0;         // 1 (ReLU derivative)
const dz1_dw1 = x;                       // 2

// Chain them together:
const dLoss_dw1 = dLoss_dz2 * dz2_da1 * da1_dz1 * dz1_dw1;
console.log('Manual gradient dL/dw1:', dLoss_dw1); // 1.56

// Verify with TensorFlow.js auto-diff
const computeLoss = (w1t: tf.Tensor) => {
const z1t = w1t.mul(x);
const a1t = z1t.relu();
const z2t = tf.scalar(w2).mul(a1t);
return z2t.sub(y).square();
};

const autoGrad = tf.grad(computeLoss);
console.log('Auto gradient dL/dw1:', await autoGrad(tf.scalar(w1)).array());
// Same value: 1.56

The Computational Graph

Every neural network builds a computational graph during the . walks this graph in reverse, applying the chain rule at each node. This is why TensorFlow is called TensorFlow: flow through a graph of operations.

computational-graph.tstypescript

import * as tf from '@tensorflow/tfjs';

// Visualize a computational graph as a pipeline:
//
//   x ──┐
//       ├─ [multiply] ─ z1 ─ [relu] ─ a1 ──┐
//  w1 ──┘                                    ├─ [multiply] ─ z2 ─ [sub y] ─ [square] ─ loss
//                                       w2 ──┘
//
// Forward: left to right (compute values)
// Backward: right to left (compute gradients via chain rule)

// With multiple weights, tf.grads returns all gradients at once.
// The function receives each weight as its own argument; the returned
// gradient function takes the concrete values as an array.
const networkLoss = (w1: tf.Tensor, w2: tf.Tensor) => {
const x = tf.scalar(2);
const y = tf.scalar(1);

const z1 = w1.mul(x);
const a1 = z1.relu();
const z2 = w2.mul(a1);
return z2.sub(y).square();
};

const gradFn = tf.grads(networkLoss);
const [dw1, dw2] = gradFn([tf.scalar(0.5), tf.scalar(-0.3)]);

console.log('dL/dw1:', await dw1.array()); // 1.56
console.log('dL/dw2:', await dw2.array()); // -2.6 (= dL/dz2 * a1 = -2.6 * 1.0)

Challenge

Compute gradients by hand for a simple network, then verify with TensorFlow.js.

Loading editor…

Recall Prompt

What mathematical rule makes backpropagation work, and how does it let you compute a gradient through many layers?

Lesson Recap

What you learned

✓A partial derivative measures how much the loss changes when you nudge exactly one weight while holding all others fixed.
✓The chain rule decomposes the derivative of a deeply nested function into a product of simple local derivatives, one per operation.
✓Backpropagation walks the computational graph in reverse, applying the chain rule at each node to deliver a gradient to every weight.

The bridge

Just as `requestAnimationFrame` velocity is a finite difference of position over time, `tf.grad()` computes the exact rate of change of the loss with respect to a weight using reverse-mode autodiff through the same kind of incremental update logic.

You can now

Trace gradient flow through a simple network by hand and use `tf.grad()` to verify the result.

Need a hint?

Guidance

Solution

← All Extras