Can I learn machine learning with JavaScript?

Yes. Tensorcraft teaches machine learning entirely in JavaScript and TypeScript using TensorFlow.js. Models train and run in the browser, no Python required. The curriculum covers neural networks, LSTMs, CNNs, and transformers through hands-on tutorials built for frontend developers.

How does Tensorcraft teach ML to frontend developers?

Through 50+ 'bridge' analogies that map frontend concepts you already know (like useState, Array.map, and fetch) to their ML equivalents (model weights, tensor operations, and inference APIs). Each course is a story-driven narrative where you build real ML models.

How much math do I need?

None upfront. You need working JavaScript: comfortable with functions, arrays, and async. The math is there when you want it: derivations sit in optional expandable drawers, and you can finish every module without opening one.

Module 1 of Deep Orbit, the live theme, is free, no account or credit card required. The other four themes ship in waves, each with a waitlist. Full themes cost $59 each, with bundle discounts available up to $159 for all 5 themes.

What ML topics does Tensorcraft cover?

Five specializations: Time-Series & Signals (RNNs, LSTMs), Computer Vision (CNNs, YOLO), NLP & Text Intelligence (Transformers, BERT), Multimodal & Generative AI (GANs, Diffusion), and Edge AI & Production ML (quantization, MLOps).

What if it turns out not to be for me?

Module 1 is free before any money moves. After purchase there's a 14-day money-back guarantee: full refund if you've used less than 20% of a theme.

Extras/ethics-responsibility/dataset-bias

companion content · math depth

Dataset Bias: Garbage In, Discrimination Out

Dataset bias occurs when your training data doesn't represent the real world, leading to models that work well for some people and fail for others.

Instructor

Before we write a single line of model code, we need to talk about something that will determine whether your model helps people or harms them: the data you train on.

In 2018, researchers tested commercial face-recognition systems and found error rates up to 34.7% for darker-skinned women. For light-skinned men: under 1%. The models weren't broken. Their training data was. That gap, between who your data represents and who your model serves, is dataset , and the consequences run far past a broken layout.

Learning Objectives

○Identify the three main types of dataset bias: selection, measurement, and historical
○Recognize how biased data leads to discriminatory model outputs
○Audit a dataset for demographic representation gaps
○Connect dataset bias to the familiar problem of insufficient cross-browser testing

The "Works on My Machine" Problem

Frontend

Browser-Only Testing

// Only tested in Chrome on macOS: 'works for me!'

Machine Learning

Biased Training Data

// Only trained on English text; fails on multilingual input

Intuition Bridge

Where the analogy ends

Cross-browser bugs surface immediately on a Firefox/Safari pageview; the failure mode is loud (broken layout, JS error). Dataset bias is silent: a face-recognition model trained mostly on light-skinned faces may still return a confident match on a darker-skinned face that is wrong. Detecting bias requires per-subgroup evaluation against a labelled holdout. There's no equivalent of 'open it in Firefox' that surfaces it on demand.

You know the drill. You build a beautiful UI, test it in Chrome on your MacBook, ship it, and immediately get bug reports from Firefox users, Windows users, and screen reader users. The problem wasn't your code. It was your testing coverage. You only validated against a narrow slice of your actual user base.

Dataset bias is the same trap. If your training data only represents certain demographics, your model will "work" for those groups and silently fail for everyone else.

Three Types of Bias

Selection Bias

Your dataset doesn't represent the full population. If you train a facial recognition model on photos that are 80% light-skinned faces, it will perform poorly on darker skin tones. The frontend version: testing only on desktop when 60% of your traffic is mobile.

Measurement Bias

Your data collection method itself introduces distortion. If you measure "employee productivity" using metrics that favor in-office workers, remote workers will be systematically rated lower. Not because they're less productive, but because your measuring tool is broken.

Historical Bias

Your data faithfully reflects an unfair world. If you train a hiring model on historical hiring decisions, and those decisions were biased against women in engineering, your model will learn to replicate that discrimination.

audit-dataset.tstypescript

// Auditing a dataset for representation bias
interface DataPoint {
age: number;
gender: string;
ethnicity: string;
income: number;
label: number; // 0 = loan denied, 1 = loan approved
}

function auditRepresentation(data: DataPoint[]) {
const total = data.length;

// Count representation by group
const genderCounts: Record<string, number> = {};
const ethnicityCounts: Record<string, number> = {};

for (const point of data) {
  genderCounts[point.gender] = (genderCounts[point.gender] || 0) + 1;
  ethnicityCounts[point.ethnicity] = (ethnicityCounts[point.ethnicity] || 0) + 1;
}

// Compute proportions and flag underrepresentation
const report: string[] = [];

for (const [group, count] of Object.entries(genderCounts)) {
  const proportion = count / total;
  report.push(`Gender "${group}": ${(proportion * 100).toFixed(1)}% (${count}/${total})`);
  if (proportion < 0.1) {
    report.push(`  ⚠ WARNING: "${group}" is severely underrepresented`);
  }
}

for (const [group, count] of Object.entries(ethnicityCounts)) {
  const proportion = count / total;
  report.push(`Ethnicity "${group}": ${(proportion * 100).toFixed(1)}% (${count}/${total})`);
  if (proportion < 0.1) {
    report.push(`  ⚠ WARNING: "${group}" is severely underrepresented`);
  }
}

return report;
}

// Also check label distribution per group; disparate outcomes signal bias
function auditOutcomes(data: DataPoint[]) {
const groupOutcomes: Record<string, { approved: number; total: number }> = {};

for (const point of data) {
  const key = point.gender;
  if (!groupOutcomes[key]) groupOutcomes[key] = { approved: 0, total: 0 };
  groupOutcomes[key].total++;
  if (point.label === 1) groupOutcomes[key].approved++;
}

for (const [group, stats] of Object.entries(groupOutcomes)) {
  const rate = stats.approved / stats.total;
  console.log(`${group}: ${(rate * 100).toFixed(1)}% approval rate`);
}
}

Real-World Consequences

These aren't hypothetical problems. Amazon scrapped an AI recruiting tool that penalized resumes containing the word "women's" because it was trained on a decade of male-dominated hiring. Healthcare algorithms allocated less care to Black patients because they used healthcare spending (influenced by systemic inequality) as a proxy for health needs.

You would never close a bug report with "works for me." Hold your training data to the same standard: works for whom, exactly?

Challenge

Audit a dataset and identify representation gaps across demographic groups.

Loading editor…

Recall Prompt

What are the three main types of dataset bias, and what does each one describe?

Lesson Recap

What you learned

✓Biased training data produces biased model outputs regardless of algorithm quality; auditing demographic representation before training is non-negotiable.
✓The three types of dataset bias (selection, measurement, historical) have distinct root causes and require different remedies.
✓Checking label distribution per subgroup, not just overall accuracy, is the only way to surface silent failures that affect specific groups.

The bridge

Just as testing only in Chrome misses bugs real users hit on Firefox and mobile, training only on data from dominant groups misses failures that real users from underrepresented groups will experience.

You can now

Audit a dataset for demographic representation gaps and flag groups that fall below acceptable thresholds.

Need a hint?

Guidance

Solution

← All Extras