Machine learning,
from to clusters.

An open community for learning, writing, and tinkering on the infrastructure behind modern AI — inference engines, training systems, ml stacks, and everything in between.

Read the archive →Write with us

articles

contributors

FIG. 1.1Per-head attention pattern at layer 14. Causal mask + induction circuit + sink token, visualized over one sentence.

Latest

Recently published.

All articles →

Architecture

Neural Networks From Zero: From a Single Number to a Billion Parameters

A neural network never sees a word, an image, or a sound — only a list of numbers. Starting from that one fact and a single neuron, this guide builds the whole machine: how any input becomes numbers, why weights, biases, and activations each exist, and how neurons stack into layers and layers into a model.

DineshJul 12, 2026 · 14 min

Index

Browse by topic.

Full index →

Neural Networks From Zero: From a Single Number to a Billion Parameters
A neural network never sees a word, an image, or a sound — only a list of numbers. Starting from that one fact and a single neuron, this guide builds the whole machine: how any input becomes numbers, why weights, biases, and activations each exist, and how neurons stack into layers and layers into a model.
DineshJul 12, 2026

Tools

Run the math yourself.

All tools →

Discourse

The conversation.

Where to talk →

Forum Discussions

Ask questions, share what you built, run a poll, or just discuss ML systems.

Open the Forum →

Discord

Real-time chat for the working day. Quick questions, debugging help, paper club, and the occasional argument about whether MoE is overrated.

Join the server →

Machine learning,
from to clusters.

Recently published.

Neural Networks From Zero: From a Single Number to a Billion Parameters

Browse by topic.

Neural Networks From Zero: From a Single Number to a Billion Parameters

Run the math yourself.

Attention Visualizer

Throughput Calculator

Training Memory Calculator

Eval Harness Playground

Model Card Generator

Kernel Benchmark

The conversation.

Forum Discussions

Discord

Share knowledge that
moves the field forward.

Machine learning,from kernels to clusters.

Recently published.

Neural Networks From Zero: From a Single Number to a Billion Parameters

Browse by topic.

Neural Networks From Zero: From a Single Number to a Billion Parameters

Run the math yourself.

Attention Visualizer

Throughput Calculator

Training Memory Calculator

Eval Harness Playground

Model Card Generator

Kernel Benchmark

The conversation.

Forum Discussions

Discord

Share knowledge thatmoves the field forward.

Machine learning,
from to clusters.

Share knowledge that
moves the field forward.