LearningTree · AI

Artificial Intelligence
Foundation

A structured reference covering the full spectrum of AI — from history and mathematics to deep learning, agentic systems, and responsible AI practice.

5

Tiers

12

Domains

80+

Chapters

All

Levels

This reference takes you from zero to frontier AI — structured across five progressive tiers so beginners, developers, and researchers each find their entry point and grow from there.

Tier 1 · Beginner → Intermediate

Foundations

History, philosophy, and mathematics — the intellectual bedrock before the algorithms begin.

Domain 01 — Foundations of AI

7 chapters ~45 min Beginner → Intermediate

Where AI came from, what it actually is, the key paradigms, and the foundational frameworks that every practitioner needs before touching a single algorithm.

What Is Artificial Intelligence?

Definitions, the four capabilities (perceive, reason, learn, act), AI vs ML vs DL

History of AI — Origin to Present

Dartmouth 1956 → two AI winters → AlexNet → Transformers → ChatGPT → Agents

Symbolic AI & Knowledge Representation

Expert systems, logic, search algorithms, why GOFAI failed to scale

The Turing Test & Philosophy of Mind

Chinese Room, consciousness, symbol grounding, strong vs weak AI

Problem Solving & Rational Agents

PEAS framework, environment types, reflex → utility → learning agents

Key Paradigms & Schools of Thought

Connectionism vs symbolicism, Bayesian AI, evolutionary computation

The AI Landscape Today & Tomorrow

Foundation models, benchmarks, open problems: reasoning, causality, robustness

📋 Domain 1 — What you'll learn

AI = optimisation toward a goal using data and computation — not magic
Two AI winters caused by data, compute, and algorithm gaps — now all solved
2012 AlexNet → 2017 Transformer → 2022 ChatGPT are the three inflection points
PEAS framework: the foundation of modern agentic AI in Domain 8
Modern LLMs = hybrid of all paradigms — not purely one approach

Domain 02 — Mathematics & Statistics for AI

7 chapters ~60 min Intermediate

The mathematical language of machine learning — linear algebra, calculus, probability, and optimisation. You don't need to master all of it upfront, but understanding the core ideas makes every algorithm click.

Vectors, matrices, eigenvalues, SVD — the language of data

Calculus & Gradients

Derivatives, chain rule, Jacobians — how backprop works

Probability Theory

Distributions, expectation, variance, conditional probability

Bayesian Inference

Bayes' theorem, MLE, MAP estimation, priors

Information Theory

Entropy, KL-divergence, cross-entropy loss explained

Gradient descent, Adam, convexity, learning rate schedules

Graphs in AI: knowledge graphs, GNNs, network analysis

Tier 2 · Intermediate

Core Machine Learning

Traditional algorithms and the deep learning architectures that power everything in modern AI.

Domain 03 — Classical Machine Learning

7 chapters ~60 min Intermediate

Supervised, unsupervised, and ensemble methods — the core ML toolkit that still powers ~80% of production systems.

The ML Landscape

Supervised, unsupervised, reinforcement — taxonomy, bias-variance, the ML workflow

Linear & logistic regression, cost functions, gradient descent, regularisation

Decision boundaries, k-NN, Naive Bayes, precision/recall, ROC curves

Decision Trees & Ensembles

CART, Random Forests, Gradient Boosting, XGBoost — bagging vs boosting

Support Vector Machines

Hyperplanes, margins, kernel trick, soft margins, SVM vs modern methods

Clustering & Unsupervised Learning

K-Means, DBSCAN, hierarchical clustering, PCA, dimensionality reduction

Model Evaluation, Metrics & Validation

Cross-validation, confusion matrix, F1, AUC-ROC, overfitting, pipelines

📋 Domain 3 — What you'll learn

Supervised learning: input→output mapping via labelled data — regression & classification
Ensemble methods (Random Forest, XGBoost) are the most-used ML in production today
Bias-variance tradeoff: underfitting vs overfitting — the central tension in ML
Unsupervised learning discovers structure without labels — clustering & dimensionality reduction
Model evaluation: cross-validation + right metric matters more than the algorithm choice

Domain 04 — Deep Learning

9 chapters ~75 min Intermediate → Advanced

Neural networks from perceptron to Transformer — activation functions, backprop, CNNs, RNNs, attention, and modern architectures.

Perceptron to MLP

The artificial neuron, multi-layer perceptrons, universal approximation

Activation Functions

Sigmoid, tanh, ReLU, GELU — why non-linearity matters

Backpropagation & Gradient Flow

Chain rule, computational graphs, vanishing/exploding gradients

Training Deep Networks

Optimisers (SGD, Adam), batch norm, dropout, learning rate schedules

Convolutional Neural Networks

Convolutions, pooling, feature maps — AlexNet to ResNet to EfficientNet

Sequential processing, gating mechanisms, why Transformers replaced them

The Transformer

Self-attention, multi-head attention, positional encoding — the architecture that changed AI

Transfer Learning & Fine-Tuning

Pre-train once, fine-tune many — LoRA, adapters, the modern paradigm

Generative Models

VAEs, GANs, diffusion models — learning to create, not just classify

📋 Domain 4 — What you'll learn

Neural networks learn via backpropagation + gradient descent — no magic
CNNs exploit spatial locality for images; RNNs process sequences step-by-step
The Transformer replaced both with self-attention — O(1) depth, fully parallel
Transfer learning: pre-train on large data, fine-tune on your task — the dominant paradigm
Generative models (GANs, diffusion) learn to create new data, not just classify existing data

Tier 3 · Advanced

Advanced & Specialized AI

Language, vision, and decision-making — the three major specialisations of modern AI.

Domain 05 — NLP & Large Language Models

10 chapters Advanced

From word embeddings to GPT-4 and beyond — tokenisation, pre-training, fine-tuning, prompting, RAG, alignment, and evaluation.

NLP Fundamentals & Text Preprocessing

Four ambiguity layers, stopwords, stemming, TF-IDF, the NLP stack

BPE, WordPiece, SentencePiece — how text becomes numbers

Word Embeddings

Word2Vec, GloVe, vector arithmetic — king − man + woman ≈ queen

Contextual Embeddings

ELMo, sentence-transformers — same word, different vector per context

GPT-1 to GPT-4, scaling laws, Chinchilla, emergent abilities, open-source LLMs

BERT & Encoder Models

Bidirectional attention, MLM, RoBERTa, DistilBERT, DeBERTa

Prompt Engineering

Zero-shot, few-shot, chain-of-thought, structured prompting, pitfalls

Retrieval-Augmented Generation

RAG architecture, vector databases, chunking, advanced RAG patterns

Hallucination, Alignment & Evaluation

Why LLMs confabulate, HHH, RLHF, BLEU/ROUGE, benchmarks

LLM Fine-Tuning in Practice

SFT, LoRA, QLoRA, DPO — the complete practitioner's guide

📋 Domain 5 — What you'll learn

BPE tokenisation: iteratively merge most frequent pairs — text → numbers
Word2Vec geometry encodes meaning: king − man + woman ≈ queen
GPT = decoder-only, BERT = encoder-only — generation vs understanding
Prompt engineering + RAG + fine-tuning = the three tools every AI practitioner uses daily
Hallucination is a feature, not a bug — LLMs are trained for fluency, not facts

Domain 06 — Computer Vision

8 chapters Available Advanced

Image fundamentals, CNN architectures, object detection, segmentation, GANs, Vision Transformers, multimodal AI, and video/3D vision.

Image Fundamentals & Classical CV

Pixels, colour spaces, filters, edge detection, HOG, SIFT — and why deep learning won

CNN Architectures for Vision

AlexNet, VGG, Inception, ResNet, MobileNet, EfficientNet — the architecture lineage

Object Detection

R-CNN family, YOLO, SSD, anchor boxes, NMS — detecting objects at scale

Image Segmentation

Semantic, instance & panoptic segmentation — U-Net, Mask R-CNN, SAM

Generative Adversarial Networks

GAN fundamentals, DCGAN, StyleGAN, CycleGAN — learning to generate images

Vision Transformers & Modern Architectures

ViT, DeiT, Swin Transformer, ConvNeXt — attention meets vision

CLIP, DALL-E, vision-language models, GPT-4V, Gemini — images and text together

Video Understanding & 3D Vision

Optical flow, video models, depth estimation, point clouds, NeRF

📋 Domain 6 — What you'll learn

CNNs exploit spatial locality and weight sharing — the key inductive bias for images
ResNet solved the vanishing gradient problem with skip connections — enabled 100+ layer networks
YOLO unified detection as a single regression problem — orders of magnitude faster than R-CNN
Vision Transformers (ViT) prove attention works on images — treating patches as tokens
CLIP learns visual-semantic alignment from image-text pairs — the foundation of multimodal AI

Domain 07 — Reinforcement Learning

8 chapters Available Advanced

MDPs, dynamic programming, TD learning, Q-learning, deep RL, policy gradients, PPO, and RL in real-world systems including RLHF.

Agent-environment loop, MDPs, rewards, returns, policies, value functions

Dynamic Programming

Bellman equations, policy evaluation, policy iteration, value iteration

Monte Carlo & TD Learning

MC prediction & control, TD(0), TD(λ), eligibility traces

Q-Learning & SARSA

Off-policy vs on-policy, Q-tables, function approximation, convergence

Deep Q-Networks

DQN, experience replay, target networks, Double DQN, Dueling DQN

Policy Gradients & Actor-Critic

REINFORCE, baseline subtraction, A2C, A3C — optimising policies directly

PPO, SAC & Model-Based RL

Proximal Policy Optimisation, Soft Actor-Critic, world models, Dyna

RL in the Real World

RLHF for LLMs, robotics, games, sparse rewards, safety in RL

📋 Domain 7 — What you'll learn

RL = learn by doing — no labels, only reward signals from environment interaction
Bellman equations express recursive value decomposition — the foundation of all RL algorithms
DQN combined Q-learning with deep networks — first superhuman Atari performance
PPO is the workhorse of modern RL — used in RLHF to align LLMs like ChatGPT
Real-world RL: sparse rewards, safety constraints, sim-to-real gap — harder than games

Tier 4 · Advanced → Expert

Agentic & Systems AI

Autonomous agents that plan, act and adapt — plus the engineering discipline of deploying AI at scale.

Domain 08 — AI Agents

8 chapters Available Expert ⭐ Frontier

LLM agents with tool use, memory, planning, multi-agent collaboration, and production deployment — the frontier of AI engineering.

What Are AI Agents?

Agent definition, perception-action loop, LLM agents vs classical agents, taxonomy

Tool Use & Function Calling

APIs, code execution, web search, structured outputs, tool schemas

ReAct & Reasoning Loops

ReAct, Chain-of-Thought, Reflexion, self-correction, scratchpad reasoning

In-context, episodic, semantic, procedural memory — long-horizon agent behaviour

Planning & Task Decomposition

HiGP, Tree-of-Thoughts, MCTS for agents, hierarchical task planning

Multi-Agent Systems

Agent orchestration, AutoGen, CrewAI, debate, specialisation, coordination

Evaluation, Safety & Reliability

Agent evals, prompt injection, hallucination in agents, failure modes

Production Agents

Observability, tracing, latency, cost, LangSmith, deployment patterns

📋 Domain 8 — What you'll learn

An LLM agent = LLM + tools + memory + planning loop — not just a chatbot
ReAct: interleave reasoning and acting — grounding decisions in observable tool outputs
Memory gives agents persistence across sessions — the missing piece for long-horizon tasks
Multi-agent systems enable parallelism, specialisation, and debate — better than one large agent
Production agents require observability, error recovery, and cost management — not just capability

Domain 09 — MLOps & AI Engineering

8 chapters Available Advanced

Building, deploying & operating AI systems — data pipelines, experiment tracking, model serving, CI/CD for ML, monitoring, drift detection, vector databases, LLM APIs, and infrastructure.

The ML Lifecycle & MLOps Overview

ML lifecycle, MLOps vs DevOps, maturity levels, tech stack map

Data Pipelines & Feature Engineering

Batch vs streaming, Feast, Tecton, Great Expectations, DVC

Experiment Tracking & Model Registry

MLflow, Weights & Biases, model versioning, lifecycle stages

Model Serving & Deployment Patterns

Batch vs real-time, TorchServe, Triton, vLLM, canary/blue-green/shadow

CI/CD for Machine Learning

ML-specific CI/CD, Airflow, Kubeflow, Prefect, pipeline orchestration

Monitoring, Drift & Observability

Data drift, concept drift, Evidently, WhyLabs, retraining triggers

Vector Databases & LLM Infrastructure

Pinecone, Qdrant, pgvector, LLM gateways, LiteLLM, prompt caching

Containerisation, Orchestration & Cost

Docker for ML, GPU scheduling, spot instances, quantisation, cloud vs on-prem

📋 Domain 9 — What you'll learn

87% of models never reach production — engineering, not algorithms, is the bottleneck
Feature stores bridge training and serving — eliminating training/serving skew
ML CI/CD tests code + data + model quality — triggered by code, data, or drift
Models break silently — drift detection is the early warning system
Vector databases power RAG — HNSW index, ANN search, LLM gateway patterns

Tier 5 · All Levels

Applied & Responsible AI

Real-world use cases, ethical practice, and where the field is headed — the human side of AI.

Domain 10 — Ethics & Safety

8 chapters Available All Levels

Bias, fairness, explainability, privacy, AI safety, alignment, governance, disinformation, societal impact, and long-term existential risk.

AI Fairness — Bias & Discrimination

Bias sources, fairness definitions, impossibility theorem, COMPAS, mitigation strategies

Explainability & Interpretability

LIME, SHAP, attention maps, model cards, legal right to explanation

Privacy & Data Governance

LLM memorisation, differential privacy, federated learning, right to be forgotten

AI Safety — Technical Alignment

Alignment problem, adversarial robustness, specification gaming, RLHF, scalable oversight

Societal Impact

Labour markets, power concentration, environmental costs, inequality, ghost workers

AI Governance & Regulation

EU AI Act, US approach, international frameworks, NIST AI RMF, risk pyramid

Disinformation & Information Integrity

AI-generated disinformation, deepfakes, detection, C2PA provenance, elections

Long-Term AI Safety & Existential Risk

The expert debate, catastrophic risk scenarios, safety research, responsible development

📋 Domain 10 — What you'll learn

Fairness is a value judgement — multiple definitions exist and the impossibility theorem proves they cannot all hold simultaneously
LIME and SHAP explain individual predictions; model cards document per-subgroup performance
Differential privacy provides provable privacy guarantees — ε controls the privacy-utility tradeoff
EU AI Act: risk pyramid from banned (social scoring) to minimal risk — world's first comprehensive AI law
Long-term AI risk is genuinely contested among serious experts — not a mainstream-vs-fringe debate

Domain 11 — AI Applications & Industry

8 chapters All Levels Complete

Real-world AI across healthcare, finance, robotics, autonomous vehicles, education, scientific discovery, code generation, creative AI, and enterprise.

AI in Healthcare

Finance & Trading

Scientific Discovery

Code Generation

Creative AI & Media

Domain 12 — Emerging Technologies

12 chapters Frontier

Foundation models, world models, neuromorphic computing, quantum AI, embodied AI, AGI concepts, mixture of experts, neuro-symbolic AI, edge AI, AI hardware, consciousness, and the road ahead.

Foundation Models

Scaling laws, Chinchilla, multimodal fusion, post-training, test-time compute

World Models & Simulation

Sora, JEPA, Genie 2, latent-space prediction, physical understanding

Neuromorphic Computing

Intel Loihi 2, IBM NorthPole, spiking neural networks, brain-inspired chips

Qubits, NISQ era, quantum ML algorithms, quantum advantage spectrum

RT-2, sim-to-real, humanoid robots, vision-language-action models

Definitions, DeepMind levels, paths to AGI, timeline debate, ARC benchmark

Mixture of Experts

Sparse activation, routing, Switch Transformer, Mixtral, DeepSeek-V3

Neuro-Symbolic AI

AlphaGeometry, LLM + code, GraphRAG, neural + symbolic reasoning

Edge AI & Federated Learning

On-device LLMs, federated learning, quantisation, model compression

H100, B200, TPU v5p, Groq LPU, TSMC geopolitics, training economics

AI Consciousness

Hard problem, functionalism, IIT, Chinese Room, sentience debate

Open problems, reasoning, alignment, energy, governance, predictions