Artificial Intelligence
Foundation
A structured reference covering the full spectrum of AI β from history and mathematics to deep learning, agentic systems, and responsible AI practice.
This reference takes you from zero to frontier AI β structured across five progressive tiers so beginners, developers, and researchers each find their entry point and grow from there.
Where AI came from, what it actually is, the key paradigms, and the foundational frameworks that every practitioner needs before touching a single algorithm.
- AI = optimisation toward a goal using data and computation β not magic
- Two AI winters caused by data, compute, and algorithm gaps β now all solved
- 2012 AlexNet β 2017 Transformer β 2022 ChatGPT are the three inflection points
- PEAS framework: the foundation of modern agentic AI in Domain 8
- Modern LLMs = hybrid of all paradigms β not purely one approach
The mathematical language of machine learning β linear algebra, calculus, probability, and optimisation. You don't need to master all of it upfront, but understanding the core ideas makes every algorithm click.
Supervised, unsupervised, and ensemble methods β the core ML toolkit that still powers ~80% of production systems.
- Supervised learning: inputβoutput mapping via labelled data β regression & classification
- Ensemble methods (Random Forest, XGBoost) are the most-used ML in production today
- Bias-variance tradeoff: underfitting vs overfitting β the central tension in ML
- Unsupervised learning discovers structure without labels β clustering & dimensionality reduction
- Model evaluation: cross-validation + right metric matters more than the algorithm choice
Neural networks from perceptron to Transformer β activation functions, backprop, CNNs, RNNs, attention, and modern architectures.
- Neural networks learn via backpropagation + gradient descent β no magic
- CNNs exploit spatial locality for images; RNNs process sequences step-by-step
- The Transformer replaced both with self-attention β O(1) depth, fully parallel
- Transfer learning: pre-train on large data, fine-tune on your task β the dominant paradigm
- Generative models (GANs, diffusion) learn to create new data, not just classify existing data
From word embeddings to GPT-4 and beyond β tokenisation, pre-training, fine-tuning, prompting, RAG, alignment, and evaluation.
- BPE tokenisation: iteratively merge most frequent pairs β text β numbers
- Word2Vec geometry encodes meaning: king β man + woman β queen
- GPT = decoder-only, BERT = encoder-only β generation vs understanding
- Prompt engineering + RAG + fine-tuning = the three tools every AI practitioner uses daily
- Hallucination is a feature, not a bug β LLMs are trained for fluency, not facts
Image fundamentals, CNN architectures, object detection, segmentation, GANs, Vision Transformers, multimodal AI, and video/3D vision.
- CNNs exploit spatial locality and weight sharing β the key inductive bias for images
- ResNet solved the vanishing gradient problem with skip connections β enabled 100+ layer networks
- YOLO unified detection as a single regression problem β orders of magnitude faster than R-CNN
- Vision Transformers (ViT) prove attention works on images β treating patches as tokens
- CLIP learns visual-semantic alignment from image-text pairs β the foundation of multimodal AI
MDPs, dynamic programming, TD learning, Q-learning, deep RL, policy gradients, PPO, and RL in real-world systems including RLHF.
- RL = learn by doing β no labels, only reward signals from environment interaction
- Bellman equations express recursive value decomposition β the foundation of all RL algorithms
- DQN combined Q-learning with deep networks β first superhuman Atari performance
- PPO is the workhorse of modern RL β used in RLHF to align LLMs like ChatGPT
- Real-world RL: sparse rewards, safety constraints, sim-to-real gap β harder than games
LLM agents with tool use, memory, planning, multi-agent collaboration, and production deployment β the frontier of AI engineering.
- An LLM agent = LLM + tools + memory + planning loop β not just a chatbot
- ReAct: interleave reasoning and acting β grounding decisions in observable tool outputs
- Memory gives agents persistence across sessions β the missing piece for long-horizon tasks
- Multi-agent systems enable parallelism, specialisation, and debate β better than one large agent
- Production agents require observability, error recovery, and cost management β not just capability
Building, deploying & operating AI systems β data pipelines, experiment tracking, model serving, CI/CD for ML, monitoring, drift detection, vector databases, LLM APIs, and infrastructure.
- 87% of models never reach production β engineering, not algorithms, is the bottleneck
- Feature stores bridge training and serving β eliminating training/serving skew
- ML CI/CD tests code + data + model quality β triggered by code, data, or drift
- Models break silently β drift detection is the early warning system
- Vector databases power RAG β HNSW index, ANN search, LLM gateway patterns
Bias, fairness, explainability, privacy, AI safety, alignment, governance, disinformation, societal impact, and long-term existential risk.
- Fairness is a value judgement β multiple definitions exist and the impossibility theorem proves they cannot all hold simultaneously
- LIME and SHAP explain individual predictions; model cards document per-subgroup performance
- Differential privacy provides provable privacy guarantees β Ξ΅ controls the privacy-utility tradeoff
- EU AI Act: risk pyramid from banned (social scoring) to minimal risk β world's first comprehensive AI law
- Long-term AI risk is genuinely contested among serious experts β not a mainstream-vs-fringe debate
Real-world AI across healthcare, finance, robotics, autonomous vehicles, education, scientific discovery, code generation, creative AI, and enterprise.
Foundation models, world models, neuromorphic computing, quantum AI, embodied AI, AGI concepts, mixture of experts, neuro-symbolic AI, edge AI, AI hardware, consciousness, and the road ahead.